Skip to content
View gsajko's full-sized avatar
Block or Report

Block or report gsajko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A repository for research on medium sized language models.

Python 453 62 Updated Aug 3, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,085 431 Updated Aug 14, 2024

Twitter Scraper

Python 450 62 Updated Jun 29, 2024

Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI

Python 219 24 Updated Apr 29, 2024
Python 170 9 Updated May 5, 2024

auto fine tune of models with synthetic data

Python 70 3 Updated Feb 14, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 35,874 3,768 Updated Jul 28, 2024

Minimalistic large language model 3D-parallelism training

Python 1,033 99 Updated Aug 16, 2024

structured outputs for llms

Python 7,146 577 Updated Aug 10, 2024

LUI: Autonomous Collective Decision Making via Large Language Models

Python 104 6 Updated Apr 23, 2023

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 3,703 200 Updated Aug 16, 2024
Jupyter Notebook 343 56 Updated Jun 26, 2023

Free Data Engineering course!

Jupyter Notebook 24,167 5,182 Updated Aug 16, 2024

Scripts to create a basic search on podcast data in general

Python 10 1 Updated Dec 23, 2022

An end-to-end implementation of intent prediction with Metaflow and other cool tools

Python 834 64 Updated Jun 16, 2023

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python 1,268 157 Updated Apr 3, 2023

Next.js app for serverless deployments of OpenAI Whisper on Banana.dev

JavaScript 92 33 Updated Sep 22, 2022

AI-powered CLI tool to help you remember bash commands.

Rust 327 17 Updated Jul 6, 2024

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

Jupyter Notebook 193 30 Updated Sep 12, 2022

An Obsidian.md plugin to save tweets as Markdown files.

TypeScript 190 12 Updated May 8, 2023

Supporting materials/code examples for my course in data engineering for machine learning.

Python 38 7 Updated Nov 15, 2022

Resumes generated using the GitHub informations

JavaScript 61,773 1,351 Updated Feb 15, 2023

An underground, wireless, open-source, low-cost system for monitoring oxygen, temperature, and soil moisture

C++ 6 Updated Nov 19, 2021

Free MLOps course from DataTalks.Club

Jupyter Notebook 10,866 2,086 Updated Aug 6, 2024

Building a real-time twitter graph of your friends

C# 268 14 Updated May 15, 2022

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python 4,860 375 Updated Mar 17, 2024

Repo for Ecosystem Creator project based on Synthetic Silviculture Paper

C++ 4 Updated Nov 2, 2021

a cheat-sheet for mathematical notation in code form

15,004 1,071 Updated Mar 8, 2022

Question Generation - Question Answering for Automatic Flashcards

JavaScript 64 5 Updated Mar 14, 2022

My notes on using Linux

Shell 860 94 Updated Jun 23, 2024
Next