Skip to content
View zhang-yi-chi's full-sized avatar
Block or Report

Block or report zhang-yi-chi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM training in simple, raw C/CUDA

Cuda 22,718 2,540 Updated Aug 16, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 12,825 1,262 Updated Aug 20, 2024

🙌 OpenHands: Code Less, Make More

Python 30,219 3,486 Updated Aug 20, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,927 818 Updated Jul 1, 2024

Machine Learning Compiler Road Map

Jupyter Notebook 40 4 Updated Sep 12, 2023

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,558 155 Updated Aug 14, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,908 184 Updated Aug 20, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,281 4,020 Updated Jul 17, 2024

An index of algorithms for offline reinforcement learning (offline-rl)

893 87 Updated May 23, 2024

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Shell 24,865 3,103 Updated Aug 15, 2024

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 7,299 669 Updated Jul 3, 2024

Generative Agents: Interactive Simulacra of Human Behavior

16,098 2,044 Updated Aug 5, 2024

Example models using DeepSpeed

Python 5,957 1,004 Updated Aug 20, 2024

A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges

197 20 Updated Jan 16, 2024

SkyAGI: Emerging human-behavior simulation capability in LLM

TypeScript 752 53 Updated Sep 21, 2023

An attempt to build a working, locally-running cheap version of Generative Agents: Interactive Simulacra of Human Behavior

Jupyter Notebook 910 143 Updated May 6, 2023

Making large AI models cheaper, faster and more accessible

Python 38,502 4,320 Updated Aug 20, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,383 4,020 Updated Aug 20, 2024

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,663 668 Updated Jan 14, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,268 5,172 Updated Jun 27, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 90,932 14,440 Updated Aug 20, 2024

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈

Jupyter Notebook 1,291 262 Updated Feb 28, 2024

PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)

Python 149 17 Updated Apr 28, 2019

Useful CMake Examples

CMake 12,262 2,478 Updated Feb 28, 2024

Pokémon battle simulator.

TypeScript 4,691 2,739 Updated Aug 19, 2024

Obfuscator for Python

Python 361 61 Updated Feb 23, 2024

PyTorch deep learning projects made easy.

Python 4,669 1,081 Updated Jun 4, 2024

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 32,709 5,540 Updated Aug 20, 2024

Implementations of deep RL papers and random experimentation

Python 177 47 Updated Apr 7, 2018
Next