zhang-yi-chi

zhang-yi-chi

7 followers · 5 following

https://zhang-yi-chi.github.io

Achievements

Block or Report

Block or report zhang-yi-chi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 22,718 2,540 Updated Aug 16, 2024

princeton-nlp / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 12,825 1,262 Updated Aug 20, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 30,219 3,486 Updated Aug 20, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,927 818 Updated Jul 1, 2024

l1nkr / DL-Compiler-Navigation

Machine Learning Compiler Road Map

Jupyter Notebook 40 4 Updated Sep 12, 2023

facebookresearch / Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,558 155 Updated Aug 14, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,908 184 Updated Aug 20, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,281 4,020 Updated Jul 17, 2024

hanjuku-kaso / awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

893 87 Updated May 23, 2024

OpenBMB / ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Shell 24,865 3,103 Updated Aug 15, 2024

a16z-infra / ai-town

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 7,299 669 Updated Jul 3, 2024

joonspk-research / generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

16,098 2,044 Updated Aug 5, 2024

microsoft / DeepSpeedExamples

Example models using DeepSpeed

Python 5,957 1,004 Updated Aug 20, 2024

Plankson / awesome-explainable-reinforcement-learning

A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges

197 20 Updated Jan 16, 2024

litanlitudan / skyagi

SkyAGI: Emerging human-behavior simulation capability in LLM

TypeScript 752 53 Updated Sep 21, 2023

mkturkcan / generative-agents

An attempt to build a working, locally-running cheap version of Generative Agents: Interactive Simulacra of Human Behavior

Jupyter Notebook 910 143 Updated May 6, 2023

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 38,502 4,320 Updated Aug 20, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,383 4,020 Updated Aug 20, 2024

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,663 668 Updated Jan 14, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,268 5,172 Updated Jun 27, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 90,932 14,440 Updated Aug 20, 2024

TradeMaster-NTU / TradeMaster

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈

Jupyter Notebook 1,291 262 Updated Feb 28, 2024

younggyoseo / Ape-X

PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)

Python 149 17 Updated Apr 28, 2019

ttroy50 / cmake-examples

Useful CMake Examples

CMake 12,262 2,478 Updated Feb 28, 2024

smogon / pokemon-showdown

Pokémon battle simulator.

TypeScript 4,691 2,739 Updated Aug 19, 2024

QQuick / Opy

Obfuscator for Python

Python 361 61 Updated Feb 23, 2024

victoresque / pytorch-template

PyTorch deep learning projects made easy.

Python 4,669 1,081 Updated Jun 4, 2024

xupe / mistake-in-retro-contest-of-OpenAI

Python 46 9 Updated Jun 19, 2018

ray-project / ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 32,709 5,540 Updated Aug 20, 2024

steveKapturowski / tensorflow-rl

Implementations of deep RL papers and random experimentation

Python 177 47 Updated Apr 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhang-yi-chi

Achievements

Achievements

Block or report zhang-yi-chi

Stars

karpathy / llm.c

princeton-nlp / SWE-agent

All-Hands-AI / OpenHands

karpathy / minbpe

l1nkr / DL-Compiler-Navigation

facebookresearch / Pearl

OpenRLHF / OpenRLHF

tatsu-lab / stanford_alpaca

hanjuku-kaso / awesome-offline-rl

OpenBMB / ChatDev

a16z-infra / ai-town

joonspk-research / generative_agents

microsoft / DeepSpeedExamples

Plankson / awesome-explainable-reinforcement-learning

litanlitudan / skyagi

mkturkcan / generative-agents

hpcaitech / ColossalAI

microsoft / DeepSpeed

lucidrains / PaLM-rlhf-pytorch

THUDM / ChatGLM-6B

langchain-ai / langchain

TradeMaster-NTU / TradeMaster

younggyoseo / Ape-X

ttroy50 / cmake-examples

smogon / pokemon-showdown

QQuick / Opy

victoresque / pytorch-template

xupe / mistake-in-retro-contest-of-OpenAI

ray-project / ray

steveKapturowski / tensorflow-rl