Skip to content
View gao-xiao-bai's full-sized avatar

Highlights

  • Pro

Block or report gao-xiao-bai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"

20 Updated Jun 16, 2024

Agentless🐱: an agentless approach to automatically solve software development problems

Python 620 64 Updated Aug 20, 2024

Fast and memory-efficient exact attention

Python 13,079 1,177 Updated Aug 23, 2024
148 17 Updated Jun 12, 2024

aider is AI pair programming in your terminal

Python 16,675 1,564 Updated Aug 24, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 12,734 1,019 Updated May 23, 2024

[ACL 2024] Exploring Safety Generalization Challenges of Large Language Models via Code

Python 14 Updated May 20, 2024

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-bench lite and 38.40% tasks (pass@1) in SWE-bench verified wi…

Python 2,564 256 Updated Aug 19, 2024

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 415 27 Updated Aug 6, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,621 296 Updated Aug 21, 2024

OpenMMLab Foundational Library for Training Deep Learning Models

Python 1,126 335 Updated Aug 21, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 13,085 1,280 Updated Aug 20, 2024

The first real AI developer

Python 29,391 2,931 Updated Aug 19, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,125 433 Updated Aug 14, 2024

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,663 279 Updated Aug 20, 2024

🙌 OpenHands: Code Less, Make More

Python 30,629 3,524 Updated Aug 24, 2024

A Comprehensive Benchmark for Software Development.

Python 82 4 Updated May 30, 2024

算法面试必备,推荐刷题网站www.lintcode.com。北大学霸的《LeetCode刷题模板》+V领取: jiuzhangfeifei

3,201 774 Updated Sep 8, 2022

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,721 2,445 Updated Aug 15, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25,359 3,667 Updated Aug 24, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 2,168 258 Updated Aug 19, 2024

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Jupyter Notebook 246 43 Updated Aug 16, 2024

JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning

Python 8 Updated Feb 23, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,373 155 Updated Nov 28, 2023

MOSS-RLHF

Python 1,254 95 Updated Mar 3, 2024

A framework for few-shot evaluation of language models.

Python 6,217 1,646 Updated Aug 23, 2024

DeepSeek Coder: Let the Code Write Itself

Python 6,396 452 Updated May 21, 2024

A curation of awesome tools, documents and projects about LLM Security.

848 83 Updated Aug 17, 2024

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 8,700 848 Updated Aug 11, 2024
Next