Skip to content
View huybery's full-sized avatar
🚀
Accelerate
🚀
Accelerate

Organizations

@QwenLM @OpenLemur @OpenDevin
Block or Report

Block or report huybery

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future

274 19 Updated Aug 13, 2023

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 173 4 Updated Jul 15, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,602 384 Updated Jul 29, 2024

CodeQwen1.5 is the code version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

Python 395 23 Updated May 23, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 12,156 1,221 Updated Jul 29, 2024

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,076 128 Updated Jul 31, 2024

CodeUltraFeedback: aligning large language models to coding preferences

Python 57 2 Updated Jun 25, 2024

🐚 OpenDevin: Code Less, Make More

Python 29,264 3,382 Updated Jul 31, 2024

Sailor: Open Language Models for South-East Asia

Python 86 7 Updated Jul 11, 2024

Home of StarCoder2!

Python 1,642 154 Updated Mar 21, 2024
Python 43 6 Updated Jun 16, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,359 143 Updated Jul 19, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 493 34 Updated Jul 24, 2024

Blog post

16 Updated Feb 16, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 731 42 Updated Apr 15, 2024

Awesome Papers related to Mamba.

1,009 51 Updated Jul 19, 2024

👑 Qwen Blog.

HTML 12 10 Updated Jul 16, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 934 43 Updated Jan 16, 2024

Mamba SSM architecture

Python 11,975 1,002 Updated Jul 30, 2024

CRUXEval: Code Reasoning, Understanding, and Execution Evaluation

Python 93 8 Updated Apr 17, 2024

Repository for paper Tools Are Instrumental for Language Agents in Complex Environments

32 2 Updated Jan 4, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

673 27 Updated Jul 24, 2024

Generative AI for Math: MathPile

Python 364 19 Updated Jun 23, 2024

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Java 90,605 11,417 Updated Jul 30, 2024

High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.

Python 599 64 Updated Jun 11, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,855 1,038 Updated Jul 30, 2024

A dataset of LLM-generated chain-of-thought steps annotated with mistake location.

Python 60 6 Updated Jan 25, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 55,163 6,737 Updated Jul 30, 2024

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023

Python 1,083 91 Updated Jul 26, 2024
Next