Skip to content
View suhmily's full-sized avatar
Block or Report

Block or report suhmily

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

323 25 Updated Nov 25, 2023

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Python 8,061 582 Updated Aug 13, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,215 318 Updated Aug 17, 2024

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,621 269 Updated Aug 17, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,072 1,847 Updated Apr 30, 2024

Llama3、Llama3.1 中文仓库(撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Python 3,800 309 Updated Aug 16, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 330 33 Updated Aug 5, 2024

Finetune Llama-3-8b on the MathInstruct dataset

Python 85 16 Updated Aug 13, 2024

A family of compressed models obtained via pruning and knowledge distillation

126 10 Updated Aug 16, 2024
Jupyter Notebook 239 13 Updated Jul 22, 2024

[ACL 2024] The project of Symbol-LLM

Python 37 1 Updated Jul 10, 2024

PaL: Program-Aided Language Models (ICML 2023)

Python 456 56 Updated Jun 30, 2023

Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math reasoning.

Python 66 2 Updated Jul 27, 2024

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Python 916 70 Updated Feb 22, 2024

Train transformer language models with reinforcement learning.

Python 9,010 1,109 Updated Aug 17, 2024

A curated list of language modeling researches for code and related datasets.

1,214 84 Updated Aug 3, 2024

代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota

Python 24 4 Updated Jul 25, 2024

Lightweight and portable LLM sandbox runtime (code interpreter) Python library.

Python 15 2 Updated Jul 13, 2024

Parse LaTeX math expressions

Python 114 22 Updated Aug 5, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,011 103 Updated Aug 17, 2024

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 166 16 Updated May 26, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,211 579 Updated Aug 17, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 183 4 Updated Jul 15, 2024

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capa…

Python 7,255 806 Updated Aug 14, 2024

This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting

Python 11 1 Updated Jul 30, 2024

KenLM: Faster and Smaller Language Model Queries

C++ 2,469 512 Updated Jul 30, 2024

雪球股票数据接口 python edition

Python 1,069 268 Updated Jul 15, 2024
Next