Skip to content
View zchuz's full-sized avatar
  • Harbin Institute of Technology
  • Shenzhen, China
  • 07:23 (UTC +08:00)
Block or Report

Block or report zchuz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 172 2 Updated Jul 15, 2024

[ICML 2024] CLLMs: Consistency Large Language Models

Python 324 14 Updated Jul 25, 2024

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Python 39 2 Updated Jun 17, 2024
Python 12 Updated Jun 7, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Python 56 3 Updated Jun 7, 2024

The official Meta Llama 3 GitHub site

Python 24,550 2,673 Updated Jul 26, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,517 374 Updated Jul 18, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 726 41 Updated Apr 15, 2024

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Python 45 2 Updated Jul 22, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 3,915 285 Updated Jul 26, 2024

How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning

Jupyter Notebook 12 Updated Jul 11, 2024

Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Python 136 11 Updated Feb 6, 2024

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

Shell 61 4 Updated May 28, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 563 31 Updated Jul 20, 2024

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,540 154 Updated Jul 25, 2024

An Awesome Collection for LLM Survey

266 27 Updated Jul 24, 2024

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

2,066 140 Updated Jul 25, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 1,898 29 Updated Jun 6, 2024
32 Updated Apr 14, 2024

A Survey of Attributions for Large Language Models

149 8 Updated May 30, 2024

CodeQwen1.5 is the code version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

Python 392 22 Updated May 23, 2024

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

88 4 Updated Jul 23, 2024

awesome papers in LLM interpretability

200 11 Updated Jul 24, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 8,412 789 Updated Jul 22, 2024

A fast MoE impl for PyTorch

Python 1,491 180 Updated Jul 5, 2024

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for…

Python 323 15 Updated May 30, 2024

contrastive decoding

Python 167 10 Updated Nov 14, 2022

A curated list for Efficient Large Language Models

Python 992 74 Updated Jul 26, 2024

The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)

Python 16 Updated May 15, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

421 26 Updated Jul 3, 2024
Next