Skip to content
View yule-BUAA's full-sized avatar
Block or Report

Block or report yule-BUAA

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

DataComp for Language Models

HTML 151 8 Updated Jul 4, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

1,228 63 Updated Jul 3, 2024
Python 10 Updated May 22, 2024
Python 109 5 Updated Jun 15, 2024

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Python 302 16 Updated May 29, 2024

Mixture-of-Experts (MoE) Language Model

Python 159 38 Updated Jul 1, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 10,937 817 Updated May 23, 2024

My favorite C programming practices.

1,913 94 Updated Oct 1, 2020

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

2,878 107 Updated Jun 26, 2024

DeepSeek LLM: Let there be answers

Makefile 1,325 87 Updated Feb 4, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 13,634 1,200 Updated Jun 28, 2024

Official repository for ORPO

Python 373 34 Updated May 31, 2024

RewardBench: the first evaluation tool for reward models.

Python 277 27 Updated Jul 5, 2024

Robust recipes to align language models with human and AI preferences

Python 4,178 354 Updated May 25, 2024

Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"

Python 50 2 Updated Jun 7, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,181 336 Updated Jul 5, 2024

Sailor: Open Language Models for South-East Asia

Python 83 7 Updated May 21, 2024

a huggingface mirror site.

196 23 Updated Mar 18, 2024

The official Meta Llama 3 GitHub site

Python 22,879 2,410 Updated Jul 3, 2024

ReFT: Representation Finetuning for Language Models

Python 947 77 Updated Jul 3, 2024

Official repository of Evolutionary Optimization of Model Merging Recipes

Python 1,082 72 Updated Mar 30, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 20,249 1,915 Updated Jul 5, 2024

Grok open release

Python 49,149 8,311 Updated May 29, 2024

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Python 73 8 Updated Jun 8, 2023

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 8,153 827 Updated Jul 5, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,417 160 Updated Jul 2, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 21,526 2,182 Updated Jul 5, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 1,837 143 Updated May 23, 2024
Next