gaotianyu1350

Tianyu Gao gaotianyu1350

PhD student at Princeton University.

925 followers · 10 following

Achievements

Highlights

Organizations

Block or Report

Block or report gaotianyu1350

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

OpenBMB / MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,490 323 Updated Jul 31, 2024

abertsch72 / unlimiformer

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Python 1,046 77 Updated Mar 7, 2024

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 5,849 593 Updated Aug 2, 2024

thunlp / OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 966 77 Updated Aug 16, 2023

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,225 4,024 Updated Jul 17, 2024

neulab / knn-transformers

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Python 267 23 Updated Oct 20, 2022

OpenBMB / ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python 225 28 Updated Nov 27, 2023

OpenBMB / BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 539 75 Updated Jul 22, 2024

FMInference / FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,099 532 Updated Jul 24, 2024

mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,132 130 Updated Jul 12, 2024

facebookresearch / dpr-scale

Scalable training for dense retrieval models.

Python 262 24 Updated May 27, 2023

paperswithcode / galai

Model API for GALACTICA

Jupyter Notebook 2,672 275 Updated Mar 5, 2023

princeton-nlp / TRIME

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 189 13 Updated Jun 14, 2023

facebookresearch / contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 645 58 Updated Apr 7, 2023

shauryr / ACL-anthology-corpus

This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs

Jupyter Notebook 164 15 Updated Oct 12, 2023

THU-KEG / MAVEN-dataset

Source code and dataset for EMNLP 2020 paper "MAVEN: A Massive General Domain Event Detection Dataset".

Python 149 39 Updated Jan 5, 2022

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 66,967 10,019 Updated Jun 18, 2024

martiansideofthemoon / rankgen

Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxiv.org/abs/2205.09726).

Python 136 11 Updated Aug 2, 2023