Skip to content
View gaotianyu1350's full-sized avatar

Highlights

  • Pro

Organizations

@princeton-nlp
Block or Report

Block or report gaotianyu1350

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,490 323 Updated Jul 31, 2024

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Python 1,046 77 Updated Mar 7, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 5,849 593 Updated Aug 2, 2024

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 966 77 Updated Aug 16, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,225 4,024 Updated Jul 17, 2024

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Python 267 23 Updated Oct 20, 2022

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Python 225 28 Updated Nov 27, 2023

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 539 75 Updated Jul 22, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,099 532 Updated Jul 24, 2024

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,132 130 Updated Jul 12, 2024

Scalable training for dense retrieval models.

Python 262 24 Updated May 27, 2023

Model API for GALACTICA

Jupyter Notebook 2,672 275 Updated Mar 5, 2023

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 189 13 Updated Jun 14, 2023

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Python 645 58 Updated Apr 7, 2023

This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs

Jupyter Notebook 164 15 Updated Oct 12, 2023

Source code and dataset for EMNLP 2020 paper "MAVEN: A Massive General Domain Event Detection Dataset".

Python 149 39 Updated Jan 5, 2022

A latent text-to-image diffusion model

Jupyter Notebook 66,967 10,019 Updated Jun 18, 2024

Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arxiv.org/abs/2205.09726).

Python 136 11 Updated Aug 2, 2023

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,452 2,095 Updated Nov 3, 2023
Python 95 32 Updated Aug 28, 2018

The source code of our COLING'18 paper "Few-Shot Charge Prediction with Discriminative Legal Attributes".

Python 129 29 Updated Dec 20, 2018

The respository of jec-qa.

Python 48 2 Updated Feb 2, 2020

Open Chinese Language Pre-trained Model Zoo

977 145 Updated Mar 18, 2020
Python 23 2 Updated Apr 11, 2020
77 49 Updated Jun 29, 2020

Must-read Papers on Legal Intelligence

459 59 Updated Jan 22, 2021

Source code and checkpoints for legal pre-trained language models.

Python 168 24 Updated May 9, 2021

Repo for external large-scale work

Python 6,446 722 Updated Apr 27, 2024

Generative model for code infilling and synthesis

Python 292 25 Updated Sep 9, 2023

A Collection of BM25 Algorithms in Python

Python 938 82 Updated May 28, 2024
Next