Skip to content
View ERnest666's full-sized avatar
:shipit:
Identified Idiot
:shipit:
Identified Idiot
  • Champaign, IL
Block or Report

Block or report ERnest666

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".

Python 8 Updated Jun 20, 2024

ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate

Python 304 42 Updated Oct 3, 2023

📖 Paper reading list in conversational AI (constantly updating 🤗).

961 163 Updated Jun 23, 2024

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,238 226 Updated Jul 3, 2024

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

Python 213 10 Updated Feb 12, 2024

Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22

Python 61 9 Updated Oct 25, 2022

A quick guide (especially) for trending instruction finetuning datasets

2,258 148 Updated Nov 28, 2023

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,398 469 Updated Jan 8, 2024

PAL: Proxy-Guided Black-Box Attack on Large Language Models

Python 38 3 Updated Jun 2, 2024

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

Python 168 22 Updated May 23, 2023

Wrapper to easily generate the chat template for Llama2

Python 61 7 Updated Mar 10, 2024

Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)

Jupyter Notebook 50 9 Updated Mar 15, 2024

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,556 261 Updated Jun 2, 2024

Code for AAAI 2023 paper 'Learning to Memorize Entailment and Discourse Relations for Persona-Consistent Dialogues'

Python 29 7 Updated May 27, 2023

Train transformer language models with reinforcement learning.

Python 8,808 1,083 Updated Jul 18, 2024

A modular RL library to fine-tune language models to human preferences

Python 2,132 190 Updated Mar 1, 2024

library supporting NLP and CV research on scientific papers

Python 643 48 Updated Apr 5, 2024

Datasets for "Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels"

Python 2 Updated May 18, 2024

Codes for NAACL'22 "A Study of the Attention Abnormality in Trojaned BERTs", a textural Trojan Detector

Python 6 1 Updated Sep 5, 2023

Ongoing research training transformer models at scale

Python 9,443 2,126 Updated Jul 18, 2024

Pre-processing and in some cases downloading of datasets for the paper "Content Selection in Deep Learning Models of Summarization."

Python 78 25 Updated Nov 2, 2022

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 9,839 628 Updated May 2, 2024

Diffusion-LM

Python 1,011 132 Updated Jul 2, 2024

Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation

Python 172 23 Updated Feb 10, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,786 4,405 Updated Jul 18, 2024

Acceptance rates for the major AI conferences

Jupyter Notebook 4,000 288 Updated Jun 22, 2024
Next