Skip to content
View bitmjy's full-sized avatar

Block or report bitmjy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 240 6 Updated Jul 15, 2024
Python 334 34 Updated Sep 23, 2024
55 3 Updated Feb 22, 2024

Train transformer language models with reinforcement learning.

Python 9,571 1,196 Updated Oct 1, 2024
Python 14 1 Updated Jul 23, 2024

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut …

Python 865 73 Updated Apr 29, 2024
Python 3 1 Updated Jun 17, 2024

Code and Data for "Long-context LLMs Struggle with Long In-context Learning"

Python 88 4 Updated Jul 1, 2024

This repository contains the code and data for the paper "SelfIE: Self-Interpretation of Large Language Model Embeddings" by Haozhe Chen, Carl Vondrick, and Chengzhi Mao.

Python 30 2 Updated Mar 25, 2024

AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection

Python 16 1 Updated Oct 30, 2023

ACL'23: Unified Demonstration Retriever for In-Context Learning

Python 31 6 Updated Dec 2, 2023
Python 58 6 Updated Nov 28, 2022

OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.

Python 529 28 Updated Oct 3, 2023

Visual and Embodied Concepts evaluation benchmark

21 1 Updated Oct 10, 2023

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Jupyter Notebook 1,809 215 Updated Sep 26, 2024

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 414 48 Updated Apr 24, 2024

Attack to induce LLMs within hallucinations

Python 100 12 Updated May 17, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,784 3,899 Updated Oct 1, 2024

a distributed deep learning platform

C++ 3,354 1,239 Updated Sep 17, 2024

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,374 511 Updated Jul 2, 2024

Paper List for In-context Learning 🌷

790 58 Updated Jul 7, 2024

Using sparse coding to find distributed representations used by neural networks.

Jupyter Notebook 167 28 Updated Nov 10, 2023

Universal Neurons in GPT2 Language Models

Jupyter Notebook 25 5 Updated May 28, 2024

This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins…

Python 198 21 Updated Nov 3, 2023

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…

Jupyter Notebook 1,973 168 Updated Aug 15, 2024

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,827 772 Updated Aug 24, 2023

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]

Python 139 6 Updated Sep 29, 2024
Python 288 15 Updated Jun 24, 2024

Do Large Language Models Know What They Don’t Know?

Python 84 5 Updated Dec 5, 2023
Next