LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…

Python 708 48 Updated Aug 7, 2024

jbloomAus / SAELens

Training Sparse Autoencoders on Language Models

HTML 319 87 Updated Aug 18, 2024

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 1,340 264 Updated Aug 19, 2024

openai / automated-interpretability

Python 938 110 Updated Mar 6, 2024

openai / transformer-debugger

Python 3,991 232 Updated Jun 4, 2024

jessevig / bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,687 762 Updated Aug 24, 2023

HarderThenHarder / transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,087 372 Updated Sep 29, 2023

maidacundo / MoE-LoRA

Python 20 Updated Jun 3, 2024

InsaneLife / ChineseNLPCorpus

中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。

Python 4,203 779 Updated Nov 21, 2023

lidangzzz / How-to-run

立党零基础转码笔记

TypeScript 5,273 337 Updated May 5, 2024

ZHO-ZHO-ZHO / ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

4,519 430 Updated Aug 5, 2024

liuqidong07 / MOELoRA-peft

[SIGIR'24] The official implementation code of MOELoRA.

Python 110 12 Updated Jul 22, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,515 443 Updated May 3, 2024

DLLXW / baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,411 295 Updated May 21, 2024

charent / Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 456 48 Updated Jul 11, 2024

jiahe7ay / MINI_LLM

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Python 313 50 Updated Apr 24, 2024

zjunlp / IEPile

[OneKE] [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus

Python 145 13 Updated Jul 13, 2024

bmaltais / kohya_ss

Python 9,122 1,184 Updated Aug 19, 2024

huggingface / diffusion-models-class

Materials for the Hugging Face Diffusion Models Course

Jupyter Notebook 3,485 375 Updated Aug 19, 2024

lllyasviel / ControlNet

Let us control diffusion models!

Python 29,478 2,661 Updated Feb 25, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,141 993 Updated Aug 20, 2024

allenai / OLMo-Eval

Evaluation suite for LLMs

Python 285 31 Updated Jun 13, 2024

allenai / catwalk

This project studies the performance and robustness of language models and task-adaptation methods.

Python 140 14 Updated May 18, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,301 418 Updated Aug 20, 2024

allenai / dolma

Data and tools for generating and inspecting OLMo pre-training data.

Python 882 88 Updated Aug 17, 2024

chen700564 / RGB

Python 244 22 Updated May 17, 2024

VikParuchuri / surya

OCR, layout analysis, reading order, line detection in 90+ languages

Python 9,604 614 Updated Aug 16, 2024

alexriggio / BERT-LoRA-TensorRT

This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high p…

Jupyter Notebook 53 7 Updated Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wangzhanxd

Achievements