Skip to content
View wangzhanxd's full-sized avatar
Block or Report

Block or report wangzhanxd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results
Jupyter Notebook 19 2 Updated May 6, 2024

An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"

Python 16 2 Updated May 12, 2024

LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…

Python 708 48 Updated Aug 7, 2024

Training Sparse Autoencoders on Language Models

HTML 319 87 Updated Aug 18, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,340 264 Updated Aug 19, 2024

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,687 762 Updated Aug 24, 2023

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,087 372 Updated Sep 29, 2023
Python 20 Updated Jun 3, 2024

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Python 4,203 779 Updated Nov 21, 2023

立党零基础转码笔记

TypeScript 5,273 337 Updated May 5, 2024

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

4,519 430 Updated Aug 5, 2024

[SIGIR'24] The official implementation code of MOELoRA.

Python 110 12 Updated Jul 22, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,515 443 Updated May 3, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,411 295 Updated May 21, 2024

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 456 48 Updated Jul 11, 2024

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Python 313 50 Updated Apr 24, 2024

[OneKE] [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus

Python 145 13 Updated Jul 13, 2024
Python 9,122 1,184 Updated Aug 19, 2024

Materials for the Hugging Face Diffusion Models Course

Jupyter Notebook 3,485 375 Updated Aug 19, 2024

Let us control diffusion models!

Python 29,478 2,661 Updated Feb 25, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,141 993 Updated Aug 20, 2024

Evaluation suite for LLMs

Python 285 31 Updated Jun 13, 2024

This project studies the performance and robustness of language models and task-adaptation methods.

Python 140 14 Updated May 18, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,301 418 Updated Aug 20, 2024

Data and tools for generating and inspecting OLMo pre-training data.

Python 882 88 Updated Aug 17, 2024
Python 244 22 Updated May 17, 2024

OCR, layout analysis, reading order, line detection in 90+ languages

Python 9,604 614 Updated Aug 16, 2024

This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high p…

Jupyter Notebook 53 7 Updated Nov 14, 2023
Next