Skip to content
View shenfe's full-sized avatar
🌕
I may be slow to respond.
🌕
I may be slow to respond.
  • ByteDance
  • Beijing
Block or Report

Block or report shenfe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,285 371 Updated Jul 16, 2023

Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)

Python 58 8 Updated Jun 10, 2024

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

Python 267 11 Updated Apr 15, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,361 432 Updated May 3, 2024

Awesome papers about unifying LLMs and KGs

1,742 130 Updated May 16, 2024

Resources of deep learning for mathematical reasoning (DL4MATH).

319 26 Updated Dec 22, 2023

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,238 223 Updated Jul 8, 2024

Safety Score for Pre-Trained Language Models

Python 92 6 Updated Oct 18, 2023

DSPy: The framework for programming—not prompting—foundation models

Python 14,574 1,113 Updated Jul 17, 2024

Next generation face swapper and enhancer

Python 16,673 2,462 Updated Jul 18, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,305 63 Updated Mar 8, 2024

Task-based datasets, preprocessing, and evaluation for sequence models.

Python 545 58 Updated Jul 18, 2024

Multipack distributed sampler for fast padding-free training of LLMs

Python 159 12 Updated Jul 8, 2023

Model API for GALACTICA

Jupyter Notebook 2,667 275 Updated Mar 5, 2023
Jupyter Notebook 41 5 Updated Jul 13, 2024

[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization

Python 26 4 Updated Jul 22, 2023

Pipeline for pulling and processing online language model pretraining data from the web

Python 170 21 Updated Jul 31, 2023

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

JavaScript 17,549 1,884 Updated Jul 19, 2024

🔥Highlighting the top ML papers every week.

9,426 543 Updated Jul 15, 2024

MTEB: Massive Text Embedding Benchmark

Python 1,658 216 Updated Jul 18, 2024

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 11,443 952 Updated Jul 5, 2024

Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.

103 9 Updated Jun 13, 2024

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

1,733 203 Updated Apr 29, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 17,939 1,839 Updated Apr 30, 2024

Repository for Decomposed Prompting

Python 80 6 Updated Nov 15, 2023

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features…

Python 2,547 341 Updated Jul 18, 2024

Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.

Jupyter Notebook 623 48 Updated Dec 20, 2023

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,249 113 Updated Jun 13, 2024

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,448 121 Updated Apr 22, 2024
Next