Skip to content
View baofff's full-sized avatar

Block or report baofff

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

collection of diffusion model papers categorized by their subareas

1,060 50 Updated Aug 24, 2024

Mamba SSM architecture

Python 12,249 1,029 Updated Aug 15, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,325 230 Updated Aug 19, 2024

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,152 49 Updated Aug 15, 2024

Consistency Distilled Diff VAE

Python 2,120 74 Updated Nov 7, 2023

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,562 460 Updated Aug 22, 2024

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

Python 297 28 Updated Dec 28, 2023

A framework for few-shot evaluation of language models.

Python 6,216 1,646 Updated Aug 23, 2024

Scaling Data-Constrained Language Models

Jupyter Notebook 305 18 Updated Mar 22, 2024

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,332 374 Updated Jul 16, 2023

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,493 344 Updated Aug 8, 2024

Tools to download and cleanup Common Crawl data

Python 950 138 Updated Apr 25, 2023

A quick guide (especially) for trending instruction finetuning datasets

2,373 155 Updated Nov 28, 2023

Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"

Python 523 46 Updated Apr 24, 2022

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,876 773 Updated Aug 20, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,067 293 Updated Jun 22, 2024

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,059 93 Updated Jun 13, 2024

Diffusion model papers, survey, and taxonomy

2,867 246 Updated Aug 9, 2024

Generative Agents: Interactive Simulacra of Human Behavior

16,122 2,052 Updated Aug 5, 2024

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Python 2,063 129 Updated Jul 12, 2024

Awesome-LLM: a curated list of Large Language Model

16,953 1,365 Updated Aug 19, 2024

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

1,391 130 Updated Aug 19, 2024

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 15,583 4,854 Updated Aug 1, 2024

An elegant PyTorch deep reinforcement learning library.

Python 7,709 1,112 Updated Aug 24, 2024

A curated list of Multimodal Related Research.

Python 1,289 152 Updated Aug 5, 2023

Reading list for research topics in multimodal machine learning

5,803 840 Updated Aug 20, 2024

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,668 505 Updated Jul 18, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,237 4,453 Updated Aug 24, 2024

中国大模型

5,139 428 Updated Jun 7, 2024

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

28,377 3,262 Updated Mar 25, 2024
Next