Skip to content
View willyzw1221's full-sized avatar
Block or Report

Block or report willyzw1221

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,645 783 Updated Jun 17, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,077 2,434 Updated Jun 24, 2024
Python 6,983 539 Updated Jun 14, 2024

仿微信即时通讯聊天h5 app【前端项目】

JavaScript 20 8 Updated Nov 22, 2019

中文公开聊天语料库

Python 3,919 785 Updated Apr 23, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,271 316 Updated May 17, 2024

PhotoMaker

Jupyter Notebook 8,602 674 Updated Feb 28, 2024

ChatGPT带火了聊天机器人,主流的趋势都调整到了GPT类模式,本项目也与时俱进,会在近期更新GPT类版本。基于本项目和自己的语料可以训练出自己想要的聊天机器人,用于智能客服、在线问答、闲聊等场景。

Python 3,478 1,021 Updated Jun 26, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 21,808 3,076 Updated Jun 28, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 4,442 290 Updated Jun 28, 2024

Unofficial Implementation of Animate Anyone

Python 2,832 228 Updated Mar 20, 2024

This repository consist of many login page example, whch can be used for any web or hybrid app developement.

HTML 541 323 Updated Mar 26, 2024

Hello, Flask!

1,794 2,503 Updated Jun 28, 2024

Example application for Flask tutorial "Flask 入门教程".

Python 253 122 Updated Jun 27, 2024

DeepSeek Coder: Let the Code Write Itself

Python 5,979 431 Updated May 21, 2024

The SimpleLogin back-end and web app

Python 4,831 410 Updated Jun 28, 2024

Emu Series: Generative Multimodal Models from BAAI

Python 1,556 79 Updated Mar 8, 2024

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Python 779 62 Updated Jun 24, 2024

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,033 92 Updated Jun 13, 2024

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,229 648 Updated Jun 16, 2024

This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR23).

Python 919 72 Updated May 12, 2024
Python 2,455 296 Updated May 19, 2024

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 10,381 754 Updated May 30, 2024

Instant voice cloning by MyShell.

Python 26,994 2,611 Updated Jun 24, 2024

如何将ChatGPT调教成一只猫娘

2,620 154 Updated Jul 18, 2023

开源SFT数据集整理,随时补充

380 29 Updated Jun 2, 2023

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

420 19 Updated Apr 7, 2024

用于训练中英文对话系统的语料库 Datasets for Training Chatbot System

Python 2,015 500 Updated Sep 23, 2020

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,696 744 Updated Mar 15, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,175 220 Updated Jun 18, 2024