Skip to content
View caoxu915683474's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
Block or Report

Block or report caoxu915683474

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

202 4 Updated Jun 16, 2024
Python 2 Updated Apr 23, 2024

[ICML 2024] CLLMs: Consistency Large Language Models

Python 324 14 Updated Jul 25, 2024

RAGOnMedicalKG,将大模型RAG与KG结合,完成demo级问答,旨在给出基础的思路。

Python 149 23 Updated Mar 31, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,913 4,412 Updated Jul 25, 2024

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 15,301 2,358 Updated Jul 26, 2024

Build deep learning applications in a new and easy way.

Python 236 21 Updated Dec 8, 2022

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,177 5,435 Updated Jul 19, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,809 805 Updated Jul 1, 2024

PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese

Python 301 30 Updated Jan 23, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,533 242 Updated Dec 12, 2023

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

357 30 Updated Apr 22, 2024

雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)

229 7 Updated Mar 28, 2024

Talk to any ArXiv paper using ChatGPT

TypeScript 486 27 Updated Jan 16, 2024

Make LLMs Easily

Python 1 Updated Jan 26, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 65,152 7,616 Updated Jul 22, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,443 2,557 Updated Jul 21, 2024

A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

805 70 Updated Jul 12, 2024

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Python 1,913 256 Updated Jul 27, 2024

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

Jupyter Notebook 3,907 504 Updated Jul 27, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

13,729 1,266 Updated Jul 21, 2024

采集mkv文件内的文本字幕轨并提供搜索界面

Python 1 Updated Jul 8, 2013

Locating and editing factual associations in GPT (NeurIPS 2022)

Python 531 113 Updated Apr 20, 2024

闻达:一个LLM调用平台。目标为针对特定环境的高效内容生成,同时考虑个人和中小企业的计算资源局限性,以及知识安全和私密性问题

JavaScript 6,209 810 Updated Jul 26, 2024

GPT4 & LangChain Chatbot for large PDF docs

TypeScript 14,767 2,998 Updated Mar 25, 2024

Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)

Jupyter Notebook 722 63 Updated Jul 18, 2023

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 17,985 1,841 Updated Apr 30, 2024

Paper List for In-context Learning 🌷

771 57 Updated Jul 7, 2024

chinese document classification of layoutlmv3 and layoutxlm

Python 34 5 Updated Oct 25, 2022

全局指针统一处理嵌套与非嵌套NER的Pytorch实现

Python 364 45 Updated Mar 23, 2023
Next