Skip to content
View Haixing-Hu's full-sized avatar
:octocat:
working
:octocat:
working

Highlights

  • Pro

Block or report Haixing-Hu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

57 repositories

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Python 2,160 502 Updated Jan 25, 2019

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,272 827 Updated Nov 14, 2024
Python 3,481 406 Updated May 17, 2024

Let ChatGPT teach your own chatbot in hours with a single GPU!

Python 3,167 285 Updated Mar 17, 2024

Examples and guides for using the OpenAI API

MDX 59,791 9,530 Updated Nov 13, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 94,896 15,369 Updated Nov 15, 2024

LlamaIndex is a data framework for your LLM applications

Python 36,744 5,270 Updated Nov 15, 2024

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,500 400 Updated Oct 27, 2024

Chatbot for documentation, that allows you to chat with your data. Privately deployable, provides AI knowledge sharing and integrates knowledge into your AI workflow

Python 15,016 1,598 Updated Nov 15, 2024

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,832 476 Updated Jul 11, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,392 844 Updated Oct 3, 2024

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Python 6,316 1,278 Updated Aug 31, 2024

LLM inference in C/C++

C++ 67,882 9,734 Updated Nov 16, 2024

Python bindings for llama.cpp

Python 8,119 966 Updated Nov 16, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,464 4,119 Updated Nov 15, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,486 1,936 Updated Apr 4, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 9,132 754 Updated Nov 15, 2024

clueai工具包: 3行代码3分钟,自定义需要的API!

Python 231 32 Updated Apr 29, 2023

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 19,339 2,400 Updated Nov 15, 2024

we want to create a repo to illustrate usage of transformers in chinese

Shell 2,328 400 Updated Aug 18, 2024

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 11,919 1,484 Updated Sep 15, 2024

HuggingLLM, Hugging Future.

Jupyter Notebook 2,782 362 Updated Jun 27, 2024

A guidance language for controlling large language models.

Jupyter Notebook 19,098 1,042 Updated Nov 11, 2024

Library for fast text representation and classification.

HTML 25,945 4,717 Updated Mar 22, 2024

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,689 1,385 Updated Jul 31, 2023

Language Technology Platform

Python 4,965 1,041 Updated Oct 12, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,388 1,866 Updated Apr 30, 2024

Prompt Framework made to optimise conversation with LLM's.

12 1 Updated May 30, 2023

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 50,232 4,864 Updated Oct 28, 2024

Inference code for Llama models

Python 56,427 9,569 Updated Aug 18, 2024