Lists (1)
Sort Name ascending (A-Z)
Stars
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Chinese Financial Assistant Benchmark for Large Language Model
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.
OCR, layout analysis, reading order, line detection in 90+ languages
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。
Reference implementation for DPO (Direct Preference Optimization)
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A framework for few-shot evaluation of language models.
Automatically split your PyTorch models on multiple GPUs for training & inference
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Automatically evaluate your LLMs in Google Colab
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTor…
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…