Starred repositories
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
A high-throughput and memory-efficient inference and serving engine for LLMs
An open-source RAG-based tool for chatting with your documents.
Contextual Harnessing for Efficient SQL Synthesis
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Neo4j graph construction from unstructured data using LLMs
Chat-based SQL Client and Editor for the next decade
A modular graph-based Retrieval-Augmented Generation (RAG) system
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
An Open-sourced Knowledgable Large Language Model Framework.
雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)
Open-source vector similarity search for Postgres
Question and Answer based on Anything.
Convert PDF to markdown quickly with high accuracy
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
A command-line installer for Windows.