Stars
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conv…
该项目致力于从中文文字版PDF文档中,自动化构建出高质量的中文文本纠错语料。
🥚 Transform PDF to JSON or Markdown with ease and speed 🐣
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
A Comprehensive Toolkit for High-Quality PDF Content Extraction
meta-comprehensive-rag-benchmark-kdd-cup-2024 phase1 task1 rank3
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
A modular graph-based Retrieval-Augmented Generation (RAG) system
ArtificialZeng / DeepLearing-Interview-Awesome-2024
Forked from 315386775/DeepLearing-Interview-Awesome-2024AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。
A python wrapper for the Doc2X API and comes with native PDF processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的PDF处理(提升PDF在RAG中的召回率)。
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,…
SEED-Story: Multimodal Long Story Generation with Large Language Model
A project for processing neural networks and rendering to gain insights on the architecture and parameters of a model through a decluttered representation.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
An experimental UI for text-to-knowledge-graph generation
This is the code for our KILT leaderboard submissions (KGI + Re2G models).
一眼看出该职位最后修改时间,绿色为2周之内,暗橙色为1.5个月之内,红色为1.5个月以上