Skip to content
View beetter's full-sized avatar

Block or report beetter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conv…

Jupyter Notebook 1,806 176 Updated Aug 19, 2024

该项目致力于从中文文字版PDF文档中,自动化构建出高质量的中文文本纠错语料。

Python 6 1 Updated Aug 17, 2024

🥚 Transform PDF to JSON or Markdown with ease and speed 🐣

Python 572 54 Updated Sep 27, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,078 847 Updated Sep 13, 2024

🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.

Java 14,962 1,673 Updated Sep 25, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 11,651 877 Updated Sep 27, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 4,919 335 Updated Sep 20, 2024

meta-comprehensive-rag-benchmark-kdd-cup-2024 phase1 task1 rank3

Python 11 4 Updated Jun 21, 2024

homer javaagent

Java 9 1 Updated Oct 19, 2021

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 4,701 468 Updated Sep 25, 2024

PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!

Python 6,923 833 Updated Jan 23, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 17,587 1,681 Updated Sep 26, 2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

3 1 Updated Aug 6, 2024
Python 57 2 Updated Sep 18, 2024

Lightweight, performant, deep table extraction

Python 270 17 Updated Sep 23, 2024

TF-ID: Table/Figure IDentifier for academic papers

Python 212 9 Updated Jul 12, 2024

Distributed LLM and StableDiffusion inference for mobile, desktop and server.

Rust 2,494 131 Updated Aug 30, 2024

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

Python 1,392 158 Updated Sep 25, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,537 3,879 Updated Sep 26, 2024

ChatPilot: Chat Agent Web UI,实现Chat对话前端,支持Google搜索、文件网址对话(RAG)、代码解释器功能,复现了Kimi Chat(文件,拖进来;网址,发出来)。

Svelte 479 47 Updated Sep 24, 2024

A python wrapper for the Doc2X API and comes with native PDF processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的PDF处理(提升PDF在RAG中的召回率)。

Python 178 10 Updated Sep 12, 2024

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,…

Jupyter Notebook 3,504 619 Updated Sep 24, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 707 55 Updated Jul 29, 2024

A project for processing neural networks and rendering to gain insights on the architecture and parameters of a model through a decluttered representation.

Python 1,073 185 Updated Jan 6, 2024

Python scraper based on AI

Python 14,594 1,193 Updated Sep 27, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,616 438 Updated Sep 19, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 8,599 703 Updated Sep 27, 2024

An experimental UI for text-to-knowledge-graph generation

HTML 727 56 Updated May 2, 2024

This is the code for our KILT leaderboard submissions (KGI + Re2G models).

Python 147 14 Updated May 16, 2023

一眼看出该职位最后修改时间,绿色为2周之内,暗橙色为1.5个月之内,红色为1.5个月以上

JavaScript 1,043 53 Updated Jun 10, 2024
Next