- France
Stars
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Github Action for building executables with Pyinstaller
ICRAR / ijson
Forked from isagalaev/ijsonIterative JSON parser with Pythonic interfaces
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
BAML is a language that helps you get structured data from LLMs, with the best DX possible. Works with all languages. Check out the promptfiddle.com playground
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut …
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
A maximum-strength name parser for record linkage.
A Flexible Deep Learning Approach to Fuzzy String Matching
A comprehensive and scalable set of string tokenizers and similarity measures in Python
Python package for performing Entity and Text Matching using Deep Learning.
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
List of entity resolution software and resources.
Demonstrate integration of Senzing and Neo4j to construct an Entity Resolved Knowledge Graph
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking
Tutorials for Entity Resolved Knowledge Graphs
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
Implementation of Nougat Neural Optical Understanding for Academic Documents
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents