Skip to content
View ClemDoum's full-sized avatar
  • France

Block or report ClemDoum

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 19,267 1,375 Updated Nov 30, 2024

Github Action for building executables with Pyinstaller

Shell 177 71 Updated Oct 24, 2024

Iterative JSON parser with Pythonic interfaces

Python 852 51 Updated Nov 26, 2024

Embed Python in Java

C 1,337 152 Updated Nov 25, 2024

Run periodic jobs in PostgreSQL

C 2,902 195 Updated Aug 22, 2024

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

Python 17,922 1,929 Updated Nov 29, 2024

BAML is a language that helps you get structured data from LLMs, with the best DX possible. Works with all languages. Check out the promptfiddle.com playground

Rust 1,417 51 Updated Nov 29, 2024

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut …

Python 891 79 Updated Oct 22, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 34,225 5,808 Updated Nov 28, 2024

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

Python 399 46 Updated Nov 28, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 3,300 250 Updated Aug 10, 2024

A maximum-strength name parser for record linkage.

Python 34 Updated Aug 8, 2024

A Flexible Deep Learning Approach to Fuzzy String Matching

Jupyter Notebook 140 34 Updated Oct 16, 2024

A comprehensive and scalable set of string tokenizers and similarity measures in Python

Python 137 16 Updated Jul 17, 2024

Python package for performing Entity and Text Matching using Deep Learning.

Python 568 130 Updated Jun 18, 2024

An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.

Python 74 11 Updated Nov 11, 2024

List of entity resolution software and resources.

39 2 Updated Mar 2, 2024

Demonstrate integration of Senzing and Neo4j to construct an Entity Resolved Knowledge Graph

26 6 Updated Aug 14, 2024

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

Python 2,153 333 Updated Jul 16, 2024

Map ICIJ format to Senzing format.

Python 4 Updated Aug 22, 2024

spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking

Python 85 23 Updated Oct 6, 2022

Tutorials for Entity Resolved Knowledge Graphs

Jupyter Notebook 4 1 Updated Oct 19, 2024

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Rust 4,850 338 Updated Nov 29, 2024

Metal I/O library for Rust.

Rust 6,398 739 Updated Nov 29, 2024

Rust async runtime based on io-uring.

Rust 4,035 226 Updated Nov 29, 2024

Neo4j GraphRAG for Python

Python 244 39 Updated Nov 26, 2024

A Python wrapper for Google Tesseract

Python 5,892 723 Updated Nov 22, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,018 572 Updated Apr 16, 2024

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

Python 2,513 152 Updated Nov 26, 2024

Delightful io_uring packages and resources

349 16 Updated Feb 13, 2024
Next