An open-source compound AI toolchain for fast and accurate entity matching, powered by LLMs.
-
Updated
Jul 8, 2024 - Python
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
An open-source compound AI toolchain for fast and accurate entity matching, powered by LLMs.
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching"
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.
Libem sample datasets.
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
Entity resolution for Elasticsearch.
Libem notebooks.
CERTA - Computing Entity Resolution explanations with TriAngles
Fair Entity Matching: A Fairness Suite for Auditing Entity Matching Approaches
Entity Matching Model solves the problem of matching company names between two possibly very large datasets.
An exploration of generalizable approaches to unsupervised entity matching for use in linking tabular public energy data sources.
An open source, high scalability toolkit in Java for Entity Resolution.
JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching
MetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Code and data for the paper "Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction"
CLK hash: hash pii for entity matching
Entity Matching specific Explanation tool. Landmark generates reliable and coherent explanations through a perturbation analysis.
AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuning
Created by Halbert L. Dunn
Released 1946