Entity matching on the DBLP-ACM dataset
-
Updated
Jan 10, 2023 - Python
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
Entity matching on the DBLP-ACM dataset
Research Article Matching Methods using Attention, Hybrid, RNN, and SIF
Course project for CS839 Spring18 at UW-Madison
Libem notebooks.
Submission Repository for Data Science Class Project
A Winner-Take-All Hashing-Based Unsupervised Model for Entity Resolution Problems. [B. Sc. Thesis]
Performed entity matching on Album music data across two different (extracted) tables from metacritic.com and wikipedia.
Neoplasm Entity Recognition: matching disease names to ontology classes
Reproducibility experiments for Generalized Supervised Meta-blocking
Scalable record-level matching rules
Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching"
JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching
4 stage data science project
Submissions for Data Science: Principles, Algorithms, and Applications (CS839) @ UW-Madison
AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuning
Entity-Match books from goodreads.com and bookdepository.com
utilities for working with Entity Resolution models
Performance Analysis of Entity Matching With Fuzzy Wuzzy on Fairness AI Articles
A collection of awesome resources regarding Record Linkage.
Created by Halbert L. Dunn
Released 1946