Skip to content

Record Linkage project that uses Jaro-Winkler to find similar names across three sources

Notifications You must be signed in to change notification settings

SherazT/RecordLinkage

Repository files navigation

Data Engineering Intern Project Applecart

To test run pytest which will test test_input.csv - this csv includes the provided example in the instructions.md file.

To run with people.csv and return result results_final_applecart.csv please run python3 solution.py.

For your convenience and to save time the output file results_final_applecart.csv has already been provided in the root directory - it was run with the input file being people.csv. All matches are sorted by first_name and last_name.

About

Record Linkage project that uses Jaro-Winkler to find similar names across three sources

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages