Skip to content

yzhouum05/Text-Mining

Repository files navigation

Text Mining

All data are provided by University of Michigan.

  • Data Extraction using Regex extracts dates from messy medical records, stored in 'dates.txt'.

  • Spelling Recommender

    • First examines the linguistic characteristics of a novel, Moby Dick ('moby.txt') with NLTK
    • Then develops spelling recommenders based on 3 different similarity measures (Jaccard Distance on Trigram, Jaccard Distance on 4-gram and Edit Distance).

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published