An introductory guide to process textual data
Use the text preprocessing script to understand how you can remove noise from the text data that you're dealing with.
Use the text to features script to understand the different ways in which you can convert your text data to analyzable features.
The text_matching script includes codes for Levenshtein matching as well Cosine Matching. Levenshtein Matching - The number of changes that need to be made to document A to convert it to document B. Allowable transformations: Insertion, Deletion and single Substitution.