TomerAberbach / wikipedia-ngrams Sponsor Star 3 Code Issues Pull requests 📚 A Kotlin project which extracts ngram counts from Wikipedia data dumps. kotlin nlp cli wikipedia ngram ngrams wikipedia-dump wikipedia-corpus wikiextractor wikipedia-data-dump extracts-ngram-counts wikipedia-ngrams Updated Jul 3, 2023 Kotlin
jeeveshkataria / WikipediaSearchEngine Star 0 Code Issues Pull requests Created a mini wikipedia search engine on wikipedia data dump of 2020 of size 40 GB.Results are retrived in less than a sec. search-engine information-retrieval corpus indexing information-extraction tf-idf nlp-machine-learning ranking-algorithm external-merge-sort wikipedia-data-dump Updated Sep 28, 2020 Python