Counting word occurrence in hewiki dumps downloaded from https://dumps.wikimedia.org/hewiki/.
Using WikiExtractor to extract text from the XML dump,
parsing each article with regular expressions to strip it from any non-hebrew characters.
Finally, writes results to csv.
Added Python version in branch "python" for comparsion/benchmarking purposes.