Phonemized MasakhaNER.

Data phonemized with epitran, for experiment.

data/swa is as usual.
data/swa_no_word_boundaries is edited to have character-level labels
data/swa_phonemes we used epitran on each word of data/swa.
data/swa_phonemes_no_word_boundaries we took above, and edited it to have character-level labels.

Did the same for Kinyarwanda:

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
analysis_scripts		analysis_scripts
annotation_quality_corpus		annotation_quality_corpus
code		code
data		data
entity_analysis		entity_analysis
text_by_language		text_by_language
transfer_corpus		transfer_corpus
LICENSE		LICENSE
README.md		README.md
custom_huggingface_loading_script.py		custom_huggingface_loading_script.py
requirements.txt		requirements.txt

Provide feedback