Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
-
Updated
Aug 2, 2018 - Python
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
An Amharic News Text classification Dataset
A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus indexer and Term weighter.
Amharic Spelling Corrector based on SymSpell - Spelling corrector which is 1 million times faster through Symmetric Delete spelling correction algorithm
simple bs4 based web crawl for a corpus in need of statistical machine translation
The set of files used for the development of the Amharic Corpus.
This repository contains implementations of various Natural Language Processing (NLP) tasks and tools specifically for the Amharic language using Java. The goal is to provide a comprehensive set of tools to facilitate NLP research and development for Amharic.
This repository contains the MSc thesis project titled "A Generic Multitask Summarizer for Amharic Text Documents". The project addresses the challenges of information overload and automatic text analysis by providing a versatile and parameterizable framework for extractive text summarization.
k`wat is collection of Amharic datasets includes peoples names, postcodes, tweets
Add a description, image, and links to the amharic-corpus topic page so that developers can more easily learn about it.
To associate your repository with the amharic-corpus topic, visit your repo's landing page and select "manage topics."