-
Mungana AI
- Pretoria
-
15:08
(UTC -12:00) - https://www.linkedin.com/in/ndamulelonemakhavhani/
- @NdamuleloNemakh
- @[email protected]
- https://credly.com/users/ndamulelo-nemakhavhani
Block or Report
Block or report ndamulelonemakh
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseNLP for All
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre…
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
The dataset contains editions from the South African government magazine Vuk'uzenzele. Data was scraped from PDFs that have been placed in the data/raw folder. The PDFS were obtained from the Vuk'u…
Facebook Low Resource (FLoRes) MT Benchmark
Code + data for the EMNLP'20 publication "Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages"
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
The data set contains cabinet statements from the South African government. Data was scraped from the governments website: https://www.gov.za/cabinet-statements
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Code and documentation to train Stanford's Alpaca models, and generate the data.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Building an effective preprocessing tool for African languages
Tools to download and cleanup Common Crawl data
A multilingual parallel corpus created from translations of the Bible.
StableLM: Stability AI Language Models
The official gpt4free repository | various collection of powerful language models
MasakhaNEWS: News Topic Classification for African Languages
Official style files for papers submitted to venues of the Association for Computational Linguistics