Skip to content
View ndamulelonemakh's full-sized avatar
🍸
Solution explorer
🍸
Solution explorer
Block or Report

Block or report ndamulelonemakh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

NLP for All

NLP projects aimed at improving inclusiveness and accessibility in AI
35 repositories

Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need

Python 811 164 Updated Jan 20, 2024

Machine Translation for Africa

Lua 271 206 Updated Jun 14, 2022

AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages

Python 63 20 Updated May 31, 2022

Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre…

Python 12 4 Updated Apr 26, 2024

A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.

84 20 Updated Apr 26, 2024

The dataset contains editions from the South African government magazine Vuk'uzenzele. Data was scraped from PDFs that have been placed in the data/raw folder. The PDFS were obtained from the Vuk'u…

Jupyter Notebook 6 4 Updated Dec 6, 2023

Facebook Low Resource (FLoRes) MT Benchmark

Python 678 123 Updated Nov 20, 2023

Code + data for the EMNLP'20 publication "Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages"

Python 3 5 Updated Dec 16, 2021
Jupyter Notebook 16 5 Updated Jan 12, 2023

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,609 760 Updated Aug 24, 2023

Simple, fast unsupervised word aligner

C++ 728 159 Updated Jul 19, 2022

The data set contains cabinet statements from the South African government. Data was scraped from the governments website: https://www.gov.za/cabinet-statements

Jupyter Notebook 4 Updated May 10, 2024

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,806 1,159 Updated Jun 30, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,180 4,018 Updated Jul 17, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,808 4,405 Updated Jul 18, 2024

Building an effective preprocessing tool for African languages

Jupyter Notebook 11 10 Updated Jan 24, 2024

Tools to download and cleanup Common Crawl data

Python 938 139 Updated Apr 25, 2023

A multilingual parallel corpus created from translations of the Bible.

169 48 Updated Jun 17, 2024

StableLM: Stability AI Language Models

Jupyter Notebook 15,850 1,036 Updated Apr 8, 2024

Topic Inference with Zeroshot models

Python 61 7 Updated Jun 12, 2023

The official gpt4free repository | various collection of powerful language models

Python 59,361 13,185 Updated Jul 15, 2024

Curated corpora for Setswana. Used to train PuoBERTa.

2 Updated Oct 26, 2023

POS for African languages

Jupyter Notebook 16 19 Updated Feb 5, 2024
Jupyter Notebook 98 51 Updated Dec 19, 2023

MasakhaNEWS: News Topic Classification for African Languages

Python 14 16 Updated May 12, 2024

MAFAND-MT

Jupyter Notebook 52 25 Updated Jul 9, 2024

Official style files for papers submitted to venues of the Association for Computational Linguistics

TeX 636 168 Updated May 20, 2024

The official Meta Llama 3 GitHub site

Python 23,385 2,510 Updated Jul 17, 2024