-
Mungana AI
- Pretoria
-
20:40
(UTC -12:00) - https://www.linkedin.com/in/ndamulelonemakhavhani/
- @NdamuleloNemakh
- @[email protected]
- https://credly.com/users/ndamulelo-nemakhavhani
Block or Report
Block or report ndamulelonemakh
Contact GitHub support about this userโs behavior. Learn more about reporting abuse.
Report abuseNLP Toolbox
Repository with all what is necessary for sentiment analysis and related areas
TensorFlow code and pre-trained models for BERT
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
A framework to learn cross-lingual word embedding mappings
๐ A curated list of resources dedicated to Natural Language Processing (NLP)
About Muti-Label Text Classification Based on Neural Network.
PyTorch original implementation of Cross-lingual Language Model Pretraining.
Swift Core ML 3 implementations of GPT-2, DistilGPT-2, BERT, and DistilBERT for Question answering. Other Transformers coming soon!
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
Algorithm for Topic Modelling and Semantic Search
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
TextAttack ๐ is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
Library for fast text representation and classification.
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
A tool for extracting plain text from Wikipedia dumps
Top2Vec learns jointly embedded topic, document and word vectors.
Robust Speech Recognition via Large-Scale Weak Supervision
Open source annotation tool for machine learning practitioners.
`pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.
Community maintained fork of pdfminer - we fathom PDF
Get semantic HTML from PDFs, recover lost text, tables, data... in bulk.
A system for quickly generating training data with weak supervision