- Kazan, Tatarstan, Russia
- https://www.corpus.tatar/en
Stars
The home of the Unicode Common Locale Data Repository
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances
Dockerfile best-practices for writing production-worthy Docker images.
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Veʹrdd is an open-source dictionary editing framework with the focus on low-resourced and endangered languages. The framework is mainly built to facilitate collecting, importing, editing and export…
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English
Tools for the 3rd edition of the Constraint Grammar formalism.
The Docker Bench for Security is a script that checks for dozens of common best-practices around deploying Docker containers in production.
Forced Alignments for Common Voice
This is an open sourced book on deep learning.
📚 Freely available programming books
List of semantic domains from semdom.org, version 4, for Javascript
Modules to convert numbers to words. 42 --> forty-two
🐢 🌎 📚 a community-owned language-learning platform
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with diverse types of annotation.
A vim plugin to display the indention levels with thin vertical lines
🔦 [Vim script] JSX and TSX syntax pretty highlighting for vim.
fork of master branch from git:https://git.slackbuilds.org/slackbuilds.git (read more on wiki). If you want to fork/pull request do it only over master (the other branches are temporary and are always re…
Tool for creation, manipulation and maintenance of voice corpora
A memory-based morphological parser for Python