Stars
An evolving how-to guide for securing a Linux server.
Homeworks and templates for LaTeX course by High School of Economics
Linux virtual machines, with a focus on running containers
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers h…
Book and code for Think Complexity, 2nd edition
Test page for Chrome and Firefox screen / desktop capture and share feature, using WebRTC and node.js with a websocket for Peer-to-peer transport. WebRTC and JSEP used for offer / answer PeerConnec…
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 …
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Awesome list of TTS papers with audio samples
Lecture notes and code for Machine Learning practical course on CMC MSU
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
An implementation of SkipVQVC with various settings.
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.