-
Intenginetech
- Beijing
- https://binglel.top
Block or Report
Block or report binglel
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
vits2 backbone with multilingual-bert
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
AI powered speech denoising and enhancement
Audio Normalization for Python/ffmpeg
draw.io is a JavaScript, client-side editor for general diagramming.
🦋 A Hexo Theme: Butterfly
Command line utility for forced alignment using Kaldi
MobileNet trained with VoxCeleb dataset and used for voice verification
simple speech enhancement with librosa
A statistical model-based Voice Activity Detector
mnist data training and testing with back propagation
Speech Recognition with DFCNN and Transformer
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Python implementation of Text-Image-Augmentation
SRZoo: An integrated repository for super-resolution using deep learning
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
A set of speech feature extraction functions for ASR and speaker identification written in matlab.
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。