Stars
pix2tex: Using a ViT to convert images of equations into LaTeX code.
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
Compare neural networks by their feature similarity
Self-Supervised Speech Pre-training and Representation Learning Toolkit
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ASPEN is a full python toolkit to generate "Auditory Stimulus for Psychophysical ExperimeNt"