- St.Petersburg, Russia
- http:https://hci.nw.ru/en/employees/14
Block or Report
Block or report ElenaRyumina
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speec…
AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for…
FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and gesture recognition. Seamlessly integrate code implementation…
YOLOv8 re-implementation for human detection using PyTorch
MiVOLO age & gender transformer neural network
Data manipulation and transformation for audio signal processing, powered by PyTorch
Foundational model for human-like, expressive TTS
Efficient face emotion recognition in photos and videos
EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learni…
WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ suppo…
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ suppo…
Implementation of ViViT: A Video Vision Transformer
Robust Speech Recognition via Large-Scale Weak Supervision
CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code inclu…
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementa…
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code …
Online translation as a Python module & command line tool. No key, no authentication needed.
AI-based assessment and forecasting of the state of complex technical objects.
A module for creating 3D ResNets with different depths and additional features.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation