Skip to content
View ElenaRyumina's full-sized avatar
Block or Report

Block or report ElenaRyumina

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Showing results

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 2,528 243 Updated Jul 1, 2024
Python 110 20 Updated Sep 18, 2023

Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better understanding, and stay at the forefront of advances in speec…

Jupyter Notebook 12 Updated Apr 19, 2024

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for…

Python 326 15 Updated May 30, 2024

FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and gesture recognition. Seamlessly integrate code implementation…

7 1 Updated May 18, 2024

YOLOv8 re-implementation for human detection using PyTorch

Python 81 8 Updated Jan 11, 2024

MiVOLO age & gender transformer neural network

Python 280 47 Updated Apr 29, 2024

Auto-AVSR: Lip-Reading Sentences Project

Python 153 37 Updated Apr 16, 2024

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,442 643 Updated Jul 28, 2024

Foundational model for human-like, expressive TTS

Python 3,572 624 Updated Jul 21, 2024
Python 35 25 Updated Oct 10, 2023

Команды для работы в терминале

7 1 Updated Apr 11, 2023

Efficient face emotion recognition in photos and videos

Jupyter Notebook 627 119 Updated Jul 19, 2024

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learni…

Python 87 4 Updated May 18, 2024

WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ suppo…

Python 69 9 Updated May 18, 2024

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 7,490 1,051 Updated Jul 18, 2024

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ suppo…

Python 904 39 Updated Jul 26, 2024

Implementation of ViViT: A Video Vision Transformer

Python 488 61 Updated Jun 21, 2021

Robust Speech Recognition via Large-Scale Weak Supervision

Python 65,202 7,625 Updated Jul 22, 2024

Russian speech technology links

168 10 Updated Feb 19, 2024

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code inclu…

Python 361 23 Updated Jul 15, 2024

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementa…

Python 83 2 Updated May 18, 2024

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…

Python 309 15 Updated Jul 29, 2024

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code …

617 42 Updated May 18, 2024

Online translation as a Python module & command line tool. No key, no authentication needed.

Python 720 155 Updated Apr 19, 2024

AI-based assessment and forecasting of the state of complex technical objects.

Python 4 1 Updated Jan 19, 2024

A module for creating 3D ResNets with different depths and additional features.

Python 8 3 Updated Oct 24, 2022

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 48,852 15,956 Updated Jul 29, 2024

The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.

Python 2,837 506 Updated Feb 2, 2024

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

Python 770 198 Updated Mar 10, 2024