Skip to content
View vladbataev's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report vladbataev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

YaFSDP: Yet another Fully Sharded Data Parallel

Python 808 37 Updated Jul 29, 2024

Awesome speech/audio LLMs, representation learning, and codec models

536 26 Updated May 29, 2024

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Python 220 16 Updated May 24, 2024

Reference-aware automatic speech evaluation toolkit

Python 83 5 Updated Feb 22, 2024

A developer's guide to management: an open-sourced handbook for leading software engineering teams.

1,540 94 Updated Jan 24, 2020

A Neural Framework for MT Evaluation

Python 462 74 Updated Jul 29, 2024

The strictest and most opinionated python linter ever!

Python 2,481 381 Updated Aug 10, 2024

dataset of podcasts and episodes

Python 14 3 Updated Jan 16, 2018

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,492 333 Updated Aug 10, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,647 1,032 Updated Aug 5, 2024

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,033 104 Updated May 10, 2024

Machine Learning Engineering Open Book

Python 10,422 628 Updated Aug 9, 2024

Text-to-Audio/Music Generation

Python 2,178 174 Updated Jul 26, 2024

Voice Conversion With Just Nearest Neighbors

Python 435 64 Updated Mar 18, 2024

FTP client package for Go

Go 1,268 357 Updated Jul 30, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 24,554 3,533 Updated Aug 10, 2024

Noise supression using deep filtering

Python 2,256 213 Updated Jul 31, 2024

A Very Low-Bitrate Codec for Speech Compression

C++ 3,806 353 Updated May 20, 2024

Stable Diffusion inference benchmarks

Python 10 Updated Jun 14, 2024

A timeline of the latest AI models for audio generation, starting in 2023!

1,876 67 Updated Jan 4, 2024

Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).

Python 15,887 1,510 Updated Aug 4, 2024

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,342 250 Updated Jan 27, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 34,153 4,046 Updated Jul 10, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,360 224 Updated Jun 2, 2024

Contrastive Language-Audio Pretraining

Python 1,285 125 Updated Jul 9, 2024

Audio Dataset for training CLAP and other models

Python 608 53 Updated Feb 5, 2024

An open-source efficient deep learning framework/compiler, written in python.

Python 636 51 Updated Jul 30, 2024

Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)

Python 361 28 Updated Jul 23, 2024

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Python 700 47 Updated Aug 6, 2024
Next