Lists (23)
Sort Name descending (Z-A)
videoGAN
useful
tts
try_on
tracking
time_series
segmentation
robotics
reinforcement_learning
reid
recommendation_systems
pose
optimizers
nlp_based
nlp
🚀 My stack
meta learning
✨ Inspiration
GNNS
gans
data_science
algorithms
3d_deep_learning
Starred repositories
💙 Programming language inspired by uzbek street tongue
Deezer source separation library including pretrained models.
kaldi-asr/kaldi is the official location of the Kaldi project.
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Ongoing research training transformer models at scale
A multi-voice TTS system trained with an emphasis on quality
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
🔊 Text-Prompted Generative Audio Model
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Code repository for the paper "On the Benefits of 3D Pose and Tracking for Human Action Recognition", (CVPR 2023)
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
[ECCV 2020] SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation
Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"
[CVPR 2023] BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
An algorithm for reconstructing the radiance field of a large-scale scene from a single casually captured video.
Standardized Serverless ML Inference Platform on Kubernetes
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
Official Pytorch implementation of "Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose", ECCV 2020
[CVPR 2023] Learning Locally Editable Virtual Humans
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement
Learning to Transfer Texture from Clothing Images to 3D Humans, CVPR 2020
CVPR 2022 - Official code repository for the paper: Accurate 3D Body Shape Regression using Metric and Semantic Attributes.
Reimplemented code for "Toward Characteristic-Preserving Image-based Virtual Try-On Network"