Highlights
- Pro
Block or Report
Block or report JohanesSetiawan
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Improved Wave-U-Net implemented in Pytorch
recursal / GoldFinch-paper
Forked from SmerkyG/GoldFinch-paperGoldFinch and other hybrid transformer components
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model
This is the official implementation of the SEMamba paper.
Yet another PyTorch implementation of Stable Diffusion (probably easy to read)
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.
PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"
Official repository for RawNet, RawNet2, and RawNet3
High-level Deep Learning Framework written in Kotlin and inspired by Keras
AuraSR: GAN-based Super-Resolution for real-world
In this blog, we will build a small scale text-to-video model from scratch. We will input a text prompt, and our trained model will generate a video based on that prompt.
afiaka87 / dalle-pytorch
Forked from lucidrains/DALLE-pytorchText to Image Transformer in Pytorch
ShinkaiGAN is an image-to-image translation model designed to transform sketch images into beautiful anime scenes inspired by the style of Makoto Shinkai. This model utilizes a Hybrid Perception Bl…
An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
Phase-aware speech enchancement with Deep Complex U-Net
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.
PyTorch implementation of RNN-Transducer(RNN-T).
A PyTorch Implementation of "Attention Is All You Need"
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"