Stars
[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
FastAPI Best Practices and Conventions we used at our startup
Speed up Stable Diffusion with this one simple trick!
This app Classifies the text generated by AI tools like chatGPT. Roberta-base-openai-detector Model has been used from hugging face to detect ai generated texts.
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
Extrapolating knowledge graphs from unstructured text using GPT-3 🕵️♂️
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Deep Voice: Real-time Neural Text-to-Speech
Command line utility for forced alignment using Kaldi
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"
Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)
A simple interface for the CMU pronouncing dictionary
PyTorch Implementation and Explanation of Graph Representation Learning papers: DeepWalk, GCN, GraphSAGE, ChebNet & GAT.
A PyTorch implementation of "Capsule Graph Neural Network" (ICLR 2019).
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
To eventually become an unofficial Pytorch implementation / replication of Alphafold2, as details of the architecture get released
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Deep Learning model for lexical stress detection in spoken English
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
Image-to-Image Translation in PyTorch
Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction
Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)
Multivariate Time Series Transformer, public version
Code for the paper "Jukebox: A Generative Model for Music"
pix2code: Generating Code from a Graphical User Interface Screenshot