Block or Report
Block or report pradipcyb
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
[IJCAI 2022] Learning Multi-dimensional Edge Feature-based AU Relation Graph for Facial Action Unit Recognition, Pytorch code
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Code for the paper "A Deep Reinforced Sequence-to-Set Model for Multi-Label Classification"
Create temporary files and temporary dirs in memory-based filesystems on Linux.
Voice Activity Projection Models: Self-supervised learning of Turn-taking Events
TextBox 2.0 is a text generation library with pre-trained language models
Code for "Towards Optimal Correlational Object Search" | ICRA 2022
Panoramic Graph Environment Annotation toolkit, for collecting audio and text annotations in panoramic graph environments such as Matterport3D and StreetLearn.
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
Lexically constrained decoding for sequence generation using Grid Beam Search
Code of Dense Relational Captioning
Code accompanying the paper "Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs" (Chen et al., CVPR 2020, Oral).
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
End-to-end ASR/LM implementation with PyTorch
A python module to process data for Frame Semantic Parsing