Skip to content
View Jungjee's full-sized avatar
👋
👋
Block or Report

Block or report Jungjee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Speech Security and Privacy Compendium - Mini

Python 6 Updated Jun 18, 2024
Python 23 Updated Jul 19, 2024

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Python 334 60 Updated Jul 27, 2022

The OS for your personal finances

Ruby 28,800 2,186 Updated Jul 19, 2024

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 518 18 Updated Jul 15, 2024

The official repository of Dynamic-SUPERB.

Python 142 87 Updated Jul 19, 2024

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

Jupyter Notebook 26 1 Updated Jan 6, 2024

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

Python 43 4 Updated Jan 18, 2024

NeMo: a toolkit for conversational AI

Python 7 1 Updated May 4, 2024

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

Jupyter Notebook 6 Updated May 23, 2024

End-to-End Speech Processing Toolkit

Python 1 1 Updated Jul 16, 2024

Spoofing-robust speaker verification evaluation toolkit

Python 5 1 Updated Jun 7, 2024

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Jupyter Notebook 427 44 Updated Sep 11, 2023

A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. Official website: https://asappresearch.github.io/slue-toolkit/

Python 58 14 Updated Feb 26, 2024

Script to download corpora from the Linguistic Data Consortium (LDC)

Shell 31 10 Updated Aug 5, 2022

End-to-End Speech Processing Toolkit

Python 1 Updated Jul 8, 2024

Confidence interval computation for evaluation in machine learning using the bootstrapping approach

Jupyter Notebook 62 5 Updated Apr 5, 2024

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

Python 16 2 Updated Jul 10, 2024

A repo containing download guidance and corresponding scripts of the VoxBlink dataset.

Python 17 Updated Apr 16, 2024

The VoxTube dataset official repository

HTML 57 1 Updated Feb 14, 2024

An open source implementation of CLIP.

Python 9,258 917 Updated Jul 4, 2024

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 600 105 Updated Jul 17, 2024

State-of-the-art 2D and 3D Face Analysis Project

Python 22,078 5,270 Updated Jul 17, 2024
Python 4 Updated Feb 3, 2022
Python 3 Updated Jul 2, 2024

Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation

Python 172 23 Updated Feb 10, 2024

A collection of papers related to speech model compression

22 3 Updated Jul 31, 2023

End-to-End Speech Processing Toolkit

Python 1 2 Updated Apr 3, 2024
Next