Skip to content
View Park323's full-sized avatar
🧠
Happy
🧠
Happy
  • sogang univ. IIP lab
  • Mapo, Seoul

Block or report Park323

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,405 431 Updated Nov 22, 2024

A Python wrapper for the high-quality vocoder "World"

Cython 726 122 Updated Oct 23, 2023
Python 11 3 Updated Oct 25, 2024

한국어 음성인식 STT API 리스트. 각 성능 벤치마크.

342 17 Updated Jun 3, 2024

An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic multi-agent settings.

Python 62 9 Updated May 15, 2023

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Python 917 194 Updated Dec 6, 2023

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Python 33 19 Updated Mar 19, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 35,609 4,358 Updated Aug 16, 2024

CUDA-Warp RNN-Transducer

Python 211 41 Updated Feb 22, 2023

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…

Python 403 17 Updated Nov 23, 2024

👩‍💻👨‍💻 AI 엔지니어 기술 면접 스터디 (⭐️ 1k+)

1,872 453 Updated Oct 12, 2024

Awesome speech/audio LLMs, representation learning, and codec models

706 36 Updated Nov 18, 2024

👦 👧 Technical-Interview guidelines written for those who started studying programming. I wish you all the best. 👾

19,814 4,608 Updated Aug 9, 2024

Google AI 2018 BERT pytorch implementation

Python 6,228 1,313 Updated Sep 15, 2023

Bridging Research and Practice with PyTorch

Python 70 6 Updated Jul 15, 2024

This repository contains the official implementation of GhostFaceNets, State-Of-The-Art lightweight face recognition models.

Python 199 38 Updated Jan 24, 2024

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,498 1,549 Updated May 23, 2024
Python 35 5 Updated Feb 5, 2023

Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)

JavaScript 32 2 Updated Oct 13, 2023

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

Python 134 11 Updated Jan 16, 2024

Implementation of ViViT: A Video Vision Transformer

Python 517 66 Updated Jun 21, 2021

A python package for whisper normalizer

Jupyter Notebook 44 8 Updated Jul 5, 2024

Pipeline to generate the Standardized Project Gutenberg Corpus

Python 159 39 Updated Jan 5, 2024

A tool for extracting plain text from Wikipedia dumps

Python 3,758 968 Updated May 23, 2024

dual-path multi-channel network for speech separation

Python 5 Updated Jan 15, 2024

Convert Wikipedia database dumps into plaintext files

Python 305 42 Updated May 23, 2021

This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.

104 5 Updated Aug 4, 2023

2023 한국어 AI 경진대회

Python 12 3 Updated Oct 30, 2023
Next