pkmital

Parag K Mital pkmital

Artist and researcher with 20+ years experience in AI and computational arts

1.2k followers · 84 following

Los Angeles, CA
https://pkmital.com
@pkmital

Achievements

x3 x4

Achievements

x3 x4

Highlights

Stars

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,334 418 Updated Nov 7, 2024

snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 4,969 311 Updated Oct 18, 2023

idiap / coqui-ai-TTS

Forked from coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 543 56 Updated Nov 11, 2024

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,538 448 Updated Oct 12, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.

Python 29,729 2,925 Updated Aug 21, 2024

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,191 1,822 Updated Aug 19, 2024

dunky11 / voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Python 222 32 Updated Oct 10, 2022

PINTO0309 / PINTO_model_zoo

A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…

Python 3,603 573 Updated Nov 6, 2024

Audio-AGI / AudioSep

Official implementation of "Separate Anything You Describe"

Python 1,623 117 Updated Oct 25, 2024

roymacdonald / ofxLineaDeTiempo

A new timeline addon for openframeworks.

C++ 40 3 Updated Jun 27, 2024

danomatika / loaf

loaf: lua, osc, and openFrameworks

C++ 53 4 Updated Feb 3, 2024

WongKinYiu / yolov7

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Jupyter Notebook 13,364 4,219 Updated Aug 19, 2024

f / awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

HTML 112,518 15,350 Updated Sep 26, 2024

lllyasviel / ControlNet

Let us control diffusion models!

Python 30,327 2,725 Updated Feb 25, 2024

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,188 549 Updated Oct 28, 2024

csteinmetz1 / auraloss

Collection of audio-focused loss functions in PyTorch

Python 739 67 Updated Jul 30, 2024

soham97 / sound_ai_progress

Tracking states of the arts and recent results (bibliography) on sound tasks.

32 2 Updated Jan 10, 2023

phoboslab / qoa

The “Quite OK Audio Format” for fast, lossy audio compression

C 767 42 Updated Oct 3, 2024

haoheliu / audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 301 31 Updated Sep 29, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,117 5,381 Updated Nov 11, 2024

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,054 388 Updated Sep 4, 2024

Kinyugo / msanii

A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.

Python 194 10 Updated Apr 27, 2023

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 1,956 168 Updated Jun 12, 2023

archinetai / archisound

A collection of pre-trained audio models, in PyTorch.

Python 110 4 Updated Jan 27, 2023

diff-usion / Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

HTML 11,056 947 Updated Aug 1, 2024

kaegi / alass

"Automatic Language-Agnostic Subtitle Synchronization"

Rust 1,047 53 Updated Dec 28, 2023

kymatio / kymatio

Wavelet scattering transforms in Python with GPU acceleration

Python 760 138 Updated May 29, 2024

archinetai / audio-diffusion-pytorch-trainer

Trainer for audio-diffusion-pytorch

Python 127 22 Updated Jan 13, 2023

timsainb / AVGN

A generative network for animal vocalizations. For dimensionality reduction, sequencing, clustering, corpus-building, and generating novel 'stimulus spaces'. All with notebook examples using freely…

Jupyter Notebook 69 21 Updated Dec 27, 2022

nerfstudio-project / nerfstudio

A collaboration friendly studio for NeRFs

Python 9,521 1,297 Updated Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parag K Mital pkmital

Achievements

Achievements

Highlights

Block or report pkmital

Stars

snakers4 / silero-vad

snakers4 / silero-models

idiap / coqui-ai-TTS

microsoft / muzic

myshell-ai / OpenVoice

neonbjb / tortoise-tts

dunky11 / voicesmith

PINTO0309 / PINTO_model_zoo

Audio-AGI / AudioSep

roymacdonald / ofxLineaDeTiempo

danomatika / loaf

WongKinYiu / yolov7

f / awesome-chatgpt-prompts

lllyasviel / ControlNet

FMInference / FlexLLMGen

csteinmetz1 / auraloss

soham97 / sound_ai_progress

phoboslab / qoa

haoheliu / audioldm_eval

huggingface / diffusers

stanford-futuredata / ColBERT

Kinyugo / msanii

archinetai / audio-diffusion-pytorch

archinetai / archisound

diff-usion / Awesome-Diffusion-Models

kaegi / alass

kymatio / kymatio

archinetai / audio-diffusion-pytorch-trainer

timsainb / AVGN

nerfstudio-project / nerfstudio