Block or Report
Block or report ooshyun
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (32)
Sort Name ascending (A-Z)
Abuse Detection
Android
Audio Library
🎶 Audio/Speech related document
🏷️ Audio Tagging
🌕 AudioVisualizationCanditㅇㅅㅇa
Candidate for Audio Visualization application☪️ Base library for
Currently, Speech and AudioPlayground
Basic learning resource
C++ library
🖥️ Development Resource
📎 Documents
🎧 DSP Open-source
📱 DSP-Related-iOS
🔢 Fixed-Point
Numerical Platform for embedded system🗨️ Generation Speech Source
💻 Hardware
🖼️ Image
🤖 Micro device
🤖 Micro ML Model
🤖 ML Model
🏭 ML Platform
Model Compression
🦾 Optimization / Cross env
📎 Paper
💯 Performance Estimator
🌏 Physical Simulation
Products
Rust
🗣️ Speech Enhancement
📈 Visualization
🌏 Web
Stars
Language: Python
Sort by: Most stars
Stable Diffusion web UI
Robust Speech Recognition via Large-Scale Weak Supervision
Clone a voice in 5 seconds to generate arbitrary speech in real-time
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
TensorFlow code and pre-trained models for BERT
Streamlit — A faster way to build and share data apps.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Deezer source separation library including pretrained models.
Open standard for machine learning interoperability
FauxPilot - an open-source alternative to GitHub Copilot server
Convert Machine Learning Code Between Frameworks
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Automatic headphone equalization from frequency responses
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Faster Whisper transcription with CTranslate2
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Manipulate audio with a simple and easy high level interface
Code for the paper Hybrid Spectrogram and Waveform Source Separation
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Code for the paper "Jukebox: A Generative Model for Music"
A flexible framework of neural networks for deep learning
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications