Stars
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
🔊 Text-Prompted Generative Audio Model
InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation 🔥
700+ Pure CSS, SVG & Figma UI Icons, 6000+ glyphs, patterns, colors and layouts.
Instant voice cloning by MIT and MyShell.
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
A natural language interface for computers
Simple HTML5 Charts using the <canvas> tag
Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, inpaint and upscale4x.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Code and documentation to train Stanford's Alpaca models, and generate the data.
the AI-native open-source embedding database
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
The fastai book, published as Jupyter Notebooks
PyTorch package for the discrete VAE used for DALL·E.
Rapid fuzzy string matching in Python using various string metrics
If you stop typing for more than five seconds, all progress will be lost.