Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
TensorFlow code and pre-trained models for BERT
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A programming framework for agentic AI 🤖
A generative speech model for daily dialogue.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
SoftVC VITS Singing Voice Conversion
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Convert PDF to markdown quickly with high accuracy
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
OCR, layout analysis, reading order, table recognition in 90+ languages
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
A very fast and expressive template engine.
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capa…
An easy and fast way to create a Python GUI 🐍
Bayesian Modeling and Probabilistic Programming in Python
Make bilingual epub books Using AI translate
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.