Highlights
- Pro
Block or Report
Block or report hahashu
Contact GitHub support about this userβs behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
A collection of scripts to download Audible audiobooks.
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
so-vits-svc fork with realtime support, improved interface and more features.
π Text-Prompted Generative Audio Model
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
A fast local neural text to speech engine for Mycroft
A multi-voice TTS system trained with an emphasis on quality
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
π€ π¬ Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Automatic Generation of Visualizations and Infographics using Large Language Models
Sample implementation of a politeness model, trained on the Stanford Politeness Corpus
A deck tracker and deck manager for Hearthstone on Windows
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Virtual whiteboard for sketching hand-drawn like diagrams
Data manipulation and transformation for audio signal processing, powered by PyTorch
Specify what you want it to build, the AI asks for clarification, and then builds it.
Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.
ImageBind One Embedding Space to Bind Them All
π Guides, papers, lecture, notebooks and resources for prompt engineering
Instructions, source code, and misc. resources needed for building a Tiny ML-powered artificial nose.
Official Code for DragGAN (SIGGRAPH 2023)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
The official gpt4free repository | various collection of powerful language models
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.