Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
-
Updated
Jan 6, 2025 - Python
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
Webui for using XTTS and for finetuning it
A simple FastAPI Server to run XTTSv2
End-to-end platform for building voice first multimodal agents
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.
This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.
A User Interface for XTTS-2 Text-Based Voice Cloning with 10 seconds
Converts epub e-book files to mp3 audiobook files.
A command line utility to easily finetune XTTS models in a fully automated way. Developed for Pandrator.
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.
Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.
OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.
Add a description, image, and links to the xtts topic page so that developers can more easily learn about it.
To associate your repository with the xtts topic, visit your repo's landing page and select "manage topics."