-
01:28
(UTC -07:00)
Highlights
- Pro
Stars
The highest quality Pokemon Images and Assets.
Wrapper to use DynamiCrafter models in ComfyUI
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
On-device Speech Recognition for Apple Silicon
🔍 AI search engine - self-host with local or cloud LLMs
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…
Allows recording of Personal Voices to a file using builtin say command in Mac OS Sonoma
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…
If Reddit's content was completely AI-generated.
Y'all thought the dead internet theory wasn't real, but HERE IT IS
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured…
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
A collection of GPT system prompts and various prompt injection/leaking knowledge.
llama and other large language models on iOS and MacOS offline using GGML library.
Swift library to work with llama and other large language models.
Easily train a good VC model with voice data <= 10 mins!
Stable Diffusion and Flux in pure C/C++
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code