Block or Report
Block or report forrestbing
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Take a screenshot online and compresses images in browser with Webassembly
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and …
A modular graph-based Retrieval-Augmented Generation (RAG) system
daswer123 / hallo-webui
Forked from fudan-generative-vision/halloWebui for Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, LoRA
Seamlessly integrate state-of-the-art transformer models into robotics stacks
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Code release for "Segment Anything without Supervision"
GPT-4 Enhanced with Real-Time Web Browsing 🔗
TexPainter: Generative Mesh Texturing with Multi-view Consistency
Code for Reinforcement Learning from Vision Language Foundation Model Feedback
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-process…
Vector (and Scalar) Quantization, in Pytorch
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:www.TTSlist.com 备用:http:https://ttslist.aiqbh.com/
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
🍦 ChatTTS-Forge is a project developed around the TTS generation model ChatTTS, implementing an API Server and a Gradio-based WebUI.
DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, …