Skip to content
View forrestbing's full-sized avatar
  • Hangzhou, Zhejiang
Block or Report

Block or report forrestbing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RaDe-GS: Rasterizing Depth in Gaussian Splatting

C++ 337 10 Updated Jun 21, 2024

Kolors Team

Python 422 14 Updated Jul 6, 2024

Take a screenshot online and compresses images in browser with Webassembly

JavaScript 315 35 Updated Jul 6, 2024

Multilingual Voice Understanding Model

Python 477 35 Updated Jul 5, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 587 43 Updated Jul 6, 2024

Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and …

JavaScript 894 111 Updated Jun 29, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 6,306 491 Updated Jul 6, 2024

Webui for Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 43 12 Updated Jun 21, 2024

A Neural-Symbolic Self-Training Framework

Python 79 2 Updated Jun 22, 2024

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 612 51 Updated Jul 5, 2024

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, LoRA

Python 290 10 Updated Jul 6, 2024

Seamlessly integrate state-of-the-art transformer models into robotics stacks

Python 113 13 Updated Jul 6, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 153 5 Updated Jul 5, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

777 14 Updated Jul 2, 2024

Code release for "Segment Anything without Supervision"

Jupyter Notebook 179 11 Updated Jul 7, 2024

GPT-4 Enhanced with Real-Time Web Browsing 🔗

Python 114 22 Updated Jul 1, 2024

TexPainter: Generative Mesh Texturing with Multi-view Consistency

Python 24 1 Updated Jul 3, 2024

Code for Reinforcement Learning from Vision Language Foundation Model Feedback

C++ 21 3 Updated May 22, 2024

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Python 1,451 204 Updated Jul 3, 2024

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Python 3,161 241 Updated Jul 5, 2024

This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-process…

Python 118 9 Updated Dec 7, 2023

TTS

Jupyter Notebook 48 5 Updated Jun 4, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 2,147 179 Updated Jul 6, 2024

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,095 99 Updated Apr 10, 2024

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 147 9 Updated Jul 1, 2024

10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:www.TTSlist.com 备用:http:https://ttslist.aiqbh.com/

HTML 98 6 Updated Jun 16, 2024

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

Python 55 4 Updated Jul 1, 2024

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 4,595 475 Updated Jul 3, 2024

🍦 ChatTTS-Forge is a project developed around the TTS generation model ChatTTS, implementing an API Server and a Gradio-based WebUI.

Python 454 58 Updated Jul 6, 2024

DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, …

Python 221 13 Updated Jul 1, 2024
Next