-
H2O.ai
- Hsinchu, Taiwan
Highlights
- Pro
Stars
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"
A real-time motion capture system for 3D virtual character animating.
Ikaros-521 / AI-Vtuber
Forked from sandboxdream/AI-VtuberAI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…
Talk to any LLM with hands-free voice interaction, voice interruption, Live2D taking face, and long-term memory running locally across platforms
ComfyUI's ControlNet Auxiliary Preprocessors
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
A ComfyUI node to automatically extract masks for body regions and clothing/fashion items. Made with 💚 by the CozyMantis squad.
My Workflows for Differential Diffusion
Workflow used in this video:
Character Animation (AnimateAnyone, Face Reenactment)
Unofficial Implementation of Animate Anyone
Creating a customized UI program that can help us deal with removal of image/video/webcam distortion, and product better result for computer vision
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
[TPAMI 2024 & CVPR 2022] Attention Concatenation Volume for Accurate and Efficient Stereo Matching
OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline
Hierarchical Deep Stereo Matching on High Resolution Images, CVPR 2019.
Python scripts form performing stereo depth estimation using the high res stereo model in PyTorch .
HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching
ICRA 2019 "FastDepth: Fast Monocular Depth Estimation on Embedded Systems"
Photogrammetric Computer Vision Framework
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"