-
Zhejiang University
- Hang Zhou, Zhe Jiang
- @ChaoLia02213535
Lists (7)
Sort Name ascending (A-Z)
Stars
[ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation
Implementation of the paper Text Augmented Spatial Aware Zero-shot Referring Image Segmentation (Findings of EMNLP 2023)
[IJCV 2024] Slimmable Networks for Contrastive Self-supervised Learning.
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
(NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
[ECCV 2024] HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.
Blind&Invisible Watermark ,图片盲水印,提取水印无须原图!
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)
[IJCAI 2023 ORAL] "Pyramid Diffusion Models For Low-light Image Enhancement" (Official Implementation)
[CVPR2024] CapHuman: Capture Your Moments in Parallel Universes
Tips for Writing a Research Paper using LaTeX
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models