inFaaa

🍭

A done thesis is better than a perfect thesis.

Jinfa Huang inFaaa

🍭

A done thesis is better than a perfect thesis.

Vision & Language

82 followers · 147 following

UR & PKU & UESTC
23:05 (UTC -04:00)
https://infaaa.github.io/

Achievements

Highlights

Block or Report

Block or report inFaaa

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (3)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

feizc / DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Python 93 3 Updated Jul 21, 2024

NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,027 75 Updated Jul 7, 2024

google-research-datasets / cube

CUBE is a benchmark to evaluate the Cultural Competence of T2I models

3 Updated Jul 18, 2024

para-lost / RVP

Recursive Visual Programming

Python 8 Updated Jul 14, 2024

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,224 219 Updated Jun 14, 2024

mihirp1998 / VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 114 4 Updated Jul 20, 2024

wfanyue / DPG-T2I-Personalization

[ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Python 18 Updated Jul 19, 2024

Vchitect / VEnhancer

65 2 Updated Jul 10, 2024

saibr / hypvl

This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https://openreview.net/pdf?id=P5D2gfi4Gg

Python 6 Updated Jul 5, 2024

lllyasviel / Paints-UNDO

Understand Human Behavior to Align True Needs

Python 2,877 242 Updated Jul 20, 2024

snap-research / VIMI

HTML 10 Updated Jul 10, 2024

Adamdad / vico

Vico: Compositional Video Generation as Flow Equalization

Python 39 Updated Jul 9, 2024

Ji4chenLi / rg-lcd

Reward Guided Latent Consistency Distillation

Python 12 Updated May 28, 2024

orrzohar / Video-STaR

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Python 37 3 Updated Jul 10, 2024

Kwai-Kolors / MPS

Python 43 1 Updated Jul 12, 2024

layerdiffusion / sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Python 3,676 324 Updated Jun 12, 2024

NJU-PCALab / OpenVid-1M

Python 113 1 Updated Jul 15, 2024

test-time-training / ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 762 40 Updated Jul 14, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 2,583 146 Updated Jul 19, 2024

Lordog / dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

2,666 220 Updated Jul 3, 2024

qrzou / ParCo

[ECCV 2024] Official PyTorch implement of paper "ParCo: Part-Coordinating Text-to-Motion Synthesis": http:https://arxiv.org/abs/2403.18512

Python 35 1 Updated Jul 1, 2024

gcorso / disco-diffdock

Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024

Python 46 2 Updated Jun 12, 2024

DCDmllm / Momentor

Python 37 1 Updated Jun 27, 2024

G-U-N / Awesome-Consistency-Models

Awesome List of Consistency Models

19 Updated Jul 1, 2024

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 329 11 Updated Jul 16, 2024

showlab / Awesome-GUI-Agent

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

62 3 Updated Jul 21, 2024

PeterWang512 / AttributeByUnlearning

4 Updated Jun 14, 2024

ruocwang / dpo-diffusion

[ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google

12 Updated Jul 9, 2024

pinterest / atg-research

Python 39 Updated Apr 10, 2024

llyx97 / TempCompass

[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Python 64 3 Updated Jun 11, 2024

Jinfa Huang inFaaa

Highlights

Block or report inFaaa

Lists (3)

LMM

t2vrl

video-llm

Starred repositories

video-captioning

vision-and-language

image-captioning

Neural Network

Tensorflow