zhang-tao-whu

zhangtao zhang-tao-whu

54 followers · 22 following

https://zhang-tao-whu.github.io/

Achievements

x2 x2

Achievements

x2 x2

Block or Report

Block or report zhang-tao-whu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 4,147 234 Updated Aug 5, 2024

dvlab-research / Prompt-Highlighter

[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Python 115 2 Updated Jul 23, 2024

JUNJIE99 / MLVU

🔥🔥MLVU: Multi-task Long Video Understanding Benchmark

Python 113 Updated Jul 28, 2024

facebookresearch / segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 8,390 476 Updated Aug 5, 2024

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 390 17 Updated Jul 31, 2024

lxa9867 / ControlVAR

This is the official implementation for ControlVAR.

Python 14 1 Updated Jul 29, 2024

lucidrains / autoregressive-diffusion-pytorch

Implementation of Autoregressive Diffusion in Pytorch

Python 224 2 Updated Jul 30, 2024

recursal / GoldFinch-paper

Forked from SmerkyG/GoldFinch-paper

GoldFinch and other hybrid transformer components

Python 35 3 Updated Jul 20, 2024

baaivision / EVE

EVE: Encoder-Free Vision-Language Models

Python 181 3 Updated Jul 20, 2024

yuecao0119 / MMInstruct

Python 24 Updated Aug 5, 2024

catcathh / UltraPixel

Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks

Python 385 14 Updated Jul 28, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

26,543 1,432 Updated Aug 1, 2024

mrwu-mac / R-Bench

Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models' (ICML2024)

Python 11 Updated Jul 15, 2024

cilinyan / VISA

[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model

Python 93 1 Updated Aug 5, 2024

baaivision / DenseFusion

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

Python 81 1 Updated Jul 31, 2024

test-time-training / ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 886 50 Updated Jul 14, 2024

kagawa588 / GvSeg

This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).

5 1 Updated Jul 15, 2024

streamlit / streamlit

Streamlit — A faster way to build and share data apps.

Python 33,800 2,946 Updated Aug 5, 2024

frank-xwang / UnSAM

Code release for "Segment Anything without Supervision"

Jupyter Notebook 261 17 Updated Jul 9, 2024

GAIR-NLP / anole

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 577 29 Updated Aug 5, 2024

buaacyw / MeshAnything

From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"

Python 1,851 73 Updated Aug 5, 2024

rasbt / LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 24,090 2,528 Updated Aug 5, 2024

callsys / ControlCap

[ECCV 2024] ControlCap: Controllable Region-level Captioning

Python 42 Updated Jul 2, 2024

hustvl / EVF-SAM

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 128 3 Updated Aug 5, 2024

IVGSZ / Flash-VStream

Forked from IVG-SZ/Flash-VStream

This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"

Python 79 4 Updated Jul 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhangtao zhang-tao-whu

Achievements

Achievements

Block or report zhang-tao-whu

Stars

black-forest-labs / flux

dvlab-research / Prompt-Highlighter

JUNJIE99 / MLVU

facebookresearch / segment-anything-2

LTH14 / mar

lxa9867 / ControlVAR

lucidrains / autoregressive-diffusion-pytorch

recursal / GoldFinch-paper

baaivision / EVE

yuecao0119 / MMInstruct

catcathh / UltraPixel

karpathy / LLM101n

mrwu-mac / R-Bench

cilinyan / VISA

baaivision / DenseFusion

test-time-training / ttt-lm-pytorch

kagawa588 / GvSeg

streamlit / streamlit

frank-xwang / UnSAM

GAIR-NLP / anole

buaacyw / MeshAnything

rasbt / LLMs-from-scratch

callsys / ControlCap

hustvl / EVF-SAM

IVGSZ / Flash-VStream

zyc00 / Point-SAM

PhyscalX / gradio-image-prompter

gradio-app / gradio

jianzongwu / MotionBooth

PhoenixZ810 / MG-LLaVA