Ziyang412

Ziyang Wang Ziyang412

Ph.D. student at UNC Chapel Hill, Student Researcher at Meta FAIR

10 followers · 4 following

UNC Chapel Hill
https://ziyangw2000.github.io/

Achievements

Highlights

Stars

ZiyangW2000 / ZiyangW2000.github.io

SCSS 1 Updated Oct 30, 2024

daniel-cores / tvbench

TVBench: Redesigning Video-Language Evaluation

Python 7 Updated Oct 25, 2024

IDEA-Research / Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,102 103 Updated Nov 3, 2024

huangb23 / VTimeLLM

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python 225 11 Updated Jun 13, 2024

egoschema / EgoSchema

Python 72 Updated Dec 13, 2023

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 2,049 149 Updated Nov 14, 2024

FreedomIntelligence / LongLLaVA

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Python 179 12 Updated Oct 12, 2024

jzhang38 / LongMamba

Some preliminary explorations of Mamba's context scaling.

Python 191 10 Updated Feb 8, 2024

bigai-nlco / VideoLLaMB

Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges

Python 49 Updated Sep 19, 2024

joslefaure / HERMES

[ECCVW'24] Long-form Video Understanding by Bridging Episodic Memory and Semantic Knowledge

Python 14 2 Updated Sep 27, 2024

WeiKangda / VideoTree

Forked from Ziyang412/VideoTree

Playground Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"

Python 1 Updated Aug 6, 2024

IVGSZ / Flash-VStream

This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"

Python 129 7 Updated Aug 11, 2024

facebookresearch / ToMe

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 969 69 Updated Jun 17, 2024

EvolvingLMMs-Lab / LongVA

Long Context Transfer from Language to Vision

Python 334 17 Updated Oct 26, 2024

BradyFU / Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

406 12 Updated Jun 18, 2024

HL-hanlin / Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Python 390 16 Updated Jun 15, 2024

jzhang38 / EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 645 46 Updated Sep 27, 2024

longvideobench / LongVideoBench

[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.

Python 66 2 Updated Jul 27, 2024

Ziyang412 / LLoVi

Forked from CeeZh/LLoVi

Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"

Python 1 Updated Mar 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziyang Wang Ziyang412

Achievements

Achievements

Highlights

Block or report Ziyang412

Stars

ZiyangW2000 / ZiyangW2000.github.io

daniel-cores / tvbench

IDEA-Research / Grounded-SAM-2

huangb23 / VTimeLLM

egoschema / EgoSchema

EvolvingLMMs-Lab / lmms-eval

FreedomIntelligence / LongLLaVA

jzhang38 / LongMamba

bigai-nlco / VideoLLaMB

joslefaure / HERMES

WeiKangda / VideoTree

IVGSZ / Flash-VStream

facebookresearch / ToMe

EvolvingLMMs-Lab / LongVA

BradyFU / Video-MME

HL-hanlin / Ctrl-Adapter

jzhang38 / EasyContext

longvideobench / LongVideoBench

Ziyang412 / LLoVi

wxh1996 / VideoAgent

facebookresearch / TimeSformer

xuguohai / X-CLIP

ttengwang / Awesome_Long_Form_Video_Understanding

antoyang / FrozenBiLM

aszala / DiagrammerGPT

doc-doc / NExT-QA

kyegomez / MC-ViT

dinobby / MAGDi

HanNight / soft_self_consistency

meetdavidwan / crg