Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
llava		llava
scripts		scripts
work_dirs/eval_video_detail_description/llava-v1.6-vicuna-7b_vicuna_v1_frames_32_stride_2		work_dirs/eval_video_detail_description/llava-v1.6-vicuna-7b_vicuna_v1_frames_32_stride_2
README.md		README.md
pyproject.toml		pyproject.toml

Repository files navigation

LLaVA-NeXT: A Strong Zero-shot Video Understanding Model

Install

Clone this repository and navigate to LLaVA folder

git clone https://code.byted.org/ic-research/llava-next-video.git
cd llava-next-video

Install Package

conda create -n llava python=3.10 -y
conda activate llava
pip install --upgrade pip  # enable PEP 660 support
pip install -e .

Quick Start With HuggingFace

Example model: liuhaotian/llava-v1.6-vicuna-7b
Prompt mode: vicuna_v1
Sampled frames: 32 (how many frames to sample from the video)
Spatial pooling stride: 2 (the original tokens for one frames is 2424, if stride=2, then the tokens for one frame is 1212)

bash scripts/eval/video_description_from_t2v.sh ${Example model} ${Prompt mode} ${Sampled frames} True ${Spatial pooling stride} 8 True ;

# bash scripts/eval/video_description_from_t2v.sh liuhaotian/llava-v1.6-vicuna-7b vicuna_v1 32 True 2 8 True ;

GPT Evaluation Example

Assume you have a pred.json (model generated predictions) for model llava-v1.6-vicuna-7b at ./work_dirs/eval_video_detail_description/llava-v1.6-vicuna-7b_vicuna_v1_frames_32_stride_2

bash /mnt/bn/vl-research/workspace/yhzhang/llava-next-video/scripts/eval/video_description_eval.sh llava-v1.6-vicuna-7b_vicuna_v1_frames_32_stride_2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLaVA-NeXT: A Strong Zero-shot Video Understanding Model

Install

Quick Start With HuggingFace

GPT Evaluation Example

About

Releases

Packages

Contributors 6

Languages

LLaVA-VL/LLaVA-NeXT

Folders and files

Latest commit

History

Repository files navigation

LLaVA-NeXT: A Strong Zero-shot Video Understanding Model

Install

Quick Start With HuggingFace

GPT Evaluation Example

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages