Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
llava		llava
scripts		scripts
work_dirs/eval_video_detail_description/llava-v1.6-vicuna-7b_vicuna_v1_frames_32_stride_2		work_dirs/eval_video_detail_description/llava-v1.6-vicuna-7b_vicuna_v1_frames_32_stride_2
README.md		README.md
pyproject.toml		pyproject.toml

Repository files navigation

LLaVA-NeXT: A Strong Zero-shot Video Understanding Model

Install

If you are not using Linux, do NOT proceed, see instructions for macOS and Windows.

Clone this repository and navigate to LLaVA folder

git clone https://code.byted.org/ic-research/llava-next-video.git
cd llava-next-video

Install Package

conda create -n llava python=3.10 -y
conda activate llava
pip install --upgrade pip  # enable PEP 660 support
pip install -e .

Quick Start With HuggingFace

Example model: liuhaotian/llava-v1.6-vicuna-7b
Prompt mode: vicuna_v1
Sampled frames: 32
Spatial pooling stride: 2

bash scripts/eval/video_description_from_t2v.sh ${Example model} ${Prompt mode} ${Sampled frames} True ${Spatial pooling stride} 8 True ;

# bash scripts/eval/video_description_from_t2v.sh liuhaotian/llava-v1.6-vicuna-7b vicuna_v1 32 True 2 8 True ;

GPT Evaluation Example

Assume you have a pred.json (model generated predictions) for model llava-v1.6-vicuna-7b at ./work_dirs/eval_video_detail_description/llava-v1.6-vicuna-7b_vicuna_v1_frames_32_stride_2

bash /mnt/bn/vl-research/workspace/yhzhang/llava-next-video/scripts/eval/video_description_eval.sh llava-v1.6-vicuna-7b_vicuna_v1_frames_32_stride_2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLaVA-NeXT: A Strong Zero-shot Video Understanding Model

Install

Quick Start With HuggingFace

GPT Evaluation Example

About

Releases

Packages

Contributors 6

Languages

LLaVA-VL/LLaVA-NeXT

Folders and files

Latest commit

History

Repository files navigation

LLaVA-NeXT: A Strong Zero-shot Video Understanding Model

Install

Quick Start With HuggingFace

GPT Evaluation Example

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages