LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Yuzhang Shang*, Mu Cai*, Bingxin Xu, Yong Jae Lee^, Yan Yan^

*Equal Contribution, ^Equal Advising

[Paper] [Project Page]

How to run

Step.0: Set the environment the same as LLaVA-1.5

Note that the core of our proposed module is here in the CLIP image encoder.

Step.1 (for inference): Download Checkpoints

Download the checkpoints (LoRA Version) from Yuzhang's Huggingface Homepage to checkpoints/llava-v1.5-7b-lora-prunemerge.

Step.2 (for inference): Change the methods (PruMerge or PruMerge+).

Change the call function of token reduction from here in the CLIP image encoder.

Step.3 (for inference): Run the script.

For example, the evaluation for TextVQA is:

CUDA_VISIBLE_DEVICES=7 XDG_CACHE_HOME='/data/shangyuzhang/' bash scripts/v1_5/eval/testvqa.sh

For other inference scripts, refer to LLaVA Evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.devcontainer		.devcontainer
docs		docs
images		images
llava		llava
playground/data		playground/data
scripts		scripts
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
cog.yaml		cog.yaml
predict.py		predict.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

How to run

Step.0: Set the environment the same as LLaVA-1.5

Step.1 (for inference): Download Checkpoints

Step.2 (for inference): Change the methods (PruMerge or PruMerge+).

Step.3 (for inference): Run the script.

About

Releases

Packages

Contributors 2

Languages

License

42Shawn/LLaVA-PruMerge

Folders and files

Latest commit

History

Repository files navigation

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

How to run

Step.0: Set the environment the same as LLaVA-1.5

Step.1 (for inference): Download Checkpoints

Step.2 (for inference): Change the methods (PruMerge or PruMerge+).

Step.3 (for inference): Run the script.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages