Multimodal Direct Preference Optimization (mDPO)

Code for our paper mDPO: Conditional Preference Optimization for Multimodal Large Language Models. Please check our website for an overview of our work.

Updates

🔥 [2024-09-04] Initial release of the mDPO trainer. We are currently working on releasing the code for training and evaluating different models.

Installation

TBD

Training

To train Bunny with mDPO, use the following command:

python bunny/run_mdpo_bunny.py

Evaluation

TBD

Citation

Please cite the following paper if you find the repo helpful:

@article{wang2024mdpo,
  title={mDPO: Conditional Preference Optimization for Multimodal Large Language Models},
  author={Wang, Fei and Zhou, Wenxuan and Huang, James Y and Xu, Nan and Zhang, Sheng and Poon, Hoifung and Chen, Muhao},
  journal={arXiv preprint arXiv:2406.11839},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
bunny		bunny
data		data
figures		figures
README.md		README.md
mdpo_trainer.py		mdpo_trainer.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal Direct Preference Optimization (mDPO)

Updates

Installation

Training

Evaluation

Citation

About

Releases

Packages

Languages

luka-group/mDPO

Folders and files

Latest commit

History

Repository files navigation

Multimodal Direct Preference Optimization (mDPO)

Updates

Installation

Training

Evaluation

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages