The repository contains the official implementation for the paper "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery" (CVPR 2024).
DPMesh is an innovative framework for occluded human Mesh recovery that capitalizes on the profound Diffusion Prior about object structure and spatial relationships embedded in a pre-trained text-to-image diffusion model.
- Release model and inference code.
- Release code for training dataloader .
We recommend you to use an Anaconda virtual environment. If you have installed Anaconda, run the following commands to create and activate a virtual environment.
conda env create -f environment.yaml
conda activate dpmesh
pip install git+https://github.com/cloneofsimo/lora.git
We prepare the data in a samilar way as 3DCrowdNet & JOTR. Please refer to here for dataset, SMPL model, VPoser model.
For 3DPW-OC and 3DPW-PC, we apply the same input key-points annotations as JOTR. Please refer to 3DPW-OC & 3DPW-PC.
For evaluation only, you can just prepare 3DPW dataset (images and annotations) and the joint regressors, we provide the directory structure below.
|-- common
| |-- utils
| | |-- human_model_files
| | |-- smplpytorch
|-- data
| |-- J_regressor_extra.npy
| |-- 3DPW
| | |-- 3DPW_latest_test.json
| | |-- 3DPW_oc.json
| | |-- 3DPW_pc.json
| | |-- 3DPW_validation_crowd_hhrnet_result.json
| | |-- imageFiles
| | |-- sequenceFiles
| |-- Human36M
| | |-- J_regressor_h36m_correct.npy
| |-- MSCOCO
| | |-- J_regressor_coco_hip_smpl.npy
Please download our pretrained checkpoints from this link and put them under ./checkpoints
. The file directory should be:
|-- checkpoints
|--|-- 3dpw_best_ckpt.pth.tar
|--|-- 3dpw-crowd_best_ckpt.pth.tar
|--|-- 3dpw-oc_best_ckpt.pth.tar
|--|-- 3dpw-pc_best_ckpt.pth.tar
You can test DPMesh use following commands:
CUDA_VISIBLE_DEVICES=0 \
torchrun \
--master_port 29591 \
--nproc_per_node 1 \
eval.py \
--cfg ./configs/main_train.yml \
--exp_id="main_train" \
--distributed \
The evaluation process can be done with one Nvidia GeForce RTX 4090 GPU (24GB VRAM). You can use more GPUs by specifying the GPU ids.
We would like to express our sincere thanks to the author of JOTR for the clear code base and quick response to our issues.
We also thank ControlNet, VPD and LoRA, for our code is partially borrowing from them.
- If you find an error as below, please refer to this link for help.
RuntimeError: Subtraction, the `-` operator, with a bool tensor is not supported. If you are trying to invert a mask, use the `~` or `logical_not()` operator instead.
@article{zhu2024dpmesh,
title={DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery},
author={Zhu, Yixuan and Li, Ao and Tang, Yansong and Zhao, Wenliang and Zhou, Jie and Lu, Jiwen},
journal={arXiv preprint arXiv:2404.01424},
year={2024}
}
This code is distributed under an MIT LICENSE.