resolution ratio of input image #23

chky1997 · 2023-02-27T12:20:20Z

Hi, it seems a little blurry when using your gui_human.py to visualize the results. Does the resolution ratio (input_ratio in the yaml) that cause the problem? Will the result seems much clearer if the parameter set to 1.0 for training and inference? Thank you!

haotongl · 2023-02-27T12:31:44Z

Although I haven't conducted that particular experiment yet, my experience with other datasets suggests that training a model with full views (21 views for ZJU-MoCap) and an input ratio of 1.0 can lead to optimal rendering results.

chky1997 · 2023-03-03T07:58:02Z

About the outdoor dataset, what's the resolution ratio when your cameras record the videos? Do you resize the images to 1024*1024 just after recording, before getting the smpl keypoints?
In project page, the video of outdoor dataset also seems clearer than zjumocap dataset. Is there any difference between the two dataset during the recording stage?

haotongl · 2023-03-03T08:34:19Z

The zjumocap dataset is captured with 21 industrial cameras (2048x2048). We resize the images to 1024*1024.
I think the estimation of smpl keypoints under different resolutions will not affect the rendering results a lot since it is only used to defined a bbox to bound the foreground region.

The outdoor dataset is captured with 18 GoPro Cameras (1920x1080). We keep the original resolution.

chky1997 · 2023-03-06T06:10:26Z

About the outdoor dataset, I found the vhull dir contains the 3D bbox information. But I wonder how to get background.ply. Is it generated from the 18 background images? Also, I noticed outdoor dataset no longer needs the smpl points, it just needs the human images, human 3d mask (generated from 2d mask and converted to 3d using camera intri and extri) and background information, is that right?
By the way, could you tell me the average distance between each gopro cameras, thank you!

haotongl · 2023-03-06T11:32:47Z

Bckground.ply is the SFM sparse point cloud which is computed during calibration.
Outdoor dataset does not needs human mask information. To obtain the 3d bbox, you can follow this suggestion:
The vedio of the ZJU-Mocap #27 (comment)
About 0.1-0.3m. The specific value can be obtained by calculating the distance between camera positions through extri.yml. Units in Extri.yml have been normalized to meters.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

resolution ratio of input image #23

resolution ratio of input image #23

chky1997 commented Feb 27, 2023 •

edited

Loading

haotongl commented Feb 27, 2023

chky1997 commented Mar 3, 2023

haotongl commented Mar 3, 2023

chky1997 commented Mar 6, 2023 •

edited

Loading

haotongl commented Mar 6, 2023 •

edited

Loading

resolution ratio of input image #23

resolution ratio of input image #23

Comments

chky1997 commented Feb 27, 2023 • edited Loading

haotongl commented Feb 27, 2023

chky1997 commented Mar 3, 2023

haotongl commented Mar 3, 2023

chky1997 commented Mar 6, 2023 • edited Loading

haotongl commented Mar 6, 2023 • edited Loading

chky1997 commented Feb 27, 2023 •

edited

Loading

chky1997 commented Mar 6, 2023 •

edited

Loading

haotongl commented Mar 6, 2023 •

edited

Loading