Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explanation on the demo #9

Open
Angus-Qi opened this issue Dec 27, 2023 · 3 comments
Open

Explanation on the demo #9

Angus-Qi opened this issue Dec 27, 2023 · 3 comments

Comments

@Angus-Qi
Copy link

Thanks for the contribution and releasing the code for this project, the work done is really interesting.

Regarding the visual_3dbbox.py demo, could you explain what are the prompt.txt and target.txt? I look forward to testing on other prompt and target images.

@ddz16
Copy link

ddz16 commented Dec 27, 2023

I have the same question. Hope for the author's reply. Thanks! @paulpanwang @zhiwenfan

@ddz16
Copy link

ddz16 commented Jan 25, 2024

I seem to have figured out what these two txt files mean. The 3x4 matrix in prompt.txt represents the rotation matrix (3x3) and translation (3x1) of the object relative to the camera when capturing the prompt.png image. Similarly, target.txt represents the rotation matrix and translation of the object relative to the camera when capturing the target.png image. @Angus-Qi

@bsamadi
Copy link

bsamadi commented Feb 1, 2024

@ddz16, thank you for your response. Where can we see a definition? If we know the pose of the object in the target image, what is being calculated?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants