Skip to content
/ VG4D Public

Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)

Notifications You must be signed in to change notification settings

Shark0-0/VG4D

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 

Repository files navigation


VG4D: Vision-Language Model Goes 4D Video Recognition

ICRA, 2024
Zhichao Deng · Xiangtai Li · Xia Li · Yunhai Tong
Shen Zhao* . Mengyuan Liu*

arXiv PDF Project Page


VG4D Framework

avatar

News

  • The code will be available before the meeting

[Paper] [CODE]

Citation

If you think VG4D is useful for your research, please consider referring VG4D:

@inproceedings{deng2024vg4d,
  title={VG4D: Vision-Language Model Goes 4D Video Recognition},
  author={Zhichao Deng, Xiangtai Li, Xia Li, Yunhai Tong, Shen Zhao, Mengyuan Liu},
  booktitle={ICRA},
  year={2024}
}

Acknowledgement

This work is built upon the PSTNet, ULIP, XCLIP.

License

MIT

About

Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published