-
Tau Motors
- Redwood City, CA
๐น skate ๐น
VideoLLM: Modeling Video Sequence with Large Language Models
Implementation of DynIBaR Neural Dynamic Image-Based Rendering (CVPR 2023)
Cool experiments at the intersection of Computer Vision and Sports โฝ๐
4DHumans: Reconstructing and Tracking Humans with Transformers
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
[Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models
Open-source and strong foundation image recognition models.
Official Code for DragGAN (SIGGRAPH 2023)
Code repository for the paper "On the Benefits of 3D Pose and Tracking for Human Action Recognition", (CVPR 2023)
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.
Hiera: A fast, powerful, and simple hierarchical vision transformer.
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
The official repository of the RePoGen paper
ICCV 2023 "Neural Video Depth Stabilizer" (NVDS) & TPAMI 2024 "NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation" (NVDS+)
[ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
[CVPR 2023] Official repository for downloading, processing, visualizing, and training models on the ARCTIC dataset.
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
[ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.
Accelerated pose estimation and tracking using semi-supervised convolutional networks.
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
Code for our ICCV'2023 paper "SHERF: Generalizable Human NeRF from a Single Image"