Skip to content
View wondervictor's full-sized avatar
๐Ÿฅต
overwhelmed with papers & coding.....
๐Ÿฅต
overwhelmed with papers & coding.....

Highlights

  • Pro

Organizations

@hustvl @msra-alumni @HRNet @TencentARC

Block or report wondervictor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
wondervictor/README.md

Hi there ๐Ÿ‘‹

I'm Tianheng Cheng, pursuing my Ph.D. now and working on Computer Vision and Machine Intelligence.

My research goal is to enable machines/robots to see and understand the world.

Previous works/publications are listed at Google Scholar ๐Ÿ“š.

Currently, I'm devoted to research on large multimodal models, foundational visual-language modeling, and image generation. Before that, I mainly focused on fundamental tasks such as object detection and instance segmentation, as well as visual perception for autonomous driving.

Highlighted Works of those pinned works:

  • The latest works ๐Ÿ”ฅ: EVF-SAM (arXiv) empowers segment-anything (SAM, SAM-2) with the strong text-prompting ability. Try our demo on HuggingFace.
  • OSP (ECCV 2024) explores sparse set of points to predict 3D semantic occupancy for autonomous vehicles, which is a brand new formulation!
  • YOLO-World (CVPR 2024) for real-time open-vocabulary object detection; Symphonies (CVPR 2024) for camera-based 3D scene completion.
  • SparseInst (CVPR 2022) aims for real-time instance segmentation with a simple fully convolutional framework! MobileInst (AAAI 2024) further explores temporal consistency and kernel reuse for efficient mobile video instance segmentation.
  • BoxTeacher (CVPR 2023) bridges the gap between fully supervised and box-supervised instance segmentation. With ~1/10 annotation cost, BoxTeacher can achieve 93% performance versus fully supervised methods.
  • BMask R-CNN (ECCV 2020) is the first work to introduce boundary modeling for objects and aims for high-performance instance segmentation. It leads the research about object boundaries for instance segmentation.
  • GKT (arXiv) addresses the ill-posed 2D-to-3D (Surrounding views to Bird-Eye views) transformation with the concern about accuracy and speed, especially for practical implementation for autonomous systems.

Pinned Loading

  1. AILab-CVC/YOLO-World AILab-CVC/YOLO-World Public

    [CVPR 2024] Real-Time Open-Vocabulary Object Detection

    Python 4.2k 407

  2. hustvl/SparseInst hustvl/SparseInst Public

    [CVPR 2022] SparseInst: Sparse Instance Activation for Real-Time Instance Segmentation

    Python 577 72

  3. hustvl/GKT hustvl/GKT Public

    Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer

    Python 221 18

  4. hustvl/Symphonies hustvl/Symphonies Public

    [CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

    Python 151 8

  5. TencentARC/mllm-npu TencentARC/mllm-npu Public

    mllm-npu: training multimodal large language models on Ascend NPUs

    Python 68 2

  6. hustvl/EVF-SAM hustvl/EVF-SAM Public

    Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

    Python 196 7