Skip to content
View ifzhang's full-sized avatar
🐶
Focusing
🐶
Focusing

Organizations

@hustvl
Block or Report

Block or report ifzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation

Python 120 2 Updated Nov 12, 2023

This is an official implementation of our CVPR 2023 paper "Human Pose as Compositional Tokens" (https://arxiv.org/pdf/2303.11638.pdf)

Python 291 18 Updated Jun 12, 2023

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Python 497 45 Updated Jun 17, 2024

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

Python 679 56 Updated Jun 3, 2023

Official PyTorch implementation for a conditional diffusion probability model in BEV perception

Python 236 10 Updated Apr 4, 2023

[ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction

112 2 Updated Sep 6, 2023

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

Python 1,471 158 Updated Jul 18, 2023

[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception

Python 552 48 Updated Sep 17, 2023

(CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection

Python 99 7 Updated May 5, 2023

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 3,064 499 Updated May 3, 2024

Code for "Multimodal Trajectory Prediction Conditioned on Lane-Graph Traversals," CoRL 2021.

Python 207 35 Updated Sep 13, 2022

An academic alternative to Tesla's occupancy network for autonomous driving.

Python 1,096 102 Updated May 29, 2024

A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction

Python 30 2 Updated Feb 1, 2023

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Python 1,398 82 Updated Jan 23, 2024

[ICCV 2023] Cross Modal Transformer: Towards Fast and Robust 3D Object Detection

Python 310 34 Updated Oct 7, 2023

Forecasting from LiDAR via Future Object Detection. CVPR '22

Python 114 13 Updated May 3, 2022

Deep Learning for Vision-based Prediction

TeX 311 59 Updated Feb 25, 2024

Official code base of the BEVDet series .

Python 1,325 245 Updated Jul 4, 2024

[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving

Python 3,089 328 Updated Jul 8, 2024

EPNet++: Cascade Bi-directional Fusion for Multi-Modal 3D Object Detection (TPAMI-2022)

Python 50 5 Updated Dec 24, 2022

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.

Python 3,718 466 Updated May 25, 2024

Detection Transformers with Assignment

Python 238 20 Updated Sep 16, 2023

The offical code of PolarBEV (CoRL2022).

Python 53 3 Updated Sep 17, 2022

This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).

Python 222 11 Updated Jul 5, 2024

Offical PyTorch implementation of "BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework"

Python 690 101 Updated Apr 5, 2023

FlyCV is a high-performance library for processing computer visual tasks.

C++ 575 57 Updated Jun 2, 2023

[NeurIPS 2022] DeepInteraction: 3D Object Detection via Modality Interaction

Python 196 14 Updated Jan 30, 2024