-
Baidu | HUST
- Shang Hai, China
- https://ifzhang.github.io/
Block or Report
Block or report ifzhang
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation
This is an official implementation of our CVPR 2023 paper "Human Pose as Compositional Tokens" (https://arxiv.org/pdf/2303.11638.pdf)
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
Official PyTorch implementation for a conditional diffusion probability model in BEV perception
[ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
(CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Code for "Multimodal Trajectory Prediction Conditioned on Lane-Graph Traversals," CoRL 2021.
An academic alternative to Tesla's occupancy network for autonomous driving.
A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
[ICCV 2023] Cross Modal Transformer: Towards Fast and Robust 3D Object Detection
Forecasting from LiDAR via Future Object Detection. CVPR '22
Deep Learning for Vision-based Prediction
Official code base of the BEVDet series .
[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving
EPNet++: Cascade Bi-directional Fusion for Multi-Modal 3D Object Detection (TPAMI-2022)
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).
Offical PyTorch implementation of "BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework"
FlyCV is a high-performance library for processing computer visual tasks.
[NeurIPS 2022] DeepInteraction: 3D Object Detection via Modality Interaction