reached_out_link
stringclasses 1
value | title
stringlengths 14
140
| arxiv_id
stringlengths 0
10
| GitHub
stringlengths 0
105
| type
stringclasses 2
values | num_models
int64 0
100
| num_datasets
int64 0
5
| num_spaces
int64 0
100
|
---|---|---|---|---|---|---|---|
Is Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images | 2404.12922 | https://github.com/jbonato1/scar | Poster | 0 | 0 | 0 |
|
Octopus: Embodied Vision-Language Programmer from Environmental Feedback | 2310.08588 | https://github.com/dongyh20/octopus | Poster | 0 | 0 | 0 |
|
FunQA: Towards Surprising Video Comprehension | 2306.14899 | https://github.com/jingkang50/funqa | Poster | 0 | 0 | 0 |
|
4D Contrastive Superflows are Dense 3D Representation Learners | 2407.06190 | https://github.com/xiangxu-0103/superflow | Poster | 0 | 0 | 0 |
|
ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation | 2407.07171 | https://github.com/yyliu01/it2 | Poster | 0 | 0 | 0 |
|
Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos | 2312.13604 | Poster | 0 | 0 | 0 |
||
Robust Fitting on a Gate Quantum Computer | Oral | 0 | 0 | 0 |
|||
H-V2X: A Large Scale Highway Dataset for BEV Perception | Oral | 0 | 0 | 0 |
|||
Learning Camouflaged Object Detection from Noisy Pseudo Label | 2407.13157 | Poster | 0 | 0 | 0 |
||
Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance | 2312.07530 | https://github.com/kuanchihhuang/vg-w3d | Poster | 0 | 0 | 0 |
|
Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions | Poster | 0 | 0 | 0 |
|||
CLR-GAN: Improving GANs Stability and Quality via Consistent Latent Representation and Reconstruction | Poster | 0 | 0 | 0 |
|||
Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence | 2407.18899 | Poster | 0 | 0 | 0 |
||
PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts | 2403.04993 | Poster | 0 | 0 | 0 |
||
Motion Mamba: Efficient and Long Sequence Motion Generation | 2403.07487 | https://github.com/steve-zeyu-zhang/MotionMamba | Poster | 0 | 0 | 0 |
|
Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis | 2403.04116 | https://github.com/caiyuanhao1998/x-gaussian | Poster | 0 | 0 | 0 |
|
Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance | 2403.05231 | https://github.com/litinglin/lorat | Poster | 0 | 0 | 0 |
|
A Direct Approach to Viewing Graph Solvability | Oral | 0 | 0 | 0 |
|||
CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization | 2405.12110 | Poster | 0 | 0 | 0 |
||
SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving | 2407.01702 | https://github.com/kth-rpl/deflow | Poster | 0 | 0 | 0 |
|
ZeST: Zero-Shot Material Transfer from a Single Image | 2404.06425 | Poster | 0 | 0 | 0 |
||
3D Congealing: 3D-Aware Image Alignment in the Wild | 2404.02125 | Poster | 0 | 0 | 0 |
||
SMooDi: Stylized Motion Diffusion Model | 2407.12783 | Poster | 0 | 0 | 0 |
||
ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs | 2311.13600 | Poster | 0 | 0 | 0 |
||
SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion | 2403.12008 | Oral | 1 | 0 | 0 |
||
WordRobe: Text-Guided Generation of Textured 3D Garments | 2403.17541 | Poster | 0 | 0 | 0 |
||
Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation | 2404.00636 | Poster | 0 | 0 | 0 |
||
SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras | 2403.10353 | https://github.com/nullmax-vision/simpb | Poster | 0 | 0 | 0 |
|
EMDM: Efficient Motion Diffusion Model for Fast, High-Quality Human Motion Generation | 2312.02256 | Poster | 0 | 0 | 0 |
||
Editable Image Elements for Controllable Synthesis | 2404.16029 | Poster | 0 | 0 | 0 |
||
Improving 2D Feature Representations by 3D-Aware Fine-Tuning | 2407.20229 | Poster | 1 | 0 | 1 |
||
Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection | 2401.03145 | Poster | 0 | 0 | 0 |
||
PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion | Poster | 0 | 0 | 0 |
|||
SemGrasp: Semantic Grasp Generation via Language Aligned Discretization | 2404.03590 | Oral | 0 | 0 | 0 |
||
MANIKIN: Biomechanically Accurate Neural Inverse Kinematics for Human Motion Estimation | Poster | 0 | 0 | 0 |
|||
Simple Unsupervised Knowledge Distillation With Space Similarity | Poster | 0 | 0 | 0 |
|||
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects | 2403.15382 | Poster | 0 | 0 | 0 |
||
Diffusion Bridges for 3D Point Cloud Denoising | 2408.16325 | Poster | 0 | 0 | 0 |
||
Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging | 2403.02449 | Poster | 0 | 0 | 0 |
||
BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos | 2312.00083 | https://github.com/Pilhyeon/BAM-DETR | Poster | 0 | 0 | 0 |
|
MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description | Oral | 0 | 0 | 0 |
|||
Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data | Poster | 0 | 0 | 0 |
|||
EgoPoser: Robust Real-Time Egocentric Pose Estimation from Sparse and Intermittent Observations Everywhere | 2308.06493 | Poster | 0 | 0 | 0 |
||
Physics-Free Spectrally Multiplexed Photometric Stereo under Unknown Spectral Composition | Oral | 0 | 0 | 0 |
|||
SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction | Poster | 0 | 0 | 0 |
|||
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models | 2403.12034 | Poster | 2 | 0 | 8 |
||
Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and Correspondences | Poster | 0 | 0 | 0 |
|||
Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs | 2403.11755 | https://github.com/jmiemirza/meta-prompting | Poster | 0 | 0 | 0 |
|
Physics-Based Interaction with 3D Objects via Video Generation | 2404.13026 | Oral | 0 | 0 | 0 |
||
Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians | 2403.09434 | Poster | 0 | 0 | 0 |
||
Deep Patch Visual SLAM | 2408.01654 | https://github.com/princeton-vl/dpvo | Poster | 0 | 0 | 0 |
|
Surface Reconstruction for 3D Gaussian Splatting via Local Structural Hints | Poster | 0 | 0 | 0 |
|||
HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting | 2312.02902 | Poster | 0 | 0 | 0 |
||
LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow | Poster | 0 | 0 | 0 |
|||
Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal | 2404.13679 | Poster | 0 | 0 | 0 |
||
Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation | 2407.10802 | https://github.com/tub-rip/motionpriorcmax | Poster | 0 | 0 | 0 |
|
Efficient Few-Shot Action Recognition via Multi-Level Post-Reasoning | Poster | 0 | 0 | 0 |
|||
Text2Place: Affordance-aware Text Guided Human Placement | 2407.15446 | Poster | 0 | 0 | 0 |
||
OGNI-DC: Robust Depth Completion with Optimization-Guided Neural Iterations | 2406.11711 | https://github.com/princeton-vl/ogni-dc | Poster | 0 | 0 | 0 |
|
Zero-Shot Multi-Object Scene Completion | 2403.14628 | Poster | 0 | 0 | 0 |
||
Beta-Tuned Timestep Diffusion Model | Poster | 0 | 0 | 0 |
|||
POA: Pre-training Once for Models of All Sizes | 2408.01031 | https://github.com/qichuzyy/poa | Poster | 0 | 0 | 0 |
|
Taming Latent Diffusion Model for Neural Radiance Field Inpainting | 2404.09995 | Poster | 0 | 0 | 0 |
||
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation | 2407.11682 | Poster | 0 | 0 | 0 |
||
ByteEdit: Boost, Comply and Accelerate Generative Image Editing | 2404.04860 | Poster | 0 | 0 | 0 |
||
ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion | 2407.09303 | https://github.com/sungmin-woo/ProDepth | Poster | 0 | 0 | 0 |
|
High-Resolution and Few-shot View Synthesis from Asymmetric Dual-lens Inputs | Poster | 0 | 0 | 0 |
|||
Accelerating Image Super-Resolution Networks with Pixel-Level Classification | 2407.21448 | Poster | 0 | 0 | 0 |
||
LASS3D: Language-Assisted Semi-Supervised 3D Semantic Segmentation with Progressive Unreliable Data Exploitation | Poster | 0 | 0 | 0 |
|||
Contourlet Residual for Prompt Learning Enhanced Infrared Image Super-Resolution | Poster | 0 | 0 | 0 |
|||
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians | 2407.11793 | Poster | 0 | 0 | 0 |
||
Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex Driving Scenes | 2404.17961 | https://github.com/zelongzeng/rwpm | Poster | 0 | 0 | 0 |
|
DySeT: a Dynamic Masked Self-distillation Approach for Robust Trajectory Prediction | Poster | 0 | 0 | 0 |
|||
Track Everything Everywhere Fast and Robustly | 2403.17931 | Poster | 0 | 0 | 0 |
||
Towards Open-ended Visual Quality Comparison | 2402.16641 | Oral | 2 | 1 | 1 |
||
FreeInit: Bridging Initialization Gap in Video Diffusion Models | 2312.07537 | https://github.com/tianxingwu/freeinit | Poster | 0 | 0 | 2 |
|
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | 2403.19588 | https://github.com/naver-ai/rdnet | Poster | 5 | 0 | 0 |
|
Eliminating Feature Ambiguity for Few-Shot Segmentation | 2407.09842 | https://github.com/sam1224/aenet | Poster | 0 | 0 | 0 |
|
Soft Prompt Generation for Domain Generalization | 2404.19286 | https://github.com/renytek13/soft-prompt-generation-with-cgan | Poster | 0 | 0 | 0 |
|
Shedding More Light on Robust Classifiers under the lens of Energy-based Models | 2407.06315 | https://github.com/omnai-lab/robust-classifiers-under-the-lens-of-ebm | Poster | 0 | 0 | 0 |
|
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation | 2402.05054 | Oral | 18 | 0 | 20 |
||
Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization | Poster | 0 | 0 | 0 |
|||
RAW-Adapter: Adapting Pretrained Visual Model to Camera RAW Images | 2408.14802 | Poster | 0 | 0 | 0 |
||
SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic | 2403.17933 | https://github.com/autonomousvision/sledge | Poster | 0 | 0 | 0 |
|
AFreeCA: Annotation-Free Counting for All | 2403.04943 | https://github.com/adrian-dalessandro/afreeca | Poster | 0 | 0 | 0 |
|
Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap | Poster | 0 | 0 | 0 |
|||
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation | 2403.12019 | https://github.com/nirvanalan/ln3diff | Poster | 1 | 0 | 1 |
|
Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion | 2407.02077 | https://github.com/arlo0o/htcl | Poster | 0 | 0 | 0 |
|
Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration | Oral | 0 | 0 | 0 |
|||
GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation | 2407.05540 | Poster | 0 | 0 | 0 |
||
PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery | 2407.19001 | Poster | 0 | 0 | 0 |
||
Sapiens: Foundation for Human Vision Models | 2408.12569 | Oral | 1 | 0 | 4 |
||
Linearly Controllable GAN: Unsupervised Feature Categorization and Decomposition for Image Generation and Manipulation | Poster | 0 | 0 | 0 |
|||
Generating Human Interaction Motions in Scenes with Text Control | 2404.10685 | Poster | 0 | 0 | 0 |
||
NOVUM: Neural Object Volumes for Robust Object Classification | 2305.14668 | https://github.com/genintel/novum | Poster | 0 | 0 | 0 |
|
Align before Collaborate: Mitigating Feature Misalignment for Robust Multi-Agent Perception | Oral | 0 | 0 | 0 |
|||
HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects | 2407.12371 | Poster | 0 | 0 | 0 |
||
SAIR: Learning Semantic-aware Implicit Representation | 2310.09285 | Poster | 0 | 0 | 0 |
||
ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video Colorization | 2404.06251 | https://github.com/yyang181/colormnet | Poster | 0 | 0 | 0 |
|
UNIC: Universal Classification Models via Multi-teacher Distillation | 2408.05088 | Poster | 0 | 0 | 0 |
End of preview. Expand
in Dataset Viewer.
README.md exists but content is empty.
Use the Edit dataset card button to edit it.
- Downloads last month
- 41