Lists (1)
Sort Name ascending (A-Z)
Stars
Script for downloading Flickr-SoundNet dataset used in Look, Listen and Learn (Arandjelovic, Zisserman; 2017)
Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".
Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"
Localizing Visual Sounds the Hard Way
Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes
A curated list of different papers and datasets in various areas of audio-visual processing
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
Repository providing a wide range of self-supervised pretrained models for computer vision tasks.
Improved Transferability of Self-Supervised Learning Models Through Batch Normalization Finetuning
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
A Contrastive Learning Boost from Intermediate Pre-Trained Representations
A repo for publishing solution to 3DCoMPaT++ challenge on an improved large-scale 3D vision dataset for compositional recognition
3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition
A paper list of object detection using deep learning.
A series of tutorial notebooks on denoising diffusion probabilistic models in PyTorch
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
End-to-End Object Detection with Transformers
Code for ICML 2019 paper "Simple Black-box Adversarial Attacks"
A targeted adversarial attack method, which won the NIPS 2017 targeted adversarial attacks competition
A non-targeted adversarial attack method, which won the first place in NIPS 2017 non-targeted adversarial attacks competition
Official Code for Efficient and Effective Augmentation Strategy for Adversarial Training (NeurIPS-2022)
This repository contains the implementation of three adversarial example attack methods FGSM, IFGSM, MI-FGSM and one Distillation as defense against all attacks using MNIST dataset.
PyTorch implementation of adversarial attacks [torchattacks]
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch