Skip to content
View WKlee0607's full-sized avatar

Block or report WKlee0607

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Script for downloading Flickr-SoundNet dataset used in Look, Listen and Learn (Arandjelovic, Zisserman; 2017)

Python 9 4 Updated Aug 7, 2021

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 222 22 Updated Mar 20, 2024

Scripts for download AudioSet

Jupyter Notebook 65 45 Updated Nov 7, 2017

download the vggsound dataset

Shell 18 2 Updated Feb 22, 2022

Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"

Python 13 2 Updated Mar 27, 2024

Localizing Visual Sounds the Hard Way

Python 76 15 Updated Jul 6, 2022

Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes

Python 81 22 Updated May 20, 2021

A curated list of different papers and datasets in various areas of audio-visual processing

654 70 Updated Jan 30, 2024

Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

Python 85 5 Updated Jul 31, 2024
Python 17 1 Updated Jul 23, 2024

Repository providing a wide range of self-supervised pretrained models for computer vision tasks.

Python 61 5 Updated Mar 31, 2021

Improved Transferability of Self-Supervised Learning Models Through Batch Normalization Finetuning

Python 2 Updated Aug 2, 2024

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Jupyter Notebook 3,250 331 Updated Mar 3, 2024

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 6,214 904 Updated Jul 3, 2024

A Contrastive Learning Boost from Intermediate Pre-Trained Representations

Python 34 3 Updated Sep 9, 2024

A repo for publishing solution to 3DCoMPaT++ challenge on an improved large-scale 3D vision dataset for compositional recognition

Python 12 Updated Jun 22, 2023

3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition

Python 74 7 Updated Jul 9, 2024

A paper list of object detection using deep learning.

Python 11,288 2,782 Updated Feb 12, 2024

A series of tutorial notebooks on denoising diffusion probabilistic models in PyTorch

Jupyter Notebook 626 75 Updated Nov 12, 2022

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 12,594 2,862 Updated Sep 11, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 8,823 1,370 Updated Aug 9, 2024

End-to-End Object Detection with Transformers

Python 13,359 2,408 Updated Mar 12, 2024

Code for ICML 2019 paper "Simple Black-box Adversarial Attacks"

Python 191 56 Updated Mar 27, 2023
Python 10 3 Updated Jun 26, 2024

A targeted adversarial attack method, which won the NIPS 2017 targeted adversarial attacks competition

Python 129 38 Updated May 29, 2018

A non-targeted adversarial attack method, which won the first place in NIPS 2017 non-targeted adversarial attacks competition

Python 242 53 Updated Oct 30, 2019

Official Code for Efficient and Effective Augmentation Strategy for Adversarial Training (NeurIPS-2022)

Python 15 1 Updated Mar 29, 2023

This repository contains the implementation of three adversarial example attack methods FGSM, IFGSM, MI-FGSM and one Distillation as defense against all attacks using MNIST dataset.

Jupyter Notebook 115 26 Updated Dec 17, 2020

PyTorch implementation of adversarial attacks [torchattacks]

Python 1,849 345 Updated Jun 29, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 19,625 2,959 Updated Aug 28, 2024
Next