Skip to content
View Sammy42779's full-sized avatar
  • SUSTech
  • Shenzhen, China

Highlights

  • Pro

Block or report Sammy42779

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

A list of tools, papers and code related to Deepfake Detection.

996 95 Updated Sep 2, 2024

AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models

Python 43 1 Updated Apr 8, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,373 71 Updated Oct 9, 2024

Awesome work on word sense disambiguation in general

3 Updated Apr 7, 2023

✨✨Latest Advances on Multimodal Large Language Models

12,099 774 Updated Oct 9, 2024

Pytorch implementation of convolutional neural network adversarial attack techniques

Python 350 61 Updated Dec 3, 2018
Python 8 4 Updated Jun 26, 2024
Python 85 6 Updated Feb 16, 2024
Python 12 2 Updated Jul 25, 2022

A curated list of papers & resources on backdoor attacks and defenses in deep learning.

Python 167 15 Updated Mar 15, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,342 2,912 Updated Sep 2, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,668 2,164 Updated Aug 12, 2024

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…

Python 1,022 111 Updated Feb 27, 2023

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023

Python 144 5 Updated Sep 9, 2024

PyTorch implementation of adversarial attacks [torchattacks]

Python 1,865 349 Updated Jun 29, 2024

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

750 24 Updated Jul 20, 2023

An open-source framework for training large multimodal models.

Python 3,690 280 Updated Aug 31, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,560 241 Updated Mar 5, 2024

Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a sea…

64 6 Updated Jun 18, 2023

Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"

Python 374 9 Updated Mar 25, 2024

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,494 232 Updated Aug 1, 2024

Official implementation and data release of the paper "Visual Prompting via Image Inpainting".

Jupyter Notebook 298 20 Updated Aug 7, 2023

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,219 1,206 Updated Jul 23, 2024

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

Python 1,007 91 Updated Sep 2, 2023

[CVPR23] Visual Prompt Multi-Modal Tracking

Python 250 18 Updated Jul 28, 2023

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 49,228 4,779 Updated Sep 19, 2024

A curated list of awesome Mix

62 1 Updated Dec 23, 2022

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,747 954 Updated Aug 23, 2024

ImageNet-R(endition) and DeepAugment (ICCV 2021)

Python 250 17 Updated Jul 23, 2021

EasyRobust: an Easy-to-use library for state-of-the-art Robust Computer Vision Research with PyTorch.

Jupyter Notebook 321 37 Updated Jun 30, 2024
Next