Sammy42779

Follow

sammy Sammy42779

Follow

@sustech, primarily focusing on trustworthy AI: the robustness of multimodal large language models.

14 followers · 13 following

SUSTech
Shenzhen, China

Achievements

Achievements

Highlights

Pro

Lists (2)

Sort

MM

NeurIPS2023

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Daisy-Zhang / Awesome-Deepfakes-Detection

A list of tools, papers and code related to Deepfake Detection.

996 95 Updated Sep 2, 2024

sail-sg / AnyDoor

AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models

Python 43 1 Updated Apr 8, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,373 71 Updated Oct 9, 2024

RyanLiut / awesome_word_sense_disambiguation

Awesome work on word sense disambiguation in general

3 Updated Apr 7, 2023

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

12,099 774 Updated Oct 9, 2024

utkuozbulak / pytorch-cnn-adversarial-attacks

Pytorch implementation of convolutional neural network adversarial attack techniques

Python 350 61 Updated Dec 3, 2018

Farhamdur / CGBA

Python 8 4 Updated Jun 26, 2024

thu-ml / Attack-Bard

Python 85 6 Updated Feb 16, 2024

ShawnXYang / C-GSP

Python 12 2 Updated Jul 25, 2022

zihao-ai / Awesome-Backdoor-in-Deep-Learning

A curated list of papers & resources on backdoor attacks and defenses in deep learning.

Python 167 15 Updated Mar 15, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,342 2,912 Updated Sep 2, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,668 2,164 Updated Aug 12, 2024

YehLi / xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…

Python 1,022 111 Updated Feb 27, 2023

FeiElysia / ViECap

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023

Python 144 5 Updated Sep 9, 2024

Harry24k / adversarial-attacks-pytorch

PyTorch implementation of adversarial attacks [torchattacks]

Python 1,865 349 Updated Jun 29, 2024

SinclairCoder / Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

750 24 Updated Jul 20, 2023

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,690 280 Updated Aug 31, 2024

Luodian / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,560 241 Updated Mar 5, 2024

zjr2000 / Awesome-Multimodal-Chatbot

Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a sea…

64 6 Updated Jun 18, 2023

Zhendong-Wang / Prompt-Diffusion

Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"

Python 374 9 Updated Mar 25, 2024

OpenGVLab / InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,494 232 Updated Aug 1, 2024

amirbar / visual_prompting

Official implementation and data release of the paper "Visual Prompting via Image Inpainting".

Jupyter Notebook 298 20 Updated Aug 7, 2023

facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,219 1,206 Updated Jul 23, 2024

KMnP / vpt

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

Python 1,007 91 Updated Sep 2, 2023

jiawen-zhu / ViPT

[CVPR23] Visual Prompt Multi-Modal Tracking

Python 250 18 Updated Jul 28, 2023

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 49,228 4,779 Updated Sep 19, 2024

ChengtaiCao / Awesome-Mix

A curated list of awesome Mix

62 1 Updated Dec 23, 2022

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,747 954 Updated Aug 23, 2024

hendrycks / imagenet-r

ImageNet-R(endition) and DeepAugment (ICCV 2021)

Python 250 17 Updated Jul 23, 2021

alibaba / easyrobust

EasyRobust: an Easy-to-use library for state-of-the-art Robust Computer Vision Research with PyTorch.

Jupyter Notebook 321 37 Updated Jun 30, 2024

Starred topics

yelp