Starred repositories
A curated list of resources for using LLMs to develop more competitive grant applications.
[NeurIPS 2022 Spotlight] A Unified Model for Multi-class Anomaly Detection
[CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects
✨✨Latest Advances on Multimodal Large Language Models
Documentation for Google's Gen AI site - including the Gemini API and Gemma
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
LPIPS metric. pip install lpips
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
Simple image captioning model
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Notebooks using the Hugging Face libraries 🤗
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Unofficial implementation of MVSS-Net (ICCV 2021) with Pytorch including training code.
Lossless Training Speed Up by Unbiased Dynamic Data Pruning
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
[ICCV 2023] BlendFace: Re-designing Identity Encoders for Face-Swapping https://arxiv.org/abs/2307.10854
[CVPR 2024🔥] EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection
Hierarchical Fine-Grained Image Forgery Detection and Localization (CVPR2023 and IJCV2024)
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)