Highlights
- Pro
Block or Report
Block or report shuishui616
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Simple image captioning model
Video+code lecture on building nanoGPT from scratch
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Taming Transformers for High-Resolution Image Synthesis
Reading list for research topics in multimodal machine learning
【IJCAI 2023】RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search
CLIP-Driven Fine-grained Text-Image Person Re-identification
Master programming by recreating your favorite technologies from scratch.
A playbook for systematically maximizing the performance of deep learning models.
TOMM2020 Dual-Path Convolutional Image-Text Embedding 🐾 https://arxiv.org/abs/1711.05535
A PyTorch implementation of the Transformer model in "Attention is All You Need".
deep learning for image processing including classification and object-detection etc.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
implementation of paper https://arxiv.org/abs/2210.04559
High-Resolution Image Synthesis with Latent Diffusion Models
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Materials for the Hugging Face Diffusion Models Course
Image to prompt with BLIP and CLIP
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan…
Stable Diffusion web UI