Block or Report
Block or report lim142857
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks"
Data and Code for Program of Thoughts (TMLR 2023)
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers"
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
Generative Models by Stability AI
[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers
[ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features
An open source implementation of CLIP.
Fashion 200K dataset used in paper "Automatic Spatially-aware Fashion Concept Discovery."
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning
Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Integrating ChatGPT into your browser deeply, everything you need is here
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Vision-Language Pre-training for Image Captioning and Question Answering