Skip to content
View 13633491388's full-sized avatar

Block or report 13633491388

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Python 356 16 Updated Jul 5, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,427 150 Updated Aug 30, 2024

Restore a damaged (truncated) mp4, m4v, mov, 3gp video. Provided you have a similar not broken video.

C++ 1,606 225 Updated May 23, 2024

A Framework of Small-scale Large Multimodal Models

Python 549 53 Updated Aug 14, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,315 418 Updated Aug 20, 2024

【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval

Python 57 3 Updated Apr 16, 2024

Official implementation of SEED-LLaMA (ICLR 2024).

Python 555 31 Updated Apr 11, 2024

Generative Models by Stability AI

Python 23,937 2,664 Updated Aug 21, 2024
Jupyter Notebook 30 2 Updated Dec 20, 2023

PyTorch implementation of Spatial Transformer Network (STN) with Thin Plate Spline (TPS)

Python 922 154 Updated Jul 15, 2021

CLIP-Driven Fine-grained Text-Image Person Re-identification

Python 33 1 Updated Nov 22, 2023

[ECCV2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval

Python 72 9 Updated Nov 29, 2022

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 24,937 5,163 Updated Sep 1, 2024

开源社区第一个能下载、能运行的中文 LLaMA2 模型!

Python 2,220 202 Updated Oct 26, 2023

Inference code for Llama models

Python 55,310 9,421 Updated Aug 18, 2024

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Python 6,281 1,279 Updated Aug 31, 2024
Python 63 4 Updated Jun 28, 2023

Bridging Vision and Language Model

Python 279 31 Updated Mar 27, 2023

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Python 3,817 415 Updated Apr 28, 2024

TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)

Python 171 11 Updated Nov 17, 2023
Python 155 23 Updated Nov 9, 2023

【AAAI 2024】An Empirical Study of CLIP for Text-based Person Search

Python 44 4 Updated Apr 15, 2024

Fast inference engine for Transformer models

C++ 3,185 281 Updated Aug 26, 2024

A latent text-to-image diffusion model

Jupyter Notebook 67,323 10,067 Updated Jun 18, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,570 938 Updated Aug 23, 2024

Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.

Jupyter Notebook 147 26 Updated Apr 17, 2024

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,700 2,247 Updated Jun 27, 2024
Next