Skip to content
View weifei7's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Dalian University of Technology
  • Dalian, Liaoning, China

Block or report weifei7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A general fine-tuning kit geared toward diffusion models.

Python 1,749 158 Updated Nov 2, 2024

An open source implementation of CLIP.

Python 10,189 975 Updated Oct 30, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,487 462 Updated Aug 6, 2024

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Python 4,336 749 Updated Jul 15, 2024

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 6,935 1,059 Updated Aug 6, 2024

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,429 1,059 Updated Nov 1, 2024

Official inference repo for FLUX.1 models

Python 15,508 1,114 Updated Oct 8, 2024

Generative Models by Stability AI

Python 24,511 2,729 Updated Sep 4, 2024

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 64,634 32,918 Updated Oct 15, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 4,200 310 Updated Oct 6, 2024

VMamba: Visual State Space Models,code is based on mamba

Python 2,160 134 Updated Oct 28, 2024

A collection of resources and papers on Diffusion Models

HTML 11,016 944 Updated Aug 1, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 25,985 5,352 Updated Nov 2, 2024

Awesome-LLM: a curated list of Large Language Model

18,563 1,515 Updated Oct 29, 2024

LLM101n: Let's build a Storyteller

29,577 1,620 Updated Aug 1, 2024

Multimodal Models in Real World

Jupyter Notebook 396 16 Updated Oct 28, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,902 459 Updated Oct 29, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,440 871 Updated Oct 22, 2024

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,396 85 Updated Sep 23, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,064 87 Updated Aug 6, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,769 177 Updated Oct 31, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,506 1,023 Updated Nov 1, 2024

FastPillars: A Deployment-friendly Pillar-based 3D Detector

135 12 Updated Feb 7, 2024

ICLR2024: LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection.

68 Updated Sep 20, 2024

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,802 158 Updated Sep 28, 2024

Mamba SSM architecture

Python 13,061 1,111 Updated Oct 28, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,953 196 Updated Sep 19, 2024

State-of-the-art bilingual open-sourced Math reasoning LLMs.

Python 429 25 Updated Oct 22, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,541 533 Updated Jul 25, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,967 125 Updated May 15, 2024
Next