Skip to content
View wjf5203's full-sized avatar

Block or report wjf5203

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Python 19 Updated Sep 17, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,035 85 Updated Aug 6, 2024

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 504 26 Updated Jul 1, 2024

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 399 16 Updated Sep 25, 2024

The open-source tool for building high-quality datasets and computer vision models

Python 8,146 543 Updated Sep 27, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,773 108 Updated Jul 29, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,214 48 Updated Aug 15, 2024
Python 60 Updated Jul 26, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,031 996 Updated Sep 27, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,469 158 Updated Nov 28, 2023

A framework for few-shot evaluation of language models.

Python 6,547 1,734 Updated Sep 26, 2024

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 69,608 7,616 Updated Sep 27, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,688 451 Updated May 3, 2024

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vi…

Python 3,607 311 Updated Sep 27, 2024

Awesome-LLM: a curated list of Large Language Model

17,616 1,432 Updated Sep 23, 2024

📋 A list of open LLMs available for commercial use.

10,963 699 Updated Jul 5, 2024

The official Meta Llama 3 GitHub site

Python 26,353 2,968 Updated Aug 12, 2024

DataComp: In search of the next generation of multimodal datasets

Python 641 54 Updated Jan 2, 2024

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Python 162 5 Updated Jun 9, 2024

A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis

Python 507 26 Updated Mar 10, 2023

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 543 57 Updated Jun 7, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 132,676 26,435 Updated Sep 27, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 4,030 301 Updated Jul 16, 2024

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,037 82 Updated Aug 8, 2024

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

Python 130 6 Updated Mar 25, 2024
Python 7,092 549 Updated Aug 12, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,056 535 Updated May 31, 2024

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 504 16 Updated Jun 27, 2024

FRP Fork

Go 124 18 Updated Aug 30, 2024
Next