Skip to content
View JosephKJ's full-sized avatar
🙇‍♂️
Work hard!
🙇‍♂️
Work hard!

Block or report JosephKJ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 16,220 1,108 Updated Sep 30, 2024

[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Python 142 2 Updated Sep 26, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,968 768 Updated Sep 25, 2024

Python Library to evaluate VLM models' robustness across diverse benchmarks

Jupyter Notebook 163 8 Updated Sep 26, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,287 440 Updated Sep 6, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,125 944 Updated Sep 29, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 711 55 Updated Sep 29, 2024

next gen smart vlm reasoner

Python 5 Updated Jun 22, 2024
Python 210 15 Updated Apr 10, 2024

Continual Few-Shot Learning of New Actions With Prompt Tuning

1 Updated Jul 5, 2024

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Python 844 52 Updated Mar 19, 2024

The open-source tool for building high-quality datasets and computer vision models

Python 8,149 544 Updated Sep 30, 2024

🔥 [CVPR 2024] The official repo for Zero-Painter!

Python 57 3 Updated Jun 8, 2024

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Python 483 25 Updated Jul 16, 2024

OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]

Python 41 4 Updated Sep 9, 2024

Recent LLM-based CV and related works. Welcome to comment/contribute!

830 35 Updated Jun 5, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,740 2,581 Updated Aug 20, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,188 279 Updated May 4, 2024

[CVPR24 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation

Python 93 1 Updated Jul 6, 2024
Python 437 26 Updated Jul 29, 2024

Go ahead and axolotl questions

Python 7,646 839 Updated Sep 30, 2024

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 5,828 581 Updated Sep 26, 2024

Multimodal language model benchmark, featuring challenging examples

Python 145 6 Updated Aug 13, 2024

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 798 57 Updated Jul 10, 2024

Official Repo of Graphist

93 2 Updated Apr 23, 2024

A Framework of Continual Learning

Python 73 4 Updated Sep 8, 2024

[CVPR 2024] AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion

Python 81 4 Updated Aug 28, 2024

Control Color: Multimodal Diffusion-based Interactive Image Colorization

103 2 Updated Feb 22, 2024
Python 8,331 485 Updated Jan 27, 2024
Next