Skip to content
View dddraxxx's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report dddraxxx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Helper functions to create COCO datasets

Python 769 179 Updated Jun 20, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 11,195 1,459 Updated Feb 29, 2024

Image Prompter for Gradio

JavaScript 57 9 Updated Dec 14, 2023

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

HTML 252 59 Updated Aug 18, 2022

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,453 280 Updated Jul 22, 2024

OMG-LLaVA and OMG-Seg codebase

Python 1,165 45 Updated Jul 23, 2024

Evaluation code for Ref-L4, a new REC benchmark in the LMM era

Python 8 Updated Jul 11, 2024

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Python 147 4 Updated Jun 7, 2023

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,104 573 Updated Jul 26, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Python 764 87 Updated Jul 28, 2024
Python 78 11 Updated Jul 4, 2024
Python 257 7 Updated Jan 27, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,294 140 Updated Jul 25, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,425 341 Updated Jul 27, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,405 357 Updated Jul 26, 2024

[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation

Python 338 16 Updated Sep 19, 2023

Emu Series: Generative Multimodal Models from BAAI

Python 1,576 81 Updated Mar 8, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,475 326 Updated Jun 16, 2024

This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Python 129 1 Updated Apr 17, 2024

We write your reusable computer vision tools. 💜

Python 18,159 1,397 Updated Jul 27, 2024

A Jupyter widget for annotating images with bounding boxes

Python 103 18 Updated May 31, 2023

The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.

Python 103 12 Updated Jul 1, 2024

PyTorch re-implementation of DeepLab v2 on COCO-Stuff / PASCAL VOC datasets

Python 1,090 280 Updated Oct 14, 2022

An official PyTorch implementation of the CRIS paper

Python 237 36 Updated Jun 9, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 3,538 318 Updated Jul 28, 2024

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.

Python 284 8 Updated Jul 11, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,434 334 Updated May 28, 2024

⭐ Vim for Visual Studio Code

TypeScript 13,582 1,297 Updated Jul 26, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,109 277 Updated May 4, 2024
Next