dddraxxx

Follow

Drax dddraxxx

Follow

Dong, Qihua. Interested in discovering intelligence in M-LLM and building general AI!

13 followers · 12 following

Northeastern University, SmileLab
Boston
https://dddraxxx.github.io/
https://orcid.org/0000-0001-5125-7218

Achievements

Achievements

Highlights

Pro

Block or Report

Block or report dddraxxx

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Lists (9)

Sort

Awesome-series

🐱‍🏍Browser Extension

👨‍🏫 chat_ui

⌨️ Editor

Good Segmentor

🐱‍🏍GTW

🚀 My stack

Segmentation Dataset

smileLab

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

waspinator / pycococreator

Helper functions to create COCO datasets

Python 769 179 Updated Jun 20, 2024

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 11,195 1,459 Updated Feb 29, 2024

ShiArthur03 / ShiArthur03

MATLAB 10,237 1,934 Updated Jul 16, 2024

PhyscalX / gradio-image-prompter

Image Prompter for Gradio

JavaScript 57 9 Updated Dec 14, 2023

lil-lab / nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.

HTML 252 59 Updated Aug 18, 2022

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,453 280 Updated Jul 22, 2024

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Python 1,165 45 Updated Jul 23, 2024

JierunChen / Ref-L4

Evaluation code for Ref-L4, a new REC benchmark in the LMM era

Python 8 Updated Jul 11, 2024

sail-sg / ptp

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Python 147 4 Updated Jun 7, 2023

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,104 573 Updated Jul 26, 2024

open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Python 764 87 Updated Jul 28, 2024

scenarios / WeMM

Python 78 11 Updated Jul 4, 2024

tsb0601 / MMVP

Python 257 7 Updated Jan 27, 2024

InternLM / InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,294 140 Updated Jul 25, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Python 4,425 341 Updated Jul 27, 2024

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,405 357 Updated Jul 26, 2024

facebookresearch / VLPart

[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation

Python 338 16 Updated Sep 19, 2023

baaivision / Emu

Emu Series: Generative Multimodal Models from BAAI

Python 1,576 81 Updated Mar 8, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,475 326 Updated Jun 16, 2024

MMStar-Benchmark / MMStar

This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

Python 129 1 Updated Apr 17, 2024

roboflow / supervision

We write your reusable computer vision tools. 💜

Python 18,159 1,397 Updated Jul 27, 2024

gereleth / jupyter-bbox-widget

A Jupyter widget for annotating images with bounding boxes

Python 103 18 Updated May 31, 2023

awslabs / s3-connector-for-pytorch

The Amazon S3 Connector for PyTorch delivers high throughput for PyTorch training jobs that access and store data in Amazon S3.

Python 103 12 Updated Jul 1, 2024

kazuto1011 / deeplab-pytorch

PyTorch re-implementation of DeepLab v2 on COCO-Stuff / PASCAL VOC datasets

Python 1,090 280 Updated Oct 14, 2022

DerrickWang005 / CRIS.pytorch

An official PyTorch implementation of the CRIS paper

Python 237 36 Updated Jun 9, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 3,538 318 Updated Jul 28, 2024

AILab-CVC / SEED-Bench

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.

Python 284 8 Updated Jul 11, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,434 334 Updated May 28, 2024

VSCodeVim / Vim

⭐ Vim for Visual Studio Code

TypeScript 13,582 1,297 Updated Jul 26, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,109 277 Updated May 4, 2024

Starred topics

Chrome extension

Vim