Skip to content
View theFool32's full-sized avatar
😴
sleeping
😴
sleeping

Block or report theFool32

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

632 25 Updated Aug 12, 2024

assistant tools for attention visualization in deep learning

Jupyter Notebook 942 76 Updated Jun 9, 2022

Cool Papers - Immersive Paper Discovery

HTML 334 4 Updated Aug 17, 2024

Firefox user.js for speed, privacy, and security. Your favorite browser, but better.

JavaScript 5,018 131 Updated Aug 24, 2024

Accepted by IJCAI-24 Survey Track

Python 99 2 Updated Aug 25, 2024

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

328 9 Updated Aug 20, 2024

Rime 配置:雾凇拼音 | 长期维护的简体词库

Lua 9 Updated Aug 25, 2024

YOLOv10: Real-Time End-to-End Object Detection

Python 9,002 815 Updated Aug 8, 2024

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 688 21 Updated Aug 9, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,196 409 Updated Jul 30, 2024

A curated list of trustworthy Generative AI papers. Daily updating...

67 5 Updated Sep 14, 2023

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 11,130 785 Updated Aug 25, 2024

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

1,343 129 Updated Aug 19, 2024

[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"

Python 167 7 Updated Jun 26, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,418 2,053 Updated Aug 9, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 4,898 540 Updated Aug 10, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,170 995 Updated Aug 26, 2024

[CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.

Python 107 8 Updated May 30, 2024

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

Python 2,130 230 Updated Aug 27, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 511 36 Updated Aug 23, 2024
Python 7,064 547 Updated Aug 12, 2024

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

416 44 Updated Jul 10, 2024
Python 107 6 Updated Jun 6, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,332 64 Updated Mar 8, 2024

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

498 30 Updated Aug 25, 2024

A collection of visual instruction tuning datasets.

Python 73 3 Updated Mar 14, 2024

Recent LLM-based CV and related works. Welcome to comment/contribute!

817 35 Updated Jun 5, 2024

This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).

745 49 Updated Aug 27, 2024
Next