Skip to content
View mondalanindya's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report mondalanindya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,223 366 Updated Apr 9, 2024

A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..

317 14 Updated Jul 20, 2024

Project page for "OmniCount: Multi-label Object Counting with Semantic-Geometric Priors"

HTML 2 Updated Apr 4, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 24,188 4,993 Updated Jul 21, 2024

Synthesize, Rank, Count; A Method for unsupervised crowd counting using latent diffusion models

Python 3 Updated Oct 4, 2023

collection of diffusion model papers categorized by their subareas

967 42 Updated Jul 19, 2024

🧙🏻‍♂️A list of papers curated for you to dive into the Awesome Radiance Field-based 3D Editing.

349 12 Updated Jul 18, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,495 500 Updated Jul 17, 2024

[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation

Python 927 62 Updated Jul 1, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,730 1,027 Updated Jun 27, 2024

Reading list for Multimodal Large Language Models

59 7 Updated Aug 17, 2023

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,174 2,899 Updated Apr 22, 2024

Video datasets

1,043 88 Updated Mar 8, 2023

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Jupyter Notebook 325 21 Updated Apr 29, 2023

"Automatically Discovering and Learning New Visual Categories with Ranking Statistics" by Kai Han, Sylvestre-Alvise Rebuffi, Sebastien Ehrhardt, Andrea Vedaldi, Andrew Zisserman (ICLR 2020)

Python 220 20 Updated Feb 13, 2020

A list of papers that studies Novel Class Discovery

405 53 Updated Jul 18, 2024

Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.

Jupyter Notebook 32 3 Updated Jul 8, 2024

[ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting

Python 75 6 Updated Mar 20, 2024

[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model

Jupyter Notebook 68 5 Updated Jul 28, 2023

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

314 21 Updated Jan 21, 2024

Inference code for Llama models

Python 54,308 9,332 Updated Jul 19, 2024

Awesome Crowd Counting

2,339 469 Updated Jul 16, 2024

Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]

Python 19 Updated Oct 20, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,249 1,313 Updated Jul 16, 2024

The repository is the code for the paper "End-to-End Video Object Detection with Spatial-TemporalTransformers"

Python 206 28 Updated Oct 12, 2023

A curated list of awesome temporal action segmentation resources.

133 10 Updated Apr 4, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 45,681 5,409 Updated Jun 24, 2024

CVPR 2024 论文和开源项目合集

17,279 2,547 Updated Jul 4, 2024
Python 25 3 Updated Apr 15, 2024
Next