Skip to content
View mondalanindya's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report mondalanindya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,145 950 Updated Oct 1, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,327 388 Updated Aug 19, 2024

A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..

399 16 Updated Sep 14, 2024

Project page for "OmniCount: Multi-label Object Counting with Semantic-Geometric Priors"

HTML 2 Updated Sep 3, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 25,360 5,251 Updated Oct 1, 2024

Synthesize, Rank, Count; A Method for unsupervised crowd counting using latent diffusion models

Python 3 Updated Oct 4, 2023

collection of diffusion model papers categorized by their subareas

1,157 57 Updated Sep 29, 2024

🧙🏻‍♂️A list of papers curated for you to dive into the Awesome Radiance Field-based 3D Editing.

394 12 Updated Sep 29, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,826 523 Updated Jul 17, 2024

[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation

Python 956 63 Updated Aug 16, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,577 1,105 Updated Sep 24, 2024

Reading list for Multimodal Large Language Models

64 7 Updated Aug 17, 2023

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,326 2,910 Updated Sep 2, 2024

Video datasets

1,144 91 Updated Mar 8, 2023

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Jupyter Notebook 350 23 Updated Apr 29, 2023

"Automatically Discovering and Learning New Visual Categories with Ranking Statistics" by Kai Han, Sylvestre-Alvise Rebuffi, Sebastien Ehrhardt, Andrea Vedaldi, Andrew Zisserman (ICLR 2020)

Python 222 20 Updated Feb 13, 2020

A list of papers that studies Novel Class Discovery

429 55 Updated Sep 8, 2024

Includes FSC-147-D and the code for training and testing the CounTX model from the paper Open-world Text-specified Object Counting.

Jupyter Notebook 32 3 Updated Sep 27, 2024

[ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting

Python 82 6 Updated Mar 20, 2024

[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model

Jupyter Notebook 72 7 Updated Jul 28, 2023

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

356 25 Updated Jan 21, 2024

Inference code for Llama models

Python 55,767 9,504 Updated Aug 18, 2024

Awesome Crowd Counting

2,377 473 Updated Aug 30, 2024

Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]

Python 21 Updated Oct 20, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,866 1,377 Updated Sep 5, 2024

The repository is the code for the paper "End-to-End Video Object Detection with Spatial-TemporalTransformers"

Python 212 28 Updated Oct 12, 2023

A curated list of awesome temporal action segmentation resources.

149 12 Updated Apr 4, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 46,939 5,554 Updated Sep 18, 2024

CVPR 2024 论文和开源项目合集

17,845 2,571 Updated Jul 4, 2024
Next