Skip to content
View hanoonaR's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@mbzuai-oryx

Block or report hanoonaR

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

[MICCAI 2024] Official code repository of paper titled "BAPLe: Backdoor Attacks on Medical Foundation Models using Prompt Learning" accepted in MICCAI 2024 conference.

Python 43 Updated Aug 23, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 9,874 724 Updated Aug 21, 2024

Official implementation of paper titled "GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model"

Python 54 3 Updated Jul 19, 2024

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 176 11 Updated Aug 11, 2024
Python 2,076 138 Updated Aug 23, 2024

CoreNet: A library for training deep neural networks

Python 6,894 536 Updated May 28, 2024

MobiLlama : Small Language Model tailored for edge devices

Python 577 42 Updated Mar 3, 2024

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 782 56 Updated Jul 10, 2024

Efficient Video Object Segmentation via Modulated Cross-Attention Memory

45 2 Updated Mar 28, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 883 27 Updated Jul 31, 2024

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

Python 233 11 Updated Jan 2, 2024

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 726 37 Updated Jun 2, 2024

[MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification"

Python 44 Updated Sep 28, 2023

How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges

30 1 Updated Sep 24, 2023

[MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation" accepted in MICCAI 2023 conference.

Python 47 Updated Nov 14, 2023

[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.

Python 454 52 Updated Aug 8, 2024

[EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabic languages.

Python 73 9 Updated Jan 30, 2024

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,130 96 Updated Jun 16, 2024

Multi-modality pre-training

Python 465 36 Updated May 8, 2024

[ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications

Python 238 25 Updated Jan 12, 2024

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-…

Jupyter Notebook 764 104 Updated Aug 24, 2023

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

Python 3,047 890 Updated Aug 26, 2022

Implementation for ECCV 2022 paper Language-Grounded Indoor 3D Semantic Segmentation in the Wild

Python 97 14 Updated Nov 3, 2022

[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Python 505 37 Updated Sep 15, 2023

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,647 189 Updated May 20, 2024

[NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection".

Jupyter Notebook 285 18 Updated Oct 12, 2022

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,664 668 Updated Jan 14, 2024

This repo includes ChatGPT prompt curation to use ChatGPT better.

HTML 108,369 14,851 Updated Aug 16, 2024
Next