- Abu Dhabi, UAE
- https://www.muhammadmaaz.com
Block or Report
Block or report mmaaz60
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (1)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Hate-CLIPper: Multimodal Hateful Meme Classification with Explicit Cross-modal Interaction of CLIP features - Accepted at EMNLP 2022 Workshop
[IEEE TMI-2024] UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
ML model trained on data from Bayut.com to predict housing prices in Dubai
🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".
MDPI Journal: Remote Sensing Track 2023
A list of 3D computer vision papers with Transformers
Source code for MICCAI 2022 paper entitled: 'Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification'
(BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations"
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
CVNets: A library for training computer vision networks
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
[NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection".
A curated list of prompt-based paper in computer vision and vision-language learning.
[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications".
Official PyTorch implementation of the paper: "Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results" (2022)
The repository contains the code for Object Detection in Aerial Images (iSAID dataset) using Faster RCNN and scale-aware data augmentation (SA-AutoAug).
[ECCV 2022] Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"
[ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".
A collection of resources on applications of Transformers in Medical Imaging.
naufil601 / darknet
Forked from AlexeyAB/darknetWindows and Linux version of Darknet Yolo v3 & v2 Neural Networks for object detection (Tensor Cores are used)
Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)
Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)