mmaaz60

Follow

😀

Muhammad Maaz mmaaz60

😀

Follow

An Electrical Engineer with experience in Computer Vision software development. Skilled in Machine Learning, Deep Learning and Computer Vision.

119 followers · 4 following

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Organizations

Block or Report

Block or report mmaaz60

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

mmaaz60/README.md

Hi there 👋

🔭 I’m currently working on multi-modal transformers and multi-task learning
🌱 I’m currently learning to play Table Tennis 🏓
📫 How to reach me: [email protected]

Pinned

mbzuai-oryx/Video-ChatGPT mbzuai-oryx/Video-ChatGPT Public

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1k 92
mbzuai-oryx/groundingLMM mbzuai-oryx/groundingLMM Public

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 625 30
mbzuai-oryx/VideoGPT-plus mbzuai-oryx/VideoGPT-plus Public

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 103 5
mbzuai-oryx/LLaVA-pp mbzuai-oryx/LLaVA-pp Public

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 718 48
mbzuai-oryx/PALO mbzuai-oryx/PALO Public

Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, Bengali and Urdu.

Python 73 3
EdgeNeXt EdgeNeXt Public

[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications".

Python 330 37