Skip to content
View ellenzhuwang's full-sized avatar
🏠
Working from home
🏠
Working from home
  • UIC
  • Chicago
Block or Report

Block or report ellenzhuwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 1 Updated Jun 12, 2024
Python 398 20 Updated Jun 17, 2024

An open source implementation of CLIP.

Jupyter Notebook 8,989 894 Updated Jun 22, 2024

Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Python 558 99 Updated May 7, 2023

[CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features

SCSS 70 8 Updated Mar 28, 2023

Generating captions on image datasets using MiniGPT-v2

Python 4 Updated Dec 23, 2023

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,103 2,897 Updated Apr 22, 2024

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

Python 9,168 581 Updated Jun 17, 2024

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,140 427 Updated Mar 7, 2024

An end-to-end vision and language model incorporating explicit knowledge graphs and OOD-detection.

Python 3 Updated May 3, 2024

QuIP quantization

Python 34 3 Updated Mar 17, 2024

😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.

129 11 Updated Mar 23, 2024

Finetuning Large Language Models on One Consumer GPU in Under 4 Bits

Python 682 73 Updated May 25, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,771 143 Updated Mar 27, 2024

简单易懂的LLaMA微调指南。

Python 306 33 Updated Jul 5, 2023

Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models

Python 43 3 Updated Jun 17, 2024

✨✨Latest Advances on Multimodal Large Language Models

10,167 673 Updated Jun 22, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,542 240 Updated Jun 12, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,059 275 Updated May 4, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 17,635 1,893 Updated May 28, 2024

Fast Segment Anything

Python 7,053 658 Updated Feb 29, 2024

The official homepage of the COCO-Stuff dataset.

Shell 817 145 Updated Sep 9, 2022

[ICLR 24] MaGIC: Multi-modality Guided Image Completion

Python 40 3 Updated Apr 24, 2024

ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…

10,918 1,001 Updated Jun 19, 2024

[ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting

Python 73 6 Updated Mar 20, 2024

[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

Python 606 37 Updated Jan 22, 2024

This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.

Python 119 14 Updated Apr 22, 2023

Segment Anything Labelling Tool

Python 996 127 Updated Feb 19, 2024

[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"

Jupyter Notebook 246 18 Updated Mar 21, 2024
Next