Skip to content
View leeloolee's full-sized avatar
😶
😶

Highlights

  • Pro
Block or Report

Block or report leeloolee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Contextual Object Detection with Multimodal Large Language Models

171 4 Updated May 30, 2023

A UI-Focused Agent for Windows OS Interaction.

Python 7,192 871 Updated Jul 25, 2024

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?

Python 94 6 Updated Jul 17, 2024

Scenic: A Jax Library for Computer Vision Research and Beyond

Python 3,170 421 Updated Jul 25, 2024

SVIT: Scaling up Visual Instruction Tuning

Python 154 4 Updated Jun 20, 2024

Data release for the ImageInWords (IIW) paper.

JavaScript 186 7 Updated May 25, 2024

List of references and online resources related to data science, machine learning and deep learning.

136 41 Updated Jul 27, 2024

Universal LLM Deployment Engine with ML Compilation

Python 17,950 1,426 Updated Jul 29, 2024

Using pre-trained Diffusion models as priors for inference tasks

Jupyter Notebook 184 13 Updated Feb 9, 2023

Generative Diffusion Prior for Unified Image Restoration and Enhancement (CVPR2023)

Shell 249 28 Updated Jul 18, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,326 1,322 Updated Jul 16, 2024

🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨

Jupyter Notebook 351 49 Updated Jul 25, 2024

Medical Image Segmentation with Diffusion Model

Python 982 147 Updated May 24, 2024

v objective diffusion inference code for PyTorch.

Python 708 109 Updated Nov 29, 2022

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Jupyter Notebook 1,614 90 Updated Jun 6, 2024

Dataset introduced in PlotQA: Reasoning over Scientific Plots

64 7 Updated Jun 20, 2023

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Python 50 3 Updated Jan 30, 2024

A lightweight, scalable, and general framework for visual question answering research

Python 314 64 Updated Sep 3, 2021

A collection of resources on applications of multi-modal learning in medical imaging.

411 43 Updated Jul 18, 2024

Pythonic wrappers for Cider/CiderD evaluation metrics. Provides CIDEr as well as CIDEr-D (CIDEr Defended) which is more robust to gaming effects. We also add the possibility to replace the original…

Python 7 Updated Nov 6, 2023

Let's build better datasets, together!

Jupyter Notebook 186 28 Updated Jul 24, 2024

🤗 AutoTrain Advanced

Python 3,677 446 Updated Jul 28, 2024

MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.

Python 35 4 Updated Jul 19, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 8,611 804 Updated Jul 29, 2024

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 497 60 Updated Jul 29, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,100 2,315 Updated Jul 29, 2024

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,305 127 Updated May 27, 2024

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Python 690 85 Updated Jul 25, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,863 114 Updated May 15, 2024
Next