Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,326 1,322 Updated Jul 16, 2024

pinellolab / DNA-Diffusion

🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨

Jupyter Notebook 351 49 Updated Jul 25, 2024

KidsWithTokens / MedSegDiff

Medical Image Segmentation with Diffusion Model

Python 982 147 Updated May 24, 2024

crowsonkb / v-diffusion-pytorch

v objective diffusion inference code for PyTorch.

Python 708 109 Updated Nov 29, 2022

YangLing0818 / RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Jupyter Notebook 1,614 90 Updated Jun 6, 2024

NiteshMethani / PlotQA

Dataset introduced in PlotQA: Reasoning over Scientific Plots

64 7 Updated Jun 20, 2023

opendatalab / HA-DPO

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Python 50 3 Updated Jan 30, 2024

MILVLG / openvqa

A lightweight, scalable, and general framework for visual question answering research

Python 314 64 Updated Sep 3, 2021

richard-peng-xia / awesome-multimodal-in-medical-imaging

A collection of resources on applications of multi-modal learning in medical imaging.

411 43 Updated Jul 18, 2024

michelecafagna26 / cider

Pythonic wrappers for Cider/CiderD evaluation metrics. Provides CIDEr as well as CIDEr-D (CIDEr Defended) which is more robust to gaming effects. We also add the possibility to replace the original…

Python 7 Updated Nov 6, 2023

huggingface / data-is-better-together

Let's build better datasets, together!

Jupyter Notebook 186 28 Updated Jul 24, 2024

huggingface / autotrain-advanced

🤗 AutoTrain Advanced

Python 3,677 446 Updated Jul 28, 2024

mathvision-cuhk / MATH-V

MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.

Python 35 4 Updated Jul 19, 2024

stanford-oval / storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 8,611 804 Updated Jul 29, 2024

huggingface / lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 497 60 Updated Jul 29, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,100 2,315 Updated Jul 29, 2024

Farama-Foundation / chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,305 127 Updated May 27, 2024

microsoft / tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Python 690 85 Updated Jul 25, 2024

PKU-YuanGroup / MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Python 1,863 114 Updated May 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DokyoonYoon leeloolee

Achievements

Achievements

Highlights

Block or report leeloolee

Stars

yuhangzang / ContextDET

apple / ml-entity-deduction-arena

microsoft / UFO

ServiceNow / WorkArena

google-research / scenic

BAAI-DCAI / Visual-Instruction-Tuning

google / imageinwords

peggy1502 / Amazing-Resources

mlc-ai / mlc-llm

AlexGraikos / diffusion_priors

Fayeben / GenerativeDiffusionPrior

IDEA-Research / Grounded-Segment-Anything