The successful integration of Qwen2-VL-Instruct into the ComfyUI platform has enabled a smooth operation, supporting (but not limited to) text-based queries, video queries, single-image queries, an…

Python 55 6 Updated Sep 26, 2024

sayakpaul / simple-image-recaptioning

Recaption large (Web)Datasets with vllm and save the artifacts.

Python 29 2 Updated Sep 24, 2024

zhangfaen / finetune-Qwen2-VL

Python 112 16 Updated Sep 26, 2024

wjbmattingly / qwen2-vl-finetune-huggingface

This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.

Python 38 8 Updated Sep 18, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,950 214 Updated Oct 1, 2024

AIDC-AI / Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 347 18 Updated Sep 19, 2024

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 311 16 Updated Sep 25, 2024

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 2,599 260 Updated Oct 1, 2024

gpu-mode / awesomeMLSys

An ML Systems Onboarding list

516 20 Updated Jul 23, 2024

gpu-mode / resource-stream

GPU programming related news and material links

1,140 69 Updated Sep 23, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,103 159 Updated Sep 30, 2024

microsoft / only_train_once

OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM

Python 19 5 Updated Sep 13, 2024

jxzhangjhu / llm-uncertainty

Forked from MiaoXiong2320/llm-uncertainty

code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"

Python 1 Updated Mar 14, 2024

jxzhangjhu / Say-I-Dont-Know

Forked from OpenMOSS/Say-I-Dont-Know

[ICML'2024] Can AI Assistants Know What They Don't Know?

Python 2 Updated Feb 5, 2024

Cinnamon / kotaemon

An open-source RAG-based tool for chatting with your documents.

Python 13,094 978 Updated Oct 1, 2024

JUSTSUJAY / nlp-zero-to-hero

NLP Zero to Hero in just 10 Kernels

Jupyter Notebook 476 61 Updated Sep 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

songkq

Achievements

Achievements

Block or report songkq

Stars

THUDM / CogVideo

math-eval / MathEval

LLaVA-VL / LLaVA-NeXT

friedrichor / Awesome-Multimodal-Papers

guoyww / AnimateDiff

continue-revolution / sd-webui-animatediff

aigc-apps / EasyAnimate

Yuan-ManX / ai-multimodal-timeline

baaivision / Emu3

GAIR-NLP / ProX

raphael-baena / DTLR

yzhang2016 / video-generation-survey

NJU-PCALab / OpenVid-1M

wjn1996 / Awesome-LLM-Reasoning-Openai-o1-Survey

IuvenisSapiens / ComfyUI_Qwen2-VL-Instruct