Stars
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
LlamaIndex is a data framework for your LLM applications
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
An open source implementation of CLIP.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
[CVPR 2024 Oral] MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation.
llama3 implementation one matrix multiplication at a time
The simplest, fastest repository for training/finetuning medium-sized GPTs.
This is the official repository for our recent work: PIDNet
A personal list of papers and resources of image matching and pose estimation, including perspective images and panoramas.
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor Extraction