Stars
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Effortless data labeling with AI support from Segment Anything and other awesome models.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
[ECCV 2024] Code for "Unleashing the Power of Prompt-driven Nucleus Instance Segmentation"
Official repository of Benchmarking Self-Supervised Learning on Diverse Pathology Datasets
The official codes for the ICCV2021 Oral presentation "Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework"
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2
Many studies have shown that the performance on deep learning is significantly affected by volume of training data. The MedicalNet project provides a series of 3D-ResNet pre-trained models and rela…
doreamon-design / clash
Forked from fossabot/clashA rule-based tunnel in Go.
Transformer: PyTorch Implementation of "Attention Is All You Need"
An annotated implementation of the Transformer paper.
A generative speech model for daily dialogue.
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc.…
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
ICAFusion: Iterative Cross-Attention Guided Feature Fusion for Multispectral Object Detection, Pattern Recognition
A collection of deep learning based RGB-T-Fusion methods, codes, and datasets. The main directions involved are Multispectral Pedestrian Detection, RGB-T Aerial Object Detection, RGB-T Semantic Seg…
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
All deep learning-based infrared and visible image fusion algorithms in a whole framework
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Drone-based RGB-Infrared Cross-Modality Vehicle Detection via Uncertainty-Aware Learning
Labeling extension for Automatic1111's Web UI