Block or Report
Block or report richjjj
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (5)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
OC_SORT implemented in C++ with Eigen Library
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
This repository provides optical character detection and recognition solution optimized on Nvidia devices.
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Large Language Model Text Generation Inference
Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to improve performance of numerous vision language performances for diverse capa…
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, Deepseek, Baichuan2...)
Multi-platform auto-proxy client, supporting Sing-box, X-ray, TUIC, Hysteria, Reality, Trojan, SSH etc. It’s an open-source, secure and ad-free.
A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt
A Toolkit to Help Optimize Large Onnx Model
🚀 TensorRT-YOLO: Supports YOLOv3, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv9, YOLOv10, and PP-YOLOE using TensorRT acceleration with EfficientNMS, CUDA Kernels and CUDA Graphs!
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
Unify Efficient Fine-Tuning of 100+ LLMs
[AAAI 2024] UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation. UCMCTrack achieves SOTA on MOT17 using estimated camera parameters.
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
[MIR 24] Fully 1x1 Convolutional Network for Lightweight Image Super-Resolution
A toolbox for deep learning model deployment using C++ YoloX | YoloV7 | YoloV8 | Gan | OCR | MobileVit | Scrfd | MobileSAM | StableDiffusion