A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,102 138 Updated Jun 16, 2024

datawhalechina / tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

Python 433 53 Updated Jun 23, 2024

Postroggy / OC_SORT_CPP

OC_SORT implemented in C++ with Eigen Library

C++ 39 11 Updated Jun 19, 2024

matatonic / openedai-vision

An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.

Python 75 6 Updated Jun 19, 2024

NVIDIA-AI-IOT / NVIDIA-Optical-Character-Detection-and-Recognition-Solution

This repository provides optical character detection and recognition solution optimized on Nvidia devices.

C++ 49 3 Updated Jun 11, 2024

unslothai / unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 11,849 763 Updated Jun 21, 2024

EricLBuehler / mistral.rs

Blazingly fast LLM inference.

Rust 2,724 204 Updated Jun 23, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 8,278 932 Updated Jun 22, 2024

ByungKwanLee / Meteor

Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to improve performance of numerous vision language performances for diverse capa…

Python 81 4 Updated May 30, 2024

dusty-nv / jetson-containers

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

Python 1,836 413 Updated Jun 23, 2024

open-webui / open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 28,709 3,085 Updated Jun 22, 2024

yulinghan / ImageQualityEnhancement

C++ 44 10 Updated May 8, 2024

roboflow / inference

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

Python 1,110 81 Updated Jun 21, 2024

modelscope / swift

ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, Deepseek, Baichuan2...)

Python 2,042 196 Updated Jun 23, 2024

hiddify / hiddify-next

Multi-platform auto-proxy client, supporting Sing-box, X-ray, TUIC, Hysteria, Reality, Trojan, SSH etc. It’s an open-source, secure and ad-free.

Dart 12,390 1,134 Updated Jun 10, 2024

jinmin527 / learning-cuda-trt

A large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt

C++ 90 134 Updated Jul 24, 2022

tsingmicro-toolchain / OnnxSlim

A Toolkit to Help Optimize Large Onnx Model

Python 123 9 Updated May 16, 2024

laugh12321 / TensorRT-YOLO

🚀 TensorRT-YOLO: Supports YOLOv3, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv9, YOLOv10, and PP-YOLOE using TensorRT acceleration with EfficientNMS, CUDA Kernels and CUDA Graphs!

Python 339 46 Updated Jun 22, 2024

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 34,861 4,680 Updated Jun 23, 2024

ollama / ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Go 75,069 5,617 Updated Jun 23, 2024

hiyouga / LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Python 24,803 3,063 Updated Jun 22, 2024

corfyi / UCMCTrack

[AAAI 2024] UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation. UCMCTrack achieves SOTA on MOT17 using estimated camera parameters.

Python 232 18 Updated Mar 26, 2024

Efficient-Large-Model / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 848 55 Updated Jun 21, 2024

NVIDIA / TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…

Python 268 14 Updated Jun 15, 2024