richjjj

Follow

rich richjjj

Follow

瞎折腾；将将将

16 followers · 22 following

Achievements

Achievements

Lists (5)

Sort

encoder

✨ Inspiration

15 repositories

magic internet

数据增强

车道线检测

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

lobehub / lobe-chat

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…

TypeScript 40,019 9,101 Updated Sep 6, 2024

opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。

Python 10,842 797 Updated Sep 6, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 1,508 84 Updated Sep 6, 2024

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 5,538 495 Updated Sep 6, 2024

neufieldrobotics / NeuFlow_v2

Python 59 4 Updated Sep 2, 2024

Dicklesworthstone / llm_aided_ocr

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

Python 1,939 117 Updated Aug 21, 2024

netease-youdao / BCEmbedding

Netease Youdao's open-source embedding and reranker models for RAG products.

Python 1,346 90 Updated Sep 6, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 4,582 303 Updated Aug 28, 2024

VITA-MLLM / VITA

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

Python 690 29 Updated Sep 6, 2024

BubblyYi / MMPedestron

[ECCV2024] Official implementation of the paper "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"

Python 33 2 Updated Jul 16, 2024

hustvl / SparseTrack

Official PyTorch implementation of SparseTrack (the new version of code will come soon)

Python 131 12 Updated Dec 6, 2023

WongKinYiu / YOLO

An MIT rewrite of YOLOv9

Python 461 46 Updated Aug 23, 2024

LayTextLLM / LayTextLLM

Python 52 8 Updated Aug 27, 2024

Keylost / jetson-ffmpeg

Forked from jocover/jetson-ffmpeg

ffmpeg support on jetson nano

Makefile 63 24 Updated Jul 4, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 4,943 338 Updated Sep 6, 2024

om-ai-lab / OmDet

Real-time and accurate open-vocabulary end-to-end object detection

Python 1,483 143 Updated Sep 6, 2024

noamgat / lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,350 59 Updated Aug 21, 2024

outlines-dev / outlines

Structured Text Generation

Python 8,146 411 Updated Sep 4, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,481 2,481 Updated Aug 28, 2024

MCG-NJU / MeMOTR

[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

Python 140 8 Updated May 9, 2024

spacewalk01 / depth-anything-tensorrt

TensorRT implementation of Depth-Anything V1, V2

Python 262 31 Updated Jun 20, 2024

648540858 / wvp-GB28181-pro

WEB VIDEO PLATFORM是一个基于GB28181-2016标准实现的网络视频平台，支持NAT穿透，支持海康、大华、宇视等品牌的IPC、NVR、DVR接入。支持国标级联，支持rtsp/rtmp等视频流转发到国标平台，支持rtsp/rtmp等推流转发到国标平台。

Java 4,800 1,422 Updated Sep 6, 2024

Gmgge / ImageAnalysisService

轻量模型的图像分析web服务，包括倾斜矫正OCR，公章(印章)检测+识别，车牌识别。api方案使用FastAPI+Gunicorn，提供gradio展示。

Python 61 10 Updated Apr 30, 2024

DefTruth / CUDA-Learn-Notes

🎉CUDA/C++ 笔记 / 技术博客: fp32、fp16/bf16、fp8/int8、flash_attn、sgemm、sgemv、warp/block reduce、dot prod、elementwise、softmax、layernorm、rmsnorm、hist etc.

Cuda 1,109 109 Updated Sep 4, 2024

FeiGeChuanShu / ncnn_ppstructure

ppstructure deploy by ncnn

23 2 Updated Jul 16, 2024

huchenlei / ComfyUI_omost

ComfyUI implementation of Omost

Python 404 27 Updated Aug 4, 2024

luanshiyinyang / awesome-multiple-object-tracking

Resources for Multiple Object Tracking (MOT)

1,092 165 Updated Jul 5, 2024

alexw914 / RK_VideoPipe

C++ 61 10 Updated Aug 1, 2024

ServerlessLLM / ServerlessLLM

Cost-efficient and fast multi-LLM serving.

Python 167 14 Updated Sep 4, 2024

NisaarAgharia / Advanced_RAG

Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,Agents.

Jupyter Notebook 153 29 Updated Apr 26, 2024