Skip to content
View richjjj's full-sized avatar

Block or report richjjj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…

TypeScript 40,019 9,101 Updated Sep 6, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 10,842 797 Updated Sep 6, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 1,508 84 Updated Sep 6, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 5,538 495 Updated Sep 6, 2024

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

Python 1,939 117 Updated Aug 21, 2024

Netease Youdao's open-source embedding and reranker models for RAG products.

Python 1,346 90 Updated Sep 6, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 4,582 303 Updated Aug 28, 2024

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

Python 690 29 Updated Sep 6, 2024

[ECCV2024] Official implementation of the paper "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"

Python 33 2 Updated Jul 16, 2024

Official PyTorch implementation of SparseTrack (the new version of code will come soon)

Python 131 12 Updated Dec 6, 2023

An MIT rewrite of YOLOv9

Python 461 46 Updated Aug 23, 2024
Python 52 8 Updated Aug 27, 2024

ffmpeg support on jetson nano

Makefile 63 24 Updated Jul 4, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 4,943 338 Updated Sep 6, 2024

Real-time and accurate open-vocabulary end-to-end object detection

Python 1,483 143 Updated Sep 6, 2024

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,350 59 Updated Aug 21, 2024

Structured Text Generation

Python 8,146 411 Updated Sep 4, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,481 2,481 Updated Aug 28, 2024

[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

Python 140 8 Updated May 9, 2024

TensorRT implementation of Depth-Anything V1, V2

Python 262 31 Updated Jun 20, 2024

WEB VIDEO PLATFORM是一个基于GB28181-2016标准实现的网络视频平台,支持NAT穿透,支持海康、大华、宇视等品牌的IPC、NVR、DVR接入。支持国标级联,支持rtsp/rtmp等视频流转发到国标平台,支持rtsp/rtmp等推流转发到国标平台。

Java 4,800 1,422 Updated Sep 6, 2024

轻量模型的图像分析web服务,包括倾斜矫正OCR,公章(印章)检测+识别,车牌识别。api方案使用FastAPI+Gunicorn,提供gradio展示。

Python 61 10 Updated Apr 30, 2024

🎉CUDA/C++ 笔记 / 技术博客: fp32、fp16/bf16、fp8/int8、flash_attn、sgemm、sgemv、warp/block reduce、dot prod、elementwise、softmax、layernorm、rmsnorm、hist etc.

Cuda 1,109 109 Updated Sep 4, 2024

ppstructure deploy by ncnn

23 2 Updated Jul 16, 2024

ComfyUI implementation of Omost

Python 404 27 Updated Aug 4, 2024

Resources for Multiple Object Tracking (MOT)

1,092 165 Updated Jul 5, 2024
C++ 61 10 Updated Aug 1, 2024

Cost-efficient and fast multi-LLM serving.

Python 167 14 Updated Sep 4, 2024

Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,Agents.

Jupyter Notebook 153 29 Updated Apr 26, 2024
Next