![scala logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/scala/scala.png)
Block or Report
Block or report zuoxiaolei
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (19)
Sort Name ascending (A-Z)
Language
Sort by: Recently starred
Starred repositories
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Agentic components of the Llama Stack APIs
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
基于python的网页自动化工具。既能控制浏览器,也能收发数据包。可兼顾浏览器自动化的便利性和requests的高效率。功能强大,内置无数人性化设计和便捷功能。语法简洁而优雅,代码量少。
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
A curated list of awesome libraries, packages, strategies, books, blogs, tutorials for systematic trading.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
A simple and modern Java and Kotlin web framework
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
#1 Locally hosted web application that allows you to perform various operations on PDF files
Images to inference with no labeling (use foundation models to train supervised models).
PyTorch code and models for the DINOv2 self-supervised learning method.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。
This repository contains demos I made with the Transformers library by HuggingFace.
A curated list of resources for Document Understanding (DU) topic
Tesseract Open Source OCR Engine (main repository)
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
A modular graph-based Retrieval-Augmented Generation (RAG) system
Formula recognition based on LaTeX-OCR and ONNXRuntime.
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.