Lists (1)
Sort Name ascending (A-Z)
Stars
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Bringing Old Photo Back to Life (CVPR 2020 oral)
Image super resolution models for PyTorch.
🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.
Convert the model in PaddleOCR to ONNX format
A simple screen parsing tool towards pure vision based GUI agent
Image restoration with neural networks but without learning.
Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
The state-of-the-art image restoration model without nonlinear activation functions.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
An Intelligent Agentic System for Complex Image Restoration Problems
Handwritten Text Recognition and Character Detection
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
Export Apple Notes to html, plain text, Markdown (eg for Obsidian), PDF, and DOCX (word) including images / attachments
📃 A better UX for chat, writing content, and coding with LLMs.
Multimodal LLM Application with PyMuPDF4LLM
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Knowledge Table is an open-source package designed to simplify extracting and exploring structured data from unstructured documents.
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.