Skip to content
View mgaerber's full-sized avatar

Organizations

@lindau-nobel

Block or report mgaerber

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.

Python 3,056 154 Updated Nov 3, 2024

Zero shot pdf OCR with gpt-4o-mini

Python 5,751 307 Updated Nov 2, 2024

Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)

TypeScript 16,714 1,009 Updated Nov 3, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 13,650 855 Updated Nov 3, 2024

The authentication glue you need.

Python 13,409 893 Updated Nov 2, 2024

A CLI and a set of examples to learn XSLT with the lxml and saxonche Python parsers.

HTML 2 Updated May 10, 2023

Gaphor is the simple modeling tool

Python 1,871 201 Updated Nov 2, 2024

SysIDE provides SysML v2 language support in VS Code

TypeScript 21 1 Updated Aug 28, 2024

A ShareX/file upload server that is easy to use, packed with features, and with an easy setup!

TypeScript 1,535 141 Updated Oct 23, 2024

A machine learning software for extracting information from scholarly documents

Java 3,533 452 Updated Oct 30, 2024

A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents

Python 19 1 Updated Dec 8, 2022

mupdf mirror

C 1,520 307 Updated Nov 1, 2024

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 5,438 511 Updated Nov 1, 2024

API for validating and transforming RDF, ShEx, SHACL and more.

Scala 36 10 Updated Jul 10, 2024

Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.

Go 7,146 135 Updated Oct 31, 2024

A shell parser, formatter, and interpreter with bash support; includes shfmt

Go 7,277 345 Updated Oct 20, 2024

A flexible commandline tool for template rendering. Supports lots of local and remote datasources.

Go 2,686 185 Updated Oct 23, 2024

Micro frontend framework

JavaScript 830 171 Updated Oct 31, 2024

FastAPI framework plugins

Python 366 21 Updated Jun 3, 2024

A minimalist production ready plugin system

Python 1,288 123 Updated Oct 29, 2024

This plugin provides the Jenkins integration for Gitea.

Java 206 58 Updated Nov 2, 2024

RDFGraphGen: A Synthetic RDF Graph Generator based on SHACL Constraints.

Python 22 1 Updated Jul 26, 2024

lakeFS - Data version control for your data lake | Git for data

Go 4,426 351 Updated Nov 3, 2024

The new Azure Storage data transfer utility - AzCopy v10

Go 613 221 Updated Nov 1, 2024

Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.

Java 5,558 4,946 Updated Nov 3, 2024

Airgap Container Swiss Army Knife

Go 127 30 Updated Nov 1, 2024

🔎 Open source distributed and RESTful search engine.

Java 9,739 1,805 Updated Nov 2, 2024

The OpenSearch Catalog is designed to make it easier for developers and community to contribute, search and install artifacts like plugins, visualization dashboards, ingestion to visualization cont…

HTML 21 19 Updated Sep 6, 2024

The Metadata Platform for your Data Stack

Java 9,862 2,915 Updated Nov 1, 2024

📙 Awesome Data Catalogs and Observability Platforms.

708 53 Updated Jul 27, 2024
Next