Skip to content
View keen85's full-sized avatar
  • Germany
  • 01:21 (UTC +02:00)

Block or report keen85

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A Spark plugin for reading and writing Excel files

Scala 463 147 Updated Oct 1, 2024

A native Delta implementation for integration with any query engine

Rust 123 33 Updated Oct 2, 2024

DuckDB extension for Delta Lake

C++ 123 14 Updated Sep 24, 2024

Python SQL Parser and Transpiler

Python 6,493 670 Updated Oct 2, 2024

Fabric Python Notebooks examples

Jupyter Notebook 36 4 Updated Oct 1, 2024

Scan documents to PDF and more, as simply as possible.

C# 2,742 321 Updated Sep 22, 2024

Apache PyIceberg

Python 403 147 Updated Oct 2, 2024

LakeSail's computation framework with a mission to unify stream processing, batch processing, and compute-intensive (AI) workloads.

Rust 344 9 Updated Oct 2, 2024

Turning PySpark Into a Universal DataFrame API

Python 289 8 Updated Oct 1, 2024

Free universal database tool and SQL client

Java 39,465 3,415 Updated Oct 2, 2024

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 3,832 212 Updated Oct 2, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,276 357 Updated Oct 2, 2024

GUI Tool To Removes Ads From Various Places Around Windows 11

C# 6,579 220 Updated Sep 12, 2024

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,160 420 Updated Oct 2, 2024

Qubole Sparklens tool for performance tuning Apache Spark

Scala 562 138 Updated Jun 26, 2024

A DataOps framework for building a lakehouse.

Python 26 2 Updated Oct 1, 2024

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 2,350 928 Updated Sep 30, 2024

Tools for Microsoft Fabric

Python 12 1 Updated May 13, 2024

An Open Standard for lineage metadata collection

Java 1,724 300 Updated Oct 2, 2024

A fast static code analyzer & language server for Python

Rust 2,389 33 Updated Sep 26, 2024

Distributed DataFrame for Python designed for the cloud, powered by Rust

Rust 2,162 145 Updated Oct 2, 2024

Watch a file or folder and automatically commit changes to a git repo easily.

Shell 1,512 218 Updated May 20, 2024

Git Extensions is a standalone UI tool for managing git repositories. It also integrates with Windows Explorer and Microsoft Visual Studio (2015/2017/2019).

C# 7,715 2,079 Updated Sep 28, 2024

The Data Engineering Cookbook

13,620 2,499 Updated Aug 1, 2024

A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric, Azure Data Factory and Azure Synapse Analytics.

Python 81 18 Updated Oct 2, 2024

DacFx, SqlPackage, and other SQL development libraries enable declarative database development and database portability across SQL versions and environments. Share feedback here on dacpacs, bacpacs…

C# 313 19 Updated Sep 25, 2024

fsspec-compatible Azure Datake and Azure Blob Storage access

Python 175 104 Updated Aug 15, 2024
Python 2 Updated Jul 13, 2023

ripgrep recursively searches directories for a regex pattern while respecting your gitignore

Rust 47,940 1,971 Updated Sep 30, 2024
Next