Skip to content
View tomkrol's full-sized avatar
  • Bayer
  • Warsaw

Block or report tomkrol

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Robust recipes to align language models with human and AI preferences

Python 4,533 393 Updated Sep 23, 2024

JunoDB is PayPal's home-grown secure, consistent and highly available key-value store providing low, single digit millisecond, latency at any scale.

Go 2,565 164 Updated Jun 21, 2024

Extensible Rules Engine for custom Dataframe / Dataset validation

Scala 134 30 Updated May 7, 2024

Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.

Java 784 184 Updated Sep 26, 2024

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 36,543 14,150 Updated Oct 3, 2024

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 10,266 2,951 Updated Oct 3, 2024

Essential Spark extensions and helper methods ✨😲

Scala 750 151 Updated Sep 26, 2024

Running Presto on k8s

38 16 Updated Aug 26, 2019

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 699 144 Updated Aug 13, 2024

Learn Scala by examples

Scala 245 121 Updated Mar 29, 2018

Examples for High Performance Spark

Scala 498 233 Updated Aug 27, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 39,405 28,221 Updated Oct 3, 2024