Skip to content
View SourabhKr's full-sized avatar
  • Berlin

Block or report SourabhKr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Free, simple, and intuitive online database design tool and SQL generator.

JavaScript 19,306 1,332 Updated Aug 24, 2024

Code for "Efficient Data Processing in Spark" Course

Python 204 46 Updated May 29, 2024

A tool for exploring each layer in a docker image

Go 45,109 1,712 Updated Jul 15, 2024

A curated list of awesome blogs, videos, tools and resources about Data Contracts

157 20 Updated Aug 12, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,132 318 Updated Aug 22, 2024

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 5,965 1,237 Updated Aug 23, 2024

Shows how the CFT modules can be composed to build a secure cloud foundation

HCL 1,204 706 Updated Aug 23, 2024

Deploys a secured BigQuery data warehouse

HCL 77 35 Updated Aug 16, 2024

A collection of learning resources for curious software engineers

Python 46,145 3,696 Updated Aug 20, 2024

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

Java 5,973 1,797 Updated Aug 20, 2024

Dataform is a framework for managing SQL based data operations in BigQuery

TypeScript 823 160 Updated Aug 23, 2024

This Dataform project processes various marketing data sources and creates a Marketing Data Store (MDS) to be used in several use cases: a)retain historical marketing data; b)create high performanc…

JavaScript 54 31 Updated Jul 16, 2024

📚 Tech blogs & talks by companies that run Apache Flink in production

146 11 Updated Aug 16, 2024

A comprehensive list of books on Software Architecture.

9,679 768 Updated Mar 15, 2023

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,378 1,662 Updated Aug 23, 2024

Free Data Engineering course!

Jupyter Notebook 24,241 5,196 Updated Aug 16, 2024

A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!

Python 527 111 Updated Apr 16, 2022

A collective list of free APIs

Python 310,599 33,178 Updated Aug 19, 2024

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

HTML 68,828 14,185 Updated Aug 17, 2024

Code snippets from the Streaming Systems book (streamingbook.net).

Java 238 59 Updated Apr 12, 2022

A cheap, serverless version of Snowplow deployed with Terraform that runs on dumky.net

HCL 43 5 Updated Feb 8, 2024

Template for a data contract used in a data mesh.

460 85 Updated Mar 13, 2024

Data Engineering Practice Problems

Dockerfile 1,610 447 Updated Jun 10, 2024

A framework for managing and maintaining multi-language pre-commit hooks.

Python 12,607 789 Updated Aug 5, 2024

A collection of useful .gitignore templates

160,616 83,183 Updated Aug 21, 2024

DuckDB is an analytical in-process SQL database management system

C++ 22,006 1,760 Updated Aug 24, 2024

Python composable command line interface toolkit

Python 15,447 1,391 Updated Aug 24, 2024

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,807 1,154 Updated Jun 30, 2023

Schema modelling framework for decentralised domain-driven ownership of data.

Java 243 15 Updated Dec 5, 2023

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

Java 10,352 2,481 Updated Aug 23, 2024
Next