-
Lead Data Engineer
- New York
- https://soumilshah.com/
- in/shah-soumil
- channel/UC_eOodxvwS_H7x2uLQa-svw
- https://soumilshah1995.blogspot.com
Highlights
- Pro
Block or Report
Block or report soumilshah1995
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
-
datahub Public
Forked from datahub-project/datahubThe Metadata Platform for your Data Stack
Java Apache License 2.0 UpdatedJul 6, 2024 -
-
Hudi-spark-sql-minio Public
Hudi-spark-sql-minio
-
Hudi-streamer-emr-7.1.0 Public
Hudi-streamer-emr-7.1.0
GNU General Public License v3.0 UpdatedJun 28, 2024 -
apache hudi delta streamer labs
-
apache-x-table-sync-aws-cloud-shell
GNU General Public License v3.0 UpdatedJun 19, 2024 -
unitycatalog Public
Forked from unitycatalog/unitycatalogOpen, Multi-modal Catalog for Data & AI
Java Apache License 2.0 UpdatedJun 14, 2024 -
-
Multiple Spark Writers with Apache Hudi
Python Apache License 2.0 UpdatedJun 4, 2024 -
hudi-streamer-pulsar Public
hudi-streamer-pulsar
BSD 2-Clause "Simplified" License UpdatedMay 25, 2024 -
election-stock-analysis Public
election-stock-analysis
GNU General Public License v3.0 UpdatedMay 24, 2024 -
-
HudiDeltaStreamer-SCD-Trino Public
HudiDeltaStreamer-SCD-Trino
-
DeltaStream-BroadcastJoinETL Public
DeltaStream-BroadcastJoinETL
Apache License 2.0 UpdatedMay 20, 2024 -
LinkedIn-Easy-Apply-Bot Public
Forked from nicolomantini/LinkedIn-Easy-Apply-BotAutomate the application process on LinkedIn
-
-
-
hudi-daft-lambda Public
hudi-daft-lambda
-
DaftHudi Public
Forked from dipankarmazumdar/DaftHudiBuild Analytical Applications on Data Lakehouse with Apache Hudi, Daft & Streamlit
Python MIT License UpdatedMay 11, 2024 -
hudi-trino-integeration-guide
-
-
Daft Public
Forked from Eventual-Inc/DaftDistributed DataFrame for Python designed for the cloud, powered by Rust
Rust Apache License 2.0 UpdatedMay 2, 2024 -
flink-iceberg-hive Public
flink-iceberg-hive
-
trino-kafka-demo Public
Forked from sorieux/trino-kafka-demoHands-on demo for querying Kafka streams using SQL with Trino and data integration with PostgreSQL.
-
DebeziumFlinkHudiSync Public
Bringing Data from MySQL to Kafka Using Debezium, Joining Kafka Topics with Flink, Upserting into a New Kafka Topic, and Ingesting into Hudi Real-Time
-
universal-datalakehouse-mysql-ingestion-deltastreamer
-
universal-datalakehouse-postgres-ingestion-deltastreamer
-
universal-data-lakehouse-xTable-MinIO-Trino
-
DataLakeHouseX-Apache-XTable-MinIO-StarRocks-DeltaStreamer-Hudi-IceBerg-Delta-Interoperability- Public
DataLakeHouseX: Apache XTable, MinIO, StarRocks, DeltaStreamer, Hudi, IceBerg, Delta Interoperability"