Skip to content
View ikmalzulkifli's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report ikmalzulkifli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Simple repo to demonstrate how to submit a spark job to EMR from Airflow

Python 32 23 Updated Oct 18, 2020

An attempt to answer the age old interview question "What happens when you type google.com into your browser and press enter?"

40,121 5,549 Updated Aug 19, 2024

The best place to learn data engineering. Built and maintained by the data engineering community.

CSS 1,422 153 Updated Nov 9, 2024

Python or SQL for data transformation

Python 8 Updated Jul 4, 2024

This project demonstrates an end-to-end solution for processing and analyzing real-time conversations data from a JSON file using GCP services and infrastructure automation, showcasing data storage…

Python 8 1 Updated Apr 29, 2024

Sample repo for startdataengineering DE 101 free course

35 23 Updated Jun 24, 2024

Code for blog at https://www.startdataengineering.com/post/python-for-de/

Python 55 61 Updated Jun 7, 2024

Cost Efficient Data Pipelines with DuckDB

C 46 66 Updated Jul 31, 2024

Free full version of exam testing engine vumingo

45 76 Updated Apr 13, 2023

All Algorithms implemented in Python

Python 194,214 45,635 Updated Nov 11, 2024

This repo contains all the code used in the Python for Data Engineering Course

Jupyter Notebook 229 618 Updated Apr 24, 2024

Data on Malaysian parliamentary election results + dataviz with the consolidated datasets

Jupyter Notebook 81 38 Updated Aug 13, 2023

Data which, to the best of my knowledge, I am the first / only to collate and make freely available in a machine-readable way. I will delete files for which I discover a better previous source.

Python 8 3 Updated Apr 6, 2023

open data for blog content at https://www.startdataengineering.com/

1 1 Updated May 2, 2020

Minimalist Hugo theme based on Hyde

CSS 1 Updated May 25, 2020

Simple repo to demonstrate how to submit a spark job

3 Updated Oct 18, 2020

unit test example in DBT

Shell 5 2 Updated Feb 6, 2021

Apache Superset Demp

2 4 Updated Feb 23, 2021

Simple example showing how to trigger a spark job with AWS Lambda

Shell 4 6 Updated Apr 6, 2021

Making data pipelines idempotent

Python 5 5 Updated May 25, 2021

Example repo to create end to end tests for data pipeline.

Python 21 6 Updated Jun 14, 2024

public file hosting

Shell 1 1 Updated Dec 19, 2023

Profile readme

6 4 Updated Jun 1, 2024

Repository showing how to automate data testing as part of CI

Python 9 7 Updated Jul 3, 2022

Multiple node presto cluster on docker container

Makefile 3 Updated Jul 8, 2022

Repo to explain development, CI/CD cycle in dbt

7 21 Updated Sep 1, 2022

Near real time ETL to populate a dashboard.

Python 70 37 Updated Jun 17, 2024

End to end data engineering project

Python 49 17 Updated Oct 27, 2022

Code for dbt tutorial

143 74 Updated May 31, 2024
Next