Skip to content
View longNguyen010203's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Organizations

@NTL-DE
Block or Report

Block or report longNguyen010203

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
longNguyen010203/README.md

Hey, I'm Long Nguyen πŸ‘‹

I am an Artificial Intelligence major in Vietnam with a passion for data engineer and I am actively seeking job opportunities in this field.

πŸ“¦ Technologies

Languages: Python SQL PySpark Shell C++

Architecture: ETL ELT Lambda Kappa Star Schema Snowflake Schema

Processing: Spark Kafka Flink Dbt Pandas Polars Airbyte Numpy

Storage: PostgreSQL MySQL SQLServer Redshift Snowflake MinIO S3 SQLite

Cloud: S3 EC2 IAM VPC Redshift EMR Glue

Visualization: Seaborn Matplotlib

Scrapping: BeautifulSoup Selenium

Orchestration: Airflow Dagster

DevOps: Docker Zookeeper Terraform Git GitLab

Testing & Logging: Unittest Pytest Logging

⚑ Fun fact

  • One-Punch Man is my favorite anime.
  • I enjoy listening to gentle songs, but sometimes I also like remixes.
  • I'm 21 years old but I don't know how to swim.

πŸ“« Contact

Connect with me, LinkedIn

Pinned Loading

  1. Spark-Processing-AWS Spark-Processing-AWS Public

    πŸ‘·πŸŒ‡ Set up and build a big data processing pipeline with Apache Spark, πŸ“¦ AWS services (S3, EMR, EC2, IAM, VPC, Redshift) Terraform to setup the infrastructure and Integration Airflow to automate wor…

    Python 1

  2. Youtube-ETL-Pipeline Youtube-ETL-Pipeline Public

    πŸ’œπŸŒˆπŸ“Š A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api 🌺

    Jupyter Notebook 9 1

  3. Spark-Kafka-Self-Learning Spark-Kafka-Self-Learning Public

    πŸ“šπŸŒŠπŸŽ“ A third-year student is self-studying Spark and Kafka as part of their πŸ‘· data engineering journey, with the goal of securing an πŸ“¬ internship or fresher job in 2024.

    Shell 1

  4. ECommerce-ELT-Pipeline ECommerce-ELT-Pipeline Public

    πŸŒ„πŸ“ˆπŸ“‰ A Data Engineering Project 🌈 that implements an ELT data pipeline using Dagster, Docker, Dbt, Polars, Snowflake, PostgreSQL. Data from kaggle website πŸ”₯

    Python 1

  5. Bank-DataWarehouse Bank-DataWarehouse Public

    πŸ“ŠπŸŒˆπŸ› This project develop a data warehouse for a bank using Amazon Redshift, VPC, Glue, S3 and DBT, following a ⭐ Star Schema architecture. The goal is to storage, manage, and optimize data to suppo…

    1

  6. InspireAI-Web-2024 InspireAI-Web-2024 Public

    πŸ€–πŸ’ŽπŸ“Ί This project involves creating an AI chatbot with OpenAI using ChatGPT, DALL-E, Codex, and Django to develop the web application 🍁

    Python 2 1