BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
-
Updated
Aug 6, 2021 - Jupyter Notebook
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
Terraform module to create AWS EMR resources 🇺🇦
Analysis performed on data from the Steam platform using Apache Spark and Cloud services such as Amazon Web Services.
Detect Tight Communities in a social Network
Shell scripts for AWS EMR clusters
EMR + Hadoop to Redshift ELT workflow using spark steps API and orchestrated by Apache-Airflow, which ingests disparate datasets focused around 7Gb of I94 arrivals information to produce a simple star schema in Redshift
Lambda to start EMR and run a map reduce job
Analysis of Airline On Time Performance Dataset
Performing various product review analysis on Amazon dataset using Apache Spark and MongoDB
Data Engineering Projects including Data Modeling, Data Warehouse, Data Lake Development
Daily Incremental load ETL pipeline for Ecommerce company using AWS Lambda and AWS EMR cluster, Deployed using Apache airflow in a docker container.
Run a Spark job within Amazon EMR
Implemented random forest machine learning algorithm using pyspark on AWS EMR to classify the wines. The model is then deployed in docker container.
AWS EMR backed Spark cluster for analyzing Yelp Data
Example for provisioning AWS EMR service with Terraform
Load data from the Million Song Dataset into a final dimensional model stored in S3.
TU Berlin Cloud Computing - correctly implemented assignment4
Add a description, image, and links to the aws-emr-clusters topic page so that developers can more easily learn about it.
To associate your repository with the aws-emr-clusters topic, visit your repo's landing page and select "manage topics."