cleaning-data

Star

Here are 339 public repositories matching this topic...

amarlearning / exploring-the-evolution-of-Linux

Star

Data Analysis about the development of the Linux operating system by exploring its Git repository history.

linux data data-analysis data-wrangling git-history first-commit datacamp cleaning-data

Updated Dec 11, 2018
Jupyter Notebook

HRNowak / cruise-reviews

Star

Cruise Reviews - NLP - Text Classification

data-science data webscraping nlp-machine-learning cleaning-data cruise-industry

Updated May 6, 2020
Jupyter Notebook

PeterSchuld / Udacity_DataAnalystNanodegree-DataWrangling

Star

Project No. 4 in the Udacity Data Analyst Nanodegree Winter 2019-2020. Using Python, we’ll gather data from a variety of sources, assess its quality and tidiness, then clean it. We’ll document our wrangling efforts in a Jupyter Notebook, plus showcase them through analyses and visualizations using Python and SQL.

json-data api-server python3 file-format cleaning-data

Updated May 31, 2021
Jupyter Notebook

marke0816 / Movies-ETL

Star

This repository creates an ETL pipeline which takes in movie data from Kaggle and Wikipedia. The ETL_create_database.ipynb file contains all the code necessary to perform all three steps.

etl regex cleaning-data

Updated Apr 8, 2021
Jupyter Notebook

Fuenj / Data-Wrangling-and-Analyzing-Twitter-Data

Star

Wrangling and analyzing we rate dogs twitter account which rates people's dogs with a humorous comment about the dog.

python data udacity twitter analysis numpy pandas wordcloud udacity-nanodegree gathering datavisualization dataanalysis textfile cleaning-data weratedogs gathering-data assessing-data wrangling-cleaning wrangling-data

Updated Nov 22, 2020
Jupyter Notebook

Abanoub8yuossef / Data-Wrangling-Project

Star

Data wrangling project from Udacity professional data analysis Nanodegree

visualization python numpy pandas data-analysis wrangling cleaning-data gathering-data assessing-data

Updated Jun 11, 2022
Jupyter Notebook

vmcapilla / Project-1_Divvy_Analysis

Star

ggplot2 analysis lubridate cleaning-data

Updated Dec 7, 2022
R

nhthaonguyen / Customer-Segmentation---RFM-Analysis

Star

Customer Segmentation is one of crucial analysis for business Marketing Strategy. In this dataset, from a raw customer purchasing history, use Python to clean, explore and prepare for further analysis. I applied 2 different approaches of Customer Segmentation: traditional and RFM.

python exploratory-data-analysis data-visualization data-analysis powerbi cleaning-data

Updated Apr 24, 2023
Jupyter Notebook

Apurva1205 / Time-Series-Forecasting-For-Energy-Consumption

Star

Energy management, grid dependability, and the distribution of sustainable resources all depend heavily on understanding and forecasting energy demand trends. This project is extremely important in a number of areas.

data-science machine-learning time-series exploratory-data-analysis python3 xgboost-algorithm cleaning-data

Updated Dec 5, 2023
Jupyter Notebook

fatm2 / Survey-Analysis-of-Data-Professionals-Dashboard

Star

dashboard analytics business-intelligence powerbi cleaning-data

Updated Dec 19, 2023

SebastienPavot / BWIN-Marketing-Datamart-R

Star

R / Shiny - Clean, merge and visualize into Shiny a BWIN Datamart.

r shiny cleaning-data

Updated Dec 12, 2020
R

AbdelrahmanAmr3 / Wrangle-and-Analyze-Data-Udacity-Project

Star

Udacity Data Analyst Nanodegree - Project IV

visualization python json numpy csv-files plot pandas report data-analysis matplotlib data-wrangling tweepy udacity-data-analyst-nanodegree assesment cleaning-data jypyternotebook tsv-files

Updated Dec 15, 2020
Jupyter Notebook

vpetrova13 / dirty_data_project_VM

Star

Dirty data project completed as part of Data Analysis course 🎓

dataanalysis dirtydata cleaning-script cleaning-data

Updated Mar 22, 2021
HTML

samer-alhalabi / Data-Wrangling

Star

Repo to show some basics techniques of data wrangling

numpy pandas datascience wrangling cleaning-data

Updated Feb 24, 2021
Jupyter Notebook

c-morey / challenge-data-analysis

Star

This repository provides a Jupyter notebook on basic data cleaning and exploratory data analysis process with a CSV file that was scrapped from a real estate website in Belgium.

data analysis jupyter-notebook cleaning-data cleaning-dataset

Updated Jul 23, 2021
Jupyter Notebook

PrateekDutta2001 / Play-Store-App-Analysis

Star

The Play Store apps data has enormous potential to drive app-making businesses to success. Actionable insights can be drawn for developers to work on and capture the Android market!

data-visualization python3 cleaning-data

Updated Jun 22, 2022
Jupyter Notebook

ritikaga / Festive-Season-Sales-Analysis

Star

Analyze Diwali Sales data using Pandas, NumPy, Matplotlib, and Seaborn Libraries to Improve customer experience and also sales.

visualization python analysis numpy eda pandas seaborn visualization-dashboard cleaning-data cleaning-data-in-python mataplotlib

Updated Aug 22, 2023
Python

pankjsalunkhe / Data-Science-Project

Star

End-to-end projects: customer churning prediction using the Random Forest Classifier Algorithm with 97% accuracy; performing pre-processing steps; EDA and Visulization fitting data into the algorithm; and hyper-parameter tuning to reduce TN and FN values to perform our model with new data. Finally, deploy the model using the Streamlit web app.

visualization webpage deployment random-forest model eda feature-selection accuracy feature-engineering cleaning-data modelselection streamlit readingdatasets

Updated Sep 13, 2023
HTML

OrenYuval / DataCleaningProject

Star

TL:DR - I've done this project in order to excercise Data Cleaning Through SQL which was published and created by Shuki Molk. I'm adding here a link to the excercise

sql database cleaning-data

Updated Aug 29, 2023
TSQL

Omkarnk816 / Kickstarter_Data_Analytics

Star

This dataset analyses roughly of 380,000 Kickstarter projects. It will lead you through a simple data exploration with excel to reveal interesting insights in Kickstarter projects and what attributes are important when it comes to examining the success (or failure) of a certain project.

dashboard excel powerpivot powerquery cleaning-data

Updated Mar 6, 2024

Improve this page

Add a description, image, and links to the cleaning-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cleaning-data topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cleaning-data

Here are 339 public repositories matching this topic...

amarlearning / exploring-the-evolution-of-Linux

HRNowak / cruise-reviews

PeterSchuld / Udacity_DataAnalystNanodegree-DataWrangling

marke0816 / Movies-ETL

Fuenj / Data-Wrangling-and-Analyzing-Twitter-Data

Abanoub8yuossef / Data-Wrangling-Project

vmcapilla / Project-1_Divvy_Analysis

nhthaonguyen / Customer-Segmentation---RFM-Analysis

Apurva1205 / Time-Series-Forecasting-For-Energy-Consumption

fatm2 / Survey-Analysis-of-Data-Professionals-Dashboard

SebastienPavot / BWIN-Marketing-Datamart-R

AbdelrahmanAmr3 / Wrangle-and-Analyze-Data-Udacity-Project

vpetrova13 / dirty_data_project_VM

samer-alhalabi / Data-Wrangling

c-morey / challenge-data-analysis

PrateekDutta2001 / Play-Store-App-Analysis

ritikaga / Festive-Season-Sales-Analysis

pankjsalunkhe / Data-Science-Project

OrenYuval / DataCleaningProject

Omkarnk816 / Kickstarter_Data_Analytics

Improve this page

Add this topic to your repo