Build software better, together

johannaschmidle / House-Price-Predictor

A machine learning model to accurately predict house prices based on various features such as quality, size, and location, utilizing Random Forest and XGBoost algorithms (Python)

visualization python machine-learning random-forest sklearn cross-validation xgboost house-price-prediction xgboost-model target-encoding sklearn-library onehot-encoding anova-test ordinal-encoding random-forest-regressors

Updated Jul 26, 2024
Jupyter Notebook

tanviparmarrajput / DATA-CLAENING-DATA-VISUALIZATION

Star

Data Cleaning and Data Visualization with python libraries like numpy , pandas, sklean,seaborn, matplotlib-pyplot

python encoding numpy pandas-dataframe sklearn pandas seaborn boxplot dataengineering datavisualization pandas-library datacleaning matplotlib-pyplot onehot-encoding imputer ordinal-encoding labelencoding

Updated Apr 17, 2024
Jupyter Notebook

sushantnair / Feature_Engineering

Star

Feature Engineering steps implemented in Google Colab with step-by-step view.

machine-learning binning standardization feature-engineering normalization onehot-encoding handling-outlier

Updated Mar 29, 2024
Jupyter Notebook

erdogant / df2onehot

Sponsor

Star

Convert a unstructured array into a stuctured dataframe.

python preprocessing structuring onehot-encoding

Updated Mar 26, 2024
Python

MoeinGazestanii / Car-Sales-Price-Prediction

Star

Car Sales Price Prediction (Streamlit)

data-science regression eda outlier-detection predictive-modeling datavisualization datacleaning onehot-encoding streamlit

Updated Mar 21, 2024
Python

NotTheStallion / Data_preparation_4_ML_algorithm

Star

This project will focus on data preparation and will follow the steps : data cleaning, handling text and categorical attributes, and feature scaling.

data-science ml data-preprocessing data-preparation data-cleaning feature-scaling onehot-encoder onehot-encoding

Updated Mar 4, 2024
Jupyter Notebook

yujansaya / kaggle_customer_segmentation

Star

Mall Customer Segmentation Data

data-visualization pca-analysis kmeans-clustering hierarchical-clustering dendogram agglomerative-clustering onehot-encoding labelencoder

Updated Feb 22, 2024
Jupyter Notebook

abibatoki / Classification-Model

Star

A model that predicts startup success from data on early-stage investments in the Crunchbase database.

random-forest heatmap logistic-regression test-data training-data missing-value-handling model-creation onehot-encoding

Updated Feb 19, 2024

norhanreda / Arabic-Text-Diacritization

Star

Diacritics are short vowels with a constant length that are spoken. The same word in the Arabic language can have different meanings and different pronunciations based on how it is diacritized. In this project, we implement a pipeline to predict the diacritic of each character in an Arabic text using Natural Language Processing techniques.

nlp neural-network word2vec lstm onehot-encoding arabert embedings arabic-text-diacritization

Updated Feb 15, 2024
Jupyter Notebook

mishika12 / Regression-Predicting_Life_Expectancy

Star

Analyzing and predicting life expectancy of a country based on multiple factors using multiple regression techniques

kfold-cross-validation onehot-encoding featurescaling

Updated Jan 5, 2024
Jupyter Notebook

yanganYNU / AFFGCN

Star

Attention Feature Fusion base on spatial-temporal Graph Convolutional Network（AFFGCN）

attention-mechanism gcn multimodal-deep-learning timeseries-forecasting traffic-flow-prediction onehot-encoding spatiotemporal-data-analysis feature-fusion-network

Updated Jan 2, 2024
Python

OmBaval / Airline-Customer-Satisfaction

Star

This project employs a dataset of 103,904 entries with 25 features. Utilizing the XGBoost classifier,The workflow involves data fetching, feature selection, preprocessing, correlation analysis, best feature selection, data rescaling, train-test split, and target balancing. Predicts whether a customer will experience satisfaction with a flight.

machine-learning feature-selection artificial-intelligence xgboost feature-engineering multicollinearity correlation-analysis onehot-encoding xgboost-classifier anova-test

Updated Jan 1, 2024
Jupyter Notebook

TrilokiDA / Kaggle

Star

sales machine-learning pipeline random-forest forecast kaggle-titanic kaggle-competition support-vector-machines kaggle-dataset house-price-prediction gridsearchcv sklearn-library kaggle-solution xgboost-regression mini-course onehot-encoding

Updated Dec 20, 2023
Jupyter Notebook

hassaanhameed786 / Probability-and-Statistics-for-Computer-Science

Star

The most common discrete and continuous distributions, showing how they find use in decision and estimation problems, and constructs computer algorithms for generating observations from the various distributions. and applications, and lastly the most important concept is covered is entropy

decision entropy seaborn matplotlib statistical-functions spam-detection continuous-distributions nhanes binomial-distribution onehot-encoding

Updated Nov 28, 2023
Jupyter Notebook

Lynn425 / Spam-Email-Classification

Star

Created numeric features derived from the email text and used those features for logistic regression based on exploratory data analysis. Used logistic regression to train a binary classifier. Used cross-validation to do feature and model selection.

spam sklearn cross-validation text-analysis seaborn logistic-regression spam-classification binary-classification onehot-encoding