Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Salary Prediction of Data Professions #670

Closed
Harshit-code-tech opened this issue Jun 19, 2024 · 8 comments · Fixed by #675
Closed

Salary Prediction of Data Professions #670

Harshit-code-tech opened this issue Jun 19, 2024 · 8 comments · Fixed by #675
Assignees
Labels
Advanced Points 40 - SSOC 2024 Assigned 💻 Issue has been assigned to a contributor SSOC

Comments

@Harshit-code-tech
Copy link
Contributor

Salary Prediction

Description

We are looking to enhance our Salary Prediction model to improve its accuracy and robustness. The goal of this issue is to implement, evaluate, and document enhancements to the existing Salary Prediction model. This issue involves several key tasks:

Tasks

Data Preprocessing Enhancements

  • Identify and handle outliers in the dataset.
  • Implement advanced feature engineering techniques to create more predictive features.
  • Explore and document different methods for handling missing values.

Model Development

  • Experiment with different machine learning algorithms and ensemble methods.
  • Perform hyperparameter tuning to optimize model performance.
  • Implement cross-validation to ensure the model's robustness.

Model Evaluation

  • Evaluate the models using relevant performance metrics (e.g., MAE, MSE, RMSE, R2).
  • Compare the performance of different models and document the results.
  • Generate and include visualization plots to illustrate model performance.

Documentation

  • Update the project documentation to reflect the new enhancements.
  • Include a detailed README file explaining the changes made, how to run the enhanced model, and the results obtained.
  • Ensure the code is well-commented for better understanding and maintenance.

Web Application Integration

  • Integrate the best-performing model into the existing Flask web application.
  • Update the web application's front-end to accommodate any new input features or changes in the model.
  • Test the web application to ensure it functions correctly with the new model.

Requirements

  • Strong knowledge of Python and machine learning libraries (pandas, scikit-learn, joblib, etc.).
  • Experience with data preprocessing, feature engineering, and model evaluation.
  • Familiarity with Flask and web application development.
  • Good documentation practices.

Expected Outcome

  • An enhanced Salary Prediction model with improved accuracy and robustness.
  • Comprehensive documentation detailing the enhancements and their impact on model performance.
  • A fully integrated and tested web application showcasing the new model.
Copy link

Thank you for creating this issue! We'll look into it as soon as possible. Your contributions are highly appreciated! 😊

@Harshit-code-tech
Copy link
Contributor Author

I am a contributor on Social Summer Of Code Season 3 please assign me this issue

@abhisheks008
Copy link
Owner

What are the models you are planning to implement?

@Harshit-code-tech
Copy link
Contributor Author

Models to Implement

  1. Linear Regression
  • Reason for Selection: Provides a simple baseline to compare against more complex models. It is easy to interpret and implement.
  1. Decision Tree Regressor
  • Reason for Selection: Captures non-linear relationships and provides easy interpretation through tree visualization.
  1. Random Forest Regressor
  • Reason for Selection: Reduces overfitting and improves accuracy by averaging multiple decision trees.
  1. Gradient Boosting Regressor
  • Reason for Selection: Often outperforms other models by focusing on correcting errors iteratively, providing high accuracy.

@abhisheks008
Copy link
Owner

Can you share the dataset URL, as this type of Salary Prediction model is already present in this repository.

@Harshit-code-tech
Copy link
Contributor Author

@abhisheks008 abhisheks008 changed the title Salary Prediction Salary Prediction of Data Professions Jun 20, 2024
@abhisheks008
Copy link
Owner

Assigned this issue to you @Harshit-code-tech

@abhisheks008 abhisheks008 added Assigned 💻 Issue has been assigned to a contributor Intermediate Points 30 - SSOC 2024 SSOC labels Jun 20, 2024
@abhisheks008 abhisheks008 added Advanced Points 40 - SSOC 2024 and removed Intermediate Points 30 - SSOC 2024 labels Jun 23, 2024
Copy link

Hello @Harshit-code-tech! Your issue #670 has been closed. Thank you for your contribution!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Advanced Points 40 - SSOC 2024 Assigned 💻 Issue has been assigned to a contributor SSOC
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants