Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Predicting Molecular Properties #678

Merged
merged 8 commits into from
Jun 23, 2024
Merged

Conversation

Sgvkamalakar
Copy link
Contributor

@Sgvkamalakar Sgvkamalakar commented Jun 23, 2024

Pull Request for ML-Crate 💡

Issue Title: Predicting Molecular Properties #636

  • Info about the related issue (Aim of the project) : This project aims to develop predictive models to estimate scalar coupling constants between atom pairs in molecules based on their structural features and conduct exploratory data analysis (EDA) to uncover relationships between molecular properties and structural characteristics.
  • Name: S G V Kamalakar
  • Email ID for further communication: [email protected]
  • GitHub ID: Sgvkamalakar
  • Idenitfy yourself: SSOC S3

Closes: #636

Describe the add-ons or changes you've made 📃

Give a clear description of what have you added or modifications made

Modifications Made:

Added interactive 3D visualizations of molecular structures using Plotly, color-coded by atom type, and implemented animations to highlight atoms and bonds.

Analyzed and visualized the distribution of unique atom counts with histograms. Conducted extensive EDA, visualizing scalar coupling constants, Mulliken charges, dipole moments, and potential energy.

Created new features based on atomic interactions, standardized molecular data, and trained multiple regression models to predict scalar coupling constants, evaluating performance with MAE, MSE, and R².

Type of change ☑️

What sort of change have you made:

  • New feature (non-breaking change which adds functionality)

How Has This Been Tested? ⚙️

Describe how it has been tested
Describe how have you verified the changes made

The changes were tested and verified through various steps. Interactive 3D visualizations were rendered and cross-checked against known molecular structures.

Histograms of unique atom counts were generated and validated by comparing them with raw data. EDA visualizations were inspected for accuracy and consistency.

Newly engineered features were analyzed for relevance and contribution to model performance.

image

Finally, regression models (Linear Regressor, Random Forest Regressor, K Nearest Neighbors, Support Vector Regressor, Decision Tree Regressor, and Simple Feed Forward Neural Network) were evaluated against benchmarks, and the entire workflow was validated for reproducibility in a clean environment.

Checklist: ☑️

  • My code follows the guidelines of this project.
  • I have performed a self-review of my own code.
  • I have commented my code, particularly wherever it was hard to understand.
  • I have made corresponding changes to the documentation.
  • My changes generate no new warnings.
  • I have added things that prove my fix is effective or that my feature works.
  • Any dependent changes have been merged and published in downstream modules.

Copy link

Our team will soon review your PR. Thanks @Sgvkamalakar :)

Copy link
Owner

@abhisheks008 abhisheks008 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me. Approved @Sgvkamalakar

@abhisheks008 abhisheks008 added Approved ✅ This PR is approved by the PR or, Mentors. Intermediate Points 30 - SSOC 2024 SSOC labels Jun 23, 2024
@abhisheks008 abhisheks008 merged commit 28a26ce into abhisheks008:main Jun 23, 2024
1 check passed
@abhisheks008 abhisheks008 added the Points Added 🎉 This issue's points has been added to the leaderboard. label Jun 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Approved ✅ This PR is approved by the PR or, Mentors. Intermediate Points 30 - SSOC 2024 Points Added 🎉 This issue's points has been added to the leaderboard. SSOC
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Project Addition] Predicting Molecular Properties
2 participants