Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

evaluate RDkit and chemprop models #2

Open
kaichop opened this issue Jun 11, 2024 · 4 comments
Open

evaluate RDkit and chemprop models #2

kaichop opened this issue Jun 11, 2024 · 4 comments
Assignees

Comments

@kaichop
Copy link
Contributor

kaichop commented Jun 11, 2024

Start from the antibiotic prediction paper and see whether we can reproduce a similar approach of building a prediction model using chemoprop, and then add features extracted from RDkit from all molecules.

@wangwpi wangwpi self-assigned this Jun 17, 2024
@wangwpi
Copy link
Contributor

wangwpi commented Jun 17, 2024

I'm going to experiment the parameters of morgan fingerprint (radius and bit) to see if we could find the best radius and bit for the morgan fingerprint. This is a good tutorial for the fingerprints from the kaggle challenge: https://www.kaggle.com/code/towardsentropy/fingerprint-tips-and-tricks

@Euchiz
Copy link
Contributor

Euchiz commented Jul 2, 2024

I've submitted the results from a chemprop-base-1v5 model with 0.388 public score. Still running a similar version that adds building blocks into features. Code uploaded

@Euchiz Euchiz self-assigned this Jul 2, 2024
@wangwpi
Copy link
Contributor

wangwpi commented Jul 3, 2024

I tested the simple NN model using new morgan fingerprint (radius=4, bits=2048), however it only achived 0.343 public score, which is even lower than using simpler morgan fingerprint (radius=3, bits=1024). We are hitting the ceiling of simple NN model and morgan fingerprint. We could just move on for the GNN or transformer model.

@Euchiz
Copy link
Contributor

Euchiz commented Jul 3, 2024

I've submitted the results from a chemprop-base-1v5 model with 0.388 public score. Still running a similar version that adds building blocks into features. Code uploaded

I've submitted the results from a chemprop-base-1v5 model with 0.388 public score. Still running a similar version that adds building blocks into features. Code uploaded

Just submitted a 1:1 model with building-block features. Got 0.410 score. Still trying higher ratio (a bit difficult as it requires more ram in one run. considering training on small subsets consecutively).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants