-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ANI-2x training data set #39
Comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi,
How does one know what was the exact data set used to train ANI-2x?
In the original ANI-2x paper, it is said that the training data set is composed of molecules from a variety of sources, including the GDB-11 database, the CheMBL database, the s66x8 benchmark, and some randomly generated amino acids and dipeptides.
Nevertheless, from what I understood, these data sets are not included integrally because some specific sampling techniques are then employed.
Is it possible to know which were the exact molecules used for training?
Thank you.
Best,
João
The text was updated successfully, but these errors were encountered: