ANI-2x training data set #39

JMorado · 2021-06-22T12:49:48Z

Hi,

How does one know what was the exact data set used to train ANI-2x?

In the original ANI-2x paper, it is said that the training data set is composed of molecules from a variety of sources, including the GDB-11 database, the CheMBL database, the s66x8 benchmark, and some randomly generated amino acids and dipeptides.
Nevertheless, from what I understood, these data sets are not included integrally because some specific sampling techniques are then employed.

Is it possible to know which were the exact molecules used for training?

Thank you.
Best,
João

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ANI-2x training data set #39

ANI-2x training data set #39

JMorado commented Jun 22, 2021 •

edited

Loading

ANI-2x training data set #39

ANI-2x training data set #39

Comments

JMorado commented Jun 22, 2021 • edited Loading

JMorado commented Jun 22, 2021 •

edited

Loading