Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Basis adaptor and Data Repository #18

Open
Dom1L opened this issue Dec 18, 2017 · 1 comment
Open

Basis adaptor and Data Repository #18

Dom1L opened this issue Dec 18, 2017 · 1 comment

Comments

@Dom1L
Copy link

Dom1L commented Dec 18, 2017

What was the reason to choose the 6-311G** basis set in combination with the wB97X-D functional? I would have thought that a basis set augmented with diffuse functions would probably do a better job when dealing with electrostatics or charged molecules in general. Was it just because your initial trainingset didn't contain any charged molecules or was there any other reason for it?

Greetings,
Dominik

@jparkhill
Copy link
Owner

Actually another good reason to use aug is to smooth out BSSE.
The reasons we did it this way for 0.1:

  • Our training data has some zwitterionic but no ionic species. This was intentional, because the local PES of a N+1 or N-1 atom should not be the same as the neutral ie: the Behler graph needs some electronic configuration input beyond the geometry to treat charged species properly (imo). Correct solvation of charged species with electronic state variables is a goal for 0.2.
  • Generation of training data is a relatively low priority for us, and we got "locked-in" after amassing a fair amount of data in this basis.

I wouldn't call using 6-311g** an "issue" but there are two issues I would raise to this effect:

  • Developing an adaptor which can allow training on mixed basis set data by adjusting for the atomization energies within each ab-initio method.
  • Developing a procedure to share and distribute training data. It's pretty tragic there are like 3-4 government funded places to store molecular data in the US and EU, and all of them are useless for the purposes of sharing 10gb datasets. They are only capable of showing you a drawing of aspirin and giving the homo lumo gap with 10 functionals (Yay funding agencies :P).

If you have any data you'd like to share, or need help training on your data, don't hesitate to reach out. It's non-trivial.

  • John

@jparkhill jparkhill changed the title Choice of DFT Functional and Basis Set Basis adaptor and Data Repository Dec 18, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants