Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Execluding some species from training #385

Open
m-a-saleh opened this issue Dec 4, 2023 · 1 comment
Open

Execluding some species from training #385

m-a-saleh opened this issue Dec 4, 2023 · 1 comment

Comments

@m-a-saleh
Copy link

Dear developers,

I have a Si slab with varying thickness t_si and whose top and bottom surfaces are passivated by "H" atoms. I need to train an SGP model on AIMD data for it. Training the model on the AIMD data with the two species (H & Si) gives a positive likelihood. However, I think that excluding the "H" atoms from the training will lead to a more accurate trained model as I am interested only in the forces on Si atoms.

To exclude the "H" atoms, I have tried the following, which all resulted in a negative likelihood:
-Remove lines starting with "H" from the AIMD.xyz file for all frames
-Assign zero forces to "H" atoms in the AIMD.xyz file
-Use small cutoffs in cutoff_matrix for "H" (e.g, ctuoff_matrix = [[1.0, 1.0], [1.0, 5.0]])

Any idea on how to exclude those surface "H" atoms?

Thanks

@cjowen1
Copy link
Collaborator

cjowen1 commented Dec 4, 2023

Hi m-a-saleh,

Thanks for your question. I would be cautious in using this approach, since the presence of H atoms also influence the forces on the Si atoms at the interface. Moreover, H atoms reduce the total energy of the system by filling the 'dangling-bonds' of Si that are formed from cleaving the bulk crystal, so learning the energy by just removing H in this way might be problematic. Would it be too expensive to generate some pure Si data? Moreover, you can train on both H and Si, but use the resultant model to model pure Si slabs and bulk (as long as you have also seen pure Si slabs and bulk in the training).

Best,
Cameron

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants