Unverified Commit 6dd414bc authored by Jan Janssen's avatar Jan Janssen Committed by GitHub
Browse files

Merge pull request #48 from pyiron/updates

Updates
parents a41b5fea ae1a5ae5
%% Cell type:code id:political-robinson tags:
%% Cell type:markdown id:entitled-break tags:
# Fitting an EAM potential
EAM potentials are pair functionals.
In a generalised form they are equal to Finnis-Sinclair, effective medium theory or glue potentials. Their total energy can be written as
$E = \sum_{ij}V(r_{ij}) + \sum_i F(\rho_i)$
with
$\rho_i = \sum_j \rho(r_{ij})$
The original functions for V, $\rho$ and F were derived from different theories, but they can be chosen freely.
Fitting is done using atomicrex https://atomicrex.org. In the fit process an objective or cost function is minimized. The objective function is defined as
$\chi^2 = \sum_i w_i r_i$
where $w_i$ is a weight and $r_i$ is a residual that describes the difference to target values. This residual can be defined in different ways, so it is not possible to simply compare the residual for different fitting processes or codes. A more in depth explanation and some examples can be found on https://atomicrex.org/overview.html#objective-function.
%% Cell type:code id:boring-simon tags:
``` python
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from pyiron import Project, ase_to_pyiron
```
%% Cell type:code id:tracked-postcard tags:
%% Cell type:markdown id:southwest-southwest tags:
### Import the training data
%% Cell type:code id:adaptive-yacht tags:
``` python
data_pr = Project("../../datasets")
if len(data_pr.job_table()) == 0:
data_pr.unpack("Cu_training_archive")
data_pr.job_table()
```
%%%% Output: execute_result
id status chemicalformula job \
0 619 finished None df1_A1_A2_A3_EV_elast_phon
1 620 finished None df3_10k
2 621 finished None df2_1k
0 759 finished None df1_A1_A2_A3_EV_elast_phon
1 760 finished None df3_10k
2 761 finished None df2_1k
subjob projectpath \
0 /df1_A1_A2_A3_EV_elast_phon /home/niklas/pyiron/projects/
1 /df3_10k /home/niklas/pyiron/projects/
2 /df2_1k /home/niklas/pyiron/projects/
project timestart timestop \
0 import_database/Cu_database/ 2021-02-08 10:33:52.341472 None
1 import_database/Cu_database/ 2021-02-08 10:33:53.993230 None
2 import_database/Cu_database/ 2021-02-08 10:33:54.435308 None
project \
0 pyiron_potentialfit/datasets/imported_datasets/Cu_database/
1 pyiron_potentialfit/datasets/imported_datasets/Cu_database/
2 pyiron_potentialfit/datasets/imported_datasets/Cu_database/
timestart timestop totalcputime computer \
0 2021-02-08 10:33:52.341472 None None zora@cmti001#1
1 2021-02-08 10:33:53.993230 None None zora@cmti001#1
2 2021-02-08 10:33:54.435308 None None zora@cmti001#1
totalcputime computer hamilton hamversion parentid masterid
0 None zora@cmti001#1 GenericJob 0.4 None None
1 None zora@cmti001#1 GenericJob 0.4 None None
2 None zora@cmti001#1 GenericJob 0.4 None None
hamilton hamversion parentid masterid
0 TrainingContainer 0.4 None None
1 TrainingContainer 0.4 None None
2 TrainingContainer 0.4 None None
%% Cell type:code id:authentic-substance tags:
%% Cell type:code id:whole-dragon tags:
``` python
data_job = data_pr.load("df1_A1_A2_A3_EV_elast_phon")
df = data_job.to_pandas()
```
%% Cell type:code id:vertical-simon tags:
%% Cell type:code id:portuguese-trance tags:
``` python
pr = Project("WorkshopPotential")
#pr.remove_jobs()
```
%%%% Output: stream
Are you sure you want to delete all jobs from 'WorkshopPotential'? y/(n) y
%% Cell type:code id:finite-planner tags:
%% Cell type:markdown id:vanilla-asset tags:
### Create an atomicrex job
%% Cell type:code id:characteristic-corruption tags:
``` python
job = pr.create_job(pr.job_type.Atomicrex, "PotentialDF1")
```
%% Cell type:markdown id:fiscal-advocate tags:
%% Cell type:markdown id:competent-produce tags:
### Add the structures that should be fitted.
It is possible to assign different weights to certain structures or properties, depending on what should be investigated using the potential. Here every structure has the same weight, but the force vector with N*3 values is normalized to have the same total weight as the single value energy. Therefore it is divided by the number of atoms.
%% Cell type:code id:funded-offense tags:
%% Cell type:code id:streaming-distance tags:
``` python
for id, row in df.iterrows():
struct = ase_to_pyiron(row.atoms)
s = job.structures.add_structure(struct, f"id{id}", relative_weight=1)
s.fit_properties.add_FitProperty("atomic-energy", target_value=row.energy/row.number_of_atoms, relative_weight=1)
s.fit_properties.add_FitProperty("atomic-forces", target_value=row.forces, relative_weight=1/row.number_of_atoms)
```
%% Cell type:markdown id:simplified-withdrawal tags:
%% Cell type:markdown id:acoustic-developer tags:
### Define the type of potential and necessary functions.
In this case an eam potential is fitted.
%% Cell type:code id:worth-electricity tags:
%% Cell type:code id:compliant-pharmacy tags:
``` python
job.potential = job.factories.potentials.eam_potential()
```
%% Cell type:markdown id:earlier-arrow tags:
%% Cell type:markdown id:respective-writer tags:
It is necessary to define a pair potential, an electronic density function and an embedding function.
For all of those it is possible to choose between different functional forms.
Classic pair potentials are physically motivated and have a very limited number of paramaters that are derived from a experimentally measured quantity.
Splines or polynomials offer more flexibility, but are not directly physically motivated and can lead to unphysical oscillations or overfitting.
Classic pair potentials are physically motivated and have a very limited number of paramaters that can often be derived from an experimentally measurable quantity.
Splines or polynomials offer more flexibility, but can lead to unphysical oscillations or overfitting. Compared with the machine learning potentials shown later the number of parameters is very low no matter which functions you choose and the problem is highly non linear.
In this case a generalized morse function is used for the pair interaction. It has the form
$(\frac{D_0}{S-1}exp(-\beta \sqrt{2S}(r-r_0))-\frac{D_0S}{S-1}exp(-\beta\sqrt{2/S}(r-r_0)))+\delta $
The parameters in the morse potential can be derived from phyiscal quantities, here they are just educated guesses. For example $r_0$ is the equilibrium distance of a dimer. The nearest neighbor distance in fcc Cu is about 2.5 $\mathring A$ so it is taken as initial value.
In the case of analytic functions the initial parameter choices should not matter too much, since the functional form is constrained.
The electronic density and embedding function will be splines. Their initial parameters require more testing and hand tuning than the parameters of analytic functions. Depending on the properties that are calculated other functional forms could give better results.
%% Cell type:code id:electric-corps tags:
In this case a generalized morse function is used for the pair interaction, while the electronic density and embedding function will be splines. Depending on the properties that are calculated other functional forms could give better results.
``` python
V = job.factories.functions.morse_B(identifier="V_CuCu", D0=0.35, r0=2.5, beta=2, S=2, delta=0)
```
%% Cell type:markdown id:retired-motor tags:
The parameters in D0=3.5 and r0=1.8 are the approximate cohesive energy and the equilibrium lattice constant. Beta and S can also be derived from physical quantities but are chosen randomly in a typical range in this case. Delta is a parameter that shifts the whole function up or down. The initial parameter choices should not matter too much as long as they are somewhat reasonable since they will be optimized in the fitting process anyway.
Pre defined functions like the morse function can be plotted to see the influence of the initial parameter values
%% Cell type:code id:grave-settlement tags:
%% Cell type:code id:affected-tamil tags:
``` python
V = job.factories.functions.morse_B(identifier="V_CuCu", D0=3.5, r0=1.8, beta=2, S=2, delta=0)
V.plot()
```
%% Cell type:code id:sharp-photographer tags:
%%%% Output: execute_result
(<Figure size 720x504 with 1 Axes>,
<AxesSubplot:xlabel='r [$\\AA$]', ylabel='func(r)'>)
%%%% Output: display_data
![]()
%% Cell type:markdown id:forced-listing tags:
Additionally it is a good idea to define limits for the parameters. This is optional for local minimizers, but the fit can quickly run away without limits. Global optimizers typically require them to constrain the sampled space.
%% Cell type:code id:sunset-beginning tags:
``` python
V.parameters.D0.min_val = 0
V.parameters.D0.max_val = 5
V.parameters.r0.min_val = 1
V.parameters.r0.max_val = 2.5
V.parameters.D0.max_val = 2
V.parameters.r0.min_val = 1.5
V.parameters.r0.max_val = 3.0
V.parameters.S.min_val = 1.1
V.parameters.S.max_val = 10.0
V.parameters.delta.min_val = -1
V.parameters.delta.max_val = 1
V.parameters.beta.min_val = 0.1
V.parameters.beta.max_val = 10
```
%% Cell type:markdown id:rough-purchase tags:
%% Cell type:markdown id:raised-kansas tags:
Additionally a screening function needs to be defined for the morse potential
A screening function needs to be defined for the morse potential
%% Cell type:code id:practical-details tags:
%% Cell type:code id:republican-rabbit tags:
``` python
V.screening = job.factories.functions.exp_A_screening(identifier="V_cutoff", cutoff=7)
```
%% Cell type:markdown id:external-ready tags:
%% Cell type:markdown id:diverse-smith tags:
The electron density is chosen to be a spline function. The cutoff has to be defined. Derivatives left and right are optional, they default to 0. For the right cutoff this is fine, since the forces should smoothly go to 0. For the left this is not necessarily the best choice, since the function value should increase at very close distances.
The electron density is chosen to be a spline function. The cutoff has to be defined. Derivatives left and right are optional, they default to 0. For the right cutoff this is fine, since the forces should smoothly go to 0. For the left this is not necessarily the best choice, since the function value should increase at very close distances. Very large absolute values will lead to osciallations and should be avoided.
%% Cell type:code id:departmental-dynamics tags:
%% Cell type:code id:described-blond tags:
``` python
rho = job.factories.functions.spline(identifier="rho_CuCu", cutoff=7, derivative_left=-1)
```
%% Cell type:markdown id:latter-wright tags:
%% Cell type:markdown id:solved-debut tags:
For a spline function it is necessary to define node points. They can be equally spaced or sampled with higher density around turning points, f.e. the first neighbor distance.
Too few nodepoints mean low flexibilty, too many lead to overfitting. This requires some testing to find an optimal choice.
%% Cell type:code id:double-engineering tags:
%% Cell type:code id:hazardous-lying tags:
``` python
rho_nodes = np.linspace(0.5, 7.0, 7).round(2)
rho_nodes
```
%%%% Output: execute_result
array([0.5 , 1.58, 2.67, 3.75, 4.83, 5.92, 7. ])
%% Cell type:markdown id:handled-housing tags:
%% Cell type:markdown id:blocked-volleyball tags:
The nodes need initial values. The electron density should be proportional to $e^{-r}$, so this function is chosen to calculate them.
%% Cell type:code id:developing-apache tags:
%% Cell type:code id:geographic-legislation tags:
``` python
decaying_exp = lambda r: np.exp(-r)
rho_initial = decaying_exp(rho_nodes)
```
%% Cell type:markdown id:separated-journal tags:
%% Cell type:markdown id:outdoor-seven tags:
Additionally it is a good idea to define limits for the node points. This is optional for local minimizers, but the fit can quickly run away without limits. Global optimizers typically require them to constrain the sampled space.
A density can't be negative so the lower limit is set to 0. The upper limit is chosen to be 3 times the initial values. These choices aswell as the choice for $e^{-r}$ as initial values are somewhat arbitrary. The electron density from single atoms does not directly influence the calculated energies and forces, instead the summed up density at some place is used in the embedding function, so the final numerical values are an interplay between electron density and embedding function. Since the latter will also be a spline function it can only be defined for a certain range of rho values as node points. Therefore it is better to limit the range of electron density values and define larger limits for the embedding function instead.
A density can't be negative so the lower limit is set to 0. The upper limit is chosen to be 3 times the initial values. These choices aswell as the choice for $e^{-r}$ as initial values are somewhat arbitrary, but don't matter much. The electron density from single atoms does not directly influence the calculated energies and forces, instead the summed up density at some place is used in the embedding function, so the final numerical values are an interplay between electron density and embedding function. Since the latter will also be a spline function it can only be defined for a certain range of rho values as node points. Therefore it is better to limit the range of electron density values and define larger limits for the embedding function instead.
%% Cell type:code id:colored-discount tags:
%% Cell type:code id:collectible-viewer tags:
``` python
rho_mins = np.zeros((len(rho_nodes)))
rho_maxs = 3*rho_initial.round(6)
rho.parameters.create_from_arrays(rho_nodes, rho_initial, min_vals=rho_mins, max_vals=rho_maxs)
```
%% Cell type:raw id:preliminary-peace tags:
%% Cell type:raw id:bottom-potato tags:
Finally the last node point at the cutoff range is set to 0 and fitting is disabled to prevent a discontinuous change of energy at the cutoff.
%% Cell type:code id:lasting-reply tags:
%% Cell type:code id:narrow-hearts tags:
``` python
rho.parameters["node_7.0"].start_val = 0
rho.parameters["node_7.0"].enabled = False
```
%% Cell type:markdown id:saving-vegetable tags:
%% Cell type:markdown id:miniature-department tags:
$-\sqrt(\rho)$ can be used as initial guess for the embedding energy, which is taken from second moment approximation tight binding.
The node points have to be chosen in a range compatible to the electron density. This can be estimated by calculating it for a densely packed structure.
Alternatively atomicrex writes the maximum electron density of all structures to the output. This can be used as a hint for the node points for consequent fits.
Everything else is similar to the electron density.
%% Cell type:code id:overall-flower tags:
%% Cell type:code id:younger-freight tags:
``` python
F = job.factories.functions.spline(identifier="F_CuCu", cutoff=5)
F_nodes = np.linspace(0.0, 5.0, 7).round(2) #9 is worse 11 is worse 7 is best
F_nodes = np.linspace(0.0, 5.0, 7).round(2)
F_init = -np.sqrt(F_nodes)
F_maxs = np.zeros(len(F_nodes))
F_mins = -np.ones(len(F_nodes))*5
F.parameters.create_from_arrays(F_nodes, F_init, F_mins, F_maxs)
F.parameters["node_0.0"].enabled=False
F.parameters["node_0.0"].start_val = 0
F.parameters["node_5.0"].enabled=False
F.parameters["node_5.0"].start_val = 0
```
%% Cell type:markdown id:neutral-gateway tags:
%% Cell type:markdown id:alert-stake tags:
The functions have to be assigned to the potential
%% Cell type:code id:major-capacity tags:
%% Cell type:code id:banned-allocation tags:
``` python
job.potential.pair_interactions[V.identifier] = V
job.potential.electron_densities[rho.identifier] = rho
job.potential.embedding_energies[F.identifier] = F
```
%% Cell type:markdown id:unusual-retirement tags:
%% Cell type:markdown id:positive-frost tags:
### Define fitting procedure
Finally a few parameters need to be set that influence the fitting process.
The minimization can be done with different algorithms. Atomicrex itself implements the BFGS algorithm. Additionally the algorithms from the nlopt library can be used.
Finally atom types and fit algorithm have to be assigned.
Atomicrex itself implements the BFGS algorithm. Additionally the algorithms from the nlopt library can be used.
%% Cell type:code id:productive-spare tags:
%% Cell type:code id:existing-average tags:
``` python
## Define the atom types of the potential
job.input.atom_types.Cu = None
##
## Limited number of iterations for the workshop
job.input.fit_algorithm = job.factories.algorithms.ar_lbfgs(max_iter=500)
job.run()
job.output
```
%%%% Output: stream
The job PotentialDF1 was saved and received the ID: 622
The job PotentialDF1 was saved and received the ID: 848
%%%% Output: execute_result
Output({'error': None, 'residual': array([1.39371e+03, 1.39371e+03, 1.39371e+03, ..., 1.52231e-01,
1.52231e-01, 1.52231e-01]), 'iterations': array([ 1, 2, 3, ..., 1998, 1999, 2000], dtype=uint32)})
%% Cell type:markdown id:regulation-foster tags:
### Plot the results
%% Cell type:markdown id:black-assets tags:
Plot the resiudal over steps to see how the calculation converges
%% Cell type:code id:partial-burner tags:
``` python
plt.plot(job.output.iterations, job.output.residual)
plt.yscale("log")
```
%%%% Output: display_data
![]()
%% Cell type:markdown id:appreciated-bracelet tags:
Finally it is a good idea to have a look at the final potential. This can reveal unphysical behavior
%% Cell type:code id:demographic-marijuana tags:
``` python
job.plot_final_potential()
```
%%%% Output: execute_result
(<Figure size 576x1296 with 3 Axes>,
array([[<AxesSubplot:title={'center':'Cu F'}, xlabel='$\\rho $ [a.u.]'>],
[<AxesSubplot:title={'center':'Cu rho_CuCu'}, xlabel='r [$\\AA$]'>],
[<AxesSubplot:title={'center':'Cu V_CuCu'}, xlabel='r [$\\AA$]'>]],
dtype=object))
%%%% Output: display_data
![]()
%% Cell type:markdown id:permanent-convention tags:
### Run a lammps simulations with the fitted potential
%% Cell type:code id:inclusive-brick tags:
``` python
lmp = pr.create_job("Lammps", "template")
lmp.structure = pr.create_ase_bulk("Cu", cubic=True)
lmp.potential = job.lammps_potential
murn = lmp.create_job("Murnaghan", "PotentialTest")
murn.run()
```
%%%% Output: stream
The job PotentialTest was saved and received the ID: 849
The job strain_0_9 was saved and received the ID: 850
The job strain_0_92 was saved and received the ID: 851
The job strain_0_94 was saved and received the ID: 852
The job strain_0_96 was saved and received the ID: 853
The job strain_0_98 was saved and received the ID: 854
The job strain_1_0 was saved and received the ID: 855
The job strain_1_02 was saved and received the ID: 856
The job strain_1_04 was saved and received the ID: 857
The job strain_1_06 was saved and received the ID: 858
The job strain_1_08 was saved and received the ID: 859
The job strain_1_1 was saved and received the ID: 860
job_id: 850 finished
job_id: 851 finished
job_id: 852 finished
job_id: 853 finished
job_id: 854 finished
job_id: 855 finished
job_id: 856 finished
job_id: 857 finished
job_id: 858 finished
job_id: 859 finished
job_id: 860 finished
%% Cell type:code id:straight-honduras tags:
``` python
murn.plot()
```
%%%% Output: display_data
![]()
%% Cell type:code id:thousand-caribbean tags:
``` python
murn["output/equilibrium_volume"]**(1/3)
```
%%%% Output: execute_result
Output({'error': None, 'residual': 0.162222, 'iterations': 19588})
3.632164318463751
%% Cell type:markdown id:abandoned-reform tags:
%% Cell type:markdown id:impressed-exhaust tags:
### Same for the 1000 structures dataset
The final parameters of the 100 structure fit can be used for the 1000 Structure fit. This speeds up the fitting process and often leads to better results.
### Same cane be done for the 1000 structures dataset
The final parameters of the 100 structure fit can be used for the 1000 Structure fit. This can speed up the fitting process and often leads to better results, especially if the initially guessed values are far from the optimum.
In general it is a good idea to start with few structures and try around with different functions and initial parameters. This is much faster than using all structures from the beginning and gives good guesses for the initial values of the parameters. It also allows to use global optimization with millions of steps in short time spans.
### This can take long and writes 1000 seperate POSCAR files, TAKE CARE
Run these jobs with more cores or more time after the workshop. Also increase the number of iterations for better results
%% Cell type:code id:preceding-march tags:
%% Cell type:code id:pretty-marina tags:
``` python
pr = Project("PotentialDF2")
#pr.remove_jobs()
j = pr.create_job(pr.job_type.Atomicrex, "PotentialDF2")
j.potential = job.potential.copy()
## Use the final parameters as starting values for the new fit
j.potential.copy_final_to_initial_params()
j.input = job.input.copy()
```
%% Cell type:code id:ancient-september tags:
%%%% Output: stream
Are you sure you want to delete all jobs from 'PotentialDF2'? y/(n) y
%% Cell type:code id:proprietary-breed tags:
``` python
df = data_pr.load("df2_1k").to_pandas()
for id, row in df.iterrows():
struct = ase_to_pyiron(row.atoms)
s = j.structures.add_structure(struct, f"id{id}", relative_weight=1)
s.fit_properties.add_FitProperty("atomic-energy", target_value=row.energy/row.number_of_atoms, relative_weight=1)
s.fit_properties.add_FitProperty("atomic-forces", target_value=row.forces, relative_weight=1/row.number_of_atoms)
```
%% Cell type:code id:correct-israeli tags:
%% Cell type:code id:convinced-gambling tags:
``` python
import time
```
%% Cell type:code id:multiple-harbor tags:
%% Cell type:code id:still-counter tags:
``` python
j.input.atom_types.Cu = None
j.input.fit_algorithm = j.factories.algorithms.ar_lbfgs(max_iter=100000)
## if possible increase number of cores
#j.server.cores = 16
t1 = time.time()
j.run()
## Uncomment if you want to run the job
#j.run()
t2 = time.time()
```
%%%% Output: stream
The job PotentialDF2 was saved and received the ID: 623
The job PotentialDF2 was saved and received the ID: 861
%% Cell type:code id:eligible-soviet tags:
%% Cell type:code id:photographic-division tags:
``` python
t2-t1
```
%%%% Output: execute_result
538.0122790336609
850.7202744483948
%% Cell type:code id:subsequent-diploma tags:
%% Cell type:code id:normal-newspaper tags:
``` python
j.output
```