Here every structure has the same weight, but the force vector with N*3 values is normalized to have the same total weight as the single value energy. Therefore it is divided by the number of atoms. " ] }, { "cell_type": "code", "execution_count": 6,  Niklas Leimeroth committed Mar 08, 2021 251  "id": "located-individual",  Niklas Leimeroth committed Mar 01, 2021 252 253 254 255 256 257 258 259 260 261 262 263  "metadata": {}, "outputs": [], "source": [ "for id, row in df.iterrows():\n", " struct = ase_to_pyiron(row.atoms)\n", " s = job.structures.add_structure(struct, f\"id{id}\", relative_weight=1)\n", " s.fit_properties.add_FitProperty(\"atomic-energy\", target_value=row.energy/row.number_of_atoms, relative_weight=1)\n", " s.fit_properties.add_FitProperty(\"atomic-forces\", target_value=row.forces, relative_weight=1/row.number_of_atoms)" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 264  "id": "angry-leader",  Niklas Leimeroth committed Mar 01, 2021 265 266 267 268 269 270 271 272 273  "metadata": {}, "source": [ "### Define the type of potential and necessary functions.\n", "In this case an eam potential is fitted." ] }, { "cell_type": "code", "execution_count": 7,  Niklas Leimeroth committed Mar 08, 2021 274  "id": "functional-formation",  Niklas Leimeroth committed Mar 01, 2021 275 276 277 278 279 280 281 282  "metadata": {}, "outputs": [], "source": [ "job.potential = job.factories.potentials.eam_potential()" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 283  "id": "realistic-karaoke",  Niklas Leimeroth committed Mar 01, 2021 284 285 286 287 288  "metadata": {}, "source": [ "It is necessary to define a pair potential, an electronic density function and an embedding function.\n", "For all of those it is possible to choose between different functional forms.\n", "Classic pair potentials are physically motivated and have a very limited number of paramaters that are derived from a experimentally measured quantity.\n",  Niklas Leimeroth committed Mar 08, 2021 289  "Splines or polynomials offer more flexibility, but can lead to unphysical oscillations or overfitting. Compared with the machine learning potentials shown later the number of parameters is very low no matter which functions you choose and the problem is highly non linear.\n",  Niklas Leimeroth committed Mar 01, 2021 290  "\n",  Niklas Leimeroth committed Mar 08, 2021 291  "In this case a generalized morse function is used for the pair interaction. It has the form\n",  Niklas Leimeroth committed Mar 01, 2021 292  "\n",  Niklas Leimeroth committed Mar 08, 2021 293 294 295 296 297 298  "$(\\frac{D_0}{S-1}exp(-\\beta \\sqrt{2S}(r-r_0))-\\frac{D_0S}{S-1}exp(-\\beta\\sqrt{2/S}(r-r_0)))+\\delta$\n", "\n", "The parameters in the morse potential can be derived from phyiscal quantities, but in this case they are just educated guesses. For example $r_0$ is the equilibrium distance of a dimer. The nearest neighbor distance in fcc Cu is about 2.5 $\\mathring A$ so it is taken as initial value.\n", "In the case of analytic functions the initial parameter choices should not matter too much, since the functional form is constrained.\n", "\n", "The electronic density and embedding function will be splines. Depending on the properties that are calculated other functional forms could give better results. The inital parameters require more testing and hand tuning than the parameters of analytic functions."  Niklas Leimeroth committed Mar 01, 2021 299 300 301 302 303  ] }, { "cell_type": "code", "execution_count": 8,  Niklas Leimeroth committed Mar 08, 2021 304  "id": "interpreted-orange",  Niklas Leimeroth committed Mar 01, 2021 305 306 307  "metadata": {}, "outputs": [], "source": [  Niklas Leimeroth committed Mar 08, 2021 308  "V = job.factories.functions.morse_B(identifier=\"V_CuCu\", D0=0.35, r0=2.5, beta=2, S=2, delta=0)"  Niklas Leimeroth committed Mar 01, 2021 309 310 311 312 313  ] }, { "cell_type": "code", "execution_count": 9,  Niklas Leimeroth committed Mar 08, 2021 314  "id": "mathematical-gasoline",  Niklas Leimeroth committed Mar 01, 2021 315 316 317 318  "metadata": {}, "outputs": [], "source": [ "V.parameters.D0.min_val = 0\n",  Niklas Leimeroth committed Mar 08, 2021 319 320 321 322 323  "V.parameters.D0.max_val = 2\n", "V.parameters.r0.min_val = 1.5\n", "V.parameters.r0.max_val = 3.0\n", "V.parameters.S.min_val = 1.1\n", "V.parameters.S.max_val = 10.0\n",  Niklas Leimeroth committed Mar 01, 2021 324 325 326 327 328 329 330 331  "V.parameters.delta.min_val = -1\n", "V.parameters.delta.max_val = 1\n", "V.parameters.beta.min_val = 0.1\n", "V.parameters.beta.max_val = 10" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 332  "id": "written-commission",  Niklas Leimeroth committed Mar 01, 2021 333 334 335 336 337 338 339 340  "metadata": {}, "source": [ "Additionally a screening function needs to be defined for the morse potential" ] }, { "cell_type": "code", "execution_count": 10,  Niklas Leimeroth committed Mar 08, 2021 341  "id": "discrete-terminology",  Niklas Leimeroth committed Mar 01, 2021 342 343 344 345 346 347 348 349  "metadata": {}, "outputs": [], "source": [ "V.screening = job.factories.functions.exp_A_screening(identifier=\"V_cutoff\", cutoff=7)" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 350  "id": "wireless-parts",  Niklas Leimeroth committed Mar 01, 2021 351 352  "metadata": {}, "source": [  Niklas Leimeroth committed Mar 08, 2021 353  "The electron density is chosen to be a spline function. The cutoff has to be defined. Derivatives left and right are optional, they default to 0. For the right cutoff this is fine, since the forces should smoothly go to 0. For the left this is not necessarily the best choice, since the function value should increase at very close distances. Very large absolute values will lead to osciallations and should be avoided."  Niklas Leimeroth committed Mar 01, 2021 354 355 356 357 358  ] }, { "cell_type": "code", "execution_count": 11,  Niklas Leimeroth committed Mar 08, 2021 359  "id": "authentic-expression",  Niklas Leimeroth committed Mar 01, 2021 360 361 362 363 364 365 366 367  "metadata": {}, "outputs": [], "source": [ "rho = job.factories.functions.spline(identifier=\"rho_CuCu\", cutoff=7, derivative_left=-1)" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 368  "id": "bored-afternoon",  Niklas Leimeroth committed Mar 01, 2021 369 370 371 372 373 374 375 376 377  "metadata": {}, "source": [ "For a spline function it is necessary to define node points. They can be equally spaced or sampled with higher density around turning points, f.e. the first neighbor distance.\n", "Too few nodepoints mean low flexibilty, too many lead to overfitting. This requires some testing to find an optimal choice." ] }, { "cell_type": "code", "execution_count": 12,  Niklas Leimeroth committed Mar 08, 2021 378  "id": "hidden-wildlife",  Niklas Leimeroth committed Mar 01, 2021 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398  "metadata": {}, "outputs": [ { "data": { "text/plain": [ "array([0.5 , 1.58, 2.67, 3.75, 4.83, 5.92, 7. ])" ] }, "execution_count": 12, "metadata": {}, "output_type": "execute_result" } ], "source": [ "rho_nodes = np.linspace(0.5, 7.0, 7).round(2)\n", "rho_nodes" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 399  "id": "binary-devil",  Niklas Leimeroth committed Mar 01, 2021 400 401 402 403 404 405 406 407  "metadata": {}, "source": [ "The nodes need initial values. The electron density should be proportional to $e^{-r}$, so this function is chosen to calculate them." ] }, { "cell_type": "code", "execution_count": 13,  Niklas Leimeroth committed Mar 08, 2021 408  "id": "comparative-brush",  Niklas Leimeroth committed Mar 01, 2021 409 410 411 412 413 414 415 416 417  "metadata": {}, "outputs": [], "source": [ "decaying_exp = lambda r: np.exp(-r)\n", "rho_initial = decaying_exp(rho_nodes)" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 418  "id": "gentle-infrastructure",  Niklas Leimeroth committed Mar 01, 2021 419 420 421 422 423 424 425 426 427 428  "metadata": {}, "source": [ "Additionally it is a good idea to define limits for the node points. This is optional for local minimizers, but the fit can quickly run away without limits. Global optimizers typically require them to constrain the sampled space.\n", "\n", "A density can't be negative so the lower limit is set to 0. The upper limit is chosen to be 3 times the initial values. These choices aswell as the choice for $e^{-r}$ as initial values are somewhat arbitrary, but don't matter much. The electron density from single atoms does not directly influence the calculated energies and forces, instead the summed up density at some place is used in the embedding function, so the final numerical values are an interplay between electron density and embedding function. Since the latter will also be a spline function it can only be defined for a certain range of rho values as node points. Therefore it is better to limit the range of electron density values and define larger limits for the embedding function instead. " ] }, { "cell_type": "code", "execution_count": 14,  Niklas Leimeroth committed Mar 08, 2021 429  "id": "funny-trinidad",  Niklas Leimeroth committed Mar 01, 2021 430 431 432 433 434 435 436 437 438 439  "metadata": {}, "outputs": [], "source": [ "rho_mins = np.zeros((len(rho_nodes)))\n", "rho_maxs = 3*rho_initial.round(6)\n", "rho.parameters.create_from_arrays(rho_nodes, rho_initial, min_vals=rho_mins, max_vals=rho_maxs)" ] }, { "cell_type": "raw",  Niklas Leimeroth committed Mar 08, 2021 440  "id": "promising-draft",  Niklas Leimeroth committed Mar 01, 2021 441 442 443 444 445 446 447 448  "metadata": {}, "source": [ "Finally the last node point at the cutoff range is set to 0 and fitting is disabled to prevent a discontinuous change of energy at the cutoff." ] }, { "cell_type": "code", "execution_count": 15,  Niklas Leimeroth committed Mar 08, 2021 449  "id": "mexican-absence",  Niklas Leimeroth committed Mar 01, 2021 450 451 452 453 454 455 456 457 458  "metadata": {}, "outputs": [], "source": [ "rho.parameters[\"node_7.0\"].start_val = 0\n", "rho.parameters[\"node_7.0\"].enabled = False" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 459  "id": "standard-relative",  Niklas Leimeroth committed Mar 01, 2021 460 461 462 463 464 465 466 467 468 469 470  "metadata": {}, "source": [ "$-\\sqrt(\\rho)$ can be used as initial guess for the embedding energy, which is taken from second moment approximation tight binding. \n", "The node points have to be chosen in a range compatible to the electron density. This can be estimated by calculating it for a densely packed structure.\n", "Alternatively atomicrex writes the maximum electron density of all structures to the output. This can be used as a hint for the node points for consequent fits.\n", "Everything else is similar to the electron density." ] }, { "cell_type": "code", "execution_count": 16,  Niklas Leimeroth committed Mar 08, 2021 471  "id": "large-rating",  Niklas Leimeroth committed Mar 01, 2021 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488  "metadata": {}, "outputs": [], "source": [ "F = job.factories.functions.spline(identifier=\"F_CuCu\", cutoff=5)\n", "F_nodes = np.linspace(0.0, 5.0, 7).round(2) #9 is worse 11 is worse 7 is best\n", "F_init = -np.sqrt(F_nodes)\n", "F_maxs = np.zeros(len(F_nodes))\n", "F_mins = -np.ones(len(F_nodes))*5\n", "F.parameters.create_from_arrays(F_nodes, F_init, F_mins, F_maxs)\n", "F.parameters[\"node_0.0\"].enabled=False\n", "F.parameters[\"node_0.0\"].start_val = 0\n", "F.parameters[\"node_5.0\"].enabled=False\n", "F.parameters[\"node_5.0\"].start_val = 0" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 489  "id": "several-mercy",  Niklas Leimeroth committed Mar 01, 2021 490 491 492 493 494 495 496 497  "metadata": {}, "source": [ "The functions have to be assigned to the potential" ] }, { "cell_type": "code", "execution_count": 17,  Niklas Leimeroth committed Mar 08, 2021 498  "id": "heavy-acoustic",  Niklas Leimeroth committed Mar 01, 2021 499 500 501 502 503 504 505 506 507 508  "metadata": {}, "outputs": [], "source": [ "job.potential.pair_interactions[V.identifier] = V\n", "job.potential.electron_densities[rho.identifier] = rho\n", "job.potential.embedding_energies[F.identifier] = F" ] }, { "cell_type": "markdown",  Niklas Leimeroth committed Mar 08, 2021 509  "id": "alien-chancellor",  Niklas Leimeroth committed Mar 01, 2021 510 511 512 513 514 515 516 517 518 519  "metadata": {}, "source": [ "### Define fitting procedure\n", "Finally a few parameters need to be set that influence the fitting process.\n", "The minimization can be done with different algorithms. Atomicrex itself implements the BFGS algorithm. Additionally the algorithms from the nlopt library can be used." ] }, { "cell_type": "code", "execution_count": 18,  Niklas Leimeroth committed Mar 08, 2021 520  "id": "enormous-segment",  Niklas Leimeroth committed Mar 01, 2021 521 522 523 524 525 526  "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [  Niklas Leimeroth committed Mar 08, 2021 527  "The job PotentialDF1 was saved and received the ID: 819\n"  Niklas Leimeroth committed Mar 01, 2021 528 529 530 531 532 533  ] }, { "data": { "application/json": { "error": "None",  Niklas Leimeroth committed Mar 08, 2021 534 535  "iterations": "array([ 1, 2, 3, ..., 1998, 1999, 2000], dtype=uint32)", "residual": "array([1.39371e+03, 1.39371e+03, 1.39371e+03, ..., 1.52231e-01,\n 1.52231e-01, 1.52231e-01])"  Niklas Leimeroth committed Mar 01, 2021 536 537  }, "text/plain": [  Niklas Leimeroth committed Mar 08, 2021 538 539  "Output({'error': None, 'residual': array([1.39371e+03, 1.39371e+03, 1.39371e+03, ..., 