Kernels
Kernel regression is activated when MILADY
is executed in the mode ml_type=1
.
The generation of a kernel potential is made in two steps as it is described
in Examples : (i) firstly, using ml_type=-2
mode with an
appropriate algorithm we choose the sparse points that define the kernel and (ii)
the parametrization of the kernel using the mode ml_type=1
. Here we will note the options for the
first step as \(k_1\), whilst for the second \(k_2\).
Kernels definitions
- kernel_type (integer),
-
\(k_2\) option.
-
kernel_type = 1
Square-exponential kernel(1)\[\tilde{k}(\mathbf{D}^{s,a}, \mathbf{x}^m) = \sigma_{SE}^2 \exp{-\frac{|\mathbf{D}^{s,a} - \mathbf{x}^m |^2}{2l_{SE}^2}}\]the values to define are \(\sigma_{SE}\) and \(l_{SE}\).
-
kernel_type = 4
Polynomial kernel shifted(2)\[\tilde{k}(\mathbf{D}^{s,a}, \mathbf{x}^m) = \left(\sigma_{SE}^2 + \frac{\mathbf{D}^{s,a} \cdot \mathbf{x}^m }{2l_{SE}^2} \right)^p\]the values to define are \(\sigma_{SE}\), \(l_{SE}\) and \(p\).
-
kernel_type = 5
Polynomial kernel scaled(3)\[\tilde{k}(\mathbf{D}^{s,a}, \mathbf{x}^m) = \sigma_{SE}^2 \left( \frac{\mathbf{D}^{s,a} \cdot \mathbf{x}^m }{|\mathbf{D}^{s,a}| |\mathbf{x}^m|} \right)^p\]the value to define \(\sigma_{SE}\) and \(p\).
-
kernel_type = 6
Mahalanobis - Batchattarya kernel(4)\[\tilde{k}(\mathbf{D}^{s,a}, \mathbf{D}^m) = \left[ \left( \mathbf{D}^{s,a} - \mathbf{x}^m \right)^\top \Sigma^{-1} \left( \mathbf{D}^{s,a} - \mathbf{x}^m \right) \right]^p\]\(p\) should be defined. For the case \(p=1/2\) there is Mahalanobis distance.
kernel_type = 7
Square-exponential kernel for which the covariance matrix is sampled randomly on some linear basis. The only parameter to define issigma_kernel
( typical value issigma_kernel = 0.05
but try more values to be sure that it is adapted for your descriptor).kernel_type = 44
Polynomial kernel for which the covariance matrix is sampled randomly on some linear basis. The only parameter to define iskernel_power
. Usuallykernel_power=4
it is a resonable value (at least on what we have tested, such as, Fe, W, some HEA and aspirin)
Defalutkernel_type = 4
-
- length_kernel (real)
-
\(k_2\) option. It defines \(l_{SE}\) from Eqs. (1) and (2).
Default
length_kernel = 0.05d0
Selections of kernel sparse points
In Milady
the selection of sparse points (ml_type=-2
and write_krnel_matrix=.true.
)
is driven by our intuition based on physics considerations
that some parts of the database are more important than the others.
For example if we are interested in having nice elastic constants we will make a
special treatment for the database classes that contain elastic deformations.
Consequently, from the complete database
we select some classes that we
will name as database_reference
. Beyond the algorithm of sparse points selection
(defined by kernel_dump
) we bias the selection by the number of points to be selected
from database
and database_reference
i.e. np_kernel_full
and np_kernel_ref
,
respectively.
The selection of classes that encompass database_reference
is given by the character
variable classes_for_mcd
(the name is weird because it corresponds sometimes to the most
“smooth” classes without outliers such as molecular dynamics for perfect bulk configuration,
elastic deformations etc). Finally the kernel is written
- write_kernel_matrix (logical)
-
\(k_1\) option. Writes or not the kernel if it is
.true.
or.false.
, respectively. The kernel is written in the filekernel_matrix.dat
, which has the shapenumber_of_data_kernel+1
\(\times\)dim_desc + 4
. The ASCII filekernel_matrix.dat
has the following structure:2314 59 1 a1 a2 ... aD 43 27 07_111_000003 2 b1 b2 ... bD 234 12 09_111_000010 . . . . . . . . . . . . . . . . . . . . . 2314 c1 c2 ... cD 10 127 11_111_000023
The first line gives the number of kernel sparse points (2314 in this example) and the number of columns for each sparse point (59). Each following lines (again, 2314) contains in first position the id of the sparse point, then followed by
D
real values with theD
components of the descriptor and finally there are three labels that identify the origin of that sparse point: an internal id used forMilady
, which identify the system, the id of atom in that system and the human readable name of the system similar toposcar
name file described in Database file names. In above example for sparse point1
is part of system43
and i correspond to atom number27
from the file07_111_000003.poscar
.Default
.false.
- kernel_dump (integer)
-
\(k_1\) option. Algorithm used for the selection of the sparse points.
kernel_dump=1
normalized selection of sparse points using MCD/Mahalanobis distances. There are 4 parameters to set:power_mcd
,np_kernel_ref
,np_kernel_full
and reference classes given byclasses_for_mcd
. Is what we advice to use. More details in the paper A. Zhong et al. 2022 (refered as normalized MCD/Mahalanobis sparse points selection)kernel_dump=2
draft selection of sparse points using MCD/Mahalanobis distances. There are 4 parameters to set:power_mcd
,np_kernel_ref
,np_kernel_full
and reference classes given byclasses_for_mcd
. More details in the paper A. Zhong et al. 2022 (refered as MCD/Mahalanobis sparse points selection)kernel_dump=3
selection based on CUR decomposition. REF Mahoney . There are three parameters that should be set:np_kernel_ref
,np_kernel_full
as well as the reference classes given byclasses_for_mcd
. For advanced applications there are others options for CUR descoposition, such as:cur_kval
,cur_rval
andcur_eps
. However, the selection of sparse points is not very sensible to these last 3 parameters.
- classes_for_mcd (character)
-
\(k_1\) option. It defines the classes that define the
database_reference
. FFor examplesclasses_for_mcd="10 11"
defines the collections of all the atomic environements from the classes10
and11
. Moreover, for the casekernel_dump=1
orkernel_dump=2
the atomic configurations, which belong to these classes, are used to build the sample covariance matrix used to compute MCD/Mahalanobis distance.Default
classes_for_mcd= " 01 "
- np_kernel_ref (integer)
-
\(k_1\) option. Number of sparse points selected from the
database_reference
(defined by the atomic environements specified byclasses_for_mcd
). However, depending on the selection algorithm the selected sparse points can be lower or can have a small noise around the mediam value.Default
np_kernel_ref= 200
- np_kernel_full (integer)
-
\(k_1\) option. Number of points selected from the whole database. However, depending on the selection algorithm the final number of selected points can be lower or larger (but not very different).
Default
np_kernel_full= 800
- power_mcd (real)
-
\(k_1\) option. Defines the power of MCD / Mahalanobis statistical distance on which the selection grid od sparse points is made. More details in A. Zhong et al. 2022. If you do not know what to do … leave the default value.
Default
power_mcd = 0.05d0
- cur_kval (integer)
-
\(k_1\) option. Defines the order of SVD decomposition of the atomic desing matrix in order to perform the leverage column score as was introduced by REF Mahoney. it cannot be larger than the rank(atomic desing matrix) or the number of columns and rows of atomic design matrix. If it is given a negative value then the optimal value i.e. the rank(atomic desing matrix) is used.
Default
cur_kval = -1
- cur_rval (integer)
-
\(k_1\) option. Defines the number of rows selection in CUR decomposition as it was introduced by REF Mahoney. If you are not sure about your choice, choose the default value i.e. -1.
Default
cur_rval = -1
- cur_eps (real)
-
\(k_1\) option. Defines the error of sampling in CUR decomposition as it was introduced by REF Mahoney. If you are nor sure about your choise, choose the default value i.e. 1.
Default
cur_eps = 1.d0