H.W. Darwish et al. / Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy 122 (2014) 744–750
747
predictive ability of the model and to reduce its complexity [31].
Several techniques devoted to variable selection in PLS models ap-
plied to spectral data have been presented [32,33]. It has already
been shown that Genetic Algorithms (GAs) can be successfully
used as a variable selection technique [34,35]. The architecture of
a GA can be divided into five components: Initiation, Evaluation,
Exploitation, Exploration and Mutation. An important issue of suc-
cessful GA performance is the adjustment of GA parameters [36].
The fitness values were used as response variables. Mutation
rate was 0.005 in all cases as when it increased above this value,
no convergence occurred between average fitness and best fitness
values and model stop. The adjusted GA parameters are shown in
Table 2.
The GA was run on 170, 90 and 130 variables for AML, VAL and
HCT, respectively, using a PLS with maximum number of factors
allowed is the optimal number of components determined by
cross-validation on the model containing all the variables, and
the selected variables were used for running of PLS model and as
input data for ANN. GA reduced absorbance matrix to about
the optimum number of factors, which involves selecting that
model including the smallest number of factors that results in an
insignificant difference between the corresponding RMSECV and
the minimum RMSECV Fig. 3.
ANN
The large number of nodes in input layer of the network
(wavelength readings) increases the CPU time for ANN modeling,
the absorbance matrix was reduced either by Genetic Algorithm
(variable selection procedure) to about 36–46% of the original ma-
trix or Principal Component Analysis (PCA) (variable compression
procedure) to three principal components. Thus three ANNs
(ANN, GA-ANN and PC-ANN) were applied in our work. The output
layer is the concentration matrix of one component. The hidden
layer consists of just single layer which has been considered suffi-
cient to solve similar or more complex problems. Moreover, more
hidden layers may cause overfitting [25].
The values of the optimized ANN parameters for each drug are
shown in Table 3. From the most important parameters that should
be optimized carefully, transfer function pair. Choosing of transfer
function depends on the nature of data you work on. In our case,
purelin–purelin transfer function was implemented in our models
due to linear correlation between absorbance and concentration.
After optimization of architectures and parameters of the ANNs,
the training step was done. ANN was trained by different training
functions and there was no difference in performance (no decrease
in Mean Square Error MSE). Levenberg–Marquardt back propaga-
tion (TRAINLM) [42] was thus preferred as it is time saving. To
avoid overfitting of our model, the validation set was encountered
in training step and ANN stops when MSE of calibration set de-
creased and that of validation set increased. Analysis from raw
data, Genetic Algorithm model and Principal Component Analysis
was implemented to test for improvement of predictions.
The proposed chemometric methods were run on the calibra-
tion data using optimal parameters. The concentrations of the
three drugs in the calibration set (15 mixtures) were calculated.
By plotting predicted concentrations of each component versus
actual concentrations, a straight line was obtained (Table 1, Sup-
plementary Material). In order to validate the proposed methods,
the validation set (10 mixtures) was analyzed with the proposed
methods (Table 4).
3
6–46% of the original matrix (62, 42 and 54 wavelengths for
AML, VAL and HCT, respectively).
Partial Least Squares (PLS-1)
The purpose of PLS method is to build a calibration model be-
tween the concentration of the analytes under study (AML, VAL
and HCT in our case) and the latent variables of the data matrix
[
37]. Two different approaches can be used in Partial Least Squares
called PLS-1 and PLS-2. PLS-2 uses the whole information about the
concentration of all components to form latent variables (LVs),
while PLS-1 uses only the information about the concentration of
one component to create the LVs used by the model [38].
Including extra LVs in the model increases the possibility of the
known problem of overfitting. On the other hand, if the number of
LVs was too small meaningful data that could be necessary for the
calibration might be discarded. Therefore optimization of number
of the LVs is a critical issue in PLS method. Leave one out (LOO)
cross validation [39] and the bootstrap [40] can be applied to
predict the optimum number of PLS components.
PLS-1 method was run on the calibration data of absorption
spectra. To select the number of factors in the PLS-1 algorithm, a
cross validation (CV) method leaving out one sample at a time
The proposed PLS-1, GA-PLS, ANN, GA-ANN and PCA-ANN
[
41] was applied using calibration set of 15 calibration spectra.
methods were successfully applied for the determination of AML,
RMSECV (Root Mean Squares Error of Cross-Validation) indicates
both of the precision and accuracy of predictions. It was recalcu-
lated upon addition of each new factor to the PLS-1. The method
developed by Haaland and Thomas [28] was used for selecting
VAL and HCT in Exforge HCTÒ tablets, Table 5. The validity of the
proposed methods was further assessed by applying the standard
addition technique (Table 2, Supplementary Material).
The results obtained for the analysis of AML, VAL and HCT in Ex-
forge HCTÒ tablets by the suggested methods were statistically
compared with those obtained by applying the reported HPLC
method [16] and no significant difference between the results
was obtained as shown in Table 6.
Table 2
Parameters of the Genetic Algorithms.
GA reduced the optimal number of latent variables of PLS-1
model for AML from three into two factors. Also, recoveries and
RMSEP (Root Mean Square Error of Prediction) were decreased
indicating a better resolution power of GA-PLS model than PLS-1
model (Table 4).
GA allowed the use of less number of neurons (shorter training
time) for AML than those used in the network utilized the raw data.
While PCA-ANN did not show any improvement than ANN, even
the results were worse (Table 4). These results indicate that vari-
able selection models (GA) are more suitable than data compres-
sion procedure (PCA), when preceding ANN, for the analysis of
this ternary mixture. This result may be attributed to the fact
that GA introduces the most relevant wavelengths to the drug
concentration.
Parameter
Value
Population size
Maximum generations
Mutation rate
The number of variables in a window (window width)
Per cent of population the same at Convergence
20
50
0.005
2
100, except VAL
(
50)
%
Wavelengths used at initiation
50
Single
3, except AML (2)
Random
Crossover type
Maximum number of latent variables
Cross validation
Number of subsets to divide
Data into for cross validation
Number of iterations for cross validation at each
generation
4
2