Limits...
Optimization of Parameter Selection for Partial Least Squares Model Development.

Zhao N, Wu ZS, Zhang Q, Shi XY, Ma Q, Qiao YJ - Sci Rep (2015)

Bottom Line: In multivariate calibration using a spectral dataset, it is difficult to optimize nonsystematic parameters in a quantitative model, i.e., spectral pretreatment, latent factors and variable selection.In this study, we describe a novel and systematic approach that uses a processing trajectory to select three parameters including different spectral pretreatments, variable importance in the projection (VIP) for variable selection and latent factors in the Partial Least-Square (PLS) model.The PLS model optimizes modeling parameters step-by-step, but the robust model described here demonstrates better efficiency than other published papers.

View Article: PubMed Central - PubMed

Affiliation: 1] Beijing University of Chinese Medicine, Beijing 100102, China [2] Beijing Key Laboratory for Basic and Development Research on Chinese Medicine, Beijing, 100102, China [3] Key Laboratory of TCM-information Engineer of State Administration of TCM, Beijing, China, 100102.

ABSTRACT
In multivariate calibration using a spectral dataset, it is difficult to optimize nonsystematic parameters in a quantitative model, i.e., spectral pretreatment, latent factors and variable selection. In this study, we describe a novel and systematic approach that uses a processing trajectory to select three parameters including different spectral pretreatments, variable importance in the projection (VIP) for variable selection and latent factors in the Partial Least-Square (PLS) model. The root mean square errors of calibration (RMSEC), the root mean square errors of prediction (RMSEP), the ratio of standard error of prediction to standard deviation (RPD), and the determination coefficient of calibration (Rcal(2)) and validation (Rpre(2)) were simultaneously assessed to optimize the best modeling path. We used three different near-infrared (NIR) datasets, which illustrated that there was more than one modeling path to ensure good modeling. The PLS model optimizes modeling parameters step-by-step, but the robust model described here demonstrates better efficiency than other published papers.

No MeSH data available.


Related in: MedlinePlus

Schematic diagram of processing trajectory of PLS model corn samples (a), Yinhuang granules samples (b) and pharmaceutical tablets sample (c).
© Copyright Policy - open-access
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC4499800&req=5

f3: Schematic diagram of processing trajectory of PLS model corn samples (a), Yinhuang granules samples (b) and pharmaceutical tablets sample (c).

Mentions: However, this result showed that there was more than one modeling path that can ensure a successful model. There were two fair PLS models with RPD between 2.5 to 3 (Fig. 3a) including: 1) a combinational method of SG(9) spectral pretreatment, VIP, and 10 factors, and 2) a combinational method using the second derivative spectra (2D) combined with SG(9) spectral pretreatment, VIP, and 10 factors. Most of PLS models were fair and there were also some good model paths with RPD values greater than 3 (Fig. 3b). In Fig. 3c, there were many models with good performance that adopted a processing trajectory.


Optimization of Parameter Selection for Partial Least Squares Model Development.

Zhao N, Wu ZS, Zhang Q, Shi XY, Ma Q, Qiao YJ - Sci Rep (2015)

Schematic diagram of processing trajectory of PLS model corn samples (a), Yinhuang granules samples (b) and pharmaceutical tablets sample (c).
© Copyright Policy - open-access
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC4499800&req=5

f3: Schematic diagram of processing trajectory of PLS model corn samples (a), Yinhuang granules samples (b) and pharmaceutical tablets sample (c).
Mentions: However, this result showed that there was more than one modeling path that can ensure a successful model. There were two fair PLS models with RPD between 2.5 to 3 (Fig. 3a) including: 1) a combinational method of SG(9) spectral pretreatment, VIP, and 10 factors, and 2) a combinational method using the second derivative spectra (2D) combined with SG(9) spectral pretreatment, VIP, and 10 factors. Most of PLS models were fair and there were also some good model paths with RPD values greater than 3 (Fig. 3b). In Fig. 3c, there were many models with good performance that adopted a processing trajectory.

Bottom Line: In multivariate calibration using a spectral dataset, it is difficult to optimize nonsystematic parameters in a quantitative model, i.e., spectral pretreatment, latent factors and variable selection.In this study, we describe a novel and systematic approach that uses a processing trajectory to select three parameters including different spectral pretreatments, variable importance in the projection (VIP) for variable selection and latent factors in the Partial Least-Square (PLS) model.The PLS model optimizes modeling parameters step-by-step, but the robust model described here demonstrates better efficiency than other published papers.

View Article: PubMed Central - PubMed

Affiliation: 1] Beijing University of Chinese Medicine, Beijing 100102, China [2] Beijing Key Laboratory for Basic and Development Research on Chinese Medicine, Beijing, 100102, China [3] Key Laboratory of TCM-information Engineer of State Administration of TCM, Beijing, China, 100102.

ABSTRACT
In multivariate calibration using a spectral dataset, it is difficult to optimize nonsystematic parameters in a quantitative model, i.e., spectral pretreatment, latent factors and variable selection. In this study, we describe a novel and systematic approach that uses a processing trajectory to select three parameters including different spectral pretreatments, variable importance in the projection (VIP) for variable selection and latent factors in the Partial Least-Square (PLS) model. The root mean square errors of calibration (RMSEC), the root mean square errors of prediction (RMSEP), the ratio of standard error of prediction to standard deviation (RPD), and the determination coefficient of calibration (Rcal(2)) and validation (Rpre(2)) were simultaneously assessed to optimize the best modeling path. We used three different near-infrared (NIR) datasets, which illustrated that there was more than one modeling path to ensure good modeling. The PLS model optimizes modeling parameters step-by-step, but the robust model described here demonstrates better efficiency than other published papers.

No MeSH data available.


Related in: MedlinePlus