Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Quantitative Determination of the Multicomponents With Overlapping Ultraviolet Spectra Using Wavelet-Packed Transform and Partial Least Squares

Quantitative Determination of the Multicomponents With Overlapping Ultraviolet Spectra Using... INTRODUCTION Many efforts have been made in order to resolve overlapping signals in spectrophotometry. As a consequence of peak overlapping, the quality of analytical information is lower than what is derived from isolated peaks; the extent of the loss depends on the extent of overlap. In complex samples, however, spectral overlap is often occurring. Strongly overlapped signals do not permit direct determination by traditional methods without previous separation. To overcome this difficulty, multivariate analysis [1–3] such as partial least squares (PLS) and principal components regression (PCR), and so forth have been proved to be useful. When PLS and PCR methods were applied to analyze samples, the first problem encountered is to determine the number of components that they contain. Unfortunately, data obtained from instrumental measurements can be contaminated by noise. The presence of noise often causes overestimating the true number of chemical components. In order to eliminate noise, wavelet packet (WP) denoising method was used as a preprocessing step. Wavelet packet transform (WPT) is an important extension of wavelet transform (WT). WT is a powerful tool with a very rich mathematical content and great potential for application [4, 5]. WPT inherits the property of having a sparse representation of the original signal and time-frequency localization, and offers more flexibility than wavelet analysis [6, 7]. A novel approach tried here is to combine WPT with PLS to eliminate noise and improve the quality of regression. Aniline-type compounds are widely applied in industries such as chemistry, printing, and pharmacy, and are one of the most important raw materials for synthetic medicine, dye, insecticides, polymer, and explosives. Anilinetype compounds are highly poisonous, and can also cause cancer. Therefore, it is very important to test and analyze aniline-type compounds in environmental samples. Simultaneous determination of aniline-type compounds is very difficult due to their overlapping spectra. In this paper, WPTPLS method was developed and used to perform simultaneous determination of α-naphthylamine, p-nitroaniline, and benzidine. Experimental results showed the proposed method to be successful and better than PLS. 2. 2.1. THEORY WPT denoising A wavelet packet W jnk is generated from the base function W jnk (x) = 2− j/2 Wn 2− j x − k , (1) where indices j, n, k are the scale, the oscillation, and the 2 localization parameter, respectively. j, k ∈ Z, Z means the set of integers, n = 0, 1, 2, . . . , 2 j − 1. The discrete wavelet transform (DWT) can be implemented by means of Mallat’s pyramid algorithm [8]. DWT can be characterized as a recursive application of the high-pass and low-pass filters that form a quadrature mirror filter (QMF) pair. The theoretical background about DWT has been described in details [9]. The difference between WT and WPT is the decomposition path. In WPT, both the approximations and details are analyzed. The recursion is simply to filter and downsample all output of the previous level. A fast wavelet packet transform (FWPT) is expressed as W j+1 2n = HW jn , W j+1 2n+1 = GW jn , (2) Journal of Automated Methods and Management in Chemistry measurements were made by a pH-3B digital pH-meter with a glass-saturated calomel dual electrode. All reagents were of analytical reagent grade. The water used was doubly distilled and deionized. Stock standard solutions of 2.000 mgml−1 αnaphthylamine, p-nitroaniline, and benzidine were prepared from correspondent reagents with water as solvents. Standard solutions were then prepared from their stock standard solutions by serial dilution as required. Acetic acid (HAc)sodium acetate (NaAc) buffer solution (pH 6.30) was used. Procedures where W0,0 indicates the measured signal f , H = {hl }l∈Z and G = {gl }l∈Z are the low-pass and high-pass filters matrices. The first and second indices of W indicate the level of decomposition and its position at that level. The reconstruction can be implemented by W jn = H ∗ W j+1 2n + G∗ W j+1 2n+1 , (3) A series of mixed standard solutions containing various ratios of the three kinds of organic compounds was prepared in 25 ml standard flasks, 10.00 ml of HAc-NaAc buffer solution (pH 6.30) was added, and dilution with distilled water to mark. A blank solution was prepared similarly. Spectra were measured in 1 cm cuvettes between 250 nm and 460 nm at 2 nm intervals with respect to a reagent blank. An absorption matrix D was built up. All the values measured were means of three replicate. 3.3. Evaluation of the performance of the test methods where H ∗ and G∗ represent the conjugate matrices of H and G. The wavelet packet denoising procedures include four steps: (1) WPT, (2) estimation of the best basis, (3) thresholding of wavelet packet coefficients, and (4) reconstruction. The best basis is selected according to entropy-based criterion proposed by Coifman and Wickerhauser [6]. Shannon entropy was applied in this case. The thresholding operation is implemented by the SURE method proposed by Donoho [10] based on Stein’s unbiased risk estimation. 2.2. The wavelet packet transform partial least squares method In the method, WPT is used as a tool for removing noise from original data. The denoising is applied to the wavelet packet domain as described above, prior to backtransforming it to original domain. The reconstructed matrices from standard and unknown mixtures were obtained for further PLS operation. The PLS algorithm is built on the properties of the nonlinear iterative partial least squares (NIPALS) algorithm by calculating one latent vector at a time. The NIPALSPLS algorithm and calculating details were described previously [11]. According to this algorithm, the program called PWPTPLS was designed to perform data compression and denoising as well as simultaneous determination. 3. EXPERIMENTAL Absolute and relative standard errors of prediction (SEP and RSEP) were used as the criteria for comparing the performances of the test methods. The SEP for a single component is given by (4); that for all components by (5). The RSEP is given by (6) [12]: SEP = n i=1 m j =1 Ci j − Ci j m (4) SEP = m j =1 Ci j − Ci j nm (5) RSEP = n i=1 2 m j =1 Ci j − Ci j , n m 2 i=1 j =1 Ci j (6) where Ci j and Ci j are the actual and estimated concentrations, respectively, for the ith component in the jth mixture, m is the number of mixtures, and n is the number of components. 4. 4.1. RESULTS AND DISCUSSIONS Absorbance spectra of the system 3.1. Apparatus and reagents The Shimadzu UV-240 spectrophotometer furnished with OPI-2 function was used for all experiments; a legend Pentium IV microcomputer was used for all the calculations; pH Figure 1 shows the absorption spectra of p-nitroaniline αnaphthylamine, and benzidine and their mixed solution. The maximum absorptions of three components were 380 nm, L. Gao and S. Ren Abs. 250 Wavelength (nm) Figure 1: The absorpation spectra 4.000 μgml−1 p-nitroaniline (1); 12.00 μgml−1 α-naphthylamine (2); 3.200 μgml−1 benzidine (3), and their mixture compounds (4). 1 0.5 0 0 20 40 60 w(0, 0) 80 100 120 2 1 0 0 20 w(1, 0) 40 60 20 w(1, 1) 2 1 0 0 20 w(2, 0) 40 0.2 0 0 20 w(2, 1) 40 0.5 0 0 20 w(2, 2) 40 20 w(2, 3) 4 2 0 0 10 w(3, 0) 20 0.1 0 0 10 w(3, 1) 20 0.5 0 0 10 w(3, 2) 20 0.2 0 0 10 w(3, 3) 20 0.2 0 0 10 w(3, 4) 20 0.5 0 0 10 w(3, 5) 20 0.5 0 0 10 w(3, 6) 20 10 w(3, 7) Figure 2: The parts of WP coefficients obtained by wavelet packet transform. 302 nm, and 278 nm, respectively. It can be seen from Figure 1 that the absorption spectra of three components exhibited are seriously overlapped in their absorbing regions, so that for mixed solution only one peak can be recognized. Original 0.5 0.5 Added 0.5% noise Journal of Automated Methods and Management in Chemistry Added 1% noise 0.5 0.5 Added 2% noise Figure 3: Original raw spectra (row 1), reconstructed spectra (row 2), and their difference obtained by means of WP denoising (row 3) for different amounts of noise added (indicated at the head of columns). 4.2. Wavelet packet transform and wavelet packet denoising Here, we selected mean spectra of D matrix as original signal f . WPT of the signal f was carried out using FWPT algorithm. The part of WP coefficients obtained by FWPT is shown in Figure 2. Each coefficient is identified by the couple of index ( j, n), where j is the level of decomposition and n is the position at that level. From Figure 2, it is obvious that the w( j, 0) only contains a positive part and is similar to the original signal. The others are composed of both positive and negative parts. Each block of the coefficients describes the components of the signal f related to a certain frequency band. This flexible time-frequency resolution enables the WP to characterize locally the most relevant parts of a signal and hence to adequately represent a signal with relatively small number of coefficients. In the spectrophotometric measurements, the analytical signals usually center in low-frequency part, whereas the noise in high-frequency part. The aim of WP denoising is to extract the desired signal from a complex instrument output, where the signal is present along with noise. Random Gaussian noise was added to the mean spectra for assessing the WP denoising method. Original and reconstructed spectra as well as their difference with different added noise are displayed in Figure 3. It was found that WPT provides an appropriate approach for denoising even in case where 2% noise is added. Thus the method is safe for preprocessing two-dimensional raw data matrix in the following WPTPLS operation. 4.3. The wavelet packet transform partial least squares method Each of the wavelet functions has different characteristics. The wavelet function, which is optimal for a given signal, is not necessarily the best for another type of signal. Thus, the choice of the wavelet functions is very important for this technique. In this work, the wavelet functions tested were Coiflet 1, 2, . . . , 5, Daubechies 4, 6, . . . , 20, Symmlet 4, 5, . . . , 8. It is possible to use the predictive parameters SEP and RSEP to find the optimum choice of functions. In similar way, one-to-six decomposition levels L were tested L. Gao and S. Ren Table 1: Optimization of wavelet functions. Wavelet function Daubechies 2 Daubechies 4 Daubechies 6 Daubechies 8 Daubechies 10 Daubechies 12 Daubechies 14 Daubechies 16 Daubechies 18 Daubechies 20 Symmlet 5 Symmlet 6 Symmlet 7 Symmlet 8 Coiflet 1 Coiflet 2 Coiflet 3 Coiflet 4 Coiflet 5 SEP ( μgml−1 ) 2.62 2.66 2.70 2.76 2.52 2.57 2.41 2.23 2.47 2.43 2.71 2.72 2.75 2.71 3.35 2.73 2.72 2.69 2.74 RSEP (%) 0.227 0.230 0.233 0.239 0.217 0.222 0.208 0.192 0.214 0.210 0.234 0.235 0.238 0.234 0.289 0.235 0.235 0.232 0.237 L 1 2 3 4 5 6 Table 2: Optimization of wavelet decomposition level. SEP ( μgml−1 ) 0.234 0.236 0.230 0.302 1.39 1.95 RSEP (%) 2.71 2.74 2.66 3.49 16.1 22.6 Table 3: Composition of the training set. Sample no. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Concentration (μgml−1 ) II 2.000 4.000 16.00 24.00 2.000 4.000 16.00 24.00 2.000 4.000 16.00 24.00 2.000 4.000 16.00 24.00 too. The influence of wavelet functions and decomposition levels is listed in Tables 1 and 2. According to these experimental results, wavelet functions and decomposition level for WPTPLS method were selected as Daubechies (Db) 16 and 3. A training set of 16 samples formed by the mixture of the three organic compounds was designed according to four-level orthogonal array design with the L16 (45 ) matrix. Table 3 summarizes the composition of the training set. The experimental data obtained from the training set were arranged in matrix D, where each column corresponds the absorbance of different mixtures at a given wavelength and each row represents the spectrum obtained at a given mixture. With FWPT, one can treat each spectrum at a given mixture. Therefore, in the same way each row vector of matrix D and Du was decomposed, and denoised by best-basis selection and thresholding operation, then reconstructed by applying inverse FWPT. Determining the number of factors is one of the most important steps in PLS method. The essence of the step is the pseudorank determination of the raw experimental data. Three principal factors for the case were selected based on previously reported methods [13]. Before starting the WPTPLS calculation, mean centering and data standardization were performed as preprocessing. After this transform, the matrix where each column had zero mean and a variance equals to the unity was obtained. Using program PWPTPLS, the concentrations of the three organic compounds for a test set were calculated. Actual concentrations, found concentrations, and their recoveries are listed in Table 4. The experimental results showed that the SEP and RSEP for all components were 0.192 μgml−1 and 2.23%. I 0.8000 0.8000 0.8000 0.8000 2.400 2.400 2.400 2.400 5.600 5.600 5.600 5.600 8.000 8.000 8.000 8.000 III 0.4000 2.4000 5.600 8.000 2.400 0.4000 8.000 5.600 5.600 8.000 0.4000 2.400 2.400 8.000 5.600 0.4000 A comparison of WPTPLS and PLS In order to evaluate WPTPLS method, two methods were tested in the study with a set of synthetic unknown samples. The RSEP for the two methods are given in Table 5. The RSEP for all components calculated by WPTPLS and PLS methods were 2.23% and 2.71%, respectively. The results demonstrated that the WPTPLS method had better performance than PLS method. 5. CONCLUSIONS A method named WPTPLS was developed for multicomponent spectrophotometric determination. The method combines the idea of the WPT denoising with PLS regression for enhancing noise removal ability and quality of the regression. In WP denoising the time-frequency localization, bestbasis algorithm and thresholding operation were used to improve the quality of denoising. In PLS operation, errors both in the concentration and spectra were taken into account to improve predictive properties. Experimental results show the clear superiority of WPTPLS over PLS method. Journal of Automated Methods and Management in Chemistry Table 4: Actual concentration, found concentration, and percentage recovery of the synthetic unknowns. Actual concentration Sample no. I 1 2 3 4 5 6 7 8 9 1.600 1.600 1.600 4.000 4.000 4.000 6.400 6.400 6.400 (μgml−1 ) II 3.200 12.00 20.00 3.200 12.00 20.00 3.200 12.00 20.00 III 0.8000 4.000 6.400 4.000 6.400 0.8000 6.400 0.8000 4.000 I 1.758 1.547 1.543 3.863 3.987 4.036 6.458 6.323 6.484 Found concentration (μgml−1 ) WPTPLS II 3.474 12.27 19.91 2.790 12.57 19.75 2.961 12.16 19.71 III 0.822 3.975 6.433 4.121 6.317 0.7501 6.495 0.6775 4.009 I 1.760 1.541 1.556 3.866 3.980 4.033 6.460 6.324 6.480 PLS II 3.379 12.48 19.66 2.722 12.75 19.84 2.866 12.09 19.82 III 0.8377 3.951 6.460 4.131 6.295 0.7404 6.501 0.6885 3.995 I 109.9 96.7 96.4 96.6 99.7 100.9 100.9 98.8 101.3 WPTPLS II 108.6 102.2 99.5 87.2 104.8 98.7 92.5 101.3 98.6 III 102.7 99.4 100.5 103.0 98.7 93.8 101.5 84.7 100.2 I 110.0 96.3 97.2 96.6 99.5 100.8 100.9 98.8 101.3 Recovery (%) PLS II 105.6 104.0 98.3 85.1 106.3 99.2 89.6 100.7 99.1 III 104.7 98.8 100.9 103.3 98.4 92.5 101.6 86.1 99.9 Table 5: SEP and RSEP values for organic compounds system by two methods. Methods WPTPLS PLS SEP (μgml−1 ) III 0.0749 0.0829 RSEP (%) III 1.71 1.89 I 0.0866 0.0859 II 0.313 0.387 Total components 0.192 0.234 I 1.94 1.93 II 2.30 2.84 Total components 2.23 2.71 ACKNOWLEDGMENT The authors would like to thank National Natural Science Foundation of China and Natural Science Foundation of Inner Mongolia for financial support of this project. REFERENCES [1] S. Wold, M. Sjostrom, and L. Eriksson, “PLS-regression: a basic tool of chemometrics,” Chemometrics and Intelligent Laboratory Systems, vol. 58, no. 2, pp. 109–130, 2001. [2] F. Yacoub and J. F. MacGregor, “Product optimization and control in the latent variable space of nonlinear PLS models,” Chemometrics and Intelligent Laboratory Systems, vol. 70, no. 1, pp. 63–74, 2004. [3] E. P. Tsaousoglou, S. D. Bolis, and C. E. Efstathiou, “Monte Carlo simulation for the prediction of precision of absorbance measurements with a miniature CCD spectrometer,” Journal of Automated Methods & Management in Chemistry, vol. 25, no. 2, pp. 35–42, 2003. [4] I. Daubechies, “Orthogonal bases of compactly supported wavelets,” Communications on Pure and Applied Mathematics, vol. 41, pp. 909–996, 1988. [5] G. Beylkin, “On the representation of operators in bases of compactly supported wavelets,” SIAM Journal on Numerical Analysis, vol. 29, no. 6, pp. 1716–1740, 1992. [6] R. R. Coifman and M. V. Wickerhauser, “Entropy-based algorithms for best basis selection,” IEEE Transactions on Information Theory, vol. 38, no. 2, pp. 713–718, 1992. [7] B. Jawerth and W. Sweldens, “An overview of wavelet based multiresolution analyses,” SIAM Review, vol. 36, no. 3, pp. 377–412, 1994. [8] S. G. Mallat, “Multiresolution approximations and wavelet orthonormal bases of L2 (R),” Transactions of the American Mathematical Society, vol. 315, no. 1, pp. 69–87, 1989. [9] S. Ren and L. Gao, “Simultaneous quantitative analysis of overlapping spectrophotometric signals using wavelet multiresolution analysis and partial least squares,” Talanta, vol. 50, no. 6, pp. 1163–1173, 2000. [10] D. L. Donoho, “De-noising by soft-thresholding,” IEEE Transactions on Information Theory, vol. 41, no. 3, pp. 613–627, 1995. [11] L. Gao and S. Ren, “Simultaneous spectrometric determination of manganese, zinc and cobalt by Kernel partial leastsquares method,” Journal of Automatic Chemistry, vol. 20, no. 6, pp. 179–183, 1998. [12] M. Otto and W. Wegscheider, “Spectrophotometric multicomponent analysis applied to trace metal determinations,” Analytical Chemistry, vol. 57, no. 1, pp. 63–69, 1985. [13] S. Ren and L. Gao, “Simultaneous spectrophotometric determination of copper (II), lead (II), and cadmium (II),” Journal of Automatic Chemistry, vol. 17, no. 3, pp. 115–118, 1995. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Journal of Automated Methods and Management in Chemistry Hindawi Publishing Corporation

Quantitative Determination of the Multicomponents With Overlapping Ultraviolet Spectra Using Wavelet-Packed Transform and Partial Least Squares

Loading next page...
 
/lp/hindawi-publishing-corporation/quantitative-determination-of-the-multicomponents-with-overlapping-zFPPOr5Z4c

References

References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.

Publisher
Hindawi Publishing Corporation
Copyright
Copyright © 2006 Ling Gao and Shouxin Ren.
ISSN
1463-9246
eISSN
1464-5068
Publisher site
See Article on Publisher Site

Abstract

INTRODUCTION Many efforts have been made in order to resolve overlapping signals in spectrophotometry. As a consequence of peak overlapping, the quality of analytical information is lower than what is derived from isolated peaks; the extent of the loss depends on the extent of overlap. In complex samples, however, spectral overlap is often occurring. Strongly overlapped signals do not permit direct determination by traditional methods without previous separation. To overcome this difficulty, multivariate analysis [1–3] such as partial least squares (PLS) and principal components regression (PCR), and so forth have been proved to be useful. When PLS and PCR methods were applied to analyze samples, the first problem encountered is to determine the number of components that they contain. Unfortunately, data obtained from instrumental measurements can be contaminated by noise. The presence of noise often causes overestimating the true number of chemical components. In order to eliminate noise, wavelet packet (WP) denoising method was used as a preprocessing step. Wavelet packet transform (WPT) is an important extension of wavelet transform (WT). WT is a powerful tool with a very rich mathematical content and great potential for application [4, 5]. WPT inherits the property of having a sparse representation of the original signal and time-frequency localization, and offers more flexibility than wavelet analysis [6, 7]. A novel approach tried here is to combine WPT with PLS to eliminate noise and improve the quality of regression. Aniline-type compounds are widely applied in industries such as chemistry, printing, and pharmacy, and are one of the most important raw materials for synthetic medicine, dye, insecticides, polymer, and explosives. Anilinetype compounds are highly poisonous, and can also cause cancer. Therefore, it is very important to test and analyze aniline-type compounds in environmental samples. Simultaneous determination of aniline-type compounds is very difficult due to their overlapping spectra. In this paper, WPTPLS method was developed and used to perform simultaneous determination of α-naphthylamine, p-nitroaniline, and benzidine. Experimental results showed the proposed method to be successful and better than PLS. 2. 2.1. THEORY WPT denoising A wavelet packet W jnk is generated from the base function W jnk (x) = 2− j/2 Wn 2− j x − k , (1) where indices j, n, k are the scale, the oscillation, and the 2 localization parameter, respectively. j, k ∈ Z, Z means the set of integers, n = 0, 1, 2, . . . , 2 j − 1. The discrete wavelet transform (DWT) can be implemented by means of Mallat’s pyramid algorithm [8]. DWT can be characterized as a recursive application of the high-pass and low-pass filters that form a quadrature mirror filter (QMF) pair. The theoretical background about DWT has been described in details [9]. The difference between WT and WPT is the decomposition path. In WPT, both the approximations and details are analyzed. The recursion is simply to filter and downsample all output of the previous level. A fast wavelet packet transform (FWPT) is expressed as W j+1 2n = HW jn , W j+1 2n+1 = GW jn , (2) Journal of Automated Methods and Management in Chemistry measurements were made by a pH-3B digital pH-meter with a glass-saturated calomel dual electrode. All reagents were of analytical reagent grade. The water used was doubly distilled and deionized. Stock standard solutions of 2.000 mgml−1 αnaphthylamine, p-nitroaniline, and benzidine were prepared from correspondent reagents with water as solvents. Standard solutions were then prepared from their stock standard solutions by serial dilution as required. Acetic acid (HAc)sodium acetate (NaAc) buffer solution (pH 6.30) was used. Procedures where W0,0 indicates the measured signal f , H = {hl }l∈Z and G = {gl }l∈Z are the low-pass and high-pass filters matrices. The first and second indices of W indicate the level of decomposition and its position at that level. The reconstruction can be implemented by W jn = H ∗ W j+1 2n + G∗ W j+1 2n+1 , (3) A series of mixed standard solutions containing various ratios of the three kinds of organic compounds was prepared in 25 ml standard flasks, 10.00 ml of HAc-NaAc buffer solution (pH 6.30) was added, and dilution with distilled water to mark. A blank solution was prepared similarly. Spectra were measured in 1 cm cuvettes between 250 nm and 460 nm at 2 nm intervals with respect to a reagent blank. An absorption matrix D was built up. All the values measured were means of three replicate. 3.3. Evaluation of the performance of the test methods where H ∗ and G∗ represent the conjugate matrices of H and G. The wavelet packet denoising procedures include four steps: (1) WPT, (2) estimation of the best basis, (3) thresholding of wavelet packet coefficients, and (4) reconstruction. The best basis is selected according to entropy-based criterion proposed by Coifman and Wickerhauser [6]. Shannon entropy was applied in this case. The thresholding operation is implemented by the SURE method proposed by Donoho [10] based on Stein’s unbiased risk estimation. 2.2. The wavelet packet transform partial least squares method In the method, WPT is used as a tool for removing noise from original data. The denoising is applied to the wavelet packet domain as described above, prior to backtransforming it to original domain. The reconstructed matrices from standard and unknown mixtures were obtained for further PLS operation. The PLS algorithm is built on the properties of the nonlinear iterative partial least squares (NIPALS) algorithm by calculating one latent vector at a time. The NIPALSPLS algorithm and calculating details were described previously [11]. According to this algorithm, the program called PWPTPLS was designed to perform data compression and denoising as well as simultaneous determination. 3. EXPERIMENTAL Absolute and relative standard errors of prediction (SEP and RSEP) were used as the criteria for comparing the performances of the test methods. The SEP for a single component is given by (4); that for all components by (5). The RSEP is given by (6) [12]: SEP = n i=1 m j =1 Ci j − Ci j m (4) SEP = m j =1 Ci j − Ci j nm (5) RSEP = n i=1 2 m j =1 Ci j − Ci j , n m 2 i=1 j =1 Ci j (6) where Ci j and Ci j are the actual and estimated concentrations, respectively, for the ith component in the jth mixture, m is the number of mixtures, and n is the number of components. 4. 4.1. RESULTS AND DISCUSSIONS Absorbance spectra of the system 3.1. Apparatus and reagents The Shimadzu UV-240 spectrophotometer furnished with OPI-2 function was used for all experiments; a legend Pentium IV microcomputer was used for all the calculations; pH Figure 1 shows the absorption spectra of p-nitroaniline αnaphthylamine, and benzidine and their mixed solution. The maximum absorptions of three components were 380 nm, L. Gao and S. Ren Abs. 250 Wavelength (nm) Figure 1: The absorpation spectra 4.000 μgml−1 p-nitroaniline (1); 12.00 μgml−1 α-naphthylamine (2); 3.200 μgml−1 benzidine (3), and their mixture compounds (4). 1 0.5 0 0 20 40 60 w(0, 0) 80 100 120 2 1 0 0 20 w(1, 0) 40 60 20 w(1, 1) 2 1 0 0 20 w(2, 0) 40 0.2 0 0 20 w(2, 1) 40 0.5 0 0 20 w(2, 2) 40 20 w(2, 3) 4 2 0 0 10 w(3, 0) 20 0.1 0 0 10 w(3, 1) 20 0.5 0 0 10 w(3, 2) 20 0.2 0 0 10 w(3, 3) 20 0.2 0 0 10 w(3, 4) 20 0.5 0 0 10 w(3, 5) 20 0.5 0 0 10 w(3, 6) 20 10 w(3, 7) Figure 2: The parts of WP coefficients obtained by wavelet packet transform. 302 nm, and 278 nm, respectively. It can be seen from Figure 1 that the absorption spectra of three components exhibited are seriously overlapped in their absorbing regions, so that for mixed solution only one peak can be recognized. Original 0.5 0.5 Added 0.5% noise Journal of Automated Methods and Management in Chemistry Added 1% noise 0.5 0.5 Added 2% noise Figure 3: Original raw spectra (row 1), reconstructed spectra (row 2), and their difference obtained by means of WP denoising (row 3) for different amounts of noise added (indicated at the head of columns). 4.2. Wavelet packet transform and wavelet packet denoising Here, we selected mean spectra of D matrix as original signal f . WPT of the signal f was carried out using FWPT algorithm. The part of WP coefficients obtained by FWPT is shown in Figure 2. Each coefficient is identified by the couple of index ( j, n), where j is the level of decomposition and n is the position at that level. From Figure 2, it is obvious that the w( j, 0) only contains a positive part and is similar to the original signal. The others are composed of both positive and negative parts. Each block of the coefficients describes the components of the signal f related to a certain frequency band. This flexible time-frequency resolution enables the WP to characterize locally the most relevant parts of a signal and hence to adequately represent a signal with relatively small number of coefficients. In the spectrophotometric measurements, the analytical signals usually center in low-frequency part, whereas the noise in high-frequency part. The aim of WP denoising is to extract the desired signal from a complex instrument output, where the signal is present along with noise. Random Gaussian noise was added to the mean spectra for assessing the WP denoising method. Original and reconstructed spectra as well as their difference with different added noise are displayed in Figure 3. It was found that WPT provides an appropriate approach for denoising even in case where 2% noise is added. Thus the method is safe for preprocessing two-dimensional raw data matrix in the following WPTPLS operation. 4.3. The wavelet packet transform partial least squares method Each of the wavelet functions has different characteristics. The wavelet function, which is optimal for a given signal, is not necessarily the best for another type of signal. Thus, the choice of the wavelet functions is very important for this technique. In this work, the wavelet functions tested were Coiflet 1, 2, . . . , 5, Daubechies 4, 6, . . . , 20, Symmlet 4, 5, . . . , 8. It is possible to use the predictive parameters SEP and RSEP to find the optimum choice of functions. In similar way, one-to-six decomposition levels L were tested L. Gao and S. Ren Table 1: Optimization of wavelet functions. Wavelet function Daubechies 2 Daubechies 4 Daubechies 6 Daubechies 8 Daubechies 10 Daubechies 12 Daubechies 14 Daubechies 16 Daubechies 18 Daubechies 20 Symmlet 5 Symmlet 6 Symmlet 7 Symmlet 8 Coiflet 1 Coiflet 2 Coiflet 3 Coiflet 4 Coiflet 5 SEP ( μgml−1 ) 2.62 2.66 2.70 2.76 2.52 2.57 2.41 2.23 2.47 2.43 2.71 2.72 2.75 2.71 3.35 2.73 2.72 2.69 2.74 RSEP (%) 0.227 0.230 0.233 0.239 0.217 0.222 0.208 0.192 0.214 0.210 0.234 0.235 0.238 0.234 0.289 0.235 0.235 0.232 0.237 L 1 2 3 4 5 6 Table 2: Optimization of wavelet decomposition level. SEP ( μgml−1 ) 0.234 0.236 0.230 0.302 1.39 1.95 RSEP (%) 2.71 2.74 2.66 3.49 16.1 22.6 Table 3: Composition of the training set. Sample no. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Concentration (μgml−1 ) II 2.000 4.000 16.00 24.00 2.000 4.000 16.00 24.00 2.000 4.000 16.00 24.00 2.000 4.000 16.00 24.00 too. The influence of wavelet functions and decomposition levels is listed in Tables 1 and 2. According to these experimental results, wavelet functions and decomposition level for WPTPLS method were selected as Daubechies (Db) 16 and 3. A training set of 16 samples formed by the mixture of the three organic compounds was designed according to four-level orthogonal array design with the L16 (45 ) matrix. Table 3 summarizes the composition of the training set. The experimental data obtained from the training set were arranged in matrix D, where each column corresponds the absorbance of different mixtures at a given wavelength and each row represents the spectrum obtained at a given mixture. With FWPT, one can treat each spectrum at a given mixture. Therefore, in the same way each row vector of matrix D and Du was decomposed, and denoised by best-basis selection and thresholding operation, then reconstructed by applying inverse FWPT. Determining the number of factors is one of the most important steps in PLS method. The essence of the step is the pseudorank determination of the raw experimental data. Three principal factors for the case were selected based on previously reported methods [13]. Before starting the WPTPLS calculation, mean centering and data standardization were performed as preprocessing. After this transform, the matrix where each column had zero mean and a variance equals to the unity was obtained. Using program PWPTPLS, the concentrations of the three organic compounds for a test set were calculated. Actual concentrations, found concentrations, and their recoveries are listed in Table 4. The experimental results showed that the SEP and RSEP for all components were 0.192 μgml−1 and 2.23%. I 0.8000 0.8000 0.8000 0.8000 2.400 2.400 2.400 2.400 5.600 5.600 5.600 5.600 8.000 8.000 8.000 8.000 III 0.4000 2.4000 5.600 8.000 2.400 0.4000 8.000 5.600 5.600 8.000 0.4000 2.400 2.400 8.000 5.600 0.4000 A comparison of WPTPLS and PLS In order to evaluate WPTPLS method, two methods were tested in the study with a set of synthetic unknown samples. The RSEP for the two methods are given in Table 5. The RSEP for all components calculated by WPTPLS and PLS methods were 2.23% and 2.71%, respectively. The results demonstrated that the WPTPLS method had better performance than PLS method. 5. CONCLUSIONS A method named WPTPLS was developed for multicomponent spectrophotometric determination. The method combines the idea of the WPT denoising with PLS regression for enhancing noise removal ability and quality of the regression. In WP denoising the time-frequency localization, bestbasis algorithm and thresholding operation were used to improve the quality of denoising. In PLS operation, errors both in the concentration and spectra were taken into account to improve predictive properties. Experimental results show the clear superiority of WPTPLS over PLS method. Journal of Automated Methods and Management in Chemistry Table 4: Actual concentration, found concentration, and percentage recovery of the synthetic unknowns. Actual concentration Sample no. I 1 2 3 4 5 6 7 8 9 1.600 1.600 1.600 4.000 4.000 4.000 6.400 6.400 6.400 (μgml−1 ) II 3.200 12.00 20.00 3.200 12.00 20.00 3.200 12.00 20.00 III 0.8000 4.000 6.400 4.000 6.400 0.8000 6.400 0.8000 4.000 I 1.758 1.547 1.543 3.863 3.987 4.036 6.458 6.323 6.484 Found concentration (μgml−1 ) WPTPLS II 3.474 12.27 19.91 2.790 12.57 19.75 2.961 12.16 19.71 III 0.822 3.975 6.433 4.121 6.317 0.7501 6.495 0.6775 4.009 I 1.760 1.541 1.556 3.866 3.980 4.033 6.460 6.324 6.480 PLS II 3.379 12.48 19.66 2.722 12.75 19.84 2.866 12.09 19.82 III 0.8377 3.951 6.460 4.131 6.295 0.7404 6.501 0.6885 3.995 I 109.9 96.7 96.4 96.6 99.7 100.9 100.9 98.8 101.3 WPTPLS II 108.6 102.2 99.5 87.2 104.8 98.7 92.5 101.3 98.6 III 102.7 99.4 100.5 103.0 98.7 93.8 101.5 84.7 100.2 I 110.0 96.3 97.2 96.6 99.5 100.8 100.9 98.8 101.3 Recovery (%) PLS II 105.6 104.0 98.3 85.1 106.3 99.2 89.6 100.7 99.1 III 104.7 98.8 100.9 103.3 98.4 92.5 101.6 86.1 99.9 Table 5: SEP and RSEP values for organic compounds system by two methods. Methods WPTPLS PLS SEP (μgml−1 ) III 0.0749 0.0829 RSEP (%) III 1.71 1.89 I 0.0866 0.0859 II 0.313 0.387 Total components 0.192 0.234 I 1.94 1.93 II 2.30 2.84 Total components 2.23 2.71 ACKNOWLEDGMENT The authors would like to thank National Natural Science Foundation of China and Natural Science Foundation of Inner Mongolia for financial support of this project. REFERENCES [1] S. Wold, M. Sjostrom, and L. Eriksson, “PLS-regression: a basic tool of chemometrics,” Chemometrics and Intelligent Laboratory Systems, vol. 58, no. 2, pp. 109–130, 2001. [2] F. Yacoub and J. F. MacGregor, “Product optimization and control in the latent variable space of nonlinear PLS models,” Chemometrics and Intelligent Laboratory Systems, vol. 70, no. 1, pp. 63–74, 2004. [3] E. P. Tsaousoglou, S. D. Bolis, and C. E. Efstathiou, “Monte Carlo simulation for the prediction of precision of absorbance measurements with a miniature CCD spectrometer,” Journal of Automated Methods & Management in Chemistry, vol. 25, no. 2, pp. 35–42, 2003. [4] I. Daubechies, “Orthogonal bases of compactly supported wavelets,” Communications on Pure and Applied Mathematics, vol. 41, pp. 909–996, 1988. [5] G. Beylkin, “On the representation of operators in bases of compactly supported wavelets,” SIAM Journal on Numerical Analysis, vol. 29, no. 6, pp. 1716–1740, 1992. [6] R. R. Coifman and M. V. Wickerhauser, “Entropy-based algorithms for best basis selection,” IEEE Transactions on Information Theory, vol. 38, no. 2, pp. 713–718, 1992. [7] B. Jawerth and W. Sweldens, “An overview of wavelet based multiresolution analyses,” SIAM Review, vol. 36, no. 3, pp. 377–412, 1994. [8] S. G. Mallat, “Multiresolution approximations and wavelet orthonormal bases of L2 (R),” Transactions of the American Mathematical Society, vol. 315, no. 1, pp. 69–87, 1989. [9] S. Ren and L. Gao, “Simultaneous quantitative analysis of overlapping spectrophotometric signals using wavelet multiresolution analysis and partial least squares,” Talanta, vol. 50, no. 6, pp. 1163–1173, 2000. [10] D. L. Donoho, “De-noising by soft-thresholding,” IEEE Transactions on Information Theory, vol. 41, no. 3, pp. 613–627, 1995. [11] L. Gao and S. Ren, “Simultaneous spectrometric determination of manganese, zinc and cobalt by Kernel partial leastsquares method,” Journal of Automatic Chemistry, vol. 20, no. 6, pp. 179–183, 1998. [12] M. Otto and W. Wegscheider, “Spectrophotometric multicomponent analysis applied to trace metal determinations,” Analytical Chemistry, vol. 57, no. 1, pp. 63–69, 1985. [13] S. Ren and L. Gao, “Simultaneous spectrophotometric determination of copper (II), lead (II), and cadmium (II),” Journal of Automatic Chemistry, vol. 17, no. 3, pp. 115–118, 1995.

Journal

Journal of Automated Methods and Management in ChemistryHindawi Publishing Corporation

Published: May 11, 2006

There are no references for this article.