Abstract
Introduction
UV-vis spectroscopy is a cheap and fast technique for quantification of solutes, mainly organic molecules. According to the assumptions of Beer-Lambert equation, if there are several solutes with overlapping spectra, and no mutual interaction affects the spectrum profile; the result would be a superposition of the individual spectra of the solutes.
1
Therefore, when dealing with a sample that may contain two or more solutes with overlapping spectra, there is a need to deconvolute the superimposed spectra.
2
Such spectral resolution may be essential when actual separation of the absorbing species is impossible or may affect the result, as in the case of tautomeric equilibria, short-lived intermediates and low stability complexes.
2
Spectral deconvolution was also suggested in several other applications as in identification and enumeration of the water born parasite
The problem of overlapping spectral bands may be addressed in two main ways:
Mathematical deconvolution of the superimposed spectra to single bands represented as mathematical function (usually Gaussian function) and assigning the bands to the analytes. 2 This method is very useful when the components of the mixture are unknown but may generate artifacts by adding inexistent components while lowering the root mean square error (RMSE). In addition it should be kept in mind that since electronic absorption spectra usually appears as a series of broad and smooth bands rather than as sharp peaks, 6 the representation of the single band as a mathematical function is just an approximation and therefore presents another cause for errors. 2 Several commercial software products are available that provide such spectral deconvolution as “Specfit” (http://www.pha.jhu.edu/~gak/specfit.html) and “Peakfit” Sysat softwer Inc. (http://www.systat.com/products/PeakFit/).
The second approach, which is useful when having previous knowledge of the identity of the absorbing components, and seeking for their concentration, is selection of the best combination of the several previously known solutes which yields the best fit to the measured spectrum. The drawback of the method is that it is essential to know the spectra of the absorbing components. However, if such knowledge is available this method is less susceptible for artifacts.
In previous studies a preliminary version of a similar program was prepared using electronic worksheet program. 7 However, the program was slow and only two components were considered. In this study we present a program based on a Matlab code that performs such deconvolution for up to three base spectra. The main principle is systematically searching the combination of the three spectra of the components that yield the spectrum which is most similar to that of the mixture.
Assumptions
The assumptions required for such spectral resolution are:
Individual spectra are not affected by interactions with other sample ingredients (in other wordssuperposition can be assumed). 1
Overlapping spectra are not completely identical, at least in part of the measured range.
Detection limits allow measuring the two or three solutes in the same dilution and still obtain acceptable O.D. values. Thus, optical densities of all components in the measured sample are of the same order of magnitude.
The Matlab programming environment is specifically designed for manipulating vectors and matrices. Since spectra can be easily displayed as a row vector in which the position of each number is related to the wavelength (or wave number) and value is related to the measured optical density (O.D.) in each wavelength, the program yields accurate and very fast deconvolutions.
Materials and Methods
Calculation scheme
The program accepts 4 row vectors, each representing a spectrum: the mixture spectra, with the unknown amounts of up to three components (
Assuming
where
where
The problem of resolving a vector to a number of its components with a tolerance level for deviations has typically more than one solution. Although, this problem may be addressed by matrix calculation algorithms available in Matlab as well as other linear algebra softwares, these algorithms uses iterative procedures that returns the first solution encountered by the calculation procedure that conforms to the defined tolerance. Thus, using such approach yields a solution that is based on a local RMSE minimum which is not necessarily the best solution. In the code presented here, the program searches all the possible solutions (within the defined range and using a defined interval) and chooses the best one, thus, the value of RMSE is an absolute minimum and therefore provides a higher reliability of the calculation procedure.
The User Interface window
The graphical user interface (Fig. 1) enables the user to redefine the spectra and the calculation parameters while viewing the graphical representation of the calculated and the measured spectra.

The user interface window.
Examples:
In order to verify the efficiency and accuracy of the spectral resolution code, 14 test tubes containing 10 ml mixtures of 2,4,5-trichlorophenol (TCP), 2-chlorophenol (2CP) and 4-chlorophenol (4CP) were prepared. All of the tubes were buffered to pH 10 to ensure that the three solutes existed only as phenolate anions. These solutes were selected due to the high similarity of their UV-vis spectra (Fig. 2), in order to emphasize the abilities of the deconvolution process. All test tubes were measured in an HP 8452A diode-array spectrophotometer in the range of 230-350 nm and the measured spectra were resolved to find TCP, 2CP and 4CP concentrations. In order to make the test as realistic as possible the preparation of the mixtures and the spectral resolution process were done by two different people and the researcher conducting the spectral resolution was unaware of the real composition of the mixture until the end of the process.

UV-vis spectra of TCP, 2CP and 4CP.
The pKa values of TCP and pentachlorophenol (PCP) were estimated by the following procedure: 200 ml of analyte solution (0.2 mM and 0.05 mM for TCP and PCP respectively) were placed on a magnetic stirrer and pH was changed by adding concentrated HCl or NaOH. Added volumes of NaOH and HCl were very small (≤50 μl) thus had negligible effect on analyte concentrations. Samples in the pH range of 4.45-9.06 and 3.53-4.94 for TCP and PCP respectively, were taken out and measured by an HP 8452A diode-array spectrophotometer. All of the measured spectra were resolved to the spectra representing the completely protonated (represented by samples with pH 9.06 and 8.3 for TCP and PCP respectively) and completely deprotonated (represented by samples with pH 4.45 and 2.74 for TCP and PCP respectively).
Results
Method verification
the results of the verification of the spectral separation procedure are presented in Figure 3. The results show a very good fit between the added concentration and the concentration deduced from the spectral resolution process. The root mean square error between the added and calculated concentrations was 1.29 μ which is very reasonable for UV-vis measurements of such solutes.

Added and calculated concentrations of TCP, 2CP and 4CP.
The spectra of TCP and PCP in different pH values are presented in Figures 4a and 4b respectively. As may be observed in Figure 4, the spectra in the intermediate pH value appear to be a superposition of the spectra at a high pH and the spectra at a low pH (representing the deprotonated phenolate specie and the protonated specie respectively).
The protonation reaction of TCP/PCP can be written as

The UV-vis spectra of TCP and PCP at different pH values.
where

(A−)(H+) versus
Calculating the -Log10 of the slope of the TCP line in Figure 5a yields pKa = 6.41. This value is lower than reported in the literature 6.7-6.94.8,9 In order to estimate the reliability of the resolution process, mass conservation of TCP was calculated by summing concentrations of protonated and deprotonated species and dividing the result by the total TCP concentration (i.e. 0.1 mM). The average mass conservation was 100.4% ± 0.458%. Such accuracy in the mass balance may indicate the reliability of the process even though the slight discrepancies with literature values.
Calculating the -Log10 of the slope of the PCP line in Figure 5b yields pKa = 4.14. This value is much lower than reported in the literature for experimental values 4.4-5.26.10,11 The mass conservation of PCP was calculated the same way mentioned for TCP and was 99.3% ± 6.9%.
The very wide range of literature values for PCP may indicate a problem of pKa determination using the conventional titration method (probably because of its limited water solubility). On the other hand, the method of deducing pKa from the slope of such plot appears very sensitive: For example, a change of 0.1 pH units in the pKa value calculated for PCP, would result in a 20.6% decrease or 26% increase in the slope for downward and upward changes respectively.
Even though example [b] focuses on mixtures of two components, example [a] demonstrate that the program can accurately deal with three different components in a mixture. Additional possible mixtures in environmental conditions can be three different dyes dissolved in a textile factory effluents, or two species of the same chemical (like in example [b]), in which the solution includes also unknown concentration of a known impurity, with a well defined UV-visible spectrum, as humic acid- for example.
Discussion
The software presented here, may be useful whenever two or three known solutes with overlapping spectra need to be quantified. In addition, whereas most diode-array operating softwares are able to subtract linear baselines from the spectra, the software presented here may be used for subtraction of any background spectra needed.
The software enables the user to adjust the number of wavelengths (vector length), and the wavelength interval, thus it can be used for different types of diode array spectrophotometers.
Although, the method presented here works even when overlapping between the component spectra are very high (see example a), the main drawback of the method is that it relies on perfect superposition of the component spectra. Thus, whenever there is a deviation from Beer-Lamberts law (for examples the formation of dimers and aggregates), a mutual influence of solutes (for example-complexation) or even influence of pH or ionic strength on the spectra, this method should be avoided.
An important advantage of the proposed software arises from its being an “open code” computing solution, this enables any user with elementary knowledge in programming to understand the calculation procedure and add or change features to suite any specific requirements. As long as the assumptions mentioned earlier are met, the code may be adjusted for use in different analytical methods as fluorescence spectroscopy, chromatography (while time is used instead of wavelength), and with some adjustment possibly even ATR infra red spectrophotometry. The full code was not presented here for brevity; and it may be received by E-mail (free of charge) along with a short user manual following a request to the authors.
