Abstract
Introduction
Infrared (IR) and Raman spectroscopies are called as vibrational spectroscopy which are commonly used for analysis of different states of samples (solid, liquid, and semi-solid). Vibrational spectroscopy (VS) allows the production of unique fingerprint spectra due to the interaction between electromagnetic radiation (EMR) in infrared regions with biomedical samples to resulting the molecular vibrations [6,43]. VS is used to describe the collective terms of two powerful spectroscopic techniques, namely IR spectroscopy either in near region corresponding to wavenumber (
Raman and IR are complementary techniques that allow real-time analysis of analytes of interest in biomedical samples with good sensitivity as expressed with relatively low detection limit, since both techniques are non-destructive analytical technique in which the analyzed samples could be analyzed further with other analytical techniques such as chromatographic-based methods [38]. Besides, VS provides highly specific information of target compounds which allow the identification of analytes by a structural and biochemical diversity. Both techniques are also considered as green analytical technique because of its minimal uses of solvents and chemical reagents during analysis, therefore the environment pollution can be hindered [15].
Due to the advancement of instrumentation, Fourier transform infrared (FTIR) and FT-Raman have been introduced and popular over the dispersive instruments. FTIR spectroscopy measures the absorption, reflection, or transmission of EMR in mid-infrared, due to the interaction of the electric dipole moment of the molecule bands with the infrared radiation [12]. FTIR spectrum is a plot of intensity of the absorbed or transmitted light (
Methods
During performing this review, we explored several databases such as Science Citation index, PubMed, Medline, Scopus, and Google Scholar to identify and to download the abstracts, reports, review articles and research papers related to the vibrational spectroscopy (FTIR and Raman) and its application in quantitative analysis of biomolecules in the biofluids. The keywords used during searching of information were: FTIR spectroscopy or Raman spectroscopy + chemometrics + biomolecules (lipid, protein, nucleic acid and carbohydrates).
Analysis of biomolecules in biofluids
Biofluids have several biomolecules which can be used as biomarkers to detect some types of diseases, since these biomolecules are in direct contact with the human organs [27,34]. The other advantage of biofluids as matrix samples is its availability and ease to get blood and urine for repeated testing. Biofluids are also potential to be used in monitoring the progression of diseases or treatment [4]. With this in mind, the level changes of biomolecules in biofluids can provide substantial amount of information about patient diseases in order to improve the clinical analysis. Maes et al. [42] reported that there is a relationship between the changes of lipid levels in the membranes of red blood cells with the severity of affective disorders. Other studies also reported that depression correlated with the increased concentration of the triglycerides and low-density lipoprotein (LDL)-cholesterol. The deficiency of phospholipids is also used as depression marker [2,64]. The measurement of protein concentrations is useful to interpret some biological parameters, discuss the nutritional status, and to help the diagnosis of some diseases. Specific proteins like immunoglobulin G and human serum albumin may be changed during certain diseases such as inflammation and infection [5]. The changes of nucleic acids are also correlated with certain diseases like cancers [50]. In addition, the levels of carbohydrates were associated with diabetes mellitus [24].
Chemometrics
FTIR and Raman spectra in biofluids are complex in nature, therefore, the sophisticated statistical technique, known as chemometrics, must be used to solve the analytical problems related to the quantitative analysis purposes. Chemometrics is the application of mathematics and statistics to treat chemical data such as biomolecules in biofluids which help analytical chemists to deal with all steps of analytical procedures, starting from the experimental design and optimization through extraction of chemical information and final decision [14]. Chemometrics allows the simultaneous determination of several independent variables (factors) against several dependent variables or responses [32].
The chemometrics techniques commonly used in FTIR and Raman spectroscopy include (1) spectral processing using derivatization (first and second derivatives), standard normal variate (SNV), multiplicative scattering correction (MSC), filtering, wavelet transformations (WT), feature selection, and folding-unfolding, (2) classification techniques which can be either supervised pattern recognition such as principal component analysis (PCA) or unsupervised pattern recognition such as linear discriminant analysis (LDA), partial least square-discriminant analysis (PLS-DA), and orthogonal projection to latent structures-discriminant analysis (OPLS-DA) and (3) multivariate calibrations used for quantitative analysis of biomolecules such as stepwise multiple linear regression (SMLR), principle component analysis (PCR), partial least square regression (PLSR) and modified PLSR [37,52].

The general steps during the application of vibrational spectroscopy (FTIR and Raman spectroscopy) combined with chemometrics for analysis of biomolecules such as proteins, lipids and nucleic acids in biofluids (adapted from depciuch and parlinska-wojtan [16]).
Figure 1 showed the general scheme of chemometrics techniques applied to analysis of biomolecules in biofluid using FTIR and Raman spectra as variables [59]. Some user friendly statistical and chemometrics softwares are available which are either free or commercially available to perform statistical analysis of complex data with its advantages and features. Such softwares are Unscrambler®, SIMCA® SIRIUS®, and Pirouette® offering standard methods of multivariate statistical analysis including chemometrics classification (PCA and SIMCA) as well as multivariate calibrations (PCR, PLS, SMLR). The softwares of Minitab® Matlab® are routinely designed to facilitate the writing of personal programs, while Grams® 32 is particularly useful for calibration modelling during quantitative analysis rather than for the exploration of a data matrix and classification by different pattern recognition techniques [46,51].
The combination of FTIR spectroscopy and rapid sampling technique such as attenuated total reflectance (ATR) has accelerated the application of this method. ATR-FTIR spectroscopy has been reported to be suitable technique for analysis of analytes in biomedical materials, including serum. Currently, FTIR spectroscopy, combined with multivariate calibrations, are popular to be applied in quantitative analysis in biomedical analysis because they obey the principles of the Beer-Lambert law in which the absorbance of certain functional groups are proportional with concentrations of analytes [43,62]. This enables ATR-FTIR spectroscopy to quantify specific biomolecule concentrations, as the proportion of light absorbed by the sample will correlate with the concentration of molecules within a sample. Table 1 provided the use of FTIR spectroscopy for analysis of biomolecules (proteins, lipids, nucleic acids and carbohydrates) in the biomedical samples. The use of FTIR spectroscopy combined with certain chemometrics allows the direct measurement of chemical constituents in biomedical matrices without the use of specific reagents. This could be a step forward for the simplification of analytical steps in clinical laboratories. Problems related to the analytical procedure such as improper handling and lot-to-lot variations could be eliminated by direct measurement [49,57].
The application of FTIR spectroscopy combined with chemometrics for analysis of biomolecules in biological fluids
PLSR = partial least square regression ; PCA = principal component analysis ; PCA-LDA = principal component analysis-linear discriminant analysis ; GA-LDA = genetic algorithm-linear discriminant analysis .
The application of FTIR spectroscopy combined with chemometrics for analysis of biomolecules in biological fluids

Typical FTIR spectra of biomolecules in biofluids at wavenumber of 3.000–800 cm−1, along with corresponding functional groups.
Figure 2 revealed the functional groups responsible for IR absorption for the specific biomolecules in biomedical fluids especially proteins, lipids, nucleic acids and carbohydrates. The amide groups correspond to
FTIR spectra of lipids were characterized by absorption peaks at 2963 cm−1, 2922 cm−1 and 2850 cm−1 corresponding to asymmetric stretching vibrations of CH3, CH2 and symmetric vibration of CH2, respectively. In addition, phospholipid was characterized by absorption peaks at
Jessen et al. [33] have developed FTIR spectroscopy combined with PLSR for quantitative analysis of glucose in serum using combined 1/
Qualitative and quantitative analysis of protein
The changes in protein concentration in human blood can be used as an indicator for the nutritional status and certain illnesses state. ATR-FTIR spectroscopy has been applied to detect the minor difference of protein qualitatively and quantitatively in the biological samples [60]. The human serum samples spiked with proteins of human serum albumin (HSA) and immunoglobulin G (IgG) at seven different concentrations were directly analyzed on crystal ATR and their spectra were used for making prediction and validation models using multivariate calibration of PLSR. The wavenumber (
The combination of ATR-FTIR spectra and multivariate calibration of PLSR was succesfully applied for quantification of IgA, IgG, IgM in the plasma samples. The spectra of dried human plasma samples were analyzed at mid IR region (4000 to 500 cm−1) at resolution of 2 cm−1 and number of scanning of 32. The
The complex nature of biological fluid samples is still the limiting factors for direct analysis using ATR-FTIR spectroscopy. For this reason, some innovations by combining sample preparation step and direct measurement using FTIR spectroscopy has been proposed. Elsohaby et al. [22] have coupled centrifugal ultrafiltration and ATR-FTIR spectroscopy for quantitative analysis of IgA in human serum. The actual values of IgA were determined using reference method of immunoturbidimetric assay. The whole serum samples and residue reconstituted with saline after centrifugal ultrafiltration were subjected to FTIR spectral measurement at 4000-650 cm−1. PLSR calibration models using 5 factors (for whole serum) and 9 factors (for the reconstituted residue) was succesfully applied for relating the actual values of IgA and FTIR predicted values. The RMSEP value reported was in the range of 79-102 mg/dL.
ATR-FTIR at
Quantitative analysis of lipids
ATR-FTIR spectra have been applied for rapid analysis of very low density lipoprotein-triglyceride (VLDL-TG) serum in maternal serum without any sample preparation. The multivariate calibration of PLS was applied for prediction the levels of VLDL-TG. Using second derivative FTIR spectra at combined
The presence of triglycerides (TG) in serum was analyzed using FTIR spectroscopy and multivariate calibration of PLSR. FTIR spectra were subjected to mean centering and first derivatization. The combined 1/
Analysis of nucleic acids in biomedical fluids
Quantification of DNA in eukaryotic cells has been performed using FTIR spectroscopy method in combination with chemometrics of PLS regression. DNA quantification was performed using standard consisting of DNA and hemoglobin standards. PLS model was built at the fingerprint region of FTIR spectra (1800-850 cm−1). The model was evaluated using
FTIR spectroscopy has been used for analysis of nucleic acid in bladder cancer subject. FTIR measurement was performed at mid infrared region (4000-400 cm−1) using resolution of 4 at 100 scans number. The peak of PO2
− stretching vibration in nucleic acid appeared specific at 1086 cm−1 measured using FTIR spectrophotometer. In the condition of bladder cancer, the peak shifted to around 1087 ± 0.87 cm−1 compared to control. Some vibrations especially stretching vibration was increased significantly in cancer subject compared to control. The peak appeared at 967 cm−1 could be considered as marker peak to detect tumors. FTIR measurement of sample from bladder cancer subjects showed slight shifting from 967.22 ± 0.33 cm−1 to 967.40 ± 0.31 cm−1. Moreover, the peak at
Application of Raman spectroscopy for analysis of biomolecules in biomedical fluids
There are several variants of FT-Raman spectroscopy applying the phenomenon of Raman scattering in different ways which included Conventional Raman Spectroscopy (CRS), Surface-enhanced Raman Spectroscopy (SERS), Resonance Raman Spectroscopy (RRS), coherent anti-Stokes Raman spectroscopy (CARS), resonance enhanced Raman scattering (RRS) and spatially offset Raman spectroscopy (SORS). The selection of these variants depends on some inherent factors namely the sample complexity, the type of samples, and the concentrations of the target analytes. CRS is suitable technique in biomedical analysis due to its capability to provide high efficiency, but this technique is not suitable for analysis of analytes with low concentration, therefore, analyst should use other variants capable of amplifying the analyte signals [11]. Table 2 provided the use of Raman spectroscopy for analysis of biomolecules (proteins, lipids, nucleic acids and glucose) in biomedical samples.
The application of Raman spectroscopy combined with chemometrics for analysis of biomolecules in biological fluids
PLSR = partial least square regression ; PCA = principal component analysis ; PCA-LDA = principal component analysis-linear discriminant analysis ; PLS-DA = partial least square-discriminant analysis .
The application of Raman spectroscopy combined with chemometrics for analysis of biomolecules in biological fluids
Application of Raman spectroscopy has been explored for glucose analysis in blood serum. Sugar concentration in human diabetic serum has been successfully analyzed using Raman Spectroscopy. Blood serum samples were obtained from normal and diabetic subjects. Raman measurement was performed in the range of 2000-800 cm−1 at 532 nm. Raman spectra of glucose in diabetic mellitus patients appeared at 1168 cm−1 which correspond to the carbohydrate peak. This peak appeared at very low intensity or almost lacking intensity in the normal serum (healthy subjects). The C=C (carbon double bond) could be observed from peak appeared at 1650 cm−1. One specific peak at 1463 cm−1 appeared only in blood serum from diabetic subjects that can be used as marker peak of diabetic subject. The glucose peaks at 1168 cm−1 and 1531 cm−1 were plotted against intensity to determine glucose concentration. Eventhough glucose from normal serum still could be detected, the intensity of glucose peaks from diabetic serum showed very high intensity than normal serum. Therefore, the increasing concentration of glucose could be detected using Raman spectroscopy to differentiate glucose level from serum of normal and diabetic subjects [26].
Quantitative analysis of glucose in human serum has been performed using Raman spectroscopy and chemometrics of PLS regression method. Raman measurement was carried out using diode laser at 830 nm, and each sample was measured in three replicates. The measurement was performed in the range from 1800-400 cm−1. The bands of glucose were observed at 507 cm−1 (skeletal modes of glucose), 1065 cm−1 (C–C, C–O, C–H and C–OH modes), and 1128 cm−1 (C–O stretching). Raman spectra were subjected to PLS regression to predict glucose concentration in human serum measured by Raman spectroscopy. The actual value of glucose was obtained from colorimetric method measurement. PLS model was evaluated using
Qualitative and quantitative analysis of lipids using Raman spectroscopy
Conventional Raman spectroscopy (CRS) has been used for analysis of very low-density lipoprotein-triglyceride (VLDL-TG) serum in maternal serum without any sample preparation. The samples were excited using Nd:YAG 1064 nm laser at power of 300 mW. Raman signal was collected at 1/
Quantification of lipid components has been performed using Raman spectroscopy and chemometrics of PLS regression. The lipid components consist of cholesterol, triglycerides, high-density lipoprotein (HDL), and low-density lipoprotein (LDL). Raman spectral data were collected using a near-infrared dispersive Raman spectrometer. The diode laser is 830 nm with 25 mW. For excitation and collection of Raman spectra, a Raman probe was used and connected to spectrometer. PLS model was carried out using wavenumber of 1800-400 cm−1. The actual concentration of cholesterol, triglycerides, HDL, and LDL were measured using colorimetric methods. Lipid bands could be observed at 1676 cm−1 (C=C stretching of cholesterol), 1659 cm−1 (C=C stretching of unsaturated lipid), 1452 cm−1 (CH3, CH2 bending of lipids), 1269 cm−1 (C=C stretching of saturated lipid), 1085 cm−1 (C–O stretching or C–C skeletal of unsaturated lipid), and 700 cm−1 (stretching of steroid ring from cholesterol). The
Raman spectroscopy has also been used for the quantitative analysis of some compounds in serum including cholesterol, HDL, LDL, triglycerides, and glucose. The concentration was compared to that obtained from enzymatic assay. Chemometrics of PLSR was used to build calibration models. PLSR showed good correlation between Raman and enzymatic assay as shown by the correlation coefficient value for all measured compound was higher than 0.940. The region used for building PLSR model was in the range of 4000-450 cm−1 and the peaks were normalized to the peak at 1514 cm−1 (peak with highest intensity). The use of Raman was also applied for serum differentiation from normal serum and gastric cancer serum samples. Raman spectra were subjected to chemometrics of partial least square-discriminant analysis (PLS-DA) for classification. Serum from normal sample could be clearly differentiated with serum from gastric cancer sample by using PLS-DA method [3].
Analysis of nucleic acid in biomedical fluids using Raman spectroscopy
Surface-enhanced Raman spectroscopy (SERS) and multivariate analysis of principal component analysis-linear discriminant analysis (PCA-LDA) have been used for analysis of ribonucleic acid (RNA) in serum from normal and colorectal cancer subjects. Mixture of colloidal silver solution and MgSO4 were used as substrate for RNA measurement to enhance the Raman intensity of RNA. Raman measurement was performed using diode laser at 785 nm in the region of 1500-500 cm−1. Raman spectra from normal subjects were similar with those from colorectal cancer subjects. SERS peaks of RNA appeared at 527 cm−1 (uracil), 573 cm−1 (cytosine, guanine), 678 cm−1 (guanine), 714 and 736 cm−1 (adenine), 859 cm−1 (phosphate group), 1008 and 1033 cm−1 (C–O and C–C stretching), 1240 cm−1 (phosphate asymmetric stretching), 1277 cm−1 (nucleic acid and phosphate), 1316 cm−1 (guanine), and 1391 cm−1 (C–H rocking). However, there were several differences located in several peaks, especially in the peak intensities of RNA between normal and cancer groups. In normal subjects, the peaks at 1391, 1240, 1008, 860, and 714 cm−1 are higher than in cancer subjects, whereas the peak intensities at 1316, 1277, 1033, and 573 cm−1 are higher in cancer subjects than in normal. Multivariate analysis of PCA-LDA successfully differentiated RNA serum from normal and cancer subjects. RNA serum samples from normal and cancer subjects were clearly separated using LDA analysis. LDA showed good accuracy (89.1%) for predicting RNA samples from cancer subjects and showed good accuracy (92.35%) for predicting RNA samples from normal subjects [13].
Raman spectroscopy in combination with chemometrics of PCA and PLS-DA was used for detection of nucleic acids and protein changes during breast cancer development [45]. Some Raman spectral markers were identified in relation to the specific biomolecule markers such as peak at Raman shift of 689 cm−1 (nucleotide conformation), 788 cm−1 (phosphodiester bands in DNA), 828 cm−1 (tyrosine/protein), 848 and 1663 cm−1 (DNA) have not previously been identified. Principal component analysis-factorial discriminant analysis (PCA-FDA) was successfully used for the classification of Raman spectra related to biomolecules changes in the breast cancer patients, according to different stages of the disease.
Surface-enhanced Raman Spectroscopy (SERS) has been used for analysis of nucleic acids in human infected Hepatitis C (HCV). The Raman spectra of healthy samples were different from those in infected patients with HCV. Several peaks from infected samples were used as markers for nucleic acid in the infected patients. The peaks appeared at 720 cm−1 (adenine), 1077 cm−1 (PO2 RNA stretching), 1678 cm−1 (C–O stretching of RNA), 178 cm−1 (RNA), 1641 cm−1 (amide), 1721 cm−1 (protein stretching), and 1738 cm−1 (C–O ester group in lipids). Principal component analysis has been successfully used for differentiation between normal serum and infected Hepatitis C serum. SERS spectra from normal subjects were clearly separated from SERS spectra obtained from cancer subjects as shown by PCA score plot [19].
Protein analysis in biomedical fluids using Raman spectroscopy
Surface-enhanced Raman spectroscopy (SERS) combined with multivariate analysis of PLSR has been successfully applied to differentiate saliva protein from healthy subject, patient with benign breast tumor, and patient with malignant breast cancer. There are some differences of SERS spectra resulted from normal and tumor group. Some intense peaks appeared at 1684 cm−1, 1340 cm−1, 1208 cm−1, and 1004 cm−1 in benign and malignant breast tumor group, whereas peaks at 1176 cm−1, 1049 cm−1, and 621 cm−1 are higher in normal group. This indicates the changes in saliva proteins either in benign or malignant breast tumors. Spectra shifting in saliva protein of malignant breast cancer could also be monitored using SERS. The peak of saliva protein at 1310 cm−1 which appeares in normal subject is shifted to 1318 cm−1 in malignant breast cancer subject. The changes in SERS peak could be used as marker peak for the detection of benign and malignant breast cancer. More exploration was carried out to confirm the spectra from three different groups using multivariate analysis of PLS-DA. Model validation was carried out using leave one out cross validation technique and the score plot of PLS-DA showed clear classification among these three groups (normal, benign tumor, and malignant tumor). PLS-DA showed good diagnostic sensitivity (75.75%, 72.73%, and 74.19%) and good diagnostic specificity (87.63%, 78.35%, and 82.47%) for normal, benign, and malignant breast tumor subject, respectively [25].
Raman spectroscopy and multivariate analysis of PLS have been used for qualitative and quantitative analysis of albumin and immunoglobulin protein in human serum samples. Raman measurement was performed at 785 nm. Raman spectra of protein fraction could be observed at the band of 1650 cm−1 (vibration of amide I), 1436-1460 cm−1 (CH2 deformation), 1300-1343 cm−1 (CH deformation), 1004 cm−1 (phenylalanine), 944 and 890 cm−1 (C–C stretching), and 856 cm−1 (tyrosine) [18,35,66]. Raman spectra of albumin, Immunoglobulin A (Ig A), and Immunoglobulin G (Ig G) are similar. However, the intensities of albumin spectra have the highest spectra intensity compared to Ig A and Ig G. Quantitative analysis for determination of albumin, Ig A, and Ig G was performed using PLS regression. PLS models were evaluated using
The combination of Raman spectroscopy and chemometrics of PLS has been used for protein analysis of human blood fractions. Raman measurement was carried out at 785 nm and the blood samples were divided into two categories; whole blood and blood plasma. Raman spectra of blood plasma showed several peaks corresponding to protein or protein component signals such as 820 cm−1 (tyrosine), 950 cm−1 (deformation of C-H group), 1002 and 1080 cm−1 (phenylalanine), 1160 cm−1 (deformation of C-C group), 1250 cm−1 (
Raman spectroscopy has been employed for protein analysis in patient with asthma. Protein could be used as marker molecule to differentiate between normal and pathology condition. In pathology condition, certain protein could be different either in composition or structure from normal condition. Serum samples from asthma patients have been analyzed using Raman spectroscopy to investigate the changes in protein. Result showed that the proteins structures changed as the increasing severity of asthma [53]. Raman spectroscopy has been used for protein analysis in serum. Human blood serum in normal health was collected and then measured using Raman at wavelength 780 nm. Protein could be detected using Raman and the signals appeared at several regions. Most of protein signals appeared in the region of 1544 cm−1 and 1648 cm−1, which correspond to the signal of amide II and amide I. Other peak at 904 cm−1 is associated with deformation of C-H bond in protein. Proteins have either single bond of C–O or double bond of C=O that appear at 1740 cm−1 for single bond stretching vibration as well as 1780 cm−1 and 1816 cm−1 for double bond stretching vibration [17].
Conclusion
FTIR and Raman spectroscopy are non-destructive, fast techniques, and fingerprint techniques for analysis of biomolecules including protein, glucose, lipids, and nucleic acid in biomedical fluid. Combined with chemometrics of multivariate analysis either pattern recognition or regression, FTIR and Raman spectroscopy offer powerful and rapid analysis of biomolecules in biomedical fluids.
Conflict of interest
The authors have no conflict of interest to report.
