Abstract
Keywords
Introduction
Among diseases, cancer has the highest rate of mortality worldwide. Prevention and early diagnosis of cancer are the daunting tasks of the medical fraternity. It is now well established that diet has a significant effect on cancer incidences. 1 For many years food was accepted as the source of all nutrients required to accomplish the physiological functions needed for development, growth, health, and reproduction. For humans, macronutrients like Na, Ca, Mg, K, and Cl are required in large quantity, whereas trace elements (TE) like Cr, Mn, Fe, Co, Cu, Zn and Mo are required in quantities of less than 100 mg per day and are called micronutrients. 2 It has been observed that an imbalance of TEs is one of the significant causative factors for diseases.3,4 Further, there is a strong association between macronutrients and TEs resulting in the buildup of toxic or carcinogenic metals at the expense of macronutrients, leading to cancer.5–9
Deficiency in any trace element leads to undesirable pathological conditions that can be prevented or reversed by adequate supplementation. However, supplementation should be carefully administered given the toxic effects of TEs when taken in excess of the required amount. In order to prevent excess consumption of TE, a Tolerable Upper Intake Level (TUIL), the highest level of nutrient that is likely to pose no risk to the general population, has been prescribed by the U.S. Food and Nutrition Board of the Institute of Medicine 10 for Mn, Fe, Cu, Zn and Se and not for any other elements. The risk of adverse effects also increases with any intake above the TUIL. 11 However, the estimates made for TUILs is based on a risk assessment model with varying uncertainty factors and never considered the initial concentration of TE in each individual. 12 Furthermore, the risk assessment model never considered the complex interaction among nutrients 13 or drug nutrient interactions, and hence requires scientific validation. 11
The variation in concentration of some of TEs in the blood when compared to their inherent initial concentration is used for cancer screening on an individual by individual basis. The existence of this inherent initial concentration leads to a correlation among the TEs and forms a dynamic balance due to complex interactions among them. The existing chemometric techniques like Multivariate Linear Regressions (MLR), step wise regression, Principal Component Analysis (PCA), Artificial neural networks (ANNs), and Backpropagation neural networks (BpNN) are unable to predict the initial concentration of TEs and hence do not provide satisfactory results for cancer screening.14,15 Furthermore, supplementation in order to treat a particular disease should consider the toxic levels of all the other TEs 16 that these chemometric techniques fail to account.
In this paper, instead of estimating the initial concentration, we propose an information theory based Expert System (ES) for estimating the Lowest Limit of Toxicity Association (LLTA) for cancer screening, supplementation, and mitigation of toxicity of TE. Medical and biomedical intelligent data analysis is a complex field based on statistical methods. Medicinal doctors admit that they are still doing evidential medicine instead of making diagnosis based on hard facts, whereas an ES can guide them in their decision. 17 As carcinogenic toxic TEs build up at the expense of macronutrients, leading to cancer, the ES estimates the LLTA between the macro and micronutrients. The ES is based on minimizing the total toxicity of any particular TE by decreasing its association with the macro and micronutrients in blood samples. The ES considers the dynamic environment of interacting TEs and minimizes the total toxicity using the information theoretic concept of mutual information (MI). MI is a generalized measure of correlation, analogous to a linear correlation coefficient, but is sensitive to non-linear dependencies between TEs. In particular, a vanishing MI implies that the toxicity among the nutrients are independent, but not so with vanishing Pearson coefficient. 18 Thus MI provides a general measure of association between nutrients that is applicable regardless of the shape of their concentration distribution. Furthermore MI, unlike linear or rank order correlation, is insensitive to non-monotonic dependence between macro and micro nutrients. 19 The ES is based on an algorithm maximizing the MI by estimating the bounds for the correlated information between nutrients, using the technique of determinant Inequalities (TDI) developed by the authors. 20 This technique is unlike other ES which are restricted to a group of compounds having similar structure. 21 The advantage of the ES is that for any specific treatment due to toxicity of a particular TE, its LLTA for the rest of the macro and micronutrients are computed so that a complete toxic fingerprint of the entire TE in the body is available to the physician. Such a fingerprint would enable the physician to prescribe required medication containing the macronutrients to annul the toxicity of cancer risk TE. 22 We demonstrate the superiority of our ES based on MI over the Principal Component Analysis (PCA) in the analysis of a blood sample.
We first formulate the relationship between toxicity and the concentration of the nutrients using covariance information. As the interaction between TEs and macronutrients are stronger in cancer risk groups, we then use the information theoretic concept of MI to depict the strong correlation between them. We then deal with TDI to maximize the MI and the algorithm developed to estimate LLTA. We apply our technique to the analysis of blood samples and discuss the results. The flow chart for cancer screening is depicted in flow chart 1; the flow chart 2 portrays cancer management with macronutrients.
Flow Chart 1. Cancer screening using trace elements.
Flow Chart 2. Cancer management by supplementing macronutrients.
Covariance Information Model for Toxicity
Let
Using the first order perturbation theory, we can calculate any change in
Using the notation
The value of (
In Eq. (3) the product of the concentrations (
The correlation coefficient
Preliminaries on Mutual Information
Let,
Mutual Information,
When
When,
Figure 1 depicts pictorially the association and independence of toxicities for

The association and independence of toxicities for
where G = | Det. ρ | is the absolute value of the determinant of the correlation matrix ρ. Thus,
Here G can be maximized by finding the upper and lower bounds of
Technique of Determinant Inequalities
Information theory is endowed with a multitude of powerful theorems for computing bounds on the optimum representation and transmission of information bearing channels. Here, we develop the technique of determinant inequalities to estimate the upper and lower bounds for the constant inherent initial toxicity
The determinant G is positive when
The upper and lower bounds are determined by solving the polynomial equation G (
Then G = Det. ρ ≥ 0 requires that
From the above equation, it is clear that
Algorithm
The algorithm 24 based on TDI is as follows. Let us designate,
Gi: Determinant with ith row and column deleted,
Gij: Determinant with ith and th row and column deleted, (Note that when G has only two rows and columns then G12= 1)
gij: Determinant with ρii = 0,
gij: Determinant with ρij = 0, and row j and column i deleted.
According to Eq. (7), G ≥ 0 and hence Gi and Gij are also Gram determinants of lower order. Thus G ≥ 0, Gi > 0, Gij > 0 and we can establish the following Inequalities:
where g
Thus for the uncorrelated component, the lower bound is ρii ≥ –gii/Gi, while for the correlated component the upper and lower bounds are
According to Hadamard's inequality,
The equality is achieved if and only if ρ
Results
Blood samples from 100 patients, consisting of 58 women and 42 men from ages ranging between 20 and 80, were collected in this study. Extreme precaution was taken in the collection of these samples in order to prevent contamination from the exogenous Trace Element (TE). For each of the blood samples, multi-element concentrations were estimated using Neutron Activation Analysis (NAA). NAA is an attractive technique for rapid multi-element analysis of biological samples. 25 The multi-element data, which is the concentration of the macronutrients and the TE for the above 100 patients, was reduced to correlation matrix using the standard chemometric procedures. 26
The concentration correlation matrix for eleven elements in blood plasma is depicted in Table 1. Among these eleven, we focused our attention on the TE Cr, intake of which increases breast cancer mortality
16
In Table 2, the lower bound values of
Correlation matrix (ρ) between the concentrations of trace elements in blood plasma.
The estimation of lower bounds for
Comparison of
Cancer Screening and Management
The flow chart for cancer screening is depicted in flow chart 1. The sequence of screening is depicted serially from 1 to 5 as follows: (1) We collect the blood sample of the patients; (2) We subject these blood samples to NAA; (3) The NAA enables us to determine the elemental concentration of both the macronutrients and the TE; (4) Using these concentration values, we generate the correlation matrix using chemometric techniques; and (5) As TE builds up at the expense of macronutrients, we examine critically the value of correlation coefficient of cancer causing TE with the macronutrients.
Cancer management by supplementation is depicted in flow chart 2. We start our analysis with the correlation matrix. These correlation matrix are the input data to our technique of Determinant Inequalities (TDI). Upper Bound (UB) and Lower Bound (LB) values for each of correlation coefficients are generated by TDI. The values of either the (UB) or the (LB) which gives the highest value of Mutual Information (MI) is used for supplementation of macronutrients.
Discussion
The variation in concentration of TEs in the blood from inherent initial concentrations is an indication of malignancies and is used for cancer screening and diagnosis in individuals. All existing chemometric techniques like MLR, PCA, ANNs, and BpNN do not provide initial inherent concentration
As carcinogenic toxic TEs build up at the expense of macronutrients, we have proposed an Expert System (ES) for cancer screening which involves minimizing the strength of the association between macronutrients and TEs using MI. MI is maximum for a higher value of G, leading to least association of toxicities. Hence, our technique gives a scientific validation for the estimation of
Furthermore, from Table 2, the lower bound values are all much lower compared to PCA. As PCA values are higher, they do not lead to decreased association between the macronutrient and the TEs and accordingly cannot be used to predict LLTA. Thus, selective mitigation of toxicity of a specific TE is possible by decreasing its association with the other macro and micronutrients, using an algorithm based on MI. Even though we have chosen Cr for selective mitigation, our algorithm enables us to obtain the bounds for each of the TEs so that their association with the other elements can be selectively decreased by maximization of MI. Such TE-wise mitigation of toxicity is possible only by MI and not by either PCA or Independent Component Analysis (ICA).
As regards supplementation, a careful analysis of Table 2 would reveal that the lower bound values are all anti-correlated compared to existing values, which in only some cases are anti-correlated. As mentioned earlier, when macronutrients are consumed by humans in large quantities, because of their anti-correlation with TEs, they reduce the toxicity of TE. Thus, in the above case, the patient who consumes Mg, Fe and Zn in large quantities would not be at risk of breast cancer due to Cr toxicity. The LLTA value gives a scientific proof for supplementation of Mg, Fe, and Zn to boost the immune system against cancer.
Most expert systems provide prediction of toxicity only for a restricted group of compounds based on Quantitative Structure Activity Relationships (QSAR) and never provide toxicity profile for all of the elements present in a sample. Unlike those based on QSAR, our Expert System is not restricted to specific compounds but rather provides the LLTA for all TEs that are in association with any of micro and macronutrients.
Author Contributions
Conceived and designed the experiments: PTK, PTV. Analysed the data PTK, PTV, SSI, PI. Wrote the first draft of the manuscript PTK, PTV. Agree with the manuscript results and conclusions: PTK, PTV, VP, SSI, PI. Jointly developed the structure and arguments for the paper: PTK, PTV, VP, SSI, PI. Made critical revisions and approved final revision: PTK, PTV, VP, SSI, PI. All authors reviewed and approved of the final manuscript.
Funding
Author(s) disclose no funding sources.
Competing Interests
Author(s) disclose no potential conflicts of interest.
Disclosures and Ethics
As a requirement of publication author(s) have provided to the publisher signed confirmation of compliance with legal and ethical obligations including but not limited to the following: authorship and contributorship, conflicts of interest, privacy and confidentiality and (where applicable) protection of human and animal research subjects. The authors have read and confirmed their agreement with the ICMJE authorship and conflict of interest criteria. The authors have also confirmed that this article is unique and not under consideration or published in any other publication, and that they have permission from rights holders to reproduce any copyrighted material. Any disclosures are made in this section. The external blind peer reviewers report no conflicts of interest.
