Abstract
Introduction
Humans are exposed to mixtures of chemicals that may be influential for cancer risk. For example, risk of non-Hodgkin lymphoma (NHL) is suspected to be associated with several chemicals through environmental or occupational routes of exposure, and geographic variation in NHL rates suggests the importance of environmental risk factors. 1 Positive associations have been found with persistent organochlorine chemicals, including polychlorinated biphenyls (PCBs), 2 particularly PCB congener 180,3–5 and dichlorodiphenyldichloroethylene.2,3
Environmental exposure patterns are typically complex with inherent correlations among co-occurring chemicals or their metabolites. 6 For example, many PCB congeners exhibit a high degree of correlation. Important questions in the analysis of mixtures include whether and how the health effect of one chemical should be adjusted for other chemicals present, even when those chemicals are highly correlated. Furthermore, the relationship between environmental chemicals and health effects (eg, cancer risk) is not always constant across a study area. 6 Exposure levels may be different spatially due to environmental factors. For example, pesticide levels measured in house dust may be higher in agricultural communities (eg, in Iowa) or those in temperate climates where more pesticides are applied throughout the year (eg, Los Angeles) compared to the levels in urban locations (eg, Detroit). Acknowledging the principle that “the dose makes the poison,” the risk of adverse health effects such as NHL is greater in regions where exposure is higher. Thus, environmental health models that account for these spatially changing exposure/risk regions can be informative.
Models with spatially varying coefficients include geographically weighted regression (GWR 7 ), which is similar to local linear regression (eg, references8–10) in that both methods use a kernel function to calculate weights that are applied to observations in a series of local weighted regression models. One issue with GWR is that GWR models have been found to be affected by local collinearity.11–15 Local collinearity in weighted explanatory variables can lead to GWR coefficient estimates that are correlated locally and across space, have inflated variances, and are at times counterintuitive and contradictory in sign to the global regression estimates, ie, evidence of the reversal paradox.12,16
To illustrate, Wheeler and Tiefelsdorf 11 highlighted the issue of collinearity in GWR in a simple model to explain white male bladder cancer mortality rates (1970–1994) in the 508 State Economic Areas of the US. Their model consisted of two explanatory variables: population density, a proxy for environmental and behavioral differences in urban/rural life, and lung cancer mortality rates, a proxy for the risk factor smoking, a known risk factor for bladder cancer. These two variables had a global correlation estimate of −0.59; however, local correlation estimates were generally more extreme (ie, more strongly negative; median = −0.63; Q3 = −0.71 as approximated from their Fig. 4), with strongest inverse association in parts of Northeastern and Midwestern US (their Fig. 3). The resulting maps of GWR coefficients for population density and the smoking proxy showed a clear inverse map pattern. When the local smoking proxy parameter was high (primarily in the West and Northeast), the local population density parameter was negative. When the local smoking proxy parameter was negligible, the population density parameter was large and positive (primarily in the Midwest and Southeast). As noted by Wheeler and Tiefelsdorf, 11 the important question is whether this complementary relationship in the parameters is real, meaningful, and interpretable, or whether it is an artifact of the statistical method. The natural research question is whether such inverse patterning in regression coefficients is an example of the reversal paradox 16 due to strong local correlations between the two variables.
According to the reversal paradox, the association between two variables can be reversed, diminished, or enhanced when another variable is statistically controlled for.
16
For example, consider two explanatory variables,

Standardized partial regression coefficients in a multiple regression model with two variables.
To address the issue of collinearity with GWR and to limit its effects, the geographically weighted lasso (GWL) adds a constraint on the magnitude of the estimated regression coefficients. 14 The GWL also performs local model selection by potentially shrinking some of the estimated regression coefficients to zero in some locations of the study area, thereby diminishing the adverse effects of the correlation pattern. However, when accurate variable selection is the focus of the analysis, such a strategy makes it difficult to determine whether a variable was excluded from the model due to a lack of association with the outcome or due to its correlation with variables in the model.
Our objective in this study is to evaluate the impact of collinearity of the geographically weighted regression models GWR and GWL in a chemical exposure and risk assessment context. We use a simulated data set for which the truth is known and further assess the ability of GWL to control collinearity effects, such as the reversal paradox, when the effects of correlated environmental chemicals are of interest. We begin by describing the process used to simulate data that we propose are environmentally relevant – ie, regions with low exposure and regions with higher exposures and where different chemicals may have related exposure patterns but not necessarily the same association with a health effect of interest. We conduct GWR and GWL analyses in a scenario with independent chemicals and a scenario with correlated chemicals.
Methods
Simulating Spatially Varying Exposure- and Dose-Dependent Association with an Outcome
Consider the scenario in which there are three predictor variables (eg, environmental chemicals) that vary over space in a study area (Fig. 2). We assumed that the first predictor variable,

Plots of average simulated concentration values across 100 simulated data sets over a square study area for two scenarios: independent chemicals (Case 1) and correlated chemicals (Case 2).
For each case, multivariate normal data were simulated separately for Region 1 and Region 2. We assumed that the levels of
GWR Model
In GWR, the spatial coordinates of data are used in the calculation of distances that are input into a kernel function to determine weights for spatial dependence among observations. Local regression models are related through shared data, but the dependence between regression coefficients at different locations is not specified. For example, consider
GWL Model
The lasso is defined
17
as.
Evaluation of Models
The focus of the study was to determine whether the methods were able to correctly detect a strong relationship between
For each model, we calculated the root-mean-square error (RMSE) from estimation, the RMSPE, and the
To evaluate the performance of GWL in terms of variable selection, the percentages of coefficient estimates that were positive, negative, or zero were calculated by region for each simulated data set. We summarized the results across the simulated examples using medians and IQRs. Because GWR does not perform variable selection, we calculated the percentage of coefficient estimates that were positive and negative within each region. Additionally, in an effort to further evaluate the performance of GWR, we approximated the variance of the estimated GWR regression coefficients and created confidence intervals for the estimates at each location based on one and two standard errors (SEs) (ie,
Results
The average observed concentration levels across the 100 simulated examples are plotted over the study area for each case in Figure 2, where in we see that the average levels of
Average predictor and response values across the 100 simulated data sets for the cases of independent chemicals (Case 1) and correlated chemicals (Case 2).
The summary statistics across the 100 data sets are listed in Table 2. GWL outperformed GWR in terms of RMSPE in the uncorrelated case, while in the correlated case, GWL outperformed GWR in terms of both RMSPE and RMSE, with a greater improvement for prediction of the outcome (RMSPE) than for estimation of the outcome (RMSE).
Median (interquartile range) of summary statistics for GWR and GWL models across the 100 simulated data sets for the cases of independent chemicals (Case 1) and correlated chemicals (Case 2).
Pairwise plots of the average regression coefficients are shown in Figure 3. Correlation in the parameter estimates is evident for both GWR and GWL in the cases of both independent and correlated chemicals. In the uncorrelated case, the relationship is most pronounced between the intercept and β1 parameters (denoted by b0 and b1, respectively). In the correlated case, there is a noticeable pattern among all of the parameter estimates, with a strong linear relationship evident between the estimates for and β3 (denoted by b2 and b3, respectively). While GWL breaks up some of the strong correlation among the parameter estimates that is evident in GWR, strong relationships are still present between many of the regression coefficients.

Pairwise plots of average regression coefficients across the 100 simulated data sets for the cases of independent chemicals (Case 1) and correlated chemicals (Case 2) for GWR and GWL.
As demonstrated in the box plots of the averaged regression coefficients from the models for the 100 simulated data sets (Fig. 4), GWR appears to accurately capture the importance of

Box plots of average GWR and GWL regression coefficients across 100 simulated data sets for the two study regions for the cases of independent chemicals (Case 1) and correlated chemicals (Case 2).
The GWR and GWL regression coefficient estimates from the 100 simulated data sets were averaged at each location and are plotted in Figure 5. The coefficient maps reveal a high degree of correlation between the GWR estimates of β0 and β1 in both the independent and the correlated cases. This strong negative relationship is also evident in the pairwise scatter plots of the regression coefficients (Fig. 3). Similarly, correlation in the intercept and β1 is also apparent in the GWL models, although the correlation between the estimates is not as strong and is largely positive. When examining the coefficient maps for β1 in both the independent and the correlated cases, GWR and GWL correctly identified Region 1 as the area of highest activity for

Average GWR and GWL regression coefficient estimates over 100 simulated data sets for the cases of independent chemicals (Case 1) and correlated chemicals (Case 2).
When considering the estimated β2 coefficients, GWR appears to identify several clusters of positive and negative associations in the independent case, while in the case of correlation, it finds a positive association in Region 1 and a negative association in Region 2, probably a reflection of the high degree of correlation between the predictors
Finally, with respect to β3, GWR appears to incorrectly identify several clusters of a stronger positive relationship between
The percentages of positive and negative GWR coefficient estimates are summarized by region for each correlation case in the left side of Table 3. We see that across the simulated data sets, the GWR estimates of β1 were positive nearly 100% of the time in Regions 1 and 2 for both the independent and the correlated cases. This is further evidence that GWR overstates the importance of β1 in Region 2. When considering β2, 53% of the GWR estimates in Regions 1 and 2 were negative in the case of independence for at least half of the simulated data sets, while 28% and 77% of the GWR estimates in Regions 1 and 2, respectively, were negative in the case of correlation. Given that
Median (interquartile range) percentage of GWR and GWL coefficient estimates that were positive, negative, and zero across the 100 simulated data sets for the cases of independent chemicals (Case 1) and correlated chemicals (Case 2).
Similarly, as shown in the right side of Table 3, the GWL estimates of β1 were positive nearly 100% of the time in Regions 1 and 2 for both the independent and the correlated cases. This indicates that GWL failed to appropriately perform variable selection for
The results of applying one and two SEs to the GWR estimated coefficients to classify them as positive, negative, or zero are listed in Table 4. Using the one-SE criteria, GWR incorrectly classified 83% of β1 estimates in Region 2 as positive at least half of the time for the independent case and incorrectly classified 84% of β1 estimates in Region 2 as positive at least half of the time when the predictors were correlated. Similarly, when applying the two-SE criteria, GWR incorrectly classified 64% of β1 estimates in Region 2 as positive at least half of the time for the independent case and incorrectly classified 66% of β1 estimates in Region 2 as positive at least half of the time in the correlated case. This implies that GWR frequently yields nonnegligible positive estimates of β1 in the region of inactivity.
Median (interquartile range) percentage of GWR coefficient estimates that were positive, negative, and zero across the 100 simulated data sets when considering ±1 and ±2 standard errors of regression coefficient estimates for the cases of independent chemicals (Case 1) and correlated chemicals (Case 2).
Furthermore, when applying the one-SE rule, we see that in the case of independence, GWR correctly classified 64% and 72% of the β2 estimates as zero in the upper and lower regions, respectively, at least half of the time. In the case of correlated predictors, only 51% and 44% of the β2 estimates were correctly classified at least half of the time in the upper and lower regions, respectively. Finally, when using the two-SE criteria, 29% of the β2 estimates in Region 1 were incorrectly classified as positive at least half of the time when the predictors were correlated. Thus, even when allowing “small” estimates to be considered as negligible, GWR results can still lead to erroneous inferences about the nature of a predictor variable that is not associated with the response.
As an illustrative example, we randomly chose one simulated data set for each correlation case and plotted the corresponding estimated regression coefficients from GWR and GWL, using open circles for the negligible estimates (ie, GWR estimates with confidence intervals containing zero or GWL estimates of zero) (Fig. 6). This example visually supports the aggregate results given in Tables 3 and 4. GWR accurately identified Region 1 as the region of high activity for

Estimated regression coefficients from GWR and GWL for one simulation of data under the cases of independent chemicals (Case 1) and correlated chemicals (Case 2).
Discussion and Conclusion
We have evaluated the ability of the geographically weighted regression methods of GWR and GWL to detect signal from noise in the context of modeling the associations of environmental chemicals and an adverse health effect using a simulation study with both independent and correlated chemicals. We found that GWR was able to identify regions of high activity for an important chemical when the predictors were independent and when they were highly correlated, but it demonstrated a tendency to overstate the importance of this chemical in its region of inactivity. Furthermore, GWR suffered from the reversal paradox for less-important chemicals when the chemicals were correlated, as the variable that was not associated with the outcome was largely positive in the upper study region and largely negative in the lower study region. We also found that with GWL, the signal of the most important chemical was diminished, with less distinction between the inactive and active study regions, regardless of the correlation among the chemicals.
Previous work has addressed the issue of collinearity in GWR. Wheeler and Tiefelsdorf 11 first demonstrated the link between collinearity in GWR and correlation of estimated regression coefficients using simulation studies. These authors introduced systematic collinearity into the model by adding correlation to a pair of covariates and found consistent evidence of increasing correlation in GWR coefficients with increasing collinearity. Wheeler and Calder 13 used two simulation studies to evaluate the coverage probability and accuracy of the regression coefficients from GWR. Results of the simulation studies include low coverage probabilities for the GWR coefficients and consistently increasing error in the coefficients when collinearity is increased. Wheeler 12 conducted a simple experiment by systematically increasing collinearity in a data set to demonstrate that a penalized form of GWR, geographically weighted ridge regression, reduces the extreme effects of collinearity that afflict GWR. More recent simulation study work confirms that a nonnegligible amount of spatial variation of and correlation between GWR coefficient surfaces is inherently generated by the method. 15 This work finds that the false-positive rates for GWR coefficients are typically much higher than convention would mandate, from <10% to >50% of the time (depending on the true correlation level between two covariates) when the true underlying process is stationary.
Wheeler 14 expanded the simulation study of Wheeler and Calder 13 to contain four explanatory variables and 196 observations in a study of the performance of GWR and GWL. This work compared the coefficient accuracy and the predictive performance of the models in the presence of collinearity. In these experiments, 100 realizations of a data-generating process were used with the true local coefficients sampled from a multivariate normal distribution. These simulation studies show that the performance of GWR in terms of both prediction and coefficient accuracy can be improved by constraining the magnitude of its regression coefficients with techniques designed to remediate collinearity. However, the experiments reported in that study show that the correlation between local coefficients is reduced but not eliminated with GWL, and that although GWL can shrink some coefficients to zero to stabilize the model, the estimates still tend to be positively correlated with those from GWR. 15
We have extended these results in the case of three environmental chemicals to identify evidence of the reversal paradox and evaluate the correct identification of local “hot spots” or regions of high activity for one chemical. Our results demonstrate that while GWR can correctly identify a region of high activity for one chemical, it has difficulty in identifying regions of inactivity or low exposure. Additionally, GWR artificially induces spatial patterning and suffers from the reversal paradox in the setting of highly correlated predictor variables. Finally, we have shown that while GWL reduces the correlation among the coefficient estimates and tempers the reversal paradox that is problematic with GWR, it suffers from an inability to adequately distinguish local regions of high activity regardless of the relationship among the predictor variables. The implications of our findings for environmental risk analysis is that GWR may incorrectly identify some chemicals as positively or negatively associated with disease risk, and GWL may not correctly estimate the magnitude of association for an important chemical in some regions of the study area. Given these findings, more methodological development is required to better estimate the effects of correlated environmental chemicals on diseases associated with environmental factors, such as many cancers.
Author Contributions
Conceived and designed the experiments: JC, DCW, CG. Analyzed the data: JC, DCW, CG. Wrote the first draft of the manuscript: CG, JC, DCW. Contributed to the writing of the manuscript: JC, DCW, CG. Agree with manuscript results and conclusions: JC, DCW, CG. Jointly developed the structure and arguments for the paper: JC, DCW, CG. Made critical revisions and approved final version: JC, DCW, CG. All authors reviewed and approved of the final manuscript.
