Abstract
Keywords
1 Introduction
Diagnostic test accuracy studies observe the result of a gold standard procedure that defines the presence or absence of a disease and the result of a diagnostic test. They typically report the number of true positives (diseased subjects correctly diagnosed), false positives (non-diseased subjects incorrectly diagnosed as diseased), true negatives (non-diseased subjects correctly diagnosed as non-diseased) and false negatives (diseased subjects incorrectly diagnosed as non-diseased). However, diagnostic test outcomes can be non-evaluable. 1 This is the case for coronary computed tomography (CT) angiography studies which have non-evaluable results of index text in various ways such as when transferring a segment/vessel to a patient based evaluation. 2
Synthesis of diagnostic test accuracy studies is the most common medical application of multivariate meta-analysis.3,4 The purpose of a meta-analysis of diagnostic test accuracy studies is to combine information over different studies and provide an integrated analysis that will have more statistical power to detect an accurate diagnostic test than an analysis based on a single study. Nevertheless, the existence of non-evaluable subjects is an important issue that could lead to biased meta-analytic estimates of index test accuracy.2,5,6 Schuetz et al. 2 studied different ad-hoc approaches dealing with diagnostic test non-evaluable subjects, such as non-evaluable subjects are excluded from the study, non-evaluable positives (non-evaluable diseased subjects) are taken as true positives and non-evaluable negatives (non-evaluable non-diseased subjects) are taken as false positives, non-evaluable positives are taken as false negatives and non-evaluable negatives are taken as true negatives, and non-evaluable positives as false negatives and non-evaluable negatives as false positives. In all of these approaches, Schuetz et al. 2 used the bivariate generalized linear mixed model (BGLMM) 7 and concluded that excluding the index test non-evaluable subjects leads to overestimation of the meta-analytic estimates of sensitivity and specificity and recommended the intent-to-diagnose approach by treating non-evaluable positives as false negatives and non-evaluable negatives as false positives.
Ma et al. 5 used a trivariate generalized mixed model (TGLMM) approach by treating the non-evaluable subjects as missing data under a missing at random assumption (MAR). Ma et al., 5 with extensive simulation studies, showed that the intent-to-diagnose approach 2 under-estimates both meta-analytic estimates of sensitivity and specificity, while the TGLMM approach under the MAR assumption gives nearly unbiased estimates of sensitivity, specificity and prevalence.
Nikoloulopoulos, 6 similar to Ma et al., 5 extended the vine copula mixed model for trivariate meta-analysis of diagnostic test accuracy studies accounting for disease prevalence 8 to additionally account for non-evaluable subjects. The extended trivariate vine copula mixed model includes the extended TGLMM as a special case and can also model sensitivity, specificity and prevalence on the original scale. Nikoloulopoulos 6 demonstrated that the extended TGLMM leads to biased meta-analytic estimates of sensitivity, specificity and prevalence when the univariate random effects are misspecified and that the extended vine copula mixed model gives nearly unbiased estimates of test accuracy indices and disease prevalence.
A recurrent theme underlying the methodologies of Ma et al. 5 and Nikoloulopoulos 6 is the need to make the MAR assumption that cannot be verified based on the observed data. Hence, it is natural to be concerned about robustness or sensitivity of inferences to departures from the MAR assumption. The within-study model assumes that the number of true negatives, false negatives, false positives, true positives, non-evaluable negatives and non-evaluable positives are multinomially distributed, given the latent (random) vector of sensitivity, specificity, disease prevalence, probability of non-evaluable positives and probability of non-evaluable negatives. Under the MAR assumption, the multinomial probability mass function (pmf) decomposes into a product of independent binomial pmfs given the random effects. Hence, the within-study model actually assumes that the number of true negatives, number of true positives, number of diseased subjects, number of non-evaluable negatives and number of non-evaluable positives are conditionally independent and binomially distributed given the random effects. The triplet of latent sensitivity, specificity and prevalence are independent of the missing probabilities, hence the joint likelihood factors into two components, one involving only the sensitivity, specificity and disease prevalence, and the other involving only the probabilities of non-evaluable positives and non-evaluable negatives. Therefore, the methodology of Chu et al. 9 or Nikoloulopoulos 8 is applied to the first likelihood component to infer about the sensitivity, specificity and disease prevalence. Hence, the models in Ma et al. 5 and Nikoloulopoulos 6 extend the BGLMM 7 and the bivariate vine copula mixed model, 10 respectively, to the trivariate case by adding the disease prevalence as a third outcome to indirectly account for the non-evaluable results. On the one hand, the number of diseased subjects are binomially distributed with probability of success the latent prevalence and a support that includes the number of non-evaluable positives and the number of non-evaluable negatives, but the true positives and true negatives are binomially distributed with probability of success the latent sensitivity and specificity, respectively, and a support that does not include either the number of non-evaluable positives or the number of non-evaluable negatives on the other, just like in the BGLMM 7 and the bivariate vine copula mixed model. 10 Note in passing that a special case of the bivariate copula mixed model is the BGLMM, that is, a copula mixed model composed of a bivariate normal (BVN) copula with normal margins.
In this paper, in order to remedy this situation of ignoring the non-evaluable subjects in the derivation of the meta-analytic estimates of sensitivity and specificity, we include the number of non-evaluable positives and the number of non-evaluable negatives as separate non-missing response categories. Interestingly, the proposed model extends the bivariate copula mixed model 10 to the quadrivariate case by directly adding the number of non-evaluable positives and number of non-evaluable negatives as a third and fourth outcome, respectively. Hence, it directly utilizes all the available data. The bivariate copula mixed model 10 assumes independent binomial distributions for the true positives and true negatives, conditional on the latent pair of sensitivity and specificity in each study. In the proposed methodology for the meta-analysis of diagnostic tests where we additionally account for non-evaluable outcomes of the diagnostic test, we will assume independent multinomial distributions for the true and non-evaluable positives, and, the true and non-evaluable negatives, conditional on the latent sensitivity, specificity, probability of non-evaluable positives and probability of non-evaluable negatives in each study.
For the random effects distribution, we employ a regular vine copula.
11
Regular vine copulas are suitable for high-dimensional data, hence given the low dimension
The remainder of the paper proceeds as follows. Section 2 introduces the multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic studies accounting for non-evaluable results and provides computational details for maximum likelihood (ML) estimation. Section 3 studies the small-sample efficiency and robustness of the ML estimation of the multinomial quadrivariate D-vine copula mixed model. Section 4 applies our methodology to data from a published meta-analysis for diagnostic accuracy studies of coronary computed tomography angiography for the detection of coronary artery disease. We conclude with some discussion in Section 5, followed by a brief section with software details.
2 The multinomial quadrivariate D-vine copula mixed model
In this section, we introduce the multinomial quadrivariate D-vine copula mixed model. In Subsections 2.1 and 2.2, a D-vine copula representation of the random effects distribution with normal and beta margins, respectively, is presented. We complete this section with details on maximum likelihood estimation.
2.1 The multinomial quadrivariate D-vine copula mixed model with normal margins
We first introduce the notation used in this paper. The data are
Data from an individual study in a 3 × 2 table.
The diseased subjects have three possible states: false negative, true positive, and non-evaluable positive. The multinomial observation is therefore the number of diseased subjects where the diagnostic test is in each of its states. Hence, we assume that the false negatives
In a similar manner, the non-diseased subjects have also three possible states: true negative, false positive, and non-evaluable negative. Hence, we assume that the true negatives
After defining the within-studies model in equations (1) and (2), we next define the between-studies model. The stochastic representation of the between studies model takes the form
The quadrivariate D-vine copula is built via successive mixing from bivariate pair-copulas on different levels. The pairs at level 1 are
The models in equations (1)–(3) together specify a multinomial quadrivariate D-vine copula mixed model with joint likelihood
Below we transform the original integral into an integral over a unit hypercube using the inversion method. Hence, the joint likelihood becomes
2.2 The multinomial D-vine copula mixed model with beta margins
In this section, we use the parametrization proposed by Wilson
18
in order the latent sensitivity and specificity to remain on the original scale. The within-study model takes the form
The stochastic representation of the between studies model is
The models in equations (5) and (6) together specify a vine copula mixed model with joint likelihood
2.3 Maximum likelihood estimation and computational details
Estimation of the model parameters can be approached by the standard maximum likelihood (ML) method, by maximizing the logarithm of the joint likelihood. The estimated parameters can be obtained by using a quasi-Newton 19 method applied to the logarithm of the joint likelihood. This numerical method requires only the objective function, i.e. the logarithm of the joint likelihood, while the gradients are computed numerically and the Hessian matrix of the second-order derivatives is updated in each iteration. The standard errors (SEs) of the ML estimates can be also obtained via the gradients and the Hessian computed numerically during the maximization process.
For the multinomial quadrivariate D-vine copula mixed model, numerical evaluation of the joint pmf can be achieved with the following steps:
where 3. Numerically evaluate the joint pmf, e.g.
in a quadruple sum
With Gauss-Legendre quadrature, the same nodes and weights are used for different functions; this helps in yielding smooth numerical derivatives for numerical optimization via quasi-Newton.
3 Simulations
In this section, we study the small-sample efficiency and robustness of the ML estimation of the multinomial quadrivariate D-vine copula mixed model. In Section 3.1, we gauge the small-sample efficiency of the ML method and investigate the misspecification of the parametric margin or bivariate pair-copulas of the random effects distribution. In Section 3.2, we investigate the mixed model misspecification by using both the proposed model and the extended trivariate vine copula mixed model 6 as true models.
We set the sample size and the true univariate and dependence parameters to mimic the data analyzed in Section 4. In each model, we use six different linking copula families: normal, Frank, and Clayton copula along with its rotated versions (see our previous papers on copula mixed models8,10,21,22 for definitions) to cover different types of dependence structure. To make it easier to compare strengths of dependence, we convert the BVN, Frank, and rotated Clayton estimated parameters to Kendall’s
3.1 Small-sample efficiency–misspecification of the random effects distribution
We randomly generate samples of size 1. Simulate 2. • Convert to normal realizations via
• Convert to beta realizations via
3. Simulate the size of diseased and non-diseased subjects
and round off 4. • For normal margins, draw • For beta margins, draw
and
Tables 2 and 3 contain the resultant biases, root mean square errors (RMSE), and standard deviations (SD), along with the square root of the average theoretical variances (
Small sample of sizes
Note: Cln{
BVN: bivariate normal
True model.
Small sample of sizes
Note: Cln{
BVN: bivariate normal.
True model.
Conclusions from the values in the tables are the following:
ML with the true multinomial D-vine copula mixed model is highly efficient according to the simulated biases, SDs and RMSEs. The ML estimates of the univariate meta-analytic parameters and their SDs are robust under copula misspecification, but are not robust to margin misspecification. The ML estimates of
3.2 Misspecification of the copula mixed model that accounts for non-evaluable outcomes
We randomly generate samples of size
In Section 3.1 and in Nikoloulopoulos,
6
it has been revealed that (a) the estimation of the univariate meta-analytic parameters is a univariate inference, and hence it is the univariate marginal distribution that matters and not the type of the copula, and (b) estimated Kendall’s
Tables 4 and 5 contain the resultant biases, RMSEs, and SDs, along with the square root of the average theoretical variances (
Small sample of sizes
The non-evaluable outcomes are excluded
The resulting model is the same as the BGLMM.
The non-evaluable positives and negatives are included as false negatives and positives, respectively.
The resulting model is the same as the extended TGLMM.
Small sample of sizes
The non-evaluable outcomes are excluded.
The resulting model is the same as the BGLMM.
The non-evaluable positives and negatives are included as false negatives and positives, respectively.
The resulting model is the same as the extended TGLMM.
Conclusions from the values in the tables are the following:
The bivariate copula mixed model where the non-evaluable outcomes are disregarded and the extended trivariate vine copula mixed model showed similar performance. Both led to unbiased (biased) and efficient (inefficient) estimates when the true model is the trivariate (quadrivariate multinomial) vine copula mixed model. The bivariate copula mixed model where the non-evaluable positives and negatives included as false negatives and false positives, respectively, and the multinomial D-vine copula mixed model with beta margins showed similar performance. Both led to unbiased (biased) and efficient (inefficient) estimates when the true model is the quadrivariate multinomial vine copula mixed model with beta margins (trivariate vine copula mixed model or quadrivariate multinomial vine copula mixed model with normal margins).
4 Meta-analysis of coronary computed tomography angiography studies
We apply the multinomial quadrivariate D-vine copula mixed model for the meta-analysis of diagnostic accuracy studies accounting for non-evaluable subjects to data on 30 studies from a systematic review for diagnostic accuracy studies of coronary computed tomography angiography for the detection of coronary artery disease. 27
We fit the multinomial quadrivariate D-vine copula mixed model for all different decompositions of the D-vine copula density, for both beta and normal margins and different pair-copulas at the level 1; for levels 2 and 3, we use BVN copulas. In cases when fitting the multinomial quadrivariate D-vine copula mixed model, the resultant estimate of one of the conditional dependence parameters was close to the right boundary of its parameter space (that is clear indication that the model with a full structure provides more dependence structure than it is actually required 8 ), we used a truncated model, i.e. we captured the strongest dependence in the first tree and then just used the independence copulas in lower order trees, i.e. conditional independence. Joe et al. 28 showed that in order for a vine copula to have (tail) dependence for all bivariate margins, it is only necessary for the bivariate copulas in level 1 to have (tail) dependence and it is not necessary for the conditional bivariate copulas in levels 2 and 3, to have tail dependence. Hence, one can either use BVN or independence copulas at levels 2 and 3 without sacrificing the tail dependence of the vine copula distribution.
In Table 6, we present the results from the decomposition of the vine copula density in equation (4), as a different decompositions led to similar results due to the small sample size. This is consistent with our previous studies on vine copula mixed models.6,8 Since the number of parameters is not the same between the models, we use the Akaike information criterion (AIC), that is −2× log-likelihood +2× (#model parameters) as a rough diagnostic measure for goodness of fit between the models. The AICs showed that a (truncated) multinomial quadrivariate D-vine copula mixed model with Clayton copulas rotated by
AICs, ML estimates, and standard errors (SE) of the multinomial quadrivariate D-vine copula mixed models for diagnostic accuracy studies of coronary computed tomography angiography.
Note: Cln{
AIC: akaike information criterion; BVN: bivariate normal.
Best fit.
In real data (in contrast with the simulated data in Section 3), the truth is unknown, so it is important to compare between the proposed and other existing approaches in terms of point estimation and variance. First, in order to reveal if the use of the proposed model is worthy, when a standard bivariate analysis (either ignoring the non-evaluable outcomes or including the non-evaluable positives and negatives as false negatives and positives, respectively) is easy, we also fit the bivariate copula mixed model 10 with both beta and normal margins and different bivariate copulas. According to the likelihood principle, a bivariate copula mixed model with a Clayton and Clayton copula rotated by 180° (to model the association between the latent sensitivity and specificity) and beta margins provides the best fit for both different ad-hoc approaches to handle the non-evaluable outcomes (Table 7). It is revealed that a bivariate copula mixed model with the sensitivity and specificity on the original scale provides better fit than the BGLMM, 7 which models the sensitivity and specificity on a transformed scale.
AICs, ML estimates, and standard errors (SE) of the best fitted bivariate copula and extended trivariate vine copula mixed models with beta margins for diagnostic accuracy studies of coronary computed tomography angiography.
AIC: akaike information criterion.
The non-evaluable outcomes are excluded.
The non-evaluable positives and negatives are included as false negatives and positives, respectively.
Then, in order to compare the proposed approach with the ones that use the MAR assumption, we fit the extended trivariate vine copula mixed model 6 with both beta and normal margins and different pair-copulas. According to the likelihood principle, a vine copula mixed model composed of a Clayton copula to model the association between the sensitivity and specificity, a Clayton copula rotated by 90° to model both the associations between the specificity and prevalence and between the sensitivity and prevalence given the specificity, and beta margins provides the best fit (Table 7). It is revealed that an extended trivariate vine copula mixed model with the sensitivity, specificity, and disease prevalence on the original scale provides better fit than the extended TGLMM, 5 which models the sensitivity, specificity, and disease prevalence on a transformed scale.
It has been shown that the trivariate analysis does not change the conclusions from the bivariate analysis excluding the non-evaluable outcomes. It is also apparent that the results from the quadrivariate analysis differentiate from the ones from bivariate (excluding the non-evaluable outcomes) and trivariate analyses which are fairly similar. The meta-analytic estimates of sensitivity and specificity from the latter approaches are blown, because in both approaches it is assumed that
Although typically the focus of meta-analysis has been to derive the summary-effect estimates, there is increasing interest in drawing predictive inference. Summary receiver operating characteristic curves (SROC) can be deduced from the D-vine copula mixed model with the sensitivity and specificity on the original scale through the quantile regression techniques developed for the bivariate copula mixed model.
10
SROC essentially shows the effect of different model (random effect distribution) assumptions, since it is an inference that depends on the joint distribution. An SROC curve has been deduced for the bivariate copula mixed model
10
through a median regression curve of
Figure 1 demonstrates the SROC curves with a confidence region and summary operating points (a pair of the model-based sensitivity and specificity; shown by the black square) from the best fitted multinomial quadrivariate D-vine copula mixed model, the best fitted trivariate vine copula mixed model, and the best fitted bivariate copula mixed models along with the study estimates (shown by the circles in Figure 1). For the upper panel graphs, the sensitivity and specificity at study

SROC curves with a confidence region and summary operating points (a pair of the model-based sensitivity and specificity) from the best fitted multinomial quadrivariate D-vine, extended trivariate vine and bivariate copula mixed models along with the study estimates.
5 Discussion
Motivated by the existence of non-evaluable results in diagnostic test accuracy studies, this paper proposed a multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic test accuracy studies accounting for non-evaluable subjects. Our general statistical model allows for selection of pair-copulas independently among a variety of parametric copula families, i.e. there are no constraints in the choices of bivariate parametric families of copulas and can also operate on the original scale of sensitivity and specificity.
For the random effects, we have used a quadrivariate D-vine copula distribution or a truncated at level 1 quadrivariate D-vine copula (conditional independence), which allows flexible (tail) dependence. 28 We have proposed a numerically stable ML estimation technique based on Gauss-Legendre quadrature; the crucial step is to convert from independent to dependent quadrature points that follow a quadrivariate D-vine distribution.
In an era of evidence-based medicine, decision makers need high-quality procedures such as the one developed in this article to support decisions about whether or not to use a diagnostic test in a specific clinical situation. The multinomial quadrivariate vine-copula mixed model is not an ad-hoc 2 but rather a sophisticated approach that utilizes all the available data in decision making and can satisfy the intention-to-diagnose principle. Using an intention to diagnose principle, i.e. a conservative approach, ensures that both the sensitivity and specificity are not overestimated. Hence, it formally enables decision makers to be more cautious in solely relying to the overly optimistic meta-analytic estimates of sensitivity and specificity derived from the extended trivariate vine copula mixed model that indirectly accounts for the non-evaluable outcomes.
