Sage Journals: Discover world-class research

Abstract

Diagnostic test accuracy studies observe the result of a gold standard procedure that defines the presence or absence of a disease and the result of a diagnostic test. They typically report the number of true positives, false positives, true negatives and false negatives. However, diagnostic test outcomes can also be either non-evaluable positives or non-evaluable negatives. We propose a novel model for the meta-analysis of diagnostic studies in the presence of non-evaluable outcomes, which assumes independent multinomial distributions for the true and non-evaluable positives, and, the true and non-evaluable negatives, conditional on the latent sensitivity, specificity, probability of non-evaluable positives and probability of non-evaluable negatives in each study. For the random effects distribution of the latent proportions, we employ a drawable vine copula that can successively model the dependence in the joint tails. Our methodology is demonstrated with an extensive simulation study and applied to data from diagnostic accuracy studies of coronary computed tomography angiography for the detection of coronary artery disease. The comparison of our method with the existing approaches yields findings in the real data application that change the current conclusions.

Keywords

Diagnostic tests multivariate meta-analysis sensitivity specificity summary receiver operating characteristic curves

1 Introduction

Diagnostic test accuracy studies observe the result of a gold standard procedure that defines the presence or absence of a disease and the result of a diagnostic test. They typically report the number of true positives (diseased subjects correctly diagnosed), false positives (non-diseased subjects incorrectly diagnosed as diseased), true negatives (non-diseased subjects correctly diagnosed as non-diseased) and false negatives (diseased subjects incorrectly diagnosed as non-diseased). However, diagnostic test outcomes can be non-evaluable.¹ This is the case for coronary computed tomography (CT) angiography studies which have non-evaluable results of index text in various ways such as when transferring a segment/vessel to a patient based evaluation.²

Synthesis of diagnostic test accuracy studies is the most common medical application of multivariate meta-analysis.^3,4 The purpose of a meta-analysis of diagnostic test accuracy studies is to combine information over different studies and provide an integrated analysis that will have more statistical power to detect an accurate diagnostic test than an analysis based on a single study. Nevertheless, the existence of non-evaluable subjects is an important issue that could lead to biased meta-analytic estimates of index test accuracy.^2,5,6 Schuetz et al.² studied different ad-hoc approaches dealing with diagnostic test non-evaluable subjects, such as non-evaluable subjects are excluded from the study, non-evaluable positives (non-evaluable diseased subjects) are taken as true positives and non-evaluable negatives (non-evaluable non-diseased subjects) are taken as false positives, non-evaluable positives are taken as false negatives and non-evaluable negatives are taken as true negatives, and non-evaluable positives as false negatives and non-evaluable negatives as false positives. In all of these approaches, Schuetz et al.² used the bivariate generalized linear mixed model (BGLMM)⁷ and concluded that excluding the index test non-evaluable subjects leads to overestimation of the meta-analytic estimates of sensitivity and specificity and recommended the intent-to-diagnose approach by treating non-evaluable positives as false negatives and non-evaluable negatives as false positives.

Ma et al.⁵ used a trivariate generalized mixed model (TGLMM) approach by treating the non-evaluable subjects as missing data under a missing at random assumption (MAR). Ma et al.,⁵ with extensive simulation studies, showed that the intent-to-diagnose approach² under-estimates both meta-analytic estimates of sensitivity and specificity, while the TGLMM approach under the MAR assumption gives nearly unbiased estimates of sensitivity, specificity and prevalence.

Nikoloulopoulos,⁶ similar to Ma et al.,⁵ extended the vine copula mixed model for trivariate meta-analysis of diagnostic test accuracy studies accounting for disease prevalence⁸ to additionally account for non-evaluable subjects. The extended trivariate vine copula mixed model includes the extended TGLMM as a special case and can also model sensitivity, specificity and prevalence on the original scale. Nikoloulopoulos⁶ demonstrated that the extended TGLMM leads to biased meta-analytic estimates of sensitivity, specificity and prevalence when the univariate random effects are misspecified and that the extended vine copula mixed model gives nearly unbiased estimates of test accuracy indices and disease prevalence.

A recurrent theme underlying the methodologies of Ma et al.⁵ and Nikoloulopoulos⁶ is the need to make the MAR assumption that cannot be verified based on the observed data. Hence, it is natural to be concerned about robustness or sensitivity of inferences to departures from the MAR assumption. The within-study model assumes that the number of true negatives, false negatives, false positives, true positives, non-evaluable negatives and non-evaluable positives are multinomially distributed, given the latent (random) vector of sensitivity, specificity, disease prevalence, probability of non-evaluable positives and probability of non-evaluable negatives. Under the MAR assumption, the multinomial probability mass function (pmf) decomposes into a product of independent binomial pmfs given the random effects. Hence, the within-study model actually assumes that the number of true negatives, number of true positives, number of diseased subjects, number of non-evaluable negatives and number of non-evaluable positives are conditionally independent and binomially distributed given the random effects. The triplet of latent sensitivity, specificity and prevalence are independent of the missing probabilities, hence the joint likelihood factors into two components, one involving only the sensitivity, specificity and disease prevalence, and the other involving only the probabilities of non-evaluable positives and non-evaluable negatives. Therefore, the methodology of Chu et al.⁹ or Nikoloulopoulos⁸ is applied to the first likelihood component to infer about the sensitivity, specificity and disease prevalence. Hence, the models in Ma et al.⁵ and Nikoloulopoulos⁶ extend the BGLMM⁷ and the bivariate vine copula mixed model,¹⁰ respectively, to the trivariate case by adding the disease prevalence as a third outcome to indirectly account for the non-evaluable results. On the one hand, the number of diseased subjects are binomially distributed with probability of success the latent prevalence and a support that includes the number of non-evaluable positives and the number of non-evaluable negatives, but the true positives and true negatives are binomially distributed with probability of success the latent sensitivity and specificity, respectively, and a support that does not include either the number of non-evaluable positives or the number of non-evaluable negatives on the other, just like in the BGLMM⁷ and the bivariate vine copula mixed model.¹⁰ Note in passing that a special case of the bivariate copula mixed model is the BGLMM, that is, a copula mixed model composed of a bivariate normal (BVN) copula with normal margins.

In this paper, in order to remedy this situation of ignoring the non-evaluable subjects in the derivation of the meta-analytic estimates of sensitivity and specificity, we include the number of non-evaluable positives and the number of non-evaluable negatives as separate non-missing response categories. Interestingly, the proposed model extends the bivariate copula mixed model¹⁰ to the quadrivariate case by directly adding the number of non-evaluable positives and number of non-evaluable negatives as a third and fourth outcome, respectively. Hence, it directly utilizes all the available data. The bivariate copula mixed model¹⁰ assumes independent binomial distributions for the true positives and true negatives, conditional on the latent pair of sensitivity and specificity in each study. In the proposed methodology for the meta-analysis of diagnostic tests where we additionally account for non-evaluable outcomes of the diagnostic test, we will assume independent multinomial distributions for the true and non-evaluable positives, and, the true and non-evaluable negatives, conditional on the latent sensitivity, specificity, probability of non-evaluable positives and probability of non-evaluable negatives in each study.

For the random effects distribution, we employ a regular vine copula.¹¹ Regular vine copulas are suitable for high-dimensional data, hence given the low dimension d = 4, where d is the dimension, we use their boundary case namely a drawable vine (D-vine) copula. D-vine copulas have become important in many applications areas such as finance^12,13 and biological sciences,^14,15 to just name a few, in order to deal with dependence in the joint tails. Another boundary case of regular vine copulas is the canonical vine copula, but this parametric family of copulas is only suitable if there exists a (pilot) variable that drives the dependence among the variables,^16,17 which apparently is not the case in this application area.

The remainder of the paper proceeds as follows. Section 2 introduces the multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic studies accounting for non-evaluable results and provides computational details for maximum likelihood (ML) estimation. Section 3 studies the small-sample efficiency and robustness of the ML estimation of the multinomial quadrivariate D-vine copula mixed model. Section 4 applies our methodology to data from a published meta-analysis for diagnostic accuracy studies of coronary computed tomography angiography for the detection of coronary artery disease. We conclude with some discussion in Section 5, followed by a brief section with software details.

2 The multinomial quadrivariate D-vine copula mixed model

In this section, we introduce the multinomial quadrivariate D-vine copula mixed model. In Subsections 2.1 and 2.2, a D-vine copula representation of the random effects distribution with normal and beta margins, respectively, is presented. We complete this section with details on maximum likelihood estimation.

2.1 The multinomial quadrivariate D-vine copula mixed model with normal margins

We first introduce the notation used in this paper. The data are $y_{ijk}, i = 1, \dots, N, j = 0, 1, 2, k = 0, 1$ , where i is an index for the individual studies, j is an index for the test outcome (0: negative; 1: positive; 2: non-evaluable) and k is an index for the disease outcome (0: non-diseased; 1: diseased). The “classic” 2 × 2 table is extended to a 3 × 2 table (Table 1). Each cell in Table 1 provides the cell frequency corresponding to a combination of index test and disease outcomes in study i.

Table 1.

Data from an individual study in a 3 × 2 table.

	Disease (by gold standard)
Test	–	+	Total
–	$y_{i 00}$	$y_{i 01}$	$y_{i 0 +}$
+	$y_{i 10}$	$y_{i 11}$	$y_{i 1 +}$
Non-evaluable	$y_{i 20}$	$y_{i 21}$	$y_{i 2 +}$
Total	$y_{i + 0}$	$y_{i + 1}$	$y_{i + +}$

The diseased subjects have three possible states: false negative, true positive, and non-evaluable positive. The multinomial observation is therefore the number of diseased subjects where the diagnostic test is in each of its states. Hence, we assume that the false negatives $Y_{i 01}$ , the true positives $Y_{i 11}$ , and the non-evaluable positives $Y_{i 21}$ are multinomially distributed given $(X_{1} = x_{1}, X_{3} = x_{3})$ , viz. $\begin{array}{l} (Y_{i 01}, Y_{i 11}, Y_{i 21}) | (X_{1} = x_{1}, X_{3} = x_{3}) \\ \sim M_{3} (y_{i + 1}, 1 - l^{- 1} (x_{1}, x_{3}) - l^{- 1} (x_{3}, x_{1}), l^{- 1} (x_{1}, x_{3}), l^{- 1} (x_{3}, x_{1})) \end{array}$ (1)where (X₁, X₃) is the bivariate latent pair of transformed sensitivity and probability of non-evaluable positives and $l^{- 1} (x_{j}, x_{k}) = \frac{e^{x_{j}}}{1 + e^{x_{j}} + e^{x_{k}}}$ is the inverse multinomial logit link. Note that $M_{T} (n, p_{1}, \dots, p_{T})$ is shorthand notation for the multinomial distribution, where T is the number of cells, n is the number of observations, and $(p_{1}, \dots, p_{T})$ with $p_{1} + \dots + p_{T} = 1$ is the T-dimensional vector of success probabilities.

In a similar manner, the non-diseased subjects have also three possible states: true negative, false positive, and non-evaluable negative. Hence, we assume that the true negatives $Y_{i 00}$ , the false positives $Y_{i 10}$ , and the non-evaluable negatives $Y_{i 20}$ are multinomially distributed given $(X_{2} = x_{2}, X_{4} = x_{4})$ , viz. $\begin{array}{l} (Y_{i 00}, Y_{i 10}, Y_{i 20}) | (X_{2} = x_{2}, X_{4} = x_{4}) \\ \sim M_{3} (y_{i + 0}, l^{- 1} (x_{2}, x_{4}), 1 - l^{- 1} (x_{2}, x_{4}) - l^{- 1} (x_{4}, x_{2}), l^{- 1} (x_{4}, x_{2})) \end{array}$ (2)where (X₂, X₄) is the bivariate latent pair of transformed specificity and probability of non-evaluable negatives.

After defining the within-studies model in equations (1) and (2), we next define the between-studies model. The stochastic representation of the between studies model takes the form $\begin{array}{l} (Φ (X_{1}; l (π_{1}, π_{3}), σ_{1}^{2}), Φ (X_{2}; l (π_{2}, π_{4}), σ_{2}^{2}), \\ Φ (X_{3}; l (π_{3}, π_{1}), σ_{3}^{2}), Φ (X_{4}; l (π_{4}, π_{2}), σ_{4}^{2})) \sim C (\cdot; θ) \end{array}$ (3)where $C (\cdot; θ)$ is a quadrivariate D-vine copula with dependence parameter vector $θ = (θ_{12}, θ_{23}, θ_{34}, θ_{13 | 2}, θ_{24 | 3}, θ_{14 | 23})$ and $Φ (\cdot; μ, σ^{2})$ is the cumulative distribution function (cdf) of the N( $μ, σ^{2}$ ) distribution, and $l (π_{j}, π_{k}) = \log (\frac{π_{j}}{1 - π_{j} - π_{k}})$ is the multinomial logit link. The copula parameter vector $θ$ has parameters of the random effects model and they are separated from the univariate parameters $(π_{j}, σ_{j}), j = 1, \dots, 4$ . The parameters π₁ and π₂ are those of actual interest denoting the meta-analytic parameters for the sensitivity and specificity, while the parameters π₃ and π₄ denote the probabilities of non-evaluable positives and negatives, respectively. The univariate parameters $σ_{1}^{2}, σ_{2}^{2}, σ_{3}^{2}, σ_{4}^{2}$ denote the variabilities of the random effects.

The quadrivariate D-vine copula is built via successive mixing from bivariate pair-copulas on different levels. The pairs at level 1 are $j, j + 1$ , for j = 1, 2, 3, and for level $ℓ$ ( $2 \leq ℓ < 4$ ), the (conditional) pairs are $j, j + ℓ | j + 1, \dots, j + ℓ - 1$ for $j = 1, \dots, 4 - ℓ$ . That is, for the four-dimensional D-vine, the copulas for variables j and $j + ℓ$ given the variables indexed in between capture the conditional dependence.¹³ When all the bivariate pair-copulas are BVN copulas with correlation (copula) parameters $ρ_{12}, ρ_{23}, ρ_{34}$ (1st level) and partial correlation parameters $ρ_{13 | 2}, ρ_{24 | 3}, ρ_{14 | 23}$ (2nd and 3rd level), the resulting distribution is the quadrivariate normal with mean vector $μ = {(l (π_{1}, π_{3}), l (π_{2}, π_{3}), l (π_{3}, π_{1}), l (π_{4}, π_{2}))}^{⊤}$ and variance covariance matrix $\sum = (\begin{matrix} σ_{1}^{2} & ρ_{12} σ_{1} σ_{2} & ρ_{13} σ_{1} σ_{3} & ρ_{14} σ_{1} σ_{4} \\ ρ_{12} σ_{1} σ_{2} & σ_{2}^{2} & ρ_{23} σ_{2} σ_{3} & ρ_{24} σ_{2} σ_{4} \\ ρ_{13} σ_{1} σ_{3} & ρ_{23} σ_{2} σ_{3} & σ_{3}^{2} & ρ_{34} σ_{3} σ_{4} \\ ρ_{14} σ_{1} σ_{4} & ρ_{24} σ_{2} σ_{4} & ρ_{34} σ_{3} σ_{4} & σ_{4}^{2} \end{matrix})$ where $ρ_{13} = ρ_{13 | 2} \sqrt{1 - ρ_{12}^{2}} \sqrt{1 - ρ_{23}^{2}} + ρ_{12} ρ_{23}$ , $ρ_{24} = ρ_{24 | 3} \sqrt{1 - ρ_{23}^{2}} \sqrt{1 - ρ_{34}^{2}} + ρ_{23} ρ_{34}$ , $ρ_{14} = ρ_{14 | 2} \sqrt{1 - ρ_{12}^{2}} \sqrt{1 - ρ_{24}^{2}} + ρ_{12} ρ_{24}$ , $ρ_{14 | 2} = ρ_{14 | 23} \sqrt{1 - ρ_{13 | 2}^{2}} \sqrt{1 - ρ_{34 | 2}^{2}} + ρ_{13 | 2} ρ_{34 | 2}$ , $ρ_{13 | 2} = (ρ_{13} - ρ_{12} ρ_{23}) / \sqrt{1 - ρ_{12}^{2}} / \sqrt{1 - ρ_{23}^{2}}$ and $ρ_{34 | 2} = (ρ_{34} - ρ_{23} ρ_{24}) / \sqrt{1 - ρ_{23}^{2}} / \sqrt{1 - ρ_{24}^{2}}$ .¹⁵ Other choices of copulas are better if there is more dependence in joint upper or lower tail.

The models in equations (1)–(3) together specify a multinomial quadrivariate D-vine copula mixed model with joint likelihood $\begin{array}{l} L (π_{1}, π_{2}, π_{3}, π_{4}, σ_{1}, σ_{2}, σ_{3}, σ_{4}, θ) \\ = \prod_{i = 1}^{N} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} g (y_{i 11}, y_{i 21}; y_{i + 1}, l^{- 1} (x_{1}, x_{3}), l^{- 1} (x_{3}, x_{1})) \\ \times g (y_{i 00}, y_{i 20}; y_{i + 0}, l^{- 1} (x_{2}, x_{4}), l^{- 1} (x_{4}, x_{2})) f_{1234} (x_{1}, x_{2}, x_{3}, x_{4}; θ) d x_{1} d x_{2} d x_{3} d x_{4} \end{array}$ where $g (; n, p_{1}, \dots, p_{T - 1})$ is the $M_{T} (n, p_{1}, \dots, p_{T})$ pmf and $f_{1234} (\cdot; θ)$ is the quadrivariate D-vine density, viz. $f_{1234} (x_{1}, x_{2}, x_{3}, x_{4}; θ) = ϕ (x_{1}) ϕ (x_{2}) ϕ (x_{3}) ϕ (x_{4}) c_{1234} (Φ (x_{1}), Φ (x_{2}), Φ (x_{3}), Φ (x_{4}); θ)$ (4)with $\begin{array}{l} c_{1234} (Φ (x_{1}), Φ (x_{2}), Φ (x_{3}), Φ (x_{4}); θ) \\ = c_{12} (Φ (x_{1}), Φ (x_{2}); θ_{12}) c_{23} (Φ (x_{2}), Φ (x_{3}); θ_{23}) c_{34} (Φ (x_{3}), Φ (x_{4}); θ_{34}) \\ \times c_{13 | 2} (F_{1 | 2} (x_{1} | x_{2}), F_{3 | 2} (x_{3} | x_{2}); θ_{13 | 2}) c_{24 | 3} (F_{2 | 3} (x_{2} | x_{3}), F_{4 | 3} (x_{4} | x_{3}); θ_{24 | 3}) \\ \times c_{14 | 23} (F_{1 | 23} (x_{1} | x_{2}, x_{3}), F_{4 | 23} (x_{4} | x_{2}, x_{3}); θ_{14 | 23}) \end{array}$ where $ϕ (x)$ and $Φ (x)$ is shorthand notation for the density $ϕ (x; μ, σ^{2})$ and cdf $Φ (x; μ, σ^{2})$ of the $N (μ, σ^{2})$ distribution, $c_{j k}, c_{j k | ℓ}, c_{14 | 23}$ are bivariate copula densities, $F_{j | k} (x_{j} | x_{k}) = \frac{\partial C_{j k} (Φ_{j} (x_{j}), Φ_{k} (x_{k}))}{\partial Φ_{k} (x_{k})}$ , $F_{1 | 23} (x_{1} | x_{2}, x_{3}) = \frac{\partial C_{13 | 2} (F_{1 | 2} (x_{1} | x_{2}), F_{3 | 2} (x_{3} | x_{2}))}{\partial Φ (x_{2})}$ and $F_{4 | 23} (x_{4} | x_{2}, x_{3}) = \frac{\partial C_{24 | 3} (F_{2 | 3} (x_{2} | x_{3}), F_{4 | 3} (x_{4} | x_{3}))}{\partial Φ (x_{3})}$ ; $C_{j k}, C_{j k | ℓ}$ are bivariate copula cdfs. Note that a for a four-dimensional D-vine copula density there are 12 different decompositions.¹² To be concrete in the exposition of the theory, we use the decomposition in equation (4); the theory though also applies to the other 11 decompositions.

Below we transform the original integral into an integral over a unit hypercube using the inversion method. Hence, the joint likelihood becomes $\begin{array}{l} \prod_{i = 1}^{N} \int_{0}^{1} \int_{0}^{1} \int_{0}^{1} \int_{0}^{1} g (y_{i 11}, y_{i 21}; y_{i + 1}, l^{- 1} (Φ^{- 1} (u_{1}, l (π_{1}, π_{3}), σ_{1}^{2}), Φ^{- 1} (u_{3}, l (π_{3}, π_{1}), σ_{3}^{2})), \\ l^{- 1} (Φ^{- 1} (u_{3}, l (π_{3}, π_{1}), σ_{3}^{2}), Φ^{- 1} (u_{1}, l (π_{1}, π_{3}), σ_{1}^{2}))) \\ \times g (y_{i 00}, y_{i 20}; y_{i + 0}, l^{- 1} (Φ^{- 1} (u_{2}, l (π_{2}, π_{4}), σ_{2}^{2}), Φ^{- 1} (u_{4}, l (π_{4}, π_{2}), σ_{4}^{2})), \\ l^{- 1} (Φ^{- 1} (u_{4}, l (π_{4}, π_{2}), σ_{4}^{2}), Φ^{- 1} (u_{2}, l (π_{2}, π_{4}), σ_{2}^{2}))) c_{1234} (u_{1}, u_{2}, u_{3}, u_{4}; θ) d u \end{array}$

2.2 The multinomial D-vine copula mixed model with beta margins

In this section, we use the parametrization proposed by Wilson¹⁸ in order the latent sensitivity and specificity to remain on the original scale. The within-study model takes the form $\begin{array}{l} (Y_{i 01}, Y_{i 11}, Y_{i 21} | (X_{1} = x_{1}, X_{3} = x_{3}) \sim M_{3} (y_{i + 1}, 1 - x_{1} - x_{3} (1 - x_{1}), x_{1}, x_{3} (1 - x_{1})); \\ (Y_{i 00}, Y_{i 10}, Y_{i 20} | (X_{2} = x_{2}, X_{4} = x_{4}) \sim M_{3} (y_{i + 0}, x_{2}, 1 - x_{2} - x_{4} (1 - x_{2}), x_{4} (1 - x_{2})) \end{array}$ (5)

The stochastic representation of the between studies model is $(F (X_{1}; π_{1}, γ_{1}), F (X_{2}; π_{2}, γ_{2}), F (X_{3}; \frac{π_{3}}{1 - π_{1}}, γ_{3}), F (X_{4}; \frac{π_{4}}{1 - π_{2}}, γ_{4})) \sim C (\cdot; θ)$ (6)where $C (\cdot; θ)$ is a D-vine copula with dependence parameter vector $θ$ and $F (\cdot; π, γ)$ is the cdf of the Beta( $π, γ$ ) distribution with π the mean and γ the dispersion parameter. The copula parameter vector $θ$ has the dependence parameters of the random effects model and they are separated from the univariate parameters $(π_{j}, γ_{j}), j = 1, \dots, 4$ . The parameters π₁ and π₂ are those of actual interest denoting the meta-analytic parameters for the sensitivity and specificity, while the parameters π₃ and π₄ denote the probabilities of non-evaluable positives and negatives, respectively. The univariate parameters $γ_{1}, γ_{2}, γ_{3}, γ_{4}$ denote the variabilities of the random effects. In contrast with the model in the preceding subsection, the random effects of sensitivity and specificity are on the original scale.

The models in equations (5) and (6) together specify a vine copula mixed model with joint likelihood $\begin{array}{l} L (π_{1}, π_{2}, π_{3}, π_{4}, γ_{1}, γ_{2}, γ_{3}, γ_{4}, θ) \\ = \prod_{i = 1}^{N} \int_{0}^{1} \int_{0}^{1} \int_{0}^{1} \int_{0}^{1} g (y_{i 11}, y_{i 21}; y_{i + 1}, x_{1}, x_{3} (1 - x_{1})) g (y_{i 00}, y_{i 02}; y_{i + 0}, x_{2}, x_{4} (1 - x_{2})) \\ \times f_{1234} (x_{1}, x_{2}, x_{3}, x_{4}; θ) d x_{1} d x_{2} d x_{3} d x_{4} \end{array}$ where $f_{1234} (\cdot; θ)$ is as in equation (4) where we use beta instead of normal marginal distributions. Below we transform the integral into an integral over a unit hypercube using the inversion method. Hence, the joint likelihood becomes $\begin{array}{l} \prod_{i = 1}^{N} \int_{0}^{1} \int_{0}^{1} \int_{0}^{1} \int_{0}^{1} g (y_{i 11}, y_{i 21}; y_{i + 1}, F^{- 1} (u_{1}; π_{1}, γ_{1}), F^{- 1} (u_{3}; \frac{π_{3}}{1 - π_{1}}, γ_{3}) (1 - F^{- 1} (u_{1}; π_{1}, γ_{1}))) \\ \times g (y_{i 00}, y_{i 20}; y_{i + 0}, F^{- 1} (u_{2}; π_{2}, γ_{2}), F^{- 1} (u_{4}; \frac{π_{4}}{1 - π_{2}}, γ_{4}) (1 - F^{- 1} (u_{2}; π_{2}, γ_{2}))) \\ \times c_{1234} (u_{1}, u_{2}, u_{3}, u_{4}; θ) d u_{1} d u_{2} d u_{3} d u_{4} \end{array}$

2.3 Maximum likelihood estimation and computational details

Estimation of the model parameters can be approached by the standard maximum likelihood (ML) method, by maximizing the logarithm of the joint likelihood. The estimated parameters can be obtained by using a quasi-Newton¹⁹ method applied to the logarithm of the joint likelihood. This numerical method requires only the objective function, i.e. the logarithm of the joint likelihood, while the gradients are computed numerically and the Hessian matrix of the second-order derivatives is updated in each iteration. The standard errors (SEs) of the ML estimates can be also obtained via the gradients and the Hessian computed numerically during the maximization process.

For the multinomial quadrivariate D-vine copula mixed model, numerical evaluation of the joint pmf can be achieved with the following steps:

Calculate Gauss-Legendre²⁰ quadrature points ${u_{q} : q = 1, \dots, n_{q}}$ and weights ${w_{q} : q = 1, \dots, n_{q}}$ in terms of standard uniform.

Convert from independent uniform random variables ${u_{q_{1}} : q_{1} = 1, \dots, n_{q}}, {u_{q_{2}} : q_{2} = 1, \dots, n_{q}}, {u_{q_{3}} : q_{3} = 1, \dots, n_{q}}$ , and ${u_{q_{4}} : q_{4} = 1, \dots, n_{q}}$ to dependent uniform random variables $v_{q_{1}}, v_{q_{2} | q_{1}}, v_{q_{3} | q_{1}; q_{2}}$ , and $v_{q_{4} | q_{1}; q_{2}, q_{3}}$ that have a D-vine distribution $C (\cdot; θ)$ using the algorithm in Nikoloulopoulos¹⁵:

Set $v_{q_{1}} = u_{q_{1}}$

$v_{q_{2} | q_{1}} = C_{2 | 1}^{- 1} (u_{q_{2}} | u_{q_{1}}; θ_{12})$

$t_{1} = C_{1 | 2} (v_{q_{1}} | v_{q_{2} | q_{1}}; θ_{12})$

$t_{2} = C_{3 | 1; 2}^{- 1} (u_{q_{3}} | t_{1}; θ_{12}), θ_{13 | 2})$

$v_{q_{3} | q_{1}; q_{2}} = C_{3 | 2}^{- 1} (t_{2} | v_{q_{2} | q_{1}}; θ_{23})$

$t_{3} = C_{2 | 3} (v_{q_{2} | q_{1}} | v_{q_{3} | q_{1}; q_{2}}; θ_{23})$

$t_{4} = C_{1 | 3; 2} (t_{1} | t_{2}; θ_{13 | 2})$

$t_{5} = C_{4 | 1; 2, 3} (u_{q_{4}} | t_{4}; θ_{14 | 23})$

$t_{6} = C_{4 | 2; 3}^{- 1} (t_{5} | t_{3}; θ_{24 | 3})$

$v_{q_{4} | q_{1}; q_{2}, q_{3}} = C_{4 | 3}^{- 1} (t_{6} | v_{q_{3} | q_{1}; q_{2}}; θ_{34})$

where $C (v | u; θ)$ and $C^{- 1} (v | u; θ)$ are conditional copula cdfs and their inverses.

3. Numerically evaluate the joint pmf, e.g. $\begin{array}{l} \prod_{i = 1}^{N} \int_{0}^{1} \int_{0}^{1} \int_{0}^{1} \int_{0}^{1} g (y_{i 11}, y_{i 21}; y_{i + 1}, F^{- 1} (u_{1}; π_{1}, γ_{1}), F^{- 1} (u_{3}; \frac{π_{3}}{1 - π_{1}}, γ_{3}) (1 - F^{- 1} (u_{1}; π_{1}, γ_{1}))) \\ \times g (y_{i 00}, y_{i 20}; y_{i + 0}, F^{- 1} (u_{2}; π_{2}, γ_{2}), F^{- 1} (u_{4}; \frac{π_{4}}{1 - π_{2}}, γ_{4}) (1 - F^{- 1} (u_{2}; π_{2}, γ_{2}))) \\ \times c_{1234} (u_{1}, u_{2}, u_{3}, u_{4}; θ) d u_{1} d u_{2} d u_{3} d u_{4} \end{array}$

in a quadruple sum $\begin{array}{l} \sum_{q_{1} = 1}^{n_{q}} \sum_{q_{2} = 1}^{n_{q}} \sum_{q_{3} = 1}^{n_{q}} \sum_{q_{4} = 1}^{n_{q}} w_{q_{1}} w_{q_{2}} w_{q_{3}} w_{q_{4}} g (y_{i 11}, y_{i 21}; y_{i + 1}, F^{- 1} (v_{q_{1}}; π_{1}, γ_{1}), \\ F^{- 1} (v_{q_{3} | q_{1}; q_{2}}; \frac{π_{3}}{1 - π_{1}}, γ_{3}) (1 - F^{- 1} (v_{q_{1}}; π_{1}, γ_{1}))) g (y_{i 00}, y_{i 20}; y_{i + 0}, \\ F^{- 1} (v_{q_{2} | q_{1}}; π_{2}, γ_{2}), F^{- 1} (v_{q_{4} | q_{1}; q_{2}, q_{3}}; \frac{π_{4}}{1 - π_{2}}, γ_{4}) (1 - F^{- 1} (v_{q_{2} | q_{1}}; π_{2}, γ_{2}))) \end{array}$

With Gauss-Legendre quadrature, the same nodes and weights are used for different functions; this helps in yielding smooth numerical derivatives for numerical optimization via quasi-Newton.

3 Simulations

In this section, we study the small-sample efficiency and robustness of the ML estimation of the multinomial quadrivariate D-vine copula mixed model. In Section 3.1, we gauge the small-sample efficiency of the ML method and investigate the misspecification of the parametric margin or bivariate pair-copulas of the random effects distribution. In Section 3.2, we investigate the mixed model misspecification by using both the proposed model and the extended trivariate vine copula mixed model⁶ as true models.

We set the sample size and the true univariate and dependence parameters to mimic the data analyzed in Section 4. In each model, we use six different linking copula families: normal, Frank, and Clayton copula along with its rotated versions (see our previous papers on copula mixed models^8,10,21,22 for definitions) to cover different types of dependence structure. To make it easier to compare strengths of dependence, we convert the BVN, Frank, and rotated Clayton estimated parameters to Kendall’s τ’s in $(- 1, 1)$ via the following relations^23,24 $\begin{array}{l} τ = \frac{2}{π} \arcsin (θ) \\ τ = {\begin{matrix} 1 - 4 θ^{- 1} - 4 θ^{- 2} \int_{θ}^{0} \frac{t}{e^{t} - 1} d t, & θ < 0 \\ 1 - 4 θ^{- 1} + 4 θ^{- 2} \int_{0}^{θ} \frac{t}{e^{t} - 1} d t, & θ > 0 \end{matrix} \end{array}$ and²⁵ $τ = {\begin{matrix} θ / (θ + 2), & by 0 ° or 180 ° \\ - θ / (θ + 2), & by 90 ° or 270 ° \end{matrix}$

3.1 Small-sample efficiency–misspecification of the random effects distribution

We randomly generate samples of size N = 30 from the multinomial quadrivariate D-vine copula mixed model with both normal and beta margins. The simulation process is as below:

1. Simulate $(u_{1}, u_{2}, u_{3}, u_{4})$ from a D-vine distribution $C (\cdot; τ_{12}, τ_{23}, τ_{34}, τ_{13 | 2} = 0, τ_{24 | 3} = 0, τ_{14 | 23} = 0)$ .

2. • Convert to normal realizations via $\begin{array}{l} x_{1} = Φ^{- 1} (u_{1}; \log \frac{π_{1}}{1 - π_{1} - π_{3}}, σ_{1}) & x_{2} = Φ^{- 1} (u_{2}; \log \frac{π_{2}}{1 - π_{2} - π_{4}}, σ_{2}) \\ x_{3} = Φ^{- 1} (u_{3}; \log \frac{π_{3}}{1 - π_{1} - π_{3}}, σ_{3}) & x_{4} = Φ^{- 1} (u_{4}; \log \frac{π_{4}}{1 - π_{2} - π_{4}}, σ_{4}) \end{array}$

• Convert to beta realizations via $\begin{array}{l} x_{1} = F^{- 1} (u_{1}; π_{1}, γ_{1}) & x_{2} = F^{- 1} (u_{2}; π_{2}, γ_{2}) \\ x_{3} = F^{- 1} (u_{3}; \frac{π_{3}}{1 - π_{1}}, σ_{1}) & x_{4} = F^{- 1} (u_{4}; \log \frac{π_{4}}{1 - π_{2}}, γ_{4}) \end{array}$

3. Simulate the size of diseased and non-diseased subjects n₁ and n₂, respectively, from a shifted gamma distribution to obtain heterogeneous study sizes,²⁶ i.e. $\begin{array}{l} n_{1} \sim sGamma (α = 1.2, β = 0.01, lag = 30) \\ n_{2} \sim sGamma (α = 1.2, β = 0.01, lag = 30) \end{array}$

and round off n₁ and n₂ to the nearest integers.

4. • For normal margins, draw $(y_{01}, y_{11}, y_{21})$ from $M_{3} (n_{1}, \frac{1}{1 + e^{x_{1}} + e^{x_{3}}}, \frac{e^{x_{1}}}{1 + e^{x_{1}} + e^{x_{3}}}, \frac{e^{x_{3}}}{1 + e^{x_{1}} + e^{x_{3}}})$ and $(y_{00}, y_{10}, y_{20})$ from $M_{3} (n_{2}, \frac{e^{x_{2}}}{1 + e^{x_{2}} + e^{x_{4}}}, \frac{1}{1 + e^{x_{2}} + e^{x_{4}}}, \frac{e^{x_{4}}}{1 + e^{x_{2}} + e^{x_{4}}})$

• For beta margins, draw $(y_{01}, y_{11}, y_{21})$ from $M_{3} (n_{1}, 1 - x_{1} - x_{3} (1 - x_{1}), x_{1}, x_{3} (1 - x_{1}))$

and $(y_{00}, y_{10}, y_{20})$ from $M_{3} (n_{2}, x_{2}, 1 - x_{2} - x_{4} (1 - x_{2}), x_{4} (1 - x_{2}))$

Tables 2 and 3 contain the resultant biases, root mean square errors (RMSE), and standard deviations (SD), along with the square root of the average theoretical variances ( $\sqrt{\bar{V}}$ ), scaled by 100, for the ML estimates under different pair-copulas and marginal choices from the multinomial D-vine copula mixed model with beta and normal margins, respectively. The true (simulated) pair-copula distributions are the Clayton copulas rotated by $180 °$ for both the $C_{12} (; τ_{12})$ and $C_{34} (; τ_{34})$ pair-copulas and the Clayton copula rotated by $90 °$ for the $C_{23} (; τ_{23})$ pair-copula.

Table 2.

Small sample of sizes N = 30 simulations (10³ replications; n_q = 15) from the multinomial quadrivariate D-vine copula mixed model with beta margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances ( $\sqrt{\bar{V}}$ ), scaled by 100, for the ML estimates under different pair-copula choices and margins.

	Margin	Copula	$π_{1}$ = 0.90	$π_{2} =$ 0.77	$π_{3}$ = 0.06	$π_{4}$ = 0.11	$γ_{1}$ = 0.09	$γ_{2}$ = 0.08	$γ_{3}$ = 0.37	$γ_{4}$ = 0.15	$τ_{12}$ = 0.82	$τ_{23}$ = –0.52	$τ_{34}$ = 0.26
Bias	Normal	BVN	4.20	3.49	–1.97	–1.91	–	–	–	–	–22.37	36.27	16.07
	Beta		–0.08	–0.03	0.38	0.03	–0.10	–0.21	–4.81	–0.12	–5.01	6.21	1.97
	Normal	Frank	4.24	3.68	–1.96	–1.86	–	–	–	–	–21.28	34.84	15.74
	Beta		0.21	0.43	0.11	–0.18	–0.01	–0.17	–4.25	–0.09	–2.58	5.14	2.00
	Normal	Cln{180°, 90°}	4.20	3.37	–2.00	–1.84	–	–	–	–	–15.90	30.22	15.57
	Beta^a		–0.21	–0.16	0.31	0.11	–0.17	–0.28	–1.75	–0.52	0.60	0.71	1.37
	Normal	Cln{0°, 270°}	4.14	3.52	–1.90	–1.85	–	–	–	–	–30.10	38.76	12.30
	Beta		–0.08	0.11	0.53	0.02	0.82	0.49	–6.33	0.14	–3.62	15.15	–2.62
SD	Normal	BVN	1.84	2.68	1.59	1.74	24.50	14.06	28.18	17.58	24.52	27.58	25.93
	Beta		1.95	2.53	1.71	1.67	2.97	2.28	8.29	4.35	10.26	14.27	17.16
	Normal	Frank	1.89	2.74	1.65	1.81	24.70	14.04	28.44	17.80	24.90	28.57	26.45
	Beta		1.84	2.37	1.61	1.58	3.00	2.22	8.53	4.34	8.02	14.71	17.31
	Normal	Cln{180°, 90°}	1.88	2.67	1.62	1.73	23.97	13.53	27.78	17.86	23.67	25.46	21.90
	Beta^a		1.98	2.52	1.68	1.67	2.85	2.15	8.89	4.28	9.18	14.65	15.85
	Normal	Cln{0°, 270°}	1.88	2.76	1.62	1.79	26.28	15.89	30.75	18.60	33.78	28.34	30.21
	Beta		1.98	2.63	1.74	1.71	3.59	2.83	9.05	4.54	16.13	16.23	19.53
$\sqrt{\bar{V}}$	Normal	BVN	1.38	2.39	1.17	1.66	16.86	10.85	25.40	14.73	15.55	15.62	15.66
	Beta		1.34	1.99	1.21	1.46	1.97	1.82	7.92	4.06	9.04	13.14	14.88
	Normal	Frank	1.31	2.28	1.12	1.62	16.21	10.76	24.94	14.66	13.13	13.75	14.50
	Beta		1.18	1.85	1.10	1.36	1.84	1.94	8.25	4.05	7.84	13.07	15.20
	Normal	Cln{180°, 90°}	1.36	2.34	1.15	1.63	16.49	10.37	24.44	14.14	13.51	15.53	13.77
	Beta^a		1.33	1.99	1.21	1.44	1.92	1.83	8.00	3.97	8.08	13.45	14.33
	Normal	Cln{0°, 270°}	1.38	2.40	1.18	1.66	16.04	10.92	27.34	14.84	13.47	12.44	16.15
	Beta		1.22	1.85	1.10	1.36	2.10	1.94	7.84	4.14	10.83	12.75	16.73
RMSE	Normal	BVN	4.59	4.40	2.53	2.58	–	–	–	–	33.19	45.56	30.51
	Beta		1.95	2.53	1.75	1.67	2.97	2.28	9.58	4.35	11.42	15.56	17.28
	Normal	Frank	4.64	4.59	2.57	2.59	–	–	–	–	32.75	45.05	30.78
	Beta		1.85	2.41	1.61	1.59	3.00	2.23	9.53	4.35	8.43	15.58	17.42
	Normal	Cln{180°, 90°}	4.60	4.30	2.58	2.53	–	–	–	–	28.52	39.52	26.87
	Beta^a		1.99	2.52	1.70	1.67	2.85	2.17	9.06	4.31	9.20	14.67	15.91
	Normal	Cln{0°, 270°}	4.55	4.47	2.50	2.58	–	–	–	–	45.24	48.01	32.62
	Beta		1.98	2.63	1.81	1.71	3.69	2.87	11.04	4.54	16.54	22.20	19.70

Note: Cln{ $ω_{1}^{°}, ω_{2}^{°}$ }: The $C_{12} (\cdot; τ_{12}), C_{34} (\cdot; τ_{34})$ and $C_{23} (\cdot; τ_{23})$ pair-copulas are Clayton rotated by ω₁ and ω₂ degrees, respectively.

BVN: bivariate normal

True model.

Table 3.

Small sample of sizes N = 30 simulations (10³ replications; n_q = 15) from the multinomial quadrivariate D-vine copula mixed model with normal margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances ( $\sqrt{\bar{V}}$ ), scaled by 100, for the ML estimates under different pair-copula choices and margins.

	Margin	Copula	$π_{1}$ = 0.94	$π_{2}$ = 0.79	$π_{3}$ = 0.03	$π_{4}$ = 0.09	$σ_{1}$ = 0.75	$σ_{2}$ = 0.65	$σ_{3}$ = 1.20	$σ_{4}$ = 0.69	$τ_{12}$ = 0.82	$τ_{23}$ = –0.38	$τ_{34}$ = 0.29
Bias	Normal	BVN	–0.64	–0.33	0.61	0.25	0.99	–1.22	–5.03	–0.88	–6.98	4.30	5.50
	Beta		–6.16	–4.21	4.08	2.29	–	–	–	–	–15.26	–13.26	12.79
	Normal	Frank	–0.63	–0.17	0.61	0.22	0.82	–1.05	–5.73	–0.86	–6.67	2.53	5.45
	Beta		–5.97	–3.96	3.96	2.25	–	–	–	–	–12.55	–14.92	12.18
	Normal^a	Cln{180°, 90°}	–0.63	–0.44	0.57	0.33	–1.13	–1.96	–2.71	–0.97	–1.54	–2.42	2.31
	Beta		–6.37	–4.42	4.10	2.50	–	–	–	–	–10.10	–19.62	9.71
	Normal	Cln{0°, 270°}	–0.72	–0.24	0.71	0.24	3.57	1.36	–3.63	–0.46	–4.08	11.78	4.52
	Beta		–6.20	–4.25	4.23	2.37	–	–	–	–	–21.47	–6.61	10.91
SD	Normal	BVN	2.12	2.75	1.83	1.84	18.29	11.62	23.06	14.40	17.54	17.42	19.35
	Beta		2.99	2.94	2.31	1.94	5.26	3.00	6.51	3.42	10.98	18.63	22.53
	Normal	Frank	2.20	2.80	1.91	1.88	17.92	11.60	23.42	14.50	14.46	18.54	20.16
	Beta		2.97	3.00	2.35	2.00	5.23	3.17	6.71	3.43	10.68	19.30	22.22
	Normal^a	Cln{180°, 90°}	2.14	2.77	1.84	1.86	17.74	11.44	22.79	14.36	15.47	19.08	16.82
	Beta		3.06	3.03	2.34	2.01	5.16	3.25	7.07	3.50	11.46	20.35	21.01
	Normal	Cln{0°, 270°}	2.15	2.81	1.85	1.86	19.92	13.08	24.72	15.13	22.16	19.47	24.34
	Beta		2.99	3.02	2.33	1.98	5.73	3.39	6.60	3.46	16.13	21.83	30.25
$\sqrt{\bar{V}}$	Normal	BVN	1.43	2.45	1.19	1.62	15.81	10.23	22.66	12.43	18.18	15.88	15.91
	Beta		1.35	2.10	1.17	1.45	2.04	2.11	6.09	3.10	8.09	14.89	17.57
	Normal	Frank	1.33	2.30	1.11	1.55	15.53	10.13	22.28	12.37	11.75	15.14	16.00
	Beta		1.28	2.07	1.13	1.43	2.01	2.29	6.35	3.10	7.70	15.89	17.29
	Normal^a	Cln{180°, 90°}	1.41	2.38	1.18	1.59	14.92	9.88	21.71	12.04	14.06	16.53	14.14
	Beta		1.31	2.14	1.17	1.45	1.97	2.29	6.58	3.13	7.93	14.92	16.86
	Normal	Cln{0°, 270°}	1.39	2.41	1.17	1.60	16.20	10.56	23.22	12.61	18.50	15.09	18.85
	Beta		1.26	1.95	1.08	1.34	2.20	2.09	5.72	3.15	8.89	18.63	20.45
RMSE	Normal	BVN	2.22	2.77	1.93	1.86	18.32	11.68	23.60	14.42	18.88	17.94	20.12
	Beta		6.85	5.13	4.69	3.00	–	–	–	–	18.80	22.87	25.91
	Normal	Frank	2.29	2.81	2.00	1.89	17.93	11.65	24.11	14.53	15.93	18.71	20.88
	Beta		6.67	4.96	4.61	3.01	–	–	–	–	16.48	24.39	25.34
	Normal^a	Cln{180°, 90°}	2.23	2.80	1.93	1.89	17.77	11.61	22.95	14.39	15.55	19.24	16.98
	Beta		7.07	5.36	4.72	3.21	–	–	–	–	15.28	28.27	23.14
	Normal	Cln{0°, 270°}	2.27	2.82	1.98	1.88	20.24	13.15	24.98	15.13	22.53	22.75	24.76
	Beta		6.88	5.22	4.83	3.09	–	–	–	–	26.85	22.81	32.15

BVN: bivariate normal.

True model.

Conclusions from the values in the tables are the following:

ML with the true multinomial D-vine copula mixed model is highly efficient according to the simulated biases, SDs and RMSEs.

The ML estimates of the univariate meta-analytic parameters and their SDs are robust under copula misspecification, but are not robust to margin misspecification.

The ML estimates of τ’s and their SDs are robust to copula misspecification, but they are not robust to margin misspecification.

3.2 Misspecification of the copula mixed model that accounts for non-evaluable outcomes

We randomly generate samples of size N = 30 from the multinomial quadrivariate D-vine copula mixed model and the extended trivariate vine copula mixed model with both normal and beta margins using the algorithm in Section 3.1 and in Nikoloulopoulos,⁶ respectively. We compare the ML estimates of common parameters for both approaches under misspecification and also include in the comparison the bivariate copula mixed model estimates where the non-evaluable positives and negatives are either excluded or included as false negatives and false positives (intention to diagnose approach), respectively.

In Section 3.1 and in Nikoloulopoulos,⁶ it has been revealed that (a) the estimation of the univariate meta-analytic parameters is a univariate inference, and hence it is the univariate marginal distribution that matters and not the type of the copula, and (b) estimated Kendall’s τ is similar among different families of copulas. Hence, as the ML estimates are nearly not affected by the type of the pair-copula, we provide here the results when all the bivariate copulas are BVN.

Tables 4 and 5 contain the resultant biases, RMSEs, and SDs, along with the square root of the average theoretical variances ( $\sqrt{\bar{V}}$ ), scaled by 100, for the ML estimates under different copula mixed models. The true quadrivariate multinomial vine copula mixed model is composed by the Clayton copulas rotated by $180^{°}$ for both the $C_{12} (; τ_{12})$ and $C_{34} (; τ_{34})$ pair-copulas and the Clayton copula rotated by $90^{°}$ for the $C_{23} (; τ_{23})$ pair-copula. The true trivariate vine copula mixed model is composed by the Clayton copula for $C_{12} (; τ_{12})$ and the Clayton rotated by $90^{°}$ for both the $C_{13} (; τ_{13})$ and $C_{23 | 1} (; τ_{23 | 1})$ pair-copulas.

Table 4.

Small sample of sizes N = 30 simulations (10³ replications; n_q = 15) from the multinomial quadrivariate D-vine and trivariate vine copula mixed models with beta margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances ( $\sqrt{\bar{V}}$ ), scaled by 100, for the ML estimates under different copula mixed models.

			True vine copula mixed model
			Trivariate					Quadrivariate
	Fitted copulamixed model	Margin	$π_{1}$ = 0.97	$π_{2}$ = 0.85	$γ_{1}$ = 0.03	$γ_{2}$ = 0.06	$τ$ = 0.39	$π_{1}$ = 0.90	$π_{2}$ = 0.77	$γ_{1}$ = 0.09	$γ_{2}$ = 0.08	$τ$ = 0.82
Bias	Bivariate^a	Beta	0.04	0.22	–0.11	–0.15	11.37	7.10	9.63	–5.84	–2.25	–42.16
		Normal^b	0.91	2.38	–	–	14.24	8.26	11.78	–	–	–40.39
	Bivariate^c	Beta	–3.18	–5.46	1.70	0.14	–2.54	–0.08	–0.03	–0.08	–0.22	–4.75
		Normal^b	–1.59	–3.50	–	–	–1.26	2.79	1.47	–	–	–2.35
	Trivariate	Beta	–0.03	–0.09	–0.10	–0.06	8.97	7.10	9.60	–5.81	–2.24	–42.47
		Normal^d	0.86	2.13	–	–	10.89	8.25	11.76	–	–	–40.61
	Quadrivariate	Beta	–3.18	–5.46	1.71	0.13	–1.63	–0.08	–0.03	–0.10	–0.21	–5.01
		Normal	–0.46	–1.10	–	–	21.73	4.20	3.49	–	–	–22.37
SD	Bivariate^a	Beta	0.64	1.91	1.29	2.00	24.08	0.81	1.83	1.91	1.99	17.07
		Normal^b	0.57	1.90	23.49	14.40	25.41	0.49	1.72	23.15	13.93	17.40
	Bivariate^c	beta	0.77	1.85	1.58	1.92	17.45	1.94	2.54	2.94	2.22	10.03
		Normal^b	0.80	1.94	19.38	12.22	18.29	1.87	2.86	25.11	14.07	9.06
	Trivariate	Beta	0.66	1.89	1.27	2.02	22.71	0.81	1.83	1.93	1.99	16.92
		Normal^d	0.58	1.88	23.45	14.46	23.79	0.48	1.72	23.51	14.10	17.35
	Quadrivariate	Beta	0.77	1.87	1.57	1.93	18.20	1.95	2.53	2.97	2.28	10.25
		Normal	0.68	1.91	23.35	13.83	23.92	1.84	2.68	24.50	14.06	24.90
$\sqrt{\bar{V}}$	Bivariate^a	Beta	0.63	1.80	1.34	2.02	27.90	0.60	1.61	1.21	1.69	15.20
		Normal^b	0.53	1.73	24.78	14.05	26.16	0.45	1.52	19.40	11.91	15.35
	Bivariate^c	beta	1.08	2.09	1.78	2.00	16.39	1.31	1.97	1.93	1.79	8.79
		Normal^b	0.93	2.10	19.55	11.99	17.09	1.27	2.04	16.09	10.47	8.03
	Trivariate	Beta	0.66	1.86	1.37	2.06	25.75	0.60	1.60	1.20	1.68	15.06
		Normal^d	0.54	1.77	24.03	14.24	22.36	0.45	1.52	19.13	11.82	15.12
	Quadrivariate	Beta	1.09	2.09	1.78	2.00	17.08	1.34	1.99	1.97	1.82	9.04
		Normal	0.81	2.01	24.81	13.61	22.07	1.38	2.39	16.86	10.85	15.55
RMSE	Bivariate^a	Beta	0.64	1.92	1.29	2.00	26.63	7.15	9.80	6.14	3.00	45.48
		Normal^b	1.08	3.04	–	–	29.13	8.28	11.91	–	–	43.98
	Bivariate^c	Beta	3.27	5.77	2.32	1.92	17.64	1.94	2.54	2.94	2.23	11.10
		Normal^b	1.78	4.01	–	–	18.34	3.36	3.21	–	–	9.36
	Trivariate	Beta	0.67	1.89	1.28	2.02	24.42	7.15	9.77	6.12	3.00	45.72
		Normal^d	1.04	2.84	–	–	26.16	8.27	11.88	–	–	44.16
	Quadrivariate	Beta	3.27	5.77	2.32	1.94	18.27	1.95	2.53	2.97	2.28	11.42
		Normal	0.82	2.21	–	–	32.32	4.59	4.40	–	–	33.19

The non-evaluable outcomes are excluded

The resulting model is the same as the BGLMM.

The non-evaluable positives and negatives are included as false negatives and positives, respectively.

The resulting model is the same as the extended TGLMM.

Table 5.

Small sample of sizes N = 30 simulations (10³ replications; n_q = 15) from the multinomial quadrivariate D-vine and trivariate vine copula mixed models with normal margins and resultant biases, root mean square errors (RMSE) and standard deviations (SD), along with the square root of the average theoretical variances ( $\sqrt{\bar{V}}$ ), scaled by 100, for the ML estimates under different copula mixed models.

			True vine copula mixed model
			Trivariate					Quadrivariate
	Fitted copulamixed model	Margin	$π_{1}$ = 0.98	$π_{2}$ = 0.88	$σ_{1}$ = 0.90	$σ_{2}$ = 0.73	$τ$ = 0.39	$π_{1}$ = 0.94	$π_{2}$ = 0.79	$σ_{1}$ = 0.75	$σ_{2}$ = 0.65	$τ$ = 0.82
Bias	Bivariate^a	Beta	–0.85	–1.91	–	–	13.57	2.69	6.59	–	–	–4.63
		Normal^b	–0.03	0.17	–6.52	–1.39	15.81	3.35	8.18	–1.80	–1.96	–4.69
	Bivariate^c	Beta	–4.04	–7.58	–	–	–1.43	–5.97	–3.98	–	–	–17.33
		Normal^b	–2.46	–5.61	–0.79	–6.12	–0.35	–1.75	–1.93	63.85	17.11	–14.39
	Trivariate	Beta	–0.92	–2.19	–	–	10.97	2.63	6.45	–	–	–7.20
		Normal^d	–0.08	–0.07	–6.28	–1.15	12.26	3.31	8.09	–1.64	–2.09	–7.31
	Quadrivariate	Beta	–4.04	–7.57	–	–	–0.13	–6.16	–4.21	–	–	–15.26
		Normal	–1.39	–3.28	–8.23	–3.69	23.31	–0.64	–0.33	0.99	–1.22	–6.98
SD	Bivariate^a	Beta	0.69	1.94	1.43	2.13	25.07	0.54	1.61	0.80	1.56	17.37
		Normal^b	0.52	1.75	24.43	13.65	26.48	0.44	1.48	17.34	11.50	16.85
	Bivariate^c	Beta	0.78	1.83	1.54	1.94	17.78	2.95	2.86	5.50	3.08	10.51
		Normal^b	0.76	1.84	18.23	11.72	18.50	2.11	2.96	25.41	13.97	9.93
	Trivariate	Beta	0.72	1.92	1.43	2.15	23.29	0.60	1.69	0.86	1.55	17.14
		Normal^d	0.54	1.73	24.00	13.63	24.34	0.47	1.55	17.87	11.40	16.62
	Quadrivariate	Beta	0.79	1.84	1.53	1.96	18.77	2.99	2.94	5.26	3.00	10.98
		Normal	0.66	1.78	24.11	13.34	25.91	2.12	2.75	18.29	11.62	17.54
$\sqrt{\bar{V}}$	Bivariate^a	Beta	0.59	1.75	1.18	1.93	29.79	0.50	1.48	0.71	1.38	19.52
		Normal^b	0.50	1.67	23.60	13.40	30.27	0.45	1.45	16.23	10.60	23.19
	Bivariate^c	Beta	1.06	2.06	1.71	1.97	16.68	1.33	2.10	2.01	2.13	8.30
		Normal^b	0.91	2.07	18.92	11.82	16.94	1.34	2.22	15.15	11.02	8.07
	Trivariate	Beta	0.61	1.78	1.20	1.94	25.49	0.51	1.48	0.72	1.37	18.94
		Normal^d	0.51	1.70	22.86	13.25	22.97	0.45	1.45	15.63	10.37	19.41
	Quadrivariate	Beta	1.06	2.07	1.70	1.98	16.83	1.35	2.10	2.04	2.11	8.09
		Normal	0.79	1.96	22.00	12.84	24.51	1.43	2.45	15.81	10.23	18.18
RMSE	Bivariate^a	Beta	1.10	2.72	–	–	28.51	2.74	6.79	–	–	17.98
		Normal^b	0.52	1.75	25.28	13.72	30.84	3.38	8.31	17.43	11.66	17.49
	Bivariate^c	Beta	4.11	7.80	–	–	17.83	6.66	4.90	–	–	20.27
		Normal^b	2.58	5.91	18.25	13.22	18.50	2.74	3.53	68.73	22.09	17.49
	Trivariate	Beta	1.17	2.91	–	–	25.74	2.70	6.67	–	–	18.59
		Normal^d	0.55	1.73	24.81	13.67	27.25	3.35	8.23	17.94	11.59	18.16
	Quadrivariate	Beta	4.12	7.79	–	–	18.77	6.85	5.13	–	–	18.80
		Normal	1.54	3.73	25.47	13.84	34.85	2.22	2.77	18.32	11.68	18.88

The non-evaluable outcomes are excluded.

The resulting model is the same as the BGLMM.

The non-evaluable positives and negatives are included as false negatives and positives, respectively.

The resulting model is the same as the extended TGLMM.

Conclusions from the values in the tables are the following:

The bivariate copula mixed model where the non-evaluable outcomes are disregarded and the extended trivariate vine copula mixed model showed similar performance. Both led to unbiased (biased) and efficient (inefficient) estimates when the true model is the trivariate (quadrivariate multinomial) vine copula mixed model.

The bivariate copula mixed model where the non-evaluable positives and negatives included as false negatives and false positives, respectively, and the multinomial D-vine copula mixed model with beta margins showed similar performance. Both led to unbiased (biased) and efficient (inefficient) estimates when the true model is the quadrivariate multinomial vine copula mixed model with beta margins (trivariate vine copula mixed model or quadrivariate multinomial vine copula mixed model with normal margins).

4 Meta-analysis of coronary computed tomography angiography studies

We apply the multinomial quadrivariate D-vine copula mixed model for the meta-analysis of diagnostic accuracy studies accounting for non-evaluable subjects to data on 30 studies from a systematic review for diagnostic accuracy studies of coronary computed tomography angiography for the detection of coronary artery disease.²⁷

We fit the multinomial quadrivariate D-vine copula mixed model for all different decompositions of the D-vine copula density, for both beta and normal margins and different pair-copulas at the level 1; for levels 2 and 3, we use BVN copulas. In cases when fitting the multinomial quadrivariate D-vine copula mixed model, the resultant estimate of one of the conditional dependence parameters was close to the right boundary of its parameter space (that is clear indication that the model with a full structure provides more dependence structure than it is actually required⁸), we used a truncated model, i.e. we captured the strongest dependence in the first tree and then just used the independence copulas in lower order trees, i.e. conditional independence. Joe et al.²⁸ showed that in order for a vine copula to have (tail) dependence for all bivariate margins, it is only necessary for the bivariate copulas in level 1 to have (tail) dependence and it is not necessary for the conditional bivariate copulas in levels 2 and 3, to have tail dependence. Hence, one can either use BVN or independence copulas at levels 2 and 3 without sacrificing the tail dependence of the vine copula distribution.

In Table 6, we present the results from the decomposition of the vine copula density in equation (4), as a different decompositions led to similar results due to the small sample size. This is consistent with our previous studies on vine copula mixed models.^6,8 Since the number of parameters is not the same between the models, we use the Akaike information criterion (AIC), that is −2× log-likelihood +2× (#model parameters) as a rough diagnostic measure for goodness of fit between the models. The AICs showed that a (truncated) multinomial quadrivariate D-vine copula mixed model with Clayton copulas rotated by $180 °$ for both the $C_{12} (; τ_{12})$ and $C_{34} (; τ_{34})$ pair-copulas and the Clayton copula rotated by $90 °$ for the $C_{23} (; τ_{23})$ pair-copula and beta margins (Table 6) provides the best fit.

Table 6.

AICs, ML estimates, and standard errors (SE) of the multinomial quadrivariate D-vine copula mixed models for diagnostic accuracy studies of coronary computed tomography angiography.

	BVN		Frank		Cln{180°, 90°}^a		Cln{180°, 270°}
	Est.	SE	Est.	SE	Est.	SE	Est.	SE
Normal margins
π₁	0.94	0.01	0.95	0.01	0.94	0.02	0.94	0.02
π₂	0.80	0.03	0.80	0.03	0.79	0.03	0.79	0.03
π₃	0.04	0.01	0.03	0.01	0.03	0.01	0.04	0.01
π₄	0.09	0.02	0.09	0.02	0.09	0.02	0.09	0.02
σ₁	0.89	0.20	0.91	0.19	0.75	0.17	0.83	0.17
σ₂	0.72	0.15	0.65	0.13	0.65	0.12	0.67	0.13
σ₃	1.32	0.36	1.37	0.36	1.20	0.31	1.19	0.33
σ₄	0.80	0.23	0.70	0.21	0.69	0.19	0.73	0.19
τ₁₂	0.54	0.22	0.49	0.20	0.82	0.19	0.82	0.18
τ₂₃	–0.16	0.20	–0.31	0.17	–0.38	0.24	–0.04	0.15
τ₃₄	0.22	0.23	0.11	0.24	0.29	0.17	0.37	0.17
$τ_{13 \| 2}$	0.43	0.34	0.67	0.23	–	–	–	–
$τ_{24 \| 3}$	0.11	0.22	–0.03	0.24	–	–	–	–
$τ_{14 \| 23}$	–0.39	0.32	–0.36	0.49	–	–	–	–
AIC	4013.22		4010.80		4007.72		4009.36
Beta margins
π₁	0.90	0.02	0.90	0.02	0.90	0.01	0.89	0.01
π₂	0.76	0.03	0.77	0.02	0.77	0.02	0.76	0.02
π₃	0.06	0.01	0.06	0.01	0.06	0.01	0.07	0.01
π₄	0.11	0.02	0.11	0.02	0.11	0.02	0.11	0.02
γ₁	0.08	0.03	0.09	0.03	0.09	0.03	0.10	0.03
γ₂	0.09	0.03	0.09	0.02	0.08	0.02	0.09	0.02
γ₃	0.32	0.12	0.32	0.13	0.37	0.12	0.28	0.12
γ₄	0.15	0.07	0.16	0.07	0.15	0.07	0.15	0.06
τ₁₂	0.71	0.11	0.74	0.08	0.82	0.08	0.79	0.07
τ₂₃	–0.35	0.17	–0.34	0.12	–0.52	0.14	–0.23	0.10
τ₃₄	0.23	0.22	0.20	0.21	0.26	0.18	0.21	0.17
$τ_{13 \| 2}$	–0.66	0.38	–	–	–	–	–	–
$τ_{24 \| 3}$	–0.10	0.20	–	–	–	–	–	–
$τ_{14 \| 23}$	–0.02	0.57	–	–	–	–	–	–
AIC	4009.42		4005.93		4002.17		4004.92

AIC: akaike information criterion; BVN: bivariate normal.

Best fit.

In real data (in contrast with the simulated data in Section 3), the truth is unknown, so it is important to compare between the proposed and other existing approaches in terms of point estimation and variance. First, in order to reveal if the use of the proposed model is worthy, when a standard bivariate analysis (either ignoring the non-evaluable outcomes or including the non-evaluable positives and negatives as false negatives and positives, respectively) is easy, we also fit the bivariate copula mixed model¹⁰ with both beta and normal margins and different bivariate copulas. According to the likelihood principle, a bivariate copula mixed model with a Clayton and Clayton copula rotated by 180° (to model the association between the latent sensitivity and specificity) and beta margins provides the best fit for both different ad-hoc approaches to handle the non-evaluable outcomes (Table 7). It is revealed that a bivariate copula mixed model with the sensitivity and specificity on the original scale provides better fit than the BGLMM,⁷ which models the sensitivity and specificity on a transformed scale.

Table 7.

AICs, ML estimates, and standard errors (SE) of the best fitted bivariate copula and extended trivariate vine copula mixed models with beta margins for diagnostic accuracy studies of coronary computed tomography angiography.

	Bivariate				Trivariate
	Clayton^a		Clayton $180 °$ ^b		Clayton ${0 °, 90 °}$
	Est.	SE	Est.	SE	Est.	SE
π ₁	0.97	0.01	0.90	0.01	0.97	0.01
π ₂	0.85	0.02	0.77	0.02	0.85	0.02
π ₃	–	–	–	–	0.49	0.03
γ ₁	0.03	0.01	0.09	0.03	0.03	0.01
γ ₂	0.06	0.02	0.08	0.02	0.06	0.02
γ ₃	–	–	–	–	0.11	0.02
τ ₁₂	0.42	0.19	0.82	0.08	0.39	0.20
τ ₁₃	–	–	–	–	0.02	0.23
$τ_{23 \| 1}$	–	–	–	–	–0.28	0.17
AIC	244.82		321.91		492.26

AIC: akaike information criterion.

The non-evaluable outcomes are excluded.

The non-evaluable positives and negatives are included as false negatives and positives, respectively.

Then, in order to compare the proposed approach with the ones that use the MAR assumption, we fit the extended trivariate vine copula mixed model⁶ with both beta and normal margins and different pair-copulas. According to the likelihood principle, a vine copula mixed model composed of a Clayton copula to model the association between the sensitivity and specificity, a Clayton copula rotated by 90° to model both the associations between the specificity and prevalence and between the sensitivity and prevalence given the specificity, and beta margins provides the best fit (Table 7). It is revealed that an extended trivariate vine copula mixed model with the sensitivity, specificity, and disease prevalence on the original scale provides better fit than the extended TGLMM,⁵ which models the sensitivity, specificity, and disease prevalence on a transformed scale.

It has been shown that the trivariate analysis does not change the conclusions from the bivariate analysis excluding the non-evaluable outcomes. It is also apparent that the results from the quadrivariate analysis differentiate from the ones from bivariate (excluding the non-evaluable outcomes) and trivariate analyses which are fairly similar. The meta-analytic estimates of sensitivity and specificity from the latter approaches are blown, because in both approaches it is assumed that $Y_{i 11} | X_{1} = x_{1} \sim Binomial (y_{i 01} + y_{i 11}, x_{1}) and Y_{i 00} | X_{2} = x_{2} \sim Binomial (y_{i 00} + y_{i 10}, x_{2})$ i.e. their support ignores the number of non-evaluable positives $y_{i 21}$ and the number of non-evaluable negatives $y_{i 20}$ . The conclusions from the quadrivariate analysis with the latent proportions on the original scale are quite similar with the ones from the bivariate analysis where the non-evaluable positives and negatives are included as false negatives and positives, respectively. These results are consistent with the findings in the simulations in Section 3.2. Note in passing that comparing the AIC values among the quadrivariate, trivariate and bivariate copula mixed models is inconclusive as they use a different number of responses.

Although typically the focus of meta-analysis has been to derive the summary-effect estimates, there is increasing interest in drawing predictive inference. Summary receiver operating characteristic curves (SROC) can be deduced from the D-vine copula mixed model with the sensitivity and specificity on the original scale through the quantile regression techniques developed for the bivariate copula mixed model.¹⁰ SROC essentially shows the effect of different model (random effect distribution) assumptions, since it is an inference that depends on the joint distribution. An SROC curve has been deduced for the bivariate copula mixed model¹⁰ through a median regression curve of X₁ on X₂. For the copula mixed model, the model parameters (including dependence parameters), the choice of the copula, and the choice of the margin affect the shape of the SROC curve.¹⁰ However, there is no priori reason to regress X₁ on X₂ instead of the other way around, so a median regression curve of X₂ on X₁ has also been provided. Rucker and Schumacher²⁹ stated that instead of summarizing data using an SROC, it might be preferable to give confidence regions. Hence, in addition to using just median regression curves, quantile regression curves with a focus on high (q = 0.99) and low quantiles (q = 0.01), which are strongly associated with the upper and lower tail dependence imposed from each parametric family of copulas, have been proposed.¹⁰ These can been seen as confidence regions of the median regression SROC curve.

Figure 1 demonstrates the SROC curves with a confidence region and summary operating points (a pair of the model-based sensitivity and specificity; shown by the black square) from the best fitted multinomial quadrivariate D-vine copula mixed model, the best fitted trivariate vine copula mixed model, and the best fitted bivariate copula mixed models along with the study estimates (shown by the circles in Figure 1). For the upper panel graphs, the sensitivity and specificity at study i (point estimates) have been calculated with the typical definitions of sensitivity and specificity, viz. $\frac{y_{i 11}}{y_{i 01} + y_{i 11}} and \frac{y_{i 00}}{y_{i 00} + y_{i 10}}$ respectively, as only patients with positive or negative results are considered, while for the lower panel graphs, the sensitivity and specificity at study i have been calculated with the definitions of sensitivity and specificity in Simel et al.,³⁰ viz. $\frac{y_{i 11}}{y_{i + 1}} and \frac{y_{i 00}}{y_{i + 0}}$ respectively, since the number of non-evaluable positives $y_{i 21}$ contributes to the diseased population and the number of non-evaluable negatives $y_{i 20}$ contributes to the non-diseased population.

Figure 1.

SROC curves with a confidence region and summary operating points (a pair of the model-based sensitivity and specificity) from the best fitted multinomial quadrivariate D-vine, extended trivariate vine and bivariate copula mixed models along with the study estimates.

5 Discussion

Motivated by the existence of non-evaluable results in diagnostic test accuracy studies, this paper proposed a multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic test accuracy studies accounting for non-evaluable subjects. Our general statistical model allows for selection of pair-copulas independently among a variety of parametric copula families, i.e. there are no constraints in the choices of bivariate parametric families of copulas and can also operate on the original scale of sensitivity and specificity.

For the random effects, we have used a quadrivariate D-vine copula distribution or a truncated at level 1 quadrivariate D-vine copula (conditional independence), which allows flexible (tail) dependence.²⁸ We have proposed a numerically stable ML estimation technique based on Gauss-Legendre quadrature; the crucial step is to convert from independent to dependent quadrature points that follow a quadrivariate D-vine distribution.

In an era of evidence-based medicine, decision makers need high-quality procedures such as the one developed in this article to support decisions about whether or not to use a diagnostic test in a specific clinical situation. The multinomial quadrivariate vine-copula mixed model is not an ad-hoc² but rather a sophisticated approach that utilizes all the available data in decision making and can satisfy the intention-to-diagnose principle. Using an intention to diagnose principle, i.e. a conservative approach, ensures that both the sensitivity and specificity are not overestimated. Hence, it formally enables decision makers to be more cautious in solely relying to the overly optimistic meta-analytic estimates of sensitivity and specificity derived from the extended trivariate vine copula mixed model that indirectly accounts for the non-evaluable outcomes.

Footnotes

Acknowledgements

We would like to thank the referees for insightful comments leading to an improved presentation. The simulations presented in this paper were carried out on the High Performance Computing Cluster supported by the Research and Specialist Computing Support service at the University of East Anglia.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research,authorship,and/or publication of this article.

Funding

The author(s) received no financial support for the research,authorship,and/or publication of this article.

ORCID iD

Aristidis K Nikoloulopoulos

Software

R functions to implement and simulate from the multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic tests with non-evaluable subjects are part of the R package CopulaREMADA. 31 The data and code used in Section 4 are given as data and code examples in the package,respectively.

References

Begg

Greenes

Iglewicz

The influence of uninterpretability on the assessment of diagnostic tests. J Chronic Dis 1986; 39: 575–584.

Schuetz

Schlattmann

Dewey

Use of 3 × 2 tables with an intention to diagnose approach to assess clinical performance of diagnostic tests: Meta-analytical evaluation of coronary CT angiography studies.

BMJ (Online) 2012; 345: e6717.

Jackson

Riley

White

IR.

Multivariate meta-analysis: potential and promise. Stat Med 2011; 30: 2481–2498.

Mavridis

Salanti

A practical introduction to multivariate meta-analysis.

Stat Meth Med Res 2013; 22: 133–158.

Suri

MFK

Chu

A trivariate meta-analysis of diagnostic studies accounting for prevalence and non-evaluable subjects: re-evaluation of the meta-analysis of coronary CT angiography studies.

BMC Med Res Methodol 2014; 14: 128.

Nikoloulopoulos

AK.

An extended trivariate vine copula mixed model for meta-analysis of diagnostic studies in the presence of non-evaluable outcomes. Int J Biostat 2020. DOI: 10.1515/ijb-2019-0107

Chu

Cole

SR.

Bivariate meta-analysis of sensitivity and specificity with sparse data: a generalized linear mixed model approach.

J Clin Epidemiol 2006; 59: 1331–1332.

Nikoloulopoulos

AK.

A vine copula mixed effect model for trivariate meta-analysis of diagnostic test accuracy studies accounting for disease prevalence.

Stat Meth Med Res 2017; 26: 2270–2286.

Chu

Nie

Cole

, et al. Meta-analysis of diagnostic accuracy studies accounting for disease prevalence: alternative parameterizations and model selection. Stat Med 2009; 28: 2384–2399.

10.

Nikoloulopoulos

AK.

A mixed effect model for bivariate meta-analysis of diagnostic test accuracy studies using a copula representation of the random effects distribution.

Stat Med 2015; 34: 3842–3865.

11.

Bedford

Cooke

RM.

Vines – a new graphical model for dependent random variables. Ann Stat 2002; 30: 1031–1068.

12.

Aas

Czado

Frigessi

, et al. Pair-copula constructions of multiple dependence. Insurance: Math Econ 2009; 44: 182–198.

13.

Nikoloulopoulos

Joe

Vine copulas with asymmetric tail dependence and applications to financial return data. Comput Stat Data Anal 2012; 56: 659–3673.

14.

Killiches

Czado

A D-vine copula-based model for repeated measurements extending linear mixed models with homogeneous correlation structure.

Biometrics 2018; 74: 997–1005.

15.

Nikoloulopoulos

AK.

A D-vine copula mixed model for joint meta-analysis and comparison of diagnostic tests.

Stat Meth Med Res 2019; 28: 3286–3300.

16.

Nikoloulopoulos

Joe

Factor copula models for item response data.

Psychometrika 2015; 80: 126–150.

17.

Czado

Schepsmeier

Min

Maximum likelihood estimation of mixed C-vines with application to exchange rates. Stat Model 2012; 12: 229–255.

18.

Wilson

KJ.

Specification of informative prior distributions for multinomial models using vine copulas. Bayesian Anal 2018; 13: 749–766.

19.

Nash

Compact numerical methods for computers: linear algebra and function minimisation. 2nd ed. New York: Hilger, 1990.

20.

Stroud

Secrest

Gaussian quadrature formulas. Englewood Cliffs, NJ: Prentice-Hall, 1966.

21.

Nikoloulopoulos

AK.

Hybrid copula mixed models for combining case-control and cohort studies in meta-analysis of diagnostic tests.

Stat Meth Med Res 2018; 27: 2540–2553.

22.

Nikoloulopoulos

AK.

On composite likelihood in bivariate meta-analysis of diagnostic test accuracy studies. AStA Adv Stat Anal 2018; 102: 211–227.

23.

Hult

Lindskog

Multivariate extremes, aggregation and dependence in elliptical distributions. Adv Appl Prob 2002; 34: 587–608.

24.

Genest

Frank’s family of bivariate distributions. Biometrika 1987; 74: 549–555.

25.

Genest

MacKay

The joy of copulas: bivariate distributions with uniform marginals. Am Stat 1986; 40: 280–283.

26.

Paul

Riebler

Bachmann

, et al. Bayesian bivariate meta-analysis of diagnostic test studies using integrated nested Laplace approximations. Stat Med 2010; 29: 1325–1339.

27.

Menke

Kowalski

Diagnostic accuracy and utility of coronary CT angiography with consideration of unevaluable results: a systematic review and multivariate Bayesian random-effects meta-analysis with intention to diagnose.

Eur Radiol 2016; 26: 451–458.

28.

Joe

Nikoloulopoulos

AK.

Tail dependence functions and vine copulas. J Multivar Anal 2010; 101: 252–270.

29.

Rücker

Schumacher

Letter to the editor.

Biostatistics 2009; 10: 806–807.

30.

Simel

Feussner

Delong

, et al. Intermediate, indeterminate, and uninterpretable diagnostic test results. Med Decis Mak 1987; 7: 107–114.

31.

Nikoloulopoulos

AK.

CopulaREMADA: Copula mixed models for multivariate meta-analysis of diagnostic test accuracy studies, 2019. R package version 1.3, http://CRAN.R-project.org/package=CopulaREMADA (accessed 13 March 2020).

A multinomial quadrivariate D-vine copula mixed model for meta-analysis of diagnostic studies in the presence of non-evaluable subjects

Abstract

Keywords

1 Introduction

2 The multinomial quadrivariate D-vine copula mixed model

2.1 The multinomial quadrivariate D-vine copula mixed model with normal margins

2.2 The multinomial D-vine copula mixed model with beta margins

2.3 Maximum likelihood estimation and computational details

3 Simulations

3.1 Small-sample efficiency–misspecification of the random effects distribution

3.2 Misspecification of the copula mixed model that accounts for non-evaluable outcomes

4 Meta-analysis of coronary computed tomography angiography studies

5 Discussion

Footnotes

Acknowledgements

Declaration of conflicting interests

Funding

ORCID iD

Software

References