Abstract
Keywords
1 Introduction
Binary outcomes have drawn methodologically more attention for being the most prevalent in systematic reviews1,2 and rather straightforward to handle. 3 Although less widespread in systematic reviews for being often more complex to interpret and more labour-intensive to measure compared to binary outcomes, 1 continuous outcomes play an important role in decision-making and clinical practice. Similar to binary outcomes, continuous outcomes are also prone to missing outcome data (MOD). For instance, in a systematic review on respiratory rehabilitation in chronic obstructive pulmonary disease, Ebrahim et al. 4 observed a MOD rate ranging from 0% to 38% across 31 included trials. In a collection of 190 Cochrane systematic reviews published between 2009 and 2012 in three mental health Cochrane Groups, 27 out of 140 selected meta-analyses considered a continuous primary outcome; of those, 14 meta-analyses reported the total number of MOD in each arm of every trial. 5 In another collection of systematic reviews with at least three interventions published between 2009 and 2017, 92 out of 387 systematic reviews investigated a continuous primary outcome; 6 of these, only five reported the total number of MOD in each arm of every included trial. Figure 1 illustrates the distribution of the percentage of MOD across the different health fields in both surveys.

The distribution of missing continuous outcome data (MCOD; expressed as a percentage) across the different health fields in selected network meta-analyses (left plot with split violins) and the pairwise meta-analyses (right plot with split violins) from two surveys on the reporting and handling of aggregate missing outcome data.5,6 The red violins illustrate the density of differences in the percentage of MCOD between the compared arms across trials, and the grey violins indicate the distribution of the total percentage of MCOD across trials. The red and grey points indicate the median value in each split violin. DANG, Cochrane Depression, Anxiety and Neurosis Group; DPLPG, Cochrane Developmental, Psychosocial and Learning Problems Group; SG, Cochrane Schizophrenia Group.
Currently, there are a handful of methodological articles providing guidance on how to handle aggregate missing continuous outcome data (MCOD). Ebrahim et al.4,7 developed a three-step imputation approach to address MCOD in meta-analysis. The authors used information directly from the included trials to determine the scenarios for the imputed means and standard deviations, and they applied the same scenarios to all included trials. This approach is easy to implement; however, it fails to account for the uncertainty induced by MCOD in each arm of every trial as it merely imputes the missing observations before analysis. Depending on the amount and mechanisms of MCOD, this approach may seriously implicate the credibility of conclusions. In the case of considerable MCOD (e.g. over 20%), imputation provides spuriously more precise treatment effects 8 and increases the risk of false-positive conclusions.
Mavridis et al. 9 refined the two-stage pattern-mixture model, initially proposed by White et al. 10 for binary MOD, to operate for MCOD and applied it to a published systematic review on mental health. Their approach incorporates an informative missingness parameter (IMP) that reflects researchers’ belief(s) about the missingness mechanism for each arm of every trial with the premise to adjust the within-trial results for MCOD. We distinguish this sophisticated approach for being both conceptually and statistically more appropriate to handle MCOD as it models rather than imputes the missing observations before analysis and therefore, it fully accounts for the uncertainty induced by MOD. 9 However, the proposed two-stage approach has a shortcoming: it sets the within-trial treatment effects and standard errors fixed to the mean and variance of the missingness parameter’s distribution. Therefore, this approach does not allow the observed data to contribute to the estimation of the missingness parameter – while borrowing strength across the trials – to ‘learn’ about the missingness mechanisms. 11 In their article, the authors study a few modelling options concerning the structure of the IMP, thus potentially overlooking alternatives that would allow the researcher to investigate in more detail the implications of different structures on the conclusions.
Occasioned by the limitations of the two-stage approach by Mavridis et al., 9 the present study aims to provide a one-stage pattern-mixture model approach under the Bayesian framework for MCOD to gain knowledge with respect to the missingness mechanisms across different interventions and trials in a network. A one-stage model approach allows for an eloquent synthesis of the data in a single step, and it allows us to learn about the missingness process via the estimation of the missingness parameter, thus offering an immediate advantage over the two-stage pattern-mixture model approach.
The article has the following structure. In Section 2, we introduce two published systematic reviews with network meta-analysis (NMA) with different amount of MCOD; an NMA of treatment options for Parkinson’s disease with a considerable amount of MCOD in many trials (>20% MCOD) and an NMA of physical activities for type-2 diabetes patients with a moderate amount of MCOD. In Section 3, we describe the one-stage pattern-mixture model and the missingness parameters with different structures for their prior distribution. In Section 4, we apply our method to the motivating examples. We conclude with a discussion in Section 5 and brief recommendations in Section 6.
2 Motivating examples
We consider two motivating examples: (a) the network of Stowe et al. 12 investigating antiparkinsonian drugs by measuring the change from baseline of patient off-time reduction (Table 1) and (b) the network of Schwingshackl et al. 13 assessing the effect of different training modalities on HbA1c for patients with type 2 diabetes (Table 2). In both examples, negative values of mean difference (MD), and standardised mean difference (SMD) and positive values of the ratio of means (RoM) in the logarithmic scale – the most commonly used effect sizes in the synthesis of continuous outcomes – favour the first intervention in the comparison. The rationale of the choice of these networks heavily depends on the amount of missingness, quantified as the percentage of MCOD (%MCOD) from our broader collection of five networks (Figure 2; Table S1). The network of Stowe et al. 12 has a considerable %MCOD (>20%) in many trials, whereas the network of Schwingshackl et al. 13 suffers from a moderate amount of MCOD.
Trials on four different antiparkinsonian interventions (Stowe et al. 12 )a
A: Placebo plus levodopa; B: dopamine agonist plus levodopa; C: catechol-O-methyl transferase inhibitors plus levodopa; D: monoamine oxidase type B inhibitors plus levodopa; MCOD: missing continuous outcome data.
aOf the total 33 two-arm trials, we excluded three trials for reporting opposite signs in the mean change from baseline in the compared arms (log RoM cannot be calculated for these trials), and one trial due to inaccuracies in the available information regarding MCOD (Figure 1 in Stowe et al. 12 ).
bGreen indicates a low risk of attrition bias (≤5%), red indicates a substantial risk of attrition bias (>20%), and orange indicates a moderate risk of attrition bias.
Trials on three different training modalities (Schwingshackl et al. 13 )
A: aerobic; B: resistance; C: combined training; MCOD: missing continuous outcome data.
aThe number of randomised participants was obtained from the corresponding published reports, and then we calculated the missing outcome data in each trial.
bGreen implies a low risk of attrition bias (≤5%), red indicates a substantial risk of attrition bias (>20%), and orange implies a moderate risk of attrition bias.
cCalculated as the difference between the arms with the maximum and minimum percentage of missing outcome data.

Networks on a primary continuous outcome with extractable missing continuous outcome data (MCOD) from five published systematic reviews. The size of the node is proportional to the number of observed treatment comparisons that include that node. The thickness of the edge is proportional to the number of trials that investigated that comparison. A low, moderate and large amount of percentage of total MCOD (%MCOD) is represented in each node and edge with green (%MCOD
In more detail, in the network of Stowe et al.,
12
10 trials (34%) have a considerable %MCOD (range: 28–56), while only six trials (21%) are MCOD-free (Table 1). Of the four treatments, placebo plus levodopa (placebo+LD) has the lowest %MCOD across trials, whereas catechol-O-methyl transferase inhibitors plus levodopa (COMTI+LD) has the largest %MCOD (%MCOD median: 35, interquartile range (IQR): 0–51). The network of Schwingshackl et al.
13
has a moderate amount of MCOD compared to Stowe et al.
12
as half of the trials are MCOD-free and the remaining half has moderate to considerable %MCOD (range: 8–31; Table 2). Among the three competing interventions, resistance is the most ‘problematic’ (%MCOD median: 2, IQR: 0–15) followed by aerobic (%MCOD median: 2, IQR: 0–7). Moreover, the imbalance of %MCOD within the trials is less profound in Schwingshackl et al.
13
as the majority of trials has a lack of or low imbalance in %MCOD (
3 Methods
3.1 Notation
Consider a series of
By convention, we assume
In the presence of MCOD, we do not have information on
and
3.2 One-stage pattern-mixture model
In each trial, we are interested in estimating the unknown parameter
3.2.1 Informative missingness parameters
In order to quantify the missingness process, appropriate missingness parameters have been proposed. Following Mavridis et al., 9 we consider the following IMPs for MCOD:
Informative missingness difference of means (IMDoM)
The IMDoM is defined as the difference between the mean outcome among missing participants and the mean outcome among the completers
The
Informative missingness ratio of means (IMRoM)
The IMRoM is defined as the ratio of the mean outcome among the missing participants to the mean outcome among the completers
By replacing
Then, we can assign a normal prior distribution on
3.2.2 Structural assumptions for the missingness parameters
There are various options of increasing flexibility with respect to how the IMPs across trials and arms can be structured.11,14 We focus on the following:
common-within-network; the IMPs are assumed to be the same in the whole network, and only one parameter is estimated per network; trial-specific; the IMPs are different across trials but assumed to be the same in the compared arms resulting in down-weighting trials with unbalanced MCOD;10 intervention-specific; allowing the IMP to be different across interventions but shared across trials, thus resulting in down-weighting trials with higher total MCOD.
10
The IMPs can be further assumed to be identical, hierarchical or independent, which corresponds to hypothesising that these parameters are constant, exchangeable or different, respectively. In this work, the latter is assumed to be either uncorrelated or correlated across the arms of every trial with correlation
Assumptions and prior structure of the missingness parameters
3.3 Effect measures for a continuous outcome
3.3.1 Mean difference
We use an identity function to link
3.3.2 Standardised mean difference
We use the following link function
However, in the presence of MCOD, we have no information on
Ratio of (arithmetic) means
For the RoM, the link function is the following
3.4 Bayesian random-effects network meta-analysis model
For all the aforementioned effect measures, we assume
where
We use the surface under the cumulative ranking curve (SUCRA) to order the interventions from the best to the worst.
20
For the proposed one-stage pattern-mixture models, we use the normal likelihood with known variance for each arm of every trial (example 5 in the Appendix of Dias et al.
21
). For the location parameters
4 Application of the models
We apply the proposed models to the networks of Stowe et al.
12
and Schwingshackl et al.
13
We focus on MD and SMD for being the most prevalent effect measures for synthesising a continuous outcome. We present the results on log RoM for the primary and sensitivity analyses in the Supporting Information (Figure S1 –S5; Tables S2, and S4). Figure 3 depicts the posterior mean and 95% credible intervals (CrI) of all models for the comparisons with the reference intervention (i.e. placebo+LD in Stowe et al.,
12
and aerobic in Schwingshackl et al.,
13
respectively), and the posterior median and 95% CrI of

Interval plots of the posterior distribution of MD and SMD for comparisons with the reference intervention of Stowe et al.
12
(panel A)) and Schwingshackl et al.
13
(panel B)), and the posterior distribution of
4.1 Network meta-analysis results
4.1.1 Network with considerable MCOD (antiparkinsonian drugs)
The results obtained for MD and SMD analyses are very similar, following the same pattern, while on a different scale (Figure 3(A)). The posterior means for the identical and hierarchical structures are almost identical (after rounding to the second decimal) for the common-within-network, trial-specific and intervention-specific assumptions. The same conclusion can be drawn for the correlated and uncorrelated assumptions, whose estimates are found to be very similar to each other.
The intervention-specific assumption results in a slightly larger patient off-time reduction for dopamine agonist plus levodopa (DA+LD) versus placebo+LD as compared with the competing assumptions whose point estimates are identical or very similar to ACA (posterior mean of MD: −1.49 vs. −1.44). In this comparison, the results are almost interchangeable across the different assumptions because most of the included trials have low or moderate %MCOD (<10%) that is quite balanced in the compared arms as opposed to the other two comparisons.
The results for COMTI+LD versus placebo+LD are more variable across the assumptions, as expected because 6 out of 11 trials that were included suffer from considerable %MCOD that are unbalanced in the compared arms. All assumptions consistently lead to lower patient off-time reduction when compared with ACA, similarly in MD and SMD: the posterior mean is 14–30% lower than ACA across the assumptions. Among the different assumptions, the intervention-specific assumption leads to the lowest patient off-time reduction, whereas both assumptions under the independent structure yield the largest patient off-time reduction.
For the comparison of monoamine oxidase type B inhibitors plus levodopa (MAOBI+LD) versus placebo+LD, the competing assumptions yield almost interchangeable posterior means (MD ranging from −0.83 to −0.81, SMD from −0.37 to −0.36) that are close to ACA (MD: −0.83, SMD: −0.36), except for the common-within-network assumption that leads to a lower patient off-time reduction on average. Contrary to the other comparisons, the 95% CrIs are wider since two trials only inform this comparison, one of which has 35% MCOD (Table 1).
All assumptions yield a lower posterior median of

Barplots of the SUCRA values under MD, and SMD for the network of Stowe et al. 12 (panel A)) and the network of Schwingshackl et al. 13 (panel B)). The one-stage pattern-mixture model under the hierarchical, identical, and independent structure of IMDoM under the common-within-network, trial-specific, intervention-specific, within-trial correlated and uncorrelated assumptions. Results under ACA are also presented. ACA: available case analysis; COMTI+LD: catechol-O-methyl transferase inhibitors plus levodopa; MAOBI+LD: monoamine oxidase type B inhibitors plus levodopa; SUCRA: surface under the cumulative ranking.
By increasing the variance of the prior distribution for IMDoM at 32, and therefore our uncertainty on our belief about the MAR assumption, the results follow the same pattern with the primary analysis (only results on MD are shown), but the 95% CrIs are now wider overall (Figure S4A in the Supporting Information).
4.1.2 Network with moderate MCOD (different training modalities)
In line with the previous example, MD and SMD provide very similar results in terms of pattern, but on a different scale (Figure 3(B)). The point estimates are almost identical under the hierarchical and identical structures for the common-within-network, trial-specific and intervention-specific assumptions, and similar for the correlated and uncorrelated assumptions under the independent structure. The intervention-specific assumption leads to a slightly large reduction in HbA1c in favour of resistance, but the evidence is weak (the 95% Crl includes zero). All other assumptions give similar results with each other and with ACA strongly favouring resistance. Furthermore, under the intervention-specific assumption, the comparison of combined training versus aerobic results in lowering the reduction in HbA1c by approximately 37% compared with the competing assumptions. In contrast, all other assumptions result in similar findings with each other and with ACA but slightly more precise than ACA. Except for two trials with considerable %MCOD, the low to moderate %MCOD across and within the trials may explain the low variability of the results across the assumptions for each comparison with aerobic.
All assumptions lead to a lower posterior median and wider 95% CrI of
According to SUCRA values, combined training and resistance are consistently the best and the worst interventions across all assumptions, respectively, except for the intervention-specific assumption that leads to overlapping 95% CrIs (Figure 4(b)). The intervention-specific assumption yields wider CrI for comparisons with aerobic, which can explain the wide 95% CrI for SUCRA as well.
In line with the previous network, increasing the prior variance of the prior distribution of IMDoM at 32 leads to results of the same pattern with the primary analysis (only results on MD are shown), but with less precise estimates overall (Figure S4A in the Supporting Information).
4.2 Learning about the missingness mechanism
The posterior distribution of IMDoM under the common-within-network and intervention-specific assumptions is given in Table 4 for both network examples. A posterior mean away from zero is an indication that the MAR assumption may not be plausible; that is, the missingness process may be informative. Similarly, a 95% CrI excluding zero and protruding from the interval of the prior distribution of IMDoM is a strong indication of informative missingness; otherwise, the data provide little information to conclude for or against the MAR assumption. For the remaining structural assumptions of IMDoM, the posterior distribution for the most ‘problematic’ studies (i.e. studies with a considerable amount of MCOD) is shown in Figure 5 for the network of Stowe et al. 12 The vertical lines in each plot refer to the prior mean (middle grey line) and 95% prior interval (both sides dashed lines) under the MAR assumption on average.
Posterior mean and 95% CrI of IMDoM under different structural assumptions.a
COMTI+LD: catechol-O-methyl transferase inhibitors plus levodopa; CrI: credible interval; DA+LD: dopamine agonist plus levodopa; IMDoM: informative missingness difference of means; MAOBI+LD: monoamine oxidase type B inhibitors plus levodopa; PBO+LD: placebo plus levodopa.
aMean difference with informative missingness difference of means.

Interval plots of the posterior distribution of IMDoM using MD in the network of Stowe et al. 12 The one-stage pattern-mixture model under the hierarchical and identical structure assuming trial-specific IMDoMs and under the independent structure assuming within-trial correlated and uncorrelated IMDoMs. The vertical lines indicate the prior distribution for IMDoM. Only the trials with considerable participant losses are presented. COMTI+LD: catechol-O-methyl transferase inhibitors plus levodopa; CrI: credible interval; DA+LD: dopamine agonist plus levodopa; IMDoM; informative missingness difference of means; MAOBI+LD: monoamine oxidase type B inhibitors plus levodopa; PBO+LD: placebo plus levodopa.
4.2.1 Network with considerable MCOD (antiparkinsonian drugs)
The results under the common-within-network assumption suggest likely informative missingness as point estimates are positive (hierarchical: 0.99, identical: 1.02) and their corresponding CrIs do not include zero. However, the 95% CrIs do not protrude the upper bound of the prior interval (Table 4). The same conclusions are drawn when we assume a larger prior variance of IMDoM; though the posterior mean is larger (hierarchical: 1.17, identical: 1.20) and the 95% CrI are wider when compared to the primary analysis, as expected (Table S3). However, the common-within-network assumption does not reveal the source(s) of such informative missingness. On the contrary, the intervention-specific assumption indicates that for all interventions, except COMTI+LD, the data provide little information to conclude in favour of or against the MAR assumption as the 95% CrIs of the corresponding posterior distributions include zero (Table 4). Only the posterior mean and 95% CrI for COMTI+LD are far from zero suggesting informative missingness. The conclusions do not change when we assume a larger prior variance of IMDoM (Table S3). This is not surprising because COMTI+LD has attracted the highest %MCOD in the network (Table 1). COMTI+LD may also be responsible for pulling the posterior distribution of IMDoM away from zero under the common-within-network assumption. Conclusions are the same in both structures, though the identical structure provides more precise results than the hierarchical structure.
The trial-specific assumption further indicates that in trials with total %MCOD above 20% and severe imbalance in the compared arms (i.e. the trials that compare COMTI+LD with placebo+LD; Table 1), missing participants may have a smaller patient off-time reduction on average than completers, though the indication is weak (Figure 5). For these trials, the assumption of within-trial correlated IMDoMs provides further insights on the missingness process as it reveals that this behaviour is more profound in the active arm (i.e. COMTI+LD) than in the placebo arm (Figure 5). However, the assumption of within-trial uncorrelated IMDoMs indicates that only missing participants receiving COMTI+LD in these trials appear to have a smaller patient off-time reduction on average than completers (Figure 5). It is evident that by using different assumptions about the missingness parameter, we gain a different level of knowledge regarding the missingness mechanisms in the network.
4.2.2 Network with moderate MCOD (different training modalities)
The data provide little information to conclude for or against the MAR assumption using the common-within-network assumption given the wide CrIs of both hierarchical and identical structures that include zero (Table 4). This finding is shared in the results under the intervention-specific and trial-specific assumptions where for all treatments and trials, the intervals of the posterior distribution of IMDoM are considerably wide and include zero (last three lines of Table 4; Figure 6). The conclusions are similar for both assumptions under the independent structure (Figure 6). Contrary to the network of antiparkinsonian drugs, the trials of this network suffer mildly from MCOD, which are balanced in the compared arms. Therefore, there is not enough information to conclude for or against the MAR assumption using different assumptions for the missingness parameter.

Interval plots of the posterior distribution of IMDoM using MD in the network of Schwingshackl et al. 13 The one-stage pattern-mixture model under the hierarchical and identical structure assuming trial-specific IMDoMs and under the independent structure assuming within-trial correlated and uncorrelated IMDoMs. The vertical lines indicate the prior distribution for IMDoM. CrI: credible interval; IMDoM: informative missingness difference of means.
5 Discussion
We have proposed a one-stage pattern-mixture model approach under the Bayesian framework that accounts for MCOD from all trials in a single step while allowing the observed data to contribute to the estimation of the missingness parameters to learn about the missingness mechanism. The hierarchical structure of the proposed models facilitates the incorporation of various prior structures and assumptions about the missingness parameter to investigate the implications of MCOD on the conclusions. These features make the proposed model approach particularly attractive to handle aggregate MCOD properly. On the contrary, the two-stage approach does not offer enough flexibility in the analysis of MCOD as it requires strong assumptions about the estimated within-trial variance (considered known) and the missingness parameter (considered independent of the amount of MOD). 9
Due to variation in the amount of MCOD within and across the trials, especially, in the network of Stowe et al. 12 (Table 1), we consider the independent structure for the missingness parameters to be the most plausible, and the common-within-network assumption to be the least plausible in our motivating examples. The intervention-specific assumption is particularly relevant if one is interested to learn about the missingness mechanism in each or specific intervention(s). Common-within-network is a strong and perhaps the least realistic assumption. In a typical network with different trial-designs (e.g. placebo-controlled and active-controlled) and different interventions (e.g. placebo, active and old interventions), the amount of and reasons for MOD may vary across all arms and trials making the common-within-network assumption implausible. The trial-specific assumption is plausible when trials with different characteristics (in terms of design and conduct) are associated with different mechanisms of MOD.27–29 For further discussion on the situations to consider the different prior structures and assumptions for the missingness parameter (Table 3), the interested readers can refer to the literature.11,14,29
There is already a simulation study on the comparison of different one-stage pattern-mixture models for aggregate binary MOD. 29 This study considered the identical and hierarchical structures for the common-within-network, intervention-specific and trial-specific assumptions for the missingness parameter. In the present study on continuous outcomes, we observed the same behaviour of the proposed methods with the one from the simulation study. For instance, the intervention-specific prior structure led to larger posterior standard deviation of the treatment effects (similarly for the hierarchical and identical assumption) as compared to the other structures, especially, for a large amount of MCOD in the network (Figure 3). We expect the one-stage pattern-mixture model for continuous MOD to behave similarly to the one-stage pattern-mixture model for binary MOD for the different structural assumptions of the missingness parameter.
Furthermore, modelling MCOD appeared to have explained part of the between-trial variance in both networks, and particularly, under the intervention-specific assumption which yielded the smallest
We illustrated the proposed one-stage pattern-mixture model approach using three different effect measures for the continuous outcome. In line with Mavridis et al., 9 we apply MD and SMD in conjunction with the IMDoM, and log RoM together with the log IMRoM, because they are intuitively related. To select among these three effect measures for a specific outcome, the researcher should consider the trade-off among the ease of interpretation, statistical properties (e.g. low variability of the treatment effect across the trials 31 ) and the goodness of fit. According to the posterior mean of residual deviance, we found that using the log RoM, none of the models fit the data adequately for the network of Stowe et al. 12 (Table S5 in the Supporting Information). In the network of Cipriani et al., 32 none of the models fit the data adequately for MD and SMD (Table S5 in the Supporting Information). For a discussion on the statistical properties and performance of these effect measures in the synthesis of trials, the readers should refer to the relevant literature.33–35
The proposed one-stage pattern-mixture model approach can be particularly useful in living systematic reviews,
36
where learning about the missingness process via the estimated missingness parameters can inform the design of a future randomised trial. For instance, assuming that Stowe et al.
12
was a living systematic review, we have learned that trials comparing COMTI+LD with placebo+LD have the most participant losses (Table 1) and missing participants randomised in COMTI+LD tend to have smaller patient off-time reduction as compared to completers in that arm (according to the intervention-specific assumption and the independent structure). Moreover, scrutiny of the COMTI+LD versus placebo+LD trials in the network using the Cochrane Risk of Bias tool will further shed light on why participant losses are almost negligible in some trials (
We were able to extract MOD in each arm of every trial in five networks only (out of the 92 networks with a continuous primary outcome). Therefore, an empirical study using this dataset would not offer sufficient evidence to compare the one-stage with the two-stage pattern-mixture model empirically. It is particularly challenging to collect a sufficient number of networks (or meta-analyses) to conduct an empirical study concerning MOD as the ability to extract MOD, and the quality of the extraction strongly depends on the reporting quality of the systematic reviews under investigation. 38 However, a recent simulation study investigated the performance of modelling the exact distribution (one-stage approach) versus the approximately normal distribution (two-stage approach) of aggregate binary data in the presence of MOD in a triangle of two-arm trials (submitted). Both approaches showed substantial bias overall in the relative treatment effects, especially, for comparisons with the non-reference interventions of the network, when the amount of MOD was considerable. However, the one-stage approach is both conceptually and statistically more appropriate than the two-stage approach when the approximate normality assumption cannot be defended; for instance, the outcome is skewed and/or trials have small size. 39 Since the proposed one-stage pattern-mixture model approach is based on the normal distribution (as the exact likelihood), we would expect some bias in the treatment effects when the synthesis dataset is dominated by small trials with a skewed outcome. In these situations, we recommend that until a more competent one-stage model is developed, the researchers apply our proposed one-stage PM approach to handle MCOD, and they fully acknowledge the limitations of this approach when they discuss the results.
6 Conclusions
Similar to binary MOD, a proper analysis of MCOD should be discussed thoroughly already in the protocol phase of the systematic review. The analysis plan should comprise the description of the one-stage model with respect to the proper assumptions and prior structure of the missingness parameter. For this purpose, prior knowledge (or expectations) of the missingness process that aligns with the interventions and the design of the trials on the condition of interest is necessary. For instance, inpatients randomised to placebo are more likely to leave the trial early for not experiencing immediate improvement in their schizophrenia symptoms as compared to participants randomised to antipsychotics. 27 In the absence of such knowledge, we recommend that the researchers consider the intervention-specific assumption alongside the hierarchical structure under the MAR assumption (with liberal uncertainty about that belief) in the primary analysis and investigate the robustness of the results under the independent structure while increasing the variance of the prior distribution of the missingness parameter, as a sensitivity analysis. In a network with few comparisons, which are informed by a handful of trials, the identical structure may be advantageous to the independent structure for estimating comparatively fewer missingness parameters. Then, the hierarchical structure alongside the intervention-specific assumption may be considered as a sensitivity analysis. The proposed models can also be applied in a pairwise meta-analysis straightforward to benefit from the variety of structural assumptions about the missingness parameter.
Supplemental Material
sj-pdf-1-smm-10.1177_0962280220983544 - Supplemental material for Continuous(ly) missing outcome data in network meta-analysis: A one-stage pattern-mixture model approach
Supplemental material, sj-pdf-1-smm-10.1177_0962280220983544 for Continuous(ly) missing outcome data in network meta-analysis: A one-stage pattern-mixture model approach by Loukia M Spineli, Chrysostomos Kalyvas and Katerina Papadimitropoulou in Statistical Methods in Medical Research
Footnotes
Acknowledgements
Data Availability
Declaration of conflicting interests
Funding
Supplemental Material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
