Abstract
Keywords
Introduction
Increasingly often, researchers have access to large databases with electronic healthcare records, providing information on patient characteristics, hospital admission, treatment procedures and clinical outcomes. Although these databases with real-world data (RWD) offer new opportunities to study the effectiveness of medical interventions, exposure and outcome variables are often unavailable for key study dates.1–4 Missing outcome data is particularly common in registries, where the measurement interval of outcome variables is irregular and tends to vary across individuals due to a lack of formalized data recording processes. The analysis of RWD therefore often requires censoring individuals with incomplete outcome data, which is problematic when outcome variables are incomplete for most individuals and may lead to bias when the censoring mechanism is informative. 5
As an alternative to censoring, it is possible to replace missing outcomes for key study dates with a simple average or by a neighboring observation. Single imputation methods for this purpose are commonly applied in clinical trials, 6 but may lead to biased treatment effect estimates and inflated type-I errors.7,8 Over the last few decades, more advanced imputation methods have been proposed that involve estimation of the (multivariate) data distribution. 9 We here propose an extension of multilevel multiple imputation (MLMI) that can be used to recover the entire trajectory of a longitudinal variable. Motivated by challenges in comparative effectiveness research in multiple sclerosis (MS), this method is intended for situations where longitudinal data are used to define survival outcomes and are affected by irregular and informative patterns of missingness. We conduct a simulation study to compare the proposed method to existing imputation approaches. We also illustrate the imputation method in a case study comparing dimethyl fumarate (DMF) to fingolimod for treating patients with relapsing-remitting MS (RRMS).
Motivating example
MS is a chronic progressive disorder that affects approximately 2.8 million people worldwide. Several disease-modifying therapies (DMTs) are available for RRMS, the most common disease course in which patients experience episodes of disease activity, called relapses. In MS, RWD, such as MS registries, typically gather longitudinal information on patients where data are collected when the patient comes to the healthcare center for a clinical visit. Accumulating RWD on DMT usage in clinical practice opened the ground for comparative effectiveness research.
The comparative effectiveness of two DMTs is typically demonstrated using time to confirmed disability progression (CDP) as the primary or secondary outcome. Disability is characterized by the Expanded Disability Status Scale (EDSS) score, an ordinal measure taking values between 0 (no disability) and 10 (death). Disability progression is defined as an increase (e.g., 1-point) in EDSS score from an initial (e.g., baseline) EDSS measurement, provided that the increase is confirmed at a follow-up visit 3 (or 6) months later. Time to CDP is then derived as the time from baseline until a confirmed increase in EDSS score. Therefore, the definition of time to CDP depends on the ability to capture and confirm a progression given that the underlying process, EDSS score, is not measured continuously but rather only available at certain time points after baseline.
The definition of time to CDP is suitable when data are collected according to a standardized follow-up schedule but difficult to apply when the schedule for data collection is irregular. In clinical trials, follow-up visits are planned to occur every 3 months such that a disability progression observed at any given visit can always be confirmed 3 months later as per the planned schedule. a In this context of standardized follow-up schedule, the ability to capture a disability progression and to confirm it is the same across patients. On the contrary, RWD data are collected when the individual comes to the healthcare center for a visit such that the timing of visits varies across and within individuals.
The ability to capture a disability progression is compromised in RWD. For example, consider two patients, one visiting his treating physician every 3 months and the other once a year. Capturing a progression for the former patient is easier as more instances of EDSS are measured compared to the latter patient. Moreover, confirming a progression is not straightforward when patients do not have a visit at the 3-month confirmation mark after their initial increase in EDSS score.
Methods
Notation
We consider a multicenter study where patients attend visits (e.g., clinical visit to the center) over time. Let
Figure 1(a) shows irregular visits for five fictive patients superimposed on a regular schedule with visits every 3 months (the center subscript was dropped for clarity). All patients have an initial (baseline) visit

Examples of irregular visit schedules (a) and imputations on a 3-month grid using last observation carried forward (b) and rounding (c) for 5 fictive subjects. Squares and triangles represent observed and imputed outcomes, respectively.
Typically used methods convert the irregular visits in Figure 1(a) to a standardized schedule with equally spaced visits that allows recovering time to CDP. We describe two single imputation methods commonly used in MS: LOCF and rounding.
LOCF is one of the simplest and most common methods to account for missing data in a longitudinal analysis of repeated measures over time. It involves replacing missing patient outcomes with their most recent observation. That is, when no visit is available for time
A natural extension of LOCF is to replace missing patient outcomes at a given visit by their nearest observation in time. This method is called rounding. In contrast to LOCF, missing values for
Finally, a third strategy is to combine LOCF and rounding, and to replace missing patient outcomes with the average of the observed outcomes closest in time before the missing visit and after the missing visit (henceforth AVG).
We use a toy example to illustrate how rounding can lead to misleading results when comparing two treatments on their ability to slow disability progression in the MS context. Consider data measured on two individuals receiving either treatment A or B (see Figure 2). For simplicity, assume that both individuals have similar prognosis and were equally likely to receive one treatment or the other such that any difference in disability progression is attributable to the treatment only. Further assume that both treatments are equivalent in terms of their ability to slow disease progression such that the two individuals have the same underlying EDSS trajectory (solid gray lines), with CDP occurring 6 months after treatment initiation. However, the two individuals follow irregular visit schedules such that their EDSS trajectories are observed at different times (black squares). With rounding, the observed EDSS scores are mapped to a standardized 3-month grid (blue triangles). This leads to both individuals having similar rounded EDSS trajectories (dashed blue lines) except at 6 months when the individual on treatment A is attributed an EDSS value of 2 (using the observed EDSS at 7 months) while the one on treatment B is attributed a value of 1 (based on the observed EDSS at 4.2 months). Consequently, the time to CDP is 6 months on treatment A compared to 9 months on treatment B, mistakenly suggesting that treatment B slows disability progression. In this toy example, the difference in observed time to CDP is entirely due to differences in visit schedules between individuals on treatment A and B.

Expanded Disability Status Scale (EDSS) underlying trajectories (solid lines) and observed trajectories (dashed lines) for two patients on treatments A and B. The black squares are EDSS scores observed on an irregular visit schedule which differs between the two patients. The blue triangles are the observed EDSS mapped to a regular 3-month grid. mo: months.
As an alternative to aforementioned single imputation methods, it is possible to adopt multiple imputation methods that better account for uncertainty in imputed values. For example, when dealing with time-to-event outcomes such as CDP, substantive-model-compatible multiple imputations could be used to recover missing values. 10 Unfortunately, because disease progression can rarely be confirmed in RWD, the lack of outcome events severely hampers the implementation of the substantive (survival) model. It, therefore, seems more promising to adopt imputation methods that recover the longitudinal EDSS outcomes conditional on individual characteristics. Huque et al. 9 provide a comprehensive overview of multiple imputation methods for this purpose. These methods consider repeated measurements of time-dependent variables as distinct variables or impute each incomplete time-dependent variable separately using hierarchical models.11–13 The former approach can only be used when data are collected at fixed time intervals and is therefore not suitable to recover CDP outcomes. Although hierarchical models seem more promising for imputing irregularly spaced data, existing methods have several limitations.
First of all, CDP outcomes are defined using repeated measurements of the EDSS score and require considering their entire sequence. Existing imputation methods typically assume that repeated observations are independent once clustering has been taken into account. In practice, more complex correlation structures are needed to allow for dependencies between repeated observations. A second limitation of existing multilevel imputation methods is that they usually allow for only one level of clustering. It is not uncommon that longitudinal RWD is collected from multiple practices, hospitals, or health care centers, and therefore requires the implementation of three-level imputation models. 14 However, such methods are not always available in mainstream statistical software. Finally, and perhaps most importantly, imputation methods typically assume that data are missing at random. This assumption is extremely unrealistic for EDSS outcomes because patient visits to the clinic are often determined by changes in patient characteristics, physician's preference, or treatment status.2,15 For this reason, we consider an extension to multilevel imputation methods that can accommodate for aforementioned limitations.
Similar to existing multilevel imputation methods, we propose to estimate the outcome trajectories of all individuals by adopting multilevel (hierarchical) models.16,17 These longitudinal methods have several advantages. First, they account for the multiple repeated measures nested within each individual and within each cluster (e.g., healthcare institution). Second, they do not require observations at rigidly fixed intervals (as is rarely the case in clinical practice). Third, they allow accounting for important covariates, such as confounders, prognostic factors, or predictors of treatment effect. Our imputation model adopts the following equation to describe the longitudinal outcome (
In the context of MS and CDP,
The residuals
When visit times do not depend on unobserved covariates that also affect
We, therefore, develop a new class of one-stage imputation methods that can be used to impute longitudinal trajectories in large data sets with clustering. We build upon the theory of multivariate imputation by chained equations (MICE), and adopt an (approximate) Bayesian approach for drawing parameters in the imputation model.21,22
We define the vector of observable outcomes for individual
First, we estimate the model parameters
Hierarchical imputation of missing responses
After drawing
Note that equation (3) yields imputed values that are continuous, and therefore do not necessarily reflect realistic EDSS scores. One possible solution is to round individual draws for
We adopt the following PMM algorithm to generate more realistic imputations for the unobserved EDSS scores Draw Calculate Calculate the vector For each Randomly sample one value from (
In case multiple imputations are required, repeat steps 1–5, each time saving the completed data set. The resulting imputation algorithm has been implemented in R and is freely available from Github.
Simulation study
We conduct an extensive simulation study to compare the three approaches for dealing with informative missing outcome data. We consider a multicenter study where individuals are treated with one of two DMTs. The outcome is time to CDP defined as a function of the EDSS score recorded at each visit. To mimic the irregular visit schedules in clinical practice, visits were deleted according to an informative missingness procedure. That is, for some scenarios, we consider that EDSS scores are missing not at random.
Data-generating mechanism
We consider 20 clinical centers with 500 individuals each. Individuals receive DMT A or DMT B according to a nonrandomized treatment allocation and are followed for a maximum of 60 months. Data were generated according to an existing observational cohort of individuals with RRMS in clinical practice settings in Europe, Australia, Argentina, and Canada, 25 such that the baseline EDSS scores have a mean value of 3.45 and a standard deviation of 1.64 (Supplemental Information A2.1). The EDSS score for each individual is observable on a monthly basis and defined by an individual-specific intercept (a random effect to personalize disease severity at treatment start), a center-specific intercept (a random effect to introduce between-center heterogeneity in baseline disease severity), a time effect (to mimic an annual EDSS increase of 0.168), 26 a treatment effect, and an effect of age at treatment start. We consider three types of treatment effect: (a) Treatment does not affect disease progression, (2) DMT B decreases annual EDSS progression from 0.168 to 0.084 (moderate treatment effect), and (3) DMT B stops annual EDSS progression (strong treatment effect). For each individual, we generated 61 EDSS scores that reflect hypothetical observations in monthly intervals. Subsequently, each score is rounded to the nearest half-integer and truncated between 0 and 9.5. Note that this transformation implies that EDSS scores are no longer normally distributed, which may affect the validity of MLMI. Details of the data-generating mechanism are reported in the Supplemental Information.
We subsequently generated irregular visit patterns by randomly setting some of the 61 generated EDSS scores as missing, except for the baseline EDSS score, which was always observed. To this purpose, we consider six patterns of missingness where the visit probability for each patient varies according to their MS center, received treatment and/or unobserved EDSS scores (Table 1). The latter scenario mimics situations where EDSS scores are missing not at random. Technical details of each visit pattern and an illustration of them are provided in the Supplemental Information.
Data analysis and evaluation
Simulated scenarios.
Simulated scenarios.
EDSS: Expanded Disability Status Scale; DMT: disease-modifying therapy.
None: beta = 0, moderate: beta = −0.007, strong: beta = −0.014.
Individuals receiving DMT A have a visit every 6 months, whereas individuals receiving DMT B have a visit every 9 months.
Individuals receiving DMT A have a visit every 3 months, whereas individuals receiving DMT B have a visit every 9 months.
After imputation, the longitudinal EDSS scores were converted into time to CDP, where CDP is defined as a
We compare the three imputation methods in terms of three performance criteria across the 500 simulation runs: (a) the root mean squared error (RMSE) of the imputed EDSS scores across all visits, (b) the estimated hazard ratios for treatment in terms of average bias and RMSE, and (c) the coverage of estimated hazard ratios. An overview of all scenarios is summarized in Table 1.
Simulation study results for all 18*500 generated datasets with 10,000 individuals each are depicted in the Supplemental Information (A3). Figure S3 indicates that the RMSE of imputed EDSS scores is highest when adopting LOCF, and lowest when adopting MLMI-PMM, regardless of the true underlying treatment effect. For example, in the scenarios where the visit pattern depends on center only (visit pattern 1), the RMSE decreases from 0.61 (LOCF) to 0.48 (MLMI-PMM), which corresponds to a relative improvement of 21%. Conversely, when other imputation methods were applied, the RMSE ranged from 0.49 (MLMI-RND) to 0.53 (rounding). These differences remained consistent across all 18 evaluated scenarios, even when EDSS scores were missing not at random (MNAR) (scenarios 16–18).
The RMSE of estimated hazard ratios was highest when adopting LOCF, and lowest when adopting multilevel imputation with PMM (Table 2). Results in the Supplemental Information (Figure S4) indicate that these discrepancies mainly arise due to bias. For example, in scenario 16 where EDSS scores are MNAR and both treatments are equally effective, the average treatment effect estimate varies between 1.12 (LOCF), 1.07 (rounding), and 1.00 (MLMI-PMM). This corresponds to a bias of 0.12 for LOCF (where RMSE = 0.13), a bias of 0.07 for rounding (where RMSE = 0.08) and zero bias for multilevel imputation (where RMSE = 0.03).
Simulation results.
The reference HR was derived by calculating the counterfactual outcome for each individual and estimating the hazard ratio in the resulting sample of 20,000 individuals. The mean across 500 simulations was then used as “unbiased” source. For scenarios where there is no treatment effect, the reference HR was set to 1. LOCF: last observation carried forward; RND: rounding; MLMI-RND: multilevel imputation with rounding; MLMI-PMM: multilevel imputation with predictive mean matching.
Simulation results.
The reference HR was derived by calculating the counterfactual outcome for each individual and estimating the hazard ratio in the resulting sample of 20,000 individuals. The mean across 500 simulations was then used as “unbiased” source. For scenarios where there is no treatment effect, the reference HR was set to 1. LOCF: last observation carried forward; RND: rounding; MLMI-RND: multilevel imputation with rounding; MLMI-PMM: multilevel imputation with predictive mean matching.
Finally, simulation study results in Supplemental Figures S5 and S6 indicate that estimated confidence intervals (CIs) were too narrow when adopting LOCF, rounding, AVG, or multilevel imputation with rounding. The poorest coverage rates were obtained for LOCF, which yielded 0% coverage in many scenarios. Coverage improved when adopting multilevel imputation, especially when draws for the EDSS scores derived using MLMI-PMM. Although the coverage for multilevel imputation was too low when generated draws were rounded to the nearest half-integer, MLMI-PMM yielded a coverage close to the nominal level of 95%.
We used data from the Multiple Sclerosis Partners Advancing Technology and Health Solutions (MS PATHS), a collaborative network of healthcare institutions collecting standardized measurements on patients living with MS.
28
MS PATHS is the first demonstration project of a learning health system in MS, collecting standardized clinical and imaging data on MS patients across 10 healthcare institutions in the United States (
Cohort definition and methods
The study population consisted of new users of DMF or fingolimod (FTY) between November 2015 and July 2021 with any type of MS. New users were defined using the self-reported current DMT recorded at each visit. A treatment sequence consisted of consecutive visits on the same DMT. We assumed that the first visit of the treatment sequence corresponded to treatment initiation and labeled that first visit as a baseline. We defined the follow-up time from baseline until the last MS PATHS follow-up visit, treatment switch, or treatment discontinuation, whichever comes first. Disease progression was defined with the Patient Determined Disease Steps (PDDS) score, a patient-reported measure of disability strongly correlated with EDSS with ordinal scores ranging from 0 (normal) to 8 (bedridden) in 1-point increment. 29 The outcome was time to CDP calculated as the time in days from baseline until disease progression, defined as a 1-point increase in PDDS score from baseline PDDS, provided the increase can be confirmed at a follow-up visit 3 months later. Patients with complete baseline data were kept for the primary analysis. Additional details on cohort definitions are available in A4.1 of the Supplemental Information.
We compared LOCF, rounding, and the proposed multi-level modeling approach to generate imputations for PDDS score and derive the corresponding time to CDP. We reconstructed the entire PDDS trajectories following pre-specified grids of visits every 3 and 6 months. If no CDP was identified, patients were censored at the end of the treatment sequence. If a disease progression was observed less than 3 months before the end of treatment sequence, we allowed to impute beyond the follow-up time to confirm the progression. An inverse probability weighted Cox regression model accounting for clustering by MS PATHS site was used to compare the effect of DMF vs FTY on time to CDP with marginal hazard ratios. The following baseline covariates were considered: age, sex, MS type, years of education, disease duration, PDDS score, self-reported number of relapses in the past 12 months, prior DMT classified as no prior DMT or high, medium, or low DMT efficacy, b history of cardiovascular disease, and history of diabetes. Corrected sandwich variance estimation was used to derive robust SEs using stabilized weights. 27 LOCF and rounding each generated single imputation for time to CDP, so the Cox model was applied once for both methods. With our multi-level modeling approach, we generated 100 imputations, estimated the HR and corresponding SE for each imputed time to CDP, and combined the results using Rubin's rule. Details on the multi-level model specification and weight calculation are available from A4.2 of the Supplemental Information.
As a sensitivity analysis, we adopted multiple imputations by chained equations in the entire cohort with incomplete baseline data. To this purpose, we implemented LOCF, rounding and the proposed multi-level modeling as a conditional modeling approach in the R package
Results
As of July 2021, the MS PATHS database included 16,152 patients. Of those, we identified 254 and 202, new users of DMF and FTY, respectively, with complete baseline data. Table 3 shows the baseline characteristics of the cohort by treatment group.
Baseline characteristics of 456 patients with complete data who initiated DMF or FTY in MS PATHS.
*Interferons and glatiramer acetate were classified as low efficacy. Teriflunomide, methotrexate and mycophenolate mofetil were classified as medium efficacy. Alemtuzumab, ocrelizumab, natalizumab, and rituximab were classified as high efficacy. DMF: dimethyl fumarate; DMT: disease-modifying therapy; FTY: fingolimod; MS: multiple sclerosis; PDDS: Patient-Determined Disease Steps; SD: standard deviation.
Baseline characteristics of 456 patients with complete data who initiated DMF or FTY in MS PATHS.
*Interferons and glatiramer acetate were classified as low efficacy. Teriflunomide, methotrexate and mycophenolate mofetil were classified as medium efficacy. Alemtuzumab, ocrelizumab, natalizumab, and rituximab were classified as high efficacy. DMF: dimethyl fumarate; DMT: disease-modifying therapy; FTY: fingolimod; MS: multiple sclerosis; PDDS: Patient-Determined Disease Steps; SD: standard deviation.
The median number of visits per patient was 2 (DMF: 2, FTY: 3), with 102 patients (DMF: 62, FTY: 40) having only a baseline visit. The distribution of patient visits was similar for both treatment groups (Figure 3), and the median duration between consecutive visits was 185 days (interquartile range (IQR): 120; 261) for DMF, and 188 days (IQR: 120; 267.5) for FTY. Despite these similarities between the treatment groups, DMF and FTY patients substantially differed with respect to most baseline characteristics (A4.3 of the Supplemental Information). When adjusting for confounders, we found that DMF reduced PDDS by 0.02 point per year (not statistically significant) as compared to FTY, and that the residual autocorrelation between monthly PDDS scores was 0.13 (A4.4 of the Supplemental Information).

Distribution of number of days between consecutive visits for 192 new DMF users and 162 new FTY users having at least one follow-up visit beyond baseline. DMF: dimethyl fumarate; FTY: fingolimod.
We subsequently imputed PDDS scores on a 3-month and 6-month grid using LOCF, rounding and the proposed multilevel model. Results in the Supplemental Information indicate that these methods sometimes lead to different disease trajectories (A4.5) and survival data (A4.6). For example, we found that the median censoring time was 270 days for LOCF, 231 days for rounding, and 222 days for multilevel imputation with PMM. We also found some differences in overall survival. For example, the one-year risk of developing CDP varied between 17% (LOCF and MLMI) and 21% (rounding) for patients that received FTY, and between 14% (LOCF and MLMI) and 23% (rounding) for patients that received DMF.
The estimated HR and corresponding 95% CI are shown in Table 4 for the three imputation methods using either a 3- or 6-month pre-specified grid of visits. There was no evidence of a relative difference in effectiveness between DMF and FTY for any imputation method. All methods yielded an HR close to 1, with LOCF providing the largest SEs. SEs generally increased when the treatment effect was evaluated on a 6-month (rather than 3-month) grid. Similar results were found when chained equations imputation was applied in the cohort of 601 patients with incomplete baseline data.
Estimated HR of DMF vs FTY for time to CDP with three methods according to two imputation grids.
*Treatment effect estimates are only informed by patients with at least two visits. The effective sample size may therefore differ from the total number of patients (all of which are used to estimate the multilevel imputation model).
In this paper, we evaluated various methods for imputing longitudinal observations at arbitrary measurement times and deriving survival outcomes for subsequent comparative effectiveness research. These methods can be used in datasets where repeated measurements are available for variables of interest to facilitate analyses that require observations at specific time points or intervals. We extended previously proposed imputation methods for multicenter data, 21 and adopted additional random effect terms to account for the clustering of repeated observations. Further, we incorporated an autocorrelation structure to exploit the potential presence of correlated residual errors. This situation typically arises when the imputation model does not adequately capture the continuous time process of the repeated measurements; for instance, because the frequency of visits is driven by unobserved information. Moreover, to facilitate the imputation of semi-continuous and skewed variables, we implemented PMM. Finally, we coupled the proposed methods with a comparative effectiveness analysis by deriving a survival outcome from the imputed longitudinal trajectories and using inverse probability of treatment weighting to account for confounding. The proposed imputation approach can directly be implemented in MICE, and is, therefore, suitable to handle multiple and mixed-type time-dependent covariates that typically arise in electronic healthcare registries.
Results from an extensive simulation study demonstrated that our proposed method yields more accurate imputations than commonly used methods based on time series analysis. When the imputed values were used for statistical inference, we found that hierarchical imputation was least prone to bias and attained coverage levels that were close to the nominal level. Although the proposed imputation methods are designed for situations where observations are missing at random, results from the simulation study suggested that they may also work reasonably well when data are MNAR. Most likely, imputation performance under MNAR conditions strongly depends on the presence of autocorrelation between successive observations, as this allows to reduce bias in the imputed values. Further research is needed to evaluate this issue. Further, we found that MLMI offers better statistical properties when based on PMM. Possibly, this is because the variation of imputed values decreases when they are simply rounded to the nearest-half integer. Conversely, PMM generates imputed values by resampling from the observed data and therefore offers improved capabilities of preserving uncertainty.
The simulation study setup was motivated by the MS clinical context which uses the time to CDP derived from longitudinal measurements of the EDSS score. Thus, we demonstrated the performance of the new method for a particular application, that is, to recover a time-to-event outcome from a longitudinal variable captured at irregular measurement points within and across individuals. Other disease areas beyond MS use similar endpoints defined as the time from baseline to the occurrence of an event, where the event is derived from an underlying continuous or semi-continuous process (e.g., a biomarker or a score on a scale) only measured at certain time points. Nevertheless, the proposed method is suitable in much broader contexts, with any longitudinal outcome or variable. Moreover, the MS clinical context also informed the intermittent visit patterns in the simulation study, which were chosen to reflect realistic covariate- and outcome-driven follow-up patterns in MS. This led to a 1–2 time-points per year for collecting the longitudinal variable of interest. Other applications may have more or less frequent visits. For example, it would be more realistic to consider set-ups with more frequent visits for diabetes patients needing to monitor hemoglobin A1C every 3 months.
In the illustrative example, we used the proposed imputation method to recover the trajectory of PDDS scores in MS patients identified in MS PATHS, an ongoing demonstration project of a learning health system collecting data in clinical practice. The proposed method imputed PDDS scores according to a standardized follow-up visit schedule, which allowed one to observe and confirm a disability progression. We subsequently derived time to CDP, an important clinical outcome in MS, and evaluated the comparative effectiveness between DMF and FTY in terms of CDP. By adopting hierarchical imputation, we were able to obtain smaller SEs for treatment effect estimates (as compared to LOCF) and to include patients with missing covariate data in the analysis. All imputation methods suggested similar effectiveness of DMF and FTY, substantiating evidence from the literature.30,31
Some limitations need to be considered in this research. First, we considered that a reasonable number of repeated observations are available to generate imputations. When longitudinal responses are very sparse or limited in number, regression models with autocorrelation may not well be capable to accurately recover their temporal relationship. As MS studies using RWD typically have shorter follow-up times than the one used in the simulation study and may thus have less repeated measurements, it would be worth evaluating the methods in this context. Second, we implemented the R package
In summary, we recommend MLMI to address the presence of missing values in registry data with repeated measurements. In contrast to ad-hoc solutions that are based on time series analysis, our approach leads to less bias and is better capable of preserving adequate levels of uncertainty.
Supplemental Material
sj-docx-1-smm-10.1177_09622802231172032 - Supplemental material for Methods for comparative effectiveness based on time to confirmed disability progression with irregular observations in multiple sclerosis
Supplemental material, sj-docx-1-smm-10.1177_09622802231172032 for Methods for comparative effectiveness based on time to confirmed disability progression with irregular observations in multiple sclerosis by Thomas PA Debray, Gabrielle Simoneau, Massimiliano Copetti, Robert W Platt, Changyu Shen, Fabio Pellegrini and Carl de Moor in Statistical Methods in Medical Research
Supplemental Material
sj-zip-2-smm-10.1177_09622802231172032 - Supplemental material for Methods for comparative effectiveness based on time to confirmed disability progression with irregular observations in multiple sclerosis
Supplemental material, sj-zip-2-smm-10.1177_09622802231172032 for Methods for comparative effectiveness based on time to confirmed disability progression with irregular observations in multiple sclerosis by Thomas PA Debray, Gabrielle Simoneau, Massimiliano Copetti, Robert W Platt, Changyu Shen, Fabio Pellegrini and Carl de Moor in Statistical Methods in Medical Research
Footnotes
Declaration of conflicting interests
Funding
Supplemental material
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
