Abstract
Keywords
1 Introduction
In many clinical studies, increasingly complex data are collected. The complexity of the data may be due to its multivariate and longitudinal nature as measurements are often obtained for multiple biomarkers over time. Data of this kind have a complex correlation structure with correlation, for each patient, between measurements of a biomarker at different time points and between observed values of multiple biomarkers at a single time point. An additional complication is that collected data are often of varying types, with data being potentially continuous, counts, binary, or having multiple categories. Finally, the time points at which biomarkers are measured may be different between biomarkers and between individuals for a given biomarker.
Frequent clinical interest is in being able to classify patients into various groups corresponding to severity of their disease status or disease progression, based on the evolution of biomarkers observed over time. Our goal in this paper is to present a flexible and dynamic approach in which we use available longitudinal data on multiple biomarkers of various types to accurately classify patients into groups (such as diagnosis groups) in a discriminant analysis and to do so as early as possible.
1.1 Clinical motivation
We consider data from a study of patients with epilepsy to motivate our developments. We are interested in being able to identify those patients who will not achieve remission from seizures within five years of commencing treatment. For the purposes of this paper, this group of patients will be referred to as the refractory group. By contrast, a patient is defined as being in remission if they have had a continuous 12-month period without any seizures at any point within five years from diagnosis. Our aim is to use multivariate longitudinal clinical data from patients with epilepsy to identify, as early as possible, if a particular patient belongs to the refractory group. Early classification would allow clinicians to try alternative treatments with the hope of achieving adequate seizure control. Consequently, patients could be spared some time on unsuitable treatment regimes and receive more effective, individualised treatment.
Data were acquired from the Standard and New Antiepileptic Drugs (SANAD) study1,2 which involved patients diagnosed with epilepsy between December 1999 and August 2004. Follow-up data on these patients are available up until January 2006. Here, 1772 patients from the SANAD database are considered. These patients have been followed up sufficiently long to be known to belong to either the refractory group or the remission group. For all patients, biomarkers of different type (continuous, counts and binary) were collected over time. We remark that it is indeed possible for a patient to achieve remission and then begin to have seizures again but this is not considered in our application. For simplicity, once a patient achieves remission, they are considered as belonging to the remission group and all longitudinal measurements subsequent to the visit at which remission was achieved are discarded.
Most patients had annual clinic visits, although in some cases, the visits took place more often than annually. Information about the number and type of seizures as well as adverse events the patient has experienced since the previous visit were collected. A number of baseline covariates were also collected at the commencement of treatment (based on clinical relevance), including the patient’s age and gender, epilepsy type, whether any family members had a history of seizures, whether the patient had learning or neurological difficulties and to which arm of the SANAD study the patient had been assigned.
Out of the 1772 patients investigated, 1593 patients were in the remission group and 179 patients were in the refractory group. The median (min, lower-quartile, upper-quartile, max) follow-up times (in days) in the remission group was 710 (365, 480, 863, 1821) whilst in the refractory group was 1512 (1463, 1659, 1825, 1825). The difference in medians is easily explained due to the fact that patients who achieve remission will generally be observed for less than five years (the majority achieving remission within three years), whereas refractory patients need to be observed for at least four years to determine the refractory status.
In the following, we will consider three longitudinal markers, namely whether a patient had seizures or not since their last visit, which is binary, a transformation of the total number of seizures since their last visit (using the transformation
Figure 1 shows the change over time in the levels of each of the considered biomarkers for a sample of 20 patients in each diagnostic group. As expected, in the remission group, fewer patients experience seizures since their last visit than in the refractory group. In the refractory group, the likelihood of the patient having experienced seizures since their last visit increases with time, whilst in the remission group it decreases. For patients who achieve remission, the number of seizures decreases over time, whereas for refractory patients, the number of seizures experienced remains high. It is interesting to note the most dramatic increase/decrease occurs within the first 500 days of receiving treatment. In the refractory group, the number of seizures experienced increases with time, with again, the main increase occurring during the first 500 days. The difference between the two groups for the number of adverse events experienced is less noticeable. Initially, both groups experience similar numbers of adverse events but as time increases the refractory patients appear more likely to be experiencing more adverse events than the remission patients.
Observed longitudinal profiles of an indicator of whether a patient had seizures, 
In summary, Figure 1 highlights the challenges of the epilepsy data: having three longitudinal markers of different type, measured at different time points within and across subjects. The differences in remission and refractory groups can be subtle when each biomarker is considered individually. In this work, we aim to model the markers simultaneously and to use the model for discrimination between groups.
1.2 Dynamic longitudinal discriminant analysis
The SANAD data have been primarily analysed elsewhere,1–4 with most previous work concentrated on modelling of time to seizures using the baseline characteristics as prognostic factors. 3 A different problem will be tackled in this paper. For each patient in our dataset, we have information on not only their baseline characteristics and values of the longitudinal biomarkers but also on whether they belong to the refractory or to the remission group. It is our aim to use these data to develop a statistical approach which can be used to predict the five-year seizure status (i.e., pertinence into either the refractory or the remission group) of a new patient based on their baseline characteristics as well as longitudinally gathered biomarkers. As such, the problem can be classified as a problem of longitudinal discriminant analysis (LoDA).
In addition, we aim to refine the prediction of the seizure status whenever new longitudinal observations become available at each consecutive visit. To predict the patient’s seizure status at a particular time point, we can use not only the last available longitudinal measurements (as is often the case in clinical practice) but the whole longitudinal history of relevant biomarkers known by the time we are conducting the prediction. Due to this dynamic update of the seizure status prediction, we will refer to dynamic LoDA.
To formalize our research problem, let us assume that patients are to be classified into
For given
A task of the dynamic LoDA is to use, at a given time point
In order to avoid misunderstanding, we point out that a similar term
In contrast to those methods, we do not deal with dynamic estimation of a subject-specific time-to-event distribution. We consider dynamic discriminant analysis where we aim to use historical data to predict dynamically (also as new longitudinal information becomes available) the group membership of a patient which is only known in the future.
Finally, note that most of the longitudinal biomarkers in the SANAD data (and many other clinical applications) are either binary or counts, in which case existing methodology for LoDA is scarce and largely unsuitable as will be indicated below.
1.3 LoDA based on mixed models
Classical methods of discriminant analysis, see, e.g., Chapter 4 of Hastie et al.10 like linear discriminant analysis or discrimination based on logistic regression do not apply in our context. These methods are often applied when only baseline characteristics or other cross-sectional characteristics related to a chosen time point, common for all patients, are to be used for discrimination. In more recent years, relevant work has been done in capturing the longitudinal nature of clinical data and using it for classification via methods of LoDA.11–18 These authors base their LoDA methodologies on the classical linear mixed model
19
and propose discriminant methods based on longitudinal measurements of a single (
Nevertheless, using a single marker may be insufficient to accurately classify the subjects into prognostic groups. By using multiple markers (
As indicated above, most methods of LoDA exploit mixed model methodology. A benefit of its usage is that data do not have to be measured at regular intervals. It is possible for patients to be observed different numbers of times and at irregularly spaced intervals. In addition, it is not necessary for all biomarkers to be measured on each patient at each visit. For example, it is possible for one biomarker to be measured at one visit and then another at a different visit. This flexibility is useful in clinical applications where regularly spaced observations are rarely achieved, and not all biomarkers are measured at the same time point.
Unfortunately, the above referenced methodologies are not directly suitable when there are markers that are not all continuous (as in our application). A related development made towards LoDA with multiple markers of various types has been made by Fieuws et al. 23 who predict renal graft failure using combination of linear, non-linear and generalized linear mixed models (GLMMs). All considered markers are combined into a multivariate mixed model by specifying a joint distribution for the random effects. Computational complexity of the maximum likelihood estimation (MLE) is tackled by using a so-called pairwise fitting approach which proved to be a useful approximation towards MLE. They also show that the prediction is better when considering multiple markers than by considering only a single marker.
In LoDA methods based on multivariate longitudinal markers, the complex correlation structure between various markers is mostly taken into account by assuming a joint distribution for all random effects in the underlying mixed models. In each of the references mentioned previously, except for Komárek et al., 21 the random effects are assumed to follow a normal distribution. However, as shown by Verbeke and Lesaffre, 24 this assumption cannot easily be checked. Moreover, under misspecification of the random effects distribution, estimates of the mixed model parameters may become seriously biased 25 and consequently, the performance of the discriminant procedure may also be affected. In the mixed models literature, several extensions avoiding the normality assumption for the random effects have been proposed. 26 Nevertheless, applications of such models in the LoDA context are still rare. One of the few works in this direction is described by Komárek et al. 21 who consider a multivariate linear mixed model with distribution of random effects specified as a finite normal mixture, which robustifies the model towards misspecification of the random effects distribution. To overcome the computational complexity of the MLE, they use Markov Chain Monte Carlo (MCMC) methodology within a Bayesian framework.
1.4 Towards robust LoDA based on multivariate longitudinal markers of different types
The aim of this paper is to extend a multivariate LoDA method so that (i) it allows for multiple longitudinal markers of different types as requested by data from the SANAD study and (ii) the underlying model is robustified against possible misspecification of the random effects distribution.
We are aware of two methodologies available in the literature that satisfy either (i) or (ii) but none of them both of the requirements. The approach of Komárek et al. 21 fulfils (ii) but only continuous markers can be used. We allow for binary and count biomarkers by replacing the underlying multivariate linear mixed model with the multivariate generalized linear mixed model (MGLMM).
On the other hand, the method of Fieuws et al. 23 allows for markers of different nature but normality of the random effects is assumed. By using a pairwise fitting approach, these authors attempt to overcome the complexity of finding the maximum likelihood parameter estimates. In this paper we take a different approach by using Bayesian methods with MCMC estimation and considering a normal mixture in the distribution of random effects to robustify the model against misspecification of the random effects distribution.
Conceptually, the LoDA methodology proposed here follows that of Komárek et al. 21 Nevertheless, to allow also for binary and count biomarkers, we replace the underlying multivariate linear mixed model used therein by the MGLMM. To robustify the model against misspecification of the random effects distribution, we shall consider a normal mixture in the distribution of random effects. In this paper, we obtain a robust group-specific model that will be further used in the LoDA procedure.
An outline of the remainder of the paper is as follows. In Section 2, we describe the MGLMM with a mixture distribution for the random effects. This allows us to jointly model the longitudinal profile of each marker in each prognostic group. We also describe the MCMC procedure that is applied to infer on the model parameters. Section 3 describes the LoDA used to classify new patients into prognostic groups. An example of our methodology applied to the SANAD data is shown in Section 4 with a summary provided in Section 5.
2 MGLMMs with a normal mixture in the random effects distribution
2.1 Model
The basis for the LoDA procedure, explained further in Section 3, is a MGLMM with a normal mixture in the random effects distribution. This is assumed for the longitudinal evolution of considered markers in each prognostic group. Specifically, given
In (1),
In our SANAD example, we consider
Possible correlation between repeated observations of both the same marker and different markers measured on the same patient is accounted for by inclusion of the random effect vector
As mentioned above, a primary purpose of usage of the mixture in equation (2) is to robustify our model against misspecification of the random effects distribution. At this place, we should mention that also in other contexts, mixtures proved to provide a flexible distributional model26–28and hence can be considered as a sort of robustification against violation of the assumption on the random effects distribution.
In the following, let
2.2 Sampling-based Bayesian inference
For a training dataset of size
Due to a mixture nature of the likelihood (6) which additionally involves analytically intractable integration where the integrand combines a general exponential family and a normal density, maximum-likelihood-based inference is tractable only with difficulties. For this reason, a MCMC-based Bayesian estimation as proposed by Komárek and Komárková
29
will be adopted here. In the following, let
Komárek and Komárková
29
describe (i) how to specify the prior distribution
Finally, if it is assumed that the model parameters for different prognostic groups are apriori independent and a
3 LoDA procedure
Let
3.1 Full Bayesian prediction
Having proposed the Bayesian inference for the model parameters using the training dataset
Here,
To calculate
With the frequentist (non-Bayesian) LoDA methodologies,32,33 classification of the new subjects is usually based on the group probabilities (10), in which the unknown parameters
Finally, we note that when evaluating (11), analytically intractable integral from (3) is in general involved in calculation of the marginal densities
3.2 Marginal, conditional and random effects prediction
In several previous works on LoDA based on the mixed models,21,32,33 the authors distinguish so-called
The
To calculate the
The mean of this distribution, which is, in fact, the empirical Bayes estimator of the random effect value given the group is usually exploited in the LoDA procedure.
33
With the Bayesian approach, it is natural to consider, in the mood of the Bayesian data augmentation,
34
the unobservable random effect value
In a similar way, the
With the MCMC-based Bayesian inference, the estimators
We have described here three possible methods of prediction. It is entirely possible that different choices of method would result in different predicted group status for a particular patient. In the process of testing and building the model, one must assess the predictive ability of any of the three methods to determine which works best.
4 Application to SANAD data
Section 1 gives an overview of the SANAD data and summary information. In this section, we present the results of the methodology presented in Sections 2 and 3 when applied to the SANAD data.
As described in Section 1, we consider three longitudinal markers to predict refractory or remission patients. For the binary marker, whether a patient had seizures or not since their last visit, we use a logistic model as the form of the GLMM. For the number of adverse events (count marker), we consider a log-Poisson model. Finally, for the number of seizures experienced since the previous visit, we utilize a log transformation of the form,
As explanatory fixed effect covariates, we will use (in both prognostic groups) (1) time since last visit (
4.1 Dynamic LoDA procedure
As indicated in Section 1.2, we update the probabilities of a future patient’s group membership each time new information is available. This is achieved by applying repeatedly the formulas of Section 3 while taking information available by each visit time in place of
We proceed as follows. We consider the first clinic visit for each patient. If the estimated probability of being in the refractory group is greater than a chosen cutoff,
Of course, other schemes would have been possible. If it was the case that we were equally interested in both remission and refractory patients, we could assign a patient to either group if their probability of belonging to the group was greater than
For either scheme, the cutoff
In the following analysis, we use 70% of our data to train the models and the remaining 30% to test the predictive accuracy. We repeat this process 100 times in a cross-validation procedure. For each split of the data into training and test sets, we calculate various measures of predictive accuracy and average them across the 100 splits.
4.2 Selecting the number of components in the mixture distribution
The MGLMM introduced in Section 2 which forms the basis of the discrimination procedure considers a normal mixture (2) in the random effects distribution. In general, the number of mixture components for each of the prognostic groups,
Penalized expected deviance for models with
These values were based upon the full data available in each group. The models with the best PED values are shown in bold for each group.
Comparison of the choice of
PCC: probability of correct classification; AUC: area under curve; PPV: positive predictive value; NPV: negative predictive value.
The predictions are based on 100 splits of the data where 70% of the patients in each group were used to train the MGLMMs and the remaining 30% were used to test the predictive accuracy.
4.3 Results of the dynamic LoDA
Having shown that there is an advantage to selecting
Posterior summary statistics and highest posterior density (HPD) credible intervals for the fixed effects, and random effects in a model with
SD: standard deviation; TLFU: Time since Last Follow Up.
These statistics are based on the full longitudinal data available in each group.
By comparing the parameter estimates in Table 3, we can see that for patients who will ultimately achieve remission, older patients, male patients and patients without generalized epilepsy are less likely to have seizures and expected to have fewer seizures than young patients, female patients and patients with generalized epilepsy, respectively. Patients in both models are expected to experience fewer adverse events as time from diagnosis increases, perhaps because the clinicians have had more time to find suitable medication to avoid side effects in some patients.
The marginal and conditional dynamic LoDA approaches give good classification, as shown by high sensitivity, specificity and PCC values (see Table 5, first two columns). The random effects prediction approach works less well in this case. We are not the first to have noticed differences in the predictive accuracy of the three approaches. Komárek et al.
21
found that the random effects prediction was the best when considering a study of primary biliary cirrhosis, whilst Morrell et al.
33
found that the marginal method was the most successful at identifying prostate cancer patients. Which dynamic LoDA method works best seems to depend upon the application considered. The cutoff value regarded as optimal (e.g. 0.74 for the marginal prediction in Table 5) corresponds to the point closest to the top left hand corner of the ROC curve (see Figure 2). We point out here that the three methods of prediction are to be regarded as alternative competing potential classifiers. As such, there is no reason to expect that they give similar performance. Each method has a different cutoff that is optimal for that method. This is to be expected, and we note that these cutoffs are not directly comparable, since the probabilities they relate to are not the same.
Receiver Operating Characteristic curves of the dynamic LoDA using the marginal (solid red), conditional (dashed blue) and random effects (dot dashed green) prediction methods. The longitudinal observations on a randomly selected refractory and remission patient. The refractory patient was a 35-year-old male with generalized epilepsy randomized before 6 June 2001, whilst the remission patient was a 44-year-old male with generalized epilepsy also randomized before 6 June 2001. Summary of the classification accuracy for each of the marginal, conditional and random effects methods and for traditional LDA and QDA. LDA: linear discriminant analysis; QDA: quadratic discriminant analysis; PCC: probability of correct classification; AUC: area under curve; PPV: positive predictive value; NPV: negative predictive value. These results are based on averages across 100 splits of the data into training and test sets. For the dynamic LoDA (first three columns), prediction stops if a patient is predicted as refractory whilst for full data predictions (columns 4 to 6), all data up until the visit before the group status is confirmed is used in the prediction. The final two columns present the results of prediction using LDA and QDA based on baseline characteristics and using no longitudinal information.
To illustrate our allocation scheme outlined in Section 4.1 and to help to interpret the parameter estimates in context of discrimination, we present the longitudinal data of two patients in Table 4, one patient who achieved remission and another who had refractory epilepsy. We present for each patient the time of their clinic visits and their longitudinal information gathered at each visit. First consider the refractory patient, Patient (a). At his first four appointments, although he has had many seizures and in some cases experienced adverse events, his probability of being in the refractory group does not yet rise above 0.74 (the cutoff determined to be optimal in Table 5). Up until this point, he would not be predicted as refractory and would remain under observation. Only at the fifth visit does this probability rise above the cutoff of 0.74 and at this point the marginal prediction method allocates him to the refractory group. For this particular patient, this turned out to be the correct prediction as can be seen by viewing his further clinic visits. By considering his baseline characteristics with the estimated model parameters in Table 3, we see that patients with generalized epilepsy have increased likelihood of experiencing seizures (and in fact many seizures) even if the patient would ultimately achieve remission. This is one reason why Patient (a) initially has low probability of being in the refractory group despite experiencing seizures. At these early time points, we are not yet sufficiently confident that we can predict he will be refractory. However, we are still able to accurately classify him after 833 days (approximately two years and three months) which is considerably earlier than waiting five years to determine their status.
In contrast, Patient (b) ultimately achieves remission. He has initially low probabilities of being refractory due to having no seizures. When he does experience seizures, his probability of being refractory increases to 0.23 but is still well below the required cutoff of 0.74. As this patient experiences no further seizures, his probability of being refractory drops again and at the visit prior to remission being confirmed he is correctly classified as remission. This is confirmed to be correct at his visit when
The allocation scheme has been specifically designed to identify refractory patients. We have set up a scheme whereby as soon as a patient is classified as refractory; we stop predicting for this patient and investigate alternative treatment options. Questions may arise in these kind of settings as to how long one must wait to be confident of the prediction. We have shown that by observing a patient until their probability of being refractory is greater than 0.74 then over 90% of remission and refractory patients are correctly identified.
A further significant finding in the example is the gain in lead time by using the dynamic approach. We define the lead time as the average time, before clinical classification can be confirmed, at which our method can correctly predict a patient as belonging to the refractory group. The corresponding prediction time is the average time since diagnosis at which patients are correctly identified as belonging to the refractory group. We emphasize that these two measures are calculated using those patients who were truly refractory and also predicted to be refractory by the model. The lead times shown in Table 5 consider those patients who are truly in the refractory group and are predicted to be in the refractory group. For the dynamic marginal prediction method, the lead time is 651 days. This means that we can identify those patients who will not achieve remission from seizures almost two years before they are clinically observed as such on average. This is a good time gain, allowing clinicians to consider other forms of treatment, so that patients do not have to endure the adverse side effects of unsuitable treatments.
We now further explore the dynamic LoDA scheme. We chose one of the 100 splits of data into training and test sets. We chose a split such that the sensitivity and specificity were close to the average sensitivities and specificities over the 100 splits. Using a cutoff of 0.74 (determined to be optimal for the marginal prediction, see Table 5), patients were predicted as either refractory or remission using our proposed allocation scheme. The profiles of patients assigned to each of the remission and refractory groups based on a marginal prediction scheme are shown in Figure 3. Refractory patients that are misclassified as remission cases (three patients, top row) have low probabilities. This was due to infrequent seizures and generally low numbers of seizures.
Changes of marginal group membership probabilities over time. The profiles are from one test set of 30% of patients. Their probabilities are calculated using the model developed on the remaining 70% of patients. The top row shows those patients whose true status is refractory whilst the bottom row shows the true remission patients. The left hand panels show all patients who are classed as remission within five years. The right panels show the patients who are predicted as refractory (up until the point at which they are classified as refractory).
Most of the patients who are predicted correctly as refractory have high probabilities almost immediately of being in the refractory group. These are identified early which is consistent with the good lead times achieved, as shown in Table 5. Some of the patients who are truly refractory but were classed as remission could be correctly classified by lowering the cutoff (e.g., to 0.5). However, this would be at the cost of increasing the misclassification rate of remission cases.
In the bottom row of Figure 3, the true remission cases are shown. Most of the patients correctly identified as being in the remission group have high probabilities of being in the remission group very early on. Those patients who are wrongly predicted as refractory are generally those who have been observed for longer and hence taken longer to achieve remission. Such patients may initially have high numbers of seizures and so have initially high probabilities of being in the refractory group. A limitation of our allocation scheme is that these patients would be classed as refractory and then prediction would stop for these patients. It is possible that if they were observed for longer, their probabilities of being in the remission group would increase. This is a limitation with any classification scheme where an intervention is planned following a positive result.
4.4 Which longitudinal biomarkers to use
Comparison of possible models under the marginal prediction scheme based on averages of 100 splits of the data into training and test sets.
PCC: probability of correct classification; AUC: area under curve; PPV: positive predictive value; NPV: negative predictive value.
The predictive accuracy of the univariate model involving the binary variable,
4.5 Benefits of dynamic LoDA
Dynamic LoDA has received increased attention in recent years in the statistical literature. It has become very desirable to have methods of prediction that can be updated at each time point. The alternative to this is to wait until all the data are gathered and then make a prediction. In our application, this would involve waiting for almost five years in order to determine patients’ status. Obviously, in this scenario, there would be no need for classification methods since we would simply have observed which group patients belong to. We would have no misclassification but at the cost of giving some patients ineffective treatment for potentially five years.
By contrast, with our dynamic allocation scheme, the risk is that a patient could have been wrongly classified as refractory, when if followed up a bit longer they would have been classified as remission.
It is commonly thought that observing a patient for longer leads to increased information and so increased accuracy in prediction. We explored this in our example, by comparing the prediction results in the first three columns of Table 5 with those obtained by using all information gathered on a patient up until the visit before their status was confirmed (columns 4 to 6 of Table 5). In this setting, we use all available longitudinal information for each patient. The benefit of waiting until all information is gathered is a small increase in the PCC and specificity, while no benefit is observed in sensitivity or AUC for the marginal prediction (Table 5 and Figure 4). The most evident advantage of the dynamic LoDA over the use of the full data is the significant difference in lead times and prediction times. By waiting for all the data to be collected, patients would have to wait more than two years extra to be classified, whilst only making a minimal gain in predictive accuracy.
Receiver Operating Characteristic curves of the prediction using the marginal (solid red), conditional (dashed blue) and random effects (dot dashed green) prediction methods. The thick lines represent the dynamic allocations whilst the thin lines represent the use of the full data.
At the other extreme, an alternative would be to simply predict a patients’ group membership at diagnosis, based on various baseline characteristics and take no account of accumulating longitudinal information. We examined this possibility using traditional linear and quadratic discriminant analysis methods (LDA and QDA see Chapter 4 in Hastie et al. 10 ). These results, also based on 100 splits of the data into training and test sets, are presented in the final two columns of Table 5. Although reasonably accurate prediction can be made at diagnosis, significant improvements in predictive accuracy can be obtained by updating predictions as new information becomes available for each patient.
In this section, we show that there is merit in considering how a patient’s clinical data change over time during observation and updating the prediction of their five-year status each time new information is available. In addition, we have shown that allocating a patient to the refractory group as soon as their probability of being in the refractory group rises above a cutoff (as opposed to observing the patient for five years) does not decrease the predictive accuracy and allows refractory patients to be identified much earlier on.
5 Discussion
In this paper, we propose a time-dependent discriminant analysis approach that allows for the inclusion of multiple longitudinal biomarkers of various types. Binary, Poisson and continuous longitudinal markers can be included within a MGLMM. An implementation of the methods described in this paper has now been added to the package mixAK 30 of the R software. 31
The longitudinal profiles of considered biomarkers are described using GLMMs. We have allowed for extra flexibility through the inclusion of a mixture distribution of the random effects. These random effects capture the correlation between markers and between observations of a particular marker.
In the clinical application with SANAD data, the inclusion of a normal mixture for the random effects distribution showed only a mild impact in classification accuracy. Nevertheless, the impact can in general be much more considerable. An example of such situation is when one of the groups is characterized by subdivisions of different longitudinal behaviour of the considered markers. This subdivision might not be of interest for classification, nevertheless, if properly taken into account, e.g., by assuming a mixture distribution for random effects, it may considerably improve the classification accuracy. Moreover, since mixtures are in general considered as a suitable semi-parametric model for unknown distributions, they are more able to adapt to model misspecification, and so should be considered as a way of limiting the effect of model misspecification. In addition to reducing the chances of model misspecification, including mixtures may in some cases improve the fit of the model by reducing measures such as the PED. Checking improvement of model fit and in predictive accuracy will determine if this methodology will be a useful tool in any particular example.
In our context, the SANAD database which has more than 1700 patients allowed us to fit reasonably complex models containing three longitudinal markers and six covariates for each of them. We must point out that if very small sample sizes were available then more simple models may need to be considered. Some insight into how large sample is needed to fit models of given complexity can be gained from Komárek and Komárková 29 who present results of a simulation study towards properties of the estimators of parameters of the MGLMM that is behind our LoDA procedure.
One of the limitations in our application is that once a patient has achieved remission, the follow-up data after achieving remission are discarded. This has a direct effect on the length of follow up for some patients in the remission group, although conceptually one could argue that only the profile of the longitudinal biomarkers before achieving remission are of interest. A possible consequence of this is that the longitudinal profile of remission patients at late time points may be less accurately estimated since fewer remission patients are observed for that long. The limitation from a clinical point of view is that relapse in patients with epilepsy after achieving remission is not considered here.
Using longitudinal information along with dynamic LoDA schemes has been seen to give good classification results, yielding good prediction accuracy. In addition, we are able to make predictions about patients substantially earlier than is currently possible showing the potential benefits of such an approach.
With our dynamic classification scheme used for the SANAD application, we dynamically update the allocation probabilities as new longitudinal information arrives, nevertheless, prediction of the group pertinence is performed for each patient only once. Indeed, each patient remains unclassified till either his allocation probability of being refractory exceeds the cutoff value or those allocation probabilities remain below the cutoff value for a predefined period of time (five years in our case). Consequently, standard accuracy measures (such as AUC, sensitivity, specificity, etc.) were applied to evaluate discrimination ability of our procedure. Alternatively, at each visit, we could have used the allocation probabilities and predicted the group allocation. This would then also possibly change dynamically over time and different approach would have to be taken to evaluate a discriminant ability of the LoDA procedure. To this end, one could adopt an extended definition of sensitivity, specificity and dynamic AUC as proposed by Heagerty and Zheng 36 in context of survival analysis and then further generalized in different contexts. 37 Nevertheless, since our main focus here was ultimately on identifying refractory patients at any point within the five-year period, we do not pursue this idea further in this paper.
We compared three approaches to prediction, namely marginal, conditional and random effects prediction and found that for our application both the marginal and the conditional approaches gave good prediction, with the marginal approach most often being the best. The random effects prediction was less accurate for the SANAD data.
We believe our methods could be used in a wide variety of applications. They allow for irregularly collected data, multivariate longitudinal data and can incorporate data of different types. Classification into prognostic groups based on biomarker evolution is an increasingly important aspect of clinical practice and the approach proposed here has the flexibility to be used with many different clinical biomarkers, increasing the options available to researchers. A useful extension to this work would be to allow for discrimination using genuine categorical or ordinal biomarkers. To this end, suitable regression models suggested recently in the literature38,39 for such outcomes could be considered.
In this paper, we present an example where patients are classified into one of two groups. However, the methods here presented are applicable for classification into three or more groups as, for example, in applications where the aim is to classify patients into various stages of cancer (as opposed to simply cancer against cancer free patients) giving wider applicability to the methods proposed.
