Abstract
Keywords
Introduction
At present, there is no universally accepted definition of rare disease. In the USA, rare diseases are defined as those affecting less than 200,000 individuals (prevalence <650/million population). In Europe, a disease is defined as rare when it affects 5 persons in 10,000 (prevalence: 500/million). A systematic search for definitions related to rare disease from organizations in 32 international jurisdictions resulted in 296 definitions from 1109 organizations published by Richter and colleagues. 1 The average prevalence threshold across organizations within individual jurisdictions ranged from 5 to 76 cases/100,000 people. The global average prevalence from the definitions used in the different jurisdictions would be 40 cases/100,000 people. Ultra-rare diseases are defined as those affecting one patient per 50,000 people (about 20 patients in 1 million). The limited size of a study population for an orphan indication (sometimes only a few dozen or 100 patients) makes the safety profile of an orphan drug only partially defined at the time of approval.2,3 In general, rare adverse drug reactions (ADRs) are difficult to detect for all drugs but for orphan drugs it is very likely that they remain undetected during the development of orphan drugs and possibly also for long periods following their approval. It is therefore extremely important that the elements and procedures used for safety monitoring for all drugs, but in particular for orphan drugs for rare diseases, are carefully evaluated and based on relevant information available about the drugs from all sources, including chemistry, manufacturing and controls, nonclinical toxicology information and any other data from previous experience in humans. If available, safety profiles of other approved drugs of the same class for other indications should also be used. 4
Big databases of suspected adverse drug reactions (sADRs) represent an important source of data for signal detection activities. Different statistical disproportionality methods have been evaluated to facilitate the detection of data suggesting potential signals from large databases.5–12 There is no method of disproportionality calculation preferred to others.13–17 EudraVigilance is the system for managing and analysing information on suspected adverse reactions to medicines that have been authorized or being studied in clinical trials in the European Economic Area (EEA). The European Medicines Agency (EMA) operates the system on behalf of the European Union (EU) medicines regulatory network. EudraVigilance supports the electronic exchange of ICSRs between EMA, national competent authorities (NCAs), MAHs and sponsors of clinical trials in the EEA, early detection and evaluation of possible safety signals and better product information for medicines authorized in the EEA.
In EudraVigilance different ADR terminologies are standardized by use of the Medical Dictionary for Regulatory Activities (MedDRA). This is a dictionary designed to be used in the registration, documentation and safety monitoring of products during and after the marketing authorization process. Developed by the International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use (ICH; multidisciplinary topic M1), MedDRA contains highly specific, standardized medical terminology. The structure of MedDRA is logical. There are five levels in the MedDRA hierarchy, organized from the very specific to the very general. At the most specific level, called ‘lowest level terms’ (LLTs), there are more than 70,000 terms, which reflect how information is communicated. Thus, these LLTs reflects how an observation might be reported in clinical practice. In the next level there are MedDRA ‘preferred terms’ (PTs). These are distinct descriptors (single medical concept) for a symptom, sign, disease diagnosis, therapeutic indication, investigation, surgical or medical procedure, as well as medical, social, or family history characteristics. Each LLT is linked to only one PT. Each PT has at least one LLT (itself). Related PTs are grouped together in ‘high-level terms’ (HLTs) based upon anatomy, pathology, physiology, aetiology, or function. HLTs are, in turn, linked to ‘high-level group terms’ (HLGTs). Finally, HLGTs are grouped into ‘system organ classes’ (SOCs) which are groupings by aetiology (e.g. infections and infestations), anatomical site (e.g. gastrointestinal disorders), or purpose (e.g. surgical and medical procedures). In addition, there is a SOC that includes issues pertaining to product and one containing social circumstances. The use of this drug dictionary is vital to ensure consistency. MedDRA is governed by a maintenance and support service organization (MSSO), which maintains supporting documentation up to date with each release of the dictionary and includes MedDRA training materials. This standardization eliminates the need for conversion from one terminology to another, thus preventing distortion of data. It guarantees consistency through the different stages of development, which facilitates effective cross-referencing. MedDRA is used by regulatory authorities, pharmaceutical companies and clinical research organizations worldwide, and its use is a regulatory requirement of MAHs and marketing authorization applicants in the European Union (EU).
The disproportionality method used in EudraVigilance 18 is the reporting odds ratio (ROR): proportion of cases for a drug–reaction/event combination (DEC) in relation to the proportion of cases that would be expected if no association existed between the drug and the reaction/event. The usefulness of this statistical method for signal detection is based on the hypothesis that when a product causes an event, the number of observed reports for the DEC will tend to exceed the number based on chance alone. The calculation of the ROR is based on a two-by-two contingency table19–23 (please refer to Table 1).
Two-by-two contingency table used to calculate the reporting odds ratio (ROR) in EudraVigilance.
Number of individual cases with the suspected medicinal product and the event of interest.
Number of individual cases with the suspected medicinal product but with no event of interest.
Number of individual cases with all other suspected medicinal products in EudraVigilance and the event of interest.
Number of individual cases with all other suspected medicinal products in EudraVigilance and without the event of interest.
A set of rules, known as a signal detection algorithm (SDA), based on the observed value of the disproportionality statistic and, usually, also on other variables (e.g. number of cases reported), is applied in the EudraVigilance Data Analysis System (EVDAS) to indicate when a given DEC should be highlighted as a signal of disproportionate reporting (SDR). SDRs could potentially indicate the presence of a new ADR for the product or a known ADR (i.e. ADR listed in the concerned labelling) reported more than expected in the database. Scientific and medical judgment have to be exercised to determine whether further analyses/evaluations of data is necessary to confirm or refute an SDR as a signal. SDRs are included in electronic reaction monitoring reports (eRMR) made available to MAHs. The eRMR for a particular drug substance is an output containing aggregated data and line listings of suspect ADR cases. Further details of the individual cases per DEC can be generated by MAH
A SDR is identified when the following conditions are met: (1) ROR lower bound of the 95% confidence interval greater than one in at least one of the geographical regions Europe, North America, Japan, Asia and rest of the world; (2) the presence in EudraVigilance of at least three or five cases including the event of interest with the active substance contained in the medicinal product under signal detection evaluation (threshold set to three cases if the active substance is in the additional monitoring list as per REG 726/2004 EC article 23 and as also defined in GVP Module IX or for paediatric population); (3) the event is included in the important medical event (IME) and/or designated medical event (DME) list as published on the EMA website.19–23
On the basis of the 2018 Annual Report on EudraVigilance for the European Parliament, the Council and the Commission the database currently holds over 14.5 million individual case safety reports (ICSRs) referring to over 8.3 million cases and is one of the largest pharmacovigilance databases in the world. It has undergone significant development in recent years. This has delivered enhanced functionalities allowing for a better support of pharma-covigilance activities and the protection of public health. Disproportionality methods used in EudraVigilance have demonstrated the ability to detect about 50% of ADRs as compared with other currently used methods of signal detection. 24
In 2018, the EMA’s signal management team reviewed in detail 2204 potential signals, that is, drug–event pairs from screening of the EudraVigilance database (78.7%), medical literature (17.8%), or information received from regulatory authorities or other sources.
In the context of a mandated pilot phase of use of eRMRs from EudraVigilance for signal detection by MAHs, which started in February 2018, 25 six validated signals had been notified to EMA by the end of 2018.
In 2018, the Pharmacovigilance Risk Assessment Committee (PRAC) of the EMA prioritized and assessed 114 confirmed signals (a 39% increase compared with 2017); 79% included data from EudraVigilance. Fifty of the assessed signals (44%) resulted in a recommendation for an update of the product information for patients and healthcare professionals, thus providing updated guidance on the safe and effective use of the medicines. In six of these cases, the PRAC also recommended a dear healthcare professional communication (DHPC) to highlight new important safety information to prescribers, and in two cases, also to update the risk management plan (RMP). One additional signal led to the update of the RMP to fully characterize and investigate the concern, in the absence of a product information update. In 24 cases (21%) continuing with routine safety monitoring of the medicine was considered sufficient. The evaluation of 39 signals (34%) was ongoing at the time of the annual report including 22
A list of all signals prioritized and assessed by the PRAC in 2018, some of which concerned orphan drugs, has been included in the attachment V to the 2018 annual report on EudraVigilance for the European Parliament, the Council and the Commission. 26
Spontaneous reports are submitted to Eudra-Vigilance only when a patient experiences an adverse event that may have been caused by a medicine. The screening of the EudraVigilance data does not provide information about the number of patients taking the medicine who do not experience an adverse event or the number of patients experiencing the same clinical event without taking a medicine, simply because such cases without an adverse event are not reportable to EudraVigilance. As consequence the relevant associations measured at population level (e.g. relative risks or odds) for clinical events for patients taking a medicine compared with those not taking the medicine cannot be calculated. Instead, signal detection relies on disproportionality measures that are relative proportions conditional on reporting to the database. 23 Interpretation of the statistical analyses using data from reporting systems such as EudraVigilance are necessarily accompanied by medical and clinical judgment and consideration of what is known about the product, the disease, the class, the identified and potential risks, exposure, etc.
The ‘crude’ ROR (cROR) applied to orphan drugs, calculated from the ratios of proportions of orphan drug–event combinations (ODECs) and proportion of DECs in the rest of the population present in the whole database treated with other medicines is affected by different limitations. A cROR would result from the comparison of a proportion of a DEC occurring in the small population suffering a rare disease (in most instances life-threatening condition) and the proportion of the same event combined to all the other drugs in the database, administered for a wide range of indications (please refer to Figure 1).

Crude reporting odd ratio for orphan drugs from EudraVigilance.
The small patient population and the associated small number of cases increases the width of the ROR confidence interval and as result, SDRs may not be identified only because the cROR lower bound of the 95% confidence interval would not be greater than 1 in at least 1 of the 5 geographical regions (but with the other 2 criteria of the SDR fulfilled). Some examples of this effect, which may be false negatives, that is, not determined as SDRs in eRMRs available from EVDAS have been identified for six different orphan drugs and are discussed in this article.
Nevertheless, it is important to note that for the same medicinal products, there were some ODECs that resulted in SDRs matching ADRs already labelled in the concerned summary of product characteristics (SmPC), but these are not discussed in this article because the purpose of this study is to identify whether false negatives might occur.
Material and methods
Between 25 March 2019 and 26 March 2019, eRMRs ‘fixed reference period type’ with no filter applied to the MedDRA hierarchy (i.e. radio button ‘none’ selected for the field ‘Reactions from the MedDRA hierarchy to filter the report results’) have been downloaded from EudraVigilance for six of orphan drugs reported in the additional monitoring list as per REG 726/2004 EC article 23 (eRMR time run on 25/03/2019: cabozantinib, daratumumab and panobinostat; eRMR time run on 26/03/2019: pomalidomide, ponatinib and venetoclax).
For each active substance the concerned eRMR was filtered as follows: (a) the field ‘IME/DME’ only to include the values ‘IME’ or ‘IME/DME’; (b) the field ‘SDR All’ only to include the value ‘No’; (c) the fields ‘ROR (-) Europe’, ‘ROR (-) N America’, ‘ROR (-) Japan’, ‘ROR (-) Asia’ and ‘ROR (-) Rest of the world’ only to include values lower than ‘1’; (d) the threshold was also fixed to include for each region only ODECs including adverse event terms reported at least three times to EudraVigilance. This resulted in a list of terms that were terms included in the IME/DME lists, were a SDR not detected based on the ROR lower bound of the 95% confidence interval, were a ROR from all regions which included values lower than 1 and were terms for which there had been at least 3 case reports reported to EudraVigilance.
For each active substance, the reaction MedDRA preferred terms (PTs) included in the eRMR resulting from the filter described above have been checked against the list of ADRs as included in section 4.8 of the concerned SmPC to determine the presence of possible false negatives in the eRMR (please refer to Figure 2).

Flow diagram showing the methodology followed to obtain ADRs resulting as false negatives.
Results
The results from this study are provided in Table 2. Overall, a total of 41 MedDRA PTs meeting the criteria of important medical events, each reported at least three times for at least one of the regions (‘ROR (-) Europe’, ‘ROR (-) N America’, ‘ROR (-) Japan’, ‘ROR (-) Asia’, ‘ROR (-) Rest’) and with cROR (i.e. obtained for each region) lower than 1 with the lower bound of the 95% confidence interval have been identified.
MedDRA PTs with cROR lower than 1 with the lower bound of the 95% confidence interval from eRMR obtained from EudraVigilance mapped against the SmPC.
eRMRs, fixed reference period, time run on 25 March 2019.
eRMR, fixed reference period, time run on 26 March 2019.
Adverse drug reactions (ADRs) reported as: VC/C, very common (⩾1/10) or common (⩾1/100 to <1/10): U, uncommon (⩾1/1000 to <1/100); R, rare (⩾1/10,000 to <1/1000); VR, very rare (<1/10,000); NK, not known (cannot be estimated from the available data) in the Summary of Product Characteristics (SmPC).
ROR for the region ‘Europe’.
ROR for the region ‘North America’.
ROR for the region ‘Japan’.
ROR for the region ‘Asia’.
ROR for the region ‘Rest of the World’.
DME, designated medical event; eRMR, electronic reaction monitoring report; IME, important medical event; MedDRA, Medical Dictionary for Regulatory Activity; PT, preferred term; ROR, reporting odds ratio; SDR, signal of disproportionate reporting.
Seven of the 41 MedDRA PTs, matched ADRs noted as ‘very common’ (frequency ⩾1/10) or ‘common’ (frequency ⩾1/100 to <1/10) in the concerned SmPC as summarized as follows:
- neutropenia (cabozantinib: cROR Europe = 0.31; cROR N America = 0.25);
- haemorrhage (cabozantinib: cROR Europe = 0.63; cROR N America = 0.48);
- renal failure (panobinostat: cROR Europe = 0.81; cROR N America = 0.09; cROR Japan = 0.94);
- myocardial infarction (pomalidomide: cROR N America = 0.44);
- neuropathy peripheral (ponatinib: cROR Europe = 0.46; cROR N America = 0.97; cROR Asia = 0.75);
- pulmonary embolism (ponatinib: cROR Europe = 0.09; cROR N America = 0.64; cROR Asia = 0.65);
- hyperkalaemia (venetoclax: cROR Europe = 0.79; cROR N America = 0.89).
Of the 41 MedDRA PTs, one matched with a ‘rare’ (frequency ⩾1/10,000 to <1/1000) ADR in the concerned SmPC: anaphylactic reaction (daratumumab: cROR Europe = 0.55; cROR N America = 0.55).
Two of the 41 MedDRA PTs, matched ADRs with frequency ‘not known’ (frequency cannot be estimated from the available data) in the concerned SmPC summarized as follows:
- myocardial infarction (cabozantinib: cROR Europe = 0.06; cROR N America = 0.18);
- cerebrovascular accident (cabozantinib: cROR Europe = 0.06; cROR N America = 0.72).
For 31 of the 41 MedDRA PTs, an exact match has not been found in the related approved SmPCs.
Discussion
This study investigated some of the limitations of eRMRs obtained from data in EudraVigilance for orphan drugs. As expected, the results from this study have confirmed some weaknesses of the abovementioned statistical calculations, which were not sensitive enough to detect certain ADRs, including some ADRs that are already listed as common or very common complications in the approved labelling of orphan drugs. The study had a specific focus on those SDRs not highlighted as potential SDRs by EVDAS as consequence of cRORs with the lower bound of the 95% confidence interval lower than 1. This may be explained by the small population suffering the rare disease and the very limited safety data available for orphan drugs in EudraVigilance. It is plausible that the under-reporting (e.g. low number of cases for orphan drugs; nonreported ADRs) plays a significant role in causing false negatives because this results in wider confidence intervals for the cROR.
Moreover, in most instances the calculation of cRORs applied to orphan drugs is influenced by different factors such as the indication of the drug, the wide spectrum of medical conditions/diseases of patients from whom reporting of ratios are derived (e.g. ratios of proportions of ODECs in the small patient population suffering the rare disease and proportion of DECs in the rest of the population treated with other medicines for a wide range of indications prescribed for completely different pathologies represented in the whole database). These factors may result in false negatives.
For all six different orphan drugs selected there were ADRs listed in the concerned SmPCs for which a positive SDR was not reported and, therefore, these represent false negatives.
This study did not investigate the generalizability of the same findings to other orphan drugs. It is possible that by applying the same methodology to other substances, which are orphan drugs, different results could be obtained. It should be also noted that this study evaluated eRMRs downloaded from EVDAS only once for each analysed orphan drug. It has not been investigated whether, for the same six orphan drugs used for the analysis, eRMRs downloaded in subsequent periods would provide similar results. This study did not examine further the relevance of the identified ODECs that did not match ADRs listed in the corresponding SmPCs and not defined as SDRs (i.e. whether they could be false negatives/false positives/true negatives/true positives, etc.).
Complementary analyses could be performed to evaluate whether sensitivity of the methodology could possibly be increased by replacing the cROR with a ROR obtained from a dataset filtered by relevant medical condition (disease/indication) of the drug of interest using the MAH EVDAS version. As example of an inappropriate comparison would be for, a SDR for an adverse event of haemorrhage occurring in the in the context of a haemato-oncological rare and life-threatening condition for which an orphan drug has been administered, compared with the SDR for the same event in patients who have been administered an over-the-counter medicinal product for the treatment of a nonserious condition (e.g. treatment with acetyl salicylic acid or with ibuprofen for headache). By filtering the whole EudraVigilance database by medical condition or common pathology ratios of proportions of ODECs in the small patient population suffering the rare disease would be compared with proportions of DECs in patients suffering similar medical conditions (e.g. for a drug used for haematological–oncological disease, sorting the dataset for comparison by haemato-oncological/oncological diseases). However, it is recognized that these details might not always be available in EudraVigilance. It is likewise important to consider that restrictions of the database to accommodate subgroup analysis may introduce other biases potentially causing the under detection of signals that would be detected if no filters to the whole dataset were introduced. As example, for a drug used to treat cancer the restriction of the whole dataset to cancer drugs could result in background disease-related events and ADRs of therapies becoming too similar for drug of interest and the comparator drugs thus resulting in the under-detection of signals. The fundamental basis of use of disproportionality, whereby the variety of drugs and indications form the basis of production of an expected reporting ratio, could break down and even frequently reported events (with an underlying causal link to the drug) might become difficult to detect.
Hauben
Following an approach similar to that described by Hauben and colleagues, 27 it would be interesting to investigate whether by applying a Standardized MedDRA Query (SMQ)-based analysis to signal detection for orphan drugs on data from EudraVigilance false negatives would be reduced by grouping different PTs with similar medical concepts. The report on the EVDAS pilot phase for MAHs is expected to produce further clarifications and recommendations on the use of EVDAS MAH version.
It is recognized that these subanalyses described above could be quite difficult to implement, necessitating significant experience and be quite burdensome from a resource perspective. Medical judgment of results of quantitative statistical analyses from databases should, wherever possible, be complemented by qualitative evaluations of ICSRs (for orphan drugs paying attention also at those ADRs not marked as SDRs in eRMRs) by experienced personnel in an attempt to identify potential false negatives.
