Development and validation of COMPASS: clinical evidence of orphan medicinal products – an assessment tool
© Picavet et al.; licensee BioMed Central Ltd. 2013
Received: 25 June 2013
Accepted: 5 October 2013
Published: 9 October 2013
Rare diseases are defined as life-threatening or chronically debilitating diseases with a prevalence of 50 out of 100,000 individuals or less. Orphan medicinal products (OMPs) are intended for the treatment of rare diseases. The assessment of quality of evidence in small populations is often complex. Many generic tools are unfit. Therefore, the aim of this study was to develop and validate a new tool to assess the quality of OMPs' clinical evidence (COMPASS).
Firstly, a draft version of the COMPASS tool, developed by the authors and consisting of three parts, was amended based on suggestions obtained in four rounds of expert consultation. Secondly, the tool was put through three rounds of validation. The data source was information provided on the Orphanet website and in European Public Assessment Report (EPAR) document of the European Medicines Agency.
The first pilot round revealed a high (92.2%) inter-rater agreement for part one of the tool. After further improvements, the final inter-rater agreement was 86.4% for part two (on methodological quality) and three (on quality of reporting) of the tool. The COMPASS tool does not attempt to score or rank the quality of clinical evidence, but rather to give an outline of various, key elements with respect to quality of clinical evidence of OMP studies.
The COMPASS tool can be applied to assess the quality of evidence of an OMP based on information in the registration dossier, for example by local reimbursement agencies, pharmacists or clinicians. In that way, the tool can contribute to making reimbursement and/or treatment decisions increasingly more founded on the principles of evidence-based decision making.
KeywordsQuality assessment Quality assessment tool Quality of reporting
Rare diseases are defined as life-threatening or chronically debilitating diseases with a prevalence of 50 out of 100,000 individuals or less . It is estimated that there are currently between 5,000 and 7,000 rare diseases . Orphan medicinal products (OMPs) are intended for the treatment of rare diseases . Studies to evaluate the effect of an OMP in patients with rare diseases are often hampered by the difficulty of enrolling a sufficient number of patients [3, 4]. For example, N-acetylglutamate synthase (NAGS) deficiency is a very rare disorder that can cause neonatal life-threatening hyperammonemia. To date, only few patients with NAGS deficiency have been identified. In Europe, treatment with carglumic acid for NAGS deficiency has been authorized based on efficacy data from four case reports and 12 patients in a retrospective data collection study [5, 6]. For some OMPs, it is clear that the quantity of clinical evidence cannot be obtained. Even so, achieving the highest quality of evidence should still be aimed for .
But how do we define quality of clinical evidence? According to GRADE (Grading of Recommendations Assessment, Development and Evaluation), quality of evidence reflects the extent of our confidence that the estimates of an effect are correct . Traditionally, randomized controlled studies are regarded as the gold standard in achieving high quality of evidence whereas case controls or case studies are considered of lesser, but not lacking, value [7, 9, 10]. Nevertheless, high quality evidence is needed to guide clinical decision-making .
The assessment of quality of evidence in small populations at the time of registration and/or reimbursement is often complex. However, the rising number of authorized OMPs and their increased use in clinical practice emphasizes the need for an objective assessment. Quality assessment involves evaluation of a study’s validity, i.e. the degree to which its design, conduct and analysis have minimised biases or errors . In general, there are three ways to assess the quality of studies: individual markers, checklists and scales [13–15]. Many generic tools, from simple checklists to extensive questionnaires, are currently used to assess studies [12, 16]. However, the majority of these instruments are unfit to assess clinical studies of OMPs, as they do not take into consideration the difficulties (ie small sample size, use of surrogate endpoints, etc.…) that are inextricably bound up with these studies. According to Khan, new tools can be developed, keeping in mind that all components of the tool should be selected with due consideration for its purpose. These components should capture both generic methodological issues and issues specific to the subject under review .
Therefore, the aim of this study was to develop and validate a new tool, COMPASS (Clinical evidence of Orphan Medicinal Products – an ASSessment tool), to assess the quality of OMPs clinical evidence that is presented for OMPs at the time of marketing authorization in the EU.
Design of the tool
The design of the tool was conceptualised after consulting the Centre for Evidence-Based Medicine (CEBAM), Leuven, Belgium. A draft version of the tool was drawn up based on elements derived from existing checklists supplemented with items specifically related to rare diseases and OMPs [13–15].
Validity of the COMPASS tool
Overview of expert consultations
Expert consultation #1
Physician - Regulatory
Academic - Regulatory
Expert consultation #2
Academic - Physician
Expert consultation #3
Expert consultation #4
Academic - Regulatory
The data source of the tool consisted only of information provided on the Orphanet website and in European Public Assessment Report (EPAR) and/or the Scientific Discussion (SD) document prepared by the Committee for Human Medicinal Products (CHMP) of the European Medicines Agency (EMA). These documents provide information about chemical, pharmaceutical, biological, toxico-pharmacological and clinical aspects of a drug . For practical and privacy reasons, we did not have access to the original documents submitted to EMA. However, we anticipated that the publicly EPARs sufficiently reflect these original documents. The assessment of the methodological quality was restricted to studies that were described as 'pivotal’ or 'main’ clinical studies. The analyses were performed per study, as opposed to per orphan medicinal product, due to possible methodological differences between the studies. No additional data from publications of those studies was used due to possible unsystematic reporting and publication bias.
Consistency of the COMPASS tool
A first pilot round was undertaken, in which five randomly selected OMPs (i.e. one from each “type”: betaine anhydrous (one indication, only literature reports), histamine dihydrochloride (one indication, open label study), idursulfase (one indication, RCT), pirfenidone (one indication, two RCTs) and sorafenib (two indications, two RCTs)) were analysed by two raters (E.P. and S.S.). The two raters completed all three parts of the COMPASS tool. A second pilot round was undertaken, in which four alphabetically successive OMPs (ziconotide, agalsidase alfa, sildenafil citrate and lenalidomide) were analysed by two raters (E.P. and D.C.). One rater completed all three parts of the tool, whereas the other completed the second and third part. The final version (Additional file 1) of the COMPASS tool was drawn up in accordance with the issues raised during the two pilot rounds.
The COMPASS tool consists of three parts; the first part collects general descriptive information about the OMP and its marketing authorization. The second part focuses on the assessment of the methodological quality (i.e. specifically related to study design, patient and study population, control arm, blinding, randomization and allocation, outcomes, adherence and statistical analysis) of the study. The last part assesses the quality of reporting. The Orphanet website was consulted to provide information on the prevalence of the rare disease in which the indication is authorized and its therapeutic need . The registration of the pivotal studies on EudraCT and/or clinicaltrials.gov was evaluated on their respective websites [17, 18].
In a third and final round, two raters (E.P. and S.S.) completed the tool for a sample of OMPs (n = 29). One rater (E.P.) completed all three parts of the tool, whereas the other (S.S.) completed the second and third part. Additionally, expertise in the medical field was believed necessary to answer the question “Is the duration of the study relevant to the natural history of the disease?”. Therefore, a third rater (D.C.) (ie a trained physician) also answered this question for all 29 OMPs. Upon disagreement between the raters, the assessment of D.C. was considered decisive.
In all three rounds, raters completed the tool independently and once-only. Additionally, raters were blinded with respect to results of others. The same information was available to all raters. After data collection, E.P. was responsible for comparison of the results.
All analyses (including the calculation of inter-rater agreement, by percent agreement calculation) were performed using MS Office Excel 2010.
Validity of the COMPASS tool
Expert consultation - number of issues discussed
Number of issues discussed
Expert consultation #1
Expert consultation #2
Expert consultation #3
Expert consultation #4
Consistency of the COMPASS tool
Inter-rater agreement (%)
1st pilot round (n = 5)
2nd pilot round (n = 4)
3rd round (n = 29)
All three raters independently evaluated the relevance of the study duration with respect to the natural history of the disease. There was agreement between three raters for 65.4% of the studies. In case of disagreement, the rater with a medical background was more inclined to assess the study duration as appropriate (77.8%) than raters without a medical background (E.P. 27.8% and S.S. 22.2%). Additionally, the raters without a medical background experienced more difficulties in evaluating study duration, as shown by their choice of “Don’t know” respectively nine and two times.
The goal of this research was to develop and validate a new tool COMPASS to assess the quality of OMPs clinical evidence presented at the time of marketing authorization in the EU. The COMPASS tool (Additional file 1) does not attempt to score or rank the quality of clinical evidence, but rather to give an outline of various, key elements with respect to quality of clinical evidence, as seen by experts. Ultimately, it is up to the evaluator to define minimum conditions of quality for an individual OMP or a set of similar OMPs. Ideally, these conditions should be defined without taking into account unmet need and disease severity, as these are not determining factors for quality of clinical evidence.
This COMPASS tool can be applied to assess the quality of evidence of an OMP based on the information in the registrations dossiers, for example by local reimbursement agencies for the review of clinical evidence or by pharmacists and clinicians upon considering a (new) treatment. Registration and reimbursement bodies currently acknowledge the data limitations for OMPs by providing them special considerations . However, over time they are likely to become more sensitive to data requirements . For example, non-binding recommendations for the approval of cancer drugs and biologics were issued by the Food and Drug Administration (FDA) several years ago . Nowadays, deviations from the guidelines, stated in a similar European document, should be thoroughly justified .
To improve the reliability of results, data extraction should be performed independently by at least two raters. The reliability of results has shown to be independent of blinding, for that reason data extraction should not necessarily be blinded . An exception on the number of raters can be made for the first part of the tool, as it collects more general and descriptive information about the OMP. Yet, minor differences between the raters can occur depending on the information source. To reduce variability, the source of information was pre-specified for some queries (ie specified the source of prevalence data to EPAR and Orphanet). There are more subjective questions in part two of the tool, emphasizing the need for independent raters.
Study quality is dependent, not only on methodological quality, but also on the quality of reporting . Indeed, shortcomings in the reporting can complicate the interpretation of the methodological quality. Correct and complete information should provide the reader with the ability to make informed judgements about the validity of a study . In practice, it was (nearly) impossible to assess the methodological quality of those (few) studies that were only very briefly discussed in the EPAR. Additionally, the interpretation was also complicated for those EPARs that consisted only of literature studies. To address the issue of poor reporting, part three of the tool focuses on quality of reporting. Additionally, for some questions, an additional check box was provided with 'Not reported’. Whilst in some cases the difference between answering 'No’ or 'Not reported’ may be vague, the subtle difference influences the evaluation of the quality of reporting.
The COMPASS tool has several strengths and weaknesses. The tool was developed after iterative rounds of expert consultation and underwent several pilot rounds to increase its validity. The tool assesses the level of clinical evidence that is presented in the pivotal studies at the time of marketing authorization. As such, it does not take into consideration the evidence in any of the supporting studies or evidence generated as part of post-marketing commitments. Adding data from publications would allow for quality control of the EPAR data, but was considered outside the scope of this study. Also, the tool is dependent on the quality of reporting in the EPAR and/or SD documents. Finally, a general medical knowledge of the rater is advisable to complete the tool. Additional check boxes ('Don’t know’ and 'Not reported’) were provided, to account for possible problems related to these last issues.
In conclusion, we developed and validated a new tool for the assessment of clinical evidence of OMPs. The COMPASS tool can for example be used by local reimbursement agencies for the review of clinical evidence from OMP registration dossiers or by clinicians and pharmacists upon considering a (new) treatment. Furthermore, we hope that the COMPASS tool can initiate and add to the open debate on study standards for orphan medicinal products. In that way, the COMPASS tool can contribute to making reimbursement decisions increasingly more founded on the principles of evidence-based decision making.
Clinical evidence of orphan medicinal products – an assessment tool
Orphan medicinal product
European public assessment report
Food and drug administration
Committee for human medicinal products
European medicines agency.
No sources of funding were used to assist in the preparation of this manuscript. The authors would like to thank the expert panel – Bogaert M., Cleemput I., De Nys K., Dooms M., Hulstaert F. and Tambuyzer E. – for their valuable comments and suggestions to improve the tool.
- Commission E: Regulation (EC) No 141/2000 of the European parliament and of the council of 16 December 1999 on orphan medicinal products. Off J Euro Comm. 2000, L18: 1-5.Google Scholar
- Orphanet - The portal for rare diseases and orphan drugs. http://www.orpha.net,
- Wilcken B: Rare diseases and the assessment of intervention: what sorts of clinical trials can we use?. J Inherit Metab Dis. 2001, 24: 291-298. 10.1023/A:1010387522195.PubMedView ArticleGoogle Scholar
- Tambuyzer E: Rare diseases, orphan drugs and their regulation: questions and misconceptions. Nat Rev Drug Discov. 2010, 9: 921-929. 10.1038/nrd3275.PubMedView ArticleGoogle Scholar
- European Medicines Agency. http://www.ema.europa.eu/ema/,
- Haberle J: Role of carglumic acid in the treatment of acute hyperammonemia due to N-acetylglutamate synthase deficiency. Ther Clin Risk Manag. 2011, 7: 327-332.PubMed CentralPubMedView ArticleGoogle Scholar
- Day S: Evidence-based medicine and rare diseases. Rare Dis Epidemiol. 2010, 686: 41-53. 10.1007/978-90-481-9485-8_3.View ArticleGoogle Scholar
- Balshem H, Helfand M, Schunemann HJ, Oxman AD, Kunz R, Brozek J, Vist GE, Falck-Ytter Y, Meerpohl J, Norris S, Guyatt GH: GRADE guidelines: 3. Rating the quality of evidence. J Clin Epidemiol. 2011, 64: 401-406. 10.1016/j.jclinepi.2010.07.015.PubMedView ArticleGoogle Scholar
- Barton S: Which clinical studies provide the best evidence? The best RCT still trumps the best observational study. BMJ. 2000, 321: 255-256. 10.1136/bmj.321.7256.255.PubMed CentralPubMedView ArticleGoogle Scholar
- Griggs RC, Batshaw M, Dunkle M, Gopal-Srivastava R, Kaye E, Krischer J, Nguyen T, Paulus K, Merkel PA: Clinical research for rare disease: opportunities, challenges, and solutions. Mol Genet Metab. 2009, 96: 20-26. 10.1016/j.ymgme.2008.10.003.PubMed CentralPubMedView ArticleGoogle Scholar
- Kruer MC, Steiner RD: The role of evidence-based medicine and clinical trials in rare genetic disorders. Clin Genet. 2008, 74: 197-207. 10.1111/j.1399-0004.2008.01041.x.PubMedView ArticleGoogle Scholar
- Undertaking systematic reviews of research on effectiveness. http://www.york.ac.uk/inst/crd/pdf/Systematic_Reviews.pdf,
- Jadad AR, Moore RA, Carroll D, Jenkinson C, Reynolds DJ, Gavaghan DJ, McQuay HJ: Assessing the quality of reports of randomized clinical trials: is blinding necessary?. Control Clin Trials. 1996, 17: 1-12. 10.1016/0197-2456(95)00134-4.PubMedView ArticleGoogle Scholar
- Deeks JJ, Dinnes J, D'Amico R, Sowden AJ, Sakarovitch C, Song F, Petticrew M, Altman DG: Evaluating non-randomised intervention studies. Health Technol Assess. 2003, 7: iii-173.View ArticleGoogle Scholar
- Dechartres A, Charles P, Hopewell S, Ravaud P, Altman DG: Reviews assessing the quality or the reporting of randomized controlled trials are increasing over time but raised questions about how quality is assessed. J Clin Epidemiol. 2011, 64: 136-144. 10.1016/j.jclinepi.2010.04.015.PubMedView ArticleGoogle Scholar
- Olivo SA, Macedo LG, Gadotti IC, Fuentes J, Stanton T, Magee DJ: Scales to assess the quality of randomized controlled trials: a systematic review. Phys Ther. 2008, 88: 156-175. 10.2522/ptj.20070147.PubMedView ArticleGoogle Scholar
- EUdraCT. http://eudract.ema.europa.eu/,
- ClinicalTrials.gov. clinicaltrials.gov/
- Heron L, Laurenson S, Costello J, Anderl C, Knospe J: Gaining reimbursement of orphan products in Europe: challenges Due to wide variations in evidence requirements and processes. Value in Health. 2012, 15: A309.View ArticleGoogle Scholar
- Jeffery M, White R: Orphan drug pricing and access - current situation and future trends in Eu. Value in Health. 2012, 15: A309.View ArticleGoogle Scholar
- Food and Drug Administration: Guidance for Industry: Clinical Trial Endpoints for the Approval of Cancer Drugs and Biologics. http://www.fda.gov/downloads/Drugs/GuidanceComplianceRegulatoryInformation/Guidances/ucm071590.pdf,
- European Medicines Agency: Guideline on the evaluation of anticancer medicinal products in man. http://www.ema.europa.eu/docs/en_GB/document_library/Scientific_guideline/2013/01/WC500137128.pdf,
- Begg C, Cho M, Eastwood S, Horton R, Moher D, Olkin I, Pitkin R, Rennie D, Schulz KF, Simel D, Stroup DF: Improving the quality of reporting of randomized controlled trials - The CONSORT statement. JAMA. 1996, 276: 637-639. 10.1001/jama.1996.03540080059030.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.