Assessing the diagnostic utility of the Gaucher Earlier Diagnosis Consensus (GED-C) scoring system using real-world data

Revel-Vilk, Shoshana; Shalev, Varda; Gill, Aidan; Paltiel, Ora; Manor, Orly; Tenenbaum, Avraham; Azani, Liat; Chodick, Gabriel

doi:10.1186/s13023-024-03042-y

Research
Open access
Published: 16 February 2024

Assessing the diagnostic utility of the Gaucher Earlier Diagnosis Consensus (GED-C) scoring system using real-world data

Shoshana Revel-Vilk ORCID: orcid.org/0000-0001-9151-0337^1,2,3,
Varda Shalev⁴,
Aidan Gill⁵,
Ora Paltiel^2,3,6,
Orly Manor³,
Avraham Tenenbaum⁴,
Liat Azani⁷ &
…
Gabriel Chodick^4,7

Orphanet Journal of Rare Diseases volume 19, Article number: 71 (2024) Cite this article

749 Accesses
1 Altmetric
Metrics details

Abstract

Background

Gaucher disease (GD) is a rare autosomal recessive condition associated with clinical features such as splenomegaly, hepatomegaly, anemia, thrombocytopenia, and bone abnormalities. Three clinical forms of GD have been defined based on the absence (type 1, GD1) or presence (types 2 and 3) of neurological signs. Early diagnosis can reduce the likelihood of severe, often irreversible complications. The aim of this study was to validate the ability of factors from the Gaucher Earlier Diagnosis Consensus (GED-C) scoring system to discriminate between patients with GD1 and controls using real-world data from electronic patient medical records from Maccabi Healthcare Services, Israel’s second-largest state-mandated healthcare provider.

Methods

We applied the GED-C scoring system to 265 confirmed cases of GD and 3445 non-GD controls matched for year of birth, sex, and socioeconomic status identified from 1998 to 2022. The analyses were based on two databases: (1) all available data and (2) all data except free-text notes. Features from the GED-C scoring system applicable to GD1 were extracted for each individual. Patients and controls were compared for the proportion of the specific features and overall GED-C scores. Decision tree and random forest models were trained to identify the main features distinguishing GD from non-GD controls.

Results

The GED-C scoring distinguished individuals with GD from controls using both databases. Decision tree models for the databases showed good accuracy (0.96 [95% CI 0.95–0.97] for Database 1; 0.95 [95% CI 0.94–0.96] for Database 2), high specificity (0.99 [95% CI 0.99–1]) for Database 1; 1.0 [95% CI 0.99–1] for Database 2), but relatively low sensitivity (0.53 [95% CI 0.46–0.59] for Database 1; 0.32 [95% CI 0.25–0.38]) for Database 2). The clinical features of splenomegaly, thrombocytopenia (< 50 × 10⁹/L), and hyperferritinemia (300–1000 ng/mL) were found to be the three most accurate classifiers of GD in both databases.

Conclusion

In this analysis of real-world patient data, certain individual features of the GED-C score discriminate more successfully between patients with GD and controls than the overall score. An enhanced diagnostic model may lead to earlier, reliable diagnoses of Gaucher disease, aiming to minimize the severe complications associated with this disease.

Background

Gaucher disease (GD) is a rare autosomal recessive condition characterized by a deficiency of the lysosomal enzyme beta-glucocerebrosidase (GBA1). Accumulation of glucosylceramide in macrophages throughout the body leads to the onset of multisystemic disease manifestations such as splenomegaly, hepatomegaly, anemia, thrombocytopenia, and bone abnormalities, hallmarks of type 1 GD [1]; neurological involvement is characteristic of the more severe type 2 and type 3 GD [2, 3]. The estimated prevalence of all three GD types is 0.45–25.0 per 100,000 live births, although type 1 GD is substantially more common among people with Ashkenazi Jewish heritage (estimated at 1 in 850 live births for type 1 GD) [4, 5].

Timely initiation of appropriate GD-specific therapy (i.e., enzyme replacement therapy or substrate reduction therapy) early in the disease course [6,7,8] has been shown to improve patient outcomes, with significant effects on hematologic and visceral outcomes, and may prevent the onset of irreversible bone disease and severe growth retardation, and reduce the risk of bleeding [8]. However, delayed diagnosis or misdiagnosis is frequent, owing to the complex and non-specific clinical presentation, together with a lack of awareness about this rare disease [1, 9,10,11]. Approximately one in six patients report remaining undiagnosed for 7 years or more after first consulting a doctor with symptoms [9]. Physicians and patients both report multiple referrals to a range of different specialties prior to GD diagnosis, with primary care, hematology/hematology-oncology, and pediatrics being the main specialties to which patients first present symptoms [9].

The Gaucher Earlier Diagnosis Consensus (GED-C) scoring system was developed by a panel of 22 expert physicians using Delphi methodology regarding the signs and covariables considered important for diagnosing type 1 and type 3 GD, to help clinicians identify potential individuals to test further, thereby reducing diagnostic delay [12]. Preliminary validation of the GED-C was able to discriminate between patients with GD and those with overlapping manifestations from other disorders, in studies from the United Kingdom [13] and Finland [14]. We previously carried out a description of the GED-C scoring system in 265 confirmed patients with GD using real-world data from the Maccabi Healthcare Services (MHS), Israel’s second-largest state-mandated healthcare provider, representing 2.5 million members (25% of the Israeli population) [15]. The aim of the current study was to assess the ability of the GED-C score to discriminate between individuals with and without GD, and to identify the best discriminatory features using real-world data from electronic patient medical records from the MHS, with and without the use of free-text notes.

Methods

Data source and study design

Electronic patient medical records from the MHS were used as the data source for this study. In the MHS, clinical records have been fully computerized for > 20 years, and are fully integrated with an automated central laboratory, fully digitized imaging, and pharmacy purchase data. In addition, the MHS is associated with a biobank of samples collected from consenting sample donors among MHS participants. The study design was approved by the MHS Institutional Review Board (0013-21-ASMC). The requirement for patient consent was waived owing to the use of de-identified, anonymized data.

Population

All eligible individuals with a confirmed diagnosis of GD in the MHS database were included in the study, as described previously [15]. The records of patients identified with an MHS diagnosis code for GD were screened for evidence of GD-specific treatment authorization such as enzyme replacement therapy or substrate reduction therapy. In the absence of evidence of GD treatment, patient records were screened for any medical notes indicating GD (e.g., physicians’ notes and hospital discharges). Thirteen randomly selected controls per case, matched for year of birth, sex, and socioeconomic status (per MHS data), were extracted from the MHS database for retrospective analysis.

Data extraction

Data for each of the GED-C items applicable to type 1 GD were extracted for each individual, including demographics, diagnosis codes (International Classification of Diseases [ICD], Ninth Revision [ICD-9]), laboratory values, imaging reports, the MHS osteoporosis register, and weight and height measurements. Free-text notes were screened manually for the terms presented in Table 1 for all patients with GD and non-GD controls. For the non-GD controls, free-text notes were screened manually in 1 control from the 13 controls available for each patient with GD (chosen randomly). We assumed that the proportion of the terms in the 12 matched controls was similar to the control that was checked manually. Data were extracted from the first record available until 1 year after GD diagnosis. For controls, data were extracted up to 1 year after the GD diagnosis of the matched patient. The 1-year post-diagnosis cutoff was chosen to allow the maximum time for the capture of features on completion of relevant confirmatory testing for GD. Items for GED-C scores were extracted as quantitative values where possible. GED-C scores were calculated for these features, as indicated in Table 2 for all available data, including free text-notes from patient visits (database 1) and Table 3, data from free-text notes from patient visits were not included (database 2).

Table 1 GED-C score parameters extracted from free-text notes from patient visits

Full size table

Table 2 Characteristics used to calculate GED-C scores in patients with GD and control patients: database 1

Full size table

Table 3 Characteristics used to calculate GED-C scores in patients with GD and control patients: database 2

Full size table

Assessing the GED-C scoring system as a discrimination tool

Individuals with GD were compared with controls using features of the GED-C scoring system applicable to patients with type 1 GD [12, 15], and total points were determined for each group. Computation of the GED-C score was carried out as described previously [15]. Briefly, the weight of each feature was set according to the published score [12]. For laboratory data, the maximum or minimum levels (as appropriate) were considered and defined abnormal as indicated in Tables 2 and 3. Dichotomous variables were coded as “yes” or “no.” Evaluation of multiples of normal in spleen and liver size was not feasible; “splenomegaly” received a score of 3 irrespective of size, and “hepatomegaly” was scored as 1.5 points. Growth retardation (based on height, weight, and body mass index measures) was defined as equal to or less than the fifth percentile for age. Asthenia and fatigue were scored by one item because both descriptions use the same word in Hebrew. No ICD-9 code was available for family history of GD.

For constructing discrimination models, 16 available features were used as defined in the GED-C scoring system [12]. These included age, anemia, bleeding, bone issues, dyslipidemia, fatigue, gallstones, gammopathy, growth retardation, hepatomegaly, hyperferritinemia, Jewish ancestry, low bone mineral density, leukopenia, splenomegaly, and thrombocytopenia.

Two models were constructed. In the first, all available data were used, including free-text notes from patient visits (database 1). For the second, data from free-text notes from patient visits were not included (database 2).

Statistical analyses

GED-C summary scores were reported using the median and interquartile range (IQR). Absolute and relative frequencies were reported for nominal data, with the relationship between GD and controls expressed as odds ratio (OR) and 95% CI. A decision tree model using all 16 available features (as detailed above) was constructed using “training data” (80%) and “test data” (20%), and a random forest model was selected and trained in order to identify the main features that can distinguish GD from non-GD controls. The model performance on the validation dataset was evaluated using receiver operating characteristic (ROC) curve and f1-score as a measure of accuracy. The area under the ROC curve (AUC) results are considered excellent for AUC values between 0.9 and 1.0, good for AUC values between 0.8 and 0.9, fair for AUC values between 0.7 and 0.8, poor for AUC values between 0.6 and 0.7, and failed for AUC values between 0.5 and 0.6 [16]. All statistical analyses were performed using R programming version 1.4.1103, packages dplyr, stringr, tidiverse, lubridate, caret, random forest, and ggplot2.

Results

Of 346 individuals with a GD diagnosis code identified from the MHS database, 265 were confirmed as patients with GD following the screening of patient records. A total of 3445 control individuals without GD (13 per GD case) were included in the study.

Manual screening of free-text notes for terms defined in the GED-C scoring system revealed a marked difference in the proportions of individuals with symptoms between GD and control groups, with ORs of approximately 10 or above for splenomegaly, growth retardation, and hepatomegaly (Table 1).

Overall GED-C summary scores were calculated for each database (Fig. 1A, B). For database 1, which included free-text notes from patient visits, median GED-C scores were 7.5 (range 0–17.5; IQR 6.5) for patients with GD (n = 265) and 4.0 (range 0–12.5; IQR 3.5) for controls (n = 3445) (Fig. 1A). GED-C scoring distinguished patients with GD from controls with an AUC of 0.81 (95% CI, 0.78–0.84) (Fig. 2A). For database 2, in which data from free-text notes from patient visits were not included, median GED-C scores were 6.5 (range 0–17.0; IQR 5.0) for patients with GD (n = 265) and 4.0 (range 0–12.5; IQR 3.0) for controls (n = 3445) (Fig. 1B). GED-C scoring distinguished patients with GD from controls with an AUC of 0.74 (95% CI 0.71–0.78) (Fig. 2B).

The magnitude of difference in GED-C scoring between patients with GD and controls was greater for some features (e.g., splenomegaly [OR 80.3 in database 1 and 61.9 in database 2], severe thrombocytopenia [platelet count < 50 × 10⁹/L; OR 57.9 in both databases], and hyperferritinemia [ferritin 300–1000 ng/mL; OR 28.9 in both databases]) (Tables 2 and 3).

Decision tree models were constructed for each database (Fig. 3). The models showed good accuracy (0.96 [95% CI 0.95–0.97] for database 1; 0.95 [95% CI 0.94–0.96] for database 2), high specificity (0.99 [95% CI 0.99–1] for database 1; 1.0 [95% CI 0.99–1] for database 2), but relatively low sensitivity (0.53 [95% CI 0.46–0.59] for database 1; 0.32 [95% CI 0.25–0.38] for database 2). The AUC was higher for database 1 (0.79 [95% CI 0.73–0.87]) compared with database 2 (0.69 [95% CI 0.60–0.73]). The f1-scores were 0.66 for database 1 and 0.46 for database 2.

A random forest model was developed using the two databases to outline the 10 most important variables for distinguishing patients with GD from controls (Fig. 4). The clinical features of splenomegaly, thrombocytopenia (platelet count < 50 × 10⁹/L), and hyperferritinemia (ferritin 300–1000 ng/mL) were found to be the top three most accurate classifiers of GD in both databases.

Discussion

This retrospective observational analysis used real-world data to assess the utility of the GED-C scoring system for discriminating between patients with GD and controls. We found that the GED-C scoring system performance at distinguishing between GD and controls was ‘good’ when using a database that included free-text notes from patient visits and ‘fair’ when using a database that excluded free-text notes. Analysis of both databases consistently identified the most important GED-C features for discriminating GD from non-GD as splenomegaly, thrombocytopenia, and hyperferritinemia.

The three features, splenomegaly, thrombocytopenia and hyperferritinemia were also considered by GD experts to be important features in the diagnosis of GD [12]. There are still some differences that need to be discussed. First, the weight given by experts for these three features was significantly lower than the actual odds of exposure found among patients with GD compared with controls in this real-world dataset. Second, GD experts gave a greater weight (2 points) for mild thrombocytopenia (platelet count 50–150 × 10⁹/L) compared with severe thrombocytopenia (platelet count < 50 × 10⁹/L; 1 point), whereas the actual odds of exposure among patients with GD from this study was acutely higher for severe thrombocytopenia (with a wide CI owing to a lower number). For hyperferritinemia, GD experts gave a lower weight for ferritin levels > 1000 ng/mL (1 point), whereas the odds of exposure among patients in this study was approximately 30 for elevated ferritin levels, regardless of the exact levels. Based on our findings, we can expect that a supervised machine learning classifier, that will not use predefined weights or laboratory cutoffs, could perform better in distinguishing between patients with GD and controls.

Although the models developed in this study show high specificity, they were shown to have low sensitivity. Missing or incomplete data for many features may have been a contributing factor. The definitions used in the GED-scoring may further impact the sensitivity of models to discriminate between patients with GD and controls. For example, the definition of anemia using a threshold of 14 g/L for hemoglobin concentration is relevant to adult males, but this may result in the misclassification of normal hemoglobin levels in patients whose hemoglobin would otherwise fall into the anemic range as specified by their age and gender. Modifying the scoring system may help correctly diagnose more patients with GD at an earlier time in the disease course.

As expected, the database that included data from free-text notes (database 1) showed better performance compared with database 2 because physicians frequently documented GD-related features (such as splenomegaly, hepatomegaly, fatigue, and bone pain) in the visit notes but then failed to code them. This is in line with a study of electronic records for splenomegaly in the Danish National Registry of Patients; the total number of patients coded for splenomegaly was lower than expected, leading the authors to conclude that splenomegaly as a clinical finding was probably under-coded in the ICD system [17]. Similarly, machine learning algorithms using ICD-9 codes did not perform well in identifying patients with systemic sclerosis; the highest-performing algorithms in this study incorporated clinical data with ICD, Tenth Revision (ICD-10), codes [18]. Unfortunately, ICD-10 codes were not available in the MHS database. Under-coding could be related to the difficulty in translating signs and symptoms into a clear and unambiguous classification code, coupled with time pressures on physicians [19].

The ability of the GED-C to discriminate between individuals with and without GD was also shown by other groups. Mehta et al. compared adults with GD (n = 25) to adults with liver disease, hematological malignancy or immune thrombocytopenia (n = 75) [13]. The data were derived from hospital records, with 80%-90% completeness of data. Analyses, based on 11 possible factors, showed good discrimination between those with GD and non-GD individuals, with AUC of 0.88 (95% CI 0.78–0.97) being comparable to the result from database A in our study. Patients with liver disease and hematological malignancy were most likely to have manifestations overlapping with GD [13]. In a study from Finland, clinical data from five patients with GD were compared to electronic health record data of ~ 170,000 adults from a biobank [14]. The score of patients with GD ranged between 6 and 18.5 (based on the available data on 28–29 possible factors). Only 0.72% of adults from the biobank were assigned at least 6 points, but none had a point score as high as 18.5. A follow-up study using the same approach with another Finnish biobank, showed similar results [20]. The score of 8 patients with GD was 6–22.5 points, while only 0.77% of controls had 10 points or higher. Data from patients with GD collected from hospital electronic medical records being compared with control data collected from a biobank, may explain the significant difference found in the studies from Finland.

The use of real-world data is both an advantage and a limitation of the study. The use of real-world data confirmed the significance of the majority of features in the GED-C scoring system. These features would need to be included in future machine-learning models. However, real-world data sets have their limitations. They may contain incomplete information or be prone to errors during data entry. In addition, because GD is a rare disease, the number of individuals with GD, even within this relatively large database of individuals, is restricted and there are inherent limitations to the development of an algorithm using such a small data set of patients. In our research, we employed a manual examination of unstructured text notes to identify features related to GD. The reason for resorting to manual screening was that the existing Hebrew resources for training natural language processing (NLP) are inadequate to accomplish this task using computer-based methods [21]. Manual screening was feasible when limited to 265 patients and 265 controls but not for the entire control cohort. Similarly, machine learning diagnostic algorithms will not be able to be based on manual screening of free-text notes. In addition, our study analyzed data from patients retrospectively, a prospective design would have been preferable in terms of standardization of data collection and reducing potential sources of bias.

The current study was set up to extract patient data on parameters from the GED-C score, chosen as a set of readily available clinical data points established by expert clinician consensus to have diagnostic relevance for patients with GD, and also validated in different geographic patient cohorts [13, 14]. However, we acknowledge that findings on the diagnostic utility of this set of variables are limited to the patient population studied and may not be applicable for populations with different ethnic backgrounds and disease severity. Other clinical manifestations of GD not included in the GED-C scoring system should be considered in future algorithms developed for GD diagnosis, for example co-morbidities that may be associated with GD such as neoplasms, endocrinological disorders like insulin-like growth hormone deficiency, as well as abnormalities in parathyroid hormone levels, phospho-calcium metabolism, and vitamin D levels [22,23,24,25].

In the era of digitized medical records, the opportunity for comparing and combining electronic health data from a wide variety of patient populations is likely to result in better refinement of data-mining tools. As seen with other diseases, use of machine learning for analysis of large amounts of clinical data will assist the development of algorithms for detecting undiagnosed patients with GD, as well as tools optimised for use in particular patient sub-sets [26, 27]. This, accompanied by large-scale testing of biobank samples using alternative diagnostic techniques, is needed to address the challenges of GD diagnosis in the future.

This study forms part of the basis for the development of an artificial intelligence–based algorithm for the accurate diagnosis of GD using machine learning, designed to shorten the diagnostic journey of patients with GD (Fig. 5). Machine learning technologies are being developed for a number of disorders that can potentially accelerate accurate diagnosis by calculating disease probabilities based on symptoms [28, 29]. Integration of machine learning models employing quantifiable variables that are readily discernible and autonomous of clinical evaluations to facilitate screening for rare diseases, such as GD, are needed to improve early detection and treatment [30]. In the next phase of the research process, the models evaluated in this study are to be further refined, by incorporating additional parameters and using supervised machine-learning methods. The best-performing model would then be applied to the MHS healthcare database population to identify those who may have unidentified GD. These data would then be used for further refinement of the machine learning classifiers for GD diagnosis.

Conclusion

The GED-C score, developed by Delphi expert consensus, shows good discrimination between patients with GD and controls and could be the basis for future models. In our study, as expected, the database that included data from physician notes on frequently documented GD-related features showed the best performance at distinguishing GD patients from controls. The application of machine learning techniques to our cohort is expected to result in an improved diagnostic model with the best possible sensitivity that may be used for screening undiagnosed GD cases.

Availability of data and materials

The data that support the findings of this study are available from the authors but restrictions apply to the availability of these data, which were used under license from the Maccabi Health Services (Israel) for the current study, and so are not publicly available. Data are, however, available from the authors upon reasonable request and with permission from MaccabiTech, Maccabi Health Services, research center.

Abbreviations

AUC:: Area under the receiver operating characteristic curve
GD:: Gaucher disease
GED-C:: Gaucher earlier diagnosis consensus
ICD-9/10:: International classification of diseases, ninth/tenth revision
IQR:: Interquartile range
MHS:: Maccabi Healthcare Services
OR:: Odds ratio
ROC:: Receiver operating characteristic

References

Stirnemann J, et al. A review of Gaucher disease pathophysiology, clinical presentation and treatments. Int J Mol Sci. 2017;18:441.
Article PubMed PubMed Central Google Scholar
Schwartz IVD, et al. Characteristics of 26 patients with type 3 Gaucher disease: a descriptive analysis from the Gaucher outcome survey. Mol Genet Metab Rep. 2018;14:73–9.
Article CAS PubMed Google Scholar
El-Beshlawy A, et al. Long-term hematological, visceral, and growth outcomes in children with Gaucher disease type 3 treated with imiglucerase in the international collaborative Gaucher group Gaucher registry. Mol Genet Metab. 2017;120:47–56.
Article CAS PubMed Google Scholar
Castillon G, Chang SC, Moride Y. Global incidence and prevalence of Gaucher disease: a targeted literature review. J Clin Med. 2022;12:85.
Article PubMed PubMed Central Google Scholar
Revel-Vilk S, Szer J, Zimran A. Gaucher disease and related lysosomal storage diseases. In: Williams Hematology. New York: McGraw-Hill Education; 2021. p. 1189–202.
Google Scholar
Gonzalez DE, et al. Enzyme replacement therapy with velaglucerase alfa in Gaucher disease: results from a randomized, double-blind, multinational, Phase 3 study. Am J Hematol. 2013;88:166–71.
Article CAS PubMed Google Scholar
Hughes DA, et al. Velaglucerase alfa (VPRIV) enzyme replacement therapy in patients with Gaucher disease: long-term data from phase III clinical trials. Am J Hematol. 2015;90:584–91.
Article CAS PubMed PubMed Central Google Scholar
Mistry PK, et al. Timing of initiation of enzyme replacement therapy after diagnosis of type 1 Gaucher disease: effect on incidence of avascular necrosis. Br J Haematol. 2009;147:561–70.
Article CAS PubMed PubMed Central Google Scholar
Mehta A, et al. Exploring the patient journey to diagnosis of Gaucher disease from the perspective of 212 patients with Gaucher disease and 16 Gaucher expert physicians. Mol Genet Metab. 2017;122:122–9.
Article CAS PubMed Google Scholar
Mistry PK, et al. A reappraisal of Gaucher disease-diagnosis and disease management algorithms. Am J Hematol. 2011;86:110–5.
Article PubMed PubMed Central Google Scholar
Mistry PK, Sadan S, Yang R, Yee J, Yang M. Consequences of diagnostic delays in type 1 Gaucher disease: the need for greater awareness among hematologists-oncologists and an opportunity for early diagnosis and intervention. Am J Hematol. 2007;82:697–701.
Article PubMed Google Scholar
Mehta A, et al. Presenting signs and patient co-variables in Gaucher disease: outcome of the Gaucher earlier diagnosis consensus (GED-C) Delphi initiative. Intern Med J. 2019;49:578–91.
Article PubMed PubMed Central Google Scholar
Mehta A, et al. Scoring system to facilitate diagnosis of Gaucher disease. Intern Med J. 2020;50:1538–46.
Article CAS PubMed PubMed Central Google Scholar
Savolainen MJ, et al. The Gaucher earlier diagnosis consensus point-scoring system (GED-C PSS): evaluation of a prototype in Finnish Gaucher disease patients and feasibility of screening retrospective electronic health record data for the recognition of potential undiagnosed patients in Finland. Mol Genet Metab Rep. 2021;27:100725.
Article CAS PubMed PubMed Central Google Scholar
Revel-Vilk S, et al. Using the Gaucher earlier diagnosis consensus (GED-C) delphi score in a real-world dataset. Int J Transl Med. 2022;2:506–14.
Google Scholar
Nahm FS. Receiver operating characteristic curve: overview and practical use for clinicians. Korean J Anesthesiol. 2022;75:25–36.
Article PubMed PubMed Central Google Scholar
Curovic RE, et al. Splenomegaly - diagnostic validity, work-up, and underlying causes. PLoS ONE. 2017;12:e0186674.
Article Google Scholar
Jamian L, Wheless L, Crofford LJ, Barnado A. Rule-based and machine learning algorithms identify patients with systemic sclerosis accurately in the electronic health record. Arthritis Res Ther. 2019;21:305.
Article PubMed PubMed Central Google Scholar
Tang KL, Lucyk K, Quan H. Coder perspectives on physician-related barriers to producing high-quality administrative data: a qualitative study. CMAJ Open. 2017;5:E617-622.
Article PubMed PubMed Central Google Scholar
Pehrsson M, et al. Screening for potential undiagnosed Gaucher disease patients: utilisation of the Gaucher earlier diagnosis consensus point-scoring system (GED-C PSS) in conjunction with electronic health record data, tissue specimens, and small nucleotide polymorphism (SNP) genotype data available in Finnish biobanks. Mol Genet Metab Rep. 2022;33:100911.
Article CAS PubMed PubMed Central Google Scholar
Névéol A, Dalianis H, Velupillai S, Savova G, Zweigenbaum P. Clinical Natural Language Processing in languages other than English: opportunities and challenges. J Biomed Semant. 2018;9:12.
Article Google Scholar
Hughes D, et al. Gaucher disease in bone: from pathophysiology to practice. J Bone Miner Res. 2019;34:996–1013.
Article PubMed Google Scholar
Mikosch P, et al. Patients with Gaucher disease living in England show a high prevalence of vitamin D insufficiency with correlation to osteodensitometry. Mol Genet Metab. 2009;96:113–20.
Article CAS PubMed Google Scholar
Rite S, et al. Insulin-like growth factors in childhood-onset Gaucher disease. Pediatr Res. 2002;52:109–12.
Article CAS PubMed Google Scholar
Mistry PK, Taddei T, vom Dahl S, Rosenbloom BE. Gaucher disease and malignancy: a model for cancer pathogenesis in an inborn error of metabolism. Crit Rev Oncog. 2013;18:235–46.
Article PubMed PubMed Central Google Scholar
Knevel R, Liao KP. From real-world electronic health record data to real-world results using artificial intelligence. Ann Rheum Dis. 2023;82:306–11.
Article PubMed Google Scholar
Riskin D, et al. Using artificial intelligence to identify patients with migraine and associated symptoms and conditions within electronic health records. BMC Med Inform Decis Mak. 2023;23:121.
Article PubMed PubMed Central Google Scholar
Ronicke S, et al. Can a decision support system accelerate rare disease diagnosis? Evaluating the potential impact of Ada DX in a retrospective study. Orphanet J Rare Dis. 2019;14:69.
Article PubMed PubMed Central Google Scholar
Gurovich Y, et al. Identifying facial phenotypes of genetic disorders using deep learning. Nat Med. 2019;25:60–4.
Article CAS PubMed Google Scholar
Wilson A, et al. Development of a rare disease algorithm to identify persons at risk of Gaucher disease using electronic health records in the United States. Orphanet J Rare Dis. 2023;18:280.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Under the direction of the authors Isobel Lever, PhD, employee of Excel Medical Affairs, provided writing assistance for this publication. Editorial assistance in formatting, proofreading, copy editing, and fact-checking was also provided by Excel Scientific Solutions, Inc.

Funding

This research was funded by Takeda Pharmaceuticals International AG. Under the direction of the authors, Excel Scientific Solutions, Inc. provided writing assistance for this publication. Takeda Development Center Americas, Inc., provided funding to Excel Scientific Solutions, Inc. for support in writing and editing this manuscript under the direction of the authors.

Author information

Authors and Affiliations

Gaucher Unit, Shaare Zedek Medical Center, Jerusalem, Israel
Shoshana Revel-Vilk
Faculty of Medicine, Hebrew University, Jerusalem, Israel
Shoshana Revel-Vilk & Ora Paltiel
Braun School of Public Health and Community Medicine, Hebrew University, Jerusalem, Israel
Shoshana Revel-Vilk, Ora Paltiel & Orly Manor
Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
Varda Shalev, Avraham Tenenbaum & Gabriel Chodick
Takeda Pharmaceuticals International AG, Zurich, Switzerland
Aidan Gill
Department of Hematology , Hadassah Medical Organization, Jerusalem, Israel
Ora Paltiel
MaccabiTech, Maccabi Healthcare Services, Tel Aviv, Israel
Liat Azani & Gabriel Chodick

Authors

Shoshana Revel-Vilk
View author publications
You can also search for this author in PubMed Google Scholar
Varda Shalev
View author publications
You can also search for this author in PubMed Google Scholar
Aidan Gill
View author publications
You can also search for this author in PubMed Google Scholar
Ora Paltiel
View author publications
You can also search for this author in PubMed Google Scholar
Orly Manor
View author publications
You can also search for this author in PubMed Google Scholar
Avraham Tenenbaum
View author publications
You can also search for this author in PubMed Google Scholar
Liat Azani
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel Chodick
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

SRV and GC: conceptualization; SRV, GC, VS, OP, and OM: methodology; LA: data curation; SRV, GC, VS, AG, OP, OM, AT, and GC: writing—review and editing; AG: funding acquisition. All authors have read and agreed to the final version of the manuscript.

Corresponding author

Correspondence to Shoshana Revel-Vilk.

Ethics declarations

Ethics approval and consent to participate

The study design was approved by MHS Institutional Review Board (0013–21-ASMC). Patient consent was waived owing to the use of de-identified, anonymized data.

Consent for publication

Patient consent was waived owing to the use of de-identified, anonymized data.

Competing interests

SRV has received research grants, speaker fees, and travel support from Pfizer, Sanofi Genzyme, and Takeda; VS is the chief marketing officer of Alike and has received speaker fees from Novartis and Roche; AG is a former employee and a stockholder of Takeda; LA and GC are employees of MaccabiTech; and OP, OM, and AT declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Aidan Gill: Affiliated in Takeda Pharmaceuticals International AG, Zurich, Switzerland at the time of the study

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Revel-Vilk, S., Shalev, V., Gill, A. et al. Assessing the diagnostic utility of the Gaucher Earlier Diagnosis Consensus (GED-C) scoring system using real-world data. Orphanet J Rare Dis 19, 71 (2024). https://doi.org/10.1186/s13023-024-03042-y

Download citation

Received: 28 July 2023
Accepted: 19 January 2024
Published: 16 February 2024
DOI: https://doi.org/10.1186/s13023-024-03042-y

Assessing the diagnostic utility of the Gaucher Earlier Diagnosis Consensus (GED-C) scoring system using real-world data