Skip to main content

Assessing the diagnostic utility of the Gaucher Earlier Diagnosis Consensus (GED-C) scoring system using real-world data

Abstract

Background

Gaucher disease (GD) is a rare autosomal recessive condition associated with clinical features such as splenomegaly, hepatomegaly, anemia, thrombocytopenia, and bone abnormalities. Three clinical forms of GD have been defined based on the absence (type 1, GD1) or presence (types 2 and 3) of neurological signs. Early diagnosis can reduce the likelihood of severe, often irreversible complications. The aim of this study was to validate the ability of factors from the Gaucher Earlier Diagnosis Consensus (GED-C) scoring system to discriminate between patients with GD1 and controls using real-world data from electronic patient medical records from Maccabi Healthcare Services, Israel’s second-largest state-mandated healthcare provider.

Methods

We applied the GED-C scoring system to 265 confirmed cases of GD and 3445 non-GD controls matched for year of birth, sex, and socioeconomic status identified from 1998 to 2022. The analyses were based on two databases: (1) all available data and (2) all data except free-text notes. Features from the GED-C scoring system applicable to GD1 were extracted for each individual. Patients and controls were compared for the proportion of the specific features and overall GED-C scores. Decision tree and random forest models were trained to identify the main features distinguishing GD from non-GD controls.

Results

The GED-C scoring distinguished individuals with GD from controls using both databases. Decision tree models for the databases showed good accuracy (0.96 [95% CI 0.95–0.97] for Database 1; 0.95 [95% CI 0.94–0.96] for Database 2), high specificity (0.99 [95% CI 0.99–1]) for Database 1; 1.0 [95% CI 0.99–1] for Database 2), but relatively low sensitivity (0.53 [95% CI 0.46–0.59] for Database 1; 0.32 [95% CI 0.25–0.38]) for Database 2). The clinical features of splenomegaly, thrombocytopenia (< 50 × 109/L), and hyperferritinemia (300–1000 ng/mL) were found to be the three most accurate classifiers of GD in both databases.

Conclusion

In this analysis of real-world patient data, certain individual features of the GED-C score discriminate more successfully between patients with GD and controls than the overall score. An enhanced diagnostic model may lead to earlier, reliable diagnoses of Gaucher disease, aiming to minimize the severe complications associated with this disease.

Background

Gaucher disease (GD) is a rare autosomal recessive condition characterized by a deficiency of the lysosomal enzyme beta-glucocerebrosidase (GBA1). Accumulation of glucosylceramide in macrophages throughout the body leads to the onset of multisystemic disease manifestations such as splenomegaly, hepatomegaly, anemia, thrombocytopenia, and bone abnormalities, hallmarks of type 1 GD [1]; neurological involvement is characteristic of the more severe type 2 and type 3 GD [2, 3]. The estimated prevalence of all three GD types is 0.45–25.0 per 100,000 live births, although type 1 GD is substantially more common among people with Ashkenazi Jewish heritage (estimated at 1 in 850 live births for type 1 GD) [4, 5].

Timely initiation of appropriate GD-specific therapy (i.e., enzyme replacement therapy or substrate reduction therapy) early in the disease course [6,7,8] has been shown to improve patient outcomes, with significant effects on hematologic and visceral outcomes, and may prevent the onset of irreversible bone disease and severe growth retardation, and reduce the risk of bleeding [8]. However, delayed diagnosis or misdiagnosis is frequent, owing to the complex and non-specific clinical presentation, together with a lack of awareness about this rare disease [1, 9,10,11]. Approximately one in six patients report remaining undiagnosed for 7 years or more after first consulting a doctor with symptoms [9]. Physicians and patients both report multiple referrals to a range of different specialties prior to GD diagnosis, with primary care, hematology/hematology-oncology, and pediatrics being the main specialties to which patients first present symptoms [9].

The Gaucher Earlier Diagnosis Consensus (GED-C) scoring system was developed by a panel of 22 expert physicians using Delphi methodology regarding the signs and covariables considered important for diagnosing type 1 and type 3 GD, to help clinicians identify potential individuals to test further, thereby reducing diagnostic delay [12]. Preliminary validation of the GED-C was able to discriminate between patients with GD and those with overlapping manifestations from other disorders, in studies from the United Kingdom [13] and Finland [14]. We previously carried out a description of the GED-C scoring system in 265 confirmed patients with GD using real-world data from the Maccabi Healthcare Services (MHS), Israel’s second-largest state-mandated healthcare provider, representing 2.5 million members (25% of the Israeli population) [15]. The aim of the current study was to assess the ability of the GED-C score to discriminate between individuals with and without GD, and to identify the best discriminatory features using real-world data from electronic patient medical records from the MHS, with and without the use of free-text notes.

Methods

Data source and study design

Electronic patient medical records from the MHS were used as the data source for this study. In the MHS, clinical records have been fully computerized for > 20 years, and are fully integrated with an automated central laboratory, fully digitized imaging, and pharmacy purchase data. In addition, the MHS is associated with a biobank of samples collected from consenting sample donors among MHS participants. The study design was approved by the MHS Institutional Review Board (0013-21-ASMC). The requirement for patient consent was waived owing to the use of de-identified, anonymized data.

Population

All eligible individuals with a confirmed diagnosis of GD in the MHS database were included in the study, as described previously [15]. The records of patients identified with an MHS diagnosis code for GD were screened for evidence of GD-specific treatment authorization such as enzyme replacement therapy or substrate reduction therapy. In the absence of evidence of GD treatment, patient records were screened for any medical notes indicating GD (e.g., physicians’ notes and hospital discharges). Thirteen randomly selected controls per case, matched for year of birth, sex, and socioeconomic status (per MHS data), were extracted from the MHS database for retrospective analysis.

Data extraction

Data for each of the GED-C items applicable to type 1 GD were extracted for each individual, including demographics, diagnosis codes (International Classification of Diseases [ICD], Ninth Revision [ICD-9]), laboratory values, imaging reports, the MHS osteoporosis register, and weight and height measurements. Free-text notes were screened manually for the terms presented in Table 1 for all patients with GD and non-GD controls. For the non-GD controls, free-text notes were screened manually in 1 control from the 13 controls available for each patient with GD (chosen randomly). We assumed that the proportion of the terms in the 12 matched controls was similar to the control that was checked manually. Data were extracted from the first record available until 1 year after GD diagnosis. For controls, data were extracted up to 1 year after the GD diagnosis of the matched patient. The 1-year post-diagnosis cutoff was chosen to allow the maximum time for the capture of features on completion of relevant confirmatory testing for GD. Items for GED-C scores were extracted as quantitative values where possible. GED-C scores were calculated for these features, as indicated in Table 2 for all available data, including free text-notes from patient visits (database 1) and Table 3, data from free-text notes from patient visits were not included (database 2).

Table 1 GED-C score parameters extracted from free-text notes from patient visits
Table 2 Characteristics used to calculate GED-C scores in patients with GD and control patients: database 1
Table 3 Characteristics used to calculate GED-C scores in patients with GD and control patients: database 2

Assessing the GED-C scoring system as a discrimination tool

Individuals with GD were compared with controls using features of the GED-C scoring system applicable to patients with type 1 GD [12, 15], and total points were determined for each group. Computation of the GED-C score was carried out as described previously [15]. Briefly, the weight of each feature was set according to the published score [12]. For laboratory data, the maximum or minimum levels (as appropriate) were considered and defined abnormal as indicated in Tables 2 and 3. Dichotomous variables were coded as “yes” or “no.” Evaluation of multiples of normal in spleen and liver size was not feasible; “splenomegaly” received a score of 3 irrespective of size, and “hepatomegaly” was scored as 1.5 points. Growth retardation (based on height, weight, and body mass index measures) was defined as equal to or less than the fifth percentile for age. Asthenia and fatigue were scored by one item because both descriptions use the same word in Hebrew. No ICD-9 code was available for family history of GD.

For constructing discrimination models, 16 available features were used as defined in the GED-C scoring system [12]. These included age, anemia, bleeding, bone issues, dyslipidemia, fatigue, gallstones, gammopathy, growth retardation, hepatomegaly, hyperferritinemia, Jewish ancestry, low bone mineral density, leukopenia, splenomegaly, and thrombocytopenia.

Two models were constructed. In the first, all available data were used, including free-text notes from patient visits (database 1). For the second, data from free-text notes from patient visits were not included (database 2).

Statistical analyses

GED-C summary scores were reported using the median and interquartile range (IQR). Absolute and relative frequencies were reported for nominal data, with the relationship between GD and controls expressed as odds ratio (OR) and 95% CI. A decision tree model using all 16 available features (as detailed above) was constructed using “training data” (80%) and “test data” (20%), and a random forest model was selected and trained in order to identify the main features that can distinguish GD from non-GD controls. The model performance on the validation dataset was evaluated using receiver operating characteristic (ROC) curve and f1-score as a measure of accuracy. The area under the ROC curve (AUC) results are considered excellent for AUC values between 0.9 and 1.0, good for AUC values between 0.8 and 0.9, fair for AUC values between 0.7 and 0.8, poor for AUC values between 0.6 and 0.7, and failed for AUC values between 0.5 and 0.6 [16]. All statistical analyses were performed using R programming version 1.4.1103, packages dplyr, stringr, tidiverse, lubridate, caret, random forest, and ggplot2.

Results

Of 346 individuals with a GD diagnosis code identified from the MHS database, 265 were confirmed as patients with GD following the screening of patient records. A total of 3445 control individuals without GD (13 per GD case) were included in the study.

Manual screening of free-text notes for terms defined in the GED-C scoring system revealed a marked difference in the proportions of individuals with symptoms between GD and control groups, with ORs of approximately 10 or above for splenomegaly, growth retardation, and hepatomegaly (Table 1).

Overall GED-C summary scores were calculated for each database (Fig. 1A, B). For database 1, which included free-text notes from patient visits, median GED-C scores were 7.5 (range 0–17.5; IQR 6.5) for patients with GD (n = 265) and 4.0 (range 0–12.5; IQR 3.5) for controls (n = 3445) (Fig. 1A). GED-C scoring distinguished patients with GD from controls with an AUC of 0.81 (95% CI, 0.78–0.84) (Fig. 2A). For database 2, in which data from free-text notes from patient visits were not included, median GED-C scores were 6.5 (range 0–17.0; IQR 5.0) for patients with GD (n = 265) and 4.0 (range 0–12.5; IQR 3.0) for controls (n = 3445) (Fig. 1B). GED-C scoring distinguished patients with GD from controls with an AUC of 0.74 (95% CI 0.71–0.78) (Fig. 2B).

Fig. 1
figure 1

Boxplot of overall GED-C summary scores. A Patients with GD versus controls for database 1 (Table 2). B Patients with GD versus controls for database 2 (Table 3)

Fig. 2
figure 2

ROC curves for total GED-C scoring. A Patients with GD versus controls for database 1 (Table 2). B Patients with GD versus controls for database 2 (Table 3). FPR, false positive rate

The magnitude of difference in GED-C scoring between patients with GD and controls was greater for some features (e.g., splenomegaly [OR 80.3 in database 1 and 61.9 in database 2], severe thrombocytopenia [platelet count < 50 × 109/L; OR 57.9 in both databases], and hyperferritinemia [ferritin 300–1000 ng/mL; OR 28.9 in both databases]) (Tables 2 and 3).

Decision tree models were constructed for each database (Fig. 3). The models showed good accuracy (0.96 [95% CI 0.95–0.97] for database 1; 0.95 [95% CI 0.94–0.96] for database 2), high specificity (0.99 [95% CI 0.99–1] for database 1; 1.0 [95% CI 0.99–1] for database 2), but relatively low sensitivity (0.53 [95% CI 0.46–0.59] for database 1; 0.32 [95% CI 0.25–0.38] for database 2). The AUC was higher for database 1 (0.79 [95% CI 0.73–0.87]) compared with database 2 (0.69 [95% CI 0.60–0.73]). The f1-scores were 0.66 for database 1 and 0.46 for database 2.

Fig. 3
figure 3

Decision tree models for A database 1 and B database 2. Features of GD based on GED-C scoring as defined in Table 2 for database 1 and in Table 3 for database 2. As per GED-C scoring: anemia is defined as hemoglobin < 14.0 g/dL, hyperferritinemia is defined as ferritin > 300 ng/mL, and thrombocytopenia is defined as platelet count < 150 × 109/L. Decimal values for p parameter (range 0–1) used to show percentage of the split. HDL, high-density lipoprotein cholesterol

A random forest model was developed using the two databases to outline the 10 most important variables for distinguishing patients with GD from controls (Fig. 4). The clinical features of splenomegaly, thrombocytopenia (platelet count < 50 × 109/L), and hyperferritinemia (ferritin 300–1000 ng/mL) were found to be the top three most accurate classifiers of GD in both databases.

Fig. 4
figure 4

Random forest model showing the top 10 variables for distinguishing patients with GD from controls. A database 1 and B database 2. Gini impurity describes the probability of incorrect classification at a given node in the decision tree based on training data. Mean decrease in Gini is a measure of a variable’s importance for determining GD across all the decision trees in the random forest: the mean of a variable’s total decrease in node impurity, weighted by the proportion of samples reaching that node in each decision tree in the random forest. *HDL cholesterol < 35 mg/dL. Bone_dx, bone issues, including pain, crises, avascular necrosis, and fractures; HDL high-density lipoprotein

Discussion

This retrospective observational analysis used real-world data to assess the utility of the GED-C scoring system for discriminating between patients with GD and controls. We found that the GED-C scoring system performance at distinguishing between GD and controls was ‘good’ when using a database that included free-text notes from patient visits and ‘fair’ when using a database that excluded free-text notes. Analysis of both databases consistently identified the most important GED-C features for discriminating GD from non-GD as splenomegaly, thrombocytopenia, and hyperferritinemia.

The three features, splenomegaly, thrombocytopenia and hyperferritinemia were also considered by GD experts to be important features in the diagnosis of GD [12]. There are still some differences that need to be discussed. First, the weight given by experts for these three features was significantly lower than the actual odds of exposure found among patients with GD compared with controls in this real-world dataset. Second, GD experts gave a greater weight (2 points) for mild thrombocytopenia (platelet count 50–150 × 109/L) compared with severe thrombocytopenia (platelet count < 50 × 109/L; 1 point), whereas the actual odds of exposure among patients with GD from this study was acutely higher for severe thrombocytopenia (with a wide CI owing to a lower number). For hyperferritinemia, GD experts gave a lower weight for ferritin levels > 1000 ng/mL (1 point), whereas the odds of exposure among patients in this study was approximately 30 for elevated ferritin levels, regardless of the exact levels. Based on our findings, we can expect that a supervised machine learning classifier, that will not use predefined weights or laboratory cutoffs, could perform better in distinguishing between patients with GD and controls.

Although the models developed in this study show high specificity, they were shown to have low sensitivity. Missing or incomplete data for many features may have been a contributing factor. The definitions used in the GED-scoring may further impact the sensitivity of models to discriminate between patients with GD and controls. For example, the definition of anemia using a threshold of 14 g/L for hemoglobin concentration is relevant to adult males, but this may result in the misclassification of normal hemoglobin levels in patients whose hemoglobin would otherwise fall into the anemic range as specified by their age and gender. Modifying the scoring system may help correctly diagnose more patients with GD at an earlier time in the disease course.

As expected, the database that included data from free-text notes (database 1) showed better performance compared with database 2 because physicians frequently documented GD-related features (such as splenomegaly, hepatomegaly, fatigue, and bone pain) in the visit notes but then failed to code them. This is in line with a study of electronic records for splenomegaly in the Danish National Registry of Patients; the total number of patients coded for splenomegaly was lower than expected, leading the authors to conclude that splenomegaly as a clinical finding was probably under-coded in the ICD system [17]. Similarly, machine learning algorithms using ICD-9 codes did not perform well in identifying patients with systemic sclerosis; the highest-performing algorithms in this study incorporated clinical data with ICD, Tenth Revision (ICD-10), codes [18]. Unfortunately, ICD-10 codes were not available in the MHS database. Under-coding could be related to the difficulty in translating signs and symptoms into a clear and unambiguous classification code, coupled with time pressures on physicians [19].

The ability of the GED-C to discriminate between individuals with and without GD was also shown by other groups. Mehta et al. compared adults with GD (n = 25) to adults with liver disease, hematological malignancy or immune thrombocytopenia (n = 75) [13]. The data were derived from hospital records, with 80%-90% completeness of data. Analyses, based on 11 possible factors, showed good discrimination between those with GD and non-GD individuals, with AUC of 0.88 (95% CI 0.78–0.97) being comparable to the result from database A in our study. Patients with liver disease and hematological malignancy were most likely to have manifestations overlapping with GD [13]. In a study from Finland, clinical data from five patients with GD were compared to electronic health record data of ~ 170,000 adults from a biobank [14]. The score of patients with GD ranged between 6 and 18.5 (based on the available data on 28–29 possible factors). Only 0.72% of adults from the biobank were assigned at least 6 points, but none had a point score as high as 18.5. A follow-up study using the same approach with another Finnish biobank, showed similar results [20]. The score of 8 patients with GD was 6–22.5 points, while only 0.77% of controls had 10 points or higher. Data from patients with GD collected from hospital electronic medical records being compared with control data collected from a biobank, may explain the significant difference found in the studies from Finland.

The use of real-world data is both an advantage and a limitation of the study. The use of real-world data confirmed the significance of the majority of features in the GED-C scoring system. These features would need to be included in future machine-learning models. However, real-world data sets have their limitations. They may contain incomplete information or be prone to errors during data entry. In addition, because GD is a rare disease, the number of individuals with GD, even within this relatively large database of individuals, is restricted and there are inherent limitations to the development of an algorithm using such a small data set of patients. In our research, we employed a manual examination of unstructured text notes to identify features related to GD. The reason for resorting to manual screening was that the existing Hebrew resources for training natural language processing (NLP) are inadequate to accomplish this task using computer-based methods [21]. Manual screening was feasible when limited to 265 patients and 265 controls but not for the entire control cohort. Similarly, machine learning diagnostic algorithms will not be able to be based on manual screening of free-text notes. In addition, our study analyzed data from patients retrospectively, a prospective design would have been preferable in terms of standardization of data collection and reducing potential sources of bias.

The current study was set up to extract patient data on parameters from the GED-C score, chosen as a set of readily available clinical data points established by expert clinician consensus to have diagnostic relevance for patients with GD, and also validated in different geographic patient cohorts [13, 14]. However, we acknowledge that findings on the diagnostic utility of this set of variables are limited to the patient population studied and may not be applicable for populations with different ethnic backgrounds and disease severity. Other clinical manifestations of GD not included in the GED-C scoring system should be considered in future algorithms developed for GD diagnosis, for example co-morbidities that may be associated with GD such as neoplasms, endocrinological disorders like insulin-like growth hormone deficiency, as well as abnormalities in parathyroid hormone levels, phospho-calcium metabolism, and vitamin D levels [22,23,24,25].

In the era of digitized medical records, the opportunity for comparing and combining electronic health data from a wide variety of patient populations is likely to result in better refinement of data-mining tools. As seen with other diseases, use of machine learning for analysis of large amounts of clinical data will assist the development of algorithms for detecting undiagnosed patients with GD, as well as tools optimised for use in particular patient sub-sets [26, 27]. This, accompanied by large-scale testing of biobank samples using alternative diagnostic techniques, is needed to address the challenges of GD diagnosis in the future.

This study forms part of the basis for the development of an artificial intelligence–based algorithm for the accurate diagnosis of GD using machine learning, designed to shorten the diagnostic journey of patients with GD (Fig. 5). Machine learning technologies are being developed for a number of disorders that can potentially accelerate accurate diagnosis by calculating disease probabilities based on symptoms [28, 29]. Integration of machine learning models employing quantifiable variables that are readily discernible and autonomous of clinical evaluations to facilitate screening for rare diseases, such as GD, are needed to improve early detection and treatment [30]. In the next phase of the research process, the models evaluated in this study are to be further refined, by incorporating additional parameters and using supervised machine-learning methods. The best-performing model would then be applied to the MHS healthcare database population to identify those who may have unidentified GD. These data would then be used for further refinement of the machine learning classifiers for GD diagnosis.

Fig. 5
figure 5

Study design for the development of an artificial intelligence–based algorithm. GD Gaucher disease, GED-C Gaucher early diagnosis consensus, MHS Maccabi Health Services

Conclusion

The GED-C score, developed by Delphi expert consensus, shows good discrimination between patients with GD and controls and could be the basis for future models. In our study, as expected, the database that included data from physician notes on frequently documented GD-related features showed the best performance at distinguishing GD patients from controls. The application of machine learning techniques to our cohort is expected to result in an improved diagnostic model with the best possible sensitivity that may be used for screening undiagnosed GD cases.

Availability of data and materials

The data that support the findings of this study are available from the authors but restrictions apply to the availability of these data, which were used under license from the Maccabi Health Services (Israel) for the current study, and so are not publicly available. Data are, however, available from the authors upon reasonable request and with permission from MaccabiTech, Maccabi Health Services, research center.

Abbreviations

AUC:

Area under the receiver operating characteristic curve

GD:

Gaucher disease

GED-C:

Gaucher earlier diagnosis consensus

ICD-9/10:

International classification of diseases, ninth/tenth revision

IQR:

Interquartile range

MHS:

Maccabi Healthcare Services

OR:

Odds ratio

ROC:

Receiver operating characteristic

References

  1. Stirnemann J, et al. A review of Gaucher disease pathophysiology, clinical presentation and treatments. Int J Mol Sci. 2017;18:441.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Schwartz IVD, et al. Characteristics of 26 patients with type 3 Gaucher disease: a descriptive analysis from the Gaucher outcome survey. Mol Genet Metab Rep. 2018;14:73–9.

    Article  CAS  PubMed  Google Scholar 

  3. El-Beshlawy A, et al. Long-term hematological, visceral, and growth outcomes in children with Gaucher disease type 3 treated with imiglucerase in the international collaborative Gaucher group Gaucher registry. Mol Genet Metab. 2017;120:47–56.

    Article  CAS  PubMed  Google Scholar 

  4. Castillon G, Chang SC, Moride Y. Global incidence and prevalence of Gaucher disease: a targeted literature review. J Clin Med. 2022;12:85.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Revel-Vilk S, Szer J, Zimran A. Gaucher disease and related lysosomal storage diseases. In: Williams Hematology. New York: McGraw-Hill Education; 2021. p. 1189–202.

    Google Scholar 

  6. Gonzalez DE, et al. Enzyme replacement therapy with velaglucerase alfa in Gaucher disease: results from a randomized, double-blind, multinational, Phase 3 study. Am J Hematol. 2013;88:166–71.

    Article  CAS  PubMed  Google Scholar 

  7. Hughes DA, et al. Velaglucerase alfa (VPRIV) enzyme replacement therapy in patients with Gaucher disease: long-term data from phase III clinical trials. Am J Hematol. 2015;90:584–91.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Mistry PK, et al. Timing of initiation of enzyme replacement therapy after diagnosis of type 1 Gaucher disease: effect on incidence of avascular necrosis. Br J Haematol. 2009;147:561–70.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Mehta A, et al. Exploring the patient journey to diagnosis of Gaucher disease from the perspective of 212 patients with Gaucher disease and 16 Gaucher expert physicians. Mol Genet Metab. 2017;122:122–9.

    Article  CAS  PubMed  Google Scholar 

  10. Mistry PK, et al. A reappraisal of Gaucher disease-diagnosis and disease management algorithms. Am J Hematol. 2011;86:110–5.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Mistry PK, Sadan S, Yang R, Yee J, Yang M. Consequences of diagnostic delays in type 1 Gaucher disease: the need for greater awareness among hematologists-oncologists and an opportunity for early diagnosis and intervention. Am J Hematol. 2007;82:697–701.

    Article  PubMed  Google Scholar 

  12. Mehta A, et al. Presenting signs and patient co-variables in Gaucher disease: outcome of the Gaucher earlier diagnosis consensus (GED-C) Delphi initiative. Intern Med J. 2019;49:578–91.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Mehta A, et al. Scoring system to facilitate diagnosis of Gaucher disease. Intern Med J. 2020;50:1538–46.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Savolainen MJ, et al. The Gaucher earlier diagnosis consensus point-scoring system (GED-C PSS): evaluation of a prototype in Finnish Gaucher disease patients and feasibility of screening retrospective electronic health record data for the recognition of potential undiagnosed patients in Finland. Mol Genet Metab Rep. 2021;27:100725.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Revel-Vilk S, et al. Using the Gaucher earlier diagnosis consensus (GED-C) delphi score in a real-world dataset. Int J Transl Med. 2022;2:506–14.

    Google Scholar 

  16. Nahm FS. Receiver operating characteristic curve: overview and practical use for clinicians. Korean J Anesthesiol. 2022;75:25–36.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Curovic RE, et al. Splenomegaly - diagnostic validity, work-up, and underlying causes. PLoS ONE. 2017;12:e0186674.

    Article  Google Scholar 

  18. Jamian L, Wheless L, Crofford LJ, Barnado A. Rule-based and machine learning algorithms identify patients with systemic sclerosis accurately in the electronic health record. Arthritis Res Ther. 2019;21:305.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Tang KL, Lucyk K, Quan H. Coder perspectives on physician-related barriers to producing high-quality administrative data: a qualitative study. CMAJ Open. 2017;5:E617-622.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Pehrsson M, et al. Screening for potential undiagnosed Gaucher disease patients: utilisation of the Gaucher earlier diagnosis consensus point-scoring system (GED-C PSS) in conjunction with electronic health record data, tissue specimens, and small nucleotide polymorphism (SNP) genotype data available in Finnish biobanks. Mol Genet Metab Rep. 2022;33:100911.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Névéol A, Dalianis H, Velupillai S, Savova G, Zweigenbaum P. Clinical Natural Language Processing in languages other than English: opportunities and challenges. J Biomed Semant. 2018;9:12.

    Article  Google Scholar 

  22. Hughes D, et al. Gaucher disease in bone: from pathophysiology to practice. J Bone Miner Res. 2019;34:996–1013.

    Article  PubMed  Google Scholar 

  23. Mikosch P, et al. Patients with Gaucher disease living in England show a high prevalence of vitamin D insufficiency with correlation to osteodensitometry. Mol Genet Metab. 2009;96:113–20.

    Article  CAS  PubMed  Google Scholar 

  24. Rite S, et al. Insulin-like growth factors in childhood-onset Gaucher disease. Pediatr Res. 2002;52:109–12.

    Article  CAS  PubMed  Google Scholar 

  25. Mistry PK, Taddei T, vom Dahl S, Rosenbloom BE. Gaucher disease and malignancy: a model for cancer pathogenesis in an inborn error of metabolism. Crit Rev Oncog. 2013;18:235–46.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Knevel R, Liao KP. From real-world electronic health record data to real-world results using artificial intelligence. Ann Rheum Dis. 2023;82:306–11.

    Article  PubMed  Google Scholar 

  27. Riskin D, et al. Using artificial intelligence to identify patients with migraine and associated symptoms and conditions within electronic health records. BMC Med Inform Decis Mak. 2023;23:121.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Ronicke S, et al. Can a decision support system accelerate rare disease diagnosis? Evaluating the potential impact of Ada DX in a retrospective study. Orphanet J Rare Dis. 2019;14:69.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Gurovich Y, et al. Identifying facial phenotypes of genetic disorders using deep learning. Nat Med. 2019;25:60–4.

    Article  CAS  PubMed  Google Scholar 

  30. Wilson A, et al. Development of a rare disease algorithm to identify persons at risk of Gaucher disease using electronic health records in the United States. Orphanet J Rare Dis. 2023;18:280.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

Under the direction of the authors Isobel Lever, PhD, employee of Excel Medical Affairs, provided writing assistance for this publication. Editorial assistance in formatting, proofreading, copy editing, and fact-checking was also provided by Excel Scientific Solutions, Inc.

Funding

This research was funded by Takeda Pharmaceuticals International AG. Under the direction of the authors, Excel Scientific Solutions, Inc. provided writing assistance for this publication. Takeda Development Center Americas, Inc., provided funding to Excel Scientific Solutions, Inc. for support in writing and editing this manuscript under the direction of the authors.

Author information

Authors and Affiliations

Authors

Contributions

SRV and GC: conceptualization; SRV, GC, VS, OP, and OM: methodology; LA: data curation; SRV, GC, VS, AG, OP, OM, AT, and GC: writing—review and editing; AG: funding acquisition. All authors have read and agreed to the final version of the manuscript.

Corresponding author

Correspondence to Shoshana Revel-Vilk.

Ethics declarations

Ethics approval and consent to participate

The study design was approved by MHS Institutional Review Board (0013–21-ASMC). Patient consent was waived owing to the use of de-identified, anonymized data.

Consent for publication

Patient consent was waived owing to the use of de-identified, anonymized data.

Competing interests

SRV has received research grants, speaker fees, and travel support from Pfizer, Sanofi Genzyme, and Takeda; VS is the chief marketing officer of Alike and has received speaker fees from Novartis and Roche; AG is a former employee and a stockholder of Takeda; LA and GC are employees of MaccabiTech; and OP, OM, and AT declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Aidan Gill: Affiliated in Takeda Pharmaceuticals International AG, Zurich, Switzerland at the time of the study

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Revel-Vilk, S., Shalev, V., Gill, A. et al. Assessing the diagnostic utility of the Gaucher Earlier Diagnosis Consensus (GED-C) scoring system using real-world data. Orphanet J Rare Dis 19, 71 (2024). https://doi.org/10.1186/s13023-024-03042-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13023-024-03042-y

Keywords