Skip to main content

Are the European reference networks for rare diseases ready to embrace machine learning? A mixed-methods study

Abstract

Background

The delay in diagnosis for rare disease (RD) patients is often longer than for patients with common diseases. Machine learning (ML) technologies have the potential to speed up and increase the precision of diagnosis in this population group. We aim to explore the expectations and experiences of the members of the European Reference Networks (ERNs) for RDs with those technologies and their potential for application.

Methods

We used a mixed-methods approach with an online survey followed by a focus group discussion. Our study targeted primarily medical professionals but also other individuals affiliated with any of the 24 ERNs.

Results

The online survey yielded 423 responses from ERN members. Participants reported a limited degree of knowledge of and experience with ML technologies. They considered improved diagnostic accuracy the most important potential benefit, closely followed by the synthesis of clinical information, and indicated the lack of training in these new technologies, which hinders adoption and implementation in routine care. Most respondents supported the option that ML should be an optional but recommended part of the diagnostic process for RDs. Most ERN members saw the use of ML limited to specialised units only in the next 5 years, where those technologies should be funded by public sources. Focus group discussions concluded that the potential of ML technologies is substantial and confirmed that the technologies will have an important impact on healthcare and RDs in particular. As ML technologies are not the core competency of health care professionals, participants deemed a close collaboration with developers necessary to ensure that results are valid and reliable. However, based on our results, we call for more research to understand other stakeholders’ opinions and expectations, including the views of patient organisations.

Conclusions

We found enthusiasm to implement and apply ML technologies, especially diagnostic tools in the field of RDs, despite the perceived lack of experience. Early dialogue and collaboration between health care professionals, developers, industry, policymakers, and patient associations seem to be crucial to building trust, improving performance, and ultimately increasing the willingness to accept diagnostics based on ML technologies.

Background

In the EU, rare diseases (RDs) were first outlined as a health policy priority by the Community action programme on RDs (1999–2003). This programme defined RD as severe conditions affecting no more than 5 per 10,000 persons in the EU [1, 2]. Owing to their complex nature, RDs stood out as a distinctive domain for international coordinated action at the European and international levels [3, 4]. In this context, the EU adopted a series of legislative acts, culminating in the Council Recommendation of June 8, 2009, outlining further action in the RD field [3,4,5,6].

Even though each RD has a low prevalence, between 5000 and 8000 separate rare conditions are nowadays identified, affecting between 6 and 8% of the population at a certain point in their lives [2, 7]. Most RD patients actually have significantly less common diseases that could impact one in every 100 000 persons or less [2]. Because their numbers are very small on a national scale, these people are extremely isolated and vulnerable [2]. They often have a long delay in diagnosis, spending considerable time and resources in seeking advice and testing, commonly referred to as a diagnostic odyssey [8].

Machine learning (ML) represents a new paradigm in RD diagnostics and management. By examining massive volumes of phenotype and genotype data and discovering complex multiallelic patterns, ML-based tools have the potential to increase the precision and speed of RD diagnosis [9]. Nevertheless, the successful implementation of these new technologies encompasses three important milestones: availability (in terms of market authorization) [10]; accessibility (in terms of coverage or reimbursement) [10]; and routine application in clinical practice. Furthermore, compared to other health technologies, clinical studies on ML-based tools have limitations and often lack adapted, robust, and complete evidence, which results in vague and unreliable estimates of efficacy and cost effectiveness [11,12,13]. Finally, ML is far more than a diagnostic technique. It is a health system transformation modality that could produce significant changes and impacts at numerous layers [14].

While decisions on availability and accessibility are mainly made by regulators, payers, healthcare providers, and professionals, the routine application in clinical practice largely depends on the individual medical specialists’ knowledge, attitudes, and willingness to accept and adopt these new health technologies [15,16,17].

The European Reference Networks (ERNs) represent key opinion leaders in the field of RDs. These are virtual networks that connect healthcare providers across Europe. ERNs aim to facilitate discussion on complex or rare conditions that require highly specialised knowledge and concentrated expertise [18]. In 2017, the first 24 ERNs were established, incorporating over 900 highly specialised healthcare units from over 300 hospitals throughout EU Member States [18]. These were gradually joined by more than 600 new member centres of expertise, bringing the total number of ERN members to about 1500 by January 2022 [19].

Cooperation and transfer of knowledge among ERNs have proven to be a very efficient strategy to address RDs in Europe. The added value of ERNs to society is particularly high due to the rarity of these conditions, which implies both a limited number of patients and a scarcity of expertise within a single jurisdiction [2, 18, 20, 21]. ERNs thus unite the most crucial RD healthcare providers in the EU and play an important role in RD policymaking both at the EU and national levels.

The purpose of this study is to explore ERN members’ expectations towards and acceptance of ML and its potential application, to understand the key benefits and risks perceived regarding its potential use, and to identify the key factors being considered by ERN members to promote the use of and access to ML technologies in the diagnostic process.

Methods

We applied a two-stage research framework consisting of an online survey followed by a focus group discussion. As there is no universally accepted definition of ML, prior to the survey design, we consulted with ML stakeholders and came to the following working definition: "Machine learning (ML) is a computer-aided technique that may help physicians make a diagnosis by using information from past patient data". This formulation was presented to all study participants during all stages of the research.

Study setting

This study was conducted as part of the Screen4Care public–private partnership. Screen4Care is funded by the Innovative Medicines Initiative and aims to accelerate RD diagnosis through ML technologies and genetic newborn screening [22]. Thus, the project contributes to people living with a rare genetic disorder by reducing the delay in diagnosis, to a sustainable healthcare system by avoiding inconclusive consultations and costly misdiagnosis, as well as to effective treatments and efficient use of healthcare resources [22, 23].

Our study is positioned within Work Package 1 of the Scree4Care project. This work package aims to understand the business, ethical, and regulatory environment for RD screening and diagnosis in Europe [22]. In particular, Work Package 1 explores the complex decision-making process for funding, reimbursement, and adoption of health technologies based on genetic screening or ML. This is of paramount importance, as the Screen4Care project deliverables, once available, could be implemented in practice [22].

Study participants

Our study targeted primarily medical professionals but also other individuals affiliated with any of the 24 ERNs. We therefore formed a convenience sample of all the health care professionals with publicly available email addresses who were listed either on the ERNs’ websites or on the Orphanet database.

Screening those websites and databases resulted in a total of 2212 individuals that we contacted by email. The recipients received an invitation to participate in the survey with an invitation letter that described the study. In addition, we approached ERN coordinators and asked them to share the survey link within their ERN. We did not provide any incentives for participation.

Survey

We developed the scope and format of the survey based on a literature review on perceptions and expectations of clinical artificial intelligence applications, as identified by Scott et al. [24]. The questionnaire consisted of 23 questions grouped into four sections: (1) socio-demographic and career profile; (2) knowledge and attitude towards ML; (3) attitudes towards ML’s potential implementation and integration in healthcare; and (4) attitudes towards ML’s prospects for disease diagnosis. Each question contained a free text field for providing additional input.

The questionnaire was piloted among a small group of medical professionals to improve consistency and clarity. The full final survey is presented in Table 1 (Appendix). We started the survey on April 19, 2022, and sent monthly reminders using LimeSurvey. The survey was active until September 1, 2022.

Focus group

We drew focus group participants from the respondents who declared their willingness to participate and provided their contact information in the survey. In total, we invited 42 individuals, and 10 finally confirmed their participation.

The focus group discussion took place online on October 19, 2022. We provided the participants with a list of eight questions and obtained their informed consent for audio-visual recording and transcription before the meeting. During the discussion, we reminded the attendees to take their own perspective and experience into account when advising on ML in the diagnostic process of RDs.

Data analysis

Descriptive statistics were applied. The self-reported ML experience was used as the main factor for comparison and analysis of ML attitudes and expectations. Chi-square test and Mann–Whitney U-test were used to compare the group of respondents that reported no ML experience at all to those reporting limited or extensive experience. Statistical significance was considered if the p-value was less than 0.05.

Ethics committee approval

Approval by an Ethics committee was not required for this research. The survey and the focus group discussion were sociological from a methodological point of view and did not involve clinical research.

Results

Survey results

Socio-demographic and career profile of the respondents

423 individual responses were collected. The highest number of responses came from ERN members based in Italy (n = 96, 22.7%), followed by those from Germany and the Netherlands (both n = 38, 9%) Table 2 (Appendix). Respondents indicated an average of 22.7 (SD = 11.3) years of professional experience. The three most common medical specialties were paediatrics (n = 128), nephrology (n = 102), and endocrinology (n = 57).

Medical professionals from ERKNet, the ERN on kidney diseases, most actively took part in the survey (129 respondents), followed by members of Endo-ERN, the ERN on endocrine conditions (72 respondents) Table 2 (Appendix). Only three ERNs returned less than 10 responses: the ERN on rare multisystemic vascular diseases VASCERN (n = 8), the ERN on neuromuscular diseases EURO-NMD (n = 6), and the ERN on connective tissue and musculoskeletal diseases ReCONNET (n = 4). Note that respondents could indicate their affiliation with multiple ERNs.

Knowledge and attitude towards ML

Respondents reported a relatively limited degree of knowledge of and experience with ML. About 50% (n = 208) assessed their own knowledge of ML on the lowest end of a 1–5 scale, whereas only 15 participants rated their knowledge at a 5 (Fig. 1). Similarly, around 60% (n = 251) declared to have never used ML in their clinical practice, and only 14 respondents indicated to have extensive ML experience (Fig. 2).

Fig. 1
figure 1

Distribution of respondents by self-assessed knowledge of ML on a 1–5 scale

Fig. 2
figure 2

Distribution of respondents by self-reported ML experience

Survey participants assessed the importance of ML’s potential benefits and risks on a 1–5 scale, with 5 being the highest level. They rated improved diagnostic accuracy (mean = 3.65, SD = 1.52) as the most important benefit of ML, closely followed by synthesis of clinical information (mean = 3.54, SD = 1.47) (Fig. 3). Respondents who indicated having some ML experience consistently rated the benefits of ML higher. Improved diagnostic accuracy (4.09 vs. 3.35, p < 0.001), more efficient workflows (3.74 vs. 3.20, p = 0.015), and improved access to care (3.64 vs. 2.87, p < 0.001) were all graded significantly higher by this group compared to the group of participants who declared no ML experience.

Fig. 3
figure 3

Assessment of ML potential benefits on a 1–5 scale

The respondents identified insufficient training and continuing professional development around the use of ML for clinical purposes (mean = 3.28, SD = 1.50) as the most important risks of ML’s adoption and implementation, followed by liability for ML-mediated errors (mean = 3.10, SD = 1.57) (Fig. 4). They perceived reputational loss and reduced demand for specialist opinion (mean = 2.38, SD = 1.40) as the least important potential risks. Again, participants who reported some ML experience assessed the importance of those risks higher. Liability of ML-mediated errors (3.41 vs. 2.88, p = 0.047), insufficient training and continuing professional development (3.64 vs. 3.04, p = 0.002), and lack of accuracy, fairness, transparency, and decision-making power of the ML outcomes (3.22 vs. 2.67, p = 0.016) were all graded significantly higher compared to the group of participants who declared no ML experience.

Fig. 4
figure 4

Assessment of ML potential risks on a 1–5 scale

Attitudes towards ML’s potential implementation and integration in healthcare

Survey participants were asked to indicate their attitudes towards different modalities for ML’s potential implementation and integration in healthcare. Most respondents (n = 343, 81.1%) supported the option that ML should be an optional but recommended part of the diagnostic process for RDs. Only 33 respondents (7.8%) believed that ML technologies should be mandatory for RD diagnosis. The presence or lack of ML experience did have an impact on these findings (p = 0.034). A higher proportion of the participants who declared ML experience supported ML being a mandatory part (12.2% vs. 4.8%). On the other hand, 84.9% of the respondents without ML experience believed ML should be optional in this process, compared to 75.6% among those with ML experience.

We could not find a clear consensus on the scope and type of ML diagnostic findings to be disclosed to RD patients (Fig. 5). While 22.7% (n = 92) believed that all ML diagnostic results should be disclosed to patients, 19.1% (n = 81) stated that there is a need for guidance at the EU level on what specific information to disclose.

Fig. 5
figure 5

Distribution of respondents by preferred scope and type of ML diagnostic findings to be disclosed to patients

Public funding was preferred as the main source to cover ML diagnostics of RDs by 50.1% (n = 212) of the survey’s participants (Fig. 6). The rest split their answers mainly between two options: funding through research projects and subsequent public funding if justified (n = 94, 22.2%), and mixed coverage (public–private funding) (n = 89, 21.0%). No significant association was found between the funding preferences and the self-reported ML experience (p = 0.144).

Fig. 6
figure 6

Distribution of respondents by preferred funding of ML diagnostics

About half of the respondents (n = 219, 51.8%) supported the notion that anonymized ML-generated diagnostic data should be available for secondary use only with patients’ consent (consent required), while 28.6% (n = 121) were open to sharing anonymized ML-generated diagnostic data for secondary research if patients did not opt-out explicitly (Fig. 7). We found no significant influence of the ML experience on these findings (p = 0.451).

Fig. 7
figure 7

Distribution of respondents by preferred secondary use of ML anonymous data for research

Attitudes towards ML’s prospects

A majority of respondents (n = 251, 59.3%) expected ML’s clinical application in the next 5 years to be restricted to specialised units (e.g., designated centres of expertise and ERNs). About a quarter (n = 109, 25.8%) expected no change from the current situation, and only 14.9% (n = 63) believed ML could be more widely applied in all clinical settings and all levels of health care, including autonomous application by patients. We found a significant association between the self-reported ML experience and the participants’ attitudes towards the future of ML (p < 0.001). Respondents with ML experience were more optimistic, with 20.3% believing that ML would be routinely applied in all clinical settings, compared to 11.2% among those with no ML experience. On the other hand, a higher portion of the individuals without ML experience expected no change from the current situation, 31.9% vs. 16.9%.

Survey participants were further asked to assess the importance of factors that could encourage the routine application of ML outside of research projects on a 1–5 scale, with 5 being the highest level of importance. Improving clinical decision-making and outcomes (mean = 4.06, SD = 1.30), ensuring accuracy, freedom from bias, and trustworthiness (mean = 3.96, SD = 1.42), and ensuring data privacy, confidentiality, and security (mean = 3.84, SD = 1.40) were considered most important for the routine application of ML. The respondents with and without ML experience only differed significantly regarding the importance of ensuring accuracy, freedom from bias, and trustworthiness, with the experienced group rating this factor higher (4.22 vs. 3.78, p = 0.029).

Respondents assessed the importance of several policy measures suggested to promote the routine application of and access to ML outside research projects on a 1–5 scale, with 5 being the most important. Among those, participants expressed the strongest support for focusing on improved effectiveness in clinical decision-making (mean = 3.84, SD = 1.35), and adherence to legal and community expectations regarding privacy, confidentiality, and security of health and medical data (mean = 3.82, SD = 1.45). Elaboration of regulatory standards that are robust, transparent, and responsive (mean = 3.74, SD = 1.37), clear lines of responsibility regarding liability for error (mean = 3.74, SD = 1.41), and respect for human-to-human interaction and shared decision-making (mean = 3.73, SD = 1.43) were highly appraised. Significant differences were found between the respondents with and without ML experience regarding the requirements for ML tools to be based on models that have involved domain experts (4.04 vs. 3.50, p = 0.006), to fit to and complement routine clinical workflows (3.60 vs. 3.13, p = 0.007), and to be developed with a focus on maximising explainability and transparency in terms of their inner workings (3.70 vs. 3.14, p = 0.002).

Focus group results

Out of the 10 respondents initially confirming their participation in the online focus group, four participants attended online, and two participants provided their written responses instead. The participants were from six European countries and affiliated with seven ERNs.

The focus group lasted for 90 min. Two co-moderators provided discussion prompts following the structure of the online survey. Participants were presented with the survey’s preliminary outcomes before they were invited to comment with their own opinion and explore the answer patterns, while interaction between the focus group participants was encouraged by the co-moderators.

Experience with ML in clinical practice

Focus group participants indicated having little to no experience with ML in their own clinical practice. Previous ML interactions came entirely from research projects. Nevertheless, there was a clear agreement that ML’s potential is substantial and will have an important impact on health care and the RD field.

Participant#1: “I am not [using] it in my clinical practice myself, but I think it is a very interesting way of working. So, I feel that it's necessary to know more.”

Participant#4: “We have a project where we test ML for electronic records, … [but] it is not part of everyday clinical management yet. It is a pilot, and it aims to be an everyday technology when we know that it functions.”

Insufficient training as a main obstacle to the successful implementation of ML in the diagnostic process of RDs

Participants agreed that it is not the responsibility of medical specialists to be experts in ML. Nevertheless, they believed it necessary for clinicians to partner with ML tool developers to ensure that ML-generated outcomes are valid and reliable. To this end, ML training for physicians is considered beneficial if such training is tailored to the actual characteristics and needs of the various medical specialties and conditions. Some of the attendees favoured a top-down approach in the promotion of ML training, starting from highly specialised centres of expertise and eventually trickling down to undergraduate training.

Participant#1: “It is necessary to have some work by medical societies – national, European, and international. They should promote ML in specific areas of interest and continuous education. And after that, we can think about promoting ML at the undergraduate level of training.”

Participant#2: “What is important for clinicians is to understand the process behind it. Use of ML commercial solutions does not require too much explanation and training, but it may require some training to better understand the technologies. I mean, in cases that are more research- and development-oriented, it is important to understand what is behind [the outcomes]. For example, whether, from a clinical perspective, the right data sets are used in the right way. Questions posed in this way make outcomes effective and reliable. However, the bulk of technical work behind it is not what physicians would need to understand or to work on. This is clearly the task of computer scientists and IT professionals.”

Participant#4: “I think we, as clinicians, do not need to be experts on the process behind the ML algorithm, but we need to be sure that the algorithm itself is valid and gives results that we can rely on. This is what is important. We do not need any training per se to develop these ML tools. We need, of course, collaboration with ML experts who know what ML can give us.”

Scope and type of ML diagnostic findings to be disclosed to RD patients

Most participants believe that current regulations and guidelines on the scope and type of diagnostic findings to be disclosed to patients are sufficient to respond well to ML’s implementation and its potential challenges.

Participant#2: “We will decide based on the legislation and the clinical need. ML will be there only to increase our capacity. I do not see any difference if you get this type of information by ML or by any other technology.”

Participant#4: “ML is just a tool, and we already have regulations and guidelines for what kind of incidental findings are mandatory to disclose to the patient.”

Funding of ML diagnostics

Similar to the survey’s findings, participants stated that public funding should be the preferred option in case of RD diagnosis. Nevertheless, there must be a distinction between routine ML diagnostics in clinical settings and research projects. The latter should be funded by the current mix of public, private, and mixed resources.

Participant#4: “When we talk about diagnostics, then it must, of course, be public funding because it is a tool for diagnostics and for the patients’ best [interest].”

Participant#5: “ML should be publicly funded, so that all patients can access it. Otherwise, it may become an add-on that only the wealthy or those with particular insurance can access.”

Participant#6: “If the evidence is sound and demonstrates dramatic improvement in RD diagnosis, the use of these models should be publicly funded and available, just like for new drugs or devices.”

ML application in the next five years

Participants mostly agreed that ML’s application in the next five years should be restricted to specialised units only. They stressed the fact that diagnosis also includes interpretation of the findings and communication with patients and their families.

Participant#1: “…When we are dealing with RDs, it is not only the diagnosis, but it is also the interpretation and the information to the patients and their families. ML seems to be a very fast-moving field, but I do not see it [being applied broadly] right now. But who knows for the next five years.”

Participant#5: “It depends on what the ML is being used for. In a limited sense, various web programmes might be considered ML. Although "Dr. Google" has diagnosed some diseases, it has not been helpful in most cases, and patients become very anxious. However, in the appropriate hands, ML helps to identify a disease that clinicians may only see once in a lifetime and enables rapid referrals. I am in favour of ML for RDs being available locally but with support from centres of expertise. Local centres could use it but need training and must know who their specialty links are.”

Discussion

Our study aimed to explore whether members of ERNs are willing to use ML-based tools for RD diagnostics. However, ultimate decisions on the adoption of and access to these technologies depend on more than ERNs but also on many other stakeholders, such as policymakers, industry, and health care payers, and their experiences and expectations. Nevertheless, medical professionals do represent a distinct link between patients and payers. Therefore, the insights provided by this community are of high interest for adopting novel ML technologies in diagnosis and reducing the delay in diagnosis. Furthermore, ERNs represent pan-European hubs of research and knowledge. Thus, they enjoy a special status in the ecosystem of RDs taking on a leadership role [2,3,4, 20, 21].

Expected benefits and risks of ML-based RD diagnosis

We found a distinctly positive attitude among ERN members towards the use of ML for the diagnosis of RDs, despite the large share of respondents who indicated a lack of knowledge of and experience with such tools. This trend was even more pronounced in the participants who already had some ML experience. As the use of ML becomes more common in medicine and healthcare, the role of trustworthiness of these tools must be discussed [15]. This is a recurrent problem reported by previous research [15, 16, 24], which hinders the successful integration of ML-based tools into existing healthcare workflows [25]. There is a need for ML training curricula targeting medical specialists. However, their focus should be on enhancing ML literacy among clinicians rather than building technical skills [26].

Our focus group indicated that medical specialists should not be held responsible for being ML experts. It is important, however, to promote early dialogue and collaboration between ML developers and members of ERNs. ERNs can inform the research and development of novel ML-based tools for diagnosing RDs. Together, healthcare professionals and ML developers can design tailored ML training across all medical specialties. This kind of much-needed synergy could be the critical starting point for building trust in and willingness to adopt ML-based tools.

Participants firmly expected that the RD field would largely benefit from the adoption of ML-based diagnostic tools. Rapid and reliable disease diagnosis, as well as the synthesis of clinical information, were deemed to be the most significant benefits of this new paradigm. ML-based RD diagnosis seems to be a promising technology, and the reported positive attitudes could be easily transformed into routine clinical application [27]. However, like other studies before, the diagnostic ability of ML is regarded as subject to a physician’s evaluation and final decision [28].

The latter aspect was repeatedly underlined by our focus group attendees. They stressed the fact that interpretation of the findings and communication with patients and their families represent integral parts of the RD diagnostic process and cannot be substituted by any technology. In fact, our participants considered ML-based self-diagnosis by patients very problematic with the current development stage of ML technologies. Contrarily to some previously reported results, reputational loss and reduced demand for specialist opinion due to ML implementation were perceived as the least important potential risks [24].

Preferred implementation modalities of ML-based RD diagnosis

We explored the level of agreement about different implementation modalities of ML-based RD diagnosis. One topic for which our study revealed strongly diverging opinions among ERN members was the secondary use of anonymous ML-based diagnostic findings, which highlight the complex ethical considerations surrounding data sharing and consent in the context of ML technologies [29]. While most of our survey’s respondents believed that anonymized ML-generated diagnostic data should be available for secondary use only with patients’ consent (consent required), a substantial minority were open to the idea of sharing anonymized ML-generated diagnostic data for secondary research if patients did not explicitly opt-out. Further debating the issues of data privacy protection, previous studies have also underlined the importance of establishing guidelines and frameworks to ensure accountability and responsibility in cases of breaches and hacking [30,31,32].

Public funding was clearly preferred as the main source to support ML-based diagnostics of RDs in both our survey and focus group. However, this is not surprising, as we did not ask healthcare payers, such as health insurance companies or national health authorities. One of the participants explicitly tied the question of funding to the issue of equity and equal access to diagnosis and treatment. In the end, it will be important to distinguish between research endeavours and routine ML-based diagnosis. Our results indicate physicians’ preference for financing the latter via a mix of public and private resources.

Most of our study’s participants agreed that ML’s application in the next five years should be restricted to specialised units. We interpret the tendency to specialisation as ML-based technologies being perceived as having the most immediate impact and the biggest potential in tertiary clinical settings, where there is a substantial accumulation of expertise and resources for RDs [20].

However, even if ML technologies spread across healthcare settings, the respondents highlighted the importance of the patient-physician relationship and the interpretation of diagnostic findings, which cannot be replaced by technology. This impact of ML on the medical profession has been underlined by several studies [33]. In particular, ML tools for patient use offer a more optimistic outlook on this complex matter, although healthcare systems’ infrastructure might not be ready to facilitate autonomous ML usage by patients [34].

Limitations

This study has a number of limitations. Convenience sampling was applied, thus some ERNs may be under- or overrepresented in the sample of responses. Therefore, our findings may not be considered fully representative of all ERNs’ attitudes and opinions about ML-based diagnostics of RDs. Nevertheless, to our best knowledge, this is the first attempt to explore the question of ML acceptance among ERNs. While we consider that our study marks an important starting point, the debate around routine application of ML for the diagnosis of RDs is expected to continue in the future.

Second, our survey respondents reported a limited degree of knowledge and experience with ML. This relative lack of expertise may have been reflected in the overall results and conclusions of our research. Nonetheless, we believe that this specific outcome well describes the current RD ecosystem in Europe regarding ERN members’ perception and understanding of this novel technology.

Third, our research provides information on ERNs’ members only. Professionals from other levels of the health care system, especially primary care specialists, should be consulted as well. Successful implementation of ML-based diagnostic tools for RDs, including coverage and reimbursement, will need to be agreed upon and worked on by various other stakeholders as well, including patient organisations. Therefore, it is equally important to also explore the attitudes and expectations of these specific groups.

Last but not least, our study did not explore the technical aspects of ML-based diagnostics for RDs. It is equally important that the findability, accessibility, interoperability, and reusability of the existing RD data sources are surveyed and analysed in order to provide a more objective overview of the short- and long-term prospects of the ML-based diagnosis of RDs in the EU. Within the Screen4Care project, there is a separate work task to address the latter question [35], and its research outcomes would greatly inform this ongoing debate.

Conclusions

We found enthusiasm to implement and apply ML technologies, especially diagnostic tools in the field of RDs, despite the perceived lack of experience. While these findings are subject to limitations, to our best knowledge, they provide the first insights into that complex issue and could serve as a starting point for further research on the potential use of ML within the ERNs. Early dialogue and collaboration between health care professionals, developers, industry, policymakers, and patient associations seem to be crucial to building trust, improving performance, and ultimately increasing the willingness to accept diagnostics based on ML technologies.

Availability of data and materials

The data that support the findings of this study are not openly available due to reasons of sensitivity and are available from the corresponding author upon reasonable request. Data are located in controlled access data storage at the Institute for Rare Diseases (Plovdiv, Bulgaria).

Abbreviations

ERN:

European Reference Network

ML:

Machine learning

RD:

Rare disease

References

  1. Decision No 1295/1999/EC of the European Parliament and of the Council of 29 April 1999 adopting a programme of Community action on rare diseases within the framework for action in the field of public health (1999 to 2003) (OJ L 155, 22.6.1999, p. 1). Decision repealed by Decision No 1786/2002/EC (OJ L 271, 9.10.2002, p. 1).

  2. Council Recommendation of 8 June 2009 on an action in the field of rare diseases (2009/C 151/02) (OJ C 151, 3.7.2009, pp. 7–10).

  3. Rodwell C, Aymé S. Rare disease policies to improve care for patients in Europe. Biochim Biophys Acta. 2015;1852(101):2329–35. https://doi.org/10.1016/j.bbadis.2015.02.008.

    Article  CAS  PubMed  Google Scholar 

  4. Moliner AM, Waligora J. The European union policy in the field of rare diseases. Adv Exp Med Biol. 2017;1031:561–87. https://doi.org/10.1007/978-3-319-67144-4_30.

    Article  PubMed  Google Scholar 

  5. Montserrat Moliner A, Waligóra J. The European union policy in the field of rare diseases. Public Health Gen. 2013;16(6):268–77. https://doi.org/10.1159/000355930.

    Article  CAS  Google Scholar 

  6. Aymé S, Rodwell C. The European union committee of experts on rare diseases: three productive years at the service of the rare disease community. Orphanet J Rare Dis. 2014;28(9):30. https://doi.org/10.1186/1750-1172-9-30.

    Article  Google Scholar 

  7. Communication from the Commission to the European Parliament, the Council, the European Economic and Social Committee and the Committee of the Regions on Rare Diseases - Europe's challenges (COM(2008) 679 final, 11.11.2008)

  8. Miller IM, Yashar BM. Undiagnosed disease network; macnamara EF continuing a search for a diagnosis: the impact of adolescence and family dynamics. Orphanet J Rare Dis. 2023;18(1):6. https://doi.org/10.1186/s13023-022-02598-x.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Roman-Naranjo P, Parra-Perez AM, Lopez-Escamez JA. A systematic review on machine learning approaches in the diagnosis and prognosis of rare genetic diseases. J Biomed Inform. 2023;22(143): 104429. https://doi.org/10.1016/j.jbi.2023.104429.

    Article  Google Scholar 

  10. Blankart CR, Stargardt T, Schreyögg J. Availability of and access to orphan drugs: an international comparison of pharmaceutical treatments for pulmonary arterial hypertension, Fabry disease, hereditary angioedema and chronic myeloid leukaemia. Pharmacoeconomics. 2011;29(1):63–82. https://doi.org/10.2165/11539190-000000000-00000.

    Article  PubMed  Google Scholar 

  11. Farah L, Davaze-Schneider J, Martin T, Nguyen P, Borget I, Martelli N. Are current clinical studies on artificial intelligence-based medical devices comprehensive enough to support a full health technology assessment? A systematic review. Artif Intell Med. 2023;140: 102547. https://doi.org/10.1016/j.artmed.2023.102547.

    Article  PubMed  Google Scholar 

  12. Sounderajah V, Ashrafian H, Golub RM, Shetty S, De Fauw J, Hooft L, Moons K, Collins G, Moher D, Bossuyt PM, Darzi A, Karthikesalingam A, Denniston AK, Mateen BA, Ting D, Treanor D, King D, Greaves F, Godwin J, Pearson-Stuttard J, Harling L, McInnes M, Rifai N, Tomasev N, Normahani P, Whiting P, Aggarwal R, Vollmer S, Markar SR, Panch T, Liu X. STARD-AI Steering committee. Developing a reporting guideline for artificial intelligence-centred diagnostic test accuracy studies: the STARD-AI protocol. BMJ Open. 2021;11(6):1047709. https://doi.org/10.1136/bmjopen-2020-047709.

    Article  Google Scholar 

  13. Norgeot B, Quer G, Beaulieu-Jones BK, Torkamani A, Dias R, Gianfrancesco M, Arnaout R, Kohane IS, Saria S, Topol E, Obermeyer Z, Yu B, Butte AJ. Minimum information about clinical artificial intelligence modeling: the MI-CLAIM checklist. Nat Med. 2020;26(9):1320–4. https://doi.org/10.1038/s41591-020-1041-y.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Alami H, Lehoux P, Auclair Y, de Guise M, Gagnon MP, Shaw J, Roy D, Fleet R, Ag Ahmed MA, Fortin JP. Artificial intelligence and health technology assessment: anticipating a new level of complexity. J Med Internet Res. 2020;22(7): e17707. https://doi.org/10.2196/17707.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Hallowell N, Badger S, Sauerbrei A, Nellåker C, Kerasidou A. “I don’t think people are ready to trust these algorithms at face value”: trust and the use of machine learning algorithms in the diagnosis of rare disease. BMC Med Ethics. 2022;23(1):112. https://doi.org/10.1186/s12910-022-00842-4.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Messner DA, Al Naber J, Koay P, Cook-Deegan R, Majumder M, Javitt G, Deverka P, Dvoskin R, Bollinger J, Curnutte M, Chandrasekharan S, McGuire A. Barriers to clinical adoption of next generation sequencing: perspectives of a policy Delphi panel. Appl Transl Genom. 2016;25(10):19–24. https://doi.org/10.1016/j.atg.2016.05.004.

    Article  Google Scholar 

  17. Varabyova Y, Blankart CR, Greer AL, Schreyögg J. The determinants of medical technology adoption in different decisional systems: a systematic literature review. Health Policy. 2017;121(3):230–42. https://doi.org/10.1016/j.healthpol.2017.01.005.

    Article  PubMed  Google Scholar 

  18. European Commission. Directorate-General for Health and Food Safety. European Reference Networks. Overview. Available from: https://health.ec.europa.eu/european-reference-networks/overview_en Accessed 5 September 2023

  19. European Commission. Directorate-General for Health and Food Safety. 620 newly approved members to join the existing European Reference Networks (ERNs) from 1 January 2022. Available from: https://ec.europa.eu/newsroom/sante/newsletter-archives/36147 Accessed 5 September 2023

  20. Graessner H, Storf H, Schaefer F. Versorgernetzwerke für Menschen mit Seltenen Erkrankungen: Daten und Expertise bündeln [Healthcare networks for people with rare diseases: integrating data and expertise]. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2022;65(11):1164–9. https://doi.org/10.1007/s00103-022-03592-1.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Talarico R, Aguilera S, Alexander T, Amoura Z, Andersen J, Arnaud L, Avcin T, Marsal Barril S, Beretta L, Bombardieri S, Bortoluzzi A, Bouillot C, Bulina I, Burmester GR, Cannizzo S, Cavagna L, Chaigne B, Cornet A, Corti P, Costedoat-Chalumeau N, Dāvidsone Z, Doria A, Fenech C, Ferraris A, Fischer-Betz R, Fonseca JE, Frank C, Gaglioti A, Galetti I, Guimarães V, Hachulla E, Holmner M, Houssiau F, Iaccarino L, Jacobsen S, Limper M, Malfait F, Mariette X, Marinello D, Martin T, Matthews L, Matucci-Cerinic M, Meyer A, Milas-Ahić J, Moinzadeh P, Montecucco C, Mouthon L, Müller-Ladner U, Nagy G, Patarata E, Pileckyte M, Pruunsild C, Rednic S, Romão VC, Schneider M, Scirè CA, Smith V, Sulli A, Tamirou F, Tani C, Taruscio D, Taulaigo AV, Tincani A, Ticciati S, Turchetti G, van Hagen PM, van Laar JM, Viera A, de Vries-Bouwstra JK, Zschocke J, Cutolo M, Mosca M. The added value of a European Reference Network on rare and complex connective tissue and musculoskeletal diseases: insights after the first 5 years of the ERN ReCONNET. Clin Exp Rheumatol. 2022;40(Suppl 134 5):3–11. https://doi.org/10.55563/clinexprheumatol/d2qz38.

    Article  PubMed  Google Scholar 

  22. Screen4Care. Available from: https://screen4care.eu/ Accessed 5 September 2023

  23. Bick D, Ahmed A, Deen D, Ferlini A, Garnier N, Kasperaviciute D, Leblond M, Pichini A, Rendon A, Satija A, Tuff-Lacey A, Scott RH. Newborn screening by genomic sequencing: opportunities and challenges. Int J Neonatal Screen. 2022;8(3):40. https://doi.org/10.3390/ijns8030040.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Scott IA, Carter SM, Coiera E. Exploring stakeholder attitudes towards AI in clinical practice. BMJ Health Care Inform. 2021;28(1): e100450. https://doi.org/10.1136/bmjhci-2021-100450.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Bajwa J, Munir U, Nori A, Williams B. Artificial intelligence in healthcare: transforming the practice of medicine. Future Healthc J. 2021;8(2):e188–94. https://doi.org/10.7861/fhj.2021-0095.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Kolachalama VB, Garg PS. Machine learning and medical education. NPJ Digit Med. 2018;27(1):54. https://doi.org/10.1038/s41746-018-0061-1.

    Article  Google Scholar 

  27. Ahsan MM, Luna SA, Siddique Z. Machine-learning-based disease diagnosis: a comprehensive review. Healthcare (Basel). 2022;10(3):541. https://doi.org/10.3390/healthcare10030541.

    Article  PubMed  Google Scholar 

  28. Al-Edresee T. Physician acceptance of machine learning for diagnostic purposes: caution, bumpy road ahead! Stud Health Technol Inform. 2022;29(295):83–6. https://doi.org/10.3233/SHTI220666.

    Article  Google Scholar 

  29. Vayena E, Blasimme A, Cohen IG. Machine learning in medicine: addressing ethical challenges. PLoS Med. 2018;15(11): e1002689. https://doi.org/10.1371/journal.pmed.1002689.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Mansouri-Benssassi E, Rogers S, Reel S, Malone M, Smith J, Ritchie F, Jefferson E. Disclosure control of machine learning models from trusted research environments (TRE): new challenges and opportunities. Heliyon. 2023;9(4): e15143. https://doi.org/10.1016/j.heliyon.2023.e15143.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Yoon CH, Torrance R, Scheinerman N. Machine learning in medicine: should the pursuit of enhanced interpretability be abandoned? J Med Ethics. 2022;48(9):581–5. https://doi.org/10.1136/medethics-2020-107102.

    Article  PubMed  Google Scholar 

  32. Rasheed K, Qayyum A, Ghaly M, Al-Fuqaha A, Razi A, Qadir J. Explainable, trustworthy, and ethical machine learning for healthcare: a survey. Comput Biol Med. 2022;149: 106043. https://doi.org/10.1016/j.compbiomed.2022.106043.

    Article  PubMed  Google Scholar 

  33. Darcy AM, Louie AK, Roberts LW. Machine learning and the profession of medicine. JAMA. 2016;315(6):551–2. https://doi.org/10.1001/jama.2015.18421.

    Article  CAS  PubMed  Google Scholar 

  34. Molina-Mula J, Gallo-Estrada J. Impact of nurse-patient relationship on quality of care and patient autonomy in decision-making. Int J Environ Res Public Health. 2020;17(3):835. https://doi.org/10.3390/ijerph17030835.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Raycheva R, Kostadinov K, Mitova E, Bogoeva N, Iskrov G, Stefanov G, Stefanov R. Challenges in mapping European rare disease databases, relevant for ML-based screening technologies in terms of organizational, FAIR and legal principles: scoping review. Front Public Health. 2023;15(11):1214766. https://doi.org/10.3389/fpubh.2023.1214766.

    Article  Google Scholar 

Download references

Acknowledgements

We are grateful for valuable feedback during its early conception by Kosta Shatrov, Anne-Sophie Chalandon, Alicia Granados, Andreas Clemens, Julian Nam, David Wolters, and Kimon Doukoumetzidis.

Funding

Screen4Care project has received funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) under Grant Agreement No. 101034427. The JU receives support from the European Union’s Horizon 2020 research and innovation programme and EFPIA.

Author information

Authors and Affiliations

Authors

Contributions

The views in this paper reflect only the authors’ views, and the JU is not responsible for any use that may be made of the information it contains. GI, RR and KK contributed to the conception and design of the study, recruitment of study participants, analyzing the data, and writing the draft manuscript. SG, EM, SS, and GS contributed to the conception and design of the study, recruitment of study participants, analyzing the data, and critical review of the manuscript. RB, ESG, GG and RS contributed to the conception and design of the study, and critical review of the manuscript. All authors contributed to the article and approved the submitted version.

Corresponding author

Correspondence to Georgi Iskrov.

Ethics declarations

Ethics approval and consent to participate

Approval by an Ethics committee was not required for this research. The survey and the focus group discussion were sociological from a methodological point of view, with no clinical research. Survey participants provided electronic consent after reading the study’s description and before launching the online questionnaire. Focus group participants provided informed consent for audio-visual recording and transcription before the meeting.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

See Tables 1 and 2.

Table 1 Survey questionnaire
Table 2 Socio-demographic and career profile of the respondents

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Iskrov, G., Raycheva, R., Kostadinov, K. et al. Are the European reference networks for rare diseases ready to embrace machine learning? A mixed-methods study. Orphanet J Rare Dis 19, 25 (2024). https://doi.org/10.1186/s13023-024-03047-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13023-024-03047-7

Keywords