Identification of two novel bullous pemphigoid- associated alleles, HLA-DQA1*05:05 and -DRB1*07:01, in Germans


Bullous pemphigoid (BP) is the most common autoimmune skin blistering disease characterized by autoimmunity against the hemidesmosomal proteins BP180, type XVII collagen, and BP230. To elucidate the genetic basis of susceptibility to BP, we performed the first genome-wide association study (GWAS) in Germans. This GWAS was combined with HLA locus targeted sequencing in an additional independent BP cohort. The strongest association with BP in Germans tested in this study was observed in the two HLA loci, HLA-DQA1*05:05 and HLA-DRB1*07:01. Further studies with increased sample sizes and complex studies integrating multiple pathogenic drivers will be conducted.

Bullous pemphigoid (BP) is the most common autoimmune skin blistering disease in Europe. BP is characterized by autoimmunity against the hemidesmosomal proteins BP180, type XVII collagen, and BP230 [1]. The pathophysiology of BP is incompletely understood and the genetic basis of susceptibility to BP is largely unknown as large-scale genetic studies have so far been hampered by the low prevalence of the disease.

Therefore, we set out to perform the first genome-wide association study (GWAS) in Germans to identify the gene variants predisposing to BP. For this purpose, 446 BP patients were recruited by the German AIBD Genetics Study Group and 433 German age- and sex-matched controls were retrieved from the Popgen biobank (Kiel, Germany). The cohorts were genotyped in two batches, both containing patient and control samples, using Applied Biosystems™ UK Biobank Axiom™ Array chips, containing 825,927 markers (Additional file 1: Materials and methods).

The meta-analysis of the GWAS revealed a strong association with SNPs within the HLA locus (6p21.1–21.3) (Additional file 1: Materials and methods; Table 2; Additional file 2: dataset 1), reaching genome-wide significance for 6 SNPs (p < 5E−08). In addition to the HLA locus, multiple loci of suggestive association superseding the background noise were identified. A λ-GC of 0.8501081 for the meta-analysis, adjusted for 100 cases and controls, indicates that the results for the non-HLA loci may be conservatively biased and include more false negatives than expected. We therefore focused on the HLA locus for further analysis. Allele calling based on the raw GWAS data (Batch 1 and Batch 2, as a discovery study) showed 18 HLA alleles that are associated with BP (p < 0.05), including DQA1*05:05 (p = 1.23189E−08), DQB1*03:01 (p = 1.10574E−05) and DRB1*07:01 (p = 0.000236558; Additional file 3: dataset 2). To confirm these findings, the entire HLA locus was deep sequenced in 87 independent BP patients samples and analysed with reference to a northern German blood donor cohort (n = 547), coded by the National Marrow Donor Program (NMDP) standard as a replication cohort (Additional file 1: Materials and methods; Additional file 3: dataset 2). A meta-analysis of the discovery and the replication cohorts revealed that two of 18 HLA alleles identified in the discovery study, HLA-DQA1*05:05 and -DRB1*07:01, were confirmed (Table 1).

Of the identified HLA alleles, the association of DQA1*05:05 (p = 8.9783E−7; Table 1) is in line with previous reports: it was identified in Brazilian [2] and Chinese [3] BP cohorts as a BP-susceptible allele. The HLA-DRB1 gene allele DRB1*07:01 was previously identified as a protective allele in Chinese population [3]. However, these reports were based on studies using small number of non-European cohorts. These alleles have not been reported in Germans or other European BP patients, to the best of our knowledge. The allele HLA-DRB1*07:01 has been reported to be associated with increased susceptibility to systemic lupus erythematosus and with the production of autoantibodies (anti-Sm) in Koreans [4].

Interestingly, the allele DQA1*05:05 is reported to be in linkage disequilibrium with the allele DQB1*03:01, which is reportedly associated with BP in multiple ethnic backgrounds including Caucasians [2, 5,6,7], in different populations [8, 9]. The functional impact of the DQB1*03:01 has been well documented [10, 11] as well as its strong association with drug-induced BP [12]. When the conditional analysis was performed in our data, DQA1*05:05 is conditional on DQB1*03:01 and vice versa (Additional file 4: dataset 3). Even though these alleles are significantly associated with BP in the discovery cohort, the effects of both alleles are not statistically significant at 0.05 under condition of each other allele. This finding supports the linkage disequibrilium of these two alleles. Yet, the confidence interval of the effect of the DQB1*03:01 in the meta-analysis is still strictly positive (Table 1). A similar phenomeno is also observed with the DRB1*11:01 allele, which has also been previously reported to be associated with BP [12, 13], i.e., in significant linkage disequilibrium with HLA-DQA1*05:05 [9], and its effect in the meta-analysis is positive despite its statistical non-significance (Table 1).

As the sample size of our study is comparably smaller for a GWAS compared with today’s standard of GWAS for common diseases, such as cardiovascular diseases, detection of associated variants can only be limited to common variants shared between the patients, i.e., HLA locus. Indeed, the allele frequency of DRB1*07:01 is approximately 1.21265E−1 in Germans [14]. However, considering the rare nature of BP, disease susceptibility may be attributable to rare variants spread across many different genes other than the HLA locus, affecting shared pathways. These gene variants would therefore only possess a small effect size and weak associations, which has in recent times been characterized as a defining feature of GWA studies, accounting for what is occasionally referred to as ‘missing heritability’ [15]. To address this issue of minor effect variants, which is typical for multifactorial and polygenic disorders, targeted sequencing approaches are currently being employed. Another potential explanation for the lack of significant association outside the HLA locus in this GWAS is the potential involvement of environmental factors (e.g., diet, commensal bacteria) in the pathogenesis of BP. Therefore, the complex gene-environment interactions will be further investigated by the German AIBD Genetics Study Group.

In conclusion, we performed the very first GWAS in BP using the largest cohort in the world. Together with the HLA locus targeted sequencing result in an additional independent BP cohort, the strongest association with BP in Germans tested in this study was observed in the HLA loci, HLA-DQA1*05:05 and HLA-DRB1*07:01. However, further studies using increased sample sizes and complex studies integrating multiple pathogenic drivers will be conducted.

Table 1 HLA alleles associated with BP identified in this study

Availability of data and materials

Datasets related to this article were submitted to the European Genome-phenome Archive (EGA) with ID: EGAD00010001956 (GWAS Called Data Batch 2 Called genotypes of samples in batch 2 of CRU303 GWAS), EGAD00010001955 (GWAS Raw Data Batch 1 Controls Raw data files of samples in batch 1 of CRU303), EGAD00010001954 (GWAS Raw Data Batch 1 Cases Raw data files of samples in batch 1 of CRU303 GWAS), EGAD00010001953 (GWAS Raw Data Batch 2 Controls Raw data files of samples in batch 2 of CRU303), EGAD00010001952 (GWAS Raw Data Batch 2 Cases Raw data files of samples in batch 2 of CRU303 GWAS), and EGAD00010001951 (GWAS Called Data Batch 1 Called genotypes of samples in batch 1 of CRU303 GWAS). These data are available upon request.


The authors would like to extend their gratitude towards all patients for their participation in this study.


This work was supported by the Germany Research Council (Deutsche Forschungsgemeinschaft, DFG) through the Clinical Research Unit 303 Pemphigoid Diseases  (to NvB, IRK, JEr, HB, CDS, MHi, DZ, ES, and SMI).

CS: data curation, data analysis, writing-original draft, writing-reviewing and editing. DG, AB, JEr, HB, IRK, MWi, AF: GWAS data and HLA data formal analysis, data validation, methodology, writing-reviewing and editing. MF: Sample preparation, writing-reviewing and editing. PN, MT: Performed array genotyping, methodology, writing-reviewing and editing. WL, AF: Popgen cohort recruitment, writing-reviewing and editing. MMH, AD, NvB, MWo, MS, JEh, CG, RG, WKP, MS, RE, MHe, SB, MG, CP, MK, AK, DZ, CDS, ES: Patients recruitment, writing-reviewing and editing. NvB, JEr, HB, DZ, CDS, MHi, IRK, ES, SMI: Funding acquisition, wrting-reviewing and editing. MHi: Writing the original draft, writing-reviewing and editing. SMI: Conceptualization, direction, funding acquisition, writing- original draft, writing-reviewing and editing. All authors read and approved the final manuscript.

