Whole exome analysis of patients in Japan with hearing loss reveals high heterogeneity among responsible and novel candidate genes
Orphanet Journal of Rare Diseases volume 17, Article number: 114 (2022)
Heterogeneous genetic loci contribute to hereditary hearing loss; more than 100 deafness genes have been identified, and the number is increasing. To detect pathogenic variants in multiple deafness genes, in addition to novel candidate genes associated with hearing loss, whole exome sequencing (WES), followed by analysis prioritizing genes categorized in four tiers, were applied.
Trios from families with non-syndromic or syndromic hearing loss (n = 72) were subjected to WES. After segregation analysis and interpretation according to American College of Medical Genetics and Genomics guidelines, candidate pathogenic variants in 11 previously reported deafness genes (STRC, MYO15A, CDH23, PDZD7, PTPN11, SOX10, EYA1, MYO6, OTOF, OTOG, and ZNF335) were identified in 21 families. Discrepancy between pedigree inheritance and genetic inheritance was present in one family. In addition, eight genes (SLC12A2, BAIAP2L2, HKDC1, SVEP1, CACNG1, GTPBP4, PCNX2, and TBC1D8) were screened as single candidate genes in 10 families.
Our findings demonstrate that four-tier assessment of WES data is efficient and can detect novel candidate genes associated with hearing loss, in addition to pathogenic variants of known deafness genes.
Approximately 1 in every 500 newborns exhibits a degree of hearing loss, and more than half of cases are associated with genetic mutations . Genes responsible for hereditary hearing loss are highly heterogeneous. Recent advances in clinical genome sequencing, focusing on known deafness gene panels, have been used to efficiently detect pathogenic variants, inform appropriate clinical intervention (such as cochlear implants), and estimate prognosis, in terms of symptoms [2,3,4]. To date, more than 100 genes have been reported as associated with non-syndromic hearing loss . Further, according to Online Mendelian Inheritance in Man (OMIM), hundreds of genes are associated with syndromic hearing loss. Targeted resequencing of deafness genes is cost-effective and, therefore, beneficial for diagnostic purposes ; however, it is not suitable for detection of very rare or novel deafness genes.
Whole exome sequencing (WES), involves sequencing of coding exons comprising approximately 2% of the whole human genome, which are estimated to contain approximately 85% of pathogenic variants associated with monogenic disease . For efficient identification of pathogenic variants in patient samples by analysis of WES data, detected variants are often categorized in several groups, where those in genes previously associated with the clinical features of interest are the first priority for analysis [7, 8]. Sets of prioritized genes can be modified during analysis to increase the number of targeted genes, without resequencing the same samples. For comprehensive investigation of the genetic heterogeneity of diseases with a wide range of causative genes, such as hearing loss, and to identify novel candidate genes, WES analysis overcomes the limitations of targeted analysis and is considerably more cost-effective than whole genome sequencing (WGS) analysis.
In this study, we sought to explore the wide spectrum of genetic heterogeneity associated with hearing loss in Japan, and to discover novel candidate genes associated with hearing loss, using trio analysis of probands and their parents, and four originally developed gene groups ranked by priority (tiers), as a new strategy to filter candidate variants. Using this strategy, we successfully detected candidate pathogenic variants in 11 previously reported deafness genes in 21 families, as well as eight single candidate deafness genes in 10 families.
Editorial policies and ethical considerations
The Ethics Review Committees of the National Hospital Organization Tokyo Medical Center (approval number: R1-0703009) and all collaborating institutes approved the study procedures. All procedures were conducted after written informed consent had been obtained from each subject or their parents.
All subjects were patients visiting the National Hospital Organization Tokyo Medical Center or collaborating hospitals. Medical histories were obtained, and clinical information, such as the results of physical, audiological, and blood tests, were collected from subjects and family members, when available. Hearing loss severity was determined according to the recommendations of the Genetic Deafness study group, using audiological tests, including pure-tone audiometry, auditory steady-state response, conditioned orientation reflex audiometry, or play audiometry, depending on the age of the patient and availability . Subjects with hearing loss related to environmental factors, such as meningitis, premature birth, and rubella, were excluded.
Genomic DNA was obtained from blood samples collected from probands and their family members, mostly parents. Probands with known high prevalence deafness gene variants, and those with specific clinical features suggesting subsets of deafness genes, were filtered using the following methods. All probands were screened for GJB2 or mitochondrial m.1555A>G and m.3243A>G variants, which are frequently detected in Japanese patients with hereditary hearing loss, as described previously . Probands were also screened for SLC26A4 variants when enlarged vestibular aqueduct was detected by computed tomography (CT), or when they were not examined by CT. Probands with auditory neuropathy, which manifests as normal otoacoustic emission and loss of auditory brainstem responses, were subjected to Sanger sequencing analysis of OTOF . To rule out congenital cytomegalovirus infection, PCR examination for cytomegalovirus in the preserved umbilical cords of probands was conducted, when samples were available.
WES protocols have been reported previously . In brief, genomic DNA extracted from blood was subjected to whole exome region capture using a Nextera Rapid Capture Exome kit (Illumina)  and to massively parallel sequencing using the HiSeq2500 platform (Illumina). Sequence reads were mapped onto the human reference genome (GRCh37) with a decoy sequence (hs37d5), using BWE-mem (v.0.7.5a), and variants were called using the Picard (v.1.106) and Genome Analysis Toolkit 3.4.46 (GATK) . Individual variants were joint-called, together with in-house data (WES, n = 498 and WGS, n = 1037)  using GenotypeGVCFs. Variants were then annotated using Annovar . Variants in repeat elements, low complexity regions, or considered to result from strand bias, were omitted from further analyses. Average mapping rate, read depth, and numbers of SNVs and indels, are presented in Additional file 1.
A schematic flowchart of the WES analysis conducted in this study is shown in Additional file 2. To identify candidate pathogenic changes, variants predicted to alter the encoded protein were first filtered according to minor allele frequency (MAF), as previously descried . In brief, a threshold MAF of < 0.001 was applied for AD inheritance mode analysis of global public databases (Database of Single Nucleotide Polymorphisms (dbSNP) , East Asian population of 1000 Genomes , NHLBI Exome Variant Server (ESP6500), Exome Aggregation Consortium (ExAC) , Genome Aggregation Database (gnomAD) , Human Genetic Variation Database (HGVD) ver1.42 based on 1208 healthy Japanese subjects , and an in-house database including 1037 healthy Japanese subjects ; and a MAF threshold of < 0.003 applied for sporadic cases and AR inheritance mode analysis of global databases, except that a threshold of < 0.005 was used for the HGVD and in-house databases, as previously described . Variants were further excluded out from candidates if all the in silico analyses (LRT, LR, Mutation Assessor, Mutation Taster, Polyphen 2-HDIV, Polyphen 2-HVAR, RadialSVM, SIFT) predicted no, benign, or tolerated effect of the variant. The effect of the splice site variants was predicted by MaxEntScan  and Human Splice Finder 3.0  with default threshold values.
Remaining variants were prioritized in four categories before analysis of co-segregation with the disease: (1) Tier 1 genes were reported as associated with non-syndromic, syndromic hearing loss, and diseases including hearing loss as a non-characteristic symptom registered in OMIM (n = 293, gene list in ); (2) Tier 2 genes were associated with hearing loss in animal models by the Mouse Genome Informatics  or International Mouse Phenotyping Consortium , and not included in Tier 1 (n = 328, gene list in ); and (3) Tier 3 genes were expressed at > twofold higher levels in M. fascicularis cochlea than in other tissues  and were not included in Tier 1 or Tier 2 (n = 305; Additional file 3). Genes with high expression levels in M. fascicularis cochlea are enriched for deafness genes and may therefore contain novel candidates . In total, 926 genes were categorized in Tiers 1–3. Genes not included in Tiers 1–3 were categorized as Tier 4.
Among selected variants co-segregating with hearing loss, those in Tier 1 genes were searched in the OMIM, Human Genome Mutation Database (HGMD) (last accessed March 12, 2019) and ClinVar (last accessed March 12, 2019) to determine the consistency of the clinical features of the individuals in this study, according to PP4 criterion in the ACMG guidelines . Genes associated with syndromic hearing loss were excluded if they met the following criteria: (1) the variants had not been reported as pathogenic or likely pathogenic, and (2) the proband did not exhibit the characteristic symptoms of multiple organ disease caused by that gene.
Remaining candidate variants were subjected to PCR and Sanger sequencing. Primer sets used in this study are shown in Additional file 5. Representative electropherograms of variants detected in each proband are shown in Additional file 6 and Additional file 10.
Assessment of large deletion allele of STRC
A suspected homozygous deletion of STRC, mapping to chromosome 15q15.3 [28, 29], and detected in patients using IGV , was validated by MLPA (kit P461, MRC-Holland, Amsterdam, Netherlands), according to the manufacturer’s protocols. Copy numbers of exon 19 and the 5′ flanking region of STRC were also examined by duplicated quantitative PCR (qPCR) in subjects and their family members. Primers for copy number quantification of the exon 10 region of MYO7A (NM_000260.3) were used as a reference. Primer sets used in this study are presented in Additional file 5.
Overview of subjects
Seventy-two families including 215 individuals (71 families with the proband and parents, and one with the proband and mother) were recruited for this study (Table 1). Most of the participants were Japanese, while a father with normal hearing in one family was Korean. Within the families, the majority of probands appeared to be sporadic cases (52 families, 72%). In addition, 9 (13%) and 10 (14%) families were presumed to have autosomal-dominant (AD) and autosomal-recessive (AR) inheritance modes, respectively, based on the symptoms of family members. The inheritance mode was not determined in one family, since the proband and both parents had hearing loss. The majority of probands had non-syndromic hearing loss (58 families, 81%).
Detection of candidate pathogenic variants in previously known deafness genes
In WES analysis, a mean read depth of approximately 144 with > 99.9% average mapping rate, was obtained (Additional file 1). Approximately 0.4% of the targeted regions (848 out of 212,158 regions) showed insufficient read depth (< 20) consistently. Among them, 5 regions were included in Tier 1 genes (Additional file 4). In Tier 1 genes, approximately 0.67% of targeted bases (4,644 out of 697,091 bases) showed insufficient read depth consistently.
By trio analysis, 11 previously reported deafness genes were considered to be responsible for hearing loss in 21 families (Fig. 1). As described in “Methods” section, probands were prescreened for GJB2 variants, including the m.1555 A>G and m.3243 A>G variants, as well as SLC26A4 and OTOF variants, depending on their clinical features. All detected genes, genotypes, diseases, and clinical features of probands are summarized in Table 2. Additional bioinformatic data for each variant, including allele frequencies in population databases, in silico analyses, and conservation among vertebrate species, are shown in Table 3. Partial Sanger sequencing electropherograms validating each variant are presented in Additional file 6. All the variants reported in this study fulfilled at least one criterion (PM2_Supporting, absent or extremely low frequency in population databases), according to the American College of Medical Genetics and Genomics (ACMG) guidelines  and the modification of PM2 by ClinGen Sequence Variant Interpretation Working Group (https://www.clinicalgenome.org/site/assets/files/5182/pm2_-_svi_recommendation_-_approved_sept2020.pdf).
Regarding non-syndromic deafness genes, compound heterozygous variants of MYO15A were identified in four sporadic cases (families 1470, 1540, 1479, and 1688); compound heterozygous variants of CDH23 in two sporadic cases (families 1644 and 1528); and compound heterozygous variants of PDZD7 in families 1397 and 1597; all three candidate PDZD7 variants mapped to exon 4 in regions encoding one of the PDZ domains, which are structural anchors that tether the protein to cytoskeletal components . Although ADGRV1 and PDZD7 have been proposed as genes responsible for Usher syndrome type IIC , no candidate variants of ADGRV1 were detected among our patients. A homozygous variant of OTOF was identified in a sporadic case in family 1648. The pathogenicity of the c.5816G>A (p.Arg1939Gln) variant is established [11, 33]. As the proband had not been tested for otoacoustic emission, which is necessary to detect auditory neuropathy, this case was subjected to WES without prescreening for OTOF. Compound heterozygous variants of OTOG were identified in a sporadic case from family 739; the variants were predicted to disrupt splicing at the donor site (5′ splice site) of exon 11 and to be a nonsense mutation. Loss-of-function of both alleles of OTOG was considered to be sufficient explanation for hearing impairment.
Regarding syndromic deafness genes, two de novo variants of PTPN11 were identified in sporadic cases in families 1631 and 1543. Both detected variants, c.836A>G (p.Tyr279Cys) and c.1529A>G (p.Gln510Arg), reside in regions encoding the catalytic sites of the non-receptor type protein-tyrosine phosphatase , and are established pathogenic variants causing Noonan syndrome 1 (NS1) [35, 36]. The proband in family 1631 showed syndromic symptoms (short statue with subtle ocular hypertelorism, café -au-lait pigmentation, Table 2). Although evaluation of the developmental status of the proband was limited because of the age at the time of genetic test (2 years 0 month), developmental delay was not noted. The proband in family 1543 was 1 year 10 months old at the time of genetic test. No clinical features other than hearing loss were notified. Two de novo variants of SOX10 were identified in sporadic cases in families 1583 and 1651. The c.570C>A (p.Cys190Ter) variant maps to exon 3, and the transcript is predicted to be degenerated by nonsense-mediated decay (NMD) , whereas the other variant, c.1122del (p.Thr375ProfsTer127), maps to the last exon (exon 4) and is predicted to escape NMD. No neurologic disorders were recorded in the proband of family 1583, with the c.570C>A variant, whereas the neurologic symptoms of the proband of family 1651, with c.1122del, were consistent with Waardenburg syndrome, with neurological phenotypes (peripheral demyelinating neuropathy, central dysmyelination) associated with escape from NMD . A heterozygous c.1082G>A (p.Arg361Gln) variant of EYA1, a gene responsible for Branchiootorenal syndrome 1 (BOR1), was identified in family 1636. The proband with the variant had amblyopia with refractive errors, which have not previously been reported in BOR1, while his father with the heterozygous p.Arg361Gln variant showed mild hearing loss without additional noticeable symptoms. The proband’s mother with normal hearing did not have the variant, and no other family members showed hearing loss. Compound heterozygous variants of ZNF335 were identified in the proband of family 1456. The two variants were both predicted to affect the region encoding the C2H2-type zinc finger domain; the genetic and clinical features of this family have been reported by others .
Assessment of homogenous large deletion spanning STRC
While detection of copy number variants (CNVs) from the results of WES is challenging using a single program , inspection using the Integrative Genomics Viewer (IGV) suggested that probands in families 1410, 1564, 1436, and 1700, and the mother (I-2) from family 1633, showed extremely low read depths across exon 16 and from exons 19 to 26 of STRC, in contrast to a control (III-1 of family 1470), with similar read depths covering all STRC exons (Fig. 2A–E, Additional file 7B). Moreover, this large homozygous deleted region appeared to extend to the adjacent gene (exons 8–10 of CKMT1B), as well as the entire CATSPER2 locus, in all probands (Additional file 7C). Homozygous deletion of both STRC and CATSPER2 has been reported to be associated with deafness-infertility syndrome (OMIM: 61102). The non-reduced read depths at other exons (including exons 1–15 and 27–29 of STRC, and exon 8 of CATSPER2) were likely due to multiple mapping of the sequences of the highly homologous pseudogenes, STRCP1 and CATSPER2P1 (Additional file 7D, E) [28, 41].
Multiplex ligation-dependent probe amplification (MLPA) analysis of the probands from families 1410 and 1700 demonstrated homozygous deletion of a genomic region spanning from exon 8 of CKMT1B to exon 1 of CATSPER2 (Additional file 8). According to the positions of the MLPA probes, the 5′ breakpoint was predicted to be between exon 27 of PPIP5K1 (NC_000015.10:g.43851168) and exon 8 of CKMT1B (g.43890333), whereas the 3′ breakpoint was mapped between exon 1 of CATSPER2 (g.43940784) and exon 1 of PDIA3 (g.44038794). Based on the inner and outer boundaries, the deleted region was estimated to be between 50.5 and 187.6 kb. This structural variant resembled those recorded in dbVar (for example, nsv868983 (62.4 kb) and nsv3109791 (145.9 kb)) . qPCR targeting of exon 19 and the 5′ UTR of STRC also demonstrated absence of these regions of STRC in patients from families 1410, 1564, 1436, and 1700 (Fig. 2F), consistent with the results of IGV and MLPA analyses (Fig. 2F, Additional file 7B and Additional file 8). In addition, heterozygous large deletion of an STRC allele in the parents of families 1410, 1436, and 1700 was also detected by qPCR (Fig. 2F); however, the copy numbers in the mother (II-5) of family 1564 were difficult to measure, making the exact genotypes predicted to carry the large STRC deletion allele ambiguous. The proband (III-3) and a sibling (III-2) in family 1410 had vision loss, in addition to hearing loss.
Intriguingly, IGV predicted homozygous deletion of STRC in the mother (I-2) of family 1633 (Additional file 7B), a family initially presumed to have an AD mode of inheritance (Fig. 2E). qPCR demonstrated that the mother (I-2) and the proband (II-1) had homozygous and heterozygous deletion of STRC, respectively, whereas the father (I-1) did not appear to have copy number loss of this gene. The trio of family 1633 was reanalyzed under the assumption that a distinct gene was responsible for hearing loss in the proband. Consequently, a de novo variant of MYO6 (c.1325G>A (p.Cys442Tyr) was identified in the proband.
Because OTOA is also known to have highly homologous pseudogene OTOAP1 especially in its exon 21–29, we searched for differences in read depths of OTOA. However, we could not detect any changes suggesting large deletion or duplication of OTOA in any probands.
Novel candidate genes associated with hearing loss
In addition to the previously known deafness genes categorized to Tier 1, eight additional genes were narrowed down as single candidates by WES analysis in a total of 10 families (Figs. 1, 3, 4). Two of these genes (SLC12A2 and BAIAP2L2, Tier 2) cause hearing loss phenotypes in mouse models, and one (HKDC1, Tier 3) is predominantly expressed in Macaca fascicularis cochlea. The other five genes (SVEP1, CACNG1, GTPBP4, PCNX2, and TBC1D8) were categorized as Tier 4 genes, with no known association with hearing loss. Genetic information for each variant is presented in Additional file 9. Partial Sanger sequencing electropherograms validating each variant are presented in Additional file 10. Association of SLC12A2 variants with hearing loss has been reported  and registered as DFNA78 in OMIM (619081).
A heterozygous variant of BAIAP2L2 was identified as the candidate cause for AD inheritance mode hearing loss in family 1427 (Fig. 3A). This gene encodes the membrane protein, brain-specific angiogenesis inhibitor 1-associated protein 2-like protein 2, which localizes to the plasma membrane in intestine and kidney epithelial cells . Further, single-cell RNA sequencing analysis demonstrated predominant Baiap2l2 expression in hair cells in neonatal mouse cochlear epithelium  (Additional file 11), and mice deficient for Baiap2l2 have an increased auditory brainstem response threshold [24, 45]. The c.506T>C (p.Val169Ala) variant is predicted to reside in the IRSp53/MIM homology domain (IMD), which can bind to membranes and interact with a small GTPase (PROSITE: PRU00668) . The proband with heterozygous BAIAP2L2 variant showed congenital, progressive, severe, steep sloping hearing loss without other symptoms.
Compound heterozygous HKDC1 variants were identified as the candidate cause of the sporadic hearing loss in family 1676 (Fig. 3B). This gene encodes hexokinase domain-containing 1, which catalyzes phosphorylation of glucose to generate glucose-6-phosphate . A genome-wide association study (GWAS) identified HKDC1 as a risk factor for gestational hyperglycemia . The missense variant found in the proband (c.1771A>C (p.Lys591Gln)) was predicted to reside in the hexokinase small subdomain 2, whereas the other compound heterozygous variant was predicted to affect splicing (c.376–2A>G). The proband had congenital, mild-to-moderate hearing loss, without other symptoms.
Compound heterozygous variants of SVEP1 were identified as the candidate cause of the sporadic hearing loss in two families: 1535 and 1555 (Fig. 3C). This gene encodes Sushi von Willebrand factor type A EGF and pentraxin domain-containing 1, which may function in cell attachment via integrin α9β1 . A GWAS detected SVEP1 as a risk factor for coronary artery disease  and knockout of Svep1 in mice is embryonic lethal, with multiple developmental defects . All four variants (c.6766C>G (p.Pro2256Ala), c.7357G>A (p.Val2453Met), c.6977C>T (p.Pro2326Leu), and c.10294T>C (p.Tyr3432His)) found in this study reside in the stretched sushi domains. The probands in families 1535 and 1555 carried the compound heterozygous variants c.[6766C>G];[7357G>A] and c.[6977C>T];[10294T>C], respectively, and had congenital, severe-to-profound non-syndromic hearing loss, without other symptoms.
A de novo heterozygous variant of CACNG1 was identified as the candidate cause of the sporadic hearing loss in family 1669 (Fig. 3D). This gene encodes voltage-dependent calcium channel gamma-1 subunit. The c.461C>T (p.Ser154Leu) variant of CACNG1 is predicted to encode a residue in the transmembrane domain of the putative protein product. Cacng1-knockout mice show dysregulated calcium transport in skeletal muscle . The proband with the variant showed congenital, severe-to-profound hearing loss, without other symptoms.
A de novo heterozygous variant of GTPBP4 was identified as the candidate cause of the sporadic hearing loss in family 1696 (Fig. 4A). This gene encodes a nucleolar GTP-binding protein, and the variant (c.967C>G (p.Leu323Val)) in this gene affects the predicted GTP-binding domain. GTPBP4 mediates ribosomal RNA processing , suppresses schwannoma cell growth , and promotes colorectal carcinoma metastasis  in vitro. The proband (III-4) with the variant showed congenital, mild, and mid-frequency hearing loss, without other symptoms.
Compound heterozygous variants of PCNX2 were identified as the candidate cause of the AR inheritance mode hearing loss in family 1685 (Fig. 4B). This gene encodes Pecanex-like protein 2 and is frequently mutated in colorectal carcinomas with high microsatellite instability . The detected variants were a nonsense change (c.4777C>T (p.Arg1593Ter)) and a missense variant (c.3505C>T (p.Arg1169Trp)), residing in the intracellular region of the plasma membrane protein. Pcnx2 deficiency modifies seizure-like behaviors in mouse . The proband had congenital, progressive hearing loss, resulting in profound hearing loss at 2 years old, as well as abnormal pulmonary venous return, which was surgically treated at 1 day after birth.
A de novo heterozygous variant of TBC1D8 was identified as a candidate cause of the sporadic hearing loss in family 1575 (Fig. 4C). This gene encodes Tre-2 BUB2p and Cdc16p domain 1 family member 8, which functions as a GTPase-activator of Rab family proteins and promotes tumorigenesis of ovarian cancer . TBC1D8 has also been reported to be within a susceptibility locus for osteoporosis-related traits . The variant c.1997C>T (p.Ser666Leu) was predicted to reside in the putative carboxyl-terminal Rab-GTPase-TBC domain with unknown function. The proband with the variant had congenital, moderate low-frequency hearing loss, without other symptoms.
Identification of variants in known deafness genes by WES analysis
Analysis of Tier 1 prioritized genes using WES data led to successful identification of pathogenic or likely pathogenic variants in 11 known deafness genes in 21 of 72 families, after screening of common deafness genes. Due to higher coverage of coding regions, WES is considered to detect pathogenic variants more efficiently and more cost-effectively than WGS. In addition, we narrowed down eight single genes as candidates associated with hearing loss in 10 families. Analysis of prioritized Tier 1 genes was similarly effective to targeted NGS analysis  and enabled efficient determination of the genes responsible for hearing loss in probands. After prescreening for GJB2, m.1555A>G, and m.3243A>G variants, as well as SLC26A4 and OTOF variants, when patient data suggested, the two most frequently identified genes in this study were STRC (DFNB16, five families) and MYO15A (DFNB3, four families). These two genes have been reported as relatively frequent causes of genetic hearing loss in Japan [59, 60] and studies in other ethnic regions [4, 61]. Subsequently, CDH23, PDZD7, and PTPN11 were detected as causative genes in two families each. Unlike CDH23 and PDZD7, which cause non-syndromic hearing loss or Usher syndrome presenting as non-syndromic hearing loss during childhood, PTPN11 is associated with NS1, which shows a variety of phenotypes in multiple organs . Although two probands with PTPN11 variants had short stature, and one exhibited café-au-lait pigmentation, these clinical features had been unnoticed by the primary physicians. Our findings highlight that NS1 with no-to-mild symptoms, other than hearing loss, can be categorized as non-syndromic hearing loss in certain cases; hence PTPN11 may be a much more frequent cause of hearing loss than previously recognized.
Although a straightforward method to detect CNVs from WES data has yet to be established, homozygous deletion of STRC, which harbors a tandem homologous pseudogene sequence at its genomic locus, with potential for non-allelic homologous recombination , was successfully detected by combined assessment of read depths for each coding exon, MLPA, and qPCR. More extensive analyses of structural variants using several programs [63, 64], WGS , and long read sequencing  would reveal exact breakpoints of STRC CNVs.
In addition, this study demonstrated that trio WES analysis is a potent method of deciphering the reasons for discrepancies between pedigree and genetic inheritance, as shown in family 1633, where there was an initial presumption of AD inheritance, but mutations at two separate loci (STRC and MYO6) were detected. This study also demonstrates that WES analysis can be used to identify genes responsible for hearing loss and other factors suspected of influencing coexisting symptoms, to explain the clinical features in families; for example, families 1636 (EYA1 variant with amblyopia) and 1410 (STRC variant with vision loss).
Strategy to discover novel candidate deafness genes by WES analysis
Our strategy to discover novel candidate genes associated with hearing loss from Tiers 2–4 genes was based on the assumption that deafness genes would also cause auditory phenotypes in animal models [24, 25], which we categorized as Tier 2 genes, and that many genes critical for proper hearing in humans would also show predominant expression in M. fascicularis cochlea , which we categorized as Tier 3 genes. We identified SLC12A2, BAIAPL2, and HKDC1 as promising candidate genes warranting investigation for pathogenicity; however, identification of additional patients with variants in the same candidate genes will be critical for confirming their involvement. SVEP1 variants were detected in two families and are plausible candidates for further investigation, such as in vitro functional analysis or generation of an animal model with the identified variants knocked in. Confirmation of novel deafness genes will improve genetic tests for hearing loss.
We were unable to screen single candidate genes in 35 families, and no candidate variants emerged from WES analysis in six families. Hearing loss in these families may be attributable to pathogenic variants in untranslated regions, introns, cryptic splice sites, promoter or enhancer regions, intergenic regions, multigenic causes, or chromosomal arrangements, including CNVs, or unidentified environmental factors. We also aware that 5 exonic regions n Tier 1 genes showed insufficient read depth. Variants on these exons may also have been failed to be detected. In addition, our in silico filtering strategy did not use REVEL scores recommended by Hearing Loss Expert Panel guidelines . In fact, our filtering strategy is considered very stringent; variants were filtered out only when all the in silico analyses (see “Methods” section) predicted no, benign, or tolerated effect. As a result, two candidate variants on our list showed low REVEL scores (PDZD7:c.503G>C, REVEL = 0.123, (Table 3) and PCNX2:c.3505C>G, REVEL = 0.139 (Additional file 9)). Although we cannot exclude out the possibility of filtering out pathogenic variants based on in silico prediction, it is considered quite unlikely.
Another possibility is that we may have missed causative genes due to discrepancies between the typical clinical features caused by the gene and those observed in our probands. For example, Tier1 genes included KDM6A, a gene responsible for syndromic hearing loss (Kabuki syndrome 2; OMIM: 300827). Variants of this gene were not considered as candidates when the proband had non-syndromic hearing loss; however, we cannot exclude the possibility that these variants can be associated with very mild or normal phenotypes, except for hearing loss. As we experienced in the case with known pathogenic variant of PTPN11 in family 1543, clinical features of several diseases such as Noonan syndrome show wide spectrum of symptoms including non-syndromic hearing loss, and these atypical features in patients could have been overlooked and affected the diagnostic yield. It is also possible that symptoms other than hearing loss are late-onset and overlooked at the time of genetic test. These are the limitations of this study to detect Tier 1 genes associated with hearing loss using WES analysis.
WES analysis using a tier system to prioritize genetic analysis is an efficient method to identify pathogenic variants of known deafness genes, as well as novel candidate deafness genes. Further analyses, including accumulation of variants and clinical features of patients, will expand perspectives on hereditary hearing loss.
Availability of data and materials
The ethics committee approves sharing filtered and limited number of variants detected from each subject to public, but does not approve sequencing data of each individual open to public. All the pathogenic or candidate pathogenic variants detected from are within the manuscript and its Additional files.
Whole exome sequencing
Copy number variant
Morton CC, Nance WE. Newborn hearing screening—a silent revolution. N Engl J Med. 2006;354:2151–64.
Brownstein Z, Friedman LM, Shahin H, Oron-Karni V, Kol N, Abu Rayyan A, et al. Targeted genomic capture and massively parallel sequencing to identify genes for hereditary hearing loss in Middle Eastern families. Genome Biol. 2011;12:R89.
Mutai H, Suzuki N, Shimizu A, Torii C, Namba K, Morimoto N, et al. Diverse spectrum of rare deafness genes underlies early-childhood hearing loss in Japanese patients: a cross-sectional, multi-center next-generation sequencing study. Orphanet J Rare Dis. 2013;8:172.
Azaiez H, Booth KT, Ephraim SS, Crone B, Black-Ziegelbein EA, Marini RJ, et al. Genomic landscape and mutational signatures of deafness-associated genes. Am J Hum Genet. 2018;103:484–97.
Hereditary Hearing Loss Homepage. http://hereditaryhearingloss.org.
Botstein D, Risch N. Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease. Nat Genet. 2003;33(Suppl):228–37.
Taylor JC, Martin HC, Lise S, Broxholme J, Cazier JB, Rimmer A, et al. Factors influencing success of clinical genome sequencing across a broad spectrum of disorders. Nat Genet. 2015;47:717–26.
Guan Q, Balciuniene J, Cao K, Fan Z, Biswas S, Wilkens A, et al. AUDIOME: a tiered exome sequencing-based comprehensive gene panel for the diagnosis of heterogeneous nonsyndromic sensorineural hearing loss. Genet Med. 2018;20:1600–8.
Stephens D. Audiological terms. In: Martini A, Mazzoli M, Stephens D, Read A, editors. Definitions, protocols and guidelines in genetic hearing impairment. New York: Wiley; 2009.
Yamamoto N, Mutai H, Namba K, Morita N, Masuda S, Nishi Y, et al. Prevalence of TECTA mutation in patients with mid-frequency sensorineural hearing loss. Orphanet J Rare Dis. 2017;12:157.
Matsunaga T, Mutai H, Kunishima S, Namba K, Morimoto N, Shinjo Y, et al. A prevalent founder mutation and genotype–phenotype correlations of OTOF in Japanese patients with auditory neuropathy. Clin Genet. 2012;82:425–32.
Mutai H, Wasano K, Momozawa Y, Kamatani Y, Miya F, Masuda S, et al. Variants encoding a restricted carboxy-terminal domain of SLC12A2 cause hereditary hearing loss in humans. PLoS Genet. 2020;16:e1008643.
Shigemizu D, Momozawa Y, Abe T, Morizono T, Boroevich KA, Takata S, et al. Performance comparison of four commercial human whole-exome capture platforms. Sci Rep. 2015;5:12742.
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–8.
Okada Y, Momozawa Y, Sakaue S, Kanai M, Ishigaki K, Akiyama M, et al. Deep whole-genome sequencing reveals recent selection signatures linked to evolution and disease risk of Japanese. Nat Commun. 2018;9:1631.
Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164.
Kitts A, Sherry S. The single nucleotide polymorphism database (dbSNP) of nucleotide dequence variation. In: McEntyre J, Ostell J, editors. The NCBI handbook. Bethesda: National Center for Biotechnology Information (US); 2002.
Genomes Project C, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, et al. A global reference for human genetic variation. Nature. 2015;526:68–74.
Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536:285–91.
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alfoldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–43.
Higasa K, Miyake N, Yoshimura J, Okamura K, Niihori T, Saitsu H, et al. Human genetic variation database, a reference database of genetic variations in the Japanese population. J Hum Genet. 2016;61:547–53.
Reese MG, Eeckman FH, Kulp D, Haussler D. Improved splice site detection in Genie. J Comput Biol. 1997;4:311–23.
Desmet FO, Hamroun D, Lalande M, Collod-Beroud G, Claustres M, Beroud C. Human Splicing Finder: an online bioinformatics tool to predict splicing signals. Nucleic Acids Res. 2009;37:e67.
Meehan TF, Conte N, West DB, Jacobsen JO, Mason J, Warren J, et al. Disease model discovery from 3328 gene knockouts by The International Mouse Phenotyping Consortium. Nat Genet. 2017;49:1231–8.
Dickinson ME, Flenniken AM, Ji X, Teboul L, Wong MD, White JK, et al. High-throughput discovery of novel developmental phenotypes. Nature. 2016;537:508–14.
Mutai H, Miya F, Shibata H, Yasutomi Y, Tsunoda T, Matsunaga T. Gene expression dataset for whole cochlea of Macaca fascicularis. Sci Rep. 2018;8:15554.
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17:405–24.
Moteki H, Azaiez H, Sloan-Heggen CM, Booth K, Nishio SY, Wakui K, et al. Detection and confirmation of deafness-causing copy number variations in the STRC gene by massively parallel sequencing and comparative genomic hybridization. Ann Otol Rhinol Laryngol. 2016;125:918–23.
Shearer AE, Kolbe DL, Azaiez H, Sloan CM, Frees KL, Weaver AE, et al. Copy number variants are a common cause of non-syndromic hearing loss. Genome Med. 2014;6:37.
Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29:24–6.
Lee SY, Han JH, Kim BJ, Oh SH, Lee S, Oh DY, et al. Identification of a potential founder effect of a novel PDZD7 variant involved in moderate-to-severe sensorineural hearing loss in Koreans. Int J Mol Sci. 2019;20:4174.
Ebermann I, Wiesen MH, Zrenner E, Lopez I, Pigeon R, Kohl S, et al. GPR98 mutations cause Usher syndrome type 2 in males. J Med Genet. 2009;46:277–80.
Varga R, Kelley PM, Keats BJ, Starr A, Leal SM, Cohn E, et al. Non-syndromic recessive auditory neuropathy is the result of mutations in the otoferlin (OTOF) gene. J Med Genet. 2003;40:45–50.
Kontaridis MI, Swanson KD, David FS, Barford D, Neel BG. PTPN11 (Shp2) mutations in LEOPARD syndrome have dominant negative, not activating, effects. J Biol Chem. 2006;281:6785–92.
Tartaglia M, Kalidas K, Shaw A, Song X, Musat DL, van der Burgt I, et al. PTPN11 mutations in Noonan syndrome: molecular spectrum, genotype–phenotype correlation, and phenotypic heterogeneity. Am J Hum Genet. 2002;70:1555–63.
Bertola DR, Pereira AC, Passetti F, de Oliveira PS, Messiaen L, Gelb BD, et al. Neurofibromatosis–Noonan syndrome: molecular evidence of the concurrence of both disorders in a patient. Am J Med Genet A. 2005;136:242–5.
Kurosaki T, Maquat LE. Nonsense-mediated mRNA decay in humans at a glance. J Cell Sci. 2016;129:461–7.
Inoue K, Khajavi M, Ohyama T, Hirabayashi S, Wilson J, Reggin JD, et al. Molecular mechanism for distinct neurological phenotypes conveyed by allelic truncating mutations. Nat Genet. 2004;36:361–9.
Sato R, Takanashi J, Tsuyusaki Y, Kato M, Saitsu H, Matsumoto N, et al. Association between invisible basal ganglia and ZNF335 mutations: a case report. Pediatrics. 2016;138:e20160897.
Tan R, Wang Y, Kleinstein SE, Liu Y, Zhu X, Guo H, et al. An evaluation of copy number variation detection tools from whole-exome sequencing data. Hum Mutat. 2014;35:899–907.
Mandelker D, Amr SS, Pugh T, Gowrisankar S, Shakhbatyan R, Duffy E, et al. Comprehensive diagnostic testing for stereocilin: an approach for analyzing medically important genes with high homology. J Mol Diagn. 2014;16:639–47.
Lappalainen I, Lopez J, Skipper L, Hefferon T, Spalding JD, Garner J, et al. DbVar and DGVa: public archives for genomic structural variation. Nucleic Acids Res. 2013;41:D936–41.
Pykalainen A, Boczkowska M, Zhao H, Saarikangas J, Rebowski G, Jansen M, et al. Pinkbar is an epithelial-specific BAR domain protein that generates planar membrane structures. Nat Struct Mol Biol. 2011;18:902–7.
Kolla L, Kelly MC, Mann ZF, Anaya-Rocha A, Ellis K, Lemons A, et al. Characterization of the development of the mouse cochlear epithelium at the single cell level. Nat Commun. 2020;11:2389.
Carlton AJ, Halford J, Underhill A, Jeng JY, Avenarius MR, Gilbert ML, et al. Loss of Baiap2l2 destabilizes the transducing stereocilia of cochlear hair cells and leads to deafness. J Physiol. 2021;599:1173–98.
Hulo N, Bairoch A, Bulliard V, Cerutti L, De Castro E, Langendijk-Genevaux PS, et al. The PROSITE database. Nucleic Acids Res. 2006;34:D227–30.
Guo C, Ludvik AE, Arlotto ME, Hayes MG, Armstrong LL, Scholtens DM, et al. Coordinated regulatory variation associated with gestational hyperglycaemia regulates expression of the novel hexokinase HKDC1. Nat Commun. 2015;6:6069.
Hayes MG, Urbanek M, Hivert MF, Armstrong LL, Morrison J, Guo C, et al. Identification of HKDC1 and BACE2 as genes influencing glycemic traits during pregnancy through genome-wide association studies. Diabetes. 2013;62:3282–91.
Sato-Nishiuchi R, Nakano I, Ozawa A, Sato Y, Takeichi M, Kiyozumi D, et al. Polydom/SVEP1 is a ligand for integrin alpha9beta1. J Biol Chem. 2012;287:25615–30.
Myocardial Infarction G, Investigators CAEC, Stitziel NO, Stirrups KE, Masca NG, Erdmann J, et al. Coding variation in ANGPTL4, LPL, and SVEP1 and the risk of coronary disease. N Engl J Med. 2016;374:1134–44.
Freise D, Held B, Wissenbach U, Pfeifer A, Trost C, Himmerkus N, et al. Absence of the gamma subunit of the skeletal muscle dihydropyridine receptor increases L-type Ca2+ currents and alters channel inactivation properties. J Biol Chem. 2000;275:14476–81.
Jensen BC, Wang Q, Kifer CT, Parsons M. The NOG1 GTP-binding protein is required for biogenesis of the 60 S ribosomal subunit. J Biol Chem. 2003;278:32204–11.
Lee H, Kim D, Dan HC, Wu EL, Gritsko TM, Cao C, et al. Identification and characterization of putative tumor suppressor NGB, a GTP-binding protein that interacts with the neurofibromatosis 2 protein. Mol Cell Biol. 2007;27:2103–19.
Yu H, Jin S, Zhang N, Xu Q. Up-regulation of GTPBP4 in colorectal carcinoma is responsible for tumor metastasis. Biochem Biophys Res Commun. 2016;480:48–54.
Kim NG, Rhee H, Li LS, Kim H, Lee JS, Kim JH, et al. Identification of MARCKS, FLJ11383 and TAF1B as putative novel target genes in colorectal carcinomas with microsatellite instability. Oncogene. 2002;21:5081–7.
Frankel WN, Mahaffey CL, McGarr TC, Beyer BJ, Letts VA. Unraveling genetic modifiers in the gria4 mouse model of absence epilepsy. PLoS Genet. 2014;10:e1004454.
Chen M, Sheng XJ, Qin YY, Zhu S, Wu QX, Jia L, et al. TBC1D8 amplification drives tumorigenesis through metabolism reprogramming in ovarian cancer. Theranostics. 2019;9:676–90.
Hsu YH, Zillikens MC, Wilson SG, Farber CR, Demissie S, Soranzo N, et al. An integration of genome-wide association study and gene expression profiling to prioritize the discovery of novel susceptibility Loci for osteoporosis-related traits. PLoS Genet. 2010;6:e1000977.
Moteki H, Azaiez H, Booth KT, Shearer AE, Sloan CM, Kolbe DL, et al. Comprehensive genetic testing with ethnic-specific filtering by allele frequency in a Japanese hearing-loss population. Clin Genet. 2015;89:466–72.
Miyagawa M, Naito T, Nishio SY, Kamatani N, Usami S. Targeted exon sequencing successfully discovers rare causative genes and clarifies the molecular epidemiology of Japanese deafness patients. PLoS ONE. 2013;8:e71381.
Sloan-Heggen CM, Bierer AO, Shearer AE, Kolbe DL, Nishimura CJ, Frees KL, et al. Comprehensive genetic testing in the clinical evaluation of 1119 patients with hearing loss. Hum Genet. 2016;135:441–50.
Weckselblatt B, Rudd MK. Human structural variation: mechanisms of chromosome rearrangements. Trends Genet. 2015;31:587–99.
Yao R, Zhang C, Yu T, Li N, Hu X, Wang X, et al. Evaluation of three read-depth based CNV detection tools using whole-exome sequencing data. Mol Cytogenet. 2017;10:30.
Shigemizu D, Miya F, Akiyama S, Okuda S, Boroevich KA, Fujimoto A, et al. IMSindel: an accurate intermediate-size indel detection tool incorporating de novo assembly and gapped global-local alignment with split read analysis. Sci Rep. 2018;8:5608.
Gross AM, Ajay SS, Rajan V, Brown C, Bluske K, Burns NJ, et al. Copy-number variants in clinical genome sequencing: deployment and interpretation for rare and undiagnosed disease. Genet Med. 2019;21:1121–30.
Jain M, Fiddes IT, Miga KH, Olsen HE, Paten B, Akeson M. Improved data analysis for the MinION nanopore sequencer. Nat Methods. 2015;12:351–6.
Oza AM, DiStefano MT, Hemphill SE, Cushman BJ, Grant AR, Siegert RK, et al. Expert specification of the ACMG/AMP variant interpretation guidelines for genetic hearing loss. Hum Mutat. 2018;39:1593–613.
The authors wish to thank the families for their participation in this study. We would also like to thank Dr. Atsuko Shimano (Division of Hearing and Balance Research, National Institute of Sensory Organs, National Hospital Organization Tokyo Medical Center) for technical assistance.
This work was supported by a Grant–in–Aid for Clinical Research from the National Hospital Organization of Japan (H30–NHO (kankakuki)–01) to TM and High-quality genetic research to identify susceptibility genes of common diseases, Tailor-made Medical Treatment Program (Biobank Japan Project, 17km0305002) to MK.
Ethics approval and consent to participate
The Ethics Review Committees of the National Hospital Organization Tokyo Medical Center (approval number: R1-0703009) and all collaborating institutes approved the study procedures. All procedures were conducted after written informed consent had been obtained from each subject or their parents.
Consent for publication
Consent for publication had been obtained from each subject or their parents.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Summary of whole exome sequencing results.
Flowchart of WES analysis. All detected variants affecting protein-coding sequences with low minor allele frequencies (MAF) in global and Japanese populations were subjected to further filtration. Variants were categorized in four tiers of genes and subjected to co-segregation analysis. See “Materials and methods” for details.
List of genes categorized in Tier 3 in this study.
List of captured regions with insufficient average read depths (<20) of Tier 1 genes in this study.
Primers used in this study.
Partial electropherograms of variants in known deafness genes detected in this study. Green, blue, black, and red peaks indicate nucleotides A, C, G, and T, respectively. Data were derived from probands from: (A) and (B), family 1470; (C), family 1540; (D) and (E), family 1479; (F), family 1688; (G) and (H), family 1644; (I), family 1528; (J) and (K), family 1397; (L), family 1597; (M), family 1648; (N) and (O), family 739; (P), family 1633; (Q), family 1543; (R), family 1631; (S), family 1583; (T), family 1651; (U), family 1636; (V) and (W), family 1456. Reverse complementary sequences are shown in (F), (M), (O), (R), and (T). Segregation of all variants in probands and their parents was validated by Sanger sequencing. Note that in (F), c.8969delG variant of MYO15A is based on right-normalized nomenclature and not c.8968-1delG as shown by electropherogram.
Genome map of the STRC locus and a homozygous large deletion of STRC and CATSPER2 visualized using Integrative Genomics Viewer (IGV). (A), Partial chromosomal 15q15.3 locus visualized using IGV. Genes are shown in blue. (B) and (C), Representative IGV images of WES reads mapped to CKMT1B, STRC (B), and CATSPER2 (C) in probands from families 1410, 1564, 1436, and 1700, and I-2 from family 1633. WES reads in the proband of family 1470 are shown as a control to represent normally mapped reads in the locus. Positions of exons examined by MLPA or mentioned in the manuscript are indicated with arrows. (D) and (E), Multiple mapped reads (blank boxes) at, for example, the exon 1–15 and exon 27–29 regions of STRC, due to inability to distinguish sequences from STRC and STRCP1 (D), and exon 8 of CATSPER2 due to inability to distinguish sequences from CATSPER2 and CATSPER2P1 (E). Single mapped reads are shown in gray boxes.
Homozygous large deletion of the locus containing STRC and CATSPER2 detected by multiplex ligation-dependent probe amplification (MLPA). Representative MLPA results showing homozygous deletion of the region including the partial CKMT1B and entire STRC and CATSPER2 genes in the probands from families 1410 and 1700. Estimated copy numbers of each exon are shown as mean ± S.D.
Variants of novel candidate genes associated with hearing loss.
Electropherograms showing variants in novel candidate genes associated with hearing loss. Data are derived from probands from (A), family 1427; (B) and (C), family 1676; (D) and (E), family 1535; (F) and (G), family 1555; (H), family 1669; (I), family 1696; (J) and (K), family 1685; and (L), family 1575. Reverse complementary sequences are shown in (C), (K), and (L).
Predominant expression of Baiap2l2 in auditory hair cell clusters. Images are derived from single-cell RNA sequencing analysis of mouse cochlear epithelium at postnatal day 1 from gEAR portal (https://umgear.org). For detailed classification of the cell clusters, see Kolla et al. (2020). DC, Deiter’s cells row 1–3; Hensen, Hensen’s cells; IHC, inner hair cells; IPC, inner pillar cells; IPhC, inner phalangeal cells/border cells; IS, inner sulcus cells; IdC, interdental cells; LGER, lateral greater epithelial ridge cells group 1–3; MGER, medial greater epithelial ridge cells; OHC, outer hair cells; OPC, outer pillar cells; OS, outer sulcus cells; Oc90, Oc90-positive cells; eIHC, less mature developing inner hair cells; eOHC, less mature developing outer hair cells.
About this article
Cite this article
Mutai, H., Momozawa, Y., Kamatani, Y. et al. Whole exome analysis of patients in Japan with hearing loss reveals high heterogeneity among responsible and novel candidate genes. Orphanet J Rare Dis 17, 114 (2022). https://doi.org/10.1186/s13023-022-02262-4
- Whole exome sequencing analysis
- Hearing loss
- Deafness genes