- Open Access
Pathway-based classification of genetic diseases
Molecular Cytogeneticsvolume 12, Article number: 4 (2019)
In medical genetics, diseases are classified according to the nature (hypothetical nature) of the underlying genetic defect. The classification is “gene-centric” and “factor-centric”; a disease may be, thereby, designated as monogenic, oligogenic or polygenic/multifactorial. Chromosomal diseases/syndromes and abnormalities are generally considered apart from these designations due to distinctly different formation mechanisms and simultaneous encompassing from several to several hundreds of co-localized genes. These definitions are ubiquitously used and are perfectly suitable for human genetics issues in historical and academic perspective. However, recent achievements in systems biology have offered a possibility to explore the consequences of a genetic defect from genomic variations to molecular/cellular pathway alterations unique to a disease. Since pathogenetic mechanisms (pathways) are more influential on our understating of disease presentation and progression than genetic defects per se, a need for a disease classification reflecting both genetic causes and molecular/cellular mechanisms appears to exist. Here, we propose an extension to the common disease classification based on the underlying genetic defects, which focuses on disease-specific molecular pathways.
The basic idea of our classification is to propose pathways as parameters for designating a genetic disease. To proceed, we have followed the tradition of using ancient Greek words and prefixes to create the terms for the pathway-based classification of genetic diseases. We have chosen the word “griphos” (γρῖφος), which simultaneously means “net” and “puzzle”, accurately symbolizing the term “pathway” currently used in molecular biology and medicine. Thus, diseases may be classified as monogryphic (single pathway is altered to result in a phenotype), digryphic (two pathways are altered to result in a phenotype), etc.; additionally, diseases may be designated as oligogryphic (several pathways are altered to result in a phenotype), polygryphic (numerous pathways or cascades of pathways are altered to result in a phenotype) and homeogryphic in cases of comorbid diseases resulted from shared pathway alterations. We suppose that classifying illness this way using both “gene-centric” and “pathway-centric” concepts is able to revolutionize current views on genetic diseases.
Classically, genetic point of view suggests diseases to be monogenic (digenic etc.), oligogenic, polygenic/multifactorial (complex) or chromosomal. This classification is based on either hypothetical or known nature of genetic defects underlying diseases [1, 2]. Actually, this classification is “gene-centric” and “factor-centric” leading to the dogma that genes and genetic-environmental interactions are the only parameters required to describe a disease with a genetic background. However, advances in genome research evidence that genetic diseases cannot be fully designated using genes and other regulatory elements [3, 4]. Furthermore, the concepts of designating genetic diseases (developed mainly for monogenic and multifactorial diseases) cannot be limited to specific genes or gene-gene interactions, but require extensive knowledge of gene-specific ontological properties and processes occurring at higher levels of causal interactions (i.e. systems biology hierarchy) [5, 6]. These requirements appear to be especially important for the description of chromosomal imbalances and disease-causing copy number variations (CNV). A problematic task is to classify diseases resulting from CNV, which are able to affect either single genes or several genes with random genomic localization producing extreme phenotypic heterogeneity. Consequently, manifestations of pathogenic CNV can be equal either to a monogenic disease or a chromosomal aberration [7,8,9]. On the other hand, monogenic diseases are not simple, exhibiting extreme variability in phenotypic manifestations and molecular/cellular mechanisms [6, 10]. These properties are also applicable to chromosomal syndromes (diseases) and abnormalities that usually encompass from several to several hundred of genes [11,12,13]. In total, the designation of genetic diseases as monogenic, polygenic/multifactorial or chromosomal superficially indicates possible or known genetic cause without reflecting the etiology, as a whole. Since etiology comprises the multilateral evaluation of how a disease can be classified, defined, and discovered , current classification of genetic diseases appears to require an update.
Although (cyto)genomic analysis is the permanent starting point for uncovering the mechanism and etiology of a disease, an indication of gene amount and a speculation about possible genetic-environmental interaction is certainly not enough for the disease designation at the present stage of development in the fields of (cyto)genomics and molecular (systems) medicine. The knowledge of the nature of genetic defects alone poorly defines the etiology of a disease. More precisely, mechanisms of phenotypic outcomes and molecular/cellular pathways to disease remain obscure without a presentation of additional etiologic aspects. Particularly, addressing numerical and structural abnormalities of chromosomes using “gene-centric” concepts is usually confined to the determination of amount of affected genes. However, it is possible that a limited number of genes within the rearranged chromosomal region are intrinsically involved in the clinical outcome. CNV or mutations in different genes attributed to the same pathway may have clinical outcomes similar to chromosome rearrangements or vice versa [7, 13, 15]. For instance, our previous studies of mutation-negative cases of a monogenic disease (Rett syndrome) have shown that the disease can be caused by subchromosome rearrangements (microdeletions), as well . Even though it is the same disease from a clinical point of view, one has to differ between “monogenic” and “chromosomal” Rett syndrome. Single gene mutations are able to produce chromosomal/genomic instability, which is the underlying cause of the clinical outcome [17, 18]. Genome/chromosome instability syndromes (monogenic syndromes) usually exhibit severe manifestations inasmuch as numerous molecular and cellular pathways are altered due to a mutation in a regulatory gene. Thus, it is quite strange that diseases associated with a single pathway defect (e.g. single enzymatic defect) are attributed to the same category as diseases associated with an extensive cascade of abnormal molecular and cellular events. More importantly, these genetic conditions can be merely defined as monogenic, because the underlying cause of the disease manifestations is chromosome/genome instability representing the simultaneous presence of multiple DNA sequence mutations and/or chromosome abnormalities. Accordingly, it is necessary to highlight another problem in classifying diseases caused by genomic variations, which derives from the presence of multiple rearrangements in an individual genome possessing cumulative effect and producing interindividual phenotypic heterogeneity and intercellular genetic variability. The latter underlies numerous complex (polygenic or multifactorial) diseases (i.e. cancer, neurodegenerative and neuropsychiatric disorders) and can only be properly assessed by network-based/pathway-based analyses [19, 20]. Multilateral genomic instability in cancer has a wide range of origins (genetic and environmental). However, regardless of the nature of “initial” genetic defect (monogenic or chromosomal) or starting point of clonal somatic genome evolution, almost all cancers are likely to result from alterations to shared and specific pathways [21, 22]. Pathway-based analysis provides more precise cancer classification than focusing on multitude of different genomic and chromosomal variations in malignant cell populations . Genetic architecture of other complex diseases (e.g. neuropsychiatric and neurodegenerative disorders) seems to be alike . It is highly probable that neuropsychiatric disorders are the result of complex interactions and cascades of single gene mutations, chromosomal abnormalities, somatic mosaicism/genome instability and genetic-environmental interactions . In autism, shared and specific molecular/cellular pathways are disabled by a wide spectrum of genomic alterations. Similarly to cancer, pathway-based analysis determines converging molecular pathways to the disease [26, 27]. Despite a stricter clinical definition, Alzheimer’s disease also exhibits extreme variability of the underlying causes. Additionally, Alzheimer’s disease is associated with somatic mosaicism and chromosome (genome) instability confined to the diseased brain originating from disruptions of cell cycle checkpoint, mitotic signaling and DNA replication pathways [28,29,30]. Since brain pathology featuring Alzheimer’s disease is able to be produced either by single-gene mutations or by environmental factors, it is recommended to designate the disease using pathway-based approaches to unravel the pathogenetic mechanisms . Finally, mutations of genes integrated in a single pathway (functionally related genes) with similar clinical outcomes seem to be the most probable explanation for comorbidity and blurred distinction between monogenic and complex forms [32, 33]. It is noteworthy that pathway-based (network-based) approaches to molecular etiology of complex disease comorbidities shed light on the mechanisms and fascinate the development of targeted therapeutic strategies . In summary, pathway-based analysis of genomic variations is able to add significant information to the data on DNA sequence changes [19,20,21,22,23, 25,26,27, 30, 33, 34]. Therefore, a classification for genetic diseases is likely to benefit from the knowledge of pathways altered to result in specific phenotype.
The idea of human disease classification using pathway-based approaches to the molecular and cellular mechanisms has marked the beginning of the postgenomic era . Focusing on uncovering underlying disease mechanisms using high-resolution genomic data, pathway-based analysis has provided numerous discoveries in the field of molecular medicine. As a result, it is possible to associate a disease not only with specific genomic variations, but also with specific cellular phenotypes and biomarkers or, in other words, with molecular pathways unique to a disease [36,37,38]. To understand functional consequence of genome changes, genomic variation has to be assessed by systems biology approaches to unveil molecular pathways . These approaches are also applicable for chromosome abnormalities resulting in narrowing genotype-phenotype correlations and uncovering intrinsic causal interactions at genomic, transcriptomic, proteomic, and metabolomic levels . Moreover, it has been consistently shown that the etiology of complex diseases may be unraveled almost exclusively by pathway-based analysis of genomic data [41, 42]. In this context, it is to note that complex (multiple) phenotypes resulted from genomic variations are better classified using pathway-based multivariate analysis . Finally, pathway-based analysis, providing the knowledge about the number and extent of alterations to the molecular and cellular pathways, is able to form theoretical and even empirical basis for the treatment of presumably incurable genetic conditions (i.e. chromosome instability syndromes, structural chromosome abnormalities, complex diseases) [44,45,46]. As one can see, pathways are as important as genes for understanding disease etiology.
The “gene-centric” model established during the last three decades has become almost non-competitive for classification of genetic diseases [3, 4]. From the theoretical point of view, a model’s ability to describe phenomena should come at the expense as our knowledge deepens. To provide actual scientific explanation of phenomena, collecting (combining) models seems to be the way to increase the explanatory power . Interestingly, theoretic analysis of biological explanations (definitions of biomedical phenomena) accentuates the role of pathway concept in explaining multicausal relationships between components of a biological system . Therefore, an extension of “gene-centric” model for classifying genetic diseases by combining it with “pathway-centric” model is able to extend the explanatory power. Eventually, theoretical and empirical considerations indicate that classifying genetic diseases using the number of altered pathways in addition to designating hypothetical or known nature of the genetic defects has to be given a chance to be developed.
Following the tradition of creating biomedical terms using ancient Greek words and prefixes, we have proposed to use the word “griphos” (γρῖφος) for making disease designations. The ancient Greek word “griphos” simultaneously means “net” and “puzzle” and adequately symbolizes the term “pathway” in the context of molecular biology and medicine. Combining the word “griphos” and Greek prefixes, following disease designations are suggested: monogryphic — single pathway is altered to result in a phenotype; digryphic, trigriphic, etc.… — two, three, etc. pathways are altered to result in a phenotype; oligogryphic — several pathways are altered to result in a phenotype; polygryphic — numerous pathways or cascades of pathways are altered to result in a phenotype; homeogryphic — shared pathway alterations can result in comorbid diseases.
Natural limitations of the pathway-based classification are associated with a possible lack of knowledge about specific pathways altered in a given disease. However, this is also the case for genetic variants (defects), which are attempted to be associated with complex diseases. Poor reproducibility of findings in genetic studies of complex diseases indicates that pathway-based classification focused on candidate processes might be a solution for etiologic analysis of multifactorial disorders [19, 20, 27, 40]. Additional limitation of the classification can result from difficulties of pathway definitions. The hierarchy of pathways is not precisely determined suggesting the existence of general pathways encompassing less sophisticated ones. In this case, it is hard to indicate whether the disease is associated with a single general pathway (monogryphic) or numerous pathways (polygryphic) of “lower hierarchical levels” are implicated in the etiology. It is highly likely that forthcoming studies on pathway hierarchy may give a solution to this problem.
We outline below some examples of using the proposed classification extension:
A disease caused by a single gene mutation and associated with a phenotype produced by an alteration to a single pathway (i.e. monogenic metabolic diseases) would be defined as “monogenic-monogryphic disease”.
A disease caused by a single gene mutation resulting in a pathogenetic cascade altering numerous pathways (i.e. chromatin remodeling diseases, chromosome instability syndromes) would be defined as “monogenic-polygryphic disease”.
Chromosomal abnormalities altering several pathways would be defined as “chromosomal-oligogryphic diseases”.
A chromosomal syndrome resulting in a rearrangement of a chromosomal locus containing several genes, two of which alter two pathways specific for the syndrome manifestations would be defined as “chromosomal-digryphic disease”.
Comorbid diseases, which are caused by mutations in different genes involved in a single pathway, would be defined as “monogenic-homeogryphic diseases”.
Cases of alterations to shared pathways due to the complex interaction between genetic and environmental factors would be defined as “polygenic or multifactorial-homeogryphic diseases”.
A complex disease caused by an alteration to a specific pathway, which may occur due to a variety of single-gene mutations, chromosome abnormalities and genetic-environmental interactions would be defined as “polygenic or multifactorial-monogryphic disease”.
To show the applicability of the disease classification, we provide more precise examples:
phenylketonuria (a disorder caused by mutations in the PAH gene, encoding phenylalanine hydroxylase catalyzing a reaction of the hydroxylation of phenylalanine to tyrosine) — monogenic-monogryphic disease;
Rett syndrome (a disorder caused by genetic defects in MECP2, a gene involved in several pathways mainly regulating genome activity) — monogenic-polygryphic disease;
ataxia-telangiectasia (a chromosome instability syndrome caused by genetic defects in ATM, a gene involved in a multitude of pathways regulating genome stability maintenance, cell cycle, programed cell death etc.) — monogenic-polygryphic disease;
familial Alzheimer’s disease (rare familial cases of Alzheimer’s disease, mainly considered as multifactorial, are caused by mutations in single genes implicated in multiple pathways) — monogenic-polygryphic disease;
sporadic Alzheimer’s disease (a multifactorial disorder associated with a variety of genetic defects resulting in alterations to multiple pathways) — multifactorial-polygryphic disease;
Williams syndrome (a chromosomal syndrome caused by microdeletions at 7q11.23 leading to a disbalance of 20–30 genes affecting several pathways) — chromosomal-oligogryphic disease;
A unique case of chromosomal microdeletion at 3p22.1p21.31 resulting in alterations of two pathways (for more details, see ) — chromosomal-digryphic disease.
The present extension to the common disease classification is not suggested to substitute the ultimately accepted designations of human diseases (i.e. monogenic, poygenic/multifactorial/complex and chromosomal). Indeed, an addition to the indication of the nature (hypothetical nature) of the underlying genetic defect highlighting the disease-specific molecular pathway appears to be required both for medical studies and for academic research. To this end, we do hope that this classification extension using both “gene-centric” and “pathway-centric” concepts may revolutionize current views on genetic diseases.
Copy number variations
Passarge E. The human genome and disease. In: Feinendegen LE, Shreeve WW, Eckelman WC, Bahk YW, Wagner HN, editors. Molecular Nuclear Medicine. Berlin, Heidelberg: Springer; 2003. p. 31–7.
Turnpenny PD, Ellard S. Emery’s elements of medical genetics. In: E-book. Elsevier Health Sciences. 15th ed; 2016.
Heng HH. The genome-centric concept: resynthesis of evolutionary theory. BioEssays. 2009;31(5):512–25.
Heng HH, Liu G, Stevens JB, Bremer SW, Ye KJ, Abdallah BY, Horne SD, Ye CJ. Decoding the genome beyond sequencing: the new phase of genomic research. Genomics. 2011;98(4):242–52.
Stewart J. Genetics, biology and multifactorial diseases. Acta Biotheor. 2002;50(4):323–9.
Dekeuwer C. Conceptualization of genetic disease. In: Schramme T, Edwards S, editors. Handbook of the philosophy of medicine. Dordrecht: Springer; 2015. p. 1–18.
Iourov IY, Vorsanova SG, Yurov YB. Molecular cytogenetics and cytogenomics of brain diseases. Curr Genomics. 2008;9(7):452–65.
Zarrei M, MacDonald JR, Merico D, Scherer SW. A copy number variation map of the human genome. Nat Rev Genet. 2015;16(3):172–83.
Aouiche C, Shang X, Chen B. Copy number variation related disease genes. Quant Biol. 2018;6(2):99–112.
Antonarakis SE, Beckmann JS. Mendelian disorders deserve more attention. Nat Rev Genet. 2006;7(4):277–82.
Gersen SL, Keagle MB, editors. The principles of clinical cytogenetics. Trenton: Humana Press Inc; 2005.
Iourov IY, Vorsanova SG, Yurov YB. Chromosomal variation in mammalian neuronal cells: known facts and attractive hypotheses. Int Rev Cytol. 2006;249:143–91.
Liehr T. Benign & pathological chromosomal imbalances. 1st ed. Oxford: Academic; 2014.
Ross LN. The doctrine of specific etiology. Biol Philos. 2018;33:37.
Heng HH, Horne SD, Chaudhry S, Regan SM, Liu G, Abdallah BY, Ye CJ. A postgenomic perspective on molecular cytogenetics. Curr Genomics. 2018;19(3):227–39.
Iourov IY, Vorsanova SG, Voinova VY, Kurinnaia OS, Zelenova MA, Demidova IA, Yurov YB. Xq28 (MECP2) microdeletions are common in mutation-negative females with Rett syndrome and cause mild subtypes of the disease. Mol Cytogenet. 2013;6(1):53.
Iourov IY, Vorsanova SG, Liehr T, Kolotii AD, Yurov YB. Increased chromosome instability dramatically disrupts neural genome integrity and mediates cerebellar degeneration in the ataxia-telangiectasia brain. Hum Mol Genet. 2009;18(14):2656–69.
Iourov IY, Vorsanova SG, Zelenova MA, Korostelev SA, Yurov YB. Genomic copy number variation affecting genes involved in the cell cycle pathway: implications for somatic mosaicism. Int J Genomics. 2015;2015:757680.
Heng HH, Regan S. A systems biology perspective on molecular cytogenetics. Curr Bioinforma. 2017;12(1):4–10.
Yurov YB, Iourov IY, Vorsanova SG. Network-based classification of molecular cytogenetic data. Curr Bioinforma. 2017;12(1):27–33.
Chuang HY, Lee E, Liu YT, Lee D, Ideker T. Network-based classification of breast cancer metastasis. Mol Syst Biol. 2007;3:140.
Kim S, Kon M, DeLisi C. Pathway-based classification of cancer subtypes. Biol Direct. 2012;7:21.
Lee E, Chuang HY, Kim JW, Ideker T, Lee D. Inferring pathway activity toward precise disease classification. PLoS Comput Biol. 2008;4(11):e1000217.
Willsey AJ, Morris MT, Wang S, Willsey HR, Sun N, Teerikorpi N, Baum TB, Cagney G, Bender KJ, Desai TA, Srivastava D, Davis GW, Doudna J, Chang E, Sohal V, Lowenstein DH, Li H, Agard D, Keiser MJ, Shoichet B, von Zastrow M, Mucke L, Finkbeiner S, Gan L, Sestan N, Ward ME, Huttenhain R, Nowakowski TJ, Bellen HJ, Frank LM, Khokha MK, Lifton RP, Kampmann M, Ideker T, State MW, Krogan NJ. The psychiatric cell map initiative: a convergent systems biological approach to illuminating key molecular pathways in neuropsychiatric disorders. Cell. 2018;174(3):505–20.
Iourov IY, Vorsanova SG, Yurov YB. Somatic cell genomics of brain disorders: a new opportunity to clarify genetic-environmental interactions. Cytogenet Genome Res. 2013;139(3):181–8.
Sakai Y, Shaw CA, Dawson BC, Dugas DV, Al-Mohtaseb Z, Hill DE, Zoghbi HY. Protein interactome reveals converging molecular pathways among autism disorders. Sci Transl Med. 2011;3(86):86ra49.
Vorsanova SG, Yurov YB, Iourov IY. Neurogenomic pathway of autism spectrum disorders: linking germline and somatic mutations to genetic-environmental interactions. Curr Bioinforma. 2017;12(1):19–26.
Iourov IY, Liehr T, Yurov YB. Aneuploidy in the normal, Alzheimer's disease and ataxia-telangiectasia brain: differential expression and pathological meaning. Neurobiol Dis. 2009;34(2):212–20.
Iourov IY, Vorsanova SG, Yurov YB. Genomic landscape of the Alzheimer's disease brain: chromosome instability — aneuploidy, but not tetraploidy — mediates neurodegeneration. Neurodegener Dis. 2011;8(1–2):35–7.
Yurov YB, Vorsanova SG, Iourov IY. The DNA replication stress hypothesis of Alzheimer's disease. ScientificWorldJournal. 2011;11:2602–12.
Hu YS, Xin J, Hu Y, Zhang L, Wang J. Analyzing the genes related to Alzheimer's disease via a network and pathway-based approach. Alzheimers Res Ther. 2017;9(1):29.
Oti M, Brunner HG. The modular nature of genetic diseases. Clin Genet. 2007;71(1):1–11.
Dozmorov MG. Disease classification: from phenotypic similarity to integrative genomics and beyond. Brief Bioinform. 2018. https://doi.org/10.1093/bib/bby049.
Hu JX, Thomas CE, Brunak S. Network biology concepts in complex disease comorbidities. Nat Rev Genet. 2016;17(10):615–29.
Loscalzo J, Kohane I, Barabasi AL. Human disease classification in the postgenomic era: a complex systems approach to human pathobiology. Mol Syst Biol. 2007;3:124.
Wu X, Jiang R, Zhang MQ, Li S. Network-based global inference of human disease genes. Mol Syst Biol. 2008;4:189.
Zanzoni A, Soler-López M, Aloy P. A network medicine approach to human disease. FEBS Lett. 2009;583(11):1759–65.
Barabasi AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011;12(1):56–68.
Atias N, Istrail S, Sharan R. Pathway-based analysis of genomic variation data. Curr Opin Genet Dev. 2013;23(6):622–6.
Iourov IY, Vorsanova SG, Yurov YB. In silico molecular cytogenetics: a bioinformatic approach to prioritization of candidate genes and copy number variations for basic and clinical genome research. Mol Cytogenet. 2014;7(1):98.
Elbers CC, van Eijk KR, Franke L, Mulder F, van der Schouw YT, Wijmenga C, Onland-Moret NC. Using genome-wide pathway analysis to unravel the etiology of complex diseases. Genet Epidemiol. 2009;33(5):419–31.
Jin L, Zuo XY, Su WY, Zhao XL, Yuan MQ, Han LZ, Zhao X, Chen YD, Rao SQ. Pathway-based analysis tools for complex diseases: a review. Genomics Proteomics Bioinformatics. 2014;12(5):210–20.
Lee S, Kim Y, Choi S, Hwang H, Park T. Pathway-based approach using hierarchical components of rare variants to analyze multiple phenotypes. BMC Bioinformatics. 2018;19(Suppl 4):79.
Yurov YB, Iourov IY, Vorsanova SG. Neurodegeneration mediated by chromosome instability suggests changes in strategy for therapy development in ataxia-telangiectasia. Med Hypotheses. 2009;73(6):1075–6.
Iourov IY, Vorsanova SG, Voinova VY, Yurov YB. 3p22.1p21.31 microdeletion identifies CCK as Asperger syndrome candidate gene and shows the way for therapeutic strategies in chromosome imbalances. Mol Cytogenet. 2015;8:82.
Marshall LJ, Austin CP, Casey W, Fitzpatrick SC, Willett C. Recommendations toward a human pathway-based approach to disease research. Drug Discov Today. 2018;23(11):1824–32.
Hochstein E. Why one model is never enough: a defense of explanatory holism. Biol Philos. 2017;32(6):1105–25.
Ross LN. Causal selection and the pathway concept. Philos Sci. 2018;85(4):551–72.
Authors are supported by RFBR and CITMA according to the research project №18–515-34005.
Availability of data and materials
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.