Body maps on the human genome
© Cherniak and Rodriguez-Esteban; licensee BioMed Central Ltd. 2013
Received: 15 October 2013
Accepted: 5 December 2013
Published: 20 December 2013
Chromosomes have territories, or preferred locales, in the cell nucleus. When these sites are taken into account, some large-scale structure of the human genome emerges.
The synoptic picture is that genes highly expressed in particular topologically compact tissues are not randomly distributed on the genome. Rather, such tissue-specific genes tend to map somatotopically onto the complete chromosome set. They seem to form a “genome homunculus”: a multi-dimensional, genome-wide body representation extending across chromosome territories of the entire spermcell nucleus. The antero-posterior axis of the body significantly corresponds to the head-tail axis of the nucleus, and the dorso-ventral body axis to the central-peripheral nucleus axis.
This large-scale genomic structure includes thousands of genes. One rationale for a homuncular genome structure would be to minimize connection costs in genetic networks. Somatotopic maps in cerebral cortex have been reported for over a century.
The human genome may show “little evidence of organization”  and be in “an alarming state of disarray” , but it seems to have a global landscape, with large-scale patterns encompassing all chromosomes together. One key to revealing this structure is chromosome territories, that is, their sites in the cell nucleus. Tissue-specific genes of the adult human body then appear to map somatotopically onto the genome, in multiple dimensions. The holistic arrangement of tissue gene-positions in the complete chromosome set significantly mirrors the antero-posterior, and dorso-ventral, configuration of the tissue-locations in the body. Unlike hox complexes  or collinearity phenomena , this anatomical mapping includes thousands of genes in the entire chromosome set of the genome. Such a multi-chromosomal bodymap may help as a navigation guide in uncovering genes involved in pathologies of corresponding tissues.
There appears to be little prior study of this extensive structure. Danchin et al.  discussed such a mapping idea for the prokaryotic chromosome. Caron et al.  described clustering on individual human chromosomes of highly expressed genes into regions of increased gene expression. In a survey of gene expression in human tissues, Shyamsundar et al.  reported clustering according to anatomic locations or types of tissues (e.g., “lymphoid tissues”, including thymus, spleen, etc.); but not any higher-order pattern of whole-organism, or whole-genome, mapping.
The map results here are based on combining published data about chromosome territory locations in the nucleus, and about tissue-specific gene expression levels. The chromosome territory model of the nucleus is that chromosomes are not randomly sited, but rather each has preferred locations [8, 9]. A notable result is that, using fluorescent tag techniques, Bolzer et al. (Figure one, ref. ) depict territory sites for all chromosomes in one human fibroblast nucleus.
For the first analysis here, nine adult normal tissues were selected; unlike many in the Shyamsundar et al. study , each is compact and localizable (as opposed to, e.g., skin or blood). Each tissue also has the largest number of tissue-specific genes of all compact tissues analyzed (see below). Genes of related contiguous tissues were aggregated (e.g., our “brain” gene count includes hippocampus, thalamus, etc. of the Liang et al. report ). For the approximate centroid of each organ, the antero-posterior order of positions in the body is: brain, thymus, heart, liver, spleen, pancreas, kidney, ovary, testis. Thus, tissues were sampled across different organ systems -- nervous, endocrine, circulatory, digestive, excretory, and reproductive.
The antero-posterior and dorso-ventral axes were each analyzed separately. Because of bilateral symmetry in the vertebrate bodyplan, the lateral (left/right) body axis has a more limited set of distinct tissue loci. For instance, brain, thymus, kidney, ovary, testis all have lateral centroids approximately at the midline; as opposed to heart, liver, spleen, pancreas.
We explored the conjecture that chromosome locations are more stable in germ than somatic cells. Chromosome territories were mapped with data derived from a comprehensive study by Manvelyan et al.  of sperm cell nuclei. Chromosome architecture in sperm cells has a distinctive packing, with the chromosomes condensed, that is, tightly coiled. Manvelyan et al. employed multicolor banding techniques to obtain information on all 24 chromosome locations; each chromosome was observed in 30 nuclei. (Because of its smallest gene count, the Y chromosome was excluded from the analyses here.) Their Figure four summarized distribution of chromosomes on the “head” - “tail” axis, i.e., from apex to base of nucleus. The location of each chromosome in the 30 nuclei sampled had only been classified in terms of head, middle, or tail zones of the nucleus. On the model of the “moment” of classical mechanics, we transformed the head - middle - tail distribution of each chromosome into a single resultant position score i = h*1 + m*2 + t*3. The Manvelyan Figure two includes corresponding chromosome location data for the orthogonal “central” - “peripheral” axis of the nucleus. Measurements from each figure were compiled to determine a mean position-score of each chromosome on each axis. For example, chromosome X occupies the first position at the tail of the nucleus, and chromosome 13 the last position at the head; chromosome 7 is in the first position at the periphery, and chromosome 22 is in the last position, at the center. In the Additional file 1: Tables S1 and S2 map these locations for the entire chromosome set in the head-tail and central-peripheral axes, respectively.
A tissue’s genes are not in general entirely exclusive to that tissue; shared genes tend to decrease contrast between tissues, and to blur any bodymapping. For each tissue, its set of maximally-selective genes was first drawn from results of Liang et al. . This study included one of the largest sets of tissue-specific genes for brain. Using Tukey HSD tests on U133A and U133B DNA microarray data, the study  identified nearly 4,000 genes that are each significantly preferentially expressed in six or less tissue types out of 97. (A finding supportive of this methodology is Zou et al. (Figure S2, ref. ), which associates high pleiotropy with low expression levels in C. elegans genes, so high expression suggests a role in a narrower set of traits.) The count of high-contrast tissue-selective genes for each tissue on each chromosome was compiled (e.g., 98 brain tissue genes on chromosome 1); see in Additional file 1: Table S3. Additional file 1: Table S4 lists for each chromosome and tissue the ratio of such tissue-specific genes to the chromosome’s combined total tissue specific genes for all tissues in the Liang study (e.g., for brain genes in chromosome 1, the high proportion 0.153).
Liang et al. database
Of course, genes of each of the nine localized tissues are not mainly concentrated on a single particular chromosome (see Additional file 1: Table S3). But, at the opposite extreme, genes of each tissue also are not uniformly distributed on all chromosomes. For instance, the proportion of brain genes ranges from 36.7% in chromosome 13, to 1.8% in chromosome 21 (see x-axis of Figure 2). Similarly, the highest mean proportion of tissue genes in all chromosomes combined is 17.0% brain genes, while the lowest mean proportion is 2.4% pancreas genes.
In addition, tissue gene distributions on the chromosomes show a significant intermediate division of labor. For instance, genome-wide positions of genes that express most strongly in brain, heart, kidney, ovary, etc. respectively tend significantly to correspond to the antero-posterior order of those organs in the body. In particular, for anterior organs (e.g., brain), the gradient of their tail-to-head gene distribution in the spermcell nucleus is increasing (see Figure 2): That is, the more anterior the tissue, the greater the proportion of its genes in chromosomes of the nucleus head. For mid-positioned organs (e.g., heart), their gene distribution slope shifts from increasing to flat. Then, for posterior organs (e.g., ovary), the relation reverses to decreasing (see Figure 3).
The two body axes were each also cross-tested for goodness of fit to the two nucleus axes. The contrast is great: For unweighted data, when the antero-posterior body axis is evaluated instead for correlation with the central-peripheral nucleus axis, r2 drops appreciably, from 0.49 to 0.09. Similarly, for the dorso-ventral body axis correlation with the head-tail nucleus axis, its r2 also diminishes markedly, from 0.40 to 0.09. As mentioned, data for lateral (left/right) body axis is limited; its correlation with each nucleus axis is similarly poor.
There is also evidence of mapping of brain subregions, e.g., telencephalon and metencephalon, extending from head to tail of the nucleus like the overall brain genes gradient of Figure 2 above. These “stacked” subregion gradients each have the same antero-posterior orientation as the brain gradient; that is, telencephalon and metencephalon genes concentrate more on chromosomes at the head than tail of the nucleus. (In addition, we have found a significant pattern of bodymaps on individual chromosomes.) This constitutes further convergent support for a genome homunculus hypothesis.
Xiao et al. database
To check stability of the bodymapping result, we also performed a replication with another tissue-selective gene compilation, the TiSGeD database of Xiao et al. . Unlike the procedures of Liang et al. , this study identified tissue-specific genes by transforming the expression profile of each gene into a vector, and using its scalar projection for a given tissue. Selectivity of a gene for a tissue is set by a specificity measure SPM, ranging from 0 to 1, where a higher value narrows selectivity. We used SPM ≥ 0.6, which increases the set of tissue-selective genes for normal adult tissues to 4,664 -- comparable to our Liang geneset. The 11 topologically compact tissue groups with the largest tissue-specific genesets differ somewhat from the Liang set. They were, in antero-posterior order: brain, salivary gland and tongue (together), thyroid, thymus, heart, lung, liver, pancreas, kidney, ovary, testis.
The TiSGeD database also includes genes selectively expressed in particular cancerous, as opposed to normal, adult tissues. A natural question concerns whether some oncogenic (and/or genetic) disorders are associated with disruption of the supra-chromosomal bodymap. In particular, for genes expressed in cancer tissues, is the mapping more disordered? We assigned each cancer tissue gene set to the locus of its corresponding normal tissue group: neuroblastoma – Brain; hepatoma, HepG2 – Liver; kidney carcinoma – Kidney; prostate cancer - Testis; colorectal adenocarcinoma, leukemia, and lymphoma - Other. Even though the total gene count increases by nearly 15%, from 4,664 to 5,463, the antero-posterior correlation for this combined tissue geneset drops (from r 2 = 0.63) to r 2 = 0.53; p < 0.01 (two-tailed). We also constructed a series, by successively adding one cancer gene set after another (in the above sequence) to the gene set of the normal tissue groups: The body-genome antero-posterior correlations of each of these 7 gene sets then themselves tend to grow progressively weaker, with a significant negative trend, r 2 = −0.66; p < 0.03 (two-tailed). This picture motivates further examination of the idea that genes of some cancer tissues tend not to conform to the genome bodymap pattern.
The perspective shift here is to view the whole genome as a unified system with its chromosomes meshing together, instead of as isolated, separate components. This approach yields evidence of a genome-wide map of the human body.
The correlation of dorso-ventral tissue positions with tissue genes’ central-peripheral nucleus sites can be compared with other models of chromosome location on the central-peripheral axis. One is that more gene-dense chromosomes tend to locate more toward the nucleus center . Another finding is that chromosomes with more active genes tend to locate more toward the center .
We have noted that we evaluated the genome bodymap model for the mature adult organism. Of course, over the developmental trajectory, tissues are moving targets, with changing sites in the embryo. This suggests a question, Are tissue-specific gene sites on the genome adapted for functions of the adult organism, but not for those of its earlier embryological development?
Somatotopic maps have been observed in mammal sensorimotor cortex [17, 18] since the 19th century. One possible function or evolutionary design rationale for a default genome homunculus might be to help minimize message-passing costs by shortening interconnections among related genes in genetic systems; neighboring tissues in the organism may be more likely to be so related. In this way, connections would shape architecture. The question then is whether information transmission is not cost-free even within a cell, nucleus, or genome. Fine-grained connection optimization has been observed in nervous system wiring . -- Thus: Genome as “nanobrain”.
This work raises natural next questions concerning prevalence of genome body maps. Does the genome, like the cortex, contain multiple maps -- e.g., “motor” output vs input maps, or overlapping submaps? Does the familiar antero-posterior polarity of the egg cell in fact also resolve into a body-tissue ordering, and a mapping, when the large scale chromosome territory structure of the genome is taken into account? As opposed to a default configuration for haploid germ cells, how much of this bodyplan modeling do specialized, mature somatic cells retain? Attention naturally turns to global genome structure at later developmental stages. Structure of the germ cell genome may serve as a scaffold for subsequent efficient structure of the somatic cell genome. And, in contrast to ontogenetic development, from a phylogenetic perspective, does this type of genome bodymap already appear for simpler eukaryotes?
- Gallagher R, Dennis C: The Draft Human Genome: A Repetitious Genome. : Wellcome Trust Website: The Human Genome; 2001. . Accessed 28 July 2011 [http://genome.wellcome.ac.uk/doc_WTD020733.html]Google Scholar
- Alberts B, Johnson A, Lewis J, Raff M, Roberts K, Walter P: Molecular Biology of the Cell. 5th edition. New York: Garland Science; 2007:206.Google Scholar
- McGinnis W, Levine MS, Hafen E, Kuroiwa A, Gehring WJ: A conserved DNA sequence in homoeotic genes of the Drosophila Antennapedia and bithorax complexes. Nature 1984, 308: 428–433. 10.1038/308428a0View ArticlePubMedGoogle Scholar
- Lewis E: A gene complex controlling segmentation in Drosophila. Nature 1978, 276: 565–570. 10.1038/276565a0View ArticlePubMedGoogle Scholar
- Danchin A, Guerdoux-Jamet P, Moszer I, Nitschké P: Mapping the bacterial cell architecture into the chromosome. Philos Trans R Soc Lond B Biol Sci 2000, 355: 179–190. Review 10.1098/rstb.2000.0557PubMed CentralView ArticlePubMedGoogle Scholar
- Caron H, van Schaik B, van der Mee M, Baas F, Riggins G, van Sluis P, Hermus MC, van Asperen R, Boon K, Voûte PA, Heisterkamp S, van Kampen A, Versteeg R: The human transcriptome map: clustering of highly expressed genes in chromosomal domains. Science 2001, 291: 1289–1292. 10.1126/science.1056794View ArticlePubMedGoogle Scholar
- Shyamsundar R, Kim YH, Higgins JP, Montgomery K, Jorden M, Sethuraman A, van de Rijn M, Botstein D, Brown PO, Pollack JR: A DNA microarray survey of gene expression in normal human tissues. Genome Biol 2005, 6: R22. Epub 2005 Feb 14. Erratum in: Genome Biol 6, 404, 404.2 10.1186/gb-2005-6-3-r22PubMed CentralView ArticlePubMedGoogle Scholar
- Cremer T, Cremer C, Schneider T, Baumann H, Hens L, Kirsch-Volders M: Analysis of chromosome positions in the interphase nucleus of Chinese hamster cells by laser-UV-microirradiation experiments. Hum Genet 1982, 62: 201–209. 10.1007/BF00333519View ArticlePubMedGoogle Scholar
- Cremer T, Cremer C: Chromosome territories, nuclear architecture and gene regulation in mammalian cells. Nat Rev Genet 2001, 2: 292–301. 10.1038/35066075View ArticlePubMedGoogle Scholar
- Bolzer A, Kreth G, Solovei I, Koehler D, Saracoglu K, Fauth C, Müller S, Eils R, Cremer C, Speicher MR, Cremer T: Three-dimensional maps of all chromosomes in human male fibroblast nuclei and prometaphase rosettes. PLoS Biol 2005, 3: e157. 10.1371/journal.pbio.0030157PubMed CentralView ArticlePubMedGoogle Scholar
- Liang S, Li Y, Be X, Howes S, Liu W: Detecting and profiling tissue-selective genes. Physiol Genomics 2006, 26: 158–162. 10.1152/physiolgenomics.00313.2005View ArticlePubMedGoogle Scholar
- Manvelyan M, Hunstig F, Bhatt S, Mrasek K, Pellestor F, Weise A, Simonyan I, Aroutiounian R, Liehr T: Chromosome distribution in human sperm – a 3 D multicolor banding-study. Mol Cytogenet 2008, 1: 25. 10.1186/1755-8166-1-25PubMed CentralView ArticlePubMedGoogle Scholar
- Zou L, Sriwasdi S, Ross B, Missiuro PV, Liu J, Ge H: Systematic analysis of pleiotropy in C. elegans early embryogenesis. PLoS Comput Biol 2008, 4: e1000003. 10.1371/journal.pcbi.1000003PubMed CentralView ArticlePubMedGoogle Scholar
- Xiao S, Zhang C, Zou Q, Ji Z: TiSGeD: a database for tissue-specific genes. Bioinformatics 2010, 26: 1273–1275. 10.1093/bioinformatics/btq109PubMed CentralView ArticlePubMedGoogle Scholar
- Croft JA, Bridger JM, Boyle S, Perry P, Teague P, Bickmore WA: Differences in the localization and morphology of chromosomes in the human nucleus. J Cell Biol 1999, 145: 1119–1131. 10.1083/jcb.145.6.1119PubMed CentralView ArticlePubMedGoogle Scholar
- Gilbert DM: Nuclear position leaves its mark on replication timing. J Cell Biol 2001, 152: F11-F15. 10.1083/jcb.152.2.F11PubMed CentralView ArticlePubMedGoogle Scholar
- Fritsch G, Hitzig E: Ueber die elektrische Erregbarkeit des Grosshirns. Arch Anat Physiol Wiss Med Leipzig 1870, 37: 300–332. In Some Papers on the Cerebral Cortex. (trans von Bonin G). Springfield Il: Charles C Thomas; 1960:73–96Google Scholar
- Penfield W, Rasmussen T: The Cerebral Cortex of Man: A Clinical Study of Localization of Function. New York: Macmillan; 1950.Google Scholar
- Cherniak C, Mokhtarzada Z, Rodriguez-Esteban R, Changizi K: Global optimization of cerebral cortex layout. Proc Natl Acad Sci U S A 1950, 101: 1081–1086.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.