GSTM1 copy number variation in the context of single nucleotide polymorphisms in the human GSTM cluster
© Khrunin et al. 2016
Received: 1 March 2016
Accepted: 1 April 2016
Published: 19 April 2016
GSTM1 gene deletion is one of the most known copy number polymorphisms in human genome. It is most likely caused by homologous recombination between the repeats flanking the gene. However, taking into account that the deletion has no crucial effects on human well-being, and the ability of other GSTMs to compensate for the lack of GSTM1, a role for additional factors affecting GSTM1 deletion can be proposed. Our goal was to explore the relationships between GSTM1 deletion polymorphism and single nucleotide polymorphisms (SNPs) in the region of the GSTM cluster that includes GSTM2, GSTM3, GSTM4, and GSTM5 in addition to GSTM1.
Real-time polymerase chain reaction was used to quantify the number of GSTM1 copies. Fourteen SNPs from the region were tested and their allelic patterns were compared in groups of Russian individuals subdivided according to their GSTM1 deletion genotypes. Linkage disequilibrium-based haplotype analysis showed substantial differences of haplotype frequencies between the groups, especially between individuals with homozygous GSTM1 −/− and +/+ genotypes. Exploration of the results of phasing of GSTM1 and SNP genotypes revealed unequal segregation of GSTM1 + and − alleles at different haplotypes.
The observed differences in haplotype patterns suggest the potential role of genetic context in GSTM1 deletion frequency (appearance) and in the determination of the deletion-related effects.
Glutathione-S-transferases (GSTs) are a group of enzymes that play an important role in the metabolism and detoxification of a wide range of exo- and endogenous compounds. Functionally, GSTs act by conjugating electrophilic centers of the compounds with molecules of reduced glutathione. The reaction products are generally less reactive and readily excreted [1, 2]. Although many GST substrates can be potentially harmful to cellular macromolecules, some GST genes, particularly GSTT1 and GSTM1, are homozygously deleted in a high percentage (15 to 60 %) of individuals in various human populations . GSTT1 and GSTM1 copy number variations are most likely caused by homologous recombination between the repeats flanking the both genes [4, 5]. However, the reasons for the high frequency of the deletions across different human populations are not well understood, especially if considered in the context of their associations with a higher risk for the development of a variety of pathophysiological conditions in individuals carrying deletion phenotypes . A possible explanation could be the capability of other GSTs to compensate, at least in part, for the absence of GSTT1- and GSTM1-related enzyme activities under normal conditions . Recent studies [7, 8] showed distinct connections between the levels of GST enzymatic activity and the activity of genes belonging to the same class (i.e., GSTT2B and GSTM2), evidently suggesting a role for local genetic context in the deletion frequency (deletion appearance).
In the current study, we investigated relationships between GSTM1 deletion polymorphism and SNP-based haplotypes in the region of the GSTM cluster that includes GSTM2, GSTM3, GSTM4, and GSTM5 in addition to GSTM1. The obtained results demonstrated substantial differences in haplotype distribution between groups of individuals subdivided according to GSTM1 deletion genotypes.
Blood samples were obtained with informed consent from Russian donors from three locations in the European part of Russia (Andreapolsky District of the Tver Oblast, n = 96; Muromsky District of the Vladimir Oblast, n = 96; and Kursky and Oktyabrsky districts of the Kursk Oblast, n = 93). The ethnicity of the donors was determined by an interview. To be included, individuals had to be unrelated and represent the native ethnic group in the regions studied (i.e., they belonged to at least the third generation living in a particular geographic region). The interview protocol and informed-consent form were approved by the Ethic Commission of Institute of Molecular Genetics of the Russian Academy of Sciences.
The DNA was isolated from peripheral leukocytes of the blood with a standard technique using proteinase K treatment and phenol–chloroform extraction .
Primers and probes used in the study
Forward primer: 5′-TTGTTTCACTGTCCTGAGGACTATTTAT-3′
Reverse primer: 5′-ATGTTACTCTGTCAATGTTCTCCACAT-3'
Forward primer: 5'-CTGAGCCCTGCTCGGTTTAG-3'
Reverse primer: 5'-ATGGGCATGGTGCTGGTT-3'
PCR was performed in 25 μL of 1× PCR buffer, containing 2.5 mM MgCl2, 200 μM each of dNTP, 20 pM each of GSTM1 and B2M primers, 1.25 units of Hot-Rescue Taq DNA polymerase (Syntol, Moscow, Russia), 10 pM and 5 pM of GSTM1- and B2M-specific probes, respectively, and 10–20 ng of genomic DNA. Thermal cycling and fluorescence intensity measurement were conducted using a StepOnePlus Real-Time PCR System (Applied Biosystems, Waltham, MA, USA). The samples were initially incubated for 10 min at 95 °C and then cycled 35 times at 95 °C for 20 s, followed by 60 °C for 60 s. All samples were tested in pentaplicate.
To quantify the number of GSTM1 copies a comparative Ct method was used . The ratio (R) of the GSTM1 to B2M gene dosage was calculated using the formula R = 2−ΔΔCt, where ΔΔCt = (Ctcontrol_B2M – Ctcontrol_GSTM1) – (Ctsample_B2M – Ctsample_GSTM1). We used a control sample known to be heterozygous for GSTM1. Based on the observed variability, R values higher than 1.4 were interpreted as an indication that the sample carried two functional variants (two copies) of GSTM1. Ratios between 0.7 and 1.3 were considered attributable to samples with heterozygous deletions (containing one GSTM1 copy).
Population SNP genotypes were obtained from our previous study, in which they were generated using Illumina Human CNV370-Duo and Human 660 W-Quad chips . The set of SNPs was chosen by considering the chromosomal region in which the genes of the GSTM family were located.
Data on individual GSTM1 deletion genotypes and SNP genotypes in other populations (i.e., CEU) were obtained from the database of the 1000 Genomes Project .
To explore patterns of genetic variation across GSTM cluster, a haplotype analysis was performed. Two approaches were used. The first was based on an analysis of haplotypes in the haplotype blocks (haploblocks). The haplotype blocks were defined using a block definition based on the linkage disequilibrium (LD) measure D′ and its confidence interval. The corresponding pairwise LD statistics between SNPs and the frequencies of haplotypes were estimated using Haploview software (version 4.2) . Comparisons of haplotype frequencies between groups of individuals were performed using GraphPad InStat (version 3.00, GraphPad Software, San Diego, CA, USA). P < 0.05 was considered significant.
The second approach consisted of exploring the patterns of haplotypes at which the GSTM1 null and non-null allele(s) segregated in populations. To obtain such structural haplotypes, the GSTM1 and SNP genotypes of populations were phased together using the Beagle software package (version 4, release 1399) using default parameters . Visualization of the sets of phased GSTM1 and SNP alleles was conducted using a custom R script kindly provided by Robert E. Handsaker (Broad Institute of MIT and Harvard, Cambridge, MA, USA) .
In previous studies based on both whole-genome polymorphism analysis and testing SNPs in GSTA and GSTM clusters, high similarity between three Russian populations from the Central European part of Russia was demonstrated [12, 17]. To create a more effective sample, particularly in the context of generally under-represented GSTM1 +/+ genotype carriers, three Russian populations were combined. In total, 128 individuals with a GSTM1 −/− genotype, 121 individuals with a GSTM1 −/+ genotype, and 36 individuals with a GSTM1 +/+ genotype were detected in the sample. Genotype frequencies did not differ from those predicted by the Hardy–Weinberg rule (P = 0.45).
Haplotype frequencies in groups of individuals subdivided according to GSTM1 deletion polymorphism
Number of haploblock
Individuals with GSTM1 −/− genotype (2 N = 256)
Individuals with GSTM1 −/+ genotype (2 N = 242)
Individuals with GSTM1 +/+ genotype (2 N = 72)
Statistics (P-values) of intergroup comparisons of haplotype frequenciesa
Number of haploblock
Individuals with GSTM1 −/− genotype vs. Individuals with GSTM1 +/+ genotype
Individuals with GSTM1 −/− genotype vs. Individuals with GSTM1 −/+ genotype
Individuals with GSTM1 +/+ genotype vs. Individuals with GSTM1 −/+ genotype
The GSTM1 homozygous deletion is very common in human populations, varying between 16 and 40 % in Africans (sub-Saharan Africa), 42 and 55 % in Europeans, and 42 and 65 % in East Asian populations . The frequency of the deletion suggests that GSTM1 has not been subjected to strong environmental selection pressure during evolution, and thus it might have resulted from an ancestral deletion that was widely spread across populations because of a founder effect.
An alternative explanation can be proposed in the context of the molecular structure of the GSTM1 region. Xu and coauthors  found that GSTM1 in GSTM1 +/+ individuals was flanked by two highly homologous 4.2 kb regions. By contrast, individuals with a completely deleted GSTM1 had only one segment that was identical to the regions in GSTM1 +/+ individuals. Such a structural organization of the region (i.e., existence of two highly homologous repeats) allows for the possibility of nonallelic homologous recombination in the chromosomal area, resulting in gene deletion , particularly the GSTM1 deletion, which in principle might occur independently on multiple occasions. The results of our intergroup haplotype comparisons support the hypothesis that the deletion might occur at different haplotypes because there were no differences in haplotype spectra between the groups. At the same time, substantial differences in the frequencies of the haplotypes were found. The greatest differences were observed between groups of individuals with homozygous GSTM1 −/− and +/+ genotypes, which differed in haplotype frequency in all four haploblocks inferred. We hypothesized that the differences may show the potential influence of genomic context on the occurrence of the deletion. The assumption was supported by the results of exploring structural haplotypes in the region of the GSTM1 deletion, where unequal occurrence of GSTM1 − and + alleles at different SNP haplotypes was demonstrated. In what way could the context affect the deletion frequency (appearance)? It is unlikely that individual SNPs could affect recombination if they were not in a recombination hotspot . However, taking into account the data on the possibility of functional compensation of GSTM1 −/− by GSTM2 , one can suggest that the correlations found may reflect the existence of functional associations between GSTM1 and allelic variants of other genes of the GSTM cluster. Such associations can explain both why GSTM1 has not been subjected to strong environmental selection and the high frequency of GSTM1 deletion. The associations seem to be also relevant to the conflicting results reported in studies that correlated GSTM1 with risk of cancer and other diseases [6, 20–24] in which the absence of GSTM1-related enzymatic activity could be masked by catalytic activities of other GSTMs and resulted in no or reduced impact of GSTM1 deletion on disease risk. Finally, taken in the context of the results of associative studies, our findings highlight the necessity of parallel examination of allelic status of functionally and structurally related members of the gene family.
In summary, we have reported the results of exploring the haplotype structure in the GSTM cluster region in relation to GSTM1 deletion polymorphism. By using both haploblock-based and extended phased haplotypes, substantial differences in haplotype distribution were observed when they were correlated with the GSTM1 genotypes and alleles. The results suggest the potential role of genetic context in GSTM1 deletion frequency (appearance), and in the determination of deletion-related effects.
The authors are grateful to Robert E. Handsaker for a custom R script and for productive discussion of the results obtained. This study was supported by grants from the Program “Molecular and Cell Biology” of the Russian Academy of Sciences, the Federal Support of Leading Scientific Schools and the Russian Foundation for Basic Research.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Hayes JD, Flanagan JU, Jowsey IR. Glutathione transferases. Annu Rev Pharmacol Toxicol. 2005;45:51–88.View ArticlePubMedGoogle Scholar
- Oakley A. Glutathione transferases: a structural perspective. Drug Metab Rev. 2011;43:138–51.View ArticlePubMedGoogle Scholar
- Saitou M, Ishida T. Distributions of the GSTM1 and GSTT1 null genotypes worldwide are characterized by latitudinal clines. Asian Pac J Cancer Prev. 2015;16:355–61.View ArticlePubMedGoogle Scholar
- Xu S, Wang Y, Roe B, Pearson WR. Characterization of the human class Mu glutathione S-transferase gene cluster and the GSTM1 deletion. J Biol Chem. 1998;273:3517–27.View ArticlePubMedGoogle Scholar
- Sprenger R, Schlagenhaufer R, Kerb R, Bruhn C, Brockmöller J, Roots I, Brinkmann U. Characterization of the glutathione S-transferase GSTT1 deletion: discrimination of all genotypes by polymerase chain reaction indicates a trimodular genotype-phenotype correlation. Pharmacogenetics. 2000;10:557–65.Google Scholar
- Bolt HM, Thier R. Relevance of the deletion polymorphisms of the glutathione S-transferases GSTT1 and GSTM1 in pharmacology and toxicology. Curr Drug Metab. 2006;7:613–28.View ArticlePubMedGoogle Scholar
- Zhao Y, Marotta M, Eichler EE, Eng C, Tanaka H. Linkage disequilibrium between two high-frequency deletion polymorphisms: implications for association studies involving the glutathione-S transferase (GST) genes. PLoS Genet. 2009;5:e1000472.View ArticlePubMedPubMed CentralGoogle Scholar
- Bhattacharjee P, Paul S, Banerjee M, Patra D, Banerjee P, Ghoshal N, Bandyopadhyay A, Giri AK. Functional compensation of glutathione S-transferase M1 (GSTM1) null by another GST superfamily member, GSTM2. Sci Rep. 2013;3:2704.Google Scholar
- Milligan BG. Total DNA, isolation. In: Hoelzel AR, editor. Molecular genetic analysis of populations. London: Oxford University Press; 1998. p. 29–60.Google Scholar
- Khrunin AV, Khokhrin DV, Limborskaia SA. Glutathione-S-transferase gene polymorphism in Russian populations of European Russia. Genetika. 2008;44:1429–34.PubMedGoogle Scholar
- Shadrina MI, Semenova EV, Slominsky PA, Bagyeva GH, Illarioshkin SN, Ivanova-Smolenskaia II, Limborska SA. Effective quantitative real-time polymerase chain reaction analysis of the parkin gene (PARK2) exon 1-12 dosage. BMC Med Genet. 2007;8:6.Google Scholar
- Khrunin AV, Khokhrin DV, Filippova IN, Esko T, Nelis M, Bebyakova NA, et al. A genome-wide analysis of populations from European Russia reveals a new pole of genetic diversity in northern Europe. PLoS One. 2013;8:e58552.Google Scholar
- 1000 Genomes: A Deep Catalog of Human Genetic Variation http://www.1000genomes.org. Accessed 25 June 2015.
- Barrett J, Fry B, Maller J. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21:263–5.View ArticlePubMedGoogle Scholar
- Browning SR, Browning BL. Rapid and accurate haplotype phasing and missing data inference for whole genome association studies using localized haplotype clustering. Am J Hum Genet. 2007;81:1084–97.View ArticlePubMedPubMed CentralGoogle Scholar
- Handsaker RE, Van Doren V, Berman JR, Genovese G, Kashin S, Boettger LM, McCarroll SA. Large multiallelic copy number variations in humans. Nat Genet. 2015;47:296–303.Google Scholar
- Filippova IN, Khrunin AV, Limborska SA. Analysis of DNA variations in GSTA and GSTM gene clusters based on the results of genome-wide data from three Russian populations taken as an example. BMC Genet. 2012;13:89.View ArticlePubMedPubMed CentralGoogle Scholar
- Gu W, Zhang F, Lupski JR. Mechanisms for human genomic rearrangements. Pathogenetics. 2008;1:4.View ArticlePubMedPubMed CentralGoogle Scholar
- Myers S, Spencer CC, Auton A, Bottolo L, Freeman C, Donnelly P, McVean G. The distribution and causes of meiotic recombination in the human genome. Biochem Soc Trans. 2006;34:526–30.Google Scholar
- Pietro G, Magno LA, Rios-Santos F. Glutathione S-transferases: an overview in cancer research. Expert Opin Drug Metab Toxicol. 2010;6:153–70.View ArticlePubMedGoogle Scholar
- Carlsten C, Sagoo GS, Frodsham AJ, Burke W, Higgins JP. Glutathione S-transferase M1 (GSTM1) polymorphisms and lung cancer: a literature-based systematic HuGE review and meta-analysis. Am J Epidemiol. 2008;167:759–74.View ArticlePubMedGoogle Scholar
- Economopoulos KP, Sergentanis TN. GSTM1, GSTT1, GSTP1, GSTA1 and colorectal cancer risk: a comprehensive meta-analysis. Eur J Cancer. 2010;46:1617–31.View ArticlePubMedGoogle Scholar
- Minelli C, Granell R, Newson R, Rose-Zerilli MJ, Torrent M, Ring SM, et al. Glutathione-S-transferase genes and asthma phenotypes: a Human Genome Epidemiology (HuGE) systematic review and meta-analysis including unpublished data. Int J Epidemiol. 2010;39:539–62.View ArticlePubMedGoogle Scholar
- Liu H, Ma HF, Chen YK. Association between GSTM1 polymorphisms and lung cancer: an updated meta-analysis. Genet Mol Res. 2015;14:1385–92.View ArticlePubMedGoogle Scholar
- Covault J, Abreu C, Kranzler H, Oncken C. Quantitative real-time PCR for gene dosage determinations in microdeletion genotypes. Biotechniques. 2003;35:594–7.PubMedGoogle Scholar
- Nørskov MS, Frikke-Schmidt R, Loft S, Tybjaerg-Hansen A. High-throughput genotyping of copy number variation in glutathione S-transferases M1 and T1 using real-time PCR in 20,687 individuals. Clin Biochem. 2009;42:201–9.View ArticlePubMedGoogle Scholar