Construction of BAC contig maps of homoeologous chromosomes A12 and D12 of Gossypium hirsutum L. acc. TM-1
© Lv et al. 2015
Received: 24 April 2015
Accepted: 9 July 2015
Published: 28 July 2015
The Gossypium hirsutum homoeologous chromosome 12 encodes important genes that contribute to fiber fuzz, lethality, gland development and male sterility. In this study a physical map of the cotton TM-1 chromosome 12 was constructed. A number of large-insert cotton genome libraries are available, and genome-wide physical mapping using large insert segments combined with bacterial cloning is a thriving area of genome research. However, sequencing of the cotton genome is difficult due to sequence repeats and homoeologous regions. In order to effectively distinguish the homologous segments, a new method for adjusting the parameters of the FPC software was applied for contig map construction.
All available markers on chromosomes A12 and D12 were used to screen the TM-1 BAC library by PCR. A total of 775 clones (387 for A12, 388 for D12) were obtained using Hind III fingerprinting and used for construction of the contig map. Seven pairs of SSR markers located on A12 and D12 were chosen for contig analysis. Following optimization of the tolerance (10) and cutoff (1e-12) parameters, combining all clones from A12 and D12 produced two separate contigs.
The BAC contig map of chromosomes A12 and D12 was constructed and FPC software parameters were optimized for analysis. The resulting approach is a powerful platform for genome-wide and evolutionary research on cotton.
BACs (bacterial artificial chromosomes) are important resources for map-based cloning and large-scale sequencing of complex genomes due to their ability to stably maintain large DNA fragments that facilitate easy manipulation [1, 2]. BAC clones can be arranged into contigs which are contiguous, gap-free overlapping clones [3, 4] which can be used to identify the minimum tilling path fingerprinted contigs required for further gene cloning. BAC libraries have been constructed from various different cotton species, including Maxxa , Suyuan7235 , Zhongmiansuo 12 , 0-613-2R , and Pima 90–53 . These libraries can be used to construct a high quality physical map of the cotton genome.
Cotton is the source of the world most important plant-derived fiber, and is also an important oilseed crop, as well as a model species for the study of plant polyploidy, cellulose biosynthesis and cell wall biogenesis . The genus Gossypium consists of over 50 species, including 40–45 diploids (2n = 2 × = 26) and five allotetraploids (2n = 4 × = 52)  that were formed from separate A and D genomes through polyploidization 1–2 million years ago [12–14]. The tetraploid cotton genome contains 13 pairs of homologous chromosomes [15–17] which have been incorporated into specific BAC clones, and a new chromosome nomenclature for tetraploid cotton has been proposed . A12 and D12 chromosomes in allotetraploid cotton are among the most important pairs of homoeologous chromosomes, and include the alleles (N 1 ,n 2 ) at naked seeds , Le 1 , Le 2 at hybrid lethality [20, 21], gl 2 , gl 3 at gossypol glanding, ne 1 ,ne 2 at mecariless, Bw 1 , Bw 2 at withering bract, ms 8 ,ms 9 at male sterile [22–26]. For map-based cloning and whole-genome sequencing of tetraploid cotton, a BAC library for G. hirsutum acc. TM-1 has been constructed [8, 27, 28]. Furthermore, a high-density gene-rich genetic map containing 2247 loci and covering 3540.4-cM, with an average inter-marker distance of 1.58-cM, has been constructed from the BC1 (TM-1 × Hai7124) cross [29–34].
In this study, to facilitate construction of BAC contig maps for the G. hirsutum L. Acc. TM-1 homoeologous chromosomes A12 and D12, the molecular markers identified in the aforementioned genetic map were used to screen the BAC library. BACs containing genetic markers that produced characteristic fingerprints when digested with the restriction enzyme Hind III were used to build contigs using the FPC software. A BAC fingerprint-based contig map of both A12 and D12 was successfully constructed by adjusting the tolerance and cutoff parameters in FPC. This approach will facilitate future map-based cloning of important cotton genes and expand our understanding of the relationships between the genetic and physical maps of A12 and D12.
Identifying BAC clones containing genes from chromosomes A12 and D12
BAC library screening was based on the published linkage map , and a total of 101 and 124 SSR markers from tetraploid cotton homoeologous chromosomes A12 and D12, respectively, were selected for screening the TM-1 library by a PCR-based method . The PCR products were abundant and well-resolved, and positive BAC clones were identified by the SSR marker. In total, 775 positive BAC clones were screened out using the SSR markers, and each marker identified and average of five clones.
Adjustment of FPC parameters and BAC fingerprinting
BACs identified by seven pairs of SSR markers
1 81G1, 255 F14, 79G11, 89P7
30 N23,367 M15,318I15
367 M17,91 L21,19 N16,34 J24,34G22, 273 J12, 282G8,91B21, 363Q13
194H15, 031 K3
225C9, 206 L15, 254 L5,192 F22
64E13, 79A4, z84A22
051 L3, 82D24, 081 K8
215O23, 266E15, 45H11, 115D23
14P1, 22Q13, 69 K1, 86E19, 269B19, 271D5, 294Q20, 30I21, 68015, 084P1, 74E18, 74 F9, 96I20
075A11,321 J20,102 N2
For NAU2715, with a tolerance value of 5 and a cutoff value of 1e-12, combining all BAC clones produced two contigs from A12 and D12, respectively (Fig. 6b). However when the tolerance value was increased to 7 and the cutoff value was 1e-12, BAC clones produced two contigs from A12 and D12, but BAC 81G1 appeared in D12 (Fig. 6b). Increasing the tolerance value to 10 and produced a one single contig from D12 (Fig. 6b).
Combining all BAC clones from NAU1237 produced two contigs with a tolerance value of 7 and a cutoff value of 1e-10, which could not distinguish contigs between chromosomes A12 and D12 (Fig. 6c). Two contigs for A12 and D12 were produced if the tolerance value was 5 and the cutoff value was 1e-11 (Fig. 6c). When the cutoff value was 1e-11 and the tolerance value was increased from 6 to 8, or when the tolerance value was 5 and the cutoff value was increased to 1e-12, this also produced separate contigs from A12 and D12, but BAC 045H11 disappeared from chromosome D12. When the tolerance value was 7 or 8 and the cutoff value was increased to 1e-12, the BAC clones generated separate contigs from A12 and D12, but BAC clone 266E15 disappeared from chromosome D12.
Combining all BAC clones from NAU2356 produced a single contig when the tolerance value was 7 and the cutoff value was 1e-10, which could not distinguish between A12 and D12 (Fig. 6d). With a tolerance value of 7 and a cutoff value of 1e-12, all BAC clones from NAU2356 produced two separate contigs for A12 and D12 (Fig. 6d). Increasing the tolerance value from 8 and 9 and setting the cutoff value at 1e-12 produced the contig from A12 and D12, respectively, but BAC 68015 disappeared from A12 (Fig. 6d). Increasing the tolerance value to 10 and setting the cutoff value at 1e-12 produced the contig from A12 and D12, respectively, but BAC 68015 and 075A11 disappeared from contigs (Fig. 6d).
Combining BAC clones from NAU3441 produced one contig when the parameters were set at the lower end of the range (Fig. 6e). However if the tolerance value was increased to 10 and the cutoff value was set at 1e-11, a single contig was generated (Fig. 6e), whereas with a tolerance value of 10 and a cutoff value of 1e-12, two separate contigs for A12 and D12 resulted (Fig. 6e).
Combining BAC clones from NAU3293 produced a single contig with a tolerance value of 7 and a cutoff value of 1e-01 to 1e-09 (Fig. 6f). However, increasing the cutoff value from 1e-10 to 1e-12 produced two contigs from A12 and D12, respectively (Fig. 6f). A tolerance value of 10 and a cutoff value of 1e-12 were therefore optimal.
Using BACs as a tool for constructing contig maps
Various strategies have been developed for constructing physical maps, and BAC clones are especially important for both map-based cloning and sequencing of large complex genomes containing abundant repetitive sequences and highly homologous segments. How to effectively distinguish between the homologous segments of the tetraploid cotton genome, verifying the quality of the physical map is crucial. To avoid interference from sequence similarity, BACs identified by SSR markers based on the linkage map of 12A and 12D were used for assembly in this study. Full use was made of SSR markers from high density genetic mapping to screen the BAC library and to distinguish between the A and D subgenomes. BAC clones obtained by PCR were located to A12 or D12 homoeologous chromosomes according to the published linkage map . Although BAC library screening was time consuming initially, the resulting BAC contigs were accurate and this simplified the analysis later. The first integrated cytogenetic and linkage maps of homoeologous chromosomes 12A and 12D indicate that the orders of most genetic markers tested are colinear with corresponding BAC FISH signals. Although the orders and positions of polymorphic and monomorphic BACs on chromosome 12A and 12D were concordant with marker positions in the corresponding linkage map, the detailed chromosomal view of genome size variation between homoeologous chromosomes 12A and 12D was provided that show 1.3-fold size variation . Furthermore, upon integrating with molecular markers, the BAC contig map of A12 and D12 will facilitate map-based cloning of QTLs or genes associated with important agronomic traits and marker-assisted selection, as well as comparative studies for analysis of the evolution of cotton genomes using sequencing of homologous segments.
Improving contig map quality
The quality of the BAC library DNA is important for sharp, clear fingerprinting maps, and several steps in the preparation of BAC DNA are critical to the success of the process. Due to the low copy number, at least 3 ml of cell culture should be used for DNA preparation, and both cell growth and BAC production should be adequate. The cell pellet should be thoroughly suspended and cell lysis should proceed for less than 5 min. For construction of contigs, the tolerance and cutoff parameters were highly influential. We tested cutoff values from 1e-01 to 1e-12 and tolerance values from 5 to 10, which span the range used widely for agarose-based enzyme fingerprinting. A tolerance value of between 3 and 5 and a cutoff value between 1e-30 and 1e-50 are generally used in HICF fingerprint analysis . A tolerance value of 4 and a cutoff value from 1e-20 to 1e-04 were selected for the BIBAC contig map, which contained at least five shared clones , and a tolerance value of 8 and a cutoff value of 1e-10 were used in the Gossypium raimondii D-genome physical map . In this study, we adjusted the parameters in the FPCV9.3 software to improve the quality of the contig map for analysis by enzyme fingerprinting using homologous BACs. A tolerance value of 10 and a cutoff value of 1e-12 were found to be the optimal parameters.
In this study, we successfully constructed a BAC contig map of the homoeologous chromosome 12 of G. hirsutum TM-1. The BAC-based method was an effective strategy for construction of the cotton physical map, but further work is necessary to improve the contig map. Even so, this work introduces a novel method for combined analysis of homoeologous chromosomes, and could provide an important framework for sequencing of the cotton genome. This work could also facilitate research into the evolution of the cotton genome through sequencing of homologous segments and comparison of genome sequences with BAC sequences. This could also generate information on the genome formation and evolutionary processes involved in cotton polyploidization.
Source BAC library and BAC library screening
A BAC library constructed from G. hirsutum cv. Texas Marker-1 (TM-1) was used in this study . The library was constructed from cotton DNA partially digested with Hind III and incorporated into the BAC vector pIndigBAC-5. The library consists of 147,456 clones with an average insert size of 122.8 kb that ranges from 97 to 240 kb. Approximately 96 % of the clones contain inserts over 100 kb, therefore this library represents 7.4 haploid genome equivalents in theory, based on an AD genome size of 2425 Mb. Clones were stored in 384-well plates and arrayed into multiplex pools for rapid and reliable library screening. BAC screening was carried using four-round PCR using SSR markers selected from the A12 and D12 high-density genetic maps derived from populations of the tetraploid Gossypium species.
BAC-DNA isolation and fingerprinting
BAC clones were inoculated into 96-well 2.2 mL plates, and each well contained 1.5 mL of 2xYT medium (12.5 μg/mL CM). Plates were covered with sealing film and incubated at 37 °C for 20–24 h on a shaker. BAC DNA was isolated using standard alkaline lysis , digested with HindIII, and subjected to 1 % agarose gel electrophoresis at 40 V for 16 h. Restriction fragment identification was performed using IMAGE 3.10b software  with extensive manual editing. Fragments ranging from 53 to 155 bases were used for contig assembly. Bands derived from the BAC vector (pIndigBAC-5) and BACs containing less than five bands were manually deleted from the image files.
BAC contigs assembly
The computer program FPC V9.3 (http://www.agcol.arizona.edu/software/fpc/) was used to assemble the physical map contigs from the BAC fingerprints. A series of tests were conducted in which fingerprints of a set of overlapping clones were compared using different tolerance values (from 5 to 10) and cutoff values (from 1e-01 to 1e-12). Based on these results, a tolerance of 10 and a primary cutoff of 1e-12 were selected for contig assembly.
This study was financially supported in part by grants from the National Natural Science Foundation (31201250), the Natural Science Foundation in Jiangsu Province (BK2010441), the Priority Academic Program Development of Jiangsu Higher Education Institutions and 111 Project.
- Shizuya H, Birren B, Kim UJ, Mancino V, Slepak T, Tachiiri Y, et al. Cloning and stable maintenance of 300-kilobase-pair fragments of human DNA in escherichia-coli using an F-factor-based vector. Proc Natl Acad Sci U S A. 1992;89(18):8794–7.PubMed CentralPubMedView ArticleGoogle Scholar
- Woo SS, Jiang J, Gill BS, Paterson AH, Wing RA. Construction and characterization of a bacterial artificial chromosome library of Sorghum bicolor. Nucleic Acids Res. 1994;22(23):4922–31.PubMed CentralPubMedView ArticleGoogle Scholar
- Staden R. A mew computer method for the storage and manipulation of DNA gel reading data. Nucleic Acids Res. 1980;8(16):3673–94.PubMed CentralPubMedView ArticleGoogle Scholar
- Coulson A, Sulston J, Brenner S, Karn J. Toward a physical map of the genome of the nematode Caenorhabditis elegans. Proc Natl Acad Sci U S A. 1986;83(20):7821–5.PubMed CentralPubMedView ArticleGoogle Scholar
- Frelichowski Jr JE, Palmer MB, Main D, Tomkins JP, Cantrell RG, Stelly DM. Cotton genome mapping with new microsatellites from Acala ‘Maxxa’ BAC-ends. Mol Gen Genet. 2006;275:479–91.View ArticleGoogle Scholar
- Wang XF, Jun MA, Ma ZY, Zhang GY, Zheng YM. BAC library construction and characterization of Suyuan7235, a cotton germplasm with high fiber strength. Cotton Sci. 2006;18:200–3 (in Chinese with an English abstract).Google Scholar
- Zheng YM, Wang XF, Zhang GY, Li XH, Ma ZY. BAC library construction of Zhongmiansuo12 with high yield, high quality and disease resistance. J Hebei Agric Univ. 2004;27:17–20 (in Chinese with an English abstract).Google Scholar
- Yin JM, Guo WZ, Yang LM, Liu LW, Zhang TZ. Physical mapping of the Rf 1 fertility-restoring gene to a 100 kb region in cotton. Theor Appl Genet. 2006;112(7):1318–25.PubMedView ArticleGoogle Scholar
- Wang XF, Ma J, Wang WS, Zheng YM, Zhang GY, Liu CJ. Construction and characterization of the first bacterial artificial chromosome library for the cotton species Gossypium barbadense L. Genome. 2006;49:1393–8.PubMedView ArticleGoogle Scholar
- Kim HJ, Triplett BA. Cotton fiber growth in planta and in vitro. Models for plant cell elongation and cell wall biogenesis. Plant Physiol. 2001;127(4):1361–6.PubMed CentralPubMedView ArticleGoogle Scholar
- Fryxell P. A revised taxonomic interpretation of Gossypium L. (Malvaceae). Rheedea. 1992;2:108–65.Google Scholar
- Wendel JF. New World tetraploid cottons contain Old World cytoplasm. Proc Natl Acad Sci U S A. 1989;86(11):4132–6.PubMed CentralPubMedView ArticleGoogle Scholar
- Seelanan T, Schnabel A, Wendel JF. Congruence and consensus in the cotton tribe (Malvaceae). Syst Bot. 1997;22(2):259–90.View ArticleGoogle Scholar
- Cronn RC, Small RL, Haselkorn T, Wendel JF. Rapid diversification of the cotton genus (Gossypium: Malvaceae) revealed by analysis of sixteen nuclear and chloroplast genes. Am J Bot. 2002;89(4):707–25.PubMedView ArticleGoogle Scholar
- Wang K, Guo WZ, Zhang TZ. Detection and mapping of homologous and homoeologous segments in homoeologous groups of allotetraploid cotton by BAC-FISH. BMC Genomics. 2007;8:178.PubMed CentralPubMedView ArticleGoogle Scholar
- Rong JK, Abbey C, Bowers JE, Brubaker CL, Chang C, Chee PW, et al. A 3347-locus genetic recombination map of sequence-tagged sites reveals features of genome organization, transmission and evolution of cotton (Gossypium). Genetics. 2004;166(1):389–417.PubMed CentralPubMedView ArticleGoogle Scholar
- Nguyen TB, Giband M, Brottier P, Risterucci AM, Lacape JM. Wide coverage of the tetraploid cotton genome using newly developed microsatellite markers. Theor Appl Genet. 2004;109(1):167–75.PubMedView ArticleGoogle Scholar
- Wang K, Song XL, Han ZG, Guo WZ, Yu JZ, Sun J, et al. Complete assignment of the chromosomes of Gossypium hirsutum L. by translocation and fluorescence in situ hybridization mapping. Theor Appl Genet. 2006;113(1):73–80.PubMedView ArticleGoogle Scholar
- Song L, Guo WZ, Qin HD, Ding YZ, Zhang TZ. Genetic analysis and molecular validation of chromosome assignment for fuzzless genes N 1 and n 2 in cotton. J Nanjing Agric Univ. 2010;33(1):21–6 (In Chinese).Google Scholar
- Stelly DM. Localization of the Le 2 Locus of Cotton (Gossypium-Hirsutum L). J Hered. 1990;81:193–7.Google Scholar
- Samora P, Stelly D, Kohel R. Localization and mapping of the Le 1 , and Gl 2 loci of cotton (Gossypium hirsutum L.). J Hered. 1994;85:152–7.Google Scholar
- Kohel RJ, Stelly DM, Yu J. Tests of six cotton (Gossypium hirsutum L.) mutants for association with aneuploids. J Hered. 2002;93:130–2.PubMedView ArticleGoogle Scholar
- Endrizzi JE, Turcotte EL, Kohel RJ. Genetics, cytogenetics and evolution of Gossypium. Adv Genet. 1985;23:271–5.View ArticleGoogle Scholar
- Rong J, Pierce GJ, Waghmare VN, Rogers CJ, Desai A, Chee PW, et al. Genetic mapping and comparative analysis of seven mutants related to seed fiber development in cotton. Theor Appl Genet. 2005;111(6):1137–46.PubMedView ArticleGoogle Scholar
- Dong CG, Ding YZ, Guo WZ, Zhang TZ. Fine mapping of the dominant glandless Gene Gl 2 e in Sea-island cotton (Gossypium barbadense L.). Chin Sci Bull. 2007;52(22):3105–9.View ArticleGoogle Scholar
- Chen D, Ding Y, Guo W, Zhang T. Molecular mapping of genic male-sterile genes ms 15 , ms 5 and ms 6 in tetraploid cotton. Plant Breed. 2009;128(2):193–8.View ArticleGoogle Scholar
- Hu Y, Guo WZ, Zhang TZ. Construction of a bacterial artificial chromosome library of TM-1, a standard line for genetics and genomics in Upland cotton. J Integr Plant Biol. 2009;51(1):107–12.PubMedView ArticleGoogle Scholar
- Xu Z, Kohel RJ, Song G, Cho J, Yu J, Yu S, et al. An integrated genetic and physical map of homoeologous chromosomes 12 and 26 in Upland cotton (G. hirsutum L.). BMC Genomics. 2008;9:108.PubMed CentralPubMedView ArticleGoogle Scholar
- Han ZG, Guo WZ, Song XL, Zhang TZ. Genetic mapping of EST-derived microsatellites from the diploid Gossypium arboreum in allotetraploid cotton. Mol Genet Genomics. 2004;272(3):308–27.PubMedView ArticleGoogle Scholar
- Song XL, Wang K, Guo WZ, Zhang J, Zhang TZ. A comparison of genetic maps constructed from haploid and BC1 mapping populations from the same crossing between Gossypium hirsutum L. and Gossypium barbadense L. Genome. 2005;48(3):378–90.PubMedView ArticleGoogle Scholar
- Han ZG, Wang CB, Song XL, Guo WZ, Gou JY, Li CH, et al. Characteristics, development and mapping of Gossypium hirsutum derived EST-SSRs in allotetraploid cotton. Theor Appl Genet. 2006;112(3):430–9.PubMedView ArticleGoogle Scholar
- Guo WZ, Cai CP, Wang CB, Han ZG, Song XL, Wang K, et al. A microsatellite-based, gene-rich linkage map reveals genome structure, function and evolution in gossypium. Genetics. 2007;176(1):527–41.PubMed CentralPubMedView ArticleGoogle Scholar
- Guo WZ, Cai C, Wang C, Zhao L, Wang L, Zhang TZ. A preliminary analysis of genome structure and composition in Gossypium hirsutum. BMC Genomics. 2008;9:314.PubMed CentralPubMedView ArticleGoogle Scholar
- Zhao L, YD L, Cai CP, Tong XC, Chen XD, Zhang W, et al. Toward allotetraploid cotton genome assembly: integration of a high-density molecular genetic linkage map with dna sequence information. BMC Genomics. 2012;13:539.PubMed CentralPubMedView ArticleGoogle Scholar
- Wang K, Guo W, Yang Z, Hu Y, Zhang W, Zhou B, et al. Structure and size variations between 12A and 12D homoeologous chromosomes based on high-resolution cytogenetic map in allotetraploid cotton. Chromosoma. 2010;119:255–66.PubMedView ArticleGoogle Scholar
- Luo MC, Thomas C, You FM, Hsiao J, Shu OY, Buell CR, et al. High-throughput fingerprinting of bacterial artificial chromosomes using the SNaPshot labeling kit and sizing of restriction fragments by capillary electrophoresis. Genomics. 2003;82(3):378–89.PubMedView ArticleGoogle Scholar
- Zhang M, Zhang Y, Huang JJ, Zhang X, Lee MK, Stelly DM, et al. Genome physical mapping of polyploids: a BIBAC physical map of cultivated tetraploid cotton, Gossypium hirsutum L. Plos One. 2012;7(3):e33644.PubMed CentralPubMedView ArticleGoogle Scholar
- Lin L, Pierce GJ, Bowers JE, Estill JC, Compton RO, Rainville LK, et al. A draft physical map of a D-genome cotton species (Gossypium raimondii). BMC Genomics. 2010;11:395.PubMed CentralPubMedView ArticleGoogle Scholar
- Sambrook J, Fritsch EF, Maniatis T. Molecular Cloning: A Laboratory Manual. 2nd ed. New York: Cold Spring Harbor Laboratory Press; 1989.Google Scholar
- Sulston J, Mallett F, Durvin R, Horsnell T. Image analysis of restriction enzyme fingerprint audioradiograms. Comput Appl Biosci. 1989;5:101–6.PubMedGoogle Scholar
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.