Copy number variants prioritization after array-CGH analysis – a cohort of 1000 patients
- Isabel Marques Carreira1, 2, 3, 4Email author,
- Susana Isabel Ferreira1,
- Eunice Matoso2, 5,
- Luís Miguel Pires1,
- José Ferrão1,
- Ana Jardim1,
- Alexandra Mascarenhas1,
- Marta Pinto1,
- Nuno Lavoura1,
- Cláudia Pais1,
- Patrícia Paiva1,
- Lúcia Simões1,
- Francisco Caramelo6,
- Lina Ramos6,
- Margarida Venâncio3, 7,
- Fabiana Ramos3, 7,
- Ana Beleza7,
- Joaquim Sá7,
- Jorge Saraiva3, 7 and
- Joana Barbosa de Melo1, 2, 3, 4
© Carreira et al. 2015
Received: 10 August 2015
Accepted: 17 December 2015
Published: 30 December 2015
Array-based comparative genomic hybridization has been assumed to be the first genetic test offered to detect genomic imbalances in patients with unexplained intellectual disability with or without dysmorphisms, multiple congenital anomalies, learning difficulties and autism spectrum disorders.
Our study contributes to the genotype/phenotype correlation with the delineation of laboratory criteria which help to classify the different copy number variants (CNVs) detected. We clustered our findings into five classes ranging from an imbalance detected in a microdeletion/duplication syndrome region (class I) to imbalances that had previously been reported in normal subjects in the Database of Genomic Variants (DGV) and thus considered common variants (class IV).
All the analyzed 1000 patients had at least one CNV independently of its clinical significance. Most of them, as expected, were alterations already reported in the DGV for normal individuals (class IV) or without known coding genes (class III-B). In approximately 14 % of the patients an imbalance involving known coding genes, but with partially overlapping or low frequency of CNVs described in the DGV was identified (class IIIA). In 10.4 % of the patients a pathogenic CNV that explained the phenotype was identified consisting of: 40 class I imbalances, 44 class II de novo imbalances and 21 class II X-chromosome imbalances in male patients. In 20 % of the patients a familial pathogenic or potentially pathogenic CNV, consisting of inherited class II imbalances, was identified that implied a family evaluation by the clinical geneticists.
As this interpretation can be sometimes difficult, particularly if it is not possible to study the parents, using the proposed classification we were able to prioritize the multiple imbalances that are identified in each patient without immediately having to classify them as pathogenic or benign.
KeywordsArray comparative genomic hybridization (array-CGH) Copy number variation (CNV) classification Intellectual disability Multiple congenital anomalies Learning difficulties Autism spectrum disorders
Microarray-based comparative genomic hybridization (array-CGH), also called chromosomal microarray or molecular karyotyping, allows the possibility to screen the whole genome at once and with high resolution. It is currently assumed that array-CGH should be the first genetic test offered to detect genomic imbalances in patients with intellectual disability (ID) with or without dysmorphisms, multiple congenital anomalies (MCA), learning difficulties and autism spectrum disorders (ASD) [1, 2]. As array-CGH allows the detection of imbalances below the 5–10 Mb resolution level of conventional cytogenetics, the average diagnostic yield can be up to 10 % higher [3–7]. It also allows the detection of a large number of Copy Number Variants (CNVs) in these patients as well as in healthy individuals, which poses a great challenge in the interpretation of the results . In addition to this increase in the detection of CNVs, the use of array-CGH in large cohorts of patients with ID, ASD and MCA has led to the identification of novel microdeletion and microduplication syndromes [9, 10].
In this study we report the frequency of imbalances, their inheritance and conclusions of a cohort of 1000 patients with ID, ASD, MCA and learning disabilities studied by array-CGH. We propose a classification of CNVs into five different classes as a way to prioritize the multiple imbalances detected in each patient, helping not only the organization and interpretation of the results, but also in those cases where parents were unwillingly to be tested or were unavailable, the elaboration of a more comprehensive report.
When a class I or class II CNV was found, the study of the parents was requested in order to ascertain its origin and delineate the strategy for family management and future prenatal diagnosis. For cases where a IIIA CNV was identified, decision whether or not to pursue with parental evaluation was left to the clinicians. For class IIIB and class IV CNVs no study of parents was needed, and a normal array-CGH result was issued. Since class IIIB CNVs at present do not have genes reported, no causality was attributed, but the information was included in a summary table in the report, to allow a future revision, if genes or regulatory regions are later described.
There were cases where the origin of the imbalances was not possible to determine, either because of unwillingness of both parents to be evaluated, in situations of adoption or in monoparental cases in which the available progenitor was not a carrier of the proband’s CNVs.
Class I imbalances distributed according to the type of imbalance and inheritance
List of all the syndromes identified (class I imbalances) and the number of cases
Number of cases
Chromosome 1q21.1 deletion syndrome
Chromosome 1q21.1 duplication syndrome
Chromosome 3q29 deletion syndrome
Chromosome 3q29 duplication syndrome
2 (1pat, 1na)
Cri du Chat syndrome
Williams-Beuren region duplication syndrome
2 (1dn, 1na)
Chromosome 9p deletion syndrome
Chromosome15q11q13 duplication syndrome
Chromosome 15q13.3 deletion syndrome
Chromosome 16p11.2 deletion syndrome
5 (1pat, 2dn, 2na)
Chromosome 16p11.2 duplication syndrome
3 (1dn, 2na)
Smith –Magenis syndrome
Chromosome 17q12 duplication syndrome
Chromosome 19q13.11 deletion syndrome
Chromosome 22q11.2 deletion syndrome, distal
5 (1pat, 3dn, 1na)
Chromosome 22q11.2 duplication syndrome
3 (1pat, 1dn, 1na)
Class II imbalances distributed according to the type of imbalance and inheritance
Maternal and paternal
Class IIIA imbalances distributed according to the type of imbalance and inheritance
We identified 185 class IIIB imbalances, fifty-six deletions (30.3 %), thirty duplications (16.2 %) and ninety-nine variations (53.5 %), ranging from 284 bp to 1.4 Mb, with 84 % of them having between 284 bp-200 kb.
Array-CGH has proven to be an important tool not only in the detection of submicroscopic chromosomal imbalances and delineation of structural imbalances but also in the establishment of genotype/phenotype correlation. All of the 1000 cases studied in this cohort presented ID with or without dysmorphisms, learning difficulties, MCA or ASD. All of them had imbalances detected by array-CGH. Overall, there were 843 imbalances (class I-III) in 611 of the 1000 cases (Fig. 1). Imbalances classified as class IV are not discussed as they correspond to benign CNVs. In order to define the relevance of the identified imbalances and their correlation with the identified phenotypes we classified them into five classes:
Class I - Deletion or duplication in a region associated with a microdeletion or a microduplication syndrome
Of the 1000 patients, forty (4 %) fitted into this category with more deletions compared to duplications. In a study by Shoukier and collaborators, in 342 patients, 42 (12 %) had an imbalance that corresponded to a known microdeletion or microduplication syndrome, with a higher proportion of deletions to duplications (33 vs 9) . The most common findings in our cohort were imbalances involving 16p11.2 and 22q11.2 microdeletion/duplication syndromes region, interestingly the same result as recently reported by Ahn et al, 2013 . In the cases where pattern of inheritance was studied, as expected, most of them were de novo. One of these cases had a de novo 19q13.11q13.12 deletion that provides evidence for the involvement of UBA2 gene haploinsuffiency in cutis aplasia .
Of the six inherited imbalances, three were deletions in chromosomes 3q29, 16p11.2 and 22q11.2, and three were duplications, in 3q29, 17q12 and 22q11.2. The 3q29 deletion and the 17q12 duplication were maternally inherited, while the 3q29 and 22q11.2 duplications and the 16p11.2 and 22q11.2 deletions were paternal in origin. In the results by Shoukier and collaborators, half of the 42 imbalances were de novo, 9 were maternally inherited, with 4 imbalances on the X-chromosome, and in 12 inheritance was unknown .
Concerning inherited CNVs, the difficulties in interpretation are due to: 1) incomplete penetrance; 2) phenotypic variability; and 3) lack of clinical evaluation of the carrier progenitor. The chromosome 16p11.2 deletion was inherited from a healthy father, which is commonly reported in the literature and in agreement with the variable clinical outcome associated with this imbalance that can include a normal phenotype [12, 15]. Concerning chromosome 22q11.2 imbalances, the father with the 22q11.2 duplication is healthy, having however only completed the basic scholarship but is currently not able to read or write and his ID is similar to his daughter. The chromosome 22q11.2 deletion is inherited from the paternal grandmother, and the proband is the only one with phenotypic features. These cases clearly show the inter and intra-familial phenotypic variability associated with this imbalance . The 3q29 deletion was inherited from the mother without ID but that had learning difficulties, and actually presents speech difficulties and psychiatric disease. The 3q29 duplication and the 17q12 duplication were inherited from healthy progenitors that have only completed basic scholarship.
The inherited cases posed a delicate problem, in terms of future pregnancies, in healthy parents that had an apparently normal phenotype, even though many of them had only completed basic schooling.
Class I imbalances are considered pathogenic and molecular/conventional cytogenetics-FISH study of the progenitors is recommended, namely to exclude a balanced rearrangement (Table 1).
Class II - Deletion or duplication not reported in the DGV and involving known coding genes
In the studied cohort, 34 % of the patients demonstrated 452 class II imbalances (Fig. 1). This high percentage is most probably due to the fact that the first 250 cases of this cohort of patients were highly selected as they had a strong clinical phenotype, normal karyotype and most had a normal subtelomeric analysis by fluorescence in situ hybridization (FISH) or multiplex ligation-dependent probe amplification (MLPA). Patient selection is one of the factors that most influences the diagnostic yield, as stated by Shoukier and collaborators, whose cohort of patients had already performed conventional cytogenetics and subtelomere screening before array-CGH, similar to our first 250 patients .
Almost 50 % of the identified class II imbalances were inherited, making the genotype/phenotype correlation difficult to establish as most of the carrier progenitors had apparently normal phenotypes, although they have not been through a thorough clinical evaluation. As previously mentioned, larger inherited imbalances also had a phenotypic manifestation in the carrier progenitor. Smaller inherited imbalances can either not be the cause of the proband’s phenotype or can have incomplete penetrance in the carrier progenitor. There are some exceptions, as for example the 12.5 Mb inherited imbalance that corresponds to an X-chromosome duplication in a boy, inherited from his healthy mother, which illustrates how X-chromosome inactivation in females can mask large imbalances. In the range of 8–9 Mb we identified a chromosome 4 maternal duplication in a girl with attention-deficit hyperactivity disorder whose mother presented learning difficulties at school . In the 6–7 Mb range two imbalances were identified: in 18p11.32p11.31 with 6.9 Mb duplication and in 8p21.3p21.1 with 6.2 Mb duplication both inherited from affected fathers, one with ID and the other with learning difficulties, as he only completed basic school.
Class II imbalances have a spectrum that ranges from pathogenic, potentially pathogenic to uncertain significance but likely pathogenic (Table 1). In this type of imbalance, the study of parents is recommended and helps the prioritization of CNVs – if de novo, CNV is most likely pathogenic.
Class IIIA - Deletion or duplication reported in low frequency in the DGV or that do not totally overlap with the ones reported, involving known coding genes
Imbalances classified in this class corresponded to 20 % of all the identified CNVs (Fig. 1). These CNVs can be population CNVs with low representation in the public databases. This class includes imbalances of uncertain significance tending to be probably benign. In only one patient inheritance was determined to be de novo, and in this patient the imbalance might be considered of uncertain significance but likely pathogenic.
Class IIIA imbalances have uncertain significance, ranging from likely pathogenic to likely benign (Table 1). In this type of CNVs, the study of parents is optional and the management of the origin and its impact has to be evaluated in the clinical context.
Class IIIB - Deletion or duplication reported in low frequency in the DGV or that do not totally overlap with the ones reported, without known coding genes
In this cohort of patients, 22 % of the imbalances were classified as IIIB. Accordingly to the available data, as they do not include genes, these imbalances were not considered relevant for the phenotype. Nevertheless, they were included in a table attached to the report sent to the clinician, allowing these cases to be revisited and reclassified in the light of future knowledge.
Class IIIB imbalances have uncertain significance, but being likely benign (Table 1). The study of parents is not recommended.
Array-CGH has revolutionized post natal cytogenetics diagnostic, increasing the detection rate and replacing conventional cytogenetics . However, karyotyping cannot be completely abandoned and, in certain cases, both techniques should be performed in order to understand the biological mechanism involved in the origin of an imbalance and contribute, not only to identify the carriers, but also to do a correct evaluation of the recurrence risk in these families. The importance of the proposed CNVs prioritization arises, specially, when a preliminary report is done requesting parental studies. It was important for the clinical management of the family to have an indication of the relationship between the genotype and the phenotype. Classifying CNVs in different classes allowed us to establish the continuum pathogenic–benign and their final impact on the phenotype, after determining their origin. In classes I and II, parental blood samples were always requested to determine the origin of the imbalance either by array-CGH, MLPA, FISH or karyotype. Prenatal diagnosis was recommended in subsequent gestations to progenitors from classes I and II probands. In class IIIA imbalances it was left to clinicians’ criteria, after the genetic counseling, the necessity to perform subsequent laboratory and familial studies. For classes IIIB and IV no parental blood samples were requested by the laboratory, but these results were sent to the clinician in a table attached to the report for future management.
This classification allowed a prioritization of the cases, particularly when a report had to be issued requesting parents study.
A cohort of 1000 unrelated patient samples was analyzed by oligoarray-CGH in our Cytogenetics and Genomics laboratory over a period of 3 years. The samples were provided by clinical geneticists and pediatricians from 3 national hospitals, as part of a common undertaking to determine the etiology of ID with or without dysmorphisms, learning difficulties, MCA and ASD. The levels of ID of our patients ranged from mild to moderate or severe as determined by the gold standard scales and the ASD patients were evaluated by ADI-R and ADOS. The study was approved by the Hospitals Ethics Committee.
Various techniques were used to confirm and characterize the imbalances identified by oligoarray-CGH, such as, high resolution conventional cytogenetics, FISH or MLPA, depending on the type and dimension of the imbalance and availability of FISH and MLPA probes in the laboratory. In order to determine the origin of the imbalances, parental blood samples were requested and, additionally to the previous mentioned techniques, array-CGH was also used in particular cases.
In 325 patients conventional cytogenetics had been previously performed. Fourteen patients with an abnormal karyotype were analyzed by array-CGH. Of those, five patients presented a deletion, two a duplication, two had apparently balanced translocations, one had a 47, XXY karyotype, three patients presented inversions and one had a derivative chromosome. These cases were analyzed by oligoarray-CGH to refine the breakpoints and the genomic content of the rearrangements, or to determine if there were genomic imbalances involved in apparently balanced translocations, as they can remain undetected , and inversions carriers, or even to clarify if there was an additional imbalance, like in the case of a Klinefelter patient, which could explain a severe phenotype.
Genomic DNA was extracted from peripheral blood lymphocytes using Jetquick blood and cell culture DNA Midi Spin kit according to the manufacturers’ instructions (Genomed, Löhne, Germany). DNA concentration and purity were measured using a NanoDrop1000 Spectrophotometer (Thermo Scientific, Waltham, USA).
High-resolution whole genome analysis was performed using Agilent SurePrint G3 Human Genome microarrays. Although 180 K oligonucleotide microarray format (Agilent Technologies, Santa Clara, CA, USA) was the most used, the 60 K was requested for fifty-four cases. These microarrays contain approximately 180 000 and 60 000 sixty-mer probes with a 17 kb and 54 kb average probe spacing, respectively. Labeling was performed using Agilent Genomic DNA enzymatic labeling kit (Agilent Technologies, Santa Clara, CA, USA) and clean-up of labeled genomic DNA was performed using Amicon ultra 0.5 ml centrifugal filters according to manufacturers’ instructions (Millipore, Billerica, MA, USA). Slides were scanned on an Agilent scanner and processed with Feature Extraction software (v10.7). Each patient DNA was labeled both in Cy3 and Cy5 and hybridized against the DNA of two phenotypically distinct patients according to the loop model . Results were analyzed using Agilent Genomic Workbench (v6.0 and v6.5) software with the following settings: ADM2 as aberration algorithm, threshold of 6.0, moving average 2 Mb. The results are according to Human Genome build 19 and include imbalances with at least three consecutive probes with abnormal log2 ratios. All the imbalances were interpreted consulting the UCSC genome browser (http://genome.ucsc.edu), Decipher (Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources - http://decipher.sanger.ac.uk/), ClinGen (Clinical Genome Resource - https://www.clinicalgenome.org/), OMIM (Online Mendelian Inheritance in Man - http://www.ncbi.nlm.nih.gov/omim) and DGV (http://projects.tcag.ca) databases.
The statistical analysis was mainly conducted aiming to describe the proportions found in each group resorting to suitable figures and tables. Association hypotheses regarding nominal variables were tested using the chi-square test taking into account the Cochran rules for validity of the test. The level of significance adopted was 0.05 %.
The authors wish to thank the patients and their families.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Miller DT, Adam MP, Aradhya S, Biesecker LG, Brothman AR, Carter NP, et al. Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. Am J Hum Genet. 2010;86:749–64.PubMedPubMed CentralView ArticleGoogle Scholar
- Friedman JM, Baross A, Delaney AD, Ally A, Arbour L, Armstrong L, et al. Oligonucleotide microarray analysis of genomic imbalance in children with mental retardation. Am J Hum Genet. 2006;79:500–13.PubMedPubMed CentralView ArticleGoogle Scholar
- Lu X, Shaw CA, Patel A, Li J, Cooper ML, Wells WR, et al. Clinical implementation of chromosomal microarray analysis: summary of 2513 postnatal cases. PLoS One. 2007;2, e327.PubMedPubMed CentralView ArticleGoogle Scholar
- Rodríguez-Revenga L, Vallespín E, Madrigal I, Palomares M, Mur A, García-Miñaur S, et al. A parallel study of different array-CGH platforms in a set of Spanish patients with developmental delay and intellectual disability. Gene. 2013;521:82–6.PubMedView ArticleGoogle Scholar
- D'Arrigo S, Gavazzi F, Alfei E, Zuffardi O, Montomoli C, Corso B, Buzzi E, Sciacca FL, Bulgheroni S, Riva D, Pantaleoni C. The Diagnostic Yield of Array Comparative Genomic Hybridization Is High Regardless of Severity of Intellectual Disability/Developmental Delay in Children. J Child Neurol 2015.Google Scholar
- Coutton C, Dieterich K, Satre V, Vieville G, Amblard F, David M, et al. Array-CGH in children with mild intellectual disability: a population-based study. Eur J Pediatr. 2015;174:75–83.PubMedView ArticleGoogle Scholar
- Nicholl J, Waters W, Mulley JC, Suwalski S, Brown S, Hull Y, et al. Cognitive deficit and autism spectrum disorders: prospective diagnosis by array CGH. Pathology. 2014;46:41–5.PubMedView ArticleGoogle Scholar
- Gijsbers AC, Schoumans J, Ruivenkamp CA. Interpretation of array comparative genome hybridization data: a major challenge. Cytogenet Genome Res. 2011;135:222–7.PubMedView ArticleGoogle Scholar
- Slavotinek AM. Novel microdeletion syndromes detected by chromosome microarrays. Hum Genet. 2008;124:1–17.PubMedView ArticleGoogle Scholar
- Shaffer LG, Bejjani BA, Torchia B, Kirkpatrick S, Coppinger J, Ballif BC. Theidentification of microdeletion syndromes and other chromosome abnormalities: Cytogenetic methods of the past, new technologies for the future. Am J Med Genet Part C Semin Med Genet. 2007;145:335–45.View ArticleGoogle Scholar
- Choucair N, Ghoch JA, Corbani S, Cacciagli P, Mignon-Ravix C, Salem N, et al. Contribution of copy number variants (CNVs) to congenital, unexplained intellectual and developmental disabilities in Lebanese patients. Mol Cytogenet. 2015;8:26.PubMedPubMed CentralView ArticleGoogle Scholar
- Shoukier M, Klein N, Auber B, Wickert J, Schr¨oder J, Zoll B, et al. Array CGH in patients with developmental delay or intellectual disability: are there phenotypic clues to pathogenic copy number variants? Clin Genet. 2013;83:53–65.PubMedView ArticleGoogle Scholar
- Ahn JW, Bint S, Bergbaum A, Mann K, Hall RP, Ogilvie CM. Array CGH as a first line diagnostic test in place of karyotyping for postnatal referrals - results from four years’ clinical application for over 8,700 patients. Mol Cytogenet. 2013;6:16.PubMedPubMed CentralView ArticleGoogle Scholar
- Melo JB, Estevinho A, Saraiva J, Ramos L, Carreira IM. Cutis Aplasia as a clinical hallmark for the syndrome associated with 19q13.11 deletion: the possible role for UBA2 gene. Mol Cytogenet. 2015;8:21.PubMedPubMed CentralView ArticleGoogle Scholar
- Bijlsma EK, Gijsbers AC, Schuurs-Hoeijmakers JH, van Haeringen A, Fransen van de Putte DE, Anderlid BM, et al. Extending the phenotype of recurrent rearrangements of 16p11.2: deletions in mentally retarded patients without autism and in normal individuals. Eur J Med Genet. 2009;52:77–87.PubMedView ArticleGoogle Scholar
- Tan TY, Collins A, James PA, McGillivray G, Stark Z, Gordon CT, et al. Phenotypic variability of distal 22q11.2 copy number abnormalities. Am J Med Genet A. 2011;155:1623–33.View ArticleGoogle Scholar
- Matoso E, Melo JB, Ferreira SI, Jardim A, Castelo TM, Weise A, et al. Insertional translocation leading to a 4q13 duplication including the EPHA5 gene in two siblings with attention-deficit hyperactivity disorder. Am J Med Genet A. 2013;161:1923–8.View ArticleGoogle Scholar
- Di Gregorio E, Savin E, Biamino E, Belligni EF, Naretto VG, D’Alessandro G, et al. Large cryptic genomic rearrangements with apparently normal karyotypes detected by array-CGH. Mol Cytogenet. 2014;7:82.PubMedPubMed CentralView ArticleGoogle Scholar
- Menten B, Maas N, Thienpont B, Buysse K, Vandesompele J, Melotte C, et al. Emerging patterns of cryptic chromosomal imbalance in patients with idiopathic mental retardation and multiple congenital anomalies: a new series of 140 patients and review of published reports. J Med Genet. 2006;43:625–33.PubMedPubMed CentralView ArticleGoogle Scholar