Acessibilidade / Reportar erro

Copy number variation (CNV) identification, interpretation, and database from Brazilian patients

Abstract

Copy number variations (CNVs) constitute an important class of variation in the human genome and the interpretation of their pathogenicity considering different frequencies across populations is still a challenge for geneticists. Since the CNV databases are predominantly composed of European and non-admixed individuals, and Brazilian genetic constitution is admixed and ethnically diverse, diagnostic screenings on Brazilian variants are greatly difficulted by the lack of populational references. We analyzed a clinical sample of 268 Brazilian individuals, including patients with neurodevelopment disorders and/or congenital malformations. The pathogenicity of CNVs was classified according to their gene content and overlap with known benign and pathogenic variants. A total of 1,504 autosomal CNVs (1,207 gains and 297 losses) were classified as benign (92.9%), likely benign (1.6%), VUS (2.6%), likely pathogenic (0.2%) and pathogenic (2.7%). Some of the CNVs were recurrent and with frequency increased in our sample, when compared to populational open resources of structural variants: 14q32.33, 22q11.22, 1q21.1, and 1p36.32 gains. Thus, these highly recurrent CNVs classified as likely benign or VUS were considered non-pathogenic in our Brazilian sample. This study shows the relevance of introducing CNV data from diverse cohorts to improve on the interpretation of clinical impact of genomic variations.

Keywords:
CNV; copy number variation; database; CNV classification

Introduction

CNVs are characterized by losses or gains of DNA sequences that are larger than 50 bp (Alkan et al. 2011Alkan C, Coe BP and Eichler EE (2011) Genome structural variation discovery and genotyping. Nat Rev Genet 12:363-76.; MacDonald et al. 2014MacDonald JR, Ziman R, Yuen RK, Feuk L and Scherer SW (2014) The database of genomic variants: A curated collection of structural variation in the human genome. Nucleic Acids Res 42:986-92. ; Zarrei et al. 2015Zarrei M, MacDonald JR, Merico D and Scherer SW (2015) A copy number variation map of the human genome. Nat Rev Genet 16:172-83.). They are a relevant class of variants due to the large number of genome segments that differs in the dosage between individuals, conferring great interindividual diversity (Iafrate et al. 2004Iafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW and Lee C (2004) Detection of large-scale variation in the human genome. Nat Genet 36:949-51. ). Such variants are also present in healthy individuals with no apparent association with disease phenotypes, being considered benign CNVs (Iafrate et al. 2004Iafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW and Lee C (2004) Detection of large-scale variation in the human genome. Nat Genet 36:949-51. ; Sebat et al. 2004Sebat J, Lakshmi B, Troge J, Alexander J, Young J, Lundin P, Månér S, Massa H, Walker M, Chi M et al. (2004) Large-scale copy number polymorphism in the human genome. Science 305:525-28. ; Redon et al. 2006Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W et al. (2006) Global variation in copy number in the human genome. Nature 444:444-454. ; Korbel et al. 2007Korbel JO, Urban AE, Affourtit JP, Godwin B, Grubert F, Simons JF, Kim PM, Palejev D, Carriero NJ, Du L et al. (2007) Paired-end mapping reveals extensive structural variation in the human genome. Science 318:420-426.; Conrad et al. 2010Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang Y, Aerts J, Andrews TD, Barnes C, Campbell P et al. (2010) Origins and functional impact of copy number variation in the human genome. Nature 464:704-712.). There are also CNVs that are responsible for the etiology of numerous human diseases, such as multiple syndromes that are associated to congenital anomalies, complex neurodegenerative and neuropsychiatric disorders, intellectual disability, cancer and immunological diseases (Stankiewicz and Lupski 2010Stankiewicz P and Lupski JR (2010) Structural variation in the human genome and its role in disease. Annu Rev Med 61:437-55.; Girirajan et al. 2011Girirajan S, Campbell CD and Eichler EE (2011) Human copy number variation and complex genetic disease. Annu Rev Genet 45:203-226.). The recognition of their influence on the phenotype, however, is not an easy task. Although some CNV maps and databases have been constructed (Zarrei et al. 2015Zarrei M, MacDonald JR, Merico D and Scherer SW (2015) A copy number variation map of the human genome. Nat Rev Genet 16:172-83.) for both, healthy individuals (e.g. DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
- http://dgv.tcag.ca/dgv/app/home) and affected patients (e.g. DECIPHERDatabase of genomic variation and phenotype in humans using ensembl Resources (Decipher) (http://decipher.sanger.ac.uk).
http://decipher.sanger.ac.uk...
- https://decipher.sanger.ac.uk/), it is still a great challenge to assess the CNVs’ clinical impact (Lee et al. 2007Lee C, Iafrate AJ and Brothman AR (2007) Copy number variations and clinical cytogenetic diagnosis of constitutional disorders. Nat Genet 39:S48-S54.; Gijsbers et al. 2011Gijsbers AC, Schoumans J and Ruivenkamp CA (2011) Interpretation of array comparative genome hybridization data: a major challenge. Cytogenet Genome Res 135:222-7.; Brnich et al. 2019Brnich S, Abou Tayoun A, Greenblatt MS, Heinen CD, Kanavy D, Luo X, McNulty SM, Starita LM, Tavtigian SV et al. (2019). PgmNr 352: Recommendations for determining the clinical validity of functional studies for use in variant interpretation. In: American Society of Human Genetics 69th Annual Meeting, p 445.). The American College of Medical Genetics and Genomics (ACMG) presented guidelines for CNVs interpretation and recommended the use of specific standard terminology: “pathogenic”, “likely pathogenic”, “uncertain significance”, “likely benign”, and “benign” (Richards et al. 2015Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, Grody WW, Hegde M, Lyon E, Spector E et al. (2015) Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med 17:405-24.). This guide was recently updated to assist clinical laboratories in the classification and reporting of CNVs. These professional standards will guide the evaluation of constitutional CNVs and encourage consistency and transparency across clinical laboratories (Riggs et al. 2019Riggs E, Andersen E, Cherry A, Kantarci S, Kearney H, Patel A, Raca G, Ritter DI, South S, Thorland E et al. (2019) Technical standards for the interpretation and reporting of constitutional copy-number variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen). Genet Med 22:245-257.). However, there are no generally established rules for CNV analysis, interpretation, and classification, and the guidelines can change over time due to the scientific information evolution (de Leeuw et al. 2012de Leeuw N, Dijkhuizen T, Hehir-Kwa JY, Carter NP, Feuk L, Firth HV, Kuhn RM, Ledbetter DH, Martin CL, van Ravenswaaij-Arts CMA et al. (2012) Diagnostic interpretation of array data using public databases and internet sources. Hum Mutat 33:930-940.; Palmer et al. 2014Palmer E, Speirs H, Taylor PJ, Mullan G, Turner G, Einfeld S, Tonge B and Mowat D (2014) Changing interpretation of chromosomal microarray over time in a community cohort with intellectual disability. Am J Med Genet A 164A:377-85.; Brnich et al, 2019Brnich S, Abou Tayoun A, Greenblatt MS, Heinen CD, Kanavy D, Luo X, McNulty SM, Starita LM, Tavtigian SV et al. (2019). PgmNr 352: Recommendations for determining the clinical validity of functional studies for use in variant interpretation. In: American Society of Human Genetics 69th Annual Meeting, p 445.; Riggs et al. 2019Riggs E, Andersen E, Cherry A, Kantarci S, Kearney H, Patel A, Raca G, Ritter D, South S, Thorland E et al. (2019) PgmNr 353: Validation of scoring metrics to guide the classification of constitutional copy number variants. In: American Society of Human Genetics 69th Annual Meeting, p. 447.). Furthermore, it has been shown that the CNV distribution can differ across ethnic populations (Li et al. 2009Li J, Yang T, Wang L, Yan H, Zhang Y, Guo Y, Pan F, Zhang Z, Peng Y, Zhou Q et al. (2009) Whole genome distribution and ethnic differentiation of copy number variation in caucasian and asian populations. PLoS One 4:e7958.; Collins et al. 2019Collins RL, Brand H, Karczewski KJ, Zhao X, Alföldi J, Khera AV, Francioli LC, Gauthier LD, Wang H, Watts NA et al. (2019) PgmNr 351: gnomAD-SV: An open resource of structural variation for medical and population genetics. In: American Society of Human Genetics 69th Annual Meeting, p. 443.). The Brazilian population is highly admixed and still underrepresented in genomic databanks (Naslavsky et al. 2017Naslavsky MS, Yamamoto GL, de Almeida TF, Ezquina SAM, Sunaga DY, Pho N, Bozoklian D, Sandberg TOM, Brito LA, Lazar M et al. (2017) Exomic variants of an elderly cohort of brazilians in the ABraOM database. Hum Mutat 38:751-763.; Andrade et al. 2018Andrade RB, Amador MAT, Cavalcante GC, Leitão LPC, Fernandes MR, Modesto AAC, Moreira FC, Khayat AS, Assumpção PP, Ribeiro-dos-Santos A et al. (2018). Estimating Asian contribution to the Brazilian population: A new application of a validated set of 61 ancestry informative markers. G3 Genes/Genomes/Genetics 8:3577-3582). Thus, the main goal of this study was to survey and classify large CNVs to assemble a database from Brazilian patients in order to improve the interpretation of their clinical impact.

Subjects and Methods

Individuals studied

A sample composed of 268 microarrays performed in patients with phenotypic alterations was studied (Table S1 Table S1 - Detailed information about all CNVs analyzed in our sample. ). The patients were recruited from the Medical Genetics Center of the Universidade Federal de São Paulo, outpatient clinics of the Hospital São Paulo and other genetics centers in the state of São Paulo, Brazil. This project was approved by the University Ethics Committee and all participants or parents signed informed consents. All procedures performed involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendmentsor comparable ethical standards. We studied microarrays from 143 patients with normal karyotypes (69 presented with diverse phenotypic alterations and 74 were patients with phenotype of the oculoauriculovertebral spectrum - OAVS) and 125 patients with previously identified genomic imbalances/chromosomal alterations, who participated in specific studies in our laboratory (50 presented with 22q11.2 deletion, 23 with apparently balanced translocations, nine with marker chromosomes, five with 18q deletions, four with 18p deletions, four with ring chromosomes, and 30 with other abnormal G-banding karyotypes exclusive to a single patient).

Microarray-based copy number variation assay and quality control

Genomic DNA was obtained from peripheral blood using the Gentra Puregene kit (Qiagen-Sciences, Maryland, USA). DNA samples were then analyzed using the Genome-Wide Human Array 6.0 SNP array (n= 59 individuals), CytoScan 750K (n= 54 individuals), and CytoScan High-Density SNP array (n= 155 individuals), following the manufacturer’s instructions (Affymetrix, Santa Clara, CA, US). Array analyses were performed using the Chromosome Analysis Suite software (ChAS), version 3.3 (Affymetrix, Santa Clara, CA, USA). The quality control (QC) parameters were applied according to the manufacturer´s recommendations. For Genome-Wide Human Array 6.0 SNP array platform, samples with Median of the Absolute values of all Pairwise Differences (MAPD) ≤ 0.35 were included in the sample. For CytoScan High-Density (HD) SNP array and CytoScan 750K platforms, samples with MAPD ≤ 0.25, SNP quality control (SNPQC) ≥ 15 (or ≥ 12 when all other parameters met the requirements), and waviness standard deviation (waviness SD) ≤ 0.12 (when all other parameters met the requirements) were also included in the sample.

CNVs analysis and classification

The CNV classification was performed by the same investigator in a blind manner considering array type and patients´ phenotype. Autosomal CNVs that had a minimum coverage of 50 probes and a minimum size of 200 kb for gains and 150 kb for losses were considered for the analysis of pathogenicity, since deletions can be more deleterious for the phenotype. The CNVs previously detected by other cytogenomic tests and undoubtedly causative to the patients’ phenotype such as, deletions 18p, 18q and deletion 22q11.2, were excluded from the analysis in order to avoid super representation of these loci. The genomic imbalances were annotated based on the GRCh37/hg19 Genome Build (Feb 2009). CNVs analysis was performed using the UCSCUniversity of California-Santa Cruz Human Genome Browser (UCSC) (http://genome.ucsc.edu/).
http://genome.ucsc.edu/...
with DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
track (May 15 2016 version), DECIPHERLi JZ, Absher DM, Tang H, Southwick AM, Casto AM, Ramachandran S, Cann HM, Barsh GS, Feldman M, Cavalli-Sforza LL et al. (2008) Worldwide human relationships inferred from genome-wide patterns of variation. Science 319:1100-4 track (Oct 30 2018 version), ClinVarClinVar (https://www.ncbi.nlm.nih.gov/clinvar/).
https://www.ncbi.nlm.nih.gov/clinvar/...
track (Oct 2017 version), ClinGenClinGen (https://www.clinicalgenome.org/).
https://www.clinicalgenome.org/...
track (Oct 2017 version), OMIM track for analysis of genes associated with diseases (Oct 10 2018 version) and NCBI RefSeq Genes track (April 19 2017 version). A flowchart for CNVs classification (Figure 1) was built based on the criteria described by Lee et al. (2007Lee C, Iafrate AJ and Brothman AR (2007) Copy number variations and clinical cytogenetic diagnosis of constitutional disorders. Nat Genet 39:S48-S54.), Vermeesch et al. (2012Vermeesch JR, Brady PD, Sanlaville D, Kok K and Hastings RJ (2012) Genome-wide arrays: Quality criteria and platforms to be used in routine diagnostics. Hum Mutat 33:906-15. ), Palmer et al. (2014Palmer E, Speirs H, Taylor PJ, Mullan G, Turner G, Einfeld S, Tonge B and Mowat D (2014) Changing interpretation of chromosomal microarray over time in a community cohort with intellectual disability. Am J Med Genet A 164A:377-85.), Richards et al. (2015Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, Grody WW, Hegde M, Lyon E, Spector E et al. (2015) Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med 17:405-24.), and Nowakowska (2017Nowakowska B (2017) Clinical interpretation of copy number variants in the human genome. J Appl Genet 58:449-457. ). CNVs were classified into five categories proposed by the ACMG guidelines (Richards et al. 2015Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, Grody WW, Hegde M, Lyon E, Spector E et al. (2015) Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med 17:405-24.): benign, likely benign, variant of uncertain significance (VUS), likely pathogenic and pathogenic. A CNV was considered benign when there was more than 50% overlap, in size and location, with DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
-CNVs of the same nature (i.e., deletion or duplication) from at least three unaffected individuals, and the nonoverlapping segment did not exceed 50 percent of the DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
-CNVs’ length. CNVs were considered likely benign when they did not contain genes or if they did, the genes within them were not OMIM genes and the CNVs did not overlap with the ones found in databases of affected individuals (DECIPHERDatabase of genomic variation and phenotype in humans using ensembl Resources (Decipher) (http://decipher.sanger.ac.uk).
http://decipher.sanger.ac.uk...
, ClinVar ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/).
https://www.ncbi.nlm.nih.gov/clinvar/...
and ClinGenClinGen (https://www.clinicalgenome.org/).
https://www.clinicalgenome.org/...
). CNVs were considered VUS when (1) they contained genes that were not OMIM genes, and they overlapped at least one CNV found in databases of affected individuals, with more than 50% overlap, in size and location; (2) when they contained OMIM genes but did not overlap CNVs found in databases of affected individuals; or (3) when they had OMIM genes and overlapped CNVs found in databases of affected individuals, but do not show a clear correlation with phenotypic alterations and consistency of classifications in the analyzed databases. CNVs were considered likely pathogenic when they presented genes described in OMIM, overlapped CNVs present in databases of genomic imbalances in affected individuals, and had no clear association with phenotypic alterations, but with consistency in the databases indicating phenotypic alterations. CNVs were considered pathogenic when (1) they were more than 3 Mb in length; (2) they overlapped with regions associated with DECIPHERDatabase of genomic variation and phenotype in humans using ensembl Resources (Decipher) (http://decipher.sanger.ac.uk).
http://decipher.sanger.ac.uk...
microdeletion/microduplication syndromes; or (3) when, even though their size were not more than 3 Mb in length, they harbored OMIM genes, overlapped with CNVs found in databases of affected individuals that showed consistent correlation with phenotypic alterations. After this classification, recurrent CNVs found in a high percentage (≥ 2%) of patients in our sample were reclassified as benign ones.

Figure 1 -
Flowchart for CNVs analysis and interpretation.

Results

CNVs type, size and pathogenicity

A total of 1,504 autosome CNVs from 268 microarrays were considered for downstream analysis of pathogenicity. Among them, 1,207 (80.3%) were gains, and 297 (19.7%) were losses. Table 1 present the number and percentage of loss, gain, and total CNVs according to the different sizes and classifications of pathogenicity. According to the classification criteria, 1,397 of them (92.9%) were considered benign, 25 (1.6%) were likely benign, 39 (2.6%) were VUS, 3 (0.2%) were likely pathogenic, and 40 (2.7%) were pathogenic. The mean size of the CNVs was ~763 kb, being ~704 kb for gains and ~1.0 Mb for losses. The mean size of benign, likely benign, VUS, likely pathogenic and pathogenic CNVs was 586 kb, 522 kb, 621 kb, 1.0 Mb and 7.2 Mb, respectively. The sizes of the CNVs ranged from 200 kb to 24.5 Mb for gains and from 150 kb to 20 Mb for losses. The mean number of CNVs per patient was 5.6, being 12.6 for 6.0 SNP array, 2.9 for 750K array, and 3.9 for HD array.

Table 1 -
Number and frequency (in percentage) of CNVs in our sample according to their type of genomic imbalance, size, and classification.

Recurrent CNVs in our sample

Benign, likely benign, and VUS CNVs that were found in, arbitrarily, more than 2% arrays within a certain genomic region, were grouped and were considered as recurrent CNVs in our sample. According to these criteria, our sample presented with recurrent copy gains in four genomic regions: 14q32.33 (97.8% patients), 22q11.22 (32.1%), 1p36.32 (9.7%), and 1q21.1 (6,7%). In our pathogenicity evaluation, these first two copy gains had been classified as benign, the third as VUS and the fourth as likely benign. Given their high frequency, VUS CNVs in 1p36.32 and likely benign CNVs in 1q21.1 were reclassified as benign (i.e. non-pathogenic) CNVs in our Brazilian sample and already included in analyzes (Table S2 Table S2 - Recurrent gain CNVs in 1p36.32 and 1q21.1 showing their interpretation after re-classification accounting for CNV populational frequency. ).

Discussion

In our array sample, a higher number of gain CNVs (80.3%) compared to loss CNVs (19.7%) was found, in agreement with previous studies from the literature (Kang et al. 2008Kang Tae-wook, Yeo-jin Jeon, Eunsu Jang, Hee-jin Kim, Hwan Kim, Jong-lyul Park, Siwoo Lee, Yong Sung Kim, Jong Yeol Kim and Seon-young Kim (2008) Copy number variations (CNVs) identified in Korean individuals. BMC Genomics 9:492. ; Pietiläinen et al. 2011Pietiläinen OPH, Rehnström K, Jakkula E, Service SK, Congdon E, Tilgmann C, Hartikainen AL, Taanila A, Heikura U, Paunio T et al. (2011) Phenotype mining in CNV carriers from a population cohort. Hum Mol Genet 20:2686-2695.; Palmer et al. 2014Palmer E, Speirs H, Taylor PJ, Mullan G, Turner G, Einfeld S, Tonge B and Mowat D (2014) Changing interpretation of chromosomal microarray over time in a community cohort with intellectual disability. Am J Med Genet A 164A:377-85.). Most of the CNVs in the present study were classified as benign (92.9%), and about half (62.6%) of those were smaller than 500 kb. Furthermore, in accordance to the literature (Lee et al. 2007Lee C, Iafrate AJ and Brothman AR (2007) Copy number variations and clinical cytogenetic diagnosis of constitutional disorders. Nat Genet 39:S48-S54.), we observed that the CNV mean size increased according to pathogenicity, that is, the greater the imbalance, the greater the pathogenicity. We found that the CNVs distribution differed between platforms. Although the 6.0 SNP array platform was used in only 22% of the performed arrays, 49.6% of the CNVs were identified in this platform, with a mean of 12.6 CNVs/individual, while 750K and HD platforms showed averages of 2.9 and 3.9 CNVs/individual, respectively, that indicate a significant difference (p<0.001). The mean number of CNVs per patient was 5.6, being 12.6 for Human Array 6.0 SNP array, 2.9 for 750K array and 3.9 for HD array. These differences could be related to the characteristics of the arrays used. The 6.0 SNP array platform presents more than 1.8 million probes distributed along all the genome including probes in segmental duplication regions and in pseudogenes, while the 750 K and HD arrays, containing about 750,000 and 2.7 million probes, respectively, are more recent technologies than the 6.0 SNP array platform and are composed of probes with greater specificity and sensitivity, focusing in clinically relevant regions that results in a higher accuracy in genomic imbalances detection. It is important to highlight that this difference between platforms is not associated with QC metrics, since only high-quality genotyping reactions were considered in this analysis. Thus, we must consider that the variation of the array type, company origin, and the filtering used for CNV analysis, are factors that may difficult the comparison between published datasets and the CNVs reported in open databases. Some benign, VUS and likely benign CNVs were found to be recurrent in our sample and showed high frequency in four genomic regions (14q32.33, 22q11.22, 1p36.32, and 1q21.1), being exclusively gain CNVs. The 14q32.33 and 22q11.22 CNVs, found in all three different platforms, were classified as benign in all individuals due to overlap with CNVs from the DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
. In contrast, some 1q21.1 and 1p36.32 CNVs were reclassified considering their high frequency in our sample (Table S2 Table S2 - Recurrent gain CNVs in 1p36.32 and 1q21.1 showing their interpretation after re-classification accounting for CNV populational frequency. ). The 1p36.32 CNVs were found exclusively in 750k (35.2% patients) and HD (4.5%) platforms and were reclassified as benign CNVs due to their high frequency. Interestingly, among the seven CNVs found using the HD platform, five were first classified as benign by DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
, one was classified first as VUS and after reclassified as benign due their high frequency, and another was maintained as likely benign because it had a different genomic coordinates (chr1:2412626-2729513) comparing to the others.

The 1q21.1 gain CNVs were found only in the 6.0 SNP array platform (20.3% patients) and HD (3.9%) platforms. Among the 12 CNVs found in the 6.0 SNP array platform, only one was first classified as benign by DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
while the other 11 were further reclassified as benign ones. In the HD platform, the smaller five out the six CNVs found were classified as benign, and the other, larger (952kb), was reclassified as benign. It is important to note that, among the reclassified CNVs, all of them were larger than the CNV classified first as benign or presented little different coordinates. Thus, although these findings reveal differences in CNV distribution between platforms, there was a consistency in recurrent CNVs within each platform, which demonstrates the reliability of our classification criteria.

We looked for these recurrent CNVs in other studies from the literature. Benign 14q32.33 gain CNV was also found in a high frequency (>90%) in some studies from populations with European, African or East Asian ancestry (de Smith et al. 2007de Smith AJ, Tsalenko A, Sampas N, Scheffer A, Yamada NA, Tsang P, Ben-Dor A, Yakhini Z, Ellis RJ, Bruhn L et al. (2007) Array CGH analysis of copy number variation identifies 1284 new genes variant in healthy white males: Implications for association studies of complex diseases. Hum Mol Genet 16:2783-94. ; Conrad et al. 2010Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang Y, Aerts J, Andrews TD, Barnes C, Campbell P et al. (2010) Origins and functional impact of copy number variation in the human genome. Nature 464:704-712.) but not in other studies in individuals from Ontario, Thailand and Caucasians and African-Americans (Sebat et al. 2004Sebat J, Lakshmi B, Troge J, Alexander J, Young J, Lundin P, Månér S, Massa H, Walker M, Chi M et al. (2004) Large-scale copy number polymorphism in the human genome. Science 305:525-28. ; Shaikh et al. 2009Shaikh TH, Gai X, Perin JC, Glessner JT, Xie H, Murphy K, O’Hara R, Casalunovo T, Conlin LK, D'Arcy M et al. (2009) High-resolution mapping and analysis of copy number variations in the human genome: A data resource for clinical and research applications. Genome Res 19:1682-1690. ; Suktitipat et al. 2014Suktitipat B, Naktang C, Mhuantong W, Tularak T, Artiwet P, Pasomsap E, Jongjaroenprasert W, Fuchareon S, Mahasirimongkol S, Chantratita W et al. (2014) Copy number variation in Thai population. PLoS One 9:e104355.; Uddin et al. 2015Uddin M, Thiruvahindrapuram B, Walker S, Wang Z, Hu P, Lamoureux S, Wei J, MacDonald JR, Pellecchia G, Lu C et al. (2015) A high-resolution copy number variation resource for clinical and population genetics. Genet Med 17:747-752.). The 22q11.22 gain CNV, considered as benign, had a higher frequency in our sample comparing with studies from DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
that also referred a CNV in this region (Itsara et al. 2009Itsara A, Cooper GM, Baker C, Girirajan S, Li J, Absher D, Krauss RM, Myers RM, Ridker PM, Chasman DI et al. (2009) Population analysis of large copy number variants and hotspots of human genetic disease. Am J Hum Genet 84:148-161.; Shaikh et al. 2009Shaikh TH, Gai X, Perin JC, Glessner JT, Xie H, Murphy K, O’Hara R, Casalunovo T, Conlin LK, D'Arcy M et al. (2009) High-resolution mapping and analysis of copy number variations in the human genome: A data resource for clinical and research applications. Genome Res 19:1682-1690. ; Cooper et al. 2011Cooper GM, Coe BP, Girirajan S, Rosenfeld JA, Vu TH, Baker C, Williams C, Stalker H, Hamid R, Hannig V et al. (2011) A copy number variation morbidity map of developmental delay. Nat Genet 43:838-46.). Deletions in the 22q11.2 region have already been described in patients with OAVS (Xu et al. 2008Xu J, Fan YS and Siu VM (2008) A child with features of Goldenhar Syndrome and a novel 1.12 Mb deletion in 22q11.2 by cytogenetics and oligonucleotide array CGH : Is this a candidate region for the syndrome?. Am J Med Genet A 146A:1886-1889. ; Digilio et al. 2009Digilio MC, McDonald-McGinn DM, Heike C, Catania C, Dallapiccola B, Marino B and Zackai EH (2009) Three patients with Oculo-Auriculo-Vertebral Spectrum and microdeletion 22q11.2. Am J Med Genet A 0:2860-2864.). Given the previous association of this region to the OAVS spectrum, there could result in a bias in the CNV frequency in this region in our sample. However, among the 268 individuals analyzed in the present study, this 22q11.22 CNV was found in the 30 out of 74 (40.5%) patients with a clinical diagnosis of OAVS and in 56 out of 194 (28.9%) patients with other diseases. Thus, this CNV was found not only in OAVs patients in our sample, but also in patients with different phenotypes, indicating that this CNV can be found in a higher frequency in the Brazilian population. The 1q21.1 gain CNV, classified at first as likely benign, was also considered as recurrent in our sample. In the DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
database, this variant was reported in only one study, in which a 1q21.1 gain was observed in two out of 29,084 individuals from the USA and Canada (Coe et al. 2014Coe BP, Witherspoon K, Rosenfeld JA, van Bon BWM, Vulto-van Silfhout AT, Bosco P, Friend KL, Baker C, Buono S, Vissers LELM et al. (2014) Refining analyses of copy number variation identifies specific genes associated with developmental delay. Nat Genet 46:1063-1071.). Due to the high frequency of this CNV in our sample, we reclassified this CNV as benign. Analogously, the 1p36.32 gain CNV, primely considered as VUS, was reclassified as benign due to its recurrence in our sample. This variant has a higher frequency in our sample when compared to cohorts described in DGVDatabase of Genomic Variants (DGV) (http://projects.tcag.ca/variation).
http://projects.tcag.ca/variation...
that detected this CNV in other ancestries (Redon et al. 2006Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W et al. (2006) Global variation in copy number in the human genome. Nature 444:444-454. ) (the HapMap collection). These differences in CNV frequencies between the reported data from the literature and our data may indicate distinct composition of the individuals studied since there is a great populational heterogeneity among the publications. However, these frequency divergencies may also reflect differences in the ability in CNV detection, since a diversity of approaches (such as ROMA, BAC-aCGHs, and SNP array) are used for gain and loss detection. This is a relevant factor that does not permit a reliable comparison between data obtained from different papers from the literature, which can show variability in the detection of certain CNVs, as shown in our data from three different platform used, even from the same company. The data obtained in our study indicated that the established analysis flowchart was highly effective in the classification of the CNVs’ pathogenicity and allowed the establishment of an CNVs database based on Brazilian individuals. This resource has been remarkably valuable for the diagnostic screenings in our laboratory, considering admixed genetic background of the Brazilian population. The data leveraged in this study may contribute for the pathogenicity interpretation of CNVs in other populations underrepresented in currently available open resources for structural variants.

Acknowledgements

The authors thank the patients and their parents for their participation in this study.

This study was supported by Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) (grant 2014/11572-8 to M.I.M.) and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES).

References

  • Alkan C, Coe BP and Eichler EE (2011) Genome structural variation discovery and genotyping. Nat Rev Genet 12:363-76.
  • Andrade RB, Amador MAT, Cavalcante GC, Leitão LPC, Fernandes MR, Modesto AAC, Moreira FC, Khayat AS, Assumpção PP, Ribeiro-dos-Santos A et al (2018). Estimating Asian contribution to the Brazilian population: A new application of a validated set of 61 ancestry informative markers. G3 Genes/Genomes/Genetics 8:3577-3582
  • Brnich S, Abou Tayoun A, Greenblatt MS, Heinen CD, Kanavy D, Luo X, McNulty SM, Starita LM, Tavtigian SV et al (2019). PgmNr 352: Recommendations for determining the clinical validity of functional studies for use in variant interpretation. In: American Society of Human Genetics 69th Annual Meeting, p 445.
  • Coe BP, Witherspoon K, Rosenfeld JA, van Bon BWM, Vulto-van Silfhout AT, Bosco P, Friend KL, Baker C, Buono S, Vissers LELM et al (2014) Refining analyses of copy number variation identifies specific genes associated with developmental delay. Nat Genet 46:1063-1071.
  • Collins RL, Brand H, Karczewski KJ, Zhao X, Alföldi J, Khera AV, Francioli LC, Gauthier LD, Wang H, Watts NA et al (2019) PgmNr 351: gnomAD-SV: An open resource of structural variation for medical and population genetics. In: American Society of Human Genetics 69th Annual Meeting, p. 443.
  • Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang Y, Aerts J, Andrews TD, Barnes C, Campbell P et al (2010) Origins and functional impact of copy number variation in the human genome. Nature 464:704-712.
  • Cooper GM, Coe BP, Girirajan S, Rosenfeld JA, Vu TH, Baker C, Williams C, Stalker H, Hamid R, Hannig V et al (2011) A copy number variation morbidity map of developmental delay. Nat Genet 43:838-46.
  • Digilio MC, McDonald-McGinn DM, Heike C, Catania C, Dallapiccola B, Marino B and Zackai EH (2009) Three patients with Oculo-Auriculo-Vertebral Spectrum and microdeletion 22q11.2. Am J Med Genet A 0:2860-2864.
  • Gijsbers AC, Schoumans J and Ruivenkamp CA (2011) Interpretation of array comparative genome hybridization data: a major challenge. Cytogenet Genome Res 135:222-7.
  • Girirajan S, Campbell CD and Eichler EE (2011) Human copy number variation and complex genetic disease. Annu Rev Genet 45:203-226.
  • Iafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW and Lee C (2004) Detection of large-scale variation in the human genome. Nat Genet 36:949-51.
  • Itsara A, Cooper GM, Baker C, Girirajan S, Li J, Absher D, Krauss RM, Myers RM, Ridker PM, Chasman DI et al (2009) Population analysis of large copy number variants and hotspots of human genetic disease. Am J Hum Genet 84:148-161.
  • Kang Tae-wook, Yeo-jin Jeon, Eunsu Jang, Hee-jin Kim, Hwan Kim, Jong-lyul Park, Siwoo Lee, Yong Sung Kim, Jong Yeol Kim and Seon-young Kim (2008) Copy number variations (CNVs) identified in Korean individuals. BMC Genomics 9:492.
  • Korbel JO, Urban AE, Affourtit JP, Godwin B, Grubert F, Simons JF, Kim PM, Palejev D, Carriero NJ, Du L et al (2007) Paired-end mapping reveals extensive structural variation in the human genome. Science 318:420-426.
  • Lee C, Iafrate AJ and Brothman AR (2007) Copy number variations and clinical cytogenetic diagnosis of constitutional disorders. Nat Genet 39:S48-S54.
  • de Leeuw N, Dijkhuizen T, Hehir-Kwa JY, Carter NP, Feuk L, Firth HV, Kuhn RM, Ledbetter DH, Martin CL, van Ravenswaaij-Arts CMA et al (2012) Diagnostic interpretation of array data using public databases and internet sources. Hum Mutat 33:930-940.
  • Li J, Yang T, Wang L, Yan H, Zhang Y, Guo Y, Pan F, Zhang Z, Peng Y, Zhou Q et al (2009) Whole genome distribution and ethnic differentiation of copy number variation in caucasian and asian populations. PLoS One 4:e7958.
  • Li JZ, Absher DM, Tang H, Southwick AM, Casto AM, Ramachandran S, Cann HM, Barsh GS, Feldman M, Cavalli-Sforza LL et al (2008) Worldwide human relationships inferred from genome-wide patterns of variation. Science 319:1100-4
  • MacDonald JR, Ziman R, Yuen RK, Feuk L and Scherer SW (2014) The database of genomic variants: A curated collection of structural variation in the human genome. Nucleic Acids Res 42:986-92.
  • Naslavsky MS, Yamamoto GL, de Almeida TF, Ezquina SAM, Sunaga DY, Pho N, Bozoklian D, Sandberg TOM, Brito LA, Lazar M et al (2017) Exomic variants of an elderly cohort of brazilians in the ABraOM database. Hum Mutat 38:751-763.
  • Nowakowska B (2017) Clinical interpretation of copy number variants in the human genome. J Appl Genet 58:449-457.
  • Palmer E, Speirs H, Taylor PJ, Mullan G, Turner G, Einfeld S, Tonge B and Mowat D (2014) Changing interpretation of chromosomal microarray over time in a community cohort with intellectual disability. Am J Med Genet A 164A:377-85.
  • Pietiläinen OPH, Rehnström K, Jakkula E, Service SK, Congdon E, Tilgmann C, Hartikainen AL, Taanila A, Heikura U, Paunio T et al (2011) Phenotype mining in CNV carriers from a population cohort. Hum Mol Genet 20:2686-2695.
  • Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W et al (2006) Global variation in copy number in the human genome. Nature 444:444-454.
  • Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, Grody WW, Hegde M, Lyon E, Spector E et al (2015) Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med 17:405-24.
  • Riggs E, Andersen E, Cherry A, Kantarci S, Kearney H, Patel A, Raca G, Ritter D, South S, Thorland E et al (2019) PgmNr 353: Validation of scoring metrics to guide the classification of constitutional copy number variants. In: American Society of Human Genetics 69th Annual Meeting, p. 447.
  • Riggs E, Andersen E, Cherry A, Kantarci S, Kearney H, Patel A, Raca G, Ritter DI, South S, Thorland E et al (2019) Technical standards for the interpretation and reporting of constitutional copy-number variants: A joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen). Genet Med 22:245-257.
  • Sebat J, Lakshmi B, Troge J, Alexander J, Young J, Lundin P, Månér S, Massa H, Walker M, Chi M et al (2004) Large-scale copy number polymorphism in the human genome. Science 305:525-28.
  • Shaikh TH, Gai X, Perin JC, Glessner JT, Xie H, Murphy K, O’Hara R, Casalunovo T, Conlin LK, D'Arcy M et al (2009) High-resolution mapping and analysis of copy number variations in the human genome: A data resource for clinical and research applications. Genome Res 19:1682-1690.
  • de Smith AJ, Tsalenko A, Sampas N, Scheffer A, Yamada NA, Tsang P, Ben-Dor A, Yakhini Z, Ellis RJ, Bruhn L et al (2007) Array CGH analysis of copy number variation identifies 1284 new genes variant in healthy white males: Implications for association studies of complex diseases. Hum Mol Genet 16:2783-94.
  • Stankiewicz P and Lupski JR (2010) Structural variation in the human genome and its role in disease. Annu Rev Med 61:437-55.
  • Suktitipat B, Naktang C, Mhuantong W, Tularak T, Artiwet P, Pasomsap E, Jongjaroenprasert W, Fuchareon S, Mahasirimongkol S, Chantratita W et al (2014) Copy number variation in Thai population. PLoS One 9:e104355.
  • Uddin M, Thiruvahindrapuram B, Walker S, Wang Z, Hu P, Lamoureux S, Wei J, MacDonald JR, Pellecchia G, Lu C et al (2015) A high-resolution copy number variation resource for clinical and population genetics. Genet Med 17:747-752.
  • Vermeesch JR, Brady PD, Sanlaville D, Kok K and Hastings RJ (2012) Genome-wide arrays: Quality criteria and platforms to be used in routine diagnostics. Hum Mutat 33:906-15.
  • Xu J, Fan YS and Siu VM (2008) A child with features of Goldenhar Syndrome and a novel 1.12 Mb deletion in 22q11.2 by cytogenetics and oligonucleotide array CGH : Is this a candidate region for the syndrome?. Am J Med Genet A 146A:1886-1889.
  • Zarrei M, MacDonald JR, Merico D and Scherer SW (2015) A copy number variation map of the human genome. Nat Rev Genet 16:172-83.

Internet Resources

Data availability

Data citations

Database of Genomic Variants (DGV) (http://projects.tcag.ca/variation).

Database of genomic variation and phenotype in humans using ensembl Resources (Decipher) (http://decipher.sanger.ac.uk).

ClinGen (https://www.clinicalgenome.org/).

PubMed (http://www.ncbi.nlm.nih.gov/pubmed/).

Publication Dates

  • Publication in this collection
    13 Nov 2020
  • Date of issue
    2020

History

  • Received
    12 July 2019
  • Accepted
    25 Sept 2020
Sociedade Brasileira de Genética Rua Cap. Adelmio Norberto da Silva, 736, 14025-670 Ribeirão Preto SP Brazil, Tel.: (55 16) 3911-4130 / Fax.: (55 16) 3621-3552 - Ribeirão Preto - SP - Brazil
E-mail: editor@gmb.org.br