Open Access is an initiative that aims to make scientific research freely available to all. To date our community has made over 100 million downloads. It’s based on principles of collaboration, unobstructed discovery, and, most importantly, scientific progression. As PhD students, we found it difficult to access the research we needed, so we decided to create a new Open Access publisher that levels the playing field for scientists across the world. How? By making research easy to access, and puts the academic needs of the researchers before the business interests of publishers.
We are a community of more than 103,000 authors and editors from 3,291 institutions spanning 160 countries, including Nobel Prize winners and some of the world’s most-cited researchers. Publishing on IntechOpen allows authors to earn citations and find new collaborators, meaning more people see your work not only from your own field of study, but from other related fields too.
To purchase hard copies of this book, please contact the representative in India:
CBS Publishers & Distributors Pvt. Ltd.
www.cbspd.com
|
customercare@cbspd.com
Electrical Engineering Department, College of Engineering and Petroleum, Kuwait University, Kuwait
Fuad M. Alkoot
The Higher Institute of Telecommunication and Navigation, Public Authority for Applied Education and Training, Kuwait
Mash’el S. Aldabbous
Department of Biological Sciences, College of Science, Kuwait University, Kuwait
*Address all correspondence to: al.qallaf@ku.edu.kw
1. Introduction
Autism is a complex neurodevelopmental disorder. It is characterized by social isolation, language deficits and repetitive or stereotyped behaviors. Autism spectrum disorder (ASD) has received a great deal of attention in the recent years not only due to the increasing rate of affected children but also because of the social and economical impact of the disorder on their families. Various studies and researches have been proposed to deal with and tackle the ASD. They can be divided into three categories as follows.
The basis and causes of the disorder. Different hypotheses have been proposed in an attempt to determine and discover the originality of autism. Genetic risk factors represented by abnormal chromosomal variations and rearrangements, and non-genetic factors represented by environmental agents that have been claimed to contribute to ASD, such as exposure of children to vaccines, infection, certain foods or heavy metals.
The methodologies and techniques for characterizing and diagnosing the disorder. Several instrumental diagnostic protocols are commonly used in autism research such as the Autism Diagnostic Interview-Revised (ADI-R) and the Autism Diagnostic Observation Schedule (ADOS). The advances in neuro-imaging techniques such as the functional-Magnetic Resonance Imaging (f-MRI) have allowed scientists to model the structural and functional differences in the human brain tissues of the individuals with ASD. The clinical genetics evaluation provide reliable alternative to the interview-based protocols and screening approaches. It allows geneticists to link an estimate of approximately 40% of the cases to genetic contributors.
The treatments and therapies of autistic patients. The available approaches for treatments include applied behavior analysis (ABA), developmental models, structured teaching, speech and language therapy and social skills therapy. When behavioral treatment fails, many medications are used to treat ASD symptoms.
Figure 1 demonstrates the interaction of the autism spectrum disorders researches and studies.
The advancements of the technologies in the field of genetics provide the opportunities for researchers and scientists to explore in depth the biological information and to convert it into meaningful biological knowledge through computational-based models.
In this chapter, we will investigate the genetics origins of autism and demonstrate the latest techniques and technologies available for diagnosing the complex disorder. We will also propose a robust approach for detecting and identifying the targeted disorder based upon the advantages and strengths of the publically available and commercial approaches while avoiding their weaknesses. The proposed approach is divided into two steps. The preprocessing step is a feature-extraction method used to clearly map and detect the genetic variations and structural rearrangements followed by a statistical-based model as feature-selection to evaluate and measure the statistical and biological significance of the predicted variations. The classification step is to discover the relationship among the tested samples into groups and/or subgroups, and to provide insight into the complex pattern of the genome.
The results suggest that autism is associated with an increased amount of alterations in unstable segments of the genome. The experimental results also show that using high-resolution custom-tiled samples improve the accuracy of our proposed approach in determining previously reported and new genetic contributors that warrant investigation.
This chapter aims at utilizing research to bring benefits to individuals and families affected by autism spectrum disorders and to improve the quality of their life. And this can be done by clear mapping and identifying the biomarkers associated with ASD at the early childhood stages which are essential to provide better treatments and therapies. Finally, the proposed approach presented in this chapter is broadly applicable to case-control studies of genetic diseases beyond the ASD.
The chapter is organized as follows. In section 2, we demonstrate the genetic data generating techniques, data modeling and chromosomal variations that are associated with the targeted disorder, ASD. Section 3 is devoted for the methods used to analyze the genetic data trying to discover the variant regions along the genome and to identify the tested samples. In section 4, we apply molecular test to evaluate the predictive power of the proposed approach. Finally, discussion and conclusion based on the results are presented in section 5.
2.1. Genomic structural variations and ASD susceptibility
Genetic alterations in the form of chromosomal rearrangements are genomic structural variations that lead to changes in the DNA copy number such as duplications and deletions of the DNA copies. However, copy number changes do not include other genomic structural variations such as inversions, insertions and reciprocal translocations. Figure 2 demonstrates different types of chromosomal rearrangements.
Chromosome region
Gene
Phenotype
Mendelian Syndromes
6q23.3
AHI1
Joubert syndrome
7q35-q36.1
CNTNAP2
Recessive EPI syndrome, ASD, ADHD, TS, OCD
9q34.13
TSC1
Tuberous Sclerosis type I
10q23.31
PTEN
Cowden disease*
11q13.4
DHCR7
Smith-Lemli-Opitz syndrome
12p13.33
CACNA1C
Timothy syndrome
15q11.2
UBE3A
Angelman syndrome
16p13.3
TSC2
Tuberous Sclerosis type II
17q11.2
NF1
Neurofibromatosis
Xp21.2
DMD
Duchenne muscular dystrophy
Xp21.3
ARX
LIS, XLID, EPI, ASD
Xp22.13
CDKL5
X-linked infantile spasm syndrome
Xq27.3
FMR1
Fragile X syndrome
Xq28
MECP2
Rett syndrome
Rare Variants
1q21.1
NBPF9
ASD, ID, SCZ, ADHD, EPI
2p16.3
NRXN1
ASD, ID, language delay, SCZ.
3p13
FOXP1
ID, ASD, SLI
6q16.3
GRIK2
Recessive ID
7q11.23
FKBP6/CLIP2
ASD, ID, language delay
7q31.1
FOXP2
SLI
11q13.3-q13.4
SHANK2
ASD, ID
15q11-q13
MAGEL2/ NDN
ASD, EPI, ID
16p11.2
VPS35/ORC6
ASD, ADHD, ID, EPI, SCZ
16p13.3
A2BP1
ID, ASD, EPI, SCZ, ADHD
17q11.2
SLC6A4
ASD, OCD
17q12
ACCN1/PNMT
ASD, SCZ, EPI
22q11.21
DiGeorge syndrome, SCZ, ASD, ID.BPAD
22q13.33
SHANK3
ASD, Phelan McDermid syndrome**
Xq13.1
NLGN3
ASD
Xp22.11
PTCHD1
ASD, ID
Xp22.32-p22.31
NLGN4X
ASD, ID, TS, ADHD
Common Alleles
1q42.2
DISC1
SCZ,BPAD
2q31.1
SLC25A12
ASD
3p25.3
OXTR
ASD
7q31.2
MET
ASD, Diabetes II
7q22.1
RELN
ASD
7q36.3
EN2
ASD
12q14.2
AVPR1A
ASD
17q21.32
ITGB3
ASD
Table 1.
Chromosomal regions and genes that are implicated in risk for ASD, and associated genetic disorders and syndroms [68& 69].
*A rare autosomal dominant inherited disorder characterized by multiple tumor-like growths, increased risk of certain forms of cancer, and diverse clinical features including neurologic features such as autism and Lhermitte Duclos disease [39& 40].
** A genetic syndrome caused by disruption of the SHANK3 gene which codes for the shank3 protein. The protein most important role is in the brain. It is involved in processes crucial for learning and memory. It also has an important role in brain development. It is also known as 22q13.3 deletion syndrome and is highly associated with autism.
Human (Homo sapiens) Genome Browser Gateway, http://genome.ucsc.edu/cgi-bin/hgGateway.
A set of chromosomal regions and genes that are implicated with ASD are listed in Table 1. Some of the regions are associated with known Mendelian syndromes. In some individuals affected with these syndromes, ASD occurs as a secondary diagnosis. In other regions and genes, genetic variations causing ASD include a wide range of possibilities each with very low frequency among the cases (rare variants). In some cases the rare variants are found only once in the population. In contrast to rare variants we see that in other chromosomal regions and genes only few common genetic variations (common alleles) account for ASD susceptibility.
2.2. Data Generating
Figure 3 illustrates the process of generating DNA copy number data using Microarray-based comparative genomic hybridization (array CGH) technology.
2.3. Data Modeling
As illustrated in Figure 3, aCGH technology is an experimental approach for genome-wide scanning of differences in DCN samples. It provides a high-resolution method to map and measure relative changes in DCN simultaneously at thousands of genomic loci. In a biological experiment, unknown (test) and reference (normal) DNA samples are labeled with fluorescent dyes Cy3 and Cy5, respectively. Then, they are combined and competitively co-hybridized to an array containing genomic DNA targets that have been spotted on a glass slide. The resulting ratio of the fluorescence intensities is proportional to the ratio of the copy numbers of DNA sequences in the test and reference genomes measured in a logarithmic scale for a certain genomic location. These intensity ratios are informative about DNA copy number changes. We expect to see duplication (gain) for positive ratio, deletion (loss) for negative ratio and normal state for neutral ratio. Due to the logarithmic scale and the probes performance, the data can be approximated as a piecewise function of short and long intervals with different intensity levels that are not equally-spaced along the genome. Moreover, microarray experiments suffer from many sources of error due to human factors, array printer performance, labeling, and hybridization efficiency.
According to the data description and properties generated by microarray technology, the DCN cell line can be approximated as a one-dimensional piecewise constant (PWC) discrete-time signal contaminated with some error. A good model of the genetic data generated by the aCGH technology can be model as follows.
E1
where y[n] is the contaminated genetic signal and x[n] is the true value of the genetic variation to be estimated at genomic location n of the length N. εn is assumed to be modeled as additive wihte Gaussian noise with zero mean and some variance σ2.
As described in (1), Figure 4 illustrates the genetic data in the form of DNA copy number generated by aCGH technology where 4 variant segements are presented with different intensity levels.
Although the recent advantecment in microarray technologies and sequencing now make it easy to measure the genetic variations with high-resolution through scanning large number of samples, small changes, particularly at the low copy repeat (LCRs) regions, remain difficult to detect due to different noise conditions. Thus, the challenging problem is to differentiate between the true biological signaling and the noise measurements.
Various methods have been proposed as preprocessing techniques to tackle this problem. These methods have been motivated by either well-known signal processing techniques or statistical-based models.
METHOD
COMPUTATIONAL
SMOOTHING TECHNIQUES
COMPLEXITY
SIGMA FILTERING (Alqallaf et al., 2007)
O(N)
SMOOTHING AND EDGE DETECTION (Huang et al., 2004)
O(N)
WAVELETS (Hsu et al., 2005)
O(N log N)
STATISTICAL-BASED MODELS
CIRCULAR BINARY SEGMENTATION (Olshen et al., 2004)
O(N2)
HIDDEN MARKOV MODELS (Fridlyand et al., 2004)
O(C2N)
SPARSE BAYESIAN LEARNING (Pique-Regi et al., 2008)
O(N log N)
Table 2.
Comparison based on the computational complexity of the proposed denoising techniques.
In Table 2, we present a comparison study based on the computational cost of the most recent and successful approaches. As can be noticed that the smoothing techniques are well suited to process very large amount of data such as the genetic signals compared to the statistical-based models. However, these techniques include important features such as the variant regions boundaries in the smoothing process.
Here we present our previously proposed method (Alqallaf et al., 2007), Sigma filter (SF). It is a nonlinear method used as a feature extraction to detect the variant segments edges and to smooth the rest of the genetic data. The filter is conceptually simple but effective noise smoothing algorithm. Based on the assumption of the aCGH data modeling, the SF algorithm is well suited to denoise the tested samples before further analysis. SF algorithm is motivated by the sigma probability of the Gaussian distribution, and it smooths the noise by averaging only those neighborhood variant segments which have the intensities within a fixed sigma range of the center data point. Consequently, variant segmets edges are preserved, and subtle details are retained.
3.2. Statistical significance
Few studies in the literature have addressed the power of class discovery of the recurrent copy number variations (CNVs) across multiple samples of the genetic data [52& 53]. However, they did not consider denoising the data prior to applying the statistical analysis.
To reduce the dimensionality of the detected variant regions, we apply a simple statistical-based approach to measure the significance of the candidate gemonic regions. The approach is based on the frequency difference between the case and control samples at each gemonic location. It is used as a feature selection algorithm to select a small subset of variant segments as features for classification. Figure 5 is an illustration of three RCVNs with different sizes of filtered DCN data for multiple samples of normal control (Ci) and autistic (Ai) individuals, respectively. After selecting the informative segments of the genome, we then applied comparative classification algorithms on the reduced data.
3.3. Data classification
Based on the collected and processed genetic data, we apply a system of classifiers that are used to identify autistic individuals based on their genetic information. This system will help improve detection, identification and diagnosis of autism, which will benefit both the patients and the society in general and will lead to early diagnosis and treatment.
Generally classifiers are used by researchers faced with the task of classification based on a given data. Classifiers are mathematical models that are able to perform the task of classification or decision making, based on a previously provided data. Classifier’s ability to spot trends and relationships in large data sets makes it well suited for many applications. In the field of medicine classifiers can be used to classify accurately diseases, genes, tumors, and other medical phenomena [54; 55; 56; 57; 58; 59& 60]. Although some attempts were made to use classifiers in genetics [61]. Our attempt is to use three comparative classifiers, namely, k-Nearest Neighbor, Neural Network, and Support Vector Machine, to help in diagnosing patients with ASD.
The leave-one-out cross-validation (LOOCV) is applied to evaluate the proposed classifiers by measuring the classification performance to accurately identify the association between the tested samples and the targeted disorder, ASD. The LOOCV involves using a single variant segment from the original sample as the validation data, and the remaining segments as the training data. This is repeated such that each variant segment in the sample is used once as the validation data.
3.3.1. k-Nearest Neighbor Classifier
The k-Nearest Neighbor (k-NN) classifier [64] is a well known nonparametric classifier. To classify a new input x, the k nearest neighbors are retrieved from the training data. The input x is then labeled with the majority class label corresponding to the k nearest neighbors. For the k-NN classifier, we used the Euclidean distance as the distance metric, and the best k between 1 and 10 was found by performing LOOCV on the training data.
3.3.2. Neural Network
Neural networks are another type of classifier or mathematical models used for classification, regression or decision making. Their structure is inspired by the human neural system and brain. It consists of many neurons, interconnected at different stages. The direction of flow of information is usually from the input stage to the output stage. Each neuron has an input and an output, where an activation function converts a neurons input to its output. The output of each neuron is connected to the next stage through a weighted connection. A learning function determines the value of the weights of all the connections. The weights are updated based on a mathematical function that relates the network together. Therefore, a neural network is considered as an adaptive network that changes its structure during the learning or training phase, based on mathematical functions that relate input data to the corresponding class labels. The sum of all neurons at the different layers and the weighted interconnections make up a complex network that is commonly referred to as a black box.
Before its use to classify a test sample, the neural network is trained on a given data set with known classes or labels. During the training phase the weights are updated to minimize the output error. The selected value of the minimum acceptable error determines when the training stops. For a difficult data where it is impossible to reach the set minimum error, the maximum number of epochs is used as criteria for stopping the training process.
3.3.3. Support Vector Machine
The Support Vector Machine (SVM) belongs to a new generation of learning system based on recent advances in statistical learning theory [65]. A linear SVM, which is used in our system, aims to find the separating hyper-plane with the largest margin, defined as the sum of the distances from a hyper-plane (implied by a linear classifier) to the closest positive and negative exemplars. The expectation is that the larger the margin, the better the generalization of the classifier. In a non-separable case, a linear SVM seeks a trade-off between maximizing the margin and minimizing the number of errors.
Figure 6 illustrates the LOOCV classification accuracies using the tested classifiers, k-NN, NN, and SVM. The x-axis is associated with the number of selected top-ranked variant segments and the y-axis shows the average LOOCV accuracy.
To evaluate our predictive power of our method in detecting and identifying patients with ASD, we use molecular test, quantitative Polymerase Chain Reaction (qPCR). It is a very sensitive and precise tool used for the quantification of nucleic acids. It can detect and quantify very small amounts of specific nucleic acid sequence. It is based on the method of PCR, developed by Kary Mullis in the 1980s. It allows the amplification of specific nucleic acid sequence (DNA) more than a billion-fold. Using qPCR allows scientists to quantify the starting amount of a specific DNA sequence in the sample before the amplification by PCR method [62].
Quantitative PCR is an indispensable tool for researchers in various fields including fundamental biology, molecular diagnostics, biotechnology, and forensic sciences. Critical points and limitations of qPCR-based assays must be considered to increase the reliability of the obtained data. For the detection of qPCR four technologies are commonly used all of which are based on the measurement of fluorescence during the PCR. One principle is based on intercalation of double-stranded DNA-binding dyes (simplest and cheapest). The other three principles are based on the introduction of an additional fluorescence-labeled oligonucleotide (probe). Detectable fluorescence are only released either after cleavage of the probe (hydrolysis probes) or during hybridization of one (molecular beacon) or two (hybridization probes) oligonucleotides to the amplicon. The introduction of an additional probe increases the specificity of the quantified PCR product and allows the development of multiplex reactions. Other technologies have been described for the detection of qPCR [63].
The qPCR method quickly became the first choice when it comes to quantitative analysis of nucleic acid because of many reasons. It is highly sensitive and it allows the detection of less than five copies (one copy in some cases) of a target sequence. It has good reproducibility. In addition, it has broad dynamic quantification range, at least 5 log units. It is also easy to use and has reasonable good value for money (low consumable and instrumentation costs).
For the purpose of this chapter, we are focusing on one of the many applications of qPCR, which is indispensable for research and diagnostics, the genetic variations.
Array
CBS
SF
1
14
20
2
10
20
3
20
21
4
13
21
Table 3.
Representation of the number of events (CNVs) detected by the circular binary segmentation (CBS) and sigma filtering methods, respectively, for 22 qPCR confirmed CNVs.
Table 3 shows that the number of qPCR-confirmed CNVs detected by the sigma filtering (SF) method is considerably higher than those detected using the circular binary segmentation (CBS), ranging from 4.5% to 36% more for 4 different array experiments. The results show that applying the averaging window of 2Kb allow the algorithms to be well suited for detecting variations in high-density microarray data, especially at the LCR-rich regions.
The etiology of Autism spectrum disorders involves genetic and environmental risk factors. In this chapter, we have discussed the genetics basis of the complex disorder, autism. With the recent advances in the new screening technologies to investigate the entire genome such as array comparative genomic hybridization (aCGH) and whole genome sequencing, provide the opportunities insight into the pattern of the genetic variations and reveal their roles in the genetic diseases. In this study, we have demonstrated an overview for the analysis of genetic variations in the form of DNA copy number changes and their association with autism susceptibility.
Through mathematical-based models and computational-based approaches, we analyze the genetic data trying to discover and identify the relationship between the structural chromosomal rearrangements along the genome and the targeted disorder, ASD. In conclusion, the results show strong evidence that the genetic variations contribute in the complex disorder, autism.
References
1.FreemanJ. L.PerryG. H.FeukL.RedonR.Mc CarrollS. A.AltshulerD. M.AburataniH.JonesK. W.Tyler-SmithC.MEHurles2006Copy number variation: new insights in genome diversity.168949961
2.LeeJ. A.LupskiJ. R.2006Genomic rearrangements and gene copy-number alterations as a cause of nervous system disorders521103121
3.StankiewiczP.LupskiJ. R.2002Molecular-evolutionary mechanisms for genomic disorders.123312319
5.MarshallC. R.NoorA.VincentJ. B.LionelA. C.FeukL.SkaugJ.ShagoM.MoessnerR.PintoD.RenY.2008Structural variation of chromosomes in autism spectrum disorder822477488
6.SebatJ.LakshmiB.MalhotraD.TrogeJ.Lese-MartinC.WalshT.YamromB.YoonS.KrasnitzA.KendallJ.2007Strong association of de novo copy number mutations with autism.3165823445449
7.KirovG.GumusD.ChenW.NortonN.GeorgievaL.SariM.O’DonovanM. C.ErdoganF.MJOwenRopersH. H.2008Comparative genome hybridization suggests a role for NRXN1 and APBA2 in schizophrenia173458465
8.SharpA. J.HansenS.SelzerR. R.ChengZ.ReganR.HurstJ. A.StewartH.PriceS. M.BlairE.HennekamR. C.2006Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome.38910381042
9.SharpA. J.MeffordH. C.LiK.BakerC.SkinnerC.StevensonR. E.SchroerR. J.NovaraF.De GregoriM.CicconeR.2008A recurrent 15q13.3 microdeletion syndrome associated with mental retardation and seizures403322328
11.StefanssonH.RujescuD.CichonS.PietilainenO. P.IngasonA.SteinbergS.FossdalR.SigurdssonE.SigmundssonT.Buizer-VoskampJ. E.2008Large recurrent microdeletions associated with schizophrenia.4557210232236
12.StoneJ. L.O’DonovanM. C.GurlingH.KirovG. K.BlackwoodD. H.CorvinA.CraddockN. J.GillM.HultmanC. M.LichtensteinP.2008Rare chromosomal deletions and duplications increase risk of schizophrenia4557210237241
13.WalshT.Mc ClellanJ. M.Mc CarthyS. E.AddingtonA. M.PierceS. B.CooperG. M.ASNordKusendaM.MalhotraD.BhandariA.2008Rare structural variants disrupt multiple genes in neurodevelopmental pathways in schizophrenia3205875539543
14.EstivillX.ArmengolL.2007Copy number variants and common disorders: filling the gaps and exploring complexity in genome-wide association studies.31017871799
15.GreerM. K.BrownF. R.PaiG. S.ChoudryS. H.KleinA. J.1997Cognitive, adaptive, and behavioral characteristics of Williams syndrome.745521525
16.SomervilleM. J.MervisC. B.YoungE. J.SeoE. J.del CampoM.BamforthS.PeregrineE.LooW.LilleyM.Perez-JuradoL. A.2005Severe expressive-language delay related to duplication of the Williams-Beuren locus.3531616941701
17.Aavan der N.RoomsL.VandeweyerG.van denEnde. J.ReyniersE.FicheraM.RomanoC.DelleChiaie. B.MortierG.MentenB.2009Fourteen new cases contribute to the characterization of the 7q11.23 microduplication syndrome522-394100
18.DepienneC.HeronD.BetancurC.BenyahiaB.TrouillardO.BouteillerD.VerloesA.Le GuernE.LeboyerM.BriceA.2007Autism, language delay and mental retardation in a patient with 7q11 duplication447452458
19.BalciunieneJ.FengN.IyaduraiK.HirschB.CharnasL.BillB. R.EasterdayM. C.StaafJ.OsethL.Czapansky-BeilmanD.2007Recurrent 10q22-q23 deletions: a genomic disorder on 10q associated with cognitive and behavioral abnormalities805938947
20.LaumonnierF.RogerS.GuerinP.MolinariF.M’RadR.CahardD.BelhadjA.HalayemM.PersicoA. M.EliaM.2006Association of a functional deficit of the BKCa channel, a synaptic regulator of neuronal excitability, with autism and mental retardation.163916221629
23.PapolosD. F.FaeddaG. L.VeitS.GoldbergR.MorrowB.KucherlapatiR.ShprintzenR. J.1996Bipolar spectrum disorders in patients diagnosed with velo-cardio-facial syndrome: does a hemizygous deletion of chromosome 22q11 result in bipolar affective disorder?1531215411547
24.NiklassonL.RasmussenP.OskarsdottirS.GillbergC.2001Neuropsychiatric disorders in the 22q11 deletion syndrome.317984
25.AntonellA.de LuisO.Domingo-RouraX.Perez-JuradoL. A.2005Evolutionary mechanisms shaping the genomic structure of the Williams-Beuren syndrome chromosomal region at human 7q11.23.15911791188
26.VorstmanJ. A.MorcusM. E.DuijffS. N.KlaassenP. W.Heineman-deBoer. J. A.BeemerF. A.SwaabH.KahnR. S.van EngelandH.2006The 22q11.2 deletion in children: high rate of autistic disorders and early onset of psychotic symptoms45911041113
27.Hertz-PicciottoI.CroenL. A.HansenR.JonesC. R.van de WaterJ.PessahI. N.2006The CHARGE study: an epidemiologic investigation of genetic and environmental factors contributing to autism114711191125
28.RedonR.IshikawaS.FitchK. R.FeukL.PerryG. H.AndrewsT. D.FieglerH.ShaperoM. H.CarsonA. R.ChenW.2006Global variation in copy number in the human genome.4447118444454
29.ShenF.HuangJ.FitchK. R.TruongV. B.KirbyA.ChenW.ZhangJ.LiuG.Mc CarrollS. A.JonesK. W.2008Improved detection of global copy number variation using high density, non-polymorphic oligonucleotide probes927
30.ConradD. F.PintoD.RedonR.FeukL.GokcumenO.ZhangY.AertsJ.AndrewsT. D.BarnesC.CampbellP.2010Origins and functional impact of copy number variation in the human genome4647289704712
31.PerryG. H.Ben-DorA.TsalenkoA.SampasN.Rodriguez-RevengaL.TranC. W.SchefferA.SteinfeldI.TsangP.YamadaN. A.2008The fine-scale and complex architecture of human copy-number variation823685695
32.BergJ. S.Brunetti-PierriN.PetersS. U.KangS. H.FongC. T.SalamoneJ.FreedenbergD.HannigV. L.ProckL. A.MillerD. T.2007Speech delay and autism spectrum behaviors are frequently associated with duplication of the 7q11.23 Williams-Beuren syndrome region.97427441
33.PotockiL.BiW.Treadwell-DeeringD.CarvalhoC. M.EifertA.FriedmanE. M.GlazeD.KrullK.LeeJ. A.LewisR. A.2007Characterization of Potocki-Lupski syndrome (dup(17)(p11.2p11.2)) and delineation of a dosage-sensitive critical interval that can convey an autism phenotype.804633649
34.CarvalhoC. M.LupskiJ. R.2008Copy number variation at the breakpoint region of isochromosome 17q181117241732
35.MukaddesN. M.HergunerS.2007Autistic disorder and 22q11.2 duplication.82127130
36.BarabashA.MarcosA.AncinI.Vazquez-AlvarezB.de UgarteC.GilP.FernandezC.EncinasM.Lopez-IborJ. J.CabranesJ. A.2009APOE, ACT and CHRNA7 genes in the conversion from amnestic mild cognitive impairment to Alzheimer’s disease30812541264
37.MillerD. T.ShenY.WeissL. A.KornJ.AnselmI.BridgemohanC.CoxG. F.DickinsonH.GentileJ.HarrisD. J.2009Microdeletion/duplication at 15q13.2q13.3 among individuals with features of autism and other neuropsychiatric disorders464242248
38.ShinawiM.CPSchaafBhattS. S.XiaZ.PatelA.CheungS. W.LanpherB.NaglS.HerdingH. S.Nevinny-StickelC.2009A small recurrent deletion within 15q13.3 is associated with a range of neurodevelopmental phenotypes411212691271
39.TsuchiyaK. D.WiesnerG.CassidyS. B.LimwongseC.BoyleJ. T.SchwartzS.1998Deletion 10q23.2-q23.33 in a patient with gastrointestinal juvenile polyposis and other features of a Cowden-like syndrome.212113118
40.ZhouX. P.Woodford-RichensK.LehtonenR.KuroseK.AldredM.HampelH.LaunonenV.VirtaS.PilarskiR.SalovaaraR.2001Germline mutations in BMPR1A/ALK3 cause a subset of cases of juvenile polyposis syndrome and of Cowden and Bannayan-Riley-Ruvalcaba syndromes.694704711
41.FombonneE.2003Epidemiological Surveys of Autism and Other Pervasive Developmental Disorders: An Update.334365382
43.AbrahamsB.GeschwindD.2008Advances in autism genetics: on the threshold of a new neurobiology95341355
44.AlqallafA.TewfikA.SelleckS.2009Genetic variation detection using maximum likelihood estimator.978-1-42444-761-9Minnesota, USA.
45.KallioniemiA.KallioniemiO.SudarD.RutovitzD.GrayJ.WaldmanF.PinkelD.1992Comparative genomic hybridization for molecular cytogenetic analysis of solid tumors.2585083818821
46.WangY.WangS.2007A novel stationary wavelet denoising algorithm for array-based DNA copy number data.32206222
47.WillenbrockH.FridlyandJ.2005A comparison study: applying segmentation to array CGH data for downstream analyses212240844091
48.RissanenJ.1978Modeling by Shortest Data Description14465471
49.LarsonR.CastieJ.1982Principles of Dynamic Programming1-2Marcel Dekker Inc., NY.
50.BioDiscovery Inc.2012Nexus: Copy Number Professional software package (BioDiscovery), Inc., El Segundo, CA) available from:http://www.biodiscovery.com.
51.VenkatramanE.OlshenA.2007A faster circular binary segmentation algorithm for the analysis of array CGH data236657663
52.GrantG.ManduchiE.CheungV.EwensW.1999Significance testing for direct identity-by-descent mapping.635441454
53.DiskinS.EckT.GreshockJ.MosseY.NaylorT.StoeckertC.WeberB.MarisJ.GrantG.2006STAC: A method for testing the significance of DNA copy-number aberrations across multiple array-CGH experiments.16911491158
54.RohlfingT.RussakoffD.MaurerC.2004Performance based classifier combination in atlas based image segmentation using expectation maximization parameter estimation.238983994
55.GulerI.UbeyliE.2005ECG beat classifier designed by combined neural network model38199208
56.SampatM. .BovikA.AggarwalJ.CastlemanK.2005Supervised parametric and non parametric classification of chromosome images3812091223
57.BruijneM.NielsonM.2004Shape particle filtering for image segmentation1168175
58.ShinH.SohnS.2005Selected tree classifier combination based on both accuracy and error diversity38191197
59.TsymbalA.CunninghamP.PechenizkiyM.PuuronenS.2003Search strategies for ensemble feature selection in medical diagnosis.16th IEEE symposium on computer based medical systems,124129
60.KookH.GuptaL.MolfeseD.FademK.2005Multi-stimuli multi-channel data and decision fusion strategies for dyslexia prediction using neonatal ERPS381121742184
61.MoonH.AhnH.KodellR.LinC.BaekS.ChenJ.2006Classification methods for the development of genomic signatures from high-dimensional data712121
62.ValasekM.RepaJ.2005The power of real-time PCR,293151159
63.KleinD.2002Quantification using real-time PCR technology: applications and limitations.86257260
64.DasarathyB.1991Nearest Neighbor Norms: NN Pattern Classification Techniques. IEEEComputer Society Press.
66.ShinawiM.CheungS.2008The array CGH and its clinical applications1317-18760770
67.SchwabM.AlitaloK.KlempnauerK.VarmusH.BishopJ.GilbertF.BrodeurG.GoldsteinM.TrentJ.1983Amplified DNA with limited homol- ogy to MYC cellular oncogene is shared by human neuroblastoma cell lines and a neuroblastoma tumour.3055931245248
68.AldingerK.PlummerJ.QiuS.LevittP.2011SnapShot: Genetics of Autism.72418418
69.SandersS.et al.2011Multiple Recurrent De Novo CNVs, Including Duplications of the 7q11.23 Williams Syndrome Region, Are Strongly Associated with Autism70863885
Written By
Abdullah K. Alqallaf, Fuad M. Alkoot and Mash’el S. Aldabbous