JOIV : Int. Inform. Visualization, 9. - January 2025 210-215 INTERNATIONAL JOURNAL ON INFORMATICS VISUALIZATION INTERNATIONAL JOURNAL ON INFORMATICS VISUALIZATION journal homepage : w. org/index. php/joiv Adaptive Deep Convolution Neural Network for Early Diagnosis of Autism through Combining Personal Characteristic with Eye Tracking Path Imaging Revathi Kesavan a. Naveen Palanichamy b,*. Su-Cheng Haw b. Kok-Why Ng b Department of Artificial Intelligence and Data Science. Dhanalakshmi College of Engineering. Manimangalam. Chennai. India Faculty of Computing and Informatics. Multimedia University. Selangor. Malaysia Corresponding author: *p. naveen@mmu. AbstractAiAutism is a large set of illnesses related to brain development, also referred to as autism spectrum disorder (ASD). According to WHO reports, 1 in 100 children is expected to have ASD. Numerous behavioral domains are affected, including linguistic, interpersonal skills, stereotypical and repetitive behaviors which represent an extreme instance of a neurodevelopmental abnormality. Identifying ASD can be difficult and exhausting because its symptoms are remarkably identical to those of many other disorders of the Medical professionals can improve diagnosis efficiency by adapting deep learning practices. In clinics for autism spectrum disorders, eye-tracking scan pathways (ETSP) have become a more common instrument. This approach uses quantitative eye movement analysis to study attentional processes, and it exhibits promising results in the development of indicators that can be used in clinical studies for autism. ASD can be identified by comparing the abnormal attention span patterns of childrenAos having the disorder to the childrenAos who are typically developing. The recommended model makes use of two publicly viable datasets, namely ABIDE and ETSP The proposed deep convolutional network consists of four hidden convolution layers and uses 5-fold cross-validation strategy. The performance of the proposed model is validated against multilayer perceptron (MLP) and conventional machine learning classifiers like decision tree (DT), k-nearest neighbor (KNN) and Random Forest (RF) using metrics like sensitivity, specificity and area under curve (AUC). The findings demonstrated that without the need for human assistance, the suggested model is capable of correctly identifying children with ASD. KeywordsAiAutism spectrum disorder. convolutional neural network. early diagnosis. eye-tracking scan path. machine learning. Manuscript received 28 Jul. revised 25 Oct. accepted 12 Dec. Date of publication 31 Jan. International Journal on Informatics Visualization is licensed under a Creative Commons Attribution-Share Alike 4. 0 International License. support, which is made possible by earlier identification plus accurate diagnosis of autism . In India, the frequency of autism has been rising significantly. A 2021 study that was written up in the Indian Journal of Pediatrics states the In India, it is believed that 1 in 68 children have autism There is a roughly 3:1 male-to-female ratio in cases of autism, with boys being more frequently affected than A blood test or any other medical test cannot be used to detect ASD, which makes diagnosis difficult. Medical practitioners consider a child's developmental history and behavior when making a diagnosis. Sometimes, even in children as young as 18 months old. ASD can be recognized. By the age of two, a diagnosis from a trained specialist can be relied upon. However, many children don't receive a conclusive diagnosis until much later in life. Some people INTRODUCTION ASD is a neurodevelopmental disease caused by differences in the brain. Social engagement and involvement, as well as restricted or repetitive behaviors or hobbies, can be problematic for people with ASD. Additionally, people with ASD may move, pay attention, or learn differently. For those with ASD, social interaction and communication skills might be difficult. Some people with ASD may exhibit peculiar habits or hobbies. Many ASD sufferers also exhibit comparable traits. These could consist of seizure conditions, unusual sleeping and eating patterns, delayed language acquisition, movement and learning or cognitive abilities. Millions of youngsters worldwide suffer with ASD, and India is not an exception. The long-term results for impacted children are greatly improved by prompt intervention and wait until they are adults or teenagers before receiving a This delay may prevent people with ASD from receiving the essential early assistance. It's critical to diagnose ASD in children as early as possible to ensure they receive the support and assistance they require to realize their full potential . Autism early diagnosis is revolutionary for a number of reasons: Early Intervention: Early diagnosis makes it possible to put early intervention plans in place that are customized to the individual requirements of the child. This can greatly accelerate their growth in important domains like behavior, social skills, and Brain Plasticity: The brain is very malleable throughout the first few years of life, allowing it to more easily adjust and develop new connections. excellent chance to positively impact the growth of a child's brain exists at this phase. Early identification of ASD allows for better utilization of the ability of the brain to improve a child's cognitive function. Parental Training and Assistance: Early diagnosis gives parents and other caregivers a better grasp of their children's requirements as well as the information and tools they need to help them successfully. Early intervention of ASD greatly helps parents of the child to procure treatment plans where the childAos brain also easily shaped due to the plasticity and learn skills to face challenges. The existing efforts in early diagnosis of ASD majorly focused on brain imaging datasets. Whereas the proposed attempt makes use of personal characteristic of ASD patient with ETSP images to attain promising accuracy. The article is organized according to the following structure: A review of latest studies which have established particular models for ASD identification is presented in Section 2. Furthermore. Section 3 offers a thorough explanation of the deep learning methods and datasets used in this investigation. The outcomes of the experiments are provided and analysed in Section 4. Finally. Section 5 offered a conclusion. is used, these dominant models obtain the maximum prediction accuracy of almost 100% when compared to other analyzed models. In supporting earlier detection of ASD. Ulhaq et al. demonstrated the efficiency of machine learning algorithms through adapting feature scaling namely quantile, power, normalizer and max abs scalar in their work. As an outset Ada Boost exhibited the promising the result when comparing all To improve the prediction accuracy Jhanjhi et al. applied the stochastic gradient decent model and obtained the accuracy of 99. 7% for adult dataset in UCI ML repository. Ada boost algorithm is renowned as an efficient machine learning algorithm and applied in enormous efforts of past researchers like Suhas et al. The promising way to increase the detection accuracy of ASD by adapting personality characteristics in the dataset is discussed by Parikh et al. Further the application of discriminant analysis for obtaining high accuracy with reduced computational complexity is narrated by Nishat et al. Deep learning (DL) is especially helpful for picture identification and other jobs where the characteristics are hard to identify. DL techniques can handle huge composite datasets which would be challenging for customary machine learning models to comprehend. Automatic feature extraction from the data eliminates the need for human feature engineering, which is only one of the many advantages of deep learning. Salari . attempted to apply 2D convolutional neural network (CNN) to bring better prediction rate. Whereas Selah and Chern . investigated the effectiveness of 3D Convnets / CNN for detecting ASD and they noticed the performance accuracy of CNN as 97. as the highest one comparing to conventional machine learning algorithms. The role of deep learning and adaptation of optimization techniques for increasing the prediction accuracy is exercised by Adinew et al. Meneguzzi et al. built a deep neural network (DNN) by stacking autoencoders for diagnosing the ASD over the brain images obtained from ABIDE dataset. The resultant DNN provided 70% accuracy while compared to other state-of-art techniques. The eye-tracking device delivers useful information about kids' visual activity for a timely and precise diagnosis. Its working process is initiated by scanning the eye's path to produce a series of eye projection points on the image, which is then used to examine the behavior of kids with autism. Raseem et al. conducted an experiment to demonstrate the outcome of deep learning models enriched with a feature extraction mechanism. The result evidenced that feature extraction preceded by classifier highly impacts the overall Kanhirakadavath and Chandran . built a deep network, allowing feature selection with help of principal component analysis (PCA) algorithm before CNN. This model presented its overall accuracy as 97%. The importance of early intervention and the efficacy of using the eye tracking paths in determining ASD are explained by Lanvin et al. The impact analysis of eye patterns over the years greatly influences the prediction rate of ASD and is done by Kong et . The significant past efforts carried out in this domain are summarized as Table 1. II. MATERIALS AND METHOD The summarization of renowned efforts of investigators contributed towards early detection of ASD is discussed in this section. The need for automated systems to predict ASD and significance of machine learning algorithms in detecting ASD is deeply investigated by several studies . , . , . , . The machine learning algorithms like DT, nayve bayes (NB). KNN and support vector machine (SVM) grasped its popularity due to its performance . , . , . , . Roggala et al. addressed the way to improvising the diagnosis of ASD through account EEG signal analysis. Khudhur and Khudhur . exercised different machine learning models that classified the ASD over multiple datasets extracted from Kaggle and UCI repository pertaining to four age groups namely toddler, child, adolescent and adult. The results demonstrated that the decision tree, logistic regression and random forest classification algorithms are the most prevalent models after applying various ML techniques to the preprocessing datasets mentioned above. For every dataset that TABLE I SUMMARY OF SIGNIFICANT EFFORTS IN DETECTING AUTISM Name of the Author Khudhur and Khudhur Dataset Kaggle and UCI ML Repository Ulhaq et al. UCI ML Repository Jhanjhi et al. UCI ML Repository Kumar et al. AQ-10 Salari et al. Saleh and Chern . Meneguzzi et al. Raseem et al. ABIDE ABIDE ABIDE Figshare Data Repository (Eye tracking scan-path Kanhirakadavath and Chandran . Eye tracking Dataset by Elbattah Methods SVM KNN Logistic Regression (LR) Ada Boost (AB) Gaussian Nayve Bayes (GNB) KNN SVM CN2 rule induction KNN Stochastic gradient decent (SGD) SVM SVM CNN . D Convnet. CNN . D Convnet. DNN (Two stacked autoencoder. Artificial neural network (ANN) Feed forward neural network (FFNN) Hybrid model (Enriched hybrid feature selection which combines local binary pattern (LBP) and grey level cooccurrence matrix (GLCM)) CNN with PCA The dataset comprises eye tracking habits of informed consents is presented by Cilia et al. Apart from eye movement, facial features can be used in diagnosing the ASD patients demonstrated by Alam et al. , . For an accurate diagnosis and course of treatment, structural magnetic resonance imaging . MRI) must be used to identify trustworthy biomarkers for ASD. A comparative study of machine learning algorithms with sMRI is carried out by Bahathiq et al. The early intervention of ASD detection helps a lot and the accuracy of detection rate is improved by adapting deep learning practices accounting ETSP images with personality characteristic of the person. The primary goals of the proposed model and structural details of the proposed mechanism are presented below. Accuracy Among other models DT. LR and RF provided 100% AB - 99. 25% . or toddler. 95% . or childre. AB - 99. 8% . or toddle. SGD - 99. 7% . or adul. FFNN & ANN Ae 99. process, entails three separate operations: converting RGB to grayscale, resizing images and addressing missing values in ABIDE dataset. Using Equation 1, the following expression is used to convert an RGB image to a grayscale image. The generated grayscale photos are then resized from 640 x 480 to 100 x 100. By taking this step, the computer's processing load is lessened, enabling speedier computing. A 3. 0AA AG 3. 0AA AB 3. GS Im age A R Primary Goals of the Proposed Model The primary goals of the proposed model are listed below: To diagnose ASD in early stage by combining personality characteristics and eye tracking path . ehavioral patter. of the individual To improve the prediction rate through deploying promising deep learning techniques Fig. 1 The structure of the proposed model for detecting autism Adaptive Deep Model: Structure and Details The overall structure of the proposed deep network is captured in Figure 1. Pre-processing, the initial phase in the The application of Discrete Wavelet Transform Techniques (DWT) lowers the dimensionality of the image, chooses relevant features, and efficiently extracts them. increase process efficacy. DWT is coordinated using kernel Principal Component Analysis (PCA) . By finding just a few principal components . rthogonal linear combination. of the input variables with the largest variance. PCA reduces the Thus, the proposed feature extractor is named as HPCA. Next step is applying classifier to predict ASD. The four alternating convolutional and max-pooling layers that make up the CNN's structure are designated as Levels 1 through 4 is utilized here. Convolution layers are made up of numerous kernels that work together to extract characteristics from digital images through a mathematical process called The feature maps' size is decreased by combining the max-pooling layers with convolutional layers . , . This process is also referred to as down-sampling. A flatten layer is accessible at the output terminal of all CNNbased classification models to reformat the output matrix and prepare it for the following step. Here, the spatial data from the features at level 4 is deleted and substituted with a channel Consequently, distinct channels are created for every retrieved feature map. The configuration of the CNN is provided in Table 2. It imitates the hierarchy of command and hunting style of adult grey wolves. The four types of grey wolves that are utilized to model the leadership hierarchy are alpha, beta, delta, and omega . Grey wolves' social structure and hunting patterns are mathematically modelled in order to produce GWO. Omega wolves adjust their placements in each iteration according to positions , , and alpha, beta, and delta because prey is more likely to be found at these In order to mathematically mimic the grey wolves' last hunting strategy of attacking their prey once it stops moving, we need to decrease the value of a . When a drops from 2 to 0 in the interval [-2a, 2. , a random number A is assigned to it throughout the repetitions. RESULT AND DISCUSSION For examining the personal characteristic. Autism Brain Imaging Data Exchange (ABIDE) database is utilized. The personal characteristics taken into account are age, sex, verbal IQ, performance IQ. Full scale IQ and handedness . eft, right or ambidextrou. The second dataset is composed by Mahmoud Elbattah that includes 547 eye-tracking scan path images extracted from 59 participants. The ASD dataset, which includes ETSP images, was used to test the suggested A total of 547 images is found in the collection that is split into two classes: ASD with 219 images and 328 with No ASD. Here is a mapping of personal characteristics of ASD with respective ETSP images and No ASD is The dataset was split up as follows: 20% was put aside for testing, while the remaining 80% was used for training and validation which is called as 80:20 validation strategies. assess the performance of the optimized deep neural network is evaluated using the parameters as follows: Positive Prediction Rate (PPR) represents the probability of getting the condition following the positive outcome of the test. In the event of an adverse test result, the Negative Prediction Rate (NPR) indicates the likelihood of not possessing the ailment. Sensitivity, also called recall, is the ratio known as RC, which is calculated by dividing the total number of true positive and false negative values by the number of true positive predictions. The same is expressed as Equation . RC A TP A FN TABLE i CONFIGURATION OF CNN Layer Layer Layer Layer Layer Function 2D Convolution Activation - ReLU Max Pooling 2D Convolution Activation - ReLU Max Pooling 2D Convolution Activation - ReLU Max Pooling 2D Convolution Activation - ReLU Max Pooling Flatten Layer Dense Layer 1 Dense Layer 2 Output Layer Dimensions (No. Kernel and Siz. 16, 3*3 Dimension of Output 16, 100*100 16, 2*2 32, 3*3 16, 50*50 32, 50*50 32, 2*2 64, 3*3 32, 25*25 64, 25*25 64, 2*2 128, 3*3 64, 12*12 128, 12*12 128, 2*2 128, 6*6 1*4608 Furthermore, to increase prediction accuracy, grey wolf optimization (GWO) is exercised in this model. It is one of the population-based meta-heuristics algorithms and it was introduced by Mirjalili et al. and Revathi and Samydurai . by the year 2014. The process flow of GWO is best illustrated in Figure 2. The ratio known as Specificity SP is the proportion of accurate negative predictions to all negative forecasts and is specified as Equation . SP A TN A FN Area under the ROC Curve (AUC) determines the complete two-dimensional area from . , . , . beneath the entire ROC curve, through applying integral computation. AUC provides an overall performance score over all possible categorization criteria. The AUC and cross-validated ROC curve for the deep model using the ETSP dataset is displayed in Figure 3 and 4 respectively. Fig. 2 The generic flow of GWO rate as compared to DT, 28% greater than KNN, 23% than MLP, 20% than RF, 17% than TDSA and 8% than CNN. Fig. 5 Analysis chart of proposed model Furthermore, employing metrics like accuracy and ROC can be troublesome when the classifications under consideration have non-uniform distributions. We have an unbalanced dataset when one class significantly dominates the distribution of all the data classes in it. It is therefore more reasonable to use metrics like sensitivity, specificity and Unweighted Average Recall (UAR) to assess the performance of the classifiers in the presence of an unbalanced data The UAR is recently developed evaluation It is predominantly used to address the class imbalance Without taking the number of occurrences into account, it simultaneously takes into account the mean sensitivity for one class and the mean specificity for another. Fig. 3 AUC of the proposed model Fig. 4 Cross-validated ROC on the proposed deep model IV. CONCLUSION Classifying ASD, employing deep learning as a method is the primary objective of this work. The outcomes unequivocally demonstrate that using deep learning is effective at identifying ASD. The pre-processed image information is collected, filtered based on personality characteristics and eye scan pattern with the help of HPCA. The CNN algorithm empowered with GWO is implemented over the Jupyter Notebook, to categorize the ASD and nonASD patients. Lastly, a statistical analysis is done to assess the performance of the deep network built as proposed model. The greatest prediction rate obtained by this investigation in terms of AUC was 94. Further, to make more clinically ready approach along with personal characteristics and ETSP, the brain imaging of ASD can be annotated. To improve the dataset quality, transfer learning techniques can be applied. Moreover, to obtain acute performance of the classifier, new evaluation metrics like UAR can be utilized. Lastly, it is a good idea to combine professional analysis using statistical hypothesis validation. The performance analysis of proposed model is best evident by comparing the same with traditional ML algorithms like CNN. DT. KNN. MLP. RF and two stacked denoising autoencoders (TDSA) which is a kind of ensemble method. The resultant analysis is summarized in Table 3. The graphical chart of the performance is better arrested in Figure 5. TABLE i COMPARATIVE ANALYSIS OF EVALUATION RESULTS Model KNN MLP Two Stacked Denoising Autoencoders CNN Proposed Model (CNN with GWO) PPR Measures in % NPR Sensitivity Specificity AUC ACKNOWLEDGMENT This research was completely supported by Multimedia University. Malaysia and Dhanalakshmi College of Engineering. India under the scheme International Matching Grants 2023 (Ref: MMUE/230. The trapezoidal rule can be applied to determine AUC. AUCs of 0. 5 typically show no discrimination . , the ability to distinguish between patients who have the illness or disorder and those who do not, based on the test. values in the range of 0. 7 to 0. 8 are viewed as acceptable. values in the range of 0. 8 to 0. 9 are regarded excellent. and values greater 9 are weighed remarkable. From this figure, it is clearly observed that the deep model built outperformed against machine learning models. The suggested model ensures approximately 30% increased AUC REFERENCES