International Journal of Electrical and Computer Engineering (IJECE) Vol. No. February 2017, pp. ISSN: 2088-8708. DOI: 10. 11591/ijece. Recognition of Tomato Late Blight by using DWT and Component Analysis Hiteshwari Sabrol1. Satish Kumar2 Department of Computer Science & Applications. Panjab University. Chandigarh. Chandigarh 160014. India Department of Computer Applications. SSG Regional Centre. Hoshiarpur, 146023. India (Panjab University. Chandigar. Article Info ABSTRACT Article history: Plant disease recognition concept is one of the successful and important applications of image processing and able to provide accurate and useful information to timely prediction and control of plant diseases. In the study, the wavelet based features computed from RGB images of late blight infected images and healthy images. The extracted features submitted to Principal Component Analysis (PCA). Kernel Principal Component Analysis (KPCA) and Independent Component Analysis performed (ICA) for reducing dimensions in feature data processing and classification. recognize and classify late blight from healthy plant images are classified into two classes i. late blight infected or healthy. The Euclidean Distance measure is used to compute the distance by these two classes of training and testing dataset for tomato late blight recognition and classification. Finally, the three-component analysis is compared for late blight recognition The Kernel Principal Component Analysis (KPCA) yielded overall recognition accuracy with 96. Received May 13, 2016 Revised Jul 6, 2016 Accepted Jul 20, 2016 Keyword: Independent component Kernel principal component Tomato late blight disease Principal component analysis Copyright A 2017 Institute of Advanced Engineering and Science. All rights reserved. Corresponding Author: Hiteshwari Sabrol. Department of Computer Science & Applications. Panjab University. Chandigarh. Chandigarh 160014. India. Email: hiteshwarisabrol@gmail. INTRODUCTION AuPlant disease is an impairment of the normal state of the plant that interrupts or modifies its vital functionsAy . There are several types of plant disease could cause several losses to the production of crops. The presence of the pathogen depends on the favorable environment conditions and varieties of crops grown, which is the reason for occurrence and prevalence of plant diseases. There are various plant disease management programs that will help to reduce losses in yields and grain quality. An automatic plant disease recognition and diagnosis system can design by using image processing and pattern recognition techniques . There is various kind of abnormality states present on the plants leaf which can be identified by mean of manual inspection. The image processing and pattern recognition techniques play the worth full role to convert manual process to automate the process. The automatic diagnosis system based on plant disease features reduces the dependency on experts in the area concerned. In the study, the late blight-infected tomato leaf images and healthy leaf images were taken for the experiment. The two kind of leaf divided into two categories or classes, i. , late blight infected and healthy. The plant disease features extracted after applying discrete wavelet transformation. Excessive extracted features reduced by using Auprincipal component analysisAy (PCA) . Auindependent component analysisAy (ICA) . , . and Aukernel principal component analysisAy (KPCA) . , . and some other methods could be used to reduce dimensions of the feature data. Journal homepage: http://iaesjournal. com/online/index. php/IJECE IJECE ISSN: 2088-8708 This three-component analysis also used for classification purpose by learning technique. There is The tomato late blight disease recognition from healthy leaf images includes eight steps as mentioned in Figure 1. Step 1 Late blight infected: 48 and healthy (Norma. : 58. Step 2 Created training and testing dataset. Step 3 Image preprocessing: Perform Adaptive histogram equalization and reshaping the images. Extracting each R. B components and Apply whitening for ICA Step 4. Feature Extraction using single level twodimensional Haar Wavelet. Step 5 Extracting low and high frequency of each R. G, and B component. Step 6 Data summarization and classification by PCA. KPCA & ICA. Step 7 Apply Euclidean Distance to measure the similarity distance between training and testing data by two classes (Late blight infected and healthy. Step 8 Tomato Late Blight Recognition and Classification comparison and cross-validation of three Component Analysis (PCA. KPCA, and ICA). Image acquisition by common digital Testing Set Training Set Image Preprocessing Extracting each R,G,B components from images Apply Histogram Equalization and reshaping to each extracted R. B Component For PCA and KPCA Apply Whiten to each component for ICA Feature Extraction using single level two dimensional Haar Wavelet (Extracting low and high frequenc. Extracting low and high frequency of each R. B component Data summarization and classification by PCA. KPCA & ICA Input and Target Data Testing Data Apply Euclidean Distance to measure the similarity distance between training and testing data on the basis of two classes (Late blight infected and healthy Tomato Late Blight Recognition and Classification comparison and cross validation of three Component Analysis (PCA. KPCA and ICA) Figure 1. Steps for Recognition and Classification of Tomato Late Blight by using DWT and Component Analysis RESEARCH METHOD Tomato Leaf Blight The late blight found on tomato plants is caused by the fungus Phytophthora infestans. The occurrence of late blight found in later season or features often not appearing until after blossom. The symptoms or features of late blight first appear on the lower, older leaves as water-soaked, gray-green spots. The image affected from late blight shown in Figure 2. Recognition of Tomato Late Blight by Using DWT and Component Analysis (Hiteshwari Sabro. A ISSN: 2088-8708 . Figure 2. & . Late Blight Affected Tomato Leaves . Healthy Tomato Leaves Image DataSet In this study, a dataset of total 106 digital images of late blight tomato plant disease and healthy were obtained by using the common digital camera. From which the 48 images are late blight infected and 58 images are healthy. Total five times the images are randomly used for training and testing purpose. For training, randomly the set of 90%, 80%, 70%, 60% and 50% images from the total dataset used. For the testing, randomly the set of 10%, 20%, 30%, 40% and 50% images from the total dataset used. The proposed algorithm implemented with MATLAB 2014b. Image pre-processing Apply contrast limited adaptive histogram equalization to enhance the contrast of the intensity and perform reshaping on RGB images of late blight and healthy. Feature Extraction using Discrete Wavelet Transformation Wavelet transformation is used to extract wavelet features from tomato late blight affected and healthy images. As wavelet transformation convert digital signal to wavelet signal or decomposes the digital signal into wavelet function. The wavelet transformation can analysis the detail of image like vertical, diagonal and horizontal subbands of the image. The detailed information of the image can be retrieved by low pass and high pass filtering . In the experiment, the discrete wavelet transform used for extracting features from the disease affected the area of the plant. First R. B components extracted from the images individually. Second compute the low and high frequency of each R. B component by using single level two-dimensional discrete Haar wavelet. The Haar Wavelet function defined by the following formula . ( ) Third compute the standard deviation of each extracted component to be in the scalar form. Finally, combined the three scalar forms of each R. G, and B component and submitted to PCA. KPCA and ICA for data summarization, training, and testing purpose. Component Analysis The two or more independent variables composed in such way to analyze their independence is called Component analysis. To identify the singular component and determine the influenced component due to the changing behavior is the main aim of component analysis. Eliminate ineffective or less effective components for the purpose of reduction of indifferent elements, maintenance and improving generalization. The PCA. KPCA, and ICA are three important feature extraction techniques used for dimensionality reduction and work on self-organizing unsupervised learning approach. We used all these three techniques (PCA. KPCA, and ICA) for data reduction and classification. The most effective and simple multivariate analysis based to eigenvector is AuPrincipal Component AnalysisAy. PCA an orthogonal linear transformation used to convert the data to a new coordinate system. also used for signal processing and data reduction in statistical pattern recognition. Self-organized learning provides an adaptive algorithm for PCA. It maximizes the rate of decrease of variance using Eigen IJECE Vol. No. February 2017 : 194 Ae 199 IJECE ISSN: 2088-8708 decomposition method or value decomposition theorem. We have used Singular value decomposition theorem better then Eigen decomposition. In mathematical terms, calculate the eigenvalues of the covariance matrix of the original outputs. PCA linearly transform a high-dimensional space which is uncorrelated and First, compute the eigenvalues and eigenvectors, then sort the eigenvalues in descending order and ignore the really small values. Then we transform data into the eigenspace formed by the selected The mean vector of the population is defined as: To find the covariance matrix where Cx is the covariance. K is the samples from random values, and . -m. -m. T are the n*n order of . where A is the transformation matrix to map x into vector y and the diagonal matrix is Cy and containing elements with the main diagonal are the eigenvalues of C x. The matrix Ak is designed from k eigenvectors equivalent to the k largest eigenvalues, producing a transformation matrix of order k*n. The reconstruction of vector by Ak is: The kernel based method used for nonlinear PCA is a Kernel Principal Component Analysis. The kernel method is used to transforms the original inputs into high dimensional feature space . nner product kerne. , where the computation performed on feature space which is non-linearly related to input space. Kernel PCA, the relation between input space and feature space is nonlinear but implementation relies on linear algebra, so we think that itAos an extension of ordinary PCA. In this study, we were using Gaussiantype kernel. Mathematically, we would like to find direction vector in the feature space. All the matters remain same as in PCA, but with kernel matrix U ( ) ( )U The projection onto the optimal direction is U ( ) U U ( )Oc ( )U Oc Independent Component Analysis: (Cardoaso 2. stated that AuIndependent Component Analysis is the decomposition of random vector into linear components that are statistically independent as possible, where the term independence is understood in its strongest statistical sense. ICA goes beyond . econd orde. decorrelation and therefore requires that the observations representing the data vector be non-GaussianAy. In the study, we used whitening as a preprocessing step for ICA algorithm to find the uncorrelated components but need not be independent, and the variance is equal to the unity. Next, apply discrete Haar wavelet transformation to extract wavelet features. EVALUATION We use the Euclidean Distance method to measure the distance between the training set and testing Either through supervised learning or unsupervised learning are known, an unknown object, may be called test object can recognize by associating it with one of those class using similarity . r in other words, dissimilarity or distanc. To measure the dissimilarity or distance of two classes i. Tomato late blight infected or healthy images, we used Euclidean Distance measure as following oc Recognition of Tomato Late Blight by Using DWT and Component Analysis (Hiteshwari Sabro. A ISSN: 2088-8708 Using PCA and KPCA, we measure the distance between eigenvalue of training data and testing In the case of ICA, the distance measured between eigenvector computed. If the distance is less from a particular class, then the testing image belongs to that class. RESULTS AND DISCUSSION The recognition and classification of tomato late blight by using discrete wavelet transformation and component analysis yielded accuracy 96. The PCA. KPCA, and ICA are used to the reduction of dimensions of the data using wavelet features and then these techniques used for tomato plant disease recognition and classification purpose. The two classes i. late blight infected and healthy used to classify whether the image is affected by late blight disease or not. To calculating the minimum distance between training and testing data. Euclidean distance measure was used to classify that testing image is belongs to which class infected or healthy. Table 1. shows the comparative results of the three-component analysis. The PCA producing second overall recognition accuracy with 89. 8% after KPCA. In the first, test dataset of 10%. ICA resulting the worst recognition accuracy and yielding overall 34% recognition only. CONCLUSION AND FUTURE SCOPE In the study. PCA. KPCA, and ICA used for data reduction and recognition of tomato late blight from a healthy leaf in digital images. The plant disease recognition and classification based on two classes images affected by late blight and healthy. For feature extraction, we used discrete wavelet transformation based on Haar wavelet. As the results show, extracted feature data from the images of plant diseases by reducing dimensions could reduce the running time for classification, and acceptable recognition results could obtain. The method used in the study proved that PCA. KPCA, and ICA used for extracted feature data reduction from tomato late blight images and also used for recognition purpose. Finally, the distance between training and testing dataset computed by Euclidean distance measure by two classes. If the distance is less from any of two classes, then the particular testing image belongs to one of the two classes either late blight infected or healthy one. The PCA is a kind of linear projection, and it could not correctly handle non-linear data, but it could handle using KPCA. All the properties of ordinary PCA carry over to KPCA. The Kernal PCA is linear in feature space and nonlinear in the input space. As such, it can be applied to all those domains where ordinary PCA used for feature extraction or data reduction, for which nonlinear extension would make sense. As compare to PCA. KPCA and ICA is not only statistical independent up to second order but up to all the individual components of the output vector and involves no constraint of Table 1. Recognition and Classification of Tomato Late Blight accuracy Testing Dataset Overall Accuracy Principal Component Analysis Late Blight Healthy Infected Kernel Principal Component Analysis Late Blight Healthy Infected Independent Component Analysis Late Blight Healthy Infected ACKNOWLEDGEMENTS We thank Dr. Ashutosh Sharma. Assistant Professor. Department of Agriculture Sciences. DAV University. Jalandhar. Punjab India. , for guidance and sharing his expertise in the field of plant pathology. We would also like to thanks. Mr. Toran Verma. Assistant Professor. Department of Computer Science Engineering and Technology. Bhilai. Chattisgarh. India, for his help in taking the images used in the REFERENCES