International Journal of Electrical and Computer Engineering (IJECE) Vol. No. February 2017, pp. ISSN: 2088-8708. DOI: 10. 11591/ijece. GIS-MAP based Spatial Analysis of Rainfall Data of Andhra Pradesh and Telangana States using R Jeevan Nagendra Kumar1. Rajini Kanth2 Department of Information Technology. Gokaraju Rangaraju Institute of Engineering and Technology. Hyderabad. India Department of Computer Science Engineering. SNIST. Hyderabad Article Info ABSTRACT Article history: The rainfall conditions across wide geographical location and varied topographic conditions of India throw challenge to researchers and scientists in predicting rainfall effectively. India is Agriculture based country and it mainly depends on rainfall. Seasons in India are divided into four, which is winter in January and February, summer is from March to May, monsoon is from June to September and post monsoon is from October to December. India is Agriculture based country and it mainly depends on rainfall. It is very difficult to develop suitable rainfall patterns from the highly volatile weather conditions. In this Paper, it is proposed that Map based Spatial Analysis of rainfall data of Andhra Pradesh and Telangana states is made using R software apart from Hybrid Machine learning techniques. A Study will be made on rainfall patterns based on spatial locations. The Visual analytics were also made for effective study using statistical methods and Data Mining Techniques. This paper also introduced Spatial mining for effective retrieval of Remote sensed Data to deal with retrieval of information from the database and presents them in the form of map using R Received Jun 5, 2016 Revised Aug 31, 2016 Accepted Sep 15, 2016 Keyword: Hybrid machine learning Map Rainfall patterns Spatial analysis Visual analytics Copyright A 2017 Institute of Advanced Engineering and Science. All rights reserved. Corresponding Author: Yalla Jeevan Nagendra Kumar. Departement of Information Technology. Gokaraju Rangaraju Institute of Engineering and Technology. Flat No: 303. Datta Sree Avenue. Annapurna Enclave. Chanda nagar. Hyderabad 500050. Telangana. India. Email: jeevannagendra@gmail. INTRODUCTION India has an ample variety of climatic circumstances across geological range and diverse geography which is very difficult for generalization. India has six weather sub categories, ranging from alpine tundra, glaciers in the north, humid tropical regions supporting rainforests in the southwest, the island territories and arid desert in the west. Many areas have different micro climates. Its geology and topography are climatically crucial. In North West there is a Thar Desert and in North side there are Himalayas. They work in cycle to affect ethnically and cost-effectively important for monsoonal system. India is considered as climatically unstable because of monsoonal, tropics, and other conditions. The summer, monsoon, thunderstorms and wet seasons are controls the climate in India . Southeast storms are originating from a high-stress mass centered Indian Ocean and the monsoonal torrents provide over 80% of India's yearly Rainfall data . is one of the meteorological parameters that have a greater bearing on the livelihood of individualsA world over. The main objective was to develop a spatial framework for flood and drought disasters in Zambia. Precise inference of the temporal and spatial allocation of rainfall . is a vital input parameter for hydrologic reproduction and substantiation. The rain measures used to monitor rainfall is insufficient to settle the spatial and chronological allocations of rainfall. New techniques were developed to increase the precision of radar rainfall Journal homepage: http://iaesjournal. com/online/index. php/IJECE IJECE ISSN: 2088-8708 approximations, and construct high spatial and chronological rainfall data in standardization and India is an agricultural country and most of economy of India depends upon the agriculture. Rainfall plays an important role in agriculture so premature prediction of rainfall is necessary for the better economic growth of our country. Rainfall prediction . has been one of the most challenging concerns around the world in last year. Regression analysis, classification, clustering, and Artificial Neural Networks (ANN) etc. techniques are using widely in prediction process. LITERATURE SURVEY Reddy et al . evaluated LARS-WG model for southern Telangana and Andhra Pradesh region, used thirty year climate statistics from 1980 to 2010 to produce the enduring climate series for 2011 - 2060. The version forecasted the rise in standard yearly rainfall in 2030 is 5. 16% and in 2060 is 9. for Yacharam related to Hayath nagar. It was ranked as best model in terms of effectiveness in all selected Rangareddy mandals of Telangana and is applied to all other mandals because the climatic conditions are similar in those regions. Ishappa Muniyappa Rathod et al . identified that the rainfall is the vital aspect which decides the crop and yield model of a region. The success and failure of the crop depends upon the climatic They studied the precipitation traits like spatial allocation, seasonal variability of the Coimbatore district. The research is built on Forty Nine years of rainfall data for thirty three rain scale places. There is a heavy precipitation in north, south parts and less precipitation in east part of the district. Rajinikanth TV et al . stated that there are a quite a lot of climatic conditions in various time periods that are varied geologically. It has substantial precipitation in Chirapunji, high warmth at Rajasthan and cold environment at Himalayas. These extremes make us uncomfortable and predictions of climate requires systematic approaches like machine learning procedures. K-means algorithm. J48 classification methods for efficient study and extrapolations of climatic conditions. Kusre B. et al . analyzed the spatiotemporal disparity of the precipitation in Nagaland. The study illustrates that there is a huge dissimilarity in the precipitation with disparity from 859 mm to 2123 Yearly rainfall model indicates the northern part has high rainfall as related to east, west of Nagaland. the same way the north part receives more rainfall in monsoon season and less rainfall in winter season as related to east and west part of Nagaland. Marc G. Genton et al . stated that the use of vigorous geo-statistical techniques on the statistics of rainfall dimensions for Switzerland. They are de-trended through non parametric approximation with leveling The finest trend is calculated with a flattening factor based on cross validation. The variogram is then calculated by a vigorous evaluator. The parametric variogram prototype is comprehended by considering variance Ae covariance composition of variogram approximates. Fascinating outcomes are yielded by comparing kriging with initial quantities. All of these estimates are done with new methods in AS SPATIALSTATSA software. Sarala et al . stated that rainfall is irregular in India. It presents rainfall analysis by taking geological method in preparation of maps in geographical systems and characterizing spatial, temporal dissemination of monthly and yearly rainfall in Telangana with the help of trend exploration. The initial study is built on the information from 10 districts and 457 mandals. In this analysis, numerous GIS remote sensor practices were used by incorporating various geo reference data sets in the generation of maps of rainfall in Telangana. Nagini. Rajinikanth T. et al . stated in their paper titled AuEffective Analysis of Land Surface Water Resources of Andhra Pradesh with Rough Set based Hybrid Data Mining Techniques Using RAy, that Agriculture plays an important role in economy of India. More than half of the population in India depends on Agriculture. It provides raw material for many Industries. In early days, more than half of the land mass is used for Agriculture and over the years there is decline in agriculture land. Various factors like the urbanization and development results in the growth of Non-Agriculture land year by year. Agriculture is the largest abstractor and prime consumer of ground water resources across the globe and hence studies of agroeconomies that are ground water dependent became widely popular. Agriculture Irrigation. Surface water and Ground water resources are interlinked to each other. Water Usage and Food Production are dependent on each other extensively. Water is the major parameter that controls the crop yield. In many countries, the agriculture yield depends on the rain fall. Many times, the rainfall is not sufficient to crop yields. It made researchers to do rigorous analysis on water resource availability and suggest farmers for its effective This paper aims at, development and application of new Hybrid Data Mining (HDM) Techniques for effective analysis of Land Surface Water Resources (LSWR) like Canals. Tube wells. Tanks and other water resources. Apart from that analysis is also made on various Agriculture yieldAs i. , both for Cereals and Millets namely Kharif. Rabi. Sugarcane. Maize. Ragi. Wheat. Barley, etc. , using new Hybrid Data GIS-MAP Based Spatial Analysis of Rainfall Data of Andhra Pradesh and Telangana A (Y. Nagendra K. ISSN: 2088-8708 Mining (HDM) techniques. To model the complex logic. Decision Tables (DT) is used. The results were proved to be good when new Rough Set Based Hybrid Data Mining (RSBHDM) Techniques are applied over the refined data sets. Rajasekhar. RajiniKanth T. stated that Weather Prediction is the application of science and technology to estimate the state of the atmosphere at the particular spatial location. Due to the availability ofhuge data researchers, got interest to analyze and forecast the weather. Accurate prediction helps the human being existence and prosperity. Forecasting techniques are helpful in predicting natural disasters, crop and jungle growth, nautical routing, air craft scheming and armed functions. The Data Mining techniques are better than the obtainable methodologies or conventional methods. They were projected hybrid support vector machine replica to predict, analyze the climatic data and to discover the prototypes exist in it. They considered Krishna district climate data for the case study and it produced high quality results rather than machine learning methods in the process of prediction. Ananthoju Vijay Kumar. RajiniKanth T. stated that the rainfall has intense consequence on A standard rainfall is crucial for vegetation. Excessive or diminutive rainfall can damage Diminutive can abolish cultivation and excessive can help to grow dangerous fungus. Cultivation in India largely depends on rainfall, so an effort is made to forecast the stimulus of rainfall on harvest of For this the data set is constructed with yearly capacities of crop and rainfall for sixty two years. The data was gathered from various Government sectors. The investigation exposed that the crop is destructively prejudiced by rainfall PROPOSED APPROACH In the Proposed approach initially the various years rainfall data of Andhra Pradesh. Telangana was taken and preprocessed for cleaning, removal of redundancy, filling the missing values with suitable mean values and molded into required format. Then apply hybridization of Data Mining (HDM) Techniques on the preprocessed Rainfall dataset. The results thus obtained were analyzed effectively by constructing various GIS Maps using the Rainfall data set with the help of R software . It has proved that there is a substantial progress in performance. IMPLEMENTATION OF PROPOSED METHODOLOGY Initially the raw spatial data set is Pre-processed and converted in to the required format thus obtained is called refined spatial data set, suitable for further processing. Info-Gain Attribute Evaluation procedure along with Ranker Algorithm is applied and attributes selection was done. This concept finds the value of an attribute by measuring information gain for a given class. The optimized spatial data set is divided into Train data set and Test data set. It is then subjected to Machine learning Algorithm namely Classification algorithm of Data mining technique called J48 tree classification. The performance is calculated and the resultant decision Tree J48 classifier with refined data set is subjected to Association Rule Mining Algorithm namely Apriori Algorithm. Then the generated Association Rules will be analyzed for the The refined spatial data set is used to construct customized maps . using required R software The visual analytics were used for spatial analysis of the rainfall data sets of Andhra Pradesh and Telangana states. RESULTS AND ANALYSIS The attributes of the Rainfall data set are namely State. District. Latitude. Longitude. Year. January. March. April. May. June. July. August. September. October. November. December and Annual Total. The Info-Gain of the attributes was calculated and it was found that, except the attributes namely State. District. Longitude. Latitude. May. July. August. October. November. December, other attributes has zero Info-Gain After that the Classification Algorithm Class for generating a pruned or un-pruned C4. 5 decision tree known as J48 . is applied and the resultant Classifier Decision Tree represented by Figure 1. This Decision tree says that the Andhra Pradesh and Telangana lies in between Longitude boundaries are 77. 897where as the Latitude boundaries are 19. 664, 13. The resultant decision tree classifier data set is subjected to again Info Gain and removed the attributes whose Gain value is zero or near to it. The final attributes are State. District. Longitude. Latitude. May. July. August. October. November and December. Over this data set. Predictive Apriori Algorithm was The Predictive Apriori Algorithm . is to extract association rules in the given Class. The algorithm discovers the best rules by considering the threshold and support based confidence rate. It adds IJECE Vol. No. February 2017 : 460 Ae 468 IJECE ISSN: 2088-8708 those rules which are on far with the anticipated accuracy. The Association Rules were found and they are listed Figure 2. State = Telangana | Longitude <= 78. | | Longitude <= 78. | | | Longitude <= 77. | | | | December <= 1. 7: RangaReddy . | | | | December > 1. 7: RangaReddy . | | | Longitude > 77. 836728: Mahaboobnagar . | | Longitude > 78. | | | Longitude <= 78. | | | | December <= 0. 2: Nizamabad . | | | | December > 0. 2: Nizamabad . 0/1. | | | Longitude > 78. | | | | November <= 13. 5: Medak . | | | | November > 13. 5: Medak . 0/1. | Longitude > 78. | | Longitude <= 78. | | | Longitude <= 78. 486671: Hyderabad . | | | Longitude > 78. 486671: Adilabad . | | Longitude > 78. | | | Longitude <= 79. 128838: Karimnagar . | | | Longitude > 79. | | | | Longitude <= 79. | | | | | December<= 0. 2: warangal . | | | | | December> 0. 2: warangal . | | | | Longitude > 79. | | | | | May <= 61. 4: Khammam . | | | | | May > 61. 4: Khammam . The J48 Pruned Tree State = Andhra Pradesh | Longitude <= 80. | | Longitude <= 78. | | | Longitude <= 77. 600591: Anantapur . | | | Longitude > 77. | | | | Longitude <= 78. | | | | | December<= 2. 4: Kurnool . | | | | | December > 2. 4: Kurnool . | | | | Longitude > 78. 261853: cuddappah . | | Longitude > 78. | | | Longitude <= 79. 128838: chittore . | | | Longitude > 79. | | | | Longitude <= 79. | | | | | December <= 15. 3: Prakasham . | | | | | December> 15. 3: Prakasham . | | | | Longitude > 79. | | | | | October<= 247. 4: Nellore . 0/1. | | | | | October> 247. 4: Nellore . | Longitude > 80. | | Longitude <= 81. | | | Longitude <= 80. 43654: Guntur . | | | Longitude > 80. | | | | Longitude <= 80. | | | | | May <= 102. 1: Krishna . | | | | | May > 102. 1: Krishna . | | | | Longitude > 80. | | | | | November<= 118: West Godavari . | | | | | November> 118: West Godavari . | | Longitude > 81. | | | Longitude <= 83. | | | | Longitude <= 82. 040714: East Godavari . | | | | Longitude > 82. | | | | | November<= 70: Visakapatnam . | | | | | November> 70: Visakapatnam . | | | Longitude > 83. | | | | Longitude <= 83. | | | | | August<= 231. 2: Vizayanagaram . | | | | | August> 231. 2: Vizayanagaram . 0/1. | | | | Longitude > 83. | | | | | November<= 78: Srikakulam . | | | | | November> 78: Srikakulam . Figure 1. The Classifier Tree Association Rules : Predictive Apriori ============================= Best rules found: May=0,N=0. D=4 ==>Adlabad. State=Telangana Jul=170. 3,D=1. 7 ==>Krishna. State=Andhra Pradesh N=8. D=0. 3 ==>East Godavari. State=Andhra Pradesh May=2 ==>Nizamabad. State=Telangana May=21 ==>Medak. State=Telangana May=40. 6 ==>Prakasam. State=Andhra Pradesh May=45. D=0 ==>Srikakulam. State=Andhra Pradesh May=63. 3 ==>Guntur. State=Andhra Pradesh May=67. Jul=24. N=3. ==>Kurnool. State=Andhra Pradesh Jul=41. 6 ==>Mahaboobnagar. State=Telangana A=225. 9 ==>Hyderabad. State=Telangana O=36. N=33. 6 ==>Khammam. State=Telangana O=74. 9 ==>Cuddapa. State=Andhra Pradesh N=12. D=0. D=1. D=3. 3 ==>Anantapur. State=Andhra Pradesh Longitude=77. Latitude=17. Longitude=78. Latitude=16. Longitude=78. Latitude=18. Longitude=78. Latitude=18. Longitude=78. Latitude=17. Longitude=78. Latitude=19. Longitude=79. Latitude=17. Longitude=79. Latitude=18. Longitude=80. Latitude=17. 247253 ==> State=Telangana Longitude=78. Latitude=15. Longitude=78. Latitude=14. Longitude=79. Latitude=13. Longitude=79. Latitude=15. Longitude=79. Latitude=14. Longitude=80. Latitude=16. Longitude=80. Latitude=16. Longitude=81. Latitude=16. Longitude=82. Latitude=17. Longitude=83. Latitude=17. Longitude=83. Latitude=18. Longitude=83. Latitude=18. Latitude=13. Longitude=79. Latitude=14. Longitude=79. Latitude=14. Longitude=77. 600591, ==> State=Andhra Pradesh GIS-MAP Based Spatial Analysis of Rainfall Data of Andhra Pradesh and Telangana A (Y. Nagendra K. ISSN: 2088-8708 Figure 2. Association rules generated using Predictive Apriori Algorithm The Latitude and longitude values corresponding to the States Andhra Pradesh and Telangana are associated using Predictive Apriori Algorithm and in the same way the Rain fall values corresponding to the Districts were also perfectly associated by this Predictive Apriori Algorithm. For example May=0,N=0. D=4values are associated with the district Adilabad. State=Telangana and May=67. Jul=24. N=3. 9 values are associated with the district Kurnool. State=Andhra Pradesh. The Annual rain fall for the various years from 2004 to 2010 is shown in Figure 3. In this graph, the Districts were taken along X-axis and the Annual Rain fall was taken along Y-axis. From the Graph, it is inferred that in the year 2005 Karimnagar has received high rainfall followed by Nellore. Chittore and Visakapatnam. In the year 2006 Nellore received high annual Rainfall followed by Vizayanagaram. Srikakulum and Visakapatnam. In the year 2010 East Godavari received highest rainfall followed by West Godavari. Krishna and Khammam. The Figure 4 shows map . of the Rainfall data of Andhra Pradesh along with Telangana States across districts and years. Figure 5, the Rainfall data of 2009 across the districts of Andhra Pradesh and Telangana were shown in constructed customized map Ae Prakasam district rain fall data is shown in Pop up window. In Figure6, the Rainfall data of 2009 across districts of Andhra Pradesh and Telangana were shown in constructed customized map Ae west Godavari district rainfall shown in pop up window. In Figure 7, the Rainfall data of 2009 across districts of Andhra Pradesh and Telangana were shown in constructed customized map with Annual rain fall data values. In Figure 8, the Rainfall data of 2010 across districts of Andhra Pradesh and Telangana were shown in constructed customized map with Annual Rainfall data values. In Figure 9, the Rainfall data of 2004 across districts of Andhra Pradesh and Telangana were shown in constructed customized map with Annual Rainfall Data Values. In Figure 10, the Rainfall data of 2005 across districts of Andhra Pradesh and Telangana were shown in constructed customized map with Annual Rainfall data values. Series1 District vs Total Rainfall Adilabad Adilabad Adilabad Anantapur Anantapur Chittor Chittor Chittor Cuddapah Cuddapah East Godavari East Godavari East Godavari Guntur Guntur Hyderabad Hyderabad Hyderabad Karimnagar Karimnagar Khammam Khammam Khammam Krishna Krishna Kurnool Kurnool Kurnool Mahbubnagar Mahbubnagar Medak Medak Medak Nellore Nellore Nizamabad Nizamabad Nizamabad Prakasam Prakasam Rangareddy Rangareddy Rangareddy Srikakulam Srikakulam Vishakhapatnam Vishakhapatnam Vishakhapatnam Vizianagaram Vizianagaram Warangal Warangal Warangal West Godavari West Godavari Total rainfall Districts Figure 3. Districts Vs years Total Rain fall Data CONCLUSION The Annual Rainfall Data set was refined with Pre-processing Techniques and tested with Hybrid Data Mining Techniques namely Classification i. Decision Tree Classifier Algorithm namely J48. Apriori alogirthm was applied on the resultant data set for finding Association Rules. The results of Latitude and Longitude values are perfectly associated with the respective states namely Andhra Pradesh and Telangana. The Highest Rainfall has taken in the year 2005 across the districts. The Maps shows, the spread of Rainfall data across districts of Andhra Pradesh and Telangana. From these maps it is evident that the coastal area districts namely Nellore. Visakapatnam. Vizayanagaram. East Godavari and West Godavari etc. IJECE Vol. No. February 2017 : 460 Ae 468 IJECE ISSN: 2088-8708 more rainfall than compared to other Areas of the states of Andhra Pradesh and Telangana. There is an exceptional case with karimnagarin 2005 year. FIGURES AND TABLES Figure 4. The Rainfall data of Andhra Pradesh along with Telangana across districts and across years were shown in the Map Figure 5. The Rainfall data of 2009 across districts of Andhra Pradesh and Telangana were shown in constructed customized map Ae Prakasam district rain fall data is shown in Pop up window Figure 6. The Rainfall data of 2009 across districts of Andhra Pradesh and Telangana were shown in constructed customized map Ae west Godavari district rainfall shown in pop up window GIS-MAP Based Spatial Analysis of Rainfall Data of Andhra Pradesh and Telangana A (Y. Nagendra K. ISSN: 2088-8708 Figure 7. The Rainfall data of 2009 across districts of Andhra Pradesh and Telangana were shown in constructed customized map with Annual rain fall data values Figure 8. The Rainfall data of 2010 across districts of Andhra Pradesh and Telangana were shown in constructed customized map with Annual Rainfall data values Figure 9. The Rainfall data of 2004 across districts of Andhra Pradesh and Telangana were shown in constructed customized map with Annual Rainfall Data Values IJECE Vol. No. February 2017 : 460 Ae 468 IJECE ISSN: 2088-8708 Figure 10. The Rainfall data of 2005 across districts of Andhra Pradesh and Telangana were shown in constructed customized map with Annual Rainfall data values REFERENCES