Geo-spatial Technology for Landslide Hazard Zonation and Prediction

Similar to other geo hazards, landslides cannot be avoided in mountainous terrain. It is the most common natural hazard in the mountain regions and can result in enormous damage to both property and life every year. Better understanding of the hazard will help people to live in harmony with the pristine nature. Since India has 15% of its land area prone to landslides, preparation of landslide susceptibility zo‐ nation (LSZ) maps for these areas is of utmost importance. These susceptibility zo‐ nation maps will give the areas that are prone to landslides and the safe areas, which in-turn help the administrators for safer planning and future development activities. There are various methods for the preparation of LSZ maps such as based on Fuzzy logic, Artificial Neural Network, Discriminant Analysis, Direct Mapping, Regression Analysis, Neuro-Fuzzy approach and other techniques. These different approaches apply different rating system and the weights, which are area and fac‐ tors dependent. Therefore, these weights and ratings play a vital role in the prepa‐ ration of susceptibility maps using any of the approach. However, one technique that gives very high accuracy in certain might not be applicable to other parts of the world due to change in various factors, weights and ratings. Hence, only one meth‐ od cannot be suggested to be applied in any other terrain. Therefore, an under‐ standing of these approaches, factors and weights needs to be enhanced so that their execution in Geographic Information System (GIS) environment could give better results and yield actual ground like scenarios for landslide susceptibility mapping. Hence, the available and applicable approaches are discussed in this chapter along with detailed account of the literature survey in the areas of LSZ mapping. Also a case study of Garhwal area where Support Vector Machine (SVM) technique is used for preparing LSZ is also given. These LSZ maps will also be an important input for preparing the risk assessment of LSZ.


Introduction
According to the information on the International Red Cross, there are roughly 200 major natural disasters that occur each year in the world. These natural disasters cause an annual average loss of nearly 130,000 persons, and more than 140 million normal lives are affected. The frequency of occurrences of these natural disasters has increased many times in the recent past, and its effects are becoming more severe in the coming years. The major attribute is being the population growth, urbanization/industrialization leading to climate change. In general, most of the "natural risks" are accentuated by humans themselves by direct or indirect interference with the nature. Understanding a natural disaster is very difficult as it is a very complex system that involves various controlling and contributing factors. This means that no easy, one-sided solutions can be found, but applying the holistic approach to tackle such problems could yield some beneficial results. Currently, many researches are being carried out to understand the phenomenon acting behind these natural disasters such as floods, tsunamis, cyclones, earthquakes, landslides, etc. So to combat these natural risks, the holistic concepts should be developed and applied, particularly to tackle landslide risk as landslides are one of the major environmental problems in our society.
The adverse impacts of climate change on developing countries have been highly consequential. High-magnitude flash floods and increased rains has been one of the pertinent causes of extensive landslides, which accounts for around 4.89% of the globally occurring natural disasters during the last two decades. The unplanned urbanization and development coupled with continued deforestation may be attributed to this rise in figure. Landslides are quite frequent along the tectonically active Himalayan region. In the year 1984, Varnes defined the term hazard as "the probability of occurrence of a potentially damaging phenomenon within a specified period of time and within a given area". When such spatial distributions of hazards are represented on maps into various classes, it gives zonation maps. Thus, landslide hazard zonation refers to the division of area into various classes, which is categorised on the basis of degrees of actual/potential hazard caused by landslides. Hence, hazard zonation forms a critical factor for effective landslide management and is used as a tool for planning mitigation measures. The preparation of the landslide hazard map requires the analysis of most determining factors that leads to soil failure. The preparation of landslide hazard zonation requires detailed landslide inventory, processes involved in slope instability, triggering factors and many other associated studies. Landslides may occur due to a variety of conditional and triggering factors such as change in slope angle, slope aspect, faults, lithology, deforestation, improper drainage system, rainfall, and earthquakes. Thus, this zonation can be carried out at various scales from national (1:1 million) to local (1:5000). Depending on the scale of map, the parameters/factors and their accuracy varies.
With the advent of satellite data and various sensors, the scope of remote sensing has increased widely. The bird's eye view of the area at moderate to fine resolution gives fast and quick information about the terrain. Clubbed with the spectral and temporal characteristics of the satellite, the ability to identify and recognise landslides for the preparation of inventory map has been improved a lot. Both visual as well as automatic processes are well developed for recognition of landslide features. This preparation of inventory map has been made more effective with recent developments of resolution merging where data from different sensors could be merged to obtain better, sharp and good resolution images. Not only in the identification of landslides but also in the preparation of other contributing and controlling factors, remote sensing plays a crucial role. The elevation data from DEM (Digital Elevation Model) are used for the preparation of slope, aspect, relief, curvature, etc., parameter that controls the behaviour of landslide as well as the slope stability/instability. Not only the optical and multispectral data but the Radar and SAR data are being used for the analysis of landslides. The interferometric SAR technique is capable of distinguishing very minute changes in elevation and slope; hence, it is used for the identification of higher-resolution and correspondingly smaller area. Data from various sensors, i.e. optical, multispectral, thermal and microwave/radar, are being used for landslide studies.
There are various methods for the preparation of Landslide Susceptibility Zonation (LSZ) such as based on Fuzzy logic, Artificial Neural Network, Discriminant Analysis, Direct Mapping, Regression Analysis, Neuro-Fuzzy approach and other techniques. These different approaches apply different rating system and the weights, which are area and factors dependent. Therefore, these weights and ratings play a vital role in the preparation of susceptibility maps using any of the approach. However, one technique that gives very high accuracy in certain might not be applicable to other parts of the world due to change in various factors, weights and ratings. Hence, only one method cannot be suggested to be applied in any other terrain. This chapter discusses the methods being used in the field of LSZ, what are the input parameters being used, what the accuracy is and how best the method map the LSZ. However, it should be kept in mind that most of these methods/analysis are based on landslide inventory of any area, so the first and foremost step for working towards LSZ should be preparation of landslide inventory. Finally, this chapter discusses a case study of application of geo-spatial technology for preparation of LSZ in Garhwal Himalayan region, which is tectonically very active and prone to landsliding.

Regression Analysis
People are normally interested in finding the relationship between different variables. For example, whether smoking causes lung cancer? Regression analysis is the statistical method of finding relationship between dependent/predicted variable (denoted as y) and independent/ predictor variables (denoted as x 1 , x 2 , … , x n ), where n denotes the number of predictor variables [1]. The true relationship between y and x 1 , x 2 , … , x n can be approximated by the regression model as indicated in equation 1 ( ) 1 2 , , , where ε is assumed to be a random error representing the discrepancy in the approximation. It accounts for the failure of the model to fit the data exactly [2]. Typically, regression analysis is used for one of these three purposes [3] viz. (i) Modelling the relationship between x and y, (ii) Prediction of target variable, and (iii) Testing of hypotheses.
There are three types of regression models: Simple linear regression: It models the linear relationship between two variables; out of which one is dependent variable y, and other is independent variable x. In this model, regression equation is given as below in equation 2 where a = slope of regression line, b = intercept and ε = random error. Simple linear regression is shown in Figure.1.

Multiple linear regression:
There are many situations when result depends on one or more predictor variables. In such situations, simple linear regression is not sufficient to model the output, hence it requires a regression equation as given in eq 3, which models the linear relationship between one dependent variable y and more than one independent variables x 1 , x 2 , … , x n . In this model, regression equation is given as below where a 1 , a 2 , …, a n are regression coefficients, a 0 = intercept and ε = random error After the determination of regression model, its parameters are estimated based on the collected data. This is called as parameter estimation and model fitting. Most commonly used method of estimation is called the least square method [1,2,3].
Nonlinear regression: When the relationship between dependent and independent variable cannot be modelled using straight line, nonlinear regression is used. For example, nonlinear regression model for growth of a particular organism (y) as a function of time (t) can be written as where α and β are model parameters and ε = random error. All nonlinear functions that can be transformed into linear functions are called linearizable functions [2,3].

Estimation Using Least Square
The least square method for linear regression finds regression coefficients a 0 , a 1 , a 2 , …, a n such that sum of squared distance from actual value y i and fitted value y i ˰ reaches minimum for all possible choices of regression coefficients a 0 , a 1 , a 2 , …, a n , [1,4] using the given eq 5.
The difference between observed value y i and fitted value y i ˰ is called residual. to random error ε [3].
When dealing with regression analysis, if there is only one response variable, regression analysis is called univariate regression, and in case of two or more response variables, the regression is called multivariate regression. The difference between simple and multiple regressions is determined by the number of predictor variables (i.e. simple means one predictor variable and multiple means two or more predictor variables), whereas the difference between univariate and multivariate regressions is determined by the number of response variables. A brief summary of various classifications is given in Table-1. Out of all these regression types, logistic regression method is used a lot since most variables in hazard zonation mapping tends to be qualitative rather than quantitative.

Univariate
Only one quantitative response variable

Multivariate
Two or more quantitative response variables

Simple
Only one predictor variable

Multiple
Two or more predictor variables

Linear
All parameters enter the equation linearly, possibly after transformation of the data

Nonlinear
The relationship between the response and some of the predictors is nonlinear or some of the parameters appear nonlinearly, but no transformation is possible to make the parameters appear linearly

Analysis of Variance
All predictors are qualitative variables

Analysis of Covariance
Some predictors are quantitative variables and others are qualitative variables

Logistic
The response variable is qualitative

Logistic Regression
Logistic regression model is a general linear model, which models the data with binary responses [1], i.e. it predicts the presence or absence of an outcome based on the values of a set of predictor variables [5]. The dependent variable in logistic regression is binary (i.e. 0 or 1, true or false), whereas the independent variable can be categorical, dichotomous or interval [6]. For landslide study, dependent variable is binary, showing either the presence or the absence of landslide.
Example: For determining risk factor for cancer, health data of several people were collected on several variables such as age, sex, smoking, diet, and the family's medical history. The response variable "y" is the person having cancer (y=1) or not having cancer (y=0) [2].
Coefficients of logistic regression can be used to calculate ratios for each independent variable in the model. Logistic regression model can be represented in simplest form as shown in equation 7 where p is the probability of occurrence of an event (varies between 0 and 1 on S-shaped curve), and y is dependent variable and calculated using the logistic regression equation 8 where a 1 , a 2 , …, a n are logistic regression coefficients and a 0 = intercept, x 1 , x 2 , … ., x n are independent variables [7]. [2,4] i.

Applications
Agricultural sciences (e.g. analysis of data of milk production).
ii. Management, industrial and labour relations (e.g. Do chief executive officers (CEOs) and their top managers always agree on the goals of the company?).

iii.
Environmental sciences (e.g. exploration of relationship between water quality and land use).

iv.
Psychology (e.g. What are the factors that impact the likelihood of a moonlighting worker becoming aggressive toward his or her supervisor?).

v.
Geography (Can the population of an urban area be estimated without taking a census?).

Landslide Hazard Zonation using Regression Analysis
Regression analysis is one of the most widely used statistical tool as it provides simple methods for establishing a functional relationship among variables. Logistic regression has been used widely for preparation of landslide hazard zonation maps [5,6,8,9]. Slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, land cover, vegetation index, and precipitation are considered as landslide-causing factors in many literatures. In logistic regression model, LHI is calculated by solving the regression equation. Correlation between landslide event and landslide affecting factors is estimated, and then, equation predicting the landslide is obtained.

Analytic Hierarchy Process
AHP, developed by Thomas L. Saaty in 1975, is an effective tool for decision making. It helps the decision makers in setting priorities and making best decision on complex decisive problems. It distributes the problems in hierarchy of criteria and options (alternatives), i.e. it reduces complex decisions to pairwise comparisons and then synthesizes the result. The AHP considers both the rational and the intuitive to select the best from a number of alternatives evaluated with respect to several criteria. It checks for consistencies in decision maker's evaluation and also allows limited inconsistencies in judgements.

Working of AHP
The AHP uses a set of evaluation criteria and a set of alternative options among which the best decision is to be made. It generates a weight for each evaluation criteria according to pairwise comparisons of criteria. The criteria with higher weight are selected since it is most important of all the criteria. Further, for fixed criteria, it assigns a score to each alternative option according to pairwise comparisons of options based on those criteria. Higher the score for an option, better the performance of that option w.r.t. considered criteria. Information is then arranged in a hierarchical tree. Finally, the AHP generates global score for each option using the combinations of the criteria weights and options scores and determines relative ranking of alternatives. A simple hierarchy with three levels is shown in Figure.2.

Implementation of AHP
AHP can be implemented in three simple steps i.

Computation of weight vector for all criteria
ii.
Computation of score matrix for all options iii.

Ranking of options based on final score
Once the goal has been set, then for all the alternatives, different ranks are given based on the criterion fixed to reach that goal. In this way, the priorities are set, and these factors are compared pairwise. For example, in case of landslide zonation, the goal could be to identify the areas that are prone to landsliding and the factors/parameters, such as slope, elevation, soil type, rock type, distance to drainage, etc., controlling it would become the alternatives. And to select the areas prone to landsliding, the criteria could be fixed such as slope should be more than 45º, soil type should be clayey, rock type should be other than granite/gneiss (hard rock), etc. Hence, the area fulfilling these criteria will be selected. This way of preparing the landslide susceptibility map is area specific, and the criteria applicable to one location may not be true for other location. Hence, a different approach is needed where the system adjust itself with the given conditions and scenarios.

The Fundamental Scale
The AHP is a general theory of measurement and is used to derive relative priorities of different criteria on absolute scales. Pairwise comparison judgments in the AHP are applied to pairs of homogeneous elements. The fundamental scale represents the intensities of judgments. In many cases, the elements to be compared are almost equal in measurements. In this situation, comparison must be made not on what fraction it is larger than the other [10]. Pairwise comparisons of criteria and/or options are performed based on the scale given in Table- Extreme importance The evidence favouring one activity over another is of the highest possible order of affirmation  [10,12,13].

Applications of AHP
i.

Evaluation of cities for livelihood and planning
ii.

Ranking of countries
iii.
Customers adoption of mobile devices and mobile services iv.
Human organ transplants v.

vi.
Natural resource management

Landslide Hazard Zonation using AHP
Various authors [14,15,16,17,18,19] have used AHP for giving weights to various factors of landslide occurrence. The effect of each factor and factor classes, on landslide occurrence, is determined using pairwise comparison, and an equation is modelled for landside susceptible index (LSI), as given below in equation 9 where Factor i = landslide conditioning factor such as slope, aspect, lithology, etc. W AH P i = Weightage for each causative factor. Pixel (LSI) values derived from above equation are classified into various susceptibility classes (low, moderate, high, and very high) based on natural break.

Artificial Neural Network
Artificial neural network attempts to model the information processing capabilities of the brain. The operation of the brain is based on simple basic elements called as neurons. Neurons are connected to each other with transmission lines called as axons and receptive lines called as dendrites. Information is stored at synapses. Each neuron has an activation level that ranges between some minimum and maximum value [20,21]. A neural network is a massively parallel distributed processor made from simple processing units, which can store knowledge gained from experiments and can utilize it later. It replicates the processing of the brain in two respects [22].

i.
Knowledge is acquired by the network from its environment through a learning process.

ii.
Synaptic weights are used to store the acquired knowledge. In 1943, McCulloch and Pitts proposed a computational model for artificial neuron, based on binary threshold [23]. This neuron calculates a weighted sum of 'n' input signals, x j where j = 1, 2, 3…...n, and generates an output of 1 if this sum is above a certain threshold 'u', else output 0. The model [24] is shown in Figure. 3 and given by equation 10. 1 1, , 0, y otherwise = ANN is a weighted directed graph, in which artificial neurons are nodes and directed edges with weights are connections between neuron outputs and neuron inputs. ANN can be grouped in two categories [20,22].

i.
Feed-forward network, where graph has no loops, as shown in Figure. 4. Here, all the nodes in each layers are connected to every other node in forward layer, hence it is called fully connected network. If some of the links are missing, then it is called partially connected network. Example: single-layer perceptron, multilayer perceptron, radial basis function, etc.

ii.
Recurrent or feedback network, where graph has loops because of feedback connections, as shown in Figure. 5. Here, output from all the neurons is applied to input using feedback connection. Example: self-organizing map, adaptive resonance theory model, Hopfield network, etc.

Learning Algorithms
To be able to learn is the fundamental trait of intelligence. Although it is difficult to formulate a precise definition of learning, the process of learning in the context of ANN can be defined as the problem of updating network architecture and connection weights so that a network can efficiently perform a specific task [20]. Artificial neural network tries to learn input-output relationships from the given collection of representative examples, instead of following a set of rules specified by human experts. This is one of the major advantages of neural networks over traditional expert systems. A learning algorithm refers to a procedure in which learning rules are used for adjusting the weights. Some examples of learning algorithms are (i) Error correction learning, (ii) Memory-based learning, (iii) Hebbian learning, (iv) Competitive learning, (v) Boltzmann learning, etc. [23,25].

Feed-Forward Back-Propagation Network (Based on error correction learning)
It is basically a feed-forward multilayer perceptron with back-propagation as learning/training algorithm. In order to train a neural network to perform desired task, the weight of each input has to be adjusted, such that the error between the desired and actual output is minimal (Figure.

Applications of ANN
1. Image processing, classification of satellite data, compression of large images, etc.

Application of ANN in Landslide Hazard Zonation
ANN has been used widely in the preparation of LHZ maps [34][35][36][37]. People have used variations of ANN with one input layer, two hidden layers, and one output layer for various factors controlling landslide occurrence. ANN connection weights are used to provide weights or rankings to the input data source (landslide-causative factors). Weights of factors and rankings of categories are integrated to provide LSZ map.

Support Vector Machine
Support Vector Machine is a data classification technique, developed by Vapnik in 1990. Classification process involves separating data into training and testing sets. Each element in the training set contains a corresponding target value (i.e. the class labels) and several attribute (i.e. the features of elements). The ultimate goal of SVM is to predict the target value for the test data, with only attributes of the test data given [38,39]. Support vector machines are based on the concept of decision planes that define decision boundaries [40]. SVM finds the best hyperplane (n-dimensional plane) that separates all data points of one class from those of other class. It uses kernel method to project linearly non-separable data to a higher dimension. The kernel can separate classes even if mean values are near to each other. A simple illustration of the method is shown in Figure.7. The data points shown are linearly separable. The maximum margin hyper plane is shown in red, and the margin between the support vectors is shown by the parallel light blue lines. The two classes do not overlap. The support vectors (patterns that are on the margin) are shown [41] as yellow circles for class 1 and triangles for class 2. Let m-dimensional training inputs x i (i=1,...,M) belong to Class 1 or 2 and the associated labels be y i = 1 for Class 1 and −1 for Class 2. If these data are linearly separable, we can determine the decision function, which is represented by equation 11 [42] ( ) = + D x w x b T (11) where w and b are weight and bias, respectively, to map the input into a higher dimensional space. The optimal separating hyper plane (i.e., w T x + b = 0) is located where the margin between the two classes is maximized, and the misclassification is minimized. The optimal hyper plane satisfies the following constrained minimization as given by equations 12-13 where α i = Lagrange's multiplier and α i ≥ 0. SVM can perform only binary classification; however, classifying data in more than two classes can be performed using pairwise classification [42,43].

i.
Image processing for classification of satellite images [44].

Advantages and Disadvantages
All these methods mentioned above have certain advantages as well as disadvantage over the other, hence a detailed comparative Table 3 showing their advantages and disadvantages are given below.

Regression Analysis
Model developer has full knowledge of variables. It requires the data to be independent.
It is most strongly predictive of an outcome. It is sensitive to outliers.
It runs faster than neural network/support vector machine-based models.
It is not "black box" as ANN.
Analytic Hierarchy

Process (AHP)
It is simple, flexible and powerful. It requires a large number of comparisons.
All the calculations are driven by decision maker's experience.
Limitation of the use of 9 point T. L. Saaty's scale.
It does not require an expert system with the decision maker's knowledge embedded in it.
It adds extra burden on decision maker for complex problem. Single-layer perceptron work only on linearly separable classification problems.

Rank reversal
Availability of multiple training algorithms. It requires greater computational resources.
It is prone to over fitting.
Can trap in local minima.

Machine (SVM)
It has high prediction accuracy and good mathematical foundation.
The biggest limitation of the support vector approach is the choice of the kernel.
Overfitting does not occur. It requires long training time.
It does not trap in local minima, i.e. it finds the global solution.
Problem has to be formulated as two-class problem.
It works well with fewer training samples (i.e. number of support vectors do not matter much).

Literature Survey
The literature survey of some of the available research works carried out for Landslide Susceptibility Zonation is shown in Table 4  The results obtained showed that the Artificial Neuro Fuzzy (ANF) modeling is a very useful and powerful tool for the regional landslide susceptibility risk assessments. Various membership functions should be selected and a number of training sets should be carefully and optimally selected to prevent over learning of the model. Therefore, the results that are to be obtained from the ANF modeling should be assessed carefully because the over learning may cause misleading results [35]. As a final recommendation, the results obtained from various papers showed that the methods followed in the study based on Neuro-Fuzzy approach exhibits a high performance. However, it is not forgotten that the performance of such type maps depends not only on the methodology followed but also on the quality of the available data and the factors considered for preparing LSZ. These input factors can be natural factors (like rainfall, lithology, slope, etc.) and anthropogenic factors (like road construction, mining, etc.). For this reason, if the quality of the data increases, the performance of the maps produced by these methods could increase. The detailed literature survey where various different models have been used for landslide hazard zonation is given below: Lee and Pradhan [5] used frequency ratio and logistic regression model for mapping the landslide susceptible areas by considering slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, land cover, vegetation index, and precipitation as landslide stimulating factors. They calculated the Landslide Hazard Index (LHI) by summation of frequency ratios for all the factors and solving the regression equation, respectively, for both methods and concluded that the frequency ratio model has 2.7% (93.04-90.34%) better predication accuracy than the logistic regression model.
Pradhan et al [57] combined frequency ratio and fuzzy algorithm for generating landslide hazard maps. Fuzzy membership values were calculated using frequency ratio and detected landslides. Fuzzy algebraic operators (such as fuzzy and, or, product, sum) and fuzzy gamma operators were applied on fuzzy membership values for landslide hazard mapping. Value of fuzzy gamma operator was set to 0.025, 0.05, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 0.95, and 0.975 for detecting its effect on landslide hazard maps. After verification, they found that out of 17 cases tested, the gamma operator with value 0.8 performed best (prediction accuracy 80.26%), while 'Fuzzy algebraic sum' and 'fuzzy or' showed worst accuracy of 64.77% and 56.86%, respectively.
Pourghasemi et al [14] showed the applicability of fuzzy logic and analytic hierarchy process in the mapping and zonation of landslide susceptible areas. A total of 12 data layers, which correspond to 12 landslide conditioning factors, were exploited to detect the most susceptible areas. Fuzzy membership values to all pixels were assigned based on the frequency ratio model. Landslide susceptibility was then identified using fuzzy if then else rules. Using the AHP model, weightage of each contributing factor was identified using pairwise comparisons and an equation was modelled for landside susceptible index. Validation of the maps created using both the methods was performed using ROC curve. They concluded that the model with fuzzy logic has the highest area under the curve (AUC) value 0.9194, whereas AHP has 0.8887.
Devkota et al [6] compared certainty factor, index of entropy and logistic regression methods for landslide susceptibility mapping. Slope gradient, slope aspect, altitude, plan curvature, lithology, land use, distance from faults, rivers and roads, topographic wetness index, stream power index and sediment transport index were considered as prominent factors for landslide susceptibility study. The value of the certainty factor ranges between −1 and +1. A positive value means an increasing certainty in landslide occurrence, while a negative value corre-sponds to a decreasing certainty in landslide occurrence. CF values of the landslide conditioning factors were combined pairwise to generate landslide susceptibility index. Natural breaks were used to classify LSI value to Landslide Hazard Zones. The performance of landslide susceptibility models was assessed using ROC curves. They found that the hazard map prepared using the index of the entropy model has the highest prediction accuracy (90.16%), followed by the logistic regression model (86.29%) and the certainty factor model (83.57%).
Nourani et al [8] prepared landslide hazard zonation maps using genetic programming and compared it with frequency ratio, logistic regression, artificial neural network. Seven factors, i.e. lithology, slope, aspect, elevation, land cover, distance to stream, and distance to road, were considered prominent for landslide hazard zonation study. In the frequency ratio model, landslide hazard index was calculated by summation of frequency ratios for all the factors. In the logistic regression model, LHI was calculated by solving the regression equation. Correlation between landslide event and landslide affecting factors was estimated, and then, equation predicting the landslide was obtained. Three layered feed-forward neural network with back-propagation as training algorithm was used for calculation of LHI. Two different criteria were used to measure the efficiency of the ANN method, i.e. the root mean square error (RMSE) and the determination coefficient (DC). For producing the best landslide susceptibility maps, sensitivity analysis was also implemented in ANN. For verification of LSM, produced by FR, LR, ANN, and GP methods, landslide testing data were compared with these maps. The assessment of AUCs showed that the prediction accuracy of FR, LR, ANN, and GP methods were 89.42%, 87.57%, 92.37%, and 93.27%, respectively.
Bui et al [37] compared the accuracy of landslide prediction, using support vector machine, multilayer perceptron neural network, radial basis function neural network, kernel logistic regression and logistic model tree. Slope, aspect, altitude, relief amplitude, topographic wetness index, stream power index, sediment transport index, lithology, fault density, land use, and rainfall were studied as landslide conditioning factors. For choosing the best subset of conditioning factors, predictive ability of the factors was assessed using the information gain ratio with 10-fold cross-validation technique. The analysis of landslide inventory map showed that landslides mainly occurred during and after the heavy rainfall. The performance of landslide susceptibility models was assessed using receiver operating characteristics (ROC) curves, and reliability was assessed using kappa index. They found that the MLP neural net model has the highest prediction capability of 90.2%, followed by the SVM model 88.7%, the KLR model 87.9%, the RBF neural net model 87.1%, and the LMT model 86.1%.
Youssef et al [9] combined logistic regression and frequency ratio for removing their weaknesses and producing landslide susceptibility maps with better accuracy. Altitude, curvature, distance from wadis, distance from road, distance from fault, stream power index, topographic wetness index, soil type, geology, slope, and aspect were used as contributing factors in landslide occurrences. Frequency ratio was calculated by analyzing the relationship between 11 conditioning factors and landslide occurrence. Landslide hazard index was calculated by summation of frequency ratios for all the factors and solving the regression equation, respectively, for the frequency ratio and logistic regression methods. After this, the probability index for ensemble of FR and LR was calculated and normalized to be between 0 and 1. For calculating the landslide susceptibility map from ensemble method, the probability index value was classified in five categories using quantile classifier. Probability index value represents the predicted probability of landslide for each pixel in the presence of given set of conditioning factor. Validation of all three models was performed using ROC curves, and they observed that the prediction accuracy of ensemble of FR and LR was higher (82%) than that of FR (58%) and LR (77%) separately.

Case Study
The landslide susceptibility mapping is carried out in the Mandakini River basin of Uttarakhand, which covers an area of about 2439 sq. km and is situated between 30°19'00"N to 30°49'00"N latitude and 78°49'00"E to 79°20'00"E longitude ( Figure. 8a) falling in Survey of India toposheet Nos. 53J and 53N.

Geological setting of the Study Area
The lithological mapping of the area ( Figure.  The high susceptibility to landslides in the Mandakini River basin is mainly due to complex geological settings, varying slopes and relief, heavy rainfall, along with ever-increasing human interference in the ecosystem. Extreme climatic events increase the instability of the terrain, which results in landslides, example includes the Kedarnath disaster [60].

Data Used
The Survey of India (SOI) toposheet Nos. 53N and 53J were used to create the base map of the study area. Landsat satellite image of October 2008 with 30-m spatial resolution was taken to finalize the tectonic and geologic map of the study area (after) [59]. Elevation data were taken from ASTER-GDEM (Advance Spaceborne Thermal Emission and Reflection Radiometer, Global Digital Elevation Model) having spatial resolution of 30 m with an accuracy of ±10 m.
These data sets were analyzed, preprocessed and then categorized using Arc GIS 9.3, ERDAS Imagine 9.1 software to generate various thematic layers such as elevation, slope, aspect, drainages, geology/lithology, soil, buffer of thrusts/faults, and buffer of streams in the study area (Figure 8 a-h).

Model Selection and Results
All the data sets were generated in Geographic Information System (GIS) environment at 30 × 30 m pixel resolution, the vector layers were converted to raster format with other raster data sets. These raster data sets were converted to ASCII format to be read in MATLAB for using Support Vector Machine (SVM) for prediction of Landslide susceptibility. The landslide data for Okhimath River basin, procured from Geological Survey of India (GSI), were considered to test the SVM model and generate the predictive susceptibility map. The study area contains 1,805,548 pixels, while 2207 pixels are present as landslides. Thus, the pixels representing the landslides are mere 0.125% of the whole study area. The purpose of this study is to predict the landslide, so 1 denotes that pixel involved in landslide and −1 represents pixels that are not involved in landslide. In the whole study area, 2207 pixels were mapped as landslide based on the past data from GSI and other published reports. The whole set of data were divided into 60% as training data and 40% as testing data. Hence, the landslide susceptibility map for Mandakini River basin was prepared using the Proximal Support Vector Machine (PSVM) model ( Figure. 9). It is evident from this figure that the PSVM model classified more areas in landslide susceptible zone as compared to certain landslides have been missed. Hence, various performance metrics such as average prediction accuracy (AA), true positive rate (TPR), true negative rate (TNR) and relative operating characteristic curve (ROC) were computed on testing data to validate the performance of prediction models [64,65,66]. The validation results in terms of AUC, and their corresponding testing accuracy showed that the PSVM model has higher AUC values when rainfall data from TRMM were considered with respect to when not considered as shown in Figure 10. The PSVM model with TRMM and without TRMM has an AA of 82.85% and 84.20%, TPR of 79.43% and 72.46%, TNR of 82.85% and 84.22% and an AUC value of 81.15% and 78.34%, respectively (Table 5). The high value of TNR (82.85% and 84.22%) achieved by the PSVM model in this case is due to the large number of pixels for the study area as compared to pixels forming the landslides. Hence, this model predicted/demarcated the safe areas with 84.22% accuracy when TRMM data were taken into consideration, while it predicted the areas prone to landslide with 79.43% accuracy when TRMM data were taken in consideration because of less number of landslide pixels. Though the AUC values (78.34% and 81.15%) are good, the average accuracy for the PSVM model is quite high between 82.85% and 84.20%. Similar results were also obtained by Pradhan [67] where SVM yielded 81.46% AUC when applied on altitude, slope angle, plan curvature, distance from drainage, distance from road, soil type and NDVI as the input parameters considered for landslide susceptibility mapping for Penang Island in Malaysia. Best results are shown in bold. AA(%) is the average accuracy, TPR(%) is the true predictive rate, TNR(%) is the true negative rate and AUC(%) is the area under the curve.

Conclusion
In Garhwal Himalaya, Mandakini River basin is highly vulnerable to landslides, especially the town of Okhimath and its nearby villages. In the vicinity of the study area, Mandakini River crosses various Himalayan thrusts, and due to the presence of these tectonically active MCT zones, the rocks shows high shearing and fracturing and becomes more susceptible for landsliding. The susceptibility to landslide is mainly controlled by valley slopes, attitude of discontinuity of surfaces, soil type, presence of drainage, nature of rocks exposed, and structural and tectonic features present, besides human interaction in the terrain.
Hence, recently developed Support Vector Machine (SVM) learning technique was applied on this area to demarcate the landslide prone and safe areas. The PSVM method has been applied for landslide susceptibility mapping of the study area. The PSVM model showed higher average accuracy (AA) of 82.82%-84.20% for this study area, and the ROC curve indicates that the PSVM model has the prediction accuracy of 81.15%. Nevertheless, this model can be effectively used for landslide susceptibility mapping in this area or similar terrain with these sets of input parameters.