Variables selected for SVM model.
The quantitative and qualitative measurement, prediction and evaluation of urban sprawl have come to play a central role in land-system science. One of the most important and most implemented artificial intelligence (AI) techniques in terms of urban systems simulation is cellular automata (CA) like SLEUTH. SLEUTH models the physical urban expansion by accomplishing four simple growth rules with every modeling step. Simultaneously, SLEUTH also reflects main drawbacks of CA since they contain a higher degree of stochastic variation leading to a simulation uncertainty. This chapter will explain how the simulation power of CA can be optimized by combining them with the machine learning algorithm support vector machines (SVMs). Conceptually in SVMs, input vectors are projected in a higher-dimensional feature space in which an optimal separating hyperplane can be constructed for separating the input data into two or more classes. In the comparative analysis, the integrated modeling approach is carried out for a unique postindustrial European agglomeration: The Ruhr Area. It will be demonstrated how the AI learning approach is implemented, calibrated, validated and applied for the prediction of the regional urban land-cover pattern between 1975 and 2005. Finally, the probability effects will be visualized with the concept of urban DNA.
- support vector machines
- Cellular automata
- urban growth
- probability maps
The American landscape designer Earle Draper initiated a term describing the unaesthetic and uneconomic settlement structure of American cities in his 1937 lecture on town planning. In reflecting the obliteration of rural and urban spaces, this term grew more and more in popularity: “perhaps diffusion is too kind of word. in bursting its bounds, the city actually sprawled and made the countryside ugly, uneconomic [in terms] of services and doubtful social value ”. Ever since the term ‘sprawl’ emerged in a geographical context, nearly 80 years have passed and the specific patterns and processes of growing cities have developed from an American to a European problem in general and a German problem in particular [2–5]. Being one of the most challenging land-use and land-cover changes implicating several consequences for the anthropogenic and geobiophysical spheres, urban sprawl has become an inherent part of the international sustainability discourse in the context of global change [6–10]. The same holds true for geographical research with its long history of diverse theories and models of land-use change in general and urban sprawl in specific [11–16]. One of the most important streams of urban models understands urban areas as complex systems. Those geosimulation techniques make use of artificial intelligence (AI) to model the micro processes being responsible for the macro pattern of urban systems [11, 12, 17]. Following the famous quote by Aristotle “the whole is more than the sum of its parts”, they model urban systems in a bottom-up manner. Cellular automata (CA) are very popular geosimulation tools. SLEUTH is a well-known urban CA . It is a purely growth-oriented model and as a bottom-up approach it is not dependent on intensive prestudies about the general causes of urban growth or the location-specific determining factors. Based on the principles of neighborhood effects and spatial autocorrelation, the simulation rules are relatively simple. However, SLEUTH is able to capture complex emergences of urban patterns so that it has been applied in several urban growth studies all over the world . Although the performance of CA for spatially explicit urban land-use simulation is very high, it gives no direct insight into the relationship between nonspatial human and ecological driving forces, spatial determining factors and the emergence of urban growth. The semistatistical method called support vector machines (SVMs)  is able to avoid this disadvantage: developed for solving nonlinear classification problems, SVMs are also suitable for analyzing the spatial driving factors of urban land-use change. They are a machine learning concept based on statistical learning theory. The basic idea is to project input vectors on a higher dimensional feature space, in which an optimal separating hyperplane can be constructed for separating the data into two or more classes. By using a specific feature selection, neglectable features can be separated and important features can be identified.
The objective of this contribution is to demonstrate how the simulation skills of CA on the one hand and SVM on the other hand can be integrated in order to achieve an optimized modeling approach. The anchor point for joining AI and machine learning physique is the exclusion layer of SLEUTH. Instead of the standard input map of the CA defining restriction areas where no urbanization is allowed to take place, a probability map derived from SVM will be utilized. In a nutshell, the coupling of SLEUTH and SVM is undertaken in order to answer the following research questions:
Which driving factors influence the physical urban growth in the Ruhr and how do they take effect according to a SVM feature selection?
Do SVM guide SLEUTH and do they optimize its modeling ability?
How well do SVM perform in comparison to standard SLEUTH implementations?
The main study area which SLEUTH-SVM is set up for lies in the western part of Germany and in the central part of NRW: the Ruhr (Figure 1). With a polycentric and administratively fragmented structure but a homogenous and extensive urban area, the Ruhr is a worldwide unique urban entity. In general, 11 cities and 4 districts form the biggest agglomeration (1150 people per km2) in Germany, and with its 443,969 ha it is the fifth largest urban region in Europe. Concerning the geosimulation of urban systems, the Ruhr proves to be suitable due to two principal aspects: Firstly, it exhibits the highest absolute rates of urban sprawl in Germany. Between 1975 and 2005, the agglomeration grew around 37,022 ha with a total urban area of 94,990 to 132,012 . Compared to other sprawling cities in Germany, the Ruhr’s urban expansion can be described as a ‘metropolitan suburban sprawl’ type with high values of new land consumption and with low fragmentations and dispersion patterns in the core area coexisting with lower densities and patched open spaces in the suburban area . Secondly, the Ruhr acts as a ‘hero’ in the scientific discourse of demographic decline and structural transformations in old industrialized cities. Like other members of the ‘rusty fellowship’, the Ruhr has to struggle with archetypical problems of the former monofunctional manufacturing cities depending on mining and heavy engineering: a demographic decline, an aging population, high unemployment rates, an incipient brain drain and a lack of incentives to attract prosperous companies of the service sector, especially the ‘new economy’ [22, 23]. Before the CA is optimized, a brief overview of the implications of urban sprawl, the complexity of urban systems and important urban models in geography, as well as an introduction into the geosimulation with CA is given.
2. Geosimulation of urban sprawl
2.1. The complexity of urban systems
Urban sprawl is often treated as a specific of urbanization and urban growth. The classification of those processes is not homogenous and their definitions oscillate between the pure expansion of impervious surfaces and the distribution of urban activities and life styles in addition [2, 4, 24]. The study focuses on the land consumption aspects of urban sprawl. That is the conversion from nonurban to urban land cover and land uses for the main part indicating a certain amount of newly built-up, impervious areas . The natural impacts of urban sprawl are manifold and concern several natural spheres. Cities consist by more than 50% of impervious land cover . The sealing of even fruitful and agricultural valuable soils leads to a loss of fertility, transformation, filtering and buffering services . The infiltration is disturbed and the surface runoff can be increased by five times with a sealing fracture of more than 90% . The ecological footprint of a street can be estimated by 2 km and that of a European agglomeration by 1000 times of its area . For a long time, the natural increase of people and migration flows into the cities’ core areas as well as the economic expansion of the industrial age constituted the most important factors responsible for the growth of German cities. During the last 50 years, however, the trends of land consumption and population growth have moved apart: While the German population has grown around one fifth, the amount of settlement and traffic areas has nearly doubled . In the literature, the following aspects are mentioned regularly: economic wealth; separation of individuals; the persuasion of the wish for owning a house on greenfield sites; new supply types; demographic change; aging of population; parish-pump politics and dominance of the automobile [2, 29–34]. Those causes build the main underlying driving forces of urban sprawl in Germany. But how do they interact? Which direct impacts do they evoke and what are their spatial outcomes? Why do they induce the emergence of sprawling urban areas even in regions affected by demographic and economic shrinkage? In order to measure, model and understand urban sprawl, one has to think of it as a kind of urban land-use change embedded in the global land system . The changes of land use and land cover and their driving factors construct nonlinear, complex systems including irrational behavior of their human actors. Therefore, urban areas must be treated as open and dynamic systems in which macro level patterns are a result of behavioral driven processes of micro level [36, 37]. Urban systems exhibit characteristics of hysteresis so that future developments and changes of urban systems are not only influenced by the current environment but also by the past one [32, 38, 39]. The initial configuration of its states affects the future decision making of its actors. Exemplary, road expansions do not only improve the infrastructural development but also change the spatial pattern affecting the circulatory system of a region’s economy and feeding back to road improvements . Thus, the observation scales of time (1) and space (2) are fundamental elements for the analysis of urban systems: (1) Technological innovations or new policies are exogenous drivers and affect sprawling urban growth in a short term. Due to coevolutionary interaction between states and actors, they become endogenous and are affected by urban dynamics in a long term . (2) While on an aggregate level, residential areas may be clustered resulting in a positive spatial autocorrelation, and on an individual level, a certain range of distance may be kept resulting in negative spatial autocorrelation [41, 42].
2.2. Geosimulation – Using the AI of cells
The new wave of urban simulation often goes under the name of geosimulation. They shifted the modelling paradigm from macrostatics to microdynamics, from aggregation to disaggregation, from homogeneity to heterogeneity and from equilibrium to disequilibrium in urban modeling. Mandl defines geosimulation as a simulation where the modeled system processes, compartments and applications are spatially characterized and exhibit spatial relations . Benenson and Torrens further comprehend geosimulation as “catch title that can be used to represent a very recent wave of research in geography … [which] is concerned with the design and construction of object-based high resolution spatial models … to explore ideas and hypotheses about how spatial systems operate ”. The implementation of AI methods for urban simulation purposes was heavily influenced by technical progresses in terms of computation, land-use data acquisition, geographic information systems (GISs), complexity studies, as well as the development and use of AI approaches in natural and social sciences outside of geography [17, 44]. Indeed, “geographers arrived somewhat late to the party” of addressing individual behavior alteration as design strategy of simulation applications . The displacement of equations by code offered the possibility to explain the pattern and processes of complex open system dynamics on multiple organizational levels theoretically and experimentally . Hence, the research object of urban sprawl could leave the suggestions of order, stability, linearity and rationality rooted in general system theory.
One of the most important and most implemented geosimulation techniques in terms of urban systems simulation is CA. The invention of CA is attributed to the mathematicians von Neumann (1951) and Ulam (1952), and “one can say that the ‘cellular’ comes from Ulam and the ‘automata’ comes from von Neumann” [47–49]. The final breakthrough of CA came with John Conway’s ‘The Game of Life’ in 1970. Urban CA is often defined by (1) a raster lattice representing the spatial context, (2) a set of states associating a cell with a certain land-use type, (3) neighborhoods influencing the spatial configuration and (4) transition rules regulating the conversion of a cell state with every (5) time step . The gridded two-dimensional character of a CA environment makes it well suited for the simulation of urban land-use and land-cover conversion. Here, the most popular CA modeling urban growth was developed by the authors mentioned in references [48–55]. The progress of CA implementations profited heavily by innovations in remote sensing techniques. The world’s radiances are recorded from a bird’s eye view and stored regularly pixel-by-pixel. Hence, the world’s surface is represented in a two-dimensional raster lattice facilitating a paradigm of modeling from the pixel. Natural or administrative borders are completely neglected so that land-use and land-cover patterns become the only things that matter. While maps always depend on a more or less subjective semiotics and imply reduced content, remotely sensed images provide the unmediated biophysical context of coupled human–environment systems : By classifying a data set of radiances with the help of specific spectral characteristics, a continuous spatial texture is turned into a discrete spatial pattern. Accordingly, a classified time series of LANDSAT data of 1975, 1984, 2001 and 2005 is applied for his study. The data sets were classified using a hybrid approach of supervised classification algorithms and knowledge-based decision trees. The resultant classification simply identifies ‘urban’ and ‘nonurban’ areas, where an “urban” area is defined as having a surface imperviousness of a minimum of 25%. A validation analysis of the classification documented an accuracy of >85%. In order to balance the spatial resolution and the spatial extent of the Ruhr, a grid resolution of 100 m was used. This classification procedure is described in detail in references [21, 61]. For the calibration of SLEUTH, the 1984 data comprise the base year and the 2001 data constitute the reference year. For the validation of SLEUTH, the 1975 data serve as the base year and the 2005 data constitute the reference year. Finally, the urban growth detected in the classified LANDSAT data between 1984 and 2001 is used to train the SVM model.
3. Modelling urban growth with an optimized cellular automaton
3.1. SLEUTH – An urban cellular automaton
Clarke’s urban growth model (UGM), mostly identified as SLEUTH, understands urbanization as a diffusion process where complex urban patterns spread as a whole. It has been applied in several urban growth studies all over the world  and offers many set screws for an enhancement of its performance. SLEUTH is an acronym of its initial input factors, which are slope, land-use, exclusion, transport and hillshade. The base data consist of the urban land-use configuration and the mandatory slope and transportation layer. The exclusion layer is optional, but recommended because it prevents the location of urban cells in, for example, conservation areas or water bodies. Additionally, the exclusion layer can be combined with a probability map to enhance the simulation performance. Five growth coefficients (dispersion, breed, spread, slope and road gravity) define the four growth rules of SLEUTH: spontaneous growth, representing the random emergence of new urban areas; new spreading center growth; edge growth depicting extensive urban sprawl and road-influenced growth (Figure 2). The last one is SLEUTH-specific and relocates a temporary cell along the road to its final position. Space and time are treated discretely in the CA. Thus, one growth cycle represents 1 year of urban growth and consists of the four aforementioned successive growth simulations. Each selected new urban cell is tested against the local slope and exclusion information as well as a random value during each growth cycle. One model step consists of all growth cycles (years) between the starting and the end date of the calibration phase.
The calibration process of SLEUTH is based on the brute-force method. Every parameter combination of the particular growth coefficients between values of 0 and 100 is tested until the optimal balance is assessed. Every modeling process for each parameter combination is run several times by using Monte-Carlo (MC) iterations. Goetzke modified SLEUTH and implemented it into XULU (eXtendable Unified Land-use Modeling Platform), a modeling environment developed at the University of Bonn [57, 58]. The standard calibration evaluation method of SLEUTH has been replaced by the multiple resolution validation (MRV) . The MRV procedure compares an observed simulated map with a validation map at different spatial resolutions. High -resolution maps are weighted more than maps at lower resolution. The basic idea behind MRV is to attenuate the impact of localization errors by extending the conventional cell-by-cell comparison and considering the similarity of the entire neighborhood of a cell. Thus, the ‘fuzziness’ of categorical maps is addressed , and spatial patterns can be simulated quite precisely by accurately classifying only a relatively few map cells. Additionally, urban land-use calibration with MRV requires only two maps: a map of the initial calibration year and one of the final year.
3.2. Support Vector Machines
In their contemporary form, SVM was firstly formulated by Cortes and Vapnik . Along with artificial neural networks and genetic programming they represent a new generation of machine learning algorithms. With their robust operation architecture and their valid classification results, SVM is a very popular classification technique for remote sensing data . Fundamentally, SVM is a linear binary classifier that labels a sample of empirical data by constructing the optimal separating hyperplane (Figure 3, left). Traditional machine learning methods try to minimize the empirical training error, causing a tendency to overfit [61, 62]. They are strongly tailored to the training data, so extending them to additional data becomes difficult. According to the principles of structural risk minimization, SVM tries to minimize the upper bound of the expected generalization error through maximizing the margin between the separating hyperplane and the data. The margin concept is the key element in the SVM approach for it is an indicator of its generalization capability [63, 64].
The principal advantage of SVM is the option to transform the model in order to solve a nonlinear classification problem without any a priori knowledge. Using the so-called ‘kernel trick’ (Eqs 7–9), the input vectors are reprojected to a higher dimensional space in which they can be classified linearly (Figure 3, right).
Considering the scenario of a set of training vectors
In this scenario, the dimension of the input space is determined by the range of urban growth driving forces. A hyperplane needs to be found which separates the positive from the negative feature vectors. The ‘separating hyperplane’
For the linearly separable case, two hyperplanes
The constant C is called penalty parameter and
SVMs are well suited for this operation since the training data
Instead of predicting the label directly, the class probability is calculated (Eq 11) which delivers the basis for the probability maps of urban growth. Platt approximates the probabilities for binary SVMs using a sigmoid function
3.3. Optimizing SLEUTH
In this study, SVM is applied to raster layer stack consisting of different geophysical, socioeconomic as well as demographic driving factors of urban growth in order to optimize the CA SLEUTH. The selection is based on recent empirical studies dealing with urban sprawl in Central Europe [2, 5, 24]. Most studies of SVM in the context of urban growth modeling employ ordinary distance variables to represent proximity [66, 68]. In some cases, varying accessibility has a significant effect on the forces driving land use decisions. Hence, the effects of accessibility to markets or important infrastructure facilities are measured by weighting distances with a road variable that was derived using a categorized road network data set. Demographic and socioeconomic data that are exclusively available at the district level have been disaggregated to dasymetric maps . For the other socioeconomic variables that are not related to population, statistical data were projected to the center points of each district and inverse distance weighting was used for interpolation. To minimize spatial autocorrelation effects, a stratified sampling method was applied . This procedure generated a training data set containing 4000 image pixels for each urban growth and no urban growth class, with a 1 km minimum separation distance between equal pixels. For the construction of the SVM model one can use the software tool imageSVM® implemented in the EnMAP Toolbox®. Originally, imageSVM was created for solving classification problems in the context of multi- and hyperspectral satellite imagery . The output of a SVM classification with imageSVM is not only a classified binary image but also a probability image based on the principles of (Eq 11). The crucial step for constructing a SVM model is determining optimal parameter settings, including appropriate values for the penalty parameter
The driving forces (Table 1) form the feature space (Eq 2) and build the base for training the 1984–2001 SVM urban growth model. The rank of a variable within the SVM feature selection (Table 1) is a reliable indicator for assessing the influence of the urban growth driving forces that were selected as model input parameters . Following several subsequent feature selections, nine variables were eliminated. It can be stated that the characteristic attributes of the distance-related variables  and of the number of jobs are more suitable for constructing the SVM model than other socioeconomic or demographic variables. The distance to the next railway station in particular seems to be a very good indicator for a possible urbanization. This might be a link to the industrial past traversing the urban area by a freight transport network. Beside elevation – and slope, which already is a part of SLEUTH – no other geophysical variable is usable for the selection of areas suitable for urban growth with SVM. The parameters “Jobs” related to the labor market and “NetDwellArea” variable which is a descriptor of living conditions are other important urban growth drivers.
|Dist Airport||Cost-weighted distance (CWD) to next international airport||5|
|Dist City||CWD to next city >25.000 inhabitants||3|
|DistHighway||CWD to next highway exit||2|
|Dist Railway||CWD to next railway station||1|
|Dist River||Euclidian distance to next river||6|
|Highway Buffer||500 m buffer to highways||n.i. X|
|Elevation||Elevation above sea level (m)||11|
|Soil depth°||Vertical extent of soil layer (cm)||n.i.|
|Soil type°||Soil type defined by grain size (nominal)||n.i.|
|Soil quality°||Agricultural appropriateness (from [temporary] ‘not usable’ to ‘very good|
|Waterlogging°||Waterlogging type (from ‘low’ to ‘very high’)||n.i.|
|Water table||Depth of complete water saturation below ground (cm)||n.i.|
|Income||Inverse distance-weighted (IDW) average income per month in district 1991||n.i.|
|Jobs||IDW number of jobs 1991||4|
|Land Price||IDW land value 1990||7|
|NetDwellArea||IDW per capita net dwelling area 1990||8|
|Unemployment||IDW unemployed per population 1991||9|
|Cars||Number of cars in district; density function (10 km kernel) DF||n.i.|
|Migration25–50||Difference between in- and out-migration per settlement of the group aged 25 to 50||n.i.|
|PopDens||Population density 1984; DF||10|
The output of the imageSVM® classification is not only a classified land-cover image but also an image containing the probabilities of each pixel for ‘urban growth’ and ‘nonurban growth’ . The receiving operator characteristic (ROC) is an approved index for the accuracy assessment of binary categorical probability estimations . The ROC divides the probability outcomes into percentile groups from high to low probability and compares the individual probability groups with the cumulative real values. The ROC only considers the positive values estimated by the model; in our case all urban growth cells. To define the ROC, true positive and false positive rates are plotted for every percentile group. The result is a curve where the area under the curve (AUC) is the measure that represents the ROC statistic. If a model acts randomly, then the curve will be a line through the origin with a slope of 1 and an AUC of 0.5. If a model acts perfectly, then the AUC is 1. Figure 4 shows the ROC of the SVM model as well as the created probability map and the observed urban growth between 1984 and 2001. The curve of the SVM model clearly reaches a stable level already at low percentile groups. The resulting AUC values confirm the visual impression. The SVM model achieves a value of 0.94 – an outstanding performance for the AUC.
3.4. Calibration and validation of SLEUTH-SVM
Together with the SVM-based allocation probability map of urban growth, SLEUTH-SVM is calibrated using the produced urban land-cover maps 1984 and 2001 as the start and the testing year. For the validation of the model, the predicted urban growth is compared with the observed urban growth of 2005, starting the simulation in 1975. It is clearly distinguished between the data sets used for calibration, and the data set used for validation of the model. The SVM probability map based on various driving forces of urban growth is combined with the exclusion layer of SLEUTH. In total, the three different versions of SLEUTH are compiled:
SLEUTH: without exclusion layer.
SLEUTH-AR: exclusion layer (areas where urban growth is restricted such as water bodies, natural reserves, etc.).
SLEUTH-SVM: exclusion layer and probabilities for urban growth based on the SVM analysis.
The validation of complex system models such as CA not only includes careful error estimation but also addresses the uncertainty of the model results . Unwin defines uncertainty in the context of spatial simulations as a measure of doubt and distrust in results which can be seen as a certain kind of vagueness due to stochastic variability . Aerts et al.  demonstrated that a detailed examination of the outcomes after 100 MC iterations is a valid procedure for assessing the uncertainty of CA like SLEUTH. The resulting map shows how often a certain cell has been depicted by the model to be urbanized. In order to transform it into a binary land-use map, a probability of 33% as cut-off value is used which was elaborated via a standard histogram frequency method [78, 79]. With this value the best balance between location and quantification performance could be achieved.
Figure 5 contains the comparison map of the predicted and the observed urban growth for 2005. It is important to emphasize that SLEUTH is a purely growth-oriented model. The CA exclusively models the transformation from nonurban to urban cell states. There is no ability to model in a spatially explicit manner the reverse process of urban contraction due to demolition or removal of sealed surfaces. In the near future, however, regional planning will have to deal with this process of extensive urban demolition (or urban perforation). For the 1975–2005 study period, spatial urban growth occurred concurrently with a contracting population . There are only few areas where the simulation acts ‘false-positively’, meaning the model predicted urban growth where no growth has occurred . These are nearly always located in open-spaced inner city areas or clustered along recreational parks of the river Emscher between Bottrop and Essen. In contrast, there are more areas where the model simulates ‘false-negatively’, which is predicting persistence where urban areas have spread in reality. Admittedly, it is virtually impossible to allocate greenfield development with industrial estates or the emergence of new traffic areas with a spatial certainty of 100 m without any local planning knowledge. Indeed, the urban areas in the district of Ennepe-Ruhr should have been actually captured. It seems that the slope layer of SLEUTH suppressed urbanization in this hilly region during the simulation run. However, in total, 108,220 ha of built-up land were predicted in the Ruhr for 2005. This is a growth of nearly 14% since 1975. In comparison, the observed urban growth rate is 39%. The underestimation of the quantity of change is specific for spatially explicit land-use change models [59, 80, 81]. For this reason, the model is validated regarding its overall simulation performance (overall agreement) in comparison with randomness (Cohen’s Kappa), the quantity estimations of urban growth (
|New urban cells (ha)||110,668||108,220||110,445|
The achieved results are on a very good level. They show that the optimized CA SLEUTH-SVM outperforms the SLEUTH model without any exclusion information as well as the SLEUTH model exclusively containing the restriction areas. The lower quantification performance is due to the simulation algorithm of SLEUTH: if SLEUTH is run without a probability map or an exclusion layer, then nearly every cell has the same probability of urbanization. Utilizing probability maps increase the possibility that a particular cell is defined as not suitable for urbanization and the resulting growth rate decreases. In several urban land-use change studies, a low signal of urban growth in comparison to a very high signal of persistent nonurban cells was discussed [59, 81].
The MRV results show a consistently excellent level of overall accuracy. This can be attributed to the ability of all three models to almost perfectly predict the location and the quantity of nonurban growth cell states. SLEUTH-SVM is the only model exceeding the level of the null model already at the resolution level 1 of 100 m (Figure 6). The agreement curve of the null model forms a straight line, reflecting the low signal of urban growth in comparison to the very high signal of persistent nonurban growth cells. Again, the optimized version of SLEUTH shows a higher agreement between the predicted and the observed urban growth on the level of high resolutions up to 400 m than SLEUTH and SLEUTH-AR. As soon the resolution gets coarser, the impact of allocation errors decreases and the performance of SLEUTH-SVM stagnates.
The question of how the SVM-probability map influences the spatial extension of the Ruhr’s urban growth will be analyzed with the concept of urban DNA . Analogous to the biological DNA, it postulates fundamental elements that are common to each urban area and determine their future growth pattern . Gazulis and Clarke apply the concept to an abstract space representation mimicking the variable input of SLEUTH . Hence, this grid image reflects a kind of digital petri dish consisting of an urban area, a slope layer, transport information as well as exclusion areas (Figure 7). The urban input is just a single urban cell in the middle of the image whereas all other cells are defined as nonurban. The slope has a minimum value of 0% and increases concentric-radially to a maximum value which is equal to the maximum slope value to be found in the Ruhr (70%). The transport network is represented by a single road crossing the center of the image from north to south. In this study, the exclusion layer is exchanged by the SVM probability map of urban growth of the Ruhr. For estimating the spatial impact of the SVM optimization procedure, the particular probability map is allocated with a linear transition from high to low probabilities equivalent to their particular value range. In the south of the urban center, the probabilities decrease from 1 to the particular medium value. This medium level continues northwards from the urban center and decreases to zero. Thus, the maps are divided lengthwise from west to east.
The models SLEUTH and SLEUTH-SVM are run with the calibrated growth coefficients and 100 MC iterations. Hence, one can observe the allocation behavior of the CA under the SVM-defined conditions of the Ruhr’s urban areas. The border of high and medium probabilities of urban growth is distinct in the SVM-optimized version of SLEUTH. The CA is clearly guided whereas the regular version allocates the cells more randomly in the surroundings of already built-up areas. In the northern part of the synthetic region, the urban pattern is slim while in the southern part it broadens. All in all, the DNA of the Ruhr’s urban areas reveals a tendency to edge growth and a high influence of the road network. The SVM-based probability optimizes the allocation performance of the urban CA SLEUTH.
4. Conclusion and outlook
The aim of this study was to optimize the CA SLEUTH by using SVM and to assess its performance in comparison to its standard configuration. A SVM-based probability map including the impact of driving forces on the allocation of new urban areas was combined with the exclusion layer of SLEUTH. Hence, the model could be guided and its stochastic variability regarding the emergence of urban cells depressed. The application of SVM additionally delivered insights into the most important factors driving the local urbanization suitability. Thus, the CA provided a theoretical foundation. The importance of distance-related variables over socioeconomic or demographic variables in the SVM model was clear. Geophysical variables were not useful to select areas suitable for urban growth with SVM. The models have been applied to the polycentric agglomeration of the Ruhr. The region’s specific settlement structure is characterized by large urban areas solely divided through administrative borders. The scattered rural districts complete the image of a challenging study region in terms of organizational hierarchies, migration flows and heterogenic growth conditions. The accuracy of SLEUTH-SVM has been assessed regarding the overall agreement, the fuzziness, the ability to allocate new urban cells, the performance in comparison with the randomness as well as the quantity and the allocation ability of urban growth. The calibration and the validation of the model have been separated carefully. The validation indices for SLEUTH-SVM showed good values around 0.91 (kappa) and 0.93 (MRV). As a reliable result, it can be stated that the allocation performance of SLEUTH is optimized clearly when coupling it with a SVM-based probability map. Its spatial impacts are visualized with the concept of urban DNA and a digital petri dish. Hence, the generic growth elements of the Ruhr’s urban area were uncovered.
As a next step, coupling different modeling techniques for prediction and process analyses of urban land-use and land-cover change should be pushed beyond the physical restriction of pixels. To incorporate the irrational human component impacting on all spatial and temporal levels between the household and the global scale, pixels must be coupled with people. This can, for instance, be done by extending the enhanced version of SLEUTH with a multiagent system. That way, one can study emergence phenomena resulting from complex behavioral interaction processes on the micro scale. Hence, the simulation of urban change could be extended from analyzing growth processes to the estimation of urban decline. Coupling actors with factors would be a chance to overcome formal, differential equations. Therefore, the present ride on the surface of an ocean of driving forces, pressures, states, impacts and responses would be optimized into diving right into it.
Wassmer RW. An economist’s perspective on urban sprawl, Part 1—Defining excessive decentralization in California and other western states. Sacramento: California Senate Office of Research; 2002. 33 p.
EEA. Urban sprawl in Europe: The ignored challenge. Luxembourg: Office for Official Publications of the European Communities; 2006. 60 p.
Kasanko M, Barredo JI, Lavalle C, McCormic kN, Demicheli L, Sagris V. Are European cities becoming dispersed? A comparative analysis of 15 European urban areas. Landsc Urban Plan. 2006; 77(1–2):111–130. DOI:10.1016/j.landurbplan.2005.02.003.
Hoymann J, Dosch F, Beckmann G. Trends der Siedlungsflächenentwicklung. Status quo und Projektion 2030. Bonn: BBSR-Analysen KOMPAKT; 2012. 20 p.
Siedentop S, Fina S. Monitoring urban sprawl in Germany: Towards a GIS-based measurement and assessment approach. J Land Use Sci. 2010; 5(2):73–104. DOI: 10.1080/1747423X.2010.481075.
GLP. Science plan and implementation strategy. Stockholm: IGBP Secretaria; 2005. 74 p.
IPCC. Renewable energy sources and climate change mitigation: Special report of the Intergovernmental Panel on Climate Change. Cambridge: Cambridge University Press; 2012. 1088 p.
Millenium Ecosystem Assessment. Ecosystems and human well-being: Synthesis. Washington: Island Press; 2005. 155 p.
Ramankutty N, Graumlich L, Achard F, Alves D, Chhabra A, DeFries RS. Global land-cover change: Recent progress, remaining challenges. In: Lambin EF, Geist H, editors. Land-Use and Land-Cover Change. Springer Berlin Heidelberg; 2006, p. 9–39.
UNEP. Global environment outlook: GEO4: Environment for development. Nairobi: United Nations Environment Programme; 2012, 572 p.
Batty M. Fifty years of urban modeling: macro-statics to micro-dynamics. In: Albeverio S, Andrey D, Giordano P, Vancheri A, editors. The dynamics of complex urban systems. Springer, London; 2008, p. 1–20.
Benenson I, Kharbash V, Xie Y, Brown, DG. Geographic automata: from paradigm to software and back to paradigm. Proc 8th Int Conf GeoComputation Univ Mich U S Am July–3 August 2005. University of Michigan, 2005, p. 1–12.
Silva E, Wu N. Surveying models in urban land studies. J Plan Lit 2012; 27:1–14. DOI: 10.1177/0885412211430477.
Steven DP, Hoffman M, Parker DC, Manson SM, Manson SM, Janssen MA. Multi-agent systems for the simulation of land-use and land-cover change: A review. Ann Assoc Am Geogr 2002; 93:314–337. DOI: 10.1111/1467-8306.9302004.
Schwarz N, Haase D, Seppelt R. Omnipresent sprawl? A review of urban simulation models with respect to urban shrinkage. Environ Plan B Plan Des. 2010; 37(2):265–283. DOI: 10.1068/b35087.
Matthews RB, Gilbert NG, Roach A, Polhill JG, Gotts NM. Agent-based land-use models: A review of applications. Landsc Ecol. 2007; 22(10):1447–1459. DOI: 10.1007/s10980-007-9135-1.
Couclelis H. Computational human geography. In: Kitchin R, Thrift N, editors. International Encyclopedia of Human Geography. Oxford: Elsevier; 2009. p. 245–250.
Clarke KC. Mapping and modelling land use change: an application of the SLEUTH model. In: Pettit C, Cartwright W, Bishop I, Lowell K, Pullar D, Duncan D, editors. Landscape analysis and visualisation. Springer Berlin Heidelberg; 2008. p. 353–366.
Chaudhuri G, Clarke KC. The SLEUTH land use change model: A review. Int J Environ Resour Res. 2013; 1(1):88–104.
Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995; 20(3):273–297.
Goetzke R, Over M, Braun M. A method to map land-use change and urban growth in North Rhine-Westphalia (Germany). Proc 2nd Workshop EARSeL SIG Land Use Land Cover, ZFL Bonn, 2006, p. 102–10.
Blotevogel HH. Gemeindetypisierung Nordrhein-Westfalens nach demographischen Merkmalen. In: Danielzyk R, Kilper H, editors. Räumliche Konsequenzen des Demographischen Wandels Teil 8: Demographischer Wandel in ausgewählten Regionaltypen Nordrhein-Westfalens – Herausforderungen und Chancen für die regionale Politik. Hannover: Akademie für Raumforschung und Landesplanung – ARL; 2006. p. 17–33.
Couch C, Karecha J, Nuissl H, Rink D. Decline and sprawl: An evolving type of urban development – Observed in Liverpool and Leipzig. Eur Plan Stud. 2005; 13(1):117–136. DOI: 10.1080/0965431042000312433.
Antrop M. Landscape change and the urbanization process in Europe. Landsc Urban Plan. 15. 2004; 67(1–4):9–26. DOI: 10.1016/S0169-2046(03)00026-4.
Ulmer F, Renn O, Ruther-Mehlis A, Jany A, Lilienthal M, Malburg-Graf B. Erfolgsfaktoren zur Reduzierung des Flächenverbrauchs in Deutschland – Evaluation der Ratsempfehlungen “Mehr Wert für die Fläche: das Ziel 30ha“. Eine Studie im Auftrag des Rates für Nachhaltige Entwicklung. Stuttgart: Rat für Nachhaltige Entwicklung; 2007. 66 p.
Hostert P. Advances in urban remote sensing: Examples from Berlin (Germany). In: Netzband DM, Stefanov DWL, Redman PC, editors. Applied remote sensing for urban planning, governance and sustainability. Berlin: Springer; 2007. p. 37–51.
Blum WEH. Flächenverbrauch und Auswirkungen auf die Ökologische Bodennutzung. In: Umweltbundesamt, editors. Versiegelt Österreich? Der Flächenverbrauch und seine Eignung als Indikator für Umweltbeeinträchtigungen, Wien, 15. März 2001, p. 74–79.
Arnold CL, Gibbons CJ. Impervious surface coverage: The emergence of a key environmental indicator. J Am Plann Assoc. 1996; 62(2):243–258.
Mielke B, Münter A. Demographischer Wandel und Flächeninanspruchnahme. In: Danielzyk R, Meyer C, Grüber-Töpfer W, editors. Demographischer Wandel in Nordrhein-Westfalen. 2nd ed. Dortmund: Institut für Landes- und Stadtentwicklungsforschung und Bauwesen des Landes Nordrhein-Westfalen (ILS NRW); 2008. p. 58–64.
Sieverts T. Zwischenstadt: zwischen Ort und Welt, Raum und Zeit, Stadt und Land. 3rd ed. Berlin: Bertelsmann Fachzeitschriften; 2001, 191 p.
Kistenmacher H. Ursachen und räumliche Wirkungen der Suburbanisierung. In: ARL Arbeitsmaterial 276. 2001. p. 17–30.
Geist H, McConnell W, Lambin EF, Moran E, Alves D, Rudel T. Causes and trajectories of land-use/cover change. In: Lambin EF, Geist H, editors. Land-use and land-cover change. Berlin: Springer; 2006. p. 41–70.
Klijn JA. Driving forces behind landscape transformation in Europe, from a conceptual approach to policy options. In: Jongman RHG, editor. The new dimensions of the European landscape. Dordrecht; 2004. p. 201–218.
Siedentop S. Urban sprawl – verstehen, messen, steuern. Ansatzpunkte für ein empirisches Mess- und Evaluationskonzept der urbanen Siedlungsentwicklung. DISP. 2006; 160:23–35.
Verburg PH, Eck JRR van, Nijs TCM de, Dijst MJ, Schot P. Determinants of land-use change patterns in the Netherlands. Environ Plan B Plan Des. 2004; 31(1):125–150. DOI: 10.1068/b307.
Veldkamp A, Lambin EF. Predicting land-use change. Agric Ecosyst Environ. 2001; 85(1):1–6. DOI: 10.1016/S0167-8809(01)00199-2.
Verburg PH. Simulating feedbacks in land use and land cover change models. Landsc Ecol. 2006; 21(8):1171–1183. DOI: 10.1007/s10980-006-0029-4.
Alcamo J, Kok K, Busch G, Priess JA, Eickhout B, Rounsevell M. Searching for the future of land: Scenarios from the local to global scale. In: Lambin EF, Geist HJ, editors. Land-use and land-cover change: Local processes and global impacts. Berlin: Springer; 2006. p. 137–157.
Lambin EF, Geist HJ. Introduction: Local processes with global impacts. In: Lambin EF, Geist HJ, editors. Land-use and land-cover Change: local processes and global impacts. Berlin: Springer; 2006. p. 1–8.
Verburg PH. Modeling land-use and land-cover change. In: Lambin EF, Geist HJ, editors. Land-use and land-cover change: Local processes and global impacts. Berlin: Springer; 2006. p. 117–136.
Lesschen JP, Verburg PH, Staal SJ. Statistical methods for analysing the spatial dimension of changes in land use and farming systems. Wageningen: LUCC Report Series 7 (IGBP); 2005. 81 p.
Overmars KP, de Koning GHJ, Veldkamp A. Spatial autocorrelation in multi-scale land use models. Ecol Model. 2003; 164(2–3):257–270. DOI: 10.1016/S0304-3800(03)00070-X.
Mandl P. Geo-Simulation-Experimentieren und Problemlösen mit GIS-Modellen. In: Strobl J, Blaschke T, Griesebner G, editors. Beiträge zum AGIT-Symposium Salzburg. Heidelberg: Wichmann; 2000. p. 345–356.
Miller EJ, Douglas Hunt J, Abraham JE, Salvini PA. Microsimulating urban systems. Comput Environ Urban Syst. 2004; 28(1–2):9–44. DOI: 10.1016/S0198-9715(02)00044-3.
Benenson I, Torrens PM. Geosimulation: Automata-based modeling of urban phenomena. West Sussex: John Wiley & Sons; 2004. 287 p.
Bethell T. The search for artificial intelligence. Am Spect. 2006; 39(6):26–35.
Rucker R. Seek! Selected Nonfiction. New York: Four Walls Eight Windows; 1999. 356 p.
Ulam S. Random processes and transformations. In: Proceedings of the International Congress on Mathematics. Cambridge: Cambridge University Press; 1952. p. 264–275.
von Neumann J. The general and logical theory of automata, Cerebral Mechanisms in Behavior. The Hixon Symposium, John Wiley & Sons, Inc., New York, NY; Chapman & Hall, Ltd., London, 1951, p. 1–31.
Barredo JI, Lavalle C, Kasanko M, Demicheli L, McCormick N. Sustainable urban and regional planning: The MOLAND activities on urban scenario modelling and forecast. Luxemburg: EC; 2003. 54 p.
Landis J. CUF, CUF II, and CURBA: A family of spatially explicit urban growth and land-use policy simulation models. In: Brail R, Klostermann R, editors. Planning support systems: integrating geographic information systems, models, and visualization tools. Redlands: Esri Press; 2001. p. 159–200.
Batty M, Xie Y. Possible urban automata. Environ Plan B Plan Des. 1997; 24(2):175–192.
Clarke KC, Hoppen S, Gaydos L. A self-modifying cellular automaton model of historical urbanization in the San Francisco Bay area. Environ Plan B Plan Des 1997; 24:247–262.
Hilferink M, Rietveld P. Land use scanner: An integrated GIS based model for long term projections of land use in urban and rural areas. J Geogr Syst. 1999; 1(2):155–177.
Tobler W. Cellular geography. In: Gale S, Ollson G, editors. Philosophy in geography. Dordrecht: D. Reidel Publishing Company; 1975. p. 379–386.
White R, Engelen G. Cellular automata and fractal urban form: A cellular modelling approach to the evolution of urban land-use patterns. Environ Plan A. 1993; 25(8):1175–1199.
Goetzke R. Entwicklung eines fernerkundungsgestützten Modellverbundes zur Simulation des urban-ruralen Landnutzungswandels in Nordrhein-Westfalen. Hamburg: Disserta Verlag; 2012. 296 p.
Schmitz M, Bode T, Thamm H-P, Cremers AB. XULU – A generic JAVA-based platform to simulate land use and land cover change (LUCC). MODSIM 2007 Int Congr Model Simul Model Simul Soc Aust N Z Dec 2007. 2007; p. 4–7.
Pontius RG, Boersma W, Castella J-C, Clarke K, Nijs T, Dietzel C. Comparing the input, output, and validation maps for several models of land change. Ann Reg Sci. 2008; 42(1):11–37. DOI: 10.1007/s00168-007-0138-2.
Costanza R, Maxwell T. Spatial ecosystem modelling using parallel processors. Ecol Model. 1991; 58(1–4):159–183.
Schoettker B. Monitoring statewide urban development using multitemporal multisensoral satellite data covering a 40-year time span in north Rhine-Westphalia (Germany). Proc SPIE 10th Int Symp Remote Sens 8-12th Sept 2003 Barc. 2003; p. 252–61.
Mountrakis G, Im J, Ogole C. Support vector machines in remote sensing: A review. ISPRS J Photogramm Remote Sens. 2011; 66(3):247–259. DOI: 10.1016/j.isprsjprs.2010.11.001.
Vapnik V. Statistical learning theory. New York: John Wiley & Sons; 1998. 736 p.
Xie C. Support vector machines for land use change modeling. Calgary: UCGE Reports; 2006. 128 p.
Burges CJC. A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 1998; 2:121–167.
Huang B, Xie C, Tay R. Support vector machines for urban growth modeling. GeoInformatica. 2010; 14(1):83–99. DOI: 10.1007/s10707-009-0077-4.
Platt JC. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Adv Large Margin Classif. 1999;61–74.
Okwuashi O, McConchie J, Nwilo P, Eyo E. Stochastic GIS cellular automata for land use change simulation: Application of a kernel based model. Proc 10th Int Conf GeoComputation Univ New South Wales Syd Aust 30 Nov – 02 Dec 2009. 2009; p. 1–7.
Langford M, Unwin DJ. Generating and mapping population density surfaces within a geographical information system. Cartogr J. 1994; 31(1):21–26.
Congalton RG. A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens Environ 1991; 37:35–46.
Waske B, van der Linden S, Benediktsson JA, Rabe A, Hostert P. Sensitivity of support vector machines to random feature selection in classification of hyperspectral data. IEEE Trans Geosci Remote Sens. 2010; 48(7):2880–2889.
Hsu C-W, Chang C-C, Lin C-J. A practical guide to support vector classification. Taipei: Department of Computer Science, National Taiwan University; 2010. 16 p.
Hughes GF. On the mean accuracy of statistical pattern recognizers. IEEE Trans Information Theory. 1968; 14(1):55–63.
Nguyen MH, de la Torre F. Optimal feature selection for support vector machines. Pattern Recognit. 2010; 43(3):584–591. DOI: 10.1016/j.patcog.2009.09.003.
Koomen E, Borsboom-van Beurden J. Land-use modelling in planning practice. Dordrecht: Springer; 2011. 214 p.
Pontius RG, Schneider LC. Land-cover change model validation by an ROC method for the Ipswich watershed, Massachusetts, USA Agric Ecosyst Environ 2001; 85(1–3):239–248. DOI: 10.1016/S0167-8809(01)00187-6.
Aerts JCJH, Clarke KC, Keuper AD. Testing popular visualization techniques for representing model uncertainty. Cartogr Geogr Inf Sci. 2003; 30(3):249–261. DOI: 10.1559/152304003100011180.
Rafiee R, Mahiny AS, Khorasani N, Darvishsefat AA, Danekar A. Simulating urban growth in Mashad City, Iran through the SLEUTH model (UGM). Cities. 2009; 26(1):19–26. DOI: 10.1016/j.cities.2008.11.005.
Wu X, Hu Y, He HS, Bu R, Onsted J, Xi F. Performance evaluation of the SLEUTH model in the Shenyang metropolitan area of northeastern China. Environ Model Assess. 2008; 14(2):221–230. DOI: 10.1007/s10666-008-9154-6.
Pontius RG, Malizia NR. Effect of category aggregation on map comparison. In: Egenhofer MJ, Freska C, Miller HJ, editors. GIScience 2004. Springer, Berlin 2004, p. 251–268.
Pontius RG. Quantification error versus location error in comparison of categorical maps. Photogramm Eng Remote Sens. 2000;66(8):1011–1016.
Silva EA. The DNA of our regions: Artificial intelligence in regional planning. Futures. 2004; 36(10):1077–1094. DOI: 10.1016/j.futures.2004.03.014.
Silva EA, Clarke KC. Complexity, emergence and cellular urban models: Lessons learned from applying SLEUTH—To two Portuguese metropolitan areas. Eur Plan Stud. 2005; 13(1):93–115.
Gazulis N, Clarke KC. Exploring the DNA of our regions: Classification of outputs from the SLEUTH model. In: Yacoubi SE, Chopard B, Bandini S, editors. Cellular Automata. Berlin: Springer; 2006.:462–471.