Open access peer-reviewed chapter

An Open-Data-Based Methodology for the Creation of a Graph of Critical Infrastructure Dependencies at an Urban Scale

Written By

Antonio Di Pietro, Alessandro Calabrese, Antonio De Nicola, Daniele Ferneti, Luisa Franchina, Josè Martì and Tommaso Ruocco

Reviewed: 29 August 2023 Published: 16 November 2023

DOI: 10.5772/intechopen.113045

From the Edited Volume

Critical Infrastructure - Modern Approach and New Developments

Edited by Antonio Di Pietro and Josè Martì

Chapter metrics overview

52 Chapter Downloads

View Full Metrics

Abstract

This paper presents the MARIS (Modeling infrAstructuRe dependencIes at an urban Scale) methodology, allowing the generalization of one of the possible graphs modeling Critical Infrastructure (CI, hereafter) interdependencies at an urban scale starting from uncertain data. This leverages a set of known interdependencies at the system level, topological open data of local services and Points of Interest collected at an urban scale, and some heuristics. Indeed, interdependencies at an urban scale are usually not known to decision makers (e.g., CI operators, emergency planners) due to, for example, a lack of integration of knowledge held by different critical infrastructure operators and privacy restrictions. Here, these interdependencies are determined through geographic-based strategies. The resulting graph can be a valuable input to simulate emergency scenarios of CIs in the area of interest and, thus, plan proper countermeasures.

Keywords

  • interdependencies
  • open-data
  • critical infrastructures
  • graph
  • GIS

1. Introduction

Nowadays, Critical Infrastructures (CI) (roads and railways, electrical and telecommunication networks, gas and water pipelines, etc.), supplying primary services to citizens, are mutually connected as they provide their services not only to final users but also to other infrastructures. In this aspect, infrastructures are said to be dependent on each other and” interdependent.” The concept of dependency stands on the fact that an infrastructure (e.g., the electrical network) provides service to another (e.g., the telecommunication network), and, thus, the latter is said to be dependent on the former. CI interdependencies can involve different abstraction layers. For instance, a general statement about the dependency between a railway and a power station involves a different layer related to a railway and a power station located in a given city. For the sake of simplicity, in the following, we refer to the former as system dependency and to the latter as urban dependency.

Whereas repositories of system dependencies already exist1, these are not sufficient to build urban dependencies due to several factors including lack of integration of knowledge held by different critical infrastructure operators and privacy restrictions. Furthermore, these data are constantly changing and difficult to collect because different stakeholders keep them. In order to build simulation models, the unavailability of information on real interdependencies is generally overcome using literature data or through survey data analysis addressed to critical infrastructure experts [1]. However, interdependency data would allow more reliable simulation for risk assessment and crisis management in case of natural events, such as earthquakes, or other events, such as cyber-attacks.

In this context, this paper aims to define a methodology to generate one of the possible graphs modeling CI interdependencies at an urban scale starting from uncertain data. The proposed methodology, named MARIS (Modeling infrAstructuRe dependencIes at an urban Scale), allows the discover of possible hidden dependencies using Geographical Information System (GIS) Open data of CIs and Points of Interest (e.g., the location of substations of an electric distribution network serving an urban area) and to apply proximity criteria to model dependencies when these are not known.

In addition, the paper presents the results of a survey on CIs conducted by several experts from Italy, aiming at quantifying the dependencies between critical infrastructures. This can be a valuable input for applying the MARIS methodology to produce realistic graphs of interdependencies in an urban context.

The rest of the paper is organized as it follows. Section 2 gives an overview of the present work on CI interdependencies. Section 3 presents the MARIS methodology. Section 4 presents a survey analysis of experts on the dependencies between Critical Infrastructures. Section 5 describes a case study related to Rome in Italy. Finally, Section 6 concludes the paper.

Advertisement

2. Related work

Modeling of CI interdependencies has been considered as a relevant problem since the seminal paper of [2].

In Chiara et al. [3], the authors proposed the Mixed Holistic Reductionist (MHR) methodology based on the interaction of three layers: (i) a holistic layer where CIs are seen as singular entities with defined boundaries and functional properties; (ii) a reductionist layer modeling the behavior of individual CI components; and (iii) a service layer that describes the functional relationships between components and the infrastructure at different levels of granularity.

An ontology design pattern to model CI interdependencies was proposed by Ref. [4] as part of the TERMINUS ontology. Crisis management [5] and risk assessment [67] are two possible applications of ontologies modeling critical infrastructures [8].

In Rosato et al. [9], the authors focus on modeling cyber dependencies of a set of infrastructures in an urban context. In particular, a dependency matrix [10] was used to reveal the potential vulnerability of a given node to the unavailability, corruption, or disclosure of data from an interdependent node regardless of the current state of the shared data infrastructure.

The work of Michel et al. [11] allowed us to define a dependency matrix generated based on the analysis of CI disruptions gathered from public media in the Netherlands from 2004 to 2010. In particular, a specific set of news sources containing impact keywords were acquired by Google News and further organized to save information including the affected CI sector and service, the initiating event (if any), and its dependency on another affected CI service.

In Franchina et al. [12], the authors present a methodology able to classify critical infrastructures starting from citizenship basic needs and foresee possible cascading effects.

In the MARIS methodology, we address all types of dependencies (logical, cyber, and physical) that are modeled by a specific system dependency layer and acquire open data of CIs and points of interest in order to create an urban dependency layer that can be valuable for further CI dependency analysis.

Advertisement

3. The MARIS methodology

3.1 Overview

Dependencies and interdependencies increase the vulnerability of infrastructures as they allow failures and perturbations to propagate from one system to another with the consequence that an infrastructure, which is not directly affected by some event that undermines its functioning, can be perturbed by the lack of functionality coming from another infrastructure which is turn hit by the event. Whereas from a conceptual side, the phenomenon is clear, on the practical side, dependencies and interdependencies are responsible for producing complex phenomena as, for instance, propagation time-scales can be quite different depending on the first perturbed infrastructure and on the infrastructure where perturbation will flow. For example, whereas perturbation on an electrical line may affect a very short time scale (seconds or less), an electrical perturbation to the traffic system or the water distribution network may take hours to produce sizeable effects. In that, if one would consider a” dependency matrix” where row and columns represent infrastructures and the i-j element their interaction, such a matrix is highly nonsymmetric (the ij element may be largely different from the ji one) as some infrastructure might perturb (severely and rapidly) other infrastructures which, in turn, have a very poor effect on the former when, in turn, perturbed. In some cases (i.e. when perturbation originates from the electrical system), the” coupling” strength is often very strong (i.e. many systems depend primarily on the electrical power), and the resulting reduction (or loss) of function established in a very short time scale. The presence of such strong coupling between infrastructures leads the system to be a” unique” system that cannot be treated and approximated as a system of independent (or nearly independent) infrastructure but as a” system of systems” that cannot be linearized A further element which leads the interdependency problem even more complex to be treated originates from the possibility of closed loops involving more than a couple of infrastructures. Perturbation from a first infrastructure might flow on several infrastructures before returning to the first one, providing negative feedback.

All these issues lead to an operational approach to interdependency that is extremely complex due primarily to an incomplete description of system’s dependencies, time scales, and latencies. All dependency and interdependency data can be achieved through direct and indirect methods [12], but a complete theory of interdependency is still lacking.

Despite all that, it is possible to empirically deal with the problem of describing a number of dependent and interdependent systems with the aim of providing some type of decision support systems (DSS) enable to support decision makers (e.g., CI and Civil Protection managers) in the risk management process.

The MARIS methodology makes it possible to transform the layer of interdependencies known at the system level into a layer of interdependencies at the level of the physical components that lie in an urban context, as shown in Figure 1.

Figure 1.

Georeferenced dependency upscaling.

In the following, first, we give some definitions of the main concepts involved in the MARIS methodology, and then, we give an overview of its main steps.

3.2 Definitions

The MARIS methodology is based on seven fundamental concepts pertaining to the two above-mentioned layers. The ones dealing with the system dependency layer are the following:

  • System: It denotes a Critical infrastructure from an overarching perspective [13]. Accordingly, examples are transportation, energy, water, waste, telecommunication, education, and health.

  • Subsystem: It represents a further refinement of a system; for instance, the Water system includes several subsystems such as drinking water, wastewater/sewage, and stemming of surface water.

  • Subsystem Dependencies: These can be physical, logical, and cyber. For example, a hospital depends on an electric distribution network.

Those pertaining to the urban infrastructure layer are:

  • Entity Type: It represents a physical component of a subsystem that is responsible for the provision of a service (e.g., the generic substation component of a distribution power grid).

  • Item: It represents the resource, good, data, or functionality provided to a customer (or a citizen) that is produced and/or consumed by an entity.

  • Entity: It represents the specific instance of an Entity Type (e.g., the set of substations of a distribution power grid in the area of interest).

  • Urban dependencies: These can be physical, logical, and cyber. Unlike SD, UD involves Entity only. For example, a specific substation of a distribution power grid may depend on a nearby base transceiver station (BTS) that provides communication functionality required by the power grid’s supervisory control and data acquisition (SCADA) system.

3.3 Main steps

Figure 2 describes the main steps of the MARIS methodology. First, dependencies at system scale are collected through a survey eliciting knowledge on the dependencies between critical infrastructures. As mentioned, these dependencies concern subsystems. Information on entities are retrieved from a GIS (Geographical Information System) such as OpenStreetMap (OSM). Then, similar to the approach presented in Ref. [14], subsystems are matched with entity types. The resulting pairs are included in an annotation table. Finally, all the above information, i.e., subsystems dependencies, entities, and the annotation table, are used to infer urban dependencies and create an interdependency graph.

Figure 2.

Main steps of the MARIS methodology.

Advertisement

4. Survey analysis on the dependencies between critical infrastructures

In this section, we present the results of a survey on Cis conducted with experts from Italy coming from various economic and social sectors.

This activity, in addition to providing a set of real interdependencies useful for the application of the MARIS methodology, was used as input to the DOMINO simulation model [12], developed by Tesseract Srl company, aiming at studying and quantifying the consequences of negative, unexpected, and disruptive events to the Supply and Value Chain systems of a country. The purpose of this model is to reproduce the impacts that would occur on such systems if relevant functions of the Chains were disrupted by natural or malicious events. This DOMINO approach leverages the concept of an item (as defined in Section 3.1) based on the NACE2 and ATECO3 classifications. The survey was conducted by involving industry experts to detail the direct consequences produced by their own organizations due to the loss (or degradation) of a specific item. In particular, for each of the 117 items considered (Table A1 in the Appendix section), a set of the impacted items was collected. In addition, data regarding the duration of the propagation (in terms of hours, days, months, and years) were also collected in order to perform impact analysis through the DOMINO model.

The overall model will thus consider all the interdependencies between the various sectors (including Cis) that make up the Country Supply Chain System, based on the data reported by the various experts.

In the following, two examples of dependency trees are discussed. Figures 3 and 4 show the direct dependencies of the Drinking water and Bank items respectively. At the top of each tree a timeline is displayed to show the “falling time” of each ITEM, starting from the “time zero,” corresponding to the root node.

Figure 3.

DOMINO model from tesseract Srl: Drinking water dependency tree.

Figure 4.

DOMINO model from tesseract Srl: Bank dependency tree (items with impact higher than 30 days have been removed for the sake of space).

In the first dependency tree, it can be noted that, following the disruption of the Drinking Water item, two items, i.e., Education and Research, are impacted in the range of only 4 hours. On day 1, 45 items are impacted. Most of the dependencies represented are directly linked to the root node, but few of them are activated due to second (or higher) order effects (i.e., Maintenance services, Agriculture, and products).

In the second dependency tree, a few items are impacted in the order of a few minutes. However, most of the items are impacted after 3 days from the initiating failure.

Advertisement

5. Case study

The case study concerns an area of the city of Rome characterized by a high concentration of various business activities and power centers, as well as the presence of CIs, which are essential to maintaining vital societal functions. As shown in Table 1, the mentioned area includes 𝑁 = 10 systems and 11 subsystems.

SystemSubsystemEntity typeEntity occurrences
EducationUniversityUniv. building27
SchoolSchool building181
CollegeCollege building11
EnergyPowerSubstation facility1000
Financial servicesBankBank building413
InsuranceInsurance building40
FoodRestaurantRestaurant building2492
Fast foodFast food building701
GovernmentEmbassyEmbassy building184
ParliamentParliament2
HealthClinicClinic building40
PharmacyPharmacy building506
HospitalHospital facility31
DentistDentist building48
InternetInternet servicesStreet cabinet34
TelecommunicationTelecomBTS building122
TransportationRailwayRailway Station188
AirportsAirport facility4
WaterWater SupplyWater tower2
Total:6025

Table 1.

Dataset of infrastructure types in this study, categorized under 11 subsystems and 10 systems in the area of interest.

5.1 System dependency layer

In this step, we imported the subset of subsystem dependencies that were collected by means of the survey described in Section 4. It is worth noting that these dependencies expand those more generic related to the system level, which are addressed, for instance, in Rinaldi et al. [2]. Figure 5 shows an example of how system dependencies are extended to subsystem dependencies.

Figure 5.

(a) Example of system dependency layer; (b) example of subsystem dependency layer obtained from open data in the area of interest; (c) a fragment of the entity layer.

5.2 Territorial open-data acquisition

The development of the local dataset in the case study is based on the integration of open data collected and provided by OSM. The aim of this platform is to provide free geospatial information of world features. OSM uses open tagging mechanism to add meaning to geographic objects, so that any OSM user can add a new tag to them. For instance, some keys can be used to classify OSM entities into classes (e.g., highway, building, and amenity) while other keys play the role of attributes (e.g., name and maxspeed). OSM also allows to set attribute values, while values for classes are used to classify class members into categories (e.g., residential, hotel, monument).

In order to facilitate the understanding of OSM data and reduce the number of incorrect entity-classes associations [15], we used the Taginfo platform4 to gather tag statistics (e.g., providing the tags are actually in the database, the number of users choosing those tags) and to semantically enrich the entities. For example, when analyzing the railway subsystem, by setting the tag railway = station in Taginfo, we were able to select the railway station entities located in the area of interest and discard those combinations (e.g., railway = crossing, railway = radio) that were associated with different concepts.

Then, we used Overpass turbo5, a web-based data mining tool for OSM, to acquire the OSM data of interest representing entity type occurrences of Table A1 (e.g., university buildings and water towers) according to a geoJSON data format.

Therefore, the GIS open data acquired for CIs and Points of Interest were classified according to the specific subsystem.

5.3 Urban infrastructure layer

Regarding the dependencies to be associated to the entity layer, we considered those at the subsystem level and scaled them to the entity level. In other words, given that, according to the system dependency layer, the health system depends on the energy system, when scaling to the entity layer, we adopted the criterion that the specific hospital X depends on the energy supplied by the electrical substation Y (in particular, the one closest to the hospital).

Figure 5 shows the result of the application of the MARIS workflow, i.e., a fragment of the dependency at an urban level graph. It can be noticed that a geographical proximity criterion was applied to set up dependencies between CI components.

Advertisement

6. Conclusion

The presented MARIS methodology allows the generation of a graph at urban scale that can capture the known dependencies of CIs and Points of interest and model those dependencies that are unknown (because of partial, uncertain, or sensitive data) through the application of proximity criteria.

The use of open software and data (Taginfo, Overpass turbo, OSM) and the knowledge of CI interdependencies data at the system level were used to estimate the interdependencies at an urban level and to produce a realistic graph of interdependencies.

The methodology was applied to the city of Rome to create an interdependency graph characterized by 6025 entities representing real CIs and Points of interest. Future developments will concern the application of dynamic simulation models to the interdependency graph, obtained through the MARIS methodology, in order to reproduce the impacts that would occur on such systems when relevant functions of the supply chain of an area of interest were disrupted by natural or anthropic events.

Advertisement

Appendix A

List of items considered in the DOMINO model

SystemItem
WaterIrrigation Water
WaterWater For Industrial Use
WaterWater For Industrial Use
Agriculture, Livestock, Fisheries, ForestsAgriculture And Products
Agriculture, Livestock, Fisheries, ForestsFarming And Products
Agriculture, Livestock, Fisheries, ForestsForests
Agriculture, Livestock, Fisheries, ForestsLumber
Agriculture, Livestock, Fisheries, ForestsFisheries And Products
EnvironmentWaste Water
EnvironmentDams
EnvironmentNonhazardous Waste
EnvironmentHazardous Waste
EnvironmentMeteorology and Climate Services
Food ChainLong-Life Foods
Food ChainPerishable Foods
Food ChainFrozen Foods
Food ChainDrinks and Bottled Water
TradeRetail Trade
TradeWholesale
Culture, Icons, VenuesArts and Sports
Culture, Icons, VenuesAssociationism
Culture, Icons, VenuesEducation
Culture, Icons, VenuesReligion
Culture, Icons, VenuesResearch
EnergyCoal
EnergyFuels (Oil, Diesel, Biodiesel, etc.)
EnergyElectricity distribution
EnergyRenewable Sources Biomass
EnergyWind Renewable Sources
EnergyGeothermal Renewable Sources
EnergyRenewable Water Sources
EnergySolar Renewable Sources
EnergyLiquid Natural Gas
EnergyGreggio
EnergyMethane
EnergyNuclear
EnergyElectric Transport
EnergyElectricity Production
FinanceInsurance, Reinsurance, and Pension Funds
FinanceCash
FinanceStock Market
FinanceLoans And Mortgages
FinanceBanks
FinanceServices and Payment Systems (Excluding Cash)
IndustryClothes and Footwear
IndustryOther
IndustryPaper Production
IndustryChemical Industry
IndustryConstruction
IndustryElectrical Devices Production
IndustryElectronics
IndustryRubber and Plastics Production
IndustryMetal Machinery
IndustryMetallurgy
IndustryNonmetal Mining
IndustryMetal Mining
IndustryFurniture and Supply Chain
IndustryLeather Goods
IndustryCeramic Production
IndustryWood Goods
IndustryMetal Goods Production
IndustryTextile Industry
IndustryGlass and the Supply Chain
Institutions and Public AdministrationCentral Public Administrations
Institutions and Public AdministrationLocal Public Administrations
Institutions and Public AdministrationCivil Defense and Firefighters
Institutions and Public AdministrationPolice and Law Enforcement
Institutions and public administrationLocal political institutions
Institutions and Public AdministrationNational Political Institutions
Institutions and Public AdministrationCivil Protection
Institutions and Public AdministrationGovernment Financial Services
Institutions and Public AdministrationPrison System
Institutions and Public AdministrationDefense
ManpowerPhysical Manpower
ManpowerVirtual Manpower
HealthcareSocial Assistance
HealthcareProduction of Medicines and Medical Devices
HealthcareHealth Emergency Services
HealthcarePrivate Medical Services
HealthcarePublic Medical Services
HealthcareVeterinary Services
HealthcareSale of Medicines and Medical Devices
ServicesOther (Opticians and Photographers)
ServicesLegal Estate Services
ServicesLegal Advisory Services
ServicesInformatics Industry
ServicesTemporary Employment
ServicesMarketing and Advertising
ServicesRestaurants
ServicesAccommodation Services
ServicesMaintenance Services
ServicesCleaning Services
ServicesPrivate Security Services
ServicesSoftware Industry
ICTDigital Terrestrial Connection System
ICTData Processing, Hosting
ICTProvision of Internet Services (Isp)
ICTWeb Portals
ICTTelevision Production
ICTPublishing (Books, Periodicals, and Newspapers)
ICTRadio Broadcasting
ICTRadio and Communication Services
ICTSatellite Services
ICTFixed Telecommunications
ICTFiber Optic Telecommunications
ICTMobile Telecommunications
Transport and LogisticsAir Transport of Goods and Logistics
Transport and LogisticsPassenger Air Transport
Transport and logisticsLogistics by sea and ocean
Transport And LogisticsLogistics By Road
Transport And LogisticsTransport On Inland Waterways
Transport And LogisticsRail Freight And Logistics Transport
Transport And LogisticsPassenger Rail Transport

Table A1.

List of items.

References

  1. 1. Laugé A, Hernantes J, Sarriegi JM. Critical infrastructure dependencies: A holistic, dynamic and quantitative approach. International Journal of Critical Infrastructure Protection. 2015;8:16-23. DOI: 10.1016/j.ijcip.2014.12.004. Available from: https://www.sciencedirect.com/science/article/pii/S187454821400081X
  2. 2. Rinaldi SM, Peerenboom JP, Kelly TK. Identifying, understanding, and analyzing critical infrastructure interdependencies. IEEE Control Systems Magazine. 2001;21(6):11-25. DOI: 10.1109/37.969131
  3. 3. Foglietta C, Panzieri S, Pascucci F. Algorithms and tools for risk/impact evaluation in critical infrastructures. International Journal of Critical Infrastructures. 2015;565:227-238. DOI: 10.1007/978-3-662-44160-2_8
  4. 4. Coletti A, De Nicola A, Vicoli G, Villani ML. Semantic modeling of cascading risks in interoperable socio-technical systems. In: Popplewell K, Thoben K-D, Knothe T, Poler R, editors. Enterprise Interoperability viii. Cham: Springer International Publishing; 2019. pp. 119-129
  5. 5. Nicola D, Antonio MM, Villani ML. Creative design of emergency management scenarios driven by semantics: An application to smart cities. Information Systems. 2019;81:21-48. DOI: 10.1016/j.is.2018.10.005. Available from: https://www.sciencedirect.com/science/article/pii/S0306437918304277
  6. 6. Coletti A, De Nicola A, Villani ML. Building climate change into risk assessments. Natural Hazards. 2016;84(2):1307-1325. DOI: 10.1007/s11069-016-2487-6
  7. 7. Coletti A, De Nicola A, Di Pietro A, La Porta L, Pollino M, Rosato V, et al. A comprehensive system for semantic spatiotemporal assessment of risk in urban areas. Journal of Contingencies and Crisis Management. 2020;28(3):178-193. DOI: 10.1111/1468-5973.12309. Available from: https://onlinelibrary.wiley.com
  8. 8. Antonio DN, Villani ML. Smart city ontologies and their applications: A systematic literature review. Sustainability. 2021;13(10):20-60. DOI: 10.3390/su13105578. Available from: https://www.mdpi.com/2071-1050/13/10/5578
  9. 9. Rosato V, Di Pietro A, Kotzanikolaou P, Stergiopoulos G, Smedile G. Integrating resilience in time-based dependency analysis: A large-scale case study for urban critical infrastructures. In: Rosato V, Di Pietro A, editors. Issues on Risk Analysis for Critical Infrastructure Protection, Chap. 5. Rijeka: IntechOpen; 2021. DOI: 10.5772/intechopen.97809
  10. 10. Macaulay T. Critical Infrastructure: Understanding its Component Parts, Vulnerabilities, Operating Risks, and Interdependencies. Taylor & Francis; 2008
  11. 11. Eeten M, Nieuwenhuijs A, Luiijf E, Klaver M, Cruz E. The state and the threat of cascading failure across critical infrastructures: The implications of empirical evidence from media incident reports. Public Administration. 2011;89:381. DOI: 10.1111/j.1467-9299.2011.01926.x
  12. 12. Franchina L, Carbonelli M, Crisci M, Gratta L, Perucchini D. An effective approach for cascading effects prevision in critical infrastructures. Studies in Computational Intelligence. 2011a;7:73-90. DOI: 10.1504/IJCIS.2011.038958
  13. 13. Nirandjan S, Koks EE, Ward PJ, Aerts JCJH. A spatially-explicit harmonized global dataset of critical infrastructure. Scientific Data. 2022;9(1):150. DOI: 10.1038/s41597-022-01218-4
  14. 14. Minardi R, Villani ML, De Nicola A. Semantic reasoning for geolocalized assessment of crime risk in smart cities. Smart Cities. 2023;6(1):179-195. DOI: 10.3390/smartcities6010010. Available from: https://www.mdpi.com/2624-6511/6/1/10
  15. 15. Ali AL, Schmid F. Data quality assurance for volunteered geographic information. In: Duckham M, Pebesma E, Stewart K, Frank AU, editors. Geographic Information Science. Cham: Springer International Publishing; 2014. pp. 126-141

Notes

  • https://websites.fraunhofer.de/CIPedia/index.php/Interdependency.
  • https://www.en.wikipedia.org/wiki/Statistical_Classification_of_Economic_Activities_in_the_European_Community.
  • https://www.istat.it/it/archivio/17888.
  • http://taginfo.openstreetmap.org.
  • Overpass-turbo.eu.

Written By

Antonio Di Pietro, Alessandro Calabrese, Antonio De Nicola, Daniele Ferneti, Luisa Franchina, Josè Martì and Tommaso Ruocco

Reviewed: 29 August 2023 Published: 16 November 2023