Application of the Bayesian Approach to Incorporate Helium Isotope Ratios in Long-Term Probabilistic Volcanic Hazard Assessments in Tohoku, Japan

A challenge with the long-term probabilistic assessment of future volcanism in relation to the siting of, for example geological repositories is that because new volcano formation is rare, uncertainties in models are inherently large [10]. Sites for nuclear facilities in particular must be located in areas of very low geologic risk [11]. Recent studies have been carried out looking at the hazard posed by volcanoes to nuclear power plants in Armenia [e.g. 9, 12] and Java, Indonesia [e.g. 13]. Here the focus was more on the consequences of an eruption at an existing volcano on the safety of an operating nuclear power plant. In the case of a geological repository for high and/or low level radioactive waste, the emphasis is on the consequences of new igneous activity such as a dike that may intrude the repository [e.g. 14] and transport the waste to the surface. In this case, the probability of a new volcano forming in the first place is very low (typically < 10-7/a) since by definition such facilities should be located away from existing Quaternary volcanoes. However the lack of volcano ‘data’ implies that addition information on the processes that control future long-term spatio-temporal distribution of volcanism are


Introduction
Geological hazard assessments based on established statistical techniques are now commonly used as a basis to make decisions that may affect society over the long-term (0.1 -1 Ma). Volcanic risk essentially consists of: (1) The probability of a 'volcanic event' occurring such as a dike intrusion or a new strato-or caldera volcano forming e.g. [1][2][3][4][5][6][7] (2) The consequences of the volcanic event e.g. [8 -9].
A challenge with the long-term probabilistic assessment of future volcanism in relation to the siting of, for example geological repositories is that because new volcano formation is rare, uncertainties in models are inherently large [10]. Sites for nuclear facilities in particular must be located in areas of very low geologic risk [11]. Recent studies have been carried out looking at the hazard posed by volcanoes to nuclear power plants in Armenia [e.g. 9,12] and Java, Indonesia [e.g . 13]. Here the focus was more on the consequences of an eruption at an existing volcano on the safety of an operating nuclear power plant. In the case of a geological repository for high and/or low level radioactive waste, the emphasis is on the consequences of new igneous activity such as a dike that may intrude the repository [e.g. 14] and transport the waste to the surface. In this case, the probability of a new volcano forming in the first place is very low (typically < 10 -7 /a) since by definition such facilities should be located away from existing Quaternary volcanoes. However the lack of volcano 'data' implies that addition information on the processes that control future long-term spatio-temporal distribution of volcanism are needed. This has motivated several investigators to incorporate datasets in addition to the distribution and timing of past volcanic activity in volcanic probabilistic analyses [e.g. 15]. Bayesian inference has been used to combine geophysical datasets to probability distributions constructed from known historic volcano locations in order to estimate the location of future volcanism over a regional scale [1]. More recently, [16] used Bayesian inference to merge prior information and past data to construct a probability map of vent opening at the Campi Flegrei caldera in Italy.
Here we revisit the Bayesian approach developed by [1] where seismic tomographs and geothermal gradients were incorporated into probabilistic assessments by Bayesian inference in Tohoku. We apply the same Bayesian technique in the same study area to incorporate recently acquired helium isotopes into probabilistic hazard assessments; such noble gases have been shown to be excellent natural tracers for mantle-crust interaction owing to their inert chemical properties which means they are not altered by complex chemical processes. Moreover helium isotopes provide evidence for the presence of mantle derived materials in the crust, owing to the distinct isotopic compositions between the crust and the upper mantle [e.g. 17,18]. We examine the link between volcanism and 3 He/ 4 He ratios that may infer possible regions of magma generation and hence volcano formation. Such links between magmatism and elevated 3 He/ 4 He ratios have been proposed [e.g. 19,20], but the link has not been examined quantitatively in probabilistic based models. Finally we discuss the Bayesian method in developed by [1] in the context of recent approaches to incorporate multiple datasets [e.g. 21,22].

Japan and the Tohoku region
Japan is one of the most tectonically active regions in the world. Due to the dynamics of four plates, Quaternary volcanoes have formed along distinct volcanic fronts in east and west Japan ( Figure 1).
The Tohoku region ( Figure 2) is arguably one of the most extensively studied volcanic arcs in the world, particularly regarding the relationship between volcanism and tectonics. Moreover there have been numerous geological and geophysical investigations yielding high-quality datasets e.g., [23 -29].
Tohoku is a mature double volcanic arc with a back-arc marginal sea basin located on a convergent plate boundary of the subducting Pacific plate and the North American plate ( Figure 1). The location and orientation of the volcanic front (grey line in Figure 2) has been linked to the opening of the Sea of Japan and subduction angle of the Pacific plate [e.g., 26,33]. From 60 Ma up until about 10 Ma the volcanic front migrated east and west several times, however, it has been relatively static during the last 8 Ma [26].
Presently there are 15 known historically active volcanoes in the Tohoku region and a total 170 volcanoes that formed during the Quaternary [30]. Volcanism has gradually become more clustered and localized over a period from 14 Ma to present [34], thus volcano clustering is a characteristic feature in Tohoku.

Defining the volcanic event
What do we mean here by 'volcano'? In developing probabilistic based models, one of the most difficult and challenging tasks is defining the 'volcanic event'. This is because the volcanic event defined has to be simple and consistent enough for the probabilistic based models to handle. To a certain extent the degree of consistency that can be realistically included in a model is largely constrained by the size of the study area and by the amount and quality of geological, geochemical and/or geophysical data available. The volcanic event could range from a single eruption to a series of eruptions. It could be defined as the existence of a relatively young cinder cone, spatter mound, maar, tuff ring, tuff cone, pyroclastic fall, lava flow or even a large composite volcano. On the other hand older edifices may have been eroded and/or covered by sedimentary deposits such as alluvium and thus be more difficult to locate and/or   [1]). The four main islands that make up Japan are located on or near the boundaries of four plates. Black triangles denote Quaternary volcanoes and the red lines depict the main volcanic fronts. The thin contour lines denote the depth of the subducting Pacific Plate beneath Japan. The velocities and arrows indicate the subduction rates and directions respectively of the Pacific, Philippine Sea and Eurasian plates relative to the North American plate.
are easily overlooked. Results of magnetic and gravity data have been used as evidence for locating such hidden volcanic events which in turn had an impact on resulting probabilities at given locations [e.g. 15].
If we were carrying out a hazard assessment on a single volcano we may be interested in defining the event as a series of pyroclastic flows or surges or eruptions that generate lava flows that exceed a certain volume [e.g. 9]. This is particular relevant to volcanic hazard assessments carried out at volcanoes near densely urbanized areas such as the Campi Flegrei caldera in southern Italy [16].  [28,30] (modified from [1]). The highest density of volcanoes in Tohoku is the cluster in the Sengan region. Other notable volcanoes are the Towada volcano which has been the site of late Quaternary large-volume felsic eruptions resulting in large caldera formation [e.g., 27,31], and the Iwaki and Chokai [32] volcanoes which are active andesitic volcanoes on the back-arc side of Tohoku. The grey line denotes the present day volcanic front.
Updates in Volcanology -New Advances in Understanding Volcanic Systems Several aligned edifices with the same eruption age may also be considered as a single volcanic event. Such vent alignments typically developed simultaneously as a result of magma supply from a single dike. For example the vent alignments in the Higashi-Izu monogenetic volcanic group [e.g. 35], could well be classified as a single volcanic event temporally but spatially are multiple. Where age data has been limited, some authors have implemented a condition whereby a cone or cones can only be defined as a volcanic event if they are associated with a single linear dike or a dike system with more complex geometry [e.g. 36].
Many of the advances made on modelling future spatial or spatio-temporal patterns of volcanism where carried out in monogenetic volcano fields due to the apparent relative ease of defining such volcanoes as point processes [e.g., 37,38]. However as composite or established polygenetic volcanoes represent multiple eruptions from the same conduit occurring over several tens to hundreds of thousands of years, defining the volcanic event is not so easy if the focus is on single eruption episodes as the type of eruption can evolve significantly during the lifetime of the composite volcano. In fact the temporal definition of a monogenetic volcano appears to be not so straightforward either as this can range from several days to a few weeks or longer. For example the Ukinrek maar in Alaska formed in about eight days [39] and the 1913 eruption forming the Ambrym Volcano, Vanuatu in the south west pacific in just a few days [40]. Moreover, [41] argued that monogenetic volcanoes can be both spatially and temporarily more complex than a single eruptive event. In other words so called 'monogenetic' volcanoes can also be 'polygenetic' albeit smaller scaled than large volume complex strato or caldera volcanoes. Based on this there could be a case to look again at the volcanic event definition used in earlier probabilistic assessments carried out in monogenetic volcanic fields [e.g. 42,43].
In Tohoku, new volcanoes forming at new locations typically evolve into large complex strato and/or caldera volcanoes containing multiple vents e.g. Akitakomagatake volcano [44]. Such large polygenetic volcanoes in Tohoku have been sub-grouped into unstable types where the eruptive centre has migrated more than 1.5 km within 10 ka and stable types were the vents are more concentrated around the geographic centre of the volcano [44].
The volcanic event definition requires information on both the temporal and spatial aspects; the temporal definition relates to the recurrence rate, λ t (number of volcanic events per unit time), and spatial definition to the intensity or spatial recurrence rate λ x, y (number of volcanic events per unit area). λ t and λ x, y can also be combined as a spatio-temporal recurrence rate λ x, y,t (number of volcanic events per unit area per unit time) [42].
The temporal definition of a volcanic event could range from a single eruption occurring in one day or less, to an eruption cycle in which active periods of eruptions occur between dormant periods. The time scale of an active period may vary from several years to thousands of years. In previous volcanic hazard analyses carried out on complex, large-volume strato and/or caldera volcanoes, volcanologists have typically defined volcanic events as single eruptions or several eruptions within some defined time period separated by periods in which there is no activity e.g. [4]. This is because the focus at such established volcanoes is not on the probability of a new volcano forming in the vicinity of the existing volcano but rather on the probability of the next eruption or eruption phase.

Tohoku volcanic event definition
In the context of siting of a geological repository, the main concern is the formation of a new volcano in a region where volcanoes do not already exist. Thus the distinction between monogenetic (simple or complex) and polygenetic (complex strato and/or caldera) volcanism is not relevant for the definition of volcanic event here. Table 1 is a compilation of all Quaternary volcanoes in the Tohoku volcanic arc modified from the Catalog of Quaternary Volcanoes in Japan [1,30]. Volcano complexes refer to magma systems that have evolved over the longterm (order of 0.1 Ma) which appear as regional scale clusters. In this chapter we use the same definition of volcanic event as [1] taking into account eruption volumes. This is depicted as a white triangle in Figure 3 and is the average geographic location of the vents (white dots). The eruption products released from the vents are represented by the dark grey regions in Figure  3. The lighter grey areas in Figure 3a are the eruption products of a separate volcanic event. Each volcanic event typically has a time gap of more than 10 ka, and/or is differentiated from other volcanic events according to geochemistry.  [30,28]. Dense-rock equivalent (DRE) of eruptive volumes is the product of volume and density of the respective volcanic deposits.

Bayesian model
The following is a slightly shorter description of the Bayesian methodology published in [1]. A two-dimensional surface distribution is set-up showing the continuous probability of one or more volcanic event(s) forming within a region of interest, in an arbitrarily time frame of the order of 0.1 -1 Ma. The volcanic event definition defined above means that we are estimating with known uncertainty, the probability of a new volcano forming at a given location (x, y). [1] noted that a challenge with estimating the long-term future spatial distribution of volcanism is the fact that we are trying to model something that we cannot sample directly; namely the locations of future volcanoes. In this chapter we incorporate 3 He/ 4 He ratios, as these may be indicative of conduits in the earth's crust through which magma may rise through resulting in future volcano formation [19,20].
Information, no matter how obtained, can be described by a probability density function (PDF) [e.g. 45,46]. Once the dataset is expressed as a PDF, it is possible to combine with our initial PDF created based on a priori assumptions on volcanism. Bayesian inference is a powerful tool that allows us to construct an a posteriori PDF given a priori assumptions and the PDF generated by our new dataset.
Essentially, two stages are performed yielding the a posteriori PDF. The first is to make a longterm future prediction based solely on the distribution and ages of past volcanic events, creating an a priori PDF. The a priori assumption is that the past and the present provide information about the future; in other words the locations of past and present volcanism are used as an initial guide to estimating future long-term spatial patterns of volcanism. The basic logic behind the a priori assumption is that a new volcano doesn't form far from existing volcanoes. The a priori assumption can be quite vague in the first step as it is simply the starting point. The next stage is to update or modify the a priori assumptions by incorporating information that is likely to be indicative of the locations of future volcanism and/or we have increased our understanding of the process that controls the location of volcanism. This new information and/or knowledge, obtained from chemical and/or geophysical data, is used to modify the a priori PDF to form an a posteriori PDF that is expected to better reflect the location of future volcanism. The cycle can be repeated any number of times for other datasets by treating the a posteriori PDF as the new a priori PDF in the first step above.

Bayesian inference and Bayes' theorem
Bayes' theorem [e.g. 47] is used to setup a model providing a joint probability distribution for the location known volcanic events (a priori PDF) and current R/R A contoured datasets recast as a PDF (likelihood function). The joint probability density function or a posteriori PDF can be written as the product of two PDFs; the a priori PDF and the sampling or likelihood PDF Updates in Volcanology -New Advances in Understanding Volcanic Systems where x and y represent grid point locations within the volcanic field A, θis additional dataset, P(x, y)is the a priori PDF, L (θ | x, y)the likelihood function generated by conditioning additional data on the locations of volcanic events, and P(x, y | θ) the resulting a posteriori PDF [1]. The a posteriori PDF is normalized to unity by integrating over the entire Tohoku volcanic field; hence total cumulative probability will not change but the shape of the 2-D surface distribution will be modified according to the likelihood function.

A priori PDF
We assume that past and present volcanic events can be used to estimate future locations of volcanoes over the long-term, as well as constraining upper bound recurrence rates in the volcanic field. The spatial distribution of volcanoes in volcanic arcs like Tohoku are random [48] hence by treating volcanism in Tohoku volcanic arc as a low frequency, random event, it is assumed that the underlying process could be approximated to a Poisson process [1]. Moreover, by treating the location of volcanic events as random points within some set, the spatial distribution of volcanism can be modeled as a spatial point process [1] where a spatial point process is a stochastic model that can be described as the process controlling the spatial locations of the eventss 1 ,…,s 1 in some arbitrary set S [49]. In applying point process models to volcanism, [42] eloquently defineds 1 ,…,s n as volcanic events and S as the volcanic field.
The Poisson process is 'homogeneous' if the spatial distribution of point events are completely random [49]. However, as with many volcanic fields, spatial patterns of volcanism in the Tohoku volcanic arc are clustered [34,50], hence the distribution of volcanoes are not completely random and therefore non-homogeneous (also referred to as in-homogenous). Applying the Clark-Evans nearest-neighbour test [51], [1] showed that the distribution of the volcanic events defined above is clustered with greater than 95% confidence. A non-homogeneous Poisson process is the simplest alternative for modeling such clustered events. Moreover, point process models based on non-homogeneous Poisson processes have been extensively used in modeling the spatial and spatio-temporal characteristics of several volcano fields (e.g. the Springerville volcanic fields in Arizona [38] and the Higashi-Izu monogenetic volcano group, Shizuoka Prefecture, Japan [43]. In these models the local spatial density of volcanic events λ x, y is calculated using a kernel function [37,52]. The kernel function itself is a density function used to obtain the intensity of volcanic events at a sampling pointx p , y p , calculated as a function of the distance to nearby volcanoes and a smoothing constant h ( Figure 4).
As noted by [42] the choice of kernel function with appropriate values of h has some consequence for the parameter estimation because it controls how λ x, y varies with distance from existing volcanoes. The Gaussian kernel has been used a lot in probabilistic assessments carried out in monogenetic volcanic fields, [e.g. 15,43] since it was assumed that the next volcano to form would not be far from an existing volcanoes. In order to include extreme volcanic events further afield however, [1] modelled spatial patterns using the Cauchy kernel which has thicker tails than the Gaussian kernel. [1] also showed that the spatial distribution of volcanic events in the Tohoku volcanic arc fit a Cauchy distribution whereas monogenetic fields such as the Higashi-Izu Monogenetic Volcano Group [43] tend to be Gaussian. We therefore also use a two-dimensional Cauchy kernel here to calculate the spatial recurrence rate λ x, y at grid point x p , y p where: x vi , y vi are Cartesian coordinates of the ith volcanic event, N the number of volcanic events used in the calculation and l vi is a factor for weighting eruption volume of the corresponding ith volcanic event. l vi is set to unity when eruption volume is excluded. The calculation is repeated on a 10 km mesh in the study area 139 to 143 longitude and 37 to 41.6 latitude and the resulting PDF is normalized to unity. The 10 km grid spacing was selected taking into account the resolution of available geophysical or geochemical datasets across the entire Tohoku volcanic arc.

Estimating an optimum smoothing coefficient h for the volcanoes in Tohoku
The choice of the smoothing coefficient depends on a combination of the size of the volcanic field, size and degree of clustering and the amount of robustness and conservatism required at specific points within or nearby the volcanic fields in question. In order to estimate the most likely optimum value of smoothing coefficient, [1] plotted cumulative probability density  The cumulative plots in Figure 5 suggest that the spatial distribution of volcanic events in the Tohoku volcanic arc fit a Cauchy distribution with smoothing coefficients of h = 1-1.5 km.

A priori probabilities
Probability estimates for each grid point x p , y p are computed by using a Poisson distribution where λ x, y represents the intensity parameter computed using equation (  where, N(t) represents the number of future volcanic vents that occur within time t and area ΔxΔy (10 km x 10 km). The parameter λ x, y is normalized to unity across the Tohoku, so, equation (3) represents the probability of one or more volcanic event(s) forming in an area ΔxΔycentred on point x p , y p given the formation of a new volcanic event in Tohoku. This calculation is repeated on a grid throughout Tohoku. The resolution is such that the spatial recurrence rate λ x, y does not vary within each cell. For the regional recurrence rateλ t an average of 120 volcanic events per million years is used, effectively taking average Quaternary activity [1].
Using smoothing coefficients of 1 -1.5 km for the Cauchy kernel, as well as weighting eruption volumes, probability plots were constructed using equation (3). A probability contour plot for one case is shown in Figure 6.

Monogenetic volcanoes
Iwaki Chokai Towada Figure 6. Probabilities of one or more volcanic events occurring in the next 100 ka based on a priori PDF (Cauchy (h = 1.5 km). White triangles denote the volcanic events used in the calculation and black lines are active faults [35] The highest probabilities are located in the Sengan region (10 -6 -10 -5 / a) which has the highest density of volcanic events in the Tohoku volcanic arc. By testing the two volcanic event subdefinitions (weighted with and without eruption volume), [1] found that the probabilities in the vicinity of monogenetic volcanoes on the back-arc region were higher when volcanic events were not weighted with eruption volumes (1 -4 x 10 -7 /a, weighted; 1 -4 x 10 -6 /a, unweighted), whereas the probabilities around established centers such as Iwaki, Towada, Sengan and Chokai were reduced slightly. This is expected as volcanoes with large eruption volumes are the sites of highest magma production. However if the focus of the assessment is on new volcano event formation, irrelevant of whether the new volcano evolves into are large complex stratovolcano and/or caldera or not, then selecting the volcanic event definition that is not weighted with eruption volume would seem more appropriate.

The likelihood function
Here the a priori PDF is conditioned 3 He/ 4 He ratios. This is done by normalizing additional data into a likelihood function according to how such information is judged by the expert and/or indicated by experimental result to relate to the distribution of volcanism [1]. Helium isotopes have been shown to provide evidence for the presence of mantle derived materials in the crust, and hence potential volcanism based on distinct 3 He/ 4 He ratios (Figure 7) [17,18]. [1] looked at seismic tomographs and geothermal gradients. This is because P velocity perturbations (ΔV / V ) in particular at 40 km depth [29] is a good estimate of the minimum depth of partial melting in the mantle for most of the volcanoes in Tohoku. Geothermal gradients on the other hand were used by [1] as an additional aid to P velocity perturbations since it is not possible to differentiate heat from P wave velocity alone. Figure 7. Distribution of R/R A data (Ra denotes the atmospheric 3 He/ 4 He ratio of 1.4×10 -6 ) taken from boreholes and hot springs [20,54,55] In order to compare the R/R A ratios, cumulative plots of values around all volcanic events and values of 10 km 2 bins over all of Tohoku are plotted. Figure 8 shows R/R A ratios below all volcanic events (8a) and volcanic events less than 100 ka (8b). In both cases approximately 90% of all volcanic events are distributed in regions with R/R A ratios greater than 3. In other words 90% volcanoes are located in regions where 3 He/ 4 He is elevated.
The R/R A ratios are interpolated to represent a continuous, differentiable surface and then the spatial data are mapped into a likelihood function based on the percentage of recent volcanic events that lie within the binned R/R A ratios in Figure 8. For low P velocity perturbation, [1] assumed an inverse linear relationship; based on the interpretation that low P velocity perturbation corresponds to partial melting (and hence increased probability of volcanism). In this case, 10% of volcanoes less than 100 ka located in regions where ΔV / V ranged from -6% to -5% etc. For geothermal gradients [1] used a linear relationship for recasting the data values as a PDF.

A posteriori probabilities
Finally the a posteriori PDF is calculated from the likelihood function and the a priori PDF using equation (1). The integral across the entire field of both the a priori and the a posteriori PDFs is set to unity; however the shape of the distribution is modified by the likelihood function. The probability of a new volcanic event is calculated for each grid point using equation (3).  Using equations (1) to (3) above, two dimensional probability plots are subsequently constructed showing the probability of one or more future volcanic event(s) forming during the Updates in Volcanology -New Advances in Understanding Volcanic Systems 136 long-term, given that a volcanic event will occur in the Tohoku volcanic arc during 100 ka. Figure 9 shows a comparison of the a priori probability (9a) and a posteriori probability (9b) conditioned on R/R A ratios of one or more volcanic events forming in 100 ka.
The probability of new volcanic event formation in the forearc region to the east of the volcanic front is reduced slightly in the a posterior probability calculation. This is more evident when we repeat the calculation for new volcanic events forming in the next 1Ma ( Figure 10). a b Figure 10. Probability of the formation of a new volcanic event over the next 1Ma; a priori (a) and a posteriori (b) probability plots calculated with a Cauchy kernel (h = 1.5 km) conditioned on R/R A ratios.
The R/R A analyses are compared with the probability calculations conditioned on P velocity perturbations (10 or 40 km) ( Figure 11) and geothermal gradients ( Figure 12) [1]. The a posteriori probability below Iwaki volcanic event is particularly low when conditioned on 40 km depth P velocity perturbation datasets but that there was no significant change beneath Chokai, another andesitic volcano on the back-arc side of Tohoku. With the a posteriori calculation conditioned on the R/R A analyses, no decrease is seen in the probabilities below Iwaki volcano when compared to a priori plots. Similar results can be seen when probabilities are conditioned on 10km depth P velocity perturbations (Figure 11a) or geothermal gradients ( Figure 12) [1]. [1] found that a posteriori probabilities are not reduced when compared to a priori probabilities in the northern regions when conditioning on shallower (10 km) P velocity perturbations or on geothermal gradients. This seems reasonable as seismic velocity structure [57] and the depth of Curie isotherms [58] in this part of Tohoku reveal high-temperature-like geophysical anomalies at depths of up to 10 km below Iwaki volcano which may be indicative of the shallower depths (ca. 10km) of magma chambers.

Discussion
The main advantage of probabilistic based models over deterministic models is that the probability of new volcano event formation is never zero. [1] showed that Bayesian inference is well-suited for formally combining observations relevant to the imaging of the magma source region (e.g. seismic tomography) with quantitative methods for estimation of volcano intensity. Moreover, the strength of Bayesian inference is that probabilistic assessments can be improved with increased understanding of the physical processes governing magmatism and/ or data that may be indicative of future volcanism such as the helium isotope ratios presented here. Nevertheless it is worth examining the logic behind what we perceive to be 'data' and what we mean by a priori information and knowledge.

Which datasets are a priori information?
[1] used the volcano geographical datasets themselves as a starting point in their analysis. The same approach was applied in this chapter. In the first step a Cauchy kernel was used to calculateλ x, y . This means that the probability new volcanic event formation decreases with increasing distance from existing volcanic events. In the case of selecting a location for a geological repository, there may be a need to have a conservative estimate and accept that extreme events may occur. In this case, selection of the Cauchy as the a priori PDF would be most appropriate due to the thickness of the tails. This is especially the case if we have to make probability calculations for periods for 1Ma where the tectonic setting can change, and we may have a shift in the location of the volcanic front. The probabilities in distal regions would only be reduced in the a posterior probability calculation if newly obtained evidence in such regions shows that volcano formation is zero or close to zero. Since R/R A ratios vary due to the heterogeneous release of mantle helium and elevated ratios and are likely to indicate the presence of partial melting [e.g. 20] datasets may give some indication on the future location of volcanism even in nonvolcanic regions. Seismic tomography on the other hand offers a direct view of the mantle that can be interpreted in terms of degree of partial melting [e.g. 58,59].
It could be equally argued, however that the logic of [1] should be reversed in that the models based on seismic tomography or elevated helium isotope ratios are in fact a priori information or knowledge, and the location of volcanic events the 'data'. The philosophy here is that we assume new volcanic events will form in regions where partial melting is likely to be occurring now and that the distribution of known volcanic events are the datasets updating our model and/or knowledge. However, this may be true for the very recent volcanism up to about 1,000 years say, but how relevant are volcanic events that formed over 100,000 years ago or more to the present day geophysical snap-shot of the Earth's crust or upper mantle? This question is difficult to answer as there is very little information on the temporal behaviour of partial melting in the mantle. This is also evident when we try to evaluate our forecasts below. A problem here is that we are always trying to predict the formation of future volcanic events which may or may not be related to historic volcanic events. Our closest 'data' to such future events are thus present day geophysical snap shots of the current conditions in the crust or upper mantle and/or newly formed or forming volcanic events. This has been the motivation for [1] to use such geophysical data or models as the basis of the likelihood function.
On the other hand there are also practical aspects to be considered particularly when starting a hazard analysis in a region where there have not been many studies. In such a case, the only data available to begin with might be just the geographical location of volcanoes. Information from more complicated and expensive surface based investigations might not come until later.

Model evaluation
Since it is not possible to infer directly the location of future volcanic events that will form in the next 0.1 to 1 Ma from now, models can instead by evaluated by calculating the probability of the new volcanic events that formed after some time in the past, using all volcanic events that formed before that time [1,38]. Since we calculate the probability of future volcanism in the next 100 ka in most of the analyses described here, 100 ka is selected as the timeframe in the verification calculations. In Tohoku, as there are a large number of dated volcanic events it is possible to verify the Bayesian models developed to a certain extent by using all volcanic events that formed before 100 ka to predict the location of volcanic events that formed between 100 ka and the present day. Since the 'new' volcanic events are still in the past, it is possible to compare probability plots with the locations of volcanic events we are attempting to forecast. Figure 13 shows probability plots for the Cauchy PDF (h=1.5 km) and the a posterior probability conditioned on R/R A ratios. All volcanic events that formed before 100 ka (white triangles) during the Quaternary were used to make a forecast for the period from 100 ka ago to the present day. All subsequent volcanic events that formed during the forecast period are shown in red. Probability calculations are then compared with the locations of volcanoes that formed during the forecast period.
In both cases, all subsequent volcanoes formed in regions where the probability was at least 10%. Approximately 50% of newly formed volcanic events formed in regions where the probability was at least 25%. There was approximately 10% increase in probabilities in the locations were volcanoes formed in the a posteriori probability calculations.
Probability calculations above were made using single inferences on one set of data. However, Bayes' theorem allows beliefs to be updated as additional information becomes available. [1] attempted this by combining geothermal and seismic tomography datasets ( Figure 14).
By conditioning on P velocity perturbations at 40 km depth, the model assigned a low probability for the Iwaki volcano which formed in region where probability was calculated to be low (< 10 -9 /a). This could be improved upon by including both P velocity perturbations at 10 km depth and geothermal gradients [1].

Varying the temporal recurrence rate
The temporal recurrence rates in Tohoku have been steady state from 0.5 Ma to present [28]. This implies that recurrence rates are likely to remain steady state for at least the next 0.1 Ma. However if we need to assess volcanism over a much longer time frame such as 1.0 Ma more a b Figure 13. Verification probability plots calculated using all volcanic events before 100 ka (white triangles) in order to predict the subsequent distribution of volcanic events that formed from 100 ka to present (red triangles) for (a) the a priori probability (Cauchy, h=1.5km, eruption volume weighting included) and the a posteriori probability (b) conditioned on R/R A ratios.  care is needed. In addition to temporal recurrence rates, the type of volcanism can also change over extended timeframes. For example, [28] used eruptive volumes of volcanic products along the volcanic front in Tohoku to identify three sub-stages with distinct types of volcanism and volumetric changes in the last 2.0 Ma. From 2.0 to 1.2 Ma large-scale felsic eruptions were predominant; during 1.2 to 0.5 Ma, the crustal stress changed to compression yielding the formation of strato-volcanoes all along the Tohoku volcanic arc. Finally, from 0.5 Ma to the present day, volcanically active areas became localized [34]. The volcanic front also shifted over a 2.0 Ma period [60] (Figure 15) It can thus be argued that for periods beyond 0.1Ma, it is unreasonable to treat λ t in equation (3) as constant or steady state. One option might be to assign say a Weibull function where recurrence rates can increase or decrease with time [61] if there is sufficient age data to indicate temporal trends statistically. Alternatively one could assume that the temporal recurrence rates are entirely random with a tendency to cluster temporally [e.g. 22,62]. Moreover, [22] showed that time clustering can have an impact on the spatial intensity of volcanoes.
A challenge though with utilizing temporal data are the quantity and quality of the age datasets and being consistent enough with the temporal definitions since eruptions may last for several days, weeks, months, years even longer. Having a consistent temporal definition is especially challenging when handling volcanic datasets on the regional scale described in this chapter. As highlighted in section 3, even for monogenetic volcanoes, the temporal definition is not so straightforward [41]. It was for this reason [42] argued that a drawback with nearest-neighbour models which are a function of both spatial and temporal parameters is that they require the ages of every single volcanic event within the volcanic field in question. Nevertheless in certain Updates in Volcanology -New Advances in Understanding Volcanic Systems cases such as tectonically controlled basaltic fields, eruptions can be time predictable, [63] hence there is potential to improve on the Bayesian model presented here by taking into account time clustering in the temporal rate parameter.

Conclusions
Bayes' thereom is a powerful statistical tool for incorporating additional datasets. In this chapter R/R A ratios were used in probabilistic volcanic hazard assessments applying the methodology developed by [1]. These were compared with earlier assessments in Tohoku incorporating low P perturbations at 10km and 40km depth and geothermal gradients. Probabilities of one or more volcanic event(s) forming in Tohoku for both analyses were found to be similar ranging from 10 -10 -10 -9 /a between clusters and 10 -5 /a within clusters. The Cauchy kernel, combined with multiple datasets successfully captures all subsequent volcanic events, including extreme events. This is particularly important when making calculations over 1Ma when the tectonic setting is likely to change resulting in a potential shift of the volcanic front.
Although the Cauchy kernel appears to be over conservative for regions east of the volcanic front, where probabilities are expected to be negligible, values are reduced when R/R A ratios are included.