## 1. Introduction

Modern statistical and dynamic forecast models continue to demonstrate low forecast skill in identifying the onset of rapid intensification (hereafter known as RI) within tropical cyclones (hereafter known as TCs). Even though storms which rapidly intensify can cost governments billions of dollars in damage upon landfall (e.g. by destroying property through flooding–as with Katrina in 2005) and RI forecasting is considered one of the top priorities for the National Hurricane Center [1], little advancement has been made in improvement of probabilistic tropical cyclone RI forecasting. While previous studies have examined the intensification patterns between RI and non-RI TCs [2, 3], the technological impairments, coupled with the complexity of these systems, have left gaps in understanding the large-scale structures associated with RI storms [1–4]. These gaps result in poor statistical forecast model accuracy, which requires prior knowledge of relevant RI variables. While recent research shows modest improvements in RI forecasts, and global models have steadily improved in their ability to predict the large-scale environmental conditions of TCs [3], forecast skill scores still remain inadequate.

Current statistical forecast models blend both thermodynamic and kinematic variables in attempts to increase the skill, emphasizing meteorological processes deemed more crucial to RI prediction [1, 4–6]. Improvements to the Statistical Hurricane Intensity Prediction Scheme Rapid Intensification Index (SHIPS-RII) continue to be added regularly since the original implementation by the National Hurricane Center (NHC) in 2004 for the North Atlantic [1]. The latest enhanced SHIPS-RII consists of 10 predictors, including previous 12-hour intensity change, vertical shear, divergence at 200 hPa, total precipitable water, GOES-IR imagery, potential intensity, oceanic heat content, max sustained wind, and an inner-core dry air predictor [1]. Despite the addition of new predictors, Brier skill scores (BSS) relative to climatology for Atlantic RI forecasts remain below 20% [1]. Additionally, verification of all operational consensus intensity forecast models for the NHC, including their official intensity forecast, showed only limited improvement as Peirce skill scores remained below 0.2 [1]. Other studies have included predictors that resolve the inner-core environment more effectively, utilizing microwave passive imagery predictors in a probabilistic logistic regression (LR) model. Despite this effort, BSS values only improved to roughly 22% with either simulated real-time LR models or LR models utilizing reanalysis data [4]. Additionally, using a baseline peak wind speed of 25 knot intensity (at all RI thresholds) severely reduces skill to below 15% when compared to a probabilistic LR model utilizing current SHIPS parameters previously developed in [7].

In order to improve statistical model prediction of the onset of TC RI, the ability to identify distinguishing meteorological characteristics of the storm structure between RI and non-RI TCs with 24 hours lead time is crucial. While research of this nature is not new [2, 3], the approaches have differed (e.g. data selection, data reduction, meteorological variables chosen, compositing approaches). For example, Kaplan and DeMaria [2] and Kaplan et al. [8] noted that RI was more likely to occur for TCs that were situated over regions of higher than average sea surface temperature (SST), strong upper-level divergence, large low- to mid-tropospheric moisture, and weaker than average vertical wind shear [2, 8]. Other research (see [3]) also observed that RI events occurred in environments with weaker deep-layer shear (as was found in [2, 8]) and greater conditional instability in the Atlantic basin than non-RI events. This research also noted that TCs moving over a warm ocean anomaly were found to be equally likely to intensify slowly or rapidly given other assumptions are met [3], a result in contrast with the work shown in [2, 8]. While recent research suggests environmental, internal dynamic processes, and oceanic conditions [2, 3, 8] all play a role in RI, research performed in [3] concluded RI is mostly controlled by internal dynamical processes, provided a pre-existing favorable environment exists. Research performed in [4] reiterated this sentiment suggesting Atlantic basin forecasts benefit more from the inclusion of storm structure information (more than the Pacific basins), which has yet to be explained.

In an effort to continue improving the understanding of the internal dynamics of TCs undergoing RI, the current study sought to identify important diagnostic variables in the North Atlantic basin, looking not only at which levels, but also at which spatial points in proximity to the cyclone are distinguishable between the two types of systems. The primary research question being considered is: What meteorological parameters discriminate RI from non-RI storms most effectively, and what spatial location in the TC domain provide the largest differences in these fields? These findings give key insights to which variables should be used in future development of a prognostic artificial intelligence classification scheme to assist with operational forecasts of RI. Section 2 provides a description of the data and methodology, while Section 3 presents the results of the work and Section 4 provides a discussion and conclusions.

## 2. Data and methodology

### 2.1. Dataset description

While the NHC defines RI as an increase in wind of 30 knots (kt) in 24 hours, several RI definitions are usually considered during the research phase of model development [1, 4]. This study examined three separate definitions of RI (following [1]), including the operational definition of a 30kt increase of wind speed in 24 hours and two experimental definitions of 25kt and 40kt increases. All Atlantic tropical and subtropical systems, from 1985–2009 from the NHC Atlantic best track data (HURDAT–[9]) were considered. For the three different RI definitions, the full database of 298 TC events were divided into RI and non-RI groups, yielding 152 RI and 146 non-RI cases with the 25kt definition, 119 RI and 179 non-RI for the 30kt definition, and 46 RI and 252 non-RI for the 40kt definition (**Figure 1** breaks these down by Saffir-Simpson scale category). Since a forecast proxy was desired, base-state meteorological fields from the National Centers for Environmental Prediction (NCEP) Global Ensemble Forecast System (GEFS–[10]) reforecast database were retained 24 hours prior to the period of greatest intensification for all storms (RI and non-RI). GEFS reforecast data are provided at a 1° resolution at 3-hour forecast intervals from 0 to 72-hours. Three-dimensional base-state meteorological fields at eight vertical levels (1000–100 hPa) were utilized, including: geopotential height, temperature, *u* and *v* wind components, and specific humidity. Additionally, single-layer variables were considered, including mean sea level pressure (MSLP), skin temperature (a proxy for SST), latent heat flux, sensible heat flux, convective available potential energy (CAPE), convective inhibition (CIN), and vertical velocity at 850 hPa were evaluated.

As a primary goal was to diagnose RI using TC structure relative to the storm center, storm-centric GEFS reforecast domains for each cyclone were obtained. Storm centers were identified by determining the local minimum in GEFS MSLP nearest the NHC-defined TC center, 24-hours prior to the timestep associated with the greatest intensification. Each variable was retained on a 15° × 11° latitude/longitude grid centered on this domain. In the event multiple occurrences of peak intensification occurred for an individual TC (which occurred 28 times when using 25kt/24-hours, 13 times for 30kt/24-hours, and once for 40kt/24-hours), the first was chosen. Thus, the results presented herein deal with the first instance of peak intensification regardless of the frequency of peak intensification for a given TC.

### 2.2. RPCA

As the primary goal of this research was the identification of variables and spatial locations most favorable for distinguishing RI and non-RI storms, discriminatory statistical methods were needed. One method, rotated principal component analysis (RPCA), has been shown to be useful in discriminating meteorological environments of different types [11–14]. These studies also used permutation testing to evaluate magnitude differences in diagnostic variables for each environment. Both of these techniques were utilized in the current study so that both spatial configuration and magnitude difference could be assessed.

#### 2.2.1. S-mode RPCA

The first approach to RPCA, S-mode analysis [13], provided a diagnosis of the spatial relationship among gridpoints for all cases. For S-mode, the similarity matrix is computed on the individual spatial locations and is eigenanalyzed to identify particular locations that group together. The S-mode rotated principal component (RPC) loadings are maps that demonstrate these spatial relationships (known as modes of variability), with the RPC scores revealing the similarity between the individual cases and the resulting S-mode loading maps. To reduce the dimensions of the eigenvector matrix, truncation of RPCs was completed by evaluating a scree plot, as well as using a congruence test. A congruence test is a way to measure pattern and magnitude similarity of a dataset, corresponding to the cosine of the angular separation between the loadings, by maximizing the dissimilarity of the two loading patterns [15]. The congruence coefficient presenting a strong relationship for any absolute value greater than 0.81 was marked as the truncation point. RI and non-RI datasets (consisting only of base-state variables for all 298 cases) were combined, where the analysis of both RI and non-RI event deviations and the loading patterns provided information on how the systems are grouping together (e.g. cooler SSTs versus warmer, upper level trough/ridge patterns, and influence of land at the surface 24-hours prior). To demonstrate the lack of linear separability in the resulting RPCs, a pairwise scatterplot of all six PC score vectors was formulated (**Figure 2**). There is significant overlap among the RI and non-RI PC scores, rendering separation via classification very difficult, motivating the need to consider additional analysis techniques.

#### 2.2.2. T-mode RPCA

While S-mode helped reveal the difficulties in identifying relevant RI/non-RI distinguishing characteristics, the results did not provide the necessary discrimination capability of interest in this work. Recent work has shown the value of composite analysis with T-mode RPCA in identifying discriminating characteristics for different meteorological event types [11, 12]. Following the methodology of [11, 12], a T-mode varimax-rotated RPCA [11, 16], conducted simultaneously on all GEFS reforecast fields, was completed on all RI events and all non-RI events separately. T-mode contrasts S-mode in that in T-mode, the relationships between events, as opposed to spatial locations, are of interest, and thus the correlation matrix is computed on the event dimension of the data. Following methods established in [11, 12], the resulting uncorrelated eigenvector matrix and associated eigenvalues reduced to a subset of RPCs for each event type and each RI definition (**Table 1**). Similar to the S-mode RPCA approach, the truncation point was determined through utilization of a scree plot and the congruence test. The resulting RPC loadings maintain the same dimension as the event dimension, so events were clustered by RPC loading magnitude using hierarchical clustering with Ward’s minimum variance method [16]. To assess cluster quality, a cluster verification statistic (silhouette coefficient [17]) was found that includes two components:

a measure of intra-cluster spread (cluster cohesion–should be small) and

a measure of inter-cluster spread (cluster separation–should be large) [11].

In this study, the mean of the silhouette coefficient values for all events considered in the cluster analysis was retained as a measure of cluster analysis performance. With the silhouette coefficient, values approaching 1 suggest a minimization of cluster cohesion and a maximization of cluster separation. Negative values suggest a particular event was misclustered. The cluster analysis revealed six clusters each for RI and non-RI storm types using the 25kt/24-hours definition, seven non-RI and six RI using the 30kt/24-hours definition, and seven non-RI and five RI using the 40kt/24-hours definition (**Table 2** provides the number of events per cluster, as well as silhouette coefficient values). Events within each cluster were averaged together, yielding map types that retained unique synoptic-scale structures and provided more detailed map types of RI and non-RI TC environments than simply averaging all events together. The resulting composites allowed for the identification of spatial structure among RI/non-RI events.

### 2.3. Permutation testing

While the composites resulting from the RPCA approach are useful for diagnosing spatial characteristics within RI and non-RI environments, magnitude differences are diagnosed more effectively using hypothesis testing. In this study, permutation tests [16] comparing magnitudes of diagnostic fields in RI and non-RI storms were utilized at each gridpoint from the study domains, yielding a spatial map of significance values associated with each variable tested. The resulting plots provided specific regions in the study domain where statistically significant magnitude differences between RI and non-RI storms existed for individual GEFS reforecast variables. These results provided insight not only into the scope of these magnitude differences but into the spatial locations of the differences, which complement the RPCA results well.

## 3. Analysis

Based on previous work done [12], variables selected for evaluation consisted of base-state variables and derived variables as listed in Section 2.1. Through the S-mode technique, only base-state variables were examined; therefore, only notable characterizations are summarized in the text below. The T-mode and cluster analysis technique; however, yielded numerous composite fields for consideration. To minimize this impact, the cluster from each RI group and each RI definition that contained the largest number of events (bolded in **Table 2**) is provided for discussion below.

### 3.1. RI and non-RI S-mode maps

As stated previously, S-mode analysis of the base-state meteorological fields was conducted first. **Table 1** shows that RPC1 contained the largest variance explained (roughly 24%), and the loading map (**Figure 3**) revealed that areas of higher heights in the northeast quadrant quadrant and over the storm center co-varied with higher MSLP, lower temperatures, lower moisture and latent heat content. A total of 16 events’ RPC scores (**Table 3**) exceeded 2 standard deviations above the mean, suggesting strong positive correlation between those events and RPC1. Of these 16 events, six were classified as RI cases with the 25kt/24-hour definition of RI. In fact, the highest positive deviation (approximately 5 standard deviations above the mean) was an RI case, while the second largest positive deviations were a mixture of RI and non-RI events. These results suggest a blend of RI and non-RI events for RPC1. Similarly, RPC2 results (which explained approximately 12% of the variability) revealed lower heights in proximity of the storm center (**Figure 4**) co-varied with lower MSLP, cooler low-level temperatures in the southwest quadrant of the cyclone and overall higher specific humidity and latent heat flux values. Additionally, patterns revealing a wrap-around of moisture over the storm-center were revealed. However, only three of the eight events that exceeded 2 standard deviations from the mean were RI events, again showing the blending of RI and non-RI cases in these results. RPC3 (which explained approximately 7% of the total variance), exhibited higher heights co-located with higher MSLP, temperature, specific humidity, and latent heat flux over the storm center and to the south, but lower heights, temperature, specific humidity, and latent heat flux North of the storm center (**Figure 5**). This RPC profile, along with RPCs 4–6 (not shown), is indicative of baroclinic environmental influence associated with both storm types. These first three RPCs, explaining over half of the variability combined, demonstrated a recurring problem in the S-mode analysis, namely the inability to separate RI and non-RI events (as was seen in **Figure 2**).

S-mode score extremes (±) 2 SDs above mean | ||||
---|---|---|---|---|

+2 SD RI | −2 SD RI | +2 SD non-RI | −2 SD non-RI | |

RPC 1 | 6 | 0 | 10 | 0 |

RPC 2 | 3 | 4 | 5 | 4 |

RPC 3 | 1 | 5 | 0 | 8 |

RPC 4 | 3 | 4 | 4 | 5 |

RPC 5 | 2 | 7 | 4 | 4 |

RPC 6 | 1 | 6 | 2 | 2 |

Through this analysis, the inherent difficulty of classifying RI and non-RI storms is apparent, as the base-state fields considered seem to be equally present in RI and non-RI events for all RPCs. Despite this, the results suggest some modest classification ability of RI and non-RI events through the temperature and moisture patterns, as well as variables more indicative of environmental interaction (e.g. vorticity and static stability) as the largest influences on TC RI processes.

### 3.2. RI and non-RI T-mode composites

T-mode composites of the base-state meteorological fields, as well as derived fields including: divergence, relative vorticity, vertical speed and directional wind shears (see [18] for clarification on the difference), equivalent potential temperature, and static stability (as defined in [19]) were formulated next. The analysis below is broken down by variable.

#### 3.2.1. Geopotential height and mean sea level pressure characteristics

The map types for RI and non-RI systems revealed a better lower to mid-level structure, with lower heights for a larger radius overall for RI systems. This suggests the RI core is physically distinct from its surrounding environment. In general, for all RI definitions at all height levels, the highest heights are in the northeast quadrant quadrant of the composites, with low-levels for all RI definitions also exhibiting higher heights around the core, indicative of deeper convection (**Figure 6**). In the mid-levels and low-levels for 30kt/24-hours (four out of the six RI clusters) and 40kt/24-hours (two out of the five RI clusters) definitions, all of the RI clusters contain lower heights over the storm core for a larger radius. Map types for MSLP reveal instances when RI composites exhibit a smaller diameter of lower MSLP over the storm center (cluster 6 for RI using 30kt/24-hours and cluster 2 for 40kt/24-hours) with tighter gradients. Comparing these results to non-RI composites, three of the seven clusters maintain a uniform appearance (30kt/24-hours) or even mirror a traditional midlatitude trough/ridge pattern (in one non-RI map type). It is important to note that two non-RI cases using the 40kt/24-hours definition had larger regions of lower MSLP, which is explainable given the frequency of strong (category 3 or 4) non-RI storms associated with this RI definition (12% of the non-RI dataset). Regardless, the dominant pattern among all clusters shows a tighter gradient in low-level geopotential height and MSLP surrounding the TC core in RI systems. Permutation testing revealed a magnitude difference most apparent with MSLP composites for all RI definitions, where RI cases are exhibiting a statistically significantly larger radius of lower heights and pressures than for the non-RI systems, especially for 25kt and 30kt definitions. These results are supported by permutation test results for geopotential heights, which reveal the storm center in the low- and mid- levels as statistically significant at the 95% level in distinguishing between RI and non-RI storms (**Table 4**). It is also notable that the region of significance for MSLP increases as the wind definition increases. In other words, the 40kt/24-hours has the entire permutation map exhibiting statistical significance suggesting geopotential heights are more distinct; however, this could be an artifact of the 40kt definition containing category 4 and 5 storms making up 70% of the dataset versus non-RI containing at most category 4 (4%).

#### 3.2.2. Thermodynamic characteristics

Specific humidity (25kt/24-hours and 30kt/24-hours) throughout the atmospheric profile contain larger magnitudes (see **Table 5**) for a greater diameter around the storm center and in the northeast quadrant quadrant for RI cases. RI TCs also contain maximum magnitude over the storm center in the mid- and upper- levels, or in the northeast quadrant quadrant, compared to non-RI cases which see a shift of the maximum magnitude towards the ENE region for 25kt/24-hours and 30kt/24-hours definition (**Figure 7**). Cross sections show drier air infiltrating through the inflow regions of the non-RI storm (west side of latitudinal cross section for 25kt/24-hours definition) compared to a more even distribution for RI clusters on either side of the storm center (**Figure 8**).

Equivalent potential temperature (*θ _{e}*) fields show similar magnitudes among RI and non-RI cases, although the radius of maximum

*θ*is larger with the RI map type, suggesting the potential energy over the storm center is the important feature here. Additionally, the

_{e}*θ*field is largely symmetric around the storm center for RI map types (

_{e}**Figure 9a**). However, for the non-RI using the 25kt/24-hours definition, the

*θ*field is non-symmetric, instead showing a tilted core in the composite fields (

_{e}**Figure 9b**). This tilt suggests a cutting off of the moisture source over the storm center, especially in the mid- and upper- levels. These results do not hold up as well for the 30kt/24-hours and 40kt/24-hours RI definitions, as there are non-RI composites which show symmetric latitudinal

*θ*cross sections. While the more intense non-RI TCs have clustered together, distinguishing them from the non-RI group itself, it hinders classification ability using these RI definitions. Permutation tests revealed that at all pressure levels, for all RI definitions, the region directly over the storm center, as well as the inflow region for 25kt and 30kt definitions, is statistically significant at the 95% level in discriminating RI versus non-RI systems.

_{e}Static stability at 500 hPa, on average revealed magnitudes were approximately the same for all definitions between RI and non-RI storms. However, RI clusters, which contained stronger TCs (i.e. category 4 and 5s), had a closed off maximum static stability center over the core of the storm for all RI definitions (**Figure 10**). Permutation tests confirm stability in the mid-levels as statistically significant in discriminating between RI and non-RI systems for nearly the entire storm domain. Notably, for the 40kt/24-hours definition, the storm center in the low-levels was also significant, but is likely a result of higher magnitudes (i.e. category 5 cases) for these RI events.

#### 3.2.3. Kinematic characteristics

The first kinematic field considered was upper-level (200 hPa) divergence. Divergence (25kt/24-hours and 30kt/24-hours) showed RI and non-RI clusters similar in both magnitude and region of greatest divergence in the northeast quadrant quadrant of the systems (**Figure 11**); however, the 40kt/24-hours definition revealed RI systems had 30% larger magnitude near the storm center and in the northeast quadrant quadrant. RI cases, for all definitions, tended to have a larger coverage area of the composite exhibiting divergence, despite the similarities in the spatial orientation of the divergence on the composite maps. Permutation tests supported the conclusion that divergence magnitude (**Table 6**), rather than spatial orientation, was the distinguishing characteristic between RI and non-RI storms at a 95% significance level.

For relative vorticity, using both the 25kt/24-hours and 30kt/24-hours definitions, positive vorticity is noted in three out of the six RI clusters in proximity to storm center in the upper levels, which is notably absent from non-RI cluster map types. For the 40kt/24-hours definition, three out of five RI map types exhibited this feature as well, while only two out of seven non-RI map types showed the same positive vorticity area (**Figure 12**). Vorticity magnitudes were larger for RI TCs with all map types (at all levels) and definitions, and also the vorticity gradient near the center was steeper within the RI system versus the non-RI. The only exception was at 700mb, in which vorticity features are similar in both RI and non-RI. Permutation tests show that over the storm center, for all three pressure levels, all RI definitions, had a 95% level of significance in distinguishing RI from non-RI cases. Notably, the area of statistical significance around the storm center is larger in the mid- levels.

Map types of vertical speed shear (850–200 hPa – **Figure 13**), thought to be undesirable for RI to occur, revealed weaker 200 hPa winds than the 850 hPa winds within the RI. This is indicative of a closed off environment around the core for the RI systems to a greater degree than the non-RI. Permutation tests revealed all but the southwest quadrant to be statistically significant at the 95% level at discriminating RI from non-RI systems for all RI definitions.

#### 3.2.4. Non-significant variables

CAPE, CIN, vertical velocity at 850 hPa, latent heat flux, sensible heat flux, static stability at 850- and 700-hPa, and skin temperature composites, while all examined, did not reveal meaningful differences with regards to spatial orientation or magnitude for distinguishing between RI and non-RI cases. While some of the magnitudes were greater for RI clusters containing stronger systems (i.e. category 4 and 5 TCs), other non-RI clusters exhibited similar magnitudes which consisted mainly of tropical storm strength systems. Latent heat flux for example, revealed that for the 25kt/24-hour definition, more latent heat flux was available throughout the inflow region and around the core of the RI cases. However, with the 30kt/24-hour and 40kt/24-hour definitions, the main distinguishing feature seemed to only be higher magnitudes throughout the atmospheric profile. Otherwise, permutation tests surprisingly revealed the NW and northeast quadrant quadrant of the maps for all RI definitions as statistically significant at the 95% level for both CAPE and skin temperature. This is attributed to land influences of some TCs which were in proximity to land when the greatest intensification occurred. Results confirmed a lack of statistical significance in discriminating RI from non-RI with CIN, but confirmed a decent discrimination of magnitude with latent heat flux and sensible heat flux (**Table 5**). However, again, these results are likely being influenced by the proximity to land of some TCs, which would affect 1000 hPa level results for these variables.

Overall, the T-mode analysis revealed the discriminating spatial and magnitude differences between RI and non-RI storms. As suspected through S-mode analysis, moisture and surface temperature patterns, as well as variables indicating environmental influence including geopotential heights in the upper levels, relative vorticity, divergence, and static stability in the mid-levels had the largest influences on TC RI processes.

## 4. Conclusion

Distinguishing meteorological characteristics of RI and non-RI storm structure is critically important in order to improve statistical model prediction of the onset of RI. This research made efforts to continue improvement in identifying relevant large-scale internal dynamics of TCs undergoing RI in the North Atlantic basin, specifically noting important diagnostic variables in three-dimensional space. Base-state, as well as composite derived, meteorological parameters were evaluated through both S-mode and T-mode RPCA for three RI definitions. Specifically with T-mode, hierarchical cluster analysis techniques were used to formulate map types for RI and non-RI systems. To understand the internal dynamics within these complex systems, variables examined included: geopotential heights, temperature, *u* and *v* wind components, specific humidity, MSLP, CAPE, CIN, latent heat flux, sensible heat flux, surface temperature, vertical velocity at 850 hPa, divergence at 200 hPa, relative vorticity, vertical directional and speed shear, equivalent potential temperature, and static stability.

S-mode analysis results demonstrated the difficulty of establishing characteristic attributes for classifying RI and non-RI storms, as the base-state fields considered were equally present in RI and non-RI events for all RPCs. Two of the six RPC groups contained cases that were indicative of strong, well-structured TCs, exhibiting what you would expect for a sustainable environment for TC continuation and strengthening, regardless of storm type. Whereas, four of the six RPC groups contained cases influenced by baroclinic environmental effects on TCs, which further aids in the positive/negative aspects of environmental influence on TCs. While stronger outflow can lower stability, enhancing the outflow, it can also be detrimental to a TC [20, 21]. Despite this, results indicated a modest classification ability of RI and non-RI events through the temperature and moisture patterns, as well as those variables that would be more indicative of environmental interaction.

T-mode analysis, on the other hand, revealed several important distinguishing spatial features between RI and non-RI systems. Most notably:

Geopotential heights and MSLP in the low- and mid- levels were statistically significantly different between RI and non-RI systems for all RI definitions, where RI systems maintained lower heights directly over the core and higher heights around the core and in the northeast quadrant quadrant for a larger radius. These results suggest deeper convection for RI systems, re-emphasizing the importance of convective processes around the core of a TC [2, 4, 22, 23].

Specific humidity throughout the atmospheric profile contained larger magnitudes for a greater diameter around the storm center, as well as in the northeast quadrant quadrant for RI cases (which is in agreement with geopotential heights). Specifically, for the mid- and upper- levels, non-RI cases exhibited a shift of the largest specific humidity magnitudes towards the ENE region and cross sections suggested drier air infiltrating non-RI cases within the inflow regions of the storm for the 25kt/24-hours definition. This was in contrast to a more uniform horizontal and vertical moisture distribution in RI storms.

Equivalent potential temperature for all RI definitions gave similar results to specific humidity, where the focus of the potential energy was over the storm center for RI cases. Non-RI cases instead exhibited a shift towards the east-northeast (creating the tilted appearance in the composites). This lack of symmetry in non-RI storms for the 25kt/24-hours definition, suggests a cutting off of the moisture and heat source over the storm center, especially in the upper-levels. This region over the storm center was significant in discriminating and indicates enhanced eyewall convection and TC intensity [23].

Static stability for all RI definitions revealed RI systems were statistically significantly more stable over the storm center in the mid-levels than non-RI systems. This result implies a resistance to upward vertical motion, forcing subsidence over the storm center [5, 20], but also resistance to adverse effects of the environment, such as vertical shear and Rossby penetration depth, preventing tilting of the TC and allowing for maintenance of the vertical thermal structure [24, 25].

Divergence at 200 hPa (25kt/24-hours and 30kt/24-hours) showed RI and non-RI cluster composites similar in both spatial location and magnitude of greatest divergence in the northeast quadrant quadrant for the two types of systems. However, RI composites tended to have a statistically significantly larger magnitude.

Relative vorticity at 200 hPa for the 25kt/24-hours and 30kt/24rs revealed three RI clusters contained an upper-level area of positive vorticity over the storm center. This feature only appeared for the 40kt/24-hours definition for non-RI systems (which is attributed to occurrence of category 4 TC events). Throughout the mid- and upper- levels, RI cases had significantly higher magnitudes over the storm center, indicating a stronger spin.

Shear, often thought to hinder TC intensification by creating asymmetry in eyewall convection resulting in a loss of the warm core at upper levels through tilting [24–27], revealed vertical speed shear had a much larger area of statistical significance in discriminating RI from non-RI systems for all RI definitions, compared to vertical directional shear currently used in operational forecasts [1].

CAPE and skin temperature did not reveal any distinguishing feature between RI and non-RI cases through composite analysis. However, permutation tests suggested the NW and northeast quadrant quadrant of the maps for all RI definitions as statistically significant at distinguishing between storm types for both. Latent heat flux, fundamental in the maintenance of convection and increasing kinetic energy [23, 28], showed that RI systems have higher magnitudes for a larger area over the core, for all RI definitions, and throughout the inflow region for the 25kt/24-hours and 30kt/24-hours definitions. However, land masses could be influencing results at the 1000 hPa pressure level for these three, and the other surface variables; therefore, distinguishing whether these fields are different among RIs and non-RIs remains unclear.

While results of the RPCA analysis confirm previous findings such as the importance of moisture supply, stability within the core, and stronger relative vorticity for RI systems, it also argues against research findings suggesting magnitude is the main distinguisher between RI and non-RI events [29]. Results presented suggest the symmetry of the equivalent potential temperature and specific humidity profiles throughout the atmospheric column, as well as the storm-centered placement of these variables, and stability, directly over the inner-core (instead of shifted to the east-northeast as with several non-RI composites given lower RI definitions) are significant in discrimination of these event types. While there were some shortcomings, such as proximity to land potentially influencing results in the low levels and the inability to fully resolve the inner-core due to model resolution, the results provide a framework of diagnosis for RI processes within TCs. This framework, combined with an improved statistical modelling scheme, will ideally be of use for improving TC intensity forecasts in operational meteorology.