Chapter 4

Testing the generalization of the potential height framework

Abstract. Tropical cyclones (TCs) generate extreme storm surges that pose a significant and growing threat to coastal communities, a risk amplified by climate change. Identifying worst-case surge scenarios is computationally prohibitive using traditional simulation methods. This Chapter builds upon the new Bayesian optimization (BO) framework designed to efficiently discover TC trajectories that produce maximum storm surge heights from Chapter 3, known as the potential height. We present results from applying this framework to three distinct US coastal locations (New Orleans, Galveston, and Miami), demonstrating its ability to identify physically consistent worst-case scenarios under current and future climate conditions (calculated using the potential intensity and PI potential size from Chapter 2). Furthermore, we conduct sensitivity analysis of Gaussian Process (GP) kernel choices, a critical component of the BO surrogate model, which shows that the Matérn kernel chosen in Chapter 3 provides skillful predictions. Our findings broadly indicate that the choices made in Chapter 3 are sensible, that the random error in the BO results is relatively small, and that the BO framework generalizes reasonably well geographically.

4.1 Introduction

Tropical cyclones (TCs) are a primary driver of catastrophic storm surges, which can lead to extensive flooding and devastation in coastal regions (Resio and Westerink 2008). In a warming climate, the characteristics of TCs are changing (see e.g. Chapter 2), with projections indicating increases in average intensity, rainfall rates, and the proportion of the most severe storms (Knutson et al. 2020; GFDL 2024). Coupled with sea-level rise, these changes amplify the threat of coastal inundation, making the accurate assessment of potential storm surge heights a critical task for disaster preparedness and infrastructure planning (Oppenheimer et al. 2019).

An interesting challenge in hazard assessment is identifying the worst-case storm surge for a given location. This requires both considering the practical thermodynamic limits on TCs, and exploring a high-dimensional parameter space of how the TC characteristics, including track, intensity, size, and forward speed will interact with the complex geometry at the coast. High-fidelity hydrodynamic models like ADCIRC (Luettich et al. 1992) can numerically simulate surge events, but their computational expense makes a brute-force exploration of this parameter space intractable (Toro et al. 2008).

To address this challenge, Thomas et al. (2025, Chapter 3) introduced a framework for finding the potential height of storm surges using Bayesian optimization (BO), a sample-efficient global optimization technique for expensive black-box functions (see e.g.  Shahriari et al. 2015; Brochu et al. 2010), while assuming the TC potential intensity and PI potential size produced through thermodynamic arguments (see Chapter 2). This framework not only provides a robust estimate of the upper bound on surge height but also improves the estimation of high-return-period events (e.g., 1-in-100 or 1-in-500 year events). The BO approach iteratively builds a probabilistic surrogate model of the surge response and uses an acquisition function to intelligently select the next set of TC parameters to numerically simulate, efficiently converging on the most impactful storm scenarios (see Ide et al. 2024 for contemporaneous work not using thermodynamic TC bounds).

This Chapter presents key results from this framework and investigates the sensitivity of its machine learning components, and a demonstration that it can generalize geographically. In Section Section 4.2.1, we detail the BO methodology and define the metrics of interest. In Section Section 4.3, we showcase its application to New Orleans, Galveston, and Miami and present a comparative analysis of Gaussian Process kernels for modeling the surge response. Finally, in Section Section 4.4, we outline critical areas for future work to refine and extend this approach.

4.2 Methods

4.2.1 A Bayesian Optimization Framework for Storm Surges

Our methodology treats the high-fidelity ADCIRC storm surge model (Luettich et al. 1992) as an expensive black-box function whose inputs are the parameters of an idealized TC. The goal of the BO algorithm is to find the input parameters that maximize the peak storm surge at a specified observation point, which is the closest point on the mesh to a city of interest (New Orleans, Miami, or Galveston).

The TC parameters we vary, \(x\), define a three-dimensional search space (see Table 3.2 for a summary of all framework parameters): the landfall point relative to the observation location, \(c \in \left(-2^{\circ}, 2^{\circ}\right)\); the bearing of approach, \(\chi\), constrained to approach from the ocean1; and the translation speed, \(V_t \in \left(0,\; 15 \right) \text{ m s}^{-1}\). The TC’s wind and pressure profiles follow the model of Chavas et al. (2015), with intensity and size fixed at their potential limits for the given climate conditions (see Section Section 4.2.3).

As in Chapter 3, the BO process begins by initializing a dataset with 25 points selected via Latin hypercube sampling (McKay et al. 1979) to ensure good initial coverage of the parameter space. Subsequent points are chosen sequentially by maximizing an acquisition function. We use the ADCIRC model (Luettich et al. 1992) to numerically simulate the storm surge for the observed point, and assume that the maximum sea surface height at the observed point has a small Gaussian noise \(\sigma_{\varepsilon}\), and define \(f(x)\) as the negative of the maximum storm surge height at that point during the simulation. Since standard BO libraries are formulated as minimisation problems, we negate the surge height so that minimising \(f\) is equivalent to maximising the storm surge. The problem setup is currently as follows:

\[\begin{align} &\text{\textbf{Objective:}}\quad x^{i}=\arg\min_{x\in\mathcal{X}\subset\mathbb{R}^{d}} f(x), \qquad f:\mathcal{X}\!\to\!\mathbb{R}\;\text{deterministic}. \\& \text{\textbf{Prior on }}f:\quad \hat{f} \sim \mathcal{GP}\!\bigl(0,k_{\mathrm{M52}}\bigr)\\& k_{\mathrm{M52}}(x,x') =\sigma_f^{2}\!\Bigl(1+\tfrac{\sqrt5 r}{\ell} +\tfrac{5r^{2}}{3\ell^{2}}\Bigr) \exp\!\Bigl(-\tfrac{\sqrt5 r}{\ell}\Bigr),\; r=\lVert x-x'\rVert_2. \\& \text{\textbf{Data (noisy):}}\quad \mathcal{D}_n=\{(x_i,y_i)\}_{i=1}^{n},\quad y_i = f(x_i) + \varepsilon_i,\; \varepsilon_i\sim\mathcal{N}(0,\sigma_\varepsilon^{2}). \\& \text{\textbf{Posterior surrogate:}}\quad \hat{f} \mid \mathcal{D}_n \sim \mathcal{GP}\!\bigl(\mu_n(\cdot),k_n(\cdot,\cdot)\bigr) \\& \text{\textbf{Acquisition (Max-Value Entropy Search):}}\notag\\& \alpha_{\mathrm{MES}}(x)= H\!\bigl[\hat{f}^{i}\mid\mathcal{D}_n\bigr] \;-\; \mathbb{E}_{\,y(x)\mid\mathcal{D}_n}\! \Bigl[ H\!\bigl[\hat{f}^{i}\mid \mathcal{D}_n\cup\{(x,y(x))\}\bigr] \Bigr],\\& \hat{f}^{i}=\min_{x\in\mathcal{X}}\hat{f}(x),\quad x_{n+1}=\arg\max_{x\in\mathcal{X}}\alpha_{\mathrm{MES}}(x). \end{align}\](4.1)

Here \(\mathcal{X}\subset\mathbb{R}^{d}\) is the bounded search space and \(f:\mathcal{X}\!\to\!\mathbb{R}\) is the objective we want to minimize. The random field \(\hat{f}\mid\mathcal{D}_{n}\sim\mathcal{GP}(\mu_{n},k_{n})\) is the posterior surrogate model; its mean \(\mu_{n}(x)\) is a point predictor, while the covariance \(k_{n}(x,x')\) (specified here by the Matérn-5/2 kernel \(k_{\mathrm{M52}}\) with signal variance \(\sigma_{f}^{2}\), length-scale \(\ell\) and pairwise distance \(r=\lVert x-x'\rVert_{2}\)) encodes remaining uncertainty. The data set \(\mathcal{D}_{n}=\{(x_{i},y_{i})\}_{i=1}^{n}\) contains the evaluations \(y_{i}=f(x_{i})\), assumed to have a small random noise. For any sample path of the GP, \(\hat{f}^{i}=\min_{x\in\mathcal{X}}\hat{f}(x)\) is its (global) minimum value; thus \(\hat{f}^{i}\) is a random variable. The Max-Value Entropy Search acquisition \(\alpha_{\mathrm{MES}}(x)\) measures the expected information gain about \(\hat{f}^{i}\) obtained by evaluating \(f\) at \(x\); the next experimental design point is \(x_{n+1}=\arg\max_{x}\alpha_{\mathrm{MES}}(x)\).

4.2.2 Metrics

We evaluate the performance of the BO framework using two main categories of metrics: convergence to the optimal surge height and predictive accuracy of the GP surrogate model.

Convergence to the Optimal Surge Height

We track the maximum surge height found as a function of the number of ADCIRC simulations. This metric assesses how quickly the BO algorithm converges to the worst-case scenario.

Robustness Across Random Seeds

To ensure the reliability of the BO results, we repeat each experiment with multiple random seeds and report the mean and standard deviation of the maximum surge heights found.

Approximate Simple Regret

The approximate simple regret, \(r_i\), quantifies the suboptimality of the best point found after \(i\) iterations. It is defined as the difference between the function value of the best recommendation so far, \(x_n^{\text{best}}\), and the true minimum function value, \(f(x^*)\), (estimated by the lowest value found over all experiments), \[\begin{equation} r_n = f(x_i^{\text{best}}) - f(x^*), \end{equation}\](4.2) where \(x_n^{\text{best}} = \arg\min_{x_i \in \{x_1,\; \dots,\; x_n\}} y_i\). This quantifies the efficiency of the optimization process, and a lower simple regret indicates a more efficient optimization process.

Predictive Accuracy of the GP Surrogate

We assess the surrogate model’s accuracy against a test set, which evaluates how well the GP captures the underlying function.

Coefficient of determination (\(r^2\))

The \(r^2\) score indicates the proportion of the variance in the true surge heights that is predictable from the GP model. An \(r^2\) of 1 indicates a perfect fit. \[\begin{equation} r^2 = 1 - \frac{\sum_{i=1}^{N_{\text{test}}} (y_i - \mu_n(x_i))^2}{\sum_{i=1}^{N_{\text{test}}} (y_i - \bar{y})^2} \end{equation}\](4.3) where \(\bar{y} = \frac{1}{N_{\text{test}}} \sum_{i=1}^{N_{\text{test}}} y_i\) is the mean of the true surge heights in the test set.

Root Mean Square Error (RMSE)

The RMSE measures the standard deviation of the prediction errors, providing a measure of the surrogate’s overall predictive performance in the units of the target variable (meters), where an RMSE of 0 indicates a perfect fit. It is defined as, \[\begin{equation} \text{RMSE} = \sqrt{\frac{1}{N_{\text{test}}} \sum_{i=1}^{N_{\text{test}}} (y_i - \mu_n(x_i))^2}, \end{equation}\](4.4) where \(y_i\) is the true surge height for the \(i\)-th test point, and \(\mu_n(x_i)\) is the posterior mean prediction from the GP for the \(i\)-th test point.

Mean Predictive Log-Likelihood

The mean predictive log-likelihood of the model on the test set is evaluated, given the model parameters, so that we can assess how well the model fits probabilistically. For a single point, the predictive log-likelihood is defined as, \[\begin{equation} \mathcal{L}_i= \log p\!\bigl(y_i \mid x_i,\mathcal{D}_n\bigr) =-\tfrac12\Bigl[ \log\!\bigl(2\pi\bigl(\sigma_n^{2}(x_i)+\sigma_{\text{obs}}^{2}\bigr)\bigr) + \frac{\bigl(y_i-\mu_n(x_i)\bigr)^2} {\sigma_n^{2}(x_i)+\sigma_{\text{obs}}^{2}} \Bigr]. \end{equation}\](4.5) where \(\sigma_n^2(x_i)\) is the variance of the GP surrogate model at \(x_i\), \(\sigma_{\text{obs}}^2\) is the fitted observation noise, and \(\mu_n\) is the mean of the model at the test point. The predictive log-likelihood is a measure of how well the model fits the data, with higher values indicating a better fit. We then average over all test points to get the final score, \(\bar{\mathcal{L}}\).

4.2.3 Potential Intensity and Size for US Coastal Cities

We choose three coastal cities in the United States with distinct geographic and climatic characteristics: New Orleans, Louisiana; Galveston, Texas; and Miami, Florida. These locations are selected to represent a range of coastal environments, from the semi-enclosed basin of New Orleans to the open coastlines of Galveston and Miami. Each city has an infamous history of significant storm surge events, making them ideal candidates for testing the BO framework. We use the same ADCIRC mesh as in Chapter 3, with a resolution of approximately 1 km near the coast and a coarser resolution offshore. As this mesh covers all three locations, we reuse it in each case. The mesh includes detailed bathymetry and topography data to accurately capture the coastal geometry and its influence on storm surge dynamics.

The potential intensity (PI), \(V_p\) and the PI potential size (PS) that is possible at that intensity2 of TCs for each location are calculated using the methods outlined in Chapter 2. For the 2015 and 2100 climate conditions, we used the r4i1p1f1 member of CESM2 model from the SSP5-8.5 scenario, taking data from the August monthly average, selected at the closest ocean grid point to each city after the data had been bilinearly regridded to a 0.5\(^{\circ}\) grid using CDO. The PI and PS values for each location and time period are summarized in Table 4.1. As is expected, the effect of global warming (as indicated by the increasing sea surface temperature) leads to an increasing potential intensity and potential size.

Potential Intensity and Size for US Coastal Cities for 2015 and 2100. The potential intensity (PI) and PI potential size (PS) of tropical cyclones for coastal cells centered near New Orleans (90.25\(^{\circ}\)W, 29.25\(^{\circ}\)N), Galveston (94.25\(^{\circ}\)W, 28.25\(^{\circ}\)N), and Miami (79.75\(^{\circ}\)W, 25.75\(^{\circ}\)N) under 2015 and projected 2100 climate conditions. SST is the sea surface temperature at the nearest ocean grid point to each city. \(V_p\) is the potential intensity of the gradient level wind, \(r_3\) is the PI potential size’s radius of maximum wind, and \(r_{a3}\) is its radius of vanishing wind. The increase for each variable for each place is also shown. The increase in SST is calculated in degrees Kelvin, to account for temperature’s absolute scale.
Location Year SST [\(^{\circ}\)C] PI \(V_p\) [m s\(^{-1}\)] \(r_3\) [km] \(r_{a3}\) [km]
New Orleans 2015 31.3 105.0 24.4 1500
2100 34.8 111.4 28.4 1743
\(\Delta\) 0.9% 6.1% 16% 16%
Galveston 2015 30.5 94.9 29.8 1590
2100 34.3 105.0 31.9 1811
\(\Delta\) 1.3% 10.6% 7% 14%
Miami 2015 30.4 96.8 31.5 1727
2100 33.5 99.5 36.0 1928
\(\Delta\) 1.0% 2.7% 14% 12%

4.3 Results

4.3.1 Application to US Coastal Cities

We first present a summary of the results of applying the BO framework to find the worst-case storm surges for New Orleans, Galveston, and Miami using TC potential intensity and size for August 2015 climate conditions (with a summary of the same procedure applied to 2100 in Table 4.2). The results, shown in Figures 4.2, 4.3, and 4.4, reveal distinct optimal TC trajectories tailored to each local geography. They are shown at the moment of maximum storm surge height for the observation point, i.e. a snapshot of the numerical simulation at which the potential height found is realized. The dashed black line on each shows the straight line trajectory of the TC, the green arrow shows the direction of travel, and its size is proportional to the translation speed. The blue dot is the observation point, the red dot is the city itself, and the black dot is the “impact point" which is the point at which the TC goes through at a constant time in each numerical simulation. The colormap shows the storm surge height, and the black quivers show the 10m wind vectors.

For New Orleans (Figure 4.2), located within a semi-enclosed bay system, the worst-case surge of 15.8 m is produced by a very slow-moving TC (\(V_t=1.23 \text{ m s}^{-1}\)). This slow speed allows the cyclonic winds to efficiently pile water into the bay. In contrast, for Galveston and Miami (Figures 4.3 and 4.4), which are on open coastlines, the optimal trajectories involve much faster translation speeds (\(V_t=13.9 \text{ m s}^{-1}\) and \(V_t=13.1 \text{ m s}^{-1}\), respectively). This is consistent with previous findings that faster-moving storms tend to produce higher surges on open coasts (Lockwood et al. 2022). In all cases, the optimal track positions the northeast quadrant of the storm, typically the region of strongest winds and surge generation, travelling over the observation point.

Repeating the experiments under a high-emissions 2100 climate scenario (SSP5-8.5) consistently yields higher potential heights (Table 4.2), highlighting the framework’s utility for climate change impact assessment. It is interesting to note that Galveston’s largest relative potential height increase of 21.7% corresponds to a 10.6% increase in potential intensity and a 14% increase in PI potential outer size (Table 4.1). Whereas, Miami’s much smaller relative potential height increase of 7.1% corresponds to a lower potential intensity increase of 2.7% and a lower PI potential outer size increase of 12%. New Orleans, with its very high storm surge sensitivity (caused both by the bay and the large area of shallow water around it) has the largest absolute increase in potential height of 2.4 m, but an intermediate relative potential increase of 15.1% corresponding to its intermediate potential intensity increase of 6.1% and PI potential outer size increase of 16%. Notably, the optimal trajectories are broadly unchanged between 2015 and 2100: only the surge magnitude increases while the track parameters remain similar. This suggests that the basic geometry of the coastline, rather than the intensity or size of the storm, is the primary determinant of the worst-case trajectory. If the assumptions of a fixed coastline and sea level were relaxed to include sea-level rise and coastal evolution, the optimal trajectory could differ between the two periods.

New Orleans (15.8 m): \(V_t=1.23\text{ m s}^{-1}\), \(\chi=-80.0\;^{\circ}\), \(c=-0.02\;^{\circ}\)E.
Galveston (7.8 m):
\(V_t=13.9\text{ m s}^{-1}\), \(\chi=-6.55\;^{\circ}\), \(c=-0.24\;^{\circ}\)E.
Miami (5.6 m):
\(V_t=13.1 \text{ m s}^{-1}\), \(\chi=-33.3\;^{\circ}\), \(c=-0.34\;^{\circ}\)E.
Location 2015 2100 \(\boldsymbol{\Delta}\)
New Orleans 15.8 m 18.2 m 2.4 m (15.1%)
Galveston 7.8 m 9.5 m 1.7 m (21.7%)
Miami 5.6 m 6.0 m 0.4 m (7.1%)
Summary of potential heights for 2015 and 2100 (SSP5-8.5), with inputs from Table 4.1.
A summary of the results of extending the PotentialHeight framework along the coast. Panels (a)-(c) show snapshots from the ADCIRC simulation of the worst-case TC trajectory found for each location for August 2015 conditions corresponding to the potential height. The colormap indicates sea surface height, and quivers represent 10m wind vectors. The blue dot is the observation point. The large green arrow shows the trajectory direction and speed of the TC, starting at the centre of the TC. The dashed black line shows the trajectory of the TC over time. Panel (d) summarizes the increase in potential height from 2015 to 2100 in the SSP5-8.5 climate scenario.

4.3.2 Investigating random error in solution

To assess the reliability of finding the potential height and its corresponding optimal trajectory from Figure 4.1, we use the Bayesian optimization framework to find the potential height for different points on the US coastline, repeating each experiment three times with different random seeds, and for both 2015 and 2100 conditions. Using different random seeds results in distinct initial sets of 25 Latin hypercube samples (LHS) and different initial values for the GP’s parameters. The results are shown in Figure 4.5 for New Orleans (and in the Appendix Section Section 4.7, Figures 4.28 and 4.29 for Galveston and Miami respectively). The blue line shows the largest potential height (the highest storm surge height found so far) for 2015 during that Bayesian Optimization experiment, and the red shows the same for 2100 conditions. The potential height found is consistently higher for 2100 than 2015, as would be expected by the increase in both the size and intensity of the imposed idealized cyclone (see Table 4.1). Note that these potential heights do not include mean sea-level rise, which under SSP5-8.5 could add a further 0.5–1 m or more by 2100 (Oppenheimer et al. 2019), so the actual flood heights in a future climate would be higher still. The optimal trajectories do not seem to be significantly different between the two Augusts. Figure 4.5 shows that for one trial the highest value is found after a single additional Bayesian Optimization point has been numerically simulated, whereas for other trials they are still finding higher values for each of the additional 25 Bayesian optimization samples.

Figures 4.6, 4.10, 4.14, and 4.21 show snapshots of ADCIRC simulation from the point at which the maximum storm surge height is reached at the observation point for each of the three trials for 2015 and 2100 conditions for New Orleans, Galveston and Miami respectively. The optimal parameters found for each of these experiments are shown in Table 4.3, with the highest value for each location highlighted in bold (shown previously in Figure 4.1). As can be seen graphically, the snapshots highlight that for some locations the exact angle of approach \(\chi\) does not make a large difference, and we can see a cone of uncertainty for Miami (Figure 4.21) of between \(-33^{\circ}\) and \(-58^{\circ}\), whereas for Galveston (Figure 4.14) this cone is smaller, between \(-8^{\circ}\) and \(1^{\circ}\). New Orleans (Figures 4.6 and 4.10) has a moderate \(\chi\) sensitivity with a range between \(-68.8^{\circ}\) and \(-80^{\circ}\), although this could partly be due to the constraint on \(\chi\) for New Orleans. For these repeats (Figures 4.6 and 4.10), the trajectory travels so that the TC’s RHS roughly pushes water down the channel known as Lake Borgne, which enables the water to be piled up at the observation point (shown as a blue dot).

The optimal TC parameters found for the potential height for New Orleans, Galveston and Miami, for 2015 and 2100 conditions with three repeats. \(i\) is the iteration number at which the estimated potential height \(z^{*}\) was found. The top repeat’s potential height and TC characteristics are highlighted. For all experiments the limits are for \(c \in (-2^{\circ}E, 2^{\circ}E)\), \(V_t \in (0\text{ m s}^{-1}, 15\text{ m s}^{-1})\).
Name Year Trial \(i\) \(c\) [\(^{\circ}\)E] \(\chi\) [\(^{\circ}\)] \(V_t\) [m s\(^{-1}\)] \(z^{*}\) [m]
New Orleans 2015 0 47 0.150664 -68.583395 1.547884 15.455311
2015 1 26 -0.024005 -80.000000 1.225401 15.810684
2015 2 49 0.443851 -64.282176 1.394231 15.150438
New Orleans 2100 0 48 0.128358 -67.692634 2.230662 18.073754
2100 1 48 -0.134150 -80.000000 1.574318 18.180207
2100 2 38 -0.499875 -80.000000 2.076426 18.167737
Galveston 2015 0 49 -0.239934 -6.546827 13.934525 7.808678
2015 1 46 -0.202530 1.388188 14.135774 7.784917
2015 2 37 -0.242096 -4.413026 15.000000 7.792348
Galveston 2100 0 41 -0.260462 -7.956457 15.000000 9.456821
2100 1 40 -0.219618 -1.623654 15.000000 9.451374
2100 2 38 -0.228618 -4.128352 15.000000 9.449851
Miami 2015 0 49 -0.474863 -58.696815 15.000000 5.337554
2015 1 49 -0.414894 -44.251724 13.009463 5.467047
2015 2 42 -0.339169 -33.298199 13.140890 5.561214
Miami 2100 0 46 -0.645725 -57.052325 15.000000 5.756896
2100 1 44 -0.401420 -37.599006 12.778452 5.932592
2100 2 49 -0.403765 -36.752923 13.056293 5.952213
Using Bayesian optimization to find the Potential Height for 2015 (blue) and 2100 (red) for New Orleans. The first 25 samples are selected using Latin hypercube sampling, and the subsequent 25 points are selected by the MES acquisition function. Each point summarises a numerical simulation where the track displacement \(c\), track angle \(\chi\), and translation speed \(V_t\) are inputs and the maximum storm surge height at the observation point is the output. The solid lines in panel (a) show the highest SSH seen so far in each trial (the current estimated potential height). Each experiment (2015 TC and 2100 TC) is repeated three times with different random seeds.
New Orleans, 2015, Trial 0
New Orleans, 2015, Trial 1
New Orleans, 2015, Trial 2
Snapshots of the maximum storm surge height for New Orleans, for 2015 conditions. The trial number (0-2) is a repeat of the experiment with a different random seed. Snapshots are taken when the maximum storm surge height at the observation point (blue) was reached.
New Orleans, 2100, Trial 0
New Orleans, 2100, Trial 1
New Orleans, 2100, Trial 2
Snapshots of the maximum storm surge height for New Orleans, for 2100 conditions. The trial number (0-2) is a repeat of the experiment with a different random seed. Snapshots are taken when the maximum storm surge height at the observation point (blue) was reached.
Galveston, 2015, Trial 0
Galveston, 2100, Trial 0
Galveston, 2015, Trial 1
Galveston, 2100, Trial 1
Galveston, 2015, Trial 2
Galveston, 2100, Trial 2
Snapshots of the maximum storm surge height for Galveston, for 2015 and 2100 conditions. The trial number (0-2) is a repeat of the experiment with a different random seed. Snapshots are taken when the maximum storm surge height at the observation point (blue) was reached.
Miami, 2015, Trial 0
Miami, 2100, Trial 0
Miami, 2015, Trial 1
Miami, 2100, Trial 1
Miami, 2015, Trial 2
Miami, 2100, Trial 2
Snapshots of the maximum storm surge height for Miami, for 2015 and 2100 conditions. The trial number (0-2) is a repeat of the experiment with a different random seed. Snapshots are taken when the maximum storm surge height at the observation point (blue) was reached.

4.3.3 GP Kernel Sensitivity Analysis

The choice of covariance kernel is a critical design decision in any GP-based model, as it encodes prior assumptions about the function’s smoothness (Rasmussen and Williams 2006). To assess this, we perform a kernel comparison using a dataset of 787 sample locations generated via Latin hypercube sampling, and numerically simulated in high-resolution ADCIRC simulations for the New Orleans location with the 2015 idealized cyclone. Each point in this dataset has the three input parameters (translation speed \(V_t\), track angle \(\chi\), and track displacement, \(c\)), and the resulting maximum storm surge height at the observation point, \(y\).

We repeatedly split the dataset into a small training set of 25 points (mimicking the initial design in a BO run) and a large test set of the remaining 762 points. This should evaluate how well they will model the space at the start of the Bayesian optimization loop. If they are better surrogate models, we expect the Bayesian optimization loop to better estimate potential height. We then train GP models with various kernels and evaluate their predictive performance on the test set using three metrics: the mean predictive log-likelihood (\(\bar{\mathcal{L}}\)) for probabilistic predictions, Root Mean Squared Error (RMSE), a widely used measure of deterministic prediction accuracy, and the coefficient of determination (\(r^2\)) for a scaled score of deterministic accuracy that does not depend on the scale of the predictions. The experiment is repeated over \(N=500\) random train/test splits to ensure statistical robustness. Because the standard error of the mean scales as \(1/\sqrt{N}\), using 500 splits yields standard errors of approximately 0.01 for \(\bar{\mathcal{L}}\) and 0.005 for \(r^2\) (see Table 4.4), which are substantially smaller than the differences between kernel families and therefore sufficient to distinguish their performance.

We summarise the results in Table 4.4. The Matérn family of kernels, which allows for tunable smoothness, consistently outperforms the infinitely smooth squared-exponential (SE) kernel. The Matérn-3/2 kernel achieved the highest mean predictive log-likelihood (\(\bar{\mathcal{L}}=-2.219\pm0.011\)), indicating that it has the best probabilistic fit for the data. The Matérn-5/2 kernel yielded the best deterministic performance, with the lowest RMSE (\(2.281\pm0.015\text{ m}\)) and highest \(r^2\) (\(0.660 \pm 0.005\)). However, the difference between either kernel choice (Matérn-5/2 or Matérn-3/2, both with a zero mean function), is small, indicating that they are both reasonable choices if you are prioritising the mean prediction or the probabilistic prediction. For the Bayesian optimization task, it is probably more important to have accurate probabilistic predictions, so the Matérn-3/2 could be a small improvement. Both kernels significantly outperform the SE and Polynomial kernels, suggesting that the assumption of infinite smoothness is inappropriate for the storm surge response surface, which likely contains sharp gradients as found in other studies such as Gopinathan et al. (2021).

Comparison of GP kernels on a high-resolution ADCIRC dataset (N=787). A training set of 25 points is used to predict the remaining 762 points. Results are mean \(\pm\) standard error over 500 random splits. Bold values indicate the best performance for each metric. The Matérn-1/2 kernel is also known as the Exponential or Laplacian kernel. SE stands for the Squared Exponential kernel, also known as the Radial Basis Function kernel or RBF.
Kernel Mean Function \(\bar{\mathcal{L}}\) RMSE [m] \(r^2\)
Matérn-\({5}/{2}\) Constant -2.254 \(\pm\) 0.015 2.294 \(\pm\) 0.016 0.656 \(\pm\) 0.005
Matérn-\({5}/{2}\) Zero -2.246 \(\pm\) 0.014 2.281 \(\pm\) 0.015 0.660 \(\pm\) 0.005
Matérn-\({3}/{2}\) Constant -2.225 \(\pm\) 0.011 2.302 \(\pm\) 0.015 0.654 \(\pm\) 0.005
Matérn-\({3}/{2}\) Zero -2.219 \(\pm\) 0.011 2.292 \(\pm\) 0.015 0.657 \(\pm\) 0.005
Matérn-\({1}/{2}\) Constant -2.295 \(\pm\) 0.006 2.464 \(\pm\) 0.014 0.606 \(\pm\) 0.004
Matérn-\({1}/{2}\) Zero -2.290 \(\pm\) 0.006 2.455 \(\pm\) 0.013 0.609 \(\pm\) 0.004
SE Constant -2.382 \(\pm\) 0.025 2.330 \(\pm\) 0.016 0.645 \(\pm\) 0.005
SE Zero -2.365 \(\pm\) 0.024 2.312 \(\pm\) 0.015 0.651 \(\pm\) 0.005
Polynomial Constant -2.529 \(\pm\) 0.013 2.892 \(\pm\) 0.021 0.451 \(\pm\) 0.008
Polynomial Zero -2.512 \(\pm\) 0.012 2.851 \(\pm\) 0.020 0.467 \(\pm\) 0.008

4.4 Discussion and Future Work

4.4.1 Findings

The key findings from Section Section 4.3 can be summarized as follows:

4.4.2 Limitations

In Section Section 4.3, we found that the optimal trajectory for New Orleans consistently approached at the angular limit of \(-80^{\circ}\) (travelling roughly westward, deflected 10\(^{\circ}\) northward from due west), while for Galveston and Miami, the optimal translation speed was often at the upper limit of \(15 \text{ m s}^{-1}\). Potential height was also reached near \(V_t\approx 15 \text{ m s}^{-1}\) for Galveston and Miami. This means that the limits that we impose on the BO loop could be important in determining the potential height found, as if a larger range had been allowed, the potential height found could have been even larger.

For New Orleans, the angular limit of \(-80^{\circ}\) to \(80^{\circ}\) is chosen so that the TC approaches from the ocean, with the coastline roughly orientated as a line of constant latitude. TCs can sometimes skim the coastline (e.g. Typhoon Ragasa in 2025), but if the TC spends a significant portion of the approach partially on land, then this should likely reduce the plausible intensity and size of the TC (not modelled here, as we choose a single ocean point at which to calculate potential intensity and size for each location).

However, the choice of TC translation speed is much less certain, as it is chosen to be large compared to the climatology of TC translation speeds in the Atlantic from IBTrACS, and roughly twice the landfall speed of Hurricane Katrina in 2005. However, we have no explicit theory for how translation speed should interact with the potential intensity and potential size, and further work could be done to consider how to integrate the somewhat incompatible assumptions made in Chapter 2 that we are both at a steady equilibrium with the environment (not moving), but that the sea surface temperature and outflow temperature are kept at ambient conditions (which would need the storm to move fast enough to not significantly cool the ocean below it etc.). Considering the TC trade-off between speed, size and intensity could be an important area of future theoretical work.

4.4.3 Future Work

The performance and reliability of the potential height BO framework depend on several key design choices. Our kernel analysis in Section Section 4.3 highlights one such sensitivity, but also confirms that our previous choice was reasonable. Building on this, we identify several critical areas for future research to enhance the methodology.

4.4.3.0.1 Acquisition Functions.

Our current framework uses Max-Value Entropy Search (MES) (Wang and Jegelka 2017), an information-theoretic acquisition function. While powerful, it is important to compare its performance against other established functions. Expected Improvement (EI) (Jones et al. 1998) is a classic choice that focuses on direct improvement, and was used in the TC track optimization study Ide et al. (2024), while Upper Confidence Bound (UCB) (Srinivas et al. 2010) offers theoretical regret guarantees and explicit control over the exploration-exploitation trade-off. Recent work has shown that EI can be viewed as an approximation to MES, suggesting a deeper connection between these philosophies (Cheng et al. 2025). A systematic comparison of these functions, particularly in a batch setting (e.g., q-UCB), is needed to determine the most efficient strategy for the storm surge problem. We conduct some preparatory experiments with EI in Section Section 4.6, Table 4.5, Table 4.6, but a more thorough comparison is needed.

4.4.3.0.2 Noise Modeling.

We currently assume a small, fixed noise term to account for numerical artifacts in the ADCIRC simulation. However, the simulation is largely deterministic. Comparing this noisy assumption against a noise-free GP model is an important step. A noise-free model could fit the observed data more closely but risks overfitting, while a small “nugget” term can improve numerical stability and robustness (Rasmussen and Williams 2006). Investigating the impact of this choice on convergence speed and final optima is a key research question.

4.4.3.0.3 Informative Priors.

The standard GP assumes a zero-mean prior, learning the function’s behavior entirely from the data. This can be inefficient if prior physical knowledge is available. An alternative is to use an informative prior mean function (as shown for particle physics in Boltz et al. 2025), for example, derived from a faster, lower-fidelity surge model or an empirical formula (Mori et al. 2022, used a simple empirical formula (response function) and \(V_p\) to calculate their “maximum potential surge”). Such a “gray-box” approach could significantly accelerate convergence by guiding the initial search towards physically plausible high-surge regions, while retaining the ability to numerically simulate the full complexity of the coast.

4.4.3.0.4 Advanced Geometries and Physics.

The current work is limited to straight-line TC tracks. Extending the optimization space to include track curvature, as explored by Ide et al. (2024), would allow for the discovery of more complex and potentially more hazardous scenarios. Furthermore, our analysis only considers the peak surge at a single point. Expanding the framework to a multi-objective problem (Kandasamy et al. 2017), optimizing for maximum surge across a stretch of coastline or considering other impacts like wave height, would provide a more comprehensive hazard assessment. We might be able to leverage observational data (e.g. IBTrACS), but historical tracks do not directly define a continuous parameter space from which the BO loop can sample. Two avenues could bridge this gap: training a generative model on IBTrACS to produce realistic tracks from a small number of latent inputs, or extending the parameterisation to include additional degrees of freedom (e.g. track curvature, time-varying intensity) so that observed track variability is better represented in the search space.

4.5 Conclusion

This work demonstrates that Bayesian optimization is a powerful and efficient tool for identifying the potential height of tropical cyclone storm surges, a critical task for coastal risk assessment in a changing climate. Our application of the framework to diverse US coastal locations successfully identified physically plausible high-impact scenarios that are sensitive to both local geography and future climate conditions.

A key contribution of this work is the systematic evaluation of Gaussian Process kernels, which showed that the smoothness assumptions encoded in the kernel have a significant impact on predictive accuracy. The superior performance of Matérn kernels over the standard squared-exponential kernel provides valuable guidance for practitioners modeling similar complex physical systems. The outlined directions for future work, particularly concerning the choice of acquisition function and the incorporation of prior physical knowledge, promise to further enhance the power and reliability of this data-centric approach to understanding extreme environmental events.

Open source code

The code for this work is available at https://github.com/sdat2/PotentialHeight (Thomas 2025), and builds upon the trieste library (Picheny et al. 2023) and the sithom utility library (Thomas 2024).

4.6 Appendix: Additional Bayesian Optimization Results

In Table 4.5 we show the optimal TC parameters found for the potential height for New Orleans, Galveston and Miami, for 2015 and 2100 conditions with multiple repeats with the EI DAF. This can be compared to Table 4.6 which shows the same results for the MES DAF, for a greater number of repeats. If we plot the full experiments we should be able to better estimate the ability of each DAF to find the potential height quickly and reliably (i.e. minimize simple regret).

The optimal TC parameters found for the potential height for New Orleans, Galveston and Miami, for 2015 and 2100 conditions with multiple repeats with the EI DAF. \(i\) is the iteration number at which the maximum surge \(z^{*}\) was found.
Name Year Trial \(i\) \(c\) [\(^{\circ}\)] \(\chi\) [\(^{\circ}\)] \(V_t\) [m s\(^{-1}\)] \(z^{*}\) [m]
New Orleans 2015 0 35 0.106144 -65.765142 2.645223 16.623703
2015 1 27 -0.037976 -72.105791 2.610292 16.646118
2015 2 34 0.080311 -73.216037 2.137189 16.568913
2015 3 48 0.178416 -50.576019 3.424609 16.197922
New Orleans 2100 0 34 0.116110 -64.564155 2.502893 19.005556
2100 1 9 0.188782 -74.528809 3.266373 17.358231
2100 2 50 0.081534 -72.808089 1.050691 18.617483
2100 3 11 0.532410 -38.566895 6.979294 12.756316
Miami 2015 0 50 -0.907845 -65.559199 14.199726 5.808904
2015 1 45 -0.755691 -58.689019 12.467778 5.855880
2015 2 42 -0.317724 -14.295937 15.000000 5.878907
2015 3 44 -0.493195 -42.803518 9.394865 5.804811
2015 4 47 -0.436209 -42.899324 14.359218 6.025050
Miami 2100 0 50 -1.089736 -64.441735 14.217393 6.275769
2100 1 46 -0.812193 -59.644524 13.642048 6.352571
2100 2 48 -0.516151 -41.709921 15.000000 6.469617
2100 3 48 -0.505128 -41.683116 9.644382 6.181484
2100 4 45 -0.581573 -44.306367 14.214715 6.439702
Galveston 2015 0 49 -0.403830 -21.338870 8.279774 8.286314
2015 1 45 -0.350905 -12.918042 14.575252 9.227775
2015 2 47 -0.356173 -17.930728 14.526479 9.264994
2015 3 50 -0.302235 -16.091937 10.399270 8.754756
2015 4 50 -0.535234 -38.618675 8.980126 8.285849
Galveston 2100 0 50 -0.414398 -24.336226 8.494458 10.055245
2100 1 41 -0.374471 -16.827473 14.330211 11.007732
2100 2 42 -0.391046 -20.149699 14.586575 11.015003
2100 3 48 -0.355822 -15.085349 10.537436 10.511299
2100 4 45 -0.620599 -36.978461 9.332306 10.008612
The optimal TC parameters found for the potential height for New Orleans, Galveston and Miami, for 2015 and 2100 conditions with multiple repeats with the MES DAF. \(i\) is the iteration number at which the maximum surge \(z^{*}\) was found.
Name Year Trial \(i\) \(c\) [\(^{\circ}\)] \(\chi\) [\(^{\circ}\)] \(V_t\) [m s\(^{-1}\)] \(z^{*}\) [m]
New Orleans 2015 0 48 0.150664 -68.583395 1.547884 15.455311
2015 1 27 -0.024005 -80.000000 1.225401 15.810684
2015 2 50 0.443851 -64.282176 1.394231 15.150438
2015 3 49 0.435980 -79.011548 0.814035 15.760807
2015 4 48 0.061778 -69.469572 2.117449 16.331021
2015 5 50 0.032803 -80.000000 1.274389 16.346804
2015 6 42 -0.296418 -80.000000 1.688346 16.401895
2015 7 27 0.190199 -65.280805 2.360441 16.256331
2015 8 45 -0.292706 -80.000000 1.560420 16.415911
2015 9 50 0.500926 -9.271796 2.309178 13.210543
2015 10 50 0.192714 -60.737065 2.891723 16.203676
2015 11 48 0.277146 -75.769189 1.365527 16.150175
New Orleans 2100 0 49 0.128358 -67.692634 2.230662 18.073754
2100 1 49 -0.134150 -80.000000 1.574318 18.180207
2100 2 39 -0.499875 -80.000000 2.076426 18.167737
2100 3 50 0.184383 -61.389076 1.685182 18.433191
2100 4 50 0.113350 -67.124205 2.231882 18.678784
2100 5 14 0.313660 -64.816895 5.924606 12.823672
2100 6 39 0.165419 -59.671687 3.118457 18.771043
2100 7 1 -0.073730 -61.308594 2.074585 17.684511
2100 8 38 -0.333299 -80.000000 1.514461 18.738474
2100 9 3 1.586426 47.285156 3.143921 7.072265
2100 10 45 0.158070 -60.216438 3.126566 18.787441
2100 11 3 0.591370 65.363770 2.534866 9.210907
Miami 2015 0 50 -0.474863 -58.696815 15.000000 5.337554
2015 1 50 -0.414894 -44.251724 13.009463 5.467047
2015 2 43 -0.339169 -33.298199 13.140890 5.561214
2015 3 35 -0.384161 -29.815032 14.667681 5.629461
2015 4 46 -0.744002 -59.949934 9.760519 5.339923
2015 5 39 -0.873098 -60.735722 8.801652 5.295642
2015 6 46 -0.627557 -54.849255 12.125315 5.411477
2015 7 36 -0.463462 -38.612121 13.576390 5.620397
2015 8 44 -0.509733 -51.497712 8.433675 5.369272
2015 9 4 -0.413574 -32.714844 10.643921 5.571131
2015 10 10 -0.299561 -23.173828 10.701599 5.680423
2015 11 46 -0.318434 -28.584325 13.558010 5.647601
Miami 2100 0 47 -0.645725 -57.052325 15.000000 5.756896
2100 1 45 -0.401420 -37.599006 12.778452 5.932592
2100 2 50 -0.403765 -36.752923 13.056293 5.952213
2100 3 37 -0.404466 -33.068057 14.413090 6.051083
2100 4 50 -0.759946 -57.534220 14.095991 5.826954
2100 5 43 -0.922258 -58.354497 9.329459 5.666612
2100 6 46 -0.732460 -53.882079 9.551218 5.809021
2100 7 44 -0.501872 -38.525615 14.078191 6.022338
2100 8 24 -1.145447 -67.741699 6.491776 5.557811
2100 9 4 -0.413574 -32.714844 10.643921 5.946936
2100 10 50 -0.385609 -29.661194 11.205281 6.023767
2100 11 48 -0.357789 -22.981955 13.741247 6.076998
Galveston 2015 0 50 -0.239934 -6.546827 13.934525 7.808678
2015 1 47 -0.202530 1.388188 14.135774 7.784917
2015 2 38 -0.242096 -4.413026 15.000000 7.792348
2015 3 46 -0.295615 -8.472373 10.243142 7.897922
2015 4 46 -0.274242 -9.557076 12.083916 8.117956
2015 5 43 -0.315450 -21.033440 9.173075 7.755775
2015 6 50 -0.338784 -19.658241 9.593588 7.830658
2015 7 49 -0.341349 -15.711692 11.881489 8.107960
2015 8 42 -0.343098 -24.416809 7.671206 7.521423
2015 9 49 -0.337953 -22.515926 9.835710 7.837834
2015 10 50 -0.356872 -20.149886 10.369618 7.930439
2015 11 28 -0.227945 -1.538035 13.293142 8.148791
Galveston 2100 0 42 -0.260462 -7.956457 15.000000 9.456821
2100 1 41 -0.219618 -1.623654 15.000000 9.451374
2100 2 39 -0.228618 -4.128352 15.000000 9.449851
2100 3 50 -0.304825 -4.035788 11.587170 9.636491
2100 4 44 -0.293576 -6.798316 12.710099 9.812594
2100 5 46 -0.337807 -22.121024 8.961768 9.351685
2100 6 50 -0.488446 -24.842308 9.343398 9.301671
2100 7 47 -0.366055 -23.211359 9.987884 9.516553
2100 8 48 -0.368090 -26.706469 8.476046 9.216771
2100 9 49 -0.394500 -21.304301 9.912835 9.523946
2100 10 50 -0.384730 -20.524362 10.385365 9.584086
2100 11 28 -0.234762 -2.492068 13.252825 9.804772

4.7 Additional Bayesian Optimization Process Figures

Figures 4.28 and 4.29 are additional figures that show the same Bayesian optimization process as Figure 4.5, but for Galveston and Miami respectively. As before, the first 25 points before the dashed black line are for the Latin hypercube search sampling strategy, and the remaining 25 points are selected with the MES acquisition function. The blue points are for 2015 conditions, and the red points for 2100 conditions. Each experiment is repeated three times with different random seeds. The final potential height found is indicated by the dashed horizontal line of the same color.

Using Bayesian optimization to find the Potential Height for 2015 (blue) and 2100 (red) for Galveston. Each is repeated three times.
Using Bayesian optimization to find the Potential Height for 2015 (blue) and 2100 (red) for Miami. Each is repeated three times.

References

Boltz, Tobias, Jose L. Martinez, Connie Xu, et al. 2025. “Leveraging Prior Mean Models for Faster Bayesian Optimization of Particle Accelerators.” Scientific Reports 15: 12232. https://doi.org/10.1038/s41598-025-95297-z.
Brochu, Eric, Vlad M. Cora, and Nando de Freitas. 2010. “A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning.” arXiv Preprint arXiv:1012.2599. https://arxiv.org/abs/1012.2599.
Chavas, Daniel R., Ning Lin, and Kerry A Emanuel. 2015. “A Model for the Complete Radial Structure of the Tropical Cyclone Wind Field. Part I: Comparison with Observed Structure.” Journal of the Atmospheric Sciences 72 (9): 3647–62. https://doi.org/10.1175/JAS-D-15-0014.1.
Cheng, Nuojin, Leonard Papenmeier, Stephen Becker, and Luigi Nardi. 2025. “A Unified Framework for Entropy Search and Expected Improvement in Bayesian Optimization.” Proceedings of the 42nd International Conference on Machine Learning. https://openreview.net/forum?id=LbJQYNSH41.
GFDL. 2024. “Global Warming and Hurricanes.” Geophysical Fluid Dynamics Laboratory. https://www.gfdl.noaa.gov/global-warming-and-hurricanes/.
Gopinathan, Devaraj, Mohammad Heidarzadeh, and Serge Guillas. 2021. “Probabilistic Quantification of Tsunami Current Hazard Using Statistical Emulation.” Proceedings of the Royal Society A 477 (2250): 20210180.
Hennig, Philipp, and Christian J Schuler. 2012. “Entropy Search for Information-Efficient Global Optimization.” The Journal of Machine Learning Research 13 (1): 1809–37.
Ide, Yoshihiko, Shinichiro Ozaki, Masaru Yamashiro, and Mitsuyoshi Kodama. 2024. “Development and Improvement of a Method for Determining the Worst-Case Typhoon Path for Storm Surge Deviation Through Bayesian Optimization.” Engineering Applications of Artificial Intelligence 132: 107950.
Jones, Donald R., Matthias Schonlau, and William J. Welch. 1998. “Efficient Global Optimization of Expensive Black-Box Functions.” Journal of Global Optimization 13 (4): 455–92. https://doi.org/10.1023/a:1008306431147.
Kandasamy, Kirthevasan, Gautam Dasarathy, Jeff Schneider, and Barnabás Póczos. 2017. “Multi-Fidelity Bayesian Optimization with Continuous Approximations.” Proceedings of the 34th International Conference on Machine Learning 70: 1799–808. http://proceedings.mlr.press/v70/kandasamy17a/kandasamy17a.pdf.
Knutson, Thomas R., Suzana J. Camargo, Johnny C. L. Chan, et al. 2020. “Tropical Cyclones and Climate Change Assessment: Part II: Projected Response to Anthropogenic Warming.” Bulletin of the American Meteorological Society 101 (3): E303–22. https://doi.org/10.1175/BAMS-D-18-0194.1.
Lockwood, Joseph W, Ning Lin, Michael Oppenheimer, and Chin-Yao Lai. 2022. “Using Neural Networks to Predict Hurricane Storm Surge and to Assess the Sensitivity of Surge to Storm Characteristics.” Journal of Geophysical Research: Atmospheres, e2022JD037617.
Luettich, Richard Albert, Joannes J Westerink, Norman W Scheffner, et al. 1992. ADCIRC: An Advanced Three-Dimensional Circulation Model for Shelves, Coasts, and Estuaries. Report 1, Theory and Methodology of ADCIRC-2DD1 and ADCIRC-3DL.
McKay, Michael D, Richard J Beckman, and William J Conover. 1979. “A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output from a Computer Code.” Technometrics 21 (2): 239–45. https://doi.org/10.2307/1268522.
Mori, Sotaro, Tomoya Shimura, Takuya Miyashita, Adrean Webb, and Nobuhito Mori. 2022. “Future Changes in Extreme Storm Surge Based on a Maximum Potential Storm Surge Model for East Asia.” Coastal Engineering Journal 64 (4): 630–47. https://doi.org/10.1080/21664250.2022.2145682.
Oppenheimer, M., B. C. Glavovic, J. Hinkel, et al. 2019. “Chapter 4: Sea Level Rise and Implications for Low-Lying Islands, Coasts and Communities.” In IPCC Special Report on the Ocean and Cryosphere in a Changing Climate, edited by H.-O. Pörtner, D. C. Roberts, V. Masson-Delmotte, et al. Intergovernmental Panel on Climate Change (IPCC).
Picheny, Victor, Joel Berkeley, Henry B Moss, et al. 2023. Trieste: Efficiently Exploring the Depths of Black-Box Functions with TensorFlow.” arXiv Preprint arXiv:2302.08436.
Rasmussen, Carl E., and Christopher K. I. Williams. 2006. Gaussian Processes for Machine Learning. Vol. 2. MIT Press.
Resio, Donald T., and Joannes J. Westerink. 2008. “Modeling the Physics of Storm Surges.” Physics Today 61 (9): 33–38. https://doi.org/10.1063/1.2982120.
Shahriari, Bobak, Kevin Swersky, Ziyu Wang, Ryan P. Adams, and Nando de Freitas. 2015. “Taking the Human Out of the Loop: A Review of Bayesian Optimization.” Proceedings of the IEEE 104 (1): 148–75. https://doi.org/10.1109/JPROC.2015.2494218.
Srinivas, Niranjan, Andreas Krause, Sham M. Kakade, and Matthias Seeger. 2010. Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design.” Proc. Of the 27th International Conference on Machine Learning (ICML), 1015–22.
Thomas, Simon D. A., Dani C. Jones, Talea Mayo, et al. 2025. “Finding the Potential Height of Tropical Cyclone Storm Surges in a Changing Climate Using Bayesian Optimization.” EarthArXiv (Submitted to Environmental Data Science), ahead of print. https://doi.org/10.31223/X57T5R.
Thomas, Simon Donald Alistair. 2024. Sithom’s Scientific Python Utilities Package. V. v0.1.1. Released. https://doi.org/10.5281/zenodo.7020109.
Thomas, Simon Donald Alistair. 2025. WorstSurge – Finding the potential height of a tropical cyclone storm surge in a changing climate using Bayesian Optimization. V. v0.0.4. Released. https://doi.org/10.5281/zenodo.15096627.
Toro, G. R., D. T. Resio, D. Divoky, A. W. Niedoroda, and C. Reed. 2008. “Efficient Joint-Probability Methods for Hurricane Surge Frequency Analysis.” Proceedings of the 10th International Workshop on Wave Hindcasting and Forecasting, ahead of print. https://doi.org/10.1016/j.oceaneng.2009.09.004.
Wang, Zi, and Stefanie Jegelka. 2017. “Max-Value Entropy Search for Efficient Bayesian Optimization.” Proc. Of the 34th International Conference on Machine Learning (ICML), 3627–35.

  1. \(\chi \in \left(-80, 80\right)\) for New Orleans, \(\chi \in \left(-80, 80\right)\) for Galveston, \(\chi \in \left(-120, 10\right)\) for Miami↩︎

  2. Both the PI potential outer size \(r_{a3}\) for the radius of vanishing wind, and the PI potential inner size \(r_3\) for the radius of maximum wind, both calculated for the intensity equal to the potential intensity.↩︎

Previous: Finding the potential height of tropical storm surges in a changing climate using Bayesian optimizationNext: SurgeNet: Beginning to train a spatio-temporal graph neural network for tropical cyclone storm surge prediction