Enhancing 3D Planetary Atmosphere Simulations with a Surrogate Radiative Transfer Model

Tara P. A. Tahseen,¹ João M. Mendonça,² Kai Hou Yip¹ and Ingo P. Waldmann¹
¹Department of Physics and Astronomy, University College London, Gower Street, WC1E 6BT London, United Kingdom
²Department of Space Research and Space Technology, Technical University of Denmark, Elektrovej 328, 2800 Kgs. Lyngby, Denmark E-mail: tara.tahseen.22@ucl.ac.uk (TPAT)

(Accepted XXX. Received YYY; in original form ZZZ)

Abstract

This work introduces an approach to enhancing the computational efficiency of 3D atmospheric simulations by integrating a machine-learned surrogate model into the OASIS global circulation model (GCM). Traditional GCMs, which are based on repeatedly numerically integrating physical equations governing atmospheric processes across a series of time-steps, are time-intensive, leading to compromises in spatial and temporal resolution of simulations. This research improves upon this limitation, enabling higher resolution simulations within practical timeframes. Speeding up 3D simulations holds significant implications in multiple domains. Firstly, it facilitates the integration of 3D models into exoplanet inference pipelines, allowing for robust characterisation of exoplanets from a previously unseen wealth of data anticipated from JWST and post-JWST instruments. Secondly, acceleration of 3D models will enable higher resolution atmospheric simulations of Earth and Solar System planets, enabling more detailed insights into their atmospheric physics and chemistry. Our method replaces the radiative transfer module in OASIS with a recurrent neural network-based model trained on simulation inputs and outputs. Radiative transfer is typically one of the slowest components of a GCM, thus providing the largest scope for overall model speed-up. The surrogate model was trained and tested on the specific test case of the Venusian atmosphere, to benchmark the utility of this approach in the case of non-terrestrial atmospheres. This approach yields promising results, with the surrogate-integrated GCM demonstrating above 99.0% accuracy and 101 factor GPU speed-up of the entire simulation compared to using the matched original GCM under Venus-like conditions.

keywords:

planets and satellites: atmospheres – radiative transfer

^†^†pubyear: 2024^†^†pagerange: Enhancing 3D Planetary Atmosphere Simulations with a Surrogate Radiative Transfer Model–D

1 Introduction

3D atmospheric models, commonly known as global circulation models (GCMs) are key tools for studying the climates of solar system planets including the Earth, and are becoming increasingly important in the characterisation of exoplanet atmospheres. GCMs consist of multiple components which individually model different atmospheric processes: each component numerically solves equations governing an atmospheric process across elements of a grid and across many time-steps until simulation convergence criteria are reached. Due to the large number of numerical integrations involved in GCM simulations, producing a simulated atmospheric state for a given input set of planetary and stellar parameters is incredibly computationally expensive and time-consuming.

GCMs are applied in exoplanet science in the forward modelling component of Bayesian atmospheric retrieval pipelines. To retrieve posterior distributions on planetary and stellar parameters from a single transit spectrum, the Bayesian retrieval framework requires of the order of tens of thousands of forward simulations corresponding to different samples of input parameter space. With current state-of-the-art 3D modelling techniques, the compute resources and time needed to produce the required number of 3D simulations prohibits statistically-rigorous inference of data from the James Webb Space Telescope (JWST, Gardner et al., 2006) and further next-generation observational instruments yet to come, namely the Ariel Space Telescope (Tinetti et al., 2021). Acceleration without compromising the accuracy of GCMs is thus one clear method of facilitating inference using JWST and post-JWST data in exoplanet science.

Reducing GCM simulation time would also incur benefits in climate science of Earth and other solar system planets. Increased speed of computation would enable simulations to be run at greater resolution, and/or with more physics included, thus enabling more realistic climate simulations to be achieved.

The past few years have yielded work in both exoplanet science and Earth climate science to accelerate 3D models. Much of this work in exoplanet science involves extending 1D models with extra parameters to reflect certain 3D atmospheric variations deemed necessary to account for 3D effects in phase-curve data (Changeat & Al-Refaie, 2020; Irwin et al., 2020; Chubb & Min, 2022; Himes et al., 2023; Nixon & Madhusudhan, 2022; Feng et al., 2020). This approach is prone to introducing biases to the simulated atmospheric states produced, and a fully 3D model parametrisation is essential to ensuring that simulations are robust against both known and unknown biases resulting from model oversimplifications Pluriel et al. (2020). Earth climate science, benefiting from a wealth of high-resolution observational measurements, has had a different set of methods employed, namely machine-learned surrogate models trained on such observations (Yao et al., 2023; Ukkonen, 2022). Surrogate models have not been broadly explored outside of Earth climate science, but demonstrate the potential of speed-up without requiring an oversimplified model parametrisation (Yao et al., 2023; Ukkonen, 2022).

Of the components within GCMs, radiative transfer is often the slowest or least-resolved process. In the OASIS GCM (Mendonca et al., 2014), the radiative transfer component contributes to 60-70% of the total simulation runtime for massive and complex atmospheres such as Venus (Mendonça & Buchhave, 2020). There is thus key scope to substantially improve GCM computational efficiency by targeting the radiative transfer component specifically.

To assess the effectiveness of our new approach, we are testing it on Venus’ atmospheric conditions. Simulating the Venus atmosphere in 3D is very computationally intensive due to the need for a complex radiative transfer model to accurately represent the energy balance in the atmosphere (e.g. Eymet et al., 2009; Lee & Richardson, 2012; Mendonca et al., 2015). Venus has a substantial amount of CO₂, which generates a strong greenhouse effect (Sagan, 1962) and is covered by highly reflective sulfuric acid clouds, obscuring the planet’s surface. Only about 2.5 $\%$ of the incoming solar radiation reaches the surface (e.g. Tomasko et al., 1980; Mendonca et al., 2015). The radiation model also requires fine spectral resolution to capture the spectral windows impacting energy exchange between the deep atmosphere and the upper layers above the clouds. Additionally, due to the high thermal inertia of the massive CO₂ atmosphere, the models need to be integrated over a long period to reach a statistically steady state (Mendonça & Read, 2016). Using a surrogate model to represent the radiative transfer is key to enhancing the performance of 3D simulations whilst enabling a more realistic depiction of radiative processes at minimal cost. Therefore, Venus presents a significant challenge and serves as a benchmark for the complexities our new modelling approach may encounter in future applications.

2 Data & Codes

2.1 OASIS

OASIS is a planetary climate model composed of different coupled modules representing physical and chemical processes within planetary atmospheres (Mendonça & Buchhave, 2020). The mathematics and assumptions of the radiative transfer component of OASIS are detailed in Mendonca et al. (2015).

The equations in OASIS are discretized over concentric icosahedral spatial grids (Mendonça et al., 2016; Deitrick et al., 2020). For the 3D simulations in this study, we set the grid to approximately 2 degrees in the horizontal and 49 vertical layers. This grid configuration results in a total of 10242 columns covering the entire model domain. Our simulations started from the converged state obtained in Mendonça & Buchhave (2020) and were integrated for 5 Venus solar days (approximately 117 Earth days) using a time step of 15 seconds. The model configuration is similar to the simulations in Mendonça & Buchhave (2020), and in the following sections, we describe the main physical modules relevant to this work.

2.1.1 OASIS-RT

The radiative transfer module of OASIS (henceforth referred to as OASIS-RT) models the interaction of radiation with gas and cloud species in the atmosphere in a two-stream manner. Radiation from two different sources are modelled separately: these are solar radiation ( $0.1-5.5\ \mu\text{m}$ ) and thermal radiation ( $1.7-260\ \mu\text{m}$ ). The solar radiation code utilises the $\delta$ -Eddington approximation (Joseph et al., 1976) in combination with an adding-layer method (Liu & Weng, 2006; Mendonca et al., 2015), whilst the thermal code considers absorptivity and emissivity (Mendonca et al., 2015). The radiation scheme uses the $k$ -distribution method to represent the gas absorption cross sections, which are integrated over 353 spectral bands and 20 Gaussian points (Mendonça & Buchhave, 2020).

In the OASIS code used for this project, the spatial distribution and radiative properties of the clouds, which are composed of sulphuric acid and water, were taken from Crisp (1986) and Mendonca et al. (2015). The main cloud deck is located between roughly $45-65\,\text{km}$ altitude with layers of sub-micron particles below and above (Knollenberg & Hunten, 1980). More details on the cloud properties can be found in Mendonca et al. (2015).

The radiative transfer code involves two steps of computation for each timestep; these are:

1.

Computing the gas optics: computing the optical properties of layer boundaries from the thermodynamic variables.
2.

Computing the flow of radiation: computing the upward and downward-welling fluxes at each layer boundary, from the incident flux at the top of the column in combination with the optical properties at each layer boundary.

The model uses a $k$ -distribution table (Lacis & Oinas, 1991) to calculate the optical properties of each layer across the wavelength bands using pre-computed wavelength-dependent absorption coefficients for $\text{CO}_{2}$ , $\text{SO}_{2}$ and $\text{H}_{2}\text{O}$ . These coefficients are combined with a continuum absorption, mostly from $\text{CO}_{2}$ – $\text{CO}_{2}$ collisions, and Rayleigh scattering from $\text{CO}_{2}$ and $\text{N}_{2}$ (Mendonca et al., 2015).

The radiative transfer model takes inputs of the density $\rho$ , pressure $p$ , temperature $T$ and chemical composition per grid element as outputted from the dynamical core THOR¹¹1The model component that governs the resolved 3D fluid flow evolution. (Mendonça et al., 2016). Heating rates are then calculated per grid element from fluxes outputted from the radiative transfer model, and these heating rates update the temperature profile of the atmosphere, which serves as input back into the dynamical core. Flux is used to compute the heating rate per layer according to equation 1.

\frac{dT}{dt}=\frac{1}{\rho c_{p}}\frac{dF^{net}}{dz}

(1)

where $dF^{net}$ is the spectral-integrated net radiative flux ( $\text{W}\,\text{m}^{-2}$ ), $\rho$ is the atmospheric density ( $\text{kg}\,\text{m}^{-3}$ ), and $c_{p}$ is the specific heat capacity at constant pressure (900 $\text{J}\,\text{kg}^{-1}\,\text{K}^{-1}$ ).

In order to improve the computational efficiency of 3D simulations with radiative transfer, Venus GCMs traditionally do not update the radiative fluxes from the solar and thermal schemes at every time step (Lebonnois et al., 2016; Mendonça & Read, 2016; Mendonça & Buchhave, 2020). In the 3D simulations with explicit radiative transfer used in this study, the fluxes calculated from the solar radiation scheme were updated every 2880 steps, and the thermal radiation fluxes were updated every 320 steps. These values are adjusted by the model user and are specific to Venus’s simulations. Larger values could potentially cause model instabilities. Although this approach introduces some inaccuracies in the heating/cooling rates, the robustness of the simulation is not compromised because the composition of the atmosphere and clouds remains constant over time. With the efficiency of the new surrogate model described in the next section, we are now able to update radiative fluxes at every time step. To enhance the stability of the model with the new surrogate model, we also apply the three-step Adams-Bashforth method to the heating rate calculated from the surrogate radiative fluxes.

2.2 Data

The data used for this project is 3D data simulated by OASIS corresponding to input parameters of Venus (Mendonça & Buchhave, 2020). The data consists of over 1,000 snapshots, each taken 12 hours apart, covering a period of 500 Earth days. The data covers the entire icosahedral grid with dimensions 10,242 columns $\times$ 49 layers.

At the sample level (per atmospheric column), the data comprises the following quantities: pressure $p$ , temperature $T$ , and gas density $\rho$ , all defined per layer; upwelling short-wave flux $\text{F}^{\text{SW},\uparrow}$ , downwelling short-wave flux $\text{F}^{\text{SW},\downarrow}$ , upwelling long-wave flux $\text{F}^{\text{LW},\uparrow}$ , and downwelling long-wave flux $\text{F}^{\text{LW},\downarrow}$ , all defined per layer boundary; and cosine of the solar zenith angle $\mu$ , short-wave surface albedo $\alpha_{\text{SW}}$ , long-wave surface albedo $\alpha_{\text{SW}}$ , and surface temperature $T_{0}$ , all defined per column.

Figures illustrating the distribution of samples within the dataset as a function of $\cos\mu$ and surface temperature are contained within appendix section C.

3 Methods

3.1 Surrogate Modelling

Surrogate models, or emulators, are approximate mathematical models which model outcomes of interest, whereby the emulator mechanism does not necessarily reflect the physical mechanism which produces the outcomes. Surrogate models are useful in cases where the physical mechanism producing such outcomes is not well-understood, or in cases where modelling the physical mechanism is excessively computational demanding; in the case of the latter, the aim is to produce a surrogate model which is computationally efficient compared to the physical model whilst maintaining accuracy. Machine learning provides a framework by which surrogate models can be produced: deep neural networks adhere to the Universal Approximation Theorem, and can (in theory) model any arbitrarily complex non-linear relationship, thus providing a function space whereby a suitable surrogate function is almost certain to exist (Ian Goodfellow and Yoshua Bengio and Aaron Courville, 2016).

Research into surrogate modelling of exoplanetary atmospheres is relatively nascent (Himes et al., 2022, 2023; Unlu et al., 2023). The use of surrogate models within Earth Climate Science, though still new, is much more established and well-explored (Yao et al., 2023; Mukkavilli et al., 2023; Ukkonen, 2022). The development of surrogate models for atmospheric models of exoplanets can thus be informed by the development of such surrogate models for the parameter space of Earth.

An example where surrogate modelling has already proved valuable within 3D modelling of exoplanetary atmospheres is work by Schneider et al. (2024), who utilised DeepSets to achieve fast and accurate mixing of correlated- $k$ opacities. Their work exemplifies how machine learning can be leveraged to successfully speed up a single process of a GCM whilst retaining accuracy, thus establishing a basis for further exploration into the integration of surrogates within GCM frameworks.

In this work, two surrogate models were produced for 3D modelling of Venus using OASIS: one surrogate model to emulate the short-wave radiative transfer schema, and one to emulate the long-wave radiative transfer schema (see 2.1.1 for details on the computations for the two radiation schemas).

3.2 Data Preprocessing

Data were preprocessed separately for the long-wave and short-wave regimes. Each model took input of two types of data: variables defined at the grid-element level, referred to as vector variables of dimension $(n_{\text{columns}},)=(49,)$ , and variables defined at the column-level, referred to as scalar variables.

The short-wave surrogate model took scalar inputs of: surface temperature, $T_{0}$ ; gas density of the lowest-altitude layer, $\rho_{S}$ ; pressure of the lowest-altitude layer, $p_{S}$ ; cosine of the solar zenith angle $\mu$ ; and short-wave surface albedo $\alpha_{SW}$ . The long-wave surrogate model took scalar inputs of: surface temperature, $T_{0}$ ; gas density of the lowest-altitude layer, $\rho_{S}$ ; pressure of the lowest-altitude layer, $p_{S}$ ; and long-wave surface albedo $\alpha_{LW}$ .

Both models took input of the same vector variables, which were as follows: temperature, $T$ ; pressure $p$ ; and gas density $\rho$ . Vector variables were scaled as:

x_{i,j}=\frac{\log_{e}(x_{i,j})}{\log_{e}(x_{i,0})}$$

(2)

for $x\in\{T,p\}$ , and

x_{i,j}=\left(\frac{x_{i,j}}{x_{i,0}}\right)^{0.25}

(3)

for $x=\rho$ , for the $i$ th column and $j$ th atmospheric level.

Scalar variables were then re-scaled as

x_{i}^{\text{scaled}}=\frac{x_{i}-\min\limits_{i\in S_{\text{train}}}{x_{i}}}{% \max\limits_{i\in S_{\text{train}}}{x_{i}}-\min\limits_{i\in S_{\text{train}}}% {x_{i}}}

(4)

for $x\in\{T_{0},p_{0},\rho_{0}\}$ where and $S_{\text{train}}$ is the training set.

Targets were scaled as follows:

u^{\text{LW}}_{j}=\frac{y^{\text{LW}}_{j}}{A^{\text{LW}}}

(5)

for $y_{j}\in\{F_{j}^{\text{LW},\uparrow},F_{j}^{\text{LW},\downarrow}\}$ and

u^{\text{SW}}_{j}=\frac{y_{j}^{\text{SW}}}{B^{\text{SW}}}

(6)

for $y_{j}\in\{F_{j}^{\text{SW},\uparrow},F_{j}^{\text{SW},\downarrow}\}$ , for the $j$ th altitude level ( $j\in[0,49]$ where 0 indexes the ground level and 49 indexes the top level of the atmospheric column), where $A^{\text{LW}}$ and $B^{\text{SW}}$ are scaling factors linear in $T_{0}$ and $\mu$ respectively, and fitted from the data:

A^{\text{LW}}(T_{0})=a_{1}\cdot T_{0}+a_{2}

(7)

B^{\text{SW}}(\mu)=b_{1}\cdot\mu+b_{2}

(8)

where $(a_{1},a_{2})$ are constants fitted using $y^{\text{LW},\uparrow}_{0}$ and $(b_{1},b_{2})$ are fitted using $y^{\text{SW},\downarrow}_{49}$ across all columns of the training set $S_{\text{train}}$ . Values of $(a_{1},a_{2},b_{1},b_{2})$ are retained for model prediction post-processing. Residuals between the targets $y^{\text{LW},\uparrow}_{0}$ , $y^{\text{SW},\downarrow}_{49}$ , and the respective scaling factors approximating the value of these targets, are displayed in figure 1. These simple linear scaling methods were chosen for data pre-processing in this work instead of using exact computations of $y^{\text{LW},\uparrow}_{0}$ and $y^{\text{SW},\downarrow}_{49}$ as the latter computations are much more involved, and the more simple computations produce results with an acceptably small marginal difference in the values of the fluxes.

Columns across all epochs were shuffled and split into train, test and validation datasets, in the ratio $70:15:15$ .

Refer to caption — Figure 1: The figures above display scatter plots illustrating the residuals between scaling factors $A^{\text{LW}}(T_{0})$ and $B^{\text{SW}}(\mu)$ and their respective targets $y^{\text{LW},\uparrow}_{0}$ and $y^{\text{SW},\downarrow}_{49}$ , plotted over one epoch of 10,242 test columns covering the entire icosahedral grid. Left: This panel shows the residuals between the scaling factors $B^{\text{SW}}(\mu_{i})$ used for the short-wave targets of a given column $i$ and the downward-welling short-wave flux at the top of the column $\text{F}^{\text{SW},\downarrow}_{i,49}$ . $B^{\text{SW}}(\mu_{i})$ , a linear function of the cosine of the solar zenith angle $\mu_{i}$ , approximates $\text{F}^{\text{SW},\downarrow}_{i,49}$ . Right: This panel displays the residuals between the scaling factors $A^{\text{LW}}(T_{i,0})$ used for the long-wave targets of a given column $i$ and the upward-welling long-wave flux at the ground level $\text{F}^{\text{LW},\uparrow}_{i,0}$ . Here, $A^{\text{LW}}(T_{i,0})$ approximates $\text{F}^{\text{LW},\uparrow}_{i,0}$ as a linear function of surface temperature $T_{i,0}$ . Both: In both the long-wave and short-wave cases, the preferred quantities $\text{F}^{\text{LW},\uparrow}_{i,0}$ and $\text{F}^{\text{SW},\downarrow}_{i,49}$ to use for scaling flux profiles across atmospheric levels, involve complex calculations. The figures demonstrate that simple linear functions $A^{\text{LW}}(T_{0})$ and $B^{\text{SW}}(\mu)$ yield close approximations with low residuals, making them suitable scaling factors instead.

3.3 Model Architecture

Model architecture was chosen to be based on recurrent neural networks (RNNs), as RNNs structurally incorporate the spatial dependence of the training data, which fits naturally in this scenario. RNN layers were implemented in the form of gated recurrent units (GRUs). The specific architecture of the benchmark was chosen to be that used by Ukkonen (2022) (illustrated in figure 2), which utilised a bi-directional RNN-based architecture to create a two-stream radiative transfer emulator for Earth, trained using observational data. Ukkonen’s model performed with $\leq 0.5\%$ mean absolute error for the upwelling and downwelling fluxes on the test-set (Ukkonen, 2022), suggesting its potential efficacy for developing surrogates trained on analogous simulated data. The models used in this work were constructed and trained using TensorFlow version 2.12.0 (Abadi et al., 2016), and converted into ONNX format for integration within OASIS. Input data to the surrogate model is detailed in the table 1, and surrogate model parameters are detailed in section A.

Surrogate Schema	Short-wave	Long-wave
Vector Inputs	$\vec{X}_{i}=(\tilde{p}_{i},\tilde{T}_{i},\tilde{\rho}_{i})$	$\vec{X}_{i}=(\tilde{p}_{i},\tilde{T}_{i},\tilde{\rho}_{i})$
Scalar Inputs	$\vec{\beta}_{SW}=(\mu,p_{0},T_{0},\rho_{0},\alpha_{SW})$	$\vec{\beta}_{LW}=(p_{0},T_{0},\rho_{0},\alpha_{LW})$

Table 1: Surrogate model inputs: Surrogate models for both the long-wave and short-wave schemas took scaled vector inputs

\vec{X}_{i}=(\tilde{p}_{i},\tilde{T}_{i},\tilde{\rho}_{i})

for the

i

th layer where

\tilde{p}_{i},\tilde{T}_{i},\tilde{\rho}_{i}

are scaled pressure, scaled temperature and scaled density of the

i

th layer of the input atmospheric column, respectively (see 3.2 for details on scaling). The surrogate models took input of scalar inputs

\vec{\beta}

, where

\vec{\beta}_{SW}=(\mu,p_{0},T_{0},\rho_{0},\alpha_{SW})

for the short-wave surrogate model, and

\vec{\beta}_{LW}=(p_{0},T_{0},\rho_{0},\alpha_{LW})

for the long-wave surrogate model, where

\mu

and

\alpha

represent the cosine of the solar zenith angle and surface albedo, respectively.

3.4 Model Training

Both models were trained in a supervised, end-to-end fashion. An Adam (Kingma & Ba, 2017) optimiser was used in combination with a cyclical learning rate. Models were trained using TensorFlow on an NVIDIA A100 GPU.

3.4.1 Loss Function

The loss function was constructed as a combination of the mean percentage error between the predictions and targets, per output. The mean error (ME) for the $i$ th test column and $k$ th target variable is defined as follows:

\text{ME}_{i,k}=\sum_{j=0}^{n_{\text{levels}}-1}\frac{\left|\hat{y}_{i,j,k}-% \tilde{y}_{i,j,k}\right|}{n_{\text{levels}}}

(9)

where $\hat{y}_{i,j,k}$ is the target for the $i$ th test column, $j$ th atmospheric level, and $k$ th target variable, where $k=0$ corresponds to down-welling flux and $k=1$ corresponds to up-welling flux, and $n_{\text{levels}}$ is the number of atmospheric levels ( $n_{\text{levels}}=50$ is this work). Normalisation factors were defined as

\text{norm}_{i,k}=\frac{\sum_{j=0}^{n_{\text{levels}}-1}\left|\hat{y}_{i,j,k}% \right|}{n_{\text{levels}}}

(10)

such that mean percentage error (MPE) of the $i$ th test column and $k$ th target variable can be expressed as

\text{MPE}_{i,k}=\frac{\text{ME}_{i,k}}{\text{norm}_{i,k}}

(11)

The loss function per sample was then defined as

\text{loss}_{i}=\frac{1}{2}\sum_{k}\text{MPE}_{i,k}

(12)

with the total loss defined as the sum over all test samples:

\text{loss}=\frac{1}{2}\sum_{i}\sum_{k}\text{MPE}_{i,k}

(13)

3.4.2 Hyperparameter Tuning

Multiple models were trained corresponding to different hyperparameter values. Number of neurons of all RNN layers was varied across the range of values $[16,32,64,128]$ for both surrogate models. The best candidate models were chosen as having 128 neurons per RNN layer for the short-wave surrogate model, and 32 neurons per RNN layer for the long-wave surrogate model.

3.5 Performance Analysis

Below, we detail the metrics used to analyse the performance of the surrogate models on the test set. In the results (section 4), different aggregations of absolute error (equation 15 for raw model outputs and equation 14 for postprocessed model outputs) are used to investigate the performance of both the long-wave and short-wave surrogate models.

\text{Absolute Error}\equiv\text{AE}_{i,j,k}=\left|\hat{y}_{i,j,k}-\tilde{y}_{% i,j,k}\right|

(14)

where $\hat{y}_{i,j,k}$ are the post-processed model predictions, and $\tilde{y}_{i,j,k}$ are the unscaled target variables.

\widehat{\text{AE}}_{i,j,k}=\left|\hat{u}_{i,j,k}-\tilde{u}_{i,j,k}\right|

(15)

where $\hat{u}_{i,j,k}$ are the raw model predictions, and $\tilde{u}_{i,j,k}$ are target variables which have been scaled to lie in the interval $[0,1]$ using the scaling methods detailed in section 3.2.

Column-aggregated error quantities are defined as follows:

\text{CAE}_{i,k}=\sum_{j=0}^{n_{\text{levels}}-1}\left|\hat{y}_{i,j,k}-\tilde{% y}_{i,j,k}\right|

(16)

\widehat{\text{CAE}}_{i,k}=\sum_{j=0}^{n_{\text{levels}}-1}\left|\hat{u}_{i,j,% k}-\tilde{u}_{i,j,k}\right|

(17)

where $n_{\text{levels}}$ is the number of atmospheric levels.

Error quantities averaged across samples per altitude level are defined as follows:

\text{MAE}_{j,k}=\frac{\sum_{i=0}^{N-1}\left|\hat{y}_{i,j,k}-\tilde{y}_{i,j,k}% \right|}{N}

(18)

\widehat{\text{MAE}}_{j,k}=\frac{\sum_{i=0}^{N-1}\left|\hat{u}_{i,j,k}-\tilde{% u}_{i,j,k}\right|}{N}

(19)

where $N$ is the number of test samples.

Mean flux for the $k$ th target variable is defined as:

\text{Mean Flux}_{k}\equiv\bar{F}_{k}=\frac{\sum_{i=0}^{N-1}\sum_{j=0}^{n_{% \text{levels}}-1}{\tilde{y}_{i,j,k}}}{N}

(20)

and the mean absolute error for the $k$ th target variable aggregated across all altitude levels is calculated as

\text{MAE}_{k}=\sum_{j=0}^{n_{\text{levels}}-1}{\text{MAE}_{j,k}}

(21)

4 Results & Discussion

Regime	Stream	Mean Flux $\bar{\textbf{F}}_{k}$ ( $\text{W}\,\text{m}^{-2}$ )	$\textbf{MAE}_{k}$ ( $\text{W}\,\text{m}^{-2}$ )	$\textbf{MAE}_{k}$ / $\bar{\textbf{F}}_{k}$
Long-wave	Upwelling	4211.0	18.8	0.45 %
	Downwelling	4139.3	16.9	0.41 %
Short-wave	Upwelling	577.9	6.4	1.11 %
	Downwelling	707.8	7.7	1.09 %

Table 2: The above table summarises the mean absolute error (

\text{MAE}_{k}

, defined in equation 21) across the four target variables, and relative to the mean values

\bar{F}_{k}

of these four variables (as defined in equation 20), across the test set.

4.1 Model Performance on Test Set

Table 2 summarises the mean absolute error (MAE, equation 21) across the four target variables, and relative to the mean values of these four variables, across the test set. These MAE values are in line with those achieved using similar surrogate modelling methods for radiative transfer within the Earth’s atmosphere, with Ukkonen (2022) quoting MAE for short-wave fluxes of around 1% or less. For both the long-wave and short-wave regimes, the MAE is higher for the upwelling fluxes as compared to downwelling fluxes: this is expected as physical computation of the upwelling flux depends on the computation of the downwelling flux, thus meaning this is a more complicated mapping to emulate. The percentage errors for the short-wave targets are a factor of 2-3 greater than those for the long-wave targets: this is to be expected as the magnitude of the short-wave targets are smaller, and the supervised learning task set for the short-wave model in this work is a more complex mapping from inputs to outputs as compared to that for the long-wave model.

In the following subsections, we visualise and interrogate the variation of model prediction errors with altitude and with scalar variables: $\cos\mu$ for both long-wave and short-wave model predictions, and surface temperature for long-wave model predictions only. For completeness, further plots of error as a function of the remaining input scalar variables (surface temperature, surface pressure, surface gas density) are contained in appendix section D.

4.1.1 Short-Wave Surrogate Model

A. Variation of error vs. altitude
Figure 3 displays the average absolute error of predictions at different altitude levels across test samples. Below around $65\,\text{km}$ , the average error increases roughly in proportion to the average target fluxes. Above this altitude, the average error decreases for both target variables, even though the average values of these variables continue to rise with altitude. This change in the trend of the average error occurs around the top of the cloud deck.

Potential explanations as to why there is lower error in predicting targets above $65\,\text{km}$ are as follows: above the top of the cloud deck, there is less complexity in mapping from input variables to output fluxes, and so this can be naively assumed to be an easier task to learn; there is also tighter variance in the target variables within the test set above this altitude threshold.

Plots A and B of figure 3 display mean errors of less than 3% across all altitude levels, averaged across test samples in the [5, 95] percentile interval of column-aggregated errors. This accuracy falls within an acceptably small margin of error.
B. Variation of error vs. $\cos\mu$
Figure 5 displays the distribution in test errors as a function of $\cos\mu$ of the test column. Data in these plots have been binned to more simply display the spread of errors for a given $\cos\mu$ interval. For values of $\cos\mu$ close to 1, there is a narrower distribution of error, with test samples being predicted reliably more accurately as compared to test samples corresponding to lower values of $\cos\mu$ .

This variation in test error distribution as a function of $\cos\mu$ may be attributable to the approximations used during target preprocessing when training the model (see section 3.2); if indeed this is the case, then there is scope to mitigate against this by refining data preprocessing methods when producing future iterations of this surrogate model. This error variation with $\cos\mu$ may otherwise be due to the model not exactly capturing the complexities in how $\cos\mu$ is used in mapping inputs to outputs, which may be an acceptable and necessary trade-off in using a surrogate model for the purposes of model speed-up.

4.1.2 Long-Wave Surrogate Model

A. Variation of error vs. altitude
Figure 4 displays the absolute error of predictions averaged across altitude levels across test samples. A similar trend can be seen in these plots as compared to the trends described in 4.1.1: average magnitude of error roughly follows the same trend as average magnitude of the target variables, except for between $40-70\,\text{km}$ altitude (roughly in the interval of the main cloud deck) whereby the error rises significantly at the top of the cloud deck and decreases going deeper into the cloud deck, for both target variables.

Below the cloud deck, it can be seen from plots A and B that the mean error in target predictions is less than 5% of mean target magnitude, for the given percentile interval of test samples. Above the cloud deck, the long-wave fluxes tend to zero, so though the percentage errors increase for increasing altitude, these accuracies still fall within the reasonable margin of error for the model.
B. Variation of error vs. $\cos\mu$ /surface temperature
Variations in test error versus surface temperature (displayed in figure 5), appear to follow the same trend as compared to residuals in the scaling factor used in preprocessing (displayed in figure 1). This is promising as it may indicate that scaling residuals are the limiting factor in model accuracy, and these scaling residuals were initially deemed as acceptably small for the purpose of this work. Considering variation in test error versus cosine of the solar zenith angle ( $\cos\mu$ ) (also displayed in figure 5), there is not a discernible trend in the error variation across $\cos\mu$ , though it appears that the error distribution is most narrow for $\cos\mu$ approaching 1.

4.2 Model Performance in Simulation

To evaluate the performance of our new surrogate model on 3D simulations of Venus’s atmosphere, we have run two OASIS simulations: one using OASIS-RT for the radiative transfer scheme and the other using the new surrogate model presented in this work. In order to efficiently run the 3D simulation with OASIS-RT, as detailed in section 2.1.1, we updated the radiative fluxes for solar radiation every 2880 steps, and the thermal radiation fluxes were updated every 320 steps. Both simulations run for 5 Venus solar days, each of which is approximately 117 Earth days. Figure 6 displays the temperature of the simulation using OASIS-RT averaged over the last simulated Venus day in the left-hand plot, and the percentage difference in this quantity between the two simulations in the right-hand plot. Beneath the bottom of the cloud deck, the percentage difference between the time-averaged temperature profiles produced by the two simulations is below 1.5%; in the interval of the cloud deck, below 2.7%; and above the cloud deck, below 4.0%. These deviations in the final temperature profiles fall within the range of uncertainties in the measurements. This is reasonable due to two main factors: firstly, there will be inherent discrepancies between simulated and real temperatures arising from assumptions made in the physical model, and secondly, deviations will also naturally arise between the physical model and real temperature profiles due to the spatial resolution of the simulation. Also, we expected differences to arise due to the frequency at which the radiative fluxes are updated. In the case of the surrogate model, it is possible to update them at every physical time step.

4.3 Speed-up

Our new model with the surrogate-integrated model, integrated for 5 Venus days has been shown in figure 6 to produce temperature profiles with an acceptably small deviation from those produced in the same number of epochs using the original model. These two models described in the previous section had similar performances. However, when we compare OASIS simulations with OASIS-RT, where the fluxes are updated every timestep, and the OASIS simulations with the new surrogate model, we find a 101-factor speed-up for integration of only 1000 timesteps. Additionally, a large amount of global memory is saved on the GPU graphics card in the simulations with the new surrogate model since it does not require storing opacity cross-sections from the gas absorption or clouds. With the potential to update the radiative fluxes on every physical timestep, our new approach allows for the inclusion of dynamical cloud feedback at a small computational cost. This addresses one of the main limitations of current 3D Venus atmospheric models.

4.4 Limitations

The surrogate models produced in this work do not take explicit input of the structure of cloud and gas absorber constituents of the atmosphere being modelled, except for the input of gas density, $\rho_{i,j}$ . This means that the learning objective for surrogate models in this work is to approximate the mapping from input thermodynamic column variables to output columnar flux profiles, conditioned on a specific cloud and absorber structure. Consequently, this means that the models produced in this work are only applicable for planets corresponding to the planetary parameters and cloud and absorber structure specific to Venus. This is a limitation in terms of generalisability of the surrogate-integrated GCM to other types of atmosphere.

5 Conclusions

This work introduces a surrogate model approach to replacing numerical simulations of short-wave and long-wave computations in a two-stream radiative transfer model, aimed at accelerating the Global Circulation Model (GCM), OASIS. The results show a significant GCM speed-up by a factor of 101 GPU performance, with surrogate models for both long-wave and short-wave regimes achieving test set accuracies of approximately 1%. Additionally, this approach replicates the temperature profile of the original Venus simulations averaged across a Venus solar day with differences of 4% after 5 Venus solar days of simulation.

This work is significant in that it enables:

1.

$\sim 100\times$ faster simulations of planets with massive atmospheres that require complex radiative transfer schemes, such as the Venus atmosphere.
2.

Longer simulations with a much higher spatial resolution ( $\sim 10\times$ ).
3.

Improved representation of the temperature evolution of short-term physical phenomena in the atmosphere. These can be atmospheric waves with timescales shorter than the period at which the radiative fluxes are updated in the simulation. In the case of our Venus simulations, we can measure the temperature change of atmospheric waves with timescales $<12$ hours.
4.

A model free of model tuning to optimize performance, such as the frequency of how the radiative fluxes are updated.
5.

The inclusion in 3D simulations of cloud dynamical feedback or higher-order, more complex radiative schemes with a small extra computational cost.

This achievement of faster and/or higher spatial resolution atmospheric simulations will facilitate better insight into the nature of the atmosphere of Venus, as well as bench-marking the utility and applicability of such modelling techniques for use in exoplanet science.

Acknowledgements

We thank Dr Ahmed Al-Refaie, Nikita Pond and Max Hart for helpful conversations on integrating TensorFlow models within a C/C++ framework.

The authors acknowledge the use of the High Performance Computing facilities of University College London to carry out this work, specifically the Hypatia cluster. This work used computing equipment funded by the Research Capital Investment Fund (RCIF) provided by UKRI, and partially funded by the UCL Cosmoparticle Initiative. This research received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement n^∘ 758892/ExoAI), and from the Science and Technology Facilities Council (STFC; grant n^∘ ST/W00254X/1 and grant n^∘ ST/W50788X/1).

Data Availability

The code for this project will be made publicly available at https://github.com/ttahseen/oasis-rt-surrogate on acceptance of this paper.

References

Abadi et al. (2016) Abadi M., et al., 2016, TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems, doi:10.48550/arXiv.1603.04467, http://arxiv.org/abs/1603.04467
Agarap (2019) Agarap A. F., 2019, Deep Learning using Rectified Linear Units (ReLU), doi:10.48550/arXiv.1803.08375, http://arxiv.org/abs/1803.08375
Changeat & Al-Refaie (2020) Changeat Q., Al-Refaie A., 2020, The Astrophysical Journal, 898, 155
Chubb & Min (2022) Chubb K. L., Min M., 2022, Astronomy & Astrophysics, 665, A2
Crisp (1986) Crisp D., 1986, Icarus, 67, 484
Deitrick et al. (2020) Deitrick R., Mendonça J. M., Schroffenegger U., Grimm S. L., Tsai S.-M., Heng K., 2020, The Astrophysical Journal Supplement Series, 248, 30
Eymet et al. (2009) Eymet V., Fournier R., Dufresne J.-L., Lebonnois S., Hourdin F., Bullock M. A., 2009, Journal of Geophysical Research: Planets, 114
Feng et al. (2020) Feng Y. K., Line M. R., Fortney J. J., 2020, The Astronomical Journal, 160, 137
Gardner et al. (2006) Gardner J. P., et al., 2006, Space Science Reviews, 123, 485
Himes et al. (2022) Himes M. D., et al., 2022, The Planetary Science Journal, 3, 91
Himes et al. (2023) Himes M. D., Harrington J., Baydin A. G., 2023, Towards 3D Retrieval of Exoplanet Atmospheres: Assessing Thermochemical Equilibrium Estimation Methods, doi:10.48550/arXiv.2304.00073, http://arxiv.org/abs/2304.00073
Ian Goodfellow and Yoshua Bengio and Aaron Courville (2016) Ian Goodfellow and Yoshua Bengio and Aaron Courville 2016, Deep Learning. MIT Press, http://www.deeplearningbook.org
Irwin et al. (2020) Irwin P. G. J., Parmentier V., Taylor J., Barstow J., Aigrain S., Lee E. K. H., Garland R., 2020, Monthly Notices of the Royal Astronomical Society, 493, 106
Joseph et al. (1976) Joseph J. H., Wiscombe W. J., Weinman J. A., 1976, Journal of the Atmospheric Sciences, 33, 2452
Kingma & Ba (2017) Kingma D. P., Ba J., 2017, Adam: A Method for Stochastic Optimization, doi:10.48550/arXiv.1412.6980, http://arxiv.org/abs/1412.6980
Knollenberg & Hunten (1980) Knollenberg R. G., Hunten D. M., 1980, Journal of Geophysical Research: Space Physics, 85, 8039
Lacis & Oinas (1991) Lacis A. A., Oinas V., 1991, Journal of Geophysical Research: Atmospheres, 96, 9027
Lebonnois et al. (2016) Lebonnois S., Sugimoto N., Gilli G., 2016, Icarus, 278, 38
Lee & Richardson (2012) Lee C., Richardson M. I., 2012, Icarus, 221, 1173
Liu & Weng (2006) Liu Q., Weng F., 2006, Journal of the Atmospheric Sciences, 63, 3459
Mendonca et al. (2014) Mendonca J., Read P., Wilson C., Lee C., 2014, Planetary and Space Science, 105
Mendonca et al. (2015) Mendonca J. M., Read P. L., Wilson C. F., Lee C., 2015, Planetary and Space Science, 105, 80
Mendonça & Buchhave (2020) Mendonça J. M., Buchhave L. A., 2020, Monthly Notices of the Royal Astronomical Society, 496, 3512
Mendonça & Read (2016) Mendonça J. M., Read P. L., 2016, Planetary and Space Science, 134, 1
Mendonça et al. (2016) Mendonça J. M., Grimm S. L., Grosheintz L., Heng K., 2016, The Astrophysical Journal, 829, 115
Mukkavilli et al. (2023) Mukkavilli S. K., et al., 2023, AI Foundation Models for Weather and Climate: Applications, Design, and Implementation, doi:10.48550/arXiv.2309.10808, http://arxiv.org/abs/2309.10808
Nixon & Madhusudhan (2022) Nixon M. C., Madhusudhan N., 2022, The Astrophysical Journal, 935, 73
Pluriel et al. (2020) Pluriel W., Zingales T., Leconte J., Parmentier V., 2020, Astronomy and Astrophysics, 636, A66
Sagan (1962) Sagan C., 1962, The Physical Environment of Venus: Models and Prospects. https://ui.adsabs.harvard.edu/abs/1962saa..conf..430S
Schneider et al. (2024) Schneider A. D., Mollière P., Louppe G., Carone L., Jørgensen U. G., Decin L., Helling C., 2024, Astronomy & Astrophysics, 682, A79
Tinetti et al. (2021) Tinetti G., et al., 2021, Ariel: Enabling planetary science across light-years, doi:10.48550/arXiv.2104.04824, https://ui.adsabs.harvard.edu/abs/2021arXiv210404824T
Tomasko et al. (1980) Tomasko M. G., Doose L. R., Smith P. H., Odell A. P., 1980, Journal of Geophysical Research: Space Physics, 85, 8167
Ukkonen (2022) Ukkonen P., 2022, Journal of Advances in Modeling Earth Systems, 14, e2021MS002875
Unlu et al. (2023) Unlu E. B., Forestano R. T., Matchev K. T., Matcheva K., 2023, Reproducing Bayesian Posterior Distributions for Exoplanet Atmospheric Parameter Retrievals with a Machine Learning Surrogate Model, http://arxiv.org/abs/2310.10521
Yao et al. (2023) Yao Y., Zhong X., Zheng Y., Wang Z., 2023, Journal of Advances in Modeling Earth Systems, 15, e2022MS003445

Appendix A Model Parameters

The tables below display the number of model parameters per layer of the two surrogate models produced in this work.

A.1 Short-Wave Surrogate Model

Layer	Output Shape	Number of Parameters
Main Inputs	[( $n$ , 49, 3)]	0
Auxiliary Inputs	[( $n$ , 5)]	0
$\text{GRU}_{\downarrow}$	[( $n$ , 49, 128), ( $n$ , 128)]	51,072
$\text{Dense}_{1}$	[( $n$ , 128)]	17,152
$\text{GRU}_{\uparrow}$	[( $n$ , 50, 128)]	99,072
$\text{Dense}_{\text{out}}$	[( $n$ , 50, 2)]	514
Total number of model parameters:		167,810

Table 3: This table displays the number of parameters across model layers for the short-wave surrogate model, as well as the output shapes of each layer.

n

denotes the number of atmospheric columns passed as input to the model; layers correspond to those displayed in figure 2.

A.2 Long-Wave Surrogate Model

Layer	Output Shape	Number of Parameters
Main Inputs	[( $n$ , 49, 3)]	0
Auxiliary Inputs	[( $n$ , 4)]	0
$\text{GRU}_{\downarrow}$	[( $n$ , 49, 32), ( $n$ , 32)]	3,552
$\text{Dense}_{1}$	[( $n$ , 32)]	1,184
$\text{GRU}_{\uparrow}$	[( $n$ , 50, 32)]	6,336
$\text{Dense}_{\text{out}}$	[( $n$ , 50, 2)]	130
Total number of model parameters:		11,202

Table 4: This table displays the number of parameters across model layers for the long-wave surrogate model, as well as the output shapes of each layer.