Preheating with deep learning

Jong-Hyun Yoon, Simon Cléry, Mathieu Gross and Yann Mambrini

Université Paris-Saclay, CNRS/IN2P3, IJCLab, 91405 Orsay, France

Abstract

We apply deep learning techniques to the late-time turbulent regime in a post-inflationary model where a real scalar inflaton field and the standard model Higgs doublet interact with renormalizable couplings between them. After inflation, the inflaton decays into the Higgs through a trilinear coupling and the Higgs field subsequently thermalizes with gauge bosons via its SU(2)×U(1)𝑆𝑈2𝑈1SU(2)\times U(1)italic_S italic_U ( 2 ) × italic_U ( 1 ) gauge interaction. Depending on the strength of the trilinear interaction and the Higgs self-coupling, the effective mass squared of Higgs can become negative, leading to the tachyonic production of Higgs particles. These produced Higgs particles would then share their energy with gauge bosons, potentially indicating thermalization. Since the model entails different non-perturbative effects, it is necessary to resort to numerical and semi-classical techniques. However, simulations require significant costs in terms of time and computational resources depending on the model used. Particularly, when SU(2)𝑆𝑈2SU(2)italic_S italic_U ( 2 ) gauge interactions are introduced, this becomes evident as the gauge field redistributes particle energies through rescattering processes, leading to an abundance of UV modes that disrupt simulation stability. This necessitates very small lattice spacings, resulting in exceedingly long simulation runtimes. Furthermore, the late-time behavior of preheating dynamics exhibits a universal form by wave kinetic theory. Therefore, we analyze patterns in the flow of particle numbers and predict future behavior using CNN-LSTM (Convolutional Neural Network combined with Long Short-Term Memory) time series analysis. In this way, we can reduce our dependence on simulations by orders of magnitude in terms of time and computational resources.

1 Introduction

Cosmic inflation has been demonstrated to play a pivotal role in establishing the initial conditions of the early universe within modern cosmology [1, 2, 3, 4]. These characteristics can be realized by a real scalar field, known as the inflaton. It allows for the exploration of various intriguing phenomenologies due to the interplay between the inflaton and other fields [5, 6, 7]. However, in any post-inflationary model, we must eventually explain a thermal universe where the constituents of thermal plasma possess distinctive properties such as thermal mass, necessitating a deeper comprehension of thermal field theory. During the initial stages of thermalization, defining thermal quantities becomes challenging, particularly with a low number of plasma constituents [8]. In the initially under-occupied systems, low-momentum particles form a bath, thermalize, and then dominate the system by absorbing the energy from the hard excitations. This process is dominated by number-changing interactions, which can interfere and therefore it is important to take into account modification of the interaction rate so-called the Landau–Pomeranchuk–Migdal (LPM) effect [9, 10, 11]. When the rate of plasma constituent production is sufficiently low, interaction dynamics can be treated perturbatively [12, 13], but when the rate is large, the system falls into the non-perturbative regime, where fields redistribute their energies by turbulent rescattering. This late-time phenomenon is known as free turbulence, where the dynamics are described by kinetic theories [14, 15]. Given the inherent challenges of demonstrating free turbulence and tracking thermalization using lattice simulations, it is tempting to infer patterns in the occupation number of fields, represented as self-similarity implied by the kinetic theories and analyze them using deep learning methods for its proficiency in pattern recognition.

Machine learning and deep learning have revolutionized various fields, including physics, by providing powerful tools to analyze complex systems and phenomena [16, 17, 18]. Machine learning involves algorithms that enable computers to learn from data and make predictions or decisions without being explicitly programmed, while deep learning utilizes neural networks with multiple layers to learn complex patterns and representations from data. Convolutional Neural Networks (CNNs) are particularly effective for processing spatial data, such as images, while Long Short-Term Memory (LSTM) networks specialize in capturing temporal dependencies in sequential data [19].

Therefore, combining Convolutional 1D (Conv1D) layers with LSTM networks offers a powerful solution for analyzing time series data in physics. Conv1D layers effectively capture local patterns in sequential data, while LSTM networks model long-term dependencies and temporal dynamics, enabling accurate analysis of time-dependent datasets across various domains.

In this study, we apply deep learning techniques to the following example model. We consider the post-inflationary dynamics of standard model Higgs particles abundantly produced by the inflaton, where the Higgs subsequently decays into gauge bosons through SU(2)×U(1)𝑆𝑈2𝑈1SU(2)\times U(1)italic_S italic_U ( 2 ) × italic_U ( 1 ) interactions, leading to the formation of a thermal sector. During the inflaton oscillation period, numerical simulations are often necessary since the production of scalar or gauge bosons after inflation involves non-perturbative dynamics (e.g. preheating)[20, 21, 22, 23, 24, 25, 26, 27, 28]. Since lattice simulations designed to analyze the preheating system have limitations in observing late-time dynamics up to reheating, we attempt to overcome these limitations by employing deep learning techniques.

Our paper is structured as follows: In Section 2, we examine the tachyonic production mechanism of Higgs via its trilinear interaction with the inflaton. In Appendix A, we investigate how post-inflationary SU(2)×U(1)𝑆𝑈2𝑈1SU(2)\times U(1)italic_S italic_U ( 2 ) × italic_U ( 1 ) gauge interactions are implemented on the lattice, and the simulation results will be presented in Section 3. We discuss the deep learning technique that we applied to our results in Section 4 and draw conclusions in Section 5. Appendix B provides an introduction to the hyperparameters used in deep learning.

2 Tachyonic production of Higgs

The minimal interactions between the inflaton ϕitalic-ϕ\phiitalic_ϕ and the Higgs H𝐻Hitalic_H are of the form [29]

ΔV=12mϕ2ϕ2+14λϕϕ4+12λϕhϕ2HH+σϕhϕHHmh2HH+λh(HH)2,Δ𝑉12subscriptsuperscript𝑚2italic-ϕsuperscriptitalic-ϕ214subscript𝜆italic-ϕsuperscriptitalic-ϕ412subscript𝜆italic-ϕsuperscriptitalic-ϕ2superscript𝐻𝐻subscript𝜎italic-ϕitalic-ϕsuperscript𝐻𝐻subscriptsuperscript𝑚2superscript𝐻𝐻subscript𝜆superscriptsuperscript𝐻𝐻2\Delta V=\frac{1}{2}{m^{2}_{\phi}}\,\phi^{2}+\frac{1}{4}{\lambda_{\phi}}\,\phi% ^{4}+\frac{1}{2}{\lambda_{\phi h}}\,\phi^{2}\,H^{\dagger}H+{\sigma_{\phi h}}\,% \phi\,H^{\dagger}H-{m^{2}_{h}}\,H^{\dagger}H+{\lambda_{h}}\,(H^{\dagger}H)^{2}\;,roman_Δ italic_V = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 4 end_ARG italic_λ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_λ start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_H + italic_σ start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT italic_ϕ italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_H - italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_H + italic_λ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_H ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (1)

where we assume λϕ=0,λϕh=0,formulae-sequencesubscript𝜆italic-ϕ0subscript𝜆italic-ϕ0\lambda_{\phi}=0,\,\lambda_{\phi h}=0,\,italic_λ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT = 0 , italic_λ start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT = 0 , and mh=0subscript𝑚0m_{h}=0italic_m start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = 0 for simplicity.111In the case of the quartic inflaton potential, to elevate the global minimum of the potential from negative values to zero, one should introduce an undesirable offset term [31]. In unitary gauge

H=12(0h),𝐻12matrix0H=\frac{1}{\sqrt{2}}\begin{pmatrix}0\\ h\end{pmatrix}\,,italic_H = divide start_ARG 1 end_ARG start_ARG square-root start_ARG 2 end_ARG end_ARG ( start_ARG start_ROW start_CELL 0 end_CELL end_ROW start_ROW start_CELL italic_h end_CELL end_ROW end_ARG ) , (2)

the potential simplifies to

V=12mϕ2ϕ2+σϕh2ϕh2+λh4h4.𝑉12subscriptsuperscript𝑚2italic-ϕsuperscriptitalic-ϕ2subscript𝜎italic-ϕ2italic-ϕsuperscript2subscript𝜆4superscript4V=\frac{1}{2}{m^{2}_{\phi}}\,\phi^{2}+\frac{\sigma_{\phi h}}{2}\,\phi\,h^{2}+% \frac{\lambda_{h}}{4}\,h^{4}\,.italic_V = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG italic_ϕ italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_λ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG start_ARG 4 end_ARG italic_h start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT . (3)

By rewriting Eq. (3) in the form

V=12(mϕϕ+σϕh2mϕh2)2+14(λhσϕh22mϕ2)h4,𝑉12superscriptsubscript𝑚italic-ϕitalic-ϕsubscript𝜎italic-ϕ2subscript𝑚italic-ϕsuperscript2214subscript𝜆subscriptsuperscript𝜎2italic-ϕ2subscriptsuperscript𝑚2italic-ϕsuperscript4V=\frac{1}{2}\left({m_{\phi}}\,\phi+\frac{\sigma_{\phi h}}{2m_{\phi}}h^{2}% \right)^{2}+\frac{1}{4}\,\left(\lambda_{h}-\frac{\sigma^{2}_{\phi h}}{2m^{2}_{% \phi}}\right)h^{4}\,,italic_V = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT end_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 4 end_ARG ( italic_λ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT end_ARG ) italic_h start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT , (4)

one finds that λhsubscript𝜆\lambda_{h}italic_λ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT must be larger than σϕh2/2mϕ2subscriptsuperscript𝜎2italic-ϕ2subscriptsuperscript𝑚2italic-ϕ\sigma^{2}_{\phi h}/2m^{2}_{\phi}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT / 2 italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT to make the potential bounded from below [30]. The equation of motion of hhitalic_h is

h+V,h=0\Box h+V_{,h}=0□ italic_h + italic_V start_POSTSUBSCRIPT , italic_h end_POSTSUBSCRIPT = 0 (5)

Quantizing hhitalic_h with the eigenmodes hk(t)ei𝕜xsubscript𝑘𝑡superscript𝑒𝑖𝕜𝑥h_{k}(t)e^{i\mathbb{k}\vec{x}}italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) italic_e start_POSTSUPERSCRIPT italic_i blackboard_k over→ start_ARG italic_x end_ARG end_POSTSUPERSCRIPT and 𝕜𝕜\mathbb{k}blackboard_k the comoving momentum, one may derive linear mode equations in momentum space222The modes are rescaled to factor out the scale factor, hka3/2hksubscript𝑘superscript𝑎32subscript𝑘h_{k}\rightarrow a^{3/2}h_{k}italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT → italic_a start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.

h¨k+ωk2hk=0,subscript¨𝑘subscriptsuperscript𝜔2𝑘subscript𝑘0\ddot{h}_{k}+\omega^{2}_{k}h_{k}=0,over¨ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0 , (6)

where

ωk2=k2a2+σϕhϕ(t)+3λhh23a˙2/4a23a¨/2a.subscriptsuperscript𝜔2𝑘superscript𝑘2superscript𝑎2subscript𝜎italic-ϕitalic-ϕ𝑡3subscript𝜆delimited-⟨⟩superscript23superscript˙𝑎24superscript𝑎23¨𝑎2𝑎\omega^{2}_{k}=\frac{k^{2}}{a^{2}}+\sigma_{\phi h}\phi(t)+3\lambda_{h}\langle h% ^{2}\rangle-3\dot{a}^{2}/4a^{2}-3\ddot{a}/2a\,.italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + italic_σ start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT italic_ϕ ( italic_t ) + 3 italic_λ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ⟨ italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ - 3 over˙ start_ARG italic_a end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 4 italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 3 over¨ start_ARG italic_a end_ARG / 2 italic_a . (7)

Given that negative values of the inflaton background can make the entire frequency term negative, we expect a tachyonic growth of the Higgs mode during the inflaton oscillation period. While one can theoretically examine the mathematical structure of the mode equations and quantitatively determine the number of produced particles, practical implementation proves to be challenging for various reasons [31]. Each mode in the equation experiences movement across the instability band with varying Floquet quotient due to the expansion of space. Eq. (6) becomes linearized in terms of k𝑘kitalic_k only when we disregard the rescattering of modes with different k𝑘kitalic_k. This is not the case in the presence of the strong self-interaction and gauge interaction of Higgs, even at the onset of the inflaton oscillation period. To deal with the turbulent particle interactions, we opt to solve the equations of motion directly in spacetime through a semi-classical approach (see [32] for studies on gravitational wave production in a similar framework).

3 Post-inflationary weak interactions

Given the gauge interactions in the standard model, Higgs particles will interact with gauge bosons upon production. Previous studies have explored scenarios where considerable Higgs condensates generate gauge bosons via resonant channels [27, 28]. We examine a system in which field constituents, particularly focusing on the Higgs directly produced by the inflaton, interact with other fields in a turbulent manner.

Refer to caption
Refer to caption
Figure 1: Energy densities of interacting fields during preheating. Left: energy density of fields normalized by mϕ4superscriptsubscript𝑚italic-ϕ4m_{\phi}^{4}italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT. The x𝑥xitalic_x-axis represents time rescaled by the inflaton mass, mϕsubscript𝑚italic-ϕm_{\phi}italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT. Right: ratio of the energy density of each field to the total energy over the scale factor.

Using a publicly available code, CosmoLattice [33, 34], we conduct lattice simulations to model post-inflationary weak interactions of the Higgs (see A). We set g2=g1=0.6subscript𝑔2subscript𝑔10.6g_{2}=g_{1}=0.6italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0.6 and λh=102subscript𝜆superscript102\lambda_{h}=10^{-2}italic_λ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = 10 start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT for the coupling constants at the inflationary physics scale to be not too different from those of the standard model. As in [32], the inflaton field starts to roll down to the minimum of potential with the bare mass mϕ=1.6×1013subscript𝑚italic-ϕ1.6superscript1013m_{\phi}=1.6\times 10^{13}italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT = 1.6 × 10 start_POSTSUPERSCRIPT 13 end_POSTSUPERSCRIPT GeV, ϕinit=1mpsubscriptitalic-ϕ𝑖𝑛𝑖𝑡1subscript𝑚𝑝\phi_{init}=1\,m_{p}italic_ϕ start_POSTSUBSCRIPT italic_i italic_n italic_i italic_t end_POSTSUBSCRIPT = 1 italic_m start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and ϕ˙init=0.71mϕmpsubscript˙italic-ϕ𝑖𝑛𝑖𝑡0.71subscript𝑚italic-ϕsubscript𝑚𝑝\dot{\phi}_{init}=-0.71\,m_{\phi}m_{p}over˙ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_i italic_n italic_i italic_t end_POSTSUBSCRIPT = - 0.71 italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT, where the Planck mass mp2.435×1018 GeVsubscript𝑚𝑝2.435superscript1018 GeVm_{p}\approx 2.435\times 10^{18}\text{ GeV}italic_m start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≈ 2.435 × 10 start_POSTSUPERSCRIPT 18 end_POSTSUPERSCRIPT GeV. Here, init𝑖𝑛𝑖𝑡inititalic_i italic_n italic_i italic_t refers to the initial values input into the lattice simulations. While we do not specify an inflationary model, it is assumed to converge to a quadratic inflaton potential some time after the end of inflation.333Around the end of inflation where ϕ>0italic-ϕ0\phi>0italic_ϕ > 0, the potential of the form Eq. (4) reaches its minimum for h=00h=0italic_h = 0. Parameters may vary within certain limits to accommodate running coupling constants or experimental constraints.

The energy densities of fields are defined

ρϕsubscript𝜌italic-ϕ\displaystyle\rho_{\phi}italic_ρ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT \displaystyle\equiv Kϕ+Gϕ+V(ϕ)subscript𝐾italic-ϕsubscript𝐺italic-ϕ𝑉italic-ϕ\displaystyle K_{\phi}+G_{\phi}+V(\phi)\,italic_K start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT + italic_V ( italic_ϕ ) (8)
ρHsubscript𝜌𝐻\displaystyle\rho_{H}italic_ρ start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT \displaystyle\equiv KH+GH+V(H)subscript𝐾𝐻subscript𝐺𝐻𝑉𝐻\displaystyle K_{H}+G_{H}+V(H)\,italic_K start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT + italic_V ( italic_H ) (9)
ρBsubscript𝜌𝐵\displaystyle\rho_{B}italic_ρ start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT \displaystyle\equiv KU(1)+GU(1)subscript𝐾𝑈1subscript𝐺𝑈1\displaystyle K_{U(1)}+G_{U(1)}\,italic_K start_POSTSUBSCRIPT italic_U ( 1 ) end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_U ( 1 ) end_POSTSUBSCRIPT (10)
ρWsubscript𝜌𝑊\displaystyle\rho_{W}italic_ρ start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT \displaystyle\equiv KSU(2)+GSU(2),subscript𝐾𝑆𝑈2subscript𝐺𝑆𝑈2\displaystyle K_{SU(2)}+G_{SU(2)}\,,italic_K start_POSTSUBSCRIPT italic_S italic_U ( 2 ) end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_S italic_U ( 2 ) end_POSTSUBSCRIPT , (11)

where V(ϕ)𝑉italic-ϕV(\phi)italic_V ( italic_ϕ ) and V(H)𝑉𝐻V(H)italic_V ( italic_H ) represent the potential of the inflaton and Higgs doublet, respectively. Fig. 1 shows the energy densities of fields over time for the case where σϕh=109GeVsubscript𝜎italic-ϕsuperscript109GeV\sigma_{\phi h}=10^{9}\,\text{GeV}italic_σ start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT = 10 start_POSTSUPERSCRIPT 9 end_POSTSUPERSCRIPT GeV. The total energy density of the system is still dominated by the energy of the inflaton field, while the Higgs contributes only a small fraction of the total energy. Due to the strong self-interaction of the Higgs (λh=102subscript𝜆superscript102\lambda_{h}=10^{-2}italic_λ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = 10 start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT), tachyonic production is highly suppressed but its interaction with the inflaton is sufficiently strong to withstand the dilution by the quadratic inflaton potential (ρRa4proportional-tosubscript𝜌𝑅superscript𝑎4\rho_{R}\propto a^{-4}italic_ρ start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ∝ italic_a start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT for radiation while ρϕa3proportional-tosubscript𝜌italic-ϕsuperscript𝑎3\rho_{\phi}\propto a^{-3}italic_ρ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ∝ italic_a start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT for the massive inflaton field).

For a qualitative analysis, we illustrate the energy ratio of the Higgs and the SU(2)𝑆𝑈2SU(2)italic_S italic_U ( 2 ) gauge field to the U(1)𝑈1U(1)italic_U ( 1 ) gauge field on the left of Fig. 2. We anticipate observing the equipartition of energy in the gauge sector as an indication of thermalization, in a ratio of 4:6:2 (or equivalently 2:3:1) corresponding to the degrees of freedom of the Higgs, SU(2)𝑆𝑈2SU(2)italic_S italic_U ( 2 ) gauge fields, and U(1)𝑈1U(1)italic_U ( 1 ) gauge boson, respectively. We have captured the moment of energy equipartition shortly after the end of inflation (a60similar-to𝑎60a\sim 60italic_a ∼ 60 or mϕt700similar-tosubscript𝑚italic-ϕ𝑡700m_{\phi}t\sim 700italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t ∼ 700).

Refer to caption
Refer to caption
Figure 2: Indication of a quasi-equilibrium state in the gauge sector. Left: energy ratio of Higgs and SU(2)𝑆𝑈2SU(2)italic_S italic_U ( 2 ) gauge field to U(1)𝑈1U(1)italic_U ( 1 ) gauge field. The dashed grey horizontal line represents 3 which corresponds to the ratio of degrees of freedom for the SU(2)𝑆𝑈2SU(2)italic_S italic_U ( 2 ) gauge field to the one for the U(1)𝑈1U(1)italic_U ( 1 ) gauge field (ρW/ρB=3subscript𝜌𝑊subscript𝜌𝐵3\rho_{W}/\rho_{B}=3italic_ρ start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT / italic_ρ start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT = 3). Likewise, the red dashed horizontal line represents the same quantity but for the Higgs doublet (ρH/ρB=2subscript𝜌𝐻subscript𝜌𝐵2\rho_{H}/\rho_{B}=2italic_ρ start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT / italic_ρ start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT = 2). Right: occupation number for the Higgs doublet (red to blue over time). The comoving momentum k𝑘kitalic_k is normalized by the inflaton mass. The mean particle distribution shifts from IR to UV as time progresses.

We excited initial particle modes for the comoving momentum in the range 5k/mϕ105𝑘subscript𝑚italic-ϕ105\leq k/m_{\phi}\leq 105 ≤ italic_k / italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ≤ 10 and evolved the system in the extended range 5k/mϕ6905𝑘subscript𝑚italic-ϕless-than-or-similar-to6905\leq k/m_{\phi}\lesssim 6905 ≤ italic_k / italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ≲ 690. It should be noted that our focus is on the tail end of spectra in the UV, which could potentially underestimate Higgs production in the IR. Expanding the momentum window in the IR direction leads to increased energy exchange due to the increased scattering of particles, necessitating further expansion in the UV direction as well. This results in a significant increase in simulation runtime and it is one of the key motivations for us to leverage deep learning techniques. For the subsequent discussion, we define the occupation number for the Higgs doublet as444Zero initial occupation number for scalar fields in theory corresponds to nk,init=1/2subscript𝑛𝑘𝑖𝑛𝑖𝑡12n_{k,init}=1/2italic_n start_POSTSUBSCRIPT italic_k , italic_i italic_n italic_i italic_t end_POSTSUBSCRIPT = 1 / 2 in our definition. For the Higgs doublet, we take into account 4 Higgs d.o.f and 1/2 for the normalization factor, resulting in nk,initH=1/2×4×1/2=1subscriptsuperscript𝑛𝐻𝑘𝑖𝑛𝑖𝑡124121n^{H}_{k,init}=1/2\times 4\times 1/2=1italic_n start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i italic_n italic_i italic_t end_POSTSUBSCRIPT = 1 / 2 × 4 × 1 / 2 = 1.

nkH=inkiiωki2(|(hki)|2/(ωki)2+|hki|2),subscriptsuperscript𝑛𝐻𝑘subscript𝑖subscriptsuperscript𝑛𝑖𝑘subscript𝑖subscriptsuperscript𝜔𝑖𝑘2superscriptsuperscriptsubscriptsuperscript𝑖𝑘2superscriptsubscriptsuperscript𝜔𝑖𝑘2superscriptsubscriptsuperscript𝑖𝑘2\displaystyle n^{H}_{k}=\sum_{i}n^{i}_{k}\equiv\sum_{i}\frac{\omega^{i}_{k}}{2% }\left(|{(h^{i}_{k})^{\prime}}|^{2}/({\omega^{i}_{k}})^{2}+|h^{i}_{k}|^{2}% \right)\,,italic_n start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≡ ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT divide start_ARG italic_ω start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ( | ( italic_h start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( italic_ω start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + | italic_h start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) , (12)

where i𝑖iitalic_i represents the Higgs d.o.fformulae-sequence𝑑𝑜𝑓d.o.fitalic_d . italic_o . italic_f and the frequency is

ωkik2+a2|d2V(H)/dhi2|.subscriptsuperscript𝜔𝑖𝑘superscript𝑘2superscript𝑎2superscript𝑑2𝑉𝐻𝑑superscriptsuperscript𝑖2\displaystyle\omega^{i}_{k}\equiv\sqrt{k^{2}+a^{2}\,|d^{2}V(H)/d{h^{i}}^{2}|}\,.italic_ω start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≡ square-root start_ARG italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_V ( italic_H ) / italic_d italic_h start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | end_ARG . (13)

The prime denotes the conformal time derivative a(d/dt)𝑎𝑑𝑑𝑡a(d/dt)italic_a ( italic_d / italic_d italic_t ). For gauge bosons, since the occupation number is gauge-dependent, gauge fixing must be performed at each measurement [28, 35, 36, 37]. Due to its technical difficulty, we leave this aspect for future work.

The late-time classical reheating process of self-interacting fields or non-abelian fields exhibits a turbulent and self-similar evolution of distribution functions towards equilibrium [15, 38]. The shape of the spectra, along with the self-similar dynamics, can be comprehended within the framework of wave kinetic theory.555See [39, 40] for possible IR- and UV-cascades in the context of non-thermal fixed points in scalar field theories. This suggests the presence of a consistent trend in the evolution of the occupation number, offering a foundation for exploring the occupation number using deep learning methods.

4 CNN-LSTM time series analysis

The CNN-LSTM model integrates two fundamental components: the Convolutional Neural Network (CNN) and the Long Short-Term Memory (LSTM) network. Each component plays a distinct role in processing the input data to extract features and capture temporal dependencies.

The CNN component is responsible for extracting spatial features from the input data. It operates by applying convolutional filters over the input data to capture spatial patterns, such as edges, textures, and shapes. This process is facilitated by multiple convolutional layers followed by pooling layers, which help reduce spatial dimensions while retaining essential information.

On the other hand, the LSTM component specializes in modeling temporal dependencies in sequential data. It consists of stacked LSTM layers, each equipped with memory cells capable of retaining information over extended time periods. These memory cells are governed by gates that control the flow of information, enabling the model to selectively update, forget, or output information based on the input sequence.

In the parallel CNN-LSTM model, the input data are processed separately by the CNN and LSTM components. Once the CNN and LSTM components have processed their respective data streams, the output features are concatenated and fed into a linear layer (or layers) for further processing. This linear layer serves as the final step in the model, where the extracted features are transformed and mapped to the desired output space.

In our CNN-LSTM time series analysis, the input data have the shape (batch_size, seq_length, input_size), which corresponds to (the number of time points, the length of sequence, the number of k𝑘kitalic_k modes) for the preheating model that we discuss. Both the input and output contain nk(t)subscript𝑛𝑘𝑡n_{k}(t)italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) of the Higgs, but the model output predicts values occurring after one sequence.666In a time series context, the term ‘time’ typically denotes the sequential order of data points rather than implying that the input variables must directly represent time intervals. We implement the model using PyTorch [41], a popular deep learning library in Python. The model architecture consists of a Conv1D layer followed by an LSTM layer. The Conv1D layer captures local patterns in the sequential data, while the LSTM layer captures long-term dependencies.

The model is trained using the training data, and the training process involves optimizing a loss function (mean absolute error) using the Adam optimizer. The training loop iterates over epochs, updating the model parameters to minimize the loss. Predictions are made for future time steps beyond the training data and compared with the actual test data. The test data are completely separated from the model training and forecasting process.

The free turbulence starts approximately around mϕt400subscript𝑚italic-ϕ𝑡400m_{\phi}t\approx 400italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t ≈ 400, so we divided the data into a training set for 400<mϕt<1000400subscript𝑚italic-ϕ𝑡1000400<m_{\phi}t<1000400 < italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t < 1000 and a test set for 1000<mϕt<15001000subscript𝑚italic-ϕ𝑡15001000<m_{\phi}t<15001000 < italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t < 1500. We make predictions up to mϕt2000subscript𝑚italic-ϕ𝑡2000m_{\phi}t\approx 2000italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t ≈ 2000.

Refer to caption
Refer to caption
Figure 3: Occupation number for the Higgs field as a function of t𝑡titalic_t with a logarithmic scale for two k𝑘kitalic_k modes. The gray dashed vertical line distinguishes between training data (orange dotted curve) and time series predictions (green solid curve). The trained predictions (blue solid curve) are obtained by feeding the input data into the model, while the time series predictions are recursively generated by applying the model to the output. Test data (red dotted curve) are used to calculate the validation loss. Left: nk(t)subscript𝑛𝑘𝑡n_{k}(t)italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) for k5mϕ𝑘5subscript𝑚italic-ϕk\approx 5m_{\phi}italic_k ≈ 5 italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT. Since it corresponds to the region interacting with the inflaton, the pattern appears to be more complex. Right: nk(t)subscript𝑛𝑘𝑡n_{k}(t)italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) for k400mϕ𝑘400subscript𝑚italic-ϕk\approx 400m_{\phi}italic_k ≈ 400 italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT.

The model’s predictions are visualized alongside the actual data as in Fig. 3. It represents the actual data used for training, the trained predictions, the test data not used for training, and the predictions for future time steps for two momentum modes among 15 modes (5k/mϕ4005𝑘subscript𝑚italic-ϕless-than-or-similar-to4005\leq k/m_{\phi}\lesssim 4005 ≤ italic_k / italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ≲ 400). It is remarkable that our deep learning model managed to learn patterns and trends in the limited amount of data and make predictions similar to actual simulation results.

The predictions for each mode are collected and reshaped in Fig. 4. At the end of our predictions, around mϕt2000subscript𝑚italic-ϕ𝑡2000m_{\phi}t\approx 2000italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t ≈ 2000, it appears that we have already reached a point where the system departs from the classical regime and quantum effects become significant, indicating proximity to thermalization. By further developing the deep learning model, we can implement a more realistic cosmological model that correctly incorporates the production of the Higgs, which has been underestimated in the IR regime.

Refer to caption
Refer to caption
Figure 4: Predicted spectra of the Higgs occupation number. Left: Occupation numbers computed for different momentum modes (15 points in total) are collected and reshaped. The darker magenta color indicates a later spectrum in time (500mϕt2000less-than-or-similar-to500subscript𝑚italic-ϕ𝑡less-than-or-similar-to2000500\lesssim m_{\phi}t\lesssim 2000500 ≲ italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t ≲ 2000) and the cyan solid (dashed) curve represents a spectrum at mϕt1000subscript𝑚italic-ϕ𝑡1000m_{\phi}t\approx 1000italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t ≈ 1000 (mϕt1500subscript𝑚italic-ϕ𝑡1500m_{\phi}t\approx 1500italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t ≈ 1500). Right: Self-similar form of the spectra corresponding to the left-hand side of Eq. (14). Only predicted values outside the data range were used to fit p𝑝pitalic_p and q𝑞qitalic_q (1500mϕt2000less-than-or-similar-to1500subscript𝑚italic-ϕ𝑡less-than-or-similar-to20001500\lesssim m_{\phi}t\lesssim 20001500 ≲ italic_m start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_t ≲ 2000).

Still, we may use the predicted values to verify the self-similar evolution. Rewriting Eq. (4) from [15] yields

k~4n(k=k~τp,t)τq=k~4n(k=k~,t=t0),superscript~𝑘4𝑛𝑘~𝑘superscript𝜏𝑝𝑡superscript𝜏𝑞superscript~𝑘4𝑛formulae-sequence𝑘~𝑘𝑡subscript𝑡0\displaystyle\tilde{k}^{4}n(k=\tilde{k}\tau^{p},t)\tau^{q}=\tilde{k}^{4}n(k=% \tilde{k},t=t_{0})~{},over~ start_ARG italic_k end_ARG start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_n ( italic_k = over~ start_ARG italic_k end_ARG italic_τ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT , italic_t ) italic_τ start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT = over~ start_ARG italic_k end_ARG start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_n ( italic_k = over~ start_ARG italic_k end_ARG , italic_t = italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , (14)

where k~kτp~𝑘𝑘superscript𝜏𝑝\tilde{k}\equiv k\tau^{-p}over~ start_ARG italic_k end_ARG ≡ italic_k italic_τ start_POSTSUPERSCRIPT - italic_p end_POSTSUPERSCRIPT, τt/t0𝜏𝑡subscript𝑡0\tau\equiv t/t_{0}italic_τ ≡ italic_t / italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and t0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT denotes a fixed point in time. p𝑝pitalic_p and q𝑞qitalic_q represent the fitting exponents, and p𝑝pitalic_p determines how quickly the system approaches equilibrium. Since the right-hand side of Eq. (14) is stationary, it was used as data for fitting, resulting in a good fit when p0.16𝑝0.16p\approx 0.16italic_p ≈ 0.16 and q0.63𝑞0.63q\approx 0.63italic_q ≈ 0.63, as illustrated in Fig. 4. For physical insights on the exponents, see [15, 38]. However, the aim of this work is not to precisely determine the exponents, but to apply deep learning techniques based on the consistent patterns in the distributions, referred to as self-similar evolution or non-thermal fixed points. The exponents may vary depending on the details of the self- or gauge interaction of the Higgs field as well as the fitting conditions. In fact, the Higgs field interacts not only through self- and gauge interactions but also with fermions, and intense fermion reactions are expected during the preheating period [42, 43]. Once the code is updated to include fermions, it will be interesting to directly observe how this affects the self-evolution or non-thermal fixed points in the Higgs distribution.

5 Conclusion

We consider a simple post-inflationary scenario where after inflation, the Standard Model Higgs decays into gauge bosons via weak interactions while forming a quasi-thermal state in the gauge sector. The non-perturbative nature of the system is mainly due to the SU(2)𝑆𝑈2SU(2)italic_S italic_U ( 2 ) gauge interaction (g=0.6𝑔0.6g=0.6italic_g = 0.6) and Higgs self-coupling (λh=102subscript𝜆superscript102\lambda_{h}=10^{-2}italic_λ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = 10 start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT) along with the trilinear interaction (σϕ���hϕHHsubscript𝜎italic-ϕitalic-ϕsuperscript𝐻𝐻\sigma_{\phi h}\phi H^{\dagger}Hitalic_σ start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT italic_ϕ italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_H where σϕh=109subscript𝜎italic-ϕsuperscript109\sigma_{\phi h}=10^{9}\,italic_σ start_POSTSUBSCRIPT italic_ϕ italic_h end_POSTSUBSCRIPT = 10 start_POSTSUPERSCRIPT 9 end_POSTSUPERSCRIPTGeV) between Higgs and the inflaton. Analyzing the production and decay of Higgs in this model appears challenging without relying on numerical and semi-classical approaches. Demonstrating thermalization through simulation is also very challenging, primarily because simulations are costly in practical terms and quantum effects become significant in later stages. We introduced deep learning techniques as an effort to overcome the limitations of classical lattice simulations and ultimately reproduce the thermal universe. The later stage of preheating is referred to as the regime of free turbulence, where the flow of particle numbers can be analyzed through kinetic theory, making it a suitable material for applying deep learning techniques. LSTM networks are commonly used in time series analysis, but they have limitations in predicting beyond the trained range. And since the momentum modes of the fields in the free turbulence regime have localized influence rather than behaving independently, local patterns in the momentum space can play an important role. CNNs are useful for capturing localized patterns observed in images and similar data. Therefore, we utilized a deep learning model that combines LSTM with CNN, allowing a fully connected linear layer to learn the outputs of both networks simultaneously. We applied this model to the realistic post-inflationary preheating scenario and predicted the spectra of the Higgs field beyond simulation results. While still in its early stages, this study has demonstrated its potential for advancement.

Acknowledgements. J.Y. would like to thank Oleg Lebedev for useful discussions. This work was performed using HPC resources from the ‘‘Mésocentre’’ computing center of CentraleSupélec, École Normale Supérieure Paris-Saclay and Université Paris-Saclay supported by CNRS and Région Île-de-France (https://mesocentre.universite-paris-scalay.fr/). This project has received support from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 860881-HIDDeN, and the CNRS-IRP UCMN.

Appendix A SU(2)×U(1)𝑆𝑈2𝑈1SU(2)\times U(1)italic_S italic_U ( 2 ) × italic_U ( 1 ) on the lattice

Let us start with the SU(2)×U(1)𝑆𝑈2𝑈1SU(2)\times U(1)italic_S italic_U ( 2 ) × italic_U ( 1 ) gauge Lagrangian

gauge=14(Wμνa)214Bμν2+(DμH)(DμH)+mh2HHλh(HH)2,subscriptg𝑎𝑢𝑔𝑒14superscriptsubscriptsuperscript𝑊𝑎𝜇𝜈214subscriptsuperscript𝐵2𝜇𝜈superscriptsubscript𝐷𝜇𝐻subscript𝐷𝜇𝐻subscriptsuperscript𝑚2superscript𝐻𝐻subscript𝜆superscriptsuperscript𝐻𝐻2\mathcal{L}_{\textrm{g}auge}=-\frac{1}{4}{(W^{a}_{\mu\nu})^{2}}\,-\frac{1}{4}{% B^{2}_{\mu\nu}}\,+(D_{\mu}H)^{\dagger}(D_{\mu}H)+{m^{2}_{h}}\,H^{\dagger}H-{% \lambda_{h}}\,(H^{\dagger}H)^{2}\;,caligraphic_L start_POSTSUBSCRIPT g italic_a italic_u italic_g italic_e end_POSTSUBSCRIPT = - divide start_ARG 1 end_ARG start_ARG 4 end_ARG ( italic_W start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_μ italic_ν end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 4 end_ARG italic_B start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_μ italic_ν end_POSTSUBSCRIPT + ( italic_D start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_H ) start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_H ) + italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_H - italic_λ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_H ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (15)

where Bμsubscript𝐵𝜇B_{\mu}italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT is the U(1)𝑈1U(1)italic_U ( 1 ) gauge boson, with Bμν=μBννBμsubscript𝐵𝜇𝜈subscript𝜇subscript𝐵𝜈subscript𝜈subscript𝐵𝜇B_{\mu\nu}=\partial_{\mu}B_{\nu}-\partial_{\nu}B_{\mu}italic_B start_POSTSUBSCRIPT italic_μ italic_ν end_POSTSUBSCRIPT = ∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT - ∂ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT, and Wμasubscriptsuperscript𝑊𝑎𝜇W^{a}_{\mu}italic_W start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT are the SU(2)𝑆𝑈2SU(2)italic_S italic_U ( 2 ) gauge bosons, with Wμνasubscriptsuperscript𝑊𝑎𝜇𝜈W^{a}_{\mu\nu}italic_W start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_μ italic_ν end_POSTSUBSCRIPT their field strengths. The covariant derivative of Higgs doublet H𝐻Hitalic_H is

DμH=μHig2WμaτaH12ig1BμH,subscript𝐷𝜇𝐻subscript𝜇𝐻𝑖subscript𝑔2subscriptsuperscript𝑊𝑎𝜇superscript𝜏𝑎𝐻12𝑖subscript𝑔1subscript𝐵𝜇𝐻D_{\mu}H=\partial_{\mu}H-ig_{2}W^{a}_{\mu}\tau^{a}H-\frac{1}{2}ig_{1}B_{\mu}H,italic_D start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_H = ∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_H - italic_i italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_W start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_τ start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT italic_H - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_i italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_H , (16)

where g2subscript𝑔2g_{2}italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and g1subscript𝑔1g_{1}italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT are the SU(2)𝑆𝑈2SU(2)italic_S italic_U ( 2 ) and U(1)𝑈1U(1)italic_U ( 1 ) couplings, respectively.

To present the equations of motion in an expanding universe, we extract useful expressions from [33]. First, we assume a flat Friedmann-Lemaître-Robertson-Walker (FLRW) metric, characterized by the line element:

ds2=gμνdxμdxν=a(η)2αdη2+a(η)2δijdxidxj,dsuperscript𝑠2subscript𝑔𝜇𝜈dsuperscript𝑥𝜇dsuperscript𝑥𝜈𝑎superscript𝜂2𝛼dsuperscript𝜂2𝑎superscript𝜂2subscript𝛿𝑖𝑗dsuperscript𝑥𝑖dsuperscript𝑥𝑗\text{d}s^{2}=g_{\mu\nu}\text{d}x^{\mu}\text{d}x^{\nu}=-a(\eta)^{2\alpha}\text% {d}\eta^{2}+a(\eta)^{2}\delta_{ij}\text{d}x^{i}\text{d}x^{j}\ ,d italic_s start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_g start_POSTSUBSCRIPT italic_μ italic_ν end_POSTSUBSCRIPT d italic_x start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT d italic_x start_POSTSUPERSCRIPT italic_ν end_POSTSUPERSCRIPT = - italic_a ( italic_η ) start_POSTSUPERSCRIPT 2 italic_α end_POSTSUPERSCRIPT d italic_η start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_a ( italic_η ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT d italic_x start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT d italic_x start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , (17)

Here, a(η)𝑎𝜂a(\eta)italic_a ( italic_η ) represents the scale factor, δijsubscript𝛿𝑖𝑗\delta_{ij}italic_δ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT denotes the Euclidean metric, and α𝛼\alphaitalic_α is a constant parameter chosen conveniently depending on the scenario.

The EOMs in an expanding universe are given by

ϕ′′a2(1α)\vv 2ϕ+(3α)aaϕsuperscriptitalic-ϕ′′superscript𝑎21𝛼\vvsuperscript2italic-ϕ3𝛼superscript𝑎𝑎superscriptitalic-ϕ\displaystyle\phi^{\prime\prime}-a^{-2(1-\alpha)}{\vv\nabla}^{\,2}\phi+(3-% \alpha)\frac{{a^{\prime}}}{a}{\phi^{\prime}}italic_ϕ start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT - italic_a start_POSTSUPERSCRIPT - 2 ( 1 - italic_α ) end_POSTSUPERSCRIPT ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ϕ + ( 3 - italic_α ) divide start_ARG italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_a end_ARG italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT =\displaystyle== a2αV,ϕ,\displaystyle-a^{2\alpha}V_{,\phi}\ ,- italic_a start_POSTSUPERSCRIPT 2 italic_α end_POSTSUPERSCRIPT italic_V start_POSTSUBSCRIPT , italic_ϕ end_POSTSUBSCRIPT , (18)
H′′a2(1α)\vvD 2H+(3α)aaHsuperscript𝐻′′superscript𝑎21𝛼\vvsuperscript𝐷2𝐻3𝛼superscript𝑎𝑎superscript𝐻\displaystyle H^{\prime\prime}-a^{-2(1-\alpha)}{\vv D}^{\,2}H+(3-\alpha)\frac{% {a^{\prime}}}{a}{H^{\prime}}italic_H start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT - italic_a start_POSTSUPERSCRIPT - 2 ( 1 - italic_α ) end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_H + ( 3 - italic_α ) divide start_ARG italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_a end_ARG italic_H start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT =\displaystyle== a2αV,|H|2H|H|,\displaystyle-\frac{a^{2\alpha}V_{,|H|}}{2}\frac{H}{|H|}\ ,- divide start_ARG italic_a start_POSTSUPERSCRIPT 2 italic_α end_POSTSUPERSCRIPT italic_V start_POSTSUBSCRIPT , | italic_H | end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG divide start_ARG italic_H end_ARG start_ARG | italic_H | end_ARG , (19)
0F0ia2(1α)jFji+(1α)aaF0isubscript0subscript𝐹0𝑖superscript𝑎21𝛼subscript𝑗subscript𝐹𝑗𝑖1𝛼superscript𝑎𝑎subscript𝐹0𝑖\displaystyle\partial_{0}F_{0i}-a^{-2(1-\alpha)}\partial_{j}F_{ji}+(1-\alpha)% \frac{{a^{\prime}}}{a}F_{0i}∂ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 0 italic_i end_POSTSUBSCRIPT - italic_a start_POSTSUPERSCRIPT - 2 ( 1 - italic_α ) end_POSTSUPERSCRIPT ∂ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_j italic_i end_POSTSUBSCRIPT + ( 1 - italic_α ) divide start_ARG italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_a end_ARG italic_F start_POSTSUBSCRIPT 0 italic_i end_POSTSUBSCRIPT =\displaystyle== a2αJiB,superscript𝑎2𝛼subscriptsuperscript𝐽𝐵𝑖\displaystyle a^{2\alpha}J^{B}_{i}\ ,italic_a start_POSTSUPERSCRIPT 2 italic_α end_POSTSUPERSCRIPT italic_J start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , (20)
(𝒟0)ab(G0i)ba2(1α)(𝒟j)ab(Gji)b+(1α)aa(G0i)bsubscriptsubscript𝒟0𝑎𝑏superscriptsubscript𝐺0𝑖𝑏superscript𝑎21𝛼subscriptsubscript𝒟𝑗𝑎𝑏superscriptsubscript𝐺𝑗𝑖𝑏1𝛼superscript𝑎𝑎superscriptsubscript𝐺0𝑖𝑏\displaystyle(\mathcal{D}_{0})_{ab}(G_{0i})^{b}-a^{-2(1-\alpha)}(\mathcal{D}_{% j})_{ab}(G_{ji})^{b}+(1-\alpha)\frac{{a^{\prime}}}{a}(G_{0i})^{b}( caligraphic_D start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT 0 italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT - italic_a start_POSTSUPERSCRIPT - 2 ( 1 - italic_α ) end_POSTSUPERSCRIPT ( caligraphic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_j italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT + ( 1 - italic_α ) divide start_ARG italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_a end_ARG ( italic_G start_POSTSUBSCRIPT 0 italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT =\displaystyle== a2α(Ji)a,superscript𝑎2𝛼subscriptsubscript𝐽𝑖𝑎\displaystyle a^{2\alpha}(J_{i})_{a}\ ,italic_a start_POSTSUPERSCRIPT 2 italic_α end_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , (21)
iF0isubscript𝑖subscript𝐹0𝑖\displaystyle\partial_{i}F_{0i}∂ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 0 italic_i end_POSTSUBSCRIPT =\displaystyle== a2J0B,superscript𝑎2subscriptsuperscript𝐽𝐵0\displaystyle a^{2}J^{B}_{0}\ ,italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_J start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , (22)
(𝒟i)ab(G0i)bsubscriptsubscript𝒟𝑖𝑎𝑏superscriptsubscript𝐺0𝑖𝑏\displaystyle(\mathcal{D}_{i})_{ab}(G_{0i})^{b}( caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT 0 italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT =\displaystyle== a2(J0)a,superscript𝑎2subscriptsubscript𝐽0𝑎\displaystyle a^{2}(J_{0})_{a}\,,italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_J start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , (23)

where the prime denotes α𝛼\alphaitalic_α-time derivative d/dη𝑑𝑑𝜂d/d\etaitalic_d / italic_d italic_η and (𝒟νO)a=(𝒟ν)abOb(δabνfabcWνc)Obsubscriptsubscript𝒟𝜈𝑂𝑎subscriptsubscript𝒟𝜈𝑎𝑏subscript𝑂𝑏subscript𝛿𝑎𝑏subscript𝜈subscript𝑓𝑎𝑏𝑐superscriptsubscript𝑊𝜈𝑐subscript𝑂𝑏(\mathcal{D}_{\nu}O)_{a}=(\mathcal{D}_{\nu})_{ab}O_{b}\equiv(\delta_{ab}% \partial_{\nu}-f_{abc}W_{\nu}^{c})O_{b}( caligraphic_D start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT italic_O ) start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ( caligraphic_D start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT italic_O start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ≡ ( italic_δ start_POSTSUBSCRIPT italic_a italic_b end_POSTSUBSCRIPT ∂ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_a italic_b italic_c end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) italic_O start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT. The field strength tensors are defined as

Fμνsubscript𝐹𝜇𝜈\displaystyle F_{\mu\nu}italic_F start_POSTSUBSCRIPT italic_μ italic_ν end_POSTSUBSCRIPT \displaystyle\equiv μBννBμ,subscript𝜇subscript𝐵𝜈subscript𝜈subscript𝐵𝜇\displaystyle\partial_{\mu}B_{\nu}-\partial_{\nu}B_{\mu}\ ,∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT - ∂ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT , (24)
Gμνsubscript𝐺𝜇𝜈\displaystyle G_{\mu\nu}italic_G start_POSTSUBSCRIPT italic_μ italic_ν end_POSTSUBSCRIPT \displaystyle\equiv μWννWμig2[Wμ,Wν],subscript𝜇subscript𝑊𝜈subscript𝜈subscript𝑊𝜇𝑖subscript𝑔2subscript𝑊𝜇subscript𝑊𝜈\displaystyle\partial_{\mu}W_{\nu}-\partial_{\nu}W_{\mu}-ig_{2}[W_{\mu},W_{\nu% }]\,,∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT - ∂ start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT - italic_i italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT [ italic_W start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT , italic_W start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ] , (25)

and the currents are

(JB)μsuperscriptsuperscript𝐽𝐵𝜇\displaystyle\hskip 51.21504pt(J^{B})^{\mu}( italic_J start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT \displaystyle\equiv g1m[H(DμH)],subscript𝑔1𝑚delimited-[]superscript𝐻superscript𝐷𝜇𝐻\displaystyle g_{1}\mathcal{I}m[H^{\dagger}(D^{\mu}H)]\,,italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT caligraphic_I italic_m [ italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ( italic_D start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT italic_H ) ] , (26)
Jaμsuperscriptsubscript𝐽𝑎𝜇\displaystyle\hskip 51.21504ptJ_{a}^{\mu}italic_J start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT \displaystyle\equiv 2g2m[HTa(DμH)].2subscript𝑔2𝑚delimited-[]superscript𝐻subscript𝑇𝑎superscript𝐷𝜇𝐻\displaystyle 2g_{2}\mathcal{I}m[H^{{\dagger}}T_{a}(D^{\mu}H)]\,.2 italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT caligraphic_I italic_m [ italic_H start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( italic_D start_POSTSUPERSCRIPT italic_μ end_POSTSUPERSCRIPT italic_H ) ] . (27)

Eqs. (22) and (23) are the U(1)𝑈1U(1)italic_U ( 1 ) and SU(2)𝑆𝑈2SU(2)italic_S italic_U ( 2 ) Gauss constraints in an expanding background that can be used to check the consistency of numerical simulations. The evolution of the scale factor obeys the Friedmann equations

(aa)2superscriptsuperscript𝑎𝑎2\displaystyle\left({a^{\prime}\over a}\right)^{2}( divide start_ARG italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_a end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =\displaystyle== a2α3mp2Kϕ+KH+Gϕ+GH+KU(1)+GU(1)+KSU(2)+GSU(2)+V,superscript𝑎2𝛼3superscriptsubscript𝑚𝑝2delimited-⟨⟩subscript𝐾italic-ϕsubscript𝐾𝐻subscript𝐺italic-ϕsubscript𝐺𝐻subscript𝐾𝑈1subscript𝐺𝑈1subscript𝐾𝑆𝑈2subscript𝐺𝑆𝑈2𝑉\displaystyle\frac{a^{2\alpha}}{3m_{p}^{2}}\left\langle{K}_{\phi}+{K}_{H}+{G}_% {\phi}+{G}_{H}+{K}_{U(1)}+{G}_{U(1)}+{K}_{SU(2)}+{G}_{SU(2)}+{V}\right\rangle\,,divide start_ARG italic_a start_POSTSUPERSCRIPT 2 italic_α end_POSTSUPERSCRIPT end_ARG start_ARG 3 italic_m start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ⟨ italic_K start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_U ( 1 ) end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_U ( 1 ) end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_S italic_U ( 2 ) end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_S italic_U ( 2 ) end_POSTSUBSCRIPT + italic_V ⟩ , (28)
a′′asuperscript𝑎′′𝑎\displaystyle{a^{\prime\prime}\over a}divide start_ARG italic_a start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_a end_ARG =\displaystyle== a2α3mp2(α2)(Kϕ+KH)+α(Gϕ+GH)+(α+1)V\displaystyle\frac{a^{2\alpha}}{3m_{p}^{2}}\left\langle(\alpha-2)({K}_{\phi}+{% K}_{H})+\alpha({G}_{\phi}+{G}_{H})+(\alpha+1)V\right.divide start_ARG italic_a start_POSTSUPERSCRIPT 2 italic_α end_POSTSUPERSCRIPT end_ARG start_ARG 3 italic_m start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ⟨ ( italic_α - 2 ) ( italic_K start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ) + italic_α ( italic_G start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ) + ( italic_α + 1 ) italic_V
+(α1)(KU(1)+GU(1)+KSU(2)+GSU(2)),\displaystyle\hskip 56.9055pt\left.+~{}(\alpha-1)({K}_{U(1)}+{G}_{U(1)}+{K}_{% SU(2)}+{G}_{SU(2)})\right\rangle\,,+ ( italic_α - 1 ) ( italic_K start_POSTSUBSCRIPT italic_U ( 1 ) end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_U ( 1 ) end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_S italic_U ( 2 ) end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_S italic_U ( 2 ) end_POSTSUBSCRIPT ) ⟩ ,

where delimited-⟨⟩\langle...\rangle⟨ … ⟩ represents values averaged over space. One of the two Friedmann equations is redundant and used to check consistency as Gauss constraints. The kinetic (K𝐾Kitalic_K) and gradient (G𝐺Gitalic_G) energy densities are

Kϕ=12a2αϕ2KH=1a2α(D0H)(D0H)KU(1)=12a2+2αiF0i2KSU(2)=12a2+2αa,i(G0ia)2;Gϕ=12a2i(iϕ)2GH=1a2i(DiH)(DiH)GU(1)=12a4i,j<iFij2GSU(2)=12a4a,i,j<i(Gija)2.subscript𝐾italic-ϕ12superscript𝑎2𝛼superscriptsuperscriptitalic-ϕ2subscript𝐾𝐻1superscript𝑎2𝛼superscriptsubscript𝐷0𝐻subscript𝐷0𝐻subscript𝐾𝑈112superscript𝑎22𝛼subscript𝑖superscriptsubscript𝐹0𝑖2subscript𝐾𝑆𝑈212superscript𝑎22𝛼subscript𝑎𝑖superscriptsuperscriptsubscript𝐺0𝑖𝑎2subscript𝐺italic-ϕ12superscript𝑎2subscript𝑖superscriptsubscript𝑖italic-ϕ2subscript𝐺𝐻1superscript𝑎2subscript𝑖superscriptsubscript𝐷𝑖𝐻subscript𝐷𝑖𝐻subscript𝐺𝑈112superscript𝑎4subscript𝑖𝑗𝑖superscriptsubscript𝐹𝑖𝑗2subscript𝐺𝑆𝑈212superscript𝑎4subscript𝑎𝑖𝑗𝑖superscriptsuperscriptsubscript𝐺𝑖𝑗𝑎2\displaystyle\begin{array}[]{lcl}{K}_{\phi}&=&\frac{1}{2a^{2\alpha}}{\phi^{% \prime}}^{2}\vspace{0.1cm}\vspace{0.1cm}\\ {K}_{H}&=&\frac{1}{a^{2\alpha}}(D_{0}H)^{\dagger}(D_{0}H)\vspace{0.1cm}\\ {K}_{U(1)}&=&\frac{1}{2a^{2+2\alpha}}\sum_{i}F_{0i}^{2}\vspace{0.1cm}\\ {K}_{SU(2)}&=&\frac{1}{2a^{2+2\alpha}}\sum_{a,i}(G_{0i}^{a})^{2}\vspace{0.1cm}% \\ \end{array}\hskip 2.84544pt;\hskip 11.38092pt\begin{array}[]{lcl}{G}_{\phi}&=&% \frac{1}{2a^{2}}\sum_{i}(\partial_{i}\phi)^{2}\vspace{0.1cm}\\ {G}_{H}&=&\frac{1}{a^{2}}\sum_{i}(D_{i}H)^{\dagger}(D_{i}H)\vspace{0.1cm}\\ {G}_{U(1)}&=&\frac{1}{2a^{4}}\sum_{i,j<i}F_{ij}^{2}\vspace{0.1cm}\\ {G}_{SU(2)}&=&\frac{1}{2a^{4}}\sum_{a,i,j<i}(G_{ij}^{a})^{2}\,.\vspace*{0.2cm}% \\ \end{array}start_ARRAY start_ROW start_CELL italic_K start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 2 italic_a start_POSTSUPERSCRIPT 2 italic_α end_POSTSUPERSCRIPT end_ARG italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_K start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 italic_α end_POSTSUPERSCRIPT end_ARG ( italic_D start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_H ) start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_H ) end_CELL end_ROW start_ROW start_CELL italic_K start_POSTSUBSCRIPT italic_U ( 1 ) end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 2 italic_a start_POSTSUPERSCRIPT 2 + 2 italic_α end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 0 italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_K start_POSTSUBSCRIPT italic_S italic_U ( 2 ) end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 2 italic_a start_POSTSUPERSCRIPT 2 + 2 italic_α end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_a , italic_i end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT 0 italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ; start_ARRAY start_ROW start_CELL italic_G start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 2 italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ∂ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_ϕ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_G start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_a start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_H ) start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ( italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_H ) end_CELL end_ROW start_ROW start_CELL italic_G start_POSTSUBSCRIPT italic_U ( 1 ) end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 2 italic_a start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i , italic_j < italic_i end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_G start_POSTSUBSCRIPT italic_S italic_U ( 2 ) end_POSTSUBSCRIPT end_CELL start_CELL = end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 2 italic_a start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_a , italic_i , italic_j < italic_i end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . end_CELL end_ROW end_ARRAY (38)

One can solve the above equations in the temporal gauge with different equation-solving algorithms. We implement our model on the lattice using a publicly available code called CosmoLattice [33, 34].

Program variables used for our physics model are rescaled as follows:

α𝛼\displaystyle\alphaitalic_α =\displaystyle== 0,f=1.61013GeV,ω=1.61013GeV,dτ=aαωdt,formulae-sequence0subscript𝑓1.6superscript1013GeVformulae-sequencesubscript𝜔1.6superscript1013GeV𝑑𝜏superscript𝑎𝛼subscript𝜔𝑑𝑡\displaystyle 0~{}~{},~{}~{}f_{*}=1.6\cdot 10^{13}\,\text{GeV}~{}~{},~{}~{}% \omega_{*}=1.6\cdot 10^{13}\,\text{GeV}~{}~{},~{}~{}d\tau=a^{-\alpha}\omega_{*% }dt~{}~{},0 , italic_f start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT = 1.6 ⋅ 10 start_POSTSUPERSCRIPT 13 end_POSTSUPERSCRIPT GeV , italic_ω start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT = 1.6 ⋅ 10 start_POSTSUPERSCRIPT 13 end_POSTSUPERSCRIPT GeV , italic_d italic_τ = italic_a start_POSTSUPERSCRIPT - italic_α end_POSTSUPERSCRIPT italic_ω start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT italic_d italic_t , (39)
f~~𝑓\displaystyle\tilde{f}over~ start_ARG italic_f end_ARG \displaystyle\equiv f/f,V~V/(f2ω2),𝑓subscript𝑓~𝑉𝑉subscriptsuperscript𝑓2subscriptsuperscript𝜔2\displaystyle f/f_{*}~{}~{},~{}~{}\tilde{V}\equiv V/(f^{2}_{*}\omega^{2}_{*})~% {}~{},italic_f / italic_f start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT , over~ start_ARG italic_V end_ARG ≡ italic_V / ( italic_f start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ) , (40)

where variables with tildes represent dimensionless program variables. Important input parameters used for our model are set as follows

evolver=VV2   ,   N=160   ,   dt=0.005   ,   kIR = 5   ,   kCutOff = 10.0   . (41)

We evolve the field equations using the second kind of Velocity-Verlet algorithm on 160 lattice points per dimension. dt represents the time step in program unit.

Appendix B Hyperparameters

Refer to caption
Figure 5: Train and validation losses versus epoch. Early stopping was performed at the epoch marked by the gray dashed vertical line.

Hyperparameters are parameters whose values are set before the learning process begins. They control aspects of the learning process itself and are not learned from the data. These parameters are external to the model and must be set based on heuristics, prior experience, or trial and error. Common hyperparameters include the learning rate, batch size, number of hidden layers, number of neurons in each layer, dropout rate, regularization strength, and optimizer choice. Proper selection and tuning of hyperparameters are crucial for achieving good performance and generalization of machine learning models. If hyperparameters are not tuned, the results obtained through the model can either underfit or overfit the data. Therefore, it is important to compute and monitor the train loss and validation loss. Typically, the train loss decreases over epochs, so by solely examining the validation loss, we can discern whether the model is underfitting or overfitting. If the validation loss continues to decrease, it indicates underfitting, suggesting room for improvement through further training. However, if the validation loss does not decrease, it implies overfitting, where the model merely memorizes the data without understanding beyond it. Thus, it is advisable to perform early stopping at this intermediate stage, as illustrated in Fig 5.

References

  • [1] A. A. Starobinsky, Phys. Lett. B 91 (1980) 99-102.
  • [2] A. H. Guth, Phys. Rev. D 23 (1981) 347-356.
  • [3] A. D. Linde, Phys. Lett. B 108 (1982) 389-393; Phys. Lett. B 129 (1983), 177-181.
  • [4] V. F. Mukhanov and G. V. Chibisov, JETP Lett. 33, 532-535 (1981).
  • [5] O. Lebedev and J. H. Yoon, Phys. Lett. B 821, 136614 (2021).
  • [6] O. Lebedev, T. Nerdi, T. Solomko and J. H. Yoon, Phys. Rev. D 106, no.4, 043537 (2022).
  • [7] O. Lebedev, T. Solomko and J. H. Yoon, JCAP 02, 035 (2023).
  • [8] A. Kurkela and G. D. Moore, JHEP 12, 044 (2011) doi:10.1007/JHEP12(2011)044 [arXiv:1107.5050 [hep-ph]].
  • [9] L. D. Landau and I. Pomeranchuk, Dokl. Akad. Nauk Ser. Fiz. 92, 535-536 (1953)
  • [10] A. B. Migdal, Phys. Rev. 103, 1811-1820 (1956) doi:10.1103/PhysRev.103.1811
  • [11] M. Gyulassy and X. n. Wang, Nucl. Phys. B 420, 583-614 (1994) doi:10.1016/0550-3213(94)90079-5 [arXiv:nucl-th/9306003 [nucl-th]].
  • [12] K. Harigaya and K. Mukaida, JHEP 05, 006 (2014) doi:10.1007/JHEP05(2014)006 [arXiv:1312.3097 [hep-ph]].
  • [13] K. Mukaida and M. Yamada, [arXiv:2402.14054 [hep-ph]].
  • [14] R. Micha and I. I. Tkachev, Phys. Rev. Lett. 90, 121301 (2003) doi:10.1103/PhysRevLett.90.121301 [arXiv:hep-ph/0210202 [hep-ph]].
  • [15] R. Micha and I. I. Tkachev, Phys. Rev. D 70, 043538 (2004) doi:10.1103/PhysRevD.70.043538 [arXiv:hep-ph/0403101 [hep-ph]].
  • [16] Y. LeCun, Y. Bengio and G. Hinton, Nature 521, 436-444 (2015) doi:10.1038/nature14539
  • [17] J. Schmidhuber, Neural Networks 61, 85-117 (2015) doi:10.1016/j.neunet.2014.09.003
  • [18] T. Plehn, A. Butter, B. Dillon, T. Heimel, C. Krause and R. Winterhalder, [arXiv:2211.01421 [hep-ph]].
  • [19] S. Hochreiter and J. Schmidhuber, Neural Comput. 9, no.8, 1735-1780 (1997) doi:10.1162/neco.1997.9.8.1735
  • [20] L. Kofman, A. D. Linde and A. A. Starobinsky, Phys. Rev. Lett. 73, 3195-3198 (1994) doi:10.1103/PhysRevLett.73.3195 [arXiv:hep-th/9405187 [hep-th]].
  • [21] L. Kofman, A. D. Linde and A. A. Starobinsky, Phys. Rev. D 56, 3258-3295 (1997) doi:10.1103/PhysRevD.56.3258 [arXiv:hep-ph/9704452 [hep-ph]].
  • [22] P. B. Greene, L. Kofman, A. D. Linde and A. A. Starobinsky, Phys. Rev. D 56, 6175-6192 (1997).
  • [23] A. Rajantie, P. M. Saffin and E. J. Copeland, Phys. Rev. D 63, 123512 (2001) doi:10.1103/PhysRevD.63.123512 [arXiv:hep-ph/0012097 [hep-ph]].
  • [24] J. I. Skullerud, J. Smit and A. Tranberg, JHEP 08, 045 (2003) doi:10.1088/1126-6708/2003/08/045 [arXiv:hep-ph/0307094 [hep-ph]].
  • [25] R. Micha and I. I. Tkachev, doi:10.1142/9789812704498_0020 [arXiv:hep-ph/0301249 [hep-ph]].
  • [26] J. Berges, S. Schlichting and D. Sexty, Phys. Rev. D 86, 074006 (2012) doi:10.1103/PhysRevD.86.074006 [arXiv:1203.4646 [hep-ph]].
  • [27] D. G. Figueroa, J. Garcia-Bellido and F. Torrenti, Phys. Rev. D 92, no.8, 083511 (2015) doi:10.1103/PhysRevD.92.083511 [arXiv:1504.04600 [astro-ph.CO]].
  • [28] K. Enqvist, S. Nurmi, S. Rusak and D. Weir, JCAP 02, 057 (2016) doi:10.1088/1475-7516/2016/02/057 [arXiv:1506.06895 [astro-ph.CO]].
  • [29] O. Lebedev, Prog. Part. Nucl. Phys. 120, 103881 (2021).
  • [30] J. F. Dufaux, G. N. Felder, L. Kofman, M. Peloso and D. Podolsky, JCAP 07, 006 (2006) doi:10.1088/1475-7516/2006/07/006 [arXiv:hep-ph/0602144 [hep-ph]].
  • [31] A. A. Abolhasani, H. Firouzjahi and M. M. Sheikh-Jabbari, Phys. Rev. D 81, 043524 (2010) doi:10.1103/PhysRevD.81.043524 [arXiv:0912.1021 [hep-th]].
  • [32] C. Cosme, D. G. Figueroa and N. Loayza, JCAP 05, 023 (2023) doi:10.1088/1475-7516/2023/05/023 [arXiv:2206.14721 [astro-ph.CO]].
  • [33] D. G. Figueroa, A. Florio, F. Torrenti and W. Valkenburg, JCAP 04, 035 (2021).
  • [34] D. G. Figueroa, A. Florio, F. Torrenti and W. Valkenburg, [arXiv:2102.01031 [astro-ph.CO]].
  • [35] L. Giusti, M. L. Paciello, C. Parrinello, S. Petrarca and B. Taglienti, Int. J. Mod. Phys. A 16, 3487-3534 (2001) doi:10.1142/S0217751X01004281 [arXiv:hep-lat/0104012 [hep-lat]].
  • [36] G. S. Bali, V. Bornyakov, M. Muller-Preussker and F. Pahl, Nucl. Phys. B Proc. Suppl. 42, 852-854 (1995) doi:10.1016/0920-5632(95)00401-T [arXiv:hep-lat/9412027 [hep-lat]].
  • [37] M. Schröck and H. Vogt, Comput. Phys. Commun. 184, 1907-1919 (2013) doi:10.1016/j.cpc.2013.03.021 [arXiv:1212.5221 [hep-lat]].
  • [38] J. Berges, K. Boguslavski, S. Schlichting and R. Venugopalan, Phys. Rev. D 89, no.11, 114007 (2014) doi:10.1103/PhysRevD.89.114007 [arXiv:1311.3005 [hep-ph]].
  • [39] J. Berges, A. Rothkopf and J. Schmidt, Phys. Rev. Lett. 101, 041603 (2008) doi:10.1103/PhysRevLett.101.041603 [arXiv:0803.0131 [hep-ph]].
  • [40] J. Berges and D. Sexty, Phys. Rev. Lett. 108, 161601 (2012) doi:10.1103/PhysRevLett.108.161601 [arXiv:1201.0687 [hep-ph]].
  • [41] Adam Paszke et al., CoRR, vol. abs/1912.01703, 2019. [Online]. Available: http://arxiv.org/abs/1912.01703
  • [42] J. Garcia-Bellido, D. G. Figueroa and J. Rubio, Phys. Rev. D 79, 063531 (2009) doi:10.1103/PhysRevD.79.063531 [arXiv:0812.4624 [hep-ph]].
  • [43] J. Berges, D. Gelfand and D. Sexty, Phys. Rev. D 89, no.2, 025001 (2014) doi:10.1103/PhysRevD.89.025001 [arXiv:1308.2180 [hep-ph]].