-
Transformer Circuit Faithfulness Metrics are not Robust
Authors:
Joseph Miller,
Bilal Chughtai,
William Saunders
Abstract:
Mechanistic interpretability work attempts to reverse engineer the learned algorithms present inside neural networks. One focus of this work has been to discover 'circuits' -- subgraphs of the full model that explain behaviour on specific tasks. But how do we measure the performance of such circuits? Prior work has attempted to measure circuit 'faithfulness' -- the degree to which the circuit repl…
▽ More
Mechanistic interpretability work attempts to reverse engineer the learned algorithms present inside neural networks. One focus of this work has been to discover 'circuits' -- subgraphs of the full model that explain behaviour on specific tasks. But how do we measure the performance of such circuits? Prior work has attempted to measure circuit 'faithfulness' -- the degree to which the circuit replicates the performance of the full model. In this work, we survey many considerations for designing experiments that measure circuit faithfulness by ablating portions of the model's computation. Concerningly, we find existing methods are highly sensitive to seemingly insignificant changes in the ablation methodology. We conclude that existing circuit faithfulness scores reflect both the methodological choices of researchers as well as the actual components of the circuit - the task a circuit is required to perform depends on the ablation used to test it. The ultimate goal of mechanistic interpretability work is to understand neural networks, so we emphasize the need for more clarity in the precise claims being made about circuits. We open source a library at https://github.com/UFO-101/auto-circuit that includes highly efficient implementations of a wide range of ablation methodologies and circuit discovery algorithms.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Investigating the Mass of the Black Hole and Possible Wind Outflow of the Accretion Disk in the Tidal Disruption Event AT2021ehb
Authors:
Xin Xiang,
Jon M. Miller,
Abderahmen Zoghbi,
Mark T. Reynolds,
David Bogensberger,
Lixin Dai,
Paul A. Draghis,
Jeremy J. Drake,
Olivier Godet,
Jimmy A. Irwin,
Michael C. Miller,
Brenna E. Mockler,
Richard Saxton,
Natalie Webb
Abstract:
Tidal disruption events (TDEs) can potentially probe low-mass black holes in host galaxies that might not adhere to bulge or stellar-dispersion relationships. At least initially, TDEs can also reveal super-Eddington accretion. X-ray spectroscopy can potentially constrain black hole masses, and reveal ionized outflows associated with super-Eddington accretion. Our analysis of XMM-Newton X-ray obser…
▽ More
Tidal disruption events (TDEs) can potentially probe low-mass black holes in host galaxies that might not adhere to bulge or stellar-dispersion relationships. At least initially, TDEs can also reveal super-Eddington accretion. X-ray spectroscopy can potentially constrain black hole masses, and reveal ionized outflows associated with super-Eddington accretion. Our analysis of XMM-Newton X-ray observations of the TDE AT2021ehb, around 300 days post-disruption, reveals a soft spectrum and can be fit with a combination of multi-color disk blackbody and power-law components. Using two independent disk models with properties suited to TDEs, we estimate a black hole mass at $M \simeq 10^{5.5}~M_{\odot}$, indicating AT2021ehb may expose the elusive low-mass end of the nuclear black hole population. These models offer simple yet robust characterization; more complicated models are not required, but provide important context and caveats in the limit of moderately sensitive data. If disk reflection is included, the disk flux is lower and inferred black hole masses are $\sim$ 0.35 dex higher. Simple wind formulations imply an extremely fast $v_{\mathrm{out}} = -0.2~c$ outflow and obviate a disk continuum component. Assuming a unity filling factor, such a wind implies an instantaneous mass outflow rate of $\dot{M} \simeq 5~M_{\odot}~{\rm yr}^{-1}$. Such a high rate suggests that the filling factor for the Ultra Fast Outflow (UFO) must be extremely low, and/or the UFO phase is ephemeral. We discuss the strengths and limitations of our analysis and avenues for future observations of TDEs.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
A criterion for slope 1 homological stability
Authors:
Mikala Ørsnes Jansen,
Jeremy Miller
Abstract:
We show that for nice enough $\mathbb{N}$-graded $\mathbb{E}_2$-algebras, a diagonal vanishing line in $\mathbb{E}_1$-homology of gives rise to slope $1$ homological stability. This is an integral version of a result by Kupers-Miller-Patzt.
We show that for nice enough $\mathbb{N}$-graded $\mathbb{E}_2$-algebras, a diagonal vanishing line in $\mathbb{E}_1$-homology of gives rise to slope $1$ homological stability. This is an integral version of a result by Kupers-Miller-Patzt.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Comparison of 4.5PN and 2SF gravitational energy fluxes from quasicircular compact binaries
Authors:
Niels Warburton,
Barry Wardell,
David Trestini,
Quentin Henry,
Adam Pound,
Luc Blanchet,
Leanne Durkan,
Guillaume Faye,
Jeremy Miller
Abstract:
The past three years have seen two significant advances in models of gravitational waveforms emitted by quasicircular compact binaries in two regimes: the weak-field, post-Newtonian regime, in which the gravitational wave energy flux has now been calculated to fourth-and-a-half post-Newtonian order (4.5PN) [Phys. Rev. Lett. 131, 121402 (2023)]; and the small-mass-ratio, gravitational self-force re…
▽ More
The past three years have seen two significant advances in models of gravitational waveforms emitted by quasicircular compact binaries in two regimes: the weak-field, post-Newtonian regime, in which the gravitational wave energy flux has now been calculated to fourth-and-a-half post-Newtonian order (4.5PN) [Phys. Rev. Lett. 131, 121402 (2023)]; and the small-mass-ratio, gravitational self-force regime, in which the flux has now been calculated to second perturbative order in the mass ratio (2SF) [Phys. Rev. Lett. 127, 151102 (2021)]. We compare these results and find excellent agreement for the total flux, showing consistency between the two calculations at all available PN and SF orders. However, although the total fluxes agree, we find disagreements in the fluxes due to individual spherical-harmonic modes of the waveform, strongly suggesting the two waveforms might be in different asymptotic frames.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Rapid Mid-Infrared Spectral-Timing with JWST. I. The prototypical black hole X-ray Binary GRS 1915+105 during a MIR-bright and X-ray-obscured state
Authors:
P. Gandhi,
E. S. Borowski,
J. Byrom,
R. I. Hynes,
T. J. Maccarone,
A. W. Shaw,
O. K. Adegoke,
D. Altamirano,
M. C. Baglio,
Y. Bhargava,
C. T. Britt,
D. A. H. Buckley,
D. J. K. Buisson,
P. Casella,
N. Castro Segura,
P. A. Charles,
J. M. Corral-Santana,
V. S. Dhillon,
R. Fender,
A. Gúrpide,
C. O. Heinke,
A. B. Igl,
C. Knigge,
S. Markoff,
G. Mastroserio
, et al. (22 additional authors not shown)
Abstract:
We present mid-infrared (MIR) spectral-timing measurements of the prototypical Galactic microquasar GRS 1915+105. The source was observed with the Mid-Infrared Instrument (MIRI) onboard JWST in June 2023 at a MIR luminosity L(MIR)~10^{36} erg/s exceeding past IR levels by about a factor of 10. By contrast, the X-ray flux is much fainter than the historical average, in the source's now-persistent '…
▽ More
We present mid-infrared (MIR) spectral-timing measurements of the prototypical Galactic microquasar GRS 1915+105. The source was observed with the Mid-Infrared Instrument (MIRI) onboard JWST in June 2023 at a MIR luminosity L(MIR)~10^{36} erg/s exceeding past IR levels by about a factor of 10. By contrast, the X-ray flux is much fainter than the historical average, in the source's now-persistent 'obscured' state. The MIRI low-resolution spectrum shows a plethora of emission lines, the strongest of which are consistent with recombination in the hydrogen Pfund (Pf) series and higher. Low amplitude (~1%) but highly significant peak-to-peak photometric variability is found on timescales of ~1,000 s. The brightest Pf(6-5) emission line lags the continuum. Though difficult to constrain accurately, this lag is commensurate with light-travel timescales across the outer accretion disc or with expected recombination timescales inferred from emission line diagnostics. Using the emission line as a bolometric indicator suggests a moderate (~5-30% Eddington) intrinsic accretion rate. Multiwavelength monitoring shows that JWST caught the source close in-time to unprecedentedly bright MIR and radio long-term flaring. Assuming a thermal bremsstrahlung origin for the MIRI continuum suggests an unsustainably high mass-loss rate during this time unless the wind remains bound, though other possible origins cannot be ruled out. PAH features previously detected with Spitzer are now less clear in the MIRI data, arguing for possible destruction of dust in the interim. These results provide a preview of new parameter space for exploring MIR spectral-timing in XRBs and other variable cosmic sources on rapid timescales.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Elucidating Galaxy Population Properties Using a Model-Free Analysis of Quadruply Imaged Quasar Lenses From Large Surveys
Authors:
John Miller Jr,
Liliya L. R. Williams
Abstract:
The population of strong lensing galaxies is a sub-set of intermediate-redshift massive galaxies, whose population-level properties are not yet well understood. In the near future, thousands of multiply imaged systems are expected to be discovered by wide-field surveys like Rubin Observatory's Legacy Survey of Space and Time (LSST) and Euclid. With the soon-to-be robust population of quadruply len…
▽ More
The population of strong lensing galaxies is a sub-set of intermediate-redshift massive galaxies, whose population-level properties are not yet well understood. In the near future, thousands of multiply imaged systems are expected to be discovered by wide-field surveys like Rubin Observatory's Legacy Survey of Space and Time (LSST) and Euclid. With the soon-to-be robust population of quadruply lensed quasars, or quads, in mind, we introduce a novel technique to elucidate the empirical distribution of the galaxy population properties. Our re-imagining of the prevailing strong lensing analysis does not fit mass models to individual lenses, but instead starts with parametric models of many galaxy populations, which include generally ignored mass distribution complexities and exclude external shear for now. We construct many mock galaxy populations with different properties and obtain populations of quads from each of them. The mock `observed' population of quads is then compared to those from the mocks using a model-free analysis based on a 3D sub-space of directly observable quad image properties. The distance between two quad populations in the space of image properties is measured by a metric $η$, and the distance between their parent galaxy populations in the space of galaxy properties is measured by $ζ$. We find a well defined relation between $η$ and $ζ$. The discovered relation between the space of image properties and the space of galaxy properties allows for the observed galaxy population properties to be estimated from the properties of their quads, which will be conducted in a future paper.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
On Trojans in Refined Language Models
Authors:
Jayaram Raghuram,
George Kesidis,
David J. Miller
Abstract:
A Trojan in a language model can be inserted when the model is refined for a particular application such as determining the sentiment of product reviews. In this paper, we clarify and empirically explore variations of the data-poisoning threat model. We then empirically assess two simple defenses each for a different defense scenario. Finally, we provide a brief survey of related attacks and defen…
▽ More
A Trojan in a language model can be inserted when the model is refined for a particular application such as determining the sentiment of product reviews. In this paper, we clarify and empirically explore variations of the data-poisoning threat model. We then empirically assess two simple defenses each for a different defense scenario. Finally, we provide a brief survey of related attacks and defenses.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
On the Within-perfect Numbers
Authors:
Chung-Hang Kwan,
Steven J. Miller
Abstract:
Motivated by the works of Erdös, Pomerance, Wolke and Harman on the sum-of-divisor function $σ(n)$, we study the distribution of a special class of natural numbers closely related to (multiply) perfect numbers which we term `$(\ell;k)$-within-perfect numbers', where $\ell >1$ is a real number and $k: [1, \infty) \rightarrow (0, \infty)$ is an increasing and unbounded function.
Motivated by the works of Erdös, Pomerance, Wolke and Harman on the sum-of-divisor function $σ(n)$, we study the distribution of a special class of natural numbers closely related to (multiply) perfect numbers which we term `$(\ell;k)$-within-perfect numbers', where $\ell >1$ is a real number and $k: [1, \infty) \rightarrow (0, \infty)$ is an increasing and unbounded function.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Tidal Disruption Events from Stripped Stars
Authors:
Brenna Mockler,
Monica Gallegos-Garcia,
Ylva Götberg,
Jon Miller,
Enrico Ramirez-Ruiz
Abstract:
Observations of tidal disruption events (TDEs) show signs of Nitrogen enrichment reminiscent of other astrophysical sources such as active galactic nuclei (AGN) and star-forming galaxies. Given that TDEs probe the gas from a single star, it is possible to test if the observed enrichment is consistent with expectations from the CNO cycle by looking at the observed Nitrogen/Carbon (N/C) abundance ra…
▽ More
Observations of tidal disruption events (TDEs) show signs of Nitrogen enrichment reminiscent of other astrophysical sources such as active galactic nuclei (AGN) and star-forming galaxies. Given that TDEs probe the gas from a single star, it is possible to test if the observed enrichment is consistent with expectations from the CNO cycle by looking at the observed Nitrogen/Carbon (N/C) abundance ratios. Given that $\approx 20\%$ of solar mass stars (and an even larger fraction of more massive stars) live in close binaries, it is worthwhile to also consider what TDEs from stars influenced by binary evolution would look like. We show here that TDEs from stars stripped of their Hydrogen-rich (and Nitrogen-poor) envelopes through previous binary-induced mass loss can produce much higher observable N/C enhancements than even TDEs from massive stars. Additionally, we predict that the time-dependence of the N/C abundance ratio in the mass fallback rate of stripped stars will follow the inverse behavior of main-sequence stars, enabling a more accurate characterization of the disrupted star.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Fundamental effective temperature measurements for eclipsing binary stars -- V. The circumbinary planet system EBLM J0608-59
Authors:
P. F. L. Maxted,
N. J. Miller,
D. Sebastian,
A. H. M. J. Triaud,
D. V. Martin,
A. Duck
Abstract:
EBLM J0608-59 / TOI-1338 / BEBOP-1 is a 12th-magnitude, F9V star in an eclipsing binary with a much fainter M-dwarf companion on a wide, eccentric orbit (P=14.6 d). The binary is orbited by two circumbinary planets: one transiting on a 95-day orbit and one non-transiting on a 215-day orbit. We have used high-precision photometry from the TESS mission combined with direct mass measurements for the…
▽ More
EBLM J0608-59 / TOI-1338 / BEBOP-1 is a 12th-magnitude, F9V star in an eclipsing binary with a much fainter M-dwarf companion on a wide, eccentric orbit (P=14.6 d). The binary is orbited by two circumbinary planets: one transiting on a 95-day orbit and one non-transiting on a 215-day orbit. We have used high-precision photometry from the TESS mission combined with direct mass measurements for the two stars published recently to measure the following model-independent radii: $R_1 = 1.32 \pm 0.02 R_{\odot}$, $R_2 = 0.309 \pm 0.004 R_{\odot}$. Using $R_1$ and the parallax from Gaia EDR3 we find that this star's angular diameter is $θ= 0.0309 \pm 0.0005$ mas. The apparent bolometric flux of the primary star corrected for both extinction and the contribution from the M-dwarf ($<0.4$%) is ${\mathcal F}_{\oplus,0} = (0.417\pm 0.005)\times10^{-9} {\rm \,erg\,cm}^{-2} {\rm \,s}^{-1}$. Hence, this F9V star has an effective temperature $T_{\rm eff,1} = 6031{\rm\,K} \pm 46{\rm \,K\,(rnd.)} \pm 10 {\rm \,K\,(sys.)}$. EBLM J0608-59 is an ideal benchmark star that can be added to the sample of such systems we are establishing for "end-to-end" tests of the stellar parameters measured by large-scale spectroscopic surveys.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
CHEOPS in-flight performance: A comprehensive look at the first 3.5 years of operations
Authors:
A. Fortier,
A. E. Simon,
C. Broeg,
G. Olofsson,
A. Deline,
T. G. Wilson,
P. F. L. Maxted,
A. Brandeker,
A. Collier Cameron,
M. Beck,
A. Bekkelien,
N. Billot,
A. Bonfanti,
G. Bruno,
J. Cabrera,
L. Delrez,
B. -O. Demory,
D. Futyan,
H. -G. Florén,
M. N. Günther,
A. Heitzmann,
S. Hoyer,
K. G. Isaak,
S. G. Sousa,
M. Stalport
, et al. (106 additional authors not shown)
Abstract:
CHEOPS is a space telescope specifically designed to monitor transiting exoplanets orbiting bright stars. In September 2023, CHEOPS completed its nominal mission and remains in excellent operational conditions. The mission has been extended until the end of 2026. Scientific and instrumental data have been collected throughout in-orbit commissioning and nominal operations, enabling a comprehensive…
▽ More
CHEOPS is a space telescope specifically designed to monitor transiting exoplanets orbiting bright stars. In September 2023, CHEOPS completed its nominal mission and remains in excellent operational conditions. The mission has been extended until the end of 2026. Scientific and instrumental data have been collected throughout in-orbit commissioning and nominal operations, enabling a comprehensive analysis of the mission's performance. In this article, we present the results of this analysis with a twofold goal. First, we aim to inform the scientific community about the present status of the mission and what can be expected as the instrument ages. Secondly, we intend for this publication to serve as a legacy document for future missions, providing insights and lessons learned from the successful operation of CHEOPS. To evaluate the instrument performance in flight, we developed a comprehensive monitoring and characterisation programme. It consists of dedicated observations that allow us to characterise the instrument's response. In addition to the standard collection of nominal science and housekeeping data, these observations provide input for detecting, modelling, and correcting instrument systematics, discovering and addressing anomalies, and comparing the instrument's actual performance with expectations. The precision of the CHEOPS measurements has enabled the mission objectives to be met and exceeded. Careful modelling of the instrumental systematics allows the data quality to be significantly improved during the light curve analysis phase, resulting in more precise scientific measurements. CHEOPS is compliant with the driving scientific requirements of the mission. Although visible, the ageing of the instrument has not affected the mission's performance.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Escape Velocity Mass of Abell S1063
Authors:
Alexander Rodriguez,
Christopher J. Miller,
Vitali Halenka,
Anthony Kremin
Abstract:
We measure the radius-velocity phase-space edge profile for Abell S1063 using galaxy redshifts from arXiv:1409.3507 and arXiv:2109.03305. Combined with a cosmological model and after accounting for interlopers and sampling effects, we infer the escape velocity profile. Using the Poisson equation, we then directly constrain the gravitational potential profile and find excellent agreement between th…
▽ More
We measure the radius-velocity phase-space edge profile for Abell S1063 using galaxy redshifts from arXiv:1409.3507 and arXiv:2109.03305. Combined with a cosmological model and after accounting for interlopers and sampling effects, we infer the escape velocity profile. Using the Poisson equation, we then directly constrain the gravitational potential profile and find excellent agreement between three different density models. For the NFW profile, we find log$_{10}$(M$_{200},{\rm crit}$)= $15.40^{+0.06}_{-0.12}$M$_{\odot}$, consistent to within $1σ$ of six recently published lensing masses. We argue that this consistency is due to the fact that the escape technique shares no common systematics with lensing other than radial binning. These masses are 2-4$σ$ lower than estimates using X-ray data, in addition to earlier velocity dispersion estimates. We measure the 1D velocity dispersion within r$_{200}$ to be $σ_{v} = 1477^{+87}_{-99}$ km/s, which combined with our escape velocity mass, brings the dispersion for AS1063 in-line with hydrodynamic cosmological simulations for the first time.
△ Less
Submitted 7 June, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
Upper Bounds for the Lowest First Zero in Families of Cuspidal Newforms
Authors:
Xueyiming Tang,
Steven J. Miller
Abstract:
Assuming the Generalized Riemann Hypothesis, the non-trivial zeros of $L$-functions lie on the critical line with the real part $1/2$. We find an upper bound of the lowest first zero in families of even cuspidal newforms of prime level tending to infinity. We obtain explicit bounds using the $n$-level densities and results towards the Katz-Sarnak density conjecture. We prove that as the level tend…
▽ More
Assuming the Generalized Riemann Hypothesis, the non-trivial zeros of $L$-functions lie on the critical line with the real part $1/2$. We find an upper bound of the lowest first zero in families of even cuspidal newforms of prime level tending to infinity. We obtain explicit bounds using the $n$-level densities and results towards the Katz-Sarnak density conjecture. We prove that as the level tends to infinity, there is at least one form with a normalized zero within $1/4$ of the average spacing. We also obtain the first-ever bounds on the percentage of forms in these families with a fixed number of zeros within a small distance near the central point.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Partial bases and homological stability of $\operatorname{GL}_{n}(R)$ revisited
Authors:
Calista Bernard,
Jeremy Miller,
Robin J. Sroka
Abstract:
Let $R$ be a unital ring satisfying the invariant basis number property, that every stably free $R$-module is free, and that the complex of partial bases of every finite rank free module is Cohen--Macaulay. This class of rings includes every ring of stable rank $1$ (e.g. any local, semi-local or Artinian ring), every Euclidean domain, and every Dedekind domain $\mathcal{O}_S$ of arithmetic type wh…
▽ More
Let $R$ be a unital ring satisfying the invariant basis number property, that every stably free $R$-module is free, and that the complex of partial bases of every finite rank free module is Cohen--Macaulay. This class of rings includes every ring of stable rank $1$ (e.g. any local, semi-local or Artinian ring), every Euclidean domain, and every Dedekind domain $\mathcal{O}_S$ of arithmetic type where $|S| > 1$ and $S$ contains at least one non-complex place. Extending recent work of Galatius--Kupers--Randal-Williams and Kupers--Miller--Patzt, we prove that the sequence of general linear groups $\operatorname{GL}_n(R)$ satisfies slope-$1$ homological stability with $\mathbb{Z}[1/2]$-coefficients.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Human-interpretable clustering of short-text using large language models
Authors:
Justin K. Miller,
Tristram J. Alexander
Abstract:
Large language models have seen extraordinary growth in popularity due to their human-like content generation capabilities. We show that these models can also be used to successfully cluster human-generated content, with success defined through the measures of distinctiveness and interpretability. This success is validated by both human reviewers and ChatGPT, providing an automated means to close…
▽ More
Large language models have seen extraordinary growth in popularity due to their human-like content generation capabilities. We show that these models can also be used to successfully cluster human-generated content, with success defined through the measures of distinctiveness and interpretability. This success is validated by both human reviewers and ChatGPT, providing an automated means to close the 'validation gap' that has challenged short-text clustering. Comparing the machine and human approaches we identify the biases inherent in each, and question the reliance on human-coding as the 'gold standard'. We apply our methodology to Twitter bios and find characteristic ways humans describe themselves, agreeing well with prior specialist work, but with interesting differences characteristic of the medium used to express identity.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
$v$-Palindromes: An Analogy to the Palindromes
Authors:
Chris Bispels,
Muhammet Boran,
Steven J. Miller,
Eliel Sosis,
Daniel Tsai
Abstract:
Around the year 2007, one of the authors, Tsai, accidentally discovered a property of the number $198$ he saw on the license plate of a car. Namely, if we take $198$ and its reversal $891$, which have prime factorizations $198 = 2\cdot 3^2\cdot 11$ and $891 = 3^4\cdot 11$ respectively, and sum the numbers appearing in each factorization getting $2+3+2+11 = 18$ and $3+4+11 = 18$, both sums are…
▽ More
Around the year 2007, one of the authors, Tsai, accidentally discovered a property of the number $198$ he saw on the license plate of a car. Namely, if we take $198$ and its reversal $891$, which have prime factorizations $198 = 2\cdot 3^2\cdot 11$ and $891 = 3^4\cdot 11$ respectively, and sum the numbers appearing in each factorization getting $2+3+2+11 = 18$ and $3+4+11 = 18$, both sums are $18$. Such numbers were later named $v$-palindromes because they can be viewed as an analogy to the usual palindromes. In this article, we introduce the concept of a $v$-palindrome in base $b$ and prove their existence for infinitely many bases. We also exhibit infinite families of $v$-palindromes in bases $p+1$ and $p^2+1$, for each odd prime $p$. Finally, we collect some conjectures and problems involving $v$-palindromes.
△ Less
Submitted 24 April, 2024;
originally announced May 2024.
-
Inhomogeneous wave kinetic equation and its hierarchy in polynomially weighted $L^\infty$ spaces
Authors:
Ioakeim Ampatzoglou,
Joseph K. Miller,
Nataša Pavlović,
Maja Tasković
Abstract:
Inspired by ideas stemming from the analysis of the Boltzmann equation, in this paper we expand well-posedness theory of the spatially inhomogeneous 4-wave kinetic equation, and also analyze an infinite hierarchy of PDE associated with this nonlinear equation. More precisely, we show global in time well-posedness of the spatially inhomogeneous 4-wave kinetic equation for polynomially decaying init…
▽ More
Inspired by ideas stemming from the analysis of the Boltzmann equation, in this paper we expand well-posedness theory of the spatially inhomogeneous 4-wave kinetic equation, and also analyze an infinite hierarchy of PDE associated with this nonlinear equation. More precisely, we show global in time well-posedness of the spatially inhomogeneous 4-wave kinetic equation for polynomially decaying initial data. For the associated infinite hierarchy, we construct global in time solutions using the solutions of the wave kinetic equation and the Hewitt-Savage theorem. Uniqueness of these solutions is proved by using a combinatorial board game argument tailored to this context, which allows us to control the factorial growth of the Dyson series.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
The influence of dark excitons on the electroabsorption spectrum of polyacetylene
Authors:
Jaspal Singh Bola,
Ryan M. Stolley,
Prashanna Poudel,
Joel S. Miller,
Christoph Boheme,
Z. Valy Vardeny
Abstract:
This study revisits the electroabsorption (EA) spectrum of polyacetylene, as functions of the electric field strength, isomerization degree, and light polarization states. The EA spectrum of $cis$-$(CH)_x$ reveals an oscillatory feature that follows the Stark shift-related first derivative of the materials absorption spectrum that contains v(0-1) and v(0-2) sidebands of the excited $C=C$ stretchin…
▽ More
This study revisits the electroabsorption (EA) spectrum of polyacetylene, as functions of the electric field strength, isomerization degree, and light polarization states. The EA spectrum of $cis$-$(CH)_x$ reveals an oscillatory feature that follows the Stark shift-related first derivative of the materials absorption spectrum that contains v(0-1) and v(0-2) sidebands of the excited $C=C$ stretching vibration that agrees well with the Raman spectrum. EA spectrum of $trans $-$(CH)_x$ does not match the first derivative of the materials absorption spectrum, and the phonon sideband frequency does not agree with the RS spectrum. EA spectrum of $trans $-$(CH)_x$ reveals a band below the first allowed $1B_u$ exciton. We interpret this feature as due to the electric field activated even-parity dark (forbidden) exciton, namely $mA_g$ ($m >1$), showing that the nonluminescent $trans $-$(CH)_x$ is due to the reverse order of the excited states, where a dark $mA_g$ exciton lies below the allowed $1B_u$ exciton. This agrees with the unusual phonon sideband in $trans $-$(CH)_x$ absorption, since the excited state attenuation caused by the fast internal conversion from $1B_u$ to $mA_g$ influences the apparent frequency that determines the phonon sideband. Consequently, from the EA and RS spectra we estimate the $1B_u$ lifetime in $trans $-$(CH)_x$ to be $\sim 30$ fs. Integrated EA spectrum of $trans $-$(CH)_x$ shows a traditional Huang-Rhys type series with a relaxation parameter, $S \sim 0.5$. This indicates that the EA spectrum of the $trans $ isomer is also determined by a Stark shift related to the first derivative of the absorption spectrum, but preferentially for the longest chains in the films chain lengths distribution. This is due to the $N^3$ response of the non-linear susceptibility, $χ^{(3)}$ ($\sim$EA), dependence on the chain length having $N$ monomers.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Some Winnability Results for the Neighborhood and Group Labeling Lights Out Games
Authors:
Brittany Doherty,
Christian J. Miller,
Darren B. Parker
Abstract:
We look at both the \emph{group labeling lights out game} and the \emph{neighborhood lights out game}. Our main focus is to determine necessary and sufficient conditions for when the group labeling lights out game on path graphs, cycle graphs, and complete bipartite graphs can be won for every possible initial labeling. In the process of solving this problem, we demonstrate a new proof for when th…
▽ More
We look at both the \emph{group labeling lights out game} and the \emph{neighborhood lights out game}. Our main focus is to determine necessary and sufficient conditions for when the group labeling lights out game on path graphs, cycle graphs, and complete bipartite graphs can be won for every possible initial labeling. In the process of solving this problem, we demonstrate a new proof for when the neighborhood lights out game on complete bipartite graphs can be won for every possible initial labeling.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
The time evolution of fast flavor crossings in post-merger disks around a black hole remnant
Authors:
Payel Mukhopadhyay,
Jonah Miller,
Gail C. McLaughlin
Abstract:
We postprocess a three-dimensional general relativistic, full transport neutrino radiation magnetohydrodynamics simulation of the black hole--accretion disk--wind system thought to be a potential outcome of the GW170817 merger to investigate the presence of electron lepton number (ELN-XLN) crossings in the neutrino angular distribution. Neutrinos are evolved with an explicit Monte Carlo method and…
▽ More
We postprocess a three-dimensional general relativistic, full transport neutrino radiation magnetohydrodynamics simulation of the black hole--accretion disk--wind system thought to be a potential outcome of the GW170817 merger to investigate the presence of electron lepton number (ELN-XLN) crossings in the neutrino angular distribution. Neutrinos are evolved with an explicit Monte Carlo method and can interact with matter via emission, absorption, or scattering. Within the postprocessing framework, we find ubiquitous occurrence of ELN-XLN crossings at early times ($\sim$ 11ms) but this does not hold for later times in the simulation. At postmerger times of $ \sim$ 60 ms and beyond, ELN-XLN crossings are only present near the equator. We provide a detailed analysis of the neutrino radiation field to investigate the origin and time evolution of these crossings. Previous reports have suggested ubiquitous flavor crossings persisting throughout the simulation lifetime, albeit for different sets of conditions for the merger remnant, the treatment of hydrodynamics and neutrino transport. Even though we do not perform a direct comparison with other published works, we qualitatively assess the reasons for the difference with our results. The geometric structure and evolution of the ELN-XLN crossings found in our analysis, and by extension, fast flavor instabilities have important implications for heavy element nucleosynthesis in neutron star mergers.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
Hopf algebras, Steinberg modules, and the unstable cohomology of $SL_n(\mathbb Z)$ and $GL_n(\mathbb Z)$
Authors:
Avner Ash,
Jeremy Miller,
Peter Patzt
Abstract:
We prove that the direct sum of all homology groups of the integral general linear groups with Steinberg module coefficients form a commutative Hopf algebra, in particular a free graded commutative algebra. We use this to construct new infinite families of unstable cohomology classes of $SL_n(\mathbb Z)$.
We prove that the direct sum of all homology groups of the integral general linear groups with Steinberg module coefficients form a commutative Hopf algebra, in particular a free graded commutative algebra. We use this to construct new infinite families of unstable cohomology classes of $SL_n(\mathbb Z)$.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Compressive Bayesian non-negative matrix factorization for mutational signatures analysis
Authors:
Alessandro Zito,
Jeffrey W. Miller
Abstract:
Non-negative matrix factorization (NMF) is widely used in many applications for dimensionality reduction. Inferring an appropriate number of factors for NMF is a challenging problem, and several approaches based on information criteria or sparsity-inducing priors have been proposed. However, inference in these models is often complicated and computationally challenging. In this paper, we introduce…
▽ More
Non-negative matrix factorization (NMF) is widely used in many applications for dimensionality reduction. Inferring an appropriate number of factors for NMF is a challenging problem, and several approaches based on information criteria or sparsity-inducing priors have been proposed. However, inference in these models is often complicated and computationally challenging. In this paper, we introduce a novel methodology for overfitted Bayesian NMF models using "compressive hyperpriors" that force unneeded factors down to negligible values while only imposing mild shrinkage on needed factors. The method is based on using simple semi-conjugate priors to facilitate inference, while setting the strength of the hyperprior in a data-dependent way to achieve this compressive property. We apply our method to mutational signatures analysis in cancer genomics, where we find that it outperforms state-of-the-art alternatives. In particular, we illustrate how our compressive hyperprior enables the use of biologically informed priors on the signatures, yielding significantly improved accuracy. We provide theoretical results establishing the compressive property, and we demonstrate the method in simulations and on real data from a breast cancer application.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Curvature of Gaussian quantum states
Authors:
Harry J. D. Miller
Abstract:
The space of quantum states can be endowed with a metric structure using the second order derivatives of the relative entropy, giving rise to the so-called Kubo-Mori-Bogoliubov inner product. We explore its geometric properties on the submanifold of faithful, zero-displacement Gaussian states parameterised by their covariance matrices, deriving expressions for the geodesic equations, curvature ten…
▽ More
The space of quantum states can be endowed with a metric structure using the second order derivatives of the relative entropy, giving rise to the so-called Kubo-Mori-Bogoliubov inner product. We explore its geometric properties on the submanifold of faithful, zero-displacement Gaussian states parameterised by their covariance matrices, deriving expressions for the geodesic equations, curvature tensors and scalar curvature. Our analysis suggests that the curvature of the manifold is strictly monotonic with respect to the von Neumann entropy, and thus can be interpreted as a measure of state uncertainty. This provides supporting evidence for the Petz conjecture in continuous variable systems.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Algebraic Proofs of Path Disconnectedness using Time-Dependent Barrier Functions
Authors:
Didier Henrion,
Jared Miller,
Mohab Safey El Din
Abstract:
Two subsets of a given set are path-disconnected if they lie in different connected components of the larger set. Verification of path-disconnectedness is essential in proving the infeasibility of motion planning and trajectory optimization algorithms. We formulate path-disconnectedness as the infeasibility of a single-integrator control task to move between an initial set and a target set in a su…
▽ More
Two subsets of a given set are path-disconnected if they lie in different connected components of the larger set. Verification of path-disconnectedness is essential in proving the infeasibility of motion planning and trajectory optimization algorithms. We formulate path-disconnectedness as the infeasibility of a single-integrator control task to move between an initial set and a target set in a sufficiently long time horizon. This control-infeasibility task is certified through the generation of a time-dependent barrier function that separates the initial and final sets. The existence of a time-dependent barrier function is a necessary and sufficient condition for path-disconnectedness under compactness conditions. Numerically, the search for a polynomial barrier function is formulated using the moment-sum-of-squares hierarchy of semidefinite programs. The barrier function proves path-disconnectedness at a sufficiently large polynomial degree. The computational complexity of these semidefinite programs can be reduced by elimination of the control variables. Disconnectedness proofs are synthesized for example systems.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Peak Time-Windowed Risk Estimation of Stochastic Processes
Authors:
Jared Miller,
Niklas Schmid,
Matteo Tacchi,
Didier Henrion,
Roy S. Smith
Abstract:
This paper develops a method to upper-bound extreme-values of time-windowed risks for stochastic processes. Examples of such risks include the maximum average or 90% quantile of the current along a transmission line in any 5-minute window. This work casts the time-windowed risk analysis problem as an infinite-dimensional linear program in occupation measures. In particular, we employ the coherent…
▽ More
This paper develops a method to upper-bound extreme-values of time-windowed risks for stochastic processes. Examples of such risks include the maximum average or 90% quantile of the current along a transmission line in any 5-minute window. This work casts the time-windowed risk analysis problem as an infinite-dimensional linear program in occupation measures. In particular, we employ the coherent risk measures of the mean and the expected shortfall (conditional value at risk) to define the maximal time-windowed risk along trajectories. The infinite-dimensional linear program must then be truncated into finite-dimensional optimization problems, such as by using the moment-sum of squares hierarchy of semidefinite programs. The infinite-dimensional linear program will have the same optimal value as the original nonconvex risk estimation task under compactness and regularity assumptions, and the sequence of semidefinite programs will converge to the true value under additional properties of algebraic characterization. The scheme is demonstrated for risk analysis of example stochastic processes.
△ Less
Submitted 11 April, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Nuclear uncertainties associated with the ejecta of a neutron-star black-hole accretion disk
Authors:
M. R. Mumpower,
T. M. Sprouse,
J. M. Miller,
K. A. Lund,
J. Cabrera Garcia,
N. Vassh,
G. C. McLaughlin,
R. Surman
Abstract:
The simulation of heavy element nucleosynthesis requires input from yet-to-be-measured nuclear properties. The uncertainty in the values of these off-stability nuclear properties propagates to uncertainties in the predictions of elemental and isotopic abundances. However, for any given astrophysical explosion, there are many different trajectories, i.e. temperature and density histories, experienc…
▽ More
The simulation of heavy element nucleosynthesis requires input from yet-to-be-measured nuclear properties. The uncertainty in the values of these off-stability nuclear properties propagates to uncertainties in the predictions of elemental and isotopic abundances. However, for any given astrophysical explosion, there are many different trajectories, i.e. temperature and density histories, experienced by outflowing material and thus different nuclear properties can come into play. We consider combined nucleosynthesis results from 460,000 trajectories from a neutron star-black hole accretion disk and the find spread in elemental predictions due solely to unknown nuclear properties to be a factor of a few. We analyze this relative spread in model predictions due to nuclear variations and conclude that the uncertainties can be attributed to a combination of properties in a given region of the abundance pattern. We calculate a cross-correlation between mass changes and abundance changes to show how variations among the properties of participating nuclei may be explored. Our results provide further impetus for measurements of multiple quantities on individual short-lived neutron-rich isotopes at modern experimental facilities.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Integrated path stability selection
Authors:
Omar Melikechi,
Jeffrey W. Miller
Abstract:
Stability selection is a widely used method for improving the performance of feature selection algorithms. However, stability selection has been found to be highly conservative, resulting in low sensitivity. Further, the theoretical bound on the expected number of false positives, E(FP), is relatively loose, making it difficult to know how many false positives to expect in practice. In this paper,…
▽ More
Stability selection is a widely used method for improving the performance of feature selection algorithms. However, stability selection has been found to be highly conservative, resulting in low sensitivity. Further, the theoretical bound on the expected number of false positives, E(FP), is relatively loose, making it difficult to know how many false positives to expect in practice. In this paper, we introduce a novel method for stability selection based on integrating the stability paths rather than maximizing over them. This yields a tighter bound on E(FP), resulting in a feature selection criterion that has higher sensitivity in practice and is better calibrated in terms of matching the target E(FP). Our proposed method requires the same amount of computation as the original stability selection algorithm, and only requires the user to specify one input parameter, a target value for E(FP). We provide theoretical bounds on performance, and demonstrate the method on simulations and real data from cancer gene expression studies.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
The German Tank Problem with Multiple Factories
Authors:
Steven J. Miller,
Kishan Sharma,
Andrew K. Yang
Abstract:
During the Second World War, estimates of the number of tanks deployed by Germany were critically needed. The Allies adopted two methods to estimate this information: espionage and statistical analysis. The latter approach was far more successful and is as follows: assuming that the tanks are sequentially numbered starting from 1, if we observe $k$ serial numbers from an unknown total of $N$ tanks…
▽ More
During the Second World War, estimates of the number of tanks deployed by Germany were critically needed. The Allies adopted two methods to estimate this information: espionage and statistical analysis. The latter approach was far more successful and is as follows: assuming that the tanks are sequentially numbered starting from 1, if we observe $k$ serial numbers from an unknown total of $N$ tanks, with the highest observed number being $M$, then the best linear unbiased estimator for $N$ is $M(1+1/k)-1$. This is now known as the German Tank Problem. Suppose one wishes to estimate the productivity of a rival by inspecting captured or destroyed tanks, each with a unique serial number. In many situations, the original German Tank Problem is insufficient, since typically there are $l>1$ factories, and tanks produced by different factories may have serial numbers in disjoint ranges that are often far separated, let alone sequentially numbered starting from 1. We wish to estimate the total tank production across all of the factories. We construct an efficient procedure to estimate the total productivity and prove that our procedure effectively estimates $N$ when $\log l/\log k$ is sufficiently small, and is robust against both large and small gaps between factories. In the final section, we show that given information about the gaps, we can make a far better estimator that is also effective when we have a small number of samples. When the number of samples is small compared to the number of gaps, the Mean Squared Error of this new estimator is several orders of magnitude smaller than the one that assumes no information. This quantifies the importance of hiding such information if one wishes to conceal their productivity from a rival.
△ Less
Submitted 11 April, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Gen-T: Table Reclamation in Data Lakes
Authors:
Grace Fan,
Roee Shraga,
Renée J. Miller
Abstract:
We introduce the problem of Table Reclamation. Given a Source Table and a large table repository, reclamation finds a set of tables that, when integrated, reproduce the source table as closely as possible. Unlike query discovery problems like Query-by-Example or by-Target, Table Reclamation focuses on reclaiming the data in the Source Table as fully as possible using real tables that may be incomp…
▽ More
We introduce the problem of Table Reclamation. Given a Source Table and a large table repository, reclamation finds a set of tables that, when integrated, reproduce the source table as closely as possible. Unlike query discovery problems like Query-by-Example or by-Target, Table Reclamation focuses on reclaiming the data in the Source Table as fully as possible using real tables that may be incomplete or inconsistent. To do this, we define a new measure of table similarity, called error-aware instance similarity, to measure how close a reclaimed table is to a Source Table, a measure grounded in instance similarity used in data exchange. Our search covers not only SELECT-PROJECT- JOIN queries, but integration queries with unions, outerjoins, and the unary operators subsumption and complementation that have been shown to be important in data integration and fusion. Using reclamation, a data scientist can understand if any tables in a repository can be used to exactly reclaim a tuple in the Source. If not, one can understand if this is due to differences in values or to incompleteness in the data. Our solution, Gen-T, performs table discovery to retrieve a set of candidate tables from the table repository, filters these down to a set of originating tables, then integrates these tables to reclaim the Source as closely as possible. We show that our solution, while approximate, is accurate, efficient and scalable in the size of the table repository with experiments on real data lakes containing up to 15K tables, where the average number of tuples varies from small (web tables) to extremely large (open data tables) up to 1M tuples.
△ Less
Submitted 22 March, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Quantum tomography of molecules using ultrafast electron diffraction
Authors:
Jiayang Jiang,
Ming Zhang,
Aosheng Gu,
R. J. Dwayne Miller,
Zheng Li
Abstract:
We propose a quantum tomography (QT) approach to retrieve the temporally evolving reduced density matrix in elecotronic state basis, where the populations and coherence between ground state and excited state are reconstructed from the ultrafast electron diffraction signal. In order to showcase the capability of the proposed QT approach, we simulate the nuclear wavepacket dynamics and ultrafast ele…
▽ More
We propose a quantum tomography (QT) approach to retrieve the temporally evolving reduced density matrix in elecotronic state basis, where the populations and coherence between ground state and excited state are reconstructed from the ultrafast electron diffraction signal. In order to showcase the capability of the proposed QT approach, we simulate the nuclear wavepacket dynamics and ultrafast electron diffraction of photoexcited pyrrole molecules using ab initio quantum chemical CASSCF method. From simulated time-resolved diffraction data, we retrieve the evolving density matrix in a crude diabatic representation basis and reveal the symmetry of the excited pyrrole wavepacket. Our QT approach opens the route to make quantum version of "molecular movie" that covers the electronic degree of freedom, and equips ultrafast electron diffraction with the power to reveal the coherence between electronic states, relaxation and dynamics of population transfer.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Proton Helicity GPDs from Lattice QCD
Authors:
Joshua Miller,
Shohini Bhattacharya,
Krzysztof Cichy,
Martha Constantinou,
Xiang Gao,
Andreas Metz,
Swagato Mukherjee,
Peter Petreczky,
Fernanda Steffens,
Yong Zhao
Abstract:
First lattice QCD calculations of $x$-dependent GPD have been performed in the (symmetric) Breit frame, where the momentum transfer is evenly divided between the initial and final hadron states. However, employing the asymmetric frame, we are able to obtain proton GPDs for multiple momentum transfers in a computationally efficient setup. In these proceedings, we focus on the helicity twist-2 GPD a…
▽ More
First lattice QCD calculations of $x$-dependent GPD have been performed in the (symmetric) Breit frame, where the momentum transfer is evenly divided between the initial and final hadron states. However, employing the asymmetric frame, we are able to obtain proton GPDs for multiple momentum transfers in a computationally efficient setup. In these proceedings, we focus on the helicity twist-2 GPD at zero skewness that gives access to the $\widetilde{H}$ GPD. We will cover the implementation of the asymmetric frame, its comparison to the Breit frame, and the dependence of the GPD on the squared four-momentum transfer, $-t$. The calculation is performed on an $N_f = 2+1+1$ ensemble of twisted mass fermions with a clover improvement. The mass of the pion for this ensemble is roughly 260 MeV.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Maximizing Slice-Volumes of Semialgebraic Sets using Sum-of-Squares Programming
Authors:
Jared Miller,
Chiara Meroni,
Matteo Tacchi,
Mauricio Velasco
Abstract:
This paper presents an algorithm to maximize the volume of an affine slice through a given semialgebraic set. This slice-volume task is formulated as an infinite-dimensional linear program in continuous functions, inspired by prior work in volume computation of semialgebraic sets. A convergent sequence of upper-bounds to the maximal slice volume are computed using the moment-Sum-of-Squares hierarc…
▽ More
This paper presents an algorithm to maximize the volume of an affine slice through a given semialgebraic set. This slice-volume task is formulated as an infinite-dimensional linear program in continuous functions, inspired by prior work in volume computation of semialgebraic sets. A convergent sequence of upper-bounds to the maximal slice volume are computed using the moment-Sum-of-Squares hierarchy of semidefinite programs in increasing size. The computational complexity of this scheme can be reduced by utilizing topological structure (in dimensions 2, 3, 4, 8) and symmetry. This numerical convergence can be accelerated through the introduction of redundant Stokes-based constraints. Demonstrations of slice-volume calculation are performed on example sets.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
DART: Implicit Doppler Tomography for Radar Novel View Synthesis
Authors:
Tianshu Huang,
John Miller,
Akarsh Prabhakara,
Tao Jin,
Tarana Laroia,
Zico Kolter,
Anthony Rowe
Abstract:
Simulation is an invaluable tool for radio-frequency system designers that enables rapid prototyping of various algorithms for imaging, target detection, classification, and tracking. However, simulating realistic radar scans is a challenging task that requires an accurate model of the scene, radio frequency material properties, and a corresponding radar synthesis function. Rather than specifying…
▽ More
Simulation is an invaluable tool for radio-frequency system designers that enables rapid prototyping of various algorithms for imaging, target detection, classification, and tracking. However, simulating realistic radar scans is a challenging task that requires an accurate model of the scene, radio frequency material properties, and a corresponding radar synthesis function. Rather than specifying these models explicitly, we propose DART - Doppler Aided Radar Tomography, a Neural Radiance Field-inspired method which uses radar-specific physics to create a reflectance and transmittance-based rendering pipeline for range-Doppler images. We then evaluate DART by constructing a custom data collection platform and collecting a novel radar dataset together with accurate position and instantaneous velocity measurements from lidar-based localization. In comparison to state-of-the-art baselines, DART synthesizes superior radar range-Doppler images from novel views across all datasets and additionally can be used to generate high quality tomographic images.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Targeted Variance Reduction: Robust Bayesian Optimization of Black-Box Simulators with Noise Parameters
Authors:
John Joshua Miller,
Simon Mak
Abstract:
The optimization of a black-box simulator over control parameters $\mathbf{x}$ arises in a myriad of scientific applications. In such applications, the simulator often takes the form $f(\mathbf{x},\boldsymbolθ)$, where $\boldsymbolθ$ are parameters that are uncertain in practice. Robust optimization aims to optimize the objective $\mathbb{E}[f(\mathbf{x},\boldsymbolΘ)]$, where…
▽ More
The optimization of a black-box simulator over control parameters $\mathbf{x}$ arises in a myriad of scientific applications. In such applications, the simulator often takes the form $f(\mathbf{x},\boldsymbolθ)$, where $\boldsymbolθ$ are parameters that are uncertain in practice. Robust optimization aims to optimize the objective $\mathbb{E}[f(\mathbf{x},\boldsymbolΘ)]$, where $\boldsymbolΘ \sim \mathcal{P}$ is a random variable that models uncertainty on $\boldsymbolθ$. For this, existing black-box methods typically employ a two-stage approach for selecting the next point $(\mathbf{x},\boldsymbolθ)$, where $\mathbf{x}$ and $\boldsymbolθ$ are optimized separately via different acquisition functions. As such, these approaches do not employ a joint acquisition over $(\mathbf{x},\boldsymbolθ)$, and thus may fail to fully exploit control-to-noise interactions for effective robust optimization. To address this, we propose a new Bayesian optimization method called Targeted Variance Reduction (TVR). The TVR leverages a novel joint acquisition function over $(\mathbf{x},\boldsymbolθ)$, which targets variance reduction on the objective within the desired region of improvement. Under a Gaussian process surrogate on $f$, the TVR acquisition can be evaluated in closed form, and reveals an insightful exploration-exploitation-precision trade-off for robust black-box optimization. The TVR can further accommodate a broad class of non-Gaussian distributions on $\mathcal{P}$ via a careful integration of normalizing flows. We demonstrate the improved performance of TVR over the state-of-the-art in a suite of numerical experiments and an application to the robust design of automobile brake discs under operational uncertainty.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Data-Driven Superstabilizing Control under Quadratically-Bounded Errors-in-Variables Noise
Authors:
Jared Miller,
Tianyu Dai,
Mario Sznaier
Abstract:
The Error-in-Variables model of system identification/control involves nontrivial input and measurement corruption of observed data, resulting in generically nonconvex optimization problems. This paper performs full-state-feedback stabilizing control of all discrete-time linear systems that are consistent with observed data for which the input and measurement noise obey quadratic bounds. Instances…
▽ More
The Error-in-Variables model of system identification/control involves nontrivial input and measurement corruption of observed data, resulting in generically nonconvex optimization problems. This paper performs full-state-feedback stabilizing control of all discrete-time linear systems that are consistent with observed data for which the input and measurement noise obey quadratic bounds. Instances of such quadratic bounds include elementwise norm bounds (at each time sample), energy bounds (across the entire signal), and chance constraints arising from (sub)gaussian noise. Superstabilizing controllers are generated through the solution of a sum-of-squares hierarchy of semidefinite programs. A theorem of alternatives is employed to eliminate the input and measurement noise process, thus improving tractability.
△ Less
Submitted 17 May, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
Model Lakes
Authors:
Koyena Pal,
David Bau,
Renée J. Miller
Abstract:
Given a set of deep learning models, it can be hard to find models appropriate to a task, understand the models, and characterize how models are different one from another. Currently, practitioners rely on manually-written documentation to understand and choose models. However, not all models have complete and reliable documentation. As the number of machine learning models increases, this issue o…
▽ More
Given a set of deep learning models, it can be hard to find models appropriate to a task, understand the models, and characterize how models are different one from another. Currently, practitioners rely on manually-written documentation to understand and choose models. However, not all models have complete and reliable documentation. As the number of machine learning models increases, this issue of finding, differentiating, and understanding models is becoming more crucial. Inspired from research on data lakes, we introduce and define the concept of model lakes. We discuss fundamental research challenges in the management of large models. And we discuss what principled data management techniques can be brought to bear on the study of large model management.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Test for Echo: X-ray Reflection Variability in the Seyfert-2 AGN NGC 4388
Authors:
B. Gediman,
J. M. Miller,
A. Zoghbi,
P. Draghis,
Z. Arzoumanian,
W. N . Brandt,
K. Gendreau
Abstract:
We report on a study of the narrow Fe K$α$ line and reflection spectrum in the well-known Seyfert-2 AGN, NGC 4388. X-ray spectra summed from two extensive NICER monitoring campaigns, separated by years, show strong evidence of variation in the direct continuum and reflected emission, but only small variations in the obscuring gas. Fits to the spectra from individual NICER observations find a stron…
▽ More
We report on a study of the narrow Fe K$α$ line and reflection spectrum in the well-known Seyfert-2 AGN, NGC 4388. X-ray spectra summed from two extensive NICER monitoring campaigns, separated by years, show strong evidence of variation in the direct continuum and reflected emission, but only small variations in the obscuring gas. Fits to the spectra from individual NICER observations find a strong, positive correlation between the power-law photon index, $Γ$, and direct flux that is commonly observed in unobscured AGN. A search for a reverberation lag between the direct and reflected spectra -- dominated by the narrow Fe K$α$ emission line -- measures a time scale of $t = 16.37^{+0.46}_{-0.38}$ days, or a characteristic radius of $r=1.374_{-0.032}^{+0.039}\times10^{-2}$ pc $=3.4_{-0.1}^{+0.1}\times10^4\;GM/c^2$. Only one cycle of this tentative lag is observed, but it is driven by a particularly sharp drop in the direct continuum that leads to the subsequent disappearance of the otherwise prominent Fe K$α$ line. Physically motivated fits to high-resolution Chandra spectra of NGC 4388 measure a line production radius of $r =2.9^{+1.2}_{-0.7}~\times 10^{4}~GM/c^{2}$, formally consistent with the tentative lag. The line profile also prefers a Compton-thick reflector, indicating an origin in the disk and/or thick clumps within a wind. We discuss the strengths and weaknesses of our analysis and methods for testing our results in future observations, and we note the potential for X-ray reverberation lags to constrain black hole masses in obscured Seyferts wherein the optical broad line region is not visible.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Detailed Report on the Measurement of the Positive Muon Anomalous Magnetic Moment to 0.20 ppm
Authors:
D. P. Aguillard,
T. Albahri,
D. Allspach,
A. Anisenkov,
K. Badgley,
S. Baeßler,
I. Bailey,
L. Bailey,
V. A. Baranov,
E. Barlas-Yucel,
T. Barrett,
E. Barzi,
F. Bedeschi,
M. Berz,
M. Bhattacharya,
H. P. Binney,
P. Bloom,
J. Bono,
E. Bottalico,
T. Bowcock,
S. Braun,
M. Bressler,
G. Cantatore,
R. M. Carey,
B. C. K. Casey
, et al. (168 additional authors not shown)
Abstract:
We present details on a new measurement of the muon magnetic anomaly, $a_μ= (g_μ-2)/2$. The result is based on positive muon data taken at Fermilab's Muon Campus during the 2019 and 2020 accelerator runs. The measurement uses $3.1$ GeV$/c$ polarized muons stored in a $7.1$-m-radius storage ring with a $1.45$ T uniform magnetic field. The value of $ a_μ$ is determined from the measured difference b…
▽ More
We present details on a new measurement of the muon magnetic anomaly, $a_μ= (g_μ-2)/2$. The result is based on positive muon data taken at Fermilab's Muon Campus during the 2019 and 2020 accelerator runs. The measurement uses $3.1$ GeV$/c$ polarized muons stored in a $7.1$-m-radius storage ring with a $1.45$ T uniform magnetic field. The value of $ a_μ$ is determined from the measured difference between the muon spin precession frequency and its cyclotron frequency. This difference is normalized to the strength of the magnetic field, measured using Nuclear Magnetic Resonance (NMR). The ratio is then corrected for small contributions from beam motion, beam dispersion, and transient magnetic fields. We measure $a_μ= 116 592 057 (25) \times 10^{-11}$ (0.21 ppm). This is the world's most precise measurement of this quantity and represents a factor of $2.2$ improvement over our previous result based on the 2018 dataset. In combination, the two datasets yield $a_μ(\text{FNAL}) = 116 592 055 (24) \times 10^{-11}$ (0.20 ppm). Combining this with the measurements from Brookhaven National Laboratory for both positive and negative muons, the new world average is $a_μ$(exp) $ = 116 592 059 (22) \times 10^{-11}$ (0.19 ppm).
△ Less
Submitted 22 May, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
A jump operator on the Weihrauch degrees
Authors:
Uri Andrews,
Steffen Lempp,
Alberto Marcone,
Joseph S. Miller,
Manlio Valenti
Abstract:
A partial order $(P,\le)$ admits a jump operator if there is a map $j\colon P \to P$ that is strictly increasing and weakly monotone. Despite its name, the jump in the Weihrauch lattice fails to satisfy both of these properties: it is not degree-theoretic and there are functions $f$ such that $f\equiv_{\mathrm{W}} f'$. This raises the question: is there a jump operator in the Weihrauch lattice? We…
▽ More
A partial order $(P,\le)$ admits a jump operator if there is a map $j\colon P \to P$ that is strictly increasing and weakly monotone. Despite its name, the jump in the Weihrauch lattice fails to satisfy both of these properties: it is not degree-theoretic and there are functions $f$ such that $f\equiv_{\mathrm{W}} f'$. This raises the question: is there a jump operator in the Weihrauch lattice? We answer this question positively and provide an explicit definition for an operator on partial multi-valued functions that, when lifted to the Weihrauch degrees, induces a jump operator. This new operator, called the totalizing jump, can be characterized in terms of the total continuation, a well-known operator on computational problems. The totalizing jump induces an injective endomorphism of the Weihrauch degrees. We study some algebraic properties of the totalizing jump and characterize its behavior on some pivotal problems in the Weihrauch lattice.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
Spaceport Facility Location Planning within the US National Airspace System
Authors:
Haochen Wu,
Kevin R. Sun,
Jackson A. Miller,
Oliver Jia-Richards,
Max Z. Li
Abstract:
The burgeoning commercial space transportation industry necessitates an expansion of launch infrastructure to meet rising demands. However, future operations from these large-scale infrastructures can result in new impacts, particularly to air traffic operations. To rigorously reason about where such future spaceports might be located and what their impacts might be, we introduce a facility locati…
▽ More
The burgeoning commercial space transportation industry necessitates an expansion of launch infrastructure to meet rising demands. However, future operations from these large-scale infrastructures can result in new impacts, particularly to air traffic operations. To rigorously reason about where such future spaceports might be located and what their impacts might be, we introduce a facility location planning model for future US spaceports (SPFLP). Central considerations for the SPFLP include population density, space launch trajectories, and potential impacts to air traffic within the US National Airspace System (NAS). The SPFLP outputs a cost-optimal set of candidate locations for future spaceports while satisfying a range of operational constraints. By conducting sensitivity analyses on the SPFLP, we are able to examine differences in flight rerouting costs and optimal launch mission allocations. Our model and numerical experiments offer valuable insights for future spaceport site selection, contributing to the strategic development of commercial space transportation while keeping in mind the need to integrate these operations within the NAS.
△ Less
Submitted 17 February, 2024;
originally announced February 2024.
-
TAI-GAN: A Temporally and Anatomically Informed Generative Adversarial Network for early-to-late frame conversion in dynamic cardiac PET inter-frame motion correction
Authors:
Xueqi Guo,
Luyao Shi,
Xiongchao Chen,
Qiong Liu,
Bo Zhou,
Huidong Xie,
Yi-Hwa Liu,
Richard Palyo,
Edward J. Miller,
Albert J. Sinusas,
Lawrence H. Staib,
Bruce Spottiswoode,
Chi Liu,
Nicha C. Dvornek
Abstract:
Inter-frame motion in dynamic cardiac positron emission tomography (PET) using rubidium-82 (82-Rb) myocardial perfusion imaging impacts myocardial blood flow (MBF) quantification and the diagnosis accuracy of coronary artery diseases. However, the high cross-frame distribution variation due to rapid tracer kinetics poses a considerable challenge for inter-frame motion correction, especially for ea…
▽ More
Inter-frame motion in dynamic cardiac positron emission tomography (PET) using rubidium-82 (82-Rb) myocardial perfusion imaging impacts myocardial blood flow (MBF) quantification and the diagnosis accuracy of coronary artery diseases. However, the high cross-frame distribution variation due to rapid tracer kinetics poses a considerable challenge for inter-frame motion correction, especially for early frames where intensity-based image registration techniques often fail. To address this issue, we propose a novel method called Temporally and Anatomically Informed Generative Adversarial Network (TAI-GAN) that utilizes an all-to-one mapping to convert early frames into those with tracer distribution similar to the last reference frame. The TAI-GAN consists of a feature-wise linear modulation layer that encodes channel-wise parameters generated from temporal information and rough cardiac segmentation masks with local shifts that serve as anatomical information. Our proposed method was evaluated on a clinical 82-Rb PET dataset, and the results show that our TAI-GAN can produce converted early frames with high image quality, comparable to the real reference frames. After TAI-GAN conversion, the motion estimation accuracy and subsequent myocardial blood flow (MBF) quantification with both conventional and deep learning-based motion correction methods were improved compared to using the original frames.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Measuring Sharpness in Grokking
Authors:
Jack Miller,
Patrick Gleeson,
Charles O'Neill,
Thang Bui,
Noam Levi
Abstract:
Neural networks sometimes exhibit grokking, a phenomenon where perfect or near-perfect performance is achieved on a validation set well after the same performance has been obtained on the corresponding training set. In this workshop paper, we introduce a robust technique for measuring grokking, based on fitting an appropriate functional form. We then use this to investigate the sharpness of transi…
▽ More
Neural networks sometimes exhibit grokking, a phenomenon where perfect or near-perfect performance is achieved on a validation set well after the same performance has been obtained on the corresponding training set. In this workshop paper, we introduce a robust technique for measuring grokking, based on fitting an appropriate functional form. We then use this to investigate the sharpness of transitions in training and validation accuracy under two settings. The first setting is the theoretical framework developed by Levi et al. (2023) where closed form expressions are readily accessible. The second setting is a two-layer MLP trained to predict the parity of bits, with grokking induced by the concealment strategy of Miller et al. (2023). We find that trends between relative grokking gap and grokking sharpness are similar in both settings when using absolute and relative measures of sharpness. Reflecting on this, we make progress toward explaining some trends and identify the need for further study to untangle the various mechanisms which influence the sharpness of grokking.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
The Shifting Impact of Recurrent Flooding on Transportation Accessibility: A Case Study of Affected Populations in The Hampton Roads Region
Authors:
Luwei Zeng,
T. Donna Chen,
John S. Miller,
Faria Tuz Zahura,
Jonathan L. Goodall
Abstract:
Accelerated sea level rise has resulted in recurrent flooding in coastal regions, increasingly impacting both transportation systems and local populations. Using the Hampton Roads region in Virginia as a case study, this study a. identifies hotspots with frequent, significant accessibility reduction for work and nonwork travel utilizing crowdsourced WAZE flood report data during the month of Augus…
▽ More
Accelerated sea level rise has resulted in recurrent flooding in coastal regions, increasingly impacting both transportation systems and local populations. Using the Hampton Roads region in Virginia as a case study, this study a. identifies hotspots with frequent, significant accessibility reduction for work and nonwork travel utilizing crowdsourced WAZE flood report data during the month of August over 5 years: 2018 to 2022; and b. examines the shifts in social vulnerability in populations residing in these hotspots over the 5 year period using 2016 and 2021 American Community Survey data. Results show that approximately 12 percent and 3 percent of the population of the region reside in hotspots experiencing significant recurrent flooding-induced accessibility reduction for work and nonwork trips. Social vulnerability analysis revealed that populations with greater socioeconomic and transportation vulnerabilities are more susceptible to recurrent flooding induced accessibility impacts in terms of both extent and frequency. Furthermore, a comparison of social vulnerability indices between 2016 and 2021 shows an increasing trend of social vulnerability for highly impacted zones, with low income, disabled, and households with young children having restricted ability to relocate from these zones. The findings reinforce the necessity for spatially and temporally disaggregated studies of climate event impacts. Furthermore, the longer term population trends highlight the importance of dynamic assessment of climate event impacts at different time scales.
△ Less
Submitted 10 February, 2024;
originally announced February 2024.
-
A Survey of a Random Matrix Model for a Family of Cusp Forms
Authors:
Owen Barrett,
Zoë X. Batterman,
Aditya Jambhale,
Steven J. Miller,
Akash L. Narayanan,
Kishan Sharma,
Chris Yao
Abstract:
The Katz-Sarnak philosophy states that statistics of zeros of $L$-function families near the central point as the conductors tend to infinity agree with those of eigenvalues of random matrix ensembles as the matrix size tends to infinity. While numerous results support this conjecture, S. J. Miller observed that for finite conductors, very different behavior can occur for zeros near the central po…
▽ More
The Katz-Sarnak philosophy states that statistics of zeros of $L$-function families near the central point as the conductors tend to infinity agree with those of eigenvalues of random matrix ensembles as the matrix size tends to infinity. While numerous results support this conjecture, S. J. Miller observed that for finite conductors, very different behavior can occur for zeros near the central point in elliptic curve families. This led to the excised model of Dueñez, Huynh, Keating, Miller, and Snaith, whose predictions for quadratic twists of a given elliptic curve are beautifully fit by the data. The key ingredients are relating the discretization of central values of the $L$-functions to excising matrices based on the value of the characteristic polynomials at 1 and using lower order terms (in statistics such as the one-level density and pair-correlation) to adjust the matrix size. We discuss recent successes by the authors in extending this model to a family of quadratic twists of finite conductor of a given holomorphic cuspidal newform of level an odd prime level. In particular, we predict very little repulsion for forms with weight greater than 2.
△ Less
Submitted 17 April, 2024; v1 submitted 28 January, 2024;
originally announced February 2024.
-
Assessing The Spatially Heterogeneous Impact of Recurrent Flooding On Accessibility: A Case Study of The Hampton Roads Region:Part 2 Transit Accessibility
Authors:
Luwei Zeng,
T. Donna Chen,
John S. Miller,
Jonathan L. Goodall,
Faria Tuz Zahura
Abstract:
Due to accelerated sea level rise and climate change, the transportation system is increasingly affected by recurrent flooding coastal regions, yet the cumulative travel disruption effects are not well understood. In Part 1 of this study, the accessibility impacts of recurrent flooding on the auto mode were examined. In this paper (Part 2 of the study), the impact of recurrent flooding on transit…
▽ More
Due to accelerated sea level rise and climate change, the transportation system is increasingly affected by recurrent flooding coastal regions, yet the cumulative travel disruption effects are not well understood. In Part 1 of this study, the accessibility impacts of recurrent flooding on the auto mode were examined. In this paper (Part 2 of the study), the impact of recurrent flooding on transit service accessibility was quantified with the aid of spatially and temporally disaggregated crowdsourced flood incident data from WAZE. A fixed route transit network is built for five time of day periods for 710 traffic analysis zones (TAZs), to capture the spatial and temporal variation of transit accessibility reduction due to recurrent flooding. Results show that the greatest transit accessibility reduction occurs during the morning peak hour, with individual TAZ transit accessibility reduction ranging from 0 to 88.2% for work trips (with an average of 6.4%) and ranging from 0 to 99.9% for non-work trips (with an average of 3.7%). Furthermore, social vulnerability analysis indicates that TAZs with a greater share of people with higher vulnerability in transportation and socioeconomic status are more likely to experience recurrent flooding-induced transit accessibility reduction. Results from this study reinforce the notion that transportation impacts under recurrent flooding are not uniformly experienced throughout a region, and this spatial and temporal variation translates to different impacts borne by various population groups. Disaggregate impact analysis like this study can support transportation engineers and planners to prioritize resources to ensure equitable transit accessibility under increasing climate disruptions.
△ Less
Submitted 12 January, 2024;
originally announced February 2024.
-
Assessing The Spatially Heterogeneous Transportation Impacts of Recurrent Flooding in The Hampton Roads Region: Part 1 Auto Accessibility
Authors:
Luwei Zeng,
T. Donna Chen,
John S. Miller,
Jonathan L. Goodall,
Faria Tuz Zahura
Abstract:
Recurrent flooding has increased rapidly in coastal regions due to sea level rise and climate change. A key metric for evaluating transportation system degradation is accessibility, yet the lack of temporally and spatially disaggregate data means that the impact of recurrent flooding on accessibility, and hence transportation system performance: is not well understood. Using crowdsourced WAZE floo…
▽ More
Recurrent flooding has increased rapidly in coastal regions due to sea level rise and climate change. A key metric for evaluating transportation system degradation is accessibility, yet the lack of temporally and spatially disaggregate data means that the impact of recurrent flooding on accessibility, and hence transportation system performance: is not well understood. Using crowdsourced WAZE flood incident data from the Hampton Roads region in Virginia, this study (Part 1) examines changes in the roadway network accessibility for travelers residing in 1,113 traffic analysis zones (TAZs) across five time of day periods. Additionally, a social vulnerability index framework is developed to understand the socioeconomic characteristics of TAZs that experience high accessibility reduction under recurrent flooding.
Results show that TAZs experience the most accessibility reduction under recurrent flooding during the morning peak period (6 to 9am) with large differences across different zones, ranging from 0 to 49.6 (percentage) for work trips (with population weighted mean reduction of 1.71 percent) and 0 to 87.9 (percentage) for nonwork trips (with population weighted mean reduction of 0.81 percent). Furthermore, the social vulnerability analysis showed that zones with higher percentages of lower socioeconomic status, unemployed, less educated, and limited English proficiency residents experience greater accessibility reduction for work trips. In contrast to previous studies that aggregate the effects of recurrent flooding across a city, these results demonstrate that there exists large spatial and temporal variation in recurrent floodings impacts on accessibility. This study also highlights the need to include social vulnerability analysis in assessing impacts of climate events, to ensure equitable outcomes as investments are made to create resilient transportation infrastructure.
△ Less
Submitted 12 January, 2024;
originally announced February 2024.
-
Universal Post-Training Reverse-Engineering Defense Against Backdoors in Deep Neural Networks
Authors:
Xi Li,
Hang Wang,
David J. Miller,
George Kesidis
Abstract:
A variety of defenses have been proposed against backdoors attacks on deep neural network (DNN) classifiers. Universal methods seek to reliably detect and/or mitigate backdoors irrespective of the incorporation mechanism used by the attacker, while reverse-engineering methods often explicitly assume one. In this paper, we describe a new detector that: relies on internal feature map of the defended…
▽ More
A variety of defenses have been proposed against backdoors attacks on deep neural network (DNN) classifiers. Universal methods seek to reliably detect and/or mitigate backdoors irrespective of the incorporation mechanism used by the attacker, while reverse-engineering methods often explicitly assume one. In this paper, we describe a new detector that: relies on internal feature map of the defended DNN to detect and reverse-engineer the backdoor and identify its target class; can operate post-training (without access to the training dataset); is highly effective for various incorporation mechanisms (i.e., is universal); and which has low computational overhead and so is scalable. Our detection approach is evaluated for different attacks on benchmark CIFAR-10 and CIFAR-100 image classifiers.
△ Less
Submitted 22 May, 2024; v1 submitted 3 February, 2024;
originally announced February 2024.
-
On the global in time existence and uniqueness of solutions to the Boltzmann hierarchy
Authors:
Ioakeim Ampatzoglou,
Joseph K. Miller,
Nataša Pavlović,
Maja Tasković
Abstract:
In this paper we establish the global in time existence and uniqueness of solutions to the Boltzmann hierarchy, a hierarchy of equations instrumental for the rigorous derivation of the Boltzmann equation from many particles. Inspired by available $L^{\infty}$-based a-priori estimate for solutions to the Boltzmann equation, we develop the polynomially weighted $L^\infty$ a-priori bounds for solutio…
▽ More
In this paper we establish the global in time existence and uniqueness of solutions to the Boltzmann hierarchy, a hierarchy of equations instrumental for the rigorous derivation of the Boltzmann equation from many particles. Inspired by available $L^{\infty}$-based a-priori estimate for solutions to the Boltzmann equation, we develop the polynomially weighted $L^\infty$ a-priori bounds for solutions to the Boltzmann hierarchy and handle the factorial growth of the number of terms in the Dyson's series by reorganizing the sum through a combinatorial technique known as the Klainerman-Machedon board game argument. This paper is the first work that exploits such a combinatorial technique in conjunction with an $L^{\infty}$-based estimate to prove uniqueness of the mild solutions to the Boltzmann hierarchy. Our proof of existence of global in time mild solutions to the Boltzmann hierarchy for admissible initial data is constructive and it employs known global in time solutions to the Boltzmann equation via a Hewitt-Savage type theorem.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Uniform twisted homological stability
Authors:
Jeremy Miller,
Peter Patzt,
Dan Petersen,
Oscar Randal-Williams
Abstract:
We prove a homological stability theorem for families of discrete groups (e.g. mapping class groups, automorphism groups of free groups, braid groups) with coefficients in a sequence of irreducible algebraic representations of arithmetic groups. The novelty is that the stable range is independent of the choice of representation. Combined with earlier work of Bergström--Diaconu--Petersen--Westerlan…
▽ More
We prove a homological stability theorem for families of discrete groups (e.g. mapping class groups, automorphism groups of free groups, braid groups) with coefficients in a sequence of irreducible algebraic representations of arithmetic groups. The novelty is that the stable range is independent of the choice of representation. Combined with earlier work of Bergström--Diaconu--Petersen--Westerland this proves the Conrey--Farmer--Keating--Rubinstein--Snaith predictions for all moments of the family of quadratic $L$-functions over function fields, for sufficiently large odd prime powers.
△ Less
Submitted 8 February, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Optimizing NILC Extractions of the Thermal Sunyaev-Zeldovich Effect with Deep Learning
Authors:
Cameron T. Pratt,
Zhijie Qu,
Joel N. Bregman,
Christopher J. Miller
Abstract:
All-sky maps of the thermal Sunyaev-Zel'dovich effect (SZ) tend to suffer from systematic features arising from the component separation techniques used to extract the signal. In this work, we investigate one of these methods known as needlet internal linear combination (NILC) and test its performance on simulated data. We show that NILC estimates are strongly affected by the choice of the spatial…
▽ More
All-sky maps of the thermal Sunyaev-Zel'dovich effect (SZ) tend to suffer from systematic features arising from the component separation techniques used to extract the signal. In this work, we investigate one of these methods known as needlet internal linear combination (NILC) and test its performance on simulated data. We show that NILC estimates are strongly affected by the choice of the spatial localization parameter ($Γ$), which controls a bias-variance trade-off. Typically, NILC extractions assume a fixed value of $Γ$ over the entire sky, but we show there exists an optimal $Γ$ that depends on the SZ signal strength and local contamination properties. Then we calculate the NILC solutions for multiple values of $Γ$ and feed the results into a neural network to predict the SZ signal. This extraction method, which we call Deep-NILC, is tested against a set of validation data, including recovered radial profiles of resolved systems. Our main result is that Deep-NILC offers significant improvements over choosing fixed values of $Γ$.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.