-
A Spectral Atlas of Lyman Alpha Emitters at z = 5.7 and z = 6.6
Authors:
A. Songaila,
L. L. Cowie,
A. J. Barger,
E. M. Hu,
A. J. Taylor
Abstract:
We present two uniformly observed spectroscopic samples of Ly-alpha emitters (LAEs) (127 at z = 5.7 and 82 at z = 6.6), which we use to investigate the evolution of the LAE population at these redshifts. The observations cover a large field (44 sq. deg) in the North Ecliptic Pole (HEROES), as well as several smaller fields. We have a small number of exotic LAEs in the samples: double-peaked Ly-alp…
▽ More
We present two uniformly observed spectroscopic samples of Ly-alpha emitters (LAEs) (127 at z = 5.7 and 82 at z = 6.6), which we use to investigate the evolution of the LAE population at these redshifts. The observations cover a large field (44 sq. deg) in the North Ecliptic Pole (HEROES), as well as several smaller fields. We have a small number of exotic LAEs in the samples: double-peaked Ly-alpha profiles; very extended red wings; and one impressive lensed LAE cross. We also find three broad-line AGNs. We compare the Ly-alpha line width measurements at the two redshifts, finding that the lower-luminosity LAEs show a strong evolution of decreasing line width with increasing redshift, while the high-luminosity LAEs do not, with a transition luminosity of log L(Ly-alpha) = 43.25 erg s-1 . Thus, at z = 6.6, the high-luminosity LAEs may be producing large ionized bubbles themselves, or they may be residing in overdense galaxy sites that are producing such bubbles. In order to avoid losses in the red wing, the radius of the ionized bubble must be larger than 1 pMpc. The double-peaked LAEs also require transmission on the blue side. For the four at z = 6.6, we use models to estimate the proximity radii, Ra , where the ionizing flux of the galaxy is sufficient to make the surroundings have a low enough neutral fraction to pass the blue light. Since the required Ra are large, multiple ionizing sources in the vicinity may be needed.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Sulphur dioxide in the mid-infrared transmission spectrum of WASP-39b
Authors:
Diana Powell,
Adina D. Feinstein,
Elspeth K. H. Lee,
Michael Zhang,
Shang-Min Tsai,
Jake Taylor,
James Kirk,
Taylor Bell,
Joanna K. Barstow,
Peter Gao,
Jacob L. Bean,
Jasmina Blecic,
Katy L. Chubb,
Ian J. M. Crossfield,
Sean Jordan,
Daniel Kitzmann,
Sarah E. Moran,
Giuseppe Morello,
Julianne I. Moses,
Luis Welbanks,
Jeehyun Yang,
Xi Zhang,
Eva-Maria Ahrer,
Aaron Bello-Arufe,
Jonathan Brande
, et al. (48 additional authors not shown)
Abstract:
The recent inference of sulphur dioxide (SO$_2$) in the atmosphere of the hot ($\sim$1100 K), Saturn-mass exoplanet WASP-39b from near-infrared JWST observations suggests that photochemistry is a key process in high temperature exoplanet atmospheres. This is due to the low ($<$1 ppb) abundance of SO$_2$ under thermochemical equilibrium, compared to that produced from the photochemistry of H$_2$O a…
▽ More
The recent inference of sulphur dioxide (SO$_2$) in the atmosphere of the hot ($\sim$1100 K), Saturn-mass exoplanet WASP-39b from near-infrared JWST observations suggests that photochemistry is a key process in high temperature exoplanet atmospheres. This is due to the low ($<$1 ppb) abundance of SO$_2$ under thermochemical equilibrium, compared to that produced from the photochemistry of H$_2$O and H$_2$S (1-10 ppm). However, the SO$_2$ inference was made from a single, small molecular feature in the transmission spectrum of WASP-39b at 4.05 $μ$m, and therefore the detection of other SO$_2$ absorption bands at different wavelengths is needed to better constrain the SO$_2$ abundance. Here we report the detection of SO$_2$ spectral features at 7.7 and 8.5 $μ$m in the 5-12 $μ$m transmission spectrum of WASP-39b measured by the JWST Mid-Infrared Instrument (MIRI) Low Resolution Spectrometer (LRS). Our observations suggest an abundance of SO$_2$ of 0.5-25 ppm (1$σ$ range), consistent with previous findings. In addition to SO$_2$, we find broad water vapour absorption features, as well as an unexplained decrease in the transit depth at wavelengths longer than 10 $μ$m. Fitting the spectrum with a grid of atmospheric forward models, we derive an atmospheric heavy element content (metallicity) for WASP-39b of $\sim$7.1-8.0 $\times$ solar and demonstrate that photochemistry shapes the spectra of WASP-39b across a broad wavelength range.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Representation growth of Fuchsian groups and modular forms
Authors:
Michael Larsen,
Jay Taylor,
Pham Huu Tiep
Abstract:
Let $Γ$ be a cocompact, oriented Fuchsian group which is not on an explicit finite list of possible exceptions and $q$ a sufficiently large prime power not divisible by the order of any non-trivial torsion element of $Γ$. Then $|\mathrm{Hom}(Γ,\mathrm{GL}_n(q))|\sim c_{q,n} q^{(1-χ(Γ))n^2}$, where $c_{q,n}$ is periodic in $n$. As a function of $q$, $c_{q,n}$ can be expressed as a Puiseux series in…
▽ More
Let $Γ$ be a cocompact, oriented Fuchsian group which is not on an explicit finite list of possible exceptions and $q$ a sufficiently large prime power not divisible by the order of any non-trivial torsion element of $Γ$. Then $|\mathrm{Hom}(Γ,\mathrm{GL}_n(q))|\sim c_{q,n} q^{(1-χ(Γ))n^2}$, where $c_{q,n}$ is periodic in $n$. As a function of $q$, $c_{q,n}$ can be expressed as a Puiseux series in $1/q$ whose coefficients are periodic in $n$ and $q$. Moreover, this series is essentially the $q$-expansion of a meromorphic modular form of half-integral weight.
△ Less
Submitted 11 July, 2024; v1 submitted 9 July, 2024;
originally announced July 2024.
-
DarkSide-20k sensitivity to light dark matter particles
Authors:
DarkSide-20k Collaboration,
:,
F. Acerbi,
P. Adhikari,
P. Agnes,
I. Ahmad,
S. Albergo,
I. F. M. Albuquerque,
T. Alexander,
A. K. Alton,
P. Amaudruz,
M. Angiolilli,
E. Aprile,
R. Ardito,
M. Atzori Corona,
D. J. Auty,
M. Ave,
I. C. Avetisov,
O. Azzolini,
H. O. Back,
Z. Balmforth,
A. Barrado Olmedo,
P. Barrillon,
G. Batignani,
P. Bhowmick
, et al. (289 additional authors not shown)
Abstract:
The dual-phase liquid argon time projection chamber is presently one of the leading technologies to search for dark matter particles with masses below 10 GeV/c$^2$. This was demonstrated by the DarkSide-50 experiment with approximately 50 kg of low-radioactivity liquid argon as target material. The next generation experiment DarkSide-20k, currently under construction, will use 1,000 times more arg…
▽ More
The dual-phase liquid argon time projection chamber is presently one of the leading technologies to search for dark matter particles with masses below 10 GeV/c$^2$. This was demonstrated by the DarkSide-50 experiment with approximately 50 kg of low-radioactivity liquid argon as target material. The next generation experiment DarkSide-20k, currently under construction, will use 1,000 times more argon and is expected to start operation in 2027. Based on the DarkSide-50 experience, here we assess the DarkSide-20k sensitivity to models predicting light dark matter particles, including Weakly Interacting Massive Particles (WIMPs) and sub-GeV/c$^2$ particles interacting with electrons in argon atoms. With one year of data, a sensitivity improvement to dark matter interaction cross-sections by at least one order of magnitude with respect to DarkSide-50 is expected for all these models. A sensitivity to WIMP--nucleon interaction cross-sections below $1\times10^{-42}$ cm$^2$ is achievable for WIMP masses above 800 MeV/c$^2$. With 10 years exposure, the neutrino fog can be reached for WIMP masses around 5 GeV/c$^2$.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Whose Knowledge is Valued?: Epistemic Injustice in CSCW Applications
Authors:
Leah Hope Ajmani,
Jasmine C Foriest,
Jordan Taylor,
Kyle Pittman,
Sarah Gilbert,
Michael Ann Devito
Abstract:
Social computing scholars have long known that people do not interact with knowledge in straightforward ways, especially in digital environments. While policies around knowledge are essential for targeting misinformation, they are value-laden; in choosing how to present information, we undermine non-traditional -- often non-Western -- ways of knowing. Epistemic injustice is the systemic exclusion…
▽ More
Social computing scholars have long known that people do not interact with knowledge in straightforward ways, especially in digital environments. While policies around knowledge are essential for targeting misinformation, they are value-laden; in choosing how to present information, we undermine non-traditional -- often non-Western -- ways of knowing. Epistemic injustice is the systemic exclusion of certain people and methods from the knowledge canon. Epistemic injustice chips away at one's testimony and vocabulary until they are stripped of their due right to know and understand. In this paper, we articulate how epistemic injustice in sociotechnical applications leads to material harm. Inspired by a hybrid collaborative autoethnography of 14 CSCW practitioners, we present three cases of epistemic injustice in sociotechnical applications: online transgender healthcare, identity sensemaking on r/bisexual, and Indigenous ways of knowing on r/AskHistorians. We further explore signature tensions across our autoethnographic materials and relate them to previous CSCW research areas and personal non-technological experiences. We argue that epistemic injustice can serve as a unifying and intersectional lens for CSCW research by surfacing dimensions of epistemic community and power. Finally, we present a call to action of three changes the CSCW community should make to move toward its own goals of research justice. We call for CSCW researchers to center individual experiences, bolster communities, and remediate issues of epistemic power as a means towards epistemic justice. In sum, we recount, synthesize, and propose solutions for the various forms of epistemic injustice that CSCW sites of study -- including CSCW itself -- propagate.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
BOWIE-ALIGN: How formation and migration histories of giant planets impact atmospheric compositions
Authors:
Anna B. T. Penzlin,
Richard A. Booth,
James Kirk,
James E. Owen,
Eva-Maria Ahrer,
Duncan A. Christie,
Alastair B. Claringbold,
Emma Esparza-Borges,
M. López-Morales,
N. J. Mayne,
Mason McCormack,
Annabella Meech,
Vatsal Panwar,
Diana Powell,
Denis E. Sergeev,
Jake Taylor,
Peter J. Wheatley,
Maria Zamyatina
Abstract:
Hot Jupiters present a unique opportunity for measuring how planet formation history shapes present-day atmospheric composition. However, due to the myriad pathways influencing composition, a well-constructed sample of planets is needed to determine whether formation history can be accurately traced back from atmospheric composition. To this end, the BOWIE-ALIGN survey will compare the composition…
▽ More
Hot Jupiters present a unique opportunity for measuring how planet formation history shapes present-day atmospheric composition. However, due to the myriad pathways influencing composition, a well-constructed sample of planets is needed to determine whether formation history can be accurately traced back from atmospheric composition. To this end, the BOWIE-ALIGN survey will compare the compositions of 8 hot Jupiters around F stars, 4 with orbits aligned with the stellar rotation axis and 4 misaligned. Using the alignment as an indicator for planets that underwent disc migration or high-eccentricity migration, one can determine whether migration history produces notable differences in composition between the two samples of planets. This paper describes the planet formation model that motivates our observing programme. Our model traces the accretion of chemical components from the gas and dust in the disc over a broad parameter space to create a full, unbiased model sample from which we can estimate the range of final atmospheric compositions. For high metallicity atmospheres (O/H > 10 times solar), the C/O ratios of aligned and misaligned planets diverge, with aligned planets having lower C/O (< 0.25) due to the accretion of oxygen-rich silicates from the inner disc. However, silicates may rain out instead of releasing their oxygen into the atmosphere. This would significantly increase the C/O of aligned planets (C/O > 0.6), inverting the trend between the aligned and misaligned planets. Nevertheless, by comparing statistically significant samples of aligned and misaligned planets, we expect atmospheric composition to constrain how planets form.
△ Less
Submitted 4 July, 2024; v1 submitted 3 July, 2024;
originally announced July 2024.
-
BOWIE-ALIGN: A JWST comparative survey of aligned vs misaligned hot Jupiters to test the dependence of atmospheric composition on migration history
Authors:
James Kirk,
Eva-Maria Ahrer,
Anna B. T. Penzlin,
James E. Owen,
Richard A. Booth,
Lili Alderson,
Duncan A. Christie,
Alastair B. Claringbold,
Emma Esparza-Borges,
Chloe E. Fisher,
Mercedes López-Morales,
N. J. Mayne,
Mason McCormack,
Annabella Meech,
Vatsal Panwar,
Diana Powell,
Jake Taylor,
Denis E. Sergeev,
Daniel Valentine,
Hannah R. Wakeford,
Peter J. Wheatley,
Maria Zamyatina
Abstract:
A primary objective of exoplanet atmosphere characterisation is to learn about planet formation and evolution, however, this is challenged by degeneracies. To determine whether differences in atmospheric composition can be reliably traced to differences in evolution, we are undertaking a new survey with JWST to compare the compositions of a sample of hot Jupiters that orbit F stars above the Kraft…
▽ More
A primary objective of exoplanet atmosphere characterisation is to learn about planet formation and evolution, however, this is challenged by degeneracies. To determine whether differences in atmospheric composition can be reliably traced to differences in evolution, we are undertaking a new survey with JWST to compare the compositions of a sample of hot Jupiters that orbit F stars above the Kraft break with different orbital alignments. Under the assumption that aligned planets migrate through the inner disc, while misaligned planets migrate after disc dispersal, the act of migrating through the inner disc should lead to a measurable difference in the C/O between aligned and misaligned planets. We expect the amplitude and sign of this difference to depend on the amount of planetesimal accretion and whether silicates accreted from the inner disc release their oxygen. Here, we identify all known exoplanets that are suitable for testing this hypothesis, describe our JWST survey, and use noise simulations and atmospheric retrievals to estimate our survey's sensitivity. With the selected sample of four aligned and four misaligned hot Jupiters, we will be sensitive to the predicted differences in C/O between aligned and misaligned hot Jupiters for a wide range of model scenarios.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
The infall region as a complementary probe to cluster abundance
Authors:
Charlie T. Mpetha,
James E. Taylor,
Yuba Amoura,
Roan Haggar
Abstract:
Galaxy cluster abundance measurements provide a classic test of cosmology. They are most sensitive to the evolved amplitude of fluctuations, usually expressed as $S_8 = σ_8\sqrt{Ω_m/0.3}$. Thus, abundance constraints exhibit a strong degeneracy between $σ_8$ and $Ω_{\rm m}$, as do other similar low-redshift tests such as cosmic shear. The mass distribution in the infall region around galaxy cluste…
▽ More
Galaxy cluster abundance measurements provide a classic test of cosmology. They are most sensitive to the evolved amplitude of fluctuations, usually expressed as $S_8 = σ_8\sqrt{Ω_m/0.3}$. Thus, abundance constraints exhibit a strong degeneracy between $σ_8$ and $Ω_{\rm m}$, as do other similar low-redshift tests such as cosmic shear. The mass distribution in the infall region around galaxy clusters, where material is being accreted from the surrounding field, also exhibits a cosmological dependence, but in this case it is nearly orthogonal to the $S_8$ direction in the $Ω_m$--$σ_8$ plane, making it highly complementary to halo abundance or cosmic shear studies. We explore how weak lensing measurements of the infall region might be used to complement abundance studies, considering three different tests. The splashback radius is a prominent feature of the infall region; we show that detection of this feature in lensing data from the Euclid survey could independently constrain $Ω_{\rm m}$ and $σ_8$ to $\pm 0.05$. Another feature, the depletion radius where the bias reaches a minimum, also shows cosmological dependence, though it is challenging to observe in practice. The strongest constraints come from direct measurements of the shear profile in the infall region at $2$--$4\,r_{200{\rm c}}$. Combining the latter with abundance constraints such as those reported from SRG$/$eROSITA should reduce the area of the error contours by an estimated factor of $1.2$ using a sample of clusters observed by the UNIONS survey, or a factor of $3$ using clusters observed by the Euclid Wide survey over a broader range of redshift.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Constraining cosmological parameters using the splashback radius of galaxy clusters
Authors:
Roan Haggar,
Yuba Amoura,
Charlie T. Mpetha,
James E. Taylor,
Kris Walker,
Chris Power
Abstract:
Cosmological parameters such as $Ω_{\rm{M}}$ and $σ_{8}$ can be measured indirectly using various methods, including galaxy cluster abundance and cosmic shear. These measurements constrain the composite parameter $S_{8}$, leading to degeneracy between $Ω_{\rm{M}}$ and $σ_{8}$. However, some structural properties of galaxy clusters also correlate with cosmological parameters, due to their dependenc…
▽ More
Cosmological parameters such as $Ω_{\rm{M}}$ and $σ_{8}$ can be measured indirectly using various methods, including galaxy cluster abundance and cosmic shear. These measurements constrain the composite parameter $S_{8}$, leading to degeneracy between $Ω_{\rm{M}}$ and $σ_{8}$. However, some structural properties of galaxy clusters also correlate with cosmological parameters, due to their dependence on a cluster's accretion history. In this work, we focus on the splashback radius, an observable cluster feature that represents a boundary between a cluster and the surrounding Universe. Using a suite of cosmological simulations with a range of values for $Ω_{\rm{M}}$ and $σ_{8}$, we show that the position of the splashback radius around cluster-mass halos is greater in cosmologies with smaller values of $Ω_{\rm{M}}$ or larger values of $σ_{8}$. This variation breaks the degeneracy between $Ω_{\rm{M}}$ and $σ_{8}$ that comes from measurements of the $S_{8}$ parameter. We also show that this variation is, in principle, measurable in observations. As the splashback radius can be determined from the same weak lensing analysis already used to estimate $S_{8}$, this new approach can tighten low-redshift constraints on cosmological parameters, either using existing data, or using upcoming data such as that from Euclid and LSST.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Transverse surfaces and pseudo-Anosov flows
Authors:
Michael P. Landry,
Yair N. Minsky,
Samuel J. Taylor
Abstract:
Let $\varphi$ be a transitive pseudo-Anosov flow on an oriented, compact $3$-manifold $M$, possibly with toral boundary. We characterize the surfaces in $M$ that are (almost) transverse to $φ$. When $\varphi$ has no perfect fits (e.g. $\varphi$ is the suspension flow of a pseudo-Anosov homeomorphism), we prove that any Thurston-norm minimizing surface $S$ that pairs nonnegatively with the closed o…
▽ More
Let $\varphi$ be a transitive pseudo-Anosov flow on an oriented, compact $3$-manifold $M$, possibly with toral boundary. We characterize the surfaces in $M$ that are (almost) transverse to $φ$. When $\varphi$ has no perfect fits (e.g. $\varphi$ is the suspension flow of a pseudo-Anosov homeomorphism), we prove that any Thurston-norm minimizing surface $S$ that pairs nonnegatively with the closed orbits of $\varphi$ is almost transverse to $\varphi$, up to isotopy. This answers a question of Cooper--Long--Reid. Our main tool is a correspondence between surfaces that are almost transverse to $\varphi$ and those that are relatively carried by any associated veering triangulation. The correspondence also allows us to investigate the uniqueness of almost transverse position, to extend Mosher's Transverse Surface Theorem to the case with boundary, and more generally to characterize when relative homology classes represent Birkhoff surfaces.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
OCCAM: Online Continuous Controller Adaptation with Meta-Learned Models
Authors:
Hersh Sanghvi,
Spencer Folk,
Camillo Jose Taylor
Abstract:
Control tuning and adaptation present a significant challenge to the usage of robots in diverse environments. It is often nontrivial to find a single set of control parameters by hand that work well across the broad array of environments and conditions that a robot might encounter. Automated adaptation approaches must utilize prior knowledge about the system while adapting to significant domain sh…
▽ More
Control tuning and adaptation present a significant challenge to the usage of robots in diverse environments. It is often nontrivial to find a single set of control parameters by hand that work well across the broad array of environments and conditions that a robot might encounter. Automated adaptation approaches must utilize prior knowledge about the system while adapting to significant domain shifts to find new control parameters quickly. In this work, we present a general framework for online controller adaptation that deals with these challenges. We combine meta-learning with Bayesian recursive estimation to learn prior predictive models of system performance that quickly adapt to online data, even when there is significant domain shift. These predictive models can be used as cost functions within efficient sampling-based optimization routines to find new control parameters online that maximize system performance. Our framework is powerful and flexible enough to adapt controllers for four diverse systems: a simulated race car, a simulated quadrupedal robot, and a simulated and physical quadrotor.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Reconsidering the dynamical states of galaxy clusters using PCA and UMAP
Authors:
Roan Haggar,
Federico De Luca,
Marco De Petris,
Elizaveta Sazonova,
James E. Taylor,
Alexander Knebe,
Meghan E. Gray,
Frazer R. Pearce,
Ana Contreras-Santos,
Weiguang Cui,
Ulrike Kuchner,
Robert A. Mostoghiu Paun,
Chris Power
Abstract:
Numerous metrics exist to quantify the dynamical state of galaxy clusters, both observationally and within simulations. Many of these correlate strongly with one another, but it is not clear whether all of these measures probe the same intrinsic properties. In this work, we use two different statistical approaches -- principal component analysis (PCA) and uniform manifold approximation and project…
▽ More
Numerous metrics exist to quantify the dynamical state of galaxy clusters, both observationally and within simulations. Many of these correlate strongly with one another, but it is not clear whether all of these measures probe the same intrinsic properties. In this work, we use two different statistical approaches -- principal component analysis (PCA) and uniform manifold approximation and projection (UMAP) -- to investigate which dynamical properties of a cluster are in fact the best descriptors of its dynamical state. We use measurements taken directly from The Three Hundred suite of galaxy cluster simulations, as well as morphological properties calculated using mock X-ray and SZ maps of the same simulated clusters. We find that four descriptions of dynamical state naturally arise, and although correlations exist between these, a given cluster can be "dynamically relaxed" according to all, none, or some of these four descriptions. These results demonstrate that it is highly important for future observational and theoretical studies to consider in which sense clusters are dynamically relaxed. Cluster dynamical states are complex and multi-dimensional, and so it is not meaningful to classify them simply as "relaxed" and "unrelaxed" based on a single linear scale.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Equivariant Vector Bundles with Connection on Drinfeld Symmetric Spaces
Authors:
James Taylor
Abstract:
For a finite extension $F$ of $\mathbb{Q}_p$ and $n \geq 1$, let $D$ be the division algebra over $F$ of invariant $1/n$ and let $G^0$ be the subgroup of $\text{GL}_n(F)$ of elements with norm $1$ determinant. We show that the action of $D^\times$ on the Drinfeld tower induces an equivalence of categories from finite dimensional smooth representations of $D^\times$ to $G^0$-finite…
▽ More
For a finite extension $F$ of $\mathbb{Q}_p$ and $n \geq 1$, let $D$ be the division algebra over $F$ of invariant $1/n$ and let $G^0$ be the subgroup of $\text{GL}_n(F)$ of elements with norm $1$ determinant. We show that the action of $D^\times$ on the Drinfeld tower induces an equivalence of categories from finite dimensional smooth representations of $D^\times$ to $G^0$-finite $\text{GL}_n(F)$-equivariant vector bundles with connection on $Ω$, the $(n-1)$-dimensional Drinfeld symmetric space.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
Authors:
Bowen Jiang,
Yangxinyu Xie,
Zhuoqun Hao,
Xiaomeng Wang,
Tanwi Mallick,
Weijie J. Su,
Camillo J. Taylor,
Dan Roth
Abstract:
This study introduces a hypothesis-testing framework to assess whether large language models (LLMs) possess genuine reasoning abilities or primarily depend on token bias. We go beyond evaluating LLMs on accuracy; rather, we aim to investigate their token bias in solving logical reasoning tasks. Specifically, we develop carefully controlled synthetic datasets, featuring conjunction fallacy and syll…
▽ More
This study introduces a hypothesis-testing framework to assess whether large language models (LLMs) possess genuine reasoning abilities or primarily depend on token bias. We go beyond evaluating LLMs on accuracy; rather, we aim to investigate their token bias in solving logical reasoning tasks. Specifically, we develop carefully controlled synthetic datasets, featuring conjunction fallacy and syllogistic problems. Our framework outlines a list of hypotheses where token biases are readily identifiable, with all null hypotheses assuming genuine reasoning capabilities of LLMs. The findings in this study suggest, with statistical guarantee, that most LLMs still struggle with logical reasoning. While they may perform well on classic problems, their success largely depends on recognizing superficial patterns with strong token bias, thereby raising concerns about their actual reasoning and generalization abilities.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Ringdown signatures of Kerr black holes immersed in a magnetic field
Authors:
Kate J. Taylor,
Adam Ritz
Abstract:
We analyze the quasinormal mode spectrum for Kerr black holes surrounded by an asymptotically uniform magnetic field, modeled with the Ernst-Wild geometry. A perturbative expansion in both the rotation parameter $a$ and the magnetic field $B$ allows separation of the perturbation equations, and we obtain the spectrum for a variety of scalar quasinormal modes over a range of parameters using the co…
▽ More
We analyze the quasinormal mode spectrum for Kerr black holes surrounded by an asymptotically uniform magnetic field, modeled with the Ernst-Wild geometry. A perturbative expansion in both the rotation parameter $a$ and the magnetic field $B$ allows separation of the perturbation equations, and we obtain the spectrum for a variety of scalar quasinormal modes over a range of parameters using the continued fraction method. We then interpolate the low-lying mode spectrum to construct an Ernst-Wild template for the ringdown, and use the LIGO-Virgo-KAGRA analysis tool pyRing to assess the impact of the magnetosphere on the extraction of ringdown signatures from several observed binary black hole mergers.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Pushing the limits of the Cosmic Origin Spectrograph (COS) with an optimized background correction
Authors:
Svea Hernandez,
Andrei Igoshev,
Jo Taylor,
David Sahnow,
Logan Jones
Abstract:
Observations utilizing the ultraviolet capabilities of the Cosmic Origin Spectrograph (COS) onboard the Hubble Space Telescope are of unique value to the astronomy community. Spectroscopy down to 900 A with COS has enabled new science areas. However, contrary to the situation at longer wavelengths, these observations are limited by detector background noise. The background correction currently app…
▽ More
Observations utilizing the ultraviolet capabilities of the Cosmic Origin Spectrograph (COS) onboard the Hubble Space Telescope are of unique value to the astronomy community. Spectroscopy down to 900 A with COS has enabled new science areas. However, contrary to the situation at longer wavelengths, these observations are limited by detector background noise. The background correction currently applied by the standard calibration pipeline (CalCOS) is not optimized for faint targets, limiting the scientific value of low signal-to-noise observations. In this work we investigate a possible dependence of the variations of the dark rate in both segments of the COS far-ultraviolet (FUV) detector on time, detector high voltage (HV), and solar activity. Through our analysis we identified a number of detector states (on a configuration basis, e.g., HV and segment) characterizing the spatial distribution of dark counts, and created superdarks to be used in an optimized 2-dimensional (2D) background correction. We have developed and tested Another COS Dark Correction (ACDC), a dedicated pipeline to perform a 2D background correction based on statistical methods, producing background-corrected and flux-calibrated spectra. While our testing of ACDC showed an average improvement in S/N values of ~10%, in a few cases the improvements in S/N reached 60% across the whole wavelength range of individual segments.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Data-Driven Switchback Experiments: Theoretical Tradeoffs and Empirical Bayes Designs
Authors:
Ruoxuan Xiong,
Alex Chin,
Sean J. Taylor
Abstract:
We study the design and analysis of switchback experiments conducted on a single aggregate unit. The design problem is to partition the continuous time space into intervals and switch treatments between intervals, in order to minimize the estimation error of the treatment effect. We show that the estimation error depends on four factors: carryover effects, periodicity, serially correlated outcomes…
▽ More
We study the design and analysis of switchback experiments conducted on a single aggregate unit. The design problem is to partition the continuous time space into intervals and switch treatments between intervals, in order to minimize the estimation error of the treatment effect. We show that the estimation error depends on four factors: carryover effects, periodicity, serially correlated outcomes, and impacts from simultaneous experiments. We derive a rigorous bias-variance decomposition and show the tradeoffs of the estimation error from these factors. The decomposition provides three new insights in choosing a design: First, balancing the periodicity between treated and control intervals reduces the variance; second, switching less frequently reduces the bias from carryover effects while increasing the variance from correlated outcomes, and vice versa; third, randomizing interval start and end points reduces both bias and variance from simultaneous experiments. Combining these insights, we propose a new empirical Bayes design approach. This approach uses prior data and experiments for designing future experiments. We illustrate this approach using real data from a ride-sharing platform, yielding a design that reduces MSE by 33% compared to the status quo design used on the platform.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Simultaneous retrieval of orbital phase resolved JWST/MIRI emission spectra of the hot Jupiter WASP-43b: evidence of water, ammonia and carbon monoxide
Authors:
Jingxuan Yang,
Mark Hammond,
Anjali A. A. Piette,
Jasmina Blecic,
Taylor J. Bell,
Patrick G. J. Irwin,
Vivien Parmentier,
Shang-Min Tsai,
Joanna K. Barstow,
Nicolas Crouzet,
Laura Kreidberg,
João M. Mendonça,
Jake Taylor,
Robin Baeyens,
Kazumasa Ohno,
Lucas Teinturier,
Matthew C. Nixon
Abstract:
Spectroscopic phase curves of hot Jupiters measure their emission spectra at multiple orbital phases, thus enabling detailed characterisation of their atmospheres. Precise constraints on the atmospheric composition of these exoplanets offer insights into their formation and evolution. We analyse four phase-resolved emission spectra of the hot Jupiter WASP-43b, generated from a phase curve observed…
▽ More
Spectroscopic phase curves of hot Jupiters measure their emission spectra at multiple orbital phases, thus enabling detailed characterisation of their atmospheres. Precise constraints on the atmospheric composition of these exoplanets offer insights into their formation and evolution. We analyse four phase-resolved emission spectra of the hot Jupiter WASP-43b, generated from a phase curve observed with the MIRI/LRS onboard the JWST, to retrieve its atmospheric properties. Using a parametric 2D temperature model and assuming a chemically homogeneous atmosphere within the observed pressure region, we simultaneously fit the four spectra to constrain the abundances of atmospheric constituents, thereby yielding more precise constraints than previous work that analysed each spectrum independently. Our analysis reveals statistically significant evidence of NH3 (4$σ$) in a hot Jupiter's emission spectra for the first time, along with evidence of H2O (6.5$σ$), CO (3.1$σ$), and a non-detection of CH4. With our abundance constraints, we tentatively estimate the metallicity of WASP-43b at 0.6-6.5$\times$solar and its C/O ratio at 0.6-0.9. Our findings offer vital insights into the atmospheric conditions and formation history of WASP-43b by simultaneously constraining the abundances of carbon, oxygen, and nitrogen-bearing species.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Brain Morphology Normative modelling platform for abnormality and Centile estimation: Brain MoNoCle
Authors:
Bethany Little,
Nida Alyas,
Alexander Surtees,
Gavin P Winston,
John S Duncan,
David A Cousins,
John-Paul Taylor,
Peter Taylor,
Karoline Leiberg,
Yujiang Wang
Abstract:
Normative models of brain structure estimate the effects of covariates such as age and sex using large samples of healthy controls. These models can then be applied to smaller clinical cohorts to distinguish disease effects from other covariates. However, these advanced statistical modelling approaches can be difficult to access, and processing large healthy cohorts is computationally demanding. T…
▽ More
Normative models of brain structure estimate the effects of covariates such as age and sex using large samples of healthy controls. These models can then be applied to smaller clinical cohorts to distinguish disease effects from other covariates. However, these advanced statistical modelling approaches can be difficult to access, and processing large healthy cohorts is computationally demanding. Thus, accessible platforms with pre-trained normative models are needed.
We present such a platform for brain morphology analysis as an open-source web application https://cnnplab.shinyapps.io/normativemodelshiny/, with six key features: (i) user-friendly web interface, (ii) individual and group outputs, (iii) multi-site analysis, (iv) regional and whole-brain analysis, (v) integration with existing tools, and (vi) featuring multiple morphology metrics.
Using a diverse sample of 3,276 healthy controls across 21 sites, we pre-trained normative models on various metrics. We validated the models with a small clinical sample of individuals with bipolar disorder, showing outputs that aligned closely with existing literature only after applying our normative modelling. Further validation with a cohort of temporal lobe epilepsy showed agreement with previous group-level findings and individual-level seizure lateralisation. Finally, with the ability to investigate multiple morphology measures in the same framework, we found that biological covariates are better explained in specific morphology measures, and for clinical applications, only some measures are sensitive to the disease process.
Our platform offers a comprehensive framework to analyse brain morphology in clinical and research settings. Validations confirm the superiority of normative models and the advantage of investigating a range of brain morphology metrics together.
△ Less
Submitted 26 June, 2024; v1 submitted 3 June, 2024;
originally announced June 2024.
-
Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey
Authors:
Bowen Jiang,
Yangxinyu Xie,
Xiaomeng Wang,
Weijie J. Su,
Camillo J. Taylor,
Tanwi Mallick
Abstract:
Rationality is the quality of being guided by reason, characterized by logical thinking and decision-making that align with evidence and logical rules. This quality is essential for effective problem-solving, as it ensures that solutions are well-founded and systematically derived. Despite the advancements of large language models (LLMs) in generating human-like text with remarkable accuracy, they…
▽ More
Rationality is the quality of being guided by reason, characterized by logical thinking and decision-making that align with evidence and logical rules. This quality is essential for effective problem-solving, as it ensures that solutions are well-founded and systematically derived. Despite the advancements of large language models (LLMs) in generating human-like text with remarkable accuracy, they present biases inherited from the training data, inconsistency across different contexts, and difficulty understanding complex scenarios involving multiple layers of context. Therefore, recent research attempts to leverage the strength of multiple agents working collaboratively with various types of data and tools for enhanced consistency and reliability. To that end, this paper aims to understand whether multi-modal and multi-agent systems are advancing toward rationality by surveying the state-of-the-art works, identifying advancements over single-agent and single-modal systems in terms of rationality, and discussing open problems and future directions. We maintain an open repository at https://github.com/bowen-upenn/MMMA_Rationality.
△ Less
Submitted 18 June, 2024; v1 submitted 31 May, 2024;
originally announced June 2024.
-
Identifying and Fitting Eclipse Maps of Exoplanets with Cross-Validation
Authors:
Mark Hammond,
Neil T. Lewis,
Sasha Boone,
Xueqing Chen,
João M. Mendonça,
Vivien Parmentier,
Jake Taylor,
Taylor Bell,
Leonardo dos Santos,
Nicolas Crouzet,
Laura Kreidberg,
Michael Radica,
Michael Zhang
Abstract:
Eclipse mapping uses the shape of the eclipse of an exoplanet to measure its two-dimensional structure. Light curves are mostly composed of longitudinal information, with the latitudinal information only contained in the brief ingress and egress of the eclipse. This imbalance can lead to a spuriously confident map, where the longitudinal structure is constrained by out-of-eclipse data and the lati…
▽ More
Eclipse mapping uses the shape of the eclipse of an exoplanet to measure its two-dimensional structure. Light curves are mostly composed of longitudinal information, with the latitudinal information only contained in the brief ingress and egress of the eclipse. This imbalance can lead to a spuriously confident map, where the longitudinal structure is constrained by out-of-eclipse data and the latitudinal structure is wrongly determined by the priors on the map. We present a new method to address this issue. The method tests for the presence of an eclipse mapping signal by using k-fold cross-validation to compare the performance of a simple mapping model to the null hypothesis of a uniform disk. If a signal is found, the method fits a map with more degrees of freedom, optimising its information content. The information content is varied by penalising the model likelihood by a factor proportional to the spatial entropy of the map, optimised by cross-validation. We demonstrate this method for simulated datasets then apply it to three observational datasets. The method identifies an eclipse mapping signal for JWST MIRI/LRS observations of WASP-43b but does not identify a signal for JWST NIRISS/SOSS observations of WASP-18b or Spitzer Space Telescope observations of HD 189733b. It is possible to fit eclipse maps to these datasets, but we suggest that these maps are overfitting the eclipse shape. We fit a new map with more spatial freedom to the WASP-43b dataset and show a flatter east-west structure than previously derived.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Regularity-Conforming Neural Networks (ReCoNNs) for solving Partial Differential Equations
Authors:
Jamie M. Taylor,
David Pardo,
Judit Muñoz-Matute
Abstract:
Whilst the Universal Approximation Theorem guarantees the existence of approximations to Sobolev functions -- the natural function spaces for PDEs -- by Neural Networks (NNs) of sufficient size, low-regularity solutions may lead to poor approximations in practice. For example, classical fully-connected feed-forward NNs fail to approximate continuous functions whose gradient is discontinuous when e…
▽ More
Whilst the Universal Approximation Theorem guarantees the existence of approximations to Sobolev functions -- the natural function spaces for PDEs -- by Neural Networks (NNs) of sufficient size, low-regularity solutions may lead to poor approximations in practice. For example, classical fully-connected feed-forward NNs fail to approximate continuous functions whose gradient is discontinuous when employing strong formulations like in Physics Informed Neural Networks (PINNs). In this article, we propose the use of regularity-conforming neural networks, where a priori information on the regularity of solutions to PDEs can be employed to construct proper architectures. We illustrate the potential of such architectures via a two-dimensional (2D) transmission problem, where the solution may admit discontinuities in the gradient across interfaces, as well as power-like singularities at certain points. In particular, we formulate the weak transmission problem in a PINNs-like strong formulation with interface and continuity conditions. Such architectures are partially explainable; discontinuities are explicitly described, allowing the introduction of novel terms into the loss function. We demonstrate via several model problems in one and two dimensions the advantages of using regularity-conforming architectures in contrast to classical architectures. The ideas presented in this article easily extend to problems in higher dimensions.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Euclid. I. Overview of the Euclid mission
Authors:
Euclid Collaboration,
Y. Mellier,
Abdurro'uf,
J. A. Acevedo Barroso,
A. Achúcarro,
J. Adamek,
R. Adam,
G. E. Addison,
N. Aghanim,
M. Aguena,
V. Ajani,
Y. Akrami,
A. Al-Bahlawan,
A. Alavi,
I. S. Albuquerque,
G. Alestas,
G. Alguero,
A. Allaoui,
S. W. Allen,
V. Allevato,
A. V. Alonso-Tetilla,
B. Altieri,
A. Alvarez-Candal,
A. Amara,
L. Amendola
, et al. (1086 additional authors not shown)
Abstract:
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14…
▽ More
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14,000 deg^2 of extragalactic sky. In addition to accurate weak lensing and clustering measurements that probe structure formation over half of the age of the Universe, its primary probes for cosmology, these exquisite data will enable a wide range of science. This paper provides a high-level overview of the mission, summarising the survey characteristics, the various data-processing steps, and data products. We also highlight the main science objectives and expected performance.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning
Authors:
Dan Braun,
Jordan Taylor,
Nicholas Goldowsky-Dill,
Lee Sharkey
Abstract:
Identifying the features learned by neural networks is a core challenge in mechanistic interpretability. Sparse autoencoders (SAEs), which learn a sparse, overcomplete dictionary that reconstructs a network's internal activations, have been used to identify these features. However, SAEs may learn more about the structure of the datatset than the computational structure of the network. There is the…
▽ More
Identifying the features learned by neural networks is a core challenge in mechanistic interpretability. Sparse autoencoders (SAEs), which learn a sparse, overcomplete dictionary that reconstructs a network's internal activations, have been used to identify these features. However, SAEs may learn more about the structure of the datatset than the computational structure of the network. There is therefore only indirect reason to believe that the directions found in these dictionaries are functionally important to the network. We propose end-to-end (e2e) sparse dictionary learning, a method for training SAEs that ensures the features learned are functionally important by minimizing the KL divergence between the output distributions of the original model and the model with SAE activations inserted. Compared to standard SAEs, e2e SAEs offer a Pareto improvement: They explain more network performance, require fewer total features, and require fewer simultaneously active features per datapoint, all with no cost to interpretability. We explore geometric and qualitative differences between e2e SAE features and standard SAE features. E2e dictionary learning brings us closer to methods that can explain network behavior concisely and accurately. We release our library for training e2e SAEs and reproducing our analysis at https://github.com/ApolloResearch/e2e_sae
△ Less
Submitted 24 May, 2024; v1 submitted 17 May, 2024;
originally announced May 2024.
-
Challenges and Opportunities for Large-Scale Exploration with Air-Ground Teams using Semantics
Authors:
Fernando Cladera,
Ian D. Miller,
Zachary Ravichandran,
Varun Murali,
Jason Hughes,
M. Ani Hsieh,
C. J. Taylor,
Vijay Kumar
Abstract:
One common and desirable application of robots is exploring potentially hazardous and unstructured environments. Air-ground collaboration offers a synergistic approach to addressing such exploration challenges. In this paper, we demonstrate a system for large-scale exploration using a team of aerial and ground robots. Our system uses semantics as lingua franca, and relies on fully opportunistic co…
▽ More
One common and desirable application of robots is exploring potentially hazardous and unstructured environments. Air-ground collaboration offers a synergistic approach to addressing such exploration challenges. In this paper, we demonstrate a system for large-scale exploration using a team of aerial and ground robots. Our system uses semantics as lingua franca, and relies on fully opportunistic communications. We highlight the unique challenges from this approach, explain our system architecture and showcase lessons learned during our experiments. All our code is open-source, encouraging researchers to use it and build upon.
△ Less
Submitted 12 May, 2024;
originally announced May 2024.
-
Larmor Power Limit for Cyclotron Radiation of Relativistic Particles in a Waveguide
Authors:
N. Buzinsky,
R. J. Taylor,
W. Byron,
W. DeGraw,
B. Dodson,
M. Fertl,
A. García,
A. P. Goodson,
B. Graner,
H. Harrington,
L. Hayen,
L. Malavasi,
D. McClain,
D. Melconian,
P. Müller,
E. Novitski,
N. S. Oblath,
R. G. H. Robertson,
G. Rybka,
G. Savard,
E. Smith,
D. D. Stancil,
D. W. Storm,
H. E. Swanson,
J. R. Tedeschi
, et al. (3 additional authors not shown)
Abstract:
Cyclotron radiation emission spectroscopy (CRES) is a modern technique for high-precision energy spectroscopy, in which the energy of a charged particle in a magnetic field is measured via the frequency of the emitted cyclotron radiation. The He6-CRES collaboration aims to use CRES to probe beyond the standard model physics at the TeV scale by performing high-resolution and low-background beta-dec…
▽ More
Cyclotron radiation emission spectroscopy (CRES) is a modern technique for high-precision energy spectroscopy, in which the energy of a charged particle in a magnetic field is measured via the frequency of the emitted cyclotron radiation. The He6-CRES collaboration aims to use CRES to probe beyond the standard model physics at the TeV scale by performing high-resolution and low-background beta-decay spectroscopy of ${}^6\textrm{He}$ and ${}^{19}\textrm{Ne}$. Having demonstrated the first observation of individual, high-energy (0.1 -- 2.5 MeV) positrons and electrons via their cyclotron radiation, the experiment provides a novel window into the radiation of relativistic charged particles in a waveguide via the time-derivative (slope) of the cyclotron radiation frequency, $\mathrm{d}f_\textrm{c}/\mathrm{d}t$. We show that analytic predictions for the total cyclotron radiation power emitted by a charged particle in circular and rectangular waveguides are approximately consistent with the Larmor formula, each scaling with the Lorentz factor of the underlying $e^\pm$ as $γ^4$. This hypothesis is corroborated with experimental CRES slope data.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models
Authors:
Prannay Kaul,
Zhizhong Li,
Hao Yang,
Yonatan Dukler,
Ashwin Swaminathan,
C. J. Taylor,
Stefano Soatto
Abstract:
Mitigating hallucinations in large vision-language models (LVLMs) remains an open problem. Recent benchmarks do not address hallucinations in open-ended free-form responses, which we term "Type I hallucinations". Instead, they focus on hallucinations responding to very specific question formats -- typically a multiple-choice response regarding a particular object or attribute -- which we term "Typ…
▽ More
Mitigating hallucinations in large vision-language models (LVLMs) remains an open problem. Recent benchmarks do not address hallucinations in open-ended free-form responses, which we term "Type I hallucinations". Instead, they focus on hallucinations responding to very specific question formats -- typically a multiple-choice response regarding a particular object or attribute -- which we term "Type II hallucinations". Additionally, such benchmarks often require external API calls to models which are subject to change. In practice, we observe that a reduction in Type II hallucinations does not lead to a reduction in Type I hallucinations but rather that the two forms of hallucinations are often anti-correlated. To address this, we propose THRONE, a novel object-based automatic framework for quantitatively evaluating Type I hallucinations in LVLM free-form outputs. We use public language models (LMs) to identify hallucinations in LVLM responses and compute informative metrics. By evaluating a large selection of recent LVLMs using public datasets, we show that an improvement in existing metrics do not lead to a reduction in Type I hallucinations, and that established benchmarks for measuring Type I hallucinations are incomplete. Finally, we provide a simple and effective data augmentation method to reduce Type I and Type II hallucinations as a strong baseline.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Neural network based deep learning analysis of semiconductor quantum dot qubits for automated control
Authors:
Jacob R. Taylor,
Sankar Das Sarma
Abstract:
Machine learning offers a largely unexplored avenue for improving noisy disordered devices in physics using automated algorithms. Through simulations that include disorder in physical devices, particularly quantum devices, there is potential to learn about disordered landscapes and subsequently tune devices based on those insights. In this work, we introduce a novel methodology that employs machin…
▽ More
Machine learning offers a largely unexplored avenue for improving noisy disordered devices in physics using automated algorithms. Through simulations that include disorder in physical devices, particularly quantum devices, there is potential to learn about disordered landscapes and subsequently tune devices based on those insights. In this work, we introduce a novel methodology that employs machine learning, specifically convolutional neural networks (CNNs), to discern the disorder landscape in the parameters of the disordered extended Hubbard model underlying the semiconductor quantum dot spin qubit architectures. This technique takes advantage of experimentally obtainable charge stability diagrams from neighboring quantum dot pairs, enabling the CNN to accurately identify disorder in each parameter of the extended Hubbard model. Remarkably, our CNN can process site-specific disorder in Hubbard parameters, including variations in hopping constants, on-site potentials (gate voltages), and both intra-site and inter-site Coulomb terms. This advancement facilitates the prediction of spatially dependent disorder across all parameters simultaneously with high accuracy ($R^2>0.994$) and fewer parameter constraints, marking a significant improvement over previous methods that were focused only on analyzing on-site potentials at low coupling. Furthermore, our approach allows for the tuning of five or more quantum dots at a time, effectively addressing the often-overlooked issue of crosstalk. Not only does our method streamline the tuning process, potentially enabling fully automated adjustments, but it also introduces a "no trust" verification method to rigorously validate the neural network's predictions. Ultimately, this work aims to lay the groundwork for generalizing our method to tackle a broad spectrum of physical problems.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
A Bayesian joint longitudinal-survival model with a latent stochastic process for intensive longitudinal data
Authors:
Madeline R. Abbott,
Walter H. Dempsey,
Inbal Nahum-Shani,
Lindsey N. Potter,
David W. Wetter,
Cho Y. Lam,
Jeremy M. G. Taylor
Abstract:
The availability of mobile health (mHealth) technology has enabled increased collection of intensive longitudinal data (ILD). ILD have potential to capture rapid fluctuations in outcomes that may be associated with changes in the risk of an event. However, existing methods for jointly modeling longitudinal and event-time outcomes are not well-equipped to handle ILD due to the high computational co…
▽ More
The availability of mobile health (mHealth) technology has enabled increased collection of intensive longitudinal data (ILD). ILD have potential to capture rapid fluctuations in outcomes that may be associated with changes in the risk of an event. However, existing methods for jointly modeling longitudinal and event-time outcomes are not well-equipped to handle ILD due to the high computational cost. We propose a joint longitudinal and time-to-event model suitable for analyzing ILD. In this model, we summarize a multivariate longitudinal outcome as a smaller number of time-varying latent factors. These latent factors, which are modeled using an Ornstein-Uhlenbeck stochastic process, capture the risk of a time-to-event outcome in a parametric hazard model. We take a Bayesian approach to fit our joint model and conduct simulations to assess its performance. We use it to analyze data from an mHealth study of smoking cessation. We summarize the longitudinal self-reported intensity of nine emotions as the psychological states of positive and negative affect. These time-varying latent states capture the risk of the first smoking lapse after attempted quit. Understanding factors associated with smoking lapse is of keen interest to smoking cessation researchers.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
A new hybrid gadolinium nanoparticles-loaded polymeric material for neutron detection in rare event searches
Authors:
DarkSide-20k Collaboration,
:,
F. Acerbi,
P. Adhikari,
P. Agnes,
I. Ahmad,
S. Albergo,
I. F. Albuquerque,
T. Alexander,
A. K. Alton,
P. Amaudruz,
M. Angiolilli,
E. Aprile,
R. Ardito,
M. Atzori Corona,
D. J. Auty,
M. Ave,
I. C. Avetisov,
O. Azzolini,
H. O. Back,
Z. Balmforth,
A. Barrado Olmedo,
P. Barrillon,
G. Batignani,
P. Bhowmick
, et al. (290 additional authors not shown)
Abstract:
Experiments aimed at direct searches for WIMP dark matter require highly effective reduction of backgrounds and control of any residual radioactive contamination. In particular, neutrons interacting with atomic nuclei represent an important class of backgrounds due to the expected similarity of a WIMP-nucleon interaction, so that such experiments often feature a dedicated neutron detector surround…
▽ More
Experiments aimed at direct searches for WIMP dark matter require highly effective reduction of backgrounds and control of any residual radioactive contamination. In particular, neutrons interacting with atomic nuclei represent an important class of backgrounds due to the expected similarity of a WIMP-nucleon interaction, so that such experiments often feature a dedicated neutron detector surrounding the active target volume. In the context of the development of DarkSide-20k detector at INFN Gran Sasso National Laboratory (LNGS), several R&D projects were conceived and developed for the creation of a new hybrid material rich in both hydrogen and gadolinium nuclei to be employed as an essential element of the neutron detector. Thanks to its very high cross-section for neutron capture, gadolinium is one of the most widely used elements in neutron detectors, while the hydrogen-rich material is instrumental in efficiently moderating the neutrons. In this paper results from one of the R&Ds are presented. In this effort the new hybrid material was obtained as a poly(methyl methacrylate) (PMMA) matrix, loaded with gadolinium oxide in the form of nanoparticles. We describe its realization, including all phases of design, purification, construction, characterization, and determination of mechanical properties of the new material.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Twisted nonlinear optics in monolayer van der Waals crystals
Authors:
Tenzin Norden,
Luis M. Martinez,
Nehan Tarefder,
Kevin W. C. Kwock,
Luke M. McClintock,
Nicholas Olsen,
Luke N. Holtzman,
Xiaoyang Zhu,
James C. Hone,
Jinkyoung Yoo,
Jian-Xin Zhu,
P. James Schuck,
Antoinette J. Taylor,
Rohit P. Prasankumar,
Wilton J. M. Kort-Kamp,
Prashant Padmanabhan
Abstract:
In addition to a plethora of emergent phenomena, the spatial topology of optical vortices enables an array of applications spanning communications to quantum photonics. Nonlinear optics is essential in this context, providing access to an infinitely large set of quantum states associated with the orbital angular momentum of light. Nevertheless, the realization of such processes have failed to keep…
▽ More
In addition to a plethora of emergent phenomena, the spatial topology of optical vortices enables an array of applications spanning communications to quantum photonics. Nonlinear optics is essential in this context, providing access to an infinitely large set of quantum states associated with the orbital angular momentum of light. Nevertheless, the realization of such processes have failed to keep pace with the ever-growing need to shrink the fundamental length-scale of photonic technologies to the nanometer regime6. Here, we push the boundaries of vortex nonlinear optics to the ultimate limits of material dimensionality. By exploiting second and third-order frequency-mixing processes in semiconducting monolayers, we demonstrate the independent manipulation of the wavelength, orbital angular momentum, and spatial distribution of vortex light-fields. Due to the atomically-thin nature of the host quantum material, this control spans a broad spectral bandwidth in a highly-integrable platform, unconstrained by the traditional limits of bulk nonlinear optical materials. Our work heralds a new avenue for ultra-compact and scalable hybrid nanotechnologies empowered by twisted nonlinear light-matter interactions in van der Waals quantum nanomaterials.
△ Less
Submitted 27 April, 2024; v1 submitted 22 April, 2024;
originally announced April 2024.
-
Long-form music generation with latent diffusion
Authors:
Zach Evans,
Julian D. Parker,
CJ Carr,
Zack Zukowski,
Josiah Taylor,
Jordi Pons
Abstract:
Audio-based generative models for music have seen great strides recently, but so far have not managed to produce full-length music tracks with coherent musical structure. We show that by training a generative model on long temporal contexts it is possible to produce long-form music of up to 4m45s. Our model consists of a diffusion-transformer operating on a highly downsampled continuous latent rep…
▽ More
Audio-based generative models for music have seen great strides recently, but so far have not managed to produce full-length music tracks with coherent musical structure. We show that by training a generative model on long temporal contexts it is possible to produce long-form music of up to 4m45s. Our model consists of a diffusion-transformer operating on a highly downsampled continuous latent representation (latent rate of 21.5Hz). It obtains state-of-the-art generations according to metrics on audio quality and prompt alignment, and subjective tests reveal that it produces full-length music with coherent structure.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
One-Click Upgrade from 2D to 3D: Sandwiched RGB-D Video Compression for Stereoscopic Teleconferencing
Authors:
Yueyu Hu,
Onur G. Guleryuz,
Philip A. Chou,
Danhang Tang,
Jonathan Taylor,
Rus Maxham,
Yao Wang
Abstract:
Stereoscopic video conferencing is still challenging due to the need to compress stereo RGB-D video in real-time. Though hardware implementations of standard video codecs such as H.264 / AVC and HEVC are widely available, they are not designed for stereoscopic videos and suffer from reduced quality and performance. Specific multiview or 3D extensions of these codecs are complex and lack efficient…
▽ More
Stereoscopic video conferencing is still challenging due to the need to compress stereo RGB-D video in real-time. Though hardware implementations of standard video codecs such as H.264 / AVC and HEVC are widely available, they are not designed for stereoscopic videos and suffer from reduced quality and performance. Specific multiview or 3D extensions of these codecs are complex and lack efficient implementations. In this paper, we propose a new approach to upgrade a 2D video codec to support stereo RGB-D video compression, by wrapping it with a neural pre- and post-processor pair. The neural networks are end-to-end trained with an image codec proxy, and shown to work with a more sophisticated video codec. We also propose a geometry-aware loss function to improve rendering quality. We train the neural pre- and post-processors on a synthetic 4D people dataset, and evaluate it on both synthetic and real-captured stereo RGB-D videos. Experimental results show that the neural networks generalize well to unseen data and work out-of-box with various video codecs. Our approach saves about 30% bit-rate compared to a conventional video coding scheme and MV-HEVC at the same level of rendering quality from a novel view, without the need of a task-specific hardware upgrade.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
4D Track Reconstruction on Free-Streaming Data at PANDA at FAIR
Authors:
Jenny Taylor,
Michael Papenbrock,
Tobias Stockmanns,
Ralf Kliemt,
Tord Johansson,
Adeel Akram,
Karin Schönning
Abstract:
A new generation of experiments is being developed, where the challenge of separating rare signal processes from background at high intensities requires a change of trigger paradigm. At the future PANDA experiment at FAIR, hardware triggers will be abandoned and instead a purely software-based system will be used. This requires novel reconstruction methods with the ability to process data from man…
▽ More
A new generation of experiments is being developed, where the challenge of separating rare signal processes from background at high intensities requires a change of trigger paradigm. At the future PANDA experiment at FAIR, hardware triggers will be abandoned and instead a purely software-based system will be used. This requires novel reconstruction methods with the ability to process data from many events simultaneously.
A 4D tracking algorithm based on the cellular automaton has been developed which will utilize the timing information from detector signals. Simulation studies have been performed to test its performance on the foreseen free-streaming data from the PANDA detector. For this purpose, a quality assurance procedure for tracking on free-streaming data was implemented in the PANDA software. The studies show that at higher interaction rates, 4D tracking performs better than the 3D algorithm in terms of efficiency, 84% compared to 77%. The fake track suppression is also greatly improved, compared to the 3D tracking with roughly a 50% decrease in the ghost rate.
△ Less
Submitted 19 February, 2024;
originally announced April 2024.
-
The Rise of Faint, Red AGN at $z>4$: A Sample of Little Red Dots in the JWST Extragalactic Legacy Fields
Authors:
Dale D. Kocevski,
Steven L. Finkelstein,
Guillermo Barro,
Anthony J. Taylor,
Antonello Calabrò,
Brivael Laloux,
Johannes Buchner,
Jonathan R. Trump,
Gene C. K. Leung,
Guang Yang,
Mark Dickinson,
Pablo G. Pérez-González,
Fabio Pacucci,
Kohei Inayoshi,
Rachel S. Somerville,
Elizabeth J. McGrath,
Hollis B. Akins,
Micaela B. Bagley,
Laura Bisigello,
Rebecca A. A. Bowler,
Adam Carnall,
Caitlin M. Casey,
Yingjie Cheng,
Nikko J. Cleri,
Luca Costantin
, et al. (32 additional authors not shown)
Abstract:
We present a sample of 341 "little red dots" (LRDs) spanning the redshift range $z\sim2-11$ using data from the CEERS, PRIMER, JADES, UNCOVER and NGDEEP surveys. These sources are likely heavily-reddened AGN that trace a previously-hidden phase of dust-obscured black hole growth in the early Universe. Unlike past use of color indices to identify LRDs, we employ continuum slope fitting using shifti…
▽ More
We present a sample of 341 "little red dots" (LRDs) spanning the redshift range $z\sim2-11$ using data from the CEERS, PRIMER, JADES, UNCOVER and NGDEEP surveys. These sources are likely heavily-reddened AGN that trace a previously-hidden phase of dust-obscured black hole growth in the early Universe. Unlike past use of color indices to identify LRDs, we employ continuum slope fitting using shifting bandpasses to sample the same rest-frame emission blueward and redward of the Balmer break. This approach allows us to identify LRDs over a wider redshift range and is less susceptible to contamination from galaxies with strong breaks that otherwise lack a rising red continuum. The redshift distribution of our sample increases at $z<8$ and then undergoes a rapid decline at $z\sim4.5$, which may tie the emergence, and obscuration, of these sources to the inside-out growth that galaxies experience during this epoch. We find that LRDs are 2-3 dex more numerous than bright quasars at $z\sim5-7$, but their number density is only 0.6-1 dex higher than X-ray and UV selected AGN at these redshifts. Within our sample, we have identified the first X-ray detected LRDs at $z=3.1$ and $z=4.66$. An X-ray spectral analysis confirms that these AGN are moderately obscured with $\log\,(N_{\rm H}/{\rm cm}^{2}$) of $23.3^{+0.4}_{-1.3}$ and $22.72^{+0.13}_{-0.16}$. Our analysis reveals that reddened AGN emission dominates their rest-optical light, while the rest-UV originates from their host galaxies. We also present NIRSpec follow-up spectroscopy of 17 LRDs that show broad emission lines consistent with AGN activity. The confirmed AGN fraction of our sample is $71\%$ for sources with F444W$<26.5$. In addition, we find three LRDs with narrow blue-shifted Balmer absorption features in their spectra, suggesting an outflow of high-density, low ionization gas from near the central engine of these faint, red AGN.
△ Less
Submitted 19 April, 2024; v1 submitted 4 April, 2024;
originally announced April 2024.
-
Direct Experimental Constraints on the Spatial Extent of a Neutrino Wavepacket
Authors:
Joseph Smolsky,
Kyle G Leach,
Ryan Abells,
Pedro Amaro,
Adrien Andoche,
Keith Borbridge,
Connor Bray,
Robin Cantor,
David Diercks,
Spencer Fretwell,
Stephan Friedrich,
Abigail Gillespie,
Mauro Guerra,
Ad Hall,
Cameron N Harris,
Jackson T Harris,
Calvin Hinkle,
Amii Lamm,
Leendert M Hayen,
Paul-Antoine Hervieux,
Geon-Bo Kim,
Inwook Kim,
Annika Lennarz,
Vincenzo Lordi,
Jorge Machado
, et al. (13 additional authors not shown)
Abstract:
Despite their high relative abundance in our Universe, neutrinos are the least understood fundamental particles of nature. They also provide a unique system to study quantum coherence and the wavelike nature of particles in fundamental systems due to their extremely weak interaction probabilities. In fact, the quantum properties of neutrinos emitted in experimentally relevant sources are virtually…
▽ More
Despite their high relative abundance in our Universe, neutrinos are the least understood fundamental particles of nature. They also provide a unique system to study quantum coherence and the wavelike nature of particles in fundamental systems due to their extremely weak interaction probabilities. In fact, the quantum properties of neutrinos emitted in experimentally relevant sources are virtually unknown and the spatial extent of the neutrino wavepacket is only loosely constrained by reactor neutrino oscillation data with a spread of 13 orders of magnitude. Here, we present the first direct limits of this quantity through a new experimental concept to extract the energy width, $σ_{\textrm{N},E}$, of the recoil daughter nucleus emitted in the nuclear electron capture (EC) decay of $^7$Be. The final state in the EC decay process contains a recoiling $^7$Li nucleus and an electron neutrino ($ν_e$) which are entangled at their creation. The $^7$Li energy spectrum is measured to high precision by directly embedding $^7$Be radioisotopes into a high resolution superconducting tunnel junction that is operated as a cryogenic sensor. The lower limit on the spatial uncertainty of the recoil daughter was found to be $σ_{\textrm{N}, x} \geq 6.2$\,pm, which implies the final-state system is localized at a scale more than a thousand times larger than the nucleus itself. From this measurement, the first direct lower limits on the spatial extent of the neutrino wavepacket were extracted using two different theoretical methods. These results have wide-reaching implications in several areas including the nature of spatial localization at sub-atomic scales, interpretation of neutrino physics data, and the potential reach of future large-scale experiments.
△ Less
Submitted 30 April, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
Discovery and timing of ten new millisecond pulsars in the globular cluster Terzan 5
Authors:
P. V. Padmanabh,
S. M. Ransom,
P. C. C. Freire,
A. Ridolfi,
J. D. Taylor,
C. Choza,
C. J. Clark,
F. Abbate,
M. Bailes,
E. D. Barr,
S. Buchner,
M. Burgay,
M. E. DeCesar,
W. Chen,
A. Corongiu,
D. J. Champion,
A. Dutta,
M. Geyer,
J. W. T. Hessels,
M. Kramer,
A. Possenti,
I. H. Stairs,
B. W. Stappers,
V. Venkatraman Krishnan,
L. Vleeschower
, et al. (1 additional authors not shown)
Abstract:
We report the discovery of ten new pulsars in the globular cluster Terzan 5 as part of the Transients and Pulsars with MeerKAT (TRAPUM) Large Survey Project. We observed Terzan 5 at L-band (856--1712 MHz) with the MeerKAT radio telescope for four hours on two epochs, and performed acceleration searches of 45 out of 288 tied-array beams covering the core of the cluster. We obtained phase-connected…
▽ More
We report the discovery of ten new pulsars in the globular cluster Terzan 5 as part of the Transients and Pulsars with MeerKAT (TRAPUM) Large Survey Project. We observed Terzan 5 at L-band (856--1712 MHz) with the MeerKAT radio telescope for four hours on two epochs, and performed acceleration searches of 45 out of 288 tied-array beams covering the core of the cluster. We obtained phase-connected timing solutions for nine discoveries, covering nearly two decades of archival observations from the Green Bank Telescope for all but one. Highlights include PSR J1748$-$2446ao which is an eccentric ($e = 0.32$) wide-orbit (orbital period $P_{\rm b} = 57.55$ d) system. We were able to measure the rate of advance of periastron ($\dotω$) for this system allowing us to determine a total mass of $3.17 \pm \, 0.02\, \rm M_{\odot}$. With a minimum companion mass ($M_{\rm c}$) of $\sim 0.8\, \rm M_{\odot}$, PSR J1748$-$2446ao is a candidate double neutron star (DNS) system. If confirmed to be a DNS, it would be the fastest spinning pulsar ($P = 2.27$ ms) and the longest orbital period measured for any known DNS system. PSR J1748$-$2446ap has the second highest eccentricity for any recycled pulsar ($e \sim 0.905$) and for this system we can measure the total mass ($1.997 \pm 0.006\, \rm M_{\odot}$) and also estimate the individual pulsar and companion masses. PSR J1748$-$2446ar is an eclipsing redback (minimum $M_{\rm c} \sim 0.34\, \rm M_{\odot}$) system whose properties confirm it to be the counterpart to a previously published source identified in radio and X-ray imaging. With these discoveries, the total number of confirmed pulsars in Terzan 5 is 49, the highest for any globular cluster so far. These discoveries further enhance the rich set of pulsars known in Terzan 5 and provide scope for a deeper understanding of binary stellar evolution, cluster dynamics and ensemble population studies.
△ Less
Submitted 19 June, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
Multi-Agent VQA: Exploring Multi-Agent Foundation Models in Zero-Shot Visual Question Answering
Authors:
Bowen Jiang,
Zhijun Zhuang,
Shreyas S. Shivakumar,
Dan Roth,
Camillo J. Taylor
Abstract:
This work explores the zero-shot capabilities of foundation models in Visual Question Answering (VQA) tasks. We propose an adaptive multi-agent system, named Multi-Agent VQA, to overcome the limitations of foundation models in object detection and counting by using specialized agents as tools. Unlike existing approaches, our study focuses on the system's performance without fine-tuning it on speci…
▽ More
This work explores the zero-shot capabilities of foundation models in Visual Question Answering (VQA) tasks. We propose an adaptive multi-agent system, named Multi-Agent VQA, to overcome the limitations of foundation models in object detection and counting by using specialized agents as tools. Unlike existing approaches, our study focuses on the system's performance without fine-tuning it on specific VQA datasets, making it more practical and robust in the open world. We present preliminary experimental results under zero-shot scenarios and highlight some failure cases, offering new directions for future research.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Testbeam analysis of biasing structures for irradiated hybrid pixel detectors
Authors:
Craig M. Buttar,
Yanyan Gao,
Ricardo González López,
Dzmitry Maneuski,
Emily Pender,
Quake Qin,
Adam G. Rennie,
Matthew Sullivan,
Jon T. Taylor,
Kenneth Wraight
Abstract:
Following the Phase-II upgrade during Long Shutdown (LS3), the LHC aims to reach a peak instantaneous luminosity of $7.5\times 10^{34}$cm$^{-2}$s$^{-1}$, which corresponds to an average of around 200 inelastic proton-proton collisions per beam-crossing (every 25 ns). To cope with these conditions, the ATLAS Inner Detector will be replaced by a new all-silicon system -- the Inner Tracker (ITk). The…
▽ More
Following the Phase-II upgrade during Long Shutdown (LS3), the LHC aims to reach a peak instantaneous luminosity of $7.5\times 10^{34}$cm$^{-2}$s$^{-1}$, which corresponds to an average of around 200 inelastic proton-proton collisions per beam-crossing (every 25 ns). To cope with these conditions, the ATLAS Inner Detector will be replaced by a new all-silicon system -- the Inner Tracker (ITk). The ITk will be operational for more than ten years, during which time ATLAS is expected to record approximately 4000 fb$^{-1}$ of data. The ITk's pixel sub-system is based on hybrid pixel modules with new silicon sensors and readout chips. These studies focus on testbeam campaigns undertaken to study the spatial resolution and efficiencies of hybrid pixel detector modules based on the first large-structure prototype front-end readout chip -- the RD53A -- using planar silicon sensors. These devices have been irradiated to replicate the effect of the high radiation environment present during operation in the ATLAS detector. Results for devices using sensors with different punch-through bias structures and using different readout chips are summarised. Those with sensors incorporating a punch-through bias structure are found to exhibit systematically lower efficiency than those without, as a result of local areas of relative inefficiency around the punch-through dots. Despite this, all devices measured are found to satisfy the requirement of 97% efficiency at $V_\mathrm{bias}=400$ V after being irradiated to end-of-life fluence.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Quenching-driven equatorial depletion and limb asymmetries in hot Jupiter atmospheres: WASP-96b example
Authors:
Maria Zamyatina,
Duncan A. Christie,
Eric Hébrard,
Nathan J. Mayne,
Michael Radica,
Jake Taylor,
Harry Baskett,
Ben Moore,
Craig Lils,
Denis Sergeev,
Eva-Maria Ahrer,
James Manners,
Krisztian Kohary,
Adina D. Feinstein
Abstract:
Transport-induced quenching in hot Jupiter atmospheres is a process that determines the boundary between the part of the atmosphere at chemical equilibrium and the part of the atmosphere at thermochemical (but not photothermochemical) disequilibrium. The location of this boundary, the quench level, depends on the interplay between the dynamical and chemical timescales in the atmosphere, with quenc…
▽ More
Transport-induced quenching in hot Jupiter atmospheres is a process that determines the boundary between the part of the atmosphere at chemical equilibrium and the part of the atmosphere at thermochemical (but not photothermochemical) disequilibrium. The location of this boundary, the quench level, depends on the interplay between the dynamical and chemical timescales in the atmosphere, with quenching occurring when these timescales are equal. We explore the sensitivity of the quench level position to an increase in the planet's atmospheric metallicity using aerosol-free 3D GCM simulations of a hot Jupiter WASP-96b. We find that the temperature increase at pressures of $\sim$$10^{4}-10^{7}$ Pa that occurs when metallicity is increased could shift the position of the quench level to pressures dominated by the jet, and cause an equatorial depletion of $CH_4$, $NH_3$ and $HCN$. We discuss how such a depletion affects the planet's transmission spectrum, and how the analysis of the evening-morning limb asymmetries, especially within $\sim3-5 μm$, could help distinguish atmospheres of different metallicities that are at chemical equilibrium from those with the upper layers at thermochemical disequilibrium.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
Situating Data Sets: Making Public Data Actionable for Housing Justice
Authors:
Anh-Ton Tran,
Grace Guo,
Jordan Taylor,
Katsuki Chan,
Elora Raymond,
Carl DiSalvo
Abstract:
Activists, governmentsm and academics regularly advocate for more open data. But how is data made open, and for whom is it made useful and usable? In this paper, we investigate and describe the work of making eviction data open to tenant organizers. We do this through an ethnographic description of ongoing work with a local housing activist organization. This work combines observation, direct part…
▽ More
Activists, governmentsm and academics regularly advocate for more open data. But how is data made open, and for whom is it made useful and usable? In this paper, we investigate and describe the work of making eviction data open to tenant organizers. We do this through an ethnographic description of ongoing work with a local housing activist organization. This work combines observation, direct participation in data work, and creating media artifacts, specifically digital maps. Our interpretation is grounded in D'Ignazio and Klein's Data Feminism, emphasizing standpoint theory. Through our analysis and discussion, we highlight how shifting positionalities from data intermediaries to data accomplices affects the design of data sets and maps. We provide HCI scholars with three design implications when situating data for grassroots organizers: becoming a domain beginner, striving for data actionability, and evaluating our design artifacts by the social relations they sustain rather than just their technical efficacy.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Cruising Queer HCI on the DL: A Literature Review of LGBTQ+ People in HCI
Authors:
Jordan Taylor,
Ellen Simpson,
Anh-Ton Tran,
Jed Brubaker,
Sarah Fox,
Haiyi Zhu
Abstract:
LGBTQ+ people have received increased attention in HCI research, paralleling a greater emphasis on social justice in recent years. However, there has not been a systematic review of how LGBTQ+ people are researched or discussed in HCI. In this work, we review all research mentioning LGBTQ+ people across the HCI venues of CHI, CSCW, DIS, and TOCHI. Since 2014, we find a linear growth in the number…
▽ More
LGBTQ+ people have received increased attention in HCI research, paralleling a greater emphasis on social justice in recent years. However, there has not been a systematic review of how LGBTQ+ people are researched or discussed in HCI. In this work, we review all research mentioning LGBTQ+ people across the HCI venues of CHI, CSCW, DIS, and TOCHI. Since 2014, we find a linear growth in the number of papers substantially about LGBTQ+ people and an exponential increase in the number of mentions. Research about LGBTQ+ people tends to center experiences of being politicized, outside the norm, stigmatized, or highly vulnerable. LGBTQ+ people are typically mentioned as a marginalized group or an area of future research. We identify gaps and opportunities for (1) research about and (2) the discussion of LGBTQ+ in HCI and provide a dataset to facilitate future Queer HCI research.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers
Authors:
Onur G. Guleryuz,
Philip A. Chou,
Berivan Isik,
Hugues Hoppe,
Danhang Tang,
Ruofei Du,
Jonathan Taylor,
Philip Davidson,
Sean Fanello
Abstract:
We propose sandwiching standard image and video codecs between pre- and post-processing neural networks. The networks are jointly trained through a differentiable codec proxy to minimize a given rate-distortion loss. This sandwich architecture not only improves the standard codec's performance on its intended content, it can effectively adapt the codec to other types of image/video content and to…
▽ More
We propose sandwiching standard image and video codecs between pre- and post-processing neural networks. The networks are jointly trained through a differentiable codec proxy to minimize a given rate-distortion loss. This sandwich architecture not only improves the standard codec's performance on its intended content, it can effectively adapt the codec to other types of image/video content and to other distortion measures. Essentially, the sandwich learns to transmit ``neural code images'' that optimize overall rate-distortion performance even when the overall problem is well outside the scope of the codec's design. Through a variety of examples, we apply the sandwich architecture to sources with different numbers of channels, higher resolution, higher dynamic range, and perceptual distortion measures. The results demonstrate substantial improvements (up to 9 dB gains or up to 30\% bitrate reductions) compared to alternative adaptations. We derive VQ equivalents for the sandwich, establish optimality properties, and design differentiable codec proxies approximating current standard codecs. We further analyze model complexity, visual quality under perceptual metrics, as well as sandwich configurations that offer interesting potentials in image/video compression and streaming.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Fast Timing-Conditioned Latent Audio Diffusion
Authors:
Zach Evans,
CJ Carr,
Josiah Taylor,
Scott H. Hawley,
Jordi Pons
Abstract:
Generating long-form 44.1kHz stereo audio from text prompts can be computationally demanding. Further, most previous works do not tackle that music and sound effects naturally vary in their duration. Our research focuses on the efficient generation of long-form, variable-length stereo music and sounds at 44.1kHz using text prompts with a generative model. Stable Audio is based on latent diffusion,…
▽ More
Generating long-form 44.1kHz stereo audio from text prompts can be computationally demanding. Further, most previous works do not tackle that music and sound effects naturally vary in their duration. Our research focuses on the efficient generation of long-form, variable-length stereo music and sounds at 44.1kHz using text prompts with a generative model. Stable Audio is based on latent diffusion, with its latent defined by a fully-convolutional variational autoencoder. It is conditioned on text prompts as well as timing embeddings, allowing for fine control over both the content and length of the generated music and sounds. Stable Audio is capable of rendering stereo signals of up to 95 sec at 44.1kHz in 8 sec on an A100 GPU. Despite its compute efficiency and fast inference, it is one of the best in two public text-to-music and -audio benchmarks and, differently from state-of-the-art models, can generate music with structure and stereo sounds.
△ Less
Submitted 13 May, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.
-
Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning
Authors:
Carlos A. Velazquez-Vargas,
Isaac Ray Christian,
Jordan A. Taylor,
Sreejan Kumar
Abstract:
We investigated the human capacity to acquire multiple visuomotor mappings for de novo skills. Using a grid navigation paradigm, we tested whether contextual cues implemented as different "grid worlds", allow participants to learn two distinct key-mappings more efficiently. Our results indicate that when contextual information is provided, task performance is significantly better. The same held tr…
▽ More
We investigated the human capacity to acquire multiple visuomotor mappings for de novo skills. Using a grid navigation paradigm, we tested whether contextual cues implemented as different "grid worlds", allow participants to learn two distinct key-mappings more efficiently. Our results indicate that when contextual information is provided, task performance is significantly better. The same held true for meta-reinforcement learning agents that differed in whether or not they receive contextual information when performing the task. We evaluated their accuracy in predicting human performance in the task and analyzed their internal representations. The results indicate that contextual cues allow the formation of separate representations in space and time when using different visuomotor mappings, whereas the absence of them favors sharing one representation. While both strategies can allow learning of multiple visuomotor mappings, we showed contextual cues provide a computational advantage in terms of how many mappings can be learned.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
An introduction to graphical tensor notation for mechanistic interpretability
Authors:
Jordan K. Taylor
Abstract:
Graphical tensor notation is a simple way of denoting linear operations on tensors, originating from physics. Modern deep learning consists almost entirely of operations on or between tensors, so easily understanding tensor operations is quite important for understanding these systems. This is especially true when attempting to reverse-engineer the algorithms learned by a neural network in order t…
▽ More
Graphical tensor notation is a simple way of denoting linear operations on tensors, originating from physics. Modern deep learning consists almost entirely of operations on or between tensors, so easily understanding tensor operations is quite important for understanding these systems. This is especially true when attempting to reverse-engineer the algorithms learned by a neural network in order to understand its behavior: a field known as mechanistic interpretability. It's often easy to get confused about which operations are happening between tensors and lose sight of the overall structure, but graphical tensor notation makes it easier to parse things at a glance and see interesting equivalences. The first half of this document introduces the notation and applies it to some decompositions (SVD, CP, Tucker, and tensor network decompositions), while the second half applies it to some existing some foundational approaches for mechanistically understanding language models, loosely following ``A Mathematical Framework for Transformer Circuits'', then constructing an example ``induction head'' circuit in graphical tensor notation.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Microscopic Model for Fractional Quantum Hall Nematics
Authors:
Songyang Pu,
Ajit C. Balram,
Joseph Taylor,
Eduardo Fradkin,
Zlatko Papić
Abstract:
Geometric fluctuations of the density mode in a fractional quantum Hall (FQH) state can give rise to a nematic FQH phase, a topological state with a spontaneously broken rotational symmetry. While experiments on FQH states in the second Landau level have reported signatures of putative FQH nematics in anisotropic transport, a realistic model for this state has been lacking. We show that the standa…
▽ More
Geometric fluctuations of the density mode in a fractional quantum Hall (FQH) state can give rise to a nematic FQH phase, a topological state with a spontaneously broken rotational symmetry. While experiments on FQH states in the second Landau level have reported signatures of putative FQH nematics in anisotropic transport, a realistic model for this state has been lacking. We show that the standard model of particles in the lowest Landau level interacting via the Coulomb potential realizes the FQH nematic transition, which is reached by a progressive reduction of the strength of the shortest-range Haldane pseudopotential. Using exact diagonalization and variational wave functions, we demonstrate that the FQH nematic transition occurs when the system's neutral gap closes in the long-wavelength limit while the charge gap remains open. We confirm the symmetry-breaking nature of the transition by demonstrating the existence of a "circular moat" potential in the manifold of states with broken rotational symmetry, while its geometric character is revealed through the strong fluctuations of the nematic susceptibility and Hall viscosity.
△ Less
Submitted 9 June, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
Muted Features in the JWST NIRISS Transmission Spectrum of Hot-Neptune LTT 9779 b
Authors:
Michael Radica,
Louis-Philippe Coulombe,
Jake Taylor,
Loïc Albert,
Romain Allart,
Björn Benneke,
Nicolas B. Cowan,
Lisa Dang,
David Lafrenière,
Daniel Thorngren,
Étienne Artigau,
René Doyon,
Laura Flagg,
Doug Johnstone,
Stefan Pelletier,
Pierre-Alexis Roy
Abstract:
The hot-Neptune desert is one of the most sparsely populated regions of the exoplanet parameter space, and atmosphere observations of its few residents can provide insights into how such planets have managed to survive in such an inhospitable environment. Here, we present transmission observations of LTT 9779 b, the only known hot-Neptune to have retained a significant H/He-dominated atmosphere, t…
▽ More
The hot-Neptune desert is one of the most sparsely populated regions of the exoplanet parameter space, and atmosphere observations of its few residents can provide insights into how such planets have managed to survive in such an inhospitable environment. Here, we present transmission observations of LTT 9779 b, the only known hot-Neptune to have retained a significant H/He-dominated atmosphere, taken with JWST NIRISS/SOSS. The 0.6-2.85$μ$m transmission spectrum shows evidence for muted spectral features, rejecting a perfectly flat line at >5$σ$. We explore water and methane-dominated atmosphere scenarios for LTT 9779 b's terminator, and retrieval analyses reveal a continuum of potential combinations of metallicity and cloudiness. Through comparisons to previous population synthesis works and our own interior structure modelling, we are able to constrain LTT 9779 b's atmosphere metallicity to 20-850x solar. Within this range of metallicity, our retrieval analyses prefer solutions with clouds at mbar pressures, regardless of whether the atmosphere is water- or methane-dominated -- though cloud-free atmospheres with metallicities >500x solar cannot be entirely ruled out. By comparing self-consistent atmosphere temperature profiles with cloud condensation curves, we find that silicate clouds can readily condense in the terminator region of LTT 9779 b. Advection of these clouds onto the day-side could explain the high day-side albedo previously inferred for this planet and be part of a feedback loop aiding the survival of LTT 9779 b's atmosphere in the hot-Neptune desert.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Authors:
Taylor J. Bell,
Nicolas Crouzet,
Patricio E. Cubillos,
Laura Kreidberg,
Anjali A. A. Piette,
Michael T. Roman,
Joanna K. Barstow,
Jasmina Blecic,
Ludmila Carone,
Louis-Philippe Coulombe,
Elsa Ducrot,
Mark Hammond,
João M. Mendonça,
Julianne I. Moses,
Vivien Parmentier,
Kevin B. Stevenson,
Lucas Teinturier,
Michael Zhang,
Natalie M. Batalha,
Jacob L. Bean,
Björn Benneke,
Benjamin Charnay,
Katy L. Chubb,
Brice-Olivier Demory,
Peter Gao
, et al. (58 additional authors not shown)
Abstract:
Hot Jupiters are among the best-studied exoplanets, but it is still poorly understood how their chemical composition and cloud properties vary with longitude. Theoretical models predict that clouds may condense on the nightside and that molecular abundances can be driven out of equilibrium by zonal winds. Here we report a phase-resolved emission spectrum of the hot Jupiter WASP-43b measured from 5…
▽ More
Hot Jupiters are among the best-studied exoplanets, but it is still poorly understood how their chemical composition and cloud properties vary with longitude. Theoretical models predict that clouds may condense on the nightside and that molecular abundances can be driven out of equilibrium by zonal winds. Here we report a phase-resolved emission spectrum of the hot Jupiter WASP-43b measured from 5-12 $μ$m with JWST's Mid-Infrared Instrument (MIRI). The spectra reveal a large day-night temperature contrast (with average brightness temperatures of 1524$\pm$35 and 863$\pm$23 Kelvin, respectively) and evidence for water absorption at all orbital phases. Comparisons with three-dimensional atmospheric models show that both the phase curve shape and emission spectra strongly suggest the presence of nightside clouds which become optically thick to thermal emission at pressures greater than ~100 mbar. The dayside is consistent with a cloudless atmosphere above the mid-infrared photosphere. Contrary to expectations from equilibrium chemistry but consistent with disequilibrium kinetics models, methane is not detected on the nightside (2$σ$ upper limit of 1-6 parts per million, depending on model assumptions).
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Pretraining and the Lasso
Authors:
Erin Craig,
Mert Pilanci,
Thomas Le Menestrel,
Balasubramanian Narasimhan,
Manuel Rivas,
Roozbeh Dehghannasiri,
Julia Salzman,
Jonathan Taylor,
Robert Tibshirani
Abstract:
Pretraining is a popular and powerful paradigm in machine learning. As an example, suppose one has a modest-sized dataset of images of cats and dogs, and plans to fit a deep neural network to classify them from the pixel features. With pretraining, we start with a neural network trained on a large corpus of images, consisting of not just cats and dogs but hundreds of other image types. Then we fix…
▽ More
Pretraining is a popular and powerful paradigm in machine learning. As an example, suppose one has a modest-sized dataset of images of cats and dogs, and plans to fit a deep neural network to classify them from the pixel features. With pretraining, we start with a neural network trained on a large corpus of images, consisting of not just cats and dogs but hundreds of other image types. Then we fix all of the network weights except for the top layer (which makes the final classification) and train (or "fine tune") those weights on our dataset. This often results in dramatically better performance than the network trained solely on our smaller dataset.
In this paper, we ask the question "Can pretraining help the lasso?". We develop a framework for the lasso in which an overall model is fit to a large set of data, and then fine-tuned to a specific task on a smaller dataset. This latter dataset can be a subset of the original dataset, but does not need to be. We find that this framework has a wide variety of applications, including stratified models, multinomial targets, multi-response models, conditional average treatment estimation and even gradient boosting.
In the stratified model setting, the pretrained lasso pipeline estimates the coefficients common to all groups at the first stage, and then group specific coefficients at the second "fine-tuning" stage. We show that under appropriate assumptions, the support recovery rate of the common coefficients is superior to that of the usual lasso trained only on individual groups. This separate identification of common and individual coefficients can also be useful for scientific understanding.
△ Less
Submitted 18 April, 2024; v1 submitted 23 January, 2024;
originally announced January 2024.