-
Exploring Human-LLM Conversations: Mental Models and the Originator of Toxicity
Authors:
Johannes Schneider,
Arianna Casanova Flores,
Anne-Catherine Kranz
Abstract:
This study explores real-world human interactions with large language models (LLMs) in diverse, unconstrained settings in contrast to most prior research focusing on ethically trimmed models like ChatGPT for specific tasks. We aim to understand the originator of toxicity. Our findings show that although LLMs are rightfully accused of providing toxic content, it is mostly demanded or at least provo…
▽ More
This study explores real-world human interactions with large language models (LLMs) in diverse, unconstrained settings in contrast to most prior research focusing on ethically trimmed models like ChatGPT for specific tasks. We aim to understand the originator of toxicity. Our findings show that although LLMs are rightfully accused of providing toxic content, it is mostly demanded or at least provoked by humans who actively seek such content. Our manual analysis of hundreds of conversations judged as toxic by APIs commercial vendors, also raises questions with respect to current practices of what user requests are refused to answer. Furthermore, we conjecture based on multiple empirical indicators that humans exhibit a change of their mental model, switching from the mindset of interacting with a machine more towards interacting with a human.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Mass-Balance MRV for Carbon Dioxide Removal by Enhanced Rock Weathering: Methods, Simulation, and Inference
Authors:
Mark Baum,
Henry Liu,
Lily Schacht,
Jake Schneider,
Mary Yap
Abstract:
Carbon dioxide will likely need to be removed from the atmosphere to avoid significant future warming and climate change. Technologies are being developed to remove large quantities of carbon from the atmosphere. Enhanced rock weathering (ERW), where fine-grained silicate minerals are spread on soil, is a promising carbon removal method that can also support crop yields and maintain overall soil h…
▽ More
Carbon dioxide will likely need to be removed from the atmosphere to avoid significant future warming and climate change. Technologies are being developed to remove large quantities of carbon from the atmosphere. Enhanced rock weathering (ERW), where fine-grained silicate minerals are spread on soil, is a promising carbon removal method that can also support crop yields and maintain overall soil health. Quantifying the amount of carbon removed by ERW is crucial for understanding the potential of ERW globally and for building trust in commercial operations. However, reliable and scalable quantification in complex media like soil is challenging and there is not yet a consensus on the best method of doing so. Here we discuss mass-balance methods, where stocks of base cations in soil are monitored over time to infer the amount of inorganic carbon brought into solution by weathering reactions. First, we review the fundamental concepts of mass-balance methods and explain different ways of approaching the mass-balance problem. Then we discuss experimental planning and data collection, suggesting some best practices. Next, we present a software package designed to facilitate a range of tasks in ERW like uncertainty analysis, planning field trials, and validating statistical methods. Finally, we briefly review ways of estimating carbon removal using mass balance before discussing some advantages of Bayesian inference in this context and presenting an example Bayesian model. The model is fit to simulated data and recovers the correct answer with a clear representation of uncertainty.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Probing the connection between IceCube neutrinos and MOJAVE AGN
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (399 additional authors not shown)
Abstract:
Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well establi…
▽ More
Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well established which can be resolved via correlation studies with photon observations. For neutrinos produced due to photohadronic interactions in AGN, in addition to a correlation of neutrinos with high-energy photons, there would also be a correlation of neutrinos with photons emitted at radio wavelengths. In this work, we perform an in-depth stacking study of the correlation between 15 GHz radio observations of AGN reported in the MOJAVE XV catalog, and ten years of neutrino data from IceCube. We also use a time-dependent approach which improves the statistical power of the stacking analysis. No significant correlation was found for both analyses and upper limits are reported. When compared to the IceCube diffuse flux, at 100 TeV and for a spectral index of 2.5, the upper limits derived are $\sim3\%$ and $\sim9\%$ for the time-averaged and time-dependent case, respectively.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Search for a light sterile neutrino with 7.5 years of IceCube DeepCore data
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (399 additional authors not shown)
Abstract:
We present a search for an eV-scale sterile neutrino using 7.5 years of data from the IceCube DeepCore detector. The analysis uses a sample of 21,914 events with energies between 5 and 150 GeV to search for sterile neutrinos through atmospheric muon neutrino disappearance. Improvements in event selection and treatment of systematic uncertainties provide greater statistical power compared to previo…
▽ More
We present a search for an eV-scale sterile neutrino using 7.5 years of data from the IceCube DeepCore detector. The analysis uses a sample of 21,914 events with energies between 5 and 150 GeV to search for sterile neutrinos through atmospheric muon neutrino disappearance. Improvements in event selection and treatment of systematic uncertainties provide greater statistical power compared to previous DeepCore sterile neutrino searches. Our results are compatible with the absence of mixing between active and sterile neutrino states, and we place constraints on the mixing matrix elements $|U_{μ4}|^2 < 0.0534$ and $|U_{τ4}|^2 < 0.0574$ at 90% CL under the assumption that $Δm^2_{41}\geq 1\;\mathrm{eV^2}$. These null results add to the growing tension between anomalous appearance results and constraints from disappearance searches in the 3+1 sterile neutrino landscape.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Adversarial Online Learning with Temporal Feedback Graphs
Authors:
Khashayar Gatmiry,
Jon Schneider
Abstract:
We study a variant of prediction with expert advice where the learner's action at round $t$ is only allowed to depend on losses on a specific subset of the rounds (where the structure of which rounds' losses are visible at time $t$ is provided by a directed "feedback graph" known to the learner). We present a novel learning algorithm for this setting based on a strategy of partitioning the losses…
▽ More
We study a variant of prediction with expert advice where the learner's action at round $t$ is only allowed to depend on losses on a specific subset of the rounds (where the structure of which rounds' losses are visible at time $t$ is provided by a directed "feedback graph" known to the learner). We present a novel learning algorithm for this setting based on a strategy of partitioning the losses across sub-cliques of this graph. We complement this with a lower bound that is tight in many practical settings, and which we conjecture to be within a constant factor of optimal. For the important class of transitive feedback graphs, we prove that this algorithm is efficiently implementable and obtains the optimal regret bound (up to a universal constant).
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Complex Dynamics in Autobidding Systems
Authors:
Renato Paes Leme,
Georgios Piliouras,
Jon Schneider,
Kelly Spendlove,
Song Zuo
Abstract:
It has become the default in markets such as ad auctions for participants to bid in an auction through automated bidding agents (autobidders) which adjust bids over time to satisfy return-over-spend constraints. Despite the prominence of such systems for the internet economy, their resulting dynamical behavior is still not well understood. Although one might hope that such relatively simple system…
▽ More
It has become the default in markets such as ad auctions for participants to bid in an auction through automated bidding agents (autobidders) which adjust bids over time to satisfy return-over-spend constraints. Despite the prominence of such systems for the internet economy, their resulting dynamical behavior is still not well understood. Although one might hope that such relatively simple systems would typically converge to the equilibria of their underlying auctions, we provide a plethora of results that show the emergence of complex behavior, such as bi-stability, periodic orbits and quasi periodicity. We empirically observe how the market structure (expressed as motifs) qualitatively affects the behavior of the dynamics. We complement it with theoretical results showing that autobidding systems can simulate both linear dynamical systems as well logical boolean gates.
△ Less
Submitted 1 July, 2024; v1 submitted 27 June, 2024;
originally announced June 2024.
-
Benchmarking M6 Competitors: An Analysis of Financial Metrics and Discussion of Incentives
Authors:
Matthew J. Schneider,
Rufus Rankin,
Prabir Burman,
Alexander Aue
Abstract:
The M6 Competition assessed the performance of competitors using a ranked probability score and an information ratio (IR). While these metrics do well at picking the winners in the competition, crucial questions remain for investors with longer-term incentives. To address these questions, we compare the competitors' performance to a number of conventional (long-only) and alternative indices using…
▽ More
The M6 Competition assessed the performance of competitors using a ranked probability score and an information ratio (IR). While these metrics do well at picking the winners in the competition, crucial questions remain for investors with longer-term incentives. To address these questions, we compare the competitors' performance to a number of conventional (long-only) and alternative indices using standard industry metrics. We apply factor models to the competitors' returns and show the difficulty for any competitor to demonstrate a statistically significant value-add above industry-standard benchmarks within the short timeframe of the competition. We also uncover that most competitors generated lower risk-adjusted returns and lower maximum drawdowns than randomly selected portfolios, and that most competitors could not generate significant out-performance in raw returns. We further introduce two new strategies by picking the competitors with the best (Superstars) and worst (Superlosers) recent performance and show that it is challenging to identify skill amongst investment managers. Overall, our findings highlight the difference in incentives for competitors over professional investors, where the upside of winning the competition dwarfs the potential downside of not winning to maximize fees over an extended period of time.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization
Authors:
Wentse Chen,
Shiyu Huang,
Jeff Schneider
Abstract:
Multi-agent reinforcement learning (MARL) tasks often utilize a centralized training with decentralized execution (CTDE) framework. QMIX is a successful CTDE method that learns a credit assignment function to derive local value functions from a global value function, defining a deterministic local policy. However, QMIX is hindered by its poor exploration strategy. While maximum entropy reinforceme…
▽ More
Multi-agent reinforcement learning (MARL) tasks often utilize a centralized training with decentralized execution (CTDE) framework. QMIX is a successful CTDE method that learns a credit assignment function to derive local value functions from a global value function, defining a deterministic local policy. However, QMIX is hindered by its poor exploration strategy. While maximum entropy reinforcement learning (RL) promotes better exploration through stochastic policies, QMIX's process of credit assignment conflicts with the maximum entropy objective and the decentralized execution requirement, making it unsuitable for maximum entropy RL. In this paper, we propose an enhancement to QMIX by incorporating an additional local Q-value learning method within the maximum entropy RL framework. Our approach constrains the local Q-value estimates to maintain the correct ordering of all actions. Due to the monotonicity of the QMIX value function, these updates ensure that locally optimal actions align with globally optimal actions. We theoretically prove the monotonic improvement and convergence of our method to an optimal solution. Experimentally, we validate our algorithm in matrix games, Multi-Agent Particle Environment and demonstrate state-of-the-art performance in SMAC-v2.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Planning with Adaptive World Models for Autonomous Driving
Authors:
Arun Balajee Vasudevan,
Neehar Peri,
Jeff Schneider,
Deva Ramanan
Abstract:
Motion planning is crucial for safe navigation in complex urban environments. Historically, motion planners (MPs) have been evaluated with procedurally-generated simulators like CARLA. However, such synthetic benchmarks do not capture real-world multi-agent interactions. nuPlan, a recently released MP benchmark, addresses this limitation by augmenting real-world driving logs with closed-loop simul…
▽ More
Motion planning is crucial for safe navigation in complex urban environments. Historically, motion planners (MPs) have been evaluated with procedurally-generated simulators like CARLA. However, such synthetic benchmarks do not capture real-world multi-agent interactions. nuPlan, a recently released MP benchmark, addresses this limitation by augmenting real-world driving logs with closed-loop simulation logic, effectively turning the fixed dataset into a reactive simulator. We analyze the characteristics of nuPlan's recorded logs and find that each city has its own unique driving behaviors, suggesting that robust planners must adapt to different environments. We learn to model such unique behaviors with BehaviorNet, a graph convolutional neural network (GCNN) that predicts reactive agent behaviors using features derived from recently-observed agent histories; intuitively, some aggressive agents may tailgate lead vehicles, while others may not. To model such phenomena, BehaviorNet predicts parameters of an agent's motion controller rather than predicting its spacetime trajectory (as most forecasters do). Finally, we present AdaptiveDriver, a model-predictive control (MPC) based planner that unrolls different world models conditioned on BehaviorNet's predictions. Our extensive experiments demonstrate that AdaptiveDriver achieves state-of-the-art results on the nuPlan closed-loop planning benchmark, reducing test error from 6.4% to 4.6%, even when applied to never-before-seen cities.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Embracing the Enemy
Authors:
Johannes Schneider,
Álvaro Delgado-Vega
Abstract:
We study repeated interactions between two power-hungry agents, the ``friend''and the ``enemy,'' and a power-broker, the principal. All three care about the leading agent's policy choice. The principal cannot fully control leadership allocation, but has some influence and aligns more with the friend. After an initial cordon sanitaire breaks, the principal embraces the enemy, sometimes promising pe…
▽ More
We study repeated interactions between two power-hungry agents, the ``friend''and the ``enemy,'' and a power-broker, the principal. All three care about the leading agent's policy choice. The principal cannot fully control leadership allocation, but has some influence and aligns more with the friend. After an initial cordon sanitaire breaks, the principal embraces the enemy, sometimes promising persistent support: she grants the enemy power in exchange for moderation, which benefits the friend who reciprocates. The closer the principal is to the friend, the more she desires to embrace the enemy, but the harder it is to uphold such promises.
△ Less
Submitted 11 July, 2024; v1 submitted 14 June, 2024;
originally announced June 2024.
-
Data-driven Thermal Modeling for Electrically Excited Synchronous Motors -- A Supervised Machine Learning Approach
Authors:
Farzaneh Tatari,
Davis Trapp,
Jason Schneider,
Mohsen Mirza Aligoudarzi
Abstract:
This paper proposes a data-driven supervised machine learning (ML) for online thermal modeling of electrically excited synchronous motors (EESMs). EESMs are desired for EVs due to their high performance, efficiency, and durability at a relatively low cost. Therefore, obtaining precise EESM temperature estimations are significantly important, because online accurate temperature estimation can lead…
▽ More
This paper proposes a data-driven supervised machine learning (ML) for online thermal modeling of electrically excited synchronous motors (EESMs). EESMs are desired for EVs due to their high performance, efficiency, and durability at a relatively low cost. Therefore, obtaining precise EESM temperature estimations are significantly important, because online accurate temperature estimation can lead to EESM performance improvement and guaranteeing its safety and reliability. In this study, in addition to the default inputs' data, EESM losses data is leveraged to improve the performance of the proposed ML approach for thermal modeling. Exponentially weighted moving averages and standard deviations of the inputs are also incorporated in the learning process to consider the memory effect for modeling a dynamical thermal model. Using the experimental data of an EESM prototype, the performance of ordinary least squares (OLS) method is evaluated through a complete training, testing and cross-validation process. Finally, simulation results will provide the key performance metrics of OLS for EESM thermal modeling.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
IceCube Search for Neutrino Emission from X-ray Bright Seyfert Galaxies
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (400 additional authors not shown)
Abstract:
The recent IceCube detection of TeV neutrino emission from the nearby active galaxy NGC 1068 suggests that active galactic nuclei (AGN) could make a sizable contribution to the diffuse flux of astrophysical neutrinos. The absence of TeV $γ$-rays from NGC 1068 indicates neutrino production in the vicinity of the supermassive black hole, where the high radiation density leads to $γ$-ray attenuation.…
▽ More
The recent IceCube detection of TeV neutrino emission from the nearby active galaxy NGC 1068 suggests that active galactic nuclei (AGN) could make a sizable contribution to the diffuse flux of astrophysical neutrinos. The absence of TeV $γ$-rays from NGC 1068 indicates neutrino production in the vicinity of the supermassive black hole, where the high radiation density leads to $γ$-ray attenuation. Therefore, any potential neutrino emission from similar sources is not expected to correlate with high-energy $γ$-rays. Disk-corona models predict neutrino emission from Seyfert galaxies to correlate with keV X-rays, as they are tracers of coronal activity. Using through-going track events from the Northern Sky recorded by IceCube between 2011 and 2021, we report results from a search for individual and aggregated neutrino signals from 27 additional Seyfert galaxies that are contained in the BAT AGN Spectroscopic Survey (BASS). Besides the generic single power-law, we evaluate the spectra predicted by the disk-corona model. Assuming all sources to be intrinsically similar to NGC 1068, our findings constrain the collective neutrino emission from X-ray bright Seyfert galaxies in the Northern Hemisphere, but, at the same time, show excesses of neutrinos that could be associated with the objects NGC 4151 and CGCG 420-015. These excesses result in a 2.7$σ$ significance with respect to background expectations.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Rate-Preserving Reductions for Blackwell Approachability
Authors:
Christoph Dann,
Yishay Mansour,
Mehryar Mohri,
Jon Schneider,
Balasubramanian Sivan
Abstract:
Abernethy et al. (2011) showed that Blackwell approachability and no-regret learning are equivalent, in the sense that any algorithm that solves a specific Blackwell approachability instance can be converted to a sublinear regret algorithm for a specific no-regret learning instance, and vice versa. In this paper, we study a more fine-grained form of such reductions, and ask when this translation b…
▽ More
Abernethy et al. (2011) showed that Blackwell approachability and no-regret learning are equivalent, in the sense that any algorithm that solves a specific Blackwell approachability instance can be converted to a sublinear regret algorithm for a specific no-regret learning instance, and vice versa. In this paper, we study a more fine-grained form of such reductions, and ask when this translation between problems preserves not only a sublinear rate of convergence, but also preserves the optimal rate of convergence. That is, in which cases does it suffice to find the optimal regret bound for a no-regret learning instance in order to find the optimal rate of convergence for a corresponding approachability instance?
We show that the reduction of Abernethy et al. (2011) does not preserve rates: their reduction may reduce a $d$-dimensional approachability instance $I_1$ with optimal convergence rate $R_1$ to a no-regret learning instance $I_2$ with optimal regret-per-round of $R_2$, with $R_{2}/R_{1}$ arbitrarily large (in particular, it is possible that $R_1 = 0$ and $R_{2} > 0$). On the other hand, we show that it is possible to tightly reduce any approachability instance to an instance of a generalized form of regret minimization we call improper $φ$-regret minimization (a variant of the $φ$-regret minimization of Gordon et al. (2008) where the transformation functions may map actions outside of the action set).
Finally, we characterize when linear transformations suffice to reduce improper $φ$-regret minimization problems to standard classes of regret minimization problems in a rate preserving manner. We prove that some improper $φ$-regret minimization instances cannot be reduced to either subclass of instance in this way, suggesting that approachability can capture some problems that cannot be phrased in the language of online learning.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Search for neutrino emission from hard X-ray AGN with IceCube
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (401 additional authors not shown)
Abstract:
Active Galactic Nuclei (AGN) are promising candidate sources of high-energy astrophysical neutrinos since they provide environments rich in matter and photon targets where cosmic ray interactions may lead to the production of gamma rays and neutrinos. We searched for high-energy neutrino emission from AGN using the $\textit{Swift}$-BAT Spectroscopic Survey (BASS) catalog of hard X-ray sources and…
▽ More
Active Galactic Nuclei (AGN) are promising candidate sources of high-energy astrophysical neutrinos since they provide environments rich in matter and photon targets where cosmic ray interactions may lead to the production of gamma rays and neutrinos. We searched for high-energy neutrino emission from AGN using the $\textit{Swift}$-BAT Spectroscopic Survey (BASS) catalog of hard X-ray sources and 12 years of IceCube muon track data. First, upon performing a stacked search, no significant emission was found. Second, we searched for neutrinos from a list of 43 candidate sources and found an excess from the direction of two sources, Seyfert galaxies NGC 1068 and NGC 4151. We observed NGC 1068 at flux $φ_{ν_μ+\barν_μ}$ = $4.02_{-1.52}^{+1.58} \times 10^{-11}$ TeV$^{-1}$ cm$^{-2}$ s$^{-1}$ normalized at 1 TeV, with power-law spectral index, $γ$ = 3.10$^{+0.26}_{-0.22}$, consistent with previous IceCube results. The observation of a neutrino excess from the direction of NGC 4151 is at a post-trial significance of 2.9$σ$. If interpreted as an astrophysical signal, the excess observed from NGC 4151 corresponds to a flux $φ_{ν_μ+\barν_μ}$ = $1.51_{-0.81}^{+0.99} \times 10^{-11}$ TeV$^{-1}$ cm$^{-2}$ s$^{-1}$ normalized at 1 TeV and $γ$ = 2.83$^{+0.35}_{-0.28}$.
△ Less
Submitted 12 June, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
Hidden Holes: topological aspects of language models
Authors:
Stephen Fitz,
Peter Romero,
Jiyan Jonas Schneider
Abstract:
We explore the topology of representation manifolds arising in autoregressive neural language models trained on raw text data. In order to study their properties, we introduce tools from computational algebraic topology, which we use as a basis for a measure of topological complexity, that we call perforation.
Using this measure, we study the evolution of topological structure in GPT based large…
▽ More
We explore the topology of representation manifolds arising in autoregressive neural language models trained on raw text data. In order to study their properties, we introduce tools from computational algebraic topology, which we use as a basis for a measure of topological complexity, that we call perforation.
Using this measure, we study the evolution of topological structure in GPT based large language models across depth and time during training. We then compare these to gated recurrent models, and show that the latter exhibit more topological complexity, with a distinct pattern of changes common to all natural languages but absent from synthetically generated data. The paper presents a detailed analysis of the representation manifolds derived by these models based on studying the shapes of vector clouds induced by them as they are conditioned on sentences from corpora of natural language text.
The methods developed in this paper are novel in the field and based on mathematical apparatus that might be unfamiliar to the target audience. To help with that we introduce the minimum necessary theory, and provide additional visualizations in the appendices.
The main contribution of the paper is a striking observation about the topological structure of the transformer as compared to LSTM based neural architectures. It suggests that further research into mathematical properties of these neural networks is necessary to understand the operation of large transformer language models. We hope this work inspires further explorations in this direction within the NLP community.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
The PLATO Mission
Authors:
Heike Rauer,
Conny Aerts,
Juan Cabrera,
Magali Deleuil,
Anders Erikson,
Laurent Gizon,
Mariejo Goupil,
Ana Heras,
Jose Lorenzo-Alvarez,
Filippo Marliani,
Cesar Martin-Garcia,
J. Miguel Mas-Hesse,
Laurence O'Rourke,
Hugh Osborn,
Isabella Pagano,
Giampaolo Piotto,
Don Pollacco,
Roberto Ragazzoni,
Gavin Ramsay,
Stéphane Udry,
Thierry Appourchaux,
Willy Benz,
Alexis Brandeker,
Manuel Güdel,
Eduardo Janot-Pacheco
, et al. (801 additional authors not shown)
Abstract:
PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observati…
▽ More
PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observations from the ground, planets will be characterised for their radius, mass, and age with high accuracy (5 %, 10 %, 10 % for an Earth-Sun combination respectively). PLATO will provide us with a large-scale catalogue of well-characterised small planets up to intermediate orbital periods, relevant for a meaningful comparison to planet formation theories and to better understand planet evolution. It will make possible comparative exoplanetology to place our Solar System planets in a broader context. In parallel, PLATO will study (host) stars using asteroseismology, allowing us to determine the stellar properties with high accuracy, substantially enhancing our knowledge of stellar structure and evolution.
The payload instrument consists of 26 cameras with 12cm aperture each. For at least four years, the mission will perform high-precision photometric measurements. Here we review the science objectives, present PLATO's target samples and fields, provide an overview of expected core science performance as well as a description of the instrument and the mission profile at the beginning of the serial production of the flight cameras. PLATO is scheduled for a launch date end 2026. This overview therefore provides a summary of the mission to the community in preparation of the upcoming operational phases.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Exploration of mass splitting and muon/tau mixing parameters for an eV-scale sterile neutrino with IceCube
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (400 additional authors not shown)
Abstract:
We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth…
▽ More
We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth mass state ($|U_{\mu4}|^2$), and the element connecting tau flavor to the fourth mass state ($|U_{\tau4}|^2$). Predicted propagation effects in matter enhance the signature through a resonance as atmospheric neutrinos from the Northern Hemisphere traverse the Earth to the IceCube detector at the South Pole. The result is consistent with the no-sterile neutrino hypothesis with a probability of 4.3 %. Profiling the likelihood of each parameter yields the 90 % confidence levels: $ 2.4\,\mathrm{eV}^{2} < Δm_{41}^2 <9.6\,\mathrm{eV}^{2} $ , $0.0081 < |U_{\mu4}|^2 < 0.10$ , and $|U_{\tau4}|^2< 0.035$, which narrows the allowed parameter-space for $|U_{\tau4}|^2$. However, the primary result of this analysis is the first map of the 3+1 parameter space exploring the interdependence of $Δm_{41}^2$, $|U_{\mu4}|^2$, and $|U_{\tau4}|^2$.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions
Authors:
Sang Keun Choe,
Hwijeen Ahn,
Juhan Bae,
Kewen Zhao,
Minsoo Kang,
Youngseog Chung,
Adithya Pratapa,
Willie Neiswanger,
Emma Strubell,
Teruko Mitamura,
Jeff Schneider,
Eduard Hovy,
Roger Grosse,
Eric Xing
Abstract:
Large language models (LLMs) are trained on a vast amount of human-written data, but data providers often remain uncredited. In response to this issue, data valuation (or data attribution), which quantifies the contribution or value of each data to the model output, has been discussed as a potential solution. Nevertheless, applying existing data valuation methods to recent LLMs and their vast trai…
▽ More
Large language models (LLMs) are trained on a vast amount of human-written data, but data providers often remain uncredited. In response to this issue, data valuation (or data attribution), which quantifies the contribution or value of each data to the model output, has been discussed as a potential solution. Nevertheless, applying existing data valuation methods to recent LLMs and their vast training datasets has been largely limited by prohibitive compute and memory costs. In this work, we focus on influence functions, a popular gradient-based data valuation method, and significantly improve its scalability with an efficient gradient projection strategy called LoGra that leverages the gradient structure in backpropagation. We then provide a theoretical motivation of gradient projection approaches to influence functions to promote trust in the data valuation process. Lastly, we lower the barrier to implementing data valuation systems by introducing LogIX, a software package that can transform existing training code into data valuation code with minimal effort. In our data valuation experiments, LoGra achieves competitive accuracy against more expensive baselines while showing up to 6,500x improvement in throughput and 5x reduction in GPU memory usage when applied to Llama3-8B-Instruct and the 1B-token dataset.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Building a Simplistic Automatic Extruder: Instrument Development Opportunities for the Laboratory
Authors:
Stefanie Klisch,
Dylan Gilbert,
Emma Breaux,
Aliyah Dalier,
Sudipta Gupta,
Bruno Jakobi,
Gerald J. Schneider
Abstract:
A well-rounded introduction to work in a STEM laboratory is vital to scientific education. Besides the ability to use available instrumentation for sample characterization, students should also be imparted knowledge in the steps of instrument development and construction. These concepts can be taught using the example of lipid vesicle preparation via extrusion. Vesicle extrusion is a common techni…
▽ More
A well-rounded introduction to work in a STEM laboratory is vital to scientific education. Besides the ability to use available instrumentation for sample characterization, students should also be imparted knowledge in the steps of instrument development and construction. These concepts can be taught using the example of lipid vesicle preparation via extrusion. Vesicle extrusion is a common technique that involves syringes pushing solutions through membrane filters and is used in fundamental studies on vesicles. Such research is important to better understand of biological phenomena and drug development. Well prepared samples are key to successful research. While the manual approach is very useful to acquire experience, automatic extrusion is more convenient, and automation often results in better reproducibility. These advantages can be combined in a simplistic automatic extruder, that does not require advanced technical skills to be assembled. It can therefore be used by various groups, ranging undergraduate to graduate students using equipment typically available. Using this approach, students can acquire different skillsets including coding, testing, and advanced use of building materials based on their properties. Finally, the quality of the automatic extruder is verified.
△ Less
Submitted 2 April, 2024;
originally announced May 2024.
-
Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube
Authors:
IceCube Collaboration,
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise
, et al. (398 additional authors not shown)
Abstract:
We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 1…
▽ More
We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 12\% and consistency with the null hypothesis of no oscillations to sterile neutrinos with a p-value of 3.1\%. Several improvements were made over past analyses, which are reviewed in this article, including upgrades to the reconstruction and the study of sources of systematic uncertainty. We provide details of the fit quality and discuss stability tests that split the data for separate samples, comparing results. We find that the fits are consistent between split data sets.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
A search for an eV-scale sterile neutrino using improved high-energy $ν_μ$ event reconstruction in IceCube
Authors:
IceCube Collaboration,
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise
, et al. (398 additional authors not shown)
Abstract:
This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going…
▽ More
This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going events, distinguishing neutrino interactions with vertices inside or outside the instrumented volume, to improve energy resolution. The best-fit point for a 3+1 model is found to be at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$, which agrees with previous iterations of this study. The result is consistent with the null hypothesis of no sterile neutrinos with a p-value of 3.1\%.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Search for joint multimessenger signals from potential Galactic PeVatrons with HAWC and IceCube
Authors:
R. Alfaro,
C. Alvarez,
J. C. Arteaga-Velázquez,
D. Avila Rojas,
H. A. Ayala Solares,
R. Babu,
E. Belmont-Moreno,
K. S. Caballero-Mora,
T. Capistrán,
A. Carramiñana,
S. Casanova,
U. Cotti,
J. Cotzomi,
S. Coutiño de León,
E. De la Fuente,
D. Depaoli,
N. Di Lalla,
R. Diaz Hernandez,
J. C. Díaz-Vélez,
K. Engel,
T. Ergin,
K. L. Fan,
K. Fang,
N. Fraija,
S. Fraija
, et al. (469 additional authors not shown)
Abstract:
Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis…
▽ More
Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis that combines gamma rays and neutrinos is required. In this study, we use the Multi-Mission Maximum Likelihood framework (3ML) with IceCube Maximum Likelihood Analysis software (i3mla) and HAWC Accelerated Likelihood (HAL) to search for a correlation between 22 known gamma-ray sources from the third HAWC gamma-ray catalog and 14 years of IceCube track-like data. No significant neutrino emission from the direction of the HAWC sources was found. We report the best-fit gamma-ray model and 90% CL neutrino flux limit from the 22 sources. From the neutrino flux limit, we conclude that the gamma-ray emission from five of the sources can not be produced purely from hadronic interactions. We report the limit for the fraction of gamma rays produced by hadronic interactions for these five sources.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Acceptance Tests of more than 10 000 Photomultiplier Tubes for the multi-PMT Digital Optical Modules of the IceCube Upgrade
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (399 additional authors not shown)
Abstract:
More than 10,000 photomultiplier tubes (PMTs) with a diameter of 80 mm will be installed in multi-PMT Digital Optical Modules (mDOMs) of the IceCube Upgrade. These have been tested and pre-calibrated at two sites. A throughput of more than 1000 PMTs per week with both sites was achieved with a modular design of the testing facilities and highly automated testing procedures. The testing facilities…
▽ More
More than 10,000 photomultiplier tubes (PMTs) with a diameter of 80 mm will be installed in multi-PMT Digital Optical Modules (mDOMs) of the IceCube Upgrade. These have been tested and pre-calibrated at two sites. A throughput of more than 1000 PMTs per week with both sites was achieved with a modular design of the testing facilities and highly automated testing procedures. The testing facilities can easily be adapted to other PMTs, such that they can, e.g., be re-used for testing the PMTs for IceCube-Gen2. Single photoelectron response, high voltage dependence, time resolution, prepulse, late pulse, afterpulse probabilities, and dark rates were measured for each PMT. We describe the design of the testing facilities, the testing procedures, and the results of the acceptance tests.
△ Less
Submitted 20 June, 2024; v1 submitted 30 April, 2024;
originally announced April 2024.
-
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
Authors:
Fahim Tajwar,
Anikait Singh,
Archit Sharma,
Rafael Rafailov,
Jeff Schneider,
Tengyang Xie,
Stefano Ermon,
Chelsea Finn,
Aviral Kumar
Abstract:
Learning from preference labels plays a crucial role in fine-tuning large language models. There are several distinct approaches for preference fine-tuning, including supervised learning, on-policy reinforcement learning (RL), and contrastive learning. Different methods come with different implementation tradeoffs and performance differences, and existing empirical findings present different concl…
▽ More
Learning from preference labels plays a crucial role in fine-tuning large language models. There are several distinct approaches for preference fine-tuning, including supervised learning, on-policy reinforcement learning (RL), and contrastive learning. Different methods come with different implementation tradeoffs and performance differences, and existing empirical findings present different conclusions, for instance, some results show that online RL is quite important to attain good fine-tuning results, while others find (offline) contrastive or even purely supervised methods sufficient. This raises a natural question: what kind of approaches are important for fine-tuning with preference data and why? In this paper, we answer this question by performing a rigorous analysis of a number of fine-tuning techniques on didactic and full-scale LLM problems. Our main finding is that, in general, approaches that use on-policy sampling or attempt to push down the likelihood on certain responses (i.e., employ a "negative gradient") outperform offline and maximum likelihood objectives. We conceptualize our insights and unify methods that use on-policy sampling or negative gradient under a notion of mode-seeking objectives for categorical distributions. Mode-seeking objectives are able to alter probability mass on specific bins of a categorical distribution at a fast rate compared to maximum likelihood, allowing them to relocate masses across bins more effectively. Our analysis prescribes actionable insights for preference fine-tuning of LLMs and informs how data should be collected for maximal improvement.
△ Less
Submitted 2 June, 2024; v1 submitted 22 April, 2024;
originally announced April 2024.
-
Full Shot Predictions for the DIII-D Tokamak via Deep Recurrent Networks
Authors:
Ian Char,
Youngseog Chung,
Joseph Abbate,
Egemen Kolemen,
Jeff Schneider
Abstract:
Although tokamaks are one of the most promising devices for realizing nuclear fusion as an energy source, there are still key obstacles when it comes to understanding the dynamics of the plasma and controlling it. As such, it is crucial that high quality models are developed to assist in overcoming these obstacles. In this work, we take an entirely data driven approach to learn such a model. In pa…
▽ More
Although tokamaks are one of the most promising devices for realizing nuclear fusion as an energy source, there are still key obstacles when it comes to understanding the dynamics of the plasma and controlling it. As such, it is crucial that high quality models are developed to assist in overcoming these obstacles. In this work, we take an entirely data driven approach to learn such a model. In particular, we use historical data from the DIII-D tokamak to train a deep recurrent network that is able to predict the full time evolution of plasma discharges (or "shots"). Following this, we investigate how different training and inference procedures affect the quality and calibration of the shot predictions.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Vignetting Effects: a Tool to Characterize a Fourier Ptychographic Microscope
Authors:
John Meshreki,
Syed Muhammad Kazim,
Jan Philipp Schneider,
Michael Moeller,
Ivo Ihrke
Abstract:
Fourier Ptychographic Microscopy (FPM) is a recent technique to overcome the diffraction limit of a low numerical aperture (NA) objective lens by algorithmic post-processing of several lower resolved images. It can increase the space-bandwidth product of an optical system by computationally combining images captured under different illumination conditions. Vignetting determines the spatial extent…
▽ More
Fourier Ptychographic Microscopy (FPM) is a recent technique to overcome the diffraction limit of a low numerical aperture (NA) objective lens by algorithmic post-processing of several lower resolved images. It can increase the space-bandwidth product of an optical system by computationally combining images captured under different illumination conditions. Vignetting determines the spatial extent of the bright field and dark field regions in the captured images that contain information about low and high frequency image content, respectively. State-of-the-art analyses treat vignetting as a nuisance that needs to be reduced or excluded from algorithmic consideration by means of ad-hoc decision rules [1]. In contrast, this work investigates vignetting effects as a tool to infer a range of properties of the optical system. To achieve this, we characterize the individual system components of the experimental setup and compare experimental data to both, geometrical and wave optical simulations. We demonstrate that using vignetting as an analytical tool enables the modeling of the geometric and coherence properties of the optical system as evidenced by the good agreement between our simulation and experiment. Moreover, our work investigates pupil aberrations in the FPM setup and enables their partial characterization, despite not yet encompassing all aspects.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Explainable Generative AI (GenXAI): A Survey, Conceptualization, and Research Agenda
Authors:
Johannes Schneider
Abstract:
Generative AI (GenAI) marked a shift from AI being able to recognize to AI being able to generate solutions for a wide variety of tasks. As the generated solutions and applications become increasingly more complex and multi-faceted, novel needs, objectives, and possibilities have emerged for explainability (XAI). In this work, we elaborate on why XAI has gained importance with the rise of GenAI an…
▽ More
Generative AI (GenAI) marked a shift from AI being able to recognize to AI being able to generate solutions for a wide variety of tasks. As the generated solutions and applications become increasingly more complex and multi-faceted, novel needs, objectives, and possibilities have emerged for explainability (XAI). In this work, we elaborate on why XAI has gained importance with the rise of GenAI and its challenges for explainability research. We also unveil novel and emerging desiderata that explanations should fulfill, covering aspects such as verifiability, interactivity, security, and cost. To this end, we focus on surveying existing works. Furthermore, we provide a taxonomy of relevant dimensions that allows us to better characterize existing XAI mechanisms and methods for GenAI. We discuss different avenues to ensure XAI, from training data to prompting. Our paper offers a short but concise technical background of GenAI for non-technical readers, focusing on text and images to better understand novel or adapted XAI techniques for GenAI. However, due to the vast array of works on GenAI, we decided to forego detailed aspects of XAI related to evaluation and usage of explanations. As such, the manuscript interests both technically oriented people and other disciplines, such as social scientists and information systems researchers. Our research roadmap provides more than ten directions for future investigation.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Explicit Sarkisov program for regular surfaces over arbitrary fields and applications
Authors:
Fabio Bernasconi,
Andrea Fanelli,
Julia Schneider,
Susanna Zimmermann
Abstract:
We prove the Sarkisov program for projective surfaces over excellent base rings, including the case of non-perfect base fields of characteristic p>0. We classify the Sarkisov links between Mori fibre spaces and their relations for regular surfaces, generalising work of Iskovskikh. As an application, we discuss rationality problems for regular surfaces and the structure of the plane Cremona group.
We prove the Sarkisov program for projective surfaces over excellent base rings, including the case of non-perfect base fields of characteristic p>0. We classify the Sarkisov links between Mori fibre spaces and their relations for regular surfaces, generalising work of Iskovskikh. As an application, we discuss rationality problems for regular surfaces and the structure of the plane Cremona group.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Dynamics of nano-scale assemblies of amphiphilic PEG-PDMS-PEG copolymers
Authors:
Sudipta Gupta,
Rasangi M. Perera,
Christopher J. Van Leeuwen,
Tianyu Li,
Laura Stingaciu,
Markus Bleuel,
Kunlun Hong,
Gerald J. Schneider
Abstract:
Micelles and vesicles are promising candidates in targeted drug/gene delivery, bioreactors, and templates for nanoparticle synthesis. We investigated the morphology and dynamics of PEG-PDMS-PEG triblock copolymer nano-scale assemblies regarding the membrane dynamics because the molecular dynamics of the membrane govern mechanical properties like the stability of a membrane. We studied the structur…
▽ More
Micelles and vesicles are promising candidates in targeted drug/gene delivery, bioreactors, and templates for nanoparticle synthesis. We investigated the morphology and dynamics of PEG-PDMS-PEG triblock copolymer nano-scale assemblies regarding the membrane dynamics because the molecular dynamics of the membrane govern mechanical properties like the stability of a membrane. We studied the structure by cryogenic transmission electron microscopy, small-angle neutron scattering, and the dynamics by dynamic light scattering and neutron spin echo spectroscopy. We changed the length of the hydrophilic block to obtain micellar and vesicular systems. The vesicle has a membrane rigidity, $κ_η= 16 \pm 2 k_B T$, the same order of magnitude as the corresponding liposome value but one order of magnitude higher than polymeric interfaces in microemulsions. Hence, the height-height fluctuations of polymers in a polymersome seem much less than those measured for surfactants at an oil-water interface. Therefore, the polymersome is substantially more stable. The value is very close to liposomes, indicating a similar stability.
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
Ion-Mediated Structural Discontinuities in Phospholipid Vesicles
Authors:
Judith U. De Mel,
Sudipta Gupta,
Gerald J. Schneider
Abstract:
Despite intense research, methods for controlling soft matter's spontaneous self-assembly in-to well-defined layers remain a significant challenge. We observed ion-induced structural disconti-nuities of phospholipid vesicles that can be exploited for controlled self-assembly of soft materials. We used dynamic light scattering, zeta-potential measurement, cryo-electron microscopy, small-angle X-ray…
▽ More
Despite intense research, methods for controlling soft matter's spontaneous self-assembly in-to well-defined layers remain a significant challenge. We observed ion-induced structural disconti-nuities of phospholipid vesicles that can be exploited for controlled self-assembly of soft materials. We used dynamic light scattering, zeta-potential measurement, cryo-electron microscopy, small-angle X-ray, and small-angle neutron scattering. All the experimental observations indicate that de-clining solvent quality and increasing osmotic pressure direct lipids to expel preferentially to the inner compartment. Upon reaching a critical concentration, excess lipids can form a new bilayer. This spontaneous self-assembly process causes simultaneous shrinkage of the aqueous core and expansion of the vesicle. This approach opens an intriguing path for controlling the self-assembly of bioinspired colloids, which can also serve as a vehicle to control the polymerization of multilayer polymeric systems.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Prediction and identification of point defect fingerprints in the X-ray photoelectron spectra of TiN$_x$
Authors:
Pavel Ondračka,
Pauline Kümmerl,
Marcus Hans,
Stanislav Mráz,
Daniel Primetzhofer,
David Holec,
Petr Vašina,
Jochen M. Schneider
Abstract:
We investigate the effect of selected N and Ti point defects in $B$1 TiN on N 1s and Ti 2p$_{3/2}$ binding energies (BE) by experiments and ab initio calculations. X-ray photoelectron spectroscopy (XPS) measurements of Ti-deficient TiN films reveal additional N 1s spectral components at lower binding energies. Ab initio calculations predict that these components are caused by either Ti vacancies,…
▽ More
We investigate the effect of selected N and Ti point defects in $B$1 TiN on N 1s and Ti 2p$_{3/2}$ binding energies (BE) by experiments and ab initio calculations. X-ray photoelectron spectroscopy (XPS) measurements of Ti-deficient TiN films reveal additional N 1s spectral components at lower binding energies. Ab initio calculations predict that these components are caused by either Ti vacancies, which induce a N 1s BE shift of $-0.53$ eV in its first N neighbors, and/or N tetrahedral interstitials, which have their N 1s BE shifted by $-1.18$ eV and also shift BE of their first N neighbors by $-0.53$ eV. However, the {\it ab initio} calculations also reveal that the tetrahedral N interstitial is unstable at room temperature. We, therefore, unambiguously attribute the detected signal to Ti vacancies. Furthermore, the vacancy concentration in Ti-deficient TiN was quantified with XPS supported by ab initio calculations. The largest BE shifts of $-1.53$, $-1.80$ and $-2.28$ eV for Ti 2p$_{3/2}$ electrons are predicted for the Ti tetrahedral, split (10$\overline{1}$)-aligned and split (111)-aligned interstitial atoms, respectively, and we, therefore, propose XPS could detect them. Other defects such as N vacancy or N split (10$\overline{1}$)-aligned interstitial introduce smaller N 1s and Ti 2p$_{3/2}$ BE shifts and are unlikely to be detectable experimentally. Our work highlights the potential of ab initio-guided XPS measurements in detecting and quantifying point defects in $B$1 TiN.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Using quantum computers in control: interval matrix properties
Authors:
Jan Schneider,
Julian Berberich
Abstract:
Quantum computing provides a powerful framework for tackling computational problems that are classically intractable. The goal of this paper is to explore the use of quantum computers for solving relevant problems in systems and control theory. In the recent literature, different quantum algorithms have been developed to tackle binary optimization, which plays an important role in various control-…
▽ More
Quantum computing provides a powerful framework for tackling computational problems that are classically intractable. The goal of this paper is to explore the use of quantum computers for solving relevant problems in systems and control theory. In the recent literature, different quantum algorithms have been developed to tackle binary optimization, which plays an important role in various control-theoretic problems. As a prototypical example, we consider the verification of interval matrix properties such as non-singularity and stability on a quantum computer. We present a quantum algorithm solving these problems and we study its performance in simulation. Our results demonstrate that quantum computers provide a promising tool for control whose applicability to further computationally complex problems remains to be explored.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Governance of Generative Artificial Intelligence for Companies
Authors:
Johannes Schneider,
Rene Abraham,
Christian Meske
Abstract:
Generative Artificial Intelligence (GenAI), specifically large language models like ChatGPT, has swiftly entered organizations without adequate governance, posing both opportunities and risks. Despite extensive debates on GenAI's transformative nature and regulatory measures, limited research addresses organizational governance, encompassing technical and business perspectives. Our review paper fi…
▽ More
Generative Artificial Intelligence (GenAI), specifically large language models like ChatGPT, has swiftly entered organizations without adequate governance, posing both opportunities and risks. Despite extensive debates on GenAI's transformative nature and regulatory measures, limited research addresses organizational governance, encompassing technical and business perspectives. Our review paper fills this gap by surveying recent works with the purpose of developing a framework for GenAI governance within companies. This framework outlines the scope, objectives, and governance mechanisms tailored to harness business opportunities as well as mitigate risks associated with GenAI integration. Our research contributes a focused approach to GenAI governance, offering practical insights for companies navigating the challenges of GenAI adoption and highlighting research gaps.
△ Less
Submitted 9 June, 2024; v1 submitted 5 February, 2024;
originally announced March 2024.
-
Tractable Joint Prediction and Planning over Discrete Behavior Modes for Urban Driving
Authors:
Adam Villaflor,
Brian Yang,
Huangyuan Su,
Katerina Fragkiadaki,
John Dolan,
Jeff Schneider
Abstract:
Significant progress has been made in training multimodal trajectory forecasting models for autonomous driving. However, effectively integrating these models with downstream planners and model-based control approaches is still an open problem. Although these models have conventionally been evaluated for open-loop prediction, we show that they can be used to parameterize autoregressive closed-loop…
▽ More
Significant progress has been made in training multimodal trajectory forecasting models for autonomous driving. However, effectively integrating these models with downstream planners and model-based control approaches is still an open problem. Although these models have conventionally been evaluated for open-loop prediction, we show that they can be used to parameterize autoregressive closed-loop models without retraining. We consider recent trajectory prediction approaches which leverage learned anchor embeddings to predict multiple trajectories, finding that these anchor embeddings can parameterize discrete and distinct modes representing high-level driving behaviors. We propose to perform fully reactive closed-loop planning over these discrete latent modes, allowing us to tractably model the causal interactions between agents at each step. We validate our approach on a suite of more dynamic merging scenarios, finding that our approach avoids the $\textit{frozen robot problem}$ which is pervasive in conventional planners. Our approach also outperforms the previous state-of-the-art in CARLA on challenging dense traffic scenarios when evaluated at realistic speeds.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations
Authors:
Max Lamparth,
Anthony Corso,
Jacob Ganz,
Oriana Skylar Mastro,
Jacquelyn Schneider,
Harold Trinkunas
Abstract:
To some, the advent of artificial intelligence (AI) promises better decision-making and increased military effectiveness while reducing the influence of human error and emotions. However, there is still debate about how AI systems, especially large language models (LLMs), behave compared to humans in high-stakes military decision-making scenarios with the potential for increased risks towards esca…
▽ More
To some, the advent of artificial intelligence (AI) promises better decision-making and increased military effectiveness while reducing the influence of human error and emotions. However, there is still debate about how AI systems, especially large language models (LLMs), behave compared to humans in high-stakes military decision-making scenarios with the potential for increased risks towards escalation and unnecessary conflicts. To test this potential and scrutinize the use of LLMs for such purposes, we use a new wargame experiment with 107 national security experts designed to look at crisis escalation in a fictional US-China scenario and compare human players to LLM-simulated responses in separate simulations. Wargames have a long history in the development of military strategy and the response of nations to threats or attacks. Here, we show a considerable high-level agreement in the LLM and human responses and significant quantitative and qualitative differences in individual actions and strategic tendencies. These differences depend on intrinsic biases in LLMs regarding the appropriate level of violence following strategic instructions, the choice of LLM, and whether the LLMs are tasked to decide for a team of players directly or first to simulate dialog between players. When simulating the dialog, the discussions lack quality and maintain a farcical harmony. The LLM simulations cannot account for human player characteristics, showing no significant difference even for extreme traits, such as "pacifist" or "aggressive sociopath". Our results motivate policymakers to be cautious before granting autonomy or following AI-based strategy recommendations.
△ Less
Submitted 3 June, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
Observation of Seven Astrophysical Tau Neutrino Candidates with IceCube
Authors:
IceCube Collaboration,
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
G. Anton,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (380 additional authors not shown)
Abstract:
We report on a measurement of astrophysical tau neutrinos with 9.7 years of IceCube data. Using convolutional neural networks trained on images derived from simulated events, seven candidate $ν_τ$ events were found with visible energies ranging from roughly 20 TeV to 1 PeV and a median expected parent $ν_τ$ energy of about 200 TeV. Considering backgrounds from astrophysical and atmospheric neutrin…
▽ More
We report on a measurement of astrophysical tau neutrinos with 9.7 years of IceCube data. Using convolutional neural networks trained on images derived from simulated events, seven candidate $ν_τ$ events were found with visible energies ranging from roughly 20 TeV to 1 PeV and a median expected parent $ν_τ$ energy of about 200 TeV. Considering backgrounds from astrophysical and atmospheric neutrinos, and muons from $π^\pm/K^\pm$ decays in atmospheric air showers, we obtain a total estimated background of about 0.5 events, dominated by non-$ν_τ$ astrophysical neutrinos. Thus, we rule out the absence of astrophysical $ν_τ$ at the $5σ$ level. The measured astrophysical $ν_τ$ flux is consistent with expectations based on previously published IceCube astrophysical neutrino flux measurements and neutrino oscillations.
△ Less
Submitted 26 March, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Improved modeling of in-ice particle showers for IceCube event reconstruction
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
G. Anton,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise
, et al. (394 additional authors not shown)
Abstract:
The IceCube Neutrino Observatory relies on an array of photomultiplier tubes to detect Cherenkov light produced by charged particles in the South Pole ice. IceCube data analyses depend on an in-depth characterization of the glacial ice, and on novel approaches in event reconstruction that utilize fast approximations of photoelectron yields. Here, a more accurate model is derived for event reconstr…
▽ More
The IceCube Neutrino Observatory relies on an array of photomultiplier tubes to detect Cherenkov light produced by charged particles in the South Pole ice. IceCube data analyses depend on an in-depth characterization of the glacial ice, and on novel approaches in event reconstruction that utilize fast approximations of photoelectron yields. Here, a more accurate model is derived for event reconstruction that better captures our current knowledge of ice optical properties. When evaluated on a Monte Carlo simulation set, the median angular resolution for in-ice particle showers improves by over a factor of three compared to a reconstruction based on a simplified model of the ice. The most substantial improvement is obtained when including effects of birefringence due to the polycrystalline structure of the ice. When evaluated on data classified as particle showers in the high-energy starting events sample, a significantly improved description of the events is observed.
△ Less
Submitted 22 April, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Characterization of the Astrophysical Diffuse Neutrino Flux using Starting Track Events in IceCube
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
G. Anton,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise
, et al. (394 additional authors not shown)
Abstract:
A measurement of the diffuse astrophysical neutrino spectrum is presented using IceCube data collected from 2011-2022 (10.3 years). We developed novel detection techniques to search for events with a contained vertex and exiting track induced by muon neutrinos undergoing a charged-current interaction. Searching for these starting track events allows us to not only more effectively reject atmospher…
▽ More
A measurement of the diffuse astrophysical neutrino spectrum is presented using IceCube data collected from 2011-2022 (10.3 years). We developed novel detection techniques to search for events with a contained vertex and exiting track induced by muon neutrinos undergoing a charged-current interaction. Searching for these starting track events allows us to not only more effectively reject atmospheric muons but also atmospheric neutrino backgrounds in the southern sky, opening a new window to the sub-100 TeV astrophysical neutrino sky. The event selection is constructed using a dynamic starting track veto and machine learning algorithms. We use this data to measure the astrophysical diffuse flux as a single power law flux (SPL) with a best-fit spectral index of $γ= 2.58 ^{+0.10}_{-0.09}$ and per-flavor normalization of $φ^{\mathrm{Astro}}_{\mathrm{per-flavor}} = 1.68 ^{+0.19}_{-0.22} \times 10^{-18} \times \mathrm{GeV}^{-1} \mathrm{cm}^{-2} \mathrm{s}^{-1} \mathrm{sr}^{-1}$ (at 100 TeV). The sensitive energy range for this dataset is 3 - 550 TeV under the SPL assumption. This data was also used to measure the flux under a broken power law, however we did not find any evidence of a low energy cutoff.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Pareto-Optimal Algorithms for Learning in Games
Authors:
Eshwar Ram Arunachaleswaran,
Natalie Collina,
Jon Schneider
Abstract:
We study the problem of characterizing optimal learning algorithms for playing repeated games against an adversary with unknown payoffs. In this problem, the first player (called the learner) commits to a learning algorithm against a second player (called the optimizer), and the optimizer best-responds by choosing the optimal dynamic strategy for their (unknown but well-defined) payoff. Classic le…
▽ More
We study the problem of characterizing optimal learning algorithms for playing repeated games against an adversary with unknown payoffs. In this problem, the first player (called the learner) commits to a learning algorithm against a second player (called the optimizer), and the optimizer best-responds by choosing the optimal dynamic strategy for their (unknown but well-defined) payoff. Classic learning algorithms (such as no-regret algorithms) provide some counterfactual guarantees for the learner, but might perform much more poorly than other learning algorithms against particular optimizer payoffs.
In this paper, we introduce the notion of asymptotically Pareto-optimal learning algorithms. Intuitively, if a learning algorithm is Pareto-optimal, then there is no other algorithm which performs asymptotically at least as well against all optimizers and performs strictly better (by at least $Ω(T)$) against some optimizer. We show that well-known no-regret algorithms such as Multiplicative Weights and Follow The Regularized Leader are Pareto-dominated. However, while no-regret is not enough to ensure Pareto-optimality, we show that a strictly stronger property, no-swap-regret, is a sufficient condition for Pareto-optimality.
Proving these results requires us to address various technical challenges specific to repeated play, including the fact that there is no simple characterization of how optimizers who are rational in the long-term best-respond against a learning algorithm over multiple rounds of play. To address this, we introduce the idea of the asymptotic menu of a learning algorithm: the convex closure of all correlated distributions over strategy profiles that are asymptotically implementable by an adversary. We show that all no-swap-regret algorithms share the same asymptotic menu, implying that all no-swap-regret algorithms are ``strategically equivalent''.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Strategically-Robust Learning Algorithms for Bidding in First-Price Auctions
Authors:
Rachitesh Kumar,
Jon Schneider,
Balasubramanian Sivan
Abstract:
Learning to bid in repeated first-price auctions is a fundamental problem at the interface of game theory and machine learning, which has seen a recent surge in interest due to the transition of display advertising to first-price auctions. In this work, we propose a novel concave formulation for pure-strategy bidding in first-price auctions, and use it to analyze natural Gradient-Ascent-based algo…
▽ More
Learning to bid in repeated first-price auctions is a fundamental problem at the interface of game theory and machine learning, which has seen a recent surge in interest due to the transition of display advertising to first-price auctions. In this work, we propose a novel concave formulation for pure-strategy bidding in first-price auctions, and use it to analyze natural Gradient-Ascent-based algorithms for this problem. Importantly, our analysis goes beyond regret, which was the typical focus of past work, and also accounts for the strategic backdrop of online-advertising markets where bidding algorithms are deployed -- we provide the first guarantees of strategic-robustness and incentive-compatibility for Gradient Ascent.
Concretely, we show that our algorithms achieve $O(\sqrt{T})$ regret when the highest competing bids are generated adversarially, and show that no online algorithm can do better. We further prove that the regret reduces to $O(\log T)$ when the competition is stationary and stochastic, which drastically improves upon the previous best of $O(\sqrt{T})$. Moving beyond regret, we show that a strategic seller cannot exploit our algorithms to extract more revenue on average than is possible under the optimal mechanism. Finally, we prove that our algorithm is also incentive compatible -- it is a (nearly) dominant strategy for the buyer to report her values truthfully to the algorithm as a whole. Altogether, these guarantees make our algorithms the first to simultaneously achieve both optimal regret and strategic-robustness.
△ Less
Submitted 7 July, 2024; v1 submitted 11 February, 2024;
originally announced February 2024.
-
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following
Authors:
Brian Yang,
Huangyuan Su,
Nikolaos Gkanatsios,
Tsung-Wei Ke,
Ayush Jain,
Jeff Schneider,
Katerina Fragkiadaki
Abstract:
Diffusion models excel at modeling complex and multimodal trajectory distributions for decision-making and control. Reward-gradient guided denoising has been recently proposed to generate trajectories that maximize both a differentiable reward function and the likelihood under the data distribution captured by a diffusion model. Reward-gradient guided denoising requires a differentiable reward fun…
▽ More
Diffusion models excel at modeling complex and multimodal trajectory distributions for decision-making and control. Reward-gradient guided denoising has been recently proposed to generate trajectories that maximize both a differentiable reward function and the likelihood under the data distribution captured by a diffusion model. Reward-gradient guided denoising requires a differentiable reward function fitted to both clean and noised samples, limiting its applicability as a general trajectory optimizer. In this paper, we propose DiffusionES, a method that combines gradient-free optimization with trajectory denoising to optimize black-box non-differentiable objectives while staying in the data manifold. Diffusion-ES samples trajectories during evolutionary search from a diffusion model and scores them using a black-box reward function. It mutates high-scoring trajectories using a truncated diffusion process that applies a small number of noising and denoising steps, allowing for much more efficient exploration of the solution space. We show that DiffusionES achieves state-of-the-art performance on nuPlan, an established closed-loop planning benchmark for autonomous driving. Diffusion-ES outperforms existing sampling-based planners, reactive deterministic or diffusion-based policies, and reward-gradient guidance. Additionally, we show that unlike prior guidance methods, our method can optimize non-differentiable language-shaped reward functions generated by few-shot LLM prompting. When guided by a human teacher that issues instructions to follow, our method can generate novel, highly complex behaviors, such as aggressive lane weaving, which are not present in the training data. This allows us to solve the hardest nuPlan scenarios which are beyond the capabilities of existing trajectory optimization methods and driving policies.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Learning to Control Emulated Muscles in Real Robots: Towards Exploiting Bio-Inspired Actuator Morphology
Authors:
Pierre Schumacher,
Lorenz Krause,
Jan Schneider,
Dieter Büchler,
Georg Martius,
Daniel Haeufle
Abstract:
Recent studies have demonstrated the immense potential of exploiting muscle actuator morphology for natural and robust movement -- in simulation. A validation on real robotic hardware is yet missing. In this study, we emulate muscle actuator properties on hardware in real-time, taking advantage of modern and affordable electric motors. We demonstrate that our setup can emulate a simplified muscle…
▽ More
Recent studies have demonstrated the immense potential of exploiting muscle actuator morphology for natural and robust movement -- in simulation. A validation on real robotic hardware is yet missing. In this study, we emulate muscle actuator properties on hardware in real-time, taking advantage of modern and affordable electric motors. We demonstrate that our setup can emulate a simplified muscle model on a real robot while being controlled by a learned policy. We improve upon an existing muscle model by deriving a damping rule that ensures that the model is not only performant and stable but also tuneable for the real hardware. Our policies are trained by reinforcement learning entirely in simulation, where we show that previously reported benefits of muscles extend to the case of quadruped locomotion and hopping: the learned policies are more robust and exhibit more regular gaits. Finally, we confirm that the learned policies can be executed on real hardware and show that sim-to-real transfer with real-time emulated muscles on a quadruped robot is possible. These results show that artificial muscles can be highly beneficial actuators for future generations of robust legged robots.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Contracting with a Learning Agent
Authors:
Guru Guruganesh,
Yoav Kolumbus,
Jon Schneider,
Inbal Talgam-Cohen,
Emmanouil-Vasileios Vlatakis-Gkaragkounis,
Joshua R. Wang,
S. Matthew Weinberg
Abstract:
Many real-life contractual relations differ completely from the clean, static model at the heart of principal-agent theory. Typically, they involve repeated strategic interactions of the principal and agent, taking place under uncertainty and over time. While appealing in theory, players seldom use complex dynamic strategies in practice, often preferring to circumvent complexity and approach uncer…
▽ More
Many real-life contractual relations differ completely from the clean, static model at the heart of principal-agent theory. Typically, they involve repeated strategic interactions of the principal and agent, taking place under uncertainty and over time. While appealing in theory, players seldom use complex dynamic strategies in practice, often preferring to circumvent complexity and approach uncertainty through learning. We initiate the study of repeated contracts with a learning agent, focusing on agents who achieve no-regret outcomes.
Optimizing against a no-regret agent is a known open problem in general games; we achieve an optimal solution to this problem for a canonical contract setting, in which the agent's choice among multiple actions leads to success/failure. The solution has a surprisingly simple structure: for some $α> 0$, initially offer the agent a linear contract with scalar $α$, then switch to offering a linear contract with scalar $0$. This switch causes the agent to ``free-fall'' through their action space and during this time provides the principal with non-zero reward at zero cost. Despite apparent exploitation of the agent, this dynamic contract can leave \emph{both} players better off compared to the best static contract. Our results generalize beyond success/failure, to arbitrary non-linear contracts which the principal rescales dynamically.
Finally, we quantify the dependence of our results on knowledge of the time horizon, and are the first to address this consideration in the study of strategizing against learning agents.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Citizen Science for IceCube: Name that Neutrino
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
G. Anton,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (391 additional authors not shown)
Abstract:
Name that Neutrino is a citizen science project where volunteers aid in classification of events for the IceCube Neutrino Observatory, an immense particle detector at the geographic South Pole. From March 2023 to September 2023, volunteers did classifications of videos produced from simulated data of both neutrino signal and background interactions. Name that Neutrino obtained more than 128,000 cl…
▽ More
Name that Neutrino is a citizen science project where volunteers aid in classification of events for the IceCube Neutrino Observatory, an immense particle detector at the geographic South Pole. From March 2023 to September 2023, volunteers did classifications of videos produced from simulated data of both neutrino signal and background interactions. Name that Neutrino obtained more than 128,000 classifications by over 1,800 registered volunteers that were compared to results obtained by a deep neural network machine-learning algorithm. Possible improvements for both Name that Neutrino and the deep neural network are discussed.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Mixed identities for oligomorphic automorphism groups
Authors:
Manuel Bodirsky,
Jakob Schneider,
Andreas Thom
Abstract:
We study mixed identities for oligomorphic automorphism groups of countable relational structures. Our main result gives sufficient conditions for such a group to not admit a mixed identity without particular constants. We study numerous examples and prove in many cases that there cannot be a non-singular mixed identity.
We study mixed identities for oligomorphic automorphism groups of countable relational structures. Our main result gives sufficient conditions for such a group to not admit a mixed identity without particular constants. We study numerous examples and prove in many cases that there cannot be a non-singular mixed identity.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Identifying Policy Gradient Subspaces
Authors:
Jan Schneider,
Pierre Schumacher,
Simon Guist,
Le Chen,
Daniel Häufle,
Bernhard Schölkopf,
Dieter Büchler
Abstract:
Policy gradient methods hold great potential for solving complex continuous control tasks. Still, their training efficiency can be improved by exploiting structure within the optimization problem. Recent work indicates that supervised learning can be accelerated by leveraging the fact that gradients lie in a low-dimensional and slowly-changing subspace. In this paper, we conduct a thorough evaluat…
▽ More
Policy gradient methods hold great potential for solving complex continuous control tasks. Still, their training efficiency can be improved by exploiting structure within the optimization problem. Recent work indicates that supervised learning can be accelerated by leveraging the fact that gradients lie in a low-dimensional and slowly-changing subspace. In this paper, we conduct a thorough evaluation of this phenomenon for two popular deep policy gradient methods on various simulated benchmark tasks. Our results demonstrate the existence of such gradient subspaces despite the continuously changing data distribution inherent to reinforcement learning. These findings reveal promising directions for future work on more efficient reinforcement learning, e.g., through improving parameter-space exploration or enabling second-order optimization.
△ Less
Submitted 18 March, 2024; v1 submitted 12 January, 2024;
originally announced January 2024.
-
ML-On-Rails: Safeguarding Machine Learning Models in Software Systems A Case Study
Authors:
Hala Abdelkader,
Mohamed Abdelrazek,
Scott Barnett,
Jean-Guy Schneider,
Priya Rani,
Rajesh Vasa
Abstract:
Machine learning (ML), especially with the emergence of large language models (LLMs), has significantly transformed various industries. However, the transition from ML model prototyping to production use within software systems presents several challenges. These challenges primarily revolve around ensuring safety, security, and transparency, subsequently influencing the overall robustness and trus…
▽ More
Machine learning (ML), especially with the emergence of large language models (LLMs), has significantly transformed various industries. However, the transition from ML model prototyping to production use within software systems presents several challenges. These challenges primarily revolve around ensuring safety, security, and transparency, subsequently influencing the overall robustness and trustworthiness of ML models. In this paper, we introduce ML-On-Rails, a protocol designed to safeguard ML models, establish a well-defined endpoint interface for different ML tasks, and clear communication between ML providers and ML consumers (software engineers). ML-On-Rails enhances the robustness of ML models via incorporating detection capabilities to identify unique challenges specific to production ML. We evaluated the ML-On-Rails protocol through a real-world case study of the MoveReminder application. Through this evaluation, we emphasize the importance of safeguarding ML models in production.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
Condensate evolution in the solar nebula inferred from combined Cr, Ti, and O isotope analyses of amoeboid olivine aggregates
Authors:
Christian A. Jansen,
Christoph Burkhardt,
Yves Marrocchi,
Jonas M. Schneider,
Elias Wölfer,
Thorsten Kleine
Abstract:
Refractory inclusions in chondritic meteorites, namely amoeboid olivine aggregates (AOAs) and Ca-Al-rich inclusions (CAIs), are among the first solids to have formed in the solar system. The isotopic composition of CAIs is distinct from bulk meteorites, which either results from extreme processing of presolar carriers in the CAI-forming region, or reflects an inherited heterogeneity from the Sun's…
▽ More
Refractory inclusions in chondritic meteorites, namely amoeboid olivine aggregates (AOAs) and Ca-Al-rich inclusions (CAIs), are among the first solids to have formed in the solar system. The isotopic composition of CAIs is distinct from bulk meteorites, which either results from extreme processing of presolar carriers in the CAI-forming region, or reflects an inherited heterogeneity from the Sun's parental molecular cloud. Amoeboid olivine aggregates are less refractory than CAIs and provide a record of how the isotopic composition of solid material in the disk may have changed in time and space. However, the isotopic composition of AOAs and how this composition relates to that of CAIs and later-formed solids is unknown. Here, using new O, Ti, and Cr isotopic data for eight AOAs from the Allende CV3 chondrite, we show that CAIs and AOAs share a common isotopic composition, indicating a close genetic link and formation from the same isotopic reservoir. Because AOAs are less refractory than CAIs, this observation is difficult to reconcile with a thermal processing origin of the isotope anomalies. Instead, the common isotopic composition of CAIs and AOAs is readily accounted for in a model in which the isotopic composition of infalling material from the Sun's parental molecular cloud changed over time. In this model, CAIs and AOAs record the isotopic composition of the early infall, while later-formed solids contain a larger fraction of the later, isotopically distinct infall. This model implies that CAIs and AOAs record the isotopic composition of the Sun and suggests that the nucleosynthetic isotope heterogeneity of the solar system is predominantly produced by mixing of solar nebula condensates, which acquired their distinct isotopic compositions as a result of time-varied infall from the protosolar cloud.
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
Escalation Risks from Language Models in Military and Diplomatic Decision-Making
Authors:
Juan-Pablo Rivera,
Gabriel Mukobi,
Anka Reuel,
Max Lamparth,
Chandler Smith,
Jacquelyn Schneider
Abstract:
Governments are increasingly considering integrating autonomous AI agents in high-stakes military and foreign-policy decision-making, especially with the emergence of advanced generative AI models like GPT-4. Our work aims to scrutinize the behavior of multiple AI agents in simulated wargames, specifically focusing on their predilection to take escalatory actions that may exacerbate multilateral c…
▽ More
Governments are increasingly considering integrating autonomous AI agents in high-stakes military and foreign-policy decision-making, especially with the emergence of advanced generative AI models like GPT-4. Our work aims to scrutinize the behavior of multiple AI agents in simulated wargames, specifically focusing on their predilection to take escalatory actions that may exacerbate multilateral conflicts. Drawing on political science and international relations literature about escalation dynamics, we design a novel wargame simulation and scoring framework to assess the escalation risks of actions taken by these agents in different scenarios. Contrary to prior studies, our research provides both qualitative and quantitative insights and focuses on large language models (LLMs). We find that all five studied off-the-shelf LLMs show forms of escalation and difficult-to-predict escalation patterns. We observe that models tend to develop arms-race dynamics, leading to greater conflict, and in rare cases, even to the deployment of nuclear weapons. Qualitatively, we also collect the models' reported reasonings for chosen actions and observe worrying justifications based on deterrence and first-strike tactics. Given the high stakes of military and foreign-policy contexts, we recommend further examination and cautious consideration before deploying autonomous language model agents for strategic military or diplomatic decision-making.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Decentralized Multi-Agent Active Search and Tracking when Targets Outnumber Agents
Authors:
Arundhati Banerjee,
Jeff Schneider
Abstract:
Multi-agent multi-target tracking has a wide range of applications, including wildlife patrolling, security surveillance or environment monitoring. Such algorithms often make restrictive assumptions: the number of targets and/or their initial locations may be assumed known, or agents may be pre-assigned to monitor disjoint partitions of the environment, reducing the burden of exploration. This als…
▽ More
Multi-agent multi-target tracking has a wide range of applications, including wildlife patrolling, security surveillance or environment monitoring. Such algorithms often make restrictive assumptions: the number of targets and/or their initial locations may be assumed known, or agents may be pre-assigned to monitor disjoint partitions of the environment, reducing the burden of exploration. This also limits applicability when there are fewer agents than targets, since agents are unable to continuously follow the targets in their fields of view. Multi-agent tracking algorithms additionally assume inter-agent synchronization of observations, or the presence of a central controller to coordinate joint actions. Instead, we focus on the setting of decentralized multi-agent, multi-target, simultaneous active search-and-tracking with asynchronous inter-agent communication. Our proposed algorithm DecSTER uses a sequential monte carlo implementation of the probability hypothesis density filter for posterior inference combined with Thompson sampling for decentralized multi-agent decision making. We compare different action selection policies, focusing on scenarios where targets outnumber agents. In simulation, we demonstrate that DecSTER is robust to unreliable inter-agent communication and outperforms information-greedy baselines in terms of the Optimal Sub-Pattern Assignment (OSPA) metric for different numbers of targets and varying teamsizes.
△ Less
Submitted 9 January, 2024; v1 submitted 6 January, 2024;
originally announced January 2024.