subscribe to arXiv mailings

Towards a theory of learning dynamics in deep state space models

Authors: Jakub Smékal, Jimmy T. H. Smith, Michael Kleinman, Dan Biderman, Scott W. Linderman

Abstract: State space models (SSMs) have shown remarkable empirical performance on many long sequence modeling tasks, but a theoretical understanding of these models is still lacking. In this work, we study the learning dynamics of linear SSMs to understand how covariance structure in data, latent state size, and initialization affect the evolution of parameters throughout learning with gradient descent. We… ▽ More State space models (SSMs) have shown remarkable empirical performance on many long sequence modeling tasks, but a theoretical understanding of these models is still lacking. In this work, we study the learning dynamics of linear SSMs to understand how covariance structure in data, latent state size, and initialization affect the evolution of parameters throughout learning with gradient descent. We show that focusing on the learning dynamics in the frequency domain affords analytical solutions under mild assumptions, and we establish a link between one-dimensional SSMs and the dynamics of deep linear feed-forward networks. Finally, we analyze how latent state over-parameterization affects convergence time and describe future work in extending our results to the study of deep SSMs with nonlinear connections. This work is a step toward a theory of learning dynamics in deep state space models. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.07091 [pdf, other]

General Relativistic effects and the NIR variability of Sgr A* II: A systematic approach to temporal asymmetry

Authors: Sebastiano D. von Fellenberg, Gunther Witzel, Michi Bauboeck, Hui-Hsuan Chung, Nicola Marchili, Greg Martinez, Matteo Sadun-Bordoni, Guillaume Bourdarot, Tuan Do, Antonia Drescher, Giovanni Fazio, Frank Eisenhauer, Reinhard Genzel, Stefan Gillessen, Joseph L. Hora, Felix Mang, Thomas Ott, Howard A. Smith, Eduardo Ros, Diogo C. Ribeiro, Felix Widmann, S. P. Willner, J. Anton Zensus

Abstract: A systematic study, based on the third-moment structure function, of Sgr A*'s variability finds an exponential rise time $τ_{1,\rm{obs}}=14.8^{+0.4}_{-1.5}~\mathrm{minutes}$ and decay time $τ_{2,\rm{obs}}=13.1^{+1.3}_{-1.4}~\mathrm{minutes}$. This symmetry of the flux-density variability is consistent with earlier work, and we interpret it as caused by the dominance of Doppler boosting, as opposed… ▽ More A systematic study, based on the third-moment structure function, of Sgr A*'s variability finds an exponential rise time $τ_{1,\rm{obs}}=14.8^{+0.4}_{-1.5}~\mathrm{minutes}$ and decay time $τ_{2,\rm{obs}}=13.1^{+1.3}_{-1.4}~\mathrm{minutes}$. This symmetry of the flux-density variability is consistent with earlier work, and we interpret it as caused by the dominance of Doppler boosting, as opposed to gravitational lensing, in Sgr~A*'s light curve. A relativistic, semi-physical model of Sgr~A* confirms an inclination angle $i<45$ degrees. The model also shows that the emission of the intrinsic radiative process can have some asymmetry even though the observed emission does not. The third-moment structure function, which is a measure of the skewness of the light-curve increments, may be a useful summary statistic in other contexts of astronomy because it senses only temporal asymmetry, i.e., it averages to zero for any temporally symmetric signal. △ Less

Submitted 9 July, 2024; originally announced July 2024.

Comments: Accepted for publication in A&A letters

arXiv:2406.15845 [pdf, other]

Quantum geometry embedded in unitarity of evolution: revealing its impacts as quantum oscillation and dephasing in spin resonance and crystal bands

Authors: B. Q. Song, J. D. H. Smith, T. Jiang, Y. X. Yao, J. Wang

Abstract: Quantum Hall effects provide intuitive ways of revealing the topology in crystals, i.e., each quantized "step" represents a distinct topological state. Here, we seek a counterpart for "visualizing" quantum geometry, which is a broader concept. We show how geometry emerges in quantum as an intrinsic consequence of unitary evolution, independent of specific details or approximations, suggesting quan… ▽ More Quantum Hall effects provide intuitive ways of revealing the topology in crystals, i.e., each quantized "step" represents a distinct topological state. Here, we seek a counterpart for "visualizing" quantum geometry, which is a broader concept. We show how geometry emerges in quantum as an intrinsic consequence of unitary evolution, independent of specific details or approximations, suggesting quantum geometry may have widespread applicability. Indeed, we exemplify geometric observables, such as oscillation, dephasing, in spin and band scenarios. These phenomena are robust owing to the continuity of geometry, and can be tuned by geometric parameters. Anomalies, supported by both analytic and numerical solutions, underscore the advantages of adopting a geometric perspective, potentially yielding distinguishable experimental signatures. △ Less

Submitted 22 June, 2024; originally announced June 2024.

Comments: 5 pages, 3 figures

arXiv:2406.15379 [pdf, other]

CS1-LLM: Integrating LLMs into CS1 Instruction

Authors: Annapurna Vadaparty, Daniel Zingaro, David H. Smith IV, Mounika Padala, Christine Alvarado, Jamie Gorson Benario, Leo Porter

Abstract: The recent, widespread availability of Large Language Models (LLMs) like ChatGPT and GitHub Copilot may impact introductory programming courses (CS1) both in terms of what should be taught and how to teach it. Indeed, recent research has shown that LLMs are capable of solving the majority of the assignments and exams we previously used in CS1. In addition, professional software engineers are often… ▽ More The recent, widespread availability of Large Language Models (LLMs) like ChatGPT and GitHub Copilot may impact introductory programming courses (CS1) both in terms of what should be taught and how to teach it. Indeed, recent research has shown that LLMs are capable of solving the majority of the assignments and exams we previously used in CS1. In addition, professional software engineers are often using these tools, raising the question of whether we should be training our students in their use as well. This experience report describes a CS1 course at a large research-intensive university that fully embraces the use of LLMs from the beginning of the course. To incorporate the LLMs, the course was intentionally altered to reduce emphasis on syntax and writing code from scratch. Instead, the course now emphasizes skills needed to successfully produce software with an LLM. This includes explaining code, testing code, and decomposing large problems into small functions that are solvable by an LLM. In addition to frequent, formative assessments of these skills, students were given three large, open-ended projects in three separate domains (data science, image processing, and game design) that allowed them to showcase their creativity in topics of their choosing. In an end-of-term survey, students reported that they appreciated learning with the assistance of the LLM and that they interacted with the LLM in a variety of ways when writing code. We provide lessons learned for instructors who may wish to incorporate LLMs into their course. △ Less

Submitted 17 April, 2024; originally announced June 2024.

Comments: to be published in Proceedings of the 29th ACM conference on innovation and technology in computer science education (ITiCSE)

arXiv:2406.04147 [pdf, other]

Direct optimization of neoclassical ion transport in stellarator reactors

Authors: B. F. Lee, S. A. Lazerson, H. M. Smith, C. D. Beidler, N. A. Pablant

Abstract: We directly optimize stellarator neoclassical ion transport while holding neoclassical electron transport at a moderate level, creating a scenario favorable for impurity expulsion and retaining good ion confinement. Traditional neoclassical stellarator optimization has focused on minimizing $ε_\mathrm{eff}$, the geometric factor that characterizes the amount of radial transport due to particles in… ▽ More We directly optimize stellarator neoclassical ion transport while holding neoclassical electron transport at a moderate level, creating a scenario favorable for impurity expulsion and retaining good ion confinement. Traditional neoclassical stellarator optimization has focused on minimizing $ε_\mathrm{eff}$, the geometric factor that characterizes the amount of radial transport due to particles in the $1/ν$ regime. Under expected reactor-relevant conditions, core electrons will be in the $1/ν$ regime and core fuel ions will be in the $\sqrtν$ regime. Traditional optimizations thus minimize electron transport and rely on the radial electric field $\left(E_r\right)$ that develops to confine the ions. This often results in an inward-pointing $E_r$ that drives high-$Z$ impurities into the core, which may be troublesome in future reactors. In our optimizations, we increase the ratio of the thermal transport coefficients $L_{1 1}^{e}/L_{1 1}^{i}$, which previous work has shown can create an outward-pointing $E_r$. This effect is very beneficial for impurity expulsion. We obtain self-consistent density, temperature, and $E_r$ profiles at reactor-relevant conditions for optimized equilibria. These equilibria are expected to enjoy significantly improved impurity transport properties. We conclude by providing several directions of future research that may help further improve the presented optimization algorithm. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2406.03629 [pdf, ps, other]

Iterates of Quadratics and Monogenicity

Authors: Hanson Smith, Zack Wolske

Abstract: We investigate monogenicity and prime splitting in extensions generated by roots of iterated quadratic polynomials. Let $f(x)\in\mathbb{Z}[x]$ be an irreducible, monic, quadratic polynomial, and write $f^n(x)$ for the $n^{\text{th}}$ iterate. We obtain necessary and sufficient conditions for $f^n(x)$ to be monogenic for each $n$. We use this to construct multiple families where $f^n(x)$ is monogen… ▽ More We investigate monogenicity and prime splitting in extensions generated by roots of iterated quadratic polynomials. Let $f(x)\in\mathbb{Z}[x]$ be an irreducible, monic, quadratic polynomial, and write $f^n(x)$ for the $n^{\text{th}}$ iterate. We obtain necessary and sufficient conditions for $f^n(x)$ to be monogenic for each $n$. We use this to construct multiple families where $f^n(x)$ is monogenic for every $n>0$. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: 13 pages. Comments welcome!

MSC Class: 11R04; 11R11; 11R21; 37P05

arXiv:2406.01671 [pdf, other]

Multiwavelength Observations of Sgr A*. II. 2019 July 21 and 26

Authors: Joseph M. Michail, Farhad Yusef-Zadeh, Mark Wardle, Devaky Kunneriath, Joseph L. Hora, Howard Bushouse, Giovanni G. Fazio, Sera Markoff, Howard A. Smith

Abstract: We report on the final two days of a multiwavelength campaign of Sgr A* observing in the radio, submillimeter, infrared, and X-ray bands in July 2019. Sgr A* was remarkably active, showing multiple flaring events across the electromagnetic spectrum. We detect a transient $\sim35$-minute periodicity feature in Spitzer Space Telescope light curves on 21 July 2019. Time-delayed emission was detected… ▽ More We report on the final two days of a multiwavelength campaign of Sgr A* observing in the radio, submillimeter, infrared, and X-ray bands in July 2019. Sgr A* was remarkably active, showing multiple flaring events across the electromagnetic spectrum. We detect a transient $\sim35$-minute periodicity feature in Spitzer Space Telescope light curves on 21 July 2019. Time-delayed emission was detected in ALMA light curves, suggesting a hotspot within the accretion flow on a stable orbit. On the same night, we observe a decreased flux in the submillimeter light curve following an X-ray flare detected by the Chandra X-ray Observatory and model the feature with an adiabatically expanding synchrotron hotspot occulting the accretion flow. The event is produced by a plasma $0.55~R_{\text{S}}$ in radius with an electron spectrum $p=2.84$. It is threaded by a $\sim130$ Gauss magnetic field and expands at $0.6\%$ the speed of light. Finally, we reveal an unambiguous flare in the infrared, submillimeter, and radio, demonstrating that the variable emission is intrinsically linked. We jointly fit the radio and submillimeter light curves using an adiabatically expanding synchrotron hotspot and find it is produced by a plasma with an electron spectrum $p=0.59$, $187$ Gauss magnetic field, and radius $0.47~R_{\text{S}}$ that expands at $0.029c$. In both cases, the uncertainty in the appropriate lower and upper electron energy bounds may inflate the derived equipartition field strengths by a factor of 2 or more. Our results confirm that both synchrotron- and adiabatic-cooling processes are involved in the variable emission's evolution at submillimeter and infrared wavelengths. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 24 pages, 13 figures, accepted to The Astrophysical Journal. Comments welcome! Paper I can be found here: arXiv:2107.09681

arXiv:2405.19860 [pdf, other]

Quasi-isodynamic stellarators with low turbulence as fusion reactor candidates

Authors: Alan G. Goodman, Pavlos Xanthopoulos, Gabriel G. Plunk, Håkan Smith, Carolin Nührenberg, Craig D. Beidler, Sophia A. Henneberg, Gareth Roberg-Clark, Michael Drevlak, Per Helander

Abstract: The stellarator is a type of fusion energy device that - if properly designed - could provide clean, safe, and abundant energy to the grid. To generate this energy, a stellarator must keep a hot mixture of charged particles (known as a plasma) sufficiently confined by using a fully shaped magnetic field. If this is achieved, the heat from fusion reactions within the plasma can be harvested as ener… ▽ More The stellarator is a type of fusion energy device that - if properly designed - could provide clean, safe, and abundant energy to the grid. To generate this energy, a stellarator must keep a hot mixture of charged particles (known as a plasma) sufficiently confined by using a fully shaped magnetic field. If this is achieved, the heat from fusion reactions within the plasma can be harvested as energy. We present a novel method for designing reactor-relevant stellarator magnetic fields, which combine several key physical properties. These include plasma stability, excellent confinement of the fast moving particles generated by fusion reactions, and reduction of the turbulence that is known to limit the performance of the most advanced stellarator experiment in the world, Wendelstein 7-X. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 11 pages, 15 figures

arXiv:2405.19460 [pdf, other]

Evaluating Micro Parsons Problems as Exam Questions

Authors: Zihan Wu, David H. Smith IV

Abstract: Parsons problems are a type of programming activity that present learners with blocks of existing code and requiring them to arrange those blocks to form a program rather than write the code from scratch. Micro Parsons problems extend this concept by having students assemble segments of code to form a single line of code rather than an entire program. Recent investigations into micro Parsons probl… ▽ More Parsons problems are a type of programming activity that present learners with blocks of existing code and requiring them to arrange those blocks to form a program rather than write the code from scratch. Micro Parsons problems extend this concept by having students assemble segments of code to form a single line of code rather than an entire program. Recent investigations into micro Parsons problems have primarily focused on supporting learners leaving open the question of micro Parsons efficacy as an exam item and how students perceive it when preparing for exams. To fill this gap, we included a variety of micro Parsons problems on four exams in an introductory programming course taught in Python. We use Item Response Theory to investigate the difficulty of the micro Parsons problems as well as the ability of the questions to differentiate between high and low ability students. We then compare these results to results for related questions where students are asked to write a single line of code from scratch. Finally, we conduct a thematic analysis of the survey responses to investigate how students' perceptions of micro Parsons both when practicing for exams and as they appear on exams. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: This work is to appear in ITiCSE 2024. Both authors contributed equally to this research

ACM Class: K.3.2; K.3.1; H.5.2

arXiv:2405.12258 [pdf]

Scientific Hypothesis Generation by a Large Language Model: Laboratory Validation in Breast Cancer Treatment

Authors: Abbi Abdel-Rehim, Hector Zenil, Oghenejokpeme Orhobor, Marie Fisher, Ross J. Collins, Elizabeth Bourne, Gareth W. Fearnley, Emma Tate, Holly X. Smith, Larisa N. Soldatova, Ross D. King

Abstract: Large language models (LLMs) have transformed AI and achieved breakthrough performance on a wide range of tasks that require human intelligence. In science, perhaps the most interesting application of LLMs is for hypothesis formation. A feature of LLMs, which results from their probabilistic structure, is that the output text is not necessarily a valid inference from the training text. These are '… ▽ More Large language models (LLMs) have transformed AI and achieved breakthrough performance on a wide range of tasks that require human intelligence. In science, perhaps the most interesting application of LLMs is for hypothesis formation. A feature of LLMs, which results from their probabilistic structure, is that the output text is not necessarily a valid inference from the training text. These are 'hallucinations', and are a serious problem in many applications. However, in science, hallucinations may be useful: they are novel hypotheses whose validity may be tested by laboratory experiments. Here we experimentally test the use of LLMs as a source of scientific hypotheses using the domain of breast cancer treatment. We applied the LLM GPT4 to hypothesize novel pairs of FDA-approved non-cancer drugs that target the MCF7 breast cancer cell line relative to the non-tumorigenic breast cell line MCF10A. In the first round of laboratory experiments GPT4 succeeded in discovering three drug combinations (out of 12 tested) with synergy scores above the positive controls. These combinations were itraconazole + atenolol, disulfiram + simvastatin and dipyridamole + mebendazole. GPT4 was then asked to generate new combinations after considering its initial results. It then discovered three more combinations with positive synergy scores (out of four tested), these were disulfiram + fulvestrant, mebendazole + quinacrine and disulfiram + quinacrine. A limitation of GPT4 as a generator of hypotheses was that its explanations for them were formulaic and unconvincing. We conclude that LLMs are an exciting novel source of scientific hypotheses. △ Less

Submitted 5 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: 13 pages, 6 tables, 1 figure. Supplementary information available

arXiv:2405.07085 [pdf, other]

Optimised stellarators with a positive radial electric field

Authors: Per Helander, Alan G. Goodman, Craig D. Beidler, Michal Kuczyński, Håkan M. Smith

Abstract: We draw attention to an interesting possibility in the design and operation of stellarator fusion reactors, which has hitherto been considered unrealistic under burning-plasma conditions. Thanks to recent advances in stellarator optimisation theory, it appears possible to create a positive (outward-pointing) radial electric field in the plasma core by carefully tailoring the geometry of the magnet… ▽ More We draw attention to an interesting possibility in the design and operation of stellarator fusion reactors, which has hitherto been considered unrealistic under burning-plasma conditions. Thanks to recent advances in stellarator optimisation theory, it appears possible to create a positive (outward-pointing) radial electric field in the plasma core by carefully tailoring the geometry of the magnetic field. This electric field is likely to expel highly charged impurities from the centre of the plasma through neoclassical transport and thus eliminate, or at least mitigate, a long-standing problem in stellarator physics. Further out, the electric field is expected to suddenly change sign from positive to negative, thus creating a region of strongly sheared flow, which could locally suppress turbulent transport and enhance overall energy confinement. △ Less

Submitted 29 May, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

Comments: 17 pages, 2 figures

arXiv:2405.06147 [pdf, other]

State-Free Inference of State-Space Models: The Transfer Function Approach

Authors: Rom N. Parnichkun, Stefano Massaroli, Alessandro Moro, Jimmy T. H. Smith, Ramin Hasani, Mathias Lechner, Qi An, Christopher Ré, Hajime Asama, Stefano Ermon, Taiji Suzuki, Atsushi Yamashita, Michael Poli

Abstract: We approach designing a state-space model for deep learning applications through its dual representation, the transfer function, and uncover a highly efficient sequence parallel inference algorithm that is state-free: unlike other proposed algorithms, state-free inference does not incur any significant memory or computational cost with an increase in state size. We achieve this using properties of… ▽ More We approach designing a state-space model for deep learning applications through its dual representation, the transfer function, and uncover a highly efficient sequence parallel inference algorithm that is state-free: unlike other proposed algorithms, state-free inference does not incur any significant memory or computational cost with an increase in state size. We achieve this using properties of the proposed frequency domain transfer function parametrization, which enables direct computation of its corresponding convolutional kernel's spectrum via a single Fast Fourier Transform. Our experimental results across multiple sequence lengths and state sizes illustrates, on average, a 35% training speed improvement over S4 layers -- parametrized in time-domain -- on the Long Range Arena benchmark, while delivering state-of-the-art downstream performances over other attention-free approaches. Moreover, we report improved perplexity in language modeling over a long convolutional Hyena baseline, by simply introducing our transfer function parametrization. Our code is available at https://github.com/ruke1ire/RTF. △ Less

Submitted 1 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

Comments: Resubmission 02/06/2024: Fixed minor typo of recurrent form RTF

arXiv:2404.07808 [pdf, other]

A broad linewidth, compact, millimeter-bright molecular emission line source near the Galactic Center

Authors: Adam Ginsburg, John Bally, Ashley T. Barnes, Cara Battersby, Nazar Budaiev, Natalie O. Butterfield, Paola Caselli, Laura Colzi, Katarzyna M. Dutkowska, Pablo García, Savannah Gramze, Jonathan D. Henshaw, Yue Hu, Desmond Jeff, Izaskun Jiménez-Serra, Jens Kauffmann, Ralf S. Klessen, Emily M. Levesque, Steven N. Longmore, Xing Lu, Elisabeth A. C. Mills, Mark R. Morris, Francisco Nogueras-Lara, Tomoharu Oka, Jaime E. Pineda , et al. (15 additional authors not shown)

Abstract: A compact source, G0.02467-0.0727, was detected in ALMA \threemm observations in continuum and very broad line emission. The continuum emission has a spectral index $α\approx3.3$, suggesting that the emission is from dust. The line emission is detected in several transitions of CS, SO, and SO$_2$ and exhibits a line width FWHM $\approx160$ \kms. The line profile appears Gaussian. The emission is w… ▽ More A compact source, G0.02467-0.0727, was detected in ALMA \threemm observations in continuum and very broad line emission. The continuum emission has a spectral index $α\approx3.3$, suggesting that the emission is from dust. The line emission is detected in several transitions of CS, SO, and SO$_2$ and exhibits a line width FWHM $\approx160$ \kms. The line profile appears Gaussian. The emission is weakly spatially resolved, coming from an area on the sky $\lesssim1"$ in diameter ($\lesssim10^4$ AU at the distance of the Galactic Center; GC). The centroid velocity is $v_{LSR}\approx40$-$50$ \kms, which is consistent with a location in the Galactic Center. With multiple SO lines detected, and assuming local thermodynamic equilibrium (LTE) conditions, $T_\mathrm{LTE} = 13$ K, which is colder than seen in typical GC clouds, though we cannot rule out low-density, subthermally excited, warmer gas. Despite the high velocity dispersion, no emission is observed from SiO, suggesting that there are no strong ($\gtrsim10~\mathrm{km~s}^{-1}$) shocks in the molecular gas. There are no detections at other wavelengths, including X-ray, infrared, and radio. We consider several explanations for the Millimeter Ultra-Broad Line Object (MUBLO), including protostellar outflow, explosive outflow, collapsing cloud, evolved star, stellar merger, high-velocity compact cloud, intermediate mass black hole, and background galaxy. Most of these conceptual models are either inconsistent with the data or do not fully explain it. The MUBLO is, at present, an observationally unique object. △ Less

Submitted 1 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

Comments: Accepted to ApJL

arXiv:2404.00786 [pdf, ps, other]

There and Back Again: A Netlist's Tale with Much Egraphin'

Authors: Gus Henry Smith, Zachary D. Sisco, Thanawat Techaumnuaiwit, Jingtao Xia, Vishal Canumalla, Andrew Cheung, Zachary Tatlock, Chandrakana Nandi, Jonathan Balkind

Abstract: EDA toolchains are notoriously unpredictable, incomplete, and error-prone; the generally-accepted remedy has been to re-imagine EDA tasks as compilation problems. However, any compiler framework we apply must be prepared to handle the wide range of EDA tasks, including not only compilation tasks like technology mapping and optimization (the "there"} in our title), but also decompilation tasks like… ▽ More EDA toolchains are notoriously unpredictable, incomplete, and error-prone; the generally-accepted remedy has been to re-imagine EDA tasks as compilation problems. However, any compiler framework we apply must be prepared to handle the wide range of EDA tasks, including not only compilation tasks like technology mapping and optimization (the "there"} in our title), but also decompilation tasks like loop rerolling (the "back again"). In this paper, we advocate for equality saturation -- a term rewriting framework -- as the framework of choice when building hardware toolchains. Through a series of case studies, we show how the needs of EDA tasks line up conspicuously well with the features equality saturation provides. △ Less

Submitted 31 March, 2024; originally announced April 2024.

arXiv:2403.14195 [pdf, other]

An Agnostic Biosignature Based on Modeling Panspermia and Terraformation

Authors: Harrison B. Smith, Lana Sinapayen

Abstract: A fundamental goal of astrobiology is to detect life outside of Earth. This proves to be an exceptional challenge outside of our solar system, where strong assumptions must be made about how life would manifest and interact with its planet. Such assumptions are required because of the lack of a consensus theory of living systems, or an understanding of the possible extent of planetary dynamics. He… ▽ More A fundamental goal of astrobiology is to detect life outside of Earth. This proves to be an exceptional challenge outside of our solar system, where strong assumptions must be made about how life would manifest and interact with its planet. Such assumptions are required because of the lack of a consensus theory of living systems, or an understanding of the possible extent of planetary dynamics. Here we explore a model of life spreading between planetary systems via panspermia and terraformation. Our model shows that as life propagates across the galaxy, correlations emerge between planetary characteristics and location, and can function as a population-scale agnostic biosignature. This biosignature is agnostic because it is independent of strong assumptions about any particular instantiation of life or planetary characteristic--by focusing on a specific hypothesis of what life may do, rather than what life may be. By clustering planets based on their observed characteristics, and examining the spatial extent of these clusters, we demonstrate (and evaluate) a way to prioritize specific planets for further observation--based on their potential for containing life. We consider obstacles that must be overcome to practically implement our approach, including identifying specific ways in which better understanding astrophysical and planetary processes would improve our ability to detect life. Finally, we consider how this model leads us to think in novel ways about hierarchies of life and planetary scale replication. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: 16 pages, 20 figures

arXiv:2403.08792 [pdf, other]

Realtime Facial Expression Recognition: Neuromorphic Hardware vs. Edge AI Accelerators

Authors: Heath Smith, James Seekings, Mohammadreza Mohammadi, Ramtin Zand

Abstract: The paper focuses on real-time facial expression recognition (FER) systems as an important component in various real-world applications such as social robotics. We investigate two hardware options for the deployment of FER machine learning (ML) models at the edge: neuromorphic hardware versus edge AI accelerators. Our study includes exhaustive experiments providing comparative analyses between the… ▽ More The paper focuses on real-time facial expression recognition (FER) systems as an important component in various real-world applications such as social robotics. We investigate two hardware options for the deployment of FER machine learning (ML) models at the edge: neuromorphic hardware versus edge AI accelerators. Our study includes exhaustive experiments providing comparative analyses between the Intel Loihi neuromorphic processor and four distinct edge platforms: Raspberry Pi-4, Intel Neural Compute Stick (NSC), Jetson Nano, and Coral TPU. The results obtained show that Loihi can achieve approximately two orders of magnitude reduction in power dissipation and one order of magnitude energy savings compared to Coral TPU which happens to be the least power-intensive and energy-consuming edge AI accelerator. These reductions in power and energy are achieved while the neuromorphic solution maintains a comparable level of accuracy with the edge accelerators, all within the real-time latency requirements. △ Less

Submitted 30 January, 2024; originally announced March 2024.

arXiv:2403.06050 [pdf, other]

Explaining Code with a Purpose: An Integrated Approach for Developing Code Comprehension and Prompting Skills

Authors: Paul Denny, David H. Smith IV, Max Fowler, James Prather, Brett A. Becker, Juho Leinonen

Abstract: Reading, understanding and explaining code have traditionally been important skills for novices learning programming. As large language models (LLMs) become prevalent, these foundational skills are more important than ever given the increasing need to understand and evaluate model-generated code. Brand new skills are also needed, such as the ability to formulate clear prompts that can elicit inten… ▽ More Reading, understanding and explaining code have traditionally been important skills for novices learning programming. As large language models (LLMs) become prevalent, these foundational skills are more important than ever given the increasing need to understand and evaluate model-generated code. Brand new skills are also needed, such as the ability to formulate clear prompts that can elicit intended code from an LLM. Thus, there is great interest in integrating pedagogical approaches for the development of both traditional coding competencies and the novel skills required to interact with LLMs. One effective way to develop and assess code comprehension ability is with ``Explain in plain English'' (EiPE) questions, where students succinctly explain the purpose of a fragment of code. However, grading EiPE questions has always been difficult given the subjective nature of evaluating written explanations and this has stifled their uptake. In this paper, we explore a natural synergy between EiPE questions and code-generating LLMs to overcome this limitation. We propose using an LLM to generate code based on students' responses to EiPE questions -- not only enabling EiPE responses to be assessed automatically, but helping students develop essential code comprehension and prompt crafting skills in parallel. We investigate this idea in an introductory programming course and report student success in creating effective prompts for solving EiPE questions. We also examine student perceptions of this activity and how it influences their views on the use of LLMs for aiding and assessing learning. △ Less

Submitted 9 March, 2024; originally announced March 2024.

Comments: Accepted to ITiCSE 2024

arXiv:2403.02519 [pdf, other]

Position operators in terms of converging finite-dimensional matrices: Exploring their interplay with geometry, transport, and gauge theory

Authors: B. Q. Song, J. D. H. Smith, J. Wang

Abstract: Position operator $\hat{r}$ appears as $i{\partial_p}$ in wave mechanics, while its matrix form is well known diverging in diagonals, causing serious difficulties in basis transformation, observable yielding, etc. We aim to find a convergent $r$-matrix (CRM) to improve the existing divergent $r$-matrix (DRM), and investigate its influence at both the conceptual and the application levels. Unlike t… ▽ More Position operator $\hat{r}$ appears as $i{\partial_p}$ in wave mechanics, while its matrix form is well known diverging in diagonals, causing serious difficulties in basis transformation, observable yielding, etc. We aim to find a convergent $r$-matrix (CRM) to improve the existing divergent $r$-matrix (DRM), and investigate its influence at both the conceptual and the application levels. Unlike the spin matrix, which affords a Lie algebra representation as the solution of $[s_i,s_j]=ε_{i,j,k}s_k$, the $r$-matrix cannot be a solution for $[\hat{r},p]=i\hbar$, namely Weyl algebra. Indeed: matrix representations of Weyl algebras prove not existing; thus, neither CRM nor DRM would afford a representation. Instead, the CRM should be viewed as a procedure of encoding $\hat{r}$ using matrices of arbitrary finite dimensions. Deriving CRM recognizes that the limited understanding about Weyl algebra has led to the divergence. A key modification is increasing the 1-st Weyl algebra (the familiar substitution $\hat{r}{\rightarrow}i{\partial_p}$) to the $N$-th Weyl algebra. Resolving the divergence makes $r$-matrix rigorously defined, and we are able to show $r$-matrix is distinct from a spin matrix in terms of its defining principles, transformation behavior, and the observable it yields. At the conceptual level, the CRM fills the logical gap between the $r$-matrix and the Berry connection; and helps to show that Bloch space $\mathcal{H}_B$ is incomplete for $\hat{r}$. At the application level, we focus on transport, and discover that the Hermitian matrix is not identical with the associative Hermitian operator, i.e., $r_{m,n}=r_{n,m}^*{\nLeftrightarrow}\hat{r}=\hat{r}^{\dagger}$. We also discuss how such a non-representation CRM can contribute to building a unified transport theory. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: 37 pages, 2 figures

arXiv:2402.11062 [pdf, other]

Absorption and Self-Absorption of [C II] and [O I] Far Infrared Lines Towards a Bright Bubble in the Nessie Infrared Dark Cloud

Authors: J. M. Jackson, J. S. Whitaker, E. T. Chambers, R. Simon, C. Guevara, D. Allingham, P. Patterson, N. Killerby-Smith, J. Askew, T. Vandenberg, H. A. Smith, P. Sanhueza, I. W. Stephens, L. Bonne, F. Polles, A. Schmiedeke, N. Honigh, M. Justen

Abstract: Using the upGREAT instrument on SOFIA, we have imaged [C II] 157.74 and [O I] 63.18 micron line emission from a bright photodissociation region (PDR) associated with an ionized ``bubble'' located in the Nessie Nebula, a filamentary infrared dark cloud. A comparison with ATCA data reveals a classic PDR structure, with a uniform progression from ionized gas, to photodissociated gas, and on to molecu… ▽ More Using the upGREAT instrument on SOFIA, we have imaged [C II] 157.74 and [O I] 63.18 micron line emission from a bright photodissociation region (PDR) associated with an ionized ``bubble'' located in the Nessie Nebula, a filamentary infrared dark cloud. A comparison with ATCA data reveals a classic PDR structure, with a uniform progression from ionized gas, to photodissociated gas, and on to molecular gas from the bubble's interior to its exterior. [O I] line emission from the bubble's PDR reveals self-absorption features. Toward a FIR-bright protostar, both [O I] and [C II] show an absorption feature at a velocity of $-18$ km/s, the same velocity as an unrelated foreground molecular cloud. Since the gas density in typical molecular clouds is well below the [O I] and [C II] critical densities, the excitation temperatures for both lines are low (~20 K). The Meudon models demonstrate that the surface of a molecular cloud, externally illuminated by a standard G_0 = 1 interstellar radiation field, can produce absorption features in both transitions. Thus, the commonly observed [O I] and [C II] self-absorption and absorption features plausibly arise from the subthermally excited, externally illuminated, photodissociated envelopes of molecular clouds. The luminous young stellar object AGAL337.916-00.477, located precisely where the expanding bubble strikes the Nessie filament, is associated with two shock tracers: NH3 (3,3) maser emission and SiO 2-1 emission, indicating interaction between the bubble and the filament. The interaction of the expanding bubble with its parental dense filament has triggered star formation. △ Less

Submitted 16 February, 2024; originally announced February 2024.

arXiv:2401.16526 [pdf, other]

doi 10.1145/3620665.3640387

FPGA Technology Mapping Using Sketch-Guided Program Synthesis

Authors: Gus Henry Smith, Ben Kushigian, Vishal Canumalla, Andrew Cheung, Steven Lyubomirsky, Sorawee Porncharoenwase, René Just, Gilbert Louis Bernstein, Zachary Tatlock

Abstract: FPGA technology mapping is the process of implementing a hardware design expressed in high-level HDL (hardware design language) code using the low-level, architecture-specific primitives of the target FPGA. As FPGAs become increasingly heterogeneous, achieving high performance requires hardware synthesis tools that better support mapping to complex, highly configurable primitives like digital sign… ▽ More FPGA technology mapping is the process of implementing a hardware design expressed in high-level HDL (hardware design language) code using the low-level, architecture-specific primitives of the target FPGA. As FPGAs become increasingly heterogeneous, achieving high performance requires hardware synthesis tools that better support mapping to complex, highly configurable primitives like digital signal processors (DSPs). Current tools support DSP mapping via handwritten special-case mapping rules, which are laborious to write, error-prone, and often overlook mapping opportunities. We introduce Lakeroad, a principled approach to technology mapping via sketch-guided program synthesis. Lakeroad leverages two techniques -- architecture-independent sketch templates and semantics extraction from HDL -- to provide extensible technology mapping with stronger correctness guarantees and higher coverage of mapping opportunities than state-of-the-art tools. Across representative microbenchmarks, Lakeroad produces 2--3.5$\times$ the number of optimal mappings compared to proprietary state-of-the-art tools and 6--44$\times$ the number of optimal mappings compared to popular open-source tools, while also providing correctness guarantees not given by any other tool. △ Less

Submitted 29 January, 2024; originally announced January 2024.

arXiv:2401.10759 [pdf, other]

Interactions with Prompt Problems: A New Way to Teach Programming with Large Language Models

Authors: James Prather, Paul Denny, Juho Leinonen, David H. Smith IV, Brent N. Reeves, Stephen MacNeil, Brett A. Becker, Andrew Luxton-Reilly, Thezyrie Amarouche, Bailey Kimmel

Abstract: Large Language Models (LLMs) have upended decades of pedagogy in computing education. Students previously learned to code through \textit{writing} many small problems with less emphasis on code reading and comprehension. Recent research has shown that free code generation tools powered by LLMs can solve introductory programming problems presented in natural language with ease. In this paper, we pr… ▽ More Large Language Models (LLMs) have upended decades of pedagogy in computing education. Students previously learned to code through \textit{writing} many small problems with less emphasis on code reading and comprehension. Recent research has shown that free code generation tools powered by LLMs can solve introductory programming problems presented in natural language with ease. In this paper, we propose a new way to teach programming with Prompt Problems. Students receive a problem visually, indicating how input should be transformed to output, and must translate that to a prompt for an LLM to decipher. The problem is considered correct when the code that is generated by the student prompt can pass all test cases. In this paper we present the design of this tool, discuss student interactions with it as they learn, and provide insights into this new class of programming problems as well as the design tools that integrate LLMs. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: accepted for CHI 2024

arXiv:2401.06205 [pdf, other]

Unsupervised detection of coordinated information operations in the wild

Authors: D. Hudson Smith, Carl Ehrett, Patrick L. Warren

Abstract: This paper introduces and tests an unsupervised method for detecting novel coordinated inauthentic information operations (CIOs) in realistic settings. This method uses Bayesian inference to identify groups of accounts that share similar account-level characteristics and target similar narratives. We solve the inferential problem using amortized variational inference, allowing us to efficiently in… ▽ More This paper introduces and tests an unsupervised method for detecting novel coordinated inauthentic information operations (CIOs) in realistic settings. This method uses Bayesian inference to identify groups of accounts that share similar account-level characteristics and target similar narratives. We solve the inferential problem using amortized variational inference, allowing us to efficiently infer group identities for millions of accounts. We validate this method using a set of five CIOs from three countries discussing four topics on Twitter. Our unsupervised approach increases detection power (area under the precision-recall curve) relative to a naive baseline (by a factor of 76 to 580), relative to the use of simple flags or narratives on their own (by a factor of 1.3 to 4.8), and comes quite close to a supervised benchmark. Our method is robust to observing only a small share of messaging on the topic, having only weak markers of inauthenticity, and to the CIO accounts making up a tiny share of messages and accounts on the topic. Although we evaluate the results on Twitter, the method is general enough to be applied in many social-media settings. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 34 pages, 10 figures

arXiv:2401.00289 [pdf]

ASL Champ!: A Virtual Reality Game with Deep-Learning Driven Sign Recognition

Authors: Md Shahinur Alam, Jason Lamberton, Jianye Wang, Carly Leannah, Sarah Miller, Joseph Palagano, Myles de Bastion, Heather L. Smith, Melissa Malzkuhn, Lorna C. Quandt

Abstract: We developed an American Sign Language (ASL) learning platform in a Virtual Reality (VR) environment to facilitate immersive interaction and real-time feedback for ASL learners. We describe the first game to use an interactive teaching style in which users learn from a fluent signing avatar and the first implementation of ASL sign recognition using deep learning within the VR environment. Advanced… ▽ More We developed an American Sign Language (ASL) learning platform in a Virtual Reality (VR) environment to facilitate immersive interaction and real-time feedback for ASL learners. We describe the first game to use an interactive teaching style in which users learn from a fluent signing avatar and the first implementation of ASL sign recognition using deep learning within the VR environment. Advanced motion-capture technology powers an expressive ASL teaching avatar within an immersive three-dimensional environment. The teacher demonstrates an ASL sign for an object, prompting the user to copy the sign. Upon the user's signing, a third-party plugin executes the sign recognition process alongside a deep learning model. Depending on the accuracy of a user's sign production, the avatar repeats the sign or introduces a new one. We gathered a 3D VR ASL dataset from fifteen diverse participants to power the sign recognition model. The proposed deep learning model's training, validation, and test accuracy are 90.12%, 89.37%, and 86.66%, respectively. The functional prototype can teach sign language vocabulary and be successfully adapted as an interactive ASL learning platform in VR. △ Less

Submitted 30 December, 2023; originally announced January 2024.

Comments: 36 pages, 9 figures

arXiv:2312.13976 [pdf]

Anatomical basis of sex differences in human post-myocardial infarction ECG phenotypes identified by novel automated torso-cardiac 3D reconstruction

Authors: Hannah J. Smith, Blanca Rodriguez, Yuling Sang, Marcel Beetz, Robin Choudhury, Vicente Grau, Abhirup Banerjee

Abstract: The electrocardiogram (ECG) is routinely used in cardiology, though its interpretation is confounded by anatomical variability. A novel, automated computational pipeline enables quantification of torso-ventricular anatomy metrics from magnetic resonance imaging, and comparison to ECG characteristics. Sex and myocardial infarction differences are investigated based on 1051 healthy and 425 post-MI s… ▽ More The electrocardiogram (ECG) is routinely used in cardiology, though its interpretation is confounded by anatomical variability. A novel, automated computational pipeline enables quantification of torso-ventricular anatomy metrics from magnetic resonance imaging, and comparison to ECG characteristics. Sex and myocardial infarction differences are investigated based on 1051 healthy and 425 post-MI subjects from UK Biobank. Smaller ventricles in females explain ~50% of shorter QRS durations than in males, and contribute to lower STJ amplitudes in females (also due to more superior and posterior position). In females, torso-ventricular anatomy, particularly from larger BMI, is a stronger modulator of T wave amplitude reductions and left-deviated R axis angles in post-MI than in males. Thus, female MI phenotype is less reflective of pathology, and baseline STJ amplitudes and QRS durations are further from clinical thresholds. Therefore, quantification of anatomical sex-differences and impact on ECG in health and disease is critical to avoid clinical sex-bias. △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: Paper under revision

arXiv:2312.04778 [pdf, other]

doi 10.1103/PhysRevB.109.144301

Quantum Liouville's theorem based on Haar measure

Authors: B. Q. Song, J. D. H. Smith, L. Luo, J. Wang

Abstract: Liouville theorem (LT) reveals robust incompressibility of distribution function in phase space, given arbitrary potentials. However, its quantum generalization, Wigner flow, is compressible, i.e., LT is only conditionally true (e.g., for perfect Harmonic potential). We develop quantum Liouville theorem (rigorous incompressibility) for arbitrary potentials (interacting or not) in Hamiltonians. Haa… ▽ More Liouville theorem (LT) reveals robust incompressibility of distribution function in phase space, given arbitrary potentials. However, its quantum generalization, Wigner flow, is compressible, i.e., LT is only conditionally true (e.g., for perfect Harmonic potential). We develop quantum Liouville theorem (rigorous incompressibility) for arbitrary potentials (interacting or not) in Hamiltonians. Haar measure, instead of symplectic measure dp^dq used in Wigner's scheme, plays a central role. The argument is based on general measure theory, independent of specific spaces or coordinates. Comparison of classical and quantum is made: for instance, we address why Haar measure and metric preservation do not work in the classical case. Applications of theorems in statistics, topological phase transition, ergodic theory, etc. are discussed. △ Less

Submitted 6 April, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

Comments: 9 pages, 1 figure

Journal ref: Phys. Rev. B 109, 144301 (2024)

arXiv:2312.02475 [pdf, other]

Accurate Machine Learning Predictions of Coercivity in High-Performance Permanent Magnets

Authors: Churna Bhandari, Gavin N. Nop, Jonathan D. H. Smith, Durga Paudyal

Abstract: Increased demand for high-performance permanent magnets in the electric vehicle and wind turbine industries has prompted the search for cost-effective alternatives. Nevertheless, the discovery of new magnetic materials with the desired intrinsic and extrinsic permanent magnet properties presents a significant challenge. Traditional density functional theory (DFT) accurately predicts intrinsic perm… ▽ More Increased demand for high-performance permanent magnets in the electric vehicle and wind turbine industries has prompted the search for cost-effective alternatives. Nevertheless, the discovery of new magnetic materials with the desired intrinsic and extrinsic permanent magnet properties presents a significant challenge. Traditional density functional theory (DFT) accurately predicts intrinsic permanent magnet properties such as magnetic moments, magneto-crystalline anisotropy constants, and exchange interactions. However, it cannot compute extrinsic macroscopic properties, such as coercivity ($H_c$), which are influenced by factors like microscopic defects and internal grain structures. Although micromagnetic simulation helps compute $H_c$, it overestimates the values almost by an order of magnitude due to Brown's paradox. To circumvent these limitations, we employ machine learning (ML) methods in an extensive database obtained from experiments, DFT calculations, and micromagnetic modeling. Our novel ML approach is computationally much faster than the micromagnetic simulation program, the mumax$^3$. We successfully utilize it to predict $H_c$ values for materials like cerium-doped $\mathrm{Nd}_2\mathrm{Fe}_{14}\mathrm{B}$, and subsequently compare the predicted values with experimental results. Remarkably, our ML model accurately identifies uniaxial magnetic anisotropy as the primary contributor to $H_c$. With DFT calculations, we predict the Nd-site dependent magnetic anisotropy behavior in $\mathrm{Nd}_2\mathrm{Fe}_{14}\mathrm{B}$, confirming $4f$-site planar and $4g$-site uniaxial to crystalline $c$-direction in good agreement with experiment. The Green's function atomic sphere approximation calculated a Curie temperature ($T_{\rm C}$) for $\mathrm{Nd}_2\mathrm{Fe}_{14}\mathrm{B}$ that also agrees well with experiment. △ Less

Submitted 11 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

Comments: 17 pages, 11 figures

arXiv:2312.00828 [pdf, ps, other]

From affine to barycentric coordinates in polytopes

Authors: Anna B. Romanowska, Jonathan D. H. Smith, Anna Zamojska-Dzienio

Abstract: Each point of a simplex is expressed as a unique convex combination of the vertices. The coefficients in the combination are the barycentric coordinates of the point. For each point in a general convex polytope, there may be multiple representations, so its barycentric coordinates are not necessarily unique. There are various schemes to fix particular barycentric coordinates: Gibbs, Wachspress, ca… ▽ More Each point of a simplex is expressed as a unique convex combination of the vertices. The coefficients in the combination are the barycentric coordinates of the point. For each point in a general convex polytope, there may be multiple representations, so its barycentric coordinates are not necessarily unique. There are various schemes to fix particular barycentric coordinates: Gibbs, Wachspress, cartographic, etc. In this paper, a method for producing sparse barycentric coordinates in polytopes will be discussed. It uses a purely algebraic treatment of affine spaces and convex sets, with barycentric algebras. The method is based on a certain decomposition of each finite-dimensional convex polytope into a union of simplices of the same dimension. △ Less

Submitted 30 November, 2023; originally announced December 2023.

MSC Class: 08A99; 52A01; 52B99

arXiv:2311.14903 [pdf, other]

Code Generation Based Grading: Evaluating an Auto-grading Mechanism for "Explain-in-Plain-English" Questions

Authors: David H. Smith IV, Craig Zilles

Abstract: Comprehending and elucidating the purpose of code is often cited as being a key learning objective within introductory programming courses. To address this objective ``Explain-in-Plain-English'' questions, in which students are shown a segment of code and asked to provide an abstract description of the code's purpose, have been adopted. However, given EiPE questions require a natural language resp… ▽ More Comprehending and elucidating the purpose of code is often cited as being a key learning objective within introductory programming courses. To address this objective ``Explain-in-Plain-English'' questions, in which students are shown a segment of code and asked to provide an abstract description of the code's purpose, have been adopted. However, given EiPE questions require a natural language response, they often require manual grading which is time-consuming for course staff and delays feedback for students. With the advent of large language models (LLMs) capable of generating code, responses to EiPE questions can be used to generate code segments, the correctness of which can then be easily verified using test cases. We refer to this approach as "Code Generation Based Grading" (CGBG) and in this paper we explore its agreement with human graders using EiPE responses from past exams in an introductory programming course taught in Python. Overall, we find that CGBG achieves moderate agreement with human graders with the primary area of disagreement being its leniency with respect to low-level and line-by-line descriptions of code. △ Less

Submitted 24 November, 2023; originally announced November 2023.

arXiv:2310.19694 [pdf, other]

Convolutional State Space Models for Long-Range Spatiotemporal Modeling

Authors: Jimmy T. H. Smith, Shalini De Mello, Jan Kautz, Scott W. Linderman, Wonmin Byeon

Abstract: Effectively modeling long spatiotemporal sequences is challenging due to the need to model complex spatial correlations and long-range temporal dependencies simultaneously. ConvLSTMs attempt to address this by updating tensor-valued states with recurrent neural networks, but their sequential computation makes them slow to train. In contrast, Transformers can process an entire spatiotemporal sequen… ▽ More Effectively modeling long spatiotemporal sequences is challenging due to the need to model complex spatial correlations and long-range temporal dependencies simultaneously. ConvLSTMs attempt to address this by updating tensor-valued states with recurrent neural networks, but their sequential computation makes them slow to train. In contrast, Transformers can process an entire spatiotemporal sequence, compressed into tokens, in parallel. However, the cost of attention scales quadratically in length, limiting their scalability to longer sequences. Here, we address the challenges of prior methods and introduce convolutional state space models (ConvSSM) that combine the tensor modeling ideas of ConvLSTM with the long sequence modeling approaches of state space methods such as S4 and S5. First, we demonstrate how parallel scans can be applied to convolutional recurrences to achieve subquadratic parallelization and fast autoregressive generation. We then establish an equivalence between the dynamics of ConvSSMs and SSMs, which motivates parameterization and initialization strategies for modeling long-range dependencies. The result is ConvS5, an efficient ConvSSM variant for long-range spatiotemporal modeling. ConvS5 significantly outperforms Transformers and ConvLSTM on a long horizon Moving-MNIST experiment while training 3X faster than ConvLSTM and generating samples 400X faster than Transformers. In addition, ConvS5 matches or exceeds the performance of state-of-the-art methods on challenging DMLab, Minecraft and Habitat prediction benchmarks and enables new directions for modeling long spatiotemporal sequences. △ Less

Submitted 30 October, 2023; originally announced October 2023.

arXiv:2310.13407 [pdf]

Preserving your skies since 1988 -- Committee on Radio Astronomy Frequencies (CRAF) -- Periodic Review 2011-2021

Authors: Committee on Radio Astronomy Frequencies, Benjamin Winkel, Simon Garrington, Francesco Colomer, Waleed Madkour, Agnieszka Slowikowska, Pietro Bolli, Michael Lindqvist, José Antonio López-Pérez, Leif Morten Tangen, Ivan Thomas, Peter Thomasson, Roel Witvers, Joe McCauley, Marta Bautista, Miguel Bergano, Vladislavs Bezrukovs, Fabio Giovanardi, Hayo Hase, Karel Jiricka, Gyula I. G. Józsa, Juha Kallunki, Christophe Marqué, Derek McKay, Axel Murk , et al. (21 additional authors not shown)

Abstract: The Committee on Radio Astronomy Frequencies (CRAF) is an Expert Committee of the European Science Foundation. It aims to provide a cost-effective single voice on frequency protection issues for European radio astronomy observatories and research institutes, achieving a significantly greater impact than that achievable by individual national institutions. By working together, European observatorie… ▽ More The Committee on Radio Astronomy Frequencies (CRAF) is an Expert Committee of the European Science Foundation. It aims to provide a cost-effective single voice on frequency protection issues for European radio astronomy observatories and research institutes, achieving a significantly greater impact than that achievable by individual national institutions. By working together, European observatories and institutes can profit from synergy effects, cover many more topics, and learn from each other. CRAF was founded in 1988 and has since then been engaged with the International Telecommunication Union (ITU), in particular its Radiocommunication Sector (ITU-R), and the European Conference of Postal and Telecommunications Administrations (CEPT) and its European Communications Committee (ECC). This is the self-evaluation report prepared by CRAF for its periodic review of the years 2011-2021. △ Less

Submitted 20 October, 2023; originally announced October 2023.

Comments: 75 pages

arXiv:2310.10453 [pdf, other]

doi 10.1007/978-3-031-43895-0_70

On the Relevance of Temporal Features for Medical Ultrasound Video Recognition

Authors: D. Hudson Smith, John Paul Lineberger, George H. Baker

Abstract: Many medical ultrasound video recognition tasks involve identifying key anatomical features regardless of when they appear in the video suggesting that modeling such tasks may not benefit from temporal features. Correspondingly, model architectures that exclude temporal features may have better sample efficiency. We propose a novel multi-head attention architecture that incorporates these hypothes… ▽ More Many medical ultrasound video recognition tasks involve identifying key anatomical features regardless of when they appear in the video suggesting that modeling such tasks may not benefit from temporal features. Correspondingly, model architectures that exclude temporal features may have better sample efficiency. We propose a novel multi-head attention architecture that incorporates these hypotheses as inductive priors to achieve better sample efficiency on common ultrasound tasks. We compare the performance of our architecture to an efficient 3D CNN video recognition model in two settings: one where we expect not to require temporal features and one where we do. In the former setting, our model outperforms the 3D CNN - especially when we artificially limit the training data. In the latter, the outcome reverses. These results suggest that expressive time-independent models may be more effective than state-of-the-art video recognition models for some common ultrasound tasks in the low-data regime. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: 14 pages, 4 figures, published in MICCAI 23

Journal ref: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 744-753. Cham: Springer Nature Switzerland, 2023

arXiv:2310.09972 [pdf, ps, other]

Octonions as Clifford-like algebras

Authors: Connor M. Depies, Jonathan D. H. Smith, Mitchell D. Ashburn

Abstract: The associative Cayley-Dickson algebras over the field of real numbers are also Clifford algebras. The alternative but nonassociative real Cayley-Dickson algebras, notably the octonions and split octonions, share with Clifford algebras an involutary anti-automorphism and a set of mutually anticommutative generators. On the basis of these similarities, we introduce Kingdon algebras: alternative Cli… ▽ More The associative Cayley-Dickson algebras over the field of real numbers are also Clifford algebras. The alternative but nonassociative real Cayley-Dickson algebras, notably the octonions and split octonions, share with Clifford algebras an involutary anti-automorphism and a set of mutually anticommutative generators. On the basis of these similarities, we introduce Kingdon algebras: alternative Clifford-like algebras over vector spaces equipped with a symmetric bilinear form. Over three-dimensional vector spaces, our construction quantizes an alternative non-associative analogue of the exterior algebra. The octonions and split octonions, along with other real generalized Cayley-Dickson algebras in Albert's sense, arise as Kingdon algebras. Our construction gives natural characterizations of the octonion and split octonion algebras by a universality property endowing them with a selected superalgebra structure. △ Less

Submitted 15 October, 2023; originally announced October 2023.

MSC Class: 17D05 (Primary); 17A35; 17A45 (Secondary)

arXiv:2310.07195 [pdf, other]

doi 10.1088/2058-9565/ad3f43

Bilayer Ion Trap Design for 2D Arrays

Authors: Gavin N. Nop, Jonathan D. H. Smith, Daniel Stick, Durga Paudyal

Abstract: Junctions are fundamental elements that support qubit locomotion in two-dimensional ion trap arrays and enhance connectivity in emerging trapped-ion quantum computers. In surface ion traps they have typically been implemented by shaping radio frequency (RF) electrodes in a single plane to minimize the disturbance to the pseudopotential. However, this method introduces issues related to RF lead rou… ▽ More Junctions are fundamental elements that support qubit locomotion in two-dimensional ion trap arrays and enhance connectivity in emerging trapped-ion quantum computers. In surface ion traps they have typically been implemented by shaping radio frequency (RF) electrodes in a single plane to minimize the disturbance to the pseudopotential. However, this method introduces issues related to RF lead routing that can increase power dissipation and the likelihood of voltage breakdown. Here, we propose and simulate a novel two-layer junction design incorporating two perpendicularly rotoreflected (rotated, then reflected) linear ion traps. The traps are vertically separated, and create a trapping potential between their respective planes. The orthogonal orientation of the RF electrodes of each trap relative to the other provides perpendicular axes of confinement that can be used to realize transport in two dimensions. While this design introduces manufacturing and operating challenges, as now two separate structures have to be precisely positioned relative to each other in the vertical direction and optical access from the top is obscured, it obviates the need to route RF leads below the top surface of the trap and eliminates the pseudopotential bumps that occur in typical junctions. In this paper the stability of idealized ion transfer in the new configuration is demonstrated, both by solving the Mathieu equation analytically to identify the stable regions and by numerically modeling ion dynamics. Our novel junction layout has the potential to enhance the flexibility of microfabricated ion trap control to enable large-scale trapped-ion quantum computing. △ Less

Submitted 9 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

Comments: 11 pages

Journal ref: Gavin N Nop et al 2024 Quantum Sci. Technol. 9 035015

arXiv:2308.12912 [pdf, ps, other]

Matter relative to quantum hypersurfaces

Authors: Philipp A. Hoehn, Andrea Russo, Alexander R. H. Smith

Abstract: We explore the canonical description of a scalar field as a parameterized field theory on an extended phase space that includes additional embedding fields that characterize spacetime hypersurfaces $\mathsf{X}$ relative to which the scalar field is described. This theory is quantized via the Dirac prescription and physical states of the theory are used to define conditional wave functionals… ▽ More We explore the canonical description of a scalar field as a parameterized field theory on an extended phase space that includes additional embedding fields that characterize spacetime hypersurfaces $\mathsf{X}$ relative to which the scalar field is described. This theory is quantized via the Dirac prescription and physical states of the theory are used to define conditional wave functionals $|ψ_φ[\mathsf{X}]\rangle$ interpreted as the state of the field relative to the hypersurface $\mathsf{X}$, thereby extending the Page-Wootters formalism to quantum field theory. It is shown that this conditional wave functional satisfies the Tomonaga-Schwinger equation, thus demonstrating the formal equivalence between this extended Page-Wootters formalism and standard quantum field theory. We also construct relational Dirac observables and define a quantum deparameterization of the physical Hilbert space leading to a relational Heisenberg picture, which are both shown to be unitarily equivalent to the Page-Wootters formalism. Moreover, by treating hypersurfaces as quantum reference frames, we extend recently developed quantum frame transformations to changes between classical and nonclassical hypersurfaces. This allows us to exhibit the transformation properties of a quantum field under a larger class of transformations, which leads to a frame-dependent particle creation effect. △ Less

Submitted 23 November, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

Comments: 21 pages, 3 figures. Comments welcome

arXiv:2308.11665 [pdf]

What it takes to solve the Origin(s) of Life: An integrated review of techniques

Authors: OoLEN, Silke Asche, Carla Bautista, David Boulesteix, Alexandre Champagne-Ruel, Cole Mathis, Omer Markovitch, Zhen Peng, Alyssa Adams, Avinash Vicholous Dass, Arnaud Buch, Eloi Camprubi, Enrico Sandro Colizzi, Stephanie Colón-Santos, Hannah Dromiack, Valentina Erastova, Amanda Garcia, Ghjuvan Grimaud, Aaron Halpern, Stuart A Harrison, Seán F. Jordan, Tony Z Jia, Amit Kahana, Artemy Kolchinsky, Odin Moron-Garcia , et al. (13 additional authors not shown)

Abstract: Understanding the origin(s) of life (OoL) is a fundamental challenge for science in the 21st century. Research on OoL spans many disciplines, including chemistry, physics, biology, planetary sciences, computer science, mathematics and philosophy. The sheer number of different scientific perspectives relevant to the problem has resulted in the coexistence of diverse tools, techniques, data, and sof… ▽ More Understanding the origin(s) of life (OoL) is a fundamental challenge for science in the 21st century. Research on OoL spans many disciplines, including chemistry, physics, biology, planetary sciences, computer science, mathematics and philosophy. The sheer number of different scientific perspectives relevant to the problem has resulted in the coexistence of diverse tools, techniques, data, and software in OoL studies. This has made communication between the disciplines relevant to the OoL extremely difficult because the interpretation of data, analyses, or standards of evidence can vary dramatically. Here, we hope to bridge this wide field of study by providing common ground via the consolidation of tools and techniques rather than positing a unifying view on how life emerges. We review the common tools and techniques that have been used significantly in OoL studies in recent years. In particular, we aim to identify which information is most relevant for comparing and integrating the results of experimental analyses into mathematical and computational models. This review aims to provide a baseline expectation and understanding of technical aspects of origins research, rather than being a primer on any particular topic. As such, it spans broadly -- from analytical chemistry to mathematical models -- and highlights areas of future work that will benefit from a multidisciplinary approach to tackling the mystery of life's origin. Ultimately, we hope to empower a new generation of OoL scientists by reviewing how they can investigate life's origin, rather than dictating how to think about the problem. △ Less

Submitted 24 August, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

arXiv:2308.11634 [pdf, ps, other]

Barycentric algebra and convex polygon coordinates

Authors: A. B. Romanowska, J. D. H. Smith, A. Zamojska-Dzienio

Abstract: Barycentric coordinates provide solutions to the problem of expressing an element of a compact convex set as a convex combination of a finite number of extreme points of the set. Various approaches to this problem have arisen, in various contexts. The most general solution, namely the Gibbs coordinates based on entropy maximization, actually work in the broader setting of barycentric algebras, whi… ▽ More Barycentric coordinates provide solutions to the problem of expressing an element of a compact convex set as a convex combination of a finite number of extreme points of the set. Various approaches to this problem have arisen, in various contexts. The most general solution, namely the Gibbs coordinates based on entropy maximization, actually work in the broader setting of barycentric algebras, which constitute semilattice-ordered systems of convex sets. These coordinates involve exponential functions. For convex polytopes, Wachspress coordinates offer solutions which only involve rational functions. The current paper focuses primarily on convex polygons in the plane. After summarizing the Gibbs and Wachspress coordinates, we identify where they agree, and provide comparisons between them when they do not. Within a general formalism for analyzing coordinate systems, we then introduce a direct sparse geometric approach based on chordal decompositions of polygons, along with a symmetrized version which creates what we call cartographic coordinates. We present comparisons between cartographic coordinates based on distinct chordal decompositions, and comparisons of cartographic coordinates with Gibbs and Wachspress coordinates. △ Less

Submitted 13 August, 2023; originally announced August 2023.

Comments: 62 pages, 8 figures

MSC Class: 51M20; 52A01; 52B99

arXiv:2306.11815 [pdf, ps, other]

Radical Dynamical Monogenicity

Authors: Hanson Smith

Abstract: Let $a$ be an integer and $p$ a prime so that $f(x)=x^p-a$ is irreducible. Write $f^n(x)$ to indicate the $n$-fold composition of $f(x)$ with itself. We study the monogenicity of number fields defined by roots of $f^n(x)$ and give necessary and sufficient conditions for a root of $f^n(x)$ to yield a power integral basis for each $n\geq 1$. Let $a$ be an integer and $p$ a prime so that $f(x)=x^p-a$ is irreducible. Write $f^n(x)$ to indicate the $n$-fold composition of $f(x)$ with itself. We study the monogenicity of number fields defined by roots of $f^n(x)$ and give necessary and sufficient conditions for a root of $f^n(x)$ to yield a power integral basis for each $n\geq 1$. △ Less

Submitted 11 August, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

Comments: 11 pages. Arguments and discussion clarified and expanded. Comments are welcome!

MSC Class: 11R04; 11R21; 37P05

arXiv:2306.09620 [pdf, other]

Above-threshold ionization at laser intensity greater than $10^{20}$ W/cm$^{2}$

Authors: A. Yandow, T. N. Ha, C. Aniculaesei, H. L. Smith, C. G. Richmond, M. M. Spinks, H. J. Quevedo, S. Bruce, M. Darilek, C. Chang, D. A. Garcia, E. Gaul, M. E. Donovan, B. M. Hegelich, T. Ditmire

Abstract: We present the first experimental observation of above-threshold ionization (ATI) electrons produced by ionization of the neon K-shell in a laser field where intensity exceeds 10$^{20}$ W/cm$^{2}$. An array of plastic scintillating calorimeter detectors was used to measure the high-energy electrons at four angles in the laser forward direction. Coarse energy resolution was obtained using aluminum… ▽ More We present the first experimental observation of above-threshold ionization (ATI) electrons produced by ionization of the neon K-shell in a laser field where intensity exceeds 10$^{20}$ W/cm$^{2}$. An array of plastic scintillating calorimeter detectors was used to measure the high-energy electrons at four angles in the laser forward direction. Coarse energy resolution was obtained using aluminum filters of several thicknesses to block lower-energy electrons. A threshold intensity around $2 \times 10^{20}$ W/cm$^{2}$ is observed for production of energetic ATI electrons in the laser forward direction, with maximum electron energy exceeding 10 MeV. L-shell electrons with energies < 1.4 MeV are scattered further forward along the laser direction than expected. We present comparisons of the measured total electron energies to the predictions of a Monte Carlo models employing the ADK-PPT ionization model and the Augst barrier suppression ionization model. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Report number: LLNL-JRNL-849251

arXiv:2306.09611 [pdf, other]

Multi-MeV electrons from above-threshold ionization of the neon K-shell

Authors: A. Yandow, T. N. Ha, C. Aniculaesei, H. L. Smith, C. G. Richmond, M. M. Spinks, H. J. Quevedo, S. Bruce, M. Darilek, C. Chang, D. A. Garcia, E. Gaul, M. E. Donovan, B. M. Hegelich, T. Ditmire

Abstract: We present measurements of integrated electron energies produced by above-threshold ionization (ATI) of neon in a laser field with intensity exceeding 10$^{20}$ W/cm$^{2}$. We observe electrons with energy exceeding 10 MeV ejected in the laser forward direction above a threshold intensity of $2 \times 10^{20}$ W/cm$^{2}$. We compare to ATI models using both tunneling (ADK-PPT) and barrier suppress… ▽ More We present measurements of integrated electron energies produced by above-threshold ionization (ATI) of neon in a laser field with intensity exceeding 10$^{20}$ W/cm$^{2}$. We observe electrons with energy exceeding 10 MeV ejected in the laser forward direction above a threshold intensity of $2 \times 10^{20}$ W/cm$^{2}$. We compare to ATI models using both tunneling (ADK-PPT) and barrier suppression ionization and observe the onset of ATI at a higher threshold intensity than predicted by these models. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Report number: LLNL-JRNL-849250

arXiv:2305.18983 [pdf, other]

doi 10.1109/LRA.2023.3337701

SO(2)-Equivariant Downwash Models for Close Proximity Flight

Authors: H. Smith, A. Shankar, J. Gielis, J. Blumenkamp, A. Prorok

Abstract: Multirotors flying in close proximity induce aerodynamic wake effects on each other through propeller downwash. Conventional methods have fallen short of providing adequate 3D force-based models that can be incorporated into robust control paradigms for deploying dense formations. Thus, learning a model for these downwash patterns presents an attractive solution. In this paper, we present a novel… ▽ More Multirotors flying in close proximity induce aerodynamic wake effects on each other through propeller downwash. Conventional methods have fallen short of providing adequate 3D force-based models that can be incorporated into robust control paradigms for deploying dense formations. Thus, learning a model for these downwash patterns presents an attractive solution. In this paper, we present a novel learning-based approach for modelling the downwash forces that exploits the latent geometries (i.e. symmetries) present in the problem. We demonstrate that when trained with only 5 minutes of real-world flight data, our geometry-aware model outperforms state-of-the-art baseline models trained with more than 15 minutes of data. In dense real-world flights with two vehicles, deploying our model online improves 3D trajectory tracking by nearly 36% on average (and vertical tracking by 56%). △ Less

Submitted 25 March, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

Journal ref: Smith, H., Shankar, A., Gielis, J., Blumenkamp, J., & Prorok, A. IEEE Robotics and Automation Letters 9(2) (2024) 1174-1181

arXiv:2305.15422 [pdf]

doi 10.1145/3583781.3590245

Facial Expression Recognition at the Edge: CPU vs GPU vs VPU vs TPU

Authors: Mohammadreza Mohammadi, Heath Smith, Lareb Khan, Ramtin Zand

Abstract: Facial Expression Recognition (FER) plays an important role in human-computer interactions and is used in a wide range of applications. Convolutional Neural Networks (CNN) have shown promise in their ability to classify human facial expressions, however, large CNNs are not well-suited to be implemented on resource- and energy-constrained IoT devices. In this work, we present a hierarchical framewo… ▽ More Facial Expression Recognition (FER) plays an important role in human-computer interactions and is used in a wide range of applications. Convolutional Neural Networks (CNN) have shown promise in their ability to classify human facial expressions, however, large CNNs are not well-suited to be implemented on resource- and energy-constrained IoT devices. In this work, we present a hierarchical framework for developing and optimizing hardware-aware CNNs tuned for deployment at the edge. We perform a comprehensive analysis across various edge AI accelerators including NVIDIA Jetson Nano, Intel Neural Compute Stick, and Coral TPU. Using the proposed strategy, we achieved a peak accuracy of 99.49% when testing on the CK+ facial expression recognition dataset. Additionally, we achieved a minimum inference latency of 0.39 milliseconds and a minimum power consumption of 0.52 Watts. △ Less

Submitted 16 May, 2023; originally announced May 2023.

arXiv:2305.12034 [pdf, other]

Bayesian Safety Surveillance with Adaptive Bias Correction

Authors: Fan Bu, Martijn J. Schuemie, Akihiko Nishimura, Louisa H. Smith, Kristin Kostka, Thomas Falconer, Jody-Ann McLeggon, Patrick B. Ryan, George Hripcsak, Marc A. Suchard

Abstract: Post-market safety surveillance is an integral part of mass vaccination programs. Typically relying on sequential analysis of real-world health data as they accrue, safety surveillance is challenged by the difficulty of sequential multiple testing and by biases induced by residual confounding. The current standard approach based on the maximized sequential probability ratio test (MaxSPRT) fails to… ▽ More Post-market safety surveillance is an integral part of mass vaccination programs. Typically relying on sequential analysis of real-world health data as they accrue, safety surveillance is challenged by the difficulty of sequential multiple testing and by biases induced by residual confounding. The current standard approach based on the maximized sequential probability ratio test (MaxSPRT) fails to satisfactorily address these practical challenges and it remains a rigid framework that requires pre-specification of the surveillance schedule. We develop an alternative Bayesian surveillance procedure that addresses both challenges using a more flexible framework. We adopt a joint statistical modeling approach to sequentially estimate the effect of vaccine exposure on the adverse event of interest and correct for estimation bias by simultaneously analyzing a large set of negative control outcomes through a Bayesian hierarchical model. We then compute a posterior probability of the alternative hypothesis via Markov chain Monte Carlo sampling and use it for sequential detection of safety signals. Through an empirical evaluation using six US observational healthcare databases covering more than 360 million patients, we benchmark the proposed procedure against MaxSPRT on testing errors and estimation accuracy, under two epidemiological designs, the historical comparator and the self-controlled case series. We demonstrate that our procedure substantially reduces Type 1 error rates, maintains high statistical power, delivers fast signal detection, and provides considerably more accurate estimation. As an effort to promote open science, we present all empirical results in an R ShinyApp and provide full implementation of our method in the R package EvidenceSynthesis. △ Less

Submitted 19 May, 2023; originally announced May 2023.

arXiv:2305.09580 [pdf, other]

Generate Compilers from Hardware Models!

Authors: Gus Henry Smith, Ben Kushigian, Vishal Canumalla, Andrew Cheung, René Just, Zachary Tatlock

Abstract: Compiler backends should be automatically generated from hardware design language (HDL) models of the hardware they target. Generating compiler components directly from HDL can provide stronger correctness guarantees, ease development effort, and encourage hardware exploration. Past work has already championed this idea; here we argue that advances in program synthesis make the approach more feasi… ▽ More Compiler backends should be automatically generated from hardware design language (HDL) models of the hardware they target. Generating compiler components directly from HDL can provide stronger correctness guarantees, ease development effort, and encourage hardware exploration. Past work has already championed this idea; here we argue that advances in program synthesis make the approach more feasible. We present a concrete example by demonstrating how FPGA technology mappers can be automatically generated from SystemVerilog models of an FPGA's primitives using program synthesis. △ Less

Submitted 16 May, 2023; originally announced May 2023.

Comments: 3 pages, 2 figures, to be presented at the 2023 PLARCH Workshop at FCRC

arXiv:2304.09150 [pdf, other]

Constraints on Europa's water group torus from HST/COS observations

Authors: Lorenz Roth, H. Todd Smith, Kazuo Yoshioka, Tracy M. Becker, Aljona Blöcker, Nathaniel J. Cunningham, Nickolay Ivchenko, Kurt D. Retherford, Joachim Saur, Michael Velez, Fuminori Tsuchiya

Abstract: In-situ plasma measurements as well as remote mapping of energetic neutral atoms around Jupiter provide indirect evidence that an enhancement of neutral gas is present near the orbit of the moon Europa. Simulations suggest that such a neutral gas torus can be sustained by escape from Europa's atmosphere and consists primarily of molecular hydrogen, but the neutral gas torus has not yet been measur… ▽ More In-situ plasma measurements as well as remote mapping of energetic neutral atoms around Jupiter provide indirect evidence that an enhancement of neutral gas is present near the orbit of the moon Europa. Simulations suggest that such a neutral gas torus can be sustained by escape from Europa's atmosphere and consists primarily of molecular hydrogen, but the neutral gas torus has not yet been measured directly through emissions or in-situ. Here we present observations by the Cosmic Origins Spectrograph of the Hubble Space Telescope (HST/COS) from 2020 and 2021, which scanned the equatorial plane between 8 and 10 planetary radii west of Jupiter. No neutral gas emissions are detected. We derive upper limits on the emissions and compare these to modelled emissions from electron impact and resonant scattering using a Europa torus Monte Carlo model for the neutral gases. The comparison supports the previous findings that the torus is dilute and primarily consists of molecular hydrogen. A detection of sulfur ion emissions radially inward of the Europa orbit is consistent with emissions from the extended Io torus and with sulfur ion fractional abundances as previously detected. △ Less

Submitted 18 April, 2023; originally announced April 2023.

arXiv:2303.12741 [pdf, other]

A Method for Animating Children's Drawings of the Human Figure

Authors: Harrison Jesse Smith, Qingyuan Zheng, Yifei Li, Somya Jain, Jessica K. Hodgins

Abstract: Children's drawings have a wonderful inventiveness, creativity, and variety to them. We present a system that automatically animates children's drawings of the human figure, is robust to the variance inherent in these depictions, and is simple and straightforward enough for anyone to use. We demonstrate the value and broad appeal of our approach by building and releasing the Animated Drawings Demo… ▽ More Children's drawings have a wonderful inventiveness, creativity, and variety to them. We present a system that automatically animates children's drawings of the human figure, is robust to the variance inherent in these depictions, and is simple and straightforward enough for anyone to use. We demonstrate the value and broad appeal of our approach by building and releasing the Animated Drawings Demo, a freely available public website that has been used by millions of people around the world. We present a set of experiments exploring the amount of training data needed for fine-tuning, as well as a perceptual study demonstrating the appeal of a novel twisted perspective retargeting technique. Finally, we introduce the Amateur Drawings Dataset, a first-of-its-kind annotated dataset, collected via the public demo, containing over 178,000 amateur drawings and corresponding user-accepted character bounding boxes, segmentation masks, and joint location annotations. △ Less

Submitted 4 April, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

arXiv:2303.02161 [pdf]

Exploring Fundamental Particle Acceleration and Loss Processes in Heliophysics through an Orbiting X-ray Instrument in the Jovian System

Authors: W. Dunn, G. Berland, E. Roussos, G. Clark, P. Kollmann, D. Turner, C. Feldman, T. Stallard, G. Branduardi-Raymont, E. E. Woodfield, I. J. Rae, L. C. Ray, J. A. Carter, S. T. Lindsay, Z. Yao, R. Marshall, A. N. Jaynes A., Y. Ezoe, M. Numazawa, G. B. Hospodarsky, X. Wu, D. M. Weigt, C. M. Jackman, K. Mori, Q. Nénon , et al. (19 additional authors not shown)

Abstract: Jupiter's magnetosphere is considered to be the most powerful particle accelerator in the Solar System, accelerating electrons from eV to 70 MeV and ions to GeV energies. How electromagnetic processes drive energy and particle flows, producing and removing energetic particles, is at the heart of Heliophysics. Particularly, the 2013 Decadal Strategy for Solar and Space Physics was to "Discover and… ▽ More Jupiter's magnetosphere is considered to be the most powerful particle accelerator in the Solar System, accelerating electrons from eV to 70 MeV and ions to GeV energies. How electromagnetic processes drive energy and particle flows, producing and removing energetic particles, is at the heart of Heliophysics. Particularly, the 2013 Decadal Strategy for Solar and Space Physics was to "Discover and characterize fundamental processes that occur both within the heliosphere and throughout the universe". The Jovian system offers an ideal natural laboratory to investigate all of the universal processes highlighted in the previous Decadal. The X-ray waveband has been widely used to remotely study plasma across astrophysical systems. The majority of astrophysical emissions can be grouped into 5 X-ray processes: fluorescence, thermal/coronal, scattering, charge exchange and particle acceleration. The Jovian system offers perhaps the only system that presents a rich catalog of all of these X-ray emission processes and can also be visited in-situ, affording the special possibility to directly link fundamental plasma processes with their resulting X-ray signatures. This offers invaluable ground-truths for astrophysical objects beyond the reach of in-situ exploration (e.g. brown dwarfs, magnetars or galaxy clusters that map the cosmos). Here, we show how coupling in-situ measurements with in-orbit X-ray observations of Jupiter's radiation belts, Galilean satellites, Io Torus, and atmosphere addresses fundamental heliophysics questions with wide-reaching impact across helio- and astrophysics. New developments like miniaturized X-ray optics and radiation-tolerant detectors, provide compact, lightweight, wide-field X-ray instruments perfectly suited to the Jupiter system, enabling this exciting new possibility. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: A White Paper for the 2024-2033 Solar and Space Physics (Heliophysics) Decadal Survey

arXiv:2301.06773 [pdf, other]

doi 10.1103/PhysRevResearch.5.L032030

Critical gradient turbulence optimization toward a compact stellarator reactor concept

Authors: G. T. Roberg-Clark, G. G. Plunk, P. Xanthopoulos, C. Nührenberg, S. A. Henneberg, H. M. Smith

Abstract: Integrating turbulence into stellarator optimization is shown by targeting the onset for the ion-temperature-gradient mode, highlighting effects of parallel connection length, local magnetic shear, and flux surface expansion. The result is a compact quasihelically symmetric stellarator configuration, admitting a set of uncomplicated coils, with significantly reduced turbulent heat fluxes compared… ▽ More Integrating turbulence into stellarator optimization is shown by targeting the onset for the ion-temperature-gradient mode, highlighting effects of parallel connection length, local magnetic shear, and flux surface expansion. The result is a compact quasihelically symmetric stellarator configuration, admitting a set of uncomplicated coils, with significantly reduced turbulent heat fluxes compared to a known stellarator. The new configuration combines low values of neoclassical transport, good alpha particle confinement, and Mercier stability at a plasma beta of almost 2$\%$. △ Less

Submitted 6 October, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

Comments: 5 pages, 5 figures. Phys. Rev. Research 5, L032030 (2023)

arXiv:2212.06441 [pdf, other]

Inverse Design of High-NA Metalens for Maskless Lithography

Authors: Haejun Chung, Feng Zhang, Hao Li, Owen D. Miller, Henry I. Smith

Abstract: We demonstrate an axisymmetric inverse-designed metalens to improve the performance of zone-plate-array lithography (ZPAL), one of the maskless lithography approaches, that offer a new paradigm for nanoscale research and industry. First, we derive a computational upper bound for a unit-cell-based axisymmetric metalens. Then, we demonstrate a fabrication-compatible inverse-designed metalens with 85… ▽ More We demonstrate an axisymmetric inverse-designed metalens to improve the performance of zone-plate-array lithography (ZPAL), one of the maskless lithography approaches, that offer a new paradigm for nanoscale research and industry. First, we derive a computational upper bound for a unit-cell-based axisymmetric metalens. Then, we demonstrate a fabrication-compatible inverse-designed metalens with 85.50\% transmission normalized focusing efficiency at 0.6 numerical aperture at 405nm wavelength; a higher efficiency than a theoretical gradient index lens design (79.98\%). We also demonstrate experimental validation for our axisymmetric inverse-designed metalens via electron beam lithography. Metalens-based maskless lithography may open a new way of achieving low-cost, large-area nanofabrication. △ Less

Submitted 13 December, 2022; originally announced December 2022.

arXiv:2212.04001 [pdf, other]

TweetDrought: A Deep-Learning Drought Impacts Recognizer based on Twitter Data

Authors: Beichen Zhang, Frank Schilder, Kelly Helm Smith, Michael J. Hayes, Sherri Harms, Tsegaye Tadesse

Abstract: Acquiring a better understanding of drought impacts becomes increasingly vital under a warming climate. Traditional drought indices describe mainly biophysical variables and not impacts on social, economic, and environmental systems. We utilized natural language processing and bidirectional encoder representation from Transformers (BERT) based transfer learning to fine-tune the model on the data f… ▽ More Acquiring a better understanding of drought impacts becomes increasingly vital under a warming climate. Traditional drought indices describe mainly biophysical variables and not impacts on social, economic, and environmental systems. We utilized natural language processing and bidirectional encoder representation from Transformers (BERT) based transfer learning to fine-tune the model on the data from the news-based Drought Impact Report (DIR) and then apply it to recognize seven types of drought impacts based on the filtered Twitter data from the United States. Our model achieved a satisfying macro-F1 score of 0.89 on the DIR test set. The model was then applied to California tweets and validated with keyword-based labels. The macro-F1 score was 0.58. However, due to the limitation of keywords, we also spot-checked tweets with controversial labels. 83.5% of BERT labels were correct compared to the keyword labels. Overall, the fine-tuned BERT-based recognizer provided proper predictions and valuable information on drought impacts. The interpretation and analysis of the model were consistent with experiential domain expertise. △ Less

Submitted 7 December, 2022; originally announced December 2022.

Comments: 5 pages (+3 in appendix), 5 figures in appendix, 2 tables (+1 in appendix), ICML Workshop on Tackling Climate Change with Machine Learning Workshop, 2021

arXiv:2211.09829 [pdf, other]

doi 10.1017/S002237782300065X

Constructing precisely quasi-isodynamic magnetic fields

Authors: Alan Goodman, Katia Camacho Mata, Sophia A Henneberg, Rogerio Jorge, Matt Landreman, Gabriel Plunk, Hakan Smith, Ralf Mackenbach, Per Helander

Abstract: We present a novel method for numerically finding quasi-isodynamic stellarator magnetic fields with excellent fast-particle confinement and extremely small neoclassical transport. The method works particularly well in configurations with only one field period. We examine the properties of these newfound quasi-isodynamic configurations, including their bootstrap currents, particle confinement, and… ▽ More We present a novel method for numerically finding quasi-isodynamic stellarator magnetic fields with excellent fast-particle confinement and extremely small neoclassical transport. The method works particularly well in configurations with only one field period. We examine the properties of these newfound quasi-isodynamic configurations, including their bootstrap currents, particle confinement, and available energy for trapped-electron driven turbulence, as well as the degree to which they change when a finite pressure profile is added. We finally discuss the differences between the magnetic axes of the optimized solutions and their respective initial conditions, and conclude with the prospects for future quasi-isodynamic optimization. △ Less

Submitted 17 November, 2022; originally announced November 2022.

Comments: 25 pages, 10 figures

Showing 1–50 of 621 results for author: Smith, H