Skip to main content

Showing 1–50 of 434 results for author: Katz, D

  1. Training Next Generation AI Users and Developers at NCSA

    Authors: Daniel S. Katz, Volodymyr Kindratenko, Olena Kindratenko, Priyam Mazumdar

    Abstract: This article focuses on training work carried out in artificial intelligence (AI) at the National Center for Supercomputing Applications (NCSA) at the University of Illinois Urbana-Champaign via a research experience for undergraduates (REU) program named FoDOMMaT. It also describes why we are interested in AI, and concludes by discussing what we've learned from running this program and its predec… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2404.10486  [pdf, other

    astro-ph.GA astro-ph.SR

    Discovery of a dormant 33 solar-mass black hole in pre-release Gaia astrometry

    Authors: Gaia Collaboration, P. Panuzzo, T. Mazeh, F. Arenou, B. Holl, E. Caffau, A. Jorissen, C. Babusiaux, P. Gavras, J. Sahlmann, U. Bastian, Ł. Wyrzykowski, L. Eyer, N. Leclerc, N. Bauchet, A. Bombrun, N. Mowlavi, G. M. Seabroke, D. Teyssier, E. Balbinot, A. Helmi, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne , et al. (390 additional authors not shown)

    Abstract: Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is exp… ▽ More

    Submitted 19 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 23 pages, accepted fro publication in A&A Letters. New version with small fixes

  3. arXiv:2403.19394  [pdf, ps, other

    cs.CY q-bio.OT

    Cycling on the Freeway: The Perilous State of Open Source Neuroscience Software

    Authors: Britta U. Westner, Daniel R. McCloy, Eric Larson, Alexandre Gramfort, Daniel S. Katz, Arfon M. Smith, invited co-signees

    Abstract: Most scientists need software to perform their research (Barker et al., 2020; Carver et al., 2022; Hettrick, 2014; Hettrick et al., 2014; Switters and Osimo, 2019), and neuroscientists are no exception. Whether we work with reaction times, electrophysiological signals, or magnetic resonance imaging data, we rely on software to acquire, analyze, and statistically evaluate the raw data we obtain - o… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  4. arXiv:2403.16709  [pdf, other

    hep-lat

    The crossover line in the $(T, μ)$-phase diagram of QCD

    Authors: Jana N. Guenther, Szabolcs Borsányi, Zoltan Fodor, Ruben Kara, Sandor D. Katz, Paolo Parotto, Attila Pásztor, Claudia Ratti, Kalman K. Szabó

    Abstract: An efficient way to study the QCD phase diagram at small finite density is to extrapolate thermodynamical observables from imaginary chemical potential. The phase diagram features a crossover line starting from the transition temperature already determined at zero chemical potential. In this work we focus on the Taylor expansion of this line up to $μ^4$ contributions. We present the continuum extr… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Proceedings to Quark Matter Conference 2019

    Journal ref: Nucl.Phys.A 982 (2019) 303-306

  5. arXiv:2403.08963  [pdf, other

    astro-ph.GA

    Timing the Milky Way bar formation and the accompanying radial migration episode

    Authors: Misha Haywood, Sergey Khoperskov, Valeria Cerqui, Paola Di Matteo, David Katz, Owain Snaith

    Abstract: We derive the metallicity profile of the Milky Way low-$α$ disc population from 2 to 20 kpc from the Galactic centre in 1 Gyr age bins using the astroNN catalogue, and show that it is highly structured, with a plateau between 4 and 7 kpc and a break at 10-12 kpc. We argue that these features result from the two main bar resonances, the corotation and the Outer Lindblad Resonance (OLR), respectivel… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 20 pages, 23 figures. Submitted to A&A

  6. arXiv:2403.04972  [pdf, ps, other

    math.AC math.AG

    On Abelian extensions in mixed characteristic and ramification in codimension one

    Authors: Daniel Katz, Prashanth Sridhar

    Abstract: A theorem of Paul Roberts states that the integral closure of a regular local ring in a generically abelian extension is Cohen-Macaulay, provided the characteristic of the residue field does not divide the order of the Galois group. An example of Koh shows the conclusion is false in the modular case. After a modification to the statement concerning ramification over $p$ in codimension one, we give… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 21 pages

    MSC Class: 13B05

  7. The Gaia RVS benchmark stars II. A sample of stars selected for their Gaia high radial velocity

    Authors: E. Caffau, D. Katz, A. Gómez, P. Bonifacio, R. Lallement, P. Sartoretti, L. Sbordone, M. Spite, A. Mucciarelli, R. Ibata, L. Chemin, F. Thévenin, P. Panuzzo, N. Leclerc, P. François, H. -G. Ludwig, L. Monaco, M. Haywood, C. Soubiran

    Abstract: The Gaia satellite has already provided the astronomical community with three data releases, and the Radial Velocity Spectrometer (RVS) on board Gaia has provided the radial velocity for 33 million stars. When deriving the radial velocity from the RVS spectra, several stars are measured to have large values. To verify the credibility of these measurements, we selected some bright stars with the mo… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Astronomy and Astrophysics - A\&A, In press

  8. arXiv:2402.02824  [pdf

    cs.SE

    FAIR-USE4OS: Guidelines for Creating Impactful Open-Source Software

    Authors: Raphael Sonabend, Hugo Gruson, Leo Wolansky, Agnes Kiragga, Daniel S. Katz

    Abstract: This paper extends the FAIR (Findable, Accessible, Interoperable, Reusable) guidelines to provide criteria for assessing if software conforms to best practices in open source. By adding 'USE' (User-Centered, Sustainable, Equitable), software development can adhere to open source best practice by incorporating user-input early on, ensuring front-end designs are accessible to all possible stakeholde… ▽ More

    Submitted 3 April, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  9. arXiv:2312.07711  [pdf, other

    cs.AI

    Leveraging Large Language Models to Build and Execute Computational Workflows

    Authors: Alejandro Duque, Abdullah Syed, Kastan V. Day, Matthew J. Berry, Daniel S. Katz, Volodymyr V. Kindratenko

    Abstract: The recent development of large language models (LLMs) with multi-billion parameters, coupled with the creation of user-friendly application programming interfaces (APIs), has paved the way for automatically generating and executing code in response to straightforward human queries. This paper explores how these emerging capabilities can be harnessed to facilitate complex scientific workflows, eli… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  10. arXiv:2312.07528  [pdf, other

    hep-lat nucl-th

    Continuum extrapolated high order baryon fluctuations

    Authors: Szabolcs Borsányi, Zoltán Fodor, Jana N. Guenther, Sándor D. Katz, Paolo Parotto, Attila Pásztor, Dávid Pesznyák, Kálmán K. Szabó, Chik Him Wong

    Abstract: Fluctuations play a key role in the study of QCD phases. Lattice QCD is a valuable tool to calculate them, but going to high orders is challenging. Up to the fourth order, continuum results are available since 2015. We present the first continuum results for sixth order baryon fluctuations for temperatures between $T=130 - 200$ MeV, and eighth order at $T=145$ MeV in a fixed volume. We show that f… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 5 pages, 2 figures (main text) + 5 pages, 7 figures (supplemental material)

  11. arXiv:2310.06551  [pdf, other

    astro-ph.SR astro-ph.GA

    Gaia Focused Product Release: Sources from Service Interface Function image analysis -- Half a million new sources in omega Centauri

    Authors: Gaia Collaboration, K. Weingrill, A. Mints, J. Castañeda, Z. Kostrzewa-Rutkowska, M. Davidson, F. De Angeli, J. Hernández, F. Torra, M. Ramos-Lerate, C. Babusiaux, M. Biermann, C. Crowley, D. W. Evans, L. Lindegren, J. M. Martín-Fleitas, L. Palaversa, D. Ruz Mieres, K. Tisanić, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne, F. Arenou, A. Barbier , et al. (378 additional authors not shown)

    Abstract: Gaia's readout window strategy is challenged by very dense fields in the sky. Therefore, in addition to standard Gaia observations, full Sky Mapper (SM) images were recorded for nine selected regions in the sky. A new software pipeline exploits these Service Interface Function (SIF) images of crowded fields (CFs), making use of the availability of the full two-dimensional (2D) information. This ne… ▽ More

    Submitted 8 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Journal ref: A&A 680, A35 (2023)

  12. arXiv:2310.06295  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM

    Gaia Focused Product Release: A catalogue of sources around quasars to search for strongly lensed quasars

    Authors: Gaia Collaboration, A. Krone-Martins, C. Ducourant, L. Galluccio, L. Delchambre, I. Oreshina-Slezak, R. Teixeira, J. Braine, J. -F. Le Campion, F. Mignard, W. Roux, A. Blazere, L. Pegoraro, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne, F. Arenou, C. Babusiaux, A. Barbier, M. Biermann, O. L. Creevey, D. W. Evans, L. Eyer, R. Guerra , et al. (376 additional authors not shown)

    Abstract: Context. Strongly lensed quasars are fundamental sources for cosmology. The Gaia space mission covers the entire sky with the unprecedented resolution of $0.18$" in the optical, making it an ideal instrument to search for gravitational lenses down to the limiting magnitude of 21. Nevertheless, the previous Gaia Data Releases are known to be incomplete for small angular separations such as those ex… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 35 pages, 60 figures, accepted for publication by Astronomy and Astrophysics

    Journal ref: A&A 685, A130 (2024)

  13. arXiv:2310.06051  [pdf, other

    astro-ph.SR

    Gaia Focused Product Release: Radial velocity time series of long-period variables

    Authors: Gaia Collaboration, Gaia Collaboration, M. Trabucchi, N. Mowlavi, T. Lebzelter, I. Lecoeur-Taibi, M. Audard, L. Eyer, P. García-Lario, P. Gavras, B. Holl, G. Jevardat de Fombelle, K. Nienartowicz, L. Rimoldini, P. Sartoretti, R. Blomme, Y. Frémat, O. Marchal, Y. Damerdji, A. G. A. Brown, A. Guerrier, P. Panuzzo, D. Katz, G. M. Seabroke, K. Benson , et al. (382 additional authors not shown)

    Abstract: The third Gaia Data Release (DR3) provided photometric time series of more than 2 million long-period variable (LPV) candidates. Anticipating the publication of full radial-velocity (RV) in DR4, this Focused Product Release (FPR) provides RV time series for a selection of LPVs with high-quality observations. We describe the production and content of the Gaia catalog of LPV RV time series, and the… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 36 pages, 38 figures

  14. arXiv:2309.14571  [pdf, ps, other

    hep-ex hep-ph

    Software Citation in HEP: Current State and Recommendations for the Future

    Authors: Matthew Feickert, Daniel S. Katz, Mark S. Neubauer, Elizabeth Sexton-Kennedy, Graeme A. Stewart

    Abstract: In November 2022, the HEP Software Foundation and the Institute for Research and Innovation for Software in High-Energy Physics organized a workshop on the topic of Software Citation and Recognition in HEP. The goal of the workshop was to bring together different types of stakeholders whose roles relate to software citation, and the associated credit it provides, in order to engage the community i… ▽ More

    Submitted 4 January, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: 7 pages, 2 listings. Contribution to the Proceedings of the 26th International Conference on Computing in High Energy and Nuclear Physics (CHEP 2023)

  15. arXiv:2308.14954  [pdf

    cs.CY

    Transitioning ECP Software Technology into a Foundation for Sustainable Research Software

    Authors: Gregory R. Watson, Addi Malviya-Thakur, Daniel S. Katz, Elaine M. Raybourn, Bill Hoffman, Dana Robinson, John Kellerman, Clark Roundy

    Abstract: Research software plays a crucial role in advancing scientific knowledge, but ensuring its sustainability, maintainability, and long-term viability is an ongoing challenge. The Sustainable Research Software Institute (SRSI) Model has been designed to address the concerns, and presents a comprehensive framework designed to promote sustainable practices in the research software community. However th… ▽ More

    Submitted 30 August, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 7 pages, 1 figure

    Report number: 200366

  16. arXiv:2308.14953  [pdf

    cs.CY

    An Open Community-Driven Model For Sustainable Research Software: Sustainable Research Software Institute

    Authors: Gregory R. Watson, Addi Malviya-Thakur, Daniel S. Katz, Elaine M. Raybourn, Bill Hoffman, Dana Robinson, John Kellerman, Clark Roundy

    Abstract: Research software plays a crucial role in advancing scientific knowledge, but ensuring its sustainability, maintainability, and long-term viability is an ongoing challenge. To address these concerns, the Sustainable Research Software Institute (SRSI) Model presents a comprehensive framework designed to promote sustainable practices in the research software community. This white paper provides an i… ▽ More

    Submitted 30 August, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 13 pages, 1 figure

    Report number: 200363

  17. Research Software Engineering in 2030

    Authors: Daniel S. Katz, Simon Hettrick

    Abstract: This position paper for an invited talk on the "Future of eScience" discusses the Research Software Engineering Movement and where it might be in 2030. Because of the authors' experiences, it is aimed globally but with examples that focus on the United States and United Kingdom.

    Submitted 27 September, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Invited paper for 2023 IEEE Conference on eScience

  18. arXiv:2308.07467  [pdf, ps, other

    cs.IT eess.SP math.CA

    Sequences with identical autocorrelation spectra

    Authors: Daniel J. Katz, Adeebur Rahman, Michael J Ward

    Abstract: Aperiodic autocorrelation measures the similarity between a finite-length sequence of complex numbers and translates of itself. Autocorrelation is important in communications, remote sensing, and scientific instrumentation. The autocorrelation function reports the aperiodic autocorrelation at every possible translation. Knowing the autocorrelation function of a sequence is equivalent to knowing th… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 12 pages

    MSC Class: 94A12 42A05 42A38 42A85

  19. arXiv:2308.06105  [pdf, other

    hep-lat

    Can rooted staggered fermions describe nonzero baryon density at low temperatures?

    Authors: Szabolcs Borsanyi, Zoltan Fodor, Matteo Giordano, Jana N. Guenther, Sandor D. Katz, Attila Pasztor, Chik Him Wong

    Abstract: Research on the QCD phase diagram with lattice field theory methods is dominated by the use of rooted staggered fermions, as they are the computationally cheapest discretization available. We show that rooted staggered fermions at a nonzero baryochemical potential $μ_B$ predict a sharp rise in the baryon density at low temperatures and $μ_B \gtrsim 3 m_π/2$, where $m_π$ is the Goldstone pion mass.… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 12 pages, 6 figures

  20. arXiv:2307.15657  [pdf, ps, other

    cs.IT cs.CR cs.DM math.CO math.NT

    Almost perfect nonlinear power functions with exponents expressed as fractions

    Authors: Daniel J. Katz, Kathleen R. O'Connor, Kyle Pacheco, Yakov Sapozhnikov

    Abstract: Let $F$ be a finite field, let $f$ be a function from $F$ to $F$, and let $a$ be a nonzero element of $F$. The discrete derivative of $f$ in direction $a$ is $Δ_a f \colon F \to F$ with $(Δ_a f)(x)=f(x+a)-f(x)$. The differential spectrum of $f$ is the multiset of cardinalities of all the fibers of all the derivatives $Δ_a f$ as $a$ runs through $F^*$. The function $f$ is almost perfect nonlinear (… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: 30 pages

  21. arXiv:2307.14566  [pdf, ps, other

    cs.IT cs.DM eess.SP math.CO math.PR

    Limiting Moments of Autocorrelation Demerit Factors of Binary Sequences

    Authors: Daniel J. Katz, Miriam E. Ramirez

    Abstract: An aperiodic binary sequence of length $\ell$ is a doubly infinite sequence $f=\ldots,f_{-1},f_0,f_1,\ldots$ with $f_j \in \{-1,1\}$ when $0 \leq j < \ell$ and and $f_j=0$ otherwise. Various problems in engineering and natural science demand binary sequences that do not resemble translates of themselves. The autocorrelation of $f$ at shift $s$ is the dot product of $f$ with the sequence obtained b… ▽ More

    Submitted 11 June, 2024; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: 27 pages

    MSC Class: 60C05; 94A55; 05A99; 05A18; 05E18

  22. arXiv:2307.14281  [pdf, ps, other

    cs.IT cs.DM eess.SP math.CO math.PR

    Moments of Autocorrelation Demerit Factors of Binary Sequences

    Authors: Daniel J. Katz, Miriam E. Ramirez

    Abstract: Sequences with low aperiodic autocorrelation are used in communications and remote sensing for synchronization and ranging. The autocorrelation demerit factor of a sequence is the sum of the squared magnitudes of its autocorrelation values at every nonzero shift when we normalize the sequence to have unit Euclidean length. The merit factor, introduced by Golay, is the reciprocal of the demerit fac… ▽ More

    Submitted 7 June, 2024; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: 41 pages

    MSC Class: 60C05; 94A55; 05A99; 05A18; 05E18

  23. arXiv:2307.11383  [pdf, ps, other

    cs.SE

    Wanted: standards for automatic reproducibility of computational experiments

    Authors: Samuel Grayson, Reed Milewicz, Joshua Teves, Daniel S. Katz, Darko Marinov

    Abstract: Those seeking to reproduce a computational experiment often need to manually look at the code to see how to build necessary libraries, configure parameters, find data, and invoke the experiment; it is not automatic. Automatic reproducibility is a more stringent goal, but working towards it would benefit the community. This work discusses a machine-readable language for specifying how to execute a… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: Submitted to SE4RS'23 Portland, OR

  24. arXiv:2307.11060  [pdf, ps, other

    cs.SE

    The Changing Role of RSEs over the Lifetime of Parsl

    Authors: Daniel S. Katz, Ben Clifford, Yadu Babuji, Kevin Hunter Kesling, Anna Woodard, Kyle Chard

    Abstract: This position paper describes the Parsl open source research software project and its various phases over seven years. It defines four types of research software engineers (RSEs) who have been important to the project in those phases; we believe this is also applicable to other research software projects.

    Submitted 20 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 3 pages

  25. arXiv:2307.07630  [pdf, other

    astro-ph.SR

    Optical Studies of Seven Bright Southern Cataclysmic Variable Stars

    Authors: John R. Thorstensen, Chase K. Alvarado-Anderson, Abigail D. Burrows, Rowan M. Goebel-Bain, David C. Katz

    Abstract: We report spectroscopic observations of seven bright southern cataclysmic variable stars, collected on a single two-week observing run using the 1.9-m Radcliffe telescope at the South African Astronomical Observatory. We used radial velocity time series, in some cases in combination with other data, to determine or clarify orbital periods for five of them, namely ATO J061.1478-31.0634, BMAM-V547,… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: 12 pages, 13 figures. Accepted for The Astronomical Journal

  26. arXiv:2306.14414  [pdf, ps, other

    math.NT cs.CR cs.IT math.CO

    Rationality of Four-Valued Families of Weil Sums of Binomials

    Authors: Daniel J. Katz, Allison E. Wong

    Abstract: We investigate the rationality of Weil sums of binomials of the form $W^{K,s}_u=\sum_{x \in K} ψ(x^s - u x)$, where $K$ is a finite field whose canonical additive character is $ψ$, and where $u$ is an element of $K^{\times}$ and $s$ is a positive integer relatively prime to $|K^\times|$, so that $x \mapsto x^s$ is a permutation of $K$. The Weil spectrum for $K$ and $s$, which is the family of valu… ▽ More

    Submitted 6 April, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 33 pages

    MSC Class: 11T24; 11L05; 11L40; 11T22; 11G25; 11T71; 94A55; 94A60; 94B15

  27. arXiv:2306.11615  [pdf, other

    cs.DC

    Fine-grained Policy-driven I/O Sharing for Burst Buffers

    Authors: Ed Karrels, Lei Huang, Yuhong Kan, Ishank Arora, Yinzhi Wang, Daniel S. Katz, William D. Gropp, Zhao Zhang

    Abstract: A burst buffer is a common method to bridge the performance gap between the I/O needs of modern supercomputing applications and the performance of the shared file system on large-scale supercomputers. However, existing I/O sharing methods require resource isolation, offline profiling, or repeated execution that significantly limit the utilization and applicability of these systems. Here we present… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  28. arXiv:2306.03126  [pdf, other

    astro-ph.GA astro-ph.SR

    Stragglers of the thick disc

    Authors: Valeria Cerqui, Misha Haywood, Paola Di Matteo, David Katz, Frédéric Royer

    Abstract: Young alpha-rich (YAR) stars have been detected in the past as outliers to the local age $\rm-$ [$α$/Fe] relation. These objects are enhanced in $α$-elements but apparently younger than typical thick disc stars. We study the global kinematics and chemical properties of YAR giant stars in APOGEE DR17 survey and show that they have properties similar to those of the standard thick disc stellar popul… ▽ More

    Submitted 17 July, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 18 Pages, 20 Figures, 1 Table; accepted for publication in Astronomy & Astrophysics

    Journal ref: A&A 676, A108 (2023)

  29. arXiv:2305.07507  [pdf, other

    cs.CL

    LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development

    Authors: Ilias Chalkidis, Nicolas Garneau, Catalina Goanta, Daniel Martin Katz, Anders Søgaard

    Abstract: In this work, we conduct a detailed analysis on the performance of legal-oriented pre-trained language models (PLMs). We examine the interplay between their original objective, acquired knowledge, and legal language understanding capacities which we define as the upstream, probing, and downstream performance, respectively. We consider not only the models' size but also the pre-training corpora use… ▽ More

    Submitted 22 May, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: 9 pages, long paper at ACL 2023 proceedings

  30. Workflows Community Summit 2022: A Roadmap Revolution

    Authors: Rafael Ferreira da Silva, Rosa M. Badia, Venkat Bala, Debbie Bard, Peer-Timo Bremer, Ian Buckley, Silvina Caino-Lores, Kyle Chard, Carole Goble, Shantenu Jha, Daniel S. Katz, Daniel Laney, Manish Parashar, Frederic Suter, Nick Tyler, Thomas Uram, Ilkay Altintas, Stefan Andersson, William Arndt, Juan Aznar, Jonathan Bader, Bartosz Balis, Chris Blanton, Kelly Rosa Braghetto, Aharon Brodutch , et al. (80 additional authors not shown)

    Abstract: Scientific workflows have become integral tools in broad scientific computing use cases. Science discovery is increasingly dependent on workflows to orchestrate large and complex scientific experiments that range from execution of a cloud-based data preprocessing pipeline to multi-facility instrument-to-edge-to-HPC computational workflows. Given the changing landscape of scientific computing and t… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Report number: ORNL/TM-2023/2885

  31. Overcoming Challenges to Continuous Integration in HPC

    Authors: Todd Gamblin, Daniel S. Katz

    Abstract: Continuous integration (CI) has become a ubiquitous practice in modern software development, with major code hosting services offering free automation on popular platforms. CI offers major benefits, as it enables detecting bugs in code prior to committing changes. While high-performance computing (HPC) research relies heavily on software, HPC machines are not considered "common" platforms. This pr… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  32. arXiv:2302.12039  [pdf, other

    cs.CL cs.AI

    Natural Language Processing in the Legal Domain

    Authors: Daniel Martin Katz, Dirk Hartung, Lauritz Gerlach, Abhik Jana, Michael J. Bommarito II

    Abstract: In this paper, we summarize the current state of the field of NLP & Law with a specific focus on recent technical and substantive developments. To support our analysis, we construct and analyze a nearly complete corpus of more than six hundred NLP & Law related papers published over the past decade. Our analysis highlights several major trends. Namely, we document an increasing number of papers wr… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: 13 pages, 7 figures, 2 tables, online source and data

  33. arXiv:2302.11838  [pdf, other

    cs.IT cs.DS

    Minimum-Entropy Coupling Approximation Guarantees Beyond the Majorization Barrier

    Authors: Spencer Compton, Dmitriy Katz, Benjamin Qi, Kristjan Greenewald, Murat Kocaoglu

    Abstract: Given a set of discrete probability distributions, the minimum entropy coupling is the minimum entropy joint distribution that has the input distributions as its marginals. This has immediate relevance to tasks such as entropic causal inference for causal graph discovery and bounding mutual information between variables that we observe separately. Since finding the minimum entropy coupling is NP-H… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: AISTATS 2023

  34. arXiv:2301.04408  [pdf, other

    cs.CL cs.AI cs.CY

    GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities

    Authors: Jillian Bommarito, Michael Bommarito, Daniel Martin Katz, Jessica Katz

    Abstract: The global economy is increasingly dependent on knowledge workers to meet the needs of public and private organizations. While there is no single definition of knowledge work, organizations and industry groups still attempt to measure individuals' capability to engage in it. The most comprehensive assessment of capability readiness for professional knowledge workers is the Uniform CPA Examination… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Source code and data available in online SI at https://github.com/mjbommar/gpt-as-knowledge-worker

  35. arXiv:2212.14402  [pdf, other

    cs.CL cs.AI cs.LG

    GPT Takes the Bar Exam

    Authors: Michael Bommarito II, Daniel Martin Katz

    Abstract: Nearly all jurisdictions in the United States require a professional license exam, commonly referred to as "the Bar Exam," as a precondition for law practice. To even sit for the exam, most jurisdictions require that an applicant completes at least seven years of post-secondary education, including three years at an accredited law school. In addition, most test-takers also undergo weeks to months… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: Additional material available online at https://github.com/mjbommar/gpt-takes-the-bar-exam

  36. The phase spiral in Gaia DR3

    Authors: T. Antoja, P. Ramos, B. García-Conde, M. Bernet, C. F. P. Laporte, D. Katz

    Abstract: We aim to study the phase spiral in the Milky Way (MW) with Gaia DR3. We used an edge detection algorithm to find the border of the phase spiral, allowing us to robustly quantify its shape at different positions and for different selections. We calculated the time of onset of the phase-mixing by determining the different turns of the phase spiral and using the vertical frequencies from commonly us… ▽ More

    Submitted 25 May, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: version after proofs corrections

    Journal ref: A&A 673, A115 (2023)

  37. arXiv:2212.05081  [pdf, other

    hep-ex cs.LG physics.comp-ph

    FAIR AI Models in High Energy Physics

    Authors: Javier Duarte, Haoyang Li, Avik Roy, Ruike Zhu, E. A. Huerta, Daniel Diaz, Philip Harris, Raghav Kansal, Daniel S. Katz, Ishaan H. Kavoori, Volodymyr V. Kindratenko, Farouk Mokhtar, Mark S. Neubauer, Sang Eon Park, Melissa Quinnan, Roger Rusack, Zhizhen Zhao

    Abstract: The findable, accessible, interoperable, and reusable (FAIR) data principles provide a framework for examining, evaluating, and improving how data is shared to facilitate scientific discovery. Generalizing these principles to research software and other digital products is an active area of research. Machine learning (ML) models -- algorithms that have been trained on data without being explicitly… ▽ More

    Submitted 29 December, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 34 pages, 9 figures, 10 tables

    Journal ref: Mach. Learn.: Sci. Technol. 4 (2023) 045062

  38. Giving RSEs a Larger Stage through the Better Scientific Software Fellowship

    Authors: William F. Godoy, Ritu Arora, Keith Beattie, David E. Bernholdt, Sarah E. Bratt, Daniel S. Katz, Ignacio Laguna, Amiya K. Maji, Addi Malviya Thakur, Rafael M. Mudafort, Nitin Sukhija, Damian Rouson, Cindy Rubio-González, Karan Vahi

    Abstract: The Better Scientific Software Fellowship (BSSwF) was launched in 2018 to foster and promote practices, processes, and tools to improve developer productivity and software sustainability of scientific codes. BSSwF's vision is to grow the community with practitioners, leaders, mentors, and consultants to increase the visibility of scientific software production and sustainability. Over the last fiv… ▽ More

    Submitted 14 November, 2022; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: submitted to Computing in Science & Engineering (CiSE), Special Issue on the Future of Research Software Engineers in the US

  39. arXiv:2210.08973  [pdf, ps, other

    cs.CY cs.HC cs.LG hep-ex

    FAIR for AI: An interdisciplinary and international community building perspective

    Authors: E. A. Huerta, Ben Blaiszik, L. Catherine Brinson, Kristofer E. Bouchard, Daniel Diaz, Caterina Doglioni, Javier M. Duarte, Murali Emani, Ian Foster, Geoffrey Fox, Philip Harris, Lukas Heinrich, Shantenu Jha, Daniel S. Katz, Volodymyr Kindratenko, Christine R. Kirkpatrick, Kati Lassila-Perini, Ravi K. Madduri, Mark S. Neubauer, Fotis E. Psomopoulos, Avik Roy, Oliver Rübel, Zhizhen Zhao, Ruike Zhu

    Abstract: A foundational set of findable, accessible, interoperable, and reusable (FAIR) principles were proposed in 2016 as prerequisites for proper data management and stewardship, with the goal of enabling the reusability of scholarly data. The principles were also meant to apply to other digital assets, at a high level, and over time, the FAIR guiding principles have been re-interpreted or extended to i… ▽ More

    Submitted 1 August, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: 10 pages, comments welcome!; v2: 12 pages, accepted to Scientific Data

    ACM Class: I.2.0; E.0

    Journal ref: Scientific Data 10, 487 (2023)

  40. Research Software Engineers: Career Entry Points and Training Gaps

    Authors: Ian A. Cosden, Kenton McHenry, Daniel S. Katz

    Abstract: As software has become more essential to research across disciplines, and as the recognition of this fact has grown, the importance of professionalizing the development and maintenance of this software has also increased. The community of software professionals who work on this software have come together under the title Research Software Engineer (RSE) over the last decade. This has led to the fo… ▽ More

    Submitted 15 March, 2023; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: Accepted by IEEE Computing in Science & Engineering (CiSE): Special Issue on the Future of Research Software Engineers in the US

  41. funcX: Federated Function as a Service for Science

    Authors: Zhuozhao Li, Ryan Chard, Yadu Babuji, Ben Galewsky, Tyler Skluzacek, Kirill Nagaitsev, Anna Woodard, Ben Blaiszik, Josh Bryan, Daniel S. Katz, Ian Foster, Kyle Chard

    Abstract: funcX is a distributed function as a service (FaaS) platform that enables flexible, scalable, and high performance remote function execution. Unlike centralized FaaS systems, funcX decouples the cloud-hosted management functionality from the edge-hosted execution functionality. funcX's endpoint software can be deployed, by users or administrators, on arbitrary laptops, clouds, clusters, and superc… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2005.04215

  42. arXiv:2208.05398  [pdf, other

    hep-lat hep-ph nucl-th

    Equation of state of a hot-and-dense quark gluon plasma: lattice simulations at real $μ_B$ vs. extrapolations

    Authors: Szabolcs Borsanyi, Zoltan Fodor, Matteo Giordano, Jana N. Guenther, Sandor D. Katz, Attila Pasztor, Chik Him Wong

    Abstract: The equation of state of the quark gluon plasma is a key ingredient of heavy ion phenomenology. In addition to the traditional Taylor method, several novel approximation schemes have been proposed with the aim of calculating it at finite baryon density. In order to gain a pragmatic understanding of the limits of these schemes, we compare them to direct results at $μ_B>0$, using reweighting techniq… ▽ More

    Submitted 12 August, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: 7 pages, 3 figures

  43. Gaia Data Release 3: Summary of the content and survey properties

    Authors: Gaia Collaboration, A. Vallenari, A. G. A. Brown, T. Prusti, J. H. J. de Bruijne, F. Arenou, C. Babusiaux, M. Biermann, O. L. Creevey, C. Ducourant, D. W. Evans, L. Eyer, R. Guerra, A. Hutton, C. Jordi, S. A. Klioner, U. L. Lammers, L. Lindegren, X. Luri, F. Mignard, C. Panem, D. Pourbaix, S. Randich, P. Sartoretti, C. Soubiran , et al. (431 additional authors not shown)

    Abstract: We present the third data release of the European Space Agency's Gaia mission, GDR3. The GDR3 catalogue is the outcome of the processing of raw data collected with the Gaia instruments during the first 34 months of the mission by the Gaia Data Processing and Analysis Consortium. The GDR3 catalogue contains the same source list, celestial positions, proper motions, parallaxes, and broad band photom… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: 23 pages, 2 figures

  44. Gaia Data Release 3: Reflectance spectra of Solar System small bodies

    Authors: Gaia Collaboration, L. Galluccio, M. Delbo, F. De Angeli, T. Pauwels, P. Tanga, F. Mignard, A. Cellino, A. G. A. Brown, K. Muinonen, A. Penttila, S. Jordan, A. Vallenari, T. Prusti, J. H. J. de Bruijne, F. Arenou, C. Babusiaux, M. Biermann, O. L. Creevey, C. Ducourant, D. W. Evans, L. Eyer, R. Guerra, A. Hutton, C. Jordi , et al. (422 additional authors not shown)

    Abstract: The Gaia mission of the European Space Agency (ESA) has been routinely observing Solar System objects (SSOs) since the beginning of its operations in August 2014. The Gaia data release three (DR3) includes, for the first time, the mean reflectance spectra of a selected sample of 60 518 SSOs, primarily asteroids, observed between August 5, 2014, and May 28, 2017. Each reflectance spectrum was deriv… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 30 pages, 26 figures

  45. arXiv:2206.10986  [pdf, other

    astro-ph.SR astro-ph.GA

    Gaia Data Release 3: Properties of the line broadening parameter derived with the Radial Velocity Spectrometer (RVS)

    Authors: Y. Frémat, F. Royer, O. Marchal, R. Blomme, P. Sartoretti, A. Guerrier, P. Panuzzo, D. Katz, G. M. Seabroke, F. Thévenin, M. Cropper, K. Benson, Y. Damerdji, R. Haigron, A. Lobel, M. Smith, S. G. Baker, L. Chemin, M. David, C. Dolding, E. Gosset, K. Janßen, G. Jasniewicz, G. Plum, N. Samaras , et al. (16 additional authors not shown)

    Abstract: The third release of the Gaia catalogue contains the radial velocities for 33,812,183 stars having effective temperatures ranging from 3100 K to 14,500 K. The measurements are based on the comparison of the observed RVS spectrum (wavelength coverage: 846--870 nm, median resolving power: 11,500) to synthetic data broadened to the adequate Along-Scan Line Spread Function. The additional line-broaden… ▽ More

    Submitted 27 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: 19 pages, 17 figures, see https://www.cosmos.esa.int/web/gaia/dr3-papers Paper accepted for publication in Astronomy and Astrophysics on 23th June 2022

    Journal ref: A&A 674, A8 (2023)

  46. arXiv:2206.09044  [pdf, other

    cs.GT

    Universal Complexity Bounds Based on Value Iteration and Application to Entropy Games

    Authors: Xavier Allamigeon, Stéphane Gaubert, Ricardo D. Katz, Mateusz Skomra

    Abstract: We develop value iteration-based algorithms to solve in a unified manner different classes of combinatorial zero-sum games with mean-payoff type rewards. These algorithms rely on an oracle, evaluating the dynamic programming operator up to a given precision. We show that the number of calls to the oracle needed to determine exact optimal (positional) strategies is, up to a factor polynomial in the… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 41 pages, 7 figures

  47. Gaia Data Release 3: Mapping the asymmetric disc of the Milky Way

    Authors: Gaia Collaboration, R. Drimmel, M. Romero-Gomez, L. Chemin, P. Ramos, E. Poggio, V. Ripepi, R. Andrae, R. Blomme, T. Cantat-Gaudin, A. Castro-Ginard, G. Clementini, F. Figueras, M. Fouesneau, Y. Fremat, K. Jardine, S. Khanna, A. Lobel, D. J. Marshall, T. Muraveva, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne, F. Arenou , et al. (431 additional authors not shown)

    Abstract: With the most recent Gaia data release the number of sources with complete 6D phase space information (position and velocity) has increased to well over 33 million stars, while stellar astrophysical parameters are provided for more than 470 million sources, in addition to the identification of over 11 million variable stars. Using the astrophysical parameters and variability classifications provid… ▽ More

    Submitted 5 August, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: 35 pages, 27 figures, accepted for publication in A&A special Gaia DR3 issue. V2: abstract completed. V3: complete author list and link to data: https://drive.google.com/drive/u/1/folders/1yOJPjYmM7QK5XVsqaiSOTuwDQNti2LlZ

    Journal ref: A&A 674, A37 (2023)

  48. Gaia Data Release 3: Pulsations in main sequence OBAF-type stars

    Authors: Gaia Collaboration, J. De Ridder, V. Ripepi, C. Aerts, L. Palaversa, L. Eyer, B. Holl, M. Audard, L. Rimoldini, A. G. A. Brown, A. Vallenari, T. Prusti, J. H. J. de Bruijne, F. Arenou, C. Babusiaux, M. Biermann, O. L. Creevey, C. Ducourant, D. W. Evans, R. Guerra, A. Hutton, C. Jordi, S. A. Klioner, U. L. Lammers, L. Lindegren , et al. (423 additional authors not shown)

    Abstract: The third Gaia data release provides photometric time series covering 34 months for about 10 million stars. For many of those stars, a characterisation in Fourier space and their variability classification are also provided. This paper focuses on intermediate- to high-mass (IHM) main sequence pulsators M >= 1.3 Msun) of spectral types O, B, A, or F, known as beta Cep, slowly pulsating B (SPB), del… ▽ More

    Submitted 16 August, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Journal ref: A&A 674, A36 (2023)

  49. arXiv:2206.05902  [pdf, other

    astro-ph.GA astro-ph.IM

    Gaia Data Release 3 Properties and validation of the radial velocities

    Authors: D. Katz, P. Sartoretti, A. Guerrier, P. Panuzzo, G. M. Seabroke, F. Thévenin, M. Cropper, K. Benson, R. Blomme, R. Haigron, O. Marchal, M. Smith, S. Baker, L. Chemin, Y. Damerdji, M. David, C. Dolding, Y. Frémat, E. Gosset, K. Janßen, G. Jasniewicz, A. Lobel, G. Plum, N. Samaras, O. Snaith , et al. (25 additional authors not shown)

    Abstract: Gaia Data Release 3 (Gaia DR3) contains the second release of the combined radial velocities. It is based on the spectra collected during the first 34 months of the nominal mission. The longer time baseline and the improvements of the pipeline made it possible to push the processing limit, from Grvs = 12 in Gaia DR2, to Grvs = 14 mag. In this article, we describe the new functionalities implemente… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: Sumitted to A&A

    Journal ref: A&A 674, A5 (2023)

  50. arXiv:2206.05870  [pdf, other

    astro-ph.SR astro-ph.EP astro-ph.GA astro-ph.IM

    Gaia Data Release 3: A Golden Sample of Astrophysical Parameters

    Authors: Gaia Collaboration, O. L. Creevey, L. M. Sarro, A. Lobel, E. Pancino, R. Andrae, R. L. Smart, G. Clementini, U. Heiter, A. J. Korn, M. Fouesneau, Y. Frémat, F. De Angeli, A. Vallenari, D. L. Harrison, F. Thévenin, C. Reylé, R. Sordo, A. Garofalo, A. G. A. Brown, L. Eyer, T. Prusti, J. H. J. de Bruijne, F. Arenou, C. Babusiaux , et al. (423 additional authors not shown)

    Abstract: Gaia Data Release 3 (DR3) provides a wealth of new data products for the astronomical community to exploit, including astrophysical parameters for a half billion stars. In this work we demonstrate the high quality of these data products and illustrate their use in different astrophysical contexts. We query the astrophysical parameter tables along with other tables in Gaia DR3 to derive the samples… ▽ More

    Submitted 12 June, 2022; originally announced June 2022.

    Comments: 35 pages, (incl 6 pages references, acknowledgements, affiliations), 37 figures, A&A accepted

    Journal ref: A&A 674, A39 (2023)