subscribe to arXiv mailings

The little coadd that could: Estimating shear from coadded images

Authors: Robert Armstrong, Erin Sheldon, Eric Huff, Jim Bosch, Eli Rykoff, Rachel Mandelbaum, Arun Kannawadi, Peter Melchior, Robert Lupton, Matthew R. Becker, Yusra Al-Sayyed, The LSST Dark Energy Science Collaboration

Abstract: Upcoming wide field surveys will have many overlapping epochs of the same region of sky. The conventional wisdom is that in order to reduce the errors sufficiently for systematics-limited measurements, like weak lensing, we must do simultaneous fitting of all the epochs. Using current algorithms this will require a significant amount of computing time and effort. In this paper, we revisit the pote… ▽ More Upcoming wide field surveys will have many overlapping epochs of the same region of sky. The conventional wisdom is that in order to reduce the errors sufficiently for systematics-limited measurements, like weak lensing, we must do simultaneous fitting of all the epochs. Using current algorithms this will require a significant amount of computing time and effort. In this paper, we revisit the potential of using coadds for shear measurements. We show on a set of image simulations that the multiplicative shear bias can be constrained below the 0.1% level on coadds, which is sufficient for future lensing surveys. We see no significant differences between simultaneous fitting and coadded approaches for two independent shear codes: Metacalibration and BFD. One caveat of our approach is the assumption of a principled coadd, i.e. the PSF is mathematically well-defined for all the input images. This requires us to reject CCD images that do not fully cover the coadd region. We estimate that the number of epochs that must be rejected for a survey like LSST is on the order of 20%, resulting in a small loss in depth of less than 0.1 magnitudes. We also put forward a cell-based coaddition scheme that meets the above requirements for unbiased weak lensing shear estimation in the context of LSST. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2405.12195 [pdf, other]

Developers' Perceptions on the Impact of ChatGPT in Software Development: A Survey

Authors: Thiago S. Vaillant, Felipe Deveza de Almeida, Paulo Anselmo M. S. Neto, Cuiyun Gao, Jan Bosch, Eduardo Santana de Almeida

Abstract: As Large Language Models (LLMs), including ChatGPT and analogous systems, continue to advance, their robust natural language processing capabilities and diverse applications have garnered considerable attention. Nonetheless, despite the increasing acknowledgment of the convergence of Artificial Intelligence (AI) and Software Engineering (SE), there is a lack of studies involving the impact of this… ▽ More As Large Language Models (LLMs), including ChatGPT and analogous systems, continue to advance, their robust natural language processing capabilities and diverse applications have garnered considerable attention. Nonetheless, despite the increasing acknowledgment of the convergence of Artificial Intelligence (AI) and Software Engineering (SE), there is a lack of studies involving the impact of this convergence on the practices and perceptions of software developers. Understanding how software developers perceive and engage with AI tools, such as ChatGPT, is essential for elucidating the impact and potential challenges of incorporating AI-driven tools in the software development process. In this paper, we conducted a survey with 207 software developers to understand the impact of ChatGPT on software quality, productivity, and job satisfaction. Furthermore, the study delves into developers' expectations regarding future adaptations of ChatGPT, concerns about potential job displacement, and perspectives on regulatory interventions. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 31 pages, 9 figures

ACM Class: D.2.0

arXiv:2403.09987 [pdf, other]

Trusting the Search: Unraveling Human Trust in Health Information from Google and ChatGPT

Authors: Xin Sun, Rongjun Ma, Xiaochang Zhao, Zhuying Li, Janne Lindqvist, Abdallah El Ali, Jos A. Bosch

Abstract: People increasingly rely on online sources for health information seeking due to their convenience and timeliness, traditionally using search engines like Google as the primary search agent. Recently, the emergence of generative Artificial Intelligence (AI) has made Large Language Model (LLM) powered conversational agents such as ChatGPT a viable alternative for health information search. However,… ▽ More People increasingly rely on online sources for health information seeking due to their convenience and timeliness, traditionally using search engines like Google as the primary search agent. Recently, the emergence of generative Artificial Intelligence (AI) has made Large Language Model (LLM) powered conversational agents such as ChatGPT a viable alternative for health information search. However, while trust is crucial for adopting the online health advice, the factors influencing people's trust judgments in health information provided by LLM-powered conversational agents remain unclear. To address this, we conducted a mixed-methods, within-subjects lab study (N=21) to explore how interactions with different agents (ChatGPT vs. Google) across three health search tasks influence participants' trust judgments of the search results as well as the search agents themselves. Our key findings showed that: (a) participants' trust levels in ChatGPT were significantly higher than Google in the context of health information seeking; (b) there is a significant correlation between trust in health-related information and trust in the search agent, however only for Google; (c) the type of search tasks did not affect participants' perceived trust; and (d) participants' prior knowledge, the style of information presentation, and the interactive manner of using search agents were key determinants of trust in the health-related information. Our study taps into differences in trust perceptions when using traditional search engines compared to LLM-powered conversational agents. We highlight the potential role LLMs play in health-related information-seeking contexts, where they excel as stepping stones for further search. We contribute key factors and considerations for ensuring effective and reliable personal health information seeking in the age of generative AI. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 24 pages

ACM Class: F.2.2, I.2.7

arXiv:2403.00365 [pdf, other]

Can a Funny Chatbot Make a Difference? Infusing Humor into Conversational Agent for Behavioral Intervention

Authors: Xin Sun, Isabelle Teljeur, Zhuying Li, Jos A. Bosch

Abstract: Regular physical activity is crucial for reducing the risk of non-communicable disease (NCD). With NCDs on the rise globally, there is an urgent need for effective health interventions, with chatbots emerging as a viable and cost-effective option because of limited healthcare accessibility. Although health professionals often utilize behavior change techniques (BCTs) to boost physical activity lev… ▽ More Regular physical activity is crucial for reducing the risk of non-communicable disease (NCD). With NCDs on the rise globally, there is an urgent need for effective health interventions, with chatbots emerging as a viable and cost-effective option because of limited healthcare accessibility. Although health professionals often utilize behavior change techniques (BCTs) to boost physical activity levels and enhance client engagement and motivation by affiliative humor, the efficacy of humor in chatbot-delivered interventions is not well-understood. This study conducted a randomized controlled trial to examine the impact of the generative humorous communication style in a 10-day chatbot-delivered intervention for physical activity. It further investigated if user engagement and motivation act as mediators between the communication style and changes in physical activity levels. 66 participants engaged with the chatbots across three groups (humorous, non-humorous, and no-intervention) and responded to daily ecological momentary assessment questionnaires assessing engagement, motivation, and physical activity levels. Multilevel time series analyses revealed that an affiliative humorous communication style positively impacted physical activity levels over time, with user engagement acting as a mediator in this relationship, whereas motivation did not. These findings clarify the role of humorous communication style in chatbot-delivered physical activity interventions, offering valuable insights for future development of intelligent conversational agents incorporating humor. △ Less

Submitted 1 March, 2024; originally announced March 2024.

arXiv:2402.13387 [pdf]

DistriFS: A Platform and User Agnostic Approach to File Distribution

Authors: Julian Boesch

Abstract: In an age where the distribution of information is crucial, current file sharing solutions suffer significant deficiencies. Popular systems such as Google Drive, torrenting and IPFS suffer issues with compatibility, accessibility and censorship. This paper introduces DistriFS, a novel decentralized approach tailored for efficient and large-scale distribution of files. The architecture of DistriFS… ▽ More In an age where the distribution of information is crucial, current file sharing solutions suffer significant deficiencies. Popular systems such as Google Drive, torrenting and IPFS suffer issues with compatibility, accessibility and censorship. This paper introduces DistriFS, a novel decentralized approach tailored for efficient and large-scale distribution of files. The architecture of DistriFS is grounded in three foundational pillars: scalability, security, and seamless integration. The proposed server implementation harnesses the power of Golang, ensuring near-universal interoperability across operating systems and hardware. Moreover, the use of the HTTP protocol eliminates the need for additional software to access the network, ensuring compatibility across all major operating systems and facilitating effortless downloads. The design and efficacy of DistriFS represent a significant advancement in the realm of file distribution systems, offering a scalable and secure alternative to current centralized and decentralized models. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.06002 [pdf, other]

Every Datapoint Counts: Stellar Flares as a Case Study of Atmosphere Aided Studies of Transients in the LSST Era

Authors: Riley W. Clarke, James R. A. Davenport, John Gizis, Melissa L. Graham, Xiaolong Li, Willow Fortino, Ian Sullivan, Yusra Alsayyad, James Bosch, Robert A. Knop, Federica Bianco

Abstract: Due to their short timescale, stellar flares are a challenging target for the most modern synoptic sky surveys. The upcoming Vera C. Rubin Legacy Survey of Space and Time (LSST), a project designed to collect more data than any precursor survey, is unlikely to detect flares with more than one data point in its main survey. We developed a methodology to enable LSST studies of stellar flares, with a… ▽ More Due to their short timescale, stellar flares are a challenging target for the most modern synoptic sky surveys. The upcoming Vera C. Rubin Legacy Survey of Space and Time (LSST), a project designed to collect more data than any precursor survey, is unlikely to detect flares with more than one data point in its main survey. We developed a methodology to enable LSST studies of stellar flares, with a focus on flare temperature and temperature evolution, which remain poorly constrained compared to flare morphology. By leveraging the sensitivity expected from the Rubin system, Differential Chromatic Refraction can be used to constrain flare temperature from a single-epoch detection, which will enable statistical studies of flare temperatures and constrain models of the physical processes behind flare emission using the unprecedentedly high volume of data produced by Rubin over the 10-year LSST. We model the refraction effect as a function of the atmospheric column density, photometric filter, and temperature of the flare, and show that flare temperatures at or above ~4,000K can be constrained by a single g-band observation at airmass X > 1.2, given the minimum specified requirement on single-visit relative astrometric accuracy of LSST, and that a surprisingly large number of LSST observations is in fact likely be conducted at X > 1.2, in spite of image quality requirements pushing the survey to preferentially low X. Having failed to measure flare DCR in LSST precursor surveys, we make recommendations on survey design and data products that enable these studies in LSST and other future surveys. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 18 pages, 16 figures, 1 table. Submitted to the Astrophysical Journal Supplement Series

arXiv:2402.00011 [pdf]

doi 10.1016/j.jss.2023.111945

Choosing the Right Path for AI Integration in Engineering Companies: A Strategic Guide

Authors: Rimma Dzhusupova, Jan Bosch, Helena Holmstrom Olsson

Abstract: The Engineering, Procurement and Construction (EPC) businesses operating within the energy sector are recognizing the increasing importance of Artificial Intelligence (AI). Many EPC companies and their clients have realized the benefits of applying AI to their businesses in order to reduce manual work, drive productivity, and streamline future operations of engineered installations in a highly com… ▽ More The Engineering, Procurement and Construction (EPC) businesses operating within the energy sector are recognizing the increasing importance of Artificial Intelligence (AI). Many EPC companies and their clients have realized the benefits of applying AI to their businesses in order to reduce manual work, drive productivity, and streamline future operations of engineered installations in a highly competitive industry. The current AI market offers various solutions and services to support this industry, but organizations must understand how to acquire AI technology in the most beneficial way based on their business strategy and available resources. This paper presents a framework for EPC companies in their transformation towards AI. Our work is based on examples of project execution of AI-based products development at one of the biggest EPC contractors worldwide and on insights from EPC vendor companies already integrating AI into their engineering solutions. The paper covers the entire life cycle of building AI solutions, from initial business understanding to deployment and further evolution. The framework identifies how various factors influence the choice of approach toward AI project development within large international engineering corporations. By presenting a practical guide for optimal approach selection, this paper contributes to the research in AI project management and organizational strategies for integrating AI technology into businesses. The framework might also help engineering companies choose the optimum AI approach to create business value. △ Less

Submitted 25 December, 2023; originally announced February 2024.

Report number: JSS_111945

Journal ref: The Journal of Systems & Software, 2023

arXiv:2310.09207 [pdf, other]

Four-Dimensional Computational Ultrasound Imaging of Brain Haemodynamics

Authors: Michael D. Brown, Bastian S. Generowicz, Stephanie Dijkhuizen, Sebastiaan K. E. Koekkoek, Christos Strydis, Johannes G. Bosch, Petros Arvanitis, Geert Springeling, Geert J. T. Leus, Chris I. De Zeeuw, Pieter Kruizinga

Abstract: Four-dimensional ultrasound imaging of complex biological systems such as the brain is technically challenging because of the spatiotemporal sampling requirements. We present computational ultrasound imaging (cUSi), a new imaging method that uses complex ultrasound fields that can be generated with simple hardware and a physical wave prediction model to alleviate the sampling constraints. cUSi all… ▽ More Four-dimensional ultrasound imaging of complex biological systems such as the brain is technically challenging because of the spatiotemporal sampling requirements. We present computational ultrasound imaging (cUSi), a new imaging method that uses complex ultrasound fields that can be generated with simple hardware and a physical wave prediction model to alleviate the sampling constraints. cUSi allows for high-resolution four-dimensional imaging of brain haemodynamics in awake and anesthetized mice. △ Less

Submitted 13 October, 2023; originally announced October 2023.

arXiv:2310.08995 [pdf, ps, other]

doi 10.1051/0004-6361/202346191

Scaling slowly rotating asteroids by stellar occultations

Authors: A. Marciniak, J. Ďurech, A. Choukroun, J. Hanuš, W. Ogłoza, R. Szakáts, L. Molnár, A. Pál, F. Monteiro, E. Frappa, W. Beisker, H. Pavlov, J. Moore, R. Adomavičienė, R. Aikawa, S. Andersson, P. Antonini, Y. Argentin, A. Asai, P. Assoignon, J. Barton, P. Baruffetti, K. L. Bath, R. Behrend, L. Benedyktowicz , et al. (154 additional authors not shown)

Abstract: As evidenced by recent survey results, majority of asteroids are slow rotators (P>12 h), but lack spin and shape models due to selection bias. This bias is skewing our overall understanding of the spins, shapes, and sizes of asteroids, as well as of their other properties. Also, diameter determinations for large (>60km) and medium-sized asteroids (between 30 and 60 km) often vary by over 30% for m… ▽ More As evidenced by recent survey results, majority of asteroids are slow rotators (P>12 h), but lack spin and shape models due to selection bias. This bias is skewing our overall understanding of the spins, shapes, and sizes of asteroids, as well as of their other properties. Also, diameter determinations for large (>60km) and medium-sized asteroids (between 30 and 60 km) often vary by over 30% for multiple reasons. Our long-term project is focused on a few tens of slow rotators with periods of up to 60 hours. We aim to obtain their full light curves and reconstruct their spins and shapes. We also precisely scale the models, typically with an accuracy of a few percent. We used wide sets of dense light curves for spin and shape reconstructions via light-curve inversion. Precisely scaling them with thermal data was not possible here because of poor infrared data: large bodies are too bright for WISE mission. Therefore, we recently launched a campaign among stellar occultation observers, to scale these models and to verify the shape solutions, often allowing us to break the mirror pole ambiguity. The presented scheme resulted in shape models for 16 slow rotators, most of them for the first time. Fitting them to stellar occultations resolved previous inconsistencies in size determinations. For around half of the targets, this fitting also allowed us to identify a clearly preferred pole solution, thus removing the ambiguity inherent to light-curve inversion. We also address the influence of the uncertainty of the shape models on the derived diameters. Overall, our project has already provided reliable models for around 50 slow rotators. Such well-determined and scaled asteroid shapes will, e.g. constitute a solid basis for density determinations when coupled with mass information. Spin and shape models continue to fill the gaps caused by various biases. △ Less

Submitted 13 October, 2023; originally announced October 2023.

Comments: Accepted to Astronomy & Astrophysics. 12 pages + appendices

Journal ref: A&A 679, A60 (2023)

arXiv:2309.02936 [pdf, other]

EdgeFL: A Lightweight Decentralized Federated Learning Framework

Authors: Hongyi Zhang, Jan Bosch, Helena Holmström Olsson

Abstract: Federated Learning (FL) has emerged as a promising approach for collaborative machine learning, addressing data privacy concerns. However, existing FL platforms and frameworks often present challenges for software engineers in terms of complexity, limited customization options, and scalability limitations. In this paper, we introduce EdgeFL, an edge-only lightweight decentralized FL framework, des… ▽ More Federated Learning (FL) has emerged as a promising approach for collaborative machine learning, addressing data privacy concerns. However, existing FL platforms and frameworks often present challenges for software engineers in terms of complexity, limited customization options, and scalability limitations. In this paper, we introduce EdgeFL, an edge-only lightweight decentralized FL framework, designed to overcome the limitations of centralized aggregation and scalability in FL deployments. By adopting an edge-only model training and aggregation approach, EdgeFL eliminates the need for a central server, enabling seamless scalability across diverse use cases. With a straightforward integration process requiring just four lines of code (LOC), software engineers can easily incorporate FL functionalities into their AI products. Furthermore, EdgeFL offers the flexibility to customize aggregation functions, empowering engineers to adapt them to specific needs. Based on the results, we demonstrate that EdgeFL achieves superior performance compared to existing FL platforms/frameworks. Our results show that EdgeFL reduces weights update latency and enables faster model evolution, enhancing the efficiency of edge devices. Moreover, EdgeFL exhibits improved classification accuracy compared to traditional centralized FL approaches. By leveraging EdgeFL, software engineers can harness the benefits of federated learning while overcoming the challenges associated with existing FL platforms/frameworks. △ Less

Submitted 6 September, 2023; originally announced September 2023.

arXiv:2308.08062 [pdf, other]

doi 10.1051/0004-6361/202346892

A large topographic feature on the surface of the trans-Neptunian object (307261) 2002 MS$_4$ measured from stellar occultations

Authors: F. L. Rommel, F. Braga-Ribas, J. L. Ortiz, B. Sicardy, P. Santos-Sanz, J. Desmars, J. I. B. Camargo, R. Vieira-Martins, M. Assafin, B. E. Morgado, R. C. Boufleur, G. Benedetti-Rossi, A. R. Gomes-Júnior, E. Fernández-Valenzuela, B. J. Holler, D. Souami, R. Duffard, G. Margoti, M. Vara-Lubiano, J. Lecacheux, J. L. Plouvier, N. Morales, A. Maury, J. Fabrega, P. Ceravolo , et al. (179 additional authors not shown)

Abstract: This work aims at constraining the size, shape, and geometric albedo of the dwarf planet candidate 2002 MS4 through the analysis of nine stellar occultation events. Using multichord detection, we also studied the object's topography by analyzing the obtained limb and the residuals between observed chords and the best-fitted ellipse. We predicted and organized the observational campaigns of nine st… ▽ More This work aims at constraining the size, shape, and geometric albedo of the dwarf planet candidate 2002 MS4 through the analysis of nine stellar occultation events. Using multichord detection, we also studied the object's topography by analyzing the obtained limb and the residuals between observed chords and the best-fitted ellipse. We predicted and organized the observational campaigns of nine stellar occultations by 2002 MS4 between 2019 and 2022, resulting in two single-chord events, four double-chord detections, and three events with three to up to sixty-one positive chords. Using 13 selected chords from the 8 August 2020 event, we determined the global elliptical limb of 2002 MS4. The best-fitted ellipse, combined with the object's rotational information from the literature, constrains the object's size, shape, and albedo. Additionally, we developed a new method to characterize topography features on the object's limb. The global limb has a semi-major axis of 412 $\pm$ 10 km, a semi-minor axis of 385 $\pm$ 17 km, and the position angle of the minor axis is 121 $^\circ$ $\pm$ 16$^\circ$. From this instantaneous limb, we obtained 2002 MS4's geometric albedo and the projected area-equivalent diameter. Significant deviations from the fitted ellipse in the northernmost limb are detected from multiple sites highlighting three distinct topographic features: one 11 km depth depression followed by a 25$^{+4}_{-5}$ km height elevation next to a crater-like depression with an extension of 322 $\pm$ 39 km and 45.1 $\pm$ 1.5 km deep. Our results present an object that is $\approx$138 km smaller in diameter than derived from thermal data, possibly indicating the presence of a so-far unknown satellite. However, within the error bars, the geometric albedo in the V-band agrees with the results published in the literature, even with the radiometric-derived albedo. △ Less

Submitted 23 August, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

Journal ref: A&A 678, A167 (2023)

arXiv:2306.00332 [pdf, other]

doi 10.1103/PhysRevD.108.095014

Gravitational Wave Signatures of Gauged Baryon and Lepton Number

Authors: Jessica Bosch, Zoraida Delgado, Bartosz Fornal, Alejandra Leon

Abstract: We demonstrate that novel types of gravitational wave signatures arise in theories with new gauge symmetries broken at high energy scales. For concreteness, we focus on models with gauged baryon number and lepton number, in which neutrino masses are generated via the type I seesaw mechanism, leptogenesis occurs through the decay of a heavy right-handed neutrino, and one of the new baryonic fields… ▽ More We demonstrate that novel types of gravitational wave signatures arise in theories with new gauge symmetries broken at high energy scales. For concreteness, we focus on models with gauged baryon number and lepton number, in which neutrino masses are generated via the type I seesaw mechanism, leptogenesis occurs through the decay of a heavy right-handed neutrino, and one of the new baryonic fields is a good dark matter candidate. Depending on the scalar content of the theory, the gravitational wave spectrum consists of contributions from cosmic strings, domain walls, and first order phase transitions. We show that a characteristic double-peaked signal from domain walls or a sharp domain wall peak over a flat cosmic string background may be generated. Those new signatures are within the reach of future experiments, such as Cosmic Explorer, Einstein Telescope, DECIGO, Big Bang Observer, and LISA. △ Less

Submitted 1 June, 2023; originally announced June 2023.

Comments: 12 pages, 12 figures

Journal ref: Phys. Rev. D 108, 095014 (2023)

arXiv:2305.11225 [pdf, ps, other]

doi 10.3847/2041-8213/acd69f

Quasar Luminosity Function at z = 7

Authors: Yoshiki Matsuoka, Masafusa Onoue, Kazushi Iwasawa, Michael A. Strauss, Nobunari Kashikawa, Takuma Izumi, Tohru Nagao, Masatoshi Imanishi, Masayuki Akiyama, John D. Silverman, Naoko Asami, James Bosch, Hisanori Furusawa, Tomotsugu Goto, James E. Gunn, Yuichi Harikane, Hiroyuki Ikeda, Kohei Inayoshi, Rikako Ishimoto, Toshihiro Kawaguchi, Satoshi Kikuta, Kotaro Kohno, Yutaka Komiyama, Chien-Hsiu Lee, Robert H. Lupton , et al. (19 additional authors not shown)

Abstract: We present the quasar luminosity function (LF) at $z = 7$, measured with 35 spectroscopically confirmed quasars at $6.55 < z < 7.15$. The sample of 22 quasars from the Subaru High-$z$ Exploration of Low-Luminosity Quasars (SHELLQs) project, combined with 13 brighter quasars in the literature, covers an unprecedentedly wide range of rest-frame ultraviolet magnitudes over $-28 < M_{1450} < -23$. We… ▽ More We present the quasar luminosity function (LF) at $z = 7$, measured with 35 spectroscopically confirmed quasars at $6.55 < z < 7.15$. The sample of 22 quasars from the Subaru High-$z$ Exploration of Low-Luminosity Quasars (SHELLQs) project, combined with 13 brighter quasars in the literature, covers an unprecedentedly wide range of rest-frame ultraviolet magnitudes over $-28 < M_{1450} < -23$. We found that the binned LF flattens significantly toward the faint end populated by the SHELLQs quasars. A maximum likelihood fit to a double power-law model has a break magnitude $M^*_{1450} = -25.60^{+0.40}_{-0.30}$, a characteristic density $Φ^* = 1.35^{+0.47}_{-0.30}$ Gpc$^{-3}$ mag$^{-1}$, and a bright-end slope $β= -3.34^{+0.49}_{-0.57}$, when the faint-end slope is fixed to $α= -1.2$ as observed at $z \le 6$. The overall LF shape remains remarkably similar from $z = 4$ to $7$, while the amplitude decreases substantially toward higher redshifts, with a clear indication of an accelerating decline at $z \ge 6$. The estimated ionizing photon density, $10^{48.2 \pm 0.1}$ s$^{-1}$ Mpc$^{-3}$, is less than 1 % of the critical rate to keep the intergalactic medium ionized at $z = 7$, and thus indicates that quasars are not a major contributor to cosmic reionization. △ Less

Submitted 18 May, 2023; originally announced May 2023.

Comments: The Astrophysical Journal Letters, in press

arXiv:2305.08601 [pdf, other]

DevServOps: DevOps For Product-Oriented Product-Service Systems

Authors: Anas Dakkak, Jan Bosch, Helena Holmström Olsson

Abstract: For companies developing web-based applications, the Dev and the Ops refer to different groups with either operational or development focus. Therefore, DevOps help these companies streamline software development and operations activities by emphasizing the collaboration between the two groups. However, for companies producing software-intensive products, the Ops would refer to customers who use an… ▽ More For companies developing web-based applications, the Dev and the Ops refer to different groups with either operational or development focus. Therefore, DevOps help these companies streamline software development and operations activities by emphasizing the collaboration between the two groups. However, for companies producing software-intensive products, the Ops would refer to customers who use and operate the product. In addition, companies producing software-intensive products do not only offer products to customers but rather Product Service Systems (PSS), where product-related services play a key role in ensuring customer satisfaction besides their significant revenue contribution. Thus, the context of product-oriented PSS is very different from web-based applications, making it difficult to apply DevOps without considering the role of the services. Therefore, based on a two years participant observation case study conducted at a multinational telecommunications systems provider, we propose a new and novel approach called Development-Services-Operations (DevServOps) which incorporates services as a key player facilitating an end-to-end software flow toward customers in one direction and feedback toward developers in the other direction. Services become the glue that connects the Dev and the Ops, achieved by providing internal services to increase the precision of the development organization and external services to increase the speed of deployment and new content adoption on the customers' side. △ Less

Submitted 15 May, 2023; originally announced May 2023.

arXiv:2304.00703 [pdf, other]

Hyper Suprime-Cam Year 3 Results: Measurements of Clustering of SDSS-BOSS Galaxies, Galaxy-Galaxy Lensing and Cosmic Shear

Authors: Surhud More, Sunao Sugiyama, Hironao Miyatake, Markus Michael Rau, Masato Shirasaki, Xiangchong Li, Atsushi J. Nishizawa, Ken Osato, Tianqing Zhang, Masahiro Takada, Takashi Hamana, Ryuichi Takahashi, Roohi Dalal, Rachel Mandelbaum, Michael A. Strauss, Yosuke Kobayashi, Takahiro Nishimichi, Masamune Oguri, Wentao Luo, Arun Kannawadi, Bau-Ching Hsieh, Robert Armstrong, James Bosch, Yutaka Komiyama, Robert H. Lupton , et al. (9 additional authors not shown)

Abstract: We use the Sloan Digital Sky Survey (SDSS) BOSS galaxies and their overlap with approximately 416 sq. degree of deep $grizy$-band imaging from the Subaru Hyper Suprime-Cam Survey (HSC). We measure three two-point correlations that form the basis of the cosmological inference presented in our companion papers, Miyatake et al. and Sugiyama et al. We use three approximately volume limited subsamples… ▽ More We use the Sloan Digital Sky Survey (SDSS) BOSS galaxies and their overlap with approximately 416 sq. degree of deep $grizy$-band imaging from the Subaru Hyper Suprime-Cam Survey (HSC). We measure three two-point correlations that form the basis of the cosmological inference presented in our companion papers, Miyatake et al. and Sugiyama et al. We use three approximately volume limited subsamples of spectroscopic galaxies by their $i$-band magnitude from the SDSS-BOSS: LOWZ (0.1<z<0.35), CMASS1 (0.43<z<0.55) and CMASS2 (0.55<z<0.7), respectively. We present high signal-to-noise ratio measurements of the projected correlation functions of these galaxies, which is expected to be proportional to the matter correlation function times the bias of galaxies on large scales. In order to break the degeneracy between the amplitude of the matter correlation and the bias of these galaxies, we use the distortions of the shapes of galaxies in HSC due to weak gravitational lensing, to measure the galaxy-galaxy lensing signal, which probes the galaxy-matter cross-correlation of the SDSS-BOSS galaxies. We also measure the cosmic shear correlation functions from HSC galaxies which is related to the projected matter correlation function. We demonstrate the robustness of our measurements with a variety of systematic tests. Our use of a single sample of HSC source galaxies is crucial to calibrate any residual systematic biases in the inferred redshifts of our galaxies. We also describe the construction of a suite of mocks: i) spectroscopic galaxy catalogs which obey the clustering and abundance of each of the three SDSS-BOSS subsamples, and ii) galaxy shape catalogs which obey the footprint of the HSC survey and have been appropriately sheared by the large-scale structure expected in a $Λ$-CDM model. We use these mock catalogs to compute the covariance of each of our observables. △ Less

Submitted 16 November, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

Comments: 31 pages, 24 figures, version accepted for publication in PRD together with other HSC Y3 weak lensing cosmology papers - see https://hsc-release.mtk.nao.ac.jp/doc/index.php/wly3/

arXiv:2304.00702 [pdf, other]

Hyper Suprime-Cam Year 3 Results: Cosmology from Cosmic Shear Two-point Correlation Functions

Authors: Xiangchong Li, Tianqing Zhang, Sunao Sugiyama, Roohi Dalal, Ryo Terasawa, Markus M. Rau, Rachel Mandelbaum, Masahiro Takada, Surhud More, Michael A. Strauss, Hironao Miyatake, Masato Shirasaki, Takashi Hamana, Masamune Oguri, Wentao Luo, Atsushi J. Nishizawa, Ryuichi Takahashi, Andrina Nicola, Ken Osato, Arun Kannawadi, Tomomi Sunayama, Robert Armstrong, James Bosch, Yutaka Komiyama, Robert H. Lupton , et al. (10 additional authors not shown)

Abstract: We perform a blinded cosmology analysis with cosmic shear two-point correlation functions (2PCFs) measured from more than 25 million galaxies in the Hyper Suprime-Cam three-year shear catalog in four tomographic redshift bins ranging from 0.3 to 1.5. After conservative masking and galaxy selection, the survey covers 416 deg$^2$ of the northern sky with an effective galaxy number density of 15 arcm… ▽ More We perform a blinded cosmology analysis with cosmic shear two-point correlation functions (2PCFs) measured from more than 25 million galaxies in the Hyper Suprime-Cam three-year shear catalog in four tomographic redshift bins ranging from 0.3 to 1.5. After conservative masking and galaxy selection, the survey covers 416 deg$^2$ of the northern sky with an effective galaxy number density of 15 arcmin$^{-2}$ over the four redshift bins. The 2PCFs adopted for cosmology analysis are measured in the angular range: $7.1 < θ/{\rm arcmin} < 56.6$ for $ξ_+$ and $31.2 <θ/{\rm arcmin} < 248$ for $ξ_-$, with a total signal-to-noise ratio of 26.6. We apply a conservative, wide, flat prior on the photometric redshift errors on the last two tomographic bins, and the relative magnitudes of the cosmic shear amplitude across four redshift bins allow us to calibrate the photometric redshift errors. With this flat prior on redshift errors, we find $Ω_{\rm m}=0.256_{-0.044}^{+0.056}$ and $S_8\equiv σ_8 \sqrt{Ω_{\rm m}/0.3}=0.769_{-0.034}^{+0.031}$ (both 68\% CI) for a flat $Λ$ cold dark matter cosmology. We find, after unblinding, that our constraint on $S_8$ is consistent with the Fourier space cosmic shear and the 3$\times$2pt analyses on the same HSC dataset. We carefully study the potential systematics from astrophysical and systematic model uncertainties in our fiducial analysis using synthetic data, and report no biases (including projection bias in the posterior space) greater than $0.5σ$ in the estimation of $S_8$. Our analysis hints that the mean redshifts of the two highest tomographic bins are higher than initially estimated. In addition, a number of consistency tests are conducted to assess the robustness of our analysis. Comparing our result with Planck-2018 cosmic microwave background observations, we find a ~$2σ$ tension for the $Λ$CDM model. △ Less

Submitted 30 November, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

Comments: 38 pages, 32 figures, 4 tables (PRD in press.)

arXiv:2304.00701 [pdf, other]

doi 10.1103/PhysRevD.108.123519

Hyper Suprime-Cam Year 3 Results: Cosmology from Cosmic Shear Power Spectra

Authors: Roohi Dalal, Xiangchong Li, Andrina Nicola, Joe Zuntz, Michael A. Strauss, Sunao Sugiyama, Tianqing Zhang, Markus M. Rau, Rachel Mandelbaum, Masahiro Takada, Surhud More, Hironao Miyatake, Arun Kannawadi, Masato Shirasaki, Takanori Taniguchi, Ryuichi Takahashi, Ken Osato, Takashi Hamana, Masamune Oguri, Atsushi J. Nishizawa, Andrés A. Plazas Malagón, Tomomi Sunayama, David Alonso, Anže Slosar, Robert Armstrong , et al. (13 additional authors not shown)

Abstract: We measure weak lensing cosmic shear power spectra from the three-year galaxy shear catalog of the Hyper Suprime-Cam (HSC) Subaru Strategic Program imaging survey. The shear catalog covers $416 \ \mathrm{deg}^2$ of the northern sky, with a mean $i$-band seeing of 0.59 arcsec and an effective galaxy number density of 15 $\mathrm{arcmin}^{-2}$ within our adopted redshift range. With an $i$-band magn… ▽ More We measure weak lensing cosmic shear power spectra from the three-year galaxy shear catalog of the Hyper Suprime-Cam (HSC) Subaru Strategic Program imaging survey. The shear catalog covers $416 \ \mathrm{deg}^2$ of the northern sky, with a mean $i$-band seeing of 0.59 arcsec and an effective galaxy number density of 15 $\mathrm{arcmin}^{-2}$ within our adopted redshift range. With an $i$-band magnitude limit of 24.5 mag, and four tomographic redshift bins spanning $0.3 \leq z_{\mathrm{ph}} \leq 1.5$ based on photometric redshifts, we obtain a high-significance measurement of the cosmic shear power spectra, with a signal-to-noise ratio of approximately 26.4 in the multipole range $300<\ell<1800$. The accuracy of our power spectrum measurement is tested against realistic mock shear catalogs, and we use these catalogs to get a reliable measurement of the covariance of the power spectrum measurements. We use a robust blinding procedure to avoid confirmation bias, and model various uncertainties and sources of bias in our analysis, including point spread function systematics, redshift distribution uncertainties, the intrinsic alignment of galaxies and the modeling of the matter power spectrum. For a flat $Λ$CDM model, we find $S_8 \equiv σ_8 (Ω_m/0.3)^{0.5} =0.776^{+0.032}_{-0.033}$, which is in excellent agreement with the constraints from the other HSC Year 3 cosmology analyses, as well as those from a number of other cosmic shear experiments. This result implies a $\sim$$2σ$-level tension with the Planck 2018 cosmology. We study the effect that various systematic errors and modeling choices could have on this value, and find that they can shift the best-fit value of $S_8$ by no more than $\sim$$0.5σ$, indicating that our result is robust to such systematics. △ Less

Submitted 4 April, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

Comments: 35 pages, 18 figures, 6 tables, for coordinated submission to PRD with other HSC Y3 weak lensing cosmology papers - see https://hsc-release.mtk.nao.ac.jp/doc/index.php/wly3/

Journal ref: Physical Review D, Volume 108, Issue 12, December 2023, article id.123519

arXiv:2303.03313 [pdf, other]

Data management and execution systems for the Rubin Observatory Science Pipelines

Authors: Nate B. Lust, Tim Jenness, James F. Bosch, Andrei Salnikov, Nathan M. Pease, Michelle Gower, Mikolaj Kowalik, Gregory P. Dubois-Felsmann, Fritz Mueller, Pim Schellart

Abstract: We present the Rubin Observatory system for data storage/retrieval and pipelined code execution. The layer for data storage and retrieval is named the Butler. It consists of a relational database, known as the registry, to keep track of metadata and relations, and a system to manage where the data is located, named the datastore. Together these systems create an abstraction layer that science algo… ▽ More We present the Rubin Observatory system for data storage/retrieval and pipelined code execution. The layer for data storage and retrieval is named the Butler. It consists of a relational database, known as the registry, to keep track of metadata and relations, and a system to manage where the data is located, named the datastore. Together these systems create an abstraction layer that science algorithms can be written against. This abstraction layer manages the complexities of the large data volumes expected and allows algorithms to be written independently, yet be tied together automatically into a coherent processing pipeline. This system consists of tools which execute these pipelines by transforming them into execution graphs which contain concrete data stored in the Butler. The pipeline infrastructure is designed to be scalable in nature, allowing execution on environments ranging from a laptop all the way up to multi-facility data centers. This presentation will focus on the data management aspects as well as an overview on the creation of pipelines and the corresponding execution graphs. △ Less

Submitted 6 March, 2023; originally announced March 2023.

Comments: 4 pages, submitted to Astronomical Data Analysis Software and Systems XXXII, October 2022

arXiv:2211.15795 [pdf, other]

Adding Workflow Management Flexibility to LSST Pipelines Execution

Authors: Michelle Gower, Mikolaj Kowalik, Nate B. Lust, James F. Bosch, Tim Jenness

Abstract: Data processing pipelines need to be executed at scales ranging from small runs up through large production data release runs resulting in millions of data products. As part of the Rubin Observatory's pipeline execution system, BPS is the abstraction layer that provides an interface to different Workflow Management Systems (WMS) such as HTCondor and PanDA. During the submission process, the pipeli… ▽ More Data processing pipelines need to be executed at scales ranging from small runs up through large production data release runs resulting in millions of data products. As part of the Rubin Observatory's pipeline execution system, BPS is the abstraction layer that provides an interface to different Workflow Management Systems (WMS) such as HTCondor and PanDA. During the submission process, the pipeline execution system interacts with the Data Butler to produce a science-oriented execution graph from algorithmic tasks. BPS converts this execution graph to a workflow graph and then uses a WMS-specific plugin to submit and manage the workflow. Here we will discuss the architectural design of this interface and report briefly on the recent production of the Data Preview 0.2 release and how the system is used by pipeline developers. △ Less

Submitted 28 November, 2022; originally announced November 2022.

Comments: 4 pages, submitted to Astronomical Data Analysis Software and Systems XXXII, October 2022

arXiv:2209.09253 [pdf, other]

doi 10.21105/astro.2209.09253

PSFs of coadded images

Authors: Rachel Mandelbaum, Mike Jarvis, Robert H. Lupton, James Bosch, Arun Kannawadi, Michael D. Murphy, Tianqing Zhang, the LSST Dark Energy Science Collaboration

Abstract: We provide a detailed exploration of the connection between choice of coaddition schemes and the point-spread function (PSF) of the resulting coadded images. In particular, we investigate what properties of the coaddition algorithm lead to the final coadded image having a well-defined PSF. The key elements of this discussion are as follows: 1. We provide an illustration of how linear coaddition… ▽ More We provide a detailed exploration of the connection between choice of coaddition schemes and the point-spread function (PSF) of the resulting coadded images. In particular, we investigate what properties of the coaddition algorithm lead to the final coadded image having a well-defined PSF. The key elements of this discussion are as follows: 1. We provide an illustration of how linear coaddition schemes can produce a coadd that lacks a well-defined PSF even for relatively simple scenarios and choices of weight functions. 2. We provide a more formal demonstration of the fact that a linear coadd only has a well-defined PSF in the case that either (a) each input image has the same PSF or (b) the coadd is produced with weights that are independent of the signal. 3. We discuss some reasons that two plausible nonlinear coaddition algorithms (median and clipped-mean) fail to produce a consistent PSF profile for stars. 4. We demonstrate that all nonlinear coaddition procedures fail to produce a well-defined PSF for extended objects. In the end, we conclude that, for any purpose where a well-defined PSF is desired, one should use a linear coaddition scheme with weights that do not correlate with the signal and are approximately uniform across typical objects of interest. △ Less

Submitted 2 February, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

Comments: 13 pages, 4 figures; pedagogical article; v2 accepted for publication in the Open Journal of Astrophysics

Journal ref: The Open Journal of Astrophysics, Vol. 6, 2023

arXiv:2207.00222 [pdf, other]

Bayesian causal inference in automotive software engineering and online evaluation

Authors: Yuchu Liu, David Issa Mattos, Jan Bosch, Helena Holmström Olsson, Jonn Lantz

Abstract: Randomised field experiments, such as A/B testing, have long been the gold standard for evaluating software changes. In the automotive domain, running randomised field experiments is not always desired, possible, or even ethical. In the face of such limitations, we develop a framework BOAT (Bayesian causal modelling for ObvservAtional Testing), utilising observational studies in combination with B… ▽ More Randomised field experiments, such as A/B testing, have long been the gold standard for evaluating software changes. In the automotive domain, running randomised field experiments is not always desired, possible, or even ethical. In the face of such limitations, we develop a framework BOAT (Bayesian causal modelling for ObvservAtional Testing), utilising observational studies in combination with Bayesian causal inference, in order to understand real-world impacts from complex automotive software updates and help software development organisations arrive at causal conclusions. In this study, we present three causal inference models in the Bayesian framework and their corresponding cases to address three commonly experienced challenges of software evaluation in the automotive domain. We develop the BOAT framework with our industry collaborator, and demonstrate the potential of causal inference by conducting empirical studies on a large fleet of vehicles. Moreover, we relate the causal assumption theories to their implications in practise, aiming to provide a comprehensive guide on how to apply the causal models in automotive software engineering. We apply Bayesian propensity score matching for producing balanced control and treatment groups when we do not have access to the entire user base, Bayesian regression discontinuity design for identifying covariate dependent treatment assignments and the local treatment effect, and Bayesian difference-in-differences for causal inference of treatment effect overtime and implicitly control unobserved confounding factors. Each one of the demonstrative case has its grounds in practise, and is a scenario experienced when randomisation is not feasible. With the BOAT framework, we enable online software evaluation in the automotive domain without the need of a fully randomised experiment. △ Less

Submitted 1 July, 2022; originally announced July 2022.

Comments: In submission

arXiv:2206.14941 [pdf, other]

The Vera C. Rubin Observatory Data Butler and Pipeline Execution System

Authors: Tim Jenness, James F. Bosch, Nate B. Lust, Nathan M. Pease, Michelle Gower, Mikolaj Kowalik, Gregory P. Dubois-Felsmann, Fritz Mueller, Pim Schellart

Abstract: The Rubin Observatory's Data Butler is designed to allow data file location and file formats to be abstracted away from the people writing the science pipeline algorithms. The Butler works in conjunction with the workflow graph builder to allow pipelines to be constructed from the algorithmic tasks. These pipelines can be executed at scale using object stores and multi-node clusters, or on a lapto… ▽ More The Rubin Observatory's Data Butler is designed to allow data file location and file formats to be abstracted away from the people writing the science pipeline algorithms. The Butler works in conjunction with the workflow graph builder to allow pipelines to be constructed from the algorithmic tasks. These pipelines can be executed at scale using object stores and multi-node clusters, or on a laptop using a local file system. The Butler and pipeline system are now in daily use during Rubin construction and early operations. △ Less

Submitted 29 June, 2022; originally announced June 2022.

Comments: 14 pages, 3 figures, submitted to Proc SPIE 12189, "Software and Cyberinfrastructure for Astronomy VII", Montreal, CA, July 2022

arXiv:2203.09893 [pdf, other]

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

Authors: Rachel M. Bittner, Juan José Bosch, David Rubinstein, Gabriel Meseguer-Brocal, Sebastian Ewert

Abstract: Automatic Music Transcription (AMT) has been recognized as a key enabling technology with a wide range of applications. Given the task's complexity, best results have typically been reported for systems focusing on specific settings, e.g. instrument-specific systems tend to yield improved results over instrument-agnostic methods. Similarly, higher accuracy can be obtained when only estimating fram… ▽ More Automatic Music Transcription (AMT) has been recognized as a key enabling technology with a wide range of applications. Given the task's complexity, best results have typically been reported for systems focusing on specific settings, e.g. instrument-specific systems tend to yield improved results over instrument-agnostic methods. Similarly, higher accuracy can be obtained when only estimating frame-wise $f_0$ values and neglecting the harder note event detection. Despite their high accuracy, such specialized systems often cannot be deployed in the real-world. Storage and network constraints prohibit the use of multiple specialized models, while memory and run-time constraints limit their complexity. In this paper, we propose a lightweight neural network for musical instrument transcription, which supports polyphonic outputs and generalizes to a wide variety of instruments (including vocals). Our model is trained to jointly predict frame-wise onsets, multipitch and note activations, and we experimentally show that this multi-output structure improves the resulting frame-level note accuracy. Despite its simplicity, benchmark results show our system's note estimation to be substantially better than a comparable baseline, and its frame-level accuracy to be only marginally below those of specialized state-of-the-art AMT systems. With this work we hope to encourage the community to further investigate low-resource, instrument-agnostic AMT systems. △ Less

Submitted 12 May, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

arXiv:2202.03014 [pdf]

doi 10.1148/radiol.2021210454

US Velocimetry in Participants with Aortoiliac Occlusive Disease

Authors: Stefan Engelhard, Majorie van Helvert, Jason Voorneveld, Johan G Bosch, Guillaume PR Lajoinie, Michel Versluis, Erik Groot Jebbink, Michel MPJ Reijnen

Abstract: The accurate quantification of blood flow in aortoiliac arteries is challenging but clinically relevant because local flow patterns can influence atherosclerotic disease. To investigate the feasibility and clinical application of two-dimensional blood flow quantification using high-frame-rate contrast-enhanced US (HFR-CEUS) and particle image velocimetry (PIV), or US velocimetry, in participants w… ▽ More The accurate quantification of blood flow in aortoiliac arteries is challenging but clinically relevant because local flow patterns can influence atherosclerotic disease. To investigate the feasibility and clinical application of two-dimensional blood flow quantification using high-frame-rate contrast-enhanced US (HFR-CEUS) and particle image velocimetry (PIV), or US velocimetry, in participants with aortoiliac stenosis. In this prospective study, participants with a recently diagnosed aortoiliac stenosis underwent HFR-CEUS measurements of the pre- and poststenotic vessel segments. Two-dimensional quantification of blood flow was achieved by performing PIV analysis, which was based on pairwise cross-correlation of the HFR-CEUS images. Visual inspection of the entire data set was performed by five observers to evaluate the ability of the technique to enable adequate visualization of blood flow. The contrast-to-background ratio and average vector correlation were calculated. In two participants who showed flow disturbances, the flow complexity and vorticity were calculated. Results: 35 participants were included. Visual scoring showed that flow quantification was achieved in 41 of 42 locations. In 25 locations, one or multiple issues occurred that limited optimal flow quantification, including loss of correlation during systole, shadow regions, a short vessel segment in the image plane, and loss of contrast during diastole. In the remaining 16 locations, optimal quantification was achieved. The contrast-to-background ratio was higher during systole than during diastole, whereas the vector correlation was lower. Flow complexity and vorticity were high in regions with disturbed flow. Blood flow quantification with US velocimetry is feasible in patients with an aortoiliac stenosis, but several challenges must be overcome before implementation into clinical practice. △ Less

Submitted 7 February, 2022; originally announced February 2022.

Journal ref: Radiology 301(2), 332-338 (2021)

arXiv:2202.02006 [pdf, other]

5G Network on Wings: A Deep Reinforcement Learning Approach to the UAV-based Integrated Access and Backhaul

Authors: Hongyi Zhang, Zhiqiang Qi, Jingya Li, Anders Aronsson, Jan Bosch, Helena Holmström Olsson

Abstract: Fast and reliable wireless communication has become a critical demand in human life. In the case of mission-critical (MC) scenarios, for instance, when natural disasters strike, providing ubiquitous connectivity becomes challenging by using traditional wireless networks. In this context, unmanned aerial vehicle (UAV) based aerial networks offer a promising alternative for fast, flexible, and relia… ▽ More Fast and reliable wireless communication has become a critical demand in human life. In the case of mission-critical (MC) scenarios, for instance, when natural disasters strike, providing ubiquitous connectivity becomes challenging by using traditional wireless networks. In this context, unmanned aerial vehicle (UAV) based aerial networks offer a promising alternative for fast, flexible, and reliable wireless communications. Due to unique characteristics such as mobility, flexible deployment, and rapid reconfiguration, drones can readily change location dynamically to provide on-demand communications to users on the ground in emergency scenarios. As a result, the usage of UAV base stations (UAV-BSs) has been considered an appropriate approach for providing rapid connection in MC scenarios. In this paper, we study how to control multiple UAV-BSs in both static and dynamic environments. We use a system-level simulator to model an MC scenario in which a macro BS of a cellular network is out of service and multiple UAV-BSs are deployed using integrated access and backhaul (IAB) technology to provide coverage for users in the disaster area. With the data collected from the system-level simulation, a deep reinforcement learning algorithm is developed to jointly optimize the three-dimensional placement of these multiple UAV-BSs, which adapt their 3-D locations to the on-ground user movement. The evaluation results show that the proposed algorithm can support the autonomous navigation of the UAV-BSs to meet the MC service requirements in terms of user throughput and drop rate. △ Less

Submitted 26 May, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

arXiv:2201.03862 [pdf, other]

doi 10.5281/zenodo.7195671

Rubin-Euclid Derived Data Products: Initial Recommendations

Authors: Leanne P. Guy, Jean-Charles Cuillandre, Etienne Bachelet, Manda Banerji, Franz E. Bauer, Thomas Collett, Christopher J. Conselice, Siegfried Eggl, Annette Ferguson, Adriano Fontana, Catherine Heymans, Isobel M. Hook, Éric Aubourg, Hervé Aussel, James Bosch, Benoit Carry, Henk Hoekstra, Konrad Kuijken, Francois Lanusse, Peter Melchior, Joseph Mohr, Michele Moresco, Reiko Nakajima, Stéphane Paltani, Michael Troxel , et al. (95 additional authors not shown)

Abstract: This report is the result of a joint discussion between the Rubin and Euclid scientific communities. The work presented in this report was focused on designing and recommending an initial set of Derived Data products (DDPs) that could realize the science goals enabled by joint processing. All interested Rubin and Euclid data rights holders were invited to contribute via an online discussion forum… ▽ More This report is the result of a joint discussion between the Rubin and Euclid scientific communities. The work presented in this report was focused on designing and recommending an initial set of Derived Data products (DDPs) that could realize the science goals enabled by joint processing. All interested Rubin and Euclid data rights holders were invited to contribute via an online discussion forum and a series of virtual meetings. Strong interest in enhancing science with joint DDPs emerged from across a wide range of astrophysical domains: Solar System, the Galaxy, the Local Volume, from the nearby to the primaeval Universe, and cosmology. △ Less

Submitted 13 October, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

Comments: Report of the Rubin-Euclid Derived Data Products Working Group, 78 pages, 11 figures

arXiv:2112.10808 [pdf, other]

Radial Distribution of the Dust Comae of Comets 45P/Honda-Mrkos-Pajduusáková and 46P/Wirtanen

Authors: C. Lejoly, W. Harris, N. Samarasinha, B. E. A. Mueller, E. Howell, J. Bodnarik, A. Springmann, T. Kareta, B. Sharkey, J. Noonan, L. R. Bedin, J. -G. Bosch, A. Brosio, E. Bryssinck, J. -B. de Vanssay, F. -J. Hambsch, O. Ivanova, V. Krushinsky, Z. -Y. Lin, F. Manzini, A. Maury, N. Moriya, P. Ochner, V. Oldani

Abstract: There was an unprecedented opportunity to study the inner dust coma environment, where the dust and gas are not entirely decoupled, of comets 45P/Honda-Mrkos-Pajduusáková (45P/HMP) from Dec. 26, 2016 - Mar. 15, 2017, and 46P/Wirtanen from Nov. 10, 2018 - Feb. 13, 2019, both in visible wavelengths. The radial profile slopes of these comets were measured in the R and HB-BC filters most representativ… ▽ More There was an unprecedented opportunity to study the inner dust coma environment, where the dust and gas are not entirely decoupled, of comets 45P/Honda-Mrkos-Pajduusáková (45P/HMP) from Dec. 26, 2016 - Mar. 15, 2017, and 46P/Wirtanen from Nov. 10, 2018 - Feb. 13, 2019, both in visible wavelengths. The radial profile slopes of these comets were measured in the R and HB-BC filters most representative of dust, and deviations from a radially expanding coma were identified as significant. The azimuthally averaged radial profile slope of comet 45P/HMP gradually changes from -1.81 $\pm$ 0.20 at 5.24 days pre-perihelion to -0.35 $\pm$ 0.16 at 74.41 days post perihelion. Contrastingly, the radial profile slope of 46P/Wirtanen stays fairly constant over the observed time period at -1.05 $\pm$ 0.05. Additionally, we find that the radial profile of 46P/Wirtanen is azimuthally dependent on the skyplane-projected solar position angle, while that of 45P/HMP is not. These results suggest that comet 45P/HMP and 46P/Wirtanen have vastly different coma dust environments and that their dust properties are distinct. As evident from these two comets, well-resolved inner comae are vital for detailed characterization of dust environments. △ Less

Submitted 20 December, 2021; originally announced December 2021.

Comments: 21 pages, 13 figures, to be published in the Planetary Science Journal

arXiv:2112.07313 [pdf, other]

Autonomous Navigation and Configuration of Integrated Access Backhauling for UAV Base Station Using Reinforcement Learning

Authors: Hongyi Zhang, Jingya Li, Zhiqiang Qi, Xingqin Lin, Anders Aronsson, Jan Bosch, Helena Holmström Olsson

Abstract: Fast and reliable connectivity is essential to enhancing situational awareness and operational efficiency for public safety mission-critical (MC) users. In emergency or disaster circumstances, where existing cellular network coverage and capacity may not be available to meet MC communication demands, deployable-network-based solutions such as cells-on-wheels/wings can be utilized swiftly to ensure… ▽ More Fast and reliable connectivity is essential to enhancing situational awareness and operational efficiency for public safety mission-critical (MC) users. In emergency or disaster circumstances, where existing cellular network coverage and capacity may not be available to meet MC communication demands, deployable-network-based solutions such as cells-on-wheels/wings can be utilized swiftly to ensure reliable connection for MC users. In this paper, we consider a scenario where a macro base station (BS) is destroyed due to a natural disaster and an unmanned aerial vehicle carrying BS (UAV-BS) is set up to provide temporary coverage for users in the disaster area. The UAV-BS is integrated into the mobile network using the 5G integrated access and backhaul (IAB) technology. We propose a framework and signalling procedure for applying machine learning to this use case. A deep reinforcement learning algorithm is designed to jointly optimize the access and backhaul antenna tilt as well as the three-dimensional location of the UAV-BS in order to best serve the on-ground MC users while maintaining a good backhaul connection. Our result shows that the proposed algorithm can autonomously navigate and configure the UAV-BS to improve the throughput and reduce the drop rate of MC users. △ Less

Submitted 13 May, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2111.12766 [pdf, ps, other]

doi 10.3847/1538-4365/ac3d31

Subaru High-z Exploration of Low-Luminosity Quasars (SHELLQs). XVI. 69 New Quasars at 5.8 < z < 7.0

Authors: Yoshiki Matsuoka, Kazushi Iwasawa, Masafusa Onoue, Takuma Izumi, Nobunari Kashikawa, Michael A. Strauss, Masatoshi Imanishi, Tohru Nagao, Masayuki Akiyama, John D. Silverman, Naoko Asami, James Bosch, Hisanori Furusawa, Tomotsugu Goto, James E. Gunn, Yuichi Harikane, Hiroyuki Ikeda, Rikako Ishimoto, Toshihiro Kawaguchi, Nanako Kato, Satoshi Kikuta, Kotaro Kohno, Yutaka Komiyama, Chien-Hsiu Lee, Robert H. Lupton , et al. (19 additional authors not shown)

Abstract: We present the spectroscopic discovery of 69 quasars at 5.8 < z < 7.0, drawn from the Hyper Suprime-Cam (HSC) Subaru Strategic Program (SSP) imaging survey data. This is the 16th publication from the Subaru High-z Exploration of Low-Luminosity Quasars (SHELLQs) project, and completes identification of all but the faintest candidates (i.e., i-band dropouts with zAB < 24 and y-band detections, and z… ▽ More We present the spectroscopic discovery of 69 quasars at 5.8 < z < 7.0, drawn from the Hyper Suprime-Cam (HSC) Subaru Strategic Program (SSP) imaging survey data. This is the 16th publication from the Subaru High-z Exploration of Low-Luminosity Quasars (SHELLQs) project, and completes identification of all but the faintest candidates (i.e., i-band dropouts with zAB < 24 and y-band detections, and z-band dropouts with yAB < 24) with Bayesian quasar probability Pq > 0.1 in the HSC-SSP third public data release (PDR3). The sample reported here also includes three quasars with Pq < 0.1 at z ~ 6.6, which we selected in an effort to completely cover the reddest point sources with simple color cuts. The number of high-z quasars discovered in SHELLQs has now grown to 162, including 23 type-II quasar candidates. This paper also presents identification of seven galaxies at 5.6 < z < 6.7, an [O III] emitter at z = 0.954, and 31 Galactic cool stars and brown dwarfs. High-z quasars and galaxies comprise 75 % and 16 % respectively of all the spectroscopic SHELLQs objects that pass our latest selection algorithm with the PDR3 photometry. That is, a total of 91 % of the objects lie at z > 5.6. This demonstrates that the algorithm has very high efficiency, even though we are probing an unprecedentedly low-luminosity population down to M1450 ~ -21 mag. △ Less

Submitted 24 November, 2021; originally announced November 2021.

Comments: Accepted for publication in The Astrophysical Journal Supplement Series

arXiv:2110.12915 [pdf]

Revealing unforeseen diagnostic image features with deep learning by detecting cardiovascular diseases from apical four-chamber ultrasounds

Authors: Li-Hsin Cheng, Pablo B. J. Bosch, Rutger F. H. Hofman, Timo B. Brakenhoff, Eline F. Bruggemans, Rob J. van der Geest, Eduard R. Holman

Abstract: Background. With the rise of highly portable, wireless, and low-cost ultrasound devices and automatic ultrasound acquisition techniques, an automated interpretation method requiring only a limited set of views as input could make preliminary cardiovascular disease diagnoses more accessible. In this study, we developed a deep learning (DL) method for automated detection of impaired left ventricular… ▽ More Background. With the rise of highly portable, wireless, and low-cost ultrasound devices and automatic ultrasound acquisition techniques, an automated interpretation method requiring only a limited set of views as input could make preliminary cardiovascular disease diagnoses more accessible. In this study, we developed a deep learning (DL) method for automated detection of impaired left ventricular (LV) function and aortic valve (AV) regurgitation from apical four-chamber (A4C) ultrasound cineloops and investigated which anatomical structures or temporal frames provided the most relevant information for the DL model to enable disease classification. Methods and Results. A4C ultrasounds were extracted from 3,554 echocardiograms of patients with either impaired LV function (n=928), AV regurgitation (n=738), or no significant abnormalities (n=1,888). Two convolutional neural networks (CNNs) were trained separately to classify the respective disease cases against normal cases. The overall classification accuracy of the impaired LV function detection model was 86%, and that of the AV regurgitation detection model was 83%. Feature importance analyses demonstrated that the LV myocardium and mitral valve were important for detecting impaired LV function, while the tip of the mitral valve anterior leaflet, during opening, was considered important for detecting AV regurgitation. Conclusion. The proposed method demonstrated the feasibility of a 3D CNN approach in detection of impaired LV function and AV regurgitation using A4C ultrasound cineloops. The current research shows that DL methods can exploit large training data to detect diseases in a different way than conventionally agreed upon methods, and potentially reveal unforeseen diagnostic image features. △ Less

Submitted 25 October, 2021; originally announced October 2021.

arXiv:2110.05580 [pdf, other]

vocadito: A dataset of solo vocals with $f_0$, note, and lyric annotations

Authors: Rachel M. Bittner, Katherine Pasalo, Juan José Bosch, Gabriel Meseguer-Brocal, David Rubinstein

Abstract: To compliment the existing set of datasets, we present a small dataset entitled vocadito, consisting of 40 short excerpts of monophonic singing, sung in 7 different languages by singers with varying of levels of training, and recorded on a variety of devices. We provide several types of annotations, including $f_0$, lyrics, and two different note annotations. All annotations were created by musici… ▽ More To compliment the existing set of datasets, we present a small dataset entitled vocadito, consisting of 40 short excerpts of monophonic singing, sung in 7 different languages by singers with varying of levels of training, and recorded on a variety of devices. We provide several types of annotations, including $f_0$, lyrics, and two different note annotations. All annotations were created by musicians. We provide an analysis of the differences between the two note annotations, and see that the agreement level is low, which has implications for evaluating vocal note estimation algorithms. We also analyze the relation between the $f_0$ and note annotations, and show that quantizing $f_0$ values in frequency does not provide a reasonable note estimate, reinforcing the difficulty of the note estimation task for singing voice. Finally, we provide baseline results from recent algorithms on vocadito for note and $f_0$ transcription. Vocadito is made freely available for public use. △ Less

Submitted 29 October, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

arXiv:2109.12563 [pdf, other]

doi 10.1109/APSEC53868.2021.00031

Bayesian propensity score matching in automotive embedded software engineering

Authors: Yuchu Liu, David Issa Mattos, Jan Bosch, Helena Holmström Olsson, Jonn Lantz

Abstract: Randomised field experiments, such as A/B testing, have long been the gold standard for evaluating the value that new software brings to customers. However, running randomised field experiments is not always desired, possible or even ethical in the development of automotive embedded software. In the face of such restrictions, we propose the use of the Bayesian propensity score matching technique f… ▽ More Randomised field experiments, such as A/B testing, have long been the gold standard for evaluating the value that new software brings to customers. However, running randomised field experiments is not always desired, possible or even ethical in the development of automotive embedded software. In the face of such restrictions, we propose the use of the Bayesian propensity score matching technique for causal inference of observational studies in the automotive domain. In this paper, we present a method based on the Bayesian propensity score matching framework, applied in the unique setting of automotive software engineering. This method is used to generate balanced control and treatment groups from an observational online evaluation and estimate causal treatment effects from the software changes, even with limited samples in the treatment group. We exemplify the method with a proof-of-concept in the automotive domain. In the example, we have a larger control ($N_c=1100$) fleet of cars using the current software and a small treatment fleet ($N_t=38$), in which we introduce a new software variant. We demonstrate a scenario that shipping of a new software to all users is restricted, as a result, a fully randomised experiment could not be conducted. Therefore, we utilised the Bayesian propensity score matching method with 14 observed covariates as inputs. The results show more balanced groups, suitable for estimating causal treatment effects from the collected observational data. We describe the method in detail and share our configuration. Furthermore, we discuss how can such a method be used for online evaluation of new software utilising small groups of samples. △ Less

Submitted 26 September, 2021; originally announced September 2021.

Comments: To appear at the 28th Asia-Pacific Software Engineering Conference (APSEC 2021)

arXiv:2109.00463 [pdf, ps, other]

doi 10.1051/0004-6361/202140991

Properties of slowly rotating asteroids from the Convex Inversion Thermophysical Model

Authors: A. Marciniak, J. Ďurech, V. Alí-Lagoa, W. Ogłoza, R. Szakáts, T. G. Müller, L. Molnár, A. Pál, F. Monteiro, P. Arcoverde, R. Behrend, Z. Benkhaldoun, L. Bernasconi, J. Bosch, S. Brincat, L. Brunetto, M. Butkiewicz - Bąk, F. Del Freo, R. Duffard, M. Evangelista-Santana, G. Farroni, S. Fauvaud, M. Fauvaud, M. Ferrais, S. Geier , et al. (51 additional authors not shown)

Abstract: Results from the TESS mission showed that previous studies strngly underestimated the number of slow rotators, revealing the importance of studying those asteroids. For most slowly rotating asteroids (P > 12), no spin and shape model is available because of observation selection effects. This hampers determination of their thermal parameters and accurate sizes. We continue our campaign in minimi… ▽ More Results from the TESS mission showed that previous studies strngly underestimated the number of slow rotators, revealing the importance of studying those asteroids. For most slowly rotating asteroids (P > 12), no spin and shape model is available because of observation selection effects. This hampers determination of their thermal parameters and accurate sizes. We continue our campaign in minimising selection effects among main belt asteroids. Our targets are slow rotators with low light-curve amplitudes. The goal is to provide their scaled spin and shape models together with thermal inertia, albedo, and surface roughness to complete the statistics. Rich multi-apparition datasets of dense light curves are supplemented with data from Kepler and TESS. In addition to data in the visible range, we also use thermal data from infrared space observatories (IRAS, Akari and WISE) in a combined optimisation process using the Convex Inversion Thermophysical Model (CITPM). This novel method has so far been applied to only a few targets, and in this work we further validate the method. We present the models of 16 slow rotators. All provide good fits to both thermal and visible data. The obtained sizes are on average accurate at the 5% precision, with diameters in the range from 25 to 145 km. The rotation periods of our targets range from 11 to 59 hours, and the thermal inertia covers a wide range of values, from 2 to <400 SI units, not showing any correlation with the period. With this work we increase the sample of slow rotators with reliable spin and shape models and known thermal inertia by 40%. The thermal inertia values of our sample do not display a previously suggested increasing trend with rotation period, which might be due to their small skin depth. △ Less

Submitted 1 September, 2021; originally announced September 2021.

Comments: Accepted to Astronomy & Astrophysics. 10 pages + appendices

Journal ref: A&A 654, A87 (2021)

arXiv:2108.13045 [pdf, ps, other]

doi 10.1093/pasj/psab122

Third Data Release of the Hyper Suprime-Cam Subaru Strategic Program

Authors: Hiroaki Aihara, Yusra AlSayyad, Makoto Ando, Robert Armstrong, James Bosch, Eiichi Egami, Hisanori Furusawa, Junko Furusawa, Sumiko Harasawa, Yuichi Harikane, Bau-Ching Hsieh, Hiroyuki Ikeda, Kei Ito, Ikuru Iwata, Tadayuki Kodama, Michitaro Koike, Mitsuru Kokubo, Yutaka Komiyama, Xiangchong Li, Yongming Liang, Yen-Ting Lin, Robert H. Lupton, Nate B Lust, Lauren A. MacArthur, Ken Mawatari , et al. (42 additional authors not shown)

Abstract: The paper presents the third data release of Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP), a wide-field multi-band imaging survey with the Subaru 8.2m telescope. HSC-SSP has three survey layers (Wide, Deep, and UltraDeep) with different area coverages and depths, designed to address a wide array of astrophysical questions. This third release from HSC-SSP includes data from 278 nights of ob… ▽ More The paper presents the third data release of Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP), a wide-field multi-band imaging survey with the Subaru 8.2m telescope. HSC-SSP has three survey layers (Wide, Deep, and UltraDeep) with different area coverages and depths, designed to address a wide array of astrophysical questions. This third release from HSC-SSP includes data from 278 nights of observing time and covers about 670 square degrees in all five broad-band filters at the full depth ($\sim26$~mag at $5σ$) in the Wide layer. If we include partially observed area, the release covers 1,470 square degrees. The Deep and UltraDeep layers have $\sim80\%$ of the originally planned integration times, and are considered done, as we have slightly changed the observing strategy in order to compensate for various time losses. There are a number of updates in the image processing pipeline. Of particular importance is the change in the sky subtraction algorithm; we subtract the sky on small scales before the detection and measurement stages, which has significantly reduced false detections. Thanks to this and other updates, the overall quality of the processed data has improved since the previous release. However, there are limitations in the data (for example, the pipeline is not optimized for crowded fields), and we encourage the user to check the quality assurance plots as well as a list of known issues before exploiting the data. The data release website is https://hsc-release.mtk.nao.ac.jp/. △ Less

Submitted 30 August, 2021; originally announced August 2021.

Comments: 25 pages, 19 figures, submitted to PASJ. Data available at https://hsc-release.mtk.nao.ac.jp/

arXiv:2107.14782 [pdf, other]

Causal mediation analysis with mediator values below an assay limit

Authors: Ariel Chernofsky, Ronald J. Bosch, Judith J. Lok

Abstract: Causal indirect and direct effects provide an interpretable method for decomposing the total effect of an exposure on an outcome into the effect through a mediator and the effect through all other pathways. When the mediator is a biomarker, values can be subject to an assay lower limit. The mediator is affected by the treatment and is a putative cause of the outcome, so the assay lower limit prese… ▽ More Causal indirect and direct effects provide an interpretable method for decomposing the total effect of an exposure on an outcome into the effect through a mediator and the effect through all other pathways. When the mediator is a biomarker, values can be subject to an assay lower limit. The mediator is affected by the treatment and is a putative cause of the outcome, so the assay lower limit presents a compounded problem in mediation analysis. We propose three approaches to estimate indirect and direct effects with a mediator subject to an assay limit: 1. extrapolation 2. numerical optimization and integration of the observed likelihood and 3. the Monte Carlo Expectation Maximization (MCEM) algorithm. Since the described methods solely rely on the so-called Mediation Formula, they apply to most approaches to causal mediation analysis: natural, separable, and organic indirect and direct effects. A simulation study compares the estimation approaches to imputing with half the assay limit. Using HIV interruption study data from the AIDS Clinical Trials Group described in [Li et al. 2016, AIDS; Lok \& Bosch 2021, Epidemiology], we illustrate our methods by estimating the organic/pure indirect effect of a hypothetical HIV curative treatment on viral suppression mediated by two HIV persistence measures: cell-associated HIV-RNA (N = 124) and single copy plasma HIV-RNA (N = 96). △ Less

Submitted 30 July, 2021; originally announced July 2021.

arXiv:2107.02471 [pdf, other]

An architecture for enabling A/B experiments in automotive embedded software

Authors: Yuchu Liu, Jan Bosch, Helena Holmström Olsson, Jonn Lantz

Abstract: A/B experimentation is a known technique for data-driven product development and has demonstrated its value in web-facing businesses. With the digitalisation of the automotive industry, the focus in the industry is shifting towards software. For automotive embedded software to continuously improve, A/B experimentation is considered an important technique. However, the adoption of such a technique… ▽ More A/B experimentation is a known technique for data-driven product development and has demonstrated its value in web-facing businesses. With the digitalisation of the automotive industry, the focus in the industry is shifting towards software. For automotive embedded software to continuously improve, A/B experimentation is considered an important technique. However, the adoption of such a technique is not without challenge. In this paper, we present an architecture to enable A/B testing in automotive embedded software. The design addresses challenges that are unique to the automotive industry in a systematic fashion. Going from hypothesis to practice, our architecture was also applied in practice for running online experiments on a considerable scale. Furthermore, a case study approach was used to compare our proposal with state-of-practice in the automotive industry. We found our architecture design to be relevant and applicable in the efforts of adopting continuous A/B experiments in automotive embedded software. △ Less

Submitted 6 July, 2021; originally announced July 2021.

Comments: To appear in the 45th Annual IEEE Conference on Computers, Software and Applications (COMPSAC'2021)

arXiv:2107.02461 [pdf, other]

doi 10.1109/SEAA53835.2021.00046

Size matters? Or not: A/B testing with limited sample in automotive embedded software

Authors: Yuchu Liu, David Issa Mattos, Jan Bosch, Helena Holmström Olsson, Jonn Lantz

Abstract: A/B testing is gaining attention in the automotive sector as a promising tool to measure causal effects from software changes. Different from the web-facing businesses, where A/B testing has been well-established, the automotive domain often suffers from limited eligible users to participate in online experiments. To address this shortcoming, we present a method for designing balanced control and… ▽ More A/B testing is gaining attention in the automotive sector as a promising tool to measure causal effects from software changes. Different from the web-facing businesses, where A/B testing has been well-established, the automotive domain often suffers from limited eligible users to participate in online experiments. To address this shortcoming, we present a method for designing balanced control and treatment groups so that sound conclusions can be drawn from experiments with considerably small sample sizes. While the Balance Match Weighted method has been used in other domains such as medicine, this is the first paper to apply and evaluate it in the context of software development. Furthermore, we describe the Balance Match Weighted method in detail and we conduct a case study together with an automotive manufacturer to apply the group design method in a fleet of vehicles. Finally, we present our case study in the automotive software engineering domain, as well as a discussion on the benefits and limitations of the A/B group design method. △ Less

Submitted 10 November, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

Comments: In proceedings of the 2021 47th Euromicro Conference on Software Engineering and Advanced Applications (SEAA)

arXiv:2104.07381 [pdf, other]

On the Assessment of Benchmark Suites for Algorithm Comparison

Authors: David Issa Mattos, Lucas Ruud, Jan Bosch, Helena Holmström Olsson

Abstract: Benchmark suites, i.e. a collection of benchmark functions, are widely used in the comparison of black-box optimization algorithms. Over the years, research has identified many desired qualities for benchmark suites, such as diverse topology, different difficulties, scalability, representativeness of real-world problems among others. However, while the topology characteristics have been subjected… ▽ More Benchmark suites, i.e. a collection of benchmark functions, are widely used in the comparison of black-box optimization algorithms. Over the years, research has identified many desired qualities for benchmark suites, such as diverse topology, different difficulties, scalability, representativeness of real-world problems among others. However, while the topology characteristics have been subjected to previous studies, there is no study that has statistically evaluated the difficulty level of benchmark functions, how well they discriminate optimization algorithms and how suitable is a benchmark suite for algorithm comparison. In this paper, we propose the use of an item response theory (IRT) model, the Bayesian two-parameter logistic model for multiple attempts, to statistically evaluate these aspects with respect to the empirical success rate of algorithms. With this model, we can assess the difficulty level of each benchmark, how well they discriminate different algorithms, the ability score of an algorithm, and how much information the benchmark suite adds in the estimation of the ability scores. We demonstrate the use of this model in two well-known benchmark suites, the Black-Box Optimization Benchmark (BBOB) for continuous optimization and the Pseudo Boolean Optimization (PBO) for discrete optimization. We found that most benchmark functions of BBOB suite have high difficulty levels (compared to the optimization algorithms) and low discrimination. For the PBO, most functions have good discrimination parameters but are often considered too easy. We discuss potential uses of IRT in benchmarking, including its use to improve the design of benchmark suites, to measure multiple aspects of the algorithms, and to design adaptive suites. △ Less

Submitted 15 April, 2021; originally announced April 2021.

Comments: In submission

arXiv:2104.04824 [pdf]

Ariel: Enabling planetary science across light-years

Authors: Giovanna Tinetti, Paul Eccleston, Carole Haswell, Pierre-Olivier Lagage, Jérémy Leconte, Theresa Lüftinger, Giusi Micela, Michel Min, Göran Pilbratt, Ludovic Puig, Mark Swain, Leonardo Testi, Diego Turrini, Bart Vandenbussche, Maria Rosa Zapatero Osorio, Anna Aret, Jean-Philippe Beaulieu, Lars Buchhave, Martin Ferus, Matt Griffin, Manuel Guedel, Paul Hartogh, Pedro Machado, Giuseppe Malaguti, Enric Pallé , et al. (293 additional authors not shown)

Abstract: Ariel, the Atmospheric Remote-sensing Infrared Exoplanet Large-survey, was adopted as the fourth medium-class mission in ESA's Cosmic Vision programme to be launched in 2029. During its 4-year mission, Ariel will study what exoplanets are made of, how they formed and how they evolve, by surveying a diverse sample of about 1000 extrasolar planets, simultaneously in visible and infrared wavelengths.… ▽ More Ariel, the Atmospheric Remote-sensing Infrared Exoplanet Large-survey, was adopted as the fourth medium-class mission in ESA's Cosmic Vision programme to be launched in 2029. During its 4-year mission, Ariel will study what exoplanets are made of, how they formed and how they evolve, by surveying a diverse sample of about 1000 extrasolar planets, simultaneously in visible and infrared wavelengths. It is the first mission dedicated to measuring the chemical composition and thermal structures of hundreds of transiting exoplanets, enabling planetary science far beyond the boundaries of the Solar System. The payload consists of an off-axis Cassegrain telescope (primary mirror 1100 mm x 730 mm ellipse) and two separate instruments (FGS and AIRS) covering simultaneously 0.5-7.8 micron spectral range. The satellite is best placed into an L2 orbit to maximise the thermal stability and the field of regard. The payload module is passively cooled via a series of V-Groove radiators; the detectors for the AIRS are the only items that require active cooling via an active Ne JT cooler. The Ariel payload is developed by a consortium of more than 50 institutes from 16 ESA countries, which include the UK, France, Italy, Belgium, Poland, Spain, Austria, Denmark, Ireland, Portugal, Czech Republic, Hungary, the Netherlands, Sweden, Norway, Estonia, and a NASA contribution. △ Less

Submitted 10 April, 2021; originally announced April 2021.

Comments: Ariel Definition Study Report, 147 pages. Reviewed by ESA Science Advisory Structure in November 2020. Original document available at: https://www.cosmos.esa.int/documents/1783156/3267291/Ariel_RedBook_Nov2020.pdf/

Report number: ESA/SCI(2020)1

arXiv:2104.02126 [pdf, other]

The survival-incorporated median versus the median in the survivors or in the always-survivors: What are we measuring? And why?

Authors: Qingyan Xiang, Ronald J. Bosch, Judith J. Lok

Abstract: Many clinical studies evaluate the benefit of a treatment based on both survival and other continuous/ordinal clinical outcomes, such as Quality of Life scores. In these studies, when subjects die before the follow-up assessment, the clinical outcomes become undefined and are truncated by death. Treating outcomes as "missing" or "censored" due to death can be misleading for treatment effect evalua… ▽ More Many clinical studies evaluate the benefit of a treatment based on both survival and other continuous/ordinal clinical outcomes, such as Quality of Life scores. In these studies, when subjects die before the follow-up assessment, the clinical outcomes become undefined and are truncated by death. Treating outcomes as "missing" or "censored" due to death can be misleading for treatment effect evaluation. We show that if we use the median in the survivors or in the always-survivors as estimands to summarize clinical outcomes, we may conclude that a trade-off exists between the probability of survival and good clinical outcomes, even in settings where both the probability of survival and the probability of any good clinical outcome are better for one treatment. Therefore, we advocate not always treating death as a mechanism through which clinical outcomes are missing, but rather as part of the outcome measure. To account for the survival status, we describe the survival-incorporated median as an alternative summary measure for outcomes in the presence of death. The survival-incorporated median is the threshold such that 50% of the population is alive with an outcome above that threshold. Through conceptual examples and an application to a prostate cancer treatment study, we show that the survival-incorporated median provides a simple and useful summary measure to inform clinical practice. △ Less

Submitted 3 September, 2023; v1 submitted 5 April, 2021; originally announced April 2021.

arXiv:2103.11879 [pdf, other]

Real-time End-to-End Federated Learning: An Automotive Case Study

Authors: Hongyi Zhang, Jan Bosch, Helena Holmström Olsson

Abstract: With the development and the increasing interests in ML/DL fields, companies are eager to apply Machine Learning/Deep Learning approaches to increase service quality and customer experience. Federated Learning was implemented as an effective model training method for distributing and accelerating time-consuming model training while protecting user data privacy. However, common Federated Learning a… ▽ More With the development and the increasing interests in ML/DL fields, companies are eager to apply Machine Learning/Deep Learning approaches to increase service quality and customer experience. Federated Learning was implemented as an effective model training method for distributing and accelerating time-consuming model training while protecting user data privacy. However, common Federated Learning approaches, on the other hand, use a synchronous protocol to conduct model aggregation, which is inflexible and unable to adapt to rapidly changing environments and heterogeneous hardware settings in real-world scenarios. In this paper, we present an approach to real-time end-to-end Federated Learning combined with a novel asynchronous model aggregation protocol. Our method is validated in an industrial use case in the automotive domain, focusing on steering wheel angle prediction for autonomous driving. Our findings show that asynchronous Federated Learning can significantly improve the prediction performance of local edge models while maintaining the same level of accuracy as centralized machine learning. Furthermore, by using a sliding training window, the approach can minimize communication overhead, accelerate model training speed and consume real-time streaming data, proving high efficiency when deploying ML/DL components to heterogeneous real-world embedded systems. △ Less

Submitted 13 September, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

arXiv:2103.04095 [pdf, other]

On the experiences of adopting automated data validation in an industrial machine learning project

Authors: Lucy Ellen Lwakatare, Ellinor Rånge, Ivica Crnkovic, Jan Bosch

Abstract: Background: Data errors are a common challenge in machine learning (ML) projects and generally cause significant performance degradation in ML-enabled software systems. To ensure early detection of erroneous data and avoid training ML models using bad data, research and industrial practice suggest incorporating a data validation process and tool in ML system development process. Aim: The study i… ▽ More Background: Data errors are a common challenge in machine learning (ML) projects and generally cause significant performance degradation in ML-enabled software systems. To ensure early detection of erroneous data and avoid training ML models using bad data, research and industrial practice suggest incorporating a data validation process and tool in ML system development process. Aim: The study investigates the adoption of a data validation process and tool in industrial ML projects. The data validation process demands significant engineering resources for tool development and maintenance. Thus, it is important to identify the best practices for their adoption especially by development teams that are in the early phases of deploying ML-enabled software systems. Method: Action research was conducted at a large-software intensive organization in telecommunications, specifically within the analytics R\&D organization for an ML use case of classifying faults from returned hardware telecommunication devices. Results: Based on the evaluation results and learning from our action research, we identified three best practices, three benefits, and two barriers to adopting the data validation process and tool in ML projects. We also propose a data validation framework (DVF) for systematizing the adoption of a data validation process. Conclusions: The results show that adopting a data validation process and tool in ML projects is an effective approach of testing ML-enabled software systems. It requires having an overview of the level of data (feature, dataset, cross-dataset, data stream) at which certain data quality tests can be applied. △ Less

Submitted 6 March, 2021; originally announced March 2021.

Comments: SEIP 2021 Conference

arXiv:2101.01487 [pdf]

The use of incentives to promote Technical Debt management

Authors: Terese Besker, Antonio Martini, Jan Bosch

Abstract: When developing software, it is vitally important to keep the level of technical debt down since it is well established from several studies that technical debt can, e.g., lower the development productivity, decrease the developers' morale, and compromise the overall quality of the software. However, even if researchers and practitioners working in today's software development industry are quite f… ▽ More When developing software, it is vitally important to keep the level of technical debt down since it is well established from several studies that technical debt can, e.g., lower the development productivity, decrease the developers' morale, and compromise the overall quality of the software. However, even if researchers and practitioners working in today's software development industry are quite familiar with the concept of technical debt and its related negative consequences, there has been no empirical research focusing specifically on how software managers actively communicate and manage the need to keep the level of technical debt as low as possible. △ Less

Submitted 5 January, 2021; originally announced January 2021.

arXiv:2012.00594 [pdf, other]

HPM-Frame: A Decision Framework for Executing Software on Heterogeneous Platforms

Authors: Hugo Andrade, Ola Benderius, Christian Berger, Ivica Crnkovic, Jan Bosch

Abstract: Heterogeneous computing is one of the most important computational solutions to meet rapidly increasing demands on system performance. It typically allows the main flow of applications to be executed on a CPU while the most computationally intensive tasks are assigned to one or more accelerators, such as GPUs and FPGAs. The refactoring of systems for execution on such platforms is highly desired b… ▽ More Heterogeneous computing is one of the most important computational solutions to meet rapidly increasing demands on system performance. It typically allows the main flow of applications to be executed on a CPU while the most computationally intensive tasks are assigned to one or more accelerators, such as GPUs and FPGAs. The refactoring of systems for execution on such platforms is highly desired but also difficult to perform, mainly due the inherent increase in software complexity. After exploration, we have identified a current need for a systematic approach that supports engineers in the refactoring process -- from CPU-centric applications to software that is executed on heterogeneous platforms. In this paper, we introduce a decision framework that assists engineers in the task of refactoring software to incorporate heterogeneous platforms. It covers the software engineering lifecycle through five steps, consisting of questions to be answered in order to successfully address aspects that are relevant for the refactoring procedure. We evaluate the feasibility of the framework in two ways. First, we capture the practitioner's impressions, concerns and suggestions through a questionnaire. Then, we conduct a case study showing the step-by-step application of the framework using a computer vision application in the automotive domain. △ Less

Submitted 10 December, 2020; v1 submitted 1 December, 2020; originally announced December 2020.

Comments: Manuscript submitted to the Journal of Systems and Software

arXiv:2011.06044 [pdf, other]

A Gateway to Astronomical Image Processing: Vera C. RubinObservatory LSST Science Pipelines on AWS

Authors: Dino Bektesevic, Hsin-Fang Chiang, Kian-Tat Lim, Todd L. Miller, Greg Thain, Tim Jenness, James Bosch, Andrei Salnikov, Andrew Connolly

Abstract: The Legacy Survey of Space and Time, operated by the Vera C. Rubin Observatory, is a 10-year astronomical survey due to start operations in 2022 that will image half the sky every three nights. LSST will produce ~20TB of raw data per night which will be calibrated and analyzed in almost real time. Given the volume of LSST data, the traditional subset-download-process paradigm of data reprocessing… ▽ More The Legacy Survey of Space and Time, operated by the Vera C. Rubin Observatory, is a 10-year astronomical survey due to start operations in 2022 that will image half the sky every three nights. LSST will produce ~20TB of raw data per night which will be calibrated and analyzed in almost real time. Given the volume of LSST data, the traditional subset-download-process paradigm of data reprocessing faces significant challenges. We describe here, the first steps towards a gateway for astronomical science that would enable astronomers to analyze images and catalogs at scale. In this first step we focus on executing the Rubin LSST Science Pipelines, a collection of image and catalog processing algorithms, on Amazon Web Services (AWS). We describe our initial impressions on the performance, scalability and cost of deploying such a system in the cloud. △ Less

Submitted 11 November, 2020; originally announced November 2020.

Comments: 5 pages, 6 figures, Gateways 2020 Conference see: https://osf.io/2rqfb/

ACM Class: D.0

arXiv:2010.06342 [pdf, other]

doi 10.1364/OE.412540

Resampling the transmission matrix in an aberration-corrected Bessel mode basis

Authors: Pritam Pai, Jeroen Bosch, Allard P. Mosk

Abstract: The study of the optical transmission matrix (TM) of a sample reveals important statistics of light transport through it. The accuracy of the statistics depends strongly on the orthogonality and completeness of the basis in which the TM is measured. While conventional experimental methods suffer from sampling effects and optical aberrations, we use a basis of Bessel modes of the first kind to fait… ▽ More The study of the optical transmission matrix (TM) of a sample reveals important statistics of light transport through it. The accuracy of the statistics depends strongly on the orthogonality and completeness of the basis in which the TM is measured. While conventional experimental methods suffer from sampling effects and optical aberrations, we use a basis of Bessel modes of the first kind to faithfully recover the singular values, eigenvalues and eigenmodes of light propagation through a finite thickness of air. △ Less

Submitted 6 December, 2020; v1 submitted 13 October, 2020; originally announced October 2020.

arXiv:2010.03783 [pdf, other]

Statistical Models for the Analysis of Optimization Algorithms with Benchmark Functions

Authors: David Issa Mattos, Jan Bosch, Helena Holmström Olsson

Abstract: Frequentist statistical methods, such as hypothesis testing, are standard practice in papers that provide benchmark comparisons. Unfortunately, these methods have often been misused, e.g., without testing for their statistical test assumptions or without controlling for family-wise errors in multiple group comparisons, among several other problems. Bayesian Data Analysis (BDA) addresses many of th… ▽ More Frequentist statistical methods, such as hypothesis testing, are standard practice in papers that provide benchmark comparisons. Unfortunately, these methods have often been misused, e.g., without testing for their statistical test assumptions or without controlling for family-wise errors in multiple group comparisons, among several other problems. Bayesian Data Analysis (BDA) addresses many of the previously mentioned shortcomings but its use is not widely spread in the analysis of empirical data in the evolutionary computing community. This paper provides three main contributions. First, we motivate the need for utilizing Bayesian data analysis and provide an overview of this topic. Second, we discuss the practical aspects of BDA to ensure that our models are valid and the results transparent. Finally, we provide five statistical models that can be used to answer multiple research questions. The online appendix provides a step-by-step guide on how to perform the analysis of the models discussed in this paper, including the code for the statistical models, the data transformations and the discussed tables and figures. △ Less

Submitted 15 May, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

Journal ref: IEEE Transactions on Evolutionary Computation (DOI:10.1109/TEVC.2021.3081167)

arXiv:2010.01075 [pdf, other]

doi 10.1038/s41566-021-00789-9

Scattering invariant modes of light in complex media

Authors: Pritam Pai, Jeroen Bosch, Matthias Kühmayer, Stefan Rotter, Allard P. Mosk

Abstract: Random scattering of light in disordered media is an intriguing phenomenon of fundamental relevance to various applications. While techniques such as wavefront shaping and transmission matrix measurements have enabled remarkable progress for advanced imaging concepts, the most successful strategy to obtain clear images through a disordered medium remains the filtering of ballistic light. Ballistic… ▽ More Random scattering of light in disordered media is an intriguing phenomenon of fundamental relevance to various applications. While techniques such as wavefront shaping and transmission matrix measurements have enabled remarkable progress for advanced imaging concepts, the most successful strategy to obtain clear images through a disordered medium remains the filtering of ballistic light. Ballistic photons with a scattering-free propagation are, however, exponentially rare and no method so far can increase their proportion. To address these limitations, we introduce and experimentally implement here a new set of optical states that we term Scattering Invariant Modes (SIMs), whose transmitted field pattern is the same, irrespective of whether they scatter through a disordered sample or propagate ballistically through a homogeneous medium. We observe SIMs that are only weakly attenuated in dense scattering media, and show in simulations that their correlations with the ballistic light can be used to improve imaging inside scattering materials. △ Less

Submitted 25 January, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

Journal ref: Nature Photonics 15, 431 (2021)

arXiv:2009.03066 [pdf, other]

doi 10.1016/j.parco.2020.102664

Asynchronous Runtime with Distributed Manager for Task-based Programming Models

Authors: Jaume Bosch, Carlos Álvarez, Daniel Jiménez-González, Xavier Martorell, Eduard Ayguadé

Abstract: Parallel task-based programming models, like OpenMP, allow application developers to easily create a parallel version of their sequential codes. The standard OpenMP 4.0 introduced the possibility of describing a set of data dependences per task that the runtime uses to order the tasks execution. This order is calculated using shared graphs, which are updated by all threads in exclusive access usin… ▽ More Parallel task-based programming models, like OpenMP, allow application developers to easily create a parallel version of their sequential codes. The standard OpenMP 4.0 introduced the possibility of describing a set of data dependences per task that the runtime uses to order the tasks execution. This order is calculated using shared graphs, which are updated by all threads in exclusive access using synchronization mechanisms (locks) to ensure the dependence management correctness. The contention in the access to these structures becomes critical in many-core systems because several threads may be wasting computation resources waiting their turn. This paper proposes an asynchronous management of the runtime structures, like task dependence graphs, suitable for task-based programming model runtimes. In such organization, the threads request actions to the runtime instead of doing them directly. The requests are then handled by a distributed runtime manager (DDAST) which does not require dedicated resources. Instead, the manager uses the idle threads to modify the runtime structures. The paper also presents an implementation, analysis and performance evaluation of such runtime organization. The performance results show that the proposed asynchronous organization outperforms the speedup obtained by the original runtime for different benchmarks and different many-core architectures. △ Less

Submitted 8 September, 2020; v1 submitted 7 September, 2020; originally announced September 2020.

Comments: 2020 Parallel Computing

Journal ref: Parallel Computing, Volume 97, 2020

arXiv:2005.08712 [pdf, other]

Refactoring Software in the Automotive Domain for Execution on Heterogeneous Platforms

Authors: Hugo Andrade, Ivica Crnkovic, Jan Bosch

Abstract: The most important way to achieve higher performance in computer systems is through heterogeneous computing, i.e., by adopting hardware platforms containing more than one type of processor, such as CPUs, GPUs, and FPGAs. Several types of algorithms can be executed significantly faster on a heterogeneous platform. However, migrating CPU-executable software to other types of execution platforms pose… ▽ More The most important way to achieve higher performance in computer systems is through heterogeneous computing, i.e., by adopting hardware platforms containing more than one type of processor, such as CPUs, GPUs, and FPGAs. Several types of algorithms can be executed significantly faster on a heterogeneous platform. However, migrating CPU-executable software to other types of execution platforms poses a number of challenges to software engineering. Significant efforts are required in such type of migration, particularly for re-architecting and re-implementing the software. Further, optimizing it in terms of performance and other runtime properties can be very challenging, making the process complex, expensive, and error-prone. Therefore, a systematic approach based on explicit and justified architectural decisions is needed for a successful refactoring process from a homogeneous to a heterogeneous platform. In this paper, we propose a decision framework that supports engineers when refactoring software systems to accommodate heterogeneous platforms. It includes the assessment of important factors in order to minimize the risk of recurrent problems in the process. Through a set of questions, practitioners are able to formulate answers that will help in making appropriate architectural decisions to accommodate heterogeneous platforms. The contents of the framework have been developed and evolved based on discussions with architects and developers in the automotive domain. △ Less

Submitted 18 May, 2020; originally announced May 2020.

Comments: Accepted for publication at the 2020 IEEE 44th Annual Computer Software and Applications Conference (COMPSAC)

Showing 1–50 of 97 results for author: Bosch, J