Skip to main content

Showing 1–50 of 241 results for author: Hewitt, J

  1. Learning Translations via Matrix Completion

    Authors: Derry Wijaya, Brendan Callahan, John Hewitt, Jie Gao, Xiao Ling, Marianna Apidianaki, Chris Callison-Burch

    Abstract: Bilingual Lexicon Induction is the task of learning word translations without bilingual parallel corpora. We model this task as a matrix completion problem, and present an effective and extendable framework for completing the matrix. This method harnesses diverse bilingual and monolingual signals, each of which may be incomplete or noisy. Our model achieves state-of-the-art performance for both hi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: This is a late posting of an old paper as Google Scholar somehow misses indexing the ACL anthology version of the paper

    ACM Class: I.2.7

    Journal ref: Volume: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Year: 2017, Pages: 1452-1463

  2. arXiv:2406.08549  [pdf, other

    astro-ph.CO astro-ph.IM

    Investigating Mutual Coupling in the Hydrogen Epoch of Reionization Array and Mitigating its Effects on the 21-cm Power Spectrum

    Authors: E. Rath, R. Pascua, A. T. Josaitis, A. Ewall-Wice, N. Fagnoni, E. de Lera Acedo, Z. E. Martinot, Z. Abdurashidova, T. Adams, J. E. Aguirre, R. Baartman, A. P. Beardsley, L. M. Berkhout, G. Bernardi, T. S. Billings, J. D. Bowman, P. Bull, J. Burba, R. Byrne, S. Carey, K. -F. Chen, S. Choudhuri, T. Cox, D. R. DeBoer, M. Dexter , et al. (56 additional authors not shown)

    Abstract: Interferometric experiments designed to detect the highly redshifted 21-cm signal from neutral hydrogen are producing increasingly stringent constraints on the 21-cm power spectrum, but some k-modes remain systematics-dominated. Mutual coupling is a major systematic that must be overcome in order to detect the 21-cm signal, and simulations that reproduce effects seen in the data can guide strategi… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 19 pages, 12 figures, submitted to MNRAS

  3. arXiv:2405.02111  [pdf, ps, other

    physics.geo-ph physics.flu-dyn

    Modelling the evolution of an ice sheet's weathering crust

    Authors: Tilly Woods, Ian J. Hewitt

    Abstract: The weathering crust is a layer of porous ice that can form at the surface of an ice sheet. It grows and decays in response changing weather and climate conditions, affecting the albedo, the melt rate, and the transport of meltwater across the surface. To understand this behaviour, we seek time-dependent solutions to a continuum, thermodynamic model for the porosity, temperature and thickness of t… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 36 pages, 11 figures

  4. arXiv:2402.08659  [pdf, other

    astro-ph.CO astro-ph.IM

    A demonstration of the effect of fringe-rate filtering in the Hydrogen Epoch of Reionization Array delay power spectrum pipeline

    Authors: Hugh Garsden, Philip Bull, Mike Wilensky, Zuhra Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Lindsay M. Berkhout, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Jacob Burba, Steven Carey, Chris L. Carilli, Kai-Feng Chen, Carina Cheng, Samir Choudhuri, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter , et al. (72 additional authors not shown)

    Abstract: Radio interferometers targeting the 21cm brightness temperature fluctuations at high redshift are subject to systematic effects that operate over a range of different timescales. These can be isolated by designing appropriate Fourier filters that operate in fringe-rate (FR) space, the Fourier pair of local sidereal time (LST). Applications of FR filtering include separating effects that are correl… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 21 pages, 18 figures, submitted to Monthly Notices of the Royal Astronomical Society

  5. arXiv:2402.06155  [pdf, other

    cs.CL

    Model Editing with Canonical Examples

    Authors: John Hewitt, Sarah Chen, Lanruo Lora Xie, Edward Adams, Percy Liang, Christopher D. Manning

    Abstract: We introduce model editing with canonical examples, a setting in which (1) a single learning example is provided per desired behavior, (2) evaluation is performed exclusively out-of-distribution, and (3) deviation from an initial model is strictly limited. A canonical example is a simple instance of good behavior, e.g., The capital of Mauritius is Port Louis) or bad behavior, e.g., An aspect of re… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  6. arXiv:2401.10057  [pdf, other

    stat.ME physics.soc-ph stat.AP

    A method for characterizing disease emergence curves from paired pathogen detection and serology data

    Authors: Joshua Hewitt, Grete Wilson-Henjum, Derek T. Collins, Jourdan M. Ringenberg, Christopher A. Quintanal, Robert Pleszewski, Jeffrey C. Chandler, Thomas J. DeLiberto, Kim M. Pepin

    Abstract: Wildlife disease surveillance programs and research studies track infection and identify risk factors for wild populations, humans, and agriculture. Often, several types of samples are collected from individuals to provide more complete information about an animal's infection history. Methods that jointly analyze multiple data streams to study disease emergence and drivers of infection via epidemi… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 22 pages, 5 figures, 1 table

  7. Hydrogen Epoch of Reionization Array (HERA) Phase II Deployment and Commissioning

    Authors: Lindsay M. Berkhout, Daniel C. Jacobs, Zuhra Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Philip Bull, Jacob Burba, Steven Carey, Chris L. Carilli, Kai-Feng Chen, Carina Cheng, Samir Choudhuri, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter, Joshua S. Dillon , et al. (71 additional authors not shown)

    Abstract: This paper presents the design and deployment of the Hydrogen Epoch of Reionization Array (HERA) phase II system. HERA is designed as a staged experiment targeting 21 cm emission measurements of the Epoch of Reionization. First results from the phase I array are published as of early 2022, and deployment of the phase II system is nearing completion. We describe the design of the phase II system an… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Journal ref: PASP 2024 136 045002

  8. arXiv:2312.10944  [pdf

    cs.CV

    From Whole-slide Image to Biomarker Prediction: A Protocol for End-to-End Deep Learning in Computational Pathology

    Authors: Omar S. M. El Nahhas, Marko van Treeck, Georg Wölflein, Michaela Unger, Marta Ligero, Tim Lenz, Sophia J. Wagner, Katherine J. Hewitt, Firas Khader, Sebastian Foersch, Daniel Truhn, Jakob Nikolas Kather

    Abstract: Hematoxylin- and eosin (H&E) stained whole-slide images (WSIs) are the foundation of diagnosis of cancer. In recent years, development of deep learning-based methods in computational pathology enabled the prediction of biomarkers directly from WSIs. However, accurately linking tissue phenotype to biomarkers at scale remains a crucial challenge for democratizing complex biomarkers in precision onco… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  9. arXiv:2312.09763  [pdf, other

    astro-ph.IM

    matvis: A matrix-based visibility simulator for fast forward modelling of many-element 21 cm arrays

    Authors: Piyanat Kittiwisit, Steven G. Murray, Hugh Garsden, Philip Bull, Christopher Cain, Aaron R. Parsons, Jackson Sipple, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Lindsay M. Berkhout, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Jacob Burba, Steven Carey, Chris L. Carilli, Kai-Feng Chen, Carina Cheng , et al. (73 additional authors not shown)

    Abstract: Detection of the faint 21 cm line emission from the Cosmic Dawn and Epoch of Reionisation will require not only exquisite control over instrumental calibration and systematics to achieve the necessary dynamic range of observations but also validation of analysis techniques to demonstrate their statistical properties and signal loss characteristics. A key ingredient in achieving this is the ability… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 25 pages, 20 figures, submitted to RAS Techniques and Instruments, matvis is publicly available at https://github.com/HERA-Team/matvis

  10. arXiv:2312.03697  [pdf, other

    astro-ph.IM astro-ph.CO

    Bayesian estimation of cross-coupling and reflection systematics in 21cm array visibility data

    Authors: Geoff G. Murphy, Philip Bull, Mario G. Santos, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee Billings, Judd D. Bowman, Richard F. Bradley, Jacob Burba, Christopher Cain, Steven Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter, Joshua S. Dillon, Nico Eksteen , et al. (54 additional authors not shown)

    Abstract: Observations with radio arrays that target the 21-cm signal originating from the early Universe suffer from a variety of systematic effects. An important class of these are reflections and spurious couplings between antennas. We apply a Hamiltonian Monte Carlo sampler to the modelling and mitigation of these systematics in simulated Hydrogen Epoch of Reionisation Array (HERA) data. This method all… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 19 pages, 14 figures, submitted to MNRAS

  11. arXiv:2311.10711  [pdf, other

    astro-ph.IM astro-ph.CO

    Direct Optimal Mapping Image Power Spectrum and its Window Functions

    Authors: Zhilei Xu, Honggeun Kim, Jacqueline N. Hewitt, Kai-Feng Chen, Nicholas S. Kern, Eleanor Rath, Ruby Byrne, Adélie Gorce, Robert Pascua, Zachary E. Martinot, Joshua S. Dillon, Bryna J. Hazelton, Adrian Liu, Miguel F. Morales, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman , et al. (57 additional authors not shown)

    Abstract: The key to detecting neutral hydrogen during the epoch of reionization (EoR) is to separate the cosmological signal from the dominating foreground radiation. We developed direct optimal mapping (DOM) to map interferometric visibilities; it contains only linear operations, with full knowledge of point spread functions from visibilities to images. Here, we demonstrate a fast Fourier transform-based… ▽ More

    Submitted 5 July, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: Published in ApJ

  12. arXiv:2310.19901  [pdf, other

    astro-ph.HE

    Detection of large-scale synchrotron radiation from the molecular envelope of the Sgr B cloud complex at the Galactic center

    Authors: F. Yusef-Zadeh, M. Wardle, R. Arendt, J. W. Hewitt, Y. Hu, A. Lazarian, N. Kassim, S. Hyman, I. Heywood

    Abstract: We present highly sensitive measurements taken with MeerKAT at 1280 MHz as well as archival GBT, MWA and VLA images at 333, 88 and 74 MHz. We report the detection of synchrotron radio emission from the infrared dark cloud (IRDC) associated with the halo of the Sgr B complex on a scale of ~60 pc. A strong spatial correlation between low-frequency radio continuum emission and dense molecular gas, co… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 10 pages, 4 figures, MN (in press)

  13. arXiv:2310.12751  [pdf, other

    cs.CL

    Character-level Chinese Backpack Language Models

    Authors: Hao Sun, John Hewitt

    Abstract: The Backpack is a Transformer alternative shown to improve interpretability in English language modeling by decomposing predictions into a weighted sum of token sense components. However, Backpacks' reliance on token-defined meaning raises questions as to their potential for languages other than English, a language for which subword tokenization provides a reasonable approximation for lexical item… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: BlackboxNLP 2023 Camera-Ready

  14. arXiv:2310.01693  [pdf, other

    cs.CL

    Closing the Curious Case of Neural Text Degeneration

    Authors: Matthew Finlayson, John Hewitt, Alexander Koller, Swabha Swayamdipta, Ashish Sabharwal

    Abstract: Despite their ubiquity in language generation, it remains unknown why truncation sampling heuristics like nucleus sampling are so effective. We provide a theoretical explanation for the effectiveness of the truncation sampling by proving that truncation methods that discard tokens below some probability threshold (the most common type of truncation) can guarantee that all sampled tokens have nonze… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    MSC Class: 68T50 ACM Class: I.2.7

  15. arXiv:2308.00162  [pdf, other

    cond-mat.soft physics.geo-ph

    Soft matter physics of the ground beneath our feet

    Authors: Anne Voigtländer, Morgane Houssais, Karol A. Bacik, Ian C. Bourg, Justin C. Burton, Karen E. Daniels, Sujit S. Datta, Emanuela Del Gado, Nakul S. Deshpande, Olivier Devauchelle, Behrooz Ferdowsi, Rachel Glade, Lucas Goehring, Ian J. Hewitt, Douglas Jerolmack, Ruben Juanes, Arshad Kudrolli, Ching-Yao Lai, Wei Li, Claire Masteller, Kavinda Nissanka, Allan M. Rubin, Howard A. Stone, Jenny Suckale, Nathalie M. Vriend , et al. (2 additional authors not shown)

    Abstract: Inspired by presentations by the authors during a workshop organized at the Princeton Center for Theoretical Science (PCTS) in January 2022, we present a perspective on some of the outstanding questions related to the "physics of the ground beneath our feet." These identified challenges are intrinsically shared with the field of Soft Matter but also have unique aspects when the natural environment… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Comments: Perspective Paper, 30 pages, 15 figures

  16. arXiv:2307.12826  [pdf, other

    astro-ph.CO astro-ph.IM

    The Impact of Beam Variations on Power Spectrum Estimation for 21 cm Cosmology II: Mitigation of Foreground Systematics for HERA

    Authors: Honggeun Kim, Nicholas S. Kern, Jacqueline N. Hewitt, Bang D. Nhan, Joshua S. Dillon, Eloy de Lera Acedo, Scott B. C. Dynes, Nivedita Mahesh, Nicolas Fagnoni, David R. DeBoer

    Abstract: One key challenge in detecting 21 cm cosmological signal at z > 6 is to separate the cosmological signal from foreground emission. This can be studied in a power spectrum space where the foreground is confined to low delay modes whereas the cosmological signal can spread out to high delay modes. When there is a calibration error, however, chromaticity of gain errors propagates to the power spectru… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted for publication in ApJ

  17. The Third Fermi Large Area Telescope Catalog of Gamma-ray Pulsars

    Authors: David A. Smith, Philippe Bruel, Colin J. Clark, Lucas Guillemot, Matthew T. Kerr, Paul Ray, Soheila Abdollahi, Marco Ajello, Luca Baldini, Jean Ballet, Matthew Baring, Cees Bassa, Josefa Becerra Gonzalez, Ronaldo Bellazzini, Alessandra Berretta, Bhaswati Bhattacharyya, Elisabetta Bissaldi, Raffaella Bonino, Eugenio Bottacini, Johan Bregeon, Marta Burgay, Toby Burnett, Rob Cameron, Fernando Camilo, Regina Caputo , et al. (134 additional authors not shown)

    Abstract: We present 294 pulsars found in GeV data from the Large Area Telescope (LAT) on the Fermi Gamma-ray Space Telescope. Another 33 millisecond pulsars (MSPs) discovered in deep radio searches of LAT sources will likely reveal pulsations once phase-connected rotation ephemerides are achieved. A further dozen optical and/or X-ray binary systems co-located with LAT sources also likely harbor gamma-ray M… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 142 pages. Accepted by the Astrophysical Journal Supplement

  18. arXiv:2307.03172  [pdf, other

    cs.CL

    Lost in the Middle: How Language Models Use Long Contexts

    Authors: Nelson F. Liu, Kevin Lin, John Hewitt, Ashwin Paranjape, Michele Bevilacqua, Fabio Petroni, Percy Liang

    Abstract: While recent language models have the ability to take long contexts as input, relatively little is known about how well they use longer context. We analyze the performance of language models on two tasks that require identifying relevant information in their input contexts: multi-document question answering and key-value retrieval. We find that performance can degrade significantly when changing t… ▽ More

    Submitted 20 November, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 18 pages, 16 figures. Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2023

  19. arXiv:2305.16765  [pdf, other

    cs.CL

    Backpack Language Models

    Authors: John Hewitt, John Thickstun, Christopher D. Manning, Percy Liang

    Abstract: We present Backpacks: a new neural architecture that marries strong modeling performance with an interface for interpretability and control. Backpacks learn multiple non-contextual sense vectors for each word in a vocabulary, and represent a word in a sequence as a context-dependent, non-negative linear combination of sense vectors in this sequence. We find that, after training, sense vectors spec… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL 2023 Camera-Ready

  20. arXiv:2304.05153  [pdf

    cs.CV cs.AI

    Regression-based Deep-Learning predicts molecular biomarkers from pathology slides

    Authors: Omar S. M. El Nahhas, Chiara M. L. Loeffler, Zunamys I. Carrero, Marko van Treeck, Fiona R. Kolbinger, Katherine J. Hewitt, Hannah S. Muti, Mara Graziani, Qinghe Zeng, Julien Calderaro, Nadina Ortiz-Brüchle, Tanwei Yuan, Michael Hoffmeister, Hermann Brenner, Alexander Brobeil, Jorge S. Reis-Filho, Jakob Nikolas Kather

    Abstract: Deep Learning (DL) can predict biomarkers from cancer histopathology. Several clinically approved applications use this technology. Most approaches, however, predict categorical labels, whereas biomarkers are often continuous measurements. We hypothesized that regression-based DL outperforms classification-based DL. Therefore, we developed and evaluated a new self-supervised attention-based weakly… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  21. Fermi-GBM Discovery of GRB 221009A: An Extraordinarily Bright GRB from Onset to Afterglow

    Authors: S. Lesage, P. Veres, M. S. Briggs, A. Goldstein, D. Kocevski, E. Burns, C. A. Wilson-Hodge, P. N. Bhat, D. Huppenkothen, C. L. Fryer, R. Hamburg, J. Racusin, E. Bissaldi, W. H. Cleveland, S. Dalessi, C. Fletcher, M. M. Giles, B. A. Hristov, C. M. Hui, B. Mailyan, C. Malacaria, S. Poolakkil, O. J. Roberts, A. von Kienlin, J. Wood , et al. (115 additional authors not shown)

    Abstract: We report the discovery of GRB 221009A, the highest flux gamma-ray burst ever observed by the Fermi Gamma-ray Burst Monitor (GBM). This GRB has continuous prompt emission lasting more than 600 seconds which smoothly transitions to afterglow visible in the GBM energy range (8 keV--40 MeV), and total energetics higher than any other burst in the GBM sample. By using a variety of new and existing ana… ▽ More

    Submitted 12 July, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: 26 pages 7 figures - accepted for publication in ApJL

  22. arXiv:2302.07969  [pdf, other

    astro-ph.CO astro-ph.IM

    Search for the Epoch of Reionisation with HERA: Upper Limits on the Closure Phase Delay Power Spectrum

    Authors: Pascal M. Keller, Bojan Nikolic, Nithyanandan Thyagarajan, Chris L. Carilli, Gianni Bernardi, Ntsikelelo Charles, Landman Bester, Oleg M. Smirnov, Nicholas S. Kern, Joshua S. Dillon, Bryna J. Hazelton, Miguel F. Morales, Daniel C. Jacobs, Aaron R. Parsons, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley , et al. (58 additional authors not shown)

    Abstract: Radio interferometers aiming to measure the power spectrum of the redshifted 21 cm line during the Epoch of Reionisation (EoR) need to achieve an unprecedented dynamic range to separate the weak signal from overwhelming foreground emissions. Calibration inaccuracies can compromise the sensitivity of these measurements to the effect that a detection of the EoR is precluded. An alternative to standa… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 16 pages, 14 figures, accepted for publication by MNRAS

  23. A Machine Learning Approach for Player and Position Adjusted Expected Goals in Football (Soccer)

    Authors: James H. Hewitt, Oktay Karakuş

    Abstract: Football is a very result-driven industry, with goals being rarer than in most sports, so having further parameters to judge the performance of teams and individuals is key. Expected Goals (xG) allow further insight than just a scoreline. To tackle the need for further analysis in football, this paper uses machine learning applications that are developed and applied to Football Event data. From th… ▽ More

    Submitted 2 May, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: 16 pages, 8 tables, 6 figures

  24. arXiv:2212.07961  [pdf, other

    physics.data-an math.AT physics.ao-ph

    Topological Data Analysis Detects Percolation Thresholds in Arctic Melt-Pond Evolution

    Authors: Wilfred Offord, Michael Coughlan, Ian J. Hewitt, Heather A. Harrington, Gillian Grindstaff

    Abstract: During the summer melt period, ponds form on the surface of Arctic sea ice as it melts, with important consequences for ice evolution and marine ecology. Due to the ice-albedo feedback, these melt ponds experience uneven heat absorption, and exhibit complex patterns, which has motivated the development of modelling and data analysis to understand their particular dynamics. We provide a multiscale… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: 13 pages, 13 figures

    MSC Class: 86A40

  25. arXiv:2212.03419  [pdf, other

    cs.CL cs.LG

    JamPatoisNLI: A Jamaican Patois Natural Language Inference Dataset

    Authors: Ruth-Ann Armstrong, John Hewitt, Christopher Manning

    Abstract: JamPatoisNLI provides the first dataset for natural language inference in a creole language, Jamaican Patois. Many of the most-spoken low-resource languages are creoles. These languages commonly have a lexicon derived from a major world language and a distinctive grammar reflecting the languages of the original speakers and the process of language birth by creolization. This gives them a distincti… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: 14 pages, 3 figures, Findings of EMNLP 2022

    ACM Class: I.2.7

  26. arXiv:2210.16421  [pdf, other

    astro-ph.IM astro-ph.CO

    The Impact of Beam Variations on Power Spectrum Estimation for 21-cm Cosmology I: Simulations of Foreground Contamination for HERA

    Authors: Honggeun Kim, Bang D. Nhan, Jacqueline N. Hewitt, Nicholas S. Kern, Joshua S. Dillon, Eloy de Lera Acedo, Scott Dynes, Nivedita Mahesh, Nicolas Fagnoni, David R. DeBoer

    Abstract: Detecting cosmological signals from the Epoch of Reionization (EoR) requires high-precision calibration to isolate the cosmological signals from foreground emission. In radio interferometery, perturbed primary beams of antenna elements can disrupt the precise calibration, which results in contaminating the foreground-free region, or the EoR window, in the cylindrically averaged power spectrum. For… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Accepted for publication in ApJ

  27. arXiv:2210.15191  [pdf, other

    cs.CL

    Truncation Sampling as Language Model Desmoothing

    Authors: John Hewitt, Christopher D. Manning, Percy Liang

    Abstract: Long samples of text from neural language models can be of poor quality. Truncation sampling algorithms--like top-$p$ or top-$k$ -- address this by setting some words' probabilities to zero at each step. This work provides framing for the aim of truncation, and an improved algorithm for that aim. We propose thinking of a neural language model as a mixture of a true distribution and a smoothing dis… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP, + small fixes

  28. arXiv:2210.14927  [pdf, other

    astro-ph.IM astro-ph.CO

    Characterization Of Inpaint Residuals In Interferometric Measurements of the Epoch Of Reionization

    Authors: Michael Pagano, Jing Liu, Adrian Liu, Nicholas S. Kern, Aaron Ewall-Wice, Philip Bull, Robert Pascua, Siamak Ravanbakhsh, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Jacob Burba, Steven Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer , et al. (53 additional authors not shown)

    Abstract: Radio Frequency Interference (RFI) is one of the systematic challenges preventing 21cm interferometric instruments from detecting the Epoch of Reionization. To mitigate the effects of RFI on data analysis pipelines, numerous inpaint techniques have been developed to restore RFI corrupted data. We examine the qualitative and quantitative errors introduced into the visibilities and power spectrum du… ▽ More

    Submitted 20 February, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: 21 pages, 13 figures

  29. arXiv:2210.04912  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM

    Improved Constraints on the 21 cm EoR Power Spectrum and the X-Ray Heating of the IGM with HERA Phase I Observations

    Authors: The HERA Collaboration, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Rennan Barkana, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Daniela Breitman, Philip Bull, Jacob Burba, Steve Carey, Chris L. Carilli, Carina Cheng, Samir Choudhuri, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter, Joshua S. Dillon , et al. (70 additional authors not shown)

    Abstract: We report the most sensitive upper limits to date on the 21 cm epoch of reionization power spectrum using 94 nights of observing with Phase I of the Hydrogen Epoch of Reionization Array (HERA). Using similar analysis techniques as in previously reported limits (HERA Collaboration 2022a), we find at 95% confidence that $Δ^2(k = 0.34$ $h$ Mpc$^{-1}$) $\leq 457$ mK$^2$ at $z = 7.9$ and that… ▽ More

    Submitted 19 January, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 57 pages, 37 figures. Updated to match the accepted ApJ version. Corresponding author: Joshua S. Dillon

    Journal ref: 2023 ApJ 945 124

  30. arXiv:2210.03721  [pdf, other

    astro-ph.CO astro-ph.IM

    Impact of instrument and data characteristics in the interferometric reconstruction of the 21 cm power spectrum

    Authors: Adélie Gorce, Samskruthi Ganjam, Adrian Liu, Steven G. Murray, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Philip Bull, Jacob Burba, Steven Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter, Joshua S. Dillon , et al. (53 additional authors not shown)

    Abstract: Combining the visibilities measured by an interferometer to form a cosmological power spectrum is a complicated process. In a delay-based analysis, the mapping between instrumental and cosmological space is not a one-to-one relation. Instead, neighbouring modes contribute to the power measured at one point, with their respective contributions encoded in the window functions. To better understand t… ▽ More

    Submitted 11 January, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: 18 pages, 19 figures, accepted for publication in MNRAS

  31. arXiv:2206.10033  [pdf, other

    cs.CV

    Test Time Transform Prediction for Open Set Histopathological Image Recognition

    Authors: Adrian Galdran, Katherine J. Hewitt, Narmin L. Ghaffari, Jakob N. Kather, Gustavo Carneiro, Miguel A. González Ballester

    Abstract: Tissue typology annotation in Whole Slide histological images is a complex and tedious, yet necessary task for the development of computational pathology models. We propose to address this problem by applying Open Set Recognition techniques to the task of jointly classifying tissue that belongs to a set of annotated classes, e.g. clinically relevant tissue categories, while rejecting in test time… ▽ More

    Submitted 27 June, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: Accepted to MICCAI 2022

  32. Search for new cosmic-ray acceleration sites within the 4FGL catalog Galactic plane sources

    Authors: Fermi-LAT Collaboration, S. Abdollahi, F. Acero, M. Ackermann, L. Baldini, J. Ballet, G. Barbiellini, D. Bastieri, R. Bellazzini, B. Berenji, A. Berretta, E. Bissaldi, R. D. Blandford, R. Bonino, P. Bruel, S. Buson, R. A. Cameron, R. Caputo, P. A. Caraveo, D. Castro, G. Chiaro, N. Cibrario, S. Ciprini, J. Coronado-Blázquez, M. Crnogorcevic , et al. (95 additional authors not shown)

    Abstract: Cosmic rays are mostly composed of protons accelerated to relativistic speeds. When those protons encounter interstellar material, they produce neutral pions which in turn decay into gamma rays. This offers a compelling way to identify the acceleration sites of protons. A characteristic hadronic spectrum, with a low-energy break around 200 MeV, was detected in the gamma-ray spectra of four Superno… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in The Astrophysical Journal

  33. arXiv:2204.06021  [pdf, other

    astro-ph.CO astro-ph.IM

    Direct Optimal Mapping for 21cm Cosmology: A Demonstration with the Hydrogen Epoch of Reionization Array

    Authors: Zhilei Xu, Jacqueline N. Hewitt, Kai-Feng Chen, Honggeun Kim, Joshua S. Dillon, Nicholas S. Kern, Miguel F. Morales, Bryna J. Hazelton, Ruby Byrne, Nicolas Fagnoni, Eloy de Lera Acedo, Zara Abdurashidova, Tyrone Adams, James E. Aguirre, Paul Alexander, Zaki S. Ali, Rushelle Baartman, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Philip Bull, Jacob Burba , et al. (56 additional authors not shown)

    Abstract: Motivated by the desire for wide-field images with well-defined statistical properties for 21cm cosmology, we implement an optimal mapping pipeline that computes a maximum likelihood estimator for the sky using the interferometric measurement equation. We demonstrate this direct optimal mapping with data from the Hydrogen Epoch of Reionization (HERA) Phase I observations. After validating the pipe… ▽ More

    Submitted 26 October, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: 16 pages, 10 figures, 2 tables, published on ApJ

  34. A Gamma-ray Pulsar Timing Array Constrains the Nanohertz Gravitational Wave Background

    Authors: M. Ajello, W. B. Atwood, L. Baldini, J. Ballet, G. Barbiellini, D. Bastieri, R. Bellazzini, A. Berretta, B. Bhattacharyya, E. Bissaldi, R. D. Blandford, E. Bloom, R. Bonino, P. Bruel, R. Buehler, E. Burns, S. Buson, R. A. Cameron, P. A. Caraveo, E. Cavazzuti, N. Cibrario, S. Ciprini, C. J. Clark, I. Cognard, J. Coronado-Blázquez , et al. (107 additional authors not shown)

    Abstract: After large galaxies merge, their central supermassive black holes are expected to form binary systems whose orbital motion generates a gravitational wave background (GWB) at nanohertz frequencies. Searches for this background utilize pulsar timing arrays, which perform long-term monitoring of millisecond pulsars (MSPs) at radio wavelengths. We use 12.5 years of Fermi Large Area Telescope data to… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 3 figures in the main text. 3 figures and 8 tables are in the supplementary material

  35. Incremental Fermi Large Area Telescope Fourth Source Catalog

    Authors: Fermi-LAT collaboration, :, Soheila Abdollahi, Fabio Acero, Luca Baldini, Jean Ballet, Denis Bastieri, Ronaldo Bellazzini, Bijan Berenji, Alessandra Berretta, Elisabetta Bissaldi, Roger D. Blandford, Elliott Bloom, Raffaella Bonino, Ari Brill, Richard J. Britto, Philippe Bruel, Toby H. Burnett, Sara Buson, Rob A. Cameron, Regina Caputo, Patrizia A. Caraveo, Daniel Castro, Sylvain Chaty, Teddy C. Cheung , et al. (116 additional authors not shown)

    Abstract: We present an incremental version (4FGL-DR3, for Data Release 3) of the fourth Fermi-LAT catalog of gamma-ray sources. Based on the first twelve years of science data in the energy range from 50 MeV to 1 TeV, it contains 6658 sources. The analysis improves on that used for the 4FGL catalog over eight years of data: more sources are fit with curved spectra, we introduce a more robust spectral param… ▽ More

    Submitted 10 May, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

    Comments: accepted in ApJS; follow-up paper to 1902.10045

    Journal ref: ApJS 260, 53 (2022)

  36. arXiv:2201.01103  [pdf, other

    physics.flu-dyn cond-mat.soft

    Bendocapillary Instability of Liquid in a Flexible-Walled Channel

    Authors: Alexander T. Bradley, Ian J. Hewitt, Dominic Vella

    Abstract: We study the bendocapillary instability of a liquid droplet that part fills a flexible walled channel. Inspired by experiments in which a `weaving' pattern emerges as droplets of liquid are condensed slowly into deformable microchannels, we develop a mathematical model of this instability. We describe equilibria of the system, and use a combination of numerical methods, and asymptotic analysis in… ▽ More

    Submitted 7 December, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: 23 pages, 8 figures

    Journal ref: J. Fluid Mech. 955, A26 (2023)

  37. arXiv:2111.05593  [pdf, other

    math.NA physics.geo-ph

    Numerical approximation of viscous contact problems applied to glacial sliding

    Authors: Gonzalo G. de Diego, Patrick E. Farrell, Ian J. Hewitt

    Abstract: Viscous contact problems describe the time evolution of fluid flows in contact with a surface from which they can detach and reattach. These problems are of particular importance in glaciology, where they arise in the study of grounding lines and subglacial cavities. In this work, we propose a novel numerical method for solving viscous contact problems based on a mixed formulation with Lagrange mu… ▽ More

    Submitted 21 January, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

    MSC Class: 86A40; 65K15 ACM Class: G.1.8; G.1.10

  38. arXiv:2109.12733  [pdf, other

    astro-ph.IM astro-ph.CO

    Automated Detection of Antenna Malfunctions in Large-N Interferometers: A Case Study with the Hydrogen Epoch of Reionization Array

    Authors: Dara Storer, Joshua S. Dillon, Daniel C. Jacobs, Miguel F. Morales, Bryna J. Hazelton, Aaron Ewall-Wice, Zara Abdurashidova, James E. Aguirre, Paul Alexander, Zaki S. Ali, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Philip Bull, Jacob Burba, Steven Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer, Eloy de Lera Acedo, Matt Dexter, Scott Dynes , et al. (53 additional authors not shown)

    Abstract: We present a framework for identifying and flagging malfunctioning antennas in large radio interferometers. We outline two distinct categories of metrics designed to detect outliers along known failure modes of large arrays: cross-correlation metrics, based on all antenna pairs, and auto-correlation metrics, based solely on individual antennas. We define and motivate the statistical framework for… ▽ More

    Submitted 4 May, 2022; v1 submitted 26 September, 2021; originally announced September 2021.

    Comments: 31 pages, 17 figures

    Journal ref: Radio Science, vol. 57, no. 1, 2022

  39. arXiv:2109.09234  [pdf, other

    cs.CL

    Conditional probing: measuring usable information beyond a baseline

    Authors: John Hewitt, Kawin Ethayarajh, Percy Liang, Christopher D. Manning

    Abstract: Probing experiments investigate the extent to which neural representations make properties -- like part-of-speech -- predictable. One suggests that a representation encodes a property if probing that representation produces higher accuracy than probing a baseline representation like non-contextual word embeddings. Instead of using baselines as a point of comparison, we're interested in measuring i… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 + typo fixes

  40. arXiv:2108.07282  [pdf, other

    astro-ph.CO astro-ph.GA hep-th

    HERA Phase I Limits on the Cosmic 21-cm Signal: Constraints on Astrophysics and Cosmology During the Epoch of Reionization

    Authors: The HERA Collaboration, Zara Abdurashidova, James E. Aguirre, Paul Alexander, Zaki Ali, Yanga Balfour, Rennan Barkana, Adam Beardsley, Gianni Bernardi, Tashalee Billings, Judd Bowman, Richard Bradley, Phillip Bull, Jacob Burba, Steven Carey, Christopher Carilli, Carina Cheng, David DeBoer, Matthew Dexter, Eloy de Lera Acedo, Joshua Dillon, John Ely, Aaron Ewall-Wice, Nicolas Fagnoni, Anastasia Fialkov , et al. (59 additional authors not shown)

    Abstract: Recently, the Hydrogen Epoch of Reionization Array (HERA) collaboration has produced the experiment's first upper limits on the power spectrum of 21-cm fluctuations at z~8 and 10. Here, we use several independent theoretical models to infer constraints on the intergalactic medium (IGM) and galaxies during the epoch of reionization (EoR) from these limits. We find that the IGM must have been heated… ▽ More

    Submitted 20 December, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: 40 pages, 19 figures, accepted to ApJ

  41. arXiv:2108.07258  [pdf, other

    cs.LG cs.AI cs.CY

    On the Opportunities and Risks of Foundation Models

    Authors: Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh , et al. (89 additional authors not shown)

    Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their cap… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html

  42. arXiv:2108.05572  [pdf, ps, other

    astro-ph.HE astro-ph.IM

    Hybrid cosmic ray measurements using the IceAct telescopes in coincidence with the IceCube and IceTop detectors

    Authors: Larissa Paul, Matthias Plum, Merlin Schaufel, Thomas Bretz, Giang Do, John W. Hewitt, Frank Maslowski, Florian Rehbein, Johannes Schäfer, Adrian Zink

    Abstract: IceAct is a proposed surface array of compact (50 cm diameter) and cost-effective Imaging Air Cherenkov Telescopes installed at the site of the IceCube Neutrino Observatory at the geographic South Pole. Since January 2019, two IceAct telescope demonstrators, featuring 61 silicon pho- tomultiplier (SiPM) pixels have been taking data in the center of the IceTop surface array during the austral winte… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: Presented at the 37th International Cosmic Ray Conference (ICRC 2021). See arXiv:2107.06966 for all IceCube contributions

    Report number: PoS-ICRC2021-276

  43. arXiv:2108.02263  [pdf, other

    astro-ph.CO astro-ph.GA

    First Results from HERA Phase I: Upper Limits on the Epoch of Reionization 21 cm Power Spectrum

    Authors: The HERA Collaboration, Zara Abdurashidova, James E. Aguirre, Paul Alexander, Zaki S. Ali, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Philip Bull, Jacob Burba, Steve Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer, Matt Dexter, Eloy de Lera Acedo, Taylor Dibblee-Barkman, Joshua S. Dillon, John Ely, Aaron Ewall-Wice, Nicolas Fagnoni, Randall Fritz , et al. (52 additional authors not shown)

    Abstract: We report upper-limits on the Epoch of Reionization (EoR) 21 cm power spectrum at redshifts 7.9 and 10.4 with 18 nights of data ($\sim36$ hours of integration) from Phase I of the Hydrogen Epoch of Reionization Array (HERA). The Phase I data show evidence for systematics that can be largely suppressed with systematic models down to a dynamic range of $\sim10^9$ with respect to the peak foreground… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: Accepted to ApJ. https://reionization.org/science/public-data-release-1/

  44. arXiv:2108.00046  [pdf, other

    math.NA

    On the finite element approximation of a semicoercive Stokes variational inequality arising in glaciology

    Authors: Gonzalo G. de Diego, Patrick E. Farrell, Ian J. Hewitt

    Abstract: Stokes variational inequalities arise in the formulation of glaciological problems involving contact. We consider the problem of a two-dimensional marine ice sheet with a grounding line, although the analysis presented here is extendable to other contact problems in glaciology, such as that of subglacial cavitation. The analysis of this problem and its discretisation is complicated by the nonlinea… ▽ More

    Submitted 11 October, 2022; v1 submitted 30 July, 2021; originally announced August 2021.

    MSC Class: 65N12; 65N15; 65N30; 86A40

  45. Catalog of Long-Term Transient Sources in the First 10 Years of Fermi-LAT Data

    Authors: L. Baldini, J. Ballet, D. Bastieri, J. Becerra Gonzalez, R. Bellazzini, A. Berretta, E. Bissaldi, R. D. Blandford, E. D. Bloom, R. Bonino, E. Bottacini, P. Bruel, S. Buson, R. A. Cameron, P. A. Caraveo, E. Cavazzuti, S. Chen, G. Chiaro, D. Ciangottini, S. Ciprini, P. Cristarella Orestano, M. Crnogorcevic, S. Cutini, F. D'Ammando, P. de la Torre Luque , et al. (90 additional authors not shown)

    Abstract: We present the first Fermi Large Area Telescope (LAT) catalog of long-term $γ$-ray transient sources (1FLT). This comprises sources that were detected on monthly time intervals during the first decade of Fermi-LAT operations. The monthly time scale allows us to identify transient and variable sources that were not yet reported in other Fermi-LAT catalogs. The monthly datasets were analyzed using a… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: 41 pages, 17 figures, 7 tables; Accepted by ApJS on 24 May 2021; Contact Authors: I. Mereu, S. Cutini, E. Cavazzuti, G. Tosti

  46. arXiv:2104.12268  [pdf, ps, other

    math.CO

    The DP Color Function of Joins and Vertex-Gluings of Graphs

    Authors: Jack Becker, Jade Hewitt, Hemanshu Kaul, Michael Maxfield, Jeffrey A. Mudrock, David Spivey, Seth Thomason, Tim Wagstrom

    Abstract: DP-coloring (also called correspondence coloring) is a generalization of list coloring that has been widely studied in recent years after its introduction by Dvořák and Postle in 2015. As the analogue of the chromatic polynomial $P(G,m)$, the DP color function of a graph $G$, denoted $P_{DP}(G,m)$, counts the minimum number of DP-colorings over all possible $m$-fold covers. Chromatic polynomials f… ▽ More

    Submitted 1 July, 2022; v1 submitted 25 April, 2021; originally announced April 2021.

    Comments: 26 pages, 1 figure

    MSC Class: 05C15; 05C30; 05C69

  47. arXiv:2104.12240  [pdf, other

    astro-ph.CO astro-ph.IM

    Effects of model incompleteness on the drift-scan calibration of radio telescopes

    Authors: Bharat K. Gehlot, Daniel C. Jacobs, Judd D. Bowman, Nivedita Mahesh, Steven G. Murray, Matthew Kolopanis, Adam P. Beardsley, Zara Abdurashidova, James E. Aguirre, Paul Alexander, Zaki S. Ali, Yanga Balfour, Gianni Bernardi, Tashalee S. Billings, Richard F. Bradley, Phil Bull, Jacob Burba, Steve Carey, Chris L. Carilli, Carina Cheng, David R. DeBoer, Matt Dexter, Eloy de Lera Acedo, Joshua S. Dillon, John Ely , et al. (54 additional authors not shown)

    Abstract: Precision calibration poses challenges to experiments probing the redshifted 21-cm signal of neutral hydrogen from the Cosmic Dawn and Epoch of Reionization (z~30-6). In both interferometric and global signal experiments, systematic calibration is the leading source of error. Though many aspects of calibration have been studied, the overlap between the two types of instruments has received less at… ▽ More

    Submitted 15 July, 2021; v1 submitted 25 April, 2021; originally announced April 2021.

    Comments: 16 pages, 13 figures, 1 table; accepted for publication in MNRAS main journal

  48. arXiv:2104.10115  [pdf, other

    physics.flu-dyn cond-mat.soft

    Droplet trapping in bendotaxis caused by contact angle hysteresis

    Authors: Alexander T. Bradley, Ian J. Hewitt, Dominic Vella

    Abstract: Passive droplet transport mechanisms, in which continuous external energy input is not required for motion, have received significant attention in recent years. Experimental studies of such mechanisms often ignore, or use careful treatments to minimize, contact angle hysteresis, which can impede droplet motion, or even arrest it completely. Here, we consider the effect of contact angle hysteresis… ▽ More

    Submitted 6 January, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Journal ref: Phys. Rev. Fluids 6 114003 (2021)

  49. arXiv:2104.09635  [pdf, other

    cs.CL

    Refining Targeted Syntactic Evaluation of Language Models

    Authors: Benjamin Newman, Kai-Siang Ang, Julia Gong, John Hewitt

    Abstract: Targeted syntactic evaluation of subject-verb number agreement in English (TSE) evaluates language models' syntactic knowledge using hand-crafted minimal pairs of sentences that differ only in the main verb's conjugation. The method evaluates whether language models rate each grammatical sentence as more likely than its ungrammatical counterpart. We identify two distinct goals for TSE. First, eval… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: 14 pages, 5 figures, 3 tables. To appear at NAACL 2021

    ACM Class: I.2.7

  50. Validation of the HERA Phase I Epoch of Reionization 21 cm Power Spectrum Software Pipeline

    Authors: James E. Aguirre, Steven G. Murray, Robert Pascua, Zachary E. Martinot, Jacob Burba, Joshua S. Dillon, Daniel C. Jacobs, Nicholas S. Kern, Piyanat Kittiwisit, Matthew Kolopanis, Adam Lanman, Adrian Liu, Lily Whitler, Zara Abdurashidova, Paul Alexander, Zaki S. Ali, Yanga Balfour, Adam P. Beardsley, Gianni Bernardi, Tashalee S. Billings, Judd D. Bowman, Richard F. Bradley, Philip Bull, Steve Carey, Chris L. Carilli , et al. (51 additional authors not shown)

    Abstract: We describe the validation of the HERA Phase I software pipeline by a series of modular tests, building up to an end-to-end simulation. The philosophy of this approach is to validate the software and algorithms used in the Phase I upper limit analysis on wholly synthetic data satisfying the assumptions of that analysis, not addressing whether the actual data meet these assumptions. We discuss the… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: 32 pages, 20 figures. Submitted to the Astrophysical Journal