Skip to main content

Showing 1–50 of 126 results for author: McCarthy, D

  1. arXiv:2404.17678  [pdf, ps, other

    math.NT math.CA

    Splitting Hypergeometric Functions over Roots of Unity

    Authors: Dermot McCarthy, Mohit Tripathi

    Abstract: We examine hypergeometric functions in the finite field, p-adic and classical settings. In each setting, we prove a formula which splits the hypergeometric function into a sum of lower order functions whose arguments differ by roots of unity. We provide multiple applications of these results, including new reduction and summation formulas for finite field hypergeometric functions, along with class… ▽ More

    Submitted 1 July, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: Updated proof of Theorem 2.7

    MSC Class: 11T24; 11S80; 33E50; 33C20; 11F11; 11G05

  2. arXiv:2404.02127  [pdf, other

    cs.CL cs.AI cs.LG

    FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning

    Authors: Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning

    Abstract: Instruction tuning is an important step in making language models useful for direct user interaction. However, many legal tasks remain out of reach for most open LLMs and there do not yet exist any large scale instruction datasets for the domain. This critically limits research in this application area. In this work, we curate LawInstruct, a large legal instruction dataset, covering 17 jurisdictio… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    MSC Class: 68T50 ACM Class: I.2

  3. arXiv:2403.00908  [pdf, other

    astro-ph.EP astro-ph.SR

    JWST/NIRCam Imaging of Young Stellar Objects III: Detailed Imaging of the Nebular Environment Around the HL Tau Disk

    Authors: Camryn Mullin, Ruobing Dong, Jarron Leisenring, Gabriele Cugno, Thomas Greene, Doug Johnstone, Michael R. Meyer, Kevin R. Wagner, Schuyler G. Wolff, Martha Boyer, Scott Horner, Klaus Hodapp, Don McCarthy, George Rieke, Marcia Rieke, Erick Young

    Abstract: As part of the James Webb Space Telescope (JWST) Guaranteed Time Observation (GTO) program "Direct Imaging of YSOs" (program ID 1179), we use JWST NIRCam's direct imaging mode in F187N, F200W, F405N, and F410M to perform high contrast observations of the circumstellar structures surrounding the protostar HL Tau. The data reveal the known stellar envelope, outflow cavity, and streamers, but do not… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 13 pages, 6 figures, 2 tables, accepted to AAS Astronomical Journal

  4. arXiv:2402.09634  [pdf, other

    physics.ins-det

    Stimulated Secondary Emission of Single Photon Avalanche Diodes

    Authors: Kurtis Raymond, Fabrice Retière, Harry Lewis, Andrea Capra, Duncan McCarthy, Austin de St Croix, Giacomo Gallina, Joe McLaughlin, Juliette Martin, Nicolas Massacret, Paolo Agnes, Ryan Underwood, Seraphim Koulosousas, Peter Margetak

    Abstract: Large-area next-generation physics experiments rely on using Silicon Photo-Multiplier (SiPM) devices to detect single photons, which trigger charge avalanches. The noise mechanism of external cross-talk occurs when secondary photons produced during a charge avalanche escape from an SiPM and trigger other devices within a detector system. This work presents measured spectra of the secondary photons… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 15 pages, 7 figures

  5. arXiv:2401.02834  [pdf, other

    astro-ph.EP astro-ph.SR

    JWST/NIRCam Imaging of Young Stellar Objects. II. Deep Constraints on Giant Planets and a Planet Candidate Outside of the Spiral Disk Around SAO 206462

    Authors: Gabriele Cugno, Jarron Leisenring, Kevin R. Wagner, Camryn Mullin, Roubing Dong, Thomas Greene, Doug Johnstone, Michael R. Meyer, Schuyler G. Wolff, Charles Beichman, Martha Boyer, Scott Horner, Klaus Hodapp, Doug Kelly, Don McCarthy, Thomas Roellig, George Rieke, Marcia Rieke, John Stansberry, Erick Young

    Abstract: We present JWST/NIRCam F187N, F200W, F405N and F410M direct imaging data of the disk surrounding SAO 206462. Previous images show a very structured disk, with a pair of spiral arms thought to be launched by one or more external perturbers. The spiral features are visible in three of the four filters, with the non-detection in F410M due to the large detector saturation radius. We detect with a sign… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 18 pages, 8 figures, 3 tables

  6. arXiv:2401.02830  [pdf, other

    astro-ph.EP astro-ph.SR

    JWST/NIRCam Imaging of Young Stellar Objects. I. Constraints on Planets Exterior to The Spiral Disk Around MWC 758

    Authors: Kevin Wagner, Jarron Leisenring, Gabriele Cugno, Camryn Mullin, Ruobing Dong, Schuyler G. Wolff, Thomas Greene, Doug Johnstone, Michael R. Meyer, Charles Beichman, Martha Boyer, Scott Horner, Klaus Hodapp, Doug Kelly, Don McCarthy, Tom Roellig, George Rieke, Marcia Rieke, Michael Sitko, John Stansberry, Erick Young

    Abstract: MWC 758 is a young star hosting a spiral protoplanetary disk. The spirals are likely companion-driven, and two previously-identified candidate companions have been identified -- one at the end the Southern spiral arm at ~0.6 arcsec, and one interior to the gap at ~0.1 arcsec. With JWST/NIRCam, we provide new images of the disk and constraints on planets exterior to ~1". We detect the two-armed spi… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in AJ

  7. arXiv:2312.00092  [pdf, other

    cs.CV

    Mixture of Gaussian-distributed Prototypes with Generative Modelling for Interpretable and Trustworthy Image Recognition

    Authors: Chong Wang, Yuanhong Chen, Fengbei Liu, Yuyuan Liu, Davis James McCarthy, Helen Frazer, Gustavo Carneiro

    Abstract: Prototypical-part methods, e.g., ProtoPNet, enhance interpretability in image recognition by linking predictions to training prototypes, thereby offering intuitive insights into their decision-making. Existing methods, which rely on a point-based learning of prototypes, typically face two critical issues: 1) the learned prototypes have limited representation power and are not suitable to detect Ou… ▽ More

    Submitted 5 June, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

  8. arXiv:2311.02135  [pdf, ps, other

    math.CO math.NT

    Transitive subtournaments of $k$-th power Paley digraphs and improved lower bounds for Ramsey numbers

    Authors: Dermot McCarthy, Mason Springfield

    Abstract: Let $k \geq 2$ be an even integer. Let $q$ be a prime power such that $q \equiv k+1 \pmod {2k}$. We define the $\textit{k-th power Paley digraph}$ of order $q$, $G_k(q)$, as the graph with vertex set $\mathbb{F}_q$ where $a \to b$ is an edge if and only if $b-a$ is a $k$-th power residue. This generalizes the (k=2) Paley Tournament. We provide a formula, in terms of finite field hypergeometric fun… ▽ More

    Submitted 8 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2006.14716

  9. arXiv:2310.13678  [pdf, other

    cs.CL cs.AI cs.LG

    Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models

    Authors: Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Ke Wu

    Abstract: One challenge in speech translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we adapt large language models (LLMs) to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We overcome the tendency of hallucination in LLMs… ▽ More

    Submitted 23 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: accepted to the Findings of EMNLP 2023. arXiv admin note: text overlap with arXiv:2212.09895

  10. arXiv:2308.08084  [pdf

    physics.chem-ph

    Architecture Optimization Dramatically Improves Reverse Bias Stability in Perovskite Solar Cells: A Role of Polymer Hole Transport Layers

    Authors: Fangyuan Jiang, Yangwei Shi, Tanka R. Rana, Daniel Morales, Isaac Gould, Declan P. McCarthy, Joel Smith, Grey Christoforo, Hannah Contreras, Stephen Barlow, Aditya D. Mohite, Henry Snaith, Seth R. Marder, J. Devin MacKenzie, Michael D. McGehee, David S. Ginger

    Abstract: We report that device architecture engineering has a substantial impact on the reverse bias instability that has been reported as a critical issue in commercializing perovskite solar cells. We demonstrate breakdown voltages exceeding -15 V in typical pin structured perovskite solar cells via two steps: i) using polymer hole transporting materials; ii) using a more electrochemically stable gold ele… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  11. arXiv:2308.01522  [pdf, ps, other

    math.NT

    The number of $\mathbb{F}_q$-points on diagonal hypersurfaces with monomial deformation

    Authors: Dermot McCarthy

    Abstract: We consider the family of diagonal hypersurfaces with monomial deformation $$D_{d, λ, h}: x_1^d + x_2^d \dots + x_n^d - d λ\, x_1^{h_1} x_2^{h_2} \dots x_n^{h_n}=0$$ where $d = h_1+h_2 +\dots + h_n$ with $\gcd(h_1, h_2, \dots h_n)=1$. We first provide a formula for the number of $\mathbb{F}_{q}$-points on $D_{d, λ, h}$ in terms of Gauss and Jacobi sums. This generalizes a result of Koblitz, which… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  12. arXiv:2306.03042  [pdf, other

    cs.LG cs.AI

    SERT: A Transfomer Based Model for Spatio-Temporal Sensor Data with Missing Values for Environmental Monitoring

    Authors: Amin Shoari Nejad, Rocío Alaiz-Rodríguez, Gerard D. McCarthy, Brian Kelleher, Anthony Grey, Andrew Parnell

    Abstract: Environmental monitoring is crucial to our understanding of climate change, biodiversity loss and pollution. The availability of large-scale spatio-temporal data from sources such as sensors and satellites allows us to develop sophisticated models for forecasting and understanding key drivers. However, the data collected from sensors often contain missing values due to faulty equipment or maintena… ▽ More

    Submitted 9 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 11 pages, 7 figures

  13. arXiv:2305.12068  [pdf, other

    eess.IV cs.AI cs.CV

    Technical outlier detection via convolutional variational autoencoder for the ADMANI breast mammogram dataset

    Authors: Hui Li, Carlos A. Pena Solorzano, Susan Wei, Davis J. McCarthy

    Abstract: The ADMANI datasets (annotated digital mammograms and associated non-image datasets) from the Transforming Breast Cancer Screening with AI programme (BRAIx) run by BreastScreen Victoria in Australia are multi-centre, large scale, clinically curated, real-world databases. The datasets are expected to aid in the development of clinically relevant Artificial Intelligence (AI) algorithms for breast ca… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  14. The James Webb Space Telescope Mission

    Authors: Jonathan P. Gardner, John C. Mather, Randy Abbott, James S. Abell, Mark Abernathy, Faith E. Abney, John G. Abraham, Roberto Abraham, Yasin M. Abul-Huda, Scott Acton, Cynthia K. Adams, Evan Adams, David S. Adler, Maarten Adriaensen, Jonathan Albert Aguilar, Mansoor Ahmed, Nasif S. Ahmed, Tanjira Ahmed, Rüdeger Albat, Loïc Albert, Stacey Alberts, David Aldridge, Mary Marsha Allen, Shaune S. Allen, Martin Altenburg , et al. (983 additional authors not shown)

    Abstract: Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted by PASP for the special issue on The James Webb Space Telescope Overview, 29 pages, 4 figures

  15. arXiv:2302.07912  [pdf, other

    cs.CL

    Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models

    Authors: Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, Luis Chiruzzo, John E. Ortega, Gustavo A. Giménez-Lugo, Rolando Coto-Solano, Katharina Kann

    Abstract: Large multilingual models have inspired a new class of word alignment methods, which work well for the model's pretraining languages. However, the languages most in need of automatic alignment are low-resource and, thus, not typically included in the pretraining data. In this work, we ask: How do modern aligners perform on unseen languages, and are they better than traditional methods? We contribu… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: EACL 2023

  16. arXiv:2301.13418  [pdf, other

    cs.CV cs.AI cs.LG

    BRAIxDet: Learning to Detect Malignant Breast Lesion with Incomplete Annotations

    Authors: Yuanhong Chen, Yuyuan Liu, Chong Wang, Michael Elliott, Chun Fung Kwok, Carlos Pena-Solorzano, Yu Tian, Fengbei Liu, Helen Frazer, Davis J. McCarthy, Gustavo Carneiro

    Abstract: Methods to detect malignant lesions from screening mammograms are usually trained with fully annotated datasets, where images are labelled with the localisation and classification of cancerous lesions. However, real-world screening mammogram datasets commonly have a subset that is fully annotated and another subset that is weakly annotated with just the global classification (i.e., without lesion… ▽ More

    Submitted 2 April, 2024; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: Under Review

  17. arXiv:2301.04011  [pdf, other

    cs.CV

    Learning Support and Trivial Prototypes for Interpretable Image Classification

    Authors: Chong Wang, Yuyuan Liu, Yuanhong Chen, Fengbei Liu, Yu Tian, Davis J. McCarthy, Helen Frazer, Gustavo Carneiro

    Abstract: Prototypical part network (ProtoPNet) methods have been designed to achieve interpretable classification by associating predictions with a set of training prototypes, which we refer to as trivial prototypes because they are trained to lie far from the classification boundary in the feature space. Note that it is possible to make an analogy between ProtoPNet and support vector machine (SVM) given t… ▽ More

    Submitted 22 October, 2023; v1 submitted 8 January, 2023; originally announced January 2023.

    Comments: ICCV 2023, Code: https://github.com/cwangrun/ST-ProtoPNet

  18. arXiv:2212.09895  [pdf, other

    cs.CL

    Improved Long-Form Spoken Language Translation with Large Language Models

    Authors: Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Axel H. Ng

    Abstract: A challenge in spoken language translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we fine-tune a general-purpose, large language model to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We compare to several segmen… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  19. arXiv:2211.16858  [pdf, other

    cs.CL

    A Major Obstacle for NLP Research: Let's Talk about Time Allocation!

    Authors: Katharina Kann, Shiran Dudy, Arya D. McCarthy

    Abstract: The field of natural language processing (NLP) has grown over the last few years: conferences have become larger, we have published an incredible amount of papers, and state-of-the-art research has been implemented in a large variety of customer-facing products. However, this paper argues that we have been less successful than we should have been and reflects on where and how the field fails to ta… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Comments: To appear at EMNLP 2022

  20. arXiv:2209.15100  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Ethylenediamine Addition Improves Performance and Suppresses Phase Instabilities in Mixed-Halide Perovskites

    Authors: Margherita Taddei, Joel A. Smith, Benjamin M. Gallant, Suer Zhou, Robert J. E. Westbrook, Yangwei Shi, Jian Wang, James N. Drysdale, Declan P. McCarthy, Stephen Barlow, Seth R. Marder, Henry J. Snaith, David S. Ginger

    Abstract: We show that adding ethylenediamine (EDA) to perovskite precursor solution improves the photovoltaic device performance and material stability of high-bromide-content, methylammonium-free, formamidinium cesium lead halide perovskites FA1-xCsxPb(I1-yBry)3 which are currently of interest for perovskite-on-Si tandem solar cells. Using spectroscopy and hyperspectral microscopy, we show that the additi… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  21. Knowledge Distillation to Ensemble Global and Interpretable Prototype-Based Mammogram Classification Models

    Authors: Chong Wang, Yuanhong Chen, Yuyuan Liu, Yu Tian, Fengbei Liu, Davis J. McCarthy, Michael Elliott, Helen Frazer, Gustavo Carneiro

    Abstract: State-of-the-art (SOTA) deep learning mammogram classifiers, trained with weakly-labelled images, often rely on global models that produce predictions with limited interpretability, which is a key barrier to their successful translation into clinical practice. On the other hand, prototype-based models improve interpretability by associating predictions with training image prototypes, but they are… ▽ More

    Submitted 8 January, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: MICCAI 2022

  22. arXiv:2209.10478  [pdf, other

    cs.CV

    Multi-view Local Co-occurrence and Global Consistency Learning Improve Mammogram Classification Generalisation

    Authors: Yuanhong Chen, Hu Wang, Chong Wang, Yu Tian, Fengbei Liu, Michael Elliott, Davis J. McCarthy, Helen Frazer, Gustavo Carneiro

    Abstract: When analysing screening mammograms, radiologists can naturally process information across two ipsilateral views of each breast, namely the cranio-caudal (CC) and mediolateral-oblique (MLO) views. These multiple related images provide complementary diagnostic information and can improve the radiologist's classification accuracy. Unfortunately, most existing deep learning systems, trained with glob… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: MICCAI 2022

  23. arXiv:2209.06880  [pdf, other

    stat.AP

    Vector Time Series Modelling of Turbidity in Dublin Bay

    Authors: Amin Shoari Nejad, Gerard D. McCarthy, Brian Kelleher, Anthony Grey, Andrew Parnell

    Abstract: Turbidity is commonly monitored as an important water quality index. Human activities, such as dredging and dumping operations, can disrupt turbidity levels and should be monitored and analyzed for possible effects. In this paper, we model the variations of turbidity in Dublin Bay over space and time to investigate the effects of dumping and dredging while controlling for the effect of wind speed… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: 11 pages, 9 figures

  24. arXiv:2205.03608  [pdf, other

    cs.CL

    UniMorph 4.0: Universal Morphology

    Authors: Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina J. Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Benoît Sagot, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay , et al. (71 additional authors not shown)

    Abstract: The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This pa… ▽ More

    Submitted 19 June, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: LREC 2022; The first two authors made equal contributions

  25. Carbon monoxide emission lines reveal an inverted atmosphere in the ultra hot Jupiter WASP-33 b consistent with an eastward hot spot

    Authors: Lennart van Sluijs, Jayne L. Birkby, Joshua Lothringer, Elspeth K. H. Lee, Ian J. M. Crossfield, Vivien Parmentier, Matteo Brogi, Craig Kulesa, Don McCarthy, David Charbonneau

    Abstract: We report the first detection of CO emission at high spectral resolution in the day-side infrared thermal spectrum of an exoplanet. These emission lines, found in the atmosphere of the transiting ultra hot Jupiter (UHJ) WASP-33 b, provide unambiguous evidence of its thermal inversion. Using spectra from the MMT Exoplanet Atmosphere Survey (MEASURE, $R\sim15,000$), covering pre- and post-eclipse ph… ▽ More

    Submitted 26 April, 2023; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: 24 pages, 21 figures, accepted to MNRAS

    Journal ref: MNRAS, Volume 522, Issue 2, June 2023, Pages, 2145-2170

  26. arXiv:2203.08909  [pdf, other

    cs.CL

    Morphological Processing of Low-Resource Languages: Where We Are and What's Next

    Authors: Adam Wiemerslage, Miikka Silfverberg, Changbing Yang, Arya D. McCarthy, Garrett Nicolai, Eliana Colunga, Katharina Kann

    Abstract: Automatic morphological processing can aid downstream natural language processing applications, especially for low-resource languages, and assist language documentation efforts for endangered languages. Having long been multilingual, the field of computational morphology is increasingly moving towards approaches suitable for languages with minimal or no annotated resources. First, we survey recent… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Findings of ACL 2022

  27. arXiv:2203.08850  [pdf, other

    cs.CL

    Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?

    Authors: En-Shiun Annie Lee, Sarubi Thillainathan, Shravan Nayak, Surangika Ranathunga, David Ifeoluwa Adelani, Ruisi Su, Arya D. McCarthy

    Abstract: What can pre-trained multilingual sequence-to-sequence models like mBART contribute to translating low-resource languages? We conduct a thorough empirical experiment in 10 languages to ascertain this, considering five factors: (1) the amount of fine-tuning data, (2) the noise in the fine-tuning data, (3) the amount of pre-training data in the model, (4) the impact of domain mismatch, and (5) langu… ▽ More

    Submitted 30 April, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of ACL 2022

  28. arXiv:2112.06733  [pdf, other

    cs.CL

    Measuring Context-Word Biases in Lexical Semantic Datasets

    Authors: Qianchu Liu, Diana McCarthy, Anna Korhonen

    Abstract: State-of-the-art pretrained contextualized models (PCM) eg. BERT use tasks such as WiC and WSD to evaluate their word-in-context representations. This inherently assumes that performance in these tasks reflect how well a model represents the coupled word and context semantics. We question this assumption by presenting the first quantitative analysis on the context-word interaction being tested in… ▽ More

    Submitted 8 December, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022 main conference long paper

  29. arXiv:2104.08639  [pdf, other

    cs.CL

    AM2iCo: Evaluating Word Meaning in Context across Low-Resource Languages with Adversarial Examples

    Authors: Qianchu Liu, Edoardo M. Ponti, Diana McCarthy, Ivan Vulić, Anna Korhonen

    Abstract: Capturing word meaning in context and distinguishing between correspondences and variations across languages is key to building successful multilingual and cross-lingual text representation models. However, existing multilingual evaluation datasets that evaluate lexical semantics "in-context" have various limitations. In particular, 1) their language coverage is restricted to high-resource languag… ▽ More

    Submitted 19 September, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 long paper

  30. arXiv:2102.06898  [pdf, ps, other

    econ.TH math.FA

    Expected utility theory on mixture spaces without the completeness axiom

    Authors: David McCarthy, Kalle Mikkola, Teruji Thomas

    Abstract: A mixture preorder is a preorder on a mixture space (such as a convex set) that is compatible with the mixing operation. In decision theoretic terms, it satisfies the central expected utility axiom of strong independence. We consider when a mixture preorder has a multi-representation that consists of real-valued, mixture-preserving functions. If it does, it must satisfy the mixture continuity axio… ▽ More

    Submitted 13 February, 2021; originally announced February 2021.

    Comments: 29 pages

    MSC Class: 06F20 (Primary) 46A20; 46A40; 46A55 (Secondary)

  31. arXiv:2101.10245  [pdf, other

    cs.HC

    AirWare: Utilizing Embedded Audio and Infrared Signals for In-Air Hand-Gesture Recognition

    Authors: Nibhrat Lohia, Raunak Mundada, Arya D. McCarthy, Eric C. Larson

    Abstract: We introduce AirWare, an in-air hand-gesture recognition system that uses the already embedded speaker and microphone in most electronic devices, together with embedded infrared proximity sensors. Gestures identified by AirWare are performed in the air above a touchscreen or a mobile phone. AirWare utilizes convolutional neural networks to classify a large vocabulary of hand gestures using multi-m… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  32. arXiv:2101.06303  [pdf, ps, other

    math.NT

    Hypergeometric Functions over Finite Fields and Modular Forms: A Survey and New Conjectures

    Authors: Madeline Locus Dawsey, Dermot McCarthy

    Abstract: Hypergeometric functions over finite fields were introduced by Greene in the 1980s as a finite field analogue of classical hypergeometric series. These functions, and their generalizations, naturally lend themselves to, and have been widely used in, character sum evaluations and counting points on algebraic varieties. More interestingly, perhaps, are their links to Fourier coefficients of modular… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

    Journal ref: From Operator Theory to Orthogonal Polynomials, Combinatorics, and Number Theory. Operator Theory: Advances and Applications, Birkhauser (2021), 41-56

  33. arXiv:2012.09976  [pdf

    cs.LG

    Handling uncertainty using features from pathology: opportunities in primary care data for developing high risk cancer survival methods

    Authors: Goce Ristanoski, Jon Emery, Javiera Martinez-Gutierrez, Damien Mccarthy, Uwe Aickelin

    Abstract: More than 144 000 Australians were diagnosed with cancer in 2019. The majority will first present to their GP symptomatically, even for cancer for which screening programs exist. Diagnosing cancer in primary care is challenging due to the non-specific nature of cancer symptoms and its low prevalence. Understanding the epidemiology of cancer symptoms and patterns of presentation in patient's medica… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: 14th Australasian Conference on Health Informatics and Knowledge Management HIKM 2021

  34. arXiv:2006.14716  [pdf, ps, other

    math.NT math.CO

    Generalized Paley graphs and their complete subgraphs of orders three and four

    Authors: Madeline Locus Dawsey, Dermot McCarthy

    Abstract: Let $k \geq 2$ be an integer. Let $q$ be a prime power such that $q \equiv 1 \pmod {k}$ if $q$ is even, or, $q \equiv 1 \pmod {2k}$ if $q$ is odd. The generalized Paley graph of order $q$, $G_k(q)$, is the graph with vertex set $\mathbb{F}_q$ where $ab$ is an edge if and only if ${a-b}$ is a $k$-th power residue. We provide a formula, in terms of finite field hypergeometric functions, for the numb… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    MSC Class: Primary: 05C30; 11T24; Secondary: 05C55; 11F11

    Journal ref: Res. Math. Sci. 8: 18 (2021)

  35. arXiv:2005.00970  [pdf, other

    cs.CL

    Unsupervised Morphological Paradigm Completion

    Authors: Huiming Jin, Liwei Cai, Yihui Peng, Chen Xia, Arya D. McCarthy, Katharina Kann

    Abstract: We propose the task of unsupervised morphological paradigm completion. Given only raw text and a lemma list, the task consists of generating the morphological paradigms, i.e., all inflected forms, of the lemmas. From a natural language processing (NLP) perspective, this is a challenging unsupervised task, and high-performing systems have the potential to improve tools for low-resource languages or… ▽ More

    Submitted 20 May, 2020; v1 submitted 2 May, 2020; originally announced May 2020.

    Comments: Accepted by ACL 2020

  36. arXiv:2005.00626  [pdf, other

    cs.CL

    Predicting Declension Class from Form and Meaning

    Authors: Adina Williams, Tiago Pimentel, Arya D. McCarthy, Hagen Blix, Eleanor Chodroff, Ryan Cotterell

    Abstract: The noun lexica of many natural languages are divided into several declension classes with characteristic morphological properties. Class membership is far from deterministic, but the phonological form of a noun and/or its meaning can often provide imperfect clues. Here, we investigate the strength of those clues. More specifically, we operationalize this by measuring how much information, in bits… ▽ More

    Submitted 28 May, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: 14 pages, 2 figures, the is the camera-ready version accepted at the 2020 Annual Conference of the Association for Computational Linguistics (ACL 2020)

  37. arXiv:2002.12231  [pdf, other

    eess.AS cs.CL cs.SD

    SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation

    Authors: Arya D. McCarthy, Liezl Puzon, Juan Pino

    Abstract: We propose autoencoding speaker conversion for training data augmentation in automatic speech translation. This technique directly transforms an audio sequence, resulting in audio synthesized to resemble another speaker's voice. Our method compares favorably to SpecAugment on English$\to$French and English$\to$Romanian automatic speech translation (AST) tasks as well as on a low-resource English a… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: Accepted to ICASSP 2020

  38. arXiv:1911.00872  [pdf, ps, other

    econ.TH

    Aggregation for potentially infinite populations without continuity or completeness

    Authors: David McCarthy, Kalle Mikkola, Teruji Thomas

    Abstract: We present an abstract social aggregation theorem. Society, and each individual, has a preorder that may be interpreted as expressing values or beliefs. The preorders are allowed to violate both completeness and continuity, and the population is allowed to be infinite. The preorders are only assumed to be represented by functions with values in partially ordered vector spaces, and whose product ha… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

    Comments: 27 pages

  39. The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection

    Authors: Arya D. McCarthy, Ekaterina Vylomova, Shijie Wu, Chaitanya Malaviya, Lawrence Wolf-Sonkin, Garrett Nicolai, Christo Kirov, Miikka Silfverberg, Sabrina J. Mielke, Jeffrey Heinz, Ryan Cotterell, Mans Hulden

    Abstract: The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages. The first task evolves past years' inflection tasks by examining transfer of morphological inflection knowledge from a high-resource language to a low… ▽ More

    Submitted 25 February, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: Presented at SIGMORPHON 2019

    Journal ref: Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology (2019) 229-244

  40. arXiv:1910.01531  [pdf, other

    cs.CL

    Modeling Color Terminology Across Thousands of Languages

    Authors: Arya D. McCarthy, Winston Wu, Aaron Mueller, Bill Watson, David Yarowsky

    Abstract: There is an extensive history of scholarship into what constitutes a "basic" color term, as well as a broadly attested acquisition sequence of basic color terms across many languages, as articulated in the seminal work of Berlin and Kay (1969). This paper employs a set of diverse measures on massively cross-linguistic data to operationalize and critique the Berlin and Kay color term hypotheses. Co… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: Accepted for presentation at EMNLP-IJCNLP 2019

  41. arXiv:1909.09237  [pdf, other

    cs.CL

    Improved Variational Neural Machine Translation by Promoting Mutual Information

    Authors: Arya D. McCarthy, Xian Li, Jiatao Gu, Ning Dong

    Abstract: Posterior collapse plagues VAEs for text, especially for conditional text generation with strong autoregressive decoders. In this work, we address this problem in variational neural machine translation by explicitly promoting mutual information between the latent variables and the data. Our model extends the conditional variational autoencoder (CVAE) with two new ingredients: first, we propose a m… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

  42. arXiv:1909.06515  [pdf, other

    cs.CL cs.SD eess.AS

    Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade

    Authors: Juan Pino, Liezl Puzon, Jiatao Gu, Xutai Ma, Arya D. McCarthy, Deepak Gopinath

    Abstract: For automatic speech translation (AST), end-to-end approaches are outperformed by cascaded models that transcribe with automatic speech recognition (ASR), then translate with machine translation (MT). A major cause of the performance gap is that, while existing AST corpora are small, massive datasets exist for both the ASR and MT subsystems. In this work, we evaluate several data augmentation and… ▽ More

    Submitted 22 October, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: IWSLT 2019

  43. arXiv:1906.05906  [pdf, other

    cs.CL

    Meaning to Form: Measuring Systematicity as Information

    Authors: Tiago Pimentel, Arya D. McCarthy, Damián E. Blasi, Brian Roark, Ryan Cotterell

    Abstract: A longstanding debate in semiotics centers on the relationship between linguistic signs and their corresponding semantics: is there an arbitrary relationship between a word form and its meaning, or does some systematic phenomenon pervade? For instance, does the character bigram \textit{gl} have any systematic relationship to the meaning of words like \textit{glisten}, \textit{gleam} and \textit{gl… ▽ More

    Submitted 26 July, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: Accepted for publication at ACL 2019

  44. An Exact No Free Lunch Theorem for Community Detection

    Authors: Arya D. McCarthy, Tongfei Chen, Seth Ebner

    Abstract: A precondition for a No Free Lunch theorem is evaluation with a loss function which does not assume a priori superiority of some outputs over others. A previous result for community detection by Peel et al. (2017) relies on a mismatch between the loss function and the problem domain. The loss function computes an expectation over only a subset of the universe of possible outputs; thus, it is only… ▽ More

    Submitted 24 March, 2019; originally announced March 2019.

    Journal ref: Complex Networks and Their Applications VIII. COMPLEX NETWORKS 2019. Studies in Computational Intelligence, vol 881

  45. arXiv:1901.01354  [pdf, other

    cs.SI physics.soc-ph

    Metrics matter in community detection

    Authors: Arya D. McCarthy, Tongfei Chen, Rachel Rudinger, David W. Matula

    Abstract: We present a critical evaluation of normalized mutual information (NMI) as an evaluation metric for community detection. NMI exaggerates the leximin method's performance on weak communities: Does leximin, in finding the trivial singletons clustering, truly outperform eight other community detection methods? Three NMI improvements from the literature are AMI, rrNMI, and cNMI. We show equivalences u… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

    Journal ref: Complex Networks and Their Applications VIII. COMPLEX NETWORKS 2019. Studies in Computational Intelligence, vol 881

  46. arXiv:1810.11101  [pdf, other

    cs.CL

    UniMorph 2.0: Universal Morphology

    Authors: Christo Kirov, Ryan Cotterell, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sabrina J. Mielke, Arya D. McCarthy, Sandra Kübler, David Yarowsky, Jason Eisner, Mans Hulden

    Abstract: The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages. The project releases annotated morphological data using a universal tagset, the UniMorph schema. Each inflected form is associated with a lemma, which typically carries its underlying lexical meaning, and a bundle of morphological features from our schema.… ▽ More

    Submitted 25 February, 2020; v1 submitted 25 October, 2018; originally announced October 2018.

    Comments: LREC 2018

  47. arXiv:1810.07125  [pdf, other

    cs.CL

    The CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection

    Authors: Ryan Cotterell, Christo Kirov, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Arya D. McCarthy, Katharina Kann, Sabrina J. Mielke, Garrett Nicolai, Miikka Silfverberg, David Yarowsky, Jason Eisner, Mans Hulden

    Abstract: The CoNLL--SIGMORPHON 2018 shared task on supervised learning of morphological generation featured data sets from 103 typologically diverse languages. Apart from extending the number of languages involved in earlier supervised tasks of generating inflected forms, this year the shared task also featured a new second task which asked participants to inflect words in sentential context, similar to a… ▽ More

    Submitted 25 February, 2020; v1 submitted 16 October, 2018; originally announced October 2018.

    Comments: CoNLL 2018. arXiv admin note: text overlap with arXiv:1706.09031

  48. Marrying Universal Dependencies and Universal Morphology

    Authors: Arya D. McCarthy, Miikka Silfverberg, Ryan Cotterell, Mans Hulden, David Yarowsky

    Abstract: The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language. Each project also provides corpora of annotated text in many languages - UD at the token level and UniMorph at the type level. As each corpus is built by different annotators, language-specific decisions hinder the goal of universal schemata. Wi… ▽ More

    Submitted 15 October, 2018; originally announced October 2018.

    Comments: UDW18

    Journal ref: Proceedings of the Second Workshop on Universal Dependencies (2018) 91-101

  49. Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation

    Authors: Brian Thompson, Huda Khayrallah, Antonios Anastasopoulos, Arya D. McCarthy, Kevin Duh, Rebecca Marvin, Paul McNamee, Jeremy Gwinnup, Tim Anderson, Philipp Koehn

    Abstract: To better understand the effectiveness of continued training, we analyze the major components of a neural machine translation system (the encoder, decoder, and each embedding space) and consider each component's contribution to, and capacity for, domain adaptation. We find that freezing any single component during continued training has minimal impact on performance, and that performance is surpri… ▽ More

    Submitted 15 January, 2019; v1 submitted 13 September, 2018; originally announced September 2018.

    Comments: presented at WMT 2018. Please cite using the bib entry from here: http://www.statmt.org/wmt18/bib/WMT013.bib

    Journal ref: Proceedings of the Third Conference on Machine Translation: Research Papers (2018) 124-132

  50. arXiv:1807.03883  [pdf, ps, other

    math.NT

    Apéry-like numbers and families of newforms with complex multiplication

    Authors: Alexis Gomez, Dermot McCarthy, Dylan Young

    Abstract: Using Hecke characters, we construct two infinite families of newforms with complex multiplication, one by $\mathbb{Q}(\sqrt{-3})$ and the other by $\mathbb{Q}(\sqrt{-2})$. The values of the $p$-th Fourier coefficients of all the forms in each family can be described by a single formula, which we provide explicitly. This allows us to establish a formula relating the $p$-th Fourier coefficients of… ▽ More

    Submitted 10 July, 2018; originally announced July 2018.