Skip to main content

Showing 1–10 of 10 results for author: Pitler, E

  1. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2205.12253  [pdf, other

    cs.CL

    Evaluating the Impact of Model Scale for Compositional Generalization in Semantic Parsing

    Authors: Linlu Qiu, Peter Shaw, Panupong Pasupat, Tianze Shi, Jonathan Herzig, Emily Pitler, Fei Sha, Kristina Toutanova

    Abstract: Despite their strong performance on many tasks, pre-trained language models have been shown to struggle on out-of-distribution compositional generalization. Meanwhile, recent work has shown considerable improvements on many NLP tasks from model scaling. Can scaling up model size also improve compositional generalization in semantic parsing? We evaluate encoder-decoder models up to 11B parameters a… ▽ More

    Submitted 24 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: EMNLP 2022

  3. arXiv:2010.06032  [pdf, other

    cs.CL

    Measuring and Reducing Gendered Correlations in Pre-trained Models

    Authors: Kellie Webster, Xuezhi Wang, Ian Tenney, Alex Beutel, Emily Pitler, Ellie Pavlick, Jilin Chen, Ed Chi, Slav Petrov

    Abstract: Pre-trained models have revolutionized natural language understanding. However, researchers have found they can encode artifacts undesired in many applications, such as professions correlating with one gender more than another. We explore such gendered correlations as a case study for how to address unintended correlations in pre-trained models. We define metrics and reveal that it is possible for… ▽ More

    Submitted 2 March, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

  4. arXiv:2006.08881  [pdf, ps, other

    cs.CL

    Scalable Cross Lingual Pivots to Model Pronoun Gender for Translation

    Authors: Kellie Webster, Emily Pitler

    Abstract: Machine translation systems with inadequate document understanding can make errors when translating dropped or neutral pronouns into languages with gendered pronouns (e.g., English). Predicting the underlying gender of these pronouns is difficult since it is not marked textually and must instead be inferred from coreferent mentions in the context. We propose a novel cross-lingual pivoting techniqu… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  5. arXiv:2004.11999  [pdf, other

    cs.CL

    Syntactic Data Augmentation Increases Robustness to Inference Heuristics

    Authors: Junghyun Min, R. Thomas McCoy, Dipanjan Das, Emily Pitler, Tal Linzen

    Abstract: Pretrained neural models such as BERT, when fine-tuned to perform natural language inference (NLI), often show high accuracy on standard datasets, but display a surprising lack of sensitivity to word order on controlled challenge sets. We hypothesize that this issue is not primarily caused by the pretrained model's limitations, but rather by the paucity of crowdsourced NLI examples that might conv… ▽ More

    Submitted 24 April, 2020; originally announced April 2020.

    Comments: ACL 2020

  6. arXiv:2004.11997  [pdf, other

    cs.CL

    New Protocols and Negative Results for Textual Entailment Data Collection

    Authors: Samuel R. Bowman, Jennimaria Palomaki, Livio Baldini Soares, Emily Pitler

    Abstract: Natural language inference (NLI) data has proven useful in benchmarking and, especially, as pretraining data for tasks requiring language understanding. However, the crowdsourcing protocol that was used to collect this data has known issues and was not explicitly optimized for either of these purposes, so it is likely far from ideal. We propose four alternative protocols, each aimed at improving e… ▽ More

    Submitted 29 September, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: To appear at EMNLP 2020

  7. arXiv:1909.00109  [pdf, ps, other

    cs.CL cs.LG

    Giving BERT a Calculator: Finding Operations and Arguments with Reading Comprehension

    Authors: Daniel Andor, Luheng He, Kenton Lee, Emily Pitler

    Abstract: Reading comprehension models have been successfully applied to extractive text answers, but it is unclear how best to generalize these models to abstractive numerical answers. We enable a BERT-based reading comprehension model to perform lightweight numerical reasoning. We augment the model with a predefined set of executable 'programs' which encompass simple arithmetic as well as extraction. Rath… ▽ More

    Submitted 12 September, 2019; v1 submitted 30 August, 2019; originally announced September 2019.

  8. arXiv:1906.05416  [pdf, other

    cs.CL

    Synthetic QA Corpora Generation with Roundtrip Consistency

    Authors: Chris Alberti, Daniel Andor, Emily Pitler, Jacob Devlin, Michael Collins

    Abstract: We introduce a novel method of generating synthetic question answering corpora by combining models of question generation and answer extraction, and by filtering the results to ensure roundtrip consistency. By pretraining on the resulting corpora we obtain significant improvements on SQuAD2 and NQ, establishing a new state-of-the-art on the latter. Our synthetic data generation models, for both qu… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

  9. arXiv:1805.08237  [pdf, other

    cs.CL

    Morphosyntactic Tagging with a Meta-BiLSTM Model over Context Sensitive Token Encodings

    Authors: Bernd Bohnet, Ryan McDonald, Goncalo Simoes, Daniel Andor, Emily Pitler, Joshua Maynez

    Abstract: The rise of neural networks, and particularly recurrent neural networks, has produced significant advances in part-of-speech tagging accuracy. One characteristic common among these models is the presence of rich initial word encodings. These encodings typically are composed of a recurrent character-based representation with learned and pre-trained word embeddings. However, these encodings do not c… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Journal ref: ACL 2018

  10. arXiv:1708.00214  [pdf, other

    cs.CL cs.NE

    Natural Language Processing with Small Feed-Forward Networks

    Authors: Jan A. Botha, Emily Pitler, Ji Ma, Anton Bakalov, Alex Salcianu, David Weiss, Ryan McDonald, Slav Petrov

    Abstract: We show that small and shallow feed-forward neural networks can achieve near state-of-the-art results on a range of unstructured and structured language processing tasks while being considerably cheaper in memory and computational requirements than deep recurrent models. Motivated by resource-constrained environments like mobile phones, we showcase simple techniques for obtaining such small neural… ▽ More

    Submitted 1 August, 2017; originally announced August 2017.

    Comments: EMNLP 2017 short paper

    MSC Class: 68T50 ACM Class: I.2.7