Skip to main content

Showing 1–33 of 33 results for author: Lazaridou, A

  1. arXiv:2406.00179  [pdf, other

    cs.CL cs.AI

    Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation

    Authors: Bernd Bohnet, Kevin Swersky, Rosanne Liu, Pranjal Awasthi, Azade Nova, Javier Snaider, Hanie Sedghi, Aaron T Parisi, Michael Collins, Angeliki Lazaridou, Orhan Firat, Noah Fiedel

    Abstract: We explore the use of long-context capabilities in large language models to create synthetic reading comprehension data from entire books. Previous efforts to construct such datasets relied on crowd-sourcing, but the emergence of transformers with a context size of 1 million or more tokens now enables entirely automatic approaches. Our objective is to test the capabilities of LLMs to analyze, unde… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  2. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  4. arXiv:2307.05741  [pdf, other

    cs.CL

    Towards Robust and Efficient Continual Language Learning

    Authors: Adam Fisch, Amal Rannen-Triki, Razvan Pascanu, Jörg Bornschein, Angeliki Lazaridou, Elena Gribovskaya, Marc'Aurelio Ranzato

    Abstract: As the application space of language models continues to evolve, a natural question to ask is how we can quickly adapt models to new tasks. We approach this classic question from a continual learning perspective, in which we aim to continue fine-tuning models trained on past tasks on new tasks, with the goal of "transferring" relevant knowledge. However, this strategy also runs the risk of doing m… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  5. arXiv:2211.11747  [pdf, other

    cs.LG cs.CV

    NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

    Authors: Jorg Bornschein, Alexandre Galashov, Ross Hemsley, Amal Rannen-Triki, Yutian Chen, Arslan Chaudhry, Xu Owen He, Arthur Douillard, Massimo Caccia, Qixuang Feng, Jiajun Shen, Sylvestre-Alvise Rebuffi, Kitty Stacpoole, Diego de las Casas, Will Hawkins, Angeliki Lazaridou, Yee Whye Teh, Andrei A. Rusu, Razvan Pascanu, Marc'Aurelio Ranzato

    Abstract: A shared goal of several machine learning communities like continual learning, meta-learning and transfer learning, is to design algorithms and models that efficiently and robustly adapt to unseen tasks. An even more ambitious goal is to build models that never stop adapting, and that become increasingly more efficient through time by suitably transferring the accrued knowledge. Beyond the study o… ▽ More

    Submitted 16 May, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  6. arXiv:2205.11388  [pdf, other

    cs.CL cs.LG

    StreamingQA: A Benchmark for Adaptation to New Knowledge over Time in Question Answering Models

    Authors: Adam Liška, Tomáš Kočiský, Elena Gribovskaya, Tayfun Terzi, Eren Sezener, Devang Agrawal, Cyprien de Masson d'Autume, Tim Scholtes, Manzil Zaheer, Susannah Young, Ellen Gilsenan-McMahon, Sophia Austin, Phil Blunsom, Angeliki Lazaridou

    Abstract: Knowledge and language understanding of models evaluated through question answering (QA) has been usually studied on static snapshots of knowledge, like Wikipedia. However, our world is dynamic, evolves over time, and our models' knowledge becomes outdated. To study how semi-parametric QA models and their underlying parametric language models (LMs) adapt to evolving knowledge, we construct a new l… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  7. arXiv:2203.05115  [pdf, other

    cs.CL cs.LG

    Internet-augmented language models through few-shot prompting for open-domain question answering

    Authors: Angeliki Lazaridou, Elena Gribovskaya, Wojciech Stokowiec, Nikolai Grigorev

    Abstract: In this work, we aim to capitalize on the unique few-shot capabilities of large-scale language models (LSLMs) to overcome some of their challenges with respect to grounding to factual and up-to-date information. Motivated by semi-parametric language models (LMs), which ground their decisions in external retrieved evidence, we use few-shot prompting to learn to condition LMs on information returned… ▽ More

    Submitted 23 May, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

  8. arXiv:2112.11446  [pdf, other

    cs.CL cs.AI

    Scaling Language Models: Methods, Analysis & Insights from Training Gopher

    Authors: Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor , et al. (55 additional authors not shown)

    Abstract: Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based language model performance across a wide range of model scales -- from models with tens of millions of parameters up to a 280 billion parameter model called Gop… ▽ More

    Submitted 21 January, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: 120 pages

  9. arXiv:2110.14241  [pdf, other

    cs.LG cs.AI cs.CL cs.MA stat.ML

    Dynamic population-based meta-learning for multi-agent communication with natural language

    Authors: Abhinav Gupta, Marc Lanctot, Angeliki Lazaridou

    Abstract: In this work, our goal is to train agents that can coordinate with seen, unseen as well as human partners in a multi-agent communication environment involving natural language. Previous work using a single set of agents has shown great progress in generalizing to known partners, however it struggles when coordinating with unfamiliar agents. To mitigate that, recent work explored the use of populat… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2021

  10. arXiv:2102.01951  [pdf, other

    cs.CL cs.AI

    Mind the Gap: Assessing Temporal Generalization in Neural Language Models

    Authors: Angeliki Lazaridou, Adhiguna Kuncoro, Elena Gribovskaya, Devang Agrawal, Adam Liska, Tayfun Terzi, Mai Gimenez, Cyprien de Masson d'Autume, Tomas Kocisky, Sebastian Ruder, Dani Yogatama, Kris Cao, Susannah Young, Phil Blunsom

    Abstract: Our world is open-ended, non-stationary, and constantly evolving; thus what we talk about and how we talk about it change over time. This inherent dynamic nature of language contrasts with the current static language modelling paradigm, which trains and evaluates models on utterances from overlapping time periods. Despite impressive recent progress, we demonstrate that Transformer-XL language mode… ▽ More

    Submitted 26 October, 2021; v1 submitted 3 February, 2021; originally announced February 2021.

    Comments: To appear as a Spotlight at NeurIPS 2021

  11. arXiv:2101.10276  [pdf, other

    cs.LG cs.AI cs.MA

    Emergent Communication under Competition

    Authors: Michael Noukhovitch, Travis LaCroix, Angeliki Lazaridou, Aaron Courville

    Abstract: The literature in modern machine learning has only negative results for learning to communicate between competitive agents using standard RL. We introduce a modified sender-receiver game to study the spectrum of partially-competitive scenarios and show communication can indeed emerge in a competitive setting. We empirically demonstrate three key takeaways for future research. First, we show that c… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: To be presented at AAMAS 2021

  12. arXiv:2010.10380  [pdf, other

    cs.LG cs.AI cs.MA

    Negotiating Team Formation Using Deep Reinforcement Learning

    Authors: Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki, Thore Graepel

    Abstract: When autonomous agents interact in the same environment, they must often cooperate to achieve their goals. One way for agents to cooperate effectively is to form a team, make a binding agreement on a joint plan, and execute it. However, when agents are self-interested, the gains from team formation must be allocated appropriately to incentivize agreement. Various approaches for multi-agent negotia… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    ACM Class: I.2.6

    Journal ref: Artificial Intelligence 288 (2020): 103356

  13. arXiv:2006.02419  [pdf, other

    cs.CL cs.AI

    Emergent Multi-Agent Communication in the Deep Learning Era

    Authors: Angeliki Lazaridou, Marco Baroni

    Abstract: The ability to cooperate through language is a defining feature of humans. As the perceptual, motory and planning capabilities of deep artificial networks increase, researchers are studying whether they also can develop a shared language to interact. From a scientific perspective, understanding the conditions under which language evolves in communities of deep agents and its emergent features can… ▽ More

    Submitted 14 July, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Added some more references and discussion

  14. arXiv:2005.07064  [pdf, other

    cs.CL cs.AI cs.LG

    Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning

    Authors: Angeliki Lazaridou, Anna Potapenko, Olivier Tieleman

    Abstract: We present a method for combining multi-agent communication and traditional data-driven approaches to natural language learning, with an end goal of teaching agents to communicate with humans in natural language. Our starting point is a language model that has been trained on generic, not task-specific language data. We then place this model in a multi-agent self-play environment that generates ta… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: to appear at ACL 2020

  15. arXiv:2004.10151  [pdf, other

    cs.CL cs.AI cs.LG

    Experience Grounds Language

    Authors: Yonatan Bisk, Ari Holtzman, Jesse Thomason, Jacob Andreas, Yoshua Bengio, Joyce Chai, Mirella Lapata, Angeliki Lazaridou, Jonathan May, Aleksandr Nisnevich, Nicolas Pinto, Joseph Turian

    Abstract: Language understanding research is held back by a failure to relate language to the physical world it describes and to the social interactions it facilitates. Despite the incredible effectiveness of language processing models to tackle tasks after being trained on text alone, successful linguistic communication relies on a shared experience of the world. It is this shared experience that makes utt… ▽ More

    Submitted 1 November, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: Empirical Methods in Natural Language Processing (EMNLP), 2020

  16. arXiv:1912.06208  [pdf, other

    cs.CL cs.NE

    Shaping representations through communication: community size effect in artificial learning systems

    Authors: Olivier Tieleman, Angeliki Lazaridou, Shibl Mourad, Charles Blundell, Doina Precup

    Abstract: Motivated by theories of language and communication that explain why communities with large numbers of speakers have, on average, simpler languages with more regularity, we cast the representation learning problem in terms of learning to communicate. Our starting point sees the traditional autoencoder setup as a single encoder with a fixed decoder partner that must learn to communicate. Generalizi… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: NeurIPS 2019 workshop on visually grounded interaction and language

  17. arXiv:1912.05676  [pdf, other

    cs.MA cs.CL cs.LG

    Biases for Emergent Communication in Multi-agent Reinforcement Learning

    Authors: Tom Eccles, Yoram Bachrach, Guy Lever, Angeliki Lazaridou, Thore Graepel

    Abstract: We study the problem of emergent communication, in which language arises because speakers and listeners must communicate information in order to solve tasks. In temporally extended reinforcement learning domains, it has proved hard to learn such communication without centralized training of agents, due in part to a difficult joint exploration problem. We introduce inductive biases for positive sig… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: Accepted at NeurIPS 2019

  18. arXiv:1901.11373  [pdf, other

    cs.LG cs.CL stat.ML

    Learning and Evaluating General Linguistic Intelligence

    Authors: Dani Yogatama, Cyprien de Masson d'Autume, Jerome Connor, Tomas Kocisky, Mike Chrzanowski, Lingpeng Kong, Angeliki Lazaridou, Wang Ling, Lei Yu, Chris Dyer, Phil Blunsom

    Abstract: We define general linguistic intelligence as the ability to reuse previously acquired knowledge about a language's lexicon, syntax, semantics, and pragmatic conventions to adapt to new tasks quickly. Using this definition, we analyze state-of-the-art natural language understanding models and conduct an extensive empirical investigation to evaluate them against these criteria through a series of ex… ▽ More

    Submitted 31 January, 2019; originally announced January 2019.

  19. arXiv:1810.08647  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning

    Authors: Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Caglar Gulcehre, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas

    Abstract: We propose a unified mechanism for achieving coordination and communication in Multi-Agent Reinforcement Learning (MARL), through rewarding agents for having causal influence over other agents' actions. Causal influence is assessed using counterfactual reasoning. At each timestep, an agent simulates alternate actions that it could have taken, and computes their effect on the behavior of other agen… ▽ More

    Submitted 18 June, 2019; v1 submitted 19 October, 2018; originally announced October 2018.

  20. arXiv:1804.03984  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input

    Authors: Angeliki Lazaridou, Karl Moritz Hermann, Karl Tuyls, Stephen Clark

    Abstract: The ability of algorithms to evolve or learn (compositional) communication protocols has traditionally been studied in the language evolution literature through the use of emergent communication tasks. Here we scale up this research by using contemporary deep learning methods and by training reinforcement-learning neural network agents on referential communication games. We extend previous work, i… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: To appear at ICLR 2018

  21. arXiv:1804.03980  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    Emergent Communication through Negotiation

    Authors: Kris Cao, Angeliki Lazaridou, Marc Lanctot, Joel Z Leibo, Karl Tuyls, Stephen Clark

    Abstract: Multi-agent reinforcement learning offers a way to study how communication could emerge in communities of agents needing to solve specific problems. In this paper, we study the emergence of communication in the negotiation environment, a semi-cooperative model of agent interaction. We introduce two communication protocols -- one grounded in the semantics of the game, and one which is \textit{a pri… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: Published as a conference paper at ICLR 2018

  22. arXiv:1804.02341  [pdf, other

    cs.AI cs.CL cs.LG cs.NE

    Compositional Obverter Communication Learning From Raw Visual Input

    Authors: Edward Choi, Angeliki Lazaridou, Nando de Freitas

    Abstract: One of the distinguishing aspects of human language is its compositionality, which allows us to describe complex environments with limited vocabulary. Previously, it has been shown that neural network agents can learn to communicate in a highly structured, possibly compositional language based on disentangled input (e.g. hand- engineered features). Humans, however, do not learn to communicate base… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

    Comments: Published as a conference paper at ICLR 2018

  23. arXiv:1711.00832  [pdf, other

    cs.AI cs.GT cs.LG cs.MA

    A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning

    Authors: Marc Lanctot, Vinicius Zambaldi, Audrunas Gruslys, Angeliki Lazaridou, Karl Tuyls, Julien Perolat, David Silver, Thore Graepel

    Abstract: To achieve general intelligence, agents must learn how to interact with others in a shared environment: this is the challenge of multiagent reinforcement learning (MARL). The simplest form is independent reinforcement learning (InRL), where each agent treats its experience as part of its (non-stationary) environment. In this paper, we first observe that policies learned using InRL can overfit to t… ▽ More

    Submitted 7 November, 2017; v1 submitted 2 November, 2017; originally announced November 2017.

    Comments: Camera-ready copy of NIPS 2017 paper, including appendix

  24. arXiv:1707.08172  [pdf, other

    cs.CL

    The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations

    Authors: Nikita Nangia, Adina Williams, Angeliki Lazaridou, Samuel R. Bowman

    Abstract: This paper presents the results of the RepEval 2017 Shared Task, which evaluated neural network sentence representation learning models on the Multi-Genre Natural Language Inference corpus (MultiNLI) recently introduced by Williams et al. (2017). All of the five participating teams beat the bidirectional LSTM (BiLSTM) and continuous bag of words baselines reported in Williams et al.. The best sing… ▽ More

    Submitted 25 July, 2017; originally announced July 2017.

    Comments: 10 pages, 1 figure, 6 tables, in Proceedings of The Second Workshop on Evaluating Vector Space Representations for NLP (RepEval 2017)

  25. arXiv:1701.08954  [pdf, ps, other

    cs.LG cs.AI cs.CL

    CommAI: Evaluating the first steps towards a useful general AI

    Authors: Marco Baroni, Armand Joulin, Allan Jabri, Germàn Kruszewski, Angeliki Lazaridou, Klemen Simonic, Tomas Mikolov

    Abstract: With machine learning successfully applied to new daunting problems almost every day, general AI starts looking like an attainable goal. However, most current research focuses instead on important but narrow applications, such as image classification or machine translation. We believe this to be largely due to the lack of objective ways to measure progress towards broad machine intelligence. In or… ▽ More

    Submitted 27 March, 2017; v1 submitted 31 January, 2017; originally announced January 2017.

    Comments: Published in ICLR 2017 Workshop Track

  26. arXiv:1612.07182  [pdf, other

    cs.CL cs.CV cs.GT cs.LG cs.MA

    Multi-Agent Cooperation and the Emergence of (Natural) Language

    Authors: Angeliki Lazaridou, Alexander Peysakhovich, Marco Baroni

    Abstract: The current mainstream approach to train natural language systems is to expose them to large amounts of text. This passive learning is problematic if we are interested in developing interactive machines, such as conversational agents. We propose a framework for language learning that relies on multi-agent communication. We study this learning in the context of referential games. In these games, a… ▽ More

    Submitted 5 March, 2017; v1 submitted 21 December, 2016; originally announced December 2016.

    Comments: Accepted at ICLR 2017

  27. arXiv:1606.06031  [pdf, other

    cs.CL cs.AI cs.LG

    The LAMBADA dataset: Word prediction requiring a broad discourse context

    Authors: Denis Paperno, Germán Kruszewski, Angeliki Lazaridou, Quan Ngoc Pham, Raffaella Bernardi, Sandro Pezzelle, Marco Baroni, Gemma Boleda, Raquel Fernández

    Abstract: We introduce LAMBADA, a dataset to evaluate the capabilities of computational models for text understanding by means of a word prediction task. LAMBADA is a collection of narrative passages sharing the characteristic that human subjects are able to guess their last word if they are exposed to the whole passage, but not if they only see the last sentence preceding the target word. To succeed on LAM… ▽ More

    Submitted 20 June, 2016; originally announced June 2016.

    Comments: 10 pages, Accepted as a long paper for ACL 2016

  28. arXiv:1605.07133  [pdf, other

    cs.CL cs.CV cs.LG

    Towards Multi-Agent Communication-Based Language Learning

    Authors: Angeliki Lazaridou, Nghia The Pham, Marco Baroni

    Abstract: We propose an interactive multimodal framework for language learning. Instead of being passively exposed to large amounts of natural text, our learners (implemented as feed-forward neural networks) engage in cooperative referential games starting from a tabula rasa setup, and thus develop their own language from the need to communicate in order to succeed at the game. Preliminary experiments provi… ▽ More

    Submitted 23 May, 2016; originally announced May 2016.

    Comments: 9 pages, manuscript under submission

  29. arXiv:1603.02618  [pdf, other

    cs.CL cs.CV

    The red one!: On learning to refer to things based on their discriminative properties

    Authors: Angeliki Lazaridou, Nghia The Pham, Marco Baroni

    Abstract: As a first step towards agents learning to communicate about their visual environment, we propose a system that, given visual representations of a referent (cat) and a context (sofa), identifies their discriminative attributes, i.e., properties that distinguish them (has_tail). Moreover, despite the lack of direct supervision at the attribute level, the model learns to assign plausible attributes… ▽ More

    Submitted 23 May, 2016; v1 submitted 8 March, 2016; originally announced March 2016.

    Comments: Accepted as an ACL-short sumbmission

  30. arXiv:1506.03500  [pdf, other

    cs.CV cs.CL

    Unveiling the Dreams of Word Embeddings: Towards Language-Driven Image Generation

    Authors: Angeliki Lazaridou, Dat Tien Nguyen, Raffaella Bernardi, Marco Baroni

    Abstract: We introduce language-driven image generation, the task of generating an image visualizing the semantic contents of a word embedding, e.g., given the word embedding of grasshopper, we generate a natural image of a grasshopper. We implement a simple method based on two mapping functions. The first takes as input a word embedding (as produced, e.g., by the word2vec toolkit) and maps it onto a high-l… ▽ More

    Submitted 23 November, 2015; v1 submitted 10 June, 2015; originally announced June 2015.

    Comments: A 6-page version to appear at the Multimodal Machine Learning NIPS 2015 Workshop

  31. arXiv:1501.02714  [pdf, other

    cs.CL cs.CV

    From Visual Attributes to Adjectives through Decompositional Distributional Semantics

    Authors: Angeliki Lazaridou, Georgiana Dinu, Adam Liska, Marco Baroni

    Abstract: As automated image analysis progresses, there is increasing interest in richer linguistic annotation of pictures, with attributes of objects (e.g., furry, brown...) attracting most attention. By building on the recent "zero-shot learning" approach, and paying attention to the linguistic nature of attributes as noun modifiers, and specifically adjectives, we show that it is possible to tag images w… ▽ More

    Submitted 24 March, 2015; v1 submitted 12 January, 2015; originally announced January 2015.

    Comments: accepted at Transactions of the Association for Computational Linguistics (TACL), 3/2015

  32. arXiv:1501.02598  [pdf, other

    cs.CL cs.CV cs.LG

    Combining Language and Vision with a Multimodal Skip-gram Model

    Authors: Angeliki Lazaridou, Nghia The Pham, Marco Baroni

    Abstract: We extend the SKIP-GRAM model of Mikolov et al. (2013a) by taking visual information into account. Like SKIP-GRAM, our multimodal models (MMSKIP-GRAM) build vector-based word representations by learning to predict linguistic contexts in text corpora. However, for a restricted set of words, the models are also exposed to visual representations of the objects they denote (extracted from natural imag… ▽ More

    Submitted 12 March, 2015; v1 submitted 12 January, 2015; originally announced January 2015.

    Comments: accepted at NAACL 2015, camera ready version, 11 pages

  33. arXiv:1412.6568  [pdf, other

    cs.CL cs.LG

    Improving zero-shot learning by mitigating the hubness problem

    Authors: Georgiana Dinu, Angeliki Lazaridou, Marco Baroni

    Abstract: The zero-shot paradigm exploits vector-based word representations extracted from text corpora with unsupervised methods to learn general mapping functions from other feature spaces onto word space, where the words associated to the nearest neighbours of the mapped vectors are used as their linguistic labels. We show that the neighbourhoods of the mapped elements are strongly polluted by hubs, vect… ▽ More

    Submitted 15 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.