Skip to main content

Showing 1–15 of 15 results for author: Bailey, P

  1. arXiv:2403.08295  [pdf, other

    cs.CL cs.AI

    Gemma: Open Models Based on Gemini Research and Technology

    Authors: Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Pier Giuseppe Sessa, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari , et al. (83 additional authors not shown)

    Abstract: This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language understanding, reasoning, and safety. We release two sizes of models (2 billion and 7 billion parameters), and provide both pretrained and fine-tuned checkpoints. Ge… ▽ More

    Submitted 16 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2305.10403  [pdf, other

    cs.CL cs.AI

    PaLM 2 Technical Report

    Authors: Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yanping Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yujing Zhang, Gustavo Hernandez Abrego , et al. (103 additional authors not shown)

    Abstract: We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  4. arXiv:2212.09248  [pdf, other

    cs.CL cs.SE

    Natural Language to Code Generation in Interactive Data Science Notebooks

    Authors: Pengcheng Yin, Wen-Ding Li, Kefan Xiao, Abhishek Rao, Yeming Wen, Kensen Shi, Joshua Howland, Paige Bailey, Michele Catasta, Henryk Michalewski, Alex Polozov, Charles Sutton

    Abstract: Computational notebooks, such as Jupyter notebooks, are interactive computing environments that are ubiquitous among data scientists to perform data wrangling and analytic tasks. To measure the performance of AI pair programmers that automatically synthesize programs for those tasks given natural language (NL) intents from users, we build ARCADE, a benchmark of 1082 code generation problems using… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 46 pages. 32 figures

  5. arXiv:2208.03443  [pdf, other

    cs.HC

    Imagining Future Digital Assistants at Work: A Study of Task Management Needs

    Authors: Yonchanok Khaokaew, Indigo Holcombe-James, Mohammad Saiedur Rahaman, Jonathan Liono, Johanne R. Trippas, Damiano Spina, Nicholas Belkin, Peter Bailey, Paul N. Bennett, Yongli Ren, Mark Sanderson, Falk Scholer, Ryen W. White, Flora D. Salim

    Abstract: Digital Assistants (DAs) can support workers in the workplace and beyond. However, target user needs are not fully understood, and the functions that workers would ideally want a DA to support require further study. A richer understanding of worker needs could help inform the design of future DAs. We investigate user needs of future workplace DAs using data from a user study of 40 workers over a f… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 59 pages

  6. arXiv:2110.02482  [pdf, other

    cs.GT cs.MA

    $O\left(1/T\right)$ Time-Average Convergence in a Generalization of Multiagent Zero-Sum Games

    Authors: James P. Bailey

    Abstract: We introduce a generalization of zero-sum network multiagent matrix games and prove that alternating gradient descent converges to the set of Nash equilibria at rate $O(1/T)$ for this set of games. Alternating gradient descent obtains this convergence guarantee while using fixed learning rates that are four times larger than the optimistic variant of gradient descent. Experimentally, we show with… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  7. arXiv:2110.02134  [pdf, other

    cs.GT cs.MA

    Stochastic Multiplicative Weights Updates in Zero-Sum Games

    Authors: James P. Bailey, Sai Ganesh Nagarajan, Georgios Piliouras

    Abstract: We study agents competing against each other in a repeated network zero-sum game while applying the multiplicative weights update (MWU) algorithm with fixed learning rates. In our implementation, agents select their strategies probabilistically in each iteration and update their weights/strategies using the realized vector payoff of all strategies, i.e., stochastic MWU with full information. We sh… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  8. arXiv:2108.04381  [pdf, ps, other

    cs.GT cs.MA

    Conditions for Stability in Strategic Matching

    Authors: James P. Bailey, Craig A. Tovey

    Abstract: We consider the stability of matchings when individuals strategically submit preference information to a publicly known algorithm. Most pure Nash equilibria of the ensuing game yield a matching that is unstable with respect to the individuals' sincere preferences. We introduce a well-supported minimal dishonesty constraint, and obtain conditions under which every pure Nash equilibrium yields a mat… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  9. arXiv:2103.12954  [pdf, ps, other

    math.OC cs.LG eess.SY

    Convergence Analysis of Nonconvex Distributed Stochastic Zeroth-order Coordinate Method

    Authors: Shengjun Zhang, Yunlong Dong, Dong Xie, Lisha Yao, Colleen P. Bailey, Shengli Fu

    Abstract: This paper investigates the stochastic distributed nonconvex optimization problem of minimizing a global cost function formed by the summation of $n$ local cost functions. We solve such a problem by involving zeroth-order (ZO) information exchange. In this paper, we propose a ZO distributed primal-dual coordinate method (ZODIAC) to solve the stochastic optimization problem. Agents approximate thei… ▽ More

    Submitted 13 October, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  10. arXiv:2003.12863  [pdf, other

    cs.RO cs.LG eess.SY

    Obstacle Avoidance and Navigation Utilizing Reinforcement Learning with Reward Shaping

    Authors: Daniel Zhang, Colleen P. Bailey

    Abstract: In this paper, we investigate the obstacle avoidance and navigation problem in the robotic control area. For solving such a problem, we propose revised Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization algorithms with an improved reward shaping technique. We compare the performances between the original DDPG and PPO with the revised version of both on simulations with a re… ▽ More

    Submitted 9 April, 2020; v1 submitted 28 March, 2020; originally announced March 2020.

  11. arXiv:2003.08525   

    eess.SP cs.CV

    Extremal Region Analysis based Deep Learning Framework for Detecting Defects

    Authors: Zelin Deng, Xiaolong Yan, Shengjun Zhang, Colleen P. Bailey

    Abstract: A maximally stable extreme region (MSER) analysis based convolutional neural network (CNN) for unified defect detection framework is proposed in this paper. Our proposed framework utilizes the generality and stability of MSER to generate the desired defect candidates. Then a specific trained binary CNN classifier is adopted over the defect candidates to produce the final defect set. Defect dataset… ▽ More

    Submitted 22 May, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: Unsatisfied with results

  12. arXiv:1907.04392  [pdf, other

    cs.GT math.DS math.OC

    Finite Regret and Cycles with Fixed Step-Size via Alternating Gradient Descent-Ascent

    Authors: James P. Bailey, Gauthier Gidel, Georgios Piliouras

    Abstract: Gradient descent is arguably one of the most popular online optimization methods with a wide array of applications. However, the standard implementation where agents simultaneously update their strategies yields several undesirable properties; strategies diverge away from equilibrium and regret grows over time. In this paper, we eliminate these negative properties by introducing a different implem… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: 15 pages

  13. arXiv:1905.04532  [pdf, other

    cs.GT cs.LG cs.MA

    Fast and Furious Learning in Zero-Sum Games: Vanishing Regret with Non-Vanishing Step Sizes

    Authors: James P. Bailey, Georgios Piliouras

    Abstract: We show for the first time, to our knowledge, that it is possible to reconcile in online learning in zero-sum games two seemingly contradictory objectives: vanishing time-average regret and non-vanishing step sizes. This phenomenon, that we coin ``fast and furious" learning in games, sets a new benchmark about what is possible both in max-min optimization as well as in multi-agent systems. Our ana… ▽ More

    Submitted 11 May, 2019; originally announced May 2019.

  14. arXiv:1903.10630  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Diversifying Reply Suggestions using a Matching-Conditional Variational Autoencoder

    Authors: Budhaditya Deb, Peter Bailey, Milad Shokouhi

    Abstract: We consider the problem of diversifying automated reply suggestions for a commercial instant-messaging (IM) system (Skype). Our conversation model is a standard matching based information retrieval architecture, which consists of two parallel encoders to project messages and replies into a common feature representation. During inference, we select replies from a fixed response set using nearest ne… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.

  15. arXiv:1903.01720  [pdf, ps, other

    cs.GT cs.LG cs.MA

    Multi-Agent Learning in Network Zero-Sum Games is a Hamiltonian System

    Authors: James P. Bailey, Georgios Piliouras

    Abstract: Zero-sum games are natural, if informal, analogues of closed physical systems where no energy/utility can enter or exit. This analogy can be extended even further if we consider zero-sum network (polymatrix) games where multiple agents interact in a closed economy. Typically, (network) zero-sum games are studied from the perspective of Nash equilibria. Nevertheless, this comes in contrast with the… ▽ More

    Submitted 5 March, 2019; originally announced March 2019.