Skip to main content

Showing 1–42 of 42 results for author: Bailis, P

  1. arXiv:2407.16831  [pdf, ps, other

    cs.AI

    Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design

    Authors: Jared Quincy Davis, Boris Hanin, Lingjiao Chen, Peter Bailis, Ion Stoica, Matei Zaharia

    Abstract: As practitioners seek to surpass the current reliability and quality frontier of monolithic models, Compound AI Systems consisting of many language model inference calls are increasingly employed. In this work, we construct systems, which we call Networks of Networks (NoNs) organized around the distinction between generating a proposed answer and verifying its correctness, a fundamental concept in… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  2. arXiv:2403.02419  [pdf, other

    cs.LG cs.AI cs.CL eess.SY

    Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems

    Authors: Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei Zaharia, James Zou

    Abstract: Many recent state-of-the-art results in language tasks were achieved using compound systems that perform multiple Language Model (LM) calls and aggregate their responses. However, there is little understanding of how the number of LM calls - e.g., when asking the LM to answer each question multiple times and taking a majority vote - affects such a compound system's performance. In this paper, we i… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  3. arXiv:2402.02057  [pdf, other

    cs.LG cs.CL

    Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

    Authors: Yichao Fu, Peter Bailis, Ion Stoica, Hao Zhang

    Abstract: Autoregressive decoding of large language models (LLMs) is memory bandwidth bounded, resulting in high latency and significant wastes of the parallel processing power of modern accelerators. Existing methods for accelerating LLM decoding often require a draft model (e.g., speculative decoding), which is nontrivial to obtain and unable to generalize. In this paper, we introduce Lookahead decoding,… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  4. arXiv:2310.07177  [pdf, other

    cs.AI cs.CL cs.LG

    Online Speculative Decoding

    Authors: Xiaoxuan Liu, Lanxiang Hu, Peter Bailis, Alvin Cheung, Zhijie Deng, Ion Stoica, Hao Zhang

    Abstract: Speculative decoding is a pivotal technique to accelerate the inference of large language models (LLMs) by employing a smaller draft model to predict the target model's outputs. However, its efficacy can be limited due to the low predictive accuracy of the draft model, particularly when faced with diverse text inputs and a significant capability gap between the draft and target models. We introduc… ▽ More

    Submitted 9 June, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  5. arXiv:2208.13068  [pdf, other

    cs.DB cs.DC

    Apiary: A DBMS-Integrated Transactional Function-as-a-Service Framework

    Authors: Peter Kraft, Qian Li, Kostis Kaffes, Athinagoras Skiadopoulos, Deeptaanshu Kumar, Danny Cho, Jason Li, Robert Redmond, Nathan Weckwerth, Brian Xia, Peter Bailis, Michael Cafarella, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Xiangyao Yu, Matei Zaharia

    Abstract: Developers increasingly use function-as-a-service (FaaS) platforms for data-centric applications that perform low-latency and transactional operations on data, such as for microservices or web serving. Unfortunately, existing FaaS platforms support these applications poorly because they physically and logically separate application logic, executed in cloud functions, from data management, done in… ▽ More

    Submitted 30 June, 2023; v1 submitted 27 August, 2022; originally announced August 2022.

    Comments: 14 pages, 13 figures, 3 tables. Preprint

  6. arXiv:2201.05797  [pdf, other

    cs.DB

    Finding Label and Model Errors in Perception Data With Learned Observation Assertions

    Authors: Daniel Kang, Nikos Arechiga, Sudeep Pillai, Peter Bailis, Matei Zaharia

    Abstract: ML is being deployed in complex, real-world scenarios where errors have impactful consequences. In these systems, thorough testing of the ML pipelines is critical. A key component in ML deployment pipelines is the curation of labeled training data. Common practice in the ML literature assumes that labels are the ground truth. However, in our experience in a large autonomous vehicle development cen… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

    Journal ref: SIGMOD 2022

  7. Accelerating Approximate Aggregation Queries with Expensive Predicates

    Authors: Daniel Kang, John Guibas, Peter Bailis, Tatsunori Hashimoto, Yi Sun, Matei Zaharia

    Abstract: Researchers and industry analysts are increasingly interested in computing aggregation queries over large, unstructured datasets with selective predicates that are computed using expensive deep neural networks (DNNs). As these DNNs are expensive and because many applications can tolerate approximate answers, analysts are interested in accelerating these queries via approximations. Unfortunately, s… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Journal ref: PVLDB, 14(11): 2341 - 2354, 2021

  8. arXiv:2107.12525  [pdf, ps, other

    math.ST cs.DB cs.LG stat.ML

    Proof: Accelerating Approximate Aggregation Queries with Expensive Predicates

    Authors: Daniel Kang, John Guibas, Peter Bailis, Tatsunori Hashimoto, Yi Sun, Matei Zaharia

    Abstract: Given a dataset $\mathcal{D}$, we are interested in computing the mean of a subset of $\mathcal{D}$ which matches a predicate. ABae leverages stratified sampling and proxy models to efficiently compute this statistic given a sampling budget $N$. In this document, we theoretically analyze ABae and show that the MSE of the estimate decays at rate $O(N_1^{-1} + N_2^{-1} + N_1^{1/2}N_2^{-3/2})$, where… ▽ More

    Submitted 28 July, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

  9. arXiv:2102.08622  [pdf, other

    cs.LG stat.ML

    Sinkhorn Label Allocation: Semi-Supervised Classification via Annealed Self-Training

    Authors: Kai Sheng Tai, Peter Bailis, Gregory Valiant

    Abstract: Self-training is a standard approach to semi-supervised learning where the learner's own predictions on unlabeled data are used as supervision during training. In this paper, we reinterpret this label assignment process as an optimal transportation problem between examples and classes, wherein the cost of assigning an example to a class is mediated by the current predictions of the classifier. Thi… ▽ More

    Submitted 11 June, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: ICML 2021 camera ready version

  10. arXiv:2009.04540  [pdf, other

    cs.DB

    Semantic Indexes for Machine Learning-based Queries over Unstructured Data

    Authors: Daniel Kang, John Guibas, Peter Bailis, Tatsunori Hashimoto, Matei Zaharia

    Abstract: Unstructured data (e.g., video or text) is now commonly queried by using computationally expensive deep neural networks or human labelers to produce structured information, e.g., object types and positions in video. To accelerate queries, many recent systems (e.g., BlazeIt, NoScope, Tahoma, SUPG, etc.) train a query-specific proxy model to approximate a large target labelers (i.e., these expensive… ▽ More

    Submitted 6 January, 2022; v1 submitted 9 September, 2020; originally announced September 2020.

    Journal ref: SIGMOD 2022

  11. Approximate Partition Selection for Big-Data Workloads using Summary Statistics

    Authors: Kexin Rong, Yao Lu, Peter Bailis, Srikanth Kandula, Philip Levis

    Abstract: Many big-data clusters store data in large partitions that support access at a coarse, partition-level granularity. As a result, approximate query processing via row-level sampling is inefficient, often requiring reads of many partitions. In this work, we seek to answer queries quickly and approximately by reading a subset of the data partitions and combining partial answers in a weighted manner w… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

  12. arXiv:2008.09983  [pdf, other

    cs.LG cs.DB stat.ML

    Leveraging Organizational Resources to Adapt Models to New Data Modalities

    Authors: Sahaana Suri, Raghuveer Chanda, Neslihan Bulut, Pradyumna Narayana, Yemao Zeng, Peter Bailis, Sugato Basu, Girija Narlikar, Christopher Re, Abishek Sethi

    Abstract: As applications in large organizations evolve, the machine learning (ML) models that power them must adapt the same predictive tasks to newly arising data modalities (e.g., a new video content launch in a social media application requires existing text or image models to extend to video). To solve this problem, organizations typically create ML pipelines from scratch. However, this fails to utiliz… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Journal ref: PVLDB,13(12): 3396-3410, 2020

  13. arXiv:2007.13005  [pdf, other

    cs.DB cs.CV

    Jointly Optimizing Preprocessing and Inference for DNN-based Visual Analytics

    Authors: Daniel Kang, Ankit Mathur, Teja Veeramacheneni, Peter Bailis, Matei Zaharia

    Abstract: While deep neural networks (DNNs) are an increasingly popular way to query large corpora of data, their significant runtime remains an active area of research. As a result, researchers have proposed systems and optimizations to reduce these costs by allowing users to trade off accuracy and speed. In this work, we examine end-to-end DNN execution in visual analytics systems on modern accelerators.… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

  14. arXiv:2007.00077  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Similarity Search for Efficient Active Learning and Search of Rare Concepts

    Authors: Cody Coleman, Edward Chou, Julian Katz-Samuels, Sean Culatana, Peter Bailis, Alexander C. Berg, Robert Nowak, Roshan Sumbaly, Matei Zaharia, I. Zeki Yalniz

    Abstract: Many active learning and search approaches are intractable for large-scale industrial settings with billions of unlabeled examples. Existing approaches search globally for the optimal examples to label, scaling linearly or even quadratically with the unlabeled data. In this paper, we improve the computational efficiency of active learning and search methods by restricting the candidate pool for la… ▽ More

    Submitted 22 July, 2021; v1 submitted 30 June, 2020; originally announced July 2020.

  15. arXiv:2006.03779  [pdf, other

    stat.ML cs.LG

    Chromatic Learning for Sparse Datasets

    Authors: Vladimir Feinberg, Peter Bailis

    Abstract: Learning over sparse, high-dimensional data frequently necessitates the use of specialized methods such as the hashing trick. In this work, we design a highly scalable alternative approach that leverages the low degree of feature co-occurrences present in many practical settings. This approach, which we call Chromatic Learning (CL), obtains a low-dimensional dense feature representation by perform… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    Comments: 15 pages, 8 figures, under review

  16. arXiv:2004.00827  [pdf, other

    cs.DB

    Approximate Selection with Guarantees using Proxies

    Authors: Daniel Kang, Edward Gan, Peter Bailis, Tatsunori Hashimoto, Matei Zaharia

    Abstract: Due to the falling costs of data acquisition and storage, researchers and industry analysts often want to find all instances of rare events in large datasets. For instance, scientists can cheaply capture thousands of hours of video, but are limited by the need to manually inspect long videos to identify relevant objects and events. To reduce this cost, recent work proposes to use cheap proxy model… ▽ More

    Submitted 3 January, 2022; v1 submitted 2 April, 2020; originally announced April 2020.

    Journal ref: PVLDB 2020

  17. arXiv:2003.01668  [pdf, other

    cs.AI cs.LG

    Model Assertions for Monitoring and Improving ML Models

    Authors: Daniel Kang, Deepti Raghavan, Peter Bailis, Matei Zaharia

    Abstract: ML models are increasingly deployed in settings with real world interactions such as vehicles, but unfortunately, these models can fail in systematic ways. To prevent errors, ML engineering teams monitor and continuously improve these models. We propose a new abstraction, model assertions, that adapts the classical use of program assertions as a way to monitor and improve ML models. Model assertio… ▽ More

    Submitted 11 March, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Journal ref: MLSys 2020

  18. arXiv:2002.03063  [pdf, other

    cs.DB

    Storyboard: Optimizing Precomputed Summaries for Aggregation

    Authors: Edward Gan, Peter Bailis, Moses Charikar

    Abstract: An emerging class of data systems partition their data and precompute approximate summaries (i.e., sketches and samples) for each segment to reduce query costs. They can then aggregate and combine the segment summaries to estimate results without scanning the raw data. However, given limited storage space each summary introduces approximation errors that affect query accuracy. For instance, system… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  19. arXiv:1910.01500  [pdf, other

    cs.LG cs.PF stat.ML

    MLPerf Training Benchmark

    Authors: Peter Mattson, Christine Cheng, Cody Coleman, Greg Diamos, Paulius Micikevicius, David Patterson, Hanlin Tang, Gu-Yeon Wei, Peter Bailis, Victor Bittorf, David Brooks, Dehao Chen, Debojyoti Dutta, Udit Gupta, Kim Hazelwood, Andrew Hock, Xinyuan Huang, Atsushi Ike, Bill Jia, Daniel Kang, David Kanter, Naveen Kumar, Jeffery Liao, Guokai Ma, Deepak Narayanan , et al. (12 additional authors not shown)

    Abstract: Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges absent from other domains: optimizations that improve training throughput can increase the time to solution, training is stochastic and time to solution exhibits h… ▽ More

    Submitted 2 March, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: MLSys 2020

  20. arXiv:1906.11829  [pdf, other

    cs.LG stat.ML

    Selection via Proxy: Efficient Data Selection for Deep Learning

    Authors: Cody Coleman, Christopher Yeh, Stephen Mussmann, Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia

    Abstract: Data selection methods, such as active learning and core-set selection, are useful tools for machine learning on large datasets. However, they can be prohibitively expensive to apply in deep learning because they depend on feature representations that need to be learned. In this work, we show that we can greatly improve the computational efficiency by using a small proxy model to perform data sele… ▽ More

    Submitted 26 October, 2020; v1 submitted 26 June, 2019; originally announced June 2019.

    Comments: ICLR 2020

  21. arXiv:1906.01974  [pdf, other

    cs.DB cs.LG

    Willump: A Statistically-Aware End-to-end Optimizer for Machine Learning Inference

    Authors: Peter Kraft, Daniel Kang, Deepak Narayanan, Shoumik Palkar, Peter Bailis, Matei Zaharia

    Abstract: Systems for ML inference are widely deployed today, but they typically optimize ML inference workloads using techniques designed for conventional data serving workloads and miss critical opportunities to leverage the statistical nature of ML. In this paper, we present Willump, an optimizer for ML inference that introduces two statistically-motivated optimizations targeting ML applications whose pe… ▽ More

    Submitted 5 March, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

    Journal ref: Proceedings of Machine Learning and Systems 2020, 147-159

  22. arXiv:1905.02304  [pdf, other

    cs.LG cs.DB stat.ML

    CrossTrainer: Practical Domain Adaptation with Loss Reweighting

    Authors: Justin Chen, Edward Gan, Kexin Rong, Sahaana Suri, Peter Bailis

    Abstract: Domain adaptation provides a powerful set of model training techniques given domain-specific training data and supplemental data with unknown relevance. The techniques are useful when users need to develop models with data from varying sources, of varying quality, or from different time ranges. We build CrossTrainer, a system for practical domain adaptation. CrossTrainer utilizes loss reweighting,… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

  23. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  24. arXiv:1901.11399  [pdf, other

    cs.CV cs.LG stat.ML

    Equivariant Transformer Networks

    Authors: Kai Sheng Tai, Peter Bailis, Gregory Valiant

    Abstract: How can prior knowledge on the transformation invariances of a domain be incorporated into the architecture of a neural network? We propose Equivariant Transformers (ETs), a family of differentiable image-to-image mappings that improve the robustness of models towards pre-defined continuous transformation groups. Through the use of specially-derived canonical coordinate systems, ETs incorporate fu… ▽ More

    Submitted 24 May, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: ICML 2019

  25. arXiv:1810.01937  [pdf, other

    cs.LG cs.AI stat.ML

    LIT: Block-wise Intermediate Representation Training for Model Compression

    Authors: Animesh Koratana, Daniel Kang, Peter Bailis, Matei Zaharia

    Abstract: Knowledge distillation (KD) is a popular method for reducing the computational overhead of deep network inference, in which the output of a teacher model is used to train a smaller, faster student model. Hint training (i.e., FitNets) extends KD by regressing a student model's intermediate representation to a teacher model's intermediate representation. In this work, we introduce bLock-wise Interme… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

  26. arXiv:1806.01427  [pdf, other

    cs.LG stat.ML

    Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark

    Authors: Cody Coleman, Daniel Kang, Deepak Narayanan, Luigi Nardi, Tian Zhao, Jian Zhang, Peter Bailis, Kunle Olukotun, Chris Re, Matei Zaharia

    Abstract: Researchers have proposed hardware, software, and algorithmic optimizations to improve the computational performance of deep learning. While some of these optimizations perform the same operations faster (e.g., increasing GPU clock speed), many others modify the semantics of the training procedure (e.g., reduced precision), and can impact the final model's accuracy on unseen data. Due to a lack of… ▽ More

    Submitted 1 December, 2019; v1 submitted 4 June, 2018; originally announced June 2018.

  27. arXiv:1805.01046  [pdf, other

    cs.DB

    BlazeIt: Optimizing Declarative Aggregation and Limit Queries for Neural Network-Based Video Analytics

    Authors: Daniel Kang, Peter Bailis, Matei Zaharia

    Abstract: Recent advances in neural networks (NNs) have enabled automatic querying of large volumes of video data with high accuracy. While these deep NNs can produce accurate annotations of an object's position and type in video, they are computationally expensive and require complex, imperative deployment code to answer queries. Prior work uses approximate filtering to reduce the cost of video analytics,… ▽ More

    Submitted 9 December, 2019; v1 submitted 2 May, 2018; originally announced May 2018.

    Journal ref: PVLDB 2020

  28. arXiv:1803.09835  [pdf, other

    cs.DB

    Locality-Sensitive Hashing for Earthquake Detection: A Case Study of Scaling Data-Driven Science

    Authors: Kexin Rong, Clara E. Yoon, Karianne J. Bergen, Hashem Elezabi, Peter Bailis, Philip Levis, Gregory C. Beroza

    Abstract: In this work, we report on a novel application of Locality Sensitive Hashing (LSH) to seismic data at scale. Based on the high waveform similarity between reoccurring earthquakes, our application identifies potential earthquakes by searching for similar time series segments via LSH. However, a straightforward implementation of this LSH-enabled application has difficulty scaling beyond 3 months of… ▽ More

    Submitted 23 July, 2018; v1 submitted 26 March, 2018; originally announced March 2018.

  29. arXiv:1803.01969  [pdf, other

    cs.DB

    Moment-Based Quantile Sketches for Efficient High Cardinality Aggregation Queries

    Authors: Edward Gan, Jialin Ding, Kai Sheng Tai, Vatsal Sharan, Peter Bailis

    Abstract: Interactive analytics increasingly involves querying for quantiles over sub-populations of high cardinality datasets. Data processing engines such as Druid and Spark use mergeable summaries to estimate quantiles, but summary merge times can be a bottleneck during aggregation. We show how a compact and efficiently mergeable quantile sketch can support aggregation workloads. This data structure, whi… ▽ More

    Submitted 13 July, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: Technical Report for paper to be published in VLDB 2018

  30. arXiv:1711.02305  [pdf, other

    cs.LG cs.DS stat.ML

    Sketching Linear Classifiers over Data Streams

    Authors: Kai Sheng Tai, Vatsal Sharan, Peter Bailis, Gregory Valiant

    Abstract: We introduce a new sub-linear space sketch---the Weight-Median Sketch---for learning compressed linear classifiers over data streams while supporting the efficient recovery of large-magnitude weights in the model. This enables memory-limited execution of several statistical analyses over streams, including online feature selection, streaming data explanation, relative deltoid detection, and stream… ▽ More

    Submitted 6 April, 2018; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: Full version of paper appearing at SIGMOD 2018 with more detailed proofs of theoretical results. Code available at https://github.com/stanford-futuredata/wmsketch

  31. DROP: Dimensionality Reduction Optimization for Time Series

    Authors: Sahaana Suri, Peter Bailis

    Abstract: Dimensionality reduction is a critical step in scaling machine learning pipelines. Principal component analysis (PCA) is a standard tool for dimensionality reduction, but performing PCA over a full dataset can be prohibitively expensive. As a result, theoretical work has studied the effectiveness of iterative, stochastic PCA methods that operate over data samples. However, termination conditions f… ▽ More

    Submitted 23 August, 2020; v1 submitted 1 August, 2017; originally announced August 2017.

    Journal ref: DEEM'19: Proceedings of the 3rd International Workshop on Data Management for End-to-End Machine Learning (2019)

  32. arXiv:1706.08146  [pdf, other

    cs.LG cs.AI stat.ML

    Compressed Factorization: Fast and Accurate Low-Rank Factorization of Compressively-Sensed Data

    Authors: Vatsal Sharan, Kai Sheng Tai, Peter Bailis, Gregory Valiant

    Abstract: What learning algorithms can be run directly on compressively-sensed data? In this work, we consider the question of accurately and efficiently computing low-rank matrix or tensor factorizations given data compressed via random projections. We examine the approach of first performing factorization in the compressed domain, and then reconstructing the original high-dimensional factors from the reco… ▽ More

    Submitted 27 May, 2019; v1 submitted 25 June, 2017; originally announced June 2017.

    Comments: Updates for ICML'19 camera-ready

  33. arXiv:1706.01449  [pdf, other

    cs.IR cs.DB cs.DS cs.PF

    To Index or Not to Index: Optimizing Exact Maximum Inner Product Search

    Authors: Firas Abuzaid, Geet Sethi, Peter Bailis, Matei Zaharia

    Abstract: Exact Maximum Inner Product Search (MIPS) is an important task that is widely pertinent to recommender systems and high-dimensional similarity search. The brute-force approach to solving exact MIPS is computationally expensive, thus spurring recent development of novel indexes and pruning techniques for this task. In this paper, we show that a hardware-efficient brute-force approach, blocked matri… ▽ More

    Submitted 14 March, 2019; v1 submitted 5 June, 2017; originally announced June 2017.

    Comments: 12 pages, 8 figures, 2 tables

  34. arXiv:1705.07538  [pdf, other

    cs.LG cs.DB stat.ML

    Infrastructure for Usable Machine Learning: The Stanford DAWN Project

    Authors: Peter Bailis, Kunle Olukotun, Christopher Re, Matei Zaharia

    Abstract: Despite incredible recent advances in machine learning, building machine learning applications remains prohibitively time-consuming and expensive for all but the best-trained, best-funded engineering organizations. This expense comes not from a need for new and improved statistical models but instead from a lack of systems and tools for supporting end-to-end machine learning application developmen… ▽ More

    Submitted 8 June, 2017; v1 submitted 21 May, 2017; originally announced May 2017.

  35. arXiv:1703.02529  [pdf, other

    cs.DB cs.CV

    NoScope: Optimizing Neural Network Queries over Video at Scale

    Authors: Daniel Kang, John Emmons, Firas Abuzaid, Peter Bailis, Matei Zaharia

    Abstract: Recent advances in computer vision-in the form of deep neural networks-have made it possible to query increasing volumes of video data with high accuracy. However, neural network inference is computationally expensive at scale: applying a state-of-the-art object detector in real time (i.e., 30+ frames per second) to a single video requires a $4000 GPU. In response, we present NoScope, a system for… ▽ More

    Submitted 8 August, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: PVLDB 2017

  36. ASAP: Prioritizing Attention via Time Series Smoothing

    Authors: Kexin Rong, Peter Bailis

    Abstract: Time series visualization of streaming telemetry (i.e., charting of key metrics such as server load over time) is increasingly prevalent in modern data platforms and applications. However, many existing systems simply plot the raw data streams as they arrive, often obscuring large-scale trends due to small-scale noise. We propose an alternative: to better prioritize end users' attention, smooth ti… ▽ More

    Submitted 19 September, 2017; v1 submitted 2 March, 2017; originally announced March 2017.

    Journal ref: Proc. VLDB Endow. Vol. 10, No. 11 pages 1358-1369, 2017

  37. arXiv:1603.00567  [pdf, other

    cs.DB

    MacroBase: Prioritizing Attention in Fast Data

    Authors: Peter Bailis, Edward Gan, Samuel Madden, Deepak Narayanan, Kexin Rong, Sahaana Suri

    Abstract: As data volumes continue to rise, manual inspection is becoming increasingly untenable. In response, we present MacroBase, a data analytics engine that prioritizes end-user attention in high-volume fast data streams. MacroBase enables efficient, accurate, and modular analyses that highlight and aggregate important and unusual behavior, acting as a search engine for fast data. MacroBase is able to… ▽ More

    Submitted 24 March, 2017; v1 submitted 1 March, 2016; originally announced March 2016.

    Comments: SIGMOD 2017

  38. arXiv:1510.07092  [pdf, other

    cs.DB

    Asynchronous Complex Analytics in a Distributed Dataflow Architecture

    Authors: Joseph E. Gonzalez, Peter Bailis, Michael I. Jordan, Michael J. Franklin, Joseph M. Hellerstein, Ali Ghodsi, Ion Stoica

    Abstract: Scalable distributed dataflow systems have recently experienced widespread adoption, with commodity dataflow engines such as Hadoop and Spark, and even commodity SQL engines routinely supporting increasingly sophisticated analytics tasks (e.g., support vector machines, logistic regression, collaborative filtering). However, these systems' synchronous (often Bulk Synchronous Parallel) dataflow exec… ▽ More

    Submitted 23 October, 2015; originally announced October 2015.

  39. arXiv:1409.3809  [pdf, other

    cs.DB

    The Missing Piece in Complex Analytics: Low Latency, Scalable Model Management and Serving with Velox

    Authors: Daniel Crankshaw, Peter Bailis, Joseph E. Gonzalez, Haoyuan Li, Zhao Zhang, Michael J. Franklin, Ali Ghodsi, Michael I. Jordan

    Abstract: To support complex data-intensive applications such as personalized recommendations, targeted advertising, and intelligent services, the data management community has focused heavily on the design of systems to support training complex models on large datasets. Unfortunately, the design of these systems largely ignores a critical component of the overall analytics process: the deployment and servi… ▽ More

    Submitted 1 December, 2014; v1 submitted 12 September, 2014; originally announced September 2014.

  40. arXiv:1402.2237  [pdf, other

    cs.DB

    Coordination Avoidance in Database Systems (Extended Version)

    Authors: Peter Bailis, Alan Fekete, Michael J. Franklin, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica

    Abstract: Minimizing coordination, or blocking communication between concurrently executing operations, is key to maximizing scalability, availability, and high performance in database systems. However, uninhibited coordination-free execution can compromise application correctness, or consistency. When is coordination necessary for correctness? The classic use of serializable transactions is sufficient to m… ▽ More

    Submitted 30 October, 2014; v1 submitted 10 February, 2014; originally announced February 2014.

    Comments: Extended version of paper appearing in PVLDB Vol. 8, No. 3

  41. arXiv:1302.0309  [pdf, other

    cs.DB

    Highly Available Transactions: Virtues and Limitations (Extended Version)

    Authors: Peter Bailis, Aaron Davidson, Alan Fekete, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica

    Abstract: To minimize network latency and remain online during server failures and network partitions, many modern distributed data storage systems eschew transactional functionality, which provides strong semantic guarantees for groups of multiple operations over multiple data items. In this work, we consider the problem of providing Highly Available Transactions (HATs): transactional guarantees that do no… ▽ More

    Submitted 6 October, 2013; v1 submitted 1 February, 2013; originally announced February 2013.

    Comments: Extended version of "Highly Available Transactions: Virtues and Limitations" to appear in VLDB 2014

  42. arXiv:1204.6082  [pdf, other

    cs.DB cs.DC

    Probabilistically Bounded Staleness for Practical Partial Quorums

    Authors: Peter Bailis, Shivaram Venkataraman, Michael J. Franklin, Joseph M. Hellerstein, Ion Stoica

    Abstract: Data store replication results in a fundamental trade-off between operation latency and data consistency. In this paper, we examine this trade-off in the context of quorum-replicated data stores. Under partial, or non-strict quorum replication, a data store waits for responses from a subset of replicas before answering a query, without guaranteeing that read and write replica sets intersect. As de… ▽ More

    Submitted 26 April, 2012; originally announced April 2012.

    Comments: VLDB2012

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 8, pp. 776-787 (2012)