Skip to main content

Showing 1–44 of 44 results for author: Rabusseau, G

  1. arXiv:2407.07802  [pdf, other

    cs.LG cs.AI cs.CL

    ROSA: Random Subspace Adaptation for Efficient Fine-Tuning

    Authors: Marawan Gamal Abdel Hameed, Aristides Milios, Siva Reddy, Guillaume Rabusseau

    Abstract: Model training requires significantly more memory, compared with inference. Parameter efficient fine-tuning (PEFT) methods provide a means of adapting large models to downstream tasks using less memory. However, existing methods such as adapters, prompt tuning or low-rank adaptation (LoRA) either introduce latency overhead at inference time or achieve subpar downstream performance compared with fu… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2406.10426  [pdf, other

    cs.LG

    Towards Neural Scaling Laws for Foundation Models on Temporal Graphs

    Authors: Razieh Shirzadkhani, Tran Gia Bao Ngo, Kiarash Shamsi, Shenyang Huang, Farimah Poursafaei, Poupak Azad, Reihaneh Rabbany, Baris Coskunuzer, Guillaume Rabusseau, Cuneyt Gurcan Akcora

    Abstract: The field of temporal graph learning aims to learn from evolving network data to forecast future interactions. Given a collection of observed temporal graphs, is it possible to predict the evolution of an unseen network from the same domain? To answer this question, we first present the Temporal Graph Scaling (TGS) dataset, a large collection of temporal graphs consisting of eighty-four ERC20 toke… ▽ More

    Submitted 26 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: 17 pages, 15 figures, preprint version

  3. arXiv:2406.09639  [pdf, other

    cs.LG cs.SI

    TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and Heterogeneous Graphs

    Authors: Julia Gastinger, Shenyang Huang, Mikhail Galkin, Erfan Loghmani, Ali Parviz, Farimah Poursafaei, Jacob Danovitch, Emanuele Rossi, Ioannis Koutis, Heiner Stuckenschmidt, Reihaneh Rabbany, Guillaume Rabusseau

    Abstract: Multi-relational temporal graphs are powerful tools for modeling real-world data, capturing the evolving and interconnected nature of entities over time. Recently, many novel models are proposed for ML on such graphs intensifying the need for robust evaluation and standardized benchmark datasets. However, the availability of such resources remains scarce and evaluation faces added complexity due t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 27 pages, 8 figures

  4. arXiv:2406.05045  [pdf, other

    cs.LG

    A Tensor Decomposition Perspective on Second-order RNNs

    Authors: Maude Lizaire, Michael Rizvi-Martel, Marawan Gamal Abdel Hameed, Guillaume Rabusseau

    Abstract: Second-order Recurrent Neural Networks (2RNNs) extend RNNs by leveraging second-order interactions for sequence modelling. These models are provably more expressive than their first-order counterparts and have connections to well-studied models from formal language theory. However, their large parameter tensor makes computations intractable. To circumvent this issue, one approach known as MIRNN co… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at ICML 2024. Camera ready version

  5. arXiv:2406.02749  [pdf, other

    cs.DS

    Efficient Leverage Score Sampling for Tensor Train Decomposition

    Authors: Vivek Bharadwaj, Beheshteh T. Rakhshan, Osman Asif Malik, Guillaume Rabusseau

    Abstract: Tensor Train~(TT) decomposition is widely used in the machine learning and quantum physics communities as a popular tool to efficiently compress high-dimensional tensor data. In this paper, we propose an efficient algorithm to accelerate computing the TT decomposition with the Alternating Least Squares (ALS) algorithm relying on exact leverage scores sampling. For this purpose, we propose a data s… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2403.09728  [pdf, other

    cs.CL cs.AI cs.CC

    Simulating Weighted Automata over Sequences and Trees with Transformers

    Authors: Michael Rizvi, Maude Lizaire, Clara Lacroce, Guillaume Rabusseau

    Abstract: Transformers are ubiquitous models in the natural language processing (NLP) community and have shown impressive empirical successes in the past few years. However, little is understood about how they reason and the limits of their computational capabilities. These models do not process data sequentially, and yet outperform sequential neural models such as RNNs. Recent work has shown that these mod… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  7. arXiv:2310.20498  [pdf, other

    cs.LG cond-mat.stat-mech quant-ph stat.ML

    Generative Learning of Continuous Data by Tensor Networks

    Authors: Alex Meiburg, Jing Chen, Jacob Miller, Raphaëlle Tihon, Guillaume Rabusseau, Alejandro Perdomo-Ortiz

    Abstract: Beyond their origin in modeling many-body quantum systems, tensor networks have emerged as a promising class of models for solving machine learning problems, notably in unsupervised generative learning. While possessing many desirable features arising from their quantum-inspired nature, tensor network generative models have previously been largely restricted to binary or categorical data, limiting… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 21 pages, 15 figures

  8. arXiv:2310.04292  [pdf, other

    cs.LG

    Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets

    Authors: Dominique Beaini, Shenyang Huang, Joao Alex Cunha, Zhiyi Li, Gabriela Moisescu-Pareja, Oleksandr Dymov, Samuel Maddrell-Mander, Callum McLean, Frederik Wenkel, Luis Müller, Jama Hussein Mohamud, Ali Parviz, Michael Craig, Michał Koziarski, Jiarui Lu, Zhaocheng Zhu, Cristian Gabellini, Kerstin Klaser, Josef Dean, Cas Wognum, Maciej Sypetkowski, Guillaume Rabusseau, Reihaneh Rabbany, Jian Tang, Christopher Morris , et al. (10 additional authors not shown)

    Abstract: Recently, pre-trained foundation models have enabled significant advancements in multiple fields. In molecular machine learning, however, where datasets are often hand-curated, and hence typically small, the lack of datasets with labeled features, and codebases to manage those datasets, has hindered the development of foundation models. In this work, we present seven novel datasets categorized by… ▽ More

    Submitted 18 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  9. arXiv:2307.01026  [pdf, other

    cs.LG cs.AI

    Temporal Graph Benchmark for Machine Learning on Temporal Graphs

    Authors: Shenyang Huang, Farimah Poursafaei, Jacob Danovitch, Matthias Fey, Weihua Hu, Emanuele Rossi, Jure Leskovec, Michael Bronstein, Guillaume Rabusseau, Reihaneh Rabbany

    Abstract: We present the Temporal Graph Benchmark (TGB), a collection of challenging and diverse benchmark datasets for realistic, reproducible, and robust evaluation of machine learning models on temporal graphs. TGB datasets are of large scale, spanning years in duration, incorporate both node and edge-level prediction tasks and cover a diverse set of domains including social, trade, transaction, and tran… ▽ More

    Submitted 27 September, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: 20 pages, 7 figures, 7 tables, accepted at NeurIPS 2023 Datasets and Benchmarks Track

  10. arXiv:2306.00135  [pdf, other

    cs.FL

    Optimal Approximate Minimization of One-Letter Weighted Finite Automata

    Authors: Clara Lacroce, Borja Balle, Prakash Panangaden, Guillaume Rabusseau

    Abstract: In this paper, we study the approximate minimization problem of weighted finite automata (WFAs): to compute the best possible approximation of a WFA given a bound on the number of states. By reformulating the problem in terms of Hankel matrices, we leverage classical results on the approximation of Hankel operators, namely the celebrated Adamyan-Arov-Krein (AAK) theory. We solve the optimal spec… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 32 pages. arXiv admin note: substantial text overlap with arXiv:2102.06860

  11. arXiv:2305.08750  [pdf, other

    cs.LG

    Fast and Attributed Change Detection on Dynamic Graphs with Density of States

    Authors: Shenyang Huang, Jacob Danovitch, Guillaume Rabusseau, Reihaneh Rabbany

    Abstract: How can we detect traffic disturbances from international flight transportation logs or changes to collaboration dynamics in academic networks? These problems can be formulated as detecting anomalous change points in a dynamic graph. Current solutions do not scale well to large real-world graphs, lack robustness to large amounts of node additions/deletions, and overlook changes in node attributes.… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: in PAKDD 2023, 18 pages, 12 figures

  12. arXiv:2302.01204  [pdf, other

    cs.LG

    Laplacian Change Point Detection for Single and Multi-view Dynamic Graphs

    Authors: Shenyang Huang, Samy Coulombe, Yasmeen Hitti, Reihaneh Rabbany, Guillaume Rabusseau

    Abstract: Dynamic graphs are rich data structures that are used to model complex relationships between entities over time. In particular, anomaly detection in temporal graphs is crucial for many real world applications such as intrusion identification in network systems, detection of ecosystem disturbances and detection of epidemic outbreaks. In this paper, we focus on change point detection in dynamic grap… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 30 pages, 15 figures, extended version of previous paper "Laplacian Change Point Detection for Dynamic Graphs" with novel material. arXiv admin note: substantial text overlap with arXiv:2007.01229

  13. arXiv:2211.02255  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Spectral Regularization: an Inductive Bias for Sequence Modeling

    Authors: Kaiwen Hou, Guillaume Rabusseau

    Abstract: Various forms of regularization in learning tasks strive for different notions of simplicity. This paper presents a spectral regularization technique, which attaches a unique inductive bias to sequence modeling based on an intuitive concept of simplicity defined in the Chomsky hierarchy. From fundamental connections between Hankel matrices and regular grammars, we propose to use the trace norm of… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: LearnAut paper in 2022 (https://learnaut22.github.io/programme.html#abstract-20)

  14. arXiv:2206.03923  [pdf, other

    cs.LG cs.AI

    Sequential Density Estimation via Nonlinear Continuous Weighted Finite Automata

    Authors: Tianyu Li, Bogdan Mazoure, Guillaume Rabusseau

    Abstract: Weighted finite automata (WFAs) have been widely applied in many fields. One of the classic problems for WFAs is probability distribution estimation over sequences of discrete symbols. Although WFAs have been extended to deal with continuous input data, namely continuous WFAs (CWFAs), it is still unclear how to approximate density functions over sequences of continuous random variables using WFA-b… ▽ More

    Submitted 12 December, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  15. arXiv:2206.00172  [pdf, ps, other

    cs.FL

    Towards an AAK Theory Approach to Approximate Minimization in the Multi-Letter Case

    Authors: Clara Lacroce, Prakash Panangaden, Guillaume Rabusseau

    Abstract: We study the approximate minimization problem of weighted finite automata (WFAs): given a WFA, we want to compute its optimal approximation when restricted to a given size. We reformulate the problem as a rank-minimization task in the spectral norm, and propose a framework to apply Adamyan-Arov-Krein (AAK) theory to the approximation problem. This approach has already been successfully applied to… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: LearnAut 2022

  16. arXiv:2205.11691  [pdf, other

    cs.LG cs.AI

    High-Order Pooling for Graph Neural Networks with Tensor Decomposition

    Authors: Chenqing Hua, Guillaume Rabusseau, Jian Tang

    Abstract: Graph Neural Networks (GNNs) are attracting growing attention due to their effectiveness and flexibility in modeling a variety of graph-structured data. Exiting GNN architectures usually adopt simple pooling operations (eg. sum, average, max) when aggregating messages from a local neighborhood for updating node representation or pooling node representations from the entire graph to compute the gra… ▽ More

    Submitted 20 October, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

  17. arXiv:2110.13970  [pdf, other

    cs.LG stat.ML

    Rademacher Random Projections with Tensor Networks

    Authors: Beheshteh T. Rakhshan, Guillaume Rabusseau

    Abstract: Random projection (RP) have recently emerged as popular techniques in the machine learning community for their ability in reducing the dimension of very high-dimensional tensors. Following the work in [30], we consider a tensorized random projection relying on Tensor Train (TT) decomposition where each element of the core tensors is drawn from a Rademacher distribution. Our theoretical results rev… ▽ More

    Submitted 2 February, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

  18. arXiv:2106.11827  [pdf, other

    cs.LG

    Lower and Upper Bounds on the VC-Dimension of Tensor Network Models

    Authors: Behnoush Khavari, Guillaume Rabusseau

    Abstract: Tensor network methods have been a key ingredient of advances in condensed matter physics and have recently sparked interest in the machine learning community for their ability to compactly represent very high-dimensional objects. Tensor network methods can for example be used to efficiently learn linear models in exponentially large feature spaces [Stoudenmire and Schwab, 2016]. In this work, we… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

  19. arXiv:2106.02965  [pdf, ps, other

    cs.LG cs.FL

    Extracting Weighted Automata for Approximate Minimization in Language Modelling

    Authors: Clara Lacroce, Prakash Panangaden, Guillaume Rabusseau

    Abstract: In this paper we study the approximate minimization problem for language modelling. We assume we are given some language model as a black box. The objective is to obtain a weighted finite automaton (WFA) that fits within a given size constraint and which mimics the behaviour of the original model while minimizing some notion of distance between the black box and the extracted WFA. We provide an al… ▽ More

    Submitted 23 July, 2021; v1 submitted 5 June, 2021; originally announced June 2021.

    Comments: Full version of ICGI 2020/21 paper, authors are listed in alphabetical order

  20. arXiv:2102.06860  [pdf, ps, other

    cs.FL

    Optimal Spectral-Norm Approximate Minimization of Weighted Finite Automata

    Authors: Borja Balle, Clara Lacroce, Prakash Panangaden, Doina Precup, Guillaume Rabusseau

    Abstract: We address the approximate minimization problem for weighted finite automata (WFAs) with weights in $\mathbb{R}$, over a one-letter alphabet: to compute the best possible approximation of a WFA given a bound on the number of states. This work is grounded in Adamyan-Arov-Krein Approximation theory, a remarkable collection of results on the approximation of Hankel operators. In addition to its intri… ▽ More

    Submitted 17 May, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: Full version of ICALP2021 paper, authors are listed in alphabetical order

  21. arXiv:2101.10249  [pdf, other

    cs.LG cs.AI econ.EM

    Assessing the Impact: Does an Improvement to a Revenue Management System Lead to an Improved Revenue?

    Authors: Greta Laage, Emma Frejinger, Andrea Lodi, Guillaume Rabusseau

    Abstract: Airlines and other industries have been making use of sophisticated Revenue Management Systems to maximize revenue for decades. While improving the different components of these systems has been the focus of numerous studies, estimating the impact of such improvements on the revenue has been overlooked in the literature despite its practical importance. Indeed, quantifying the benefit of a change… ▽ More

    Submitted 16 June, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

  22. arXiv:2010.10653  [pdf, other

    cs.LG quant-ph

    Quantum Tensor Networks, Stochastic Processes, and Weighted Automata

    Authors: Siddarth Srinivasan, Sandesh Adhikary, Jacob Miller, Guillaume Rabusseau, Byron Boots

    Abstract: Modeling joint probability distributions over sequences has been studied from many perspectives. The physics community developed matrix product states, a tensor-train decomposition for probabilistic modeling, motivated by the need to tractably model many-body systems. But similar models have also been studied in the stochastic processes and weighted automata literature, with little work on how the… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  23. arXiv:2010.10029  [pdf, other

    cs.LG cs.FL

    Connecting Weighted Automata, Tensor Networks and Recurrent Neural Networks through Spectral Learning

    Authors: Tianyu Li, Doina Precup, Guillaume Rabusseau

    Abstract: In this paper, we present connections between three models used in different research fields: weighted finite automata~(WFA) from formal languages and linguistics, recurrent neural networks used in machine learning, and tensor networks which encompasses a set of optimization techniques for high-order tensors used in quantum physics and numerical analysis. We first present an intrinsic relation bet… ▽ More

    Submitted 6 January, 2022; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Accepted as a journal paper in Machine Learning Journal. arXiv admin note: text overlap with arXiv:1807.01406

  24. arXiv:2010.04003  [pdf, other

    cs.LG cs.AI stat.ML

    A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix

    Authors: Thang Doan, Mehdi Bennani, Bogdan Mazoure, Guillaume Rabusseau, Pierre Alquier

    Abstract: Continual learning (CL) is a setting in which an agent has to learn from an incoming stream of data during its entire lifetime. Although major advances have been made in the field, one recurring problem which remains unsolved is that of Catastrophic Forgetting (CF). While the issue has been extensively studied empirically, little attention has been paid from a theoretical angle. In this paper, we… ▽ More

    Submitted 25 February, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted to AISTATS 2021. Keywords: continual learning, catastrophic forgetting, NTK regime, orthgonal gradient descent

    Journal ref: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS 2021)

  25. arXiv:2008.05437  [pdf, other

    cs.LG stat.ML

    Adaptive Learning of Tensor Network Structures

    Authors: Meraj Hashemizadeh, Michelle Liu, Jacob Miller, Guillaume Rabusseau

    Abstract: Tensor Networks (TN) offer a powerful framework to efficiently represent very high-dimensional objects. TN have recently shown their potential for machine learning applications and offer a unifying view of common tensor decomposition models such as Tucker, tensor train (TT) and tensor ring (TR). However, identifying the best tensor network structure from data for a given task is challenging. In th… ▽ More

    Submitted 22 June, 2021; v1 submitted 12 August, 2020; originally announced August 2020.

  26. arXiv:2007.01229  [pdf, other

    cs.LG cs.SI stat.ML

    Laplacian Change Point Detection for Dynamic Graphs

    Authors: Shenyang Huang, Yasmeen Hitti, Guillaume Rabusseau, Reihaneh Rabbany

    Abstract: Dynamic and temporal graphs are rich data structures that are used to model complex relationships between entities over time. In particular, anomaly detection in temporal graphs is crucial for many real world applications such as intrusion identification in network systems, detection of ecosystem disturbances and detection of epidemic outbreaks. In this paper, we focus on change point detection in… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: in KDD 2020, 10 pages

  27. arXiv:2003.05101  [pdf, other

    cs.LG cs.DS stat.ML

    Tensorized Random Projections

    Authors: Beheshteh T. Rakhshan, Guillaume Rabusseau

    Abstract: We introduce a novel random projection technique for efficiently reducing the dimension of very high-dimensional tensors. Building upon classical results on Gaussian random projections and Johnson-Lindenstrauss transforms~(JLT), we propose two tensorized random projection maps relying on the tensor train~(TT) and CP decomposition format, respectively. The two maps offer very low memory requirement… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

  28. arXiv:2003.01181  [pdf, other

    cs.LG cs.CV stat.ML

    RandomNet: Towards Fully Automatic Neural Architecture Design for Multimodal Learning

    Authors: Stefano Alletto, Shenyang Huang, Vincent Francois-Lavet, Yohei Nakata, Guillaume Rabusseau

    Abstract: Almost all neural architecture search methods are evaluated in terms of performance (i.e. test accuracy) of the model structures that it finds. Should it be the only metric for a good autoML approach? To examine aspects beyond performance, we propose a set of criteria aimed at evaluating the core of autoML problem: the amount of human intervention required to deploy these methods into real world s… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 6 pages, 1 figures

  29. arXiv:2003.01039  [pdf, other

    cs.LG quant-ph stat.ML

    Tensor Networks for Probabilistic Sequence Modeling

    Authors: Jacob Miller, Guillaume Rabusseau, John Terilla

    Abstract: Tensor networks are a powerful modeling framework developed for computational many-body physics, which have only recently been applied within machine learning. In this work we utilize a uniform matrix product state (u-MPS) model for probabilistic modeling of sequence data. We first show that u-MPS enable sequence-level parallelism, with length-n sequences able to be evaluated in depth O(log n). We… ▽ More

    Submitted 23 April, 2021; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: 18 pages, 2 figures; v4 conference version; v3 link to code for experiments; v2 major revision with new main result on regular expression sampling. International Conference on Artificial Intelligence and Statistics. PMLR, 2021

  30. arXiv:2002.02863  [pdf, other

    cs.LG stat.ML

    Representation of Reinforcement Learning Policies in Reproducing Kernel Hilbert Spaces

    Authors: Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup, Guillaume Rabusseau

    Abstract: We propose a general framework for policy representation for reinforcement learning tasks. This framework involves finding a low-dimensional embedding of the policy on a reproducing kernel Hilbert space (RKHS). The usage of RKHS based methods allows us to derive strong theoretical guarantees on the expected return of the reconstructed policy. Such guarantees are typically lacking in black-box mode… ▽ More

    Submitted 15 October, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  31. arXiv:1911.05010  [pdf, other

    cs.AI cs.LG

    Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning

    Authors: Tianyu Li, Bogdan Mazoure, Doina Precup, Guillaume Rabusseau

    Abstract: Learning and planning in partially-observable domains is one of the most difficult problems in reinforcement learning. Traditional methods consider these two problems as independent, resulting in a classical two-stage paradigm: first learn the environment dynamics and then plan accordingly. This approach, however, disconnects the two problems and can consequently lead to algorithms that are sample… ▽ More

    Submitted 21 November, 2019; v1 submitted 12 November, 2019; originally announced November 2019.

  32. arXiv:1909.06686  [pdf, other

    cs.LG stat.ML

    Neural Architecture Search for Class-incremental Learning

    Authors: Shenyang Huang, Vincent François-Lavet, Guillaume Rabusseau

    Abstract: In class-incremental learning, a model learns continuously from a sequential data stream in which new classes occur. Existing methods often rely on static architectures that are manually crafted. These methods can be prone to capacity saturation because a neural network's ability to generalize to new concepts is limited by its fixed capacity. To understand how to expand a continual learner, we foc… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: 8 pages, 10 Figures

  33. arXiv:1812.07627  [pdf, other

    cs.LG cs.AI stat.ML

    Clustering-Oriented Representation Learning with Attractive-Repulsive Loss

    Authors: Kian Kenyon-Dean, Andre Cianflone, Lucas Page-Caccia, Guillaume Rabusseau, Jackie Chi Kit Cheung, Doina Precup

    Abstract: The standard loss function used to train neural network classifiers, categorical cross-entropy (CCE), seeks to maximize accuracy on the training data; building useful representations is not a necessary byproduct of this objective. In this work, we propose clustering-oriented representation learning (COREL) as an alternative to CCE in the context of a generalized attractive-repulsive loss framework… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

    Comments: AAAI 2019 Workshop on Network Interpretability for Deep Learning (9 pages)

    MSC Class: 62H30

  34. arXiv:1810.07468  [pdf, other

    stat.ML cs.LG

    Hierarchical Methods of Moments

    Authors: Matteo Ruffini, Guillaume Rabusseau, Borja Balle

    Abstract: Spectral methods of moments provide a powerful tool for learning the parameters of latent variable models. Despite their theoretical appeal, the applicability of these methods to real data is still limited due to a lack of robustness to model misspecification. In this paper we present a hierarchical approach to methods of moments to circumvent such limitations. Our method is based on replacing the… ▽ More

    Submitted 17 October, 2018; originally announced October 2018.

    Comments: NIPS 2017

  35. arXiv:1809.04988  [pdf, other

    cs.LG cs.AI stat.ML

    Sequential Coordination of Deep Models for Learning Visual Arithmetic

    Authors: Eric Crawford, Guillaume Rabusseau, Joelle Pineau

    Abstract: Achieving machine intelligence requires a smooth integration of perception and reasoning, yet models developed to date tend to specialize in one or the other; sophisticated manipulation of symbols acquired from rich perceptual spaces has so far proved elusive. Consider a visual arithmetic task, where the goal is to carry out simple arithmetical algorithms on digits presented under natural conditio… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

  36. arXiv:1807.01406  [pdf, other

    cs.LG cs.FL stat.ML

    Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning

    Authors: Guillaume Rabusseau, Tianyu Li, Doina Precup

    Abstract: In this paper, we unravel a fundamental connection between weighted finite automata~(WFAs) and second-order recurrent neural networks~(2-RNNs): in the case of sequences of discrete symbols, WFAs and 2-RNNs with linear activation functions are expressively equivalent. Motivated by this result, we build upon a recent extension of the spectral learning algorithm to vector-valued WFAs and propose the… ▽ More

    Submitted 7 April, 2019; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: AISTATS 2019

  37. arXiv:1806.08297  [pdf, other

    cs.FL cs.LG stat.ML

    Learning Graph Weighted Models on Pictures

    Authors: Philip Amortila, Guillaume Rabusseau

    Abstract: Graph Weighted Models (GWMs) have recently been proposed as a natural generalization of weighted automata over strings and trees to arbitrary families of labeled graphs (and hypergraphs). A GWM generically associates a labeled graph with a tensor network and computes a value by successive contractions directed by its edges. In this paper, we consider the problem of learning GWMs defined over the g… ▽ More

    Submitted 2 December, 2018; v1 submitted 21 June, 2018; originally announced June 2018.

    Comments: International Conference on Grammatical Inference 2018 (v2: camera-ready)

  38. arXiv:1712.09520  [pdf, other

    cs.LG stat.ML

    Tensor Regression Networks with various Low-Rank Tensor Approximations

    Authors: Xingwei Cao, Guillaume Rabusseau

    Abstract: Tensor regression networks achieve high compression rate of neural networks while having slight impact on performances. They do so by imposing low tensor rank structure on the weight matrices of fully connected layers. In recent years, tensor regression networks have been investigated from the perspective of their compressive power, however, the regularization effect of enforcing low-rank tensor s… ▽ More

    Submitted 28 November, 2018; v1 submitted 27 December, 2017; originally announced December 2017.

  39. arXiv:1709.07796  [pdf, other

    stat.ML cs.AI cs.LG

    On overfitting and asymptotic bias in batch reinforcement learning with partial observability

    Authors: Vincent Francois-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst, Raphael Fonteneau

    Abstract: This paper provides an analysis of the tradeoff between asymptotic bias (suboptimality with unlimited data) and overfitting (additional suboptimality due to limited data) in the context of reinforcement learning with partial observability. Our theoretical analysis formally characterizes that while potentially increasing the asymptotic bias, a smaller state representation decreases the risk of over… ▽ More

    Submitted 6 February, 2019; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: Accepted at the Journal of Artificial Intelligence Research (JAIR) - 31 pages

  40. arXiv:1709.04380  [pdf, other

    cs.FL cs.AI cs.CL cs.LG

    Neural Network Based Nonlinear Weighted Finite Automata

    Authors: Tianyu Li, Guillaume Rabusseau, Doina Precup

    Abstract: Weighted finite automata (WFA) can expressively model functions defined over strings but are inherently linear models. Given the recent successes of nonlinear models in machine learning, it is natural to wonder whether ex-tending WFA to the nonlinear setting would be beneficial. In this paper, we propose a novel model of neural network based nonlinearWFA model (NL-WFA) along with a learning algori… ▽ More

    Submitted 21 December, 2017; v1 submitted 13 September, 2017; originally announced September 2017.

    Comments: AISTATS 2018

  41. arXiv:1602.06863  [pdf, other

    cs.LG

    Higher-Order Low-Rank Regression

    Authors: Guillaume Rabusseau, Hachem Kadri

    Abstract: This paper proposes an efficient algorithm (HOLRR) to handle regression tasks where the outputs have a tensor structure. We formulate the regression problem as the minimization of a least square criterion under a multilinear rank constraint, a difficult non convex problem. HOLRR computes efficiently an approximate solution of this problem, with solid theoretical guarantees. A kernel extension is a… ▽ More

    Submitted 22 February, 2016; originally announced February 2016.

    Comments: submitted to ICML 2016

  42. arXiv:1511.01442  [pdf, other

    cs.LG cs.FL

    Low-Rank Approximation of Weighted Tree Automata

    Authors: Guillaume Rabusseau, Borja Balle, Shay B. Cohen

    Abstract: We describe a technique to minimize weighted tree automata (WTA), a powerful formalisms that subsumes probabilistic context-free grammars (PCFGs) and latent-variable PCFGs. Our method relies on a singular value decomposition of the underlying Hankel matrix defined by the WTA. Our main theoretical result is an efficient algorithm for computing the SVD of an infinite Hankel matrix implicitly represe… ▽ More

    Submitted 24 December, 2015; v1 submitted 4 November, 2015; originally announced November 2015.

    Comments: To appear in AISTATS 2016

  43. arXiv:1404.7533  [pdf, other

    cs.FL math.CO

    Recognizable Series on Hypergraphs

    Authors: Raphaël Bailly, François Denis, Guillaume Rabusseau

    Abstract: We introduce the notion of Hypergraph Weighted Model (HWM) that generically associates a tensor network to a hypergraph and then computes a value by tensor contractions directed by its hyperedges. A series r defined on a hypergraph family is said to be recognizable if there exists a HWM that computes it. This model generalizes the notion of rational series on strings and trees. We prove some prope… ▽ More

    Submitted 16 October, 2014; v1 submitted 29 April, 2014; originally announced April 2014.

  44. arXiv:1403.4224  [pdf, other

    cs.LG

    Learning Negative Mixture Models by Tensor Decompositions

    Authors: Guillaume Rabusseau, François Denis

    Abstract: This work considers the problem of estimating the parameters of negative mixture models, i.e. mixture models that possibly involve negative weights. The contributions of this paper are as follows. (i) We show that every rational probability distributions on strings, a representation which occurs naturally in spectral learning, can be computed by a negative mixture of at most two probabilistic auto… ▽ More

    Submitted 19 September, 2014; v1 submitted 17 March, 2014; originally announced March 2014.