Skip to main content

Showing 1–42 of 42 results for author: Maggioni, M

  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2402.08412  [pdf, other

    stat.ML cs.LG math.DS math.ST

    Interacting Particle Systems on Networks: joint inference of the network and the interaction kernel

    Authors: Quanjun Lang, Xiong Wang, Fei Lu, Mauro Maggioni

    Abstract: Modeling multi-agent systems on networks is a fundamental challenge in a wide variety of disciplines. We jointly infer the weight matrix of the network and the interaction kernel, which determine respectively which agents interact with which others and the rules of such interactions from data consisting of multiple trajectories. The estimator we propose leads naturally to a non-convex optimization… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 53 pages, 17 figures

    MSC Class: 62F12; 82C22

  3. arXiv:2402.07250  [pdf, other

    cs.LG cs.AI cs.CE

    DIMON: Learning Solution Operators of Partial Differential Equations on a Diffeomorphic Family of Domains

    Authors: Minglang Yin, Nicolas Charon, Ryan Brody, Lu Lu, Natalia Trayanova, Mauro Maggioni

    Abstract: The solution of a PDE over varying initial/boundary conditions on multiple domains is needed in a wide variety of applications, but it is computationally expensive if the solution is computed de novo whenever the initial/boundary conditions of the domain change. We introduce a general operator learning framework, called DIffeomorphic Mapping Operator learNing (DIMON) to learn approximate PDE solut… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  4. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  5. arXiv:2312.08338  [pdf, other

    cs.CV

    Global Latent Neural Rendering

    Authors: Thomas Tanay, Matteo Maggioni

    Abstract: A recent trend among generalizable novel view synthesis methods is to learn a rendering operator acting over single camera rays. This approach is promising because it removes the need for explicit volumetric rendering, but it effectively treats target images as collections of independent pixels. Here, we propose to learn a global rendering operator acting over all camera rays jointly. We show that… ▽ More

    Submitted 8 March, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted at CVPR 2024

  6. arXiv:2305.10403  [pdf, other

    cs.CL cs.AI

    PaLM 2 Technical Report

    Authors: Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yanping Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yujing Zhang, Gustavo Hernandez Abrego , et al. (103 additional authors not shown)

    Abstract: We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  7. arXiv:2304.00898  [pdf, other

    cs.CV eess.IV

    Tunable Convolutions with Parametric Multi-Loss Optimization

    Authors: Matteo Maggioni, Thomas Tanay, Francesca Babiloni, Steven McDonagh, Aleš Leonardis

    Abstract: Behavior of neural networks is irremediably determined by the specific loss and data used during training. However it is often desirable to tune the model at inference time based on external factors such as preferences of the user or dynamic characteristics of the data. This is especially important to balance the perception-distortion trade-off of ill-posed image-to-image translation tasks. In thi… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  8. arXiv:2303.18139  [pdf, other

    cs.CV

    Efficient View Synthesis and 3D-based Multi-Frame Denoising with Multiplane Feature Representations

    Authors: Thomas Tanay, Aleš Leonardis, Matteo Maggioni

    Abstract: While current multi-frame restoration methods combine information from multiple input images using 2D alignment techniques, recent advances in novel view synthesis are paving the way for a new paradigm relying on volumetric scene representations. In this work, we introduce the first 3D-based multi-frame denoising method that significantly outperforms its 2D-based counterparts with lower computatio… ▽ More

    Submitted 5 April, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  9. arXiv:2212.00746  [pdf, other

    cs.IT cs.LG math.OC stat.ML

    Learning Transition Operators From Sparse Space-Time Samples

    Authors: Christian Kümmerle, Mauro Maggioni, Sui Tang

    Abstract: We consider the nonlinear inverse problem of learning a transition operator $\mathbf{A}$ from partial observations at different times, in particular from sparse observations of entries of its powers $\mathbf{A},\mathbf{A}^2,\cdots,\mathbf{A}^{T}$. This Spatio-Temporal Transition Operator Recovery problem is motivated by the recent interest in learning time-varying graph signals that are driven by… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 34 pages, 12 figures

  10. arXiv:2208.02758  [pdf, other

    cs.LG cs.MA math.DS math.NA

    Learning Interaction Variables and Kernels from Observations of Agent-Based Systems

    Authors: Jinchao Feng, Mauro Maggioni, Patrick Martin, Ming Zhong

    Abstract: Dynamical systems across many disciplines are modeled as interacting particles or agents, with interaction rules that depend on a very small number of variables (e.g. pairwise distances, pairwise differences of phases, etc...), functions of the state of pairs of agents. Yet, these interaction rules can generate self-organized dynamics, with complex emergent behaviors (clustering, flocking, swarmin… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  11. arXiv:2207.05242  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Unsupervised learning of observation functions in state-space models by nonparametric moment methods

    Authors: Qingci An, Yannis Kevrekidis, Fei Lu, Mauro Maggioni

    Abstract: We investigate the unsupervised learning of non-invertible observation functions in nonlinear state-space models. Assuming abundant data of the observation process along with the distribution of the state process, we introduce a nonparametric generalized moment method to estimate the observation function via constrained regression. The major challenge comes from the non-invertibility of the observ… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    MSC Class: 62G05; 68Q32; 62M15

  12. Model-Based Image Signal Processors via Learnable Dictionaries

    Authors: Marcos V. Conde, Steven McDonagh, Matteo Maggioni, Aleš Leonardis, Eduardo Pérez-Pellitero

    Abstract: Digital cameras transform sensor RAW readings into RGB images by means of their Image Signal Processor (ISP). Computational photography tasks such as image denoising and colour constancy are commonly performed in the RAW domain, in part due to the inherent hardware design, but also due to the appealing simplicity of noise statistics that result from the direct sensor readings. Despite this, the av… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: AAAI 2022

    Journal ref: Vol. 36 No. 1: AAAI-22 Technical Tracks 1 (2022) 481-489

  13. arXiv:2108.11894  [pdf, other

    astro-ph.EP astro-ph.IM cs.AI cs.LG math.NA

    Machine Learning for Discovering Effective Interaction Kernels between Celestial Bodies from Ephemerides

    Authors: Ming Zhong, Jason Miller, Mauro Maggioni

    Abstract: Building accurate and predictive models of the underlying mechanisms of celestial motion has inspired fundamental developments in theoretical physics. Candidate theories seek to explain observations and predict future positions of planets, stars, and other astronomical bodies as faithfully as possible. We use a data-driven learning approach, extending that developed in Lu et al. ($2019$) and exten… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    ACM Class: G.1; I.2

  14. arXiv:2106.10070  [pdf, other

    cs.CV cs.LG

    Residual Contrastive Learning for Image Reconstruction: Learning Transferable Representations from Noisy Images

    Authors: Nanqing Dong, Matteo Maggioni, Yongxin Yang, Eduardo Pérez-Pellitero, Ales Leonardis, Steven McDonagh

    Abstract: This paper is concerned with contrastive learning (CL) for low-level image restoration and enhancement tasks. We propose a new label-efficient learning paradigm based on residuals, residual contrastive learning (RCL), and derive an unsupervised visual representation learning framework, suitable for low-level vision tasks with noisy inputs. While supervised image reconstruction aims to minimize res… ▽ More

    Submitted 27 April, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: Accepted by IJCAI 2022

  15. arXiv:2105.08629  [pdf, other

    eess.IV cs.CV cs.LG

    Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report

    Authors: Andrey Ignatov, Kim Byeoung-su, Radu Timofte, Angeline Pouget, Fenglong Song, Cheng Li, Shuai Xiao, Zhongqian Fu, Matteo Maggioni, Yibin Huang, Shen Cheng, Xin Lu, Yifeng Zhou, Liangyu Chen, Donghao Liu, Xiangyu Zhang, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Minsu Kwon, Myungje Lee, Jaeyoon Yoo, Changbeom Kang, Shinjo Wang, Bin Huang , et al. (7 additional authors not shown)

    Abstract: Image denoising is one of the most critical problems in mobile photo processing. While many solutions have been proposed for this task, they are usually working with synthetic data and are too computationally expensive to run on mobile devices. To address this problem, we introduce the first Mobile AI challenge, where the target is to develop an end-to-end deep learning-based image denoising solut… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Mobile AI 2021 Workshop and Challenges: https://ai-benchmark.com/workshops/mai/2021/. arXiv admin note: substantial text overlap with arXiv:2105.07809, arXiv:2105.07825

  16. arXiv:2105.04663  [pdf, other

    cs.DC cs.LG

    GSPMD: General and Scalable Parallelization for ML Computation Graphs

    Authors: Yuanzhong Xu, HyoukJoong Lee, Dehao Chen, Blake Hechtman, Yanping Huang, Rahul Joshi, Maxim Krikun, Dmitry Lepikhin, Andy Ly, Marcello Maggioni, Ruoming Pang, Noam Shazeer, Shibo Wang, Tao Wang, Yonghui Wu, Zhifeng Chen

    Abstract: We present GSPMD, an automatic, compiler-based parallelization system for common machine learning computations. It allows users to write programs in the same way as for a single device, then give hints through a few annotations on how to distribute tensors, based on which GSPMD will parallelize the computation. Its representation of partitioning is simple yet general, allowing it to express differ… ▽ More

    Submitted 23 December, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

  17. arXiv:2104.10781  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

    Authors: Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng li, Thomas Tanay , et al. (47 additional authors not shown)

    Abstract: This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at… ▽ More

    Submitted 31 August, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Corrected the MOS values in Table 2, and corrected some minor typos

  18. arXiv:2104.02120  [pdf, other

    stat.ML cs.LG math.DS

    Nonlinear model reduction for slow-fast stochastic systems near unknown invariant manifolds

    Authors: Felix X. -F. Ye, Sichen Yang, Mauro Maggioni

    Abstract: We introduce a nonlinear stochastic model reduction technique for high-dimensional stochastic dynamical systems that have a low-dimensional invariant effective manifold with slow dynamics, and high-dimensional, large fast modes. Given only access to a black box simulator from which short bursts of simulation can be obtained, we design an algorithm that outputs an estimate of the invariant manifold… ▽ More

    Submitted 24 October, 2023; v1 submitted 5 April, 2021; originally announced April 2021.

  19. arXiv:2102.00327  [pdf, other

    cs.LG math.ST

    Learning Interaction Kernels for Agent Systems on Riemannian Manifolds

    Authors: Mauro Maggioni, Jason Miller, Hongda Qiu, Ming Zhong

    Abstract: Interacting agent and particle systems are extensively used to model complex phenomena in science and engineering. We consider the problem of learning interaction kernels in these dynamical systems constrained to evolve on Riemannian manifolds from given trajectory data. The models we consider are based on interaction kernels depending on pairwise Riemannian distances between agents, with agents i… ▽ More

    Submitted 5 March, 2021; v1 submitted 30 January, 2021; originally announced February 2021.

  20. arXiv:2101.05119  [pdf, ps, other

    stat.ML cs.LG math.ST

    Multiscale regression on unknown manifolds

    Authors: Wenjing Liao, Mauro Maggioni, Stefano Vigogna

    Abstract: We consider the regression problem of estimating functions on $\mathbb{R}^D$ but supported on a $d$-dimensional manifold $ \mathcal{M} \subset \mathbb{R}^D $ with $ d \ll D $. Drawing ideas from multi-resolution analysis and nonlinear approximation, we construct low-dimensional coordinates on $\mathcal{M}$ at multiple scales, and perform multiscale regression by local polynomial fitting. We propos… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  21. arXiv:2010.11081  [pdf, other

    eess.IV cs.CV

    Anatomically-Informed Deep Learning on Contrast-Enhanced Cardiac MRI for Scar Segmentation and Clinical Feature Extraction

    Authors: Haley G. Abramson, Dan M. Popescu, Rebecca Yu, Changxin Lai, Julie K. Shade, Katherine C. Wu, Mauro Maggioni, Natalia A. Trayanova

    Abstract: Visualizing disease-induced scarring and fibrosis in the heart on cardiac magnetic resonance (CMR) imaging with contrast enhancement (LGE) is paramount in characterizing disease progression and quantifying pathophysiological substrates of arrhythmias. However, segmentation and scar/fibrosis identification from LGE-CMR is an intensive manual process prone to large inter-observer variability. Here,… ▽ More

    Submitted 8 January, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Haley G. Abramson and Dan M. Popescu contributed equally to this work

  22. Diagnosing and Preventing Instabilities in Recurrent Video Processing

    Authors: Thomas Tanay, Aivar Sootla, Matteo Maggioni, Puneet K. Dokania, Philip Torr, Ales Leonardis, Gregory Slabaugh

    Abstract: Recurrent models are a popular choice for video enhancement tasks such as video denoising or super-resolution. In this work, we focus on their stability as dynamical systems and show that they tend to fail catastrophically at inference time on long video sequences. To address this issue, we (1) introduce a diagnostic tool which produces input sequences optimized to trigger instabilities and that c… ▽ More

    Submitted 11 March, 2023; v1 submitted 10 October, 2020; originally announced October 2020.

    Journal ref: in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 2, pp. 1594-1605, 1 Feb. 2023

  23. arXiv:2010.03729  [pdf, other

    stat.ML cs.LG math.DS math.ST

    Learning Theory for Inferring Interaction Kernels in Second-Order Interacting Agent Systems

    Authors: Jason Miller, Sui Tang, Ming Zhong, Mauro Maggioni

    Abstract: Modeling the complex interactions of systems of particles or agents is a fundamental scientific and mathematical problem that is studied in diverse fields, ranging from physics and biology, to economics and machine learning. In this work, we describe a very general second-order, heterogeneous, multivariable, interacting agent model, with an environment, that encompasses a wide variety of known sys… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 68 pages

    MSC Class: 62Gxx; 37Nxx; 68Txx

  24. arXiv:2005.04117  [pdf, other

    cs.CV eess.IV

    NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results

    Authors: Abdelrahman Abdelhamed, Mahmoud Afifi, Radu Timofte, Michael S. Brown, Yue Cao, Zhilu Zhang, Wangmeng Zuo, Xiaoling Zhang, Jiye Liu, Wendong Chen, Changyuan Wen, Meng Liu, Shuailin Lv, Yunchao Zhang, Zhihong Pan, Baopu Li, Teng Xi, Yanwen Fan, Xiyu Yu, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding, Songhyun Yu, Bumjun Park , et al. (65 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on real image denoising with focus on the newly introduced dataset, the proposed methods and their results. The challenge is a new version of the previous NTIRE 2019 challenge on real image denoising that was based on the SIDD benchmark. This challenge is based on a newly collected validation and testing image datasets, and hence, named SIDD+. This chall… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

  25. arXiv:1912.11123  [pdf, other

    cs.LG math.DS nlin.AO stat.ML

    Data-driven Discovery of Emergent Behaviors in Collective Dynamics

    Authors: Mauro Maggioni, Jason Miller, Ming Zhong

    Abstract: Particle- and agent-based systems are a ubiquitous modeling tool in many disciplines. We consider the fundamental problem of inferring interaction kernels from observations of agent-based dynamical systems given observations of trajectories, in particular for collective dynamical systems exhibiting emergent behaviors with complicated interaction kernels, in a nonparametric fashion, and for kernels… ▽ More

    Submitted 30 March, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

  26. arXiv:1912.03413  [pdf, other

    cs.DC cs.AR cs.PF

    Dissecting the Graphcore IPU Architecture via Microbenchmarking

    Authors: Zhe Jia, Blake Tillman, Marco Maggioni, Daniele Paolo Scarpazza

    Abstract: This report focuses on the architecture and performance of the Intelligence Processing Unit (IPU), a novel, massively parallel platform recently introduced by Graphcore and aimed at Artificial Intelligence/Machine Learning (AI/ML) workloads. We dissect the IPU's performance behavior using microbenchmarks that we crafted for the purpose. We study the IPU's memory organization and performance. We st… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

    Comments: 91 pages, 21 figures

  27. arXiv:1911.10581  [pdf, other

    cs.CV

    Pixel Adaptive Filtering Units

    Authors: Filippos Kokkinos, Ioannis Marras, Matteo Maggioni, Gregory Slabaugh, Stefanos Zafeiriou

    Abstract: State-of-the-art methods for computer vision rely heavily on the translation equivariance and spatial sharing properties of convolutional layers without explicitly taking into consideration the input content. Modern techniques employ deep sophisticated architectures in order to circumvent this issue. In this work, we propose a Pixel Adaptive Filtering Unit (PAFU) which introduces a differentiable… ▽ More

    Submitted 24 November, 2019; originally announced November 2019.

  28. arXiv:1910.04832  [pdf, other

    stat.ML cs.LG math.ST

    Learning interaction kernels in heterogeneous systems of agents from multiple trajectories

    Authors: Fei Lu, Mauro Maggioni, Sui Tang

    Abstract: Systems of interacting particles or agents have wide applications in many disciplines such as Physics, Chemistry, Biology and Economics. These systems are governed by interaction laws, which are often unknown: estimating them from observation data is a fundamental task that can provide meaningful insights and accurate predictions of the behaviour of the agents. In this paper, we consider the inver… ▽ More

    Submitted 14 July, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: 63 pages, revised various places

    MSC Class: 62GXX

  29. arXiv:1905.12989  [pdf, other

    cs.LG math.ST stat.ML

    Learning by Active Nonlinear Diffusion

    Authors: Mauro Maggioni, James M. Murphy

    Abstract: This article proposes an active learning method for high dimensional data, based on intrinsic data geometries learned through diffusion processes on graphs. Diffusion distances are used to parametrize low-dimensional structures on the dataset, which allow for high-accuracy labelings of the dataset with only a small number of carefully chosen labels. The geometric structure of the data suggests reg… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 20 pages, 10 figures

  30. arXiv:1903.07486  [pdf, other

    cs.DC

    Dissecting the NVidia Turing T4 GPU via Microbenchmarking

    Authors: Zhe Jia, Marco Maggioni, Jeffrey Smith, Daniele Paolo Scarpazza

    Abstract: In 2019, the rapid rate at which GPU manufacturers refresh their designs, coupled with their reluctance to disclose microarchitectural details, is still a hurdle for those software designers who want to extract the highest possible performance. Last year, these very reasons motivated us to dissect the Volta GPU architecture using microbenchmarks. The introduction in August 2018 of Turing, NVidia… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: 65 pages

  31. arXiv:1902.05402  [pdf, other

    cs.CV cs.LG stat.ML

    Spectral-Spatial Diffusion Geometry for Hyperspectral Image Clustering

    Authors: James M. Murphy, Mauro Maggioni

    Abstract: An unsupervised learning algorithm to cluster hyperspectral image (HSI) data is proposed that exploits spatially-regularized random walks. Markov diffusions are defined on the space of HSI spectra with transitions constrained to near spatial neighbors. The explicit incorporation of spatial regularity into the diffusion construction leads to smoother random processes that are more adapted for unsup… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

  32. Nonparametric inference of interaction laws in systems of agents from trajectory data

    Authors: Fei Lu, Mauro Maggioni, Sui Tang, Ming Zhong

    Abstract: Inferring the laws of interaction between particles and agents in complex dynamical systems from observational data is a fundamental challenge in a wide variety of disciplines. We propose a non-parametric statistical learning approach to estimate the governing laws of distance-based interactions, with no reference or assumption about their analytical form, from data consisting trajectories of inte… ▽ More

    Submitted 23 March, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

  33. arXiv:1810.06702  [pdf, other

    stat.ML cs.LG

    Learning by Unsupervised Nonlinear Diffusion

    Authors: Mauro Maggioni, James M. Murphy

    Abstract: This paper proposes and analyzes a novel clustering algorithm that combines graph-based diffusion geometry with techniques based on density and mode estimation. The proposed method is suitable for data generated from mixtures of distributions with densities that are both multimodal and have nonlinear shapes. A crucial aspect of this algorithm is the use of time of a data-adapted diffusion process… ▽ More

    Submitted 29 December, 2018; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: 40 Pages, 17 Figures

  34. arXiv:1804.06826  [pdf, other

    cs.DC cs.PF

    Dissecting the NVIDIA Volta GPU Architecture via Microbenchmarking

    Authors: Zhe Jia, Marco Maggioni, Benjamin Staiger, Daniele P. Scarpazza

    Abstract: Every year, novel NVIDIA GPU designs are introduced. This rapid architectural and technological progression, coupled with a reluctance by manufacturers to disclose low-level details, makes it difficult for even the most proficient GPU software designers to remain up-to-date with the technological advances at a microarchitectural level. To address this dearth of public, microarchitectural-level inf… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

    Comments: Technical report. First Edition. April 18th, 2018. 66 pages

  35. arXiv:1708.02469  [pdf, other

    cs.LG

    Multiscale Strategies for Computing Optimal Transport

    Authors: Samuel Gerber, Mauro Maggioni

    Abstract: This paper presents a multiscale approach to efficiently compute approximate optimal transport plans between point sets. It is particularly well-suited for point sets that are in high-dimensions, but are close to being intrinsically low-dimensional. The approach is based on an adaptive multiscale decomposition of the point sets. The multiscale decomposition yields a sequence of optimal transport p… ▽ More

    Submitted 8 August, 2017; originally announced August 2017.

    Comments: Accepted to JMLR

    Journal ref: Journal of Machine Learning Research 18 (2017): 1-32

  36. arXiv:1704.07961  [pdf, other

    cs.CV

    Unsupervised Clustering and Active Learning of Hyperspectral Images with Nonlinear Diffusion

    Authors: James M. Murphy, Mauro Maggioni

    Abstract: The problem of unsupervised learning and segmentation of hyperspectral images is a significant challenge in remote sensing. The high dimensionality of hyperspectral data, presence of substantial noise, and overlap of classes all contribute to the difficulty of automatically clustering and segmenting hyperspectral images. We propose an unsupervised learning technique called spectral-spatial diffusi… ▽ More

    Submitted 15 October, 2018; v1 submitted 25 April, 2017; originally announced April 2017.

    Comments: 17 pages, 22 figures, 3 tables. IEEE accepted version

  37. arXiv:1611.01179  [pdf, other

    stat.ML cs.IT math.ST

    Adaptive Geometric Multiscale Approximations for Intrinsically Low-dimensional Data

    Authors: Wenjing Liao, Mauro Maggioni

    Abstract: We consider the problem of efficiently approximating and encoding high-dimensional data sampled from a probability distribution $ρ$ in $\mathbb{R}^D$, that is nearly supported on a $d$-dimensional set $\mathcal{M}$ - for example supported on a $d$-dimensional Riemannian manifold. Geometric Multi-Resolution Analysis (GMRA) provides a robust and computationally efficient procedure to construct low-d… ▽ More

    Submitted 18 July, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

  38. arXiv:1506.03410  [pdf, other

    stat.ML cs.LG

    Sparse Projection Oblique Randomer Forests

    Authors: Tyler M. Tomita, James Browne, Cencheng Shen, Jaewon Chung, Jesse L. Patsolic, Benjamin Falk, Jason Yim, Carey E. Priebe, Randal Burns, Mauro Maggioni, Joshua T. Vogelstein

    Abstract: Decision forests, including Random Forests and Gradient Boosting Trees, have recently demonstrated state-of-the-art performance in a variety of machine learning settings. Decision forests are typically ensembles of axis-aligned decision trees; that is, trees that split only along feature dimensions. In contrast, many recent extensions to decision forests are based on axis-oblique splits. Unfortuna… ▽ More

    Submitted 3 October, 2019; v1 submitted 10 June, 2015; originally announced June 2015.

    Comments: 31 pages; submitted to Journal of Machine Learning Research for review

    MSC Class: 68T10 ACM Class: I.5.2

    Journal ref: Journal of Machine Learning Research 21(104), 1-39, 2020

  39. arXiv:1410.0719  [pdf, other

    math.NA cs.CV cs.IT cs.LG math.OC math.ST

    Proceedings of the second "international Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST'14)

    Authors: L. Jacques, C. De Vleeschouwer, Y. Boursier, P. Sudhakar, C. De Mol, A. Pizurica, S. Anthoine, P. Vandergheynst, P. Frossard, C. Bilen, S. Kitic, N. Bertin, R. Gribonval, N. Boumal, B. Mishra, P. -A. Absil, R. Sepulchre, S. Bundervoet, C. Schretter, A. Dooms, P. Schelkens, O. Chabiron, F. Malgouyres, J. -Y. Tourneret, N. Dobigeon , et al. (42 additional authors not shown)

    Abstract: The implicit objective of the biennial "international - Traveling Workshop on Interactions between Sparse models and Technology" (iTWIST) is to foster collaboration between international scientific teams by disseminating ideas through both specific oral/poster presentations and free discussions. For its second edition, the iTWIST workshop took place in the medieval and picturesque town of Namur in… ▽ More

    Submitted 9 October, 2014; v1 submitted 2 October, 2014; originally announced October 2014.

    Comments: 69 pages, 24 extended abstracts, iTWIST'14 website: http://sites.google.com/site/itwist14

  40. arXiv:1212.1143  [pdf, other

    cs.AI eess.SY math.OC stat.ML

    Multiscale Markov Decision Problems: Compression, Solution, and Transfer Learning

    Authors: Jake Bouvrie, Mauro Maggioni

    Abstract: Many problems in sequential decision making and stochastic control often have natural multiscale structure: sub-tasks are assembled together to accomplish complex goals. Systematically inferring and leveraging hierarchical structure, particularly beyond a single level of abstraction, has remained a longstanding challenge. We describe a fast multiscale procedure for repeatedly compressing, or homog… ▽ More

    Submitted 5 December, 2012; originally announced December 2012.

    Comments: 86 pages, 15 figures

  41. arXiv:1204.3337  [pdf, ps, other

    cs.IT cs.DS

    Approximation of Points on Low-Dimensional Manifolds Via Random Linear Projections

    Authors: Mark A. Iwen, Mauro Maggioni

    Abstract: This paper considers the approximate reconstruction of points, x \in R^D, which are close to a given compact d-dimensional submanifold, M, of R^D using a small number of linear measurements of x. In particular, it is shown that a number of measurements of x which is independent of the extrinsic dimension D suffices for highly accurate reconstruction of a given x with high probability. Furthermore,… ▽ More

    Submitted 15 April, 2012; originally announced April 2012.

  42. arXiv:1105.4924  [pdf, ps, other

    math.MG cs.DS stat.ML

    Multiscale Geometric Methods for Data Sets II: Geometric Multi-Resolution Analysis

    Authors: William K. Allard, Guangliang Chen, Mauro Maggioni

    Abstract: Data sets are often modeled as point clouds in $R^D$, for $D$ large. It is often assumed that the data has some interesting low-dimensional structure, for example that of a $d$-dimensional manifold $M$, with $d$ much smaller than $D$. When $M$ is simply a linear subspace, one may exploit this assumption for encoding efficiently the data by projecting onto a dictionary of $d$ vectors in $R^D$ (for… ▽ More

    Submitted 7 September, 2011; v1 submitted 24 May, 2011; originally announced May 2011.

    Comments: Re-formatted using AMS style