Skip to main content

Showing 1–50 of 86 results for author: Chen, R T

  1. arXiv:2406.04713  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI physics.comp-ph stat.ML

    FlowMM: Generating Materials with Riemannian Flow Matching

    Authors: Benjamin Kurt Miller, Ricky T. Q. Chen, Anuroop Sriram, Brandon M Wood

    Abstract: Crystalline materials are a fundamental component in next-generation technologies, yet modeling their distribution presents unique computational challenges. Of the plausible arrangements of atoms in a periodic lattice only a vanishingly small percentage are thermodynamically stable, which is a key indicator of the materials that can be experimentally realized. Two fundamental tasks in this area ar… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: https://github.com/facebookresearch/flowmm

    Journal ref: ICML 2024

  2. arXiv:2406.00288  [pdf, other

    cs.LG stat.ML

    Neural Optimal Transport with Lagrangian Costs

    Authors: Aram-Alexandre Pooladian, Carles Domingo-Enrich, Ricky T. Q. Chen, Brandon Amos

    Abstract: We investigate the optimal transport problem between probability measures when the underlying cost function is understood to satisfy a least action principle, also known as a Lagrangian cost. These generalizations are useful when connecting observations from a physical system where the transport dynamics are influenced by the geometry of the system, such as obstacles (e.g., incorporating barrier f… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: UAI 2024

  3. arXiv:2405.04795  [pdf, other

    cs.LG

    Variational Schrödinger Diffusion Models

    Authors: Wei Deng, Weijian Luo, Yixin Tan, Marin Biloš, Yu Chen, Yuriy Nevmyvaka, Ricky T. Q. Chen

    Abstract: Schrödinger bridge (SB) has emerged as the go-to method for optimizing transportation plans in diffusion models. However, SB requires estimating the intractable forward score functions, inevitably resulting in the costly implicit training loss based on simulated trajectories. To improve the scalability while preserving efficient transportation plans, we leverage variational inference to linearize… ▽ More

    Submitted 19 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  4. arXiv:2404.08851  [pdf

    physics.optics physics.app-ph

    Mid-infrared 2D nonredundant optical phased array of mirror emitters in an InGaAs/InP platform

    Authors: Jason Midkiff, Po-Yu Hsiao, Patrick T. Camp, Ray T. Chen

    Abstract: The extension of photonic technologies such as lidar and free-space optical communications from the traditional visible and near-infrared wavelengths to longer wavelengths can improve performance in adverse environments such as haze, fog, smoke, or strong solar background. Non-mechanical beam steerers will be a critical component of the low size, weight, and power modules needed for the portable o… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Main document: 16 pages, 11 figures; Supplement: 3 pages, 2 figures

  5. arXiv:2404.08764  [pdf, other

    physics.chem-ph

    Leveraging Normalizing Flows for Orbital-Free Density Functional Theory

    Authors: Alexandre de Camargo, Ricky T. Q. Chen, Rodrigo A. Vargas-Hernández

    Abstract: Orbital-free density functional theory (OF-DFT) for real-space systems has historically depended on Lagrange optimization techniques, primarily due to the inability of previously proposed electron density ansatze to ensure the normalization constraint. This study illustrates how leveraging contemporary generative models, notably normalizing flows (NFs), can surmount this challenge. We pioneer a La… ▽ More

    Submitted 18 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 6 pages, 4 Figures, (SI: 15 pages, 4 figures)

  6. arXiv:2403.14806  [pdf, other

    cs.ET physics.app-ph physics.optics

    Photonic-Electronic Integrated Circuits for High-Performance Computing and AI Accelerators

    Authors: Shupeng Ning, Hanqing Zhu, Chenghao Feng, Jiaqi Gu, Zhixing Jiang, Zhoufeng Ying, Jason Midkiff, Sourabh Jain, May H. Hlaing, David Z. Pan, Ray T. Chen

    Abstract: In recent decades, the demand for computational power has surged, particularly with the rapid expansion of artificial intelligence (AI). As we navigate the post-Moore's law era, the limitations of traditional electrical digital computing, including process bottlenecks and power consumption issues, are propelling the search for alternative computing paradigms. Among various emerging technologies, i… ▽ More

    Submitted 11 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  7. arXiv:2403.01329  [pdf, other

    cs.LG cs.AI cs.CV

    Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models

    Authors: Neta Shaul, Uriel Singer, Ricky T. Q. Chen, Matthew Le, Ali Thabet, Albert Pumarola, Yaron Lipman

    Abstract: This paper introduces Bespoke Non-Stationary (BNS) Solvers, a solver distillation approach to improve sample efficiency of Diffusion and Flow models. BNS solvers are based on a family of non-stationary solvers that provably subsumes existing numerical ODE solvers and consequently demonstrate considerable improvement in sample approximation (PSNR) over these baselines. Compared to model distillatio… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  8. arXiv:2401.08390  [pdf, other

    physics.data-an physics.ins-det physics.plasm-ph

    Physics-informed Meta-instrument for eXperiments (PiMiX) with applications to fusion energy

    Authors: Zhehui Wang, Shanny Lin, Miles Teng-Levy, Pinghan Chu, Bradley T. Wolfe, Chun-Shang Wong, Christopher S. Campbell, Xin Yue, Liyuan Zhang, Derek Aberle, Mariana Alvarado Alvarez, David Broughton, Ray T. Chen, Baolian Cheng, Feng Chu, Eric R. Fossum, Mark A. Foster, Chengkun Huang, Velat Kilic, Karl Krushelnick, Wenting Li, Eric Loomis, Thomas Schmidt Jr., Sky K. Sjue, Chris Tomkins , et al. (2 additional authors not shown)

    Abstract: Data-driven methods (DDMs), such as deep neural networks, offer a generic approach to integrated data analysis (IDA), integrated diagnostic-to-control (IDC) workflows through data fusion (DF), which includes multi-instrument data fusion (MIDF), multi-experiment data fusion (MXDF), and simulation-experiment data fusion (SXDF). These features make DDMs attractive to nuclear fusion energy and power p… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 24 pages, 46 references, and 13 Figures. Manuscript extended a recent presentation in the 29th IAEA Fusion Energy Conference (FEC), London, UK, Oct. 16 - 21, 2023

    Report number: Los Alamos National Laboratory report number LA-UR-24-20196

  9. arXiv:2401.03228  [pdf, other

    stat.ML cs.LG

    Reflected Schrödinger Bridge for Constrained Generative Modeling

    Authors: Wei Deng, Yu Chen, Nicole Tianjiao Yang, Hengrong Du, Qi Feng, Ricky T. Q. Chen

    Abstract: Diffusion models have become the go-to method for large-scale generative models in real-world applications. These applications often involve data distributions confined within bounded domains, typically requiring ad-hoc thresholding techniques for boundary enforcement. Reflected diffusion models (Lou23) aim to enhance generalizability by generating the data distribution through a backward process… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  10. arXiv:2312.05250  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    TaskMet: Task-Driven Metric Learning for Model Learning

    Authors: Dishank Bansal, Ricky T. Q. Chen, Mustafa Mukadam, Brandon Amos

    Abstract: Deep learning models are often deployed in downstream tasks that the training procedure may not be aware of. For example, models solely trained to achieve accurate predictions may struggle to perform well on downstream tasks because seemingly small prediction errors may incur drastic task errors. The standard end-to-end learning approach is to make the task loss differentiable or to introduce a di… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  11. arXiv:2312.02027  [pdf, other

    math.OC cs.LG math.NA math.PR stat.ML

    Stochastic Optimal Control Matching

    Authors: Carles Domingo-Enrich, Jiequn Han, Brandon Amos, Joan Bruna, Ricky T. Q. Chen

    Abstract: Stochastic optimal control, which has the goal of driving the behavior of noisy systems, is broadly applicable in science, engineering and artificial intelligence. Our work introduces Stochastic Optimal Control Matching (SOCM), a novel Iterative Diffusion Optimization (IDO) technique for stochastic optimal control that stems from the same philosophy as the conditional score matching loss for diffu… ▽ More

    Submitted 28 June, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  12. arXiv:2311.13518  [pdf, other

    physics.chem-ph quant-ph

    Orbital-Free Density Functional Theory with Continuous Normalizing Flows

    Authors: Alexandre de Camargo, Ricky T. Q. Chen, Rodrigo A. Vargas-Hernández

    Abstract: Orbital-free density functional theory (OF-DFT) provides an alternative approach for calculating the molecular electronic energy, relying solely on the electron density. In OF-DFT, both the ground-state density is optimized variationally to minimize the total energy functional while satisfying the normalization constraint. In this work, we introduce a novel approach by parameterizing the electroni… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 6 pages, 3 figures

  13. arXiv:2311.13443  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Guided Flows for Generative Modeling and Decision Making

    Authors: Qinqing Zheng, Matt Le, Neta Shaul, Yaron Lipman, Aditya Grover, Ricky T. Q. Chen

    Abstract: Classifier-free guidance is a key component for enhancing the performance of conditional generative models across diverse tasks. While it has previously demonstrated remarkable improvements for the sample quality, it has only been exclusively employed for diffusion models. In this paper, we integrate classifier-free guidance into Flow Matching (FM) models, an alternative simulation-free approach t… ▽ More

    Submitted 7 December, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

  14. arXiv:2311.05726  [pdf, other

    physics.ins-det cs.LG physics.data-an

    Neural Network Methods for Radiation Detectors and Imaging

    Authors: S. Lin, S. Ning, H. Zhu, T. Zhou, C. L. Morris, S. Clayton, M. Cherukara, R. T. Chen, Z. Wang

    Abstract: Recent advances in image data processing through machine learning and especially deep neural networks (DNNs) allow for new optimization and performance-enhancement schemes for radiation detectors and imaging hardware through data-endowed artificial intelligence. We give an overview of data generation at photon sources, deep learning-based methods for image processing tasks, and hardware solutions… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Report number: LA-UR-23-32395

  15. arXiv:2310.19075  [pdf, other

    cs.LG cs.AI cs.CV

    Bespoke Solvers for Generative Flow Models

    Authors: Neta Shaul, Juan Perez, Ricky T. Q. Chen, Ali Thabet, Albert Pumarola, Yaron Lipman

    Abstract: Diffusion or flow-based models are powerful generative paradigms that are notoriously hard to sample as samples are defined as solutions to high-dimensional Ordinary or Stochastic Differential Equations (ODEs/SDEs) which require a large Number of Function Evaluations (NFE) to approximate well. Existing methods to alleviate the costly sampling process include model distillation and designing dedica… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  16. arXiv:2310.04432  [pdf, other

    cs.CV cs.AI cs.LG

    Training-free Linear Image Inverses via Flows

    Authors: Ashwini Pokle, Matthew J. Muckley, Ricky T. Q. Chen, Brian Karrer

    Abstract: Solving inverse problems without any training involves using a pretrained generative model and making appropriate modifications to the generation process to avoid finetuning of the generative model. While recent methods have explored the use of diffusion models, they still require the manual tuning of many hyperparameters for different inverse problems. In this work, we propose a training-free met… ▽ More

    Submitted 10 March, 2024; v1 submitted 25 September, 2023; originally announced October 2023.

    Comments: 40 pages, 30 figures. Added additional qualitative results in the appendix

  17. arXiv:2310.02679  [pdf, other

    cs.LG cs.AI stat.CO stat.ME stat.ML

    Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory optimization

    Authors: Dinghuai Zhang, Ricky T. Q. Chen, Cheng-Hao Liu, Aaron Courville, Yoshua Bengio

    Abstract: We tackle the problem of sampling from intractable high-dimensional density functions, a fundamental task that often appears in machine learning and statistics. We extend recent sampling-based approaches that leverage controlled stochastic processes to model approximate samples from these target densities. The main drawback of these approaches is that the training objective requires full trajector… ▽ More

    Submitted 9 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024

  18. arXiv:2310.02233  [pdf, other

    stat.ML cs.LG math.OC

    Generalized Schrödinger Bridge Matching

    Authors: Guan-Horng Liu, Yaron Lipman, Maximilian Nickel, Brian Karrer, Evangelos A. Theodorou, Ricky T. Q. Chen

    Abstract: Modern distribution matching algorithms for training diffusion or flow models directly prescribe the time evolution of the marginal distributions between two boundary distributions. In this work, we consider a generalized distribution matching setup, where these marginals are only implicitly described as a solution to some task-specific objective function. The problem setup, known as the Generaliz… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Camera Ready

  19. arXiv:2306.06626  [pdf, other

    cs.LG stat.ML

    On Kinetic Optimal Probability Paths for Generative Models

    Authors: Neta Shaul, Ricky T. Q. Chen, Maximilian Nickel, Matt Le, Yaron Lipman

    Abstract: Recent successful generative models are trained by fitting a neural network to an a-priori defined tractable probability density path taking noise to training examples. In this paper we investigate the space of Gaussian probability paths, which includes diffusion paths as an instance, and look for an optimal member in some useful sense. In particular, minimizing the Kinetic Energy (KE) of a path i… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  20. arXiv:2305.19592  [pdf

    physics.optics cs.AI cs.AR cs.ET

    Integrated multi-operand optical neurons for scalable and hardware-efficient deep learning

    Authors: Chenghao Feng, Jiaqi Gu, Hanqing Zhu, Rongxing Tang, Shupeng Ning, May Hlaing, Jason Midkiff, Sourabh Jain, David Z. Pan, Ray T. Chen

    Abstract: The optical neural network (ONN) is a promising hardware platform for next-generation neuromorphic computing due to its high parallelism, low latency, and low energy consumption. However, previous integrated photonic tensor cores (PTCs) consume numerous single-operand optical modulators for signal and weight encoding, leading to large area costs and high propagation loss to implement large tensor… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 19 pages, 10 figures

  21. arXiv:2305.19533  [pdf, other

    cs.ET cs.AR physics.optics

    Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accelerator

    Authors: Hanqing Zhu, Jiaqi Gu, Hanrui Wang, Zixuan Jiang, Zhekai Zhang, Rongxing Tang, Chenghao Feng, Song Han, Ray T. Chen, David Z. Pan

    Abstract: The wide adoption and significant computing resource of attention-based transformers, e.g., Vision Transformers and large language models (LLM), have driven the demand for efficient hardware accelerators. There is a growing interest in exploring photonics as an alternative technology to digital electronics due to its high energy efficiency and ultra-fast processing speed. Photonic accelerators hav… ▽ More

    Submitted 31 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper in HPCA 2024. Recieved the Reproducibility Badges at IEEE. Our implementation is available at https://github.com/zhuhanqing/Lightening-Transformer

  22. arXiv:2305.19505  [pdf, other

    cs.ET cs.LG physics.optics

    M3ICRO: Machine Learning-Enabled Compact Photonic Tensor Core based on PRogrammable Multi-Operand Multimode Interference

    Authors: Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Zixuan Jiang, Ray T. Chen, David Z. Pan

    Abstract: Photonic computing shows promise for transformative advancements in machine learning (ML) acceleration, offering ultra-fast speed, massive parallelism, and high energy efficiency. However, current photonic tensor core (PTC) designs based on standard optical components hinder scalability and compute density due to their large spatial footprint. To address this, we propose an ultra-compact PTC using… ▽ More

    Submitted 28 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: 12 pages. Accepted to APL Machine Learning 2023

  23. arXiv:2304.14772  [pdf, other

    cs.LG

    Multisample Flow Matching: Straightening Flows with Minibatch Couplings

    Authors: Aram-Alexandre Pooladian, Heli Ben-Hamu, Carles Domingo-Enrich, Brandon Amos, Yaron Lipman, Ricky T. Q. Chen

    Abstract: Simulation-free methods for training continuous-time generative models construct probability paths that go between noise distributions and individual data samples. Recent works, such as Flow Matching, derived paths that are optimal for each data sample. However, these algorithms rely on independent data and noise samples, and do not exploit underlying structure in the data distribution for constru… ▽ More

    Submitted 24 May, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

  24. arXiv:2302.05793  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    Distributional GFlowNets with Quantile Flows

    Authors: Dinghuai Zhang, Ling Pan, Ricky T. Q. Chen, Aaron Courville, Yoshua Bengio

    Abstract: Generative Flow Networks (GFlowNets) are a new family of probabilistic samplers where an agent learns a stochastic policy for generating complex combinatorial structure through a series of decision-making steps. Despite being inspired from reinforcement learning, the current GFlowNet framework is relatively limited in its applicability and cannot handle stochasticity in the reward function. In thi… ▽ More

    Submitted 17 February, 2024; v1 submitted 11 February, 2023; originally announced February 2023.

    Comments: Accepted by TMLR

  25. arXiv:2302.03660  [pdf, other

    cs.LG cs.AI stat.ML

    Flow Matching on General Geometries

    Authors: Ricky T. Q. Chen, Yaron Lipman

    Abstract: We propose Riemannian Flow Matching (RFM), a simple yet powerful framework for training continuous normalizing flows on manifolds. Existing methods for generative modeling on manifolds either require expensive simulation, are inherently unable to scale to high dimensions, or use approximations for limiting quantities that result in biased training objectives. Riemannian Flow Matching bypasses thes… ▽ More

    Submitted 26 February, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

    Journal ref: ICLR 2024

  26. arXiv:2301.04754  [pdf

    physics.med-ph physics.ins-det physics.optics

    A Point-of-Care Biosensor for Rapid Detection and Differentiation of COVID-19 Virus (SARS-CoV-2) and Influenza Virus Using Subwavelength Grating Micro-ring Resonator

    Authors: Shupeng Ning, Hao-Chen Chang, Kang-Chieh Fan, Po-yu Hsiao, Chenghao Feng, Devan Shoemaker, Ray T. Chen

    Abstract: In the context of continued spread of coronavirus disease 2019 (COVID-19) caused by SARS-CoV-2 and the emergence of new variants, the demand for rapid, accurate, and frequent detection is increasing. Besides, the new predominant strain, Omicron variant, manifests more similar clinical features to those of other common respiratory infections. The concurrent detection of multiple potential pathogens… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

  27. arXiv:2212.13659  [pdf, other

    cs.LG stat.ML

    Latent Discretization for Continuous-time Sequence Compression

    Authors: Ricky T. Q. Chen, Matthew Le, Matthew Muckley, Maximilian Nickel, Karen Ullrich

    Abstract: Neural compression offers a domain-agnostic approach to creating codecs for lossy or lossless compression via deep generative models. For sequence compression, however, most deep sequence models have costs that scale with the sequence length rather than the sequence complexity. In this work, we instead treat data sequences as observations from an underlying continuous-time process and learn how to… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  28. arXiv:2210.02747  [pdf, other

    cs.LG cs.AI stat.ML

    Flow Matching for Generative Modeling

    Authors: Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, Matt Le

    Abstract: We introduce a new paradigm for generative modeling built on Continuous Normalizing Flows (CNFs), allowing us to train CNFs at unprecedented scale. Specifically, we present the notion of Flow Matching (FM), a simulation-free approach for training CNFs based on regressing vector fields of fixed conditional probability paths. Flow Matching is compatible with a general family of Gaussian probability… ▽ More

    Submitted 8 February, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

  29. arXiv:2210.01741  [pdf, other

    cs.LG

    Neural Conservation Laws: A Divergence-Free Perspective

    Authors: Jack Richter-Powell, Yaron Lipman, Ricky T. Q. Chen

    Abstract: We investigate the parameterization of deep neural networks that by design satisfy the continuity equation, a fundamental conservation law. This is enabled by the observation that any solution of the continuity equation can be represented as a divergence-free vector field. We hence propose building divergence-free neural networks through the concept of differential forms, and with the aid of autom… ▽ More

    Submitted 11 December, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

    Journal ref: NeurIPS 2022

  30. arXiv:2210.00999  [pdf, other

    cs.LG cs.AI stat.ML

    Latent State Marginalization as a Low-cost Approach for Improving Exploration

    Authors: Dinghuai Zhang, Aaron Courville, Yoshua Bengio, Qinqing Zheng, Amy Zhang, Ricky T. Q. Chen

    Abstract: While the maximum entropy (MaxEnt) reinforcement learning (RL) framework -- often touted for its exploration and robustness capabilities -- is usually motivated from a probabilistic perspective, the use of deep probabilistic models has not gained much traction in practice due to their inherent complexity. In this work, we propose the adoption of latent variable policies within the MaxEnt framework… ▽ More

    Submitted 10 February, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted by ICLR 2023

  31. arXiv:2209.10098  [pdf, other

    cs.ET cs.LG physics.optics

    NeurOLight: A Physics-Agnostic Neural Operator Enabling Parametric Photonic Device Simulation

    Authors: Jiaqi Gu, Zhengqi Gao, Chenghao Feng, Hanqing Zhu, Ray T. Chen, Duane S. Boning, David Z. Pan

    Abstract: Optical computing is an emerging technology for next-generation efficient artificial intelligence (AI) due to its ultra-high speed and efficiency. Electromagnetic field simulation is critical to the design, optimization, and validation of photonic devices and circuits. However, costly numerical simulation significantly hinders the scalability and turn-around time in the photonic circuit design loo… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 13 pages. Accepted to NeurIPS 2022

  32. arXiv:2209.02606  [pdf, other

    cs.LG cs.AI stat.ML

    Unifying Generative Models with GFlowNets and Beyond

    Authors: Dinghuai Zhang, Ricky T. Q. Chen, Nikolay Malkin, Yoshua Bengio

    Abstract: There are many frameworks for deep generative modeling, each often presented with their own specific training algorithms and inference methods. Here, we demonstrate the connections between existing deep generative models and the recently introduced GFlowNet framework, a probabilistic inference machine which treats sampling as a decision-making process. This analysis sheds light on their overlappin… ▽ More

    Submitted 30 January, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: expanded version of the ICML 2022 workshop paper

  33. arXiv:2207.09442  [pdf, other

    cs.RO cs.CV cs.LG math.OC

    Theseus: A Library for Differentiable Nonlinear Optimization

    Authors: Luis Pineda, Taosha Fan, Maurizio Monge, Shobha Venkataraman, Paloma Sodhi, Ricky T. Q. Chen, Joseph Ortiz, Daniel DeTone, Austin Wang, Stuart Anderson, Jing Dong, Brandon Amos, Mustafa Mukadam

    Abstract: We present Theseus, an efficient application-agnostic open source library for differentiable nonlinear least squares (DNLS) optimization built on PyTorch, providing a common framework for end-to-end structured learning in robotics and vision. Existing DNLS implementations are application specific and do not always incorporate many ingredients important for efficiency. Theseus is application-agnost… ▽ More

    Submitted 18 January, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Advances in Neural Information Processing Systems (NeurIPS), 2022

  34. arXiv:2207.07754  [pdf

    physics.optics physics.app-ph

    Lab-on-a-Chip Optical Biosensor Platform: Micro Ring Resonator Integrated with Near-Infrared Fourier Transform Spectrometer

    Authors: Kyoung Min Yoo, May Hlaing, Sourabh Jain, James Fan, Yue An, Ray T. Chen

    Abstract: A micro-ring-resonator (MRR) optical biosensor based on the evanescent field sensing mechanism has been extensively studied due to its high sensitivity and compact device size. However, a suitable on-chip integrated spectrometer device has to be demonstrated for the lab-on-a-chip applications, which can read the resonance wavelength shift from MRR biosensors based on minuscule changes in refractiv… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: 23 pages, 9 figures including supplementary

  35. arXiv:2207.04711  [pdf, other

    stat.ML cs.LG

    Matching Normalizing Flows and Probability Paths on Manifolds

    Authors: Heli Ben-Hamu, Samuel Cohen, Joey Bose, Brandon Amos, Aditya Grover, Maximilian Nickel, Ricky T. Q. Chen, Yaron Lipman

    Abstract: Continuous Normalizing Flows (CNFs) are a class of generative models that transform a prior distribution to a model distribution by solving an ordinary differential equation (ODE). We propose to train CNFs on manifolds by minimizing probability path divergence (PPD), a novel family of divergences between the probability density path generated by the CNF and a target probability density path. PPD i… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: ICML 2022

  36. arXiv:2203.06832  [pdf, other

    cs.LG stat.ML

    Semi-Discrete Normalizing Flows through Differentiable Tessellation

    Authors: Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel

    Abstract: Mapping between discrete and continuous distributions is a difficult task and many have had to resort to heuristical approaches. We propose a tessellation-based approach that directly learns quantization boundaries in a continuous space, complete with exact likelihood evaluations. This is done through constructing normalizing flows on convex polytopes parameterized using a simple homeomorphism wit… ▽ More

    Submitted 11 December, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

    Journal ref: NeurIPS 2022

  37. arXiv:2201.08248  [pdf

    physics.app-ph physics.optics

    Packaging-enhanced optical fiber-chip interconnect with enlarged grating coupler and multimode fiber

    Authors: Chao Wang, Chingwen Chang, Jason Midkiff, Aref Asghari, James Fan, Jianying Zhou, Xiaochuan Xu, Huiping Tian, Ray T. Chen

    Abstract: Optical I/O plays a crucial role in the lifespan of lab-on-a-chip systems, from preliminary testing to operation in the target environment. However, due to the precise alignments required, efficient and reliable fiber-to-chip connections remain challenging, yielding inconsistent test results and unstable packaged performance. To overcome this issue, for use in single mode on-chip systems, we propo… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: 11 pages, 5 figures

  38. arXiv:2112.08703  [pdf, other

    cs.ET physics.optics

    ADEPT: Automatic Differentiable DEsign of Photonic Tensor Cores

    Authors: Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Zixuan Jiang, Mingjie Liu, Shuhan Zhang, Ray T. Chen, David Z. Pan

    Abstract: Photonic tensor cores (PTCs) are essential building blocks for optical artificial intelligence (AI) accelerators based on programmable photonic integrated circuits. PTCs can achieve ultra-fast and efficient tensor operations for neural network (NN) acceleration. Current PTC designs are either manually constructed or based on matrix decomposition theory, which lacks the adaptability to meet various… ▽ More

    Submitted 3 May, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted to ACM/IEEE Design Automation Conference (DAC), 2022

  39. arXiv:2112.08512  [pdf, other

    cs.ET cs.LG physics.optics

    ELight: Enabling Efficient Photonic In-Memory Neurocomputing with Life Enhancement

    Authors: Hanqing Zhu, Jiaqi Gu, Chenghao Feng, Mingjie Liu, Zixuan Jiang, Ray T. Chen, David Z. Pan

    Abstract: With the recent advances in optical phase change material (PCM), photonic in-memory neurocomputing has demonstrated its superiority in optical neural network (ONN) designs with near-zero static power consumption, time-of-light latency, and compact footprint. However, photonic tensor cores require massive hardware reuse to implement large matrix multiplication due to the limited single-core scale.… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 7 pages, 8 figures, accepted by ASPDAC 2022

  40. arXiv:2112.07027  [pdf

    physics.optics physics.app-ph physics.ins-det

    Dual-Polarization Bandwidth-Bridged On-Chip Bandpass Sampling Fourier Transform Spectrometer from Visible to Near-Infrared

    Authors: Kyoung Min Yoo, Ray T. Chen

    Abstract: The on-chip broadband optical spectrometers which cover the entire tissue transparency window (λ=650-1050 nm) with high resolution are highly demanded for the miniaturized bio-sensing and bio-imaging applications. Here, we propose a novel type of spatial heterodyne Fourier transform spectrometer (SHFTS) integrated with a sub-wavelength grating coupler (SWGC) for the dual-polarization bandpass samp… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 48 Pages, 6 figures, 14 supportive figures

  41. arXiv:2111.06705  [pdf

    cs.ET cs.LG physics.app-ph physics.optics

    A compact butterfly-style silicon photonic-electronic neural chip for hardware-efficient deep learning

    Authors: Chenghao Feng, Jiaqi Gu, Hanqing Zhu, Zhoufeng Ying, Zheng Zhao, David Z. Pan, Ray T. Chen

    Abstract: The optical neural network (ONN) is a promising hardware platform for next-generation neurocomputing due to its high parallelism, low latency, and low energy consumption. Previous ONN architectures are mainly designed for general matrix multiplication (GEMM), leading to unnecessarily large area cost and high control complexity. Here, we move beyond classical GEMM-based ONNs and propose an optical… ▽ More

    Submitted 17 July, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: 17 pages,5 figures

  42. arXiv:2110.14807  [pdf, other

    cs.LG cs.ET physics.optics

    L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization

    Authors: Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Zixuan Jiang, Ray T. Chen, David Z. Pan

    Abstract: Silicon-photonics-based optical neural network (ONN) is a promising hardware platform that could represent a paradigm shift in efficient AI with its CMOS-compatibility, flexibility, ultra-low execution latency, and high energy efficiency. In-situ training on the online programmable photonic chips is appealing but still encounters challenging issues in on-chip implementability, scalability, and eff… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: 10 pages. Accepted to NeurIPS 2021

  43. arXiv:2108.11430  [pdf, other

    cs.LG cs.ET

    Towards Memory-Efficient Neural Networks via Multi-Level in situ Generation

    Authors: Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Mingjie Liu, Zixuan Jiang, Ray T. Chen, David Z. Pan

    Abstract: Deep neural networks (DNN) have shown superior performance in a variety of tasks. As they rapidly evolve, their escalating computation and memory demands make it challenging to deploy them on resource-constrained edge devices. Though extensive efficient accelerator designs, from traditional electronics to emerging photonics, have been successfully demonstrated, they are still bottlenecked by expen… ▽ More

    Submitted 5 September, 2021; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: Accepted by International Conference on Computer Vision (ICCV) 2021

  44. arXiv:2103.12604  [pdf, other

    quant-ph physics.chem-ph physics.comp-ph

    Fully differentiable optimization protocols for non-equilibrium steady states

    Authors: Rodrigo A. Vargas-Hernández, Ricky T. Q. Chen, Kenneth A. Jung, Paul Brumer

    Abstract: In the case of quantum systems interacting with multiple environments, the time-evolution of the reduced density matrix is described by the Liouvillian. For a variety of physical observables, the long-time limit or steady state solution is needed for the computation of desired physical observables. For inverse design or optimal control of such systems, the common approaches are based on brute-forc… ▽ More

    Submitted 23 November, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Main work 10 pages and 5 Figures. Supplemental Material 12 pages, 1 Figure, 3 Tables

  45. arXiv:2102.06559  [pdf, other

    stat.ML cs.LG

    Infinitely Deep Bayesian Neural Networks with Stochastic Differential Equations

    Authors: Winnie Xu, Ricky T. Q. Chen, Xuechen Li, David Duvenaud

    Abstract: We perform scalable approximate inference in continuous-depth Bayesian neural networks. In this model class, uncertainty about separate weights in each layer gives hidden units that follow a stochastic differential equation. We demonstrate gradient-based stochastic variational inference in this infinite-parameter setting, producing arbitrarily-flexible approximate posteriors. We also derive a nove… ▽ More

    Submitted 30 January, 2022; v1 submitted 12 February, 2021; originally announced February 2021.

  46. arXiv:2012.11148  [pdf, other

    cs.ET cs.LG physics.optics

    Efficient On-Chip Learning for Optical Neural Networks Through Power-Aware Sparse Zeroth-Order Optimization

    Authors: Jiaqi Gu, Chenghao Feng, Zheng Zhao, Zhoufeng Ying, Ray T. Chen, David Z. Pan

    Abstract: Optical neural networks (ONNs) have demonstrated record-breaking potential in high-performance neuromorphic computing due to their ultra-high execution speed and low energy consumption. However, current learning protocols fail to provide scalable and efficient solutions to photonic circuit optimization in practical applications. In this work, we propose a novel on-chip learning framework to releas… ▽ More

    Submitted 5 September, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: 7 pages content, 2 pages of reference, 6 figures, 4 tables, accepted to Association for the Advancement of Artificial Intelligence (AAAI) 2021

  47. arXiv:2012.05942  [pdf, other

    cs.LG math.OC

    Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization

    Authors: Chin-Wei Huang, Ricky T. Q. Chen, Christos Tsirigotis, Aaron Courville

    Abstract: Flow-based models are powerful tools for designing probabilistic models with tractable density. This paper introduces Convex Potential Flows (CP-Flow), a natural and efficient parameterization of invertible models inspired by the optimal transport (OT) theory. CP-Flows are the gradient map of a strongly convex neural potential function. The convexity implies invertibility and allows us to resort t… ▽ More

    Submitted 23 February, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

  48. arXiv:2011.12808  [pdf, other

    quant-ph physics.chem-ph physics.comp-ph

    Inverse design of dissipative quantum steady-states with implicit differentiation

    Authors: Rodrigo A. Vargas-Hernández, Ricky T. Q. Chen, Kenneth A. Jung, Paul Brumer

    Abstract: Inverse design of a property that depends on the steady-state of an open quantum system is commonly done by grid-search type of methods. In this paper we present a new methodology that allows us to compute the gradient of the steady-state of an open quantum system with respect to any parameter of the Hamiltonian using the implicit differentiation theorem. As an example, we present a simulation of… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: 6 pages, 2 figures, accepted for publication in Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020), Vancouver, Canada

  49. arXiv:2011.04803  [pdf, other

    cs.LG

    Self-Tuning Stochastic Optimization with Curvature-Aware Gradient Filtering

    Authors: Ricky T. Q. Chen, Dami Choi, Lukas Balles, David Duvenaud, Philipp Hennig

    Abstract: Standard first-order stochastic optimization algorithms base their updates solely on the average mini-batch gradient, and it has been shown that tracking additional quantities such as the curvature can help de-sensitize common hyperparameters. Based on this intuition, we explore the use of exact per-sample Hessian-vector products and gradients to construct optimizers that are self-tuning and hyper… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

  50. arXiv:2011.04583  [pdf, other

    cs.LG

    Neural Spatio-Temporal Point Processes

    Authors: Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel

    Abstract: We propose a new class of parameterizations for spatio-temporal point processes which leverage Neural ODEs as a computational method and enable flexible, high-fidelity models of discrete events that are localized in continuous time and space. Central to our approach is a combination of continuous-time neural networks with two novel neural architectures, i.e., Jump and Attentive Continuous-time Nor… ▽ More

    Submitted 17 March, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Journal ref: ICLR 2021