-
Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Authors:
Hao-Tien Lewis Chiang,
Zhuo Xu,
Zipeng Fu,
Mithun George Jacob,
Tingnan Zhang,
Tsang-Wei Edward Lee,
Wenhao Yu,
Connor Schenck,
David Rendleman,
Dhruv Shah,
Fei Xia,
Jasmine Hsu,
Jonathan Hoech,
Pete Florence,
Sean Kirmani,
Sumeet Singh,
Vikas Sindhwani,
Carolina Parada,
Chelsea Finn,
Peng Xu,
Sergey Levine,
Jie Tan
Abstract:
An elusive goal in navigation research is to build an intelligent agent that can understand multimodal instructions including natural language and image, and perform useful navigation. To achieve this, we study a widely useful category of navigation tasks we call Multimodal Instruction Navigation with demonstration Tours (MINT), in which the environment prior is provided through a previously recor…
▽ More
An elusive goal in navigation research is to build an intelligent agent that can understand multimodal instructions including natural language and image, and perform useful navigation. To achieve this, we study a widely useful category of navigation tasks we call Multimodal Instruction Navigation with demonstration Tours (MINT), in which the environment prior is provided through a previously recorded demonstration video. Recent advances in Vision Language Models (VLMs) have shown a promising path in achieving this goal as it demonstrates capabilities in perceiving and reasoning about multimodal inputs. However, VLMs are typically trained to predict textual output and it is an open research question about how to best utilize them in navigation. To solve MINT, we present Mobility VLA, a hierarchical Vision-Language-Action (VLA) navigation policy that combines the environment understanding and common sense reasoning power of long-context VLMs and a robust low-level navigation policy based on topological graphs. The high-level policy consists of a long-context VLM that takes the demonstration tour video and the multimodal user instruction as input to find the goal frame in the tour video. Next, a low-level policy uses the goal frame and an offline constructed topological graph to generate robot actions at every timestep. We evaluated Mobility VLA in a 836m^2 real world environment and show that Mobility VLA has a high end-to-end success rates on previously unsolved multimodal instructions such as "Where should I return this?" while holding a plastic bin. A video demonstrating Mobility VLA can be found here: https://youtu.be/-Tof__Q8_5s
△ Less
Submitted 12 July, 2024; v1 submitted 10 July, 2024;
originally announced July 2024.
-
Drone-Based Antenna Beam Calibration in the High Arctic
Authors:
Lawrence Herman,
Christopher Barbarie,
Mohan Agrawal,
Vlad Calinescu,
Simon Chen,
H. Cynthia Chiang,
Cherie K. Day,
Eamon Egan,
Stephen Fay,
Kit Gerodias,
Maya Goss,
Michael Hétu,
Daniel C. Jacobs,
Marc-Olivier R. Lalonde,
Francis McGee,
Loïc Miara,
John Orlowski-Scherer,
Jonathan Sievers
Abstract:
The development of low-frequency radio astronomy experiments for detecting 21-cm line emission from hydrogen presents new opportunities for creative solutions to the challenge of characterizing an antenna beam pattern. The Array of Long Baseline Antennas for Taking Radio Observations from the Seventy-ninth parallel (ALBATROS) is a new radio interferometer sited in the Canadian high Arctic that aim…
▽ More
The development of low-frequency radio astronomy experiments for detecting 21-cm line emission from hydrogen presents new opportunities for creative solutions to the challenge of characterizing an antenna beam pattern. The Array of Long Baseline Antennas for Taking Radio Observations from the Seventy-ninth parallel (ALBATROS) is a new radio interferometer sited in the Canadian high Arctic that aims to map Galactic foregrounds at frequencies below $\sim$30 MHz. We present PteroSoar, a custom-built hexacopter outfitted with a transmitter, that will be used to characterize the beam patterns of ALBATROS and other experiments. The PteroSoar drone hardware is motivated by the need for user-servicing at remote sites and environmental factors that are unique to the high Arctic. In particular, magnetic heading is unreliable because the magnetic field lines near the north pole are almost vertical. We therefore implement moving baseline real time kinematic (RTK) positioning with two GPS units to obtain heading solutions with $\sim$1$^\circ$ accuracy. We present a preliminary beam map of an ALBATROS antenna, thus demonstrating successful PteroSoar operation in the high Arctic.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Feature Aggregation with Latent Generative Replay for Federated Continual Learning of Socially Appropriate Robot Behaviours
Authors:
Nikhil Churamani,
Saksham Checker,
Hao-Tien Lewis Chiang,
Hatice Gunes
Abstract:
For widespread real-world applications, it is beneficial for robots to explore Federated Learning (FL) settings where several robots, deployed in parallel, can learn independently while also sharing their learning with each other. This work explores a simulated living room environment where robots need to learn the social appropriateness of their actions. We propose Federated Root (FedRoot), a nov…
▽ More
For widespread real-world applications, it is beneficial for robots to explore Federated Learning (FL) settings where several robots, deployed in parallel, can learn independently while also sharing their learning with each other. This work explores a simulated living room environment where robots need to learn the social appropriateness of their actions. We propose Federated Root (FedRoot), a novel weight aggregation strategy which disentangles feature learning across clients from individual task-based learning. Adapting popular FL strategies to use FedRoot instead, we present a novel FL benchmark for learning the social appropriateness of different robot actions in diverse social configurations. FedRoot-based methods offer competitive performance compared to others while offering sizeable (up to 86% for CPU usage and up to 72% for GPU usage) reduction in resource consumption. Furthermore, real-world interactions require social robots to dynamically adapt to changing environmental and task settings. To facilitate this, we propose Federated Latent Generative Replay (FedLGR), a novel Federated Continual Learning (FCL) strategy that uses FedRoot-based weight aggregation and embeds each client with a generator model for pseudo-rehearsal of learnt feature embeddings to mitigate forgetting in a resource-efficient manner. Our benchmark results demonstrate that FedRoot-based FCL methods outperform other methods while also offering sizeable (up to 84% for CPU usage and up to 92% for GPU usage) reduction in resource consumption, with FedLGR providing the best results across evaluations.
△ Less
Submitted 16 March, 2024;
originally announced May 2024.
-
First Constraints on the Epoch of Reionization Using the non-Gaussianity of the Kinematic Sunyaev-Zel{'}dovich Effect from the South Pole Telescope and {\it Herschel}-SPIRE Observations
Authors:
S. Raghunathan,
P. A. R. Ade,
A. J. Anderson,
B. Ansarinejad,
M. Archipley,
J. E. Austermann,
L. Balkenhol,
J. A. Beall,
K. Benabed,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
J. Bock,
F. R. Bouchet,
L. Bryant,
E. Camphuis,
J. E. Carlstrom,
T. W. Cecil,
C. L. Chang,
P. Chaubal,
H. C. Chiang,
P. M. Chichura,
T. -L. Chou,
R. Citron
, et al. (97 additional authors not shown)
Abstract:
We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ i…
▽ More
We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ in bands centered at 95, 150, and 220 GHz. For SPIRE, we include data from the 600 and 857 GHz bands. We reconstruct the velocity-induced large-scale correlation of the small-scale kSZ signal with a quadratic estimator that uses two cosmic microwave background (CMB) temperature maps, constructed by optimally combining data from all the frequency bands. We reject the null hypothesis of a zero trispectrum at $10.3σ$ level. However, the measured trispectrum contains contributions from both the kSZ and other undesired components, such as CMB lensing and astrophysical foregrounds, with kSZ being sub-dominant. We use the \textsc{Agora} simulations to estimate the expected signal from CMB lensing and astrophysical foregrounds. After accounting for the contributions from CMB lensing and foreground signals, we do not detect an excess kSZ-only trispectrum and use this non-detection to set constraints on reionization. By applying a prior based on observations of the Gunn-Peterson trough, we obtain an upper limit on the duration of reionization of $Δz_{\rm re, 50} < 4.5$ (95\% C.L). We find these constraints are fairly robust to foregrounds assumptions. This trispectrum measurement is independent of, but consistent with, {\it Planck}'s optical depth measurement. This result is the first constraint on the epoch of reionization using the non-Gaussian nature of the kSZ signal.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Extremal quantiles of intermediate orders under two-way clustering
Authors:
Harold D. Chiang,
Ryutah Kato,
Yuya Sasaki
Abstract:
This paper investigates extremal quantiles under two-way cluster dependence. We demonstrate that the limiting distribution of the unconditional intermediate order quantiles in the tails converges to a Gaussian distribution. This is remarkable as two-way cluster dependence entails potential non-Gaussianity in general, but extremal quantiles do not suffer from this issue. Building upon this result,…
▽ More
This paper investigates extremal quantiles under two-way cluster dependence. We demonstrate that the limiting distribution of the unconditional intermediate order quantiles in the tails converges to a Gaussian distribution. This is remarkable as two-way cluster dependence entails potential non-Gaussianity in general, but extremal quantiles do not suffer from this issue. Building upon this result, we extend our analysis to extremal quantile regressions of intermediate order.
△ Less
Submitted 4 March, 2024; v1 submitted 29 February, 2024;
originally announced February 2024.
-
SPT Clusters with DES and HST Weak Lensing. II. Cosmological Constraints from the Abundance of Massive Halos
Authors:
S. Bocquet,
S. Grandis,
L. E. Bleem,
M. Klein,
J. J. Mohr,
T. Schrabback,
T. M. C. Abbott,
P. A. R. Ade,
M. Aguena,
A. Alarcon,
S. Allam,
S. W. Allen,
O. Alves,
A. Amon,
A. J. Anderson,
J. Annis,
B. Ansarinejad,
J. E. Austermann,
S. Avila,
D. Bacon,
M. Bayliss,
J. A. Beall,
K. Bechtol,
M. R. Becker,
A. N. Bender
, et al. (171 additional authors not shown)
Abstract:
We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d…
▽ More
We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d surveys, and comprises 1,005 confirmed clusters in the redshift range $0.25-1.78$ over a total sky area of 5,200 deg$^2$. We use DES Year 3 weak-lensing data for 688 clusters with redshifts $z<0.95$ and HST weak-lensing data for 39 clusters with $0.6<z<1.7$. The weak-lensing measurements enable robust mass measurements of sample clusters and allow us to empirically constrain the SZ observable--mass relation. For a flat $Λ$CDM cosmology, and marginalizing over the sum of massive neutrinos, we measure $Ω_\mathrm{m}=0.286\pm0.032$, $σ_8=0.817\pm0.026$, and the parameter combination $σ_8\,(Ω_\mathrm{m}/0.3)^{0.25}=0.805\pm0.016$. Our measurement of $S_8\equivσ_8\,\sqrt{Ω_\mathrm{m}/0.3}=0.795\pm0.029$ and the constraint from Planck CMB anisotropies (2018 TT,TE,EE+lowE) differ by $1.1σ$. In combination with that Planck dataset, we place a 95% upper limit on the sum of neutrino masses $\sum m_ν<0.18$ eV. When additionally allowing the dark energy equation of state parameter $w$ to vary, we obtain $w=-1.45\pm0.31$ from our cluster-based analysis. In combination with Planck data, we measure $w=-1.34^{+0.22}_{-0.15}$, or a $2.2σ$ difference with a cosmological constant. We use the cluster abundance to measure $σ_8$ in five redshift bins between 0.25 and 1.8, and we find the results to be consistent with structure growth as predicted by the $Λ$CDM model fit to Planck primary CMB data.
△ Less
Submitted 21 June, 2024; v1 submitted 4 January, 2024;
originally announced January 2024.
-
Data Science for Social Good
Authors:
Ahmed Abbasi,
Roger H. L. Chiang,
Jennifer J. Xu
Abstract:
Data science has been described as the fourth paradigm for scientific discovery. The latest wave of data science research, pertaining to machine learning and artificial intelligence (AI), is growing exponentially and garnering millions of annual citations. However, this growth has been accompanied by a diminishing emphasis on social good challenges - our analysis reveals that the proportion of dat…
▽ More
Data science has been described as the fourth paradigm for scientific discovery. The latest wave of data science research, pertaining to machine learning and artificial intelligence (AI), is growing exponentially and garnering millions of annual citations. However, this growth has been accompanied by a diminishing emphasis on social good challenges - our analysis reveals that the proportion of data science research focusing on social good is less than it has ever been. At the same time, the proliferation of machine learning and generative AI have sparked debates about the socio-technical prospects and challenges associated with data science for human flourishing, organizations, and society. Against this backdrop, we present a framework for "data science for social good" (DSSG) research that considers the interplay between relevant data science research genres, social good challenges, and different levels of socio-technical abstraction. We perform an analysis of the literature to empirically demonstrate the paucity of work on DSSG in information systems (and other related disciplines) and highlight current impediments. We then use our proposed framework to introduce the articles appearing in the special issue. We hope that this article and the special issue will spur future DSSG research and help reverse the alarming trend across data science research over the past 30-plus years in which social good challenges are garnering proportionately less attention with each passing day.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Multi-objective Non-intrusive Hearing-aid Speech Assessment Model
Authors:
Hsin-Tien Chiang,
Szu-Wei Fu,
Hsin-Min Wang,
Yu Tsao,
John H. L. Hansen
Abstract:
Without the need for a clean reference, non-intrusive speech assessment methods have caught great attention for objective evaluations. While deep learning models have been used to develop non-intrusive speech assessment methods with promising results, there is limited research on hearing-impaired subjects. This study proposes a multi-objective non-intrusive hearing-aid speech assessment model, cal…
▽ More
Without the need for a clean reference, non-intrusive speech assessment methods have caught great attention for objective evaluations. While deep learning models have been used to develop non-intrusive speech assessment methods with promising results, there is limited research on hearing-impaired subjects. This study proposes a multi-objective non-intrusive hearing-aid speech assessment model, called HASA-Net Large, which predicts speech quality and intelligibility scores based on input speech signals and specified hearing-loss patterns. Our experiments showed the utilization of pre-trained SSL models leads to a significant boost in speech quality and intelligibility predictions compared to using spectrograms as input. Additionally, we examined three distinct fine-tuning approaches that resulted in further performance improvements. Furthermore, we demonstrated that incorporating SSL models resulted in greater transferability to OOD dataset. Finally, this study introduces HASA-Net Large, which is a non-invasive approach for evaluating speech quality and intelligibility. HASA-Net Large utilizes raw waveforms and hearing-loss patterns to accurately predict speech quality and intelligibility levels for individuals with normal and impaired hearing and demonstrates superior prediction performance and transferability.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Galaxy Clusters Discovered via the Thermal Sunyaev-Zel'dovich Effect in the 500-square-degree SPTpol Survey
Authors:
L. E. Bleem,
M. Klein,
T. M. C. Abbott,
P. A. R. Ade,
M. Aguena,
O. Alves,
A. J. Anderson,
F. Andrade-Oliveira,
B. Ansarinejad,
M. Archipley,
M. L. N. Ashby,
J. E. Austermann,
D. Bacon,
J. A. Beall,
A. N. Bender,
B. A. Benson,
F. Bianchini,
S. Bocquet,
D. Brooks,
D. L. Burke,
M. Calzadilla,
J. E. Carlstrom,
A. Carnero Rosell,
J. Carretero,
C. L. Chang
, et al. (103 additional authors not shown)
Abstract:
We present a catalog of 689 galaxy cluster candidates detected at significance $ξ>4$ via their thermal Sunyaev-Zel'dovich (SZ) effect signature in 95 and 150 GHz data from the 500-square-degree SPTpol survey. We use optical and infrared data from the Dark Energy Camera and the Wide-field Infrared Survey Explorer (WISE) and \spitzer \ satellites, to confirm 544 of these candidates as clusters with…
▽ More
We present a catalog of 689 galaxy cluster candidates detected at significance $ξ>4$ via their thermal Sunyaev-Zel'dovich (SZ) effect signature in 95 and 150 GHz data from the 500-square-degree SPTpol survey. We use optical and infrared data from the Dark Energy Camera and the Wide-field Infrared Survey Explorer (WISE) and \spitzer \ satellites, to confirm 544 of these candidates as clusters with $\sim94\%$ purity. The sample has an approximately redshift-independent mass threshold at redshift $z>0.25$ and spans $1.5 \times 10^{14} < M_{500c} < 9.1 \times 10^{14}$ $M_\odot/h_{70}$ \ and $0.03<z\lesssim1.6$ in mass and redshift, respectively; 21\% of the confirmed clusters are at $z>1$. We use external radio data from the Sydney University Molonglo Sky Survey (SUMSS) to estimate contamination to the SZ signal from synchrotron sources. The contamination reduces the recovered $ξ$ by a median value of 0.032, or $\sim0.8\%$ of the $ξ=4$ threshold value, and $\sim7\%$ of candidates have a predicted contamination greater than $Δξ= 1$. With the exception of a small number of systems $(<1\%)$, an analysis of clusters detected in single-frequency 95 and 150 GHz data shows no significant contamination of the SZ signal by emission from dusty or synchrotron sources. This cluster sample will be a key component in upcoming astrophysical and cosmological analyses of clusters. The SPTpol millimeter-wave maps and associated data products used to produce this sample are available at https://pole.uchicago.edu/public/data/sptpol_500d_clusters/index.html, and the NASA LAMBDA website. An interactive sky server with the SPTpol maps and Dark Energy Survey data release 2 images is also available at NCSA https://skyviewer.ncsa.illinois.edu.
△ Less
Submitted 8 February, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
De-novo Chemical Reaction Generation by Means of Temporal Convolutional Neural Networks
Authors:
Andrei Buin,
Hung Yi Chiang,
S. Andrew Gadsden,
Faraz A. Alderson
Abstract:
We present here a combination of two networks, Recurrent Neural Networks (RNN) and Temporarily Convolutional Neural Networks (TCN) in de novo reaction generation using the novel Reaction Smiles-like representation of reactions (CGRSmiles) with atom mapping directly incorporated. Recurrent Neural Networks are known for their autoregressive properties and are frequently used in language modelling wi…
▽ More
We present here a combination of two networks, Recurrent Neural Networks (RNN) and Temporarily Convolutional Neural Networks (TCN) in de novo reaction generation using the novel Reaction Smiles-like representation of reactions (CGRSmiles) with atom mapping directly incorporated. Recurrent Neural Networks are known for their autoregressive properties and are frequently used in language modelling with direct application to SMILES generation. The relatively novel TCNs possess similar properties with wide receptive field while obeying the causality required for natural language processing (NLP). The combination of both latent representations expressed through TCN and RNN results in an overall better performance compared to RNN alone. Additionally, it is shown that different fine-tuning protocols have a profound impact on generative scope of the model when applied on a dataset of interest via transfer learning.
△ Less
Submitted 1 November, 2023; v1 submitted 26 October, 2023;
originally announced October 2023.
-
Towards Inferring Users' Impressions of Robot Performance in Navigation Scenarios
Authors:
Qiping Zhang,
Nathan Tsoi,
Booyeon Choi,
Jie Tan,
Hao-Tien Lewis Chiang,
Marynel Vázquez
Abstract:
Human impressions of robot performance are often measured through surveys. As a more scalable and cost-effective alternative, we study the possibility of predicting people's impressions of robot behavior using non-verbal behavioral cues and machine learning techniques. To this end, we first contribute the SEAN TOGETHER Dataset consisting of observations of an interaction between a person and a mob…
▽ More
Human impressions of robot performance are often measured through surveys. As a more scalable and cost-effective alternative, we study the possibility of predicting people's impressions of robot behavior using non-verbal behavioral cues and machine learning techniques. To this end, we first contribute the SEAN TOGETHER Dataset consisting of observations of an interaction between a person and a mobile robot in a Virtual Reality simulation, together with impressions of robot performance provided by users on a 5-point scale. Second, we contribute analyses of how well humans and supervised learning techniques can predict perceived robot performance based on different combinations of observation types (e.g., facial, spatial, and map features). Our results show that facial expressions alone provide useful information about human impressions of robot performance; but in the navigation scenarios we tested, spatial features are the most critical piece of information for this inference task. Also, when evaluating results as binary classification (rather than multiclass classification), the F1-Score of human predictions and machine learning models more than doubles, showing that both are better at telling the directionality of robot performance than predicting exact performance ratings. Based on our findings, we provide guidelines for implementing these predictions models in real-world navigation scenarios.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Simulating the Detection of the Global 21 cm Signal with MIST for Different Models of the Soil and Beam Directivity
Authors:
Raul A. Monsalve,
Christian H. Bye,
Jonathan L. Sievers,
Vadym Bidula,
Ricardo Bustos,
H. Cynthia Chiang,
Xinze Guo,
Ian Hendricksen,
Francis McGee,
F. Patricio Mena,
Garima Prabhakar,
Oscar Restrepo,
Nithyanandan Thyagarajan
Abstract:
The Mapper of the IGM Spin Temperature (MIST) is a new ground-based, single-antenna, radio experiment attempting to detect the global 21 cm signal from the Dark Ages and Cosmic Dawn. A significant challenge in this measurement is the frequency-dependence, or chromaticity, of the antenna beam directivity. MIST observes with the antenna above the soil and without a metal ground plane, and the beam d…
▽ More
The Mapper of the IGM Spin Temperature (MIST) is a new ground-based, single-antenna, radio experiment attempting to detect the global 21 cm signal from the Dark Ages and Cosmic Dawn. A significant challenge in this measurement is the frequency-dependence, or chromaticity, of the antenna beam directivity. MIST observes with the antenna above the soil and without a metal ground plane, and the beam directivity is sensitive to the electrical characteristics of the soil. In this paper, we use simulated observations with MIST to study how the detection of the global 21 cm signal from Cosmic Dawn is affected by the soil and the MIST beam directivity. We simulate observations using electromagnetic models of the directivity computed for single- and two-layer models of the soil. We test the recovery of the Cosmic Dawn signal with and without beam chromaticity correction applied to the simulated data. We find that our single-layer soil models enable a straightforward recovery of the signal even without chromaticity correction. Two-layer models increase the beam chromaticity and make the recovery more challenging. However, for the model in which the bottom soil layer has a lower electrical conductivity than the top layer, the signal can be recovered even without chromaticity correction. For the other two-layer models, chromaticity correction is necessary for the recovery of the signal and the accuracy requirements for the soil parameters vary between models. These results will be used as a guideline to select observation sites that are favorable for the detection of the Cosmic Dawn signal.
△ Less
Submitted 23 May, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Efficient Low-rank Backpropagation for Vision Transformer Adaptation
Authors:
Yuedong Yang,
Hung-Yueh Chiang,
Guihong Li,
Diana Marculescu,
Radu Marculescu
Abstract:
The increasing scale of vision transformers (ViT) has made the efficient fine-tuning of these large models for specific needs a significant challenge in various applications. This issue originates from the computationally demanding matrix multiplications required during the backpropagation process through linear layers in ViT. In this paper, we tackle this problem by proposing a new Low-rank BackP…
▽ More
The increasing scale of vision transformers (ViT) has made the efficient fine-tuning of these large models for specific needs a significant challenge in various applications. This issue originates from the computationally demanding matrix multiplications required during the backpropagation process through linear layers in ViT. In this paper, we tackle this problem by proposing a new Low-rank BackPropagation via Walsh-Hadamard Transformation (LBP-WHT) method. Intuitively, LBP-WHT projects the gradient into a low-rank space and carries out backpropagation. This approach substantially reduces the computation needed for adapting ViT, as matrix multiplication in the low-rank space is far less resource-intensive. We conduct extensive experiments with different models (ViT, hybrid convolution-ViT model) on multiple datasets to demonstrate the effectiveness of our method. For instance, when adapting an EfficientFormer-L1 model on CIFAR100, our LBP-WHT achieves 10.4% higher accuracy than the state-of-the-art baseline, while requiring 9 MFLOPs less computation. As the first work to accelerate ViT adaptation with low-rank backpropagation, our LBP-WHT method is complementary to many prior efforts and can be combined with them for better performance.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Mapper of the IGM spin temperature: instrument overview
Authors:
R. A. Monsalve,
C. Altamirano,
V. Bidula,
R. Bustos,
C. H. Bye,
H. C. Chiang,
M. Diaz,
B. Fernandez,
X. Guo,
I. Hendricksen,
E. Hornecker,
F. Lucero,
H. Mani,
F. McGee,
F. P. Mena,
M. Pessoa,
G. Prabhakar,
O. Restrepo,
J. L. Sievers,
N. Thyagarajan
Abstract:
The observation of the global 21 cm signal produced by neutral hydrogen gas in the intergalactic medium (IGM) during the Dark Ages, Cosmic Dawn, and Epoch of Reionization requires measurements with extremely well-calibrated wideband radiometers. We describe the design and characterization of the Mapper of the IGM Spin Temperature (MIST), which is a new ground-based, single-antenna, global 21 cm ex…
▽ More
The observation of the global 21 cm signal produced by neutral hydrogen gas in the intergalactic medium (IGM) during the Dark Ages, Cosmic Dawn, and Epoch of Reionization requires measurements with extremely well-calibrated wideband radiometers. We describe the design and characterization of the Mapper of the IGM Spin Temperature (MIST), which is a new ground-based, single-antenna, global 21 cm experiment. The design of MIST was guided by the objectives of avoiding systematics from an antenna ground plane and cables around the antenna, as well as maximizing the instrument's on-sky efficiency and portability for operations at remote sites. We have built two MIST instruments, which observe in the range 25-105 MHz. For the 21 cm signal, this frequency range approximately corresponds to redshifts 55.5 > z > 12.5, encompassing the Dark Ages and Cosmic Dawn. The MIST antenna is a horizontal blade dipole of 2.42 m in length, 60 cm in width, and 52 cm in height above the ground. This antenna operates without a metal ground plane. The instruments run on 12 V batteries and have a maximum power consumption of 17 W. The batteries and electronics are contained in a single receiver box located under the antenna. We present the characterization of the instruments using electromagnetic simulations and lab measurements. We also show sample sky measurements from recent observations at remote sites in California, Nevada, and the Canadian High Arctic. These measurements indicate that the instruments perform as expected. Detailed analyses of the sky measurements are left for future work.
△ Less
Submitted 23 May, 2024; v1 submitted 6 September, 2023;
originally announced September 2023.
-
On the Inconsistency of Cluster-Robust Inference and How Subsampling Can Fix It
Authors:
Harold D. Chiang,
Yuya Sasaki,
Yulong Wang
Abstract:
Conventional methods of cluster-robust inference are inconsistent in the presence of unignorably large clusters. We formalize this claim by establishing a necessary and sufficient condition for the consistency of the conventional methods. We find that this condition for the consistency is rejected for a majority of empirical research papers. In this light, we propose a novel score subsampling meth…
▽ More
Conventional methods of cluster-robust inference are inconsistent in the presence of unignorably large clusters. We formalize this claim by establishing a necessary and sufficient condition for the consistency of the conventional methods. We find that this condition for the consistency is rejected for a majority of empirical research papers. In this light, we propose a novel score subsampling method that achieves uniform size control over a broad class of data generating processes, covering that fails the conventional method. Simulation studies support these claims. With real data used by an empirical paper, we showcase that the conventional methods conclude significance while our proposed method concludes insignificance.
△ Less
Submitted 23 March, 2024; v1 submitted 19 August, 2023;
originally announced August 2023.
-
Study on the Correlation between Objective Evaluations and Subjective Speech Quality and Intelligibility
Authors:
Hsin-Tien Chiang,
Kuo-Hsuan Hung,
Szu-Wei Fu,
Heng-Cheng Kuo,
Ming-Hsueh Tsai,
Yu Tsao
Abstract:
Subjective tests are the gold standard for evaluating speech quality and intelligibility; however, they are time-consuming and expensive. Thus, objective measures that align with human perceptions are crucial. This study evaluates the correlation between commonly used objective measures and subjective speech quality and intelligibility using a Chinese speech dataset. Moreover, new objective measur…
▽ More
Subjective tests are the gold standard for evaluating speech quality and intelligibility; however, they are time-consuming and expensive. Thus, objective measures that align with human perceptions are crucial. This study evaluates the correlation between commonly used objective measures and subjective speech quality and intelligibility using a Chinese speech dataset. Moreover, new objective measures are proposed that combine current objective measures using deep learning techniques to predict subjective quality and intelligibility. The proposed deep learning model reduces the amount of training data without significantly affecting prediction performance. We analyzed the deep learning model to understand how objective measures reflect subjective quality and intelligibility. We also explored the impact of including subjective speech quality ratings on speech intelligibility prediction. Our findings offer valuable insights into the relationship between objective measures and human perceptions.
△ Less
Submitted 10 October, 2023; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Principles and Guidelines for Evaluating Social Robot Navigation Algorithms
Authors:
Anthony Francis,
Claudia Pérez-D'Arpino,
Chengshu Li,
Fei Xia,
Alexandre Alahi,
Rachid Alami,
Aniket Bera,
Abhijat Biswas,
Joydeep Biswas,
Rohan Chandra,
Hao-Tien Lewis Chiang,
Michael Everett,
Sehoon Ha,
Justin Hart,
Jonathan P. How,
Haresh Karnan,
Tsang-Wei Edward Lee,
Luis J. Manso,
Reuth Mirksy,
Sören Pirk,
Phani Teja Singamaneni,
Peter Stone,
Ada V. Taylor,
Peter Trautman,
Nathan Tsoi
, et al. (6 additional authors not shown)
Abstract:
A major challenge to deploying robots widely is navigation in human-populated environments, commonly referred to as social robot navigation. While the field of social navigation has advanced tremendously in recent years, the fair evaluation of algorithms that tackle social navigation remains hard because it involves not just robotic agents moving in static environments but also dynamic human agent…
▽ More
A major challenge to deploying robots widely is navigation in human-populated environments, commonly referred to as social robot navigation. While the field of social navigation has advanced tremendously in recent years, the fair evaluation of algorithms that tackle social navigation remains hard because it involves not just robotic agents moving in static environments but also dynamic human agents and their perceptions of the appropriateness of robot behavior. In contrast, clear, repeatable, and accessible benchmarks have accelerated progress in fields like computer vision, natural language processing and traditional robot navigation by enabling researchers to fairly compare algorithms, revealing limitations of existing solutions and illuminating promising new directions. We believe the same approach can benefit social navigation. In this paper, we pave the road towards common, widely accessible, and repeatable benchmarking criteria to evaluate social robot navigation. Our contributions include (a) a definition of a socially navigating robot as one that respects the principles of safety, comfort, legibility, politeness, social competency, agent understanding, proactivity, and responsiveness to context, (b) guidelines for the use of metrics, development of scenarios, benchmarks, datasets, and simulators to evaluate social navigation, and (c) a design of a social navigation metrics framework to make it easier to compare results from different simulators, robots and datasets.
△ Less
Submitted 19 September, 2023; v1 submitted 29 June, 2023;
originally announced June 2023.
-
Shilling Black-box Review-based Recommender Systems through Fake Review Generation
Authors:
Hung-Yun Chiang,
Yi-Syuan Chen,
Yun-Zhu Song,
Hong-Han Shuai,
Jason S. Chang
Abstract:
Review-Based Recommender Systems (RBRS) have attracted increasing research interest due to their ability to alleviate well-known cold-start problems. RBRS utilizes reviews to construct the user and items representations. However, in this paper, we argue that such a reliance on reviews may instead expose systems to the risk of being shilled. To explore this possibility, in this paper, we propose th…
▽ More
Review-Based Recommender Systems (RBRS) have attracted increasing research interest due to their ability to alleviate well-known cold-start problems. RBRS utilizes reviews to construct the user and items representations. However, in this paper, we argue that such a reliance on reviews may instead expose systems to the risk of being shilled. To explore this possibility, in this paper, we propose the first generation-based model for shilling attacks against RBRSs. Specifically, we learn a fake review generator through reinforcement learning, which maliciously promotes items by forcing prediction shifts after adding generated reviews to the system. By introducing the auxiliary rewards to increase text fluency and diversity with the aid of pre-trained language models and aspect predictors, the generated reviews can be effective for shilling with high fidelity. Experimental results demonstrate that the proposed framework can successfully attack three different kinds of RBRSs on the Amazon corpus with three domains and Yelp corpus. Furthermore, human studies also show that the generated reviews are fluent and informative. Finally, equipped with Attack Review Generators (ARGs), RBRSs with adversarial training are much more robust to malicious reviews.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Language to Rewards for Robotic Skill Synthesis
Authors:
Wenhao Yu,
Nimrod Gileadi,
Chuyuan Fu,
Sean Kirmani,
Kuang-Huei Lee,
Montse Gonzalez Arenas,
Hao-Tien Lewis Chiang,
Tom Erez,
Leonard Hasenclever,
Jan Humplik,
Brian Ichter,
Ted Xiao,
Peng Xu,
Andy Zeng,
Tingnan Zhang,
Nicolas Heess,
Dorsa Sadigh,
Jie Tan,
Yuval Tassa,
Fei Xia
Abstract:
Large language models (LLMs) have demonstrated exciting progress in acquiring diverse new capabilities through in-context learning, ranging from logical reasoning to code-writing. Robotics researchers have also explored using LLMs to advance the capabilities of robotic control. However, since low-level robot actions are hardware-dependent and underrepresented in LLM training corpora, existing effo…
▽ More
Large language models (LLMs) have demonstrated exciting progress in acquiring diverse new capabilities through in-context learning, ranging from logical reasoning to code-writing. Robotics researchers have also explored using LLMs to advance the capabilities of robotic control. However, since low-level robot actions are hardware-dependent and underrepresented in LLM training corpora, existing efforts in applying LLMs to robotics have largely treated LLMs as semantic planners or relied on human-engineered control primitives to interface with the robot. On the other hand, reward functions are shown to be flexible representations that can be optimized for control policies to achieve diverse tasks, while their semantic richness makes them suitable to be specified by LLMs. In this work, we introduce a new paradigm that harnesses this realization by utilizing LLMs to define reward parameters that can be optimized and accomplish variety of robotic tasks. Using reward as the intermediate interface generated by LLMs, we can effectively bridge the gap between high-level language instructions or corrections to low-level robot actions. Meanwhile, combining this with a real-time optimizer, MuJoCo MPC, empowers an interactive behavior creation experience where users can immediately observe the results and provide feedback to the system. To systematically evaluate the performance of our proposed method, we designed a total of 17 tasks for a simulated quadruped robot and a dexterous manipulator robot. We demonstrate that our proposed method reliably tackles 90% of the designed tasks, while a baseline using primitive skills as the interface with Code-as-policies achieves 50% of the tasks. We further validated our method on a real robot arm where complex manipulation skills such as non-prehensile pushing emerge through our interactive system.
△ Less
Submitted 16 June, 2023; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Resonant orbits of rotating black holes beyond circularity: Discontinuity along parameter shift
Authors:
Che-Yu Chen,
Hsu-Wen Chiang,
Avani Patel
Abstract:
According to general relativity, an isolated black hole in vacuum shall be described by the Kerr metric, whose geodesic equations are integrable. The violation of integrability leads to chaos for particles moving around the black hole. This chaotic dynamics could leave imprints on the associated gravitational waveform and could be tested with upcoming observations. In this paper, we investigate th…
▽ More
According to general relativity, an isolated black hole in vacuum shall be described by the Kerr metric, whose geodesic equations are integrable. The violation of integrability leads to chaos for particles moving around the black hole. This chaotic dynamics could leave imprints on the associated gravitational waveform and could be tested with upcoming observations. In this paper, we investigate the chaotic orbital dynamics induced by the violation of a certain spacetime symmetry, the circularity. Specifically, we focus on the resonant orbits of a particular noncircular spacetime as an example and find that they form chains of Birkhoff islands on Poincaré surfaces of section. We compare the island structures with those generated in typical nonintegrable but circular spacetimes. The islands of stability induced by noncircularity appear asymmetric on the most common Poincaré surface of section at the equatorial plane. The asymmetric patterns of islands vary discontinuously when the spacetime parameters transit through integrable regions. The origin of such features is explained in the context of perturbation analysis by considering the orbits associated with stable fixed points on the section. Possible observational implications about testing circularity through gravitational wave detection are discussed.
△ Less
Submitted 15 September, 2023; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Simultaneous Millimeter-wave, Gamma-ray, and Optical Monitoring of the Blazar PKS 2326-502 During a Flaring State
Authors:
J. C. Hood II,
A. Simpson,
A. McDaniel,
A. Foster,
P. A. R. Ade,
M. Ajello,
A. J. Anderson,
J. E. Austermann,
J. A. Beall,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
J. E. Carlstrom,
C. L. Chang,
P. Chaubal,
H. C. Chiang,
T-L. Chou,
R. Citron,
C. Corbett Moran,
T. M. Crawford,
A. T. Crites,
T. de Haan,
M. A. Dobbs,
W. Everett
, et al. (44 additional authors not shown)
Abstract:
Including millimeter-wave (mm-wave) data in multi-wavelength studies of the variability of active galactic nuclei (AGN) can provide insights into AGN physics that are not easily accessible at other wavelengths. We demonstrate in this work the potential of cosmic microwave background (CMB) telescopes to provide long-term, high-cadence mm-wave AGN monitoring over large fractions of sky. We report on…
▽ More
Including millimeter-wave (mm-wave) data in multi-wavelength studies of the variability of active galactic nuclei (AGN) can provide insights into AGN physics that are not easily accessible at other wavelengths. We demonstrate in this work the potential of cosmic microwave background (CMB) telescopes to provide long-term, high-cadence mm-wave AGN monitoring over large fractions of sky. We report on a pilot study using data from the SPTpol instrument on the South Pole Telescope (SPT), which was designed to observe the CMB at arcminute and larger angular scales. Between 2013 and 2016, SPTpol was used primarily to observe a single 500 deg^2 field, covering the entire field several times per day with detectors sensitive to radiation in bands centered at 95 and 150 GHz. We use SPT 150 GHz observations to create AGN light curves, and we compare these mm-wave light curves to those at other wavelengths, in particular gamma-ray and optical. In this Letter, we focus on a single source, PKS 2326-502, which has extensive, day-timescale monitoring data in gamma-ray, optical, and now mm-wave between 2013 and 2016. We find PKS 2326-502 to be in a flaring state in the first two years of this monitoring, and we present a search for evidence of correlated variability between mm-wave, optical R band, and gamma-ray observations. This pilot study is paving the way for AGN monitoring with current and upcoming CMB experiments such as SPT-3G, Simons Observatory, and CMB-S4, including multi-wavelength studies with facilities such as VRO-LSST.
△ Less
Submitted 28 February, 2023;
originally announced February 2023.
-
Bender-Knuth involutions on linear extensions of posets
Authors:
Judy Hsin-Hui Chiang,
Anh Trong Nam Hoang,
Matthew Kendall,
Ryan Lynch,
Son Nguyen,
Benjamin Przybocki,
Janabel Xia
Abstract:
We study the permutation group $\mathcal{BK}_P$ generated by Bender-Knuth moves on linear extensions of a poset $P$, an analog of the Berenstein-Kirillov group on column-strict tableaux. We explore the group relations, with an emphasis on identifying posets $P$ for which the cactus relations hold in $\mathcal{BK}_P$. We also examine $\mathcal{BK}_P$ as a subgroup of the symmetric group…
▽ More
We study the permutation group $\mathcal{BK}_P$ generated by Bender-Knuth moves on linear extensions of a poset $P$, an analog of the Berenstein-Kirillov group on column-strict tableaux. We explore the group relations, with an emphasis on identifying posets $P$ for which the cactus relations hold in $\mathcal{BK}_P$. We also examine $\mathcal{BK}_P$ as a subgroup of the symmetric group $\mathfrak{S}_{\mathcal{L}(P)}$ on the set of linear extensions of $P$ with the focus on analyzing posets $P$ for which $\mathcal{BK}_P = \mathfrak{S}_{\mathcal{L}(P)}$.
△ Less
Submitted 24 March, 2024; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Regression adjustment in randomized controlled trials with many covariates
Authors:
Harold D Chiang,
Yukitoshi Matsushita,
Taisuke Otsu
Abstract:
This paper is concerned with estimation and inference on average treatment effects in randomized controlled trials when researchers observe potentially many covariates. By employing Neyman's (1923) finite population perspective, we propose a bias-corrected regression adjustment estimator using cross-fitting, and show that the proposed estimator has favorable properties over existing alternatives.…
▽ More
This paper is concerned with estimation and inference on average treatment effects in randomized controlled trials when researchers observe potentially many covariates. By employing Neyman's (1923) finite population perspective, we propose a bias-corrected regression adjustment estimator using cross-fitting, and show that the proposed estimator has favorable properties over existing alternatives. For inference, we derive the first and second order terms in the stochastic component of the regression adjustment estimators, study higher order properties of the existing inference methods, and propose a bias-corrected version of the HC3 standard error. The proposed methods readily extend to stratified experiments with large strata. Simulation studies show our cross-fitted estimator, combined with the bias-corrected HC3, delivers precise point estimates and robust size controls over a wide range of DGPs. To illustrate, the proposed methods are applied to real dataset on randomized experiments of incentives and services for college achievement following Angrist, Lang, and Oreopoulos (2009).
△ Less
Submitted 13 November, 2023; v1 submitted 1 February, 2023;
originally announced February 2023.
-
On Using The Two-Way Cluster-Robust Standard Errors
Authors:
Harold D Chiang,
Yuya Sasaki
Abstract:
Thousands of papers have reported two-way cluster-robust (TWCR) standard errors. However, the recent econometrics literature points out the potential non-gaussianity of two-way cluster sample means, and thus invalidity of the inference based on the TWCR standard errors. Fortunately, simulation studies nonetheless show that the gaussianity is rather common than exceptional. This paper provides theo…
▽ More
Thousands of papers have reported two-way cluster-robust (TWCR) standard errors. However, the recent econometrics literature points out the potential non-gaussianity of two-way cluster sample means, and thus invalidity of the inference based on the TWCR standard errors. Fortunately, simulation studies nonetheless show that the gaussianity is rather common than exceptional. This paper provides theoretical support for this encouraging observation. Specifically, we derive a novel central limit theorem for two-way clustered triangular arrays that justifies the use of the TWCR under very mild and interpretable conditions. We, therefore, hope that this paper will provide a theoretical justification for the legitimacy of most, if not all, of the thousands of those empirical papers that have used the TWCR standard errors. We provide a guide in practice as to when a researcher can employ the TWCR standard errors.
△ Less
Submitted 31 January, 2023;
originally announced January 2023.
-
MobileTL: On-device Transfer Learning with Inverted Residual Blocks
Authors:
Hung-Yueh Chiang,
Natalia Frumkin,
Feng Liang,
Diana Marculescu
Abstract:
Transfer learning on edge is challenging due to on-device limited resources. Existing work addresses this issue by training a subset of parameters or adding model patches. Developed with inference in mind, Inverted Residual Blocks (IRBs) split a convolutional layer into depthwise and pointwise convolutions, leading to more stacking layers, e.g., convolution, normalization, and activation layers. T…
▽ More
Transfer learning on edge is challenging due to on-device limited resources. Existing work addresses this issue by training a subset of parameters or adding model patches. Developed with inference in mind, Inverted Residual Blocks (IRBs) split a convolutional layer into depthwise and pointwise convolutions, leading to more stacking layers, e.g., convolution, normalization, and activation layers. Though they are efficient for inference, IRBs require that additional activation maps are stored in memory for training weights for convolution layers and scales for normalization layers. As a result, their high memory cost prohibits training IRBs on resource-limited edge devices, and making them unsuitable in the context of transfer learning. To address this issue, we present MobileTL, a memory and computationally efficient on-device transfer learning method for models built with IRBs. MobileTL trains the shifts for internal normalization layers to avoid storing activation maps for the backward pass. Also, MobileTL approximates the backward computation of the activation layer (e.g., Hard-Swish and ReLU6) as a signed function which enables storing a binary mask instead of activation maps for the backward pass. MobileTL fine-tunes a few top blocks (close to output) rather than propagating the gradient through the whole network to reduce the computation cost. Our method reduces memory usage by 46% and 53% for MobileNetV2 and V3 IRBs, respectively. For MobileNetV3, we observe a 36% reduction in floating-point operations (FLOPs) when fine-tuning 5 blocks, while only incurring a 0.6% accuracy reduction on CIFAR10. Extensive experiments on multiple datasets demonstrate that our method is Pareto-optimal (best accuracy under given hardware constraints) compared to prior work in transfer learning for edge devices.
△ Less
Submitted 8 April, 2023; v1 submitted 5 December, 2022;
originally announced December 2022.
-
Antenna characterization for the HIRAX experiment
Authors:
Emily R. Kuhn,
Benjamin R. B. Saliwanchik,
Kevin Bandura,
Michele Bianco,
H. Cynthia Chiang,
Devin Crichton,
Meiling Deng,
Sindhu Gaddam,
Kit Gerodias,
Austin Gumba,
Maile Harris,
Kavilan Moodley,
V. Mugundhan,
Laura Newburgh,
Jeffrey Peterson,
Elizabeth Pieters,
Anna R. Polish,
Alexandre Refregier,
Ajith Sampath,
Mario G. Santos,
Onkabetse Sengate,
Jonathan Sievers,
Ema Smith,
Will Tyndall,
Anthony Walters
, et al. (2 additional authors not shown)
Abstract:
The Hydrogen Intensity and Real-time Analysis eXperiment (HIRAX) aims to improve constraints on the dark energy equation of state through measurements of large-scale structure at high redshift ($0.8<z<2.5$), while serving as a state-of-the-art fast radio burst detector. Bright galactic foregrounds contaminate the 400--800~MHz HIRAX frequency band, so meeting the science goals will require precise…
▽ More
The Hydrogen Intensity and Real-time Analysis eXperiment (HIRAX) aims to improve constraints on the dark energy equation of state through measurements of large-scale structure at high redshift ($0.8<z<2.5$), while serving as a state-of-the-art fast radio burst detector. Bright galactic foregrounds contaminate the 400--800~MHz HIRAX frequency band, so meeting the science goals will require precise instrument characterization. In this paper we describe characterization of the HIRAX antenna, focusing on measurements of the antenna beam and antenna noise temperature.
Beam measurements of the current HIRAX antenna design were performed in an anechoic chamber and compared to simulations. We report measurement techniques and results, which find a broad and symmetric antenna beam for $ν<$650MHz, and elevated cross-polarization levels and beam asymmetries for $ν>$700MHz. Noise temperature measurements of the HIRAX feeds were performed in a custom apparatus built at Yale. In this system, identical loads, one cryogenic and the other at room temperature, are used to take a differential (Y-factor) measurement from which the noise of the system is inferred. Several measurement sets have been conducted using the system, involving CHIME feeds as well as four of the HIRAX active feeds. These measurements give the first noise temperature measurements of the HIRAX feed, revealing a $\sim$60K noise temperature (relative to 30K target) with 40K peak- to-peak frequency-dependent features, and provide the first demonstration of feed repeatability. Both findings inform current and future feed designs.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Eikonal quasinormal modes and photon orbits of deformed Schwarzschild black holes
Authors:
Che-Yu Chen,
Hsu-Wen Chiang,
Jie-Shiun Tsao
Abstract:
The geometric optics approximation provides an interpretation for eikonal correspondence that, in black-hole-containing spacetimes, connects high-frequency black hole quasinormal modes with closed photon orbits around said black hole. This correspondence has been identified explicitly for Schwarzschild, Reissner-Nordström, Kerr, and Kerr-Newman black holes, the violation of which can be a potentia…
▽ More
The geometric optics approximation provides an interpretation for eikonal correspondence that, in black-hole-containing spacetimes, connects high-frequency black hole quasinormal modes with closed photon orbits around said black hole. This correspondence has been identified explicitly for Schwarzschild, Reissner-Nordström, Kerr, and Kerr-Newman black holes, the violation of which can be a potential hint toward physics beyond General Relativity. Notably, the aforementioned black hole spacetimes have sufficient symmetries such that both the geodesic equations and the master wave equations are separable. The identification of the correspondence seems to largely rely on these symmetries. One naturally asks how the eikonal correspondence would appear if the spacetime were less symmetric. For a pioneering work in this direction, we consider in this paper a deformed Schwarzschild spacetime retaining only axisymmetry and stationarity. We show that up to the first order of spacetime deformations the eikonal correspondence manifests through the definition of the \textit{averaged} radius of trapped photon orbits along their one period. This averaged radius overlaps the potential peak in the master wave equation, which can be defined up to the first order of spacetime deformations, allowing the explicit identification of the eikonal correspondence.
△ Less
Submitted 30 August, 2022; v1 submitted 5 May, 2022;
originally announced May 2022.
-
Boosting Self-Supervised Embeddings for Speech Enhancement
Authors:
Kuo-Hsuan Hung,
Szu-wei Fu,
Huan-Hsin Tseng,
Hsin-Tien Chiang,
Yu Tsao,
Chii-Wann Lin
Abstract:
Self-supervised learning (SSL) representation for speech has achieved state-of-the-art (SOTA) performance on several downstream tasks. However, there remains room for improvement in speech enhancement (SE) tasks. In this study, we used a cross-domain feature to solve the problem that SSL embeddings may lack fine-grained information to regenerate speech signals. By integrating the SSL representatio…
▽ More
Self-supervised learning (SSL) representation for speech has achieved state-of-the-art (SOTA) performance on several downstream tasks. However, there remains room for improvement in speech enhancement (SE) tasks. In this study, we used a cross-domain feature to solve the problem that SSL embeddings may lack fine-grained information to regenerate speech signals. By integrating the SSL representation and spectrogram, the result can be significantly boosted. We further study the relationship between the noise robustness of SSL representation via clean-noisy distance (CN distance) and the layer importance for SE. Consequently, we found that SSL representations with lower noise robustness are more important. Furthermore, our experiments on the VCTK-DEMAND dataset demonstrated that fine-tuning an SSL representation with an SE model can outperform the SOTA SSL-based SE methods in PESQ, CSIG and COVL without invoking complicated network architectures. In later experiments, the CN distance in SSL embeddings was observed to increase after fine-tuning. These results verify our expectations and may help design SE-related SSL training in the future.
△ Less
Submitted 5 July, 2022; v1 submitted 7 April, 2022;
originally announced April 2022.
-
Asteroid Measurements at Millimeter Wavelengths with the South Pole Telescope
Authors:
P. M. Chichura,
A. Foster,
C. Patel,
N. Ossa-Jaen,
P. A. R. Ade,
Z. Ahmed,
A. J. Anderson,
M. Archipley,
J. E. Austermann,
J. S. Avva,
L. Balkenhol,
P. S. Barry,
R. Basu Thakur,
J. A. Beall,
K. Benabed,
A. N. Bender,
B. A. Benson,
F. Bianchini,
L. E. Bleem,
F. R. Bouchet,
L. Bryant,
K. Byrum,
J. E. Carlstrom,
F. W. Carter,
T. W. Cecil
, et al. (119 additional authors not shown)
Abstract:
We present the first measurements of asteroids in millimeter wavelength (mm) data from the South Pole Telescope (SPT), which is used primarily to study the cosmic microwave background (CMB). We analyze maps of two $\sim270$ deg$^2$ sky regions near the ecliptic plane, each observed with the SPTpol camera $\sim100$ times over one month. We subtract the mean of all maps of a given field, removing st…
▽ More
We present the first measurements of asteroids in millimeter wavelength (mm) data from the South Pole Telescope (SPT), which is used primarily to study the cosmic microwave background (CMB). We analyze maps of two $\sim270$ deg$^2$ sky regions near the ecliptic plane, each observed with the SPTpol camera $\sim100$ times over one month. We subtract the mean of all maps of a given field, removing static sky signal, and then average the mean-subtracted maps at known asteroid locations. We detect three asteroids$\text{ -- }$(324) Bamberga, (13) Egeria, and (22) Kalliope$\text{ -- }$with signal-to-noise ratios (S/N) of 11.2, 10.4, and 6.1, respectively, at 2.0 mm (150 GHz); we also detect (324) Bamberga with S/N of 4.1 at 3.2 mm (95 GHz). We place constraints on these asteroids' effective emissivities, brightness temperatures, and light curve modulation amplitude. Our flux density measurements of (324) Bamberga and (13) Egeria roughly agree with predictions, while our measurements of (22) Kalliope suggest lower flux, corresponding to effective emissivities of $0.66 \pm 0.11$ at 2.0 mm and $<0.47$ at 3.2mm. We predict the asteroids detectable in other SPT datasets and find good agreement with detections of (772) Tanete and (1093) Freda in recent data from the SPT-3G camera, which has $\sim10 \times$ the mapping speed of SPTpol. This work is the first focused analysis of asteroids in data from CMB surveys, and it demonstrates we can repurpose historic and future datasets for asteroid studies. Future SPT measurements can help constrain the distribution of surface properties over a larger asteroid population.
△ Less
Submitted 21 April, 2023; v1 submitted 2 February, 2022;
originally announced February 2022.
-
Standard errors for two-way clustering with serially correlated time effects
Authors:
Harold D Chiang,
Bruce E Hansen,
Yuya Sasaki
Abstract:
We propose improved standard errors and an asymptotic distribution theory for two-way clustered panels. Our proposed estimator and theory allow for arbitrary serial dependence in the common time effects, which is excluded by existing two-way methods, including the popular two-way cluster standard errors of Cameron, Gelbach, and Miller (2011) and the cluster bootstrap of Menzel (2021). Our asymptot…
▽ More
We propose improved standard errors and an asymptotic distribution theory for two-way clustered panels. Our proposed estimator and theory allow for arbitrary serial dependence in the common time effects, which is excluded by existing two-way methods, including the popular two-way cluster standard errors of Cameron, Gelbach, and Miller (2011) and the cluster bootstrap of Menzel (2021). Our asymptotic distribution theory is the first which allows for this level of inter-dependence among the observations. Under weak regularity conditions, we demonstrate that the least squares estimator is asymptotically normal, our proposed variance estimator is consistent, and t-ratios are asymptotically standard normal, permitting conventional inference. We present simulation evidence that confidence intervals constructed with our proposed standard errors obtain superior coverage performance relative to existing methods. We illustrate the relevance of the proposed method in an empirical application to a standard Fama-French three-factor regression.
△ Less
Submitted 13 December, 2023; v1 submitted 26 January, 2022;
originally announced January 2022.
-
In-flight gain monitoring of SPIDER's transition-edge sensor arrays
Authors:
J. P. Filippini,
A. E. Gambrel,
A. S. Rahlin,
E. Y. Young,
P. A. R. Ade,
M. Amiri,
S. J. Benton,
A. S. Bergman,
R. Bihary,
J. J. Bock,
J. R. Bond,
J. A. Bonetti,
S. A. Bryan,
H. C. Chiang,
C. R. Contaldi,
O. Dore,
A. J. Duivenvoorden,
H. K. Eriksen,
M. Farhang,
A. A. Fraisse,
K. Freese,
M. Galloway,
N. N. Gandilo,
K. Ganga,
R. Gualtieri
, et al. (45 additional authors not shown)
Abstract:
Experiments deploying large arrays of transition-edge sensors (TESs) often require a robust method to monitor gain variations with minimal loss of observing time. We propose a sensitive and non-intrusive method for monitoring variations in TES responsivity using small square waves applied to the TES bias. We construct an estimator for a TES's small-signal power response from its electrical respons…
▽ More
Experiments deploying large arrays of transition-edge sensors (TESs) often require a robust method to monitor gain variations with minimal loss of observing time. We propose a sensitive and non-intrusive method for monitoring variations in TES responsivity using small square waves applied to the TES bias. We construct an estimator for a TES's small-signal power response from its electrical response that is exact in the limit of strong electrothermal feedback. We discuss the application and validation of this method using flight data from SPIDER, a balloon-borne telescope that observes the polarization of the cosmic microwave background with more than 2000 TESs. This method may prove useful for future balloon- and space-based instruments, where observing time and ground control bandwidth are limited.
△ Less
Submitted 16 June, 2022; v1 submitted 1 December, 2021;
originally announced December 2021.
-
Rubin Science Platform on Google: the story so far
Authors:
William O'Mullane,
Frossie Economou,
Flora Huang,
Dan Speck,
Hsin-Fang Chiang,
Melissa L. Graham,
Russ Allbery,
Christine Banek,
Jonathan Sick,
Adam J. Thornton,
Jess Masciarelli,
Kian-Tat Lim,
Fritz Mueller,
Sergey Padolski,
Tim Jenness,
K. Simon Krughoff,
Michelle Gower,
Leanne P. Guy,
Gregory P. Dubois-Felsmann
Abstract:
We describe Rubin Observatory's experience with offering a data access facility (and associated services including our Science Platform) deployed on Google Cloud infrastructure as part of our pre-Operations Data Preview program.
We describe Rubin Observatory's experience with offering a data access facility (and associated services including our Science Platform) deployed on Google Cloud infrastructure as part of our pre-Operations Data Preview program.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
HASA-net: A non-intrusive hearing-aid speech assessment network
Authors:
Hsin-Tien Chiang,
Yi-Chiao Wu,
Cheng Yu,
Tomoki Toda,
Hsin-Min Wang,
Yih-Chun Hu,
Yu Tsao
Abstract:
Without the need of a clean reference, non-intrusive speech assessment methods have caught great attention for objective evaluations. Recently, deep neural network (DNN) models have been applied to build non-intrusive speech assessment approaches and confirmed to provide promising performance. However, most DNN-based approaches are designed for normal-hearing listeners without considering hearing-…
▽ More
Without the need of a clean reference, non-intrusive speech assessment methods have caught great attention for objective evaluations. Recently, deep neural network (DNN) models have been applied to build non-intrusive speech assessment approaches and confirmed to provide promising performance. However, most DNN-based approaches are designed for normal-hearing listeners without considering hearing-loss factors. In this study, we propose a DNN-based hearing aid speech assessment network (HASA-Net), formed by a bidirectional long short-term memory (BLSTM) model, to predict speech quality and intelligibility scores simultaneously according to input speech signals and specified hearing-loss patterns. To the best of our knowledge, HASA-Net is the first work to incorporate quality and intelligibility assessments utilizing a unified DNN-based non-intrusive model for hearing aids. Experimental results show that the predicted speech quality and intelligibility scores of HASA-Net are highly correlated to two well-known intrusive hearing-aid evaluation metrics, hearing aid speech quality index (HASQI) and hearing aid speech perception index (HASPI), respectively.
△ Less
Submitted 10 November, 2021;
originally announced November 2021.
-
A Simulation-Based Method for Correcting Mode Coupling in CMB Angular Power Spectra
Authors:
J. S. -Y. Leung,
J. Hartley,
J. M. Nagy,
C. B. Netterfield,
J. A. Shariff,
P. A. R. Ade,
M. Amiri,
S. J. Benton,
A. S. Bergman,
R. Bihary,
J. J. Bock,
J. R. Bond,
J. A. Bonetti,
S. A. Bryan,
H. C. Chiang,
C. R. Contaldi,
O. Doré,
A. J. Duivenvoorden,
H. K. Eriksen,
M. Farhang,
J. P. Filippini,
A. A. Fraisse,
K. Freese,
M. Galloway,
A. E. Gambrel
, et al. (45 additional authors not shown)
Abstract:
Modern CMB analysis pipelines regularly employ complex time-domain filters, beam models, masking, and other techniques during the production of sky maps and their corresponding angular power spectra. However, these processes can generate couplings between multipoles from the same spectrum and from different spectra, in addition to the typical power attenuation. Within the context of pseudo-…
▽ More
Modern CMB analysis pipelines regularly employ complex time-domain filters, beam models, masking, and other techniques during the production of sky maps and their corresponding angular power spectra. However, these processes can generate couplings between multipoles from the same spectrum and from different spectra, in addition to the typical power attenuation. Within the context of pseudo-$C_\ell$ based, MASTER-style analyses, the net effect of the time-domain filtering is commonly approximated by a multiplicative transfer function, $F_{\ell}$, that can fail to capture mode mixing and is dependent on the spectrum of the signal. To address these shortcomings, we have developed a simulation-based spectral correction approach that constructs a two-dimensional transfer matrix, $J_{\ell\ell'}$, which contains information about mode mixing in addition to mode attenuation. We demonstrate the application of this approach on data from the first flight of the SPIDER balloon-borne CMB experiment.
△ Less
Submitted 21 April, 2022; v1 submitted 1 November, 2021;
originally announced November 2021.
-
Dyadic double/debiased machine learning for analyzing determinants of free trade agreements
Authors:
Harold D Chiang,
Yukun Ma,
Joel Rodrigue,
Yuya Sasaki
Abstract:
This paper presents novel methods and theories for estimation and inference about parameters in econometric models using machine learning for nuisance parameters estimation when data are dyadic. We propose a dyadic cross fitting method to remove over-fitting biases under arbitrary dyadic dependence. Together with the use of Neyman orthogonal scores, this novel cross fitting method enables root-…
▽ More
This paper presents novel methods and theories for estimation and inference about parameters in econometric models using machine learning for nuisance parameters estimation when data are dyadic. We propose a dyadic cross fitting method to remove over-fitting biases under arbitrary dyadic dependence. Together with the use of Neyman orthogonal scores, this novel cross fitting method enables root-$n$ consistent estimation and inference robustly against dyadic dependence. We illustrate an application of our general framework to high-dimensional network link formation models. With this method applied to empirical data of international economic networks, we reexamine determinants of free trade agreements (FTA) viewed as links formed in the dyad composed of world economies. We document that standard methods may lead to misleading conclusions for numerous classic determinants of FTA formation due to biased point estimates or standard errors which are too small.
△ Less
Submitted 19 December, 2022; v1 submitted 8 October, 2021;
originally announced October 2021.
-
The Hydrogen Intensity and Real-time Analysis eXperiment: 256-Element Array Status and Overview
Authors:
Devin Crichton,
Moumita Aich,
Adam Amara,
Kevin Bandura,
Bruce A. Bassett,
Carlos Bengaly,
Pascale Berner,
Shruti Bhatporia,
Martin Bucher,
Tzu-Ching Chang,
H. Cynthia Chiang,
Jean-Francois Cliche,
Carolyn Crichton,
Romeel Dave,
Dirk I. L. de Villiers,
Matt A. Dobbs,
Aaron M. Ewall-Wice,
Scott Eyono,
Christopher Finlay,
Sindhu Gaddam,
Ken Ganga,
Kevin G. Gayley,
Kit Gerodias,
Tim Gibbon,
Austin Gumba
, et al. (75 additional authors not shown)
Abstract:
The Hydrogen Intensity and Real-time Analysis eXperiment (HIRAX) is a radio interferometer array currently in development, with an initial 256-element array to be deployed at the South African Radio Astronomy Observatory (SARAO) Square Kilometer Array (SKA) site in South Africa. Each of the 6m, $f/0.23$ dishes will be instrumented with dual-polarisation feeds operating over a frequency range of 40…
▽ More
The Hydrogen Intensity and Real-time Analysis eXperiment (HIRAX) is a radio interferometer array currently in development, with an initial 256-element array to be deployed at the South African Radio Astronomy Observatory (SARAO) Square Kilometer Array (SKA) site in South Africa. Each of the 6m, $f/0.23$ dishes will be instrumented with dual-polarisation feeds operating over a frequency range of 400-800 MHz. Through intensity mapping of the 21 cm emission line of neutral hydrogen, HIRAX will provide a cosmological survey of the distribution of large-scale structure over the redshift range of $0.775 < z < 2.55$ over $\sim$15,000 square degrees of the southern sky. The statistical power of such a survey is sufficient to produce $\sim$7 percent constraints on the dark energy equation of state parameter when combined with measurements from the Planck satellite. Additionally, HIRAX will provide a highly competitive platform for radio transient and HI absorber science while enabling a multitude of cross-correlation studies. In this paper, we describe the science goals of the experiment, overview of the design and status of the sub-components of the telescope system, and describe the expected performance of the initial 256-element array as well as the planned future expansion to the final, 1024-element array.
△ Less
Submitted 17 January, 2022; v1 submitted 28 September, 2021;
originally announced September 2021.
-
Inference in high-dimensional regression models without the exact or $L^p$ sparsity
Authors:
Jooyoung Cha,
Harold D. Chiang,
Yuya Sasaki
Abstract:
This paper proposes a new method of inference in high-dimensional regression models and high-dimensional IV regression models. Estimation is based on a combined use of the orthogonal greedy algorithm, high-dimensional Akaike information criterion, and double/debiased machine learning. The method of inference for any low-dimensional subvector of high-dimensional parameters is based on a root-$N$ as…
▽ More
This paper proposes a new method of inference in high-dimensional regression models and high-dimensional IV regression models. Estimation is based on a combined use of the orthogonal greedy algorithm, high-dimensional Akaike information criterion, and double/debiased machine learning. The method of inference for any low-dimensional subvector of high-dimensional parameters is based on a root-$N$ asymptotic normality, which is shown to hold without requiring the exact sparsity condition or the $L^p$ sparsity condition. Simulation studies demonstrate superior finite-sample performance of this proposed method over those based on the LASSO or the random forest, especially under less sparse models. We illustrate an application to production analysis with a panel of Chilean firms.
△ Less
Submitted 31 December, 2022; v1 submitted 21 August, 2021;
originally announced August 2021.
-
FOX-NAS: Fast, On-device and Explainable Neural Architecture Search
Authors:
Chia-Hsiang Liu,
Yu-Shin Han,
Yuan-Yao Sung,
Yi Lee,
Hung-Yueh Chiang,
Kai-Chiang Wu
Abstract:
Neural architecture search can discover neural networks with good performance, and One-Shot approaches are prevalent. One-Shot approaches typically require a supernet with weight sharing and predictors that predict the performance of architecture. However, the previous methods take much time to generate performance predictors thus are inefficient. To this end, we propose FOX-NAS that consists of f…
▽ More
Neural architecture search can discover neural networks with good performance, and One-Shot approaches are prevalent. One-Shot approaches typically require a supernet with weight sharing and predictors that predict the performance of architecture. However, the previous methods take much time to generate performance predictors thus are inefficient. To this end, we propose FOX-NAS that consists of fast and explainable predictors based on simulated annealing and multivariate regression. Our method is quantization-friendly and can be efficiently deployed to the edge. The experiments on different hardware show that FOX-NAS models outperform some other popular neural network architectures. For example, FOX-NAS matches MobileNetV2 and EfficientNet-Lite0 accuracy with 240% and 40% less latency on the edge CPU. FOX-NAS is the 3rd place winner of the 2020 Low-Power Computer Vision Challenge (LPCVC), DSP classification track. See all evaluation results at https://lpcv.ai/competitions/2020. Search code and pre-trained models are released at https://github.com/great8nctu/FOX-NAS.
△ Less
Submitted 14 August, 2021;
originally announced August 2021.
-
Multiway empirical likelihood
Authors:
Harold D Chiang,
Yukitoshi Matsushita,
Taisuke Otsu
Abstract:
This paper develops a general methodology to conduct statistical inference for observations indexed by multiple sets of entities. We propose a novel multiway empirical likelihood statistic that converges to a chi-square distribution under the non-degenerate case, where corresponding Hoeffding type decomposition is dominated by linear terms. Our methodology is related to the notion of jackknife emp…
▽ More
This paper develops a general methodology to conduct statistical inference for observations indexed by multiple sets of entities. We propose a novel multiway empirical likelihood statistic that converges to a chi-square distribution under the non-degenerate case, where corresponding Hoeffding type decomposition is dominated by linear terms. Our methodology is related to the notion of jackknife empirical likelihood but the leave-out pseudo values are constructed by leaving columns or rows. We further develop a modified version of our multiway empirical likelihood statistic, which converges to a chi-square distribution regardless of the degeneracy, and discover its desirable higher-order property compared to the t-ratio by the conventional Eicker-White type variance estimator. The proposed methodology is illustrated by several important statistical problems, such as bipartite network, generalized estimating equations, and three-way observations.
△ Less
Submitted 6 December, 2023; v1 submitted 10 August, 2021;
originally announced August 2021.
-
A generic unitary black-hole evaporation model based on first principles
Authors:
Kuan-Yu Chen,
Pisin Chen,
Hsu-Wen Chiang,
Dong-Han Yeom
Abstract:
Based on the discretized horizon picture, we introduce a macroscopic effective model of the horizon area quanta that encapsulates the features necessary for black holes to evaporate consistently. The price to pay is the introduction of a "hidden sector" that represents our lack of knowledge about the final destination of the black hole entropy. We focus on the peculiar form of the interaction betw…
▽ More
Based on the discretized horizon picture, we introduce a macroscopic effective model of the horizon area quanta that encapsulates the features necessary for black holes to evaporate consistently. The price to pay is the introduction of a "hidden sector" that represents our lack of knowledge about the final destination of the black hole entropy. We focus on the peculiar form of the interaction between this hidden sector and the black hole enforced by the self-consistency. Despite the expressive power of the model, we arrive at several qualitative statements. Furthermore, we identify these statements as features inside the microscopic density of states of the horizon quanta, with the dimension of the configuration space being associated with the area per quanta in Planck unit, a UV cutoff proportional to the amount of excess entropy relative to Bekenstein's law at the end of evaporation, and a zero-frequency-pole-like structure corresponding to, similarly, the amount of excess entropy at IR limit. We then relate this nearly-zero-frequency structure to the soft hairs proposed by Strominger et al., and argue that we should consider deviating away from the zero frequency limit for soft hairs to participate in the black hole evaporation.
△ Less
Submitted 8 August, 2021;
originally announced August 2021.
-
Adaptive Learning Rate and Momentum for Training Deep Neural Networks
Authors:
Zhiyong Hao,
Yixuan Jiang,
Huihua Yu,
Hsiao-Dong Chiang
Abstract:
Recent progress on deep learning relies heavily on the quality and efficiency of training algorithms. In this paper, we develop a fast training method motivated by the nonlinear Conjugate Gradient (CG) framework. We propose the Conjugate Gradient with Quadratic line-search (CGQ) method. On the one hand, a quadratic line-search determines the step size according to current loss landscape. On the ot…
▽ More
Recent progress on deep learning relies heavily on the quality and efficiency of training algorithms. In this paper, we develop a fast training method motivated by the nonlinear Conjugate Gradient (CG) framework. We propose the Conjugate Gradient with Quadratic line-search (CGQ) method. On the one hand, a quadratic line-search determines the step size according to current loss landscape. On the other hand, the momentum factor is dynamically updated in computing the conjugate gradient parameter (like Polak-Ribiere). Theoretical results to ensure the convergence of our method in strong convex settings is developed. And experiments in image classification datasets show that our method yields faster convergence than other local solvers and has better generalization capability (test set accuracy). One major advantage of the paper method is that tedious hand tuning of hyperparameters like the learning rate and momentum is avoided.
△ Less
Submitted 26 July, 2021; v1 submitted 22 June, 2021;
originally announced June 2021.
-
Scene Transformer: A unified architecture for predicting multiple agent trajectories
Authors:
Jiquan Ngiam,
Benjamin Caine,
Vijay Vasudevan,
Zhengdong Zhang,
Hao-Tien Lewis Chiang,
Jeffrey Ling,
Rebecca Roelofs,
Alex Bewley,
Chenxi Liu,
Ashish Venugopal,
David Weiss,
Ben Sapp,
Zhifeng Chen,
Jonathon Shlens
Abstract:
Predicting the motion of multiple agents is necessary for planning in dynamic environments. This task is challenging for autonomous driving since agents (e.g. vehicles and pedestrians) and their associated behaviors may be diverse and influence one another. Most prior work have focused on predicting independent futures for each agent based on all past motion, and planning against these independent…
▽ More
Predicting the motion of multiple agents is necessary for planning in dynamic environments. This task is challenging for autonomous driving since agents (e.g. vehicles and pedestrians) and their associated behaviors may be diverse and influence one another. Most prior work have focused on predicting independent futures for each agent based on all past motion, and planning against these independent predictions. However, planning against independent predictions can make it challenging to represent the future interaction possibilities between different agents, leading to sub-optimal planning. In this work, we formulate a model for predicting the behavior of all agents jointly, producing consistent futures that account for interactions between agents. Inspired by recent language modeling approaches, we use a masking strategy as the query to our model, enabling one to invoke a single model to predict agent behavior in many ways, such as potentially conditioned on the goal or full future trajectory of the autonomous vehicle or the behavior of other agents in the environment. Our model architecture employs attention to combine features across road elements, agent interactions, and time steps. We evaluate our approach on autonomous driving datasets for both marginal and joint motion prediction, and achieve state of the art performance across two popular datasets. Through combining a scene-centric approach, agent permutation equivariant model, and a sequence masking strategy, we show that our model can unify a variety of motion prediction tasks from joint motion predictions to conditioned prediction.
△ Less
Submitted 4 March, 2022; v1 submitted 15 June, 2021;
originally announced June 2021.
-
Maximizing Extractable Value from Automated Market Makers
Authors:
Massimo Bartoletti,
James Hsin-yu Chiang,
Alberto Lluch-Lafuente
Abstract:
Automated Market Makers (AMMs) are decentralized applications that allow users to exchange crypto-tokens without the need for a matching exchange order. AMMs are one of the most successful DeFi use cases: indeed, major AMM platforms process a daily volume of transactions worth USD billions. Despite their popularity, AMMs are well-known to suffer from transaction-ordering issues: adversaries can in…
▽ More
Automated Market Makers (AMMs) are decentralized applications that allow users to exchange crypto-tokens without the need for a matching exchange order. AMMs are one of the most successful DeFi use cases: indeed, major AMM platforms process a daily volume of transactions worth USD billions. Despite their popularity, AMMs are well-known to suffer from transaction-ordering issues: adversaries can influence the ordering of user transactions, and possibly front-run them with their own, to extract value from AMMs, to the detriment of users. We devise an effective procedure to construct a strategy through which an adversary can maximize the value extracted from user transactions.
△ Less
Submitted 19 July, 2022; v1 submitted 2 June, 2021;
originally announced June 2021.
-
Spin Wave Interference Detection via Inverse Spin Hall Effect
Authors:
Michael Balynskiy,
Howard Chiang,
David Gutierrez,
Alexander Khitun
Abstract:
In this letter, we present experimental data demonstrating spin wave interference detection using spin Hall effect (ISHE). Two coherent spin waves are excited in a yttrium-iron garnet (YIG) waveguide by continuous microwave signals. The initial phase difference between the spin waves is controlled by the external phase shifter. The ISHE voltage is detected at a distance of 2 mm and 4 mm away from…
▽ More
In this letter, we present experimental data demonstrating spin wave interference detection using spin Hall effect (ISHE). Two coherent spin waves are excited in a yttrium-iron garnet (YIG) waveguide by continuous microwave signals. The initial phase difference between the spin waves is controlled by the external phase shifter. The ISHE voltage is detected at a distance of 2 mm and 4 mm away from the spin wave generating antennae by an attached Pt layer. Experimental data show ISHE voltage oscillation as a function of the phase difference between the two interfering spin waves. This experiment demonstrates an intriguing possibility of using ISHE in spin wave logic circuit converting spin wave phase into an electric signal
△ Less
Submitted 6 May, 2021;
originally announced May 2021.
-
Learning from 2D: Contrastive Pixel-to-Point Knowledge Transfer for 3D Pretraining
Authors:
Yueh-Cheng Liu,
Yu-Kai Huang,
Hung-Yueh Chiang,
Hung-Ting Su,
Zhe-Yu Liu,
Chin-Tang Chen,
Ching-Yu Tseng,
Winston H. Hsu
Abstract:
Most 3D neural networks are trained from scratch owing to the lack of large-scale labeled 3D datasets. In this paper, we present a novel 3D pretraining method by leveraging 2D networks learned from rich 2D datasets. We propose the contrastive pixel-to-point knowledge transfer to effectively utilize the 2D information by mapping the pixel-level and point-level features into the same embedding space…
▽ More
Most 3D neural networks are trained from scratch owing to the lack of large-scale labeled 3D datasets. In this paper, we present a novel 3D pretraining method by leveraging 2D networks learned from rich 2D datasets. We propose the contrastive pixel-to-point knowledge transfer to effectively utilize the 2D information by mapping the pixel-level and point-level features into the same embedding space. Due to the heterogeneous nature between 2D and 3D networks, we introduce the back-projection function to align the features between 2D and 3D to make the transfer possible. Additionally, we devise an upsampling feature projection layer to increase the spatial resolution of high-level 2D feature maps, which enables learning fine-grained 3D representations. With a pretrained 2D network, the proposed pretraining process requires no additional 2D or 3D labeled data, further alleviating the expensive 3D data annotation cost. To the best of our knowledge, we are the first to exploit existing 2D trained weights to pretrain 3D deep neural networks. Our intensive experiments show that the 3D models pretrained with 2D knowledge boost the performances of 3D networks across various real-world 3D downstream tasks.
△ Less
Submitted 27 December, 2021; v1 submitted 10 April, 2021;
originally announced April 2021.
-
The XFaster Power Spectrum and Likelihood Estimator for the Analysis of Cosmic Microwave Background Maps
Authors:
A. E. Gambrel,
A. S. Rahlin,
X. Song,
C. R. Contaldi,
P. A. R. Ade,
M. Amiri,
S. J. Benton,
A. S. Bergman,
R. Bihary,
J. J. Bock,
J. R. Bond,
J. A. Bonetti,
S. A. Bryan,
H. C. Chiang,
A. J. Duivenvoorden,
H. K. Eriksen,
M. Farhang,
J. P. Filippini,
A. A. Fraisse,
K. Freese,
M. Galloway,
N. N. Gandilo,
R. Gualtieri,
J. E. Gudmundsson,
M. Halpern
, et al. (42 additional authors not shown)
Abstract:
We present the XFaster analysis package. XFaster is a fast, iterative angular power spectrum estimator based on a diagonal approximation to the quadratic Fisher matrix estimator. XFaster uses Monte Carlo simulations to compute noise biases and filter transfer functions and is thus a hybrid of both Monte Carlo and quadratic estimator methods. In contrast to conventional pseudo-$C_\ell$ based method…
▽ More
We present the XFaster analysis package. XFaster is a fast, iterative angular power spectrum estimator based on a diagonal approximation to the quadratic Fisher matrix estimator. XFaster uses Monte Carlo simulations to compute noise biases and filter transfer functions and is thus a hybrid of both Monte Carlo and quadratic estimator methods. In contrast to conventional pseudo-$C_\ell$ based methods, the algorithm described here requires a minimal number of simulations, and does not require them to be precisely representative of the data to estimate accurate covariance matrices for the bandpowers. The formalism works with polarization-sensitive observations and also data sets with identical, partially overlapping, or independent survey regions. The method was first implemented for the analysis of BOOMERanG data, and also used as part of the Planck analysis. Here, we describe the full, publicly available analysis package, written in Python, as developed for the analysis of data from the 2015 flight of the SPIDER instrument. The package includes extensions for self-consistently estimating null spectra and for estimating fits for Galactic foreground contributions. We show results from the extensive validation of XFaster using simulations, and its application to the SPIDER data set.
△ Less
Submitted 24 May, 2021; v1 submitted 2 April, 2021;
originally announced April 2021.
-
A Constraint on Primordial $B$-Modes from the First Flight of the SPIDER Balloon-Borne Telescope
Authors:
SPIDER Collaboration,
P. A. R. Ade,
M. Amiri,
S. J. Benton,
A. S. Bergman,
R. Bihary,
J. J. Bock,
J. R. Bond,
J. A. Bonetti,
S. A. Bryan,
H. C. Chiang,
C. R. Contaldi,
O. Doré,
A. J. Duivenvoorden,
H. K. Eriksen,
M. Farhang,
J. P. Filippini,
A. A. Fraisse,
K. Freese,
M. Galloway,
A. E. Gambrel,
N. N. Gandilo,
K. Ganga,
R. Gualtieri,
J. E. Gudmundsson
, et al. (46 additional authors not shown)
Abstract:
We present the first linear polarization measurements from the 2015 long-duration balloon flight of SPIDER, an experiment designed to map the polarization of the cosmic microwave background (CMB) on degree angular scales. Results from these measurements include maps and angular power spectra from observations of 4.8% of the sky at 95 and 150 GHz, along with the results of internal consistency test…
▽ More
We present the first linear polarization measurements from the 2015 long-duration balloon flight of SPIDER, an experiment designed to map the polarization of the cosmic microwave background (CMB) on degree angular scales. Results from these measurements include maps and angular power spectra from observations of 4.8% of the sky at 95 and 150 GHz, along with the results of internal consistency tests on these data. While the polarized CMB anisotropy from primordial density perturbations is the dominant signal in this region of sky, Galactic dust emission is also detected with high significance; Galactic synchrotron emission is found to be negligible in the SPIDER bands. We employ two independent foreground-removal techniques in order to explore the sensitivity of the cosmological result to the assumptions made by each. The primary method uses a dust template derived from Planck data to subtract the Galactic dust signal. A second approach, employing a joint analysis of SPIDER and Planck data in the harmonic domain, assumes a modified-blackbody model for the spectral energy distribution of the dust with no constraint on its spatial morphology. Using a likelihood that jointly samples the template amplitude and $r$ parameter space, we derive 95% upper limits on the primordial tensor-to-scalar ratio from Feldman-Cousins and Bayesian constructions, finding $r<0.11$ and $r<0.19$, respectively. Roughly half the uncertainty in $r$ derives from noise associated with the template subtraction. New data at 280 GHz from SPIDER's second flight will complement the Planck polarization maps, providing powerful measurements of the polarized Galactic dust emission.
△ Less
Submitted 24 March, 2021;
originally announced March 2021.
-
Algorithmic subsampling under multiway clustering
Authors:
Harold D. Chiang,
Jiatong Li,
Yuya Sasaki
Abstract:
This paper proposes a novel method of algorithmic subsampling (data sketching) for multiway cluster dependent data. We establish a new uniform weak law of large numbers and a new central limit theorem for the multiway algorithmic subsample means. Consequently, we discover an additional advantage of the algorithmic subsampling that it allows for robustness against potential degeneracy, and even non…
▽ More
This paper proposes a novel method of algorithmic subsampling (data sketching) for multiway cluster dependent data. We establish a new uniform weak law of large numbers and a new central limit theorem for the multiway algorithmic subsample means. Consequently, we discover an additional advantage of the algorithmic subsampling that it allows for robustness against potential degeneracy, and even non-Gaussian degeneracy, of the asymptotic distribution under multiway clustering. Simulation studies support this novel result, and demonstrate that inference with the algorithmic subsampling entails more accuracy than that without the algorithmic subsampling. Applying these basic asymptotic theories, we derive the consistency and the asymptotic normality for the multiway algorithmic subsampling generalized method of moments estimator and for the multiway algorithmic subsampling M-estimator. We illustrate an application to scanner data.
△ Less
Submitted 30 October, 2022; v1 submitted 28 February, 2021;
originally announced March 2021.
-
A theory of Automated Market Makers in DeFi
Authors:
Massimo Bartoletti,
James Hsin-yu Chiang,
Alberto Lluch-Lafuente
Abstract:
Automated market makers (AMMs) are one of the most prominent decentralized finance (DeFi) applications. AMMs allow users to trade different types of crypto-tokens, without the need to find a counter-party. There are several implementations and models for AMMs, featuring a variety of sophisticated economic mechanisms. We present a theory of AMMs. The core of our theory is an abstract operational mo…
▽ More
Automated market makers (AMMs) are one of the most prominent decentralized finance (DeFi) applications. AMMs allow users to trade different types of crypto-tokens, without the need to find a counter-party. There are several implementations and models for AMMs, featuring a variety of sophisticated economic mechanisms. We present a theory of AMMs. The core of our theory is an abstract operational model of the interactions between users and AMMs, which can be concretised by instantiating the economic mechanisms. We exploit our theory to formally prove a set of fundamental properties of AMMs, characterizing both structural and economic aspects. We do this by abstracting from the actual economic mechanisms used in implementations, and identifying sufficient conditions which ensure the relevant properties. Notably, we devise a general solution to the arbitrage problem, the main game-theoretic foundation behind the economic mechanisms of AMMs.
△ Less
Submitted 16 December, 2022; v1 submitted 22 February, 2021;
originally announced February 2021.
-
Linear programming approach to nonparametric inference under shape restrictions: with an application to regression kink designs
Authors:
Harold D. Chiang,
Kengo Kato,
Yuya Sasaki,
Takuya Ura
Abstract:
We develop a novel method of constructing confidence bands for nonparametric regression functions under shape constraints. This method can be implemented via a linear programming, and it is thus computationally appealing. We illustrate a usage of our proposed method with an application to the regression kink design (RKD). Econometric analyses based on the RKD often suffer from wide confidence inte…
▽ More
We develop a novel method of constructing confidence bands for nonparametric regression functions under shape constraints. This method can be implemented via a linear programming, and it is thus computationally appealing. We illustrate a usage of our proposed method with an application to the regression kink design (RKD). Econometric analyses based on the RKD often suffer from wide confidence intervals due to slow convergence rates of nonparametric derivative estimators. We demonstrate that economic models and structures motivate shape restrictions, which in turn contribute to shrinking the confidence interval for an analysis of the causal effects of unemployment insurance benefits on unemployment durations.
△ Less
Submitted 12 February, 2021;
originally announced February 2021.