subscribe to arXiv mailings

Comparison of methods for mediation analysis with multiple correlated mediators

Authors: Mary Appah, D. Leann Long, George Howard, Melissa J. Smith

Abstract: Various methods have emerged for conducting mediation analyses with multiple correlated mediators, each with distinct strengths and limitations. However, a comparative evaluation of these methods is lacking, providing the motivation for this paper. This study examines six mediation analysis methods for multiple correlated mediators that provide insights to the contributors for health disparities.… ▽ More Various methods have emerged for conducting mediation analyses with multiple correlated mediators, each with distinct strengths and limitations. However, a comparative evaluation of these methods is lacking, providing the motivation for this paper. This study examines six mediation analysis methods for multiple correlated mediators that provide insights to the contributors for health disparities. We assessed the performance of each method in identifying joint or path-specific mediation effects in the context of binary outcome variables varying mediator types and levels of residual correlation between mediators. Through comprehensive simulations, the performance of six methods in estimating joint and/or path-specific mediation effects was assessed rigorously using a variety of metrics including bias, mean squared error, coverage and width of the 95$\%$ confidence intervals. Subsequently, these methods were applied to the REasons for Geographic And Racial Differences in Stroke (REGARDS) study, where differing conclusions were obtained depending on the mediation method employed. This evaluation provides valuable guidance for researchers grappling with complex multi-mediator scenarios, enabling them to select an optimal mediation method for their research question and dataset. △ Less

Submitted 23 June, 2024; originally announced June 2024.

arXiv:2405.14930 [pdf, other]

AstroPT: Scaling Large Observation Models for Astronomy

Authors: Michael J. Smith, Ryan J. Roberts, Eirini Angeloudi, Marc Huertas-Company

Abstract: This work presents AstroPT, an autoregressive pretrained transformer developed with astronomical use-cases in mind. The AstroPT models presented here have been pretrained on 8.6 million $512 \times 512$ pixel $grz$-band galaxy postage stamp observations from the DESI Legacy Survey DR8. We train a selection of foundation models of increasing size from 1 million to 2.1 billion parameters, and find t… ▽ More This work presents AstroPT, an autoregressive pretrained transformer developed with astronomical use-cases in mind. The AstroPT models presented here have been pretrained on 8.6 million $512 \times 512$ pixel $grz$-band galaxy postage stamp observations from the DESI Legacy Survey DR8. We train a selection of foundation models of increasing size from 1 million to 2.1 billion parameters, and find that AstroPT follows a similar saturating log-log scaling law to textual models. We also find that the models' performances on downstream tasks as measured by linear probing improves with model size up to the model parameter saturation point. We believe that collaborative community development paves the best route towards realising an open source `Large Observation Model' -- a model trained on data taken from the observational sciences at the scale seen in natural language processing. To this end, we release the source code, weights, and dataset for AstroPT under the MIT license, and invite potential collaborators to join us in collectively building and researching these models. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 12 pages, 4 figures, 1 table. Code available at https://github.com/Smith42/astroPT

arXiv:2401.01916 [pdf, other]

AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets

Authors: Ernest Perkowski, Rui Pan, Tuan Dung Nguyen, Yuan-Sen Ting, Sandor Kruk, Tong Zhang, Charlie O'Neill, Maja Jablonska, Zechang Sun, Michael J. Smith, Huiling Liu, Kevin Schawinski, Kartheik Iyer, Ioana Ciucă for UniverseTBD

Abstract: We explore the potential of enhancing LLM performance in astronomy-focused question-answering through targeted, continual pre-training. By employing a compact 7B-parameter LLaMA-2 model and focusing exclusively on a curated set of astronomy corpora -- comprising abstracts, introductions, and conclusions -- we achieve notable improvements in specialized topic comprehension. While general LLMs like… ▽ More We explore the potential of enhancing LLM performance in astronomy-focused question-answering through targeted, continual pre-training. By employing a compact 7B-parameter LLaMA-2 model and focusing exclusively on a curated set of astronomy corpora -- comprising abstracts, introductions, and conclusions -- we achieve notable improvements in specialized topic comprehension. While general LLMs like GPT-4 excel in broader question-answering scenarios due to superior reasoning capabilities, our findings suggest that continual pre-training with limited resources can still enhance model performance on specialized topics. Additionally, we present an extension of AstroLLaMA: the fine-tuning of the 7B LLaMA model on a domain-specific conversational dataset, culminating in the release of the chat-enabled AstroLLaMA for community use. Comprehensive quantitative benchmarking is currently in progress and will be detailed in an upcoming full paper. The model, AstroLLaMA-Chat, is now available at https://huggingface.co/universeTBD, providing the first open-source conversational AI tool tailored for the astronomy community. △ Less

Submitted 5 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

Comments: 4 pages, 1 figure, model is available at https://huggingface.co/universeTBD, published in RNAAS

arXiv:2310.18703 [pdf, other]

Detangling the role of climate in vegetation productivity with an explainable convolutional neural network

Authors: Ricardo Barros Lourenço, Michael J. Smith, Sylvia Smullin, Umangi Jain, Alemu Gonsamo, Arthur Ouaknine

Abstract: Forests of the Earth are a vital carbon sink while providing an essential habitat for biodiversity. Vegetation productivity (VP) is a critical indicator of carbon uptake in the atmosphere. The leaf area index is a crucial vegetation index used in VP estimation. This work proposes to predict the leaf area index (LAI) using climate variables to better understand future productivity dynamics; our app… ▽ More Forests of the Earth are a vital carbon sink while providing an essential habitat for biodiversity. Vegetation productivity (VP) is a critical indicator of carbon uptake in the atmosphere. The leaf area index is a crucial vegetation index used in VP estimation. This work proposes to predict the leaf area index (LAI) using climate variables to better understand future productivity dynamics; our approach leverages the capacities of the V-Net architecture for spatiotemporal LAI prediction. Preliminary results are well-aligned with established quality standards of LAI products estimated from Earth observation data. We hope that this work serves as a robust foundation for subsequent research endeavours, particularly for the incorporation of prediction attribution methodologies, which hold promise for elucidating the underlying climate change drivers of global vegetation productivity. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Comments: 7 pages, 2 figures, submitted to Tackling Climate Change with Machine Learning at NeurIPS 2023

arXiv:2309.07207 [pdf, other]

EarthPT: a time series foundation model for Earth Observation

Authors: Michael J. Smith, Luke Fleming, James E. Geach

Abstract: We introduce EarthPT -- an Earth Observation (EO) pretrained transformer. EarthPT is a 700 million parameter decoding transformer foundation model trained in an autoregressive self-supervised manner and developed specifically with EO use-cases in mind. We demonstrate that EarthPT is an effective forecaster that can accurately predict future pixel-level surface reflectances across the 400-2300 nm r… ▽ More We introduce EarthPT -- an Earth Observation (EO) pretrained transformer. EarthPT is a 700 million parameter decoding transformer foundation model trained in an autoregressive self-supervised manner and developed specifically with EO use-cases in mind. We demonstrate that EarthPT is an effective forecaster that can accurately predict future pixel-level surface reflectances across the 400-2300 nm range well into the future. For example, forecasts of the evolution of the Normalised Difference Vegetation Index (NDVI) have a typical error of approximately 0.05 (over a natural range of -1 -> 1) at the pixel level over a five month test set horizon, out-performing simple phase-folded models based on historical averaging. We also demonstrate that embeddings learnt by EarthPT hold semantically meaningful information and could be exploited for downstream tasks such as highly granular, dynamic land use classification. Excitingly, we note that the abundance of EO data provides us with -- in theory -- quadrillions of training tokens. Therefore, if we assume that EarthPT follows neural scaling laws akin to those derived for Large Language Models (LLMs), there is currently no data-imposed limit to scaling EarthPT and other similar `Large Observation Models.' △ Less

Submitted 11 January, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

Comments: 7 pages, 4 figures, accepted to NeurIPS CCAI workshop at https://www.climatechange.ai/papers/neurips2023/2 . Code available at https://github.com/aspiaspace/EarthPT

arXiv:2308.07253 [pdf, other]

Path-specific causal decomposition analysis with multiple correlated mediator variables

Authors: Melissa J. Smith, Leslie A. McClure, D. Leann Long

Abstract: A causal decomposition analysis allows researchers to determine whether the difference in a health outcome between two groups can be attributed to a difference in each group's distribution of one or more modifiable mediator variables. With this knowledge, researchers and policymakers can focus on designing interventions that target these mediator variables. Existing methods for causal decompositio… ▽ More A causal decomposition analysis allows researchers to determine whether the difference in a health outcome between two groups can be attributed to a difference in each group's distribution of one or more modifiable mediator variables. With this knowledge, researchers and policymakers can focus on designing interventions that target these mediator variables. Existing methods for causal decomposition analysis either focus on one mediator variable or assume that each mediator variable is conditionally independent given the group label and the mediator-outcome confounders. In this paper, we propose a flexible causal decomposition analysis method that can accommodate multiple correlated and interacting mediator variables, which are frequently seen in studies of health behaviors and studies of environmental pollutants. We extend a Monte Carlo-based causal decomposition analysis method to this setting by using a multivariate mediator model that can accommodate any combination of binary and continuous mediator variables. Furthermore, we state the causal assumptions needed to identify both joint and path-specific decomposition effects through each mediator variable. To illustrate the reduction in bias and confidence interval width of the decomposition effects under our proposed method, we perform a simulation study. We also apply our approach to examine whether differences in smoking status and dietary inflammation score explain any of the Black-White differences in incident diabetes using data from a national cohort study. △ Less

Submitted 14 August, 2023; originally announced August 2023.

arXiv:2308.06831 [pdf]

Population-average mediation analysis for zero-inflated count outcomes

Authors: Andrew Sims, D. Leann Long, Hemant K. Tiwari, Jinhong Cui, Dustin M. Long, Todd M. Brown, Melissa J. Smith, Emily B. Levitan

Abstract: Mediation analysis is an increasingly popular statistical method for explaining causal pathways to inform intervention. While methods have increased, there is still a dearth of robust mediation methods for count outcomes with excess zeroes. Current mediation methods addressing this issue are computationally intensive, biased, or challenging to interpret. To overcome these limitations, we propose a… ▽ More Mediation analysis is an increasingly popular statistical method for explaining causal pathways to inform intervention. While methods have increased, there is still a dearth of robust mediation methods for count outcomes with excess zeroes. Current mediation methods addressing this issue are computationally intensive, biased, or challenging to interpret. To overcome these limitations, we propose a new mediation methodology for zero-inflated count outcomes using the marginalized zero-inflated Poisson (MZIP) model and the counterfactual approach to mediation. This novel work gives population-average mediation effects whose variance can be estimated rapidly via delta method. This methodology is extended to cases with exposure-mediator interactions. We apply this novel methodology to explore if diabetes diagnosis can explain BMI differences in healthcare utilization and test model performance via simulations comparing the proposed MZIP method to existing zero-inflated and Poisson methods. We find that our proposed method minimizes bias and computation time compared to alternative approaches while allowing for straight-forward interpretations. △ Less

Submitted 13 August, 2023; originally announced August 2023.

Comments: 34 pages, 2 figures, 4 tables, 49 pages of Supplemental material, 2 supplemental figures

arXiv:2303.07329 [pdf, other]

Application of targeted maximum likelihood estimation in public health and epidemiological studies: a systematic review

Authors: Matthew J. Smith, Rachael V. Phillips, Miguel Angel Luque-Fernandez, Camille Maringe

Abstract: The Targeted Maximum Likelihood Estimation (TMLE) statistical data analysis framework integrates machine learning, statistical theory, and statistical inference to provide a least biased, efficient and robust strategy for estimation and inference of a variety of statistical and causal parameters. We describe and evaluate the epidemiological applications that have benefited from recent methodologic… ▽ More The Targeted Maximum Likelihood Estimation (TMLE) statistical data analysis framework integrates machine learning, statistical theory, and statistical inference to provide a least biased, efficient and robust strategy for estimation and inference of a variety of statistical and causal parameters. We describe and evaluate the epidemiological applications that have benefited from recent methodological developments. We conducted a systematic literature review in PubMed for articles that applied any form of TMLE in observational studies. We summarised the epidemiological discipline, geographical location, expertise of the authors, and TMLE methods over time. We used the Roadmap of Targeted Learning and Causal Inference to extract key methodological aspects of the publications. We showcase the contributions to the literature of these TMLE results. Of the 81 publications included, 25% originated from the University of California at Berkeley, where the framework was first developed by Professor Mark van der Laan. By the first half of 2022, 70% of the publications originated from outside the United States and explored up to 7 different epidemiological disciplines in 2021-22. Double-robustness, bias reduction and model misspecification were the main motivations that drew researchers towards the TMLE framework. Through time, a wide variety of methodological, tutorial and software-specific articles were cited, owing to the constant growth of methodological developments around TMLE. There is a clear dissemination trend of the TMLE framework to various epidemiological disciplines and to increasing numbers of geographical areas. The availability of R packages, publication of tutorial papers, and involvement of methodological experts in applied publications have contributed to an exponential increase in the number of studies that understood the benefits, and adoption, of TMLE. △ Less

Submitted 13 March, 2023; originally announced March 2023.

Comments: 42 pages, 2 figures, 2 tables

arXiv:2302.12537 [pdf, other]

Why Target Networks Stabilise Temporal Difference Methods

Authors: Mattie Fellows, Matthew J. A. Smith, Shimon Whiteson

Abstract: Integral to recent successes in deep reinforcement learning has been a class of temporal difference methods that use infrequently updated target values for policy evaluation in a Markov Decision Process. Yet a complete theoretical explanation for the effectiveness of target networks remains elusive. In this work, we provide an analysis of this popular class of algorithms, to finally answer the que… ▽ More Integral to recent successes in deep reinforcement learning has been a class of temporal difference methods that use infrequently updated target values for policy evaluation in a Markov Decision Process. Yet a complete theoretical explanation for the effectiveness of target networks remains elusive. In this work, we provide an analysis of this popular class of algorithms, to finally answer the question: `why do target networks stabilise TD learning'? To do so, we formalise the notion of a partially fitted policy evaluation method, which describes the use of target networks and bridges the gap between fitted methods and semigradient temporal difference algorithms. Using this framework we are able to uniquely characterise the so-called deadly triad - the use of TD updates with (nonlinear) function approximation and off-policy data - which often leads to nonconvergent algorithms. This insight leads us to conclude that the use of target networks can mitigate the effects of poor conditioning in the Jacobian of the TD update. Instead, we show that under mild regularity conditions and a well tuned target network update frequency, convergence can be guaranteed even in the extremely challenging off-policy sampling and nonlinear function approximation setting. △ Less

Submitted 11 August, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

Comments: Found a small error in Appendix (Proposition 1, Appendix B3, penultimate line) that affects results presented in the original submission. These have been fixed and this version is the one accepted at ICML 2023

Journal ref: ICML 2023

arXiv:2302.08091 [pdf, other]

Do We Still Need Clinical Language Models?

Authors: Eric Lehman, Evan Hernandez, Diwakar Mahajan, Jonas Wulff, Micah J. Smith, Zachary Ziegler, Daniel Nadler, Peter Szolovits, Alistair Johnson, Emily Alsentzer

Abstract: Although recent advances in scaling large language models (LLMs) have resulted in improvements on many NLP tasks, it remains unclear whether these models trained primarily with general web text are the right tool in highly specialized, safety critical domains such as clinical text. Recent results have suggested that LLMs encode a surprising amount of medical knowledge. This raises an important que… ▽ More Although recent advances in scaling large language models (LLMs) have resulted in improvements on many NLP tasks, it remains unclear whether these models trained primarily with general web text are the right tool in highly specialized, safety critical domains such as clinical text. Recent results have suggested that LLMs encode a surprising amount of medical knowledge. This raises an important question regarding the utility of smaller domain-specific language models. With the success of general-domain LLMs, is there still a need for specialized clinical models? To investigate this question, we conduct an extensive empirical analysis of 12 language models, ranging from 220M to 175B parameters, measuring their performance on 3 different clinical tasks that test their ability to parse and reason over electronic health records. As part of our experiments, we train T5-Base and T5-Large models from scratch on clinical notes from MIMIC III and IV to directly investigate the efficiency of clinical tokens. We show that relatively small specialized clinical models substantially outperform all in-context learning approaches, even when finetuned on limited annotated data. Further, we find that pretraining on clinical tokens allows for smaller, more parameter-efficient models that either match or outperform much larger language models trained on general text. We release the code and the models used under the PhysioNet Credentialed Health Data license and data use agreement. △ Less

Submitted 16 February, 2023; originally announced February 2023.

arXiv:2211.14556 [pdf, other]

Multiple imputation for logistic regression models: incorporating an interaction

Authors: Matthew J. Smith, Matteo Quartagno, Edmund Njeru Njagi

Abstract: Background: Multiple imputation is often used to reduce bias and gain efficiency when there is missing data. The most appropriate imputation method depends on the model the analyst is interested in fitting. Several imputation approaches have been proposed for when this model is a logistic regression model with an interaction term that contains a binary partially observed variable; however, it is n… ▽ More Background: Multiple imputation is often used to reduce bias and gain efficiency when there is missing data. The most appropriate imputation method depends on the model the analyst is interested in fitting. Several imputation approaches have been proposed for when this model is a logistic regression model with an interaction term that contains a binary partially observed variable; however, it is not clear which performs best under certain parameter settings. Methods: Using 1000 simulations, each with 10,000 observations, under six data-generating mechanisms (DGM), we investigate the performance of four methods: (i) 'passive imputation', (ii) 'just another variable' (JAV), (iii) 'stratify-impute-append' (SIA), and (iv) 'substantive model compatible fully conditional specifica-tion' (SMCFCS). The application of each method is shown in an empirical example using England-based cancer registry data. Results: SMCFCS and SIA showed the least biased estimate of the coefficients for the fully, and partially, observed variable and the interaction term. SMCFCS and SIA showed good coverage and low relative error for all DGMs. SMCFCS had a large bias when there was a low prevalence of the fully observed variable in the interaction. SIA performed poorly when the fully observed variable in the interaction had a continuous underlying form. Conclusion: SMCFCS and SIA give consistent estimation for logistic regression models with an interaction term when data are missing at random, and either can be used in most analyses. SMCFCS performed better than SIA when the fully observed variable in the interaction had an underlying continuous form. Researchers should be cautious when using SMCFCS when there is a low prevalence of the fully observed variable in the interaction. △ Less

Submitted 26 November, 2022; originally announced November 2022.

Comments: 26 pages, 9 figures, 4 tables

arXiv:2211.03796 [pdf, other]

doi 10.1098/rsos.221454

Astronomia ex machina: a history, primer, and outlook on neural networks in astronomy

Authors: Michael J. Smith, James E. Geach

Abstract: In this review, we explore the historical development and future prospects of artificial intelligence (AI) and deep learning in astronomy. We trace the evolution of connectionism in astronomy through its three waves, from the early use of multilayer perceptrons, to the rise of convolutional and recurrent neural networks, and finally to the current era of unsupervised and generative deep learning m… ▽ More In this review, we explore the historical development and future prospects of artificial intelligence (AI) and deep learning in astronomy. We trace the evolution of connectionism in astronomy through its three waves, from the early use of multilayer perceptrons, to the rise of convolutional and recurrent neural networks, and finally to the current era of unsupervised and generative deep learning methods. With the exponential growth of astronomical data, deep learning techniques offer an unprecedented opportunity to uncover valuable insights and tackle previously intractable problems. As we enter the anticipated fourth wave of astronomical connectionism, we argue for the adoption of GPT-like foundation models fine-tuned for astronomical applications. Such models could harness the wealth of high-quality, multimodal astronomical data to serve state-of-the-art downstream tasks. To keep pace with advancements driven by Big Tech, we propose a collaborative, open-source approach within the astronomy community to develop and maintain these foundation models, fostering a symbiotic relationship between AI and astronomy that capitalizes on the unique strengths of both fields. △ Less

Submitted 12 May, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

Comments: 75 pages, 327 references, 32 figures. Review accepted in Royal Society Open Science

arXiv:2206.15310 [pdf, other]

The Delta-Method and Influence Function in Medical Statistics: a Reproducible Tutorial

Authors: Rodrigo Zepeda-Tello, Michael Schomaker, Camille Maringe, Matthew J. Smith, Aurelien Belot, Bernard Rachet, Mireille E. Schnitzer, Miguel Angel Luque-Fernandez

Abstract: Approximate statistical inference via determination of the asymptotic distribution of a statistic is routinely used for inference in applied medical statistics (e.g. to estimate the standard error of the marginal or conditional risk ratio). One method for variance estimation is the classical Delta-method but there is a knowledge gap as this method is not routinely included in training for applied… ▽ More Approximate statistical inference via determination of the asymptotic distribution of a statistic is routinely used for inference in applied medical statistics (e.g. to estimate the standard error of the marginal or conditional risk ratio). One method for variance estimation is the classical Delta-method but there is a knowledge gap as this method is not routinely included in training for applied medical statistics and its uses are not widely understood. Given that a smooth function of an asymptotically normal estimator is also asymptotically normally distributed, the Delta-method allows approximating the large-sample variance of a function of an estimator with known large-sample properties. In a more general setting, it is a technique for approximating the variance of a functional (i.e., an estimand) that takes a function as an input and applies another function to it (e.g. the expectation function). Specifically, we may approximate the variance of the function using the functional Delta-method based on the influence function (IF). The IF explores how a functional $φ(θ)$ changes in response to small perturbations in the sample distribution of the estimator and allows computing the empirical standard error of the distribution of the functional. The ongoing development of new methods and techniques may pose a challenge for applied statisticians who are interested in mastering the application of these methods. In this tutorial, we review the use of the classical and functional Delta-method and their links to the IF from a practical perspective. We illustrate the methods using a cancer epidemiology example and we provide reproducible and commented code in R and Python using symbolic programming. The code can be accessed at https://github.com/migariane/DeltaMethodInfluenceFunction △ Less

Submitted 30 June, 2022; originally announced June 2022.

arXiv:2204.02840 [pdf, other]

Asymptotics of the meta-atom: plane wave scattering by a single Helmholtz resonator

Authors: M. J. A. Smith, P. A. Cotterill, D. Nigro, W. J. Parnell, I. D. Abrahams

Abstract: Using a combination of multipole methods and the method of matched asymptotics, we present a solution procedure for acoustic plane wave scattering by a single Helmholtz resonator in two dimensions. Closed-form representations for the multipole scattering coefficients of the resonator are derived, valid at low frequencies, with three fundamental configurations examined in detail: the thin-walled, m… ▽ More Using a combination of multipole methods and the method of matched asymptotics, we present a solution procedure for acoustic plane wave scattering by a single Helmholtz resonator in two dimensions. Closed-form representations for the multipole scattering coefficients of the resonator are derived, valid at low frequencies, with three fundamental configurations examined in detail: the thin-walled, moderately thick-walled, and very thick-walled limits. Additionally, we examine the impact of dissipation for very thick-walled resonators, and also numerically evaluate the scattering, absorption, and extinction cross sections (efficiencies) for representative resonators in all three wall thickness regimes. In general, we observe strong enhancement in both the scattered fields and cross sections at the Helmholtz resonance frequencies. As expected, dissipation is shown to shift the resonance frequency, reduce the amplitude of the field, and reduce the extinction efficiency at the fundamental Helmholtz resonance. Finally, we confirm results in the literature on Willis-like coupling effects for this resonator design, and crucially, connect these findings to earlier works by the authors on two-dimensional arrays of resonators, deducing that depolarisability effects (off-diagonal terms) for a single resonator do not ensure the existence of Willis coupling effects (bianisotropy) in bulk. △ Less

Submitted 6 April, 2022; originally announced April 2022.

Comments: 24 pages, 7 figures

arXiv:2202.09943 [pdf, other]

Two-dimensional Helmholtz resonator arrays. Part II. Matched asymptotic expansions for specially-scaled resonators

Authors: M. J. A. Smith, I. D. Abrahams

Abstract: We present a solution method which combines the method of matched asymptotics with the method of multipole expansions to determine the band structure of cylindrical Helmholtz resonators arrays in two dimensions. The resonator geometry is considered in the limit as the wall thickness becomes very large compared with the aperture width (the specially-scaled limit). In this regime, the existing treat… ▽ More We present a solution method which combines the method of matched asymptotics with the method of multipole expansions to determine the band structure of cylindrical Helmholtz resonators arrays in two dimensions. The resonator geometry is considered in the limit as the wall thickness becomes very large compared with the aperture width (the specially-scaled limit). In this regime, the existing treatment in Part I, with updated parameters, is found to return spurious spectral behaviour. We derive a regularised system which overcomes this issue and also derive compact asymptotic descriptions for the low-frequency dispersion equation in this setting. In the specially-scaled limit, our asymptotic dispersion equation not only recovers the first band surface but also extends to high, but still subwavelength, frequencies. A homogenisation treatment is outlined for describing the effective bulk modulus and effective density tensor of the resonator array for all wall thicknesses. We demonstrate that specially-scaled resonators are able to achieve exceptionally low Helmholtz resonant frequencies, and present closed-form expressions for determining these explicitly. We anticipate that the analytical expressions and the formulation outlined here may prove useful in industrial and other applications. △ Less

Submitted 20 February, 2022; originally announced February 2022.

Comments: 27 pages, 10 figures

arXiv:2202.09941 [pdf, other]

Two-dimensional Helmholtz resonator arrays. Part I. Matched asymptotic expansions for thick- and thin-walled resonators

Authors: M. J. A. Smith, I. D. Abrahams

Abstract: We present a novel multipole formulation for computing the band structures of two-dimensional arrays of cylindrical Helmholtz resonators. This formulation is derived by combining existing multipole methods for arrays of ideal cylinders with the method of matched asymptotic expansions. We construct asymptotically close representations for the dispersion equations of the first band surface, correcti… ▽ More We present a novel multipole formulation for computing the band structures of two-dimensional arrays of cylindrical Helmholtz resonators. This formulation is derived by combining existing multipole methods for arrays of ideal cylinders with the method of matched asymptotic expansions. We construct asymptotically close representations for the dispersion equations of the first band surface, correcting and extending an established lowest-order (isotropic) result in the literature for thin-walled resonator arrays. The descriptions we obtain for the first band are accurate over a relatively broad frequency and Bloch vector range and not simply in the long-wavelength and low-frequency regime, as is the case in many classical treatments. Crucially, we are able to capture features of the first band, such as low-frequency anisotropy, over a broad range of filling fractions, wall thicknesses, and aperture angles. In addition to describing the first band we use our formulation to compute the first band gap for both thick- and thin-walled resonators, and find that thicker resonator walls correspond to both a narrowing of the first band gap and an increase in the central band gap frequency. △ Less

Submitted 20 February, 2022; originally announced February 2022.

Comments: 28 pages, 11 figures

arXiv:2111.01713 [pdf, other]

doi 10.1093/mnras/stac130

Realistic galaxy image simulation via score-based generative models

Authors: Michael J. Smith, James E. Geach, Ryan A. Jackson, Nikhil Arora, Connor Stone, Stéphane Courteau

Abstract: We show that a Denoising Diffusion Probabalistic Model (DDPM), a class of score-based generative model, can be used to produce realistic mock images that mimic observations of galaxies. Our method is tested with Dark Energy Spectroscopic Instrument (DESI) grz imaging of galaxies from the Photometry and Rotation curve OBservations from Extragalactic Surveys (PROBES) sample and galaxies selected fro… ▽ More We show that a Denoising Diffusion Probabalistic Model (DDPM), a class of score-based generative model, can be used to produce realistic mock images that mimic observations of galaxies. Our method is tested with Dark Energy Spectroscopic Instrument (DESI) grz imaging of galaxies from the Photometry and Rotation curve OBservations from Extragalactic Surveys (PROBES) sample and galaxies selected from the Sloan Digital Sky Survey. Subjectively, the generated galaxies are highly realistic when compared with samples from the real dataset. We quantify the similarity by borrowing from the deep generative learning literature, using the `Fréchet Inception Distance' to test for subjective and morphological similarity. We also introduce the `Synthetic Galaxy Distance' metric to compare the emergent physical properties (such as total magnitude, colour and half light radius) of a ground truth parent and synthesised child dataset. We argue that the DDPM approach produces sharper and more realistic images than other generative methods such as Adversarial Networks (with the downside of more costly inference), and could be used to produce large samples of synthetic observations tailored to a specific imaging survey. We demonstrate two potential uses of the DDPM: (1) accurate in-painting of occluded data, such as satellite trails, and (2) domain transfer, where new input images can be processed to mimic the properties of the DDPM training set. Here we `DESI-fy' cartoon images as a proof of concept for domain transfer. Finally, we suggest potential applications for score-based approaches that could motivate further research on this topic within the astronomical community. △ Less

Submitted 31 January, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

Comments: 11 pages, 8 figures. Code: https://github.com/smith42/astroddpm . Follow the Twitter bot @ThisIsNotAnApod for DDPM-generated APODs

arXiv:2103.15787 [pdf, other]

Meeting in the notebook: a notebook-based environment for micro-submissions in data science collaborations

Authors: Micah J. Smith, Jürgen Cito, Kalyan Veeramachaneni

Abstract: Developers in data science and other domains frequently use computational notebooks to create exploratory analyses and prototype models. However, they often struggle to incorporate existing software engineering tooling into these notebook-based workflows, leading to fragile development processes. We introduce Assemblé, a new development environment for collaborative data science projects, in which… ▽ More Developers in data science and other domains frequently use computational notebooks to create exploratory analyses and prototype models. However, they often struggle to incorporate existing software engineering tooling into these notebook-based workflows, leading to fragile development processes. We introduce Assemblé, a new development environment for collaborative data science projects, in which promising code fragments of data science pipelines can be contributed as pull requests to an upstream repository entirely from within JupyterLab, abstracting away low-level version control tool usage. We describe the design and implementation of Assemblé and report on a user study of 23 data scientists. △ Less

Submitted 29 March, 2021; originally announced March 2021.

arXiv:2012.09920 [pdf, other]

Tutorial: Introduction to computational causal inference using reproducible Stata, R and Python code

Authors: Matthew J. Smith, Camille Maringe, Bernard Rachet, Mohammad A. Mansournia, Paul N. Zivich, Stephen R. Cole, Miguel Angel Luque-Fernandez

Abstract: The purpose of many health studies is to estimate the effect of an exposure on an outcome. It is not always ethical to assign an exposure to individuals in randomised controlled trials, instead observational data and appropriate study design must be used. There are major challenges with observational studies, one of which is confounding that can lead to biased estimates of the causal effects. Cont… ▽ More The purpose of many health studies is to estimate the effect of an exposure on an outcome. It is not always ethical to assign an exposure to individuals in randomised controlled trials, instead observational data and appropriate study design must be used. There are major challenges with observational studies, one of which is confounding that can lead to biased estimates of the causal effects. Controlling for confounding is commonly performed by simple adjustment for measured confounders; although, often this is not enough. Recent advances in the field of causal inference have dealt with confounding by building on classical standardisation methods. However, these recent advances have progressed quickly with a relative paucity of computational-oriented applied tutorials contributing to some confusion in the use of these methods among applied researchers. In this tutorial, we show the computational implementation of different causal inference estimators from a historical perspective where different estimators were developed to overcome the limitations of the previous one. Furthermore, we also briefly introduce the potential outcomes framework, illustrate the use of different methods using an illustration from the health care setting, and most importantly, we provide reproducible and commented code in Stata, R and Python for researchers to apply in their own observational study. The code can be accessed at https://github.com/migariane/TutorialCausalInferenceEstimators △ Less

Submitted 21 December, 2020; v1 submitted 17 December, 2020; originally announced December 2020.

arXiv:2012.07816 [pdf, other]

doi 10.1145/3479575

Enabling Collaborative Data Science Development with the Ballet Framework

Authors: Micah J. Smith, Jürgen Cito, Kelvin Lu, Kalyan Veeramachaneni

Abstract: While the open-source software development model has led to successful large-scale collaborations in building software systems, data science projects are frequently developed by individuals or small teams. We describe challenges to scaling data science collaborations and present a conceptual framework and ML programming model to address them. We instantiate these ideas in Ballet, a lightweight fra… ▽ More While the open-source software development model has led to successful large-scale collaborations in building software systems, data science projects are frequently developed by individuals or small teams. We describe challenges to scaling data science collaborations and present a conceptual framework and ML programming model to address them. We instantiate these ideas in Ballet, a lightweight framework for collaborative, open-source data science through a focus on feature engineering, and an accompanying cloud-based development environment. Using our framework, collaborators incrementally propose feature definitions to a repository which are each subjected to an ML performance evaluation and can be automatically merged into an executable feature engineering pipeline. We leverage Ballet to conduct a case study analysis of an income prediction problem with 27 collaborators, and discuss implications for future designers of collaborative projects. △ Less

Submitted 22 October, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

Journal ref: Proc. ACM Hum.-Comput. Interact. 5, CSCW2, Article 431 (October 2021), 39 pages

arXiv:2010.10777 [pdf, other]

AutoML to Date and Beyond: Challenges and Opportunities

Authors: Shubhra Kanti Karmaker Santu, Md. Mahadi Hassan, Micah J. Smith, Lei Xu, ChengXiang Zhai, Kalyan Veeramachaneni

Abstract: As big data becomes ubiquitous across domains, and more and more stakeholders aspire to make the most of their data, demand for machine learning tools has spurred researchers to explore the possibilities of automated machine learning (AutoML). AutoML tools aim to make machine learning accessible for non-machine learning experts (domain experts), to improve the efficiency of machine learning, and t… ▽ More As big data becomes ubiquitous across domains, and more and more stakeholders aspire to make the most of their data, demand for machine learning tools has spurred researchers to explore the possibilities of automated machine learning (AutoML). AutoML tools aim to make machine learning accessible for non-machine learning experts (domain experts), to improve the efficiency of machine learning, and to accelerate machine learning research. But although automation and efficiency are among AutoML's main selling points, the process still requires human involvement at a number of vital steps, including understanding the attributes of domain-specific data, defining prediction problems, creating a suitable training data set, and selecting a promising machine learning technique. These steps often require a prolonged back-and-forth that makes this process inefficient for domain experts and data scientists alike, and keeps so-called AutoML systems from being truly automatic. In this review article, we introduce a new classification system for AutoML systems, using a seven-tiered schematic to distinguish these systems based on their level of autonomy. We begin by describing what an end-to-end machine learning pipeline actually looks like, and which subtasks of the machine learning pipeline have been automated so far. We highlight those subtasks which are still done manually - generally by a data scientist - and explain how this limits domain experts' access to machine learning. Next, we introduce our novel level-based taxonomy for AutoML systems and define each level according to the scope of automation support provided. Finally, we lay out a roadmap for the future, pinpointing the research required to further automate the end-to-end machine learning pipeline and discussing important challenges that stand in the way of this ambitious goal. △ Less

Submitted 19 May, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

Comments: 35 pages, survey article, 3 figures

ACM Class: I.2

arXiv:2010.00622 [pdf, other]

doi 10.1093/mnras/stab424

Pix2Prof: fast extraction of sequential information from galaxy imagery via a deep natural language 'captioning' model

Authors: Michael J. Smith, Nikhil Arora, Connor Stone, Stéphane Courteau, James E. Geach

Abstract: We present 'Pix2Prof', a deep learning model that can eliminate any manual steps taken when extracting galaxy profiles. We argue that a galaxy profile of any sort is conceptually similar to a natural language image caption. This idea allows us to leverage image captioning methods from the field of natural language processing, and so we design Pix2Prof as a float sequence 'captioning' model suitabl… ▽ More We present 'Pix2Prof', a deep learning model that can eliminate any manual steps taken when extracting galaxy profiles. We argue that a galaxy profile of any sort is conceptually similar to a natural language image caption. This idea allows us to leverage image captioning methods from the field of natural language processing, and so we design Pix2Prof as a float sequence 'captioning' model suitable for galaxy profile inference. We demonstrate the technique by approximating a galaxy surface brightness (SB) profile fitting method that contains several manual steps. Pix2Prof processes $\sim$1 image per second on an Intel Xeon E5 2650 v3 CPU, improving on the speed of the manual interactive method by more than two orders of magnitude. Crucially, Pix2Prof requires no manual interaction, and since galaxy profile estimation is an embarrassingly parallel problem, we can further increase the throughput by running many Pix2Prof instances simultaneously. In perspective, Pix2Prof would take under an hour to infer profiles for $10^5$ galaxies on a single NVIDIA DGX-2 system. A single human expert would take approximately two years to complete the same task. Automated methodology such as this will accelerate the analysis of the next generation of large area sky surveys expected to yield hundreds of millions of targets. In such instances, all manual approaches -- even those involving a large number of experts -- will be impractical. △ Less

Submitted 28 April, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

Comments: Accepted for publication in MNRAS. 10 pages, and 8 figures. Code: https://github.com/Smith42/pix2prof

arXiv:2009.13464 [pdf, other]

doi 10.1098/rspa.2020.0360

On the Wiener-Hopf solution of water-wave interaction with a submerged elastic or poroelastic plate

Authors: M. J. A. Smith, M. A. Peter, I. D. Abrahams, M. H. Meylan

Abstract: A solution to the problem of water-wave scattering by a semi-infinite submerged thin elastic plate, which is either porous or non-porous, is presented using the Wiener-Hopf technique. The derivation of the Wiener-Hopf equation is rather different from that which is used traditionally in water-waves problems, and it leads to the required equations directly. It is also shown how the solution can be… ▽ More A solution to the problem of water-wave scattering by a semi-infinite submerged thin elastic plate, which is either porous or non-porous, is presented using the Wiener-Hopf technique. The derivation of the Wiener-Hopf equation is rather different from that which is used traditionally in water-waves problems, and it leads to the required equations directly. It is also shown how the solution can be computed straightforwardly using Cauchy-type integrals, which avoids the need to find the roots of the highly non-trivial dispersion equations. We illustrate the method with some numerical computations, focusing on the evolution of an incident wave pulse which illustrates the existence of two transmitted waves in the submerged plate system. The effect of the porosity is studied, and it is shown to influence the shorter-wavelength pulse much more strongly than the longer-wavelength pulse. △ Less

Submitted 28 September, 2020; originally announced September 2020.

Comments: 25 pages, 6 figures

Journal ref: Proc. R. Soc. A. 476:2020.0360

arXiv:2009.08470 [pdf, other]

doi 10.1093/mnras/staa3670

AstroVaDEr: Astronomical Variational Deep Embedder for Unsupervised Morphological Classification of Galaxies and Synthetic Image Generation

Authors: Ashley Spindler, James E. Geach, Michael J. Smith

Abstract: We present AstroVaDEr, a variational autoencoder designed to perform unsupervised clustering and synthetic image generation using astronomical imaging catalogues. The model is a convolutional neural network that learns to embed images into a low dimensional latent space, and simultaneously optimises a Gaussian Mixture Model (GMM) on the embedded vectors to cluster the training data. By utilising v… ▽ More We present AstroVaDEr, a variational autoencoder designed to perform unsupervised clustering and synthetic image generation using astronomical imaging catalogues. The model is a convolutional neural network that learns to embed images into a low dimensional latent space, and simultaneously optimises a Gaussian Mixture Model (GMM) on the embedded vectors to cluster the training data. By utilising variational inference, we are able to use the learned GMM as a statistical prior on the latent space to facilitate random sampling and generation of synthetic images. We demonstrate AstroVaDEr's capabilities by training it on gray-scaled \textit{gri} images from the Sloan Digital Sky Survey, using a sample of galaxies that are classified by Galaxy Zoo 2. An unsupervised clustering model is found which separates galaxies based on learned morphological features such as axis ratio, surface brightness profile, orientation and the presence of companions. We use the learned mixture model to generate synthetic images of galaxies based on the morphological profiles of the Gaussian components. AstroVaDEr succeeds in producing a morphological classification scheme from unlabelled data, but unexpectedly places high importance on the presence of companion objects---demonstrating the importance of human interpretation. The network is scalable and flexible, allowing for larger datasets to be classified, or different kinds of imaging data. We also demonstrate the generative properties of the model, which allow for realistic synthetic images of galaxies to be sampled from the learned classification scheme. These can be used to create synthetic image catalogs or to perform image processing tasks such as deblending. △ Less

Submitted 20 November, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

Comments: Accepted in MNRAS. 23 Pages, 16 Figures. GitHub: https://github.com/AshleySpindler/AstroVaDEr-Public

arXiv:2005.12862 [pdf, other]

Geometrical and Mechanical Characterisation of Hollow Thermoplastic Microspheres for Syntactic Foam Applications

Authors: Matthew E. Curd, Neil F. Morrison, Michael J. A. Smith, Parmesh Gajjar, Zeshan Yousaf, William Parnell

Abstract: Recently, hollow thermoplastic microspheres, such as Expancel made by Nouryon, have emerged as an innovative filler material for use in polymer-matrix composites. The resulting all-polymer syntactic foam takes on excellent damage tolerance properties, strong recoverability under large strains, and favourable energy dissipation characteristics. Despite finding increasing usage in various industries… ▽ More Recently, hollow thermoplastic microspheres, such as Expancel made by Nouryon, have emerged as an innovative filler material for use in polymer-matrix composites. The resulting all-polymer syntactic foam takes on excellent damage tolerance properties, strong recoverability under large strains, and favourable energy dissipation characteristics. Despite finding increasing usage in various industries and applications, including in coatings, films, sealants, packaging, composites for microfluidics, medical ultrasonics and cementious composites, there is a near-complete absence of statistical geometrical information for Expancel microspheres. Further, their mechanical properties have not yet been reported. In this work we characterise the geometrical quantities of two classes of Expancel thermoplastic microspheres using X-ray computed tomography, focused ion beam and electron microscopy. We also observe the spatial distribution of microspheres within a polyurethane-matrix syntactic foam. We show that the volume-weighted polydisperse shell diameter in both classes of microsphere follows a normal distribution. Interestingly, polydispersity of the shell wall thickness is not observed and in particular the shell thickness is not correlated to the shell diameter. We employ the measured geometrical information in analytical micromechanical techniques in the small strain regime to determine, for the first time, estimates of the Young's modulus and Poisson's ratio of the microsphere shell material. Our results contribute to potential future improvements in the design and fabrication of syntactic foams that employ thermoplastic microspheres. Given the breadth of fields which utilise thermoplastic microspheres, we anticipate that our results, together with the methods used, will be of use in a much broader context in future materials research. △ Less

Submitted 20 January, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

Comments: 19 page, 7 figure, 3 tables

arXiv:1905.08942 [pdf, other]

doi 10.1145/3318464.3386146

The Machine Learning Bazaar: Harnessing the ML Ecosystem for Effective System Development

Authors: Micah J. Smith, Carles Sala, James Max Kanter, Kalyan Veeramachaneni

Abstract: As machine learning is applied more widely, data scientists often struggle to find or create end-to-end machine learning systems for specific tasks. The proliferation of libraries and frameworks and the complexity of the tasks have led to the emergence of "pipeline jungles" - brittle, ad hoc ML systems. To address these problems, we introduce the Machine Learning Bazaar, a new framework for develo… ▽ More As machine learning is applied more widely, data scientists often struggle to find or create end-to-end machine learning systems for specific tasks. The proliferation of libraries and frameworks and the complexity of the tasks have led to the emergence of "pipeline jungles" - brittle, ad hoc ML systems. To address these problems, we introduce the Machine Learning Bazaar, a new framework for developing machine learning and automated machine learning software systems. First, we introduce ML primitives, a unified API and specification for data processing and ML components from different software libraries. Next, we compose primitives into usable ML pipelines, abstracting away glue code, data flow, and data storage. We further pair these pipelines with a hierarchy of AutoML strategies - Bayesian optimization and bandit learning. We use these components to create a general-purpose, multi-task, end-to-end AutoML system that provides solutions to a variety of data modalities (image, text, graph, tabular, relational, etc.) and problem types (classification, regression, anomaly detection, graph matching, etc.). We demonstrate 5 real-world use cases and 2 case studies of our approach. Finally, we present an evaluation suite of 456 real-world ML tasks and describe the characteristics of 2.5 million pipelines searched over this task suite. △ Less

Submitted 7 April, 2020; v1 submitted 21 May, 2019; originally announced May 2019.

Comments: To appear in SIGMOD '20

Journal ref: In Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data (SIGMOD '20). Association for Computing Machinery, New York, NY, USA, 785-800

arXiv:1904.10286 [pdf, other]

doi 10.1093/mnras/stz2886

Generative deep fields: arbitrarily sized, random synthetic astronomical images through deep learning

Authors: Michael J. Smith, James E. Geach

Abstract: Generative Adversarial Networks (GANs) are a class of artificial neural network that can produce realistic, but artificial, images that resemble those in a training set. In typical GAN architectures these images are small, but a variant known as Spatial-GANs (SGANs) can generate arbitrarily large images, provided training images exhibit some level of periodicity. Deep extragalactic imaging surveys… ▽ More Generative Adversarial Networks (GANs) are a class of artificial neural network that can produce realistic, but artificial, images that resemble those in a training set. In typical GAN architectures these images are small, but a variant known as Spatial-GANs (SGANs) can generate arbitrarily large images, provided training images exhibit some level of periodicity. Deep extragalactic imaging surveys meet this criteria due to the cosmological tenet of isotropy. Here we train an SGAN to generate images resembling the iconic Hubble Space Telescope eXtreme Deep Field (XDF). We show that the properties of 'galaxies' in generated images have a high level of fidelity with galaxies in the real XDF in terms of abundance, morphology, magnitude distributions and colours. As a demonstration we have generated a 7.6-billion pixel 'generative deep field' spanning 1.45 degrees. The technique can be generalised to any appropriate imaging training set, offering a new purely data-driven approach for producing realistic mock surveys and synthetic data at scale, in astrophysics and beyond. △ Less

Submitted 23 April, 2019; originally announced April 2019.

Comments: Submitted to MNRAS. Comments welcome. Code available at https://github.com/Smith42/XDF-GAN and 7.6-billion pixel GDF viewable at http://star.herts.ac.uk/~jgeach/gdf

arXiv:1903.00397 [pdf]

Compression properties of polymeric syntactic foam composites under cyclic loading

Authors: Z. Yousaf, M. J. A. Smith, P. Potluri, W. J. Parnell

Abstract: Syntactic foams are composite materials frequently used in applications requiring the properties of low density and high damage tolerance. In the present work, polymer-based syntactic foams were studied under cyclic compression in order to investigate their compressibility, recoverability, energy dissipation and damage tolerance. These syntactic foams were manufactured by adding hollow polymer mic… ▽ More Syntactic foams are composite materials frequently used in applications requiring the properties of low density and high damage tolerance. In the present work, polymer-based syntactic foams were studied under cyclic compression in order to investigate their compressibility, recoverability, energy dissipation and damage tolerance. These syntactic foams were manufactured by adding hollow polymer microspheres of various sizes and wall thicknesses into a polyurethane matrix. The associated loading and unloading curves during cyclic testing were recorded, revealing the viscoelastic nature of the materials. SEM images of the samples were obtained in order to study potential damage mechanisms during compression. It was observed that these syntactic foams exhibit high elastic recovery and energy dissipation over a wide range of compressional strains and the addition of polymer microspheres mitigate the damage under compressional loading. △ Less

Submitted 1 March, 2019; originally announced March 2019.

Comments: 25 pages, 13 figures

arXiv:1902.05009 [pdf, other]

doi 10.1145/3290605.3300911

ATMSeer: Increasing Transparency and Controllability in Automated Machine Learning

Authors: Qianwen Wang, Yao Ming, Zhihua Jin, Qiaomu Shen, Dongyu Liu, Micah J. Smith, Kalyan Veeramachaneni, Huamin Qu

Abstract: To relieve the pain of manually selecting machine learning algorithms and tuning hyperparameters, automated machine learning (AutoML) methods have been developed to automatically search for good models. Due to the huge model search space, it is impossible to try all models. Users tend to distrust automatic results and increase the search budget as much as they can, thereby undermining the efficien… ▽ More To relieve the pain of manually selecting machine learning algorithms and tuning hyperparameters, automated machine learning (AutoML) methods have been developed to automatically search for good models. Due to the huge model search space, it is impossible to try all models. Users tend to distrust automatic results and increase the search budget as much as they can, thereby undermining the efficiency of AutoML. To address these issues, we design and implement ATMSeer, an interactive visualization tool that supports users in refining the search space of AutoML and analyzing the results. To guide the design of ATMSeer, we derive a workflow of using AutoML based on interviews with machine learning experts. A multi-granularity visualization is proposed to enable users to monitor the AutoML process, analyze the searched models, and refine the search space in real time. We demonstrate the utility and usability of ATMSeer through two case studies, expert interviews, and a user study with 13 end users. △ Less

Submitted 13 February, 2019; originally announced February 2019.

Comments: Published in the ACM Conference on Human Factors in Computing Systems (CHI), 2019, Glasgow, Scotland UK

Journal ref: In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI '19). Association for Computing Machinery, New York, NY, USA, Paper 681, 1-12

arXiv:1811.10219 [pdf, other]

doi 10.1109/JLT.2019.2920844

NumBAT: The integrated, open source Numerical Brillouin Analysis Tool

Authors: Björn C. P. Sturmberg, Kokou B. Dossou, Michael J. A. Smith, Blair Morrison, Christopher G. Poulton, Michael J. Steel

Abstract: We describe NumBAT, an open-source software tool for modelling stimulated Brillouin scattering in waveguides of arbitrary cross-section. It provides rapid calculation of optical and elastic dispersion relations, field profiles and gain with an easy-to-use Python front end. Additionally, we provide an open and extensible set of standard problems and reference materials to facilitate the bench-marki… ▽ More We describe NumBAT, an open-source software tool for modelling stimulated Brillouin scattering in waveguides of arbitrary cross-section. It provides rapid calculation of optical and elastic dispersion relations, field profiles and gain with an easy-to-use Python front end. Additionally, we provide an open and extensible set of standard problems and reference materials to facilitate the bench-marking of NumBAT against subsequent tools. Such a resource is needed to help settle discrepancies between existing formulations and implementations, and to facilitate comparison between results in the literature. The resulting standardised testing framework will allow the community to gain confidence in new algorithms and will provide a common tool for the comparison of experimental designs of opto-acoustic waveguides. △ Less

Submitted 26 November, 2018; originally announced November 2018.

Comments: 17 pages, 18 figures

Journal ref: Journal of Lightwave Technology, vol. 37, no. 15, pp. 3791-3804, 2019

arXiv:1811.07193 [pdf, other]

Modelling hollow thermoplastic syntactic foams under high-strain compressive loading

Authors: Michael J. A. Smith, Zeshan Yousaf, Prasad Potluri, William J. Parnell

Abstract: The mechanical response of syntactic foams comprising hollow thermoplastic microspheres (HTMs) embedded in a polyurethane matrix were experimentally examined under uniaxial compressive strain. Phenomenological strain energy models were subsequently developed to capture both the axial stress-strain and transverse strain response of the foams. HTM syntactic foams were found to exhibit increased smal… ▽ More The mechanical response of syntactic foams comprising hollow thermoplastic microspheres (HTMs) embedded in a polyurethane matrix were experimentally examined under uniaxial compressive strain. Phenomenological strain energy models were subsequently developed to capture both the axial stress-strain and transverse strain response of the foams. HTM syntactic foams were found to exhibit increased small-strain stiffness with reduced density, revealing a highly tuneable and extremely lightweight syntactic foam blend for applications. The foams were also found to become strongly compressible at large strains and possess a high threshold for plastic deformation, making them a robust alternative to hollow glass microsphere syntactic foams. The non-standard transverse strain relationship exhibited by HTM syntactic foams at high filling fractions was captured by Ogden-type strain energy models. The thermal characteristics of these syntactic foams were also explored with Differential Scanning Calorimetry testing which showed that HTMs have a negligible impact on the thermal characteristics of the matrix. △ Less

Submitted 9 March, 2021; v1 submitted 17 November, 2018; originally announced November 2018.

Comments: 31 pages, 14 figures, 3 tables

arXiv:1809.08033 [pdf, other]

doi 10.1364/OL.44.001407

Stimulated Brillouin Scattering in layered media: nanoscale enhancement of silicon

Authors: M. J. A. Smith, C. Wolff, C. G. Poulton, C. M. de Sterke

Abstract: We report a theoretical study of Stimulated Brillouin Scattering (SBS) in general anisotropic media, incorporating the effects of both acoustic strain and local rotation in all calculations. We apply our general theoretical framework to compute the SBS gain for layered media with periodic length scales smaller than all optical and acoustic wavelengths, where such composites behave like homogeneous… ▽ More We report a theoretical study of Stimulated Brillouin Scattering (SBS) in general anisotropic media, incorporating the effects of both acoustic strain and local rotation in all calculations. We apply our general theoretical framework to compute the SBS gain for layered media with periodic length scales smaller than all optical and acoustic wavelengths, where such composites behave like homogeneous anisotropic media. We theoretically predict that a layered medium comprising nanometre-thin layers of silicon and As$_2$S$_3$ glass possesses a bulk SBS gain of $1.28 \times 10^{-9} \, \mathrm{W}^{-1} \, \mathrm{m}$. This is more than 500 times larger than the gain coefficient of silicon, and substantially larger than the gain of As$_2$S$_3$. The enhancement is due to a combination of roto-optic, photoelastic, and artificial photoelastic contributions in the composite structure. △ Less

Submitted 27 November, 2018; v1 submitted 21 September, 2018; originally announced September 2018.

Comments: 5 pages, 3 figures

arXiv:1712.09112 [pdf, other]

doi 10.1103/PhysRevLett.121.103902

Decoupling the energy and momentum of photons in the quasistatic limit

Authors: M J A Smith, P Y Chen

Abstract: We theoretically show that the frequency and momentum of a photon are not necessarily proportional to one another at low frequencies in photonic crystals comprising materials with positive- and negative-valued material properties. We rigorously determine closed-form conditions for the light cone to emanate from points other than the origin of $k$ space, ultimately decoupling the first band from th… ▽ More We theoretically show that the frequency and momentum of a photon are not necessarily proportional to one another at low frequencies in photonic crystals comprising materials with positive- and negative-valued material properties. We rigorously determine closed-form conditions for the light cone to emanate from points other than the origin of $k$ space, ultimately decoupling the first band from the origin and demonstrating light propagation at zero energy with nonzero crystal momentum. We also numerically show that first bands can originate from an arbitrary Bloch coordinate as well as from multiple coordinates simultaneously. △ Less

Submitted 6 September, 2018; v1 submitted 25 December, 2017; originally announced December 2017.

Comments: 20 pages, 5 figures

Journal ref: Phys. Rev. Lett. 121, 103902 (2018)

arXiv:1712.05427 [pdf, other]

doi 10.1098/rspa.2017.0864

Reflection from a multi-species material and its transmitted effective wavenumber

Authors: Artur L. Gower, Michael J. A. Smith, William J. Parnell, Ian David Abrahams

Abstract: We formally deduce closed-form expressions for the transmitted effective wavenumber of a material comprising multiple types of inclusions or particles (multi-species), dispersed in a uniform background medium. The expressions, derived here for the first time, are valid for moderate volume fractions and without restriction on the frequency. We show that the multi-species effective wavenumber is not… ▽ More We formally deduce closed-form expressions for the transmitted effective wavenumber of a material comprising multiple types of inclusions or particles (multi-species), dispersed in a uniform background medium. The expressions, derived here for the first time, are valid for moderate volume fractions and without restriction on the frequency. We show that the multi-species effective wavenumber is not a straightforward extension of expressions for a single species. Comparisons are drawn with state-of-the-art models in acoustics by presenting numerical results for a concrete and a water-oil emulsion in two dimensions. The limit of when one species is much smaller than the other is also discussed and we determine the background medium felt by the larger species in this limit. Surprisingly, we show that the answer is not the intuitive result predicted by self-consistent multiple scattering theories. The derivation presented here applies to the scalar wave equation with cylindrical or spherical inclusions, with any distribution of sizes, densities, and wave speeds. The reflection coefficient associated with a half-space of multi-species cylindrical inclusions is also formally derived. △ Less

Submitted 6 March, 2018; v1 submitted 14 December, 2017; originally announced December 2017.

Comments: The supplementary material has simpler self-contained formulas, it is an ancillary file and can be downloaded from a link on the right. The code to reproduce all graphs is provided in https://github.com/arturgower/EffectiveWaves.jl

MSC Class: 78-02; 82D02

arXiv:1702.01581 [pdf, other]

doi 10.1103/PhysRevB.96.064114

Enhanced acousto-optic properties in layered media

Authors: M. J. A. Smith, C. Wolff, M. Lapine, C. G. Poulton, C. Martijn de Sterke

Abstract: We present a rigorous procedure for evaluating the photoelastic coefficients of a layered medium where the periodicity is smaller than the wavelengths of all optical and acoustic fields. Analytical expressions are given for the coefficients of a composite material comprising thin layers of optically isotropic materials. These coefficients include artificial contributions that are unique to structu… ▽ More We present a rigorous procedure for evaluating the photoelastic coefficients of a layered medium where the periodicity is smaller than the wavelengths of all optical and acoustic fields. Analytical expressions are given for the coefficients of a composite material comprising thin layers of optically isotropic materials. These coefficients include artificial contributions that are unique to structured media and arise from the optical and mechanical contrast between the constituents. Using numerical examples, we demonstrate that the acousto-optic properties of layered structures can be enhanced beyond those of the constituent materials. Furthermore, we show that the acousto-optic response can be tuned as desired. △ Less

Submitted 28 June, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

Comments: 9 pages, 4 figures

Journal ref: Phys. Rev. B 96, 064114 (2017)

arXiv:1610.08171 [pdf, other]

doi 10.4204/EPTCS.227.6

MELA: Modelling in Ecology with Location Attributes

Authors: Ludovica Luisa Vissat, Jane Hillston, Glenn Marion, Matthew J. Smith

Abstract: Ecology studies the interactions between individuals, species and the environment. The ability to predict the dynamics of ecological systems would support the design and monitoring of control strategies and would help to address pressing global environmental issues. It is also important to plan for efficient use of natural resources and maintenance of critical ecosystem services. The mathematical… ▽ More Ecology studies the interactions between individuals, species and the environment. The ability to predict the dynamics of ecological systems would support the design and monitoring of control strategies and would help to address pressing global environmental issues. It is also important to plan for efficient use of natural resources and maintenance of critical ecosystem services. The mathematical modelling of ecological systems often includes nontrivial specifications of processes that influence the birth, death, development and movement of individuals in the environment, that take into account both biotic and abiotic interactions. To assist in the specification of such models, we introduce MELA, a process algebra for Modelling in Ecology with Location Attributes. Process algebras allow the modeller to describe concurrent systems in a high-level language. A key feature of concurrent systems is that they are composed of agents that can progress simultaneously but also interact - a good match to ecological systems. MELA aims to provide ecologists with a straightforward yet flexible tool for modelling ecological systems, with particular emphasis on the description of space and the environment. Here we present four example MELA models, illustrating the different spatial arrangements which can be accommodated and demonstrating the use of MELA in epidemiological and predator-prey scenarios. △ Less

Submitted 26 October, 2016; originally announced October 2016.

Comments: In Proceedings QAPL'16, arXiv:1610.07696

Journal ref: EPTCS 227, 2016, pp. 82-97

arXiv:1608.04962 [pdf, other]

doi 10.1364/OE.24.025148

Stimulated Brillouin scattering enhancement in silicon inverse opal waveguides

Authors: M. J. A. Smith, C. Wolff, C. Martijn de Sterke, M. Lapine, B. T. Kuhlmey, C. G. Poulton

Abstract: Silicon is an ideal material for on-chip applications, however its poor acoustic properties limit its performance for important optoacoustic applications, particularly for Stimulated Brillouin Scattering (SBS). We theoretically show that silicon inverse opals exhibit a strongly improved acoustic performance that enhances the bulk SBS gain coefficient by more than two orders of magnitude. We also d… ▽ More Silicon is an ideal material for on-chip applications, however its poor acoustic properties limit its performance for important optoacoustic applications, particularly for Stimulated Brillouin Scattering (SBS). We theoretically show that silicon inverse opals exhibit a strongly improved acoustic performance that enhances the bulk SBS gain coefficient by more than two orders of magnitude. We also design a waveguide that incorporates silicon inverse opals and which has SBS gain values that are comparable with chalcogenide glass waveguides. This research opens new directions for opto-acoustic applications in on-chip material systems. △ Less

Submitted 17 August, 2016; originally announced August 2016.

Comments: 4 pages, 6 figures

arXiv:1608.02827 [pdf, other]

Evaluation and regularization of generalized Eisenstein series and application to 2D cylindrical harmonic sums

Authors: Parry Y. Chen, Michael J. A. Smith, Ross C. McPhedran

Abstract: In the study of periodic media, conditionally convergent series are frequently encountered and their regularization is crucial for applications. We derive an identity that regularizes two-dimensional generalized Eisenstein series for all Bravais lattices, yielding physically meaningful values. We also obtain explicit forms for the generalized series in terms of conventional Eisenstein series, enab… ▽ More In the study of periodic media, conditionally convergent series are frequently encountered and their regularization is crucial for applications. We derive an identity that regularizes two-dimensional generalized Eisenstein series for all Bravais lattices, yielding physically meaningful values. We also obtain explicit forms for the generalized series in terms of conventional Eisenstein series, enabling their closed-form evaluation for important high symmetry lattices. Results are then used to obtain representations for the related cylindrical harmonic sums, which are also given for all Bravais lattices. Finally, we treat displaced lattices of high symmetry, expressing them in terms of origin-centered lattices via geometric multi-set identities. These identities apply to all classes of two-dimensional sums, allowing sums to be evaluated over each constituent of a unit cell that possesses multiple inclusions. △ Less

Submitted 9 August, 2016; originally announced August 2016.

Comments: 25 pages, 3 figures

arXiv:1606.03193 [pdf, other]

doi 10.1364/JOSAB.33.002162

Stimulated Brillouin scattering in metamaterials

Authors: M. J. A. Smith, B. T. Kuhlmey, C. Martijn de Sterke, C. Wolff, M. Lapine, C. G. Poulton

Abstract: We compute the SBS gain for a metamaterial comprising a cubic lattice of dielectric spheres suspended in a background dielectric material. Theoretical methods are presented to calculate the optical, acoustic, and opto-acoustic parameters that describe the SBS properties of the material at long wavelengths. Using the electromagnetic and strain energy densities we accurately characterise the optical… ▽ More We compute the SBS gain for a metamaterial comprising a cubic lattice of dielectric spheres suspended in a background dielectric material. Theoretical methods are presented to calculate the optical, acoustic, and opto-acoustic parameters that describe the SBS properties of the material at long wavelengths. Using the electromagnetic and strain energy densities we accurately characterise the optical and acoustic properties of the metamaterial. From a combination of energy density methods and perturbation theory, we recover the appropriate terms of the photoelastic tensor for the metamaterial. We demonstrate that electrostriction is not necessarily the dominant mechanism in the enhancement and suppression of the SBS gain coefficient in a metamaterial, and that other parameters, such as the Brillouin linewidth, can dominate instead. Examples are presented that exhibit an order of magnitude enhancement in the SBS gain as well as perfect suppression. △ Less

Submitted 10 June, 2016; originally announced June 2016.

Comments: 11 pages, 14 figures

arXiv:1603.08506 [pdf, ps, other]

doi 10.1051/0004-6361/201528012

Feature-tailored spectroscopic analysis of the SNR Puppis A in X-rays

Authors: G. J. M. Luna, M. J. S. Smith, G. Dubner, E. Giacani, G. Castelletti

Abstract: We introduce a distinct method to perform spatially-resolved spectral analysis of astronomical sources with highly structured X-ray emission. The method measures the surface brightness of neighbouring pixels to adaptively size and shape each region, thus the spectra from the bright and faint filamentary structures evident in the broadband images can be extracted. As a test case, we present the spe… ▽ More We introduce a distinct method to perform spatially-resolved spectral analysis of astronomical sources with highly structured X-ray emission. The method measures the surface brightness of neighbouring pixels to adaptively size and shape each region, thus the spectra from the bright and faint filamentary structures evident in the broadband images can be extracted. As a test case, we present the spectral analysis of the complete X-ray emitting plasma in the supernova remnant Puppis A observed with XMM-Newton and Chandra. Given the angular size of Puppis A, many pointings with different observational configurations have to be combined, presenting a challenge to any method of spatially-resolved spectroscopy. From the fit of a plane-parallel shocked plasma model we find that temperature, absorption column, ionization time scale, emission measure and elemental abundances of O, Ne, Mg, Si, S and Fe, are smoothly distributed in the remnant. Some regions with overabundances of O-Ne-Mg, previously characterized as ejecta material, were automatically selected by our method, proving the excellent response of the technique. This method is an advantageous tool for the exploitation of archival X-ray data. △ Less

Submitted 28 March, 2016; originally announced March 2016.

Comments: Accepted in Astronomy & Astrophysics

Journal ref: A&A 590, A70 (2016)

arXiv:1603.02497 [pdf, ps, other]

Transit times and mean ages for nonautonomous and autonomous compartmental systems

Authors: Martin Rasmussen, Alan Hastings, Matthew J. Smith, Folashade B. Agusto, Benito M. Chen-Charpentier, Forrest M. Hoffman, Jiang Jiang, Katherine E. O. Todd-Brown, Ying Wang, Ying-Ping Wang, Yiqi Luo

Abstract: We develop a theory for transit times and mean ages for nonautonomous compartmental systems. Using the McKendrick-von Förster equation, we show that the mean ages of mass in a compartmental system satisfy a linear nonautonomous ordinary differential equation that is exponentially stable. We then define a nonautonomous version of transit time as the mean age of mass leaving the compartmental system… ▽ More We develop a theory for transit times and mean ages for nonautonomous compartmental systems. Using the McKendrick-von Förster equation, we show that the mean ages of mass in a compartmental system satisfy a linear nonautonomous ordinary differential equation that is exponentially stable. We then define a nonautonomous version of transit time as the mean age of mass leaving the compartmental system at a particular time and show that our nonautonomous theory generalises the autonomous case. We apply these results to study a nine-dimensional nonautonomous compartmental system modeling the terrestrial carbon cycle, which is a modification of the Carnegie-Ames-Stanford approach (CASA) model, and we demonstrate that the nonautonomous versions of transit time and mean age differ significantly from the autonomous quantities when calculated for that model. △ Less

Submitted 8 March, 2016; originally announced March 2016.

MSC Class: 34A30; 34D05

arXiv:1602.08132 [pdf, ps, other]

doi 10.1109/CISP.2011.6100685

Adaptive Frequency Cepstral Coefficients for Word Mispronunciation Detection

Authors: Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith

Abstract: Systems based on automatic speech recognition (ASR) technology can provide important functionality in computer assisted language learning applications. This is a young but growing area of research motivated by the large number of students studying foreign languages. Here we propose a Hidden Markov Model (HMM)-based method to detect mispronunciations. Exploiting the specific dialog scripting employ… ▽ More Systems based on automatic speech recognition (ASR) technology can provide important functionality in computer assisted language learning applications. This is a young but growing area of research motivated by the large number of students studying foreign languages. Here we propose a Hidden Markov Model (HMM)-based method to detect mispronunciations. Exploiting the specific dialog scripting employed in language learning software, HMMs are trained for different pronunciations. New adaptive features have been developed and obtained through an adaptive warping of the frequency scale prior to computing the cepstral coefficients. The optimization criterion used for the warping function is to maximize separation of two major groups of pronunciations (native and non-native) in terms of classification rate. Experimental results show that the adaptive frequency scale yields a better coefficient representation leading to higher classification rates in comparison with conventional HMMs using Mel-frequency cepstral coefficients. △ Less

Submitted 25 February, 2016; originally announced February 2016.

Comments: 4th International Congress on Image and Signal Processing (CISP) 2011

arXiv:1602.08128 [pdf, ps, other]

doi 10.1117/12.884155

PCA Method for Automated Detection of Mispronounced Words

Authors: Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith

Abstract: This paper presents a method for detecting mispronunciations with the aim of improving Computer Assisted Language Learning (CALL) tools used by foreign language learners. The algorithm is based on Principle Component Analysis (PCA). It is hierarchical with each successive step refining the estimate to classify the test word as being either mispronounced or correct. Preprocessing before detection,… ▽ More This paper presents a method for detecting mispronunciations with the aim of improving Computer Assisted Language Learning (CALL) tools used by foreign language learners. The algorithm is based on Principle Component Analysis (PCA). It is hierarchical with each successive step refining the estimate to classify the test word as being either mispronounced or correct. Preprocessing before detection, like normalization and time-scale modification, is implemented to guarantee uniformity of the feature vectors input to the detection system. The performance using various features including spectrograms and Mel-Frequency Cepstral Coefficients (MFCCs) are compared and evaluated. Best results were obtained using MFCCs, achieving up to 99% accuracy in word verification and 93% in native/non-native classification. Compared with Hidden Markov Models (HMMs) which are used pervasively in recognition application, this particular approach is computational efficient and effective when training data is limited. △ Less

Submitted 25 February, 2016; originally announced February 2016.

Comments: SPIE Defense, Security, and Sensing

arXiv:1602.08045 [pdf, other]

doi 10.1117/12.919235

PCA/LDA Approach for Text-Independent Speaker Recognition

Authors: Zhenhao Ge, Sudhendu R. Sharma, Mark J. T. Smith

Abstract: Various algorithms for text-independent speaker recognition have been developed through the decades, aiming to improve both accuracy and efficiency. This paper presents a novel PCA/LDA-based approach that is faster than traditional statistical model-based methods and achieves competitive results. First, the performance based on only PCA and only LDA is measured; then a mixed model, taking advantag… ▽ More Various algorithms for text-independent speaker recognition have been developed through the decades, aiming to improve both accuracy and efficiency. This paper presents a novel PCA/LDA-based approach that is faster than traditional statistical model-based methods and achieves competitive results. First, the performance based on only PCA and only LDA is measured; then a mixed model, taking advantages of both methods, is introduced. A subset of the TIMIT corpus composed of 200 male speakers, is used for enrollment, validation and testing. The best results achieve 100%; 96% and 95% classification rate at population level 50; 100 and 200, using 39-dimensional MFCC features with delta and double delta. These results are based on 12-second text-independent speech for training and 4-second data for test. These are comparable to the conventional MFCC-GMM methods, but require significantly less time to train and operate. △ Less

Submitted 25 February, 2016; originally announced February 2016.

Comments: Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series

arXiv:1602.05292 [pdf, other]

Authorship Attribution Using a Neural Network Language Model

Authors: Zhenhao Ge, Yufang Sun, Mark J. T. Smith

Abstract: In practice, training language models for individual authors is often expensive because of limited data resources. In such cases, Neural Network Language Models (NNLMs), generally outperform the traditional non-parametric N-gram models. Here we investigate the performance of a feed-forward NNLM on an authorship attribution problem, with moderate author set size and relatively limited data. We also… ▽ More In practice, training language models for individual authors is often expensive because of limited data resources. In such cases, Neural Network Language Models (NNLMs), generally outperform the traditional non-parametric N-gram models. Here we investigate the performance of a feed-forward NNLM on an authorship attribution problem, with moderate author set size and relatively limited data. We also consider how the text topics impact performance. Compared with a well-constructed N-gram baseline method with Kneser-Ney smoothing, the proposed method achieves nearly 2:5% reduction in perplexity and increases author classification accuracy by 3:43% on average, given as few as 5 test sentences. The performance is very competitive with the state of the art in terms of accuracy and demand on test data. The source code, preprocessed datasets, a detailed description of the methodology and results are available at https://github.com/zge/authorship-attribution. △ Less

Submitted 16 February, 2016; originally announced February 2016.

Comments: Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI'16)

arXiv:1602.03222 [pdf, other]

doi 10.1364/OL.41.002338

Metamaterial control of stimulated Brillouin scattering

Authors: M. J. A. Smith, B. T. Kuhlmey, C. Martijn de Sterke, C. Wolff, M. Lapine, C. G. Poulton

Abstract: Using full opto-acoustic numerical simulations, we demonstrate enhancement and suppression of the SBS gain in a metamaterial comprising a subwavelength cubic array of dielectric spheres suspended in a dielectric background material. We develop a general theoretical framework and present several numerical examples using technologically important materials. For As$_2$S$_3$ spheres in silicon, we ach… ▽ More Using full opto-acoustic numerical simulations, we demonstrate enhancement and suppression of the SBS gain in a metamaterial comprising a subwavelength cubic array of dielectric spheres suspended in a dielectric background material. We develop a general theoretical framework and present several numerical examples using technologically important materials. For As$_2$S$_3$ spheres in silicon, we achieve a gain enhancement of more than an order of magnitude compared to pure silicon, and for GaAs spheres in silicon, full suppression is obtained. The gain for As$_2$S$_3$ glass can also be strongly suppressed by embedding silica spheres. The constituent terms of the gain coefficient are shown to depend in a complex way on the filling fraction. We find that electrostriction is the dominant effect behind the control of SBS in bulk media. △ Less

Submitted 9 May, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

Comments: 5 pages, 5 figures

Journal ref: Opt. Lett. 41(10), 2338-2341 (2016)

arXiv:1504.04932 [pdf, ps, other]

Electrostriction enhancement in metamaterials

Authors: M. J. A. Smith, B. T. Kuhlmey, C. Martijn de Sterke, C. Wolff, M. Lapine, C. G. Poulton

Abstract: We demonstrate a controllable enhancement in the electrostrictive properties of a medium using dilute composite artificial materials. Analytical expressions for the composite electrostriction are derived and used to show that enhancement, tunability and suppression can be achieved through a careful choice of constituent materials. Numerical examples with Ag, As$_2$S$_3$, Si and SiO$_2$ demonstrate… ▽ More We demonstrate a controllable enhancement in the electrostrictive properties of a medium using dilute composite artificial materials. Analytical expressions for the composite electrostriction are derived and used to show that enhancement, tunability and suppression can be achieved through a careful choice of constituent materials. Numerical examples with Ag, As$_2$S$_3$, Si and SiO$_2$ demonstrate that even in a non-resonant regime, artificial materials can bring more than a threefold enhancement in the electrostriction. △ Less

Submitted 29 May, 2015; v1 submitted 20 April, 2015; originally announced April 2015.

Comments: 8 pages, 5 figures, to appear in Physical Review B

arXiv:1410.0393 [pdf, other]

doi 10.1098/rspa.2014.0746

Trapped Modes and Steered Dirac Cones in Platonic Crystals

Authors: R. C. McPhedran, A. B. Movchan, N. V. Movchan, M. Brun, M. J. A. Smith

Abstract: This paper discusses the properties of flexural waves obeying the biharmonic equation, propagating in a thin plate pinned at doubly-periodic sets of points. The emphases are on the properties of dispersion surfaces having the Dirac cone topology, and on the related topic of trapped modes in plates with a finite set (cluster) of pinned points. The Dirac cone topologies we exhibit have at least two… ▽ More This paper discusses the properties of flexural waves obeying the biharmonic equation, propagating in a thin plate pinned at doubly-periodic sets of points. The emphases are on the properties of dispersion surfaces having the Dirac cone topology, and on the related topic of trapped modes in plates with a finite set (cluster) of pinned points. The Dirac cone topologies we exhibit have at least two cones touching at a point in the reciprocal lattice, augmented by another band passing through the point. We show that the Dirac cones can be steered along symmetry lines in the Brillouin zone by varying the aspect ratio of rectangular lattices of pins, and that, as the cones are moved, the involved band surfaces tilt. We link Dirac points with a parabolic profile in their neighbourhood, and the characteristic of this parabolic profile decides the direction of propagation of the trapped mode in finite clusters. △ Less

Submitted 1 October, 2014; originally announced October 2014.

Comments: 21 pages, 12 figures

arXiv:1401.6601 [pdf, ps, other]

doi 10.1063/1.4871694

Model reduction for slow-fast stochastic systems with metastable behaviour

Authors: Maria Bruna, S. Jonathan Chapman, Matthew J. Smith

Abstract: The quasi-steady-state approximation (or stochastic averaging principle) is a useful tool in the study of multiscale stochastic systems, giving a practical method by which to reduce the number of degrees of freedom in a model. The method is extended here to slow-fast systems in which the fast variables exhibit metastable behaviour. The key parameter that determines the form of the reduced model is… ▽ More The quasi-steady-state approximation (or stochastic averaging principle) is a useful tool in the study of multiscale stochastic systems, giving a practical method by which to reduce the number of degrees of freedom in a model. The method is extended here to slow-fast systems in which the fast variables exhibit metastable behaviour. The key parameter that determines the form of the reduced model is the ratio of the timescale for the switching of the fast variables between metastable states to the timescale for the evolution of the slow variables. The method is illustrated with two examples: one from biochemistry (a fast-species-mediated chemical switch coupled to a slower-varying species), and one from ecology (a predator-prey system). Numerical simulations of each model reduction are compared with those of the full system. △ Less

Submitted 22 April, 2014; v1 submitted 25 January, 2014; originally announced January 2014.

arXiv:1312.6775 [pdf, other]

doi 10.1103/PhysRevD.89.072002

Updated measurements of absolute $D^+$ and $D^0$ hadronic branching fractions and $σ(e^+e^-\to D\overline{D})$ at $E_\mathrm{cm} = 3774$ MeV

Authors: CLEO Collaboration, G. Bonvicini, D. Cinabro M. J. Smith, P. Zhou, P. Naik, J. Rademacker, K. W. Edwards, R. A. Briere, H. Vogel, J. L. Rosner, J. P. Alexander, D. G. Cassel, R. Ehrlich, L. Gibbons, S. W. Gray, D. L. Hartill, B. K. Heltsley, D. L. Kreinick, V. E. Kuznetsov, J. R. Patterson, D. Peterson, D. Riley, A. Ryd, A. J. Sadoff, X. Shi , et al. (44 additional authors not shown)

Abstract: Utilizing the full CLEO-c data sample of 818 pb$^{-1}$ of $e^+e^-$ data taken at the $ψ(3770)$ resonance, we update our measurements of absolute hadronic branching fractions of charged and neutral $D$ mesons. We previously reportedresults from subsets of these data. Using a double tag technique we obtain branching fractions for three $D^0$ and six $D^+$ modes, including the reference branching fra… ▽ More Utilizing the full CLEO-c data sample of 818 pb$^{-1}$ of $e^+e^-$ data taken at the $ψ(3770)$ resonance, we update our measurements of absolute hadronic branching fractions of charged and neutral $D$ mesons. We previously reportedresults from subsets of these data. Using a double tag technique we obtain branching fractions for three $D^0$ and six $D^+$ modes, including the reference branching fractions $\mathcal{B} (D^0\to K^-π^+)=(3.934 \pm 0.021 \pm 0.061)\%$ and $\mathcal{B} (D^+ \to K^- π^+π^+)=(9.224 \pm 0.059 \pm 0.157)\%$. The uncertainties are statistical and systematic, respectively. In these measurements we include the effects of final-state radiation by allowing for additional unobserved photons in the final state, and the systematic errors include our estimates of the uncertainties of these effects. Furthermore, using an independent measurement of the luminosity, we obtain the cross sections $σ(e^+e^-\to D^0\overline{D}{}^0)=(3.607\pm 0.017 \pm 0.056) \ \mathrm{nb}$ and $σ(e^+e^-\to D^+D^-)=(2.882\pm 0.018 \pm 0.042) \ \mathrm{nb}$ at a center of mass energy, $E_\mathrm{cm} = 3774 \pm 1$ MeV. △ Less

Submitted 20 August, 2014; v1 submitted 24 December, 2013; originally announced December 2013.

Comments: 12 pages, 2 figures, and 6 tables. The Fig. 2 in this version corrects errors in plotting the fits in the original Fig. 2. Correcting these errors does not affect any result or text. We thank Dr. Maurice Garcia-Sciveres for pointing out this mistake

Report number: CLNS 13/2087, CLEO 13-02

Journal ref: Phys. Rev. D 89, 072002 (2014)

Showing 1–50 of 100 results for author: Smith, M J