-
A framework for developing a knowledge management platform
Authors:
Marie Lisandra Zepeda Mendoza,
Sonali Agarwal,
James A. Blackshaw,
Vanesa Bol,
Audrey Fazzi,
Filippo Fiorini,
Amy Louise Foreman,
Nancy George,
Brett R. Johnson,
Brian Martin,
Dave McComb,
Euphemia Mutasa-Gottgens,
Helen Parkinson,
Martin Romacker,
Rolf Russell,
Valérien Ségard,
Shawn Zheng Kai Tan,
Wei Kheng Teh,
F. P. Winstanley,
Benedict Wong,
Adrian M. Smith
Abstract:
Knowledge management (KM) involves collecting, organizing, storing, and disseminating information to improve decision-making, innovation, and performance. Implementing KM at scale has become essential for organizations to effectively leverage vast accessible data. This paper is a compilation of concepts that emerged from KM workshops hosted by EMBL-EBI, attended by SMEs and industry. We provide gu…
▽ More
Knowledge management (KM) involves collecting, organizing, storing, and disseminating information to improve decision-making, innovation, and performance. Implementing KM at scale has become essential for organizations to effectively leverage vast accessible data. This paper is a compilation of concepts that emerged from KM workshops hosted by EMBL-EBI, attended by SMEs and industry. We provide guidance on envisioning, executing, evaluating, and evolving knowledge management platforms. We emphasize essential considerations such as setting knowledge domain boundaries and measuring success, as well as the importance of making knowledge accessible for downstream applications and non-computational users and highlights necessary personal and organizational skills for success. We stress the importance of collaboration and the need for convergence on shared principles and commitment to provide or seek resources to advance KM. The community is invited to join the journey of KM and contribute to the advancement of the field by applying and improving on the guidelines described.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Random Subspace Local Projections
Authors:
Viet Hoang Dinh,
Didier Nibbering,
Benjamin Wong
Abstract:
We show how random subspace methods can be adapted to estimating local projections with many controls. Random subspace methods have their roots in the machine learning literature and are implemented by averaging over regressions estimated over different combinations of subsets of these controls. We document three key results: (i) Our approach can successfully recover the impulse response functions…
▽ More
We show how random subspace methods can be adapted to estimating local projections with many controls. Random subspace methods have their roots in the machine learning literature and are implemented by averaging over regressions estimated over different combinations of subsets of these controls. We document three key results: (i) Our approach can successfully recover the impulse response functions across Monte Carlo experiments representative of different macroeconomic settings and identification schemes. (ii) Our results suggest that random subspace methods are more accurate than other dimension reduction methods if the underlying large dataset has a factor structure similar to typical macroeconomic datasets such as FRED-MD. (iii) Our approach leads to differences in the estimated impulse response functions relative to benchmark methods when applied to two widely studied empirical applications.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Distributional Refinement Network: Distributional Forecasting via Deep Learning
Authors:
Benjamin Avanzi,
Eric Dong,
Patrick J. Laub,
Bernard Wong
Abstract:
A key task in actuarial modelling involves modelling the distributional properties of losses. Classic (distributional) regression approaches like Generalized Linear Models (GLMs; Nelder and Wedderburn, 1972) are commonly used, but challenges remain in developing models that can (i) allow covariates to flexibly impact different aspects of the conditional distribution, (ii) integrate developments in…
▽ More
A key task in actuarial modelling involves modelling the distributional properties of losses. Classic (distributional) regression approaches like Generalized Linear Models (GLMs; Nelder and Wedderburn, 1972) are commonly used, but challenges remain in developing models that can (i) allow covariates to flexibly impact different aspects of the conditional distribution, (ii) integrate developments in machine learning and AI to maximise the predictive power while considering (i), and, (iii) maintain a level of interpretability in the model to enhance trust in the model and its outputs, which is often compromised in efforts pursuing (i) and (ii). We tackle this problem by proposing a Distributional Refinement Network (DRN), which combines an inherently interpretable baseline model (such as GLMs) with a flexible neural network-a modified Deep Distribution Regression (DDR; Li et al., 2019) method. Inspired by the Combined Actuarial Neural Network (CANN; Schelldorfer and W{\''u}thrich, 2019), our approach flexibly refines the entire baseline distribution. As a result, the DRN captures varying effects of features across all quantiles, improving predictive performance while maintaining adequate interpretability. Using both synthetic and real-world data, we demonstrate the DRN's superior distributional forecasting capacity. The DRN has the potential to be a powerful distributional regression model in actuarial science and beyond.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Challenges for Responsible AI Design and Workflow Integration in Healthcare: A Case Study of Automatic Feeding Tube Qualification in Radiology
Authors:
Anja Thieme,
Abhijith Rajamohan,
Benjamin Cooper,
Heather Groombridge,
Robert Simister,
Barney Wong,
Nicholas Woznitza,
Mark Ames Pinnock,
Maria Teodora Wetscherek,
Cecily Morrison,
Hannah Richardson,
Fernando Pérez-García,
Stephanie L. Hyland,
Shruthi Bannur,
Daniel C. Castro,
Kenza Bouzid,
Anton Schwaighofer,
Mercy Ranjit,
Harshita Sharma,
Matthew P. Lungren,
Ozan Oktay,
Javier Alvarez-Valle,
Aditya Nori,
Stephen Harris,
Joseph Jacob
Abstract:
Nasogastric tubes (NGTs) are feeding tubes that are inserted through the nose into the stomach to deliver nutrition or medication. If not placed correctly, they can cause serious harm, even death to patients. Recent AI developments demonstrate the feasibility of robustly detecting NGT placement from Chest X-ray images to reduce risks of sub-optimally or critically placed NGTs being missed or delay…
▽ More
Nasogastric tubes (NGTs) are feeding tubes that are inserted through the nose into the stomach to deliver nutrition or medication. If not placed correctly, they can cause serious harm, even death to patients. Recent AI developments demonstrate the feasibility of robustly detecting NGT placement from Chest X-ray images to reduce risks of sub-optimally or critically placed NGTs being missed or delayed in their detection, but gaps remain in clinical practice integration. In this study, we present a human-centered approach to the problem and describe insights derived following contextual inquiry and in-depth interviews with 15 clinical stakeholders. The interviews helped understand challenges in existing workflows, and how best to align technical capabilities with user needs and expectations. We discovered the trade-offs and complexities that need consideration when choosing suitable workflow stages, target users, and design configurations for different AI proposals. We explored how to balance AI benefits and risks for healthcare staff and patients within broader organizational and medical-legal constraints. We also identified data issues related to edge cases and data biases that affect model training and evaluation; how data documentation practices influence data preparation and labelling; and how to measure relevant AI outcomes reliably in future evaluations. We discuss how our work informs design and development of AI applications that are clinically useful, ethical, and acceptable in real-world healthcare services.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality
Authors:
Adrian Perez Dieguez,
Min Choi,
Mahmut Okyay,
Mauro Del Ben,
Bryan M. Wong,
Khaled Z. Ibrahim
Abstract:
Tuning searches are pivotal in High-Performance Computing (HPC), addressing complex optimization challenges in computational applications. The complexity arises not only from finely tuning parameters within routines but also potential interdependencies among them, rendering traditional optimization methods inefficient. Instead of scrutinizing interdependencies among parameters and routines, practi…
▽ More
Tuning searches are pivotal in High-Performance Computing (HPC), addressing complex optimization challenges in computational applications. The complexity arises not only from finely tuning parameters within routines but also potential interdependencies among them, rendering traditional optimization methods inefficient. Instead of scrutinizing interdependencies among parameters and routines, practitioners often face the dilemma of conducting independent tuning searches for each routine, thereby overlooking interdependence, or pursuing a more resource-intensive joint search for all routines. This decision is driven by the consideration that some interdependence analysis and high-dimensional decomposition techniques in literature may be prohibitively expensive in HPC tuning searches. Our methodology adapts and refines these methods to ensure computational feasibility while maximizing performance gains in real-world scenarios. Our methodology leverages a cost-effective interdependence analysis to decide whether to merge several tuning searches into a joint search or conduct orthogonal searches. Tested on synthetic functions with varying levels of parameter interdependence, our methodology efficiently explores the search space. In comparison to Bayesian-optimization-based full independent or fully joint searches, our methodology suggested an optimized breakdown of independent and merged searches that led to final configurations up to 8% more accurate, reducing the search time by up to 95%. When applied to GPU-offloaded Real-Time Time-Dependent Density Functional Theory (RT-TDDFT), an application in computational materials science that challenges modern HPC autotuners, our methodology achieved an effective tuning search. Its adaptability and efficiency extend beyond RT-TDDFT, making it valuable for related applications in HPC.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
The Ramsey numbers for trees of order $n$ with maximum degree at least $n-5$ versus the wheel graph of order nine
Authors:
Zhi Yee Chng,
Thomas Britz,
Ta Sheng Tan,
Kok Bin Wong
Abstract:
The Ramsey numbers $R(T_n,W_8)$ are determined for each tree graph $T_n$ of order $n\geq 7$ and maximum degree $Δ(T_n)$ equal to either $n-4$ or $n-5$. These numbers indicate strong support for the conjecture, due to Chen, Zhang and Zhang and to Hafidh and Baskoro, that $R(T_n,W_m) = 2n-1$ for each tree graph $T_n$ of order $n\geq m-1$ with $Δ(T_n)\leq n-m+2$ when $m\geq 4$ is even.
The Ramsey numbers $R(T_n,W_8)$ are determined for each tree graph $T_n$ of order $n\geq 7$ and maximum degree $Δ(T_n)$ equal to either $n-4$ or $n-5$. These numbers indicate strong support for the conjecture, due to Chen, Zhang and Zhang and to Hafidh and Baskoro, that $R(T_n,W_m) = 2n-1$ for each tree graph $T_n$ of order $n\geq m-1$ with $Δ(T_n)\leq n-m+2$ when $m\geq 4$ is even.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
On the evolution of data breach reporting patterns and frequency in the United States: a cross-state analysis
Authors:
Benjamin Avanzi,
Xingyun Tan,
Greg Taylor,
Bernard Wong
Abstract:
Understanding the emergence of data breaches is crucial for cyber insurance. However, analyses of data breach frequency trends in the current literature lead to contradictory conclusions. We put forward that those discrepancies may be (at least partially) due to inconsistent data collection standards, as well as reporting patterns, over time and space. We set out to carefully control both. In this…
▽ More
Understanding the emergence of data breaches is crucial for cyber insurance. However, analyses of data breach frequency trends in the current literature lead to contradictory conclusions. We put forward that those discrepancies may be (at least partially) due to inconsistent data collection standards, as well as reporting patterns, over time and space. We set out to carefully control both. In this paper, we conduct a joint analysis of state Attorneys General's publications on data breaches across eight states (namely, California, Delaware, Indiana, Maine, Montana, North Dakota, Oregon, and Washington), all of which are subject to established data collection standards-namely, state data breach (mandatory) notification laws. Thanks to our explicit recognition of these notification laws, we are capable of modelling frequency of breaches in a consistent and comparable way over time. Hence, we are able to isolate and capture the complexities of reporting patterns, adequately estimate IBNRs, and yield a highly reliable assessment of historical frequency trends in data breaches. Our analysis also provides a comprehensive comparison of data breach frequency across the eight U.S. states, extending knowledge on state-specific differences in cyber risk, which has not been extensively discussed in the current literature. Furthermore, we uncover novel features not previously discussed in the literature, such as differences in cyber risk frequency trends between large and small data breaches. Overall, we find that the reporting delays are lengthening. We also elicit commonalities and heterogeneities in reporting patterns across states, severity levels, and time periods. After adequately estimating IBNRs, we find that frequency is relatively stable before 2020 and increasing after 2020. This is consistent across states. Implications of our findings for cyber insurance are discussed.
△ Less
Submitted 30 June, 2024; v1 submitted 7 October, 2023;
originally announced October 2023.
-
Active Anomaly Detection in Confined Spaces Using Ergodic Traversal of Directed Region Graphs
Authors:
Benjamin Wong,
Tyler M. Paine,
Santosh Devasia,
Ashis G. Banerjee
Abstract:
We provide the first step toward developing a hierarchical control-estimation framework to actively plan robot trajectories for anomaly detection in confined spaces. The space is represented globally using a directed region graph, where a region is a landmark that needs to be visited (inspected). We devise a fast mixing Markov chain to find an ergodic route that traverses this graph so that the re…
▽ More
We provide the first step toward developing a hierarchical control-estimation framework to actively plan robot trajectories for anomaly detection in confined spaces. The space is represented globally using a directed region graph, where a region is a landmark that needs to be visited (inspected). We devise a fast mixing Markov chain to find an ergodic route that traverses this graph so that the region visitation frequency is proportional to its anomaly detection uncertainty, while satisfying the edge directionality (region transition) constraint(s). Preliminary simulation results show fast convergence to the ergodic solution and confident estimation of the presence of anomalies in the inspected regions.
△ Less
Submitted 1 October, 2023;
originally announced October 2023.
-
Accelerating Quantum Optimal Control of Multi-Qubit Systems with Symmetry-Based Hamiltonian Transformations
Authors:
Xian Wang,
Mahmut Sait Okyay,
Anshuman Kumar,
Bryan M. Wong
Abstract:
We present a novel, computationally efficient approach to accelerate quantum optimal control calculations of large multi-qubit systems used in a variety of quantum computing applications. By leveraging the intrinsic symmetry of finite groups, the Hilbert space can be decomposed and the Hamiltonians block-diagonalized to enable extremely fast quantum optimal control calculations. Our approach reduc…
▽ More
We present a novel, computationally efficient approach to accelerate quantum optimal control calculations of large multi-qubit systems used in a variety of quantum computing applications. By leveraging the intrinsic symmetry of finite groups, the Hilbert space can be decomposed and the Hamiltonians block-diagonalized to enable extremely fast quantum optimal control calculations. Our approach reduces the Hamiltonian size of an $n$-qubit system from 2^n by 2^n to O(n by n) or O((2^n / n) by (2^n / n)) under Sn or Dn symmetry, respectively. Most importantly, this approach reduces the computational runtime of qubit optimal control calculations by orders of magnitude while maintaining the same accuracy as the conventional method. As prospective applications, we show that (1) symmetry-protected subspaces can be potential platforms for quantum error suppression and simulation of other quantum Hamiltonians, and (2) Lie-Trotter-Suzuki decomposition approaches can generalize our method to a general variety of multi-qubit systems.
△ Less
Submitted 3 October, 2023; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Empirical Study of Straggler Problem in Parameter Server on Iterative Convergent Distributed Machine Learning
Authors:
Benjamin Wong
Abstract:
The purpose of this study is to test the effectiveness of current straggler mitigation techniques over different important iterative convergent machine learning(ML) algorithm including Matrix Factorization (MF), Multinomial Logistic Regression (MLR), and Latent Dirichlet Allocation (LDA) . The experiment was conducted to implemented using the FlexPS system, which is the latest system implementatio…
▽ More
The purpose of this study is to test the effectiveness of current straggler mitigation techniques over different important iterative convergent machine learning(ML) algorithm including Matrix Factorization (MF), Multinomial Logistic Regression (MLR), and Latent Dirichlet Allocation (LDA) . The experiment was conducted to implemented using the FlexPS system, which is the latest system implementation that employ parameter server architecture. The experiment employed the Bulk Synchronous Parallel (BSP) computational model to examine the straggler problem in Parameter Server on Iterative Convergent Distributed Machine Learning. Moreover, the current research analyzes the experimental arrangement of the parameter server strategy concerning the parallel learning problems by injecting universal straggler patterns and executing latest mitigation techniques. The findings of the study are significant in that as they will provide the necessary platform for conducting further research into the problem and allow the researcher to compare different methods for various applications. The outcome is therefore expected to facilitate the development of new techniques coupled with new perspectives in addressing this problem.
△ Less
Submitted 28 July, 2023;
originally announced August 2023.
-
On the Particle and Field nature of $γ^μ$ matrices in the Dirac Equation and the Nature's intrinsic fifth force
Authors:
B. T. T. Wong
Abstract:
The Dirac equation is a cornerstone of modern particle physics, which integrates special relativity and quantum mechanics into a consistent framework, yielding the prediction of electron and its antiparticle counterpart, positron. The Dirac equation also lays the foundation of quantum electrodynamics, such that QED phenomenon is supported by fundamental Dirac Algebras calculation. In this article,…
▽ More
The Dirac equation is a cornerstone of modern particle physics, which integrates special relativity and quantum mechanics into a consistent framework, yielding the prediction of electron and its antiparticle counterpart, positron. The Dirac equation also lays the foundation of quantum electrodynamics, such that QED phenomenon is supported by fundamental Dirac Algebras calculation. In this article, we will introduce new perspectives of the $γ^μ$ matrix in the Dirac Algebra, by realizing the $γ^μ$ matrices are actual formal quantum fields, the excitation of $γ^μ$ fields correspond to a new particle with both boson and fermion nature. Thus, we show that $γ^μ$ is a particle in nature, and can be referred as the nature's intrinsic fifth force. The $γ^μ$ field also serves as the boson-fermion connector in QED interaction.
△ Less
Submitted 7 September, 2023; v1 submitted 31 July, 2023;
originally announced August 2023.
-
Velocity-gauge real-time time-dependent density functional tight-binding for large-scale condensed matter systems
Authors:
Qiang Xu,
Mauro Del Ben,
Mahmut Sait Okyay,
Min Choi,
Khaled Z. Ibrahim,
Bryan M. Wong
Abstract:
We present a new velocity-gauge real-time, time-dependent density functional tight-binding (VG-rtTDDFTB) implementation in the open-source DFTB+ software package (https://dftbplus.org) for probing electronic excitations in large, condensed matter systems. Our VG-rtTDDFTB approach enables real-time electron dynamics simulations of large, periodic, condensed matter systems containing thousands of at…
▽ More
We present a new velocity-gauge real-time, time-dependent density functional tight-binding (VG-rtTDDFTB) implementation in the open-source DFTB+ software package (https://dftbplus.org) for probing electronic excitations in large, condensed matter systems. Our VG-rtTDDFTB approach enables real-time electron dynamics simulations of large, periodic, condensed matter systems containing thousands of atoms with a favorable computational scaling as a function of system size. We provide computational details and benchmark calculations to demonstrate its accuracy and computational parallelizability on a variety of large material systems. As a representative example, we calculate laser-induced electron dynamics in a 512-atom amorphous silicon supercell to highlight the large periodic systems that can be examined with our implementation. Taken together, our VG-rtTDDFTB approach enables new electron dynamics simulations of complex systems that require large periodic supercells, such as crystal defects, complex surfaces, nanowires, and amorphous materials.
△ Less
Submitted 21 May, 2024; v1 submitted 18 August, 2023;
originally announced August 2023.
-
Enhancing Optimization Performance: A Novel Hybridization of Gaussian Crunching Search and Powell's Method for Derivative-Free Optimization
Authors:
Benny Wong
Abstract:
This research paper presents a novel approach to enhance optimization performance through the hybridization of Gaussian Crunching Search (GCS) and Powell's Method for derivative-free optimization. While GCS has shown promise in overcoming challenges faced by traditional derivative-free optimization methods [1], it may not always excel in finding the local minimum. On the other hand, some tradition…
▽ More
This research paper presents a novel approach to enhance optimization performance through the hybridization of Gaussian Crunching Search (GCS) and Powell's Method for derivative-free optimization. While GCS has shown promise in overcoming challenges faced by traditional derivative-free optimization methods [1], it may not always excel in finding the local minimum. On the other hand, some traditional methods may have better performance in this regard. However, GCS demonstrates its strength in escaping the trap of local minima and approaching the global minima. Through experimentation, we discovered that by combining GCS with certain traditional derivative-free optimization methods, we can significantly boost performance while retaining the respective advantages of each method. This hybrid approach opens up new possibilities for optimizing complex systems and finding optimal solutions in a range of applications.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
A new derivative-free optimization method: Gaussian Crunching Search
Authors:
Benny Wong
Abstract:
Optimization methods are essential in solving complex problems across various domains. In this research paper, we introduce a novel optimization method called Gaussian Crunching Search (GCS). Inspired by the behaviour of particles in a Gaussian distribution, GCS aims to efficiently explore the solution space and converge towards the global optimum. We present a comprehensive analysis of GCS, inclu…
▽ More
Optimization methods are essential in solving complex problems across various domains. In this research paper, we introduce a novel optimization method called Gaussian Crunching Search (GCS). Inspired by the behaviour of particles in a Gaussian distribution, GCS aims to efficiently explore the solution space and converge towards the global optimum. We present a comprehensive analysis of GCS, including its working mechanism, and potential applications. Through experimental evaluations and comparisons with existing optimization methods, we highlight the advantages and strengths of GCS. This research paper serves as a valuable resource for researchers, practitioners, and students interested in optimization, providing insights into the development and potential of Gaussian Crunching Search as a new and promising approach.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs
Authors:
Shixun Wu,
Yujia Zhai,
Jinyang Liu,
Jiajun Huang,
Zizhe Jian,
Bryan M. Wong,
Zizhong Chen
Abstract:
General Matrix Multiplication (GEMM) is a crucial algorithm for various applications such as machine learning and scientific computing, and an efficient GEMM implementation is essential for the performance of these systems. While researchers often strive for faster performance by using large compute platforms, the increased scale of these systems can raise concerns about hardware and software reli…
▽ More
General Matrix Multiplication (GEMM) is a crucial algorithm for various applications such as machine learning and scientific computing, and an efficient GEMM implementation is essential for the performance of these systems. While researchers often strive for faster performance by using large compute platforms, the increased scale of these systems can raise concerns about hardware and software reliability. In this paper, we present a design for a high-performance GEMM with algorithm-based fault tolerance for use on GPUs. We describe fault-tolerant designs for GEMM at the thread, warp, and threadblock levels, and also provide a baseline GEMM implementation that is competitive with or faster than the state-of-the-art, proprietary cuBLAS GEMM. We present a kernel fusion strategy to overlap and mitigate the memory latency due to fault tolerance with the original GEMM computation. To support a wide range of input matrix shapes and reduce development costs, we present a template-based approach for automatic code generation for both fault-tolerant and non-fault-tolerant GEMM implementations. We evaluate our work on NVIDIA Tesla T4 and A100 server GPUs. Experimental results demonstrate that our baseline GEMM presents comparable or superior performance compared to the closed-source cuBLAS. The fault-tolerant GEMM incurs only a minimal overhead (8.89\% on average) compared to cuBLAS even with hundreds of errors injected per minute. For irregularly shaped inputs, the code generator-generated kernels show remarkable speedups of $160\% \sim 183.5\%$ and $148.55\% \sim 165.12\%$ for fault-tolerant and non-fault-tolerant GEMMs, outperforming cuBLAS by up to $41.40\%$.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
Disentangling Structural Breaks in Factor Models for Macroeconomic Data
Authors:
Bonsoo Koo,
Benjamin Wong,
Ze-Yu Zhong
Abstract:
Through a routine normalization of the factor variance, standard methods for estimating factor models in macroeconomics do not distinguish between breaks of the factor variance and factor loadings. We argue that it is important to distinguish between structural breaks in the factor variance and loadings within factor models commonly employed in macroeconomics as both can lead to markedly different…
▽ More
Through a routine normalization of the factor variance, standard methods for estimating factor models in macroeconomics do not distinguish between breaks of the factor variance and factor loadings. We argue that it is important to distinguish between structural breaks in the factor variance and loadings within factor models commonly employed in macroeconomics as both can lead to markedly different interpretations when viewed via the lens of the underlying dynamic factor model. We then develop a projection-based decomposition that leads to two standard and easy-to-implement Wald tests to disentangle structural breaks in the factor variance and factor loadings. Applying our procedure to U.S. macroeconomic data, we find evidence of both types of breaks associated with the Great Moderation and the Great Recession. Through our projection-based decomposition, we estimate that the Great Moderation is associated with an over 60% reduction in the total factor variance, highlighting the relevance of disentangling breaks in the factor structure.
△ Less
Submitted 3 June, 2024; v1 submitted 28 February, 2023;
originally announced March 2023.
-
Generalized Standard Model with higher-order derivatives under Rotor Mechanism and its Quantization
Authors:
B. T. T. Wong
Abstract:
The Standard Model is the paradigm of particle physics which gives an accurate theory for fundamental particle interactions. However, the extension of Standard Model with higher-order derivatives is not a well-studied subject. This paper is a follow-up work of the previous study of the generalized Abelian gauge field theory and Yang-Mills theory under rotor mechanism of order $n$ of higher order d…
▽ More
The Standard Model is the paradigm of particle physics which gives an accurate theory for fundamental particle interactions. However, the extension of Standard Model with higher-order derivatives is not a well-studied subject. This paper is a follow-up work of the previous study of the generalized Abelian gauge field theory and Yang-Mills theory under rotor mechanism of order $n$ of higher order derivatives, and we apply it to the Standard Model of particle physics. Rotor mechanism on scalar field and Dirac field is also studied. We will study the quantization of the rotored Standard Model using path integral approach. We also inherit the previous result from the path integral quantization of generalized Abelian gauge field and apply it to our non-Abelian case. Then we carry out the generalized BRST quantization and prove the existence of the Slavnov-Taylor Identities of the rotor model. Finally, we discuss the possibility of rotor model on taming the infinities arise from the self-energy correction of the Higgs boson in high spacetime dimension, thus this provides a partial solution and new insights to the Hierarchy problem.
△ Less
Submitted 29 June, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Machine Learning with High-Cardinality Categorical Features in Actuarial Applications
Authors:
Benjamin Avanzi,
Greg Taylor,
Melantha Wang,
Bernard Wong
Abstract:
High-cardinality categorical features are pervasive in actuarial data (e.g. occupation in commercial property insurance). Standard categorical encoding methods like one-hot encoding are inadequate in these settings.
In this work, we present a novel _Generalised Linear Mixed Model Neural Network_ ("GLMMNet") approach to the modelling of high-cardinality categorical features. The GLMMNet integrate…
▽ More
High-cardinality categorical features are pervasive in actuarial data (e.g. occupation in commercial property insurance). Standard categorical encoding methods like one-hot encoding are inadequate in these settings.
In this work, we present a novel _Generalised Linear Mixed Model Neural Network_ ("GLMMNet") approach to the modelling of high-cardinality categorical features. The GLMMNet integrates a generalised linear mixed model in a deep learning framework, offering the predictive power of neural networks and the transparency of random effects estimates, the latter of which cannot be obtained from the entity embedding models. Further, its flexibility to deal with any distribution in the exponential dispersion (ED) family makes it widely applicable to many actuarial contexts and beyond.
We illustrate and compare the GLMMNet against existing approaches in a range of simulation experiments as well as in a real-life insurance case study. Notably, we find that the GLMMNet often outperforms or at least performs comparably with an entity embedded neural network, while providing the additional benefit of transparency, which is particularly valuable in practical applications.
Importantly, while our model was motivated by actuarial applications, it can have wider applicability. The GLMMNet would suit any applications that involve high-cardinality categorical variables and where the response cannot be sufficiently modelled by a Gaussian distribution.
△ Less
Submitted 30 January, 2023;
originally announced January 2023.
-
A light-induced Weyl semiconductor-to-metal transition mediated by Peierls instability
Authors:
H. Ning,
O. Mehio,
C. Lian,
X. Li,
E. Zoghlin,
P. Zhou,
B. Cheng,
S. D. Wilson,
B. M. Wong,
D. Hsieh
Abstract:
Elemental tellurium is a strongly spin-orbit coupled Peierls-distorted semiconductor whose band structure features topologically protected Weyl nodes. Using time-dependent density functional theory calculations, we show that impulsive optical excitation can be used to transiently control the amplitude of the Peierls distortion, realizing a mechanism to switch tellurium between three states: Weyl s…
▽ More
Elemental tellurium is a strongly spin-orbit coupled Peierls-distorted semiconductor whose band structure features topologically protected Weyl nodes. Using time-dependent density functional theory calculations, we show that impulsive optical excitation can be used to transiently control the amplitude of the Peierls distortion, realizing a mechanism to switch tellurium between three states: Weyl semiconductor, Weyl metal and non-Weyl metal. Further, we present experimental evidence of this inverse-Peierls distortion using time-resolved optical second harmonic generation measurements. These results provide a pathway to multifunctional ultrafast Weyl devices and introduce Peierls systems as viable hosts of light-induced topological transitions.
△ Less
Submitted 2 November, 2022;
originally announced November 2022.
-
Bergman representative coordinate, constant holomorphic curvature and a multidimensional generalization of Carathéodory's theorem
Authors:
Robert Xin Dong,
Bun Wong
Abstract:
By using the Bergman representative coordinate and Calabi's diastasis, we extend a theorem of Lu to bounded pseudoconvex domains whose Bergman metric is incomplete with constant holomorphic sectional curvature. We characterize such domains that are biholomorphic to a ball possibly less a relatively closed pluripolar set. We also provide a multidimensional generalization of Carathéodory's theorem o…
▽ More
By using the Bergman representative coordinate and Calabi's diastasis, we extend a theorem of Lu to bounded pseudoconvex domains whose Bergman metric is incomplete with constant holomorphic sectional curvature. We characterize such domains that are biholomorphic to a ball possibly less a relatively closed pluripolar set. We also provide a multidimensional generalization of Carathéodory's theorem on the continuous extension of the biholomorphisms up to the closures. In particular, sufficient conditions are given, in terms of the Bergman kernel, for the boundary of a biholomorphic ball to be a topological sphere.
△ Less
Submitted 28 February, 2023; v1 submitted 18 September, 2022;
originally announced September 2022.
-
FOLIO: Natural Language Reasoning with First-Order Logic
Authors:
Simeng Han,
Hailey Schoelkopf,
Yilun Zhao,
Zhenting Qi,
Martin Riddell,
Wenfei Zhou,
James Coady,
David Peng,
Yujie Qiao,
Luke Benson,
Lucy Sun,
Alex Wardle-Solano,
Hannah Szabo,
Ekaterina Zubova,
Matthew Burtell,
Jonathan Fan,
Yixin Liu,
Brian Wong,
Malcolm Sailor,
Ansong Ni,
Linyong Nan,
Jungo Kasai,
Tao Yu,
Rui Zhang,
Alexander R. Fabbri
, et al. (10 additional authors not shown)
Abstract:
Large language models (LLMs) have achieved remarkable performance on a variety of natural language understanding tasks. However, existing benchmarks are inadequate in measuring the complex logical reasoning capabilities of a model. We present FOLIO, a human-annotated, logically complex and diverse dataset for reasoning in natural language (NL), equipped with first-order logic (FOL) annotations. FO…
▽ More
Large language models (LLMs) have achieved remarkable performance on a variety of natural language understanding tasks. However, existing benchmarks are inadequate in measuring the complex logical reasoning capabilities of a model. We present FOLIO, a human-annotated, logically complex and diverse dataset for reasoning in natural language (NL), equipped with first-order logic (FOL) annotations. FOLIO consists of 1,430 examples (unique conclusions), each paired with one of 487 sets of premises used to deductively reason for the validity of each conclusion. The logical correctness of the premises and conclusions is ensured by their FOL annotations, which are automatically verified by an FOL inference engine. In addition to the main NL reasoning task, NL-FOL pairs in FOLIO constitute a new NL-FOL translation dataset. Our experiments on FOLIO systematically evaluate the FOL reasoning ability of supervised fine-tuning on medium-sized language models. For both NL reasoning and NL-FOL translation, we benchmark multiple state-of-the-art language models. Our results show that a subset of FOLIO presents a challenge for one of the most capable {Large Language Model (LLM)} publicly available, GPT-4.
△ Less
Submitted 17 May, 2024; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Human-Assisted Robotic Detection of Foreign Object Debris Inside Confined Spaces of Marine Vessels Using Probabilistic Mapping
Authors:
Benjamin Wong,
Wade Marquette,
Nikolay Bykov,
Tyler M. Paine,
Ashis G. Banerjee
Abstract:
Many complex vehicular systems, such as large marine vessels, contain confined spaces like water tanks, which are critical for the safe functioning of the vehicles. It is particularly hazardous for humans to inspect such spaces due to limited accessibility, poor visibility, and unstructured configuration. While robots provide a viable alternative, they encounter the same set of challenges in reali…
▽ More
Many complex vehicular systems, such as large marine vessels, contain confined spaces like water tanks, which are critical for the safe functioning of the vehicles. It is particularly hazardous for humans to inspect such spaces due to limited accessibility, poor visibility, and unstructured configuration. While robots provide a viable alternative, they encounter the same set of challenges in realizing robust autonomy. In this work, we specifically address the problem of detecting foreign object debris (FODs) left inside the confined spaces using a visual mapping-based system that relies on Mahalanobis distance-driven comparisons between the nominal and online maps for local outlier identification. Simulation trials show extremely high recall but low precision for the outlier identification method. The assistance of remote humans is, therefore, taken to deal with the precision problem by going over the close-up robot camera images of the outlier regions. An online survey is conducted to show the usefulness of this assistance process. Physical experiments are also reported on a GPU-enabled mobile robot platform inside a scaled-down, prototype tank to demonstrate the feasibility of the FOD detection system.
△ Less
Submitted 31 August, 2022; v1 submitted 1 July, 2022;
originally announced July 2022.
-
Ensemble distributional forecasting for insurance loss reserving
Authors:
Benjamin Avanzi,
Yanfeng Li,
Bernard Wong,
Alan Xian
Abstract:
Loss reserving generally focuses on identifying a single model that can generate superior predictive performance. However, different loss reserving models specialise in capturing different aspects of loss data. This is recognised in practice in the sense that results from different models are often considered, and sometimes combined. For instance, actuaries may take a weighted average of the predi…
▽ More
Loss reserving generally focuses on identifying a single model that can generate superior predictive performance. However, different loss reserving models specialise in capturing different aspects of loss data. This is recognised in practice in the sense that results from different models are often considered, and sometimes combined. For instance, actuaries may take a weighted average of the prediction outcomes from various loss reserving models, often based on subjective assessments.
In this paper, we propose a systematic framework to objectively combine (i.e. ensemble) multiple _stochastic_ loss reserving models such that the strengths offered by different models can be utilised effectively. Our framework contains two main innovations compared to existing literature and practice. Firstly, our criteria model combination considers the full distributional properties of the ensemble and not just the central estimate - which is of particular importance in the reserving context. Secondly, our framework is that it is tailored for the features inherent to reserving data. These include, for instance, accident, development, calendar, and claim maturity effects. Crucially, the relative importance and scarcity of data across accident periods renders the problem distinct from the traditional ensembling techniques in statistical learning.
Our framework is illustrated with a complex synthetic dataset. In the results, the optimised ensemble outperforms both (i) traditional model selection strategies, and (ii) an equally weighted ensemble. In particular, the improvement occurs not only with central estimates but also relevant quantiles, such as the 75th percentile of reserves (typically of interest to both insurers and regulators). The framework developed in this paper can be implemented thanks to an R package, `ADLP`, which is available from CRAN.
△ Less
Submitted 3 June, 2024; v1 submitted 17 June, 2022;
originally announced June 2022.
-
The study of conformal geometry and its exact solution of the geodesic deviation equation
Authors:
B. T. T. Wong
Abstract:
In this paper, the geometric properties of the conformal metric are studied and its exact solution of the geodesic deviation equation is presented. We also find out the stress-energy tensor of this geometry and compare it with the usual prefect-fluid case, obtaining an equation of state as $P = -\frac{1}{3}ρ$ in 4D space-time dimension. Finally, the low-energy regime of the metric is studied, in w…
▽ More
In this paper, the geometric properties of the conformal metric are studied and its exact solution of the geodesic deviation equation is presented. We also find out the stress-energy tensor of this geometry and compare it with the usual prefect-fluid case, obtaining an equation of state as $P = -\frac{1}{3}ρ$ in 4D space-time dimension. Finally, the low-energy regime of the metric is studied, in which we obtain the stress-energy tensor proportional to the projection tensor.
△ Less
Submitted 31 March, 2022;
originally announced April 2022.
-
The complete metric study of effective Dirac algebra
Authors:
B. T. T. Wong
Abstract:
Following our work from the previous paper about the study of effective Dirac algebra and the metric of the simple, special case of relativistic hydrogen atom, this paper gives the complete metric study defined by the effective Dirac algebra in the Dirac and Weyl presentation, showing that relativistic electromagnetic interaction gives the correction of the flat background metric $η_{μν}$, thus cu…
▽ More
Following our work from the previous paper about the study of effective Dirac algebra and the metric of the simple, special case of relativistic hydrogen atom, this paper gives the complete metric study defined by the effective Dirac algebra in the Dirac and Weyl presentation, showing that relativistic electromagnetic interaction gives the correction of the flat background metric $η_{μν}$, thus curving spacetime. The curved metric can be nicely broken down into two parts, the pure correction on the flat spacetime metric and the projection tensor. We find that the curved metric is independent of the representation chosen.
△ Less
Submitted 28 March, 2022;
originally announced April 2022.
-
Improved Band Gaps and Structural Properties from Wannier-Fermi-Löwdin Self-Interaction Corrections for Periodic Systems
Authors:
Ravindra Shinde,
Sharma S. R. K. C. Yamijala,
Bryan M. Wong
Abstract:
The accurate prediction of band gaps and structural properties in periodic systems continues to be one of the central goals of electronic structure theory. However, band gaps obtained from popular exchange-correlation functionals (such as LDA and PBE) are severely underestimated partly due to the spurious self-interaction error (SIE) inherent to these functionals. In this work, we present a new fo…
▽ More
The accurate prediction of band gaps and structural properties in periodic systems continues to be one of the central goals of electronic structure theory. However, band gaps obtained from popular exchange-correlation functionals (such as LDA and PBE) are severely underestimated partly due to the spurious self-interaction error (SIE) inherent to these functionals. In this work, we present a new formulation and implementation of Wannier function-derived Fermi-Löwdin (WFL) orbitals for correcting the SIE in periodic systems. Since our approach utilizes a variational minimization of the self-interaction energy with respect to the Wannier charge centers, it is computationally more efficient than the HSE hybrid functional and other self-interaction corrections that require a large number of transformation matrix elements. Calculations on several (17 in total) prototypical molecular solids, semiconductors, and wide-bandgap materials show that our WFL self-interaction correction approach gives better band gaps and bulk moduli compared to semilocal functionals, largely due to the partial removal of self-interaction errors.
△ Less
Submitted 16 March, 2022;
originally announced March 2022.
-
High-Temperature Decomposition of Diisopropyl Methylphosphonate (DIMP) on Alumina: Mechanistic Predictions from Ab Initio Molecular Dynamics
Authors:
Sohag Biswas,
Bryan M. Wong
Abstract:
The enhanced degradation of organophosphorous-based chemical warfare agents (CWAs) on metal-oxide surfaces holds immense promise for neutralization efforts; however, the underlying mechanisms in this process remain poorly understood. We utilize large-scale quantum calculations for the first time to probe the high-temperature degradation of diisopropyl methylphosphonate (DIMP), a nerve agent simula…
▽ More
The enhanced degradation of organophosphorous-based chemical warfare agents (CWAs) on metal-oxide surfaces holds immense promise for neutralization efforts; however, the underlying mechanisms in this process remain poorly understood. We utilize large-scale quantum calculations for the first time to probe the high-temperature degradation of diisopropyl methylphosphonate (DIMP), a nerve agent simulant. Our Born-Oppenheimer molecular dynamics (BOMD) calculations show that the $γ$-Al$_2$O$_3$ surface shows immense promise for quickly adsorbing and destroying CWAs. We find that the alumina surface quickly adsorbs DIMP at all temperatures, and subsequent decomposition of DIMP proceeds via a propene elimination. Our BOMD calculations are complemented with metadynamics simulations to produce free energy paths, which show that the activation barrier decreases with temperature and DIMP readily decomposes on $γ$-Al$_2$O$_3$. Our first-principle BOMD and metadynamics simulations provide crucial diagnostics for sarin decomposition models and mechanistic information for examining CWA decomposition reactions on other candidate metal oxide surfaces.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
HADOKEN: An Open-Source Software Package for Predicting Electron Confinement Effects in Various Nanowire Geometries and Configurations
Authors:
Bryan M. Wong,
Cameron Chevalier
Abstract:
We present an open-source software package, HADOKEN (High-level Algorithms to Design, Optimize, and Keep Electrons in Nanowires), for predicting electron confinement/localization effects in nanowires with various geometries, {arbitrary number of concentric shell layers,} doping densities, and external boundary conditions. The HADOKEN code is written in the MATLAB programming environments to aid in…
▽ More
We present an open-source software package, HADOKEN (High-level Algorithms to Design, Optimize, and Keep Electrons in Nanowires), for predicting electron confinement/localization effects in nanowires with various geometries, {arbitrary number of concentric shell layers,} doping densities, and external boundary conditions. The HADOKEN code is written in the MATLAB programming environments to aid in its readability and general accessibility to both users and practitioners. We provide several examples and outputs on a variety of different nanowire geometries, boundary conditions, and doping densities to demonstrate the capabilities of the HADOKEN software package. As such, the use of this predictive and versatile tool by both experimentalists and theorists could lead to further advances in both understanding and tailoring electron confinement effects in these nanosystems.
△ Less
Submitted 11 March, 2022; v1 submitted 10 March, 2022;
originally announced March 2022.
-
On the surplus management of funds with assets and liabilities in presence of solvency requirements
Authors:
Benjamin Avanzi,
Ping Chen,
Lars Frederik Brandt Henriksen,
Bernard Wong
Abstract:
In this paper we consider a company whose assets and liabilities evolve according to a correlated bivariate geometric Brownian motion, such as in Gerber and Shiu (2003). We determine what dividend strategy maximises the expected present value of dividends until ruin in two cases: (i) when shareholders won't cover surplus shortfalls and a solvency constraint (as in Paulsen, 2003) is consequently im…
▽ More
In this paper we consider a company whose assets and liabilities evolve according to a correlated bivariate geometric Brownian motion, such as in Gerber and Shiu (2003). We determine what dividend strategy maximises the expected present value of dividends until ruin in two cases: (i) when shareholders won't cover surplus shortfalls and a solvency constraint (as in Paulsen, 2003) is consequently imposed, and (ii) when shareholders are always to fund any capital deficiency with capital (asset) injections. In the latter case, ruin will never occur and the objective is to maximise the difference between dividends and capital injections.
Developing and using appropriate verification lemmas, we show that the optimal dividend strategy is, in both cases, of barrier type. Both value functions are derived in closed form. Furthermore, the barrier is defined on the ratio of assets to liabilities, which mimics some of the dividend strategies that can be observed in practice by insurance companies. Existence and uniqueness of the optimal strategies are shown. Results are illustrated.
△ Less
Submitted 5 August, 2022; v1 submitted 9 March, 2022;
originally announced March 2022.
-
AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant
Authors:
Benita Wong,
Joya Chen,
You Wu,
Stan Weixian Lei,
Dongxing Mao,
Difei Gao,
Mike Zheng Shou
Abstract:
A long-standing goal of intelligent assistants such as AR glasses/robots has been to assist users in affordance-centric real-world scenarios, such as "how can I run the microwave for 1 minute?". However, there is still no clear task definition and suitable benchmarks. In this paper, we define a new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn…
▽ More
A long-standing goal of intelligent assistants such as AR glasses/robots has been to assist users in affordance-centric real-world scenarios, such as "how can I run the microwave for 1 minute?". However, there is still no clear task definition and suitable benchmarks. In this paper, we define a new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn from instructional videos to provide step-by-step help in the user's view. To support the task, we constructed AssistQ, a new dataset comprising 531 question-answer samples from 100 newly filmed instructional videos. We also developed a novel Question-to-Actions (Q2A) model to address the AQTC task and validate it on the AssistQ dataset. The results show that our model significantly outperforms several VQA-related baselines while still having large room for improvement. We expect our task and dataset to advance Egocentric AI Assistant's development. Our project page is available at: https://showlab.github.io/assistq/.
△ Less
Submitted 20 July, 2022; v1 submitted 8 March, 2022;
originally announced March 2022.
-
Detection and treatment of outliers for multivariate robust loss reserving
Authors:
Benjamin Avanzi,
Mark Lavender,
Greg Taylor,
Bernard Wong
Abstract:
Traditional techniques for calculating outstanding claim liabilities such as the chain ladder are notoriously at risk of being distorted by outliers in past claims data. Unfortunately, the literature in robust methods of reserving is scant, with notable exceptions such as Verdonck and Debruyne (2011) and Verdonck and Van Wouwe (2011). In this paper, we put forward two alternative robust bivariate…
▽ More
Traditional techniques for calculating outstanding claim liabilities such as the chain ladder are notoriously at risk of being distorted by outliers in past claims data. Unfortunately, the literature in robust methods of reserving is scant, with notable exceptions such as Verdonck and Debruyne (2011) and Verdonck and Van Wouwe (2011). In this paper, we put forward two alternative robust bivariate chain-ladder techniques to extend the approach of Verdonck and Van Wouwe (2011). The first technique is based on Adjusted Outlyingness (Hubert and Van der Veeken, 2008) and explicitly incorporates skewness into the analysis whilst providing a unique measure of outlyingness for each observation. The second technique is based on bagdistance (Hubert et al., 2016) which is derived from the bagplot however is able to provide a unique measure of outlyingness and a means to adjust outlying observations based on this measure.
Furthermore, we extend our robust bivariate chain-ladder approach to an N-dimensional framework. The implementation of the methods, especially beyond bivariate, is not trivial. This is illustrated on a trivariate data set from Australian general insurers, and results under the different outlier detection and treatment mechanisms are compared.
△ Less
Submitted 15 June, 2023; v1 submitted 8 March, 2022;
originally announced March 2022.
-
On the impact of outliers in loss reserving
Authors:
Benjamin Avanzi,
Mark Lavender,
Greg Taylor,
Bernard Wong
Abstract:
The sensitivity of loss reserving techniques to outliers in the data or deviations from model assumptions is a well known challenge. It has been shown that the popular chain-ladder reserving approach is at significant risk to such aberrant observations in that reserve estimates can be significantly shifted in the presence of even one outlier. As a consequence the chain-ladder reserving technique i…
▽ More
The sensitivity of loss reserving techniques to outliers in the data or deviations from model assumptions is a well known challenge. It has been shown that the popular chain-ladder reserving approach is at significant risk to such aberrant observations in that reserve estimates can be significantly shifted in the presence of even one outlier. As a consequence the chain-ladder reserving technique is non-robust. In this paper we investigate the sensitivity of reserves and mean squared errors of prediction under Mack's Model (Mack, 1993). This is done through the derivation of impact functions which are calculated by taking the first derivative of the relevant statistic of interest with respect to an observation. We also provide and discuss the impact functions for quantiles when total reserves are assumed to be lognormally distributed. Additionally, comparisons are made between the impact functions for individual accident year reserves under Mack's Model and the Bornhuetter-Ferguson methodology. It is shown that the impact of incremental claims on these statistics of interest varies widely throughout a loss triangle and is heavily dependent on other cells in the triangle.
Results are illustrated using data from a Belgian non-life insurer.
△ Less
Submitted 20 June, 2023; v1 submitted 28 February, 2022;
originally announced March 2022.
-
Generalized Yang-Mills Theory under Rotor Mechanism
Authors:
B. T. T. Wong
Abstract:
This paper follows the previous work on generalized abelian gauge field theory of higher-order derivatives under rotor model and extends the study to the most generalized non-abelian case. We find that the rotor mechanism from the abelian case applies nicely to the non-abelian case under the Lorentz gauge condition. Under the rotor mechanism, the gauge field transforms as…
▽ More
This paper follows the previous work on generalized abelian gauge field theory of higher-order derivatives under rotor model and extends the study to the most generalized non-abelian case. We find that the rotor mechanism from the abelian case applies nicely to the non-abelian case under the Lorentz gauge condition. Under the rotor mechanism, the gauge field transforms as $T_μ^a \rightarrow \Box^n T_μ^a$. When the order of field derivative is $n=0$, this restores back to the original Yang-Mills action. Our work gives an extensive generalization of the Yang-Mills theory with higher-order field derivatives. We also compute the equation of motion and Noether's current of the generalized non-abelian gauge field theory. Finally, we study the dynamic instability issue of the theory by the Ostrogradsky construction and the analysis of the 00-component of the energy-momentum tensor.
△ Less
Submitted 31 March, 2022; v1 submitted 28 January, 2022;
originally announced February 2022.
-
Quantization of Generalized Abelian Gauge Field Theory under Rotor Model
Authors:
B. T. T. Wong
Abstract:
This paper is a follow-up work of the previous study of the generalized abelian gauge field theory under rotor model of order $n$ of higher order derivatives. We will study the quantization of this theory using path integral approach and find out the Feynman propagator (2-point correlation function) of this generalized theory. We also investigate the generalized Proca action under rotor model and…
▽ More
This paper is a follow-up work of the previous study of the generalized abelian gauge field theory under rotor model of order $n$ of higher order derivatives. We will study the quantization of this theory using path integral approach and find out the Feynman propagator (2-point correlation function) of this generalized theory. We also investigate the generalized Proca action under rotor model and derive the Feynman propagator for the massive case.
△ Less
Submitted 29 August, 2021;
originally announced September 2021.
-
Bergman-Calabi diastasis and Kähler metric of constant holomorphic sectional curvature
Authors:
Robert Xin Dong,
Bun Wong
Abstract:
We prove that for a bounded domain in $\mathbb C^n$ with the Bergman metric of constant holomorphic sectional curvature being biholomorphic to a ball is equivalent to the hyperconvexity or the exhaustiveness of the Bergman-Calabi diastasis. By finding its connection with the Bergman representative coordinate, we give explicit formulas of the Bergman-Calabi diastasis and show that it has bounded gr…
▽ More
We prove that for a bounded domain in $\mathbb C^n$ with the Bergman metric of constant holomorphic sectional curvature being biholomorphic to a ball is equivalent to the hyperconvexity or the exhaustiveness of the Bergman-Calabi diastasis. By finding its connection with the Bergman representative coordinate, we give explicit formulas of the Bergman-Calabi diastasis and show that it has bounded gradient. In particular, we prove that any bounded domain whose Bergman metric has constant holomorphic sectional curvature is Lu Qi-Keng. We also extend a theorem of Lu towards the incomplete situation and characterize pseudoconvex domains that are biholomorphic to a ball possibly less a relatively closed pluripolar set.
△ Less
Submitted 28 September, 2021; v1 submitted 2 September, 2021;
originally announced September 2021.
-
The Theory of Fundamental Duality, Quantum Dualiton and Topological Dual Invariance
Authors:
B. T. T. Wong
Abstract:
Fundamental duality is a concept which refers to two irreducible, heterogeneous principles which are in opposite and complementary of each other. The complementary principle in quantum mechanics is also praised by Bohr. This important concept is known to appear in a lot of places in our physical universe, however a rigorous mathematical definition and physics theory has not yet ever developed in a…
▽ More
Fundamental duality is a concept which refers to two irreducible, heterogeneous principles which are in opposite and complementary of each other. The complementary principle in quantum mechanics is also praised by Bohr. This important concept is known to appear in a lot of places in our physical universe, however a rigorous mathematical definition and physics theory has not yet ever developed in a formal way. In this paper, we establish a formalism for fundamental duality and study its various properties and theorems. One of the most profound results is that we establish a relation between dual invariance and topological invariance, and we find that the topological Chern-Simons form is a dual invariant action. Finally we apply the concept of duality to study dual state oscillation, and predict a theoretical new matter of state of dualiton, which is the particle excitation of the dual field by second quantization. This new exotic quasi-particle state is expected to have an impact in particle physics and condensed matter physics.
△ Less
Submitted 28 January, 2023; v1 submitted 30 June, 2021;
originally announced August 2021.
-
Stochastic loss reserving with mixture density neural networks
Authors:
Muhammed Taher Al-Mudafer,
Benjamin Avanzi,
Greg Taylor,
Bernard Wong
Abstract:
Neural networks offer a versatile, flexible and accurate approach to loss reserving. However, such applications have focused primarily on the (important) problem of fitting accurate central estimates of the outstanding claims. In practice, properties regarding the variability of outstanding claims are equally important (e.g., quantiles for regulatory purposes).
In this paper we fill this gap by…
▽ More
Neural networks offer a versatile, flexible and accurate approach to loss reserving. However, such applications have focused primarily on the (important) problem of fitting accurate central estimates of the outstanding claims. In practice, properties regarding the variability of outstanding claims are equally important (e.g., quantiles for regulatory purposes).
In this paper we fill this gap by applying a Mixture Density Network ("MDN") to loss reserving. The approach combines a neural network architecture with a mixture Gaussian distribution to achieve simultaneously an accurate central estimate along with flexible distributional choice. Model fitting is done using a rolling-origin approach. Our approach consistently outperforms the classical over-dispersed model both for central estimates and quantiles of interest, when applied to a wide range of simulated environments of various complexity and specifications.
We further extend the MDN approach by proposing two extensions. Firstly, we present a hybrid GLM-MDN approach called "ResMDN". This hybrid approach balances the tractability and ease of understanding of a traditional GLM model on one hand, with the additional accuracy and distributional flexibility provided by the MDN on the other. We show that it can successfully improve the errors of the baseline ccODP, although there is generally a loss of performance when compared to the MDN in the examples we considered. Secondly, we allow for explicit projection constraints, so that actuarial judgement can be directly incorporated in the modelling process.
Throughout, we focus on aggregate loss triangles, and show that our methodologies are tractable, and that they out-perform traditional approaches even with relatively limited amounts of data. We use both simulated data -- to validate properties, and real data -- to illustrate and ascertain practicality of the approaches.
△ Less
Submitted 17 August, 2021;
originally announced August 2021.
-
The effective Dirac algebra by gauge field interaction in relativistic electrodynamics
Authors:
B. T. T. Wong
Abstract:
Conventional relativistic electrodynamics is set on flat Minkowski spacetime, where all computable quantities are calculated from the flat metric $η_{μν}$. We can redefine the metric of spacetime from the Dirac algebra. In this paper, we study how an electrodynamic interaction can alter the normal gamma matrix to an effective one and result in a shift in the metric perturbatively. The curvature pr…
▽ More
Conventional relativistic electrodynamics is set on flat Minkowski spacetime, where all computable quantities are calculated from the flat metric $η_{μν}$. We can redefine the metric of spacetime from the Dirac algebra. In this paper, we study how an electrodynamic interaction can alter the normal gamma matrix to an effective one and result in a shift in the metric perturbatively. The curvature properties inferred from the curved metric are also investigated. We also study how the spin operator is changed under the interaction that contribute to an effective spin operator and how the spin of an electron will be slightly deviated from $1/2$. Then we perform canonical quantization of the effective Dirac algebra. Finally we apply our results to the relativistic hydrogen case and demonstrate how such system curves the spacetime metric.
△ Less
Submitted 20 March, 2022; v1 submitted 4 May, 2021;
originally announced June 2021.
-
Approximate Bayesian Computation for an Explicit-Duration Hidden Markov Model of COVID-19 Hospital Trajectories
Authors:
Gian Marco Visani,
Alexandra Hope Lee,
Cuong Nguyen,
David M. Kent,
John B. Wong,
Joshua T. Cohen,
Michael C. Hughes
Abstract:
We address the problem of modeling constrained hospital resources in the midst of the COVID-19 pandemic in order to inform decision-makers of future demand and assess the societal value of possible interventions. For broad applicability, we focus on the common yet challenging scenario where patient-level data for a region of interest are not available. Instead, given daily admissions counts, we mo…
▽ More
We address the problem of modeling constrained hospital resources in the midst of the COVID-19 pandemic in order to inform decision-makers of future demand and assess the societal value of possible interventions. For broad applicability, we focus on the common yet challenging scenario where patient-level data for a region of interest are not available. Instead, given daily admissions counts, we model aggregated counts of observed resource use, such as the number of patients in the general ward, in the intensive care unit, or on a ventilator. In order to explain how individual patient trajectories produce these counts, we propose an aggregate count explicit-duration hidden Markov model, nicknamed the ACED-HMM, with an interpretable, compact parameterization. We develop an Approximate Bayesian Computation approach that draws samples from the posterior distribution over the model's transition and duration parameters given aggregate counts from a specific location, thus adapting the model to a region or individual hospital site of interest. Samples from this posterior can then be used to produce future forecasts of any counts of interest. Using data from the United States and the United Kingdom, we show our mechanistic approach provides competitive probabilistic forecasts for the future even as the dynamics of the pandemic shift. Furthermore, we show how our model provides insight about recovery probabilities or length of stay distributions, and we suggest its potential to answer challenging what-if questions about the societal value of possible interventions.
△ Less
Submitted 28 July, 2021; v1 submitted 28 April, 2021;
originally announced May 2021.
-
Generalized Abelian Gauge Field Theory under Rotor Model
Authors:
B. T. T. Wong
Abstract:
Gauge field theory with rank-one field $T_μ$ is a quantum field theory that describes the interaction of elementary spin-1 particles, of which being massless to preserve gauge symmetry. In this paper, we give a generalized, extended study of abelian gauge field theory under successive rotor model in general $D$-dimensional flat spacetime for spin-1 particles in the context of higher order derivati…
▽ More
Gauge field theory with rank-one field $T_μ$ is a quantum field theory that describes the interaction of elementary spin-1 particles, of which being massless to preserve gauge symmetry. In this paper, we give a generalized, extended study of abelian gauge field theory under successive rotor model in general $D$-dimensional flat spacetime for spin-1 particles in the context of higher order derivatives. We establish a theorem that $n$ rotor contributes to the $\Box^n T^μ$ fields in the integration-by-parts formalism of the action. This corresponds to the transformation of gauge field $T^μ \rightarrow \Box^n T^μ$ and gauge field strength $G_{μν}\rightarrow \Box^n G_{μν} $ in the action. The $n=0$ case restores back to the standard abelian gauge field theory. The equation of motion and Noether's conserved current of the theory are also studied.
△ Less
Submitted 16 August, 2021; v1 submitted 19 March, 2021;
originally announced April 2021.
-
Forecasting COVID-19 Counts At A Single Hospital: A Hierarchical Bayesian Approach
Authors:
Alexandra Hope Lee,
Panagiotis Lymperopoulos,
Joshua T. Cohen,
John B. Wong,
Michael C. Hughes
Abstract:
We consider the problem of forecasting the daily number of hospitalized COVID-19 patients at a single hospital site, in order to help administrators with logistics and planning. We develop several candidate hierarchical Bayesian models which directly capture the count nature of data via a generalized Poisson likelihood, model time-series dependencies via autoregressive and Gaussian process latent…
▽ More
We consider the problem of forecasting the daily number of hospitalized COVID-19 patients at a single hospital site, in order to help administrators with logistics and planning. We develop several candidate hierarchical Bayesian models which directly capture the count nature of data via a generalized Poisson likelihood, model time-series dependencies via autoregressive and Gaussian process latent processes, and share statistical strength across related sites. We demonstrate our approach on public datasets for 8 hospitals in Massachusetts, U.S.A. and 10 hospitals in the United Kingdom. Further prospective evaluation compares our approach favorably to baselines currently used by stakeholders at 3 related hospitals to forecast 2-week-ahead demand by rescaling state-level forecasts.
△ Less
Submitted 14 April, 2021;
originally announced April 2021.
-
Upper Extremity Load Reduction for Lower LimbExoskeleton Trajectory Generation Using AnkleTorque Minimization
Authors:
Yik Ben Wong,
Yawen Chen,
Kam Fai Elvis Tsang,
Winnie Suk Wai Leung,
Ling Shi
Abstract:
Recently, the lower limb exoskeletons which providemobility for paraplegic patients to support their daily life havedrawn much attention. However, the pilots are required to applyexcessive force through a pair of crutches to maintain balanceduring walking. This paper proposes a novel gait trajectorygeneration algorithm for exoskeleton locomotion on flat groundand stair which aims to minimize the f…
▽ More
Recently, the lower limb exoskeletons which providemobility for paraplegic patients to support their daily life havedrawn much attention. However, the pilots are required to applyexcessive force through a pair of crutches to maintain balanceduring walking. This paper proposes a novel gait trajectorygeneration algorithm for exoskeleton locomotion on flat groundand stair which aims to minimize the force applied by the pilotwithout increasing the degree of freedom (DoF) of the system.First, the system is modelled as a five-link mechanism dynam-ically for torque computing. Then, an optimization approachis used to generate the trajectory minimizing the ankle torquewhich is correlated to the supporting force. Finally, experimentis conducted to compare the different gait generation algorithmsthrough measurement of ground reaction force (GRF) appliedon the crutches
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
SynthETIC: an individual insurance claim simulator with feature control
Authors:
Benjamin Avanzi,
Gregory Clive Taylor,
Melantha Wang,
Bernard Wong
Abstract:
Recent years have seen rapid increase in the application of machine learning to insurance loss reserving. They yield most value when applied to large data sets, such as individual claims, or large claim triangles. In short, they are likely to be useful in the analysis of any data set whose volume is sufficient to obscure a naked-eye view of its features. Unfortunately, such large data sets are in…
▽ More
Recent years have seen rapid increase in the application of machine learning to insurance loss reserving. They yield most value when applied to large data sets, such as individual claims, or large claim triangles. In short, they are likely to be useful in the analysis of any data set whose volume is sufficient to obscure a naked-eye view of its features. Unfortunately, such large data sets are in short supply in the actuarial literature. Accordingly, one needs to turn to synthetic data. Although the ultimate objective of these methods is application to real data, the use of synthetic data containing features commonly observed in real data is also to be encouraged.
While there are a number of claims simulators in existence, each valuable within its own context, the inclusion of a number of desirable (but complicated) data features requires further development. Accordingly, in this paper we review those desirable features, and propose a new simulator of individual claim experience called `SynthETIC`.
Our simulator is publicly available, open source, and fills a gap in the non-life actuarial toolkit. The simulator specifically allows for desirable (but optionally complicated) data features typically occurring in practice, such as variations in rates of settlements and development patterns; as with superimposed inflation, and various discontinuities, and also enables various dependencies between variables. The user has full control of the mechanics of the evolution of an individual claim. As a result, the complexity of the data set generated (meaning the level of difficulty of analysis) may be dialled anywhere from extremely simple to extremely complex.
△ Less
Submitted 25 August, 2021; v1 submitted 13 August, 2020;
originally announced August 2020.
-
6 nm super-resolution optical transmission and scattering spectroscopic imaging of carbon nanotubes using a nanometer-scale white light source
Authors:
Xuezhi Ma,
Qiushi Liu,
Ning Yu,
Da Xu,
Sanggon Kim,
Zebin Liu,
Kaili Jiang,
Bryan M. Wong,
Ruoxue Yan,
Ming Liu
Abstract:
Optical hyperspectral imaging based on absorption and scattering of photons at the visible and adjacent frequencies denotes one of the most informative and inclusive characterization methods in material research. Unfortunately, restricted by the diffraction limit of light, it is unable to resolve the nanoscale inhomogeneity in light-matter interactions, which is diagnostic of the local modulation…
▽ More
Optical hyperspectral imaging based on absorption and scattering of photons at the visible and adjacent frequencies denotes one of the most informative and inclusive characterization methods in material research. Unfortunately, restricted by the diffraction limit of light, it is unable to resolve the nanoscale inhomogeneity in light-matter interactions, which is diagnostic of the local modulation in material structure and properties. Moreover, many nanomaterials have highly anisotropic optical properties that are outstandingly appealing yet hard to characterize through conventional optical methods. Therefore, there has been a pressing demand in the diverse fields including electronics, photonics, physics, and materials science to extend the optical hyperspectral imaging into the nanometer length scale. In this work, we report a super-resolution hyperspectral imaging technique that simultaneously measures optical absorption and scattering spectra with the illumination from a tungsten-halogen lamp. We demonstrated sub-5 nm spatial resolution in both visible and near-infrared wavelengths (415 to 980 nm) for the hyperspectral imaging of strained single-walled carbon nanotubes (SWNT) and reconstructed true-color images to reveal the longitudinal and transverse optical transition-induced light absorption and scattering in the SWNTs. This is the first time transverse optical absorption in SWNTs were clearly observed experimentally. The new technique provides rich near-field spectroscopic information that had made it possible to analyze the spatial modulation of band-structure along a single SWNT induced through strain engineering.
△ Less
Submitted 11 March, 2021; v1 submitted 8 June, 2020;
originally announced June 2020.
-
On the optimality of joint periodic and extraordinary dividend strategies
Authors:
Benjamin Avanzi,
Hayden Lau,
Bernard Wong
Abstract:
In this paper, we model the cash surplus (or equity) of a risky business with a Brownian motion. Owners can take cash out of the surplus in the form of "dividends", subject to transaction costs. However, if the surplus hits 0 then ruin occurs and the business cannot operate any more.
We consider two types of dividend distributions: (i) periodic, regular ones (that is, dividends can be paid only…
▽ More
In this paper, we model the cash surplus (or equity) of a risky business with a Brownian motion. Owners can take cash out of the surplus in the form of "dividends", subject to transaction costs. However, if the surplus hits 0 then ruin occurs and the business cannot operate any more.
We consider two types of dividend distributions: (i) periodic, regular ones (that is, dividends can be paid only at countable many points in time, according to a specific arrival process); and (ii) extraordinary dividend payments that can be made immediately at any time (that is, the dividend decision time space is continuous and matches that of the surplus process). Both types of dividends attract proportional transaction costs, and extraordinary distributions also attracts fixed transaction costs, a realistic feature. A dividend strategy that involves both types of distributions (periodic and extraordinary) is qualified as "hybrid".
We determine which strategies (either periodic, immediate, or hybrid) are optimal, that is, we show which are the strategies that maximise the expected present value of dividends paid until ruin, net of transaction costs. Sometimes, a liquidation strategy (which pays out all monies and stops the process) is optimal. Which strategy is optimal depends on the profitability of the business, and the level of (proportional and fixed) transaction costs. Results are illustrated.
△ Less
Submitted 2 December, 2020; v1 submitted 1 June, 2020;
originally announced June 2020.
-
Twisted Mazur pattern satellite knots and bordered Floer theory
Authors:
Ina Petkova,
Biji Wong
Abstract:
We use bordered Floer theory to study properties of twisted Mazur pattern satellite knots $Q_{n}(K)$. We prove that $Q_n(K)$ is not Floer homologically thin, with two exceptions. We calculate the 3-genus of $Q_{n}(K)$ in terms of the twisting parameter $n$ and the 3-genus of the companion $K$, and we determine when $Q_n(K)$ is fibered. As an application to our results on Floer thickness and 3-genu…
▽ More
We use bordered Floer theory to study properties of twisted Mazur pattern satellite knots $Q_{n}(K)$. We prove that $Q_n(K)$ is not Floer homologically thin, with two exceptions. We calculate the 3-genus of $Q_{n}(K)$ in terms of the twisting parameter $n$ and the 3-genus of the companion $K$, and we determine when $Q_n(K)$ is fibered. As an application to our results on Floer thickness and 3-genus, we verify the Cosmetic Surgery Conjecture for many of these satellite knots.
△ Less
Submitted 19 March, 2021; v1 submitted 26 May, 2020;
originally announced May 2020.
-
On unbalanced data and common shock models in stochastic loss reserving
Authors:
Benjamin Avanzi,
Gregory Clive Taylor,
Phuong Anh Vu,
Bernard Wong
Abstract:
Introducing common shocks is a popular dependence modelling approach, with some recent applications in loss reserving. The main advantage of this approach is the ability to capture structural dependence coming from known relationships. In addition, it helps with the parsimonious construction of correlation matrices of large dimensions. However, complications arise in the presence of "unbalanced da…
▽ More
Introducing common shocks is a popular dependence modelling approach, with some recent applications in loss reserving. The main advantage of this approach is the ability to capture structural dependence coming from known relationships. In addition, it helps with the parsimonious construction of correlation matrices of large dimensions. However, complications arise in the presence of "unbalanced data", that is, when (expected) magnitude of observations over a single triangle, or between triangles, can vary substantially. Specifically, if a single common shock is applied to all of these cells, it can contribute insignificantly to the larger values and/or swamp the smaller ones, unless careful adjustments are made. This problem is further complicated in applications involving negative claim amounts. In this paper, we address this problem in the loss reserving context using a common shock Tweedie approach for unbalanced data. We show that the solution not only provides a much better balance of the common shock proportions relative to the unbalanced data, but it is also parsimonious. Finally, the common shock Tweedie model also provides distributional tractability.
△ Less
Submitted 17 May, 2020; v1 submitted 7 May, 2020;
originally announced May 2020.
-
On the modelling of multivariate counts with Cox processes and dependent shot noise intensities
Authors:
Benjamin Avanzi,
Gregory Clive Taylor,
Bernard Wong,
Xinda Yang
Abstract:
In this paper, we develop a method to model and estimate several, _dependent_ count processes, using granular data. Specifically, we develop a multivariate Cox process with shot noise intensities to jointly model the arrival process of counts (e.g. insurance claims). The dependency structure is introduced via multivariate shot noise _intensity_ processes which are connected with the help of Lévy c…
▽ More
In this paper, we develop a method to model and estimate several, _dependent_ count processes, using granular data. Specifically, we develop a multivariate Cox process with shot noise intensities to jointly model the arrival process of counts (e.g. insurance claims). The dependency structure is introduced via multivariate shot noise _intensity_ processes which are connected with the help of Lévy copulas. In aggregate, our approach allows for (i) over-dispersion and auto-correlation within each line of business; (ii) realistic features involving time-varying, known covariates; and (iii) parsimonious dependence between processes without requiring simultaneous primary (e.g. accidents) events.
The explicit incorporation of time-varying, known covariates can accommodate characteristics of real data and hence facilitate implementation in practice. In an insurance context, these could be changes in policy volumes over time, as well as seasonality patterns and trends, which may explain some of the relationship (dependence) between multiple claims processes, or at least help tease out those relationships.
Finally, we develop a filtering algorithm based on the reversible-jump Markov Chain Monte Carlo (RJMCMC) method to estimate the latent stochastic intensities and illustrate model calibration using real data from the AUSI data set.
△ Less
Submitted 3 December, 2020; v1 submitted 23 April, 2020;
originally announced April 2020.
-
A multivariate evolutionary generalised linear model framework with adaptive estimation for claims reserving
Authors:
Benjamin Avanzi,
Gregory Clive Taylor,
Phuong Anh Vu,
Bernard Wong
Abstract:
In this paper, we develop a multivariate evolutionary generalised linear model (GLM) framework for claims reserving, which allows for dynamic features of claims activity in conjunction with dependency across business lines to accurately assess claims reserves. We extend the traditional GLM reserving framework on two fronts: GLM fixed factors are allowed to evolve in a recursive manner, and depende…
▽ More
In this paper, we develop a multivariate evolutionary generalised linear model (GLM) framework for claims reserving, which allows for dynamic features of claims activity in conjunction with dependency across business lines to accurately assess claims reserves. We extend the traditional GLM reserving framework on two fronts: GLM fixed factors are allowed to evolve in a recursive manner, and dependence is incorporated in the specification of these factors using a common shock approach.
We consider factors that evolve across accident years in conjunction with factors that evolve across calendar years. This two-dimensional evolution of factors is unconventional as a traditional evolutionary model typically considers the evolution in one single time dimension. This creates challenges for the estimation process, which we tackle in this paper. We develop the formulation of a particle filtering algorithm with parameter learning procedure. This is an adaptive estimation approach which updates evolving factors of the framework recursively over time.
We implement and illustrate our model with a simulated data set, as well as a set of real data from a Canadian insurer.
△ Less
Submitted 15 April, 2020;
originally announced April 2020.
-
Optimal periodic dividend strategies for spectrally negative Lévy processes with fixed transaction costs
Authors:
Benjamin Avanzi,
Hayden Lau,
Bernard Wong
Abstract:
Maximising dividends is one classical stability criterion in actuarial risk theory. Motivated by the fact that dividends are paid periodically in real life, $\textit{periodic}$ dividend strategies were recently introduced (Albrecher, Gerber and Shiu, 2011). In this paper, we incorporate fixed transaction costs into the model and study the optimal periodic dividend strategy with fixed transaction c…
▽ More
Maximising dividends is one classical stability criterion in actuarial risk theory. Motivated by the fact that dividends are paid periodically in real life, $\textit{periodic}$ dividend strategies were recently introduced (Albrecher, Gerber and Shiu, 2011). In this paper, we incorporate fixed transaction costs into the model and study the optimal periodic dividend strategy with fixed transaction costs for spectrally negative Lévy processes.
The value function of a periodic $(b_u,b_l)$ strategy is calculated by means of exiting identities and Itô's excusion when the surplus process is of unbounded variation. We show that a sufficient condition for optimality is that the Lévy measure admits a density which is completely monotonic. Under such assumptions, a periodic $(b_u,b_l)$ strategy is confirmed to be optimal.
Results are illustrated.
△ Less
Submitted 3 December, 2020; v1 submitted 3 April, 2020;
originally announced April 2020.