subscribe to arXiv mailings

A framework for developing a knowledge management platform

Authors: Marie Lisandra Zepeda Mendoza, Sonali Agarwal, James A. Blackshaw, Vanesa Bol, Audrey Fazzi, Filippo Fiorini, Amy Louise Foreman, Nancy George, Brett R. Johnson, Brian Martin, Dave McComb, Euphemia Mutasa-Gottgens, Helen Parkinson, Martin Romacker, Rolf Russell, Valérien Ségard, Shawn Zheng Kai Tan, Wei Kheng Teh, F. P. Winstanley, Benedict Wong, Adrian M. Smith

Abstract: Knowledge management (KM) involves collecting, organizing, storing, and disseminating information to improve decision-making, innovation, and performance. Implementing KM at scale has become essential for organizations to effectively leverage vast accessible data. This paper is a compilation of concepts that emerged from KM workshops hosted by EMBL-EBI, attended by SMEs and industry. We provide gu… ▽ More Knowledge management (KM) involves collecting, organizing, storing, and disseminating information to improve decision-making, innovation, and performance. Implementing KM at scale has become essential for organizations to effectively leverage vast accessible data. This paper is a compilation of concepts that emerged from KM workshops hosted by EMBL-EBI, attended by SMEs and industry. We provide guidance on envisioning, executing, evaluating, and evolving knowledge management platforms. We emphasize essential considerations such as setting knowledge domain boundaries and measuring success, as well as the importance of making knowledge accessible for downstream applications and non-computational users and highlights necessary personal and organizational skills for success. We stress the importance of collaboration and the need for convergence on shared principles and commitment to provide or seek resources to advance KM. The community is invited to join the journey of KM and contribute to the advancement of the field by applying and improving on the guidelines described. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 18 pages, 1 figure

arXiv:2406.01002 [pdf, ps, other]

Random Subspace Local Projections

Authors: Viet Hoang Dinh, Didier Nibbering, Benjamin Wong

Abstract: We show how random subspace methods can be adapted to estimating local projections with many controls. Random subspace methods have their roots in the machine learning literature and are implemented by averaging over regressions estimated over different combinations of subsets of these controls. We document three key results: (i) Our approach can successfully recover the impulse response functions… ▽ More We show how random subspace methods can be adapted to estimating local projections with many controls. Random subspace methods have their roots in the machine learning literature and are implemented by averaging over regressions estimated over different combinations of subsets of these controls. We document three key results: (i) Our approach can successfully recover the impulse response functions across Monte Carlo experiments representative of different macroeconomic settings and identification schemes. (ii) Our results suggest that random subspace methods are more accurate than other dimension reduction methods if the underlying large dataset has a factor structure similar to typical macroeconomic datasets such as FRED-MD. (iii) Our approach leads to differences in the estimated impulse response functions relative to benchmark methods when applied to two widely studied empirical applications. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2406.00998 [pdf, other]

Distributional Refinement Network: Distributional Forecasting via Deep Learning

Authors: Benjamin Avanzi, Eric Dong, Patrick J. Laub, Bernard Wong

Abstract: A key task in actuarial modelling involves modelling the distributional properties of losses. Classic (distributional) regression approaches like Generalized Linear Models (GLMs; Nelder and Wedderburn, 1972) are commonly used, but challenges remain in developing models that can (i) allow covariates to flexibly impact different aspects of the conditional distribution, (ii) integrate developments in… ▽ More A key task in actuarial modelling involves modelling the distributional properties of losses. Classic (distributional) regression approaches like Generalized Linear Models (GLMs; Nelder and Wedderburn, 1972) are commonly used, but challenges remain in developing models that can (i) allow covariates to flexibly impact different aspects of the conditional distribution, (ii) integrate developments in machine learning and AI to maximise the predictive power while considering (i), and, (iii) maintain a level of interpretability in the model to enhance trust in the model and its outputs, which is often compromised in efforts pursuing (i) and (ii). We tackle this problem by proposing a Distributional Refinement Network (DRN), which combines an inherently interpretable baseline model (such as GLMs) with a flexible neural network-a modified Deep Distribution Regression (DDR; Li et al., 2019) method. Inspired by the Combined Actuarial Neural Network (CANN; Schelldorfer and W{\''u}thrich, 2019), our approach flexibly refines the entire baseline distribution. As a result, the DRN captures varying effects of features across all quantiles, improving predictive performance while maintaining adequate interpretability. Using both synthetic and real-world data, we demonstrate the DRN's superior distributional forecasting capacity. The DRN has the potential to be a powerful distributional regression model in actuarial science and beyond. △ Less

Submitted 3 June, 2024; originally announced June 2024.

MSC Class: 91G70; 91G60; 62P05; 91B30

arXiv:2405.05299 [pdf, other]

Challenges for Responsible AI Design and Workflow Integration in Healthcare: A Case Study of Automatic Feeding Tube Qualification in Radiology

Authors: Anja Thieme, Abhijith Rajamohan, Benjamin Cooper, Heather Groombridge, Robert Simister, Barney Wong, Nicholas Woznitza, Mark Ames Pinnock, Maria Teodora Wetscherek, Cecily Morrison, Hannah Richardson, Fernando Pérez-García, Stephanie L. Hyland, Shruthi Bannur, Daniel C. Castro, Kenza Bouzid, Anton Schwaighofer, Mercy Ranjit, Harshita Sharma, Matthew P. Lungren, Ozan Oktay, Javier Alvarez-Valle, Aditya Nori, Stephen Harris, Joseph Jacob

Abstract: Nasogastric tubes (NGTs) are feeding tubes that are inserted through the nose into the stomach to deliver nutrition or medication. If not placed correctly, they can cause serious harm, even death to patients. Recent AI developments demonstrate the feasibility of robustly detecting NGT placement from Chest X-ray images to reduce risks of sub-optimally or critically placed NGTs being missed or delay… ▽ More Nasogastric tubes (NGTs) are feeding tubes that are inserted through the nose into the stomach to deliver nutrition or medication. If not placed correctly, they can cause serious harm, even death to patients. Recent AI developments demonstrate the feasibility of robustly detecting NGT placement from Chest X-ray images to reduce risks of sub-optimally or critically placed NGTs being missed or delayed in their detection, but gaps remain in clinical practice integration. In this study, we present a human-centered approach to the problem and describe insights derived following contextual inquiry and in-depth interviews with 15 clinical stakeholders. The interviews helped understand challenges in existing workflows, and how best to align technical capabilities with user needs and expectations. We discovered the trade-offs and complexities that need consideration when choosing suitable workflow stages, target users, and design configurations for different AI proposals. We explored how to balance AI benefits and risks for healthcare staff and patients within broader organizational and medical-legal constraints. We also identified data issues related to edge cases and data biases that affect model training and evaluation; how data documentation practices influence data preparation and labelling; and how to measure relevant AI outcomes reliably in future evaluations. We discuss how our work informs design and development of AI applications that are clinically useful, ethical, and acceptable in real-world healthcare services. △ Less

Submitted 8 May, 2024; originally announced May 2024.

ACM Class: H.5.m; I.2.m

arXiv:2403.08131 [pdf, other]

Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality

Authors: Adrian Perez Dieguez, Min Choi, Mahmut Okyay, Mauro Del Ben, Bryan M. Wong, Khaled Z. Ibrahim

Abstract: Tuning searches are pivotal in High-Performance Computing (HPC), addressing complex optimization challenges in computational applications. The complexity arises not only from finely tuning parameters within routines but also potential interdependencies among them, rendering traditional optimization methods inefficient. Instead of scrutinizing interdependencies among parameters and routines, practi… ▽ More Tuning searches are pivotal in High-Performance Computing (HPC), addressing complex optimization challenges in computational applications. The complexity arises not only from finely tuning parameters within routines but also potential interdependencies among them, rendering traditional optimization methods inefficient. Instead of scrutinizing interdependencies among parameters and routines, practitioners often face the dilemma of conducting independent tuning searches for each routine, thereby overlooking interdependence, or pursuing a more resource-intensive joint search for all routines. This decision is driven by the consideration that some interdependence analysis and high-dimensional decomposition techniques in literature may be prohibitively expensive in HPC tuning searches. Our methodology adapts and refines these methods to ensure computational feasibility while maximizing performance gains in real-world scenarios. Our methodology leverages a cost-effective interdependence analysis to decide whether to merge several tuning searches into a joint search or conduct orthogonal searches. Tested on synthetic functions with varying levels of parameter interdependence, our methodology efficiently explores the search space. In comparison to Bayesian-optimization-based full independent or fully joint searches, our methodology suggested an optimized breakdown of independent and merged searches that led to final configurations up to 8% more accurate, reducing the search time by up to 95%. When applied to GPU-offloaded Real-Time Time-Dependent Density Functional Theory (RT-TDDFT), an application in computational materials science that challenges modern HPC autotuners, our methodology achieved an effective tuning search. Its adaptability and efficiency extend beyond RT-TDDFT, making it valuable for related applications in HPC. △ Less

Submitted 12 March, 2024; originally announced March 2024.

arXiv:2403.02593 [pdf, ps, other]

The Ramsey numbers for trees of order $n$ with maximum degree at least $n-5$ versus the wheel graph of order nine

Authors: Zhi Yee Chng, Thomas Britz, Ta Sheng Tan, Kok Bin Wong

Abstract: The Ramsey numbers $R(T_n,W_8)$ are determined for each tree graph $T_n$ of order $n\geq 7$ and maximum degree $Δ(T_n)$ equal to either $n-4$ or $n-5$. These numbers indicate strong support for the conjecture, due to Chen, Zhang and Zhang and to Hafidh and Baskoro, that $R(T_n,W_m) = 2n-1$ for each tree graph $T_n$ of order $n\geq m-1$ with $Δ(T_n)\leq n-m+2$ when $m\geq 4$ is even. The Ramsey numbers $R(T_n,W_8)$ are determined for each tree graph $T_n$ of order $n\geq 7$ and maximum degree $Δ(T_n)$ equal to either $n-4$ or $n-5$. These numbers indicate strong support for the conjecture, due to Chen, Zhang and Zhang and to Hafidh and Baskoro, that $R(T_n,W_m) = 2n-1$ for each tree graph $T_n$ of order $n\geq m-1$ with $Δ(T_n)\leq n-m+2$ when $m\geq 4$ is even. △ Less

Submitted 4 March, 2024; originally announced March 2024.

MSC Class: 05C55; 05D10

arXiv:2310.04786 [pdf, other]

On the evolution of data breach reporting patterns and frequency in the United States: a cross-state analysis

Authors: Benjamin Avanzi, Xingyun Tan, Greg Taylor, Bernard Wong

Abstract: Understanding the emergence of data breaches is crucial for cyber insurance. However, analyses of data breach frequency trends in the current literature lead to contradictory conclusions. We put forward that those discrepancies may be (at least partially) due to inconsistent data collection standards, as well as reporting patterns, over time and space. We set out to carefully control both. In this… ▽ More Understanding the emergence of data breaches is crucial for cyber insurance. However, analyses of data breach frequency trends in the current literature lead to contradictory conclusions. We put forward that those discrepancies may be (at least partially) due to inconsistent data collection standards, as well as reporting patterns, over time and space. We set out to carefully control both. In this paper, we conduct a joint analysis of state Attorneys General's publications on data breaches across eight states (namely, California, Delaware, Indiana, Maine, Montana, North Dakota, Oregon, and Washington), all of which are subject to established data collection standards-namely, state data breach (mandatory) notification laws. Thanks to our explicit recognition of these notification laws, we are capable of modelling frequency of breaches in a consistent and comparable way over time. Hence, we are able to isolate and capture the complexities of reporting patterns, adequately estimate IBNRs, and yield a highly reliable assessment of historical frequency trends in data breaches. Our analysis also provides a comprehensive comparison of data breach frequency across the eight U.S. states, extending knowledge on state-specific differences in cyber risk, which has not been extensively discussed in the current literature. Furthermore, we uncover novel features not previously discussed in the literature, such as differences in cyber risk frequency trends between large and small data breaches. Overall, we find that the reporting delays are lengthening. We also elicit commonalities and heterogeneities in reporting patterns across states, severity levels, and time periods. After adequately estimating IBNRs, we find that frequency is relatively stable before 2020 and increasing after 2020. This is consistent across states. Implications of our findings for cyber insurance are discussed. △ Less

Submitted 30 June, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

MSC Class: 91G70; 62P05; 91B30 (Primary)

arXiv:2310.00588 [pdf, other]

Active Anomaly Detection in Confined Spaces Using Ergodic Traversal of Directed Region Graphs

Authors: Benjamin Wong, Tyler M. Paine, Santosh Devasia, Ashis G. Banerjee

Abstract: We provide the first step toward developing a hierarchical control-estimation framework to actively plan robot trajectories for anomaly detection in confined spaces. The space is represented globally using a directed region graph, where a region is a landmark that needs to be visited (inspected). We devise a fast mixing Markov chain to find an ergodic route that traverses this graph so that the re… ▽ More We provide the first step toward developing a hierarchical control-estimation framework to actively plan robot trajectories for anomaly detection in confined spaces. The space is represented globally using a directed region graph, where a region is a landmark that needs to be visited (inspected). We devise a fast mixing Markov chain to find an ergodic route that traverses this graph so that the region visitation frequency is proportional to its anomaly detection uncertainty, while satisfying the edge directionality (region transition) constraint(s). Preliminary simulation results show fast convergence to the ergodic solution and confident estimation of the presence of anomalies in the inspected regions. △ Less

Submitted 1 October, 2023; originally announced October 2023.

arXiv:2309.05884 [pdf, ps, other]

doi 10.1116/5.0162455

Accelerating Quantum Optimal Control of Multi-Qubit Systems with Symmetry-Based Hamiltonian Transformations

Authors: Xian Wang, Mahmut Sait Okyay, Anshuman Kumar, Bryan M. Wong

Abstract: We present a novel, computationally efficient approach to accelerate quantum optimal control calculations of large multi-qubit systems used in a variety of quantum computing applications. By leveraging the intrinsic symmetry of finite groups, the Hilbert space can be decomposed and the Hamiltonians block-diagonalized to enable extremely fast quantum optimal control calculations. Our approach reduc… ▽ More We present a novel, computationally efficient approach to accelerate quantum optimal control calculations of large multi-qubit systems used in a variety of quantum computing applications. By leveraging the intrinsic symmetry of finite groups, the Hilbert space can be decomposed and the Hamiltonians block-diagonalized to enable extremely fast quantum optimal control calculations. Our approach reduces the Hamiltonian size of an $n$-qubit system from 2^n by 2^n to O(n by n) or O((2^n / n) by (2^n / n)) under Sn or Dn symmetry, respectively. Most importantly, this approach reduces the computational runtime of qubit optimal control calculations by orders of magnitude while maintaining the same accuracy as the conventional method. As prospective applications, we show that (1) symmetry-protected subspaces can be potential platforms for quantum error suppression and simulation of other quantum Hamiltonians, and (2) Lie-Trotter-Suzuki decomposition approaches can generalize our method to a general variety of multi-qubit systems. △ Less

Submitted 3 October, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

Comments: 15 pages, 5 figures. This article may be downloaded for personal use only. Any other use requires prior permission of the author and AIP Publishing. This article appeared in AVS Quantum Science and may be found at https://doi.org/10.1116/5.0162455. Find the PDF of the Supplementary Material in the source files

Journal ref: AVS Quantum Sci. 5, 043801 (2023)

arXiv:2308.15482 [pdf, other]

Empirical Study of Straggler Problem in Parameter Server on Iterative Convergent Distributed Machine Learning

Authors: Benjamin Wong

Abstract: The purpose of this study is to test the effectiveness of current straggler mitigation techniques over different important iterative convergent machine learning(ML) algorithm including Matrix Factorization (MF), Multinomial Logistic Regression (MLR), and Latent Dirichlet Allocation (LDA) . The experiment was conducted to implemented using the FlexPS system, which is the latest system implementatio… ▽ More The purpose of this study is to test the effectiveness of current straggler mitigation techniques over different important iterative convergent machine learning(ML) algorithm including Matrix Factorization (MF), Multinomial Logistic Regression (MLR), and Latent Dirichlet Allocation (LDA) . The experiment was conducted to implemented using the FlexPS system, which is the latest system implementation that employ parameter server architecture. The experiment employed the Bulk Synchronous Parallel (BSP) computational model to examine the straggler problem in Parameter Server on Iterative Convergent Distributed Machine Learning. Moreover, the current research analyzes the experimental arrangement of the parameter server strategy concerning the parallel learning problems by injecting universal straggler patterns and executing latest mitigation techniques. The findings of the study are significant in that as they will provide the necessary platform for conducting further research into the problem and allow the researcher to compare different methods for various applications. The outcome is therefore expected to facilitate the development of new techniques coupled with new perspectives in addressing this problem. △ Less

Submitted 28 July, 2023; originally announced August 2023.

Comments: 6 pages, 8 figures

ACM Class: H.2.4

arXiv:2308.13523 [pdf, other]

On the Particle and Field nature of $γ^μ$ matrices in the Dirac Equation and the Nature's intrinsic fifth force

Authors: B. T. T. Wong

Abstract: The Dirac equation is a cornerstone of modern particle physics, which integrates special relativity and quantum mechanics into a consistent framework, yielding the prediction of electron and its antiparticle counterpart, positron. The Dirac equation also lays the foundation of quantum electrodynamics, such that QED phenomenon is supported by fundamental Dirac Algebras calculation. In this article,… ▽ More The Dirac equation is a cornerstone of modern particle physics, which integrates special relativity and quantum mechanics into a consistent framework, yielding the prediction of electron and its antiparticle counterpart, positron. The Dirac equation also lays the foundation of quantum electrodynamics, such that QED phenomenon is supported by fundamental Dirac Algebras calculation. In this article, we will introduce new perspectives of the $γ^μ$ matrix in the Dirac Algebra, by realizing the $γ^μ$ matrices are actual formal quantum fields, the excitation of $γ^μ$ fields correspond to a new particle with both boson and fermion nature. Thus, we show that $γ^μ$ is a particle in nature, and can be referred as the nature's intrinsic fifth force. The $γ^μ$ field also serves as the boson-fermion connector in QED interaction. △ Less

Submitted 7 September, 2023; v1 submitted 31 July, 2023; originally announced August 2023.

Comments: 15 pages, 2 figures

arXiv:2308.09782 [pdf, other]

doi 10.1021/acs.jctc.3c00689

Velocity-gauge real-time time-dependent density functional tight-binding for large-scale condensed matter systems

Authors: Qiang Xu, Mauro Del Ben, Mahmut Sait Okyay, Min Choi, Khaled Z. Ibrahim, Bryan M. Wong

Abstract: We present a new velocity-gauge real-time, time-dependent density functional tight-binding (VG-rtTDDFTB) implementation in the open-source DFTB+ software package (https://dftbplus.org) for probing electronic excitations in large, condensed matter systems. Our VG-rtTDDFTB approach enables real-time electron dynamics simulations of large, periodic, condensed matter systems containing thousands of at… ▽ More We present a new velocity-gauge real-time, time-dependent density functional tight-binding (VG-rtTDDFTB) implementation in the open-source DFTB+ software package (https://dftbplus.org) for probing electronic excitations in large, condensed matter systems. Our VG-rtTDDFTB approach enables real-time electron dynamics simulations of large, periodic, condensed matter systems containing thousands of atoms with a favorable computational scaling as a function of system size. We provide computational details and benchmark calculations to demonstrate its accuracy and computational parallelizability on a variety of large material systems. As a representative example, we calculate laser-induced electron dynamics in a 512-atom amorphous silicon supercell to highlight the large periodic systems that can be examined with our implementation. Taken together, our VG-rtTDDFTB approach enables new electron dynamics simulations of complex systems that require large periodic supercells, such as crystal defects, complex surfaces, nanowires, and amorphous materials. △ Less

Submitted 21 May, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

Comments: 23 pages, 6 figures

Journal ref: J. Chem. Theory Comput., 19, 22, 7989-7997 (2024)

arXiv:2308.04649 [pdf]

Enhancing Optimization Performance: A Novel Hybridization of Gaussian Crunching Search and Powell's Method for Derivative-Free Optimization

Authors: Benny Wong

Abstract: This research paper presents a novel approach to enhance optimization performance through the hybridization of Gaussian Crunching Search (GCS) and Powell's Method for derivative-free optimization. While GCS has shown promise in overcoming challenges faced by traditional derivative-free optimization methods [1], it may not always excel in finding the local minimum. On the other hand, some tradition… ▽ More This research paper presents a novel approach to enhance optimization performance through the hybridization of Gaussian Crunching Search (GCS) and Powell's Method for derivative-free optimization. While GCS has shown promise in overcoming challenges faced by traditional derivative-free optimization methods [1], it may not always excel in finding the local minimum. On the other hand, some traditional methods may have better performance in this regard. However, GCS demonstrates its strength in escaping the trap of local minima and approaching the global minima. Through experimentation, we discovered that by combining GCS with certain traditional derivative-free optimization methods, we can significantly boost performance while retaining the respective advantages of each method. This hybrid approach opens up new possibilities for optimizing complex systems and finding optimal solutions in a range of applications. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: 8 pages

arXiv:2307.14359 [pdf]

A new derivative-free optimization method: Gaussian Crunching Search

Authors: Benny Wong

Abstract: Optimization methods are essential in solving complex problems across various domains. In this research paper, we introduce a novel optimization method called Gaussian Crunching Search (GCS). Inspired by the behaviour of particles in a Gaussian distribution, GCS aims to efficiently explore the solution space and converge towards the global optimum. We present a comprehensive analysis of GCS, inclu… ▽ More Optimization methods are essential in solving complex problems across various domains. In this research paper, we introduce a novel optimization method called Gaussian Crunching Search (GCS). Inspired by the behaviour of particles in a Gaussian distribution, GCS aims to efficiently explore the solution space and converge towards the global optimum. We present a comprehensive analysis of GCS, including its working mechanism, and potential applications. Through experimental evaluations and comparisons with existing optimization methods, we highlight the advantages and strengths of GCS. This research paper serves as a valuable resource for researchers, practitioners, and students interested in optimization, providing insights into the development and potential of Gaussian Crunching Search as a new and promising approach. △ Less

Submitted 24 July, 2023; originally announced July 2023.

Comments: 7 pages

arXiv:2305.01024 [pdf, other]

doi 10.1145/3577193.3593715

Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs

Authors: Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen

Abstract: General Matrix Multiplication (GEMM) is a crucial algorithm for various applications such as machine learning and scientific computing, and an efficient GEMM implementation is essential for the performance of these systems. While researchers often strive for faster performance by using large compute platforms, the increased scale of these systems can raise concerns about hardware and software reli… ▽ More General Matrix Multiplication (GEMM) is a crucial algorithm for various applications such as machine learning and scientific computing, and an efficient GEMM implementation is essential for the performance of these systems. While researchers often strive for faster performance by using large compute platforms, the increased scale of these systems can raise concerns about hardware and software reliability. In this paper, we present a design for a high-performance GEMM with algorithm-based fault tolerance for use on GPUs. We describe fault-tolerant designs for GEMM at the thread, warp, and threadblock levels, and also provide a baseline GEMM implementation that is competitive with or faster than the state-of-the-art, proprietary cuBLAS GEMM. We present a kernel fusion strategy to overlap and mitigate the memory latency due to fault tolerance with the original GEMM computation. To support a wide range of input matrix shapes and reduce development costs, we present a template-based approach for automatic code generation for both fault-tolerant and non-fault-tolerant GEMM implementations. We evaluate our work on NVIDIA Tesla T4 and A100 server GPUs. Experimental results demonstrate that our baseline GEMM presents comparable or superior performance compared to the closed-source cuBLAS. The fault-tolerant GEMM incurs only a minimal overhead (8.89\% on average) compared to cuBLAS even with hundreds of errors injected per minute. For irregularly shaped inputs, the code generator-generated kernels show remarkable speedups of $160\% \sim 183.5\%$ and $148.55\% \sim 165.12\%$ for fault-tolerant and non-fault-tolerant GEMMs, outperforming cuBLAS by up to $41.40\%$. △ Less

Submitted 1 May, 2023; originally announced May 2023.

Comments: 11 pages, 2023 International Conference on Supercomputing

arXiv:2303.00178 [pdf, other]

Disentangling Structural Breaks in Factor Models for Macroeconomic Data

Authors: Bonsoo Koo, Benjamin Wong, Ze-Yu Zhong

Abstract: Through a routine normalization of the factor variance, standard methods for estimating factor models in macroeconomics do not distinguish between breaks of the factor variance and factor loadings. We argue that it is important to distinguish between structural breaks in the factor variance and loadings within factor models commonly employed in macroeconomics as both can lead to markedly different… ▽ More Through a routine normalization of the factor variance, standard methods for estimating factor models in macroeconomics do not distinguish between breaks of the factor variance and factor loadings. We argue that it is important to distinguish between structural breaks in the factor variance and loadings within factor models commonly employed in macroeconomics as both can lead to markedly different interpretations when viewed via the lens of the underlying dynamic factor model. We then develop a projection-based decomposition that leads to two standard and easy-to-implement Wald tests to disentangle structural breaks in the factor variance and factor loadings. Applying our procedure to U.S. macroeconomic data, we find evidence of both types of breaks associated with the Great Moderation and the Great Recession. Through our projection-based decomposition, we estimate that the Great Moderation is associated with an over 60% reduction in the total factor variance, highlighting the relevance of disentangling breaks in the factor structure. △ Less

Submitted 3 June, 2024; v1 submitted 28 February, 2023; originally announced March 2023.

arXiv:2301.12944 [pdf, other]

doi 10.1016/j.aop.2023.169399

Generalized Standard Model with higher-order derivatives under Rotor Mechanism and its Quantization

Authors: B. T. T. Wong

Abstract: The Standard Model is the paradigm of particle physics which gives an accurate theory for fundamental particle interactions. However, the extension of Standard Model with higher-order derivatives is not a well-studied subject. This paper is a follow-up work of the previous study of the generalized Abelian gauge field theory and Yang-Mills theory under rotor mechanism of order $n$ of higher order d… ▽ More The Standard Model is the paradigm of particle physics which gives an accurate theory for fundamental particle interactions. However, the extension of Standard Model with higher-order derivatives is not a well-studied subject. This paper is a follow-up work of the previous study of the generalized Abelian gauge field theory and Yang-Mills theory under rotor mechanism of order $n$ of higher order derivatives, and we apply it to the Standard Model of particle physics. Rotor mechanism on scalar field and Dirac field is also studied. We will study the quantization of the rotored Standard Model using path integral approach. We also inherit the previous result from the path integral quantization of generalized Abelian gauge field and apply it to our non-Abelian case. Then we carry out the generalized BRST quantization and prove the existence of the Slavnov-Taylor Identities of the rotor model. Finally, we discuss the possibility of rotor model on taming the infinities arise from the self-energy correction of the Higgs boson in high spacetime dimension, thus this provides a partial solution and new insights to the Hierarchy problem. △ Less

Submitted 29 June, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

Comments: 55 pages, 15 figures

Report number: Vol 457, 169399

Journal ref: Annals of Physics, 2023

arXiv:2301.12710 [pdf, other]

Machine Learning with High-Cardinality Categorical Features in Actuarial Applications

Authors: Benjamin Avanzi, Greg Taylor, Melantha Wang, Bernard Wong

Abstract: High-cardinality categorical features are pervasive in actuarial data (e.g. occupation in commercial property insurance). Standard categorical encoding methods like one-hot encoding are inadequate in these settings. In this work, we present a novel _Generalised Linear Mixed Model Neural Network_ ("GLMMNet") approach to the modelling of high-cardinality categorical features. The GLMMNet integrate… ▽ More High-cardinality categorical features are pervasive in actuarial data (e.g. occupation in commercial property insurance). Standard categorical encoding methods like one-hot encoding are inadequate in these settings. In this work, we present a novel _Generalised Linear Mixed Model Neural Network_ ("GLMMNet") approach to the modelling of high-cardinality categorical features. The GLMMNet integrates a generalised linear mixed model in a deep learning framework, offering the predictive power of neural networks and the transparency of random effects estimates, the latter of which cannot be obtained from the entity embedding models. Further, its flexibility to deal with any distribution in the exponential dispersion (ED) family makes it widely applicable to many actuarial contexts and beyond. We illustrate and compare the GLMMNet against existing approaches in a range of simulation experiments as well as in a real-life insurance case study. Notably, we find that the GLMMNet often outperforms or at least performs comparably with an entity embedded neural network, while providing the additional benefit of transparency, which is particularly valuable in practical applications. Importantly, while our model was motivated by actuarial applications, it can have wider applicability. The GLMMNet would suit any applications that involve high-cardinality categorical variables and where the response cannot be sufficiently modelled by a Gaussian distribution. △ Less

Submitted 30 January, 2023; originally announced January 2023.

MSC Class: 91G70; 91G60; 62P05

arXiv:2211.01501 [pdf, other]

doi 10.1103/PhysRevB.106.205118

A light-induced Weyl semiconductor-to-metal transition mediated by Peierls instability

Authors: H. Ning, O. Mehio, C. Lian, X. Li, E. Zoghlin, P. Zhou, B. Cheng, S. D. Wilson, B. M. Wong, D. Hsieh

Abstract: Elemental tellurium is a strongly spin-orbit coupled Peierls-distorted semiconductor whose band structure features topologically protected Weyl nodes. Using time-dependent density functional theory calculations, we show that impulsive optical excitation can be used to transiently control the amplitude of the Peierls distortion, realizing a mechanism to switch tellurium between three states: Weyl s… ▽ More Elemental tellurium is a strongly spin-orbit coupled Peierls-distorted semiconductor whose band structure features topologically protected Weyl nodes. Using time-dependent density functional theory calculations, we show that impulsive optical excitation can be used to transiently control the amplitude of the Peierls distortion, realizing a mechanism to switch tellurium between three states: Weyl semiconductor, Weyl metal and non-Weyl metal. Further, we present experimental evidence of this inverse-Peierls distortion using time-resolved optical second harmonic generation measurements. These results provide a pathway to multifunctional ultrafast Weyl devices and introduce Peierls systems as viable hosts of light-induced topological transitions. △ Less

Submitted 2 November, 2022; originally announced November 2022.

Comments: 7 pages main text, 4 figures, 11 pages supplementary information

Journal ref: Phys. Rev. B 106, 205118 (2022)

arXiv:2209.08741 [pdf, ps, other]

Bergman representative coordinate, constant holomorphic curvature and a multidimensional generalization of Carathéodory's theorem

Authors: Robert Xin Dong, Bun Wong

Abstract: By using the Bergman representative coordinate and Calabi's diastasis, we extend a theorem of Lu to bounded pseudoconvex domains whose Bergman metric is incomplete with constant holomorphic sectional curvature. We characterize such domains that are biholomorphic to a ball possibly less a relatively closed pluripolar set. We also provide a multidimensional generalization of Carathéodory's theorem o… ▽ More By using the Bergman representative coordinate and Calabi's diastasis, we extend a theorem of Lu to bounded pseudoconvex domains whose Bergman metric is incomplete with constant holomorphic sectional curvature. We characterize such domains that are biholomorphic to a ball possibly less a relatively closed pluripolar set. We also provide a multidimensional generalization of Carathéodory's theorem on the continuous extension of the biholomorphisms up to the closures. In particular, sufficient conditions are given, in terms of the Bergman kernel, for the boundary of a biholomorphic ball to be a topological sphere. △ Less

Submitted 28 February, 2023; v1 submitted 18 September, 2022; originally announced September 2022.

Comments: This is a revised version (26 pages) of the previous manuscript. The results were completed in 2022

MSC Class: Primary 32F45; Secondary 32H10; 32T05; 32D20

arXiv:2209.00840 [pdf, other]

FOLIO: Natural Language Reasoning with First-Order Logic

Authors: Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alex Wardle-Solano, Hannah Szabo, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander R. Fabbri , et al. (10 additional authors not shown)

Abstract: Large language models (LLMs) have achieved remarkable performance on a variety of natural language understanding tasks. However, existing benchmarks are inadequate in measuring the complex logical reasoning capabilities of a model. We present FOLIO, a human-annotated, logically complex and diverse dataset for reasoning in natural language (NL), equipped with first-order logic (FOL) annotations. FO… ▽ More Large language models (LLMs) have achieved remarkable performance on a variety of natural language understanding tasks. However, existing benchmarks are inadequate in measuring the complex logical reasoning capabilities of a model. We present FOLIO, a human-annotated, logically complex and diverse dataset for reasoning in natural language (NL), equipped with first-order logic (FOL) annotations. FOLIO consists of 1,430 examples (unique conclusions), each paired with one of 487 sets of premises used to deductively reason for the validity of each conclusion. The logical correctness of the premises and conclusions is ensured by their FOL annotations, which are automatically verified by an FOL inference engine. In addition to the main NL reasoning task, NL-FOL pairs in FOLIO constitute a new NL-FOL translation dataset. Our experiments on FOLIO systematically evaluate the FOL reasoning ability of supervised fine-tuning on medium-sized language models. For both NL reasoning and NL-FOL translation, we benchmark multiple state-of-the-art language models. Our results show that a subset of FOLIO presents a challenge for one of the most capable {Large Language Model (LLM)} publicly available, GPT-4. △ Less

Submitted 17 May, 2024; v1 submitted 2 September, 2022; originally announced September 2022.

arXiv:2207.00681 [pdf, other]

Human-Assisted Robotic Detection of Foreign Object Debris Inside Confined Spaces of Marine Vessels Using Probabilistic Mapping

Authors: Benjamin Wong, Wade Marquette, Nikolay Bykov, Tyler M. Paine, Ashis G. Banerjee

Abstract: Many complex vehicular systems, such as large marine vessels, contain confined spaces like water tanks, which are critical for the safe functioning of the vehicles. It is particularly hazardous for humans to inspect such spaces due to limited accessibility, poor visibility, and unstructured configuration. While robots provide a viable alternative, they encounter the same set of challenges in reali… ▽ More Many complex vehicular systems, such as large marine vessels, contain confined spaces like water tanks, which are critical for the safe functioning of the vehicles. It is particularly hazardous for humans to inspect such spaces due to limited accessibility, poor visibility, and unstructured configuration. While robots provide a viable alternative, they encounter the same set of challenges in realizing robust autonomy. In this work, we specifically address the problem of detecting foreign object debris (FODs) left inside the confined spaces using a visual mapping-based system that relies on Mahalanobis distance-driven comparisons between the nominal and online maps for local outlier identification. Simulation trials show extremely high recall but low precision for the outlier identification method. The assistance of remote humans is, therefore, taken to deal with the precision problem by going over the close-up robot camera images of the outlier regions. An online survey is conducted to show the usefulness of this assistance process. Physical experiments are also reported on a GPU-enabled mobile robot platform inside a scaled-down, prototype tank to demonstrate the feasibility of the FOD detection system. △ Less

Submitted 31 August, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

arXiv:2206.08541 [pdf, other]

Ensemble distributional forecasting for insurance loss reserving

Authors: Benjamin Avanzi, Yanfeng Li, Bernard Wong, Alan Xian

Abstract: Loss reserving generally focuses on identifying a single model that can generate superior predictive performance. However, different loss reserving models specialise in capturing different aspects of loss data. This is recognised in practice in the sense that results from different models are often considered, and sometimes combined. For instance, actuaries may take a weighted average of the predi… ▽ More Loss reserving generally focuses on identifying a single model that can generate superior predictive performance. However, different loss reserving models specialise in capturing different aspects of loss data. This is recognised in practice in the sense that results from different models are often considered, and sometimes combined. For instance, actuaries may take a weighted average of the prediction outcomes from various loss reserving models, often based on subjective assessments. In this paper, we propose a systematic framework to objectively combine (i.e. ensemble) multiple _stochastic_ loss reserving models such that the strengths offered by different models can be utilised effectively. Our framework contains two main innovations compared to existing literature and practice. Firstly, our criteria model combination considers the full distributional properties of the ensemble and not just the central estimate - which is of particular importance in the reserving context. Secondly, our framework is that it is tailored for the features inherent to reserving data. These include, for instance, accident, development, calendar, and claim maturity effects. Crucially, the relative importance and scarcity of data across accident periods renders the problem distinct from the traditional ensembling techniques in statistical learning. Our framework is illustrated with a complex synthetic dataset. In the results, the optimised ensemble outperforms both (i) traditional model selection strategies, and (ii) an equally weighted ensemble. In particular, the improvement occurs not only with central estimates but also relevant quantiles, such as the 75th percentile of reserves (typically of interest to both insurers and regulators). The framework developed in this paper can be implemented thanks to an R package, `ADLP`, which is available from CRAN. △ Less

Submitted 3 June, 2024; v1 submitted 17 June, 2022; originally announced June 2022.

MSC Class: 91G70; 91G60; 62P05; 91B30

arXiv:2204.02188 [pdf, ps, other]

The study of conformal geometry and its exact solution of the geodesic deviation equation

Authors: B. T. T. Wong

Abstract: In this paper, the geometric properties of the conformal metric are studied and its exact solution of the geodesic deviation equation is presented. We also find out the stress-energy tensor of this geometry and compare it with the usual prefect-fluid case, obtaining an equation of state as $P = -\frac{1}{3}ρ$ in 4D space-time dimension. Finally, the low-energy regime of the metric is studied, in w… ▽ More In this paper, the geometric properties of the conformal metric are studied and its exact solution of the geodesic deviation equation is presented. We also find out the stress-energy tensor of this geometry and compare it with the usual prefect-fluid case, obtaining an equation of state as $P = -\frac{1}{3}ρ$ in 4D space-time dimension. Finally, the low-energy regime of the metric is studied, in which we obtain the stress-energy tensor proportional to the projection tensor. △ Less

Submitted 31 March, 2022; originally announced April 2022.

Comments: 12 pages

arXiv:2204.00425 [pdf, ps, other]

The complete metric study of effective Dirac algebra

Authors: B. T. T. Wong

Abstract: Following our work from the previous paper about the study of effective Dirac algebra and the metric of the simple, special case of relativistic hydrogen atom, this paper gives the complete metric study defined by the effective Dirac algebra in the Dirac and Weyl presentation, showing that relativistic electromagnetic interaction gives the correction of the flat background metric $η_{μν}$, thus cu… ▽ More Following our work from the previous paper about the study of effective Dirac algebra and the metric of the simple, special case of relativistic hydrogen atom, this paper gives the complete metric study defined by the effective Dirac algebra in the Dirac and Weyl presentation, showing that relativistic electromagnetic interaction gives the correction of the flat background metric $η_{μν}$, thus curving spacetime. The curved metric can be nicely broken down into two parts, the pure correction on the flat spacetime metric and the projection tensor. We find that the curved metric is independent of the representation chosen. △ Less

Submitted 28 March, 2022; originally announced April 2022.

arXiv:2203.08953 [pdf, ps, other]

doi 10.1088/1361-648X/abc407

Improved Band Gaps and Structural Properties from Wannier-Fermi-Löwdin Self-Interaction Corrections for Periodic Systems

Authors: Ravindra Shinde, Sharma S. R. K. C. Yamijala, Bryan M. Wong

Abstract: The accurate prediction of band gaps and structural properties in periodic systems continues to be one of the central goals of electronic structure theory. However, band gaps obtained from popular exchange-correlation functionals (such as LDA and PBE) are severely underestimated partly due to the spurious self-interaction error (SIE) inherent to these functionals. In this work, we present a new fo… ▽ More The accurate prediction of band gaps and structural properties in periodic systems continues to be one of the central goals of electronic structure theory. However, band gaps obtained from popular exchange-correlation functionals (such as LDA and PBE) are severely underestimated partly due to the spurious self-interaction error (SIE) inherent to these functionals. In this work, we present a new formulation and implementation of Wannier function-derived Fermi-Löwdin (WFL) orbitals for correcting the SIE in periodic systems. Since our approach utilizes a variational minimization of the self-interaction energy with respect to the Wannier charge centers, it is computationally more efficient than the HSE hybrid functional and other self-interaction corrections that require a large number of transformation matrix elements. Calculations on several (17 in total) prototypical molecular solids, semiconductors, and wide-bandgap materials show that our WFL self-interaction correction approach gives better band gaps and bulk moduli compared to semilocal functionals, largely due to the partial removal of self-interaction errors. △ Less

Submitted 16 March, 2022; originally announced March 2022.

Comments: Accepted by Journal of Physics: Condensed Matter

Journal ref: Journal of Physics: Condensed Matter, 33, 115501 (2021)

arXiv:2203.08035 [pdf, other]

doi 10.1021/acs.jpcc.1c05632

High-Temperature Decomposition of Diisopropyl Methylphosphonate (DIMP) on Alumina: Mechanistic Predictions from Ab Initio Molecular Dynamics

Authors: Sohag Biswas, Bryan M. Wong

Abstract: The enhanced degradation of organophosphorous-based chemical warfare agents (CWAs) on metal-oxide surfaces holds immense promise for neutralization efforts; however, the underlying mechanisms in this process remain poorly understood. We utilize large-scale quantum calculations for the first time to probe the high-temperature degradation of diisopropyl methylphosphonate (DIMP), a nerve agent simula… ▽ More The enhanced degradation of organophosphorous-based chemical warfare agents (CWAs) on metal-oxide surfaces holds immense promise for neutralization efforts; however, the underlying mechanisms in this process remain poorly understood. We utilize large-scale quantum calculations for the first time to probe the high-temperature degradation of diisopropyl methylphosphonate (DIMP), a nerve agent simulant. Our Born-Oppenheimer molecular dynamics (BOMD) calculations show that the $γ$-Al$_2$O$_3$ surface shows immense promise for quickly adsorbing and destroying CWAs. We find that the alumina surface quickly adsorbs DIMP at all temperatures, and subsequent decomposition of DIMP proceeds via a propene elimination. Our BOMD calculations are complemented with metadynamics simulations to produce free energy paths, which show that the activation barrier decreases with temperature and DIMP readily decomposes on $γ$-Al$_2$O$_3$. Our first-principle BOMD and metadynamics simulations provide crucial diagnostics for sarin decomposition models and mechanistic information for examining CWA decomposition reactions on other candidate metal oxide surfaces. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: Accepted by the Journal of Physical Chemistry C

Journal ref: Journal of Physical Chemistry C, 125, 21922-21932 (2021)

arXiv:2203.05233 [pdf, other]

doi 10.1016/j.cpc.2022.108299

HADOKEN: An Open-Source Software Package for Predicting Electron Confinement Effects in Various Nanowire Geometries and Configurations

Authors: Bryan M. Wong, Cameron Chevalier

Abstract: We present an open-source software package, HADOKEN (High-level Algorithms to Design, Optimize, and Keep Electrons in Nanowires), for predicting electron confinement/localization effects in nanowires with various geometries, {arbitrary number of concentric shell layers,} doping densities, and external boundary conditions. The HADOKEN code is written in the MATLAB programming environments to aid in… ▽ More We present an open-source software package, HADOKEN (High-level Algorithms to Design, Optimize, and Keep Electrons in Nanowires), for predicting electron confinement/localization effects in nanowires with various geometries, {arbitrary number of concentric shell layers,} doping densities, and external boundary conditions. The HADOKEN code is written in the MATLAB programming environments to aid in its readability and general accessibility to both users and practitioners. We provide several examples and outputs on a variety of different nanowire geometries, boundary conditions, and doping densities to demonstrate the capabilities of the HADOKEN software package. As such, the use of this predictive and versatile tool by both experimentalists and theorists could lead to further advances in both understanding and tailoring electron confinement effects in these nanosystems. △ Less

Submitted 11 March, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

Comments: Accepted by Computer Physics Communications

ACM Class: J.2; I.6

Journal ref: Computer Physics Communications 274, 108299 (2022)

arXiv:2203.05139 [pdf, other]

doi 10.1080/03461238.2022.2116725

On the surplus management of funds with assets and liabilities in presence of solvency requirements

Authors: Benjamin Avanzi, Ping Chen, Lars Frederik Brandt Henriksen, Bernard Wong

Abstract: In this paper we consider a company whose assets and liabilities evolve according to a correlated bivariate geometric Brownian motion, such as in Gerber and Shiu (2003). We determine what dividend strategy maximises the expected present value of dividends until ruin in two cases: (i) when shareholders won't cover surplus shortfalls and a solvency constraint (as in Paulsen, 2003) is consequently im… ▽ More In this paper we consider a company whose assets and liabilities evolve according to a correlated bivariate geometric Brownian motion, such as in Gerber and Shiu (2003). We determine what dividend strategy maximises the expected present value of dividends until ruin in two cases: (i) when shareholders won't cover surplus shortfalls and a solvency constraint (as in Paulsen, 2003) is consequently imposed, and (ii) when shareholders are always to fund any capital deficiency with capital (asset) injections. In the latter case, ruin will never occur and the objective is to maximise the difference between dividends and capital injections. Developing and using appropriate verification lemmas, we show that the optimal dividend strategy is, in both cases, of barrier type. Both value functions are derived in closed form. Furthermore, the barrier is defined on the ratio of assets to liabilities, which mimics some of the dividend strategies that can be observed in practice by insurance companies. Existence and uniqueness of the optimal strategies are shown. Results are illustrated. △ Less

Submitted 5 August, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

MSC Class: 93E20; 91G70; 62P05; 91B30

arXiv:2203.04203 [pdf, other]

AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant

Authors: Benita Wong, Joya Chen, You Wu, Stan Weixian Lei, Dongxing Mao, Difei Gao, Mike Zheng Shou

Abstract: A long-standing goal of intelligent assistants such as AR glasses/robots has been to assist users in affordance-centric real-world scenarios, such as "how can I run the microwave for 1 minute?". However, there is still no clear task definition and suitable benchmarks. In this paper, we define a new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn… ▽ More A long-standing goal of intelligent assistants such as AR glasses/robots has been to assist users in affordance-centric real-world scenarios, such as "how can I run the microwave for 1 minute?". However, there is still no clear task definition and suitable benchmarks. In this paper, we define a new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn from instructional videos to provide step-by-step help in the user's view. To support the task, we constructed AssistQ, a new dataset comprising 531 question-answer samples from 100 newly filmed instructional videos. We also developed a novel Question-to-Actions (Q2A) model to address the AQTC task and validate it on the AssistQ dataset. The results show that our model significantly outperforms several VQA-related baselines while still having large room for improvement. We expect our task and dataset to advance Egocentric AI Assistant's development. Our project page is available at: https://showlab.github.io/assistq/. △ Less

Submitted 20 July, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

Comments: Accepted by ECCV 2022. Equal contribution: Benita Wong, Joya Chen, You Wu; Corresponding author: Mike Zheng Shou

arXiv:2203.03874 [pdf, other]

doi 10.1017/S1748499523000155

Detection and treatment of outliers for multivariate robust loss reserving

Authors: Benjamin Avanzi, Mark Lavender, Greg Taylor, Bernard Wong

Abstract: Traditional techniques for calculating outstanding claim liabilities such as the chain ladder are notoriously at risk of being distorted by outliers in past claims data. Unfortunately, the literature in robust methods of reserving is scant, with notable exceptions such as Verdonck and Debruyne (2011) and Verdonck and Van Wouwe (2011). In this paper, we put forward two alternative robust bivariate… ▽ More Traditional techniques for calculating outstanding claim liabilities such as the chain ladder are notoriously at risk of being distorted by outliers in past claims data. Unfortunately, the literature in robust methods of reserving is scant, with notable exceptions such as Verdonck and Debruyne (2011) and Verdonck and Van Wouwe (2011). In this paper, we put forward two alternative robust bivariate chain-ladder techniques to extend the approach of Verdonck and Van Wouwe (2011). The first technique is based on Adjusted Outlyingness (Hubert and Van der Veeken, 2008) and explicitly incorporates skewness into the analysis whilst providing a unique measure of outlyingness for each observation. The second technique is based on bagdistance (Hubert et al., 2016) which is derived from the bagplot however is able to provide a unique measure of outlyingness and a means to adjust outlying observations based on this measure. Furthermore, we extend our robust bivariate chain-ladder approach to an N-dimensional framework. The implementation of the methods, especially beyond bivariate, is not trivial. This is illustrated on a trivariate data set from Australian general insurers, and results under the different outlier detection and treatment mechanisms are compared. △ Less

Submitted 15 June, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

MSC Class: 91G70; 62P05

arXiv:2203.00184 [pdf, other]

On the impact of outliers in loss reserving

Authors: Benjamin Avanzi, Mark Lavender, Greg Taylor, Bernard Wong

Abstract: The sensitivity of loss reserving techniques to outliers in the data or deviations from model assumptions is a well known challenge. It has been shown that the popular chain-ladder reserving approach is at significant risk to such aberrant observations in that reserve estimates can be significantly shifted in the presence of even one outlier. As a consequence the chain-ladder reserving technique i… ▽ More The sensitivity of loss reserving techniques to outliers in the data or deviations from model assumptions is a well known challenge. It has been shown that the popular chain-ladder reserving approach is at significant risk to such aberrant observations in that reserve estimates can be significantly shifted in the presence of even one outlier. As a consequence the chain-ladder reserving technique is non-robust. In this paper we investigate the sensitivity of reserves and mean squared errors of prediction under Mack's Model (Mack, 1993). This is done through the derivation of impact functions which are calculated by taking the first derivative of the relevant statistic of interest with respect to an observation. We also provide and discuss the impact functions for quantiles when total reserves are assumed to be lognormally distributed. Additionally, comparisons are made between the impact functions for individual accident year reserves under Mack's Model and the Bornhuetter-Ferguson methodology. It is shown that the impact of incremental claims on these statistics of interest varies widely throughout a loss triangle and is heavily dependent on other cells in the triangle. Results are illustrated using data from a Belgian non-life insurer. △ Less

Submitted 20 June, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

MSC Class: 91G70; 62P05

arXiv:2202.03151 [pdf, ps, other]

doi 10.1016/j.nuclphysb.2022.115765

Generalized Yang-Mills Theory under Rotor Mechanism

Authors: B. T. T. Wong

Abstract: This paper follows the previous work on generalized abelian gauge field theory of higher-order derivatives under rotor model and extends the study to the most generalized non-abelian case. We find that the rotor mechanism from the abelian case applies nicely to the non-abelian case under the Lorentz gauge condition. Under the rotor mechanism, the gauge field transforms as… ▽ More This paper follows the previous work on generalized abelian gauge field theory of higher-order derivatives under rotor model and extends the study to the most generalized non-abelian case. We find that the rotor mechanism from the abelian case applies nicely to the non-abelian case under the Lorentz gauge condition. Under the rotor mechanism, the gauge field transforms as $T_μ^a \rightarrow \Box^n T_μ^a$. When the order of field derivative is $n=0$, this restores back to the original Yang-Mills action. Our work gives an extensive generalization of the Yang-Mills theory with higher-order field derivatives. We also compute the equation of motion and Noether's current of the generalized non-abelian gauge field theory. Finally, we study the dynamic instability issue of the theory by the Ostrogradsky construction and the analysis of the 00-component of the energy-momentum tensor. △ Less

Submitted 31 March, 2022; v1 submitted 28 January, 2022; originally announced February 2022.

Comments: 17 pages

Journal ref: Nuclear Physics B 978 (2022) 115765

arXiv:2109.03609 [pdf, ps, other]

doi 10.1007/s10773-022-04994-2

Quantization of Generalized Abelian Gauge Field Theory under Rotor Model

Authors: B. T. T. Wong

Abstract: This paper is a follow-up work of the previous study of the generalized abelian gauge field theory under rotor model of order $n$ of higher order derivatives. We will study the quantization of this theory using path integral approach and find out the Feynman propagator (2-point correlation function) of this generalized theory. We also investigate the generalized Proca action under rotor model and… ▽ More This paper is a follow-up work of the previous study of the generalized abelian gauge field theory under rotor model of order $n$ of higher order derivatives. We will study the quantization of this theory using path integral approach and find out the Feynman propagator (2-point correlation function) of this generalized theory. We also investigate the generalized Proca action under rotor model and derive the Feynman propagator for the massive case. △ Less

Submitted 29 August, 2021; originally announced September 2021.

Comments: 10 pages

Journal ref: International Journal of Theoretical Physics. 61: 80. (2022)

arXiv:2109.01282 [pdf, ps, other]

doi 10.4310/PAMQ.2022.v18.n2.a6

Bergman-Calabi diastasis and Kähler metric of constant holomorphic sectional curvature

Authors: Robert Xin Dong, Bun Wong

Abstract: We prove that for a bounded domain in $\mathbb C^n$ with the Bergman metric of constant holomorphic sectional curvature being biholomorphic to a ball is equivalent to the hyperconvexity or the exhaustiveness of the Bergman-Calabi diastasis. By finding its connection with the Bergman representative coordinate, we give explicit formulas of the Bergman-Calabi diastasis and show that it has bounded gr… ▽ More We prove that for a bounded domain in $\mathbb C^n$ with the Bergman metric of constant holomorphic sectional curvature being biholomorphic to a ball is equivalent to the hyperconvexity or the exhaustiveness of the Bergman-Calabi diastasis. By finding its connection with the Bergman representative coordinate, we give explicit formulas of the Bergman-Calabi diastasis and show that it has bounded gradient. In particular, we prove that any bounded domain whose Bergman metric has constant holomorphic sectional curvature is Lu Qi-Keng. We also extend a theorem of Lu towards the incomplete situation and characterize pseudoconvex domains that are biholomorphic to a ball possibly less a relatively closed pluripolar set. △ Less

Submitted 28 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

Comments: 17 pages. Final revised version to appear in Pure and Applied Mathematics Quarterly (Special Issue in honor of Joseph J. Kohn)

MSC Class: Primary 32F45; Secondary 32T05; 32Q05; 32D20

Journal ref: Pure and Applied Mathematics Quarterly Volume 18 (2022) Number 2, 481-502, Special issue in honor of Joseph J. Kohn on the occasion of his 90th birthday

arXiv:2108.08221 [pdf, other]

The Theory of Fundamental Duality, Quantum Dualiton and Topological Dual Invariance

Authors: B. T. T. Wong

Abstract: Fundamental duality is a concept which refers to two irreducible, heterogeneous principles which are in opposite and complementary of each other. The complementary principle in quantum mechanics is also praised by Bohr. This important concept is known to appear in a lot of places in our physical universe, however a rigorous mathematical definition and physics theory has not yet ever developed in a… ▽ More Fundamental duality is a concept which refers to two irreducible, heterogeneous principles which are in opposite and complementary of each other. The complementary principle in quantum mechanics is also praised by Bohr. This important concept is known to appear in a lot of places in our physical universe, however a rigorous mathematical definition and physics theory has not yet ever developed in a formal way. In this paper, we establish a formalism for fundamental duality and study its various properties and theorems. One of the most profound results is that we establish a relation between dual invariance and topological invariance, and we find that the topological Chern-Simons form is a dual invariant action. Finally we apply the concept of duality to study dual state oscillation, and predict a theoretical new matter of state of dualiton, which is the particle excitation of the dual field by second quantization. This new exotic quasi-particle state is expected to have an impact in particle physics and condensed matter physics. △ Less

Submitted 28 January, 2023; v1 submitted 30 June, 2021; originally announced August 2021.

Comments: 158 pages

arXiv:2108.07924 [pdf, other]

doi 10.1016/j.insmatheco.2022.03.010

Stochastic loss reserving with mixture density neural networks

Authors: Muhammed Taher Al-Mudafer, Benjamin Avanzi, Greg Taylor, Bernard Wong

Abstract: Neural networks offer a versatile, flexible and accurate approach to loss reserving. However, such applications have focused primarily on the (important) problem of fitting accurate central estimates of the outstanding claims. In practice, properties regarding the variability of outstanding claims are equally important (e.g., quantiles for regulatory purposes). In this paper we fill this gap by… ▽ More Neural networks offer a versatile, flexible and accurate approach to loss reserving. However, such applications have focused primarily on the (important) problem of fitting accurate central estimates of the outstanding claims. In practice, properties regarding the variability of outstanding claims are equally important (e.g., quantiles for regulatory purposes). In this paper we fill this gap by applying a Mixture Density Network ("MDN") to loss reserving. The approach combines a neural network architecture with a mixture Gaussian distribution to achieve simultaneously an accurate central estimate along with flexible distributional choice. Model fitting is done using a rolling-origin approach. Our approach consistently outperforms the classical over-dispersed model both for central estimates and quantiles of interest, when applied to a wide range of simulated environments of various complexity and specifications. We further extend the MDN approach by proposing two extensions. Firstly, we present a hybrid GLM-MDN approach called "ResMDN". This hybrid approach balances the tractability and ease of understanding of a traditional GLM model on one hand, with the additional accuracy and distributional flexibility provided by the MDN on the other. We show that it can successfully improve the errors of the baseline ccODP, although there is generally a loss of performance when compared to the MDN in the examples we considered. Secondly, we allow for explicit projection constraints, so that actuarial judgement can be directly incorporated in the modelling process. Throughout, we focus on aggregate loss triangles, and show that our methodologies are tractable, and that they out-perform traditional approaches even with relatively limited amounts of data. We use both simulated data -- to validate properties, and real data -- to illustrate and ascertain practicality of the approaches. △ Less

Submitted 17 August, 2021; originally announced August 2021.

MSC Class: 91G79; 91G60; 62P05

arXiv:2106.06398 [pdf, ps, other]

doi 10.1142/S021773232250050X

The effective Dirac algebra by gauge field interaction in relativistic electrodynamics

Authors: B. T. T. Wong

Abstract: Conventional relativistic electrodynamics is set on flat Minkowski spacetime, where all computable quantities are calculated from the flat metric $η_{μν}$. We can redefine the metric of spacetime from the Dirac algebra. In this paper, we study how an electrodynamic interaction can alter the normal gamma matrix to an effective one and result in a shift in the metric perturbatively. The curvature pr… ▽ More Conventional relativistic electrodynamics is set on flat Minkowski spacetime, where all computable quantities are calculated from the flat metric $η_{μν}$. We can redefine the metric of spacetime from the Dirac algebra. In this paper, we study how an electrodynamic interaction can alter the normal gamma matrix to an effective one and result in a shift in the metric perturbatively. The curvature properties inferred from the curved metric are also investigated. We also study how the spin operator is changed under the interaction that contribute to an effective spin operator and how the spin of an electron will be slightly deviated from $1/2$. Then we perform canonical quantization of the effective Dirac algebra. Finally we apply our results to the relativistic hydrogen case and demonstrate how such system curves the spacetime metric. △ Less

Submitted 20 March, 2022; v1 submitted 4 May, 2021; originally announced June 2021.

Comments: 21 pages, 0 figure

Journal ref: Modern Physics Letters A. Vol 37, No. 08. 2250050 (2022)

arXiv:2105.00773 [pdf, other]

Approximate Bayesian Computation for an Explicit-Duration Hidden Markov Model of COVID-19 Hospital Trajectories

Authors: Gian Marco Visani, Alexandra Hope Lee, Cuong Nguyen, David M. Kent, John B. Wong, Joshua T. Cohen, Michael C. Hughes

Abstract: We address the problem of modeling constrained hospital resources in the midst of the COVID-19 pandemic in order to inform decision-makers of future demand and assess the societal value of possible interventions. For broad applicability, we focus on the common yet challenging scenario where patient-level data for a region of interest are not available. Instead, given daily admissions counts, we mo… ▽ More We address the problem of modeling constrained hospital resources in the midst of the COVID-19 pandemic in order to inform decision-makers of future demand and assess the societal value of possible interventions. For broad applicability, we focus on the common yet challenging scenario where patient-level data for a region of interest are not available. Instead, given daily admissions counts, we model aggregated counts of observed resource use, such as the number of patients in the general ward, in the intensive care unit, or on a ventilator. In order to explain how individual patient trajectories produce these counts, we propose an aggregate count explicit-duration hidden Markov model, nicknamed the ACED-HMM, with an interpretable, compact parameterization. We develop an Approximate Bayesian Computation approach that draws samples from the posterior distribution over the model's transition and duration parameters given aggregate counts from a specific location, thus adapting the model to a region or individual hospital site of interest. Samples from this posterior can then be used to produce future forecasts of any counts of interest. Using data from the United States and the United Kingdom, we show our mechanistic approach provides competitive probabilistic forecasts for the future even as the dynamics of the pandemic shift. Furthermore, we show how our model provides insight about recovery probabilities or length of stay distributions, and we suggest its potential to answer challenging what-if questions about the societal value of possible interventions. △ Less

Submitted 28 July, 2021; v1 submitted 28 April, 2021; originally announced May 2021.

Comments: To appear in the Proceedings of the Machine Learning for Healthcare (MLHC) conference, 2021. 20 pages, 7 figures and 1 table. 26 additional pages of supplementary material

arXiv:2104.14472 [pdf, ps, other]

doi 10.1142/S0217732321501947

Generalized Abelian Gauge Field Theory under Rotor Model

Authors: B. T. T. Wong

Abstract: Gauge field theory with rank-one field $T_μ$ is a quantum field theory that describes the interaction of elementary spin-1 particles, of which being massless to preserve gauge symmetry. In this paper, we give a generalized, extended study of abelian gauge field theory under successive rotor model in general $D$-dimensional flat spacetime for spin-1 particles in the context of higher order derivati… ▽ More Gauge field theory with rank-one field $T_μ$ is a quantum field theory that describes the interaction of elementary spin-1 particles, of which being massless to preserve gauge symmetry. In this paper, we give a generalized, extended study of abelian gauge field theory under successive rotor model in general $D$-dimensional flat spacetime for spin-1 particles in the context of higher order derivatives. We establish a theorem that $n$ rotor contributes to the $\Box^n T^μ$ fields in the integration-by-parts formalism of the action. This corresponds to the transformation of gauge field $T^μ \rightarrow \Box^n T^μ$ and gauge field strength $G_{μν}\rightarrow \Box^n G_{μν} $ in the action. The $n=0$ case restores back to the standard abelian gauge field theory. The equation of motion and Noether's conserved current of the theory are also studied. △ Less

Submitted 16 August, 2021; v1 submitted 19 March, 2021; originally announced April 2021.

Comments: 17 pages, 0 figure. Accepted to publish on Mod. Phys. Lett. A

Journal ref: Mod. Phys. Lett. A, Vol. 36, No. 27, 2150194 (2021)

arXiv:2104.09327 [pdf, other]

Forecasting COVID-19 Counts At A Single Hospital: A Hierarchical Bayesian Approach

Authors: Alexandra Hope Lee, Panagiotis Lymperopoulos, Joshua T. Cohen, John B. Wong, Michael C. Hughes

Abstract: We consider the problem of forecasting the daily number of hospitalized COVID-19 patients at a single hospital site, in order to help administrators with logistics and planning. We develop several candidate hierarchical Bayesian models which directly capture the count nature of data via a generalized Poisson likelihood, model time-series dependencies via autoregressive and Gaussian process latent… ▽ More We consider the problem of forecasting the daily number of hospitalized COVID-19 patients at a single hospital site, in order to help administrators with logistics and planning. We develop several candidate hierarchical Bayesian models which directly capture the count nature of data via a generalized Poisson likelihood, model time-series dependencies via autoregressive and Gaussian process latent processes, and share statistical strength across related sites. We demonstrate our approach on public datasets for 8 hospitals in Massachusetts, U.S.A. and 10 hospitals in the United Kingdom. Further prospective evaluation compares our approach favorably to baselines currently used by stakeholders at 3 related hospitals to forecast 2-week-ahead demand by rescaling state-level forecasts. △ Less

Submitted 14 April, 2021; originally announced April 2021.

Comments: In ICLR 2021 Workshop on Machine Learning for Preventing and Combating Pandemics

arXiv:2011.04237 [pdf, other]

Upper Extremity Load Reduction for Lower LimbExoskeleton Trajectory Generation Using AnkleTorque Minimization

Authors: Yik Ben Wong, Yawen Chen, Kam Fai Elvis Tsang, Winnie Suk Wai Leung, Ling Shi

Abstract: Recently, the lower limb exoskeletons which providemobility for paraplegic patients to support their daily life havedrawn much attention. However, the pilots are required to applyexcessive force through a pair of crutches to maintain balanceduring walking. This paper proposes a novel gait trajectorygeneration algorithm for exoskeleton locomotion on flat groundand stair which aims to minimize the f… ▽ More Recently, the lower limb exoskeletons which providemobility for paraplegic patients to support their daily life havedrawn much attention. However, the pilots are required to applyexcessive force through a pair of crutches to maintain balanceduring walking. This paper proposes a novel gait trajectorygeneration algorithm for exoskeleton locomotion on flat groundand stair which aims to minimize the force applied by the pilotwithout increasing the degree of freedom (DoF) of the system.First, the system is modelled as a five-link mechanism dynam-ically for torque computing. Then, an optimization approachis used to generate the trajectory minimizing the ankle torquewhich is correlated to the supporting force. Finally, experimentis conducted to compare the different gait generation algorithmsthrough measurement of ground reaction force (GRF) appliedon the crutches △ Less

Submitted 9 November, 2020; originally announced November 2020.

Comments: 8 pages, 7 figures, ICARCV

arXiv:2008.05693 [pdf, other]

doi 10.1016/j.insmatheco.2021.06.004

SynthETIC: an individual insurance claim simulator with feature control

Authors: Benjamin Avanzi, Gregory Clive Taylor, Melantha Wang, Bernard Wong

Abstract: Recent years have seen rapid increase in the application of machine learning to insurance loss reserving. They yield most value when applied to large data sets, such as individual claims, or large claim triangles. In short, they are likely to be useful in the analysis of any data set whose volume is sufficient to obscure a naked-eye view of its features. Unfortunately, such large data sets are in… ▽ More Recent years have seen rapid increase in the application of machine learning to insurance loss reserving. They yield most value when applied to large data sets, such as individual claims, or large claim triangles. In short, they are likely to be useful in the analysis of any data set whose volume is sufficient to obscure a naked-eye view of its features. Unfortunately, such large data sets are in short supply in the actuarial literature. Accordingly, one needs to turn to synthetic data. Although the ultimate objective of these methods is application to real data, the use of synthetic data containing features commonly observed in real data is also to be encouraged. While there are a number of claims simulators in existence, each valuable within its own context, the inclusion of a number of desirable (but complicated) data features requires further development. Accordingly, in this paper we review those desirable features, and propose a new simulator of individual claim experience called `SynthETIC`. Our simulator is publicly available, open source, and fills a gap in the non-life actuarial toolkit. The simulator specifically allows for desirable (but optionally complicated) data features typically occurring in practice, such as variations in rates of settlements and development patterns; as with superimposed inflation, and various discontinuities, and also enables various dependencies between variables. The user has full control of the mechanics of the evolution of an individual claim. As a result, the complexity of the data set generated (meaning the level of difficulty of analysis) may be dialled anywhere from extremely simple to extremely complex. △ Less

Submitted 25 August, 2021; v1 submitted 13 August, 2020; originally announced August 2020.

MSC Class: 91G70; 91G60; 62P05

arXiv:2006.04903 [pdf]

doi 10.1038/s41467-021-27216-5

6 nm super-resolution optical transmission and scattering spectroscopic imaging of carbon nanotubes using a nanometer-scale white light source

Authors: Xuezhi Ma, Qiushi Liu, Ning Yu, Da Xu, Sanggon Kim, Zebin Liu, Kaili Jiang, Bryan M. Wong, Ruoxue Yan, Ming Liu

Abstract: Optical hyperspectral imaging based on absorption and scattering of photons at the visible and adjacent frequencies denotes one of the most informative and inclusive characterization methods in material research. Unfortunately, restricted by the diffraction limit of light, it is unable to resolve the nanoscale inhomogeneity in light-matter interactions, which is diagnostic of the local modulation… ▽ More Optical hyperspectral imaging based on absorption and scattering of photons at the visible and adjacent frequencies denotes one of the most informative and inclusive characterization methods in material research. Unfortunately, restricted by the diffraction limit of light, it is unable to resolve the nanoscale inhomogeneity in light-matter interactions, which is diagnostic of the local modulation in material structure and properties. Moreover, many nanomaterials have highly anisotropic optical properties that are outstandingly appealing yet hard to characterize through conventional optical methods. Therefore, there has been a pressing demand in the diverse fields including electronics, photonics, physics, and materials science to extend the optical hyperspectral imaging into the nanometer length scale. In this work, we report a super-resolution hyperspectral imaging technique that simultaneously measures optical absorption and scattering spectra with the illumination from a tungsten-halogen lamp. We demonstrated sub-5 nm spatial resolution in both visible and near-infrared wavelengths (415 to 980 nm) for the hyperspectral imaging of strained single-walled carbon nanotubes (SWNT) and reconstructed true-color images to reveal the longitudinal and transverse optical transition-induced light absorption and scattering in the SWNTs. This is the first time transverse optical absorption in SWNTs were clearly observed experimentally. The new technique provides rich near-field spectroscopic information that had made it possible to analyze the spatial modulation of band-structure along a single SWNT induced through strain engineering. △ Less

Submitted 11 March, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

Comments: 4 Figures

arXiv:2006.00717 [pdf, other]

doi 10.1016/j.ejor.2021.04.033

On the optimality of joint periodic and extraordinary dividend strategies

Authors: Benjamin Avanzi, Hayden Lau, Bernard Wong

Abstract: In this paper, we model the cash surplus (or equity) of a risky business with a Brownian motion. Owners can take cash out of the surplus in the form of "dividends", subject to transaction costs. However, if the surplus hits 0 then ruin occurs and the business cannot operate any more. We consider two types of dividend distributions: (i) periodic, regular ones (that is, dividends can be paid only… ▽ More In this paper, we model the cash surplus (or equity) of a risky business with a Brownian motion. Owners can take cash out of the surplus in the form of "dividends", subject to transaction costs. However, if the surplus hits 0 then ruin occurs and the business cannot operate any more. We consider two types of dividend distributions: (i) periodic, regular ones (that is, dividends can be paid only at countable many points in time, according to a specific arrival process); and (ii) extraordinary dividend payments that can be made immediately at any time (that is, the dividend decision time space is continuous and matches that of the surplus process). Both types of dividends attract proportional transaction costs, and extraordinary distributions also attracts fixed transaction costs, a realistic feature. A dividend strategy that involves both types of distributions (periodic and extraordinary) is qualified as "hybrid". We determine which strategies (either periodic, immediate, or hybrid) are optimal, that is, we show which are the strategies that maximise the expected present value of dividends paid until ruin, net of transaction costs. Sometimes, a liquidation strategy (which pays out all monies and stops the process) is optimal. Which strategy is optimal depends on the profitability of the business, and the level of (proportional and fixed) transaction costs. Results are illustrated. △ Less

Submitted 2 December, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

MSC Class: 93E20; 91G70; 62P05; 91B30

arXiv:2005.12795 [pdf, ps, other]

Twisted Mazur pattern satellite knots and bordered Floer theory

Authors: Ina Petkova, Biji Wong

Abstract: We use bordered Floer theory to study properties of twisted Mazur pattern satellite knots $Q_{n}(K)$. We prove that $Q_n(K)$ is not Floer homologically thin, with two exceptions. We calculate the 3-genus of $Q_{n}(K)$ in terms of the twisting parameter $n$ and the 3-genus of the companion $K$, and we determine when $Q_n(K)$ is fibered. As an application to our results on Floer thickness and 3-genu… ▽ More We use bordered Floer theory to study properties of twisted Mazur pattern satellite knots $Q_{n}(K)$. We prove that $Q_n(K)$ is not Floer homologically thin, with two exceptions. We calculate the 3-genus of $Q_{n}(K)$ in terms of the twisting parameter $n$ and the 3-genus of the companion $K$, and we determine when $Q_n(K)$ is fibered. As an application to our results on Floer thickness and 3-genus, we verify the Cosmetic Surgery Conjecture for many of these satellite knots. △ Less

Submitted 19 March, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

Comments: 40 pages, 11 figures, 16 tables. Improved exposition, corrected some mistakes, modified statement of Theorem 1.0.7. To appear in Michigan Mathematical Journal

MSC Class: Primary: 57K18; 57K30; Secondary: 57R58

arXiv:2005.03500 [pdf, other]

doi 10.1017/S1748499520000196

On unbalanced data and common shock models in stochastic loss reserving

Authors: Benjamin Avanzi, Gregory Clive Taylor, Phuong Anh Vu, Bernard Wong

Abstract: Introducing common shocks is a popular dependence modelling approach, with some recent applications in loss reserving. The main advantage of this approach is the ability to capture structural dependence coming from known relationships. In addition, it helps with the parsimonious construction of correlation matrices of large dimensions. However, complications arise in the presence of "unbalanced da… ▽ More Introducing common shocks is a popular dependence modelling approach, with some recent applications in loss reserving. The main advantage of this approach is the ability to capture structural dependence coming from known relationships. In addition, it helps with the parsimonious construction of correlation matrices of large dimensions. However, complications arise in the presence of "unbalanced data", that is, when (expected) magnitude of observations over a single triangle, or between triangles, can vary substantially. Specifically, if a single common shock is applied to all of these cells, it can contribute insignificantly to the larger values and/or swamp the smaller ones, unless careful adjustments are made. This problem is further complicated in applications involving negative claim amounts. In this paper, we address this problem in the loss reserving context using a common shock Tweedie approach for unbalanced data. We show that the solution not only provides a much better balance of the common shock proportions relative to the unbalanced data, but it is also parsimonious. Finally, the common shock Tweedie model also provides distributional tractability. △ Less

Submitted 17 May, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

MSC Class: 91G70; 91G60; 62P05; 62H12

Journal ref: Ann. actuar. sci. 15 (2021) 173-203

arXiv:2004.11169 [pdf, other]

doi 10.1016/j.insmatheco.2021.01.002

On the modelling of multivariate counts with Cox processes and dependent shot noise intensities

Authors: Benjamin Avanzi, Gregory Clive Taylor, Bernard Wong, Xinda Yang

Abstract: In this paper, we develop a method to model and estimate several, _dependent_ count processes, using granular data. Specifically, we develop a multivariate Cox process with shot noise intensities to jointly model the arrival process of counts (e.g. insurance claims). The dependency structure is introduced via multivariate shot noise _intensity_ processes which are connected with the help of Lévy c… ▽ More In this paper, we develop a method to model and estimate several, _dependent_ count processes, using granular data. Specifically, we develop a multivariate Cox process with shot noise intensities to jointly model the arrival process of counts (e.g. insurance claims). The dependency structure is introduced via multivariate shot noise _intensity_ processes which are connected with the help of Lévy copulas. In aggregate, our approach allows for (i) over-dispersion and auto-correlation within each line of business; (ii) realistic features involving time-varying, known covariates; and (iii) parsimonious dependence between processes without requiring simultaneous primary (e.g. accidents) events. The explicit incorporation of time-varying, known covariates can accommodate characteristics of real data and hence facilitate implementation in practice. In an insurance context, these could be changes in policy volumes over time, as well as seasonality patterns and trends, which may explain some of the relationship (dependence) between multiple claims processes, or at least help tease out those relationships. Finally, we develop a filtering algorithm based on the reversible-jump Markov Chain Monte Carlo (RJMCMC) method to estimate the latent stochastic intensities and illustrate model calibration using real data from the AUSI data set. △ Less

Submitted 3 December, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

MSC Class: 91G70; 91G60; 62P05; 62H12

arXiv:2004.06880 [pdf, other]

doi 10.1016/j.insmatheco.2020.04.007

A multivariate evolutionary generalised linear model framework with adaptive estimation for claims reserving

Authors: Benjamin Avanzi, Gregory Clive Taylor, Phuong Anh Vu, Bernard Wong

Abstract: In this paper, we develop a multivariate evolutionary generalised linear model (GLM) framework for claims reserving, which allows for dynamic features of claims activity in conjunction with dependency across business lines to accurately assess claims reserves. We extend the traditional GLM reserving framework on two fronts: GLM fixed factors are allowed to evolve in a recursive manner, and depende… ▽ More In this paper, we develop a multivariate evolutionary generalised linear model (GLM) framework for claims reserving, which allows for dynamic features of claims activity in conjunction with dependency across business lines to accurately assess claims reserves. We extend the traditional GLM reserving framework on two fronts: GLM fixed factors are allowed to evolve in a recursive manner, and dependence is incorporated in the specification of these factors using a common shock approach. We consider factors that evolve across accident years in conjunction with factors that evolve across calendar years. This two-dimensional evolution of factors is unconventional as a traditional evolutionary model typically considers the evolution in one single time dimension. This creates challenges for the estimation process, which we tackle in this paper. We develop the formulation of a particle filtering algorithm with parameter learning procedure. This is an adaptive estimation approach which updates evolving factors of the framework recursively over time. We implement and illustrate our model with a simulated data set, as well as a set of real data from a Canadian insurer. △ Less

Submitted 15 April, 2020; originally announced April 2020.

Comments: Accepted for publication in Insurance: Mathematics and Economics

MSC Class: 91G70; 91G60; 62P05; 62H12

Journal ref: Insurance: Mathematics and Economics, Volume 93, July 2020, Pages 50-71

arXiv:2004.01838 [pdf, ps, other]

doi 10.1080/03461238.2020.1869069

Optimal periodic dividend strategies for spectrally negative Lévy processes with fixed transaction costs

Authors: Benjamin Avanzi, Hayden Lau, Bernard Wong

Abstract: Maximising dividends is one classical stability criterion in actuarial risk theory. Motivated by the fact that dividends are paid periodically in real life, $\textit{periodic}$ dividend strategies were recently introduced (Albrecher, Gerber and Shiu, 2011). In this paper, we incorporate fixed transaction costs into the model and study the optimal periodic dividend strategy with fixed transaction c… ▽ More Maximising dividends is one classical stability criterion in actuarial risk theory. Motivated by the fact that dividends are paid periodically in real life, $\textit{periodic}$ dividend strategies were recently introduced (Albrecher, Gerber and Shiu, 2011). In this paper, we incorporate fixed transaction costs into the model and study the optimal periodic dividend strategy with fixed transaction costs for spectrally negative Lévy processes. The value function of a periodic $(b_u,b_l)$ strategy is calculated by means of exiting identities and Itô's excusion when the surplus process is of unbounded variation. We show that a sufficient condition for optimality is that the Lévy measure admits a density which is completely monotonic. Under such assumptions, a periodic $(b_u,b_l)$ strategy is confirmed to be optimal. Results are illustrated. △ Less

Submitted 3 December, 2020; v1 submitted 3 April, 2020; originally announced April 2020.

MSC Class: 60G51; 93E20; 91B30

Showing 1–50 of 137 results for author: Wong, B