subscribe to arXiv mailings

Topologically nontrivial $1/3$-magnetization plateau state in a spin-1/2 trimer chain

Authors: Y. Y. Han, B. C. Yu, Z. Du, L. S. Ling, L. Zhang, W. Tong, C. Y. Xi, J. L. Zhang, T. Shang, Li Pi, Long Ma

Abstract: Topologically nontrivial Haldane phase is theoretically proposed to be realized in the 1/3-magnetization ($M$) plateau of spin-1/2 trimer systems. However, the spin excitation gap, typical characteristic of Haldane phase, is not yet experimentally verified. Here, we report the nuclear magnetic resonance investigations into the low-energy spin dynamics in the $S=1/2$ spin-trimer antiferromagnetic c… ▽ More Topologically nontrivial Haldane phase is theoretically proposed to be realized in the 1/3-magnetization ($M$) plateau of spin-1/2 trimer systems. However, the spin excitation gap, typical characteristic of Haldane phase, is not yet experimentally verified. Here, we report the nuclear magnetic resonance investigations into the low-energy spin dynamics in the $S=1/2$ spin-trimer antiferromagnetic chain compound Na$_2$Cu$_3$Ge$_{4-x}$Si$_{x}$O$_{12}$ ($x=0, 0.1\sim1.5$). In the parent compound ($x=0$), the spin-lattice relaxation rate (1/$T_1$) shows significantly different temperature dependence when the external magnetic field is increased above the critical field of $μ_0$$H_{c}$ = 29 T. The spin excitation gap is evidenced from the thermally activated behavior of $1/T_1(T)$ in the 1/3-$M$ plateau state. By substituting Ge$^{4+}$ with Si$^{4+}$, the critical field for the 1/3-$M$ plateau significantly decreases, e.g. $μ_0H_{c}=17$ T in $x=1.0$ samples, which results from the suppressed inter-trimer coupling $J_2$. The gapped spin excitation is confirmed again above 17 T, whose size shows temperature-dependent behavior for $μ_0H\geq25.72$ T. These observations provide further insights into the Haldane physics. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 6 pages, 4 figures

arXiv:2407.03347 [pdf, other]

Chebyshev Spectral Neural Networks for Solving Partial Differential Equations

Authors: Pengsong Yin, Shuo Ling, Wenjun Ying

Abstract: The purpose of this study is to utilize the Chebyshev spectral method neural network(CSNN) model to solve differential equations. This approach employs a single-layer neural network wherein Chebyshev spectral methods are used to construct neurons satisfying boundary conditions. The study uses a feedforward neural network model and error backpropagation principles, utilizing automatic differentiati… ▽ More The purpose of this study is to utilize the Chebyshev spectral method neural network(CSNN) model to solve differential equations. This approach employs a single-layer neural network wherein Chebyshev spectral methods are used to construct neurons satisfying boundary conditions. The study uses a feedforward neural network model and error backpropagation principles, utilizing automatic differentiation (AD) to compute the loss function. This method avoids the need to solve non-sparse linear systems, making it convenient for algorithm implementation and solving high-dimensional problems. The unique sampling method and neuron architecture significantly enhance the training efficiency and accuracy of the neural network. Furthermore, multiple networks enables the Chebyshev spectral method to handle equations on more complex domains. The numerical efficiency and accuracy of the CSNN model are investigated through testing on elliptic partial differential equations, and it is compared with the well-known Physics-Informed Neural Network(PINN) method. △ Less

Submitted 6 June, 2024; originally announced July 2024.

arXiv:2406.01592 [pdf, other]

Text-guided Controllable Mesh Refinement for Interactive 3D Modeling

Authors: Yun-Chun Chen, Selena Ling, Zhiqin Chen, Vladimir G. Kim, Matheus Gadelha, Alec Jacobson

Abstract: We propose a novel technique for adding geometric details to an input coarse 3D mesh guided by a text prompt. Our method is composed of three stages. First, we generate a single-view RGB image conditioned on the input coarse geometry and the input text prompt. This single-view image generation step allows the user to pre-visualize the result and offers stronger conditioning for subsequent multi-vi… ▽ More We propose a novel technique for adding geometric details to an input coarse 3D mesh guided by a text prompt. Our method is composed of three stages. First, we generate a single-view RGB image conditioned on the input coarse geometry and the input text prompt. This single-view image generation step allows the user to pre-visualize the result and offers stronger conditioning for subsequent multi-view generation. Second, we use our novel multi-view normal generation architecture to jointly generate six different views of the normal images. The joint view generation reduces inconsistencies and leads to sharper details. Third, we optimize our mesh with respect to all views and generate a fine, detailed geometry as output. The resulting method produces an output within seconds and offers explicit user control over the coarse structure, pose, and desired details of the resulting 3D mesh. Project page: https://text-mesh-refinement.github.io. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: Project page: https://text-mesh-refinement.github.io

arXiv:2405.15057 [pdf, other]

Characterization of Nearly Self-Orthogonal Quasi-Twisted Codes and Related Quantum Codes

Authors: Martianus Frederic Ezerman, Markus Grassl, San Ling, Ferruh Özbudak, Buket Özkaya

Abstract: Quasi-twisted codes are used here as the classical ingredients in the so-called Construction X for quantum error-control codes. The construction utilizes nearly self-orthogonal codes to design quantum stabilizer codes. We expand the choices of the inner product to also cover the symplectic and trace-symplectic inner products, in addition to the original Hermitian one. A refined lower bound on the… ▽ More Quasi-twisted codes are used here as the classical ingredients in the so-called Construction X for quantum error-control codes. The construction utilizes nearly self-orthogonal codes to design quantum stabilizer codes. We expand the choices of the inner product to also cover the symplectic and trace-symplectic inner products, in addition to the original Hermitian one. A refined lower bound on the minimum distance of the resulting quantum codes is established and illustrated. We report numerous record breaking quantum codes from our randomized search for inclusion in the updated online database. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 18 pages, 8 tables; see also http://codetables.de This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2405.14659 [pdf, ps, other]

Albanese fibrations of surfaces with low slope

Authors: Songbo Ling, Xin Lu

Abstract: Let $S$ be a minimal irregular surface of general type, whose Albanese map induces a fibration $f:\,S \to C$ of genus $g$.We prove a linear upper bound on the genus $g$ if $K_S^2\leq 4χ(\mathcal{O}_S)$. Examples are constructed showing that the above linear upper bound is sharp. We also give a characterization of the Albanese fibrations reaching the above upper bound when $χ(\mathcal{O}_S)\geq 5$.… ▽ More Let $S$ be a minimal irregular surface of general type, whose Albanese map induces a fibration $f:\,S \to C$ of genus $g$.We prove a linear upper bound on the genus $g$ if $K_S^2\leq 4χ(\mathcal{O}_S)$. Examples are constructed showing that the above linear upper bound is sharp. We also give a characterization of the Albanese fibrations reaching the above upper bound when $χ(\mathcal{O}_S)\geq 5$.On the other hand, we will construct a sequence of surfaces $S_n$ of general type with $K_{S_n}^2/χ(\mathcal{O}_{S_n})>4$ and with an Albanese fibration $f_n$, such that the genus $g_n$ of a general fiber of $f_n$ increases quadratically with $χ(\mathcal{O}_{S_n})$,and that $K_{S_n}^2/χ(\mathcal{O}_{S_n})$ can be arbitrarily close to $4$. △ Less

Submitted 28 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

Comments: Add a characterization of the Albanese fibrations reaching the above upper bound. Comments are welcome!

arXiv:2405.13284 [pdf, other]

Sub-kiloparsec scaling relations between hot gas, dense gas and star formation rate in five nearby star-forming galaxies

Authors: Chunyi Zhang, Junfeng Wang, Qing-Hua Tan, Yu Gao, Shuting Ling, Xiaoyu Xu

Abstract: Based on the newly acquired dense gas observations from the JCMT MALATANG survey and X-ray data from Chandra, we explore the correlation between hot gas and HCN $J=4 \rightarrow 3$, HCO$^+\ J=4 \rightarrow 3$ emission for the first time at sub-kiloparsec scale of five nearby star-forming galaxies, namely M82, M83, IC 342, NGC 253, and NGC 6946. We find that both HCN $J=4 \rightarrow 3$ and HCO… ▽ More Based on the newly acquired dense gas observations from the JCMT MALATANG survey and X-ray data from Chandra, we explore the correlation between hot gas and HCN $J=4 \rightarrow 3$, HCO$^+\ J=4 \rightarrow 3$ emission for the first time at sub-kiloparsec scale of five nearby star-forming galaxies, namely M82, M83, IC 342, NGC 253, and NGC 6946. We find that both HCN $J=4 \rightarrow 3$ and HCO$^+\ J=4 \rightarrow 3$ line luminosity show a statistically significant correlation with the 0.5${-}$2 keV X-ray emission of the diffuse hot gas ($L_{\rm 0.5 - 2\,keV}^{\rm gas}$). The Bayesian regression analysis gives the best fit of ${\rm log}(L_{\rm 0.5-2\,keV}^{\rm gas} /{\rm erg\,s^{-1}})=2.39\,{\rm log}(L'_{\rm HCN(4-3)} /{\rm K\,km\,s^{-1}\,pc^{2}})+24.83$ and ${\rm log}(L_{\rm 0.5-2\,keV}^{\rm gas} /{\rm erg\,s^{-1}})=2.48\,{\rm log}(L'_{\rm HCO^{+}(4-3)} /{\rm K\,km\,s^{-1}\,pc^{2}})+23.84$, with dispersion of $\thicksim$0.69 dex and 0.54 dex, respectively. At the sub-kiloparsec scale, we find that the power-law index of the $L_{\rm 0.5 - 2\,keV}^{\rm gas}$ ${-}$ star formation rate (SFR) relation is ${\rm log}(L_{\rm 0.5-2\,keV}^{\rm gas} /{\rm erg\,s^{-1}})=1.80\,{\rm log} ({\rm SFR} /M_\odot\,{\rm yr}^{-1})+39.16$, deviated from previous linear relations at global scale. This implies that the global property of hot gas significantly differs from individual resolved regions, which is influenced by the local physical conditions close to the sites of star formation. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 10 pages, 3figures, accepted for publication in the ApJ Letters. Dedicated to Prof. Yu Gao, who initiated this work

arXiv:2404.15242 [pdf, other]

A Hybrid Kernel-Free Boundary Integral Method with Operator Learning for Solving Parametric Partial Differential Equations In Complex Domains

Authors: Shuo Ling, Liwei Tan, Wenjun Ying

Abstract: The Kernel-Free Boundary Integral (KFBI) method presents an iterative solution to boundary integral equations arising from elliptic partial differential equations (PDEs). This method effectively addresses elliptic PDEs on irregular domains, including the modified Helmholtz, Stokes, and elasticity equations. The rapid evolution of neural networks and deep learning has invigorated the exploration of… ▽ More The Kernel-Free Boundary Integral (KFBI) method presents an iterative solution to boundary integral equations arising from elliptic partial differential equations (PDEs). This method effectively addresses elliptic PDEs on irregular domains, including the modified Helmholtz, Stokes, and elasticity equations. The rapid evolution of neural networks and deep learning has invigorated the exploration of numerical PDEs. An increasing interest is observed in deep learning approaches that seamlessly integrate mathematical principles for investigating numerical PDEs. We propose a hybrid KFBI method, integrating the foundational principles of the KFBI method with the capabilities of deep learning. This approach, within the framework of the boundary integral method, designs a network to approximate the solution operator for the corresponding integral equations by mapping the parameters, inhomogeneous terms and boundary information of PDEs to the boundary density functions, which can be regarded as the solution of the integral equations. The models are trained using data generated by the Cartesian grid-based KFBI algorithm, exhibiting robust generalization capabilities. It accurately predicts density functions across diverse boundary conditions and parameters within the same class of equations. Experimental results demonstrate that the trained model can directly infer the boundary density function with satisfactory precision, obviating the need for iterative steps in solving boundary integral equations. Furthermore, applying the inference results of the model as initial values for iterations is also reasonable; this approach can retain the inherent second-order accuracy of the KFBI method while accelerating the traditional KFBI approach by reducing about 50% iterations. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: 30 pages,6 figures

arXiv:2404.04993 [pdf, ps, other]

On Linear Codes Whose Hermitian Hulls are MD

Authors: Gaojun Luo, Lin Sok, Martianus Frederic Ezerman, San Ling

Abstract: Hermitian hulls of linear codes are interesting for theoretical and practical reasons alike. In terms of recent application, linear codes whose hulls meet certain conditions have been utilized as ingredients to construct entanglement-assisted quantum error correcting codes. This family of quantum codes is often seen as a generalization of quantum stabilizer codes. Theoretically, compared with the… ▽ More Hermitian hulls of linear codes are interesting for theoretical and practical reasons alike. In terms of recent application, linear codes whose hulls meet certain conditions have been utilized as ingredients to construct entanglement-assisted quantum error correcting codes. This family of quantum codes is often seen as a generalization of quantum stabilizer codes. Theoretically, compared with the Euclidean setup, the Hermitian case is much harder to deal with. Hermitian hulls of MDS linear codes with low dimensions have been explored, mostly from generalized Reed-Solomon codes. Characterizing Hermitian hulls which themselves are MDS appears to be more involved and has not been extensively studied. This paper introduces some tools to study linear codes whose Hermitian hulls are MDS. Using the tools, we then propose explicit constructions of such codes. We consider Hermitian hulls of both Reed-Solomon and non Reed-Solomon types of linear MDS codes. We demonstrate that, given the same Hermitian hull dimensions, the codes from our constructions have dimensions which are larger than those in the literature. △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2402.18699 [pdf, other]

Articulated Object Manipulation with Coarse-to-fine Affordance for Mitigating the Effect of Point Cloud Noise

Authors: Suhan Ling, Yian Wang, Shiguang Wu, Yuzheng Zhuang, Tianyi Xu, Yu Li, Chang Liu, Hao Dong

Abstract: 3D articulated objects are inherently challenging for manipulation due to the varied geometries and intricate functionalities associated with articulated objects.Point-level affordance, which predicts the per-point actionable score and thus proposes the best point to interact with, has demonstrated excellent performance and generalization capabilities in articulated object manipulation. However, a… ▽ More 3D articulated objects are inherently challenging for manipulation due to the varied geometries and intricate functionalities associated with articulated objects.Point-level affordance, which predicts the per-point actionable score and thus proposes the best point to interact with, has demonstrated excellent performance and generalization capabilities in articulated object manipulation. However, a significant challenge remains: while previous works use perfect point cloud generated in simulation, the models cannot directly apply to the noisy point cloud in the real-world. To tackle this challenge, we leverage the property of real-world scanned point cloud that, the point cloud becomes less noisy when the camera is closer to the object. Therefore, we propose a novel coarse-to-fine affordance learning pipeline to mitigate the effect of point cloud noise in two stages. In the first stage, we learn the affordance on the noisy far point cloud which includes the whole object to propose the approximated place to manipulate. Then, we move the camera in front of the approximated place, scan a less noisy point cloud containing precise local geometries for manipulation, and learn affordance on such point cloud to propose fine-grained final actions. The proposed method is thoroughly evaluated both using large-scale simulated noisy point clouds mimicking real-world scans, and in the real world scenarios, with superiority over existing methods, demonstrating the effectiveness in tackling the noisy real-world point cloud problem. △ Less

Submitted 7 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

Comments: ICRA 2024

arXiv:2402.15572 [pdf, other]

doi 10.1145/3610977.3634973

Improving Explainable Object-induced Model through Uncertainty for Automated Vehicles

Authors: Shihong Ling, Yue Wan, Xiaowei Jia, Na Du

Abstract: The rapid evolution of automated vehicles (AVs) has the potential to provide safer, more efficient, and comfortable travel options. However, these systems face challenges regarding reliability in complex driving scenarios. Recent explainable AV architectures neglect crucial information related to inherent uncertainties while providing explanations for actions. To overcome such challenges, our stud… ▽ More The rapid evolution of automated vehicles (AVs) has the potential to provide safer, more efficient, and comfortable travel options. However, these systems face challenges regarding reliability in complex driving scenarios. Recent explainable AV architectures neglect crucial information related to inherent uncertainties while providing explanations for actions. To overcome such challenges, our study builds upon the "object-induced" model approach that prioritizes the role of objects in scenes for decision-making and integrates uncertainty assessment into the decision-making process using an evidential deep learning paradigm with a Beta prior. Additionally, we explore several advanced training strategies guided by uncertainty, including uncertainty-guided data reweighting and augmentation. Leveraging the BDD-OIA dataset, our findings underscore that the model, through these enhancements, not only offers a clearer comprehension of AV decisions and their underlying reasoning but also surpasses existing baselines across a broad range of scenarios. △ Less

Submitted 23 February, 2024; originally announced February 2024.

Comments: In Proceedings of the 2024 ACM / IEEE International Conference on Human-Robot Interaction (HRI '24), March 11--14, 2024, Boulder, CO, USA. ACM, New York, NY, USA, 9 pages

arXiv:2402.14516 [pdf, ps, other]

Upper bounds on the genus of Albanese fibrations

Authors: Songbo Ling, Xin Lu

Abstract: Let $S$ be a minimal irregular surface of general type, whose Albanese map induces a hyperelliptic fibration $f:\,S \to B$ of genus $g$.We prove a quadratic upper bound on the genus $g$, i.e., $g\leq h\big(χ(\mathcal{O}_S)\big)$, where $h$ is a quadratic function. We also construct examples showing that the quadratic upper bounds can not be improved to the linear ones. In the special case when… ▽ More Let $S$ be a minimal irregular surface of general type, whose Albanese map induces a hyperelliptic fibration $f:\,S \to B$ of genus $g$.We prove a quadratic upper bound on the genus $g$, i.e., $g\leq h\big(χ(\mathcal{O}_S)\big)$, where $h$ is a quadratic function. We also construct examples showing that the quadratic upper bounds can not be improved to the linear ones. In the special case when $p_g(S)=q(S)=1$, we show that $g\leq 14$. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: 16 pages

MSC Class: 14J29; 14J10; 14D06

arXiv:2402.03979 [pdf, other]

Cross Entropy versus Label Smoothing: A Neural Collapse Perspective

Authors: Li Guo, Keith Ross, Zifan Zhao, George Andriopoulos, Shuyang Ling, Yufeng Xu, Zixuan Dong

Abstract: Label smoothing loss is a widely adopted technique to mitigate overfitting in deep neural networks. This paper studies label smoothing from the perspective of Neural Collapse (NC), a powerful empirical and theoretical framework which characterizes model behavior during the terminal phase of training. We first show empirically that models trained with label smoothing converge faster to neural colla… ▽ More Label smoothing loss is a widely adopted technique to mitigate overfitting in deep neural networks. This paper studies label smoothing from the perspective of Neural Collapse (NC), a powerful empirical and theoretical framework which characterizes model behavior during the terminal phase of training. We first show empirically that models trained with label smoothing converge faster to neural collapse solutions and attain a stronger level of neural collapse. Additionally, we show that at the same level of NC1, models under label smoothing loss exhibit intensified NC2. These findings provide valuable insights into the performance benefits and enhanced model calibration under label smoothing loss. We then leverage the unconstrained feature model to derive closed-form solutions for the global minimizers for both loss functions and further demonstrate that models under label smoothing have a lower conditioning number and, therefore, theoretically converge faster. Our study, combining empirical evidence and theoretical results, not only provides nuanced insights into the differences between label smoothing and cross-entropy losses, but also serves as an example of how the powerful neural collapse framework can be used to improve our understanding of DNNs. △ Less

Submitted 6 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

arXiv:2401.04941 [pdf, ps, other]

Griesmer Bound and Constructions of Linear Codes in $b$-Symbol Metric

Authors: Gaojun Luo, Martianus Frederic Ezerman, Cem Güneri, San Ling, Ferruh Özbudak

Abstract: The $b$-symbol metric is a generalization of the Hamming metric. Linear codes, in the $b$-symbol metric, have been used in the read channel whose outputs consist of $b$ consecutive symbols. The Griesmer bound outperforms the Singleton bound for $\mathbb{F}_q$-linear codes in the Hamming metric, when $q$ is fixed and the length is large enough. This scenario is also applicable in the $b$-symbol met… ▽ More The $b$-symbol metric is a generalization of the Hamming metric. Linear codes, in the $b$-symbol metric, have been used in the read channel whose outputs consist of $b$ consecutive symbols. The Griesmer bound outperforms the Singleton bound for $\mathbb{F}_q$-linear codes in the Hamming metric, when $q$ is fixed and the length is large enough. This scenario is also applicable in the $b$-symbol metric. Shi, Zhu, and Helleseth recently made a conjecture on cyclic codes in the $b$-symbol metric. In this paper, we present the $b$-symbol Griesmer bound for linear codes by concatenating linear codes and simplex codes. Based on cyclic codes and extended cyclic codes, we propose two families of distance-optimal linear codes with respect to the $b$-symbol Griesmer bound. △ Less

Submitted 10 January, 2024; originally announced January 2024.

arXiv:2312.11115 [pdf, other]

Bounds and Constructions of Quantum Locally Recoverable Codes from Quantum CSS Codes

Authors: Gaojun Luo, Bocong Chen, Martianus Frederic Ezerman, San Ling

Abstract: Classical locally recoverable codes (LRCs) have become indispensable in distributed storage systems. They provide efficient recovery in terms of localized errors. Quantum LRCs have very recently been introduced for their potential application in quantum data storage. In this paper, we use classical LRCs to investigate quantum LRCs. We prove that the parameters of quantum LRCs are bounded by their… ▽ More Classical locally recoverable codes (LRCs) have become indispensable in distributed storage systems. They provide efficient recovery in terms of localized errors. Quantum LRCs have very recently been introduced for their potential application in quantum data storage. In this paper, we use classical LRCs to investigate quantum LRCs. We prove that the parameters of quantum LRCs are bounded by their classical counterparts. We deduce the bounds on the parameters of quantum LRCs from the bounds on the parameters of the classical ones. We establish a characterization of optimal pure quantum LRCs based on classical codes with specific properties. Using well-crafted classical LRCs as ingredients in the construction of quantum CSS codes, we offer the first construction of several families of optimal pure quantum LRCs. △ Less

Submitted 18 December, 2023; originally announced December 2023.

arXiv:2312.09482 [pdf, ps, other]

An open problem and a conjecture on binary linear complementary pairs of codes

Authors: Shitao Li, Minjia Shi, San Ling

Abstract: The existence of $q$-ary linear complementary pairs (LCPs) of codes with $q> 2$ has been completely characterized so far. This paper gives a characterization for the existence of binary LCPs of codes. As a result, we solve an open problem proposed by Carlet $et~al.$ (IEEE Trans. Inf. Theory 65(3): 1694-1704, 2019) and a conjecture proposed by Choi $et~al.$ (Cryptogr. Commun. 15(2): 469-486, 2023). The existence of $q$-ary linear complementary pairs (LCPs) of codes with $q> 2$ has been completely characterized so far. This paper gives a characterization for the existence of binary LCPs of codes. As a result, we solve an open problem proposed by Carlet $et~al.$ (IEEE Trans. Inf. Theory 65(3): 1694-1704, 2019) and a conjecture proposed by Choi $et~al.$ (Cryptogr. Commun. 15(2): 469-486, 2023). △ Less

Submitted 14 December, 2023; originally announced December 2023.

arXiv:2311.18670 [pdf, ps, other]

Local Geometry Determines Global Landscape in Low-rank Factorization for Synchronization

Authors: Shuyang Ling

Abstract: The orthogonal group synchronization problem, which focuses on recovering orthogonal group elements from their corrupted pairwise measurements, encompasses examples such as high-dimensional Kuramoto model on general signed networks, $\mathbb{Z}_2$-synchronization, community detection under stochastic block models, and orthogonal Procrustes problem. The semidefinite relaxation (SDR) has proven its… ▽ More The orthogonal group synchronization problem, which focuses on recovering orthogonal group elements from their corrupted pairwise measurements, encompasses examples such as high-dimensional Kuramoto model on general signed networks, $\mathbb{Z}_2$-synchronization, community detection under stochastic block models, and orthogonal Procrustes problem. The semidefinite relaxation (SDR) has proven its power in solving this problem; however, its expensive computational costs impede its widespread practical applications. We consider the Burer-Monteiro factorization approach to the orthogonal group synchronization, an effective and scalable low-rank factorization to solve large scale SDPs. Despite the significant empirical successes of this factorization approach, it is still a challenging task to understand when the nonconvex optimization landscape is benign, i.e., the optimization landscape possesses only one local minimizer, which is also global. In this work, we demonstrate that if the degree of freedom within the factorization exceeds twice the condition number of the ``Laplacian" (certificate matrix) at the global minimizer, the optimization landscape is absent of spurious local minima. Our main theorem is purely algebraic and versatile, and it seamlessly applies to all the aforementioned examples: the nonconvex landscape remains benign under almost identical condition that enables the success of the SDR. Additionally, we illustrate that the Burer-Monteiro factorization is robust to ``monotone adversaries", mirroring the resilience of the SDR. In other words, introducing ``favorable" adversaries into the data will not result in the emergence of new spurious local minimizers. △ Less

Submitted 30 November, 2023; originally announced November 2023.

arXiv:2311.08156 [pdf, other]

Improved Spectral Bound for Quasi-Cyclic Codes

Authors: Gaojun Luo, Martianus Frederic Ezerman, San Ling, Buket Özkaya

Abstract: Spectral bounds form a powerful tool to estimate the minimum distances of quasi-cyclic codes. They generalize the defining set bounds of cyclic codes to those of quasi-cyclic codes. Based on the eigenvalues of quasi-cyclic codes and the corresponding eigenspaces, we provide an improved spectral bound for quasi-cyclic codes. Numerical results verify that the improved bound outperforms the Jensen bo… ▽ More Spectral bounds form a powerful tool to estimate the minimum distances of quasi-cyclic codes. They generalize the defining set bounds of cyclic codes to those of quasi-cyclic codes. Based on the eigenvalues of quasi-cyclic codes and the corresponding eigenspaces, we provide an improved spectral bound for quasi-cyclic codes. Numerical results verify that the improved bound outperforms the Jensen bound in almost all cases. Based on the improved bound, we propose a general construction of quasi-cyclic codes with excellent designed minimum distances. For the quasi-cyclic codes produced by this general construction, the improved spectral bound is always sharper than the Jensen bound. △ Less

Submitted 14 November, 2023; originally announced November 2023.

arXiv:2309.13271 [pdf, other]

Secure Inter-domain Routing and Forwarding via Verifiable Forwarding Commitments

Authors: Xiaoliang Wang, Zhuotao Liu, Qi Li, Yangfei Guo, Sitong Ling, Jiangou Zhan, Yi Xu, Ke Xu, Jianping Wu

Abstract: The Internet inter-domain routing system is vulnerable. On the control plane, the de facto Border Gateway Protocol (BGP) does not have built-in mechanisms to authenticate routing announcements, so an adversary can announce virtually arbitrary paths to hijack network traffic; on the data plane, it is difficult to ensure that actual forwarding path complies with the control plane decisions. The comm… ▽ More The Internet inter-domain routing system is vulnerable. On the control plane, the de facto Border Gateway Protocol (BGP) does not have built-in mechanisms to authenticate routing announcements, so an adversary can announce virtually arbitrary paths to hijack network traffic; on the data plane, it is difficult to ensure that actual forwarding path complies with the control plane decisions. The community has proposed significant research to secure the routing system. Yet, existing secure BGP protocols (e.g., BGPsec) are not incrementally deployable, and existing path authorization protocols are not compatible with the current Internet routing infrastructure. In this paper, we propose FC-BGP, the first secure Internet inter-domain routing system that can simultaneously authenticate BGP announcements and validate data plane forwarding in an efficient and incrementally-deployable manner. FC-BGP is built upon a novel primitive, name Forwarding Commitment, to certify an AS's routing intent on its directly connected hops. We analyze the security benefits of FC-BGP in the Internet at different deployment rates. Further, we implement a prototype of FC-BGP and extensively evaluate it over a large-scale overlay network with 100 virtual machines deployed globally. The results demonstrate that FC-BGP saves roughly 55% of the overhead required to validate BGP announcements compared with BGPsec, and meanwhile FC-BGP introduces a small overhead for building a globally-consistent view on the desirable forwarding paths. △ Less

Submitted 8 November, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

Comments: 16 pages, 17 figures

arXiv:2309.09725 [pdf, ps, other]

Neural Collapse for Unconstrained Feature Model under Cross-entropy Loss with Imbalanced Data

Authors: Wanli Hong, Shuyang Ling

Abstract: Recent years have witnessed the huge success of deep neural networks (DNNs) in various tasks of computer vision and text processing. Interestingly, these DNNs with massive number of parameters share similar structural properties on their feature representation and last-layer classifier at terminal phase of training (TPT). Specifically, if the training data are balanced (each class shares the same… ▽ More Recent years have witnessed the huge success of deep neural networks (DNNs) in various tasks of computer vision and text processing. Interestingly, these DNNs with massive number of parameters share similar structural properties on their feature representation and last-layer classifier at terminal phase of training (TPT). Specifically, if the training data are balanced (each class shares the same number of samples), it is observed that the feature vectors of samples from the same class converge to their corresponding in-class mean features and their pairwise angles are the same. This fascinating phenomenon is known as Neural Collapse (N C), first termed by Papyan, Han, and Donoho in 2019. Many recent works manage to theoretically explain this phenomenon by adopting so-called unconstrained feature model (UFM). In this paper, we study the extension of N C phenomenon to the imbalanced data under cross-entropy loss function in the context of unconstrained feature model. Our contribution is multi-fold compared with the state-of-the-art results: (a) we show that the feature vectors exhibit collapse phenomenon, i.e., the features within the same class collapse to the same mean vector; (b) the mean feature vectors no longer form an equiangular tight frame. Instead, their pairwise angles depend on the sample size; (c) we also precisely characterize the sharp threshold on which the minority collapse (the feature vectors of the minority groups collapse to one single vector) will take place; (d) finally, we argue that the effect of the imbalance in datasize diminishes as the sample size grows. Our results provide a complete picture of the N C under the cross-entropy loss for the imbalanced data. Numerical experiments confirm our theoretical analysis. △ Less

Submitted 24 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

Comments: 38 pages, 10 figures

arXiv:2309.07369 [pdf, other]

Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation

Authors: Shaoshi Ling, Guoli Ye, Rui Zhao, Yifan Gong

Abstract: Attention-based encoder-decoder (AED) speech recognition model has been widely successful in recent years. However, the joint optimization of acoustic model and language model in end-to-end manner has created challenges for text adaptation. In particular, effectively, quickly and inexpensively adapting text has become a primary concern for deploying AED systems in industry. To address this issue,… ▽ More Attention-based encoder-decoder (AED) speech recognition model has been widely successful in recent years. However, the joint optimization of acoustic model and language model in end-to-end manner has created challenges for text adaptation. In particular, effectively, quickly and inexpensively adapting text has become a primary concern for deploying AED systems in industry. To address this issue, we propose a novel model, the hybrid attention-based encoder-decoder (HAED) speech recognition model that preserves the modularity of conventional hybrid automatic speech recognition systems. Our HAED model separates the acoustic and language models, allowing for the use of conventional text-based language model adaptation techniques. We demonstrate that the proposed HAED model yields 21\% Word Error Rate (WER) improvements in relative when out-of-domain text data is used for language model adaptation, and with only a minor degradation in WER on a general test set compared with conventional AED model. △ Less

Submitted 13 September, 2023; originally announced September 2023.

arXiv:2309.05269 [pdf, other]

UniKG: A Benchmark and Universal Embedding for Large-Scale Knowledge Graphs

Authors: Yide Qiu, Shaoxiang Ling, Tong Zhang, Bo Huang, Zhen Cui

Abstract: Irregular data in real-world are usually organized as heterogeneous graphs (HGs) consisting of multiple types of nodes and edges. To explore useful knowledge from real-world data, both the large-scale encyclopedic HG datasets and corresponding effective learning methods are crucial, but haven't been well investigated. In this paper, we construct a large-scale HG benchmark dataset named UniKG from… ▽ More Irregular data in real-world are usually organized as heterogeneous graphs (HGs) consisting of multiple types of nodes and edges. To explore useful knowledge from real-world data, both the large-scale encyclopedic HG datasets and corresponding effective learning methods are crucial, but haven't been well investigated. In this paper, we construct a large-scale HG benchmark dataset named UniKG from Wikidata to facilitate knowledge mining and heterogeneous graph representation learning. Overall, UniKG contains more than 77 million multi-attribute entities and 2000 diverse association types, which significantly surpasses the scale of existing HG datasets. To perform effective learning on the large-scale UniKG, two key measures are taken, including (i) the semantic alignment strategy for multi-attribute entities, which projects the feature description of multi-attribute nodes into a common embedding space to facilitate node aggregation in a large receptive field; (ii) proposing a novel plug-and-play anisotropy propagation module (APM) to learn effective multi-hop anisotropy propagation kernels, which extends methods of large-scale homogeneous graphs to heterogeneous graphs. These two strategies enable efficient information propagation among a tremendous number of multi-attribute entities and meantimes adaptively mine multi-attribute association through the multi-hop aggregation in large-scale HGs. We set up a node classification task on our UniKG dataset, and evaluate multiple baseline methods which are constructed by embedding our APM into large-scale homogenous graph learning methods. Our UniKG dataset and the baseline codes have been released at https://github.com/Yide-Qiu/UniKG. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: 9 pages, 4 figures

arXiv:2309.04477 [pdf, other]

High pressure behaviour of the magnetic van der Waals molecular framework Ni(NCS)$_2$

Authors: Madeleine Geers, David M. Jarvis, Cheng Liu, Siddharth S. Saxena, Jem Pitcairn, Emily Myatt, Sebastian A. Hallweger, Silva M. Kronawitter, Gregor Kieslich, Sanliang Ling, Andrew B. Cairns, Dominik Daisenberger, Oscar Fabelo, Laura Cañadillas-Delgado, Matthew J. Cliffe

Abstract: Two-dimensional materials offer a unique range of magnetic, electronic and mechanical properties which can be controlled by external stimuli. Pressure is a particularly important stimulus, as it can be achieved readily and can produce large responses, especially in low-dimensional materials. In this paper we explore the pressure-dependence of the structural and magnetic properties of a two-dimensi… ▽ More Two-dimensional materials offer a unique range of magnetic, electronic and mechanical properties which can be controlled by external stimuli. Pressure is a particularly important stimulus, as it can be achieved readily and can produce large responses, especially in low-dimensional materials. In this paper we explore the pressure-dependence of the structural and magnetic properties of a two-dimensional van der Waals (vdW) molecular framework antiferromagnet with ferromagnetic layers, Ni(NCS)$_2$, up to 8.4 kbar. Through a combination of X-ray and neutron diffraction analysis, we find that Ni(NCS)$_2$ is significantly more compressible than comparable vdW metal halides, and its response is anisotropic not only out of the plane, but also within the layers. Using bulk magnetisation and neutron diffraction data, we show that the ambient layered antiferromagnetic phase is maintained up to the largest investigated pressure, but with an enhanced Néel temperature, $T_\mathrm{N}$, ($ΔT_\mathrm{N} / T_\mathrm{N} = +19$ %) and a large pressure sensitivity ($Q = \frac{1}{T_\mathrm{N}} \frac{\mathrm{d}T_\mathrm{N}}{\mathrm{d}P} = +2.3$ % kbar$^{-1}$), one of the larger values of magnetic pressure responsiveness for a vdW material. Density functional theory calculations suggest that this is due to increasing three-dimensionality. These results provide some of the first insights into the pressure response of molecular framework vdW magnets and suggest investigation of other molecular framework vdW magnets might uncover contenders for future pressure-switchable devices. △ Less

Submitted 4 October, 2023; v1 submitted 3 August, 2023; originally announced September 2023.

Comments: 10 pages, 7 figures

arXiv:2309.04305 [pdf, other]

A Construction of Asymptotically Optimal Cascaded CDC Schemes via Combinatorial Designs

Authors: Yingjie Cheng, Gaojun Luo, Xiwang Cao, Martianus Frederic Ezerman, San Ling

Abstract: A coded distributed computing (CDC) system aims to reduce the communication load in the MapReduce framework. Such a system has $K$ nodes, $N$ input files, and $Q$ Reduce functions. Each input file is mapped by $r$ nodes and each Reduce function is computed by $s$ nodes. The objective is to achieve the maximum multicast gain. There are known CDC schemes that achieve optimal communication load. In s… ▽ More A coded distributed computing (CDC) system aims to reduce the communication load in the MapReduce framework. Such a system has $K$ nodes, $N$ input files, and $Q$ Reduce functions. Each input file is mapped by $r$ nodes and each Reduce function is computed by $s$ nodes. The objective is to achieve the maximum multicast gain. There are known CDC schemes that achieve optimal communication load. In some prominent known schemes, however, $N$ and $Q$ grow too fast in terms of $K$, greatly reducing their gains in practical scenarios. To mitigate the situation, some asymptotically optimal cascaded CDC schemes with $r=s$ have been proposed by using symmetric designs. In this paper, we put forward new asymptotically optimal cascaded CDC schemes with $r=s$ by using $1$-designs. Compared with earlier schemes from symmetric designs, ours have much smaller computation loads while keeping the other relevant parameters the same. We also obtain new asymptotically optimal cascaded CDC schemes with more flexible parameters compared with previously best-performing schemes. △ Less

Submitted 8 September, 2023; originally announced September 2023.

arXiv:2309.03808 [pdf, other]

Improved theoretical guarantee for rank aggregation via spectral method

Authors: Ziliang Samuel Zhong, Shuyang Ling

Abstract: Given pairwise comparisons between multiple items, how to rank them so that the ranking matches the observations? This problem, known as rank aggregation, has found many applications in sports, recommendation systems, and other web applications. As it is generally NP-hard to find a global ranking that minimizes the mismatch (known as the Kemeny optimization), we focus on the Erdös-Rényi outliers (… ▽ More Given pairwise comparisons between multiple items, how to rank them so that the ranking matches the observations? This problem, known as rank aggregation, has found many applications in sports, recommendation systems, and other web applications. As it is generally NP-hard to find a global ranking that minimizes the mismatch (known as the Kemeny optimization), we focus on the Erdös-Rényi outliers (ERO) model for this ranking problem. Here, each pairwise comparison is a corrupted copy of the true score difference. We investigate spectral ranking algorithms that are based on unnormalized and normalized data matrices. The key is to understand their performance in recovering the underlying scores of each item from the observed data. This reduces to deriving an entry-wise perturbation error bound between the top eigenvectors of the unnormalized/normalized data matrix and its population counterpart. By using the leave-one-out technique, we provide a sharper $\ell_{\infty}$-norm perturbation bound of the eigenvectors and also derive an error bound on the maximum displacement for each item, with only $Ω(n\log n)$ samples. Our theoretical analysis improves upon the state-of-the-art results in terms of sample complexity, and our numerical experiments confirm these theoretical findings. △ Less

Submitted 10 September, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

Comments: 29 pages, 6 figures

arXiv:2307.08721 [pdf, other]

doi 10.1609/icwsm.v18i1.31382

Where Did the President Visit Last Week? Detecting Celebrity Trips from News Articles

Authors: Kai Peng, Ying Zhang, Shuai Ling, Zhaoru Ke, Haipeng Zhang

Abstract: Celebrities' whereabouts are of pervasive importance. For instance, where politicians go, how often they visit, and who they meet, come with profound geopolitical and economic implications. Although news articles contain travel information of celebrities, it is not possible to perform large-scale and network-wise analysis due to the lack of automatic itinerary detection tools. To design such tools… ▽ More Celebrities' whereabouts are of pervasive importance. For instance, where politicians go, how often they visit, and who they meet, come with profound geopolitical and economic implications. Although news articles contain travel information of celebrities, it is not possible to perform large-scale and network-wise analysis due to the lack of automatic itinerary detection tools. To design such tools, we have to overcome difficulties from the heterogeneity among news articles: 1)One single article can be noisy, with irrelevant people and locations, especially when the articles are long. 2)Though it may be helpful if we consider multiple articles together to determine a particular trip, the key semantics are still scattered across different articles intertwined with various noises, making it hard to aggregate them effectively. 3)Over 20% of the articles refer to the celebrities' trips indirectly, instead of using the exact celebrity names or location names, leading to large portions of trips escaping regular detecting algorithms. We model text content across articles related to each candidate location as a graph to better associate essential information and cancel out the noises. Besides, we design a special pooling layer based on attention mechanism and node similarity, reducing irrelevant information from longer articles. To make up the missing information resulted from indirect mentions, we construct knowledge sub-graphs for named entities (person, organization, facility, etc.). Specifically, we dynamically update embeddings of event entities like the G7 summit from news descriptions since the properties (date and location) of the event change each time, which is not captured by the pre-trained event representations. The proposed CeleTrip jointly trains these modules, which outperforms all baseline models and achieves 82.53% in the F1 metric. △ Less

Submitted 9 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

Comments: Accepted to ICWSM 2024, 12 pages

arXiv:2307.08234 [pdf, other]

Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition

Authors: Shaoshi Ling, Yuxuan Hu, Shuangbei Qian, Guoli Ye, Yao Qian, Yifan Gong, Ed Lin, Michael Zeng

Abstract: Most end-to-end (E2E) speech recognition models are composed of encoder and decoder blocks that perform acoustic and language modeling functions. Pretrained large language models (LLMs) have the potential to improve the performance of E2E ASR. However, integrating a pretrained language model into an E2E speech recognition model has shown limited benefits due to the mismatches between text-based LL… ▽ More Most end-to-end (E2E) speech recognition models are composed of encoder and decoder blocks that perform acoustic and language modeling functions. Pretrained large language models (LLMs) have the potential to improve the performance of E2E ASR. However, integrating a pretrained language model into an E2E speech recognition model has shown limited benefits due to the mismatches between text-based LLMs and those used in E2E ASR. In this paper, we explore an alternative approach by adapting a pretrained LLMs to speech. Our experiments on fully-formatted E2E ASR transcription tasks across various domains demonstrate that our approach can effectively leverage the strengths of pretrained LLMs to produce more readable ASR transcriptions. Our model, which is based on the pretrained large language models with either an encoder-decoder or decoder-only structure, surpasses strong ASR models such as Whisper, in terms of recognition error rate, considering formats like punctuation and capitalization as well. △ Less

Submitted 2 August, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

arXiv:2307.04209 [pdf, other]

Sharper Asymptotically Optimal CDC Schemes via Combinatorial Designs

Authors: Yingjie Cheng, Gaojun Luo, Xiwang Cao, Martianus Frederic Ezerman, San Ling

Abstract: Coded distributed computing (CDC) was introduced to greatly reduce the communication load for MapReduce computing systems. Such a system has $K$ nodes, $N$ input files, and $Q$ Reduce functions. Each input file is mapped by $r$ nodes and each Reduce function is computed by $s$ nodes. The architecture must allow for coding techniques that achieve the maximum multicast gain. Some CDC schemes that ac… ▽ More Coded distributed computing (CDC) was introduced to greatly reduce the communication load for MapReduce computing systems. Such a system has $K$ nodes, $N$ input files, and $Q$ Reduce functions. Each input file is mapped by $r$ nodes and each Reduce function is computed by $s$ nodes. The architecture must allow for coding techniques that achieve the maximum multicast gain. Some CDC schemes that achieve optimal communication load have been proposed before. The parameters $N$ and $Q$ in those schemes, however, grow too fast with respect to $K$ to be of great practical value. To improve the situation, researchers have come up with some asymptotically optimal cascaded CDC schemes with $s+r=K$ from symmetric designs. In this paper, we propose new asymptotically optimal cascaded CDC schemes. Akin to known schemes, ours have $r+s=K$ and make use of symmetric designs as construction tools. Unlike previous schemes, ours have much smaller communication loads, given the same set of parameters $K$, $r$, $N$, and $Q$. We also expand the construction tools to include almost difference sets. Using them, we have managed to construct a new asymptotically optimal cascaded CDC scheme. △ Less

Submitted 9 July, 2023; originally announced July 2023.

arXiv:2306.14082 [pdf, ps, other]

doi 10.1103/PhysRevB.107.245134

High-field NMR study of the spin correlations in the spin-cluster mineral Na$_2$Cu$_3$O(SO$_4$)$_3$

Authors: Long Ma, J. X. Li, L. S. Ling, Y. Y. Han, L. Zhang, L. Hu, W. Tong, C. Y. Xi, Li Pi

Abstract: We report NMR study on the spin correlations in the spin-cluster based mineral Na$_2$Cu$_3$O(SO$_4$)$_3$ with magnetic fields ranged from 1 T to 33 T. The long-range magnetic order is observed from both the sudden spectral broadening at $T_N$ and critical slowing down behavior in the temperature dependence of spin-lattice relaxation rates ($1/T_1(T)$). The hump behavior of $1/T_1(T)$ persists to… ▽ More We report NMR study on the spin correlations in the spin-cluster based mineral Na$_2$Cu$_3$O(SO$_4$)$_3$ with magnetic fields ranged from 1 T to 33 T. The long-range magnetic order is observed from both the sudden spectral broadening at $T_N$ and critical slowing down behavior in the temperature dependence of spin-lattice relaxation rates ($1/T_1(T)$). The hump behavior of $1/T_1(T)$ persists to $μ_0H=7.25$ T, above which a spin excitation gap is observed from the thermally activated temperature dependence of $1/T_1$. The gap size shows a linear field dependence, whose slope and intercept respectively yield an effective magnetic moment of 2.54 $μ_B$ and a 0.94 meV spin excitation gap under zero magnetic field. These results indicate the existence of short-range order and prominent easy-plane spin anisotropy, which are important for understanding the spin excitation spectrum in A$_2$Cu$_3$O(SO$_4$)$_3$. △ Less

Submitted 24 June, 2023; originally announced June 2023.

Comments: 5 pages, 4 figures

Journal ref: Phys. Rev. B 107, 245134 (2023)

arXiv:2305.03841 [pdf, other]

doi 10.1088/1475-7516/2023/07/055

Phenomenology of wavelike vector dark matter nonminimally coupled to gravity

Authors: Hong-Yi Zhang, Siyang Ling

Abstract: We study three astrophysical/cosmological consequences of nonminimal couplings to gravity in wavelike vector dark matter. In the nonrelativistic limit, the nonminimal coupling with the lowest mass dimension leads to effective self-interactions that affect the mass-radius relation of vector solitons, growth of linear perturbations during structure formation, and the speed of gravitational waves (GW… ▽ More We study three astrophysical/cosmological consequences of nonminimal couplings to gravity in wavelike vector dark matter. In the nonrelativistic limit, the nonminimal coupling with the lowest mass dimension leads to effective self-interactions that affect the mass-radius relation of vector solitons, growth of linear perturbations during structure formation, and the speed of gravitational waves (GWs). Based on the success of cold dark matter on large-scale perturbations and the current limits on GW speed, we constrain the dark matter mass and nonminimal coupling strength to be within the range $|ξ_1| / m^2 \ll 10^{50} \mathrm{eV^{-2}}$ and $-3\times 10^{46} \mathrm{eV^{-2}} \lesssim ξ_2 / m^2 \lesssim 8 \times 10^{48} \mathrm{eV^{-2}}$. △ Less

Submitted 5 May, 2023; originally announced May 2023.

Comments: 13 pages + appendices, 2 figures

arXiv:2305.03442 [pdf, other]

Repair of Reed-Solomon Codes in the Presence of Erroneous Nodes

Authors: Stanislav Kruglik, Gaojun Luo, Wilton Kim, Shubhransh Singhvi, Han Mao Kiah, San Ling, Huaxiong Wang

Abstract: We consider the repair scheme of Guruswami-Wootters for the Reed-Solomon code and ask: can we correctly repair a failed node in the presence of erroneous nodes? Equivalently, we consider the collection of downloaded traces as a code and investigate its code-distance properties. We propose three lower bounds on its minimum distance and study methods to efficiently correct errors close to these boun… ▽ More We consider the repair scheme of Guruswami-Wootters for the Reed-Solomon code and ask: can we correctly repair a failed node in the presence of erroneous nodes? Equivalently, we consider the collection of downloaded traces as a code and investigate its code-distance properties. We propose three lower bounds on its minimum distance and study methods to efficiently correct errors close to these bounds. △ Less

Submitted 5 May, 2023; originally announced May 2023.

Comments: Accepted to IEEE International Symposium on Information Theory 2023

arXiv:2304.04869 [pdf, other]

doi 10.1088/1538-3873/acd1b5

The James Webb Space Telescope Mission

Authors: Jonathan P. Gardner, John C. Mather, Randy Abbott, James S. Abell, Mark Abernathy, Faith E. Abney, John G. Abraham, Roberto Abraham, Yasin M. Abul-Huda, Scott Acton, Cynthia K. Adams, Evan Adams, David S. Adler, Maarten Adriaensen, Jonathan Albert Aguilar, Mansoor Ahmed, Nasif S. Ahmed, Tanjira Ahmed, Rüdeger Albat, Loïc Albert, Stacey Alberts, David Aldridge, Mary Marsha Allen, Shaune S. Allen, Martin Altenburg , et al. (983 additional authors not shown)

Abstract: Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono… ▽ More Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astronomers will celebrate their accomplishments for the life of the mission, potentially as long as 20 years, and beyond. This report and the scientific discoveries that follow are extended thank-you notes to the 20,000 team members. The telescope is working perfectly, with much better image quality than expected. In this and accompanying papers, we give a brief history, describe the observatory, outline its objectives and current observing program, and discuss the inventions and people who made it possible. We cite detailed reports on the design and the measured performance on orbit. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: Accepted by PASP for the special issue on The James Webb Space Telescope Overview, 29 pages, 4 figures

arXiv:2302.04390 [pdf, other]

doi 10.1007/JHEP05(2023)181

Cosmological gravitational particle production of massive spin-2 particles

Authors: Edward W. Kolb, Siyang Ling, Andrew J. Long, Rachel A. Rosen

Abstract: The phenomenon of cosmological gravitational particle production (CGPP) is expected to occur during the period of inflation and the transition into a hot big bang cosmology. Particles may be produced even if they only couple directly to gravity, and so CGPP provides a natural explanation for the origin of dark matter. In this work we study the gravitational production of massive spin-2 particles a… ▽ More The phenomenon of cosmological gravitational particle production (CGPP) is expected to occur during the period of inflation and the transition into a hot big bang cosmology. Particles may be produced even if they only couple directly to gravity, and so CGPP provides a natural explanation for the origin of dark matter. In this work we study the gravitational production of massive spin-2 particles assuming two different couplings to matter. We evaluate the full system of mode equations, including the helicity-0 modes, and by solving them numerically we calculate the spectrum and abundance of massive spin-2 particles that results from inflation on a hilltop potential. We conclude that CGPP might provide a viable mechanism for the generation of massive spin-2 particle dark matter during inflation, and we identify the favorable region of parameter space in terms of the spin-2 particle's mass and the reheating temperature. As a secondary product of our work, we identify the conditions under which such theories admit ghost or gradient instabilities, and we thereby derive a generalization of the Higuchi bound to Friedmann-Robertson-Walker (FRW) spacetimes. △ Less

Submitted 8 February, 2023; originally announced February 2023.

Comments: 46 pages + references, 9 figures

arXiv:2302.00419 [pdf, other]

For the Underrepresented in Gender Bias Research: Chinese Name Gender Prediction with Heterogeneous Graph Attention Network

Authors: Zihao Pan, Kai Peng, Shuai Ling, Haipeng Zhang

Abstract: Achieving gender equality is an important pillar for humankind's sustainable future. Pioneering data-driven gender bias research is based on large-scale public records such as scientific papers, patents, and company registrations, covering female researchers, inventors and entrepreneurs, and so on. Since gender information is often missing in relevant datasets, studies rely on tools to infer gende… ▽ More Achieving gender equality is an important pillar for humankind's sustainable future. Pioneering data-driven gender bias research is based on large-scale public records such as scientific papers, patents, and company registrations, covering female researchers, inventors and entrepreneurs, and so on. Since gender information is often missing in relevant datasets, studies rely on tools to infer genders from names. However, available open-sourced Chinese gender-guessing tools are not yet suitable for scientific purposes, which may be partially responsible for female Chinese being underrepresented in mainstream gender bias research and affect their universality. Specifically, these tools focus on character-level information while overlooking the fact that the combinations of Chinese characters in multi-character names, as well as the components and pronunciations of characters, convey important messages. As a first effort, we design a Chinese Heterogeneous Graph Attention (CHGAT) model to capture the heterogeneity in component relationships and incorporate the pronunciations of characters. Our model largely surpasses current tools and also outperforms the state-of-the-art algorithm. Last but not least, the most popular Chinese name-gender dataset is single-character based with far less female coverage from an unreliable source, naturally hindering relevant studies. We open-source a more balanced multi-character dataset from an official source together with our code, hoping to help future research promoting gender equality. △ Less

Submitted 1 February, 2023; originally announced February 2023.

Comments: 8 pages, 4 figures

arXiv:2301.03880 [pdf, other]

Non-collinear magnetism in the post-perovskite thiocyanate frameworks CsM(NCS)$_3$

Authors: Madeleine Geers, Jie Yie Lee, Sanliang Ling, Oscar Fabelo, Laura Cañadillas-Delgado, Matthew J. Cliffe

Abstract: AMX$_3$ compounds are structurally diverse, a notable example being the post-perovskite structure which adopts a two-dimensional framework with corner- and edge-sharing octahedra. Few molecular post-perovskites are known and of these, none have reported magnetic structures. Here we report the synthesis, structure and magnetic properties of molecular post-perovskites: CsNi(NCS)$_3$, a thiocyanate f… ▽ More AMX$_3$ compounds are structurally diverse, a notable example being the post-perovskite structure which adopts a two-dimensional framework with corner- and edge-sharing octahedra. Few molecular post-perovskites are known and of these, none have reported magnetic structures. Here we report the synthesis, structure and magnetic properties of molecular post-perovskites: CsNi(NCS)$_3$, a thiocyanate framework, and two new isostructural analogues CsCo(NCS)$_3$ and CsMn(NCS)$_3$. Magnetisation measurements show that all three compounds undergo magnetic order. CsNi(NCS)$_3$ (Curie temperature, $T_\mathrm{C} = 8.5(1)\;$K) and CsCo(NCS)$_3$ ($T_\mathrm{C} = 6.7(1)\;$K) order as weak ferromagnets. On the other hand, CsMn(NCS)$_3$ orders as an antiferromagnet (Néel temperature, $T_\mathrm{N}=16.8(8)\;$K). Neutron diffraction data of CsNi(NCS)$_3$ and CsMn(NCS)$_3$, show that both are non-collinear magnets. These results suggest molecular frameworks are fruitful ground for realising the spin textures required for the next generation of information technology. △ Less

Submitted 7 February, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

Comments: 9 pages, 5 figures

arXiv:2211.03392 [pdf, ps, other]

A tight upper bound on the number of non-zero weights of a quasi-cyclic code

Authors: Xiaoxiao Li, Minjia Shi, San Ling

Abstract: Let $\mathcal{C}$ be a quasi-cyclic code of index $l(l\geq2)$. Let $G$ be the subgroup of the automorphism group of $\mathcal{C}$ generated by $ρ^l$ and the scalar multiplications of $\mathcal{C}$, where $ρ$ denotes the standard cyclic shift. In this paper, we find an explicit formula of orbits of $G$ on $\mathcal{C}\setminus \{\mathbf{0}\}$. Consequently, an explicit upper bound on the number of… ▽ More Let $\mathcal{C}$ be a quasi-cyclic code of index $l(l\geq2)$. Let $G$ be the subgroup of the automorphism group of $\mathcal{C}$ generated by $ρ^l$ and the scalar multiplications of $\mathcal{C}$, where $ρ$ denotes the standard cyclic shift. In this paper, we find an explicit formula of orbits of $G$ on $\mathcal{C}\setminus \{\mathbf{0}\}$. Consequently, an explicit upper bound on the number of nonzero weights of $\mathcal{C}$ is immediately derived and a necessary and sufficient condition for codes meeting the bound is exhibited. If $\mathcal{C}$ is a one-generator quasi-cyclic code, a tighter upper bound on the number of nonzero weights of $\mathcal{C}$ is obtained by considering a larger automorphism subgroup which is generated by the multiplier, $ρ^l$ and the scalar multiplications of $\mathcal{C}$. In particular, we list some examples to show the bounds are tight. Our main result improves and generalizes some of the results in \cite{M2}. △ Less

Submitted 6 November, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

arXiv:2211.03280 [pdf, other]

Multimodal Learning for Non-small Cell Lung Cancer Prognosis

Authors: Yujiao Wu, Yaxiong Wang, Xiaoshui Huang, Fan Yang, Sai Ho Ling, Steven Weidong Su

Abstract: This paper focuses on the task of survival time analysis for lung cancer. Although much progress has been made in this problem in recent years, the performance of existing methods is still far from satisfactory. Traditional and some deep learning-based survival time analyses for lung cancer are mostly based on textual clinical information such as staging, age, histology, etc. Unlike existing metho… ▽ More This paper focuses on the task of survival time analysis for lung cancer. Although much progress has been made in this problem in recent years, the performance of existing methods is still far from satisfactory. Traditional and some deep learning-based survival time analyses for lung cancer are mostly based on textual clinical information such as staging, age, histology, etc. Unlike existing methods that predicting on the single modality, we observe that a human clinician usually takes multimodal data such as text clinical data and visual scans to estimate survival time. Motivated by this, in this work, we contribute a smart cross-modality network for survival analysis network named Lite-ProSENet that simulates a human's manner of decision making. Extensive experiments were conducted using data from 422 NSCLC patients from The Cancer Imaging Archive (TCIA). The results show that our Lite-ProSENet outperforms favorably again all comparison methods and achieves the new state of the art with the 89.3% on concordance. The code will be made publicly available. △ Less

Submitted 6 November, 2022; originally announced November 2022.

Comments: 11 pages, 6 figures, Multimodal learning, NSCLC, Survival analysis, Transformer

arXiv:2210.14909 [pdf]

Automated Diagnosis of Cardiovascular Diseases from Cardiac Magnetic Resonance Imaging Using Deep Learning Models: A Review

Authors: Mahboobeh Jafari, Afshin Shoeibi, Marjane Khodatars, Navid Ghassemi, Parisa Moridian, Niloufar Delfan, Roohallah Alizadehsani, Abbas Khosravi, Sai Ho Ling, Yu-Dong Zhang, Shui-Hua Wang, Juan M. Gorriz, Hamid Alinejad Rokny, U. Rajendra Acharya

Abstract: In recent years, cardiovascular diseases (CVDs) have become one of the leading causes of mortality globally. CVDs appear with minor symptoms and progressively get worse. The majority of people experience symptoms such as exhaustion, shortness of breath, ankle swelling, fluid retention, and other symptoms when starting CVD. Coronary artery disease (CAD), arrhythmia, cardiomyopathy, congenital heart… ▽ More In recent years, cardiovascular diseases (CVDs) have become one of the leading causes of mortality globally. CVDs appear with minor symptoms and progressively get worse. The majority of people experience symptoms such as exhaustion, shortness of breath, ankle swelling, fluid retention, and other symptoms when starting CVD. Coronary artery disease (CAD), arrhythmia, cardiomyopathy, congenital heart defect (CHD), mitral regurgitation, and angina are the most common CVDs. Clinical methods such as blood tests, electrocardiography (ECG) signals, and medical imaging are the most effective methods used for the detection of CVDs. Among the diagnostic methods, cardiac magnetic resonance imaging (CMR) is increasingly used to diagnose, monitor the disease, plan treatment and predict CVDs. Coupled with all the advantages of CMR data, CVDs diagnosis is challenging for physicians due to many slices of data, low contrast, etc. To address these issues, deep learning (DL) techniques have been employed to the diagnosis of CVDs using CMR data, and much research is currently being conducted in this field. This review provides an overview of the studies performed in CVDs detection using CMR images and DL techniques. The introduction section examined CVDs types, diagnostic methods, and the most important medical imaging techniques. In the following, investigations to detect CVDs using CMR images and the most significant DL methods are presented. Another section discussed the challenges in diagnosing CVDs from CMR data. Next, the discussion section discusses the results of this review, and future work in CVDs diagnosis from CMR images and DL techniques are outlined. The most important findings of this study are presented in the conclusion section. △ Less

Submitted 26 October, 2022; originally announced October 2022.

arXiv:2210.14611 [pdf]

Automatic Diagnosis of Myocarditis Disease in Cardiac MRI Modality using Deep Transformers and Explainable Artificial Intelligence

Authors: Mahboobeh Jafari, Afshin Shoeibi, Navid Ghassemi, Jonathan Heras, Sai Ho Ling, Amin Beheshti, Yu-Dong Zhang, Shui-Hua Wang, Roohallah Alizadehsani, Juan M. Gorriz, U. Rajendra Acharya, Hamid Alinejad Rokny

Abstract: Myocarditis is a significant cardiovascular disease (CVD) that poses a threat to the health of many individuals by causing damage to the myocardium. The occurrence of microbes and viruses, including the likes of HIV, plays a crucial role in the development of myocarditis disease (MCD). The images produced during cardiac magnetic resonance imaging (CMRI) scans are low contrast, which can make it ch… ▽ More Myocarditis is a significant cardiovascular disease (CVD) that poses a threat to the health of many individuals by causing damage to the myocardium. The occurrence of microbes and viruses, including the likes of HIV, plays a crucial role in the development of myocarditis disease (MCD). The images produced during cardiac magnetic resonance imaging (CMRI) scans are low contrast, which can make it challenging to diagnose cardiovascular diseases. In other hand, checking numerous CMRI slices for each CVD patient can be a challenging task for medical doctors. To overcome the existing challenges, researchers have suggested the use of artificial intelligence (AI)-based computer-aided diagnosis systems (CADS). The presented paper outlines a CADS for the detection of MCD from CMR images, utilizing deep learning (DL) methods. The proposed CADS consists of several steps, including dataset, preprocessing, feature extraction, classification, and post-processing. First, the Z-Alizadeh dataset was selected for the experiments. Subsequently, the CMR images underwent various preprocessing steps, including denoising, resizing, as well as data augmentation (DA) via CutMix and MixUp techniques. In the following, the most current deep pre-trained and transformer models are used for feature extraction and classification on the CMR images. The findings of our study reveal that transformer models exhibit superior performance in detecting MCD as opposed to pre-trained architectures. In terms of DL architectures, the Turbulence Neural Transformer (TNT) model exhibited impressive accuracy, reaching 99.73% utilizing a 10-fold cross-validation approach. Additionally, to pinpoint areas of suspicion for MCD in CMRI images, the Explainable-based Grad Cam method was employed. △ Less

Submitted 1 December, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

arXiv:2209.03251 [pdf, other]

Explicit Low-Bandwidth Evaluation Schemes for Weighted Sums of Reed-Solomon-Coded Symbols

Authors: Han Mao Kiah, Wilton Kim, Stanislav Kruglik, San Ling, Huaxiong Wang

Abstract: Motivated by applications in distributed storage, distributed computing, and homomorphic secret sharing, we study communication-efficient schemes for computing linear combinations of coded symbols. Specifically, we design low-bandwidth schemes that evaluate the weighted sum of $\ell$ coded symbols in a codeword $\pmb{c}\in\mathbb{F}^n$, when we are given access to $d$ of the remaining components i… ▽ More Motivated by applications in distributed storage, distributed computing, and homomorphic secret sharing, we study communication-efficient schemes for computing linear combinations of coded symbols. Specifically, we design low-bandwidth schemes that evaluate the weighted sum of $\ell$ coded symbols in a codeword $\pmb{c}\in\mathbb{F}^n$, when we are given access to $d$ of the remaining components in $\pmb{c}$. Formally, suppose that $\mathbb{F}$ is a field extension of $\mathbb{B}$ of degree $t$. Let $\pmb{c}$ be a codeword in a Reed-Solomon code of dimension $k$ and our task is to compute the weighted sum of $\ell$ coded symbols. In this paper, for some $s<t$, we provide an explicit scheme that performs this task by downloading $d(t-s)$ sub-symbols in $\mathbb{B}$ from $d$ available nodes, whenever $d\geq \ell|\mathbb{B}|^s-\ell+k$. In many cases, our scheme outperforms previous schemes in the literature. Furthermore, we provide a characterization of evaluation schemes for general linear codes. Then in the special case of Reed-Solomon codes, we use this characterization to derive a lower bound for the evaluation bandwidth. △ Less

Submitted 7 May, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

Comments: Accepted to 2023 IEEE International Symposium on Information Theory

arXiv:2207.05647 [pdf, other]

doi 10.1007/s11128-023-04211-x

How Much Entanglement Does a Quantum Code Need?

Authors: Gaojun Luo, Martianus Frederic Ezerman, Markus Grassl, San Ling

Abstract: In the setting of entanglement-assisted quantum error-correcting codes (EAQECCs), the sender and the receiver have access to pre-shared entanglement. Such codes promise better information rates or improved error handling properties. Entanglement incurs costs and must be judiciously calibrated in designing quantum codes with good performance, relative to their deployment parameters. Revisiting kn… ▽ More In the setting of entanglement-assisted quantum error-correcting codes (EAQECCs), the sender and the receiver have access to pre-shared entanglement. Such codes promise better information rates or improved error handling properties. Entanglement incurs costs and must be judiciously calibrated in designing quantum codes with good performance, relative to their deployment parameters. Revisiting known constructions, we devise tools from classical coding theory to better understand how the amount of entanglement can be varied. We present three new propagation rules and discuss how each of them affects the error handling. Tables listing the parameters of the best performing qubit and qutrit EAQECCs that we can explicitly construct are supplied for reference and comparison. △ Less

Submitted 5 September, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

Journal ref: Quantum Information Processing, vol. 23, article 4, 2024

arXiv:2206.14204 [pdf, other]

doi 10.1007/JHEP09(2022)216

An analytic evaluation of gravitational particle production of fermions via Stokes phenomenon

Authors: Soichiro Hashiba, Siyang Ling, Andrew J. Long

Abstract: The phenomenon of gravitational particle production can take place for quantum fields in curved spacetime. The abundance and energy spectrum of gravitationally produced particles is typically calculated by solving the field's mode equations on a time-dependent background metric. For purposes of studying dark matter production in an inflationary cosmology, these mode equations are often solved nume… ▽ More The phenomenon of gravitational particle production can take place for quantum fields in curved spacetime. The abundance and energy spectrum of gravitationally produced particles is typically calculated by solving the field's mode equations on a time-dependent background metric. For purposes of studying dark matter production in an inflationary cosmology, these mode equations are often solved numerically, which is computationally intensive, especially for the rapidly-oscillating high-momentum modes. However, these same modes are amenable to analytic evaluation via the Exact Wentzel-Kramers-Brillouin (EWKB) method, where gravitational particle production is a manifestation of the Stokes phenomenon. These analytic techniques have been used in the past to study gravitational particle production for spin-0 bosons. We extend the earlier work to study gravitational production of spin-1/2 and spin-3/2 fermions. We derive an analytic expression for the connection matrix (valid to all orders in perturbations) that relates Bogoliubov coefficients across a Stokes line connecting a merged pair of simple turning points. By comparing the analytic approximation with a direct numerical integration of the mode equations, we demonstrate an excellent agreement and highlight the utility of the Stokes phenomenon formalism applied to fermions. We discuss the implications for an analytic understanding of catastrophic particle production due to vanishing sound speed, which can occur for a spin-3/2 Rarita-Schwinger field. △ Less

Submitted 28 June, 2022; originally announced June 2022.

Comments: 33 pages + appendices and references, 10 figures

arXiv:2206.11233 [pdf]

doi 10.3389/fnmol.2022.999605

Automatic autism spectrum disorder detection using artificial intelligence methods with MRI neuroimaging: A review

Authors: Parisa Moridian, Navid Ghassemi, Mahboobeh Jafari, Salam Salloum-Asfar, Delaram Sadeghi, Marjane Khodatars, Afshin Shoeibi, Abbas Khosravi, Sai Ho Ling, Abdulhamit Subasi, Roohallah Alizadehsani, Juan M. Gorriz, Sara A Abdulla, U. Rajendra Acharya

Abstract: Autism spectrum disorder (ASD) is a brain condition characterized by diverse signs and symptoms that appear in early childhood. ASD is also associated with communication deficits and repetitive behavior in affected individuals. Various ASD detection methods have been developed, including neuroimaging modalities and psychological tests. Among these methods, magnetic resonance imaging (MRI) imaging… ▽ More Autism spectrum disorder (ASD) is a brain condition characterized by diverse signs and symptoms that appear in early childhood. ASD is also associated with communication deficits and repetitive behavior in affected individuals. Various ASD detection methods have been developed, including neuroimaging modalities and psychological tests. Among these methods, magnetic resonance imaging (MRI) imaging modalities are of paramount importance to physicians. Clinicians rely on MRI modalities to diagnose ASD accurately. The MRI modalities are non-invasive methods that include functional (fMRI) and structural (sMRI) neuroimaging methods. However, diagnosing ASD with fMRI and sMRI for specialists is often laborious and time-consuming; therefore, several computer-aided design systems (CADS) based on artificial intelligence (AI) have been developed to assist specialist physicians. Conventional machine learning (ML) and deep learning (DL) are the most popular schemes of AI used for diagnosing ASD. This study aims to review the automated detection of ASD using AI. We review several CADS that have been developed using ML techniques for the automated diagnosis of ASD using MRI modalities. There has been very limited work on the use of DL techniques to develop automated diagnostic models for ASD. A summary of the studies developed using DL is provided in the Supplementary Appendix. Then, the challenges encountered during the automated diagnosis of ASD using MRI and AI techniques are described in detail. Additionally, a graphical comparison of studies using ML and DL to diagnose ASD automatically is discussed. We suggest future approaches to detecting ASDs using AI techniques and MRI neuroimaging. △ Less

Submitted 6 October, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

Journal ref: Moridian, et. al., Automatic autism spectrum disorder detection using artificial intelligence methods with MRI neuroimaging: A review, Frontiers in Molecular Neuroscience, Volume 15, 2022

arXiv:2206.09782 [pdf, ps, other]

Entanglement-Assisted and Subsystem Quantum Codes: New Propagation Rules and Constructions

Authors: Gaojun Luo, Martianus Frederic Ezerman, San Ling

Abstract: This paper proposes new propagation rules on quantum codes in the entanglement-assisted and in quantum subsystem scenarios. The rules lead to new families of such quantum codes whose parameters are demonstrably optimal. To obtain the results, we devise tools to puncture and shorten codes in ways that ensure their Hermitian hulls have certain desirable properties. More specifically, we give a gener… ▽ More This paper proposes new propagation rules on quantum codes in the entanglement-assisted and in quantum subsystem scenarios. The rules lead to new families of such quantum codes whose parameters are demonstrably optimal. To obtain the results, we devise tools to puncture and shorten codes in ways that ensure their Hermitian hulls have certain desirable properties. More specifically, we give a general framework to construct $k$-dimensional generalized Reed-Solomon codes whose Hermitian hulls are $(k-1)$-dimensional maximum distance separable codes. △ Less

Submitted 20 June, 2022; originally announced June 2022.

arXiv:2205.13599 [pdf, other]

VectorAdam for Rotation Equivariant Geometry Optimization

Authors: Selena Ling, Nicholas Sharp, Alec Jacobson

Abstract: The Adam optimization algorithm has proven remarkably effective for optimization problems across machine learning and even traditional tasks in geometry processing. At the same time, the development of equivariant methods, which preserve their output under the action of rotation or some other transformation, has proven to be important for geometry problems across these domains. In this work, we ob… ▽ More The Adam optimization algorithm has proven remarkably effective for optimization problems across machine learning and even traditional tasks in geometry processing. At the same time, the development of equivariant methods, which preserve their output under the action of rotation or some other transformation, has proven to be important for geometry problems across these domains. In this work, we observe that Adam $-$ when treated as a function that maps initial conditions to optimized results $-$ is not rotation equivariant for vector-valued parameters due to per-coordinate moment updates. This leads to significant artifacts and biases in practice. We propose to resolve this deficiency with VectorAdam, a simple modification which makes Adam rotation-equivariant by accounting for the vector structure of optimization variables. We demonstrate this approach on problems in machine learning and traditional geometric optimization, showing that equivariant VectorAdam resolves the artifacts and biases of traditional Adam when applied to vector-valued data, with equivalent or even improved rates of convergence. △ Less

Submitted 13 November, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

Comments: 10 pages, 9 figures

arXiv:2205.07565 [pdf, other]

A Framework to Map VMAF with the Probability of Just Noticeable Difference between Video Encoding Recipes

Authors: Jingwen Zhu, Suiyi Ling, Yoann Baveye, Patrick Le Callet

Abstract: Just Noticeable Difference (JND) model developed based on Human Vision System (HVS) through subjective studies is valuable for many multimedia use cases. In the streaming industries, it is commonly applied to reach a good balance between compression efficiency and perceptual quality when selecting video encoding recipes. Nevertheless, recent state-of-the-art deep learning based JND prediction mode… ▽ More Just Noticeable Difference (JND) model developed based on Human Vision System (HVS) through subjective studies is valuable for many multimedia use cases. In the streaming industries, it is commonly applied to reach a good balance between compression efficiency and perceptual quality when selecting video encoding recipes. Nevertheless, recent state-of-the-art deep learning based JND prediction model relies on large-scale JND ground truth that is expensive and time consuming to collect. Most of the existing JND datasets contain limited number of contents and are limited to a certain codec (e.g., H264). As a result, JND prediction models that were trained on such datasets are normally not agnostic to the codecs. To this end, in order to decouple encoding recipes and JND estimation, we propose a novel framework to map the difference of objective Video Quality Assessment (VQA) scores, i.e., VMAF, between two given videos encoded with different encoding recipes from the same content to the probability of having just noticeable difference between them. The proposed probability mapping model learns from DCR test data, which is significantly cheaper compared to standard JND subjective test. As we utilize objective VQA metric (e.g., VMAF that trained with contents encoded with different codecs) as proxy to estimate JND, our model is agnostic to codecs and computationally efficient. Throughout extensive experiments, it is demonstrated that the proposed model is able to estimate JND values efficiently. △ Less

Submitted 20 May, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

arXiv:2205.02448 [pdf, ps, other]

doi 10.1088/0256-307X/39/10/107501

Incommensurate magnetic order in Sm$_3$BWO$_9$ with the distorted kagome lattice

Authors: K. Y. Zeng, F. Y. Song, L. S. Ling, W. Tong, Shiliang Li, Z. M. Tian, Long Ma, Li Pi

Abstract: We investigate the magnetic ground state of Sm$_3$BWO$_9$ with the distorted kagome lattice. A magnetic phase transition is identified at $T_N=0.75$ K from the temperature dependence of specific heat. From $^{11}$B nuclear magnetic resonance (NMR) measurements, an incommensurate magnetic order is shown by the double-horn type spectra under a $c$-axis magnetic field. While, absence of line splittin… ▽ More We investigate the magnetic ground state of Sm$_3$BWO$_9$ with the distorted kagome lattice. A magnetic phase transition is identified at $T_N=0.75$ K from the temperature dependence of specific heat. From $^{11}$B nuclear magnetic resonance (NMR) measurements, an incommensurate magnetic order is shown by the double-horn type spectra under a $c$-axis magnetic field. While, absence of line splitting is observed for field oriented within the $ab$-plane, indicating the incommensurate modulation of the internal field strictly along $c$-axis. From the spin dynamics, the critical slowing down behavior is observed in the temperature dependence of $1/T_1$ with $μ_0H\perp c$-axis, which is completely absent in that with $μ_0H||c$-axis. Based on the local symmetry of $^{11}$B sites, we analyze the hyperfine coupling tensors and propose two constraints on the possible magnetic structure. The single ion anisotropy should play an important role in the determination of the contrasting ground states of Sm$_3$BWO$_9$ and Pr$_3$BWO$_9$. △ Less

Submitted 28 September, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

Comments: 7 pages, 5 figures

Journal ref: Chin. Phys. Lett. 39, 107501 (2022)

arXiv:2112.13725 [pdf, other]

Near-Optimal Bounds for Generalized Orthogonal Procrustes Problem via Generalized Power Method

Authors: Shuyang Ling

Abstract: Given multiple point clouds, how to find the rigid transform (rotation, reflection, and shifting) such that these point clouds are well aligned? This problem, known as the generalized orthogonal Procrustes problem (GOPP), has found numerous applications in statistics, computer vision, and imaging science. While one commonly-used method is finding the least squares estimator, it is generally an NP-… ▽ More Given multiple point clouds, how to find the rigid transform (rotation, reflection, and shifting) such that these point clouds are well aligned? This problem, known as the generalized orthogonal Procrustes problem (GOPP), has found numerous applications in statistics, computer vision, and imaging science. While one commonly-used method is finding the least squares estimator, it is generally an NP-hard problem to obtain the least squares estimator exactly due to the notorious nonconvexity. In this work, we apply the semidefinite programming (SDP) relaxation and the generalized power method to solve this generalized orthogonal Procrustes problem. In particular, we assume the data are generated from a signal-plus-noise model: each observed point cloud is a noisy copy of the same unknown point cloud transformed by an unknown orthogonal matrix and also corrupted by additive Gaussian noise. We show that the generalized power method (equivalently alternating minimization algorithm) with spectral initialization converges to the unique global optimum to the SDP relaxation, provided that the signal-to-noise ratio is high. Moreover, this limiting point is exactly the least squares estimator and also the maximum likelihood estimator. In addition, we derive a block-wise estimation error for each orthogonal matrix and the underlying point cloud. Our theoretical bound is near-optimal in terms of the information-theoretic limit (only loose by a factor of the dimension and a log factor). Our results significantly improve the state-of-the-art results on the tightness of the SDP relaxation for the generalized orthogonal Procrustes problem, an open problem posed by Bandeira, Khoo, and Singer in 2014. △ Less

Submitted 27 December, 2021; originally announced December 2021.

arXiv:2112.05644 [pdf, other]

doi 10.1111/cgf.14357

Roominoes: Generating Novel 3D Floor Plans From Existing 3D Rooms

Authors: Kai Wang, Xianghao Xu, Leon Lei, Selena Ling, Natalie Lindsay, Angel X. Chang, Manolis Savva, Daniel Ritchie

Abstract: Realistic 3D indoor scene datasets have enabled significant recent progress in computer vision, scene understanding, autonomous navigation, and 3D reconstruction. But the scale, diversity, and customizability of existing datasets is limited, and it is time-consuming and expensive to scan and annotate more. Fortunately, combinatorics is on our side: there are enough individual rooms in existing 3D… ▽ More Realistic 3D indoor scene datasets have enabled significant recent progress in computer vision, scene understanding, autonomous navigation, and 3D reconstruction. But the scale, diversity, and customizability of existing datasets is limited, and it is time-consuming and expensive to scan and annotate more. Fortunately, combinatorics is on our side: there are enough individual rooms in existing 3D scene datasets, if there was but a way to recombine them into new layouts. In this paper, we propose the task of generating novel 3D floor plans from existing 3D rooms. We identify three sub-tasks of this problem: generation of 2D layout, retrieval of compatible 3D rooms, and deformation of 3D rooms to fit the layout. We then discuss different strategies for solving the problem, and design two representative pipelines: one uses available 2D floor plans to guide selection and deformation of 3D rooms; the other learns to retrieve a set of compatible 3D rooms and combine them into novel layouts. We design a set of metrics that evaluate the generated results with respect to each of the three subtasks and show that different methods trade off performance on these subtasks. Finally, we survey downstream tasks that benefit from generated 3D scenes and discuss strategies in selecting the methods most appropriate for the demands of these tasks. △ Less

Submitted 10 December, 2021; originally announced December 2021.

Comments: Symposium on Geometry Processing (SGP) 2021

Journal ref: Computer Graphics Forum, 40: 57-69 (2021)

arXiv:2110.06956 [pdf, other]

Considering user agreement in learning to predict the aesthetic quality

Authors: Suiyi Ling, Andreas Pastor, Junle Wang, Patrick Le Callet

Abstract: How to robustly rank the aesthetic quality of given images has been a long-standing ill-posed topic. Such challenge stems mainly from the diverse subjective opinions of different observers about the varied types of content. There is a growing interest in estimating the user agreement by considering the standard deviation of the scores, instead of only predicting the mean aesthetic opinion score. N… ▽ More How to robustly rank the aesthetic quality of given images has been a long-standing ill-posed topic. Such challenge stems mainly from the diverse subjective opinions of different observers about the varied types of content. There is a growing interest in estimating the user agreement by considering the standard deviation of the scores, instead of only predicting the mean aesthetic opinion score. Nevertheless, when comparing a pair of contents, few studies consider how confident are we regarding the difference in the aesthetic scores. In this paper, we thus propose (1) a re-adapted multi-task attention network to predict both the mean opinion score and the standard deviation in an end-to-end manner; (2) a brand-new confidence interval ranking loss that encourages the model to focus on image-pairs that are less certain about the difference of their aesthetic scores. With such loss, the model is encouraged to learn the uncertainty of the content that is relevant to the diversity of observers' opinions, i.e., user disagreement. Extensive experiments have demonstrated that the proposed multi-task aesthetic model achieves state-of-the-art performance on two different types of aesthetic datasets, i.e., AVA and TMGA. △ Less

Submitted 13 October, 2021; originally announced October 2021.

Comments: 5 pages

MSC Class: 68T07 ACM Class: I.4.0

arXiv:2110.04056 [pdf, ps, other]

Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask

Authors: Shaoshi Ling, Chen Shen, Meng Cai, Zejun Ma

Abstract: In the recent trend of semi-supervised speech recognition, both self-supervised representation learning and pseudo-labeling have shown promising results. In this paper, we propose a novel approach to combine their ideas for end-to-end speech recognition model. Without any extra loss function, we utilize the Gradient Mask to optimize the model when training on pseudo-label. This method forces the s… ▽ More In the recent trend of semi-supervised speech recognition, both self-supervised representation learning and pseudo-labeling have shown promising results. In this paper, we propose a novel approach to combine their ideas for end-to-end speech recognition model. Without any extra loss function, we utilize the Gradient Mask to optimize the model when training on pseudo-label. This method forces the speech recognition model to predict from the masked input to learn strong acoustic representation and make training robust to label noise. In our semi-supervised experiments, the method can improve the model performance when training on pseudo-label and our method achieved competitive results comparing with other semi-supervised approaches on the Librispeech 100 hours experiments. △ Less

Submitted 8 October, 2021; originally announced October 2021.

Showing 1–50 of 195 results for author: Ling, S