subscribe to arXiv mailings

ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models

Authors: Yuxin Zhang, Weiming Dong, Fan Tang, Nisha Huang, Haibin Huang, Chongyang Ma, Tong-Yee Lee, Oliver Deussen, Changsheng Xu

Abstract: Personalizing generative models offers a way to guide image generation with user-provided references. Current personalization methods can invert an object or concept into the textual conditioning space and compose new natural sentences for text-to-image diffusion models. However, representing and editing specific visual attributes such as material, style, and layout remains a challenge, leading to… ▽ More Personalizing generative models offers a way to guide image generation with user-provided references. Current personalization methods can invert an object or concept into the textual conditioning space and compose new natural sentences for text-to-image diffusion models. However, representing and editing specific visual attributes such as material, style, and layout remains a challenge, leading to a lack of disentanglement and editability. To address this problem, we propose a novel approach that leverages the step-by-step generation process of diffusion models, which generate images from low to high frequency information, providing a new perspective on representing, generating, and editing images. We develop the Prompt Spectrum Space P*, an expanded textual conditioning space, and a new image representation method called \sysname. ProSpect represents an image as a collection of inverted textual token embeddings encoded from per-stage prompts, where each prompt corresponds to a specific generation stage (i.e., a group of consecutive steps) of the diffusion model. Experimental results demonstrate that P* and ProSpect offer better disentanglement and controllability compared to existing methods. We apply ProSpect in various personalized attribute-aware image generation applications, such as image-guided or text-driven manipulations of materials, style, and layout, achieving previously unattainable results from a single image input without fine-tuning the diffusion models. Our source code is available athttps://github.com/zyxElsa/ProSpect. △ Less

Submitted 7 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

arXiv:2305.11081 [pdf, other]

Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems

Authors: Zhaochun Ren, Na Huang, Yidan Wang, Pengjie Ren, Jun Ma, Jiahuan Lei, Xinlei Shi, Hengliang Luo, Joemon M Jose, Xin Xin

Abstract: Learning reinforcement learning (RL)-based recommenders from historical user-item interaction sequences is vital to generate high-reward recommendations and improve long-term cumulative benefits. However, existing RL recommendation methods encounter difficulties (i) to estimate the value functions for states which are not contained in the offline training data, and (ii) to learn effective state re… ▽ More Learning reinforcement learning (RL)-based recommenders from historical user-item interaction sequences is vital to generate high-reward recommendations and improve long-term cumulative benefits. However, existing RL recommendation methods encounter difficulties (i) to estimate the value functions for states which are not contained in the offline training data, and (ii) to learn effective state representations from user implicit feedback due to the lack of contrastive signals. In this work, we propose contrastive state augmentations (CSA) for the training of RL-based recommender systems. To tackle the first issue, we propose four state augmentation strategies to enlarge the state space of the offline data. The proposed method improves the generalization capability of the recommender by making the RL agent visit the local state regions and ensuring the learned value functions are similar between the original and augmented states. For the second issue, we propose introducing contrastive signals between augmented states and the state randomly sampled from other sessions to improve the state representation learning further. To verify the effectiveness of the proposed CSA, we conduct extensive experiments on two publicly accessible datasets and one dataset collected from a real-life e-commerce platform. We also conduct experiments on a simulated environment as the online evaluation setting. Experimental results demonstrate that CSA can effectively improve recommendation performance. △ Less

Submitted 18 May, 2023; originally announced May 2023.

arXiv:2305.09893 [pdf, other]

Integrating Multiple Sources Knowledge for Class Asymmetry Domain Adaptation Segmentation of Remote Sensing Images

Authors: Kuiliang Gao, Anzhu Yu, Xiong You, Wenyue Guo, Ke Li, Ningbo Huang

Abstract: In the existing unsupervised domain adaptation (UDA) methods for remote sensing images (RSIs) semantic segmentation, class symmetry is an widely followed ideal assumption, where the source and target RSIs have exactly the same class space. In practice, however, it is often very difficult to find a source RSI with exactly the same classes as the target RSI. More commonly, there are multiple source… ▽ More In the existing unsupervised domain adaptation (UDA) methods for remote sensing images (RSIs) semantic segmentation, class symmetry is an widely followed ideal assumption, where the source and target RSIs have exactly the same class space. In practice, however, it is often very difficult to find a source RSI with exactly the same classes as the target RSI. More commonly, there are multiple source RSIs available. To this end, a novel class asymmetry RSIs domain adaptation method with multiple sources is proposed in this paper, which consists of four key components. Firstly, a multi-branch segmentation network is built to learn an expert for each source RSI. Secondly, a novel collaborative learning method with the cross-domain mixing strategy is proposed, to supplement the class information for each source while achieving the domain adaptation of each source-target pair. Thirdly, a pseudo-label generation strategy is proposed to effectively combine strengths of different experts, which can be flexibly applied to two cases where the source class union is equal to or includes the target class set. Fourthly, a multiview-enhanced knowledge integration module is developed for the high-level knowledge routing and transfer from multiple domains to target predictions. △ Less

Submitted 16 May, 2023; originally announced May 2023.

Comments: 17 pages, 10 figures

arXiv:2305.05464 [pdf, other]

Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer

Authors: Nisha Huang, Yuxin Zhang, Weiming Dong

Abstract: Large-scale text-to-video diffusion models have demonstrated an exceptional ability to synthesize diverse videos. However, due to the lack of extensive text-to-video datasets and the necessary computational resources for training, directly applying these models for video stylization remains difficult. Also, given that the noise addition process on the input content is random and destructive, fulfi… ▽ More Large-scale text-to-video diffusion models have demonstrated an exceptional ability to synthesize diverse videos. However, due to the lack of extensive text-to-video datasets and the necessary computational resources for training, directly applying these models for video stylization remains difficult. Also, given that the noise addition process on the input content is random and destructive, fulfilling the style transfer task's content preservation criteria is challenging. This paper proposes a zero-shot video stylization method named Style-A-Video, which utilizes a generative pre-trained transformer with an image latent diffusion model to achieve a concise text-controlled video stylization. We improve the guidance condition in the denoising process, establishing a balance between artistic expression and structure preservation. Furthermore, to decrease inter-frame flicker and avoid the formation of additional artifacts, we employ a sampling optimization and a temporal consistency module. Extensive experiments show that we can attain superior content preservation and stylistic performance while incurring less consumption than previous solutions. Code will be available at https://github.com/haha-lisa/Style-A-Video. △ Less

Submitted 9 May, 2023; originally announced May 2023.

arXiv:2305.01830 [pdf, ps, other]

Finite-time and fixed-time consensus control of multi-agent systems driven by parabolic partial differential equations

Authors: Xu-hui Wang, Xue-song Li, Nan-jing Huang

Abstract: This paper focuses on the study of the finite-time consensus (FTC) and fixed-time consensus (FXC) issues of multi-agent systems (MASs) driven by parabolic partial differential equations (PDEs). Compared with the study in the existing literature, the topic of FTC and FXC control is first embodied in MASs driven by parabolic PDEs. Based on the Lyapunov theorems, the FTC and FXC controllers are devis… ▽ More This paper focuses on the study of the finite-time consensus (FTC) and fixed-time consensus (FXC) issues of multi-agent systems (MASs) driven by parabolic partial differential equations (PDEs). Compared with the study in the existing literature, the topic of FTC and FXC control is first embodied in MASs driven by parabolic PDEs. Based on the Lyapunov theorems, the FTC and FXC controllers are devised to ensure that the MASs converge to a stable state with external disturbance. Furthermore, we simplify the controllers to guarantee the FTC and FXC of MASs without external disturbance. Finally, two illustrative examples are given to verify the feasibility of controllers. △ Less

Submitted 2 May, 2023; originally announced May 2023.

arXiv:2302.14749 [pdf, other]

doi 10.3847/2041-8213/acbf45

Simultaneous Millimeter-wave, Gamma-ray, and Optical Monitoring of the Blazar PKS 2326-502 During a Flaring State

Authors: J. C. Hood II, A. Simpson, A. McDaniel, A. Foster, P. A. R. Ade, M. Ajello, A. J. Anderson, J. E. Austermann, J. A. Beall, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, J. E. Carlstrom, C. L. Chang, P. Chaubal, H. C. Chiang, T-L. Chou, R. Citron, C. Corbett Moran, T. M. Crawford, A. T. Crites, T. de Haan, M. A. Dobbs, W. Everett , et al. (44 additional authors not shown)

Abstract: Including millimeter-wave (mm-wave) data in multi-wavelength studies of the variability of active galactic nuclei (AGN) can provide insights into AGN physics that are not easily accessible at other wavelengths. We demonstrate in this work the potential of cosmic microwave background (CMB) telescopes to provide long-term, high-cadence mm-wave AGN monitoring over large fractions of sky. We report on… ▽ More Including millimeter-wave (mm-wave) data in multi-wavelength studies of the variability of active galactic nuclei (AGN) can provide insights into AGN physics that are not easily accessible at other wavelengths. We demonstrate in this work the potential of cosmic microwave background (CMB) telescopes to provide long-term, high-cadence mm-wave AGN monitoring over large fractions of sky. We report on a pilot study using data from the SPTpol instrument on the South Pole Telescope (SPT), which was designed to observe the CMB at arcminute and larger angular scales. Between 2013 and 2016, SPTpol was used primarily to observe a single 500 deg^2 field, covering the entire field several times per day with detectors sensitive to radiation in bands centered at 95 and 150 GHz. We use SPT 150 GHz observations to create AGN light curves, and we compare these mm-wave light curves to those at other wavelengths, in particular gamma-ray and optical. In this Letter, we focus on a single source, PKS 2326-502, which has extensive, day-timescale monitoring data in gamma-ray, optical, and now mm-wave between 2013 and 2016. We find PKS 2326-502 to be in a flaring state in the first two years of this monitoring, and we present a search for evidence of correlated variability between mm-wave, optical R band, and gamma-ray observations. This pilot study is paving the way for AGN monitoring with current and upcoming CMB experiments such as SPT-3G, Simons Observatory, and CMB-S4, including multi-wavelength studies with facilities such as VRO-LSST. △ Less

Submitted 28 February, 2023; originally announced February 2023.

Comments: 9 pages, 3 figures, accepted to Astrophysical Journal Letters

arXiv:2302.11797 [pdf, other]

Region-Aware Diffusion for Zero-shot Text-driven Image Editing

Authors: Nisha Huang, Fan Tang, Weiming Dong, Tong-Yee Lee, Changsheng Xu

Abstract: Image manipulation under the guidance of textual descriptions has recently received a broad range of attention. In this study, we focus on the regional editing of images with the guidance of given text prompts. Different from current mask-based image editing methods, we propose a novel region-aware diffusion model (RDM) for entity-level image editing, which could automatically locate the region of… ▽ More Image manipulation under the guidance of textual descriptions has recently received a broad range of attention. In this study, we focus on the regional editing of images with the guidance of given text prompts. Different from current mask-based image editing methods, we propose a novel region-aware diffusion model (RDM) for entity-level image editing, which could automatically locate the region of interest and replace it following given text prompts. To strike a balance between image fidelity and inference speed, we design the intensive diffusion pipeline by combing latent space diffusion and enhanced directional guidance. In addition, to preserve image content in non-edited regions, we introduce regional-aware entity editing to modify the region of interest and preserve the out-of-interest region. We validate the proposed RDM beyond the baseline methods through extensive qualitative and quantitative experiments. The results show that RDM outperforms the previous approaches in terms of visual quality, overall harmonization, non-editing region content preservation, and text-image semantic consistency. The codes are available at https://github.com/haha-lisa/RDM-Region-Aware-Diffusion-Model. △ Less

Submitted 23 February, 2023; originally announced February 2023.

arXiv:2212.11191 [pdf, other]

Separating MAX 2-AND, MAX DI-CUT and MAX CUT

Authors: Joshua Brakensiek, Neng Huang, Aaron Potechin, Uri Zwick

Abstract: Assuming the Unique Games Conjecture (UGC), the best approximation ratio that can be obtained in polynomial time for the MAX CUT problem is $α_{\text{CUT}}\simeq 0.87856$, obtained by the celebrated SDP-based approximation algorithm of Goemans and Williamson. The currently best approximation algorithm for MAX DI-CUT, i.e., the MAX CUT problem in directed graphs, achieves a ratio of about… ▽ More Assuming the Unique Games Conjecture (UGC), the best approximation ratio that can be obtained in polynomial time for the MAX CUT problem is $α_{\text{CUT}}\simeq 0.87856$, obtained by the celebrated SDP-based approximation algorithm of Goemans and Williamson. The currently best approximation algorithm for MAX DI-CUT, i.e., the MAX CUT problem in directed graphs, achieves a ratio of about $0.87401$, leaving open the question whether MAX DI-CUT can be approximated as well as MAX CUT. We obtain a slightly improved algorithm for MAX DI-CUT and a new UGC-hardness result for it, showing that $0.87446\le α_{\text{DI-CUT}}\le 0.87461$, where $α_{\text{DI-CUT}}$ is the best approximation ratio that can be obtained in polynomial time for MAX DI-CUT under UGC. The new upper bound separates MAX DI-CUT from MAX CUT, resolving a question raised by Feige and Goemans. A natural generalization of MAX DI-CUT is the MAX 2-AND problem in which each constraint is of the form $z_1\land z_2$, where $z_1$ and $z_2$ are literals, i.e., variables or their negations (In MAX DI-CUT each constraint is of the form $\bar{x}_1\land x_2$, where $x_1$ and $x_2$ are variables.) Austrin separated MAX 2-AND from MAX CUT by showing that $α_{\text{2AND}} < 0.87435$ and conjectured that MAX 2-AND and MAX DI-CUT have the same approximation ratio. Our new lower bound on MAX DI-CUT refutes this conjecture, completing the separation of the three problems MAX 2-AND, MAX DI-CUT and MAX CUT. We also obtain a new lower bound for MAX 2-AND, showing that $0.87414\le α_{\text{2AND}}\le 0.87435$. Our upper bound on MAX DI-CUT is achieved via a simple, analytical proof. The lower bounds on MAX DI-CUT and MAX 2-AND (the new approximation algorithms) use experimentally-discovered distributions of rounding functions which are then verified via computer-assisted proofs. △ Less

Submitted 12 April, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

Comments: 39 pages, 5 figures, 7 tables

arXiv:2212.08366 [pdf, ps, other]

Stochastic differential variational inequalities with applications

Authors: Yao-Jia Zhang, Tao Chen, Nan-jing Huang, Xue-song Li

Abstract: In this paper, we introduce and study a stochastic differential variational inequality (SDVI) which consists of a stochastic differential equation and a stochastic variational inequality. We obtain the existence and uniqueness of the solutions for SDVI by using the iteration method and Gronwall's inequality. Moreover, we show the convergence of Euler scheme for solving SDVI under some mild conditi… ▽ More In this paper, we introduce and study a stochastic differential variational inequality (SDVI) which consists of a stochastic differential equation and a stochastic variational inequality. We obtain the existence and uniqueness of the solutions for SDVI by using the iteration method and Gronwall's inequality. Moreover, we show the convergence of Euler scheme for solving SDVI under some mild conditions. Finally, we apply the obtained results to solve the electrical circuits with diodes and the collapse of the bridge problems in stochastic environment. △ Less

Submitted 16 December, 2022; originally announced December 2022.

arXiv:2212.05642 [pdf, other]

doi 10.1103/PhysRevD.108.023510

A Measurement of the CMB Temperature Power Spectrum and Constraints on Cosmology from the SPT-3G 2018 TT/TE/EE Data Set

Authors: L. Balkenhol, D. Dutcher, A. Spurio Mancini, A. Doussot, K. Benabed, S. Galli, P. A. R. Ade, A. J. Anderson, B. Ansarinejad, M. Archipley, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, F. R. Bouchet, L. Bryant, E. Camphuis, J. E. Carlstrom, T. W. Cecil, C. L. Chang, P. Chaubal, P. M. Chichura, T. -L. Chou, A. Coerver, T. M. Crawford , et al. (62 additional authors not shown)

Abstract: We present a sample-variance-limited measurement of the temperature power spectrum ($TT$) of the cosmic microwave background (CMB) using observations of a $\sim\! 1500 \,\mathrm{deg}^2$ field made by SPT-3G in 2018. We report multifrequency power spectrum measurements at 95, 150, and 220GHz covering the angular multipole range $750 \leq \ell < 3000$. We combine this $TT$ measurement with the publi… ▽ More We present a sample-variance-limited measurement of the temperature power spectrum ($TT$) of the cosmic microwave background (CMB) using observations of a $\sim\! 1500 \,\mathrm{deg}^2$ field made by SPT-3G in 2018. We report multifrequency power spectrum measurements at 95, 150, and 220GHz covering the angular multipole range $750 \leq \ell < 3000$. We combine this $TT$ measurement with the published polarization power spectrum measurements from the 2018 observing season and update their associated covariance matrix to complete the SPT-3G 2018 $TT/TE/EE$ data set. This is the first analysis to present cosmological constraints from SPT $TT$, $TE$, and $EE$ power spectrum measurements jointly. We blind the cosmological results and subject the data set to a series of consistency tests at the power spectrum and parameter level. We find excellent agreement between frequencies and spectrum types and our results are robust to the modeling of astrophysical foregrounds. We report results for $Λ$CDM and a series of extensions, drawing on the following parameters: the amplitude of the gravitational lensing effect on primary power spectra $A_\mathrm{L}$, the effective number of neutrino species $N_{\mathrm{eff}}$, the primordial helium abundance $Y_{\mathrm{P}}$, and the baryon clumping factor due to primordial magnetic fields $b$. We find that the SPT-3G 2018 $T/TE/EE$ data are well fit by $Λ$CDM with a probability-to-exceed of $15\%$. For $Λ$CDM, we constrain the expansion rate today to $H_0 = 68.3 \pm 1.5\,\mathrm{km\,s^{-1}\,Mpc^{-1}}$ and the combined structure growth parameter to $S_8 = 0.797 \pm 0.042$. The SPT-based results are effectively independent of Planck, and the cosmological parameter constraints from either data set are within $<1\,σ$ of each other. (abridged) △ Less

Submitted 27 July, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

Comments: 35 Pages, 17 Figures, 11 Tables

arXiv:2212.05248 [pdf, ps, other]

Stochastic Linear-quadratic Control Problems with Affine Constraints

Authors: Zhun Gou, Nan-jing Huang, Xian-jun Long, Jian-hao Kang

Abstract: This paper investigates the stochastic linear-quadratic control problems with affine constraints, in which both equality and inequality constraints are involved. With the help of the Pontryagin maximum principle and Lagrangian duality theory, the dual problem of original problem is established and the state feedback form of the solution to the optimal control problem is obtained. Under the Slater… ▽ More This paper investigates the stochastic linear-quadratic control problems with affine constraints, in which both equality and inequality constraints are involved. With the help of the Pontryagin maximum principle and Lagrangian duality theory, the dual problem of original problem is established and the state feedback form of the solution to the optimal control problem is obtained. Under the Slater condition, the equivalence is proved between the solutions to the original problem and the ones of the dual problem, and the KKT condition is also provided for solving original problem. Especially, a new sufficient condition is given for the invertibility assumption, which ensures the uniqueness of the solutions to the dual problem. △ Less

Submitted 15 April, 2024; v1 submitted 10 December, 2022; originally announced December 2022.

arXiv:2212.01271 [pdf, other]

Protecting the quantum interference of cat states by phase-space compression

Authors: Xiaozhou Pan, Jonathan Schwinger, Ni-Ni Huang, Pengtao Song, Weipin Chua, Fumiya Hanamura, Atharv Joshi, Fernando Valadares, Radim Filip, Yvonne Y. Gao

Abstract: Cat states, with their unique phase-space interference properties, are ideal candidates for understanding fundamental principles of quantum mechanics and performing key quantum information processing tasks. However, they are highly susceptible to photon loss, which inevitably diminishes their quantum non-Gaussian features. Here, we protect these non-Gaussian features against photon loss by compres… ▽ More Cat states, with their unique phase-space interference properties, are ideal candidates for understanding fundamental principles of quantum mechanics and performing key quantum information processing tasks. However, they are highly susceptible to photon loss, which inevitably diminishes their quantum non-Gaussian features. Here, we protect these non-Gaussian features against photon loss by compressing the phase-space distribution of a cat state. We achieve this compression with a deterministic technique based on the echo conditional displacement operation in a circuit QED device. We present a versatile technique for creating robust non-Gaussian continuous-variable resource states in a highly linear bosonic mode and manipulating their phase-space distribution to achieve enhanced resilience against photon loss. Compressed cat states offer an attractive avenue for obtaining new insights into quantum foundations and quantum metrology, and for developing inherently more protected bosonic codewords for quantum error correction. △ Less

Submitted 2 December, 2022; originally announced December 2022.

Comments: 12 pages, 9 figures

arXiv:2211.13203 [pdf, other]

Inversion-Based Style Transfer with Diffusion Models

Authors: Yuxin Zhang, Nisha Huang, Fan Tang, Haibin Huang, Chongyang Ma, Weiming Dong, Changsheng Xu

Abstract: The artistic style within a painting is the means of expression, which includes not only the painting material, colors, and brushstrokes, but also the high-level attributes including semantic elements, object shapes, etc. Previous arbitrary example-guided artistic image generation methods often fail to control shape changes or convey elements. The pre-trained text-to-image synthesis diffusion prob… ▽ More The artistic style within a painting is the means of expression, which includes not only the painting material, colors, and brushstrokes, but also the high-level attributes including semantic elements, object shapes, etc. Previous arbitrary example-guided artistic image generation methods often fail to control shape changes or convey elements. The pre-trained text-to-image synthesis diffusion probabilistic models have achieved remarkable quality, but it often requires extensive textual descriptions to accurately portray attributes of a particular painting. We believe that the uniqueness of an artwork lies precisely in the fact that it cannot be adequately explained with normal language. Our key idea is to learn artistic style directly from a single painting and then guide the synthesis without providing complex textual descriptions. Specifically, we assume style as a learnable textual description of a painting. We propose an inversion-based style transfer method (InST), which can efficiently and accurately learn the key information of an image, thus capturing and transferring the artistic style of a painting. We demonstrate the quality and efficiency of our method on numerous paintings of various artists and styles. Code and models are available at https://github.com/zyxElsa/InST. △ Less

Submitted 20 March, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

Comments: accepted by CVPR 2023

arXiv:2211.10682 [pdf, other]

DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization

Authors: Nisha Huang, Yuxin Zhang, Fan Tang, Chongyang Ma, Haibin Huang, Yong Zhang, Weiming Dong, Changsheng Xu

Abstract: Despite the impressive results of arbitrary image-guided style transfer methods, text-driven image stylization has recently been proposed for transferring a natural image into a stylized one according to textual descriptions of the target style provided by the user. Unlike the previous image-to-image transfer approaches, text-guided stylization progress provides users with a more precise and intui… ▽ More Despite the impressive results of arbitrary image-guided style transfer methods, text-driven image stylization has recently been proposed for transferring a natural image into a stylized one according to textual descriptions of the target style provided by the user. Unlike the previous image-to-image transfer approaches, text-guided stylization progress provides users with a more precise and intuitive way to express the desired style. However, the huge discrepancy between cross-modal inputs/outputs makes it challenging to conduct text-driven image stylization in a typical feed-forward CNN pipeline. In this paper, we present DiffStyler, a dual diffusion processing architecture to control the balance between the content and style of the diffused results. The cross-modal style information can be easily integrated as guidance during the diffusion process step-by-step. Furthermore, we propose a content image-based learnable noise on which the reverse denoising process is based, enabling the stylization results to better preserve the structure information of the content image. We validate the proposed DiffStyler beyond the baseline methods through extensive qualitative and quantitative experiments. Code is available at \url{https://github.com/haha-lisa/Diffstyler}. △ Less

Submitted 18 December, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

arXiv:2211.04646 [pdf, other]

doi 10.1038/s41535-023-00562-x

CrRhAs: a member of a large family of metallic kagome antiferromagnets

Authors: Y. N. Huang, Harald O. Jeschke, Igor I. Mazin

Abstract: Kagome lattice materials are an important platform for highly frustrated magnetism as well as for a plethora of phenomena resulting from flat bands, Dirac cones and van Hove singularities in their electronic structures. We study the little known metallic magnet CrRhAs, which belongs to a vast family of materials that include $3d$, $4f$ and $5f$ magnetic elements, as well as numerous nonmagnetic me… ▽ More Kagome lattice materials are an important platform for highly frustrated magnetism as well as for a plethora of phenomena resulting from flat bands, Dirac cones and van Hove singularities in their electronic structures. We study the little known metallic magnet CrRhAs, which belongs to a vast family of materials that include $3d$, $4f$ and $5f$ magnetic elements, as well as numerous nonmagnetic metals and insulators. Using noncollinear spin density functional calculations (mostly spin spirals), we extract a model magnetic Hamiltonian for CrRhAs. While it is dominated by an antiferromagnetic second nearest neighbor coupling in the kagome plane, the metallic nature of the compound leads to numerous nonzero longer range couplings and to important ring exchange terms. We analyze this Hamiltonian and find unusual ground states which are dominated by nearly isolated antiferromagnetic triangles that adopt 120$^\circ$ order either with positive or with negative vector chirality. We discuss the connection to the few known experimental facts about CrRhAs. Finally, we give a brief survey of other interesting magnetic members of this family of kagome compounds. △ Less

Submitted 8 November, 2022; originally announced November 2022.

Comments: 16 pages

Journal ref: npj Quantum Mater. 8, 32 (2023)

arXiv:2211.03231 [pdf, other]

A Spectral Analysis of Graph Neural Networks on Dense and Sparse Graphs

Authors: Luana Ruiz, Ningyuan Huang, Soledad Villar

Abstract: In this work we propose a random graph model that can produce graphs at different levels of sparsity. We analyze how sparsity affects the graph spectra, and thus the performance of graph neural networks (GNNs) in node classification on dense and sparse graphs. We compare GNNs with spectral methods known to provide consistent estimators for community detection on dense graphs, a closely related tas… ▽ More In this work we propose a random graph model that can produce graphs at different levels of sparsity. We analyze how sparsity affects the graph spectra, and thus the performance of graph neural networks (GNNs) in node classification on dense and sparse graphs. We compare GNNs with spectral methods known to provide consistent estimators for community detection on dense graphs, a closely related task. We show that GNNs can outperform spectral methods on sparse graphs, and illustrate these results with numerical examples on both synthetic and real graphs. △ Less

Submitted 13 September, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

Comments: Extended version of ICASSP 2024 submission

arXiv:2210.15083 [pdf, other]

Deep Learning is Provably Robust to Symmetric Label Noise

Authors: Carey E. Priebe, Ningyuan Huang, Soledad Villar, Cong Mu, Li Chen

Abstract: Deep neural networks (DNNs) are capable of perfectly fitting the training data, including memorizing noisy data. It is commonly believed that memorization hurts generalization. Therefore, many recent works propose mitigation strategies to avoid noisy data or correct memorization. In this work, we step back and ask the question: Can deep learning be robust against massive label noise without any mi… ▽ More Deep neural networks (DNNs) are capable of perfectly fitting the training data, including memorizing noisy data. It is commonly believed that memorization hurts generalization. Therefore, many recent works propose mitigation strategies to avoid noisy data or correct memorization. In this work, we step back and ask the question: Can deep learning be robust against massive label noise without any mitigation? We provide an affirmative answer for the case of symmetric label noise: We find that certain DNNs, including under-parameterized and over-parameterized models, can tolerate massive symmetric label noise up to the information-theoretic threshold. By appealing to classical statistical theory and universal consistency of DNNs, we prove that for multiclass classification, $L_1$-consistent DNN classifiers trained under symmetric label noise can achieve Bayes optimality asymptotically if the label noise probability is less than $\frac{K-1}{K}$, where $K \ge 2$ is the number of classes. Our results show that for symmetric label noise, no mitigation is necessary for $L_1$-consistent estimators. We conjecture that for general label noise, mitigation strategies that make use of the noisy data will outperform those that ignore the noisy data. △ Less

Submitted 26 October, 2022; originally announced October 2022.

arXiv:2209.13360 [pdf, other]

Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided Diffusion

Authors: Nisha Huang, Fan Tang, Weiming Dong, Changsheng Xu

Abstract: Digital art synthesis is receiving increasing attention in the multimedia community because of engaging the public with art effectively. Current digital art synthesis methods usually use single-modality inputs as guidance, thereby limiting the expressiveness of the model and the diversity of generated results. To solve this problem, we propose the multimodal guided artwork diffusion (MGAD) model,… ▽ More Digital art synthesis is receiving increasing attention in the multimedia community because of engaging the public with art effectively. Current digital art synthesis methods usually use single-modality inputs as guidance, thereby limiting the expressiveness of the model and the diversity of generated results. To solve this problem, we propose the multimodal guided artwork diffusion (MGAD) model, which is a diffusion-based digital artwork generation approach that utilizes multimodal prompts as guidance to control the classifier-free diffusion model. Additionally, the contrastive language-image pretraining (CLIP) model is used to unify text and image modalities. Extensive experimental results on the quality and quantity of the generated digital art paintings confirm the effectiveness of the combination of the diffusion model and multimodal guidance. Code is available at https://github.com/haha-lisa/MGAD-multimodal-guided-artwork-diffusion. △ Less

Submitted 28 September, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

Comments: Accepted by ACM MM 2022

arXiv:2209.12054 [pdf, other]

From Local to Global: Spectral-Inspired Graph Neural Networks

Authors: Ningyuan Huang, Soledad Villar, Carey E. Priebe, Da Zheng, Chengyue Huang, Lin Yang, Vladimir Braverman

Abstract: Graph Neural Networks (GNNs) are powerful deep learning methods for Non-Euclidean data. Popular GNNs are message-passing algorithms (MPNNs) that aggregate and combine signals in a local graph neighborhood. However, shallow MPNNs tend to miss long-range signals and perform poorly on some heterophilous graphs, while deep MPNNs can suffer from issues like over-smoothing or over-squashing. To mitigate… ▽ More Graph Neural Networks (GNNs) are powerful deep learning methods for Non-Euclidean data. Popular GNNs are message-passing algorithms (MPNNs) that aggregate and combine signals in a local graph neighborhood. However, shallow MPNNs tend to miss long-range signals and perform poorly on some heterophilous graphs, while deep MPNNs can suffer from issues like over-smoothing or over-squashing. To mitigate such issues, existing works typically borrow normalization techniques from training neural networks on Euclidean data or modify the graph structures. Yet these approaches are not well-understood theoretically and could increase the overall computational complexity. In this work, we draw inspirations from spectral graph embedding and propose $\texttt{PowerEmbed}$ -- a simple layer-wise normalization technique to boost MPNNs. We show $\texttt{PowerEmbed}$ can provably express the top-$k$ leading eigenvectors of the graph operator, which prevents over-smoothing and is agnostic to the graph topology; meanwhile, it produces a list of representations ranging from local features to global signals, which avoids over-squashing. We apply $\texttt{PowerEmbed}$ in a wide range of simulated and real graphs and demonstrate its competitive performance, particularly for heterophilous graphs. △ Less

Submitted 4 November, 2022; v1 submitted 24 September, 2022; originally announced September 2022.

Comments: Accepted for publication at the NeurIPS 2022 GLFrontiers Workshop

arXiv:2208.14069 [pdf, ps, other]

Variance-Based Bregman Extragradient Algorithm with Line Search for Solving Stochastic Variational Inequalities

Authors: Xian-Jun Long, Yue-Hong He, Nan-Jing Huang

Abstract: The main purpose of this paper is to propose a variance-based Bregman extragradient algorithm with line search for solving stochastic variational inequalities, which is robust with respect an unknown Lipschitz constant. We prove the almost sure convergence of the algorithm by a more concise and effective method instead of using the supermartingale convergence theorem. Furthermore, we obtain not on… ▽ More The main purpose of this paper is to propose a variance-based Bregman extragradient algorithm with line search for solving stochastic variational inequalities, which is robust with respect an unknown Lipschitz constant. We prove the almost sure convergence of the algorithm by a more concise and effective method instead of using the supermartingale convergence theorem. Furthermore, we obtain not only the convergence rate $\mathcal{O}(1/k)$ with the gap function when $X$ is bounded, but also the same convergence rate in terms of the natural residual function when $X$ is unbounded. Under the Minty variational inequality condition, we derive the iteration complexity $\mathcal{O}(1/\varepsilon)$ and the oracle complexity $\mathcal{O}(1/\varepsilon^2)$ in both cases. Finally, some numerical results demonstrate the superiority of the proposed algorithm. △ Less

Submitted 30 August, 2022; originally announced August 2022.

arXiv:2208.07499 [pdf, ps, other]

doi 10.1137/22M1515884

On GSOR, the Generalized Successive Overrelaxation Method for Double Saddle-Point Problems

Authors: Na Huang, Yu-Hong Dai, Dominique Orban, Michael A. Saunders

Abstract: We consider the generalized successive overrelaxation (GSOR) method for solving a class of block three-by-three saddle-point problems. Based on the necessary and sufficient conditions for all roots of a real cubic polynomial to have modulus less than one, we derive convergence results under reasonable assumptions. We also analyze a class of block lower triangular preconditioners induced from GSOR… ▽ More We consider the generalized successive overrelaxation (GSOR) method for solving a class of block three-by-three saddle-point problems. Based on the necessary and sufficient conditions for all roots of a real cubic polynomial to have modulus less than one, we derive convergence results under reasonable assumptions. We also analyze a class of block lower triangular preconditioners induced from GSOR and derive explicit and sharp spectral bounds for the preconditioned matrices. We report numerical experiments on test problems from the liquid crystal director model and the coupled Stokes-Darcy flow, demonstrating the usefulness of GSOR. △ Less

Submitted 15 August, 2022; originally announced August 2022.

Report number: G-2022-35 MSC Class: 65F10; 65F50

Journal ref: SIAM Journal on Scientific Computing, 2023

arXiv:2207.11937 [pdf]

doi 10.1103/PhysRevD.107.042004

A measurement of the mean central optical depth of galaxy clusters via the pairwise kinematic Sunyaev-Zel'dovich effect with SPT-3G and DES

Authors: E. Schiappucci, F. Bianchini, M. Aguena, M. Archipley, L. Balkenhol, L. E. Bleem, P. Chaubal, T. M. Crawford, S. Grandis, Y. Omori, C. L. Reichardt, E. Rozo, E. S. Rykoff, C. To, T. M. C. Abbott, P. A. R. Ade, O. Alves, A. J. Anderson, F. Andrade-Oliveira, J. Annis, J. S. Avva, D. Bacon, K. Benabed, A. N. Bender, B. A. Benson , et al. (117 additional authors not shown)

Abstract: We infer the mean optical depth of a sample of optically-selected galaxy clusters from the Dark Energy Survey (DES) via the pairwise kinematic Sunyaev-Zel'dovich (kSZ) effect. The pairwise kSZ signal between pairs of clusters drawn from the DES Year-3 cluster catalog is detected at $4.1 σ$ in cosmic microwave background (CMB) temperature maps from two years of observations with the SPT-3G camera o… ▽ More We infer the mean optical depth of a sample of optically-selected galaxy clusters from the Dark Energy Survey (DES) via the pairwise kinematic Sunyaev-Zel'dovich (kSZ) effect. The pairwise kSZ signal between pairs of clusters drawn from the DES Year-3 cluster catalog is detected at $4.1 σ$ in cosmic microwave background (CMB) temperature maps from two years of observations with the SPT-3G camera on the South Pole Telescope. After cuts, there are 24,580 clusters in the $\sim 1,400$ deg$^2$ of the southern sky observed by both experiments. We infer the mean optical depth of the cluster sample with two techniques. The optical depth inferred from the pairwise kSZ signal is $\barτ_e = (2.97 \pm 0.73) \times 10^{-3}$, while that inferred from the thermal SZ signal is $\barτ_e = (2.51 \pm 0.55^{\text{stat}} \pm 0.15^{\rm syst}) \times 10^{-3}$. The two measures agree at $0.6 σ$. We perform a suite of systematic checks to test the robustness of the analysis. △ Less

Submitted 16 June, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

arXiv:2206.13163 [pdf, other]

Endowing Language Models with Multimodal Knowledge Graph Representations

Authors: Ningyuan Huang, Yash R. Deshpande, Yibo Liu, Houda Alberts, Kyunghyun Cho, Clara Vania, Iacer Calixto

Abstract: We propose a method to make natural language understanding models more parameter efficient by storing knowledge in an external knowledge graph (KG) and retrieving from this KG using a dense index. Given (possibly multilingual) downstream task data, e.g., sentences in German, we retrieve entities from the KG and use their multimodal representations to improve downstream task performance. We use the… ▽ More We propose a method to make natural language understanding models more parameter efficient by storing knowledge in an external knowledge graph (KG) and retrieving from this KG using a dense index. Given (possibly multilingual) downstream task data, e.g., sentences in German, we retrieve entities from the KG and use their multimodal representations to improve downstream task performance. We use the recently released VisualSem KG as our external knowledge repository, which covers a subset of Wikipedia and WordNet entities, and compare a mix of tuple-based and graph-based algorithms to learn entity and relation representations that are grounded on the KG multimodal information. We demonstrate the usefulness of the learned entity representations on two downstream tasks, and show improved performance on the multilingual named entity recognition task by $0.3\%$--$0.7\%$ F1, while we achieve up to $2.5\%$ improvement in accuracy on the visual sense disambiguation task. All our code and data are available in: \url{https://github.com/iacercalixto/visualsem-kg}. △ Less

Submitted 27 June, 2022; originally announced June 2022.

Comments: 14 pages with appendix, 2 figures, 15 tables

MSC Class: 68T50 ACM Class: I.2.7; I.2.10; I.2.4

arXiv:2206.12401 [pdf, other]

doi 10.1145/3534678.3539392

Debiasing Learning for Membership Inference Attacks Against Recommender Systems

Authors: Zihan Wang, Na Huang, Fei Sun, Pengjie Ren, Zhumin Chen, Hengliang Luo, Maarten de Rijke, Zhaochun Ren

Abstract: Learned recommender systems may inadvertently leak information about their training data, leading to privacy violations. We investigate privacy threats faced by recommender systems through the lens of membership inference. In such attacks, an adversary aims to infer whether a user's data is used to train the target recommender. To achieve this, previous work has used a shadow recommender to derive… ▽ More Learned recommender systems may inadvertently leak information about their training data, leading to privacy violations. We investigate privacy threats faced by recommender systems through the lens of membership inference. In such attacks, an adversary aims to infer whether a user's data is used to train the target recommender. To achieve this, previous work has used a shadow recommender to derive training data for the attack model, and then predicts the membership by calculating difference vectors between users' historical interactions and recommended items. State-of-the-art methods face two challenging problems: (1) training data for the attack model is biased due to the gap between shadow and target recommenders, and (2) hidden states in recommenders are not observational, resulting in inaccurate estimations of difference vectors. To address the above limitations, we propose a Debiasing Learning for Membership Inference Attacks against recommender systems (DL-MIA) framework that has four main components: (1) a difference vector generator, (2) a disentangled encoder, (3) a weight estimator, and (4) an attack model. To mitigate the gap between recommenders, a variational auto-encoder (VAE) based disentangled encoder is devised to identify recommender invariant and specific features. To reduce the estimation bias, we design a weight estimator, assigning a truth-level score for each difference vector to indicate estimation accuracy. We evaluate DL-MIA against both general recommenders and sequential recommenders on three real-world datasets. Experimental results show that DL-MIA effectively alleviates training and estimation biases simultaneously, and achieves state-of-the-art attack performance. △ Less

Submitted 28 June, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

Comments: Accepted by KDD 2022

arXiv:2206.02951 [pdf, ps, other]

doi 10.13140/RG.2.2.19916.08327

A semi-conjugate gradient method for solving unsymmetric positive definite linear systems

Authors: Na Huang, Yu-Hong Dai, Dominique Orban, Michael A Saunders

Abstract: The conjugate gradient (CG) method is a classic Krylov subspace method for solving symmetric positive definite linear systems. We introduce an analogous semi-conjugate gradient (SCG) method for unsymmetric positive definite linear systems. Unlike CG, SCG requires the solution of a lower triangular linear system to produce each semi-conjugate direction. We prove that SCG is theoretically equivalent… ▽ More The conjugate gradient (CG) method is a classic Krylov subspace method for solving symmetric positive definite linear systems. We introduce an analogous semi-conjugate gradient (SCG) method for unsymmetric positive definite linear systems. Unlike CG, SCG requires the solution of a lower triangular linear system to produce each semi-conjugate direction. We prove that SCG is theoretically equivalent to the full orthogonalization method (FOM), which is based on the Arnoldi process and converges in a finite number of steps. Because SCG's triangular system increases in size each iteration, we study a sliding window implementation (SWI) to improve efficiency, and show that the directions produced are still locally semi-conjugate. A counterexample illustrates that SWI is different from the direct incomplete orthogonalization method (DIOM), which is FOM with a sliding window. Numerical experiments from the convection-diffusion equation and other applications show that SCG is robust and that the sliding window implementation SWI allows SCG to solve large systems efficiently. △ Less

Submitted 8 June, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

Report number: G-2022-25 MSC Class: 15A06; 65F10; 65F25; 65F50

arXiv:2205.14299 [pdf, other]

Deep Learning with Label Noise: A Hierarchical Approach

Authors: Li Chen, Ningyuan Huang, Cong Mu, Hayden S. Helm, Kate Lytvynets, Weiwei Yang, Carey E. Priebe

Abstract: Deep neural networks are susceptible to label noise. Existing methods to improve robustness, such as meta-learning and regularization, usually require significant change to the network architecture or careful tuning of the optimization procedure. In this work, we propose a simple hierarchical approach that incorporates a label hierarchy when training the deep learning models. Our approach requires… ▽ More Deep neural networks are susceptible to label noise. Existing methods to improve robustness, such as meta-learning and regularization, usually require significant change to the network architecture or careful tuning of the optimization procedure. In this work, we propose a simple hierarchical approach that incorporates a label hierarchy when training the deep learning models. Our approach requires no change of the network architecture or the optimization procedure. We investigate our hierarchical network through a wide range of simulated and real datasets and various label noise types. Our hierarchical approach improves upon regular deep neural networks in learning with label noise. Combining our hierarchical approach with pre-trained models achieves state-of-the-art performance in real-world noisy datasets. △ Less

Submitted 27 May, 2022; originally announced May 2022.

Comments: 8 pages, 7 figures

arXiv:2204.13863 [pdf, other]

Indoor 3-Dimensional Visible Light Positioning: Error Metric and LED Layout Optimization

Authors: Jiaojiao Xu, Nuo Huang, Chen Gong

Abstract: We consider 3-dimensional (3D) visible light positioning (VLP) based on smartphone camera in an indoor scenario. Based on the positioning model in the quantized pixel-domain, we characterize the 3D normalized positioning error metric (NPEM) through the partial derivative of the positioning function, and evaluate the NPEM for horizontal and non-horizontal receiver camera positions. Moreover, under… ▽ More We consider 3-dimensional (3D) visible light positioning (VLP) based on smartphone camera in an indoor scenario. Based on the positioning model in the quantized pixel-domain, we characterize the 3D normalized positioning error metric (NPEM) through the partial derivative of the positioning function, and evaluate the NPEM for horizontal and non-horizontal receiver camera positions. Moreover, under horizontal receiver terminal position, we explore the relationship between the NPEM and the light-emitting diode (LED) cell layout, approximate the relationship between the NPEM and the number of LEDs captured by the camera, and evaluate the approximation accuracy according to the simulated positioning error. Based on the approximation results, we optimize the LED transmitter cell layout to minimize NPEM assuming structured square cell layouts with certain distance parameters. △ Less

Submitted 28 April, 2022; originally announced April 2022.

arXiv:2204.02931 [pdf, other]

doi 10.1021/acs.nanolett.2c00212

Observation of giant surface second harmonic generation coupled to nematic orders in the van der Waals antiferromagnet FePS$_3$

Authors: Zhuoliang Ni, Nan Huang, Amanda V. Haglund, David G. Mandrus, Liang Wu

Abstract: Second harmonic generation has been applied to study lattice, electronic and magnetic proprieties in atomically thin materials. However, inversion symmetry breaking is usually required for the materials to generate a large signal. In this work, we report a giant second-harmonic generation that arises below the Néel temperature in few-layer centrosymmetric FePS$_3$. Layer-dependent study indicates… ▽ More Second harmonic generation has been applied to study lattice, electronic and magnetic proprieties in atomically thin materials. However, inversion symmetry breaking is usually required for the materials to generate a large signal. In this work, we report a giant second-harmonic generation that arises below the Néel temperature in few-layer centrosymmetric FePS$_3$. Layer-dependent study indicates the detected signal is from the second-order nonlinearity of the surface. The magnetism-induced surface second-harmonic response is two orders of magnitude larger than those reported in other magnetic systems, with the surface nonlinear susceptibility reaching 0.08--0.13 nm$^2$/V in 2 L--5 L samples. By combing linear dichroism and second harmonic generation experiments, we further confirm the giant second-harmonic generation is coupled to nematic orders formed by the three possible Zigzag antiferromagnetic domains. Our study shows that the surface second-harmonic generation is also a sensitive tool to study antiferromagnetic states in centrosymmetric atomically thin materials. △ Less

Submitted 6 April, 2022; originally announced April 2022.

Comments: to appear in Nano Letters

Journal ref: Nano Lett. 22, 8, 3283-3288 (2022)

arXiv:2203.16567 [pdf, other]

doi 10.1103/PhysRevD.106.042011

Searching for axion-like time-dependent cosmic birefringence with data from SPT-3G

Authors: K. R. Ferguson, A. J. Anderson, N. Whitehorn, P. A. R. Ade, M. Archipley, J. S. Avva, L. Balkenhol, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, F. R. Bouchet, L. Bryant, E. Camphuis, J. E. Carlstrom, T. W. Cecil, C. L. Chang, P. Chaubal, P. M. Chichura, T. -L. Chou, T. M. Crawford, A. Cukierman, C. Daley, T. de Haan , et al. (56 additional authors not shown)

Abstract: Ultralight axionlike particles (ALPs) are compelling dark matter candidates because of their potential to resolve small-scale discrepancies between $Λ$CDM predictions and cosmological observations. Axion-photon coupling induces a polarization rotation in linearly polarized photons traveling through an ALP field; thus, as the local ALP dark matter field oscillates in time, distant static polarized… ▽ More Ultralight axionlike particles (ALPs) are compelling dark matter candidates because of their potential to resolve small-scale discrepancies between $Λ$CDM predictions and cosmological observations. Axion-photon coupling induces a polarization rotation in linearly polarized photons traveling through an ALP field; thus, as the local ALP dark matter field oscillates in time, distant static polarized sources will appear to oscillate with a frequency proportional to the ALP mass. We use observations of the cosmic microwave background from SPT-3G, the current receiver on the South Pole Telescope, to set upper limits on the value of the axion-photon coupling constant $g_{φγ}$ over the approximate mass range $10^{-22} - 10^{-19}$ eV, corresponding to oscillation periods from 12 hours to 100 days. For periods between 1 and 100 days ($4.7 \times 10^{-22} \text{ eV} \leq m_φ\leq 4.7 \times 10^{-20} \text{ eV}$), where the limit is approximately constant, we set a median 95% C.L. upper limit on the amplitude of on-sky polarization rotation of 0.071 deg. Assuming that dark matter comprises a single ALP species with a local dark matter density of $0.3\text{ GeV/cm}^3$, this corresponds to $g_{φγ} < 1.18 \times 10^{-12}\text{ GeV}^{-1} \times \left( \frac{m_φ}{1.0 \times 10^{-21} \text{ eV}} \right)$. These new limits represent an improvement over the previous strongest limits set using the same effect by a factor of ~3.8. △ Less

Submitted 29 August, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

Comments: 16 pages, 5 figures. Accepted for publication in Physical Review D

Journal ref: Phys. Rev. D 106, 042011 (2022)

arXiv:2202.04405 [pdf, other]

Time-Frequency Mask Aware Bi-directional LSTM: A Deep Learning Approach for Underwater Acoustic Signal Separation

Authors: Jie Chen, Chang Liu, Jiawu Xie, Jie An, Nan Huang

Abstract: The underwater acoustic signals separation is a key technique for the underwater communications. The existing methods are mostly model-based, and could not accurately characterise the practical underwater acoustic communication environment. They are only suitable for binary signal separation, but cannot handle multivariate signal separation. On the other hand, the recurrent neural network (RNN) sh… ▽ More The underwater acoustic signals separation is a key technique for the underwater communications. The existing methods are mostly model-based, and could not accurately characterise the practical underwater acoustic communication environment. They are only suitable for binary signal separation, but cannot handle multivariate signal separation. On the other hand, the recurrent neural network (RNN) shows powerful capability in extracting the features of the temporal sequences. Inspired by this, in this paper, we present a data-driven approach for underwater acoustic signals separation using deep learning technology. We use the Bi-directional Long Short-Term Memory (Bi-LSTM) to explore the features of Time-Frequency (T-F) mask, and propose a T-F mask aware Bi-LSTM for signal separation. Taking advantage of the sparseness of the T-F image, the designed Bi-LSTM network is able to extract the discriminative features for separation, which further improves the separation performance. In particular, this method breaks through the limitations of the existing methods, not only achieves good results in multivariate separation, but also effectively separates signals when mixed with 40dB Gaussian noise signals. The experimental results show that this method can achieve a $97\%$ guarantee ratio (PSR), and the average similarity coefficient of the multivariate signal separation is stable above 0.8 under high noise conditions. △ Less

Submitted 9 February, 2022; originally announced February 2022.

Comments: 28 pages, 14 figures

arXiv:2202.01406 [pdf, other]

doi 10.3847/1538-4357/ac89ec

Asteroid Measurements at Millimeter Wavelengths with the South Pole Telescope

Authors: P. M. Chichura, A. Foster, C. Patel, N. Ossa-Jaen, P. A. R. Ade, Z. Ahmed, A. J. Anderson, M. Archipley, J. E. Austermann, J. S. Avva, L. Balkenhol, P. S. Barry, R. Basu Thakur, J. A. Beall, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, F. R. Bouchet, L. Bryant, K. Byrum, J. E. Carlstrom, F. W. Carter, T. W. Cecil , et al. (119 additional authors not shown)

Abstract: We present the first measurements of asteroids in millimeter wavelength (mm) data from the South Pole Telescope (SPT), which is used primarily to study the cosmic microwave background (CMB). We analyze maps of two $\sim270$ deg$^2$ sky regions near the ecliptic plane, each observed with the SPTpol camera $\sim100$ times over one month. We subtract the mean of all maps of a given field, removing st… ▽ More We present the first measurements of asteroids in millimeter wavelength (mm) data from the South Pole Telescope (SPT), which is used primarily to study the cosmic microwave background (CMB). We analyze maps of two $\sim270$ deg$^2$ sky regions near the ecliptic plane, each observed with the SPTpol camera $\sim100$ times over one month. We subtract the mean of all maps of a given field, removing static sky signal, and then average the mean-subtracted maps at known asteroid locations. We detect three asteroids$\text{ -- }$(324) Bamberga, (13) Egeria, and (22) Kalliope$\text{ -- }$with signal-to-noise ratios (S/N) of 11.2, 10.4, and 6.1, respectively, at 2.0 mm (150 GHz); we also detect (324) Bamberga with S/N of 4.1 at 3.2 mm (95 GHz). We place constraints on these asteroids' effective emissivities, brightness temperatures, and light curve modulation amplitude. Our flux density measurements of (324) Bamberga and (13) Egeria roughly agree with predictions, while our measurements of (22) Kalliope suggest lower flux, corresponding to effective emissivities of $0.66 \pm 0.11$ at 2.0 mm and $<0.47$ at 3.2mm. We predict the asteroids detectable in other SPT datasets and find good agreement with detections of (772) Tanete and (1093) Freda in recent data from the SPT-3G camera, which has $\sim10 \times$ the mapping speed of SPTpol. This work is the first focused analysis of asteroids in data from CMB surveys, and it demonstrates we can repurpose historic and future datasets for asteroid studies. Future SPT measurements can help constrain the distribution of surface properties over a larger asteroid population. △ Less

Submitted 21 April, 2023; v1 submitted 2 February, 2022; originally announced February 2022.

Comments: 21 pages, 9 figures

Journal ref: 2022 ApJ 936 173

arXiv:2201.07083 [pdf, other]

doi 10.1109/ICASSP39728.2021.9413523

A Short Tutorial on The Weisfeiler-Lehman Test And Its Variants

Authors: Ningyuan Huang, Soledad Villar

Abstract: Graph neural networks are designed to learn functions on graphs. Typically, the relevant target functions are invariant with respect to actions by permutations. Therefore the design of some graph neural network architectures has been inspired by graph-isomorphism algorithms. The classical Weisfeiler-Lehman algorithm (WL) -- a graph-isomorphism test based on color refinement -- became relevant to t… ▽ More Graph neural networks are designed to learn functions on graphs. Typically, the relevant target functions are invariant with respect to actions by permutations. Therefore the design of some graph neural network architectures has been inspired by graph-isomorphism algorithms. The classical Weisfeiler-Lehman algorithm (WL) -- a graph-isomorphism test based on color refinement -- became relevant to the study of graph neural networks. The WL test can be generalized to a hierarchy of higher-order tests, known as $k$-WL. This hierarchy has been used to characterize the expressive power of graph neural networks, and to inspire the design of graph neural network architectures. A few variants of the WL hierarchy appear in the literature. The goal of this short note is pedagogical and practical: We explain the differences between the WL and folklore-WL formulations, with pointers to existing discussions in the literature. We illuminate the differences between the formulations by visualizing an example. △ Less

Submitted 1 November, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021

arXiv:2201.06995 [pdf, ps, other]

Improved Receivers for Optical Wireless OFDM: An Information Theoretic Perspective

Authors: Xiaozhen Liu, Jing Zhou, Nuo Huang, Wenyi Zhang

Abstract: We consider performance enhancement of asymmetrically-clipped optical orthogonal frequency division multiplexing (ACO-OFDM) and related optical OFDM schemes, which are variations of OFDM in intensity-modulated optical wireless communications. Unlike most existing studies on specific designs of improved receivers, this paper investigates information theoretic limits of all possible receivers. For i… ▽ More We consider performance enhancement of asymmetrically-clipped optical orthogonal frequency division multiplexing (ACO-OFDM) and related optical OFDM schemes, which are variations of OFDM in intensity-modulated optical wireless communications. Unlike most existing studies on specific designs of improved receivers, this paper investigates information theoretic limits of all possible receivers. For independent and identically distributed complex Gaussian inputs, we obtain an exact characterization of information rate of ACO-OFDM with improved receivers for all SNRs. It is proved that the high-SNR gain of improved receivers asymptotically achieve 1/4 bits per channel use, which is equivalent to 3 dB in electrical SNR or 1.5 dB in optical SNR; as the SNR decreases, the maximum achievable SNR gain of improved receivers decreases monotonically to a non-zero low-SNR limit, corresponding to an information rate gain of 36.3%. For practically used constellations, we derive an upper bound on the gain of improved receivers. Numerical results demonstrate that the upper bound can be approached to within 1 dB in optical SNR by combining existing improved receivers and coded modulation. We also show that our information theoretic analyses can be extended to Flip-OFDM and PAM-DMT. Our results imply that, for the considered schemes, improved receivers may reduce the gap to channel capacity significantly at low-to-moderate SNR. △ Less

Submitted 4 May, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

Comments: 15 pages, 17 figures.To appear in IEEE Transactions on Communications

arXiv:2201.06233 [pdf, ps, other]

Robust equilibrium strategy for mean-variance-skewness portfolio selection problem

Authors: Jian-hao Kang, Nan-jing Huang, Zhihao Hu, Ben-Zhang Yang

Abstract: This paper considers a robust time-consistent mean-variance-skewness portfolio selection problem for an ambiguity-averse investor by taking into account wealth-dependent risk aversion and wealth-dependent skewness preference as well as model uncertainty. The robust equilibrium investment strategy and corresponding equilibrium value function are characterized for such a problem by employing an exte… ▽ More This paper considers a robust time-consistent mean-variance-skewness portfolio selection problem for an ambiguity-averse investor by taking into account wealth-dependent risk aversion and wealth-dependent skewness preference as well as model uncertainty. The robust equilibrium investment strategy and corresponding equilibrium value function are characterized for such a problem by employing an extended Hamilton-Jacobi-Bellman-Isaacs (HJBI) system via a game theoretic approach. Furthermore, the robust equilibrium investment strategy and corresponding equilibrium value function are obtained in semi-closed form for a special robust time-consistent mean-variance-skewness portfolio selection problem. Finally, some numerical experiments are provided to indicate several new findings concerned with the robust equilibrium investment strategy and the utility losses. △ Less

Submitted 17 January, 2022; originally announced January 2022.

arXiv:2201.03859 [pdf, other]

On Exploring Pose Estimation as an Auxiliary Learning Task for Visible-Infrared Person Re-identification

Authors: Yunqi Miao, Nianchang Huang, Xiao Ma, Qiang Zhang, Jungong Han

Abstract: Visible-infrared person re-identification (VI-ReID) has been challenging due to the existence of large discrepancies between visible and infrared modalities. Most pioneering approaches reduce intra-class variations and inter-modality discrepancies by learning modality-shared and ID-related features. However, an explicit modality-shared cue, i.e., body keypoints, has not been fully exploited in VI-… ▽ More Visible-infrared person re-identification (VI-ReID) has been challenging due to the existence of large discrepancies between visible and infrared modalities. Most pioneering approaches reduce intra-class variations and inter-modality discrepancies by learning modality-shared and ID-related features. However, an explicit modality-shared cue, i.e., body keypoints, has not been fully exploited in VI-ReID. Additionally, existing feature learning paradigms imposed constraints on either global features or partitioned feature stripes, which neglect the prediction consistency of global and part features. To address the above problems, we exploit Pose Estimation as an auxiliary learning task to assist the VI-ReID task in an end-to-end framework. By jointly training these two tasks in a mutually beneficial manner, our model learns higher quality modality-shared and ID-related features. On top of it, the learnings of global features and local features are seamlessly synchronized by Hierarchical Feature Constraint (HFC), where the former supervises the latter using the knowledge distillation strategy. Experimental results on two benchmark VI-ReID datasets show that the proposed method consistently improves state-of-the-art methods by significant margins. Specifically, our method achieves nearly 20$\%$ mAP improvements against the state-of-the-art method on the RegDB dataset. Our intriguing findings highlight the usage of auxiliary task learning in VI-ReID. △ Less

Submitted 23 February, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

arXiv:2112.15286 [pdf, ps, other]

A new class of differential quasivariational inequalities with an application to a quasistatic viscoelastic frictional contact problem

Authors: Xu Chu, Tao Chen, Nan-jing Huang, Yi-bin Xiao

Abstract: The overarching goal of this paper is to introduce and investigate a new nonlinear system driven by a nonlinear differential equation, a history-dependent quasivariational inequality, and a parabolic variational inequality in Banach spaces. Such a system can be used to model quasistatic frictional contact problems for viscoelastic materials with long memory, damage and wear. By using the Banach fi… ▽ More The overarching goal of this paper is to introduce and investigate a new nonlinear system driven by a nonlinear differential equation, a history-dependent quasivariational inequality, and a parabolic variational inequality in Banach spaces. Such a system can be used to model quasistatic frictional contact problems for viscoelastic materials with long memory, damage and wear. By using the Banach fixed point theorem, we prove an existence and uniqueness theorem of solution for such a system under some mild conditions. As a novel application, we obtain a unique solvability of a quasistatic viscoelastic frictional contact problem with long memory, damage and wear. △ Less

Submitted 30 December, 2021; originally announced December 2021.

arXiv:2111.14224 [pdf]

doi 10.1109/LPT.2022.3185171

Scattered Image Reconstruction at Near-infrared Based on Spatial Modulation Instability

Authors: Yuan Liao, Lin Li, Zhaolu Wang, Nan Huang, Hongjun Liu

Abstract: We present a method of near-infrared image reconstruction based on spatial modulation instability in a photorefractive strontium barium niobate crystal. The conditions that lead to the formation of modulation instability at near-infrared are discussed depending on the theory of modulation instability gain. Experimental results of scattered image reconstruction at the 1064 nm wavelength show the ma… ▽ More We present a method of near-infrared image reconstruction based on spatial modulation instability in a photorefractive strontium barium niobate crystal. The conditions that lead to the formation of modulation instability at near-infrared are discussed depending on the theory of modulation instability gain. Experimental results of scattered image reconstruction at the 1064 nm wavelength show the maximum cross-correlation coefficient and cross-correlation gain are 0.57 and 2.09 respectively. This method is expected to be an aid for near-infrared imaging technologies. △ Less

Submitted 13 April, 2022; v1 submitted 28 November, 2021; originally announced November 2021.

arXiv:2111.02921 [pdf, ps, other]

Map-Assisted Constellation Design for mmWave WDM with OAM in Short-Range LOS Environment

Authors: Yuan Wang, Chen Gong, Nuo Huang, Zhengyuan Xu

Abstract: We consider a system that integrates positioning and single-user millimeter wave (mmWave) communication, where the communication part adopts wavelength division multiplexing (WDM) and orbital angular momentum (OAM). This paper addresses the multi-dimensional constellation design in shortrange line-of-sight (LOS) environment, with stable communication links. We propose a map-assisted method to quan… ▽ More We consider a system that integrates positioning and single-user millimeter wave (mmWave) communication, where the communication part adopts wavelength division multiplexing (WDM) and orbital angular momentum (OAM). This paper addresses the multi-dimensional constellation design in shortrange line-of-sight (LOS) environment, with stable communication links. We propose a map-assisted method to quantify the system parameters based on positions and reduce real-time computing overhead. We explore the possibility of using a few patterns in the maps, and investigate its performance loss. We first investigate the features of OAM beams, and find that the link gain ratio between any two sub-channels remains unchanged at some postions. Then, we prove that a fixed constellation can be adopted for the positions where the link gain matrices are sufficiently close to be proportional. Moreover, we prove that the system can adopt a fixed power vector to generate a multidimensional constellation if the difference between fixed power vector and optimal power vector is small. Finally, we figure out that the constellation design for all receiver locations can be represented by a few constellation sets. △ Less

Submitted 11 October, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

arXiv:2110.13799 [pdf, other]

Neural PPO-Clip Attains Global Optimality: A Hinge Loss Perspective

Authors: Nai-Chieh Huang, Ping-Chun Hsieh, Kuo-Hao Ho, Hsuan-Yu Yao, Kai-Chun Hu, Liang-Chun Ouyang, I-Chen Wu

Abstract: Policy optimization is a fundamental principle for designing reinforcement learning algorithms, and one example is the proximal policy optimization algorithm with a clipped surrogate objective (PPO-Clip), which has been popularly used in deep reinforcement learning due to its simplicity and effectiveness. Despite its superior empirical performance, PPO-Clip has not been justified via theoretical p… ▽ More Policy optimization is a fundamental principle for designing reinforcement learning algorithms, and one example is the proximal policy optimization algorithm with a clipped surrogate objective (PPO-Clip), which has been popularly used in deep reinforcement learning due to its simplicity and effectiveness. Despite its superior empirical performance, PPO-Clip has not been justified via theoretical proof up to date. In this paper, we establish the first global convergence rate of PPO-Clip under neural function approximation. We identify the fundamental challenges of analyzing PPO-Clip and address them with the two core ideas: (i) We reinterpret PPO-Clip from the perspective of hinge loss, which connects policy improvement with solving a large-margin classification problem with hinge loss and offers a generalized version of the PPO-Clip objective. (ii) Based on the above viewpoint, we propose a two-step policy improvement scheme, which facilitates the convergence analysis by decoupling policy search from the complex neural policy parameterization with the help of entropic mirror descent and a regression-based policy update scheme. Moreover, our theoretical results provide the first characterization of the effect of the clipping mechanism on the convergence of PPO-Clip. Through experiments, we empirically validate the reinterpretation of PPO-Clip and the generalized objective with various classifiers on various RL benchmark tasks. △ Less

Submitted 31 August, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

Comments: 33 pages, 1 figure

arXiv:2110.05110 [pdf, ps, other]

Asset Prices with Investor Protection and Survival Analysis of Shareholders in the Cross-Sectional Economy

Authors: Jia Yue, Ming-Hui Wang, Nan-Jing Huang, Ben-Zhang Yang

Abstract: In this paper, we consider a dynamic asset pricing model in a cross-sectional economy with two firms where a controlling shareholder cannot divert output in one firm with perfect investor protection for minority shareholders and where he can divert a fraction of output in the other firm with imperfect protection. After obtaining the parameters of asset prices by solving the shareholders' consumpti… ▽ More In this paper, we consider a dynamic asset pricing model in a cross-sectional economy with two firms where a controlling shareholder cannot divert output in one firm with perfect investor protection for minority shareholders and where he can divert a fraction of output in the other firm with imperfect protection. After obtaining the parameters of asset prices by solving the shareholders' consumption-portfolio problems in equilibrium, our model features the effect of investor protection and cross-section in the economy. Furthermore, some survival analysis of the shareholders is presented and sufficient conditions on extinction of the shareholders are given in either firm. Our numerical results are in line with some empirical evidence: (i) poorer investor protection in the cross-sectional economy enables the controlling shareholder to hold less shares of the firm with perfect protection and more shares of the firm with imperfect protection, decreases stock gross returns of both firms, increases stock volatilities of both firms, and decreases interest rates of the economy; (ii) compared with the economy with the single relative firm, for the firm with perfect protection, cross-section enables the controlling shareholder to hold less shares, decreases stock returns, increases stock volatilities slightly and decreases interest rates, while for the firm with imperfect protection, cross-section enables the controlling shareholder to hold more shares, increases stock returns and volatilities and increases interest rates. △ Less

Submitted 11 October, 2021; originally announced October 2021.

arXiv:2110.02349 [pdf, other]

doi 10.1103/PhysRevLett.127.187201

Direct imaging of antiferromagnetic domains and anomalous layer-dependent mirror symmetry breaking in atomically thin MnPS$_3$

Authors: Zhuoliang Ni, Huiqin Zhang, David Hopper, Amanda V. Haglund, Nan Huang, Deep Jariwala, Lee Bassett, David G. Mandrus, Eugene J. Mele, Charles L. Kane, Liang Wu

Abstract: We have developed a sensitive cryogenic second-harmonic generation microscopy to study a van der Waals antiferromagnet MnPS$_3$. We find that long-range Néel antiferromagnetic order develops from the bulk crystal down to the bilayer, while it is absent in the monolayer. Before entering the long-range antiferromagnetic ordered phase in all samples, an upturn of the second harmonic generation below… ▽ More We have developed a sensitive cryogenic second-harmonic generation microscopy to study a van der Waals antiferromagnet MnPS$_3$. We find that long-range Néel antiferromagnetic order develops from the bulk crystal down to the bilayer, while it is absent in the monolayer. Before entering the long-range antiferromagnetic ordered phase in all samples, an upturn of the second harmonic generation below 200 K indicates the formation of the short-range order and magneto-elastic coupling. We also directly image the two antiphase (180$^{\circ}$) antiferromagnetic domains and thermally-induced domain switching down to bilayer. An anomalous mirror symmetry breaking shows up in samples thinner than ten layers for the temperature both above and below the Néel temperature, which indicates a structural change in few-layer samples. Minimal change of the second harmonic generation polar patterns in strain tuning experiments indicate that the symmetry crossover at ten layers is most likely an intrinsic property of MnPS$_3$ instead of an extrinsic origin of substrate-induced strain. Our results show that second harmonic generation microscopy is a direct tool for studying antiferromagnetic domains in atomically thin materials, and opens a new way to study two-dimensional antiferromagnets. △ Less

Submitted 5 October, 2021; originally announced October 2021.

Comments: To appear in Phys. Rev. Lett

Journal ref: Phys. Rev. Lett. 127, 187201 (2021)

arXiv:2110.01568 [pdf]

doi 10.1021/acsnano.2c06253

Electron-beam induced emergence of mesoscopic ordering in layered MnPS$_{3}$

Authors: Kevin M. Roccapriore, Nan Huang, Mark P. Oxley, Vinit Sharma, Timothy Taylor, Swagata Acharya, Dimitar Pashov, Mikhail I. Katsnelson, David Mandrus, Janice L. Musfeldt, Sergei V. Kalinin

Abstract: Ordered mesoscale structures in 2D materials induced by small misorientations have opened pathways for a wide variety of novel electronic, ferroelectric, and quantum phenomena. Until now, the only mechanism to induce this periodic ordering was via mechanical rotations between the layers, with the periodicity of the resulting moiré pattern being directly related to twist angle. Here we report a fun… ▽ More Ordered mesoscale structures in 2D materials induced by small misorientations have opened pathways for a wide variety of novel electronic, ferroelectric, and quantum phenomena. Until now, the only mechanism to induce this periodic ordering was via mechanical rotations between the layers, with the periodicity of the resulting moiré pattern being directly related to twist angle. Here we report a fundamentally new mechanism for emergence of mesoscopic periodic patterns in multilayer sulfur-containing metal phosphorous trichalcogenide, MnPS$_{3}$, induced by the electron beam. The formation under the beam of periodic hexagonal patterns with several characteristic length scales, nucleation and transitions between the phases, and local dynamics are demonstrated. The associated mechanisms are attributed to the relative contraction of the layers caused by beam-induced sulphur vacancy formation with subsequent ordering and lattice parameter change. As a result, the plasmonic response of the system is locally altered, suggesting an element of control over plasmon resonances by electron beam patterning. We pose that harnessing this phenomenon provides both insight into fundamental physics of quantum materials and opens a pathway towards device applications by enabling controlled periodic potentials on the atomic scale. △ Less

Submitted 29 September, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

Comments: Electron microscopy data and analysis codes are freely available here: https://github.com/kevinroccapriore/MnPS3

arXiv:2108.11687 [pdf, other]

Numerical Study on Beam-based Alignment of SXFEL Undulator Lattice

Authors: Liang Xu, Nanshun Huang, Qingmin Zhang, Duan Gu, Haixiao Deng

Abstract: The undulator line of the Shanghai soft X-ray Free-electron Laser facility (SXFEL) has very tight tolerances on the straightness of the electron beam trajectory. However, the beam trajectory cannot meet the lasing requirements due to the influence of beam position, launch angle and quadrupole offsets. Traditional mechanical alignment can only control the rms of offsets to about 100 $μ$m, which is… ▽ More The undulator line of the Shanghai soft X-ray Free-electron Laser facility (SXFEL) has very tight tolerances on the straightness of the electron beam trajectory. However, the beam trajectory cannot meet the lasing requirements due to the influence of beam position, launch angle and quadrupole offsets. Traditional mechanical alignment can only control the rms of offsets to about 100 $μ$m, which is far from reaching the requirement. Further orbit correction can be achieved by beam-based alignment (BBA) method based on electron energy variations. K modulation is used to determine whether the beam passes through the quadrupole magnetic center, and the Dispersion-Free Steering (DFS) method is used to calculate the offsets of quadrupole and BPM. In this paper, a detailed result of simulation is presented which demonstrates that the beam trajectory with rms and standard deviation ($σ$) less than 10 $μ$m can be obtained. △ Less

Submitted 26 August, 2021; originally announced August 2021.

arXiv:2107.07465 [pdf, other]

doi 10.1016/j.nima.2021.165774

Measurement of undulator section wakefield at the SXFEL test facility

Authors: He Liu, Hanxiang Yang, Nanshun Huang, Liang Xu, Zenggong Jiang, Duan Gu, Haixiao Deng, Bo Liu

Abstract: In free electron laser facilities, almost every kind of device will generate wakefield when an electron beam passes through it. Most of the wakefields are undesired and have a negative effect on the electron beam, which means a decrease of FEL performance. As for the SXFEL test facility, the sophisticated layout and the cumulative effect of such a long undulator section lead to an obvious wakefiel… ▽ More In free electron laser facilities, almost every kind of device will generate wakefield when an electron beam passes through it. Most of the wakefields are undesired and have a negative effect on the electron beam, which means a decrease of FEL performance. As for the SXFEL test facility, the sophisticated layout and the cumulative effect of such a long undulator section lead to an obvious wakefield, which is strong enough that can not be ignored. Based on two deflecting cavities at the entrance and the exit of the undulator section with corresponding profile monitors, we measured the wakefield of the undulator section. In this paper, we give the theoretical and simulation results of resistive wall wakefields which agree well with each other. In addition, the experimental and the simulation results of the overall undulator wakefield are given showing small difference. In order to explore the impact of this wakefield on FEL lasing, we give the simulation results of FEL with and without wakefield for comparison. There is almost no impact on 44 nm FEL in stage-1 of cascaded EEHG-HGHG mode, while the impact on 8.8 nm FEL in stage-2 becomes critical decreasing the pulse energy and peak power by 42% and 27% and broadening the bandwidth. △ Less

Submitted 29 August, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

arXiv:2107.01502 [pdf, other]

doi 10.1007/978-3-030-32226-7_33

Pulmonary Vessel Segmentation based on Orthogonal Fused U-Net++ of Chest CT Images

Authors: Hejie Cui, Xinglong Liu, Ning Huang

Abstract: Pulmonary vessel segmentation is important for clinical diagnosis of pulmonary diseases, while is also challenging due to the complicated structure. In this work, we present an effective framework and refinement process of pulmonary vessel segmentation from chest computed tomographic (CT) images. The key to our approach is a 2.5D segmentation network applied from three orthogonal axes, which prese… ▽ More Pulmonary vessel segmentation is important for clinical diagnosis of pulmonary diseases, while is also challenging due to the complicated structure. In this work, we present an effective framework and refinement process of pulmonary vessel segmentation from chest computed tomographic (CT) images. The key to our approach is a 2.5D segmentation network applied from three orthogonal axes, which presents a robust and fully automated pulmonary vessel segmentation result with lower network complexity and memory usage compared to 3D networks. The slice radius is introduced to convolve the adjacent information of the center slice and the multi-planar fusion optimizes the presentation of intra- and inter- slice features. Besides, the tree-like structure of the pulmonary vessel is extracted in the post-processing process, which is used for segmentation refining and pruning. In the evaluation experiments, three fusion methods are tested and the most promising one is compared with the state-of-the-art 2D and 3D structures on 300 cases of lung images randomly selected from LIDC dataset. Our method outperforms other network structures by a large margin and achieves by far the highest average DICE score of 0.9272 and precision of 0.9310, as per our knowledge from the pulmonary vessel segmentation models available in the literature. △ Less

Submitted 3 July, 2021; originally announced July 2021.

Comments: Published in Medical Image Computing and Computer Assisted Intervention (MICCAI 2019)

MSC Class: 68T45; 68T07 ACM Class: I.2.10; J.3

arXiv:2107.00719 [pdf, other]

doi 10.1109/BIBM52615.2021.9669729

Toward Drug-Target Interaction Prediction via Ensemble Modeling and Transfer Learning

Authors: Po-Yu Kao, Shu-Min Kao, Nan-Lan Huang, Yen-Chu Lin

Abstract: Drug-target interaction (DTI) prediction plays a crucial role in drug discovery, and deep learning approaches have achieved state-of-the-art performance in this field. We introduce an ensemble of deep learning models (EnsembleDLM) for DTI prediction. EnsembleDLM only uses the sequence information of chemical compounds and proteins, and it aggregates the predictions from multiple deep neural networ… ▽ More Drug-target interaction (DTI) prediction plays a crucial role in drug discovery, and deep learning approaches have achieved state-of-the-art performance in this field. We introduce an ensemble of deep learning models (EnsembleDLM) for DTI prediction. EnsembleDLM only uses the sequence information of chemical compounds and proteins, and it aggregates the predictions from multiple deep neural networks. This approach not only achieves state-of-the-art performance in Davis and KIBA datasets but also reaches cutting-edge performance in the cross-domain applications across different bio-activity types and different protein classes. We also demonstrate that EnsembleDLM achieves a good performance (Pearson correlation coefficient and concordance index > 0.8) in the new domain with approximately 50% transfer learning data, i.e., the training set has twice as much data as the test set. △ Less

Submitted 18 November, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

Comments: 8 pages, 1 figure, 10 tables

arXiv:2106.11202 [pdf, other]

doi 10.3847/1538-4365/ac374f

The Design and Integrated Performance of SPT-3G

Authors: J. A. Sobrin, A. J. Anderson, A. N. Bender, B. A. Benson, D. Dutcher, A. Foster, N. Goeckner-Wald, J. Montgomery, A. Nadolski, A. Rahlin, P. A. R. Ade, Z. Ahmed, E. Anderes, M. Archipley, J. E. Austermann, J. S. Avva, K. Aylor, L. Balkenhol, P. S. Barry, R. Basu Thakur, K. Benabed, F. Bianchini, L. E. Bleem, F. R. Bouchet, L. Bryant , et al. (98 additional authors not shown)

Abstract: SPT-3G is the third survey receiver operating on the South Pole Telescope dedicated to high-resolution observations of the cosmic microwave background (CMB). Sensitive measurements of the temperature and polarization anisotropies of the CMB provide a powerful dataset for constraining cosmology. Additionally, CMB surveys with arcminute-scale resolution are capable of detecting galaxy clusters, mill… ▽ More SPT-3G is the third survey receiver operating on the South Pole Telescope dedicated to high-resolution observations of the cosmic microwave background (CMB). Sensitive measurements of the temperature and polarization anisotropies of the CMB provide a powerful dataset for constraining cosmology. Additionally, CMB surveys with arcminute-scale resolution are capable of detecting galaxy clusters, millimeter-wave bright galaxies, and a variety of transient phenomena. The SPT-3G instrument provides a significant improvement in mapping speed over its predecessors, SPT-SZ and SPTpol. The broadband optics design of the instrument achieves a 430 mm diameter image plane across observing bands of 95 GHz, 150 GHz, and 220 GHz, with 1.2 arcmin FWHM beam response at 150 GHz. In the receiver, this image plane is populated with 2690 dual-polarization, tri-chroic pixels (~16000 detectors) read out using a 68X digital frequency-domain multiplexing readout system. In 2018, SPT-3G began a multiyear survey of 1500 deg$^{2}$ of the southern sky. We summarize the unique optical, cryogenic, detector, and readout technologies employed in SPT-3G, and we report on the integrated performance of the instrument. △ Less

Submitted 25 February, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

Comments: 25 pages, 11 figures. Accepted for publication in ApJS

Report number: FERMILAB-PUB-21-291-AE

Journal ref: ApJS 258 42 (2022)

arXiv:2105.04744 [pdf, ps, other]

On Ekeland's variational principle for interval-valued functions with applications

Authors: Chuang-liang Zhang, Nan-jing Huang

Abstract: In this paper, we obtain a version of Ekeland's variational principle for interval-value functions by means of the Dancs-Hegedus-Medvegyev theorem [14]. We also derive two versions of Ekeland's variational principle involving the generalized Hukuhara Gateaux differentiability of interval-valued functions as well as a version of Ekeland's variational principle for interval-valued bifunctions. Final… ▽ More In this paper, we obtain a version of Ekeland's variational principle for interval-value functions by means of the Dancs-Hegedus-Medvegyev theorem [14]. We also derive two versions of Ekeland's variational principle involving the generalized Hukuhara Gateaux differentiability of interval-valued functions as well as a version of Ekeland's variational principle for interval-valued bifunctions. Finally, we apply these new versions of Ekeland's variational principle to fixed point theorems, to interval-valued optimization problems, to the interval-valued Mountain Pass Theorem, to noncooperative interval-valued games, and to interval-valued optimal control problems described by interval-valued differential equations. △ Less

Submitted 10 May, 2021; originally announced May 2021.

arXiv:2104.13528 [pdf, ps, other]

A Linear-quadratic Mean-Field Stochastic Stackelberg Differential Game with Random Exit Time

Authors: Zhun Gou, Nan-jing Huang, Ming-hui Wang

Abstract: In this paper, we investigate a new model of a linear-quadratic mean-field stochastic Stackelberg differential game with one leader and two followers, in which the leader is allowed to stop her strategy at a random time. Our overarching goal is to find the Stackelberg solution of the leader and followers for such a model. By employing the backward induction method, the state equation is divided in… ▽ More In this paper, we investigate a new model of a linear-quadratic mean-field stochastic Stackelberg differential game with one leader and two followers, in which the leader is allowed to stop her strategy at a random time. Our overarching goal is to find the Stackelberg solution of the leader and followers for such a model. By employing the backward induction method, the state equation is divided into two-stage equations. Moreover, by using the maximum principle and the verification theorem, the Stackelberg solution is obtained for such a model. △ Less

Submitted 6 June, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

arXiv:2104.11543 [pdf, other]

doi 10.1109/TIP.2022.3214092

Middle-level Fusion for Lightweight RGB-D Salient Object Detection

Authors: Nianchang Huang, Qiang Zhang, Jungong Han

Abstract: Most existing lightweight RGB-D salient object detection (SOD) models are based on two-stream structure or single-stream structure. The former one first uses two sub-networks to extract unimodal features from RGB and depth images, respectively, and then fuses them for SOD. While, the latter one directly extracts multi-modal features from the input RGB-D images and then focuses on exploiting cross-… ▽ More Most existing lightweight RGB-D salient object detection (SOD) models are based on two-stream structure or single-stream structure. The former one first uses two sub-networks to extract unimodal features from RGB and depth images, respectively, and then fuses them for SOD. While, the latter one directly extracts multi-modal features from the input RGB-D images and then focuses on exploiting cross-level complementary information. However, two-stream structure based models inevitably require more parameters and single-stream structure based ones cannot well exploit the cross-modal complementary information since they ignore the modality difference. To address these issues, we propose to employ the middle-level fusion structure for designing lightweight RGB-D SOD model in this paper, which first employs two sub-networks to extract low- and middle-level unimodal features, respectively, and then fuses those extracted middle-level unimodal features for extracting corresponding high-level multi-modal features in the subsequent sub-network. Different from existing models, this structure can effectively exploit the cross-modal complementary information and significantly reduce the network's parameters, simultaneously. Therefore, a novel lightweight SOD model is designed, which contains a information-aware multi-modal feature fusion (IMFF) module for effectively capturing the cross-modal complementary information and a lightweight feature-level and decision-level feature fusion (LFDF) module for aggregating the feature-level and the decision-level saliency information in different stages with less parameters. Our proposed model has only 3.9M parameters and runs at 33 FPS. The experimental results on several benchmark datasets verify the effectiveness and superiority of the proposed method over some state-of-the-art methods. △ Less

Submitted 5 June, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

Comments: 11 pages, 6 figures

Showing 51–100 of 217 results for author: Huang, N