-
Magnetospheric physics of magnetars
Authors:
H. Tong
Abstract:
Several aspects of the magnetospheric physics of magnetars are summarized, including: GeV and hard X-ray emissions of magnetars, timing behaviors during magnetar outburst (soft X-ray observations), optical/IR observations of magnetars, radio emission of magnetars, and accreting magnetars. A unified picture for pulsars and magnetars are adopted, especially wind braking of magnetars, magnetar+ fallb…
▽ More
Several aspects of the magnetospheric physics of magnetars are summarized, including: GeV and hard X-ray emissions of magnetars, timing behaviors during magnetar outburst (soft X-ray observations), optical/IR observations of magnetars, radio emission of magnetars, and accreting magnetars. A unified picture for pulsars and magnetars are adopted, especially wind braking of magnetars, magnetar+ fallback disk systems, twisted dipole magnetic field, and accreting low magnetic field magnetars etc. It is pointed out that magnetars are related to a broad range of astrophysical phenomena.
△ Less
Submitted 10 September, 2023;
originally announced September 2023.
-
DeepScaler: Holistic Autoscaling for Microservices Based on Spatiotemporal GNN with Adaptive Graph Learning
Authors:
Chunyang Meng,
Shijie Song,
Haogang Tong,
Maolin Pan,
Yang Yu
Abstract:
Autoscaling functions provide the foundation for achieving elasticity in the modern cloud computing paradigm. It enables dynamic provisioning or de-provisioning resources for cloud software services and applications without human intervention to adapt to workload fluctuations. However, autoscaling microservice is challenging due to various factors. In particular, complex, time-varying service depe…
▽ More
Autoscaling functions provide the foundation for achieving elasticity in the modern cloud computing paradigm. It enables dynamic provisioning or de-provisioning resources for cloud software services and applications without human intervention to adapt to workload fluctuations. However, autoscaling microservice is challenging due to various factors. In particular, complex, time-varying service dependencies are difficult to quantify accurately and can lead to cascading effects when allocating resources. This paper presents DeepScaler, a deep learning-based holistic autoscaling approach for microservices that focus on coping with service dependencies to optimize service-level agreements (SLA) assurance and cost efficiency. DeepScaler employs (i) an expectation-maximization-based learning method to adaptively generate affinity matrices revealing service dependencies and (ii) an attention-based graph convolutional network to extract spatio-temporal features of microservices by aggregating neighbors' information of graph-structural data. Thus DeepScaler can capture more potential service dependencies and accurately estimate the resource requirements of all services under dynamic workloads. It allows DeepScaler to reconfigure the resources of the interacting services simultaneously in one resource provisioning operation, avoiding the cascading effect caused by service dependencies. Experimental results demonstrate that our method implements a more effective autoscaling mechanism for microservice that not only allocates resources accurately but also adapts to dependencies changes, significantly reducing SLA violations by an average of 41% at lower costs.
△ Less
Submitted 2 September, 2023;
originally announced September 2023.
-
Ensuring User-side Fairness in Dynamic Recommender Systems
Authors:
Hyunsik Yoo,
Zhichen Zeng,
Jian Kang,
Ruizhong Qiu,
David Zhou,
Zhining Liu,
Fei Wang,
Charlie Xu,
Eunice Chan,
Hanghang Tong
Abstract:
User-side group fairness is crucial for modern recommender systems, aiming to alleviate performance disparities among user groups defined by sensitive attributes like gender, race, or age. In the ever-evolving landscape of user-item interactions, continual adaptation to newly collected data is crucial for recommender systems to stay aligned with the latest user preferences. However, we observe tha…
▽ More
User-side group fairness is crucial for modern recommender systems, aiming to alleviate performance disparities among user groups defined by sensitive attributes like gender, race, or age. In the ever-evolving landscape of user-item interactions, continual adaptation to newly collected data is crucial for recommender systems to stay aligned with the latest user preferences. However, we observe that such continual adaptation often exacerbates performance disparities. This necessitates a thorough investigation into user-side fairness in dynamic recommender systems, an area that has been unexplored in the literature. This problem is challenging due to distribution shifts, frequent model updates, and non-differentiability of ranking metrics. To our knowledge, this paper presents the first principled study on ensuring user-side fairness in dynamic recommender systems. We start with theoretical analyses on fine-tuning v.s. retraining, showing that the best practice is incremental fine-tuning with restart. Guided by our theoretical analyses, we propose FAir Dynamic rEcommender (FADE), an end-to-end fine-tuning framework to dynamically ensure user-side fairness over time. To overcome the non-differentiability of recommendation metrics in the fairness loss, we further introduce Differentiable Hit (DH) as an improvement over the recent NeuralNDCG method, not only alleviating its gradient vanishing issue but also achieving higher efficiency. Besides that, we also address the instability issue of the fairness loss by leveraging the competing nature between the recommendation loss and the fairness loss. Through extensive experiments on real-world datasets, we demonstrate that FADE effectively and efficiently reduces performance disparities with little sacrifice in the overall recommendation performance.
△ Less
Submitted 31 March, 2024; v1 submitted 29 August, 2023;
originally announced August 2023.
-
Class-Imbalanced Graph Learning without Class Rebalancing
Authors:
Zhining Liu,
Ruizhong Qiu,
Zhichen Zeng,
Hyunsik Yoo,
David Zhou,
Zhe Xu,
Yada Zhu,
Kommy Weldemariam,
Jingrui He,
Hanghang Tong
Abstract:
Class imbalance is prevalent in real-world node classification tasks and poses great challenges for graph learning models. Most existing studies are rooted in a class-rebalancing (CR) perspective and address class imbalance with class-wise reweighting or resampling. In this work, we approach the root cause of class-imbalance bias from an topological paradigm. Specifically, we theoretically reveal…
▽ More
Class imbalance is prevalent in real-world node classification tasks and poses great challenges for graph learning models. Most existing studies are rooted in a class-rebalancing (CR) perspective and address class imbalance with class-wise reweighting or resampling. In this work, we approach the root cause of class-imbalance bias from an topological paradigm. Specifically, we theoretically reveal two fundamental phenomena in the graph topology that greatly exacerbate the predictive bias stemming from class imbalance. On this basis, we devise a lightweight topological augmentation framework BAT to mitigate the class-imbalance bias without class rebalancing. Being orthogonal to CR, BAT can function as an efficient plug-and-play module that can be seamlessly combined with and significantly boost existing CR techniques. Systematic experiments on real-world imbalanced graph learning tasks show that BAT can deliver up to 46.27% performance gain and up to 72.74% bias reduction over existing techniques. Code, examples, and documentations are available at https://github.com/ZhiningLiu1998/BAT.
△ Less
Submitted 19 May, 2024; v1 submitted 27 August, 2023;
originally announced August 2023.
-
Calliope-Net: Automatic Generation of Graph Data Facts via Annotated Node-link Diagrams
Authors:
Qing Chen,
Nan Chen,
Wei Shuai,
Guande Wu,
Zhe Xu,
Hanghang Tong,
Nan Cao
Abstract:
Graph or network data are widely studied in both data mining and visualization communities to review the relationship among different entities and groups. The data facts derived from graph visual analysis are important to help understand the social structures of complex data, especially for data journalism. However, it is challenging for data journalists to discover graph data facts and manually o…
▽ More
Graph or network data are widely studied in both data mining and visualization communities to review the relationship among different entities and groups. The data facts derived from graph visual analysis are important to help understand the social structures of complex data, especially for data journalism. However, it is challenging for data journalists to discover graph data facts and manually organize correlated facts around a meaningful topic due to the complexity of graph data and the difficulty to interpret graph narratives. Therefore, we present an automatic graph facts generation system, Calliope-Net, which consists of a fact discovery module, a fact organization module, and a visualization module. It creates annotated node-link diagrams with facts automatically discovered and organized from network data. A novel layout algorithm is designed to present meaningful and visually appealing annotated graphs. We evaluate the proposed system with two case studies and an in-lab user study. The results show that Calliope-Net can benefit users in discovering and understanding graph data facts with visually pleasing annotated visualizations.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Adaptive Bitrate Video Semantic Communication over Wireless Networks
Authors:
Wentao Gong,
Haonan Tong,
Sihua Wang,
Zhaohui Yang,
Xinxin He,
Changchuan Yin
Abstract:
This paper investigates the adaptive bitrate (ABR) video semantic communication over wireless networks. In the considered model, video sensing devices must transmit video semantic information to an edge server, to facilitate ubiquitous video sensing services such as road environment monitoring at the edge server in autonomous driving scenario. However, due to the varying wireless network condition…
▽ More
This paper investigates the adaptive bitrate (ABR) video semantic communication over wireless networks. In the considered model, video sensing devices must transmit video semantic information to an edge server, to facilitate ubiquitous video sensing services such as road environment monitoring at the edge server in autonomous driving scenario. However, due to the varying wireless network conditions, it is challenging to guarantee both low transmission delay and high semantic accuracy at the same time if devices continuously transmit a fixed bitrate video semantic information. To address this challenge, we develop an adaptive bitrate video semantic communication (ABRVSC) system, in which devices adaptively adjust the bitrate of video semantic information according to network conditions. Specifically, we first define the quality of experience (QoE) for video semantic communication. Subsequently, a swin transformer-based semantic codec is proposed to extract semantic information with considering the influence of QoE. Then, we propose an Actor-Critic based ABR algorithm for the semantic codec to enhance the robustness of the proposed ABRVSC scheme against network variations. Simulation results demonstrate that at low bitrates, the mean intersection over union (MIoU) of the proposed ABRVSC scheme is nearly twice that of the traditional scheme. Moreover, the proposed ABRVSC scheme, which increases the QoE in video semantic communication by 36.57%, exhibits more robustness against network variations compared to both the fixed bitrate schemes and traditional ABR schemes.
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
On the nature of long period radio pulsar GPM J1839$-$10: death line and pulse width
Authors:
H. Tong
Abstract:
Recently another long period radio pulsar GPM J1839$-$10 is reported, similar to GLEAM-X J162759.5$-$523504.3. Previously, the energy budget and rotational evolution of long period radio pulsars had been considered. This time, the death line and pulse width for neutron star and white dwarf pulsars are investigated. The pulse width is included as the second criterion for neutron star and white dwar…
▽ More
Recently another long period radio pulsar GPM J1839$-$10 is reported, similar to GLEAM-X J162759.5$-$523504.3. Previously, the energy budget and rotational evolution of long period radio pulsars had been considered. This time, the death line and pulse width for neutron star and white dwarf pulsars are investigated. The pulse width is included as the second criterion for neutron star and white dwarfs pulsars. It is found that: (1) PSR J0250+5854 and PSR J0901$-$4046 etc should be normal radio pulsars. They have narrow pulse width and they lie near the radio emission death line. (2) The two long period radio pulsars GLEAM-X J162759.5$-$523504.3 and GPM J1839$-$10 is unlikely to be normal radio pulsars. Their possible pulse width is relatively large. And they lie far below the fiducial death line on the $P-\dot{P}$ diagram. (3) GLEAM-X J162759.5$-$523504.3 and GPM J1839$-$10 may be magnetars or white dwarf radio pulsars. At present, there are many parameters and uncertainties in both of these two possibilities.
△ Less
Submitted 11 October, 2023; v1 submitted 27 July, 2023;
originally announced July 2023.
-
Scientific Objectives of the Hot Universe Baryon Surveyor (HUBS) Mission
Authors:
Joel Bregman,
Renyue Cen,
Yang Chen,
Wei Cui,
Taotao Fang,
Fulai Guo,
Edmund Hodges-Kluck,
Rui Huang,
Luis C. Ho,
Li Ji,
Suoqing Ji,
Xi Kang,
Xiaoyu Lai,
Hui Li,
Jiangtao Li,
Miao Li,
Xiangdong Li,
Yuan Li,
Zhaosheng Li,
Guiyun Liang,
Helei Liu,
Wenhao Liu,
Fangjun Lu,
Junjie Mao,
Gabriele Ponti
, et al. (29 additional authors not shown)
Abstract:
The Hot Universe Baryon Surveyor (HUBS) is a proposed space-based X-ray telescope for detecting X-ray emissions from the hot gas content in our universe. With its unprecedented spatially-resolved high-resolution spectroscopy and large field of view, the HUBS mission will be uniquely qualified to measure the physical and chemical properties of the hot gas in the interstellar medium, the circumgalac…
▽ More
The Hot Universe Baryon Surveyor (HUBS) is a proposed space-based X-ray telescope for detecting X-ray emissions from the hot gas content in our universe. With its unprecedented spatially-resolved high-resolution spectroscopy and large field of view, the HUBS mission will be uniquely qualified to measure the physical and chemical properties of the hot gas in the interstellar medium, the circumgalactic medium, the intergalactic medium, and the intracluster medium. These measurements will be valuable for two key scientific goals of HUBS, namely to unravel the AGN and stellar feedback physics that governs the formation and evolution of galaxies, and to probe the baryon budget and multi-phase states from galactic to cosmological scales. In addition to these two goals, the HUBS mission will also help us solve some problems in the fields of galaxy clusters, AGNs, diffuse X-ray backgrounds, supernova remnants, and compact objects. This paper discusses the perspective of advancing these fields using the HUBS telescope.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Privacy-Preserving Graph Machine Learning from Data to Computation: A Survey
Authors:
Dongqi Fu,
Wenxuan Bao,
Ross Maciejewski,
Hanghang Tong,
Jingrui He
Abstract:
In graph machine learning, data collection, sharing, and analysis often involve multiple parties, each of which may require varying levels of data security and privacy. To this end, preserving privacy is of great importance in protecting sensitive information. In the era of big data, the relationships among data entities have become unprecedentedly complex, and more applications utilize advanced d…
▽ More
In graph machine learning, data collection, sharing, and analysis often involve multiple parties, each of which may require varying levels of data security and privacy. To this end, preserving privacy is of great importance in protecting sensitive information. In the era of big data, the relationships among data entities have become unprecedentedly complex, and more applications utilize advanced data structures (i.e., graphs) that can support network structures and relevant attribute information. To date, many graph-based AI models have been proposed (e.g., graph neural networks) for various domain tasks, like computer vision and natural language processing. In this paper, we focus on reviewing privacy-preserving techniques of graph machine learning. We systematically review related works from the data to the computational aspects. We first review methods for generating privacy-preserving graph data. Then we describe methods for transmitting privacy-preserved information (e.g., graph model parameters) to realize the optimization-based computation when data sharing among multiple parties is risky or impossible. In addition to discussing relevant theoretical methodology and software tools, we also discuss current challenges and highlight several possible future research opportunities for privacy-preserving graph machine learning. Finally, we envision a unified and comprehensive secure graph machine learning system.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Noisy Positive-Unlabeled Learning with Self-Training for Speculative Knowledge Graph Reasoning
Authors:
Ruijie Wang,
Baoyu Li,
Yichen Lu,
Dachun Sun,
Jinning Li,
Yuchen Yan,
Shengzhong Liu,
Hanghang Tong,
Tarek F. Abdelzaher
Abstract:
This paper studies speculative reasoning task on real-world knowledge graphs (KG) that contain both \textit{false negative issue} (i.e., potential true facts being excluded) and \textit{false positive issue} (i.e., unreliable or outdated facts being included). State-of-the-art methods fall short in the speculative reasoning ability, as they assume the correctness of a fact is solely determined by…
▽ More
This paper studies speculative reasoning task on real-world knowledge graphs (KG) that contain both \textit{false negative issue} (i.e., potential true facts being excluded) and \textit{false positive issue} (i.e., unreliable or outdated facts being included). State-of-the-art methods fall short in the speculative reasoning ability, as they assume the correctness of a fact is solely determined by its presence in KG, making them vulnerable to false negative/positive issues. The new reasoning task is formulated as a noisy Positive-Unlabeled learning problem. We propose a variational framework, namely nPUGraph, that jointly estimates the correctness of both collected and uncollected facts (which we call \textit{label posterior}) and updates model parameters during training. The label posterior estimation facilitates speculative reasoning from two perspectives. First, it improves the robustness of a label posterior-aware graph encoder against false positive links. Second, it identifies missing facts to provide high-quality grounds of reasoning. They are unified in a simple yet effective self-training procedure. Empirically, extensive experiments on three benchmark KG and one Twitter dataset with various degrees of false negative/positive cases demonstrate the effectiveness of nPUGraph.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.
-
BeMap: Balanced Message Passing for Fair Graph Neural Network
Authors:
Xiao Lin,
Jian Kang,
Weilin Cong,
Hanghang Tong
Abstract:
Fairness in graph neural networks has been actively studied recently. However, existing works often do not explicitly consider the role of message passing in introducing or amplifying the bias. In this paper, we first investigate the problem of bias amplification in message passing. We empirically and theoretically demonstrate that message passing could amplify the bias when the 1-hop neighbors fr…
▽ More
Fairness in graph neural networks has been actively studied recently. However, existing works often do not explicitly consider the role of message passing in introducing or amplifying the bias. In this paper, we first investigate the problem of bias amplification in message passing. We empirically and theoretically demonstrate that message passing could amplify the bias when the 1-hop neighbors from different demographic groups are unbalanced. Guided by such analyses, we propose BeMap, a fair message passing method, that leverages a balance-aware sampling strategy to balance the number of the 1-hop neighbors of each node among different demographic groups. Extensive experiments on node classification demonstrate the efficacy of BeMap in mitigating bias while maintaining classification accuracy. The code is available at https://github.com/xiaolin-cs/BeMap.
△ Less
Submitted 8 March, 2024; v1 submitted 6 June, 2023;
originally announced June 2023.
-
Reconstructing Graph Diffusion History from a Single Snapshot
Authors:
Ruizhong Qiu,
Dingsu Wang,
Lei Ying,
H. Vincent Poor,
Yifang Zhang,
Hanghang Tong
Abstract:
Diffusion on graphs is ubiquitous with numerous high-impact applications. In these applications, complete diffusion histories play an essential role in terms of identifying dynamical patterns, reflecting on precaution actions, and forecasting intervention effects. Despite their importance, complete diffusion histories are rarely available and are highly challenging to reconstruct due to ill-posedn…
▽ More
Diffusion on graphs is ubiquitous with numerous high-impact applications. In these applications, complete diffusion histories play an essential role in terms of identifying dynamical patterns, reflecting on precaution actions, and forecasting intervention effects. Despite their importance, complete diffusion histories are rarely available and are highly challenging to reconstruct due to ill-posedness, explosive search space, and scarcity of training data. To date, few methods exist for diffusion history reconstruction. They are exclusively based on the maximum likelihood estimation (MLE) formulation and require to know true diffusion parameters. In this paper, we study an even harder problem, namely reconstructing Diffusion history from A single SnapsHot} (DASH), where we seek to reconstruct the history from only the final snapshot without knowing true diffusion parameters. We start with theoretical analyses that reveal a fundamental limitation of the MLE formulation. We prove: (a) estimation error of diffusion parameters is unavoidable due to NP-hardness of diffusion parameter estimation, and (b) the MLE formulation is sensitive to estimation error of diffusion parameters. To overcome the inherent limitation of the MLE formulation, we propose a novel barycenter formulation: finding the barycenter of the posterior distribution of histories, which is provably stable against the estimation error of diffusion parameters. We further develop an effective solver named DIffusion hiTting Times with Optimal proposal (DITTO) by reducing the problem to estimating posterior expected hitting times via the Metropolis--Hastings Markov chain Monte Carlo method (M--H MCMC) and employing an unsupervised graph neural network to learn an optimal proposal to accelerate the convergence of M--H MCMC. We conduct extensive experiments to demonstrate the efficacy of the proposed method.
△ Less
Submitted 31 May, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
Networked Time Series Imputation via Position-aware Graph Enhanced Variational Autoencoders
Authors:
Dingsu Wang,
Yuchen Yan,
Ruizhong Qiu,
Yada Zhu,
Kaiyu Guan,
Andrew J Margenot,
Hanghang Tong
Abstract:
Multivariate time series (MTS) imputation is a widely studied problem in recent years. Existing methods can be divided into two main groups, including (1) deep recurrent or generative models that primarily focus on time series features, and (2) graph neural networks (GNNs) based models that utilize the topological information from the inherent graph structure of MTS as relational inductive bias fo…
▽ More
Multivariate time series (MTS) imputation is a widely studied problem in recent years. Existing methods can be divided into two main groups, including (1) deep recurrent or generative models that primarily focus on time series features, and (2) graph neural networks (GNNs) based models that utilize the topological information from the inherent graph structure of MTS as relational inductive bias for imputation. Nevertheless, these methods either neglect topological information or assume the graph structure is fixed and accurately known. Thus, they fail to fully utilize the graph dynamics for precise imputation in more challenging MTS data such as networked time series (NTS), where the underlying graph is constantly changing and might have missing edges. In this paper, we propose a novel approach to overcome these limitations. First, we define the problem of imputation over NTS which contains missing values in both node time series features and graph structures. Then, we design a new model named PoGeVon which leverages variational autoencoder (VAE) to predict missing values over both node time series features and graph structures. In particular, we propose a new node position embedding based on random walk with restart (RWR) in the encoder with provable higher expressive power compared with message-passing based graph neural networks (GNNs). We further design a decoder with 3-stage predictions from the perspective of multi-task learning to impute missing values in both time series and graph structures reciprocally. Experiment results demonstrate the effectiveness of our model over baselines.
△ Less
Submitted 26 June, 2023; v1 submitted 29 May, 2023;
originally announced May 2023.
-
Logical Entity Representation in Knowledge-Graphs for Differentiable Rule Learning
Authors:
Chi Han,
Qizheng He,
Charles Yu,
Xinya Du,
Hanghang Tong,
Heng Ji
Abstract:
Probabilistic logical rule learning has shown great strength in logical rule mining and knowledge graph completion. It learns logical rules to predict missing edges by reasoning on existing edges in the knowledge graph. However, previous efforts have largely been limited to only modeling chain-like Horn clauses such as $R_1(x,z)\land R_2(z,y)\Rightarrow H(x,y)$. This formulation overlooks addition…
▽ More
Probabilistic logical rule learning has shown great strength in logical rule mining and knowledge graph completion. It learns logical rules to predict missing edges by reasoning on existing edges in the knowledge graph. However, previous efforts have largely been limited to only modeling chain-like Horn clauses such as $R_1(x,z)\land R_2(z,y)\Rightarrow H(x,y)$. This formulation overlooks additional contextual information from neighboring sub-graphs of entity variables $x$, $y$ and $z$. Intuitively, there is a large gap here, as local sub-graphs have been found to provide important information for knowledge graph completion. Inspired by these observations, we propose Logical Entity RePresentation (LERP) to encode contextual information of entities in the knowledge graph. A LERP is designed as a vector of probabilistic logical functions on the entity's neighboring sub-graph. It is an interpretable representation while allowing for differentiable optimization. We can then incorporate LERP into probabilistic logical rule learning to learn more expressive rules. Empirical results demonstrate that with LERP, our model outperforms other rule learning methods in knowledge graph completion and is comparable or even superior to state-of-the-art black-box methods. Moreover, we find that our model can discover a more expressive family of logical rules. LERP can also be further combined with embedding learning methods like TransE to make it more interpretable.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Local Optima Correlation Assisted Adaptive Operator Selection
Authors:
Jiyuan Pei,
Hao Tong,
Jialin Liu,
Yi Mei,
Xin Yao
Abstract:
For solving combinatorial optimisation problems with metaheuristics, different search operators are applied for sampling new solutions in the neighbourhood of a given solution. It is important to understand the relationship between operators for various purposes, e.g., adaptively deciding when to use which operator to find optimal solutions efficiently. However, it is difficult to theoretically an…
▽ More
For solving combinatorial optimisation problems with metaheuristics, different search operators are applied for sampling new solutions in the neighbourhood of a given solution. It is important to understand the relationship between operators for various purposes, e.g., adaptively deciding when to use which operator to find optimal solutions efficiently. However, it is difficult to theoretically analyse this relationship, especially in the complex solution space of combinatorial optimisation problems. In this paper, we propose to empirically analyse the relationship between operators in terms of the correlation between their local optima and develop a measure for quantifying their relationship. The comprehensive analyses on a wide range of capacitated vehicle routing problem benchmark instances show that there is a consistent pattern in the correlation between commonly used operators. Based on this newly proposed local optima correlation metric, we propose a novel approach for adaptively selecting among the operators during the search process. The core intention is to improve search efficiency by preventing wasting computational resources on exploring neighbourhoods where the local optima have already been reached. Experiments on randomly generated instances and commonly used benchmark datasets are conducted. Results show that the proposed approach outperforms commonly used adaptive operator selection methods.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Existence and stability of solitary waves to the rotation-Camassa-Holm equation
Authors:
Hao Tong,
Shaojie Yang
Abstract:
In this paper, we investigate existence and stability of solitary waves to the rotation-Camassa-Holm equation which can be considered as a model in the shallow water for the long-crested waves propagating near the equator with effect of the Coriolis force due to the Earths rotation. We prove existence of solitary waves by performing a phase plane analysis. Moreover, utilizing the approach proposed…
▽ More
In this paper, we investigate existence and stability of solitary waves to the rotation-Camassa-Holm equation which can be considered as a model in the shallow water for the long-crested waves propagating near the equator with effect of the Coriolis force due to the Earths rotation. We prove existence of solitary waves by performing a phase plane analysis. Moreover, utilizing the approach proposed by Grillakis-Shatah-Strauss, we prove stability of solitary waves.
△ Less
Submitted 19 May, 2024; v1 submitted 30 April, 2023;
originally announced May 2023.
-
SRL-Assisted AFM: Generating Planar Unstructured Quadrilateral Meshes with Supervised and Reinforcement Learning-Assisted Advancing Front Method
Authors:
Hua Tong,
Kuanren Qian,
Eni Halilaj,
Yongjie Jessica Zhang
Abstract:
High-quality mesh generation is the foundation of accurate finite element analysis. Due to the vast interior vertices search space and complex initial boundaries, mesh generation for complicated domains requires substantial manual processing and has long been considered the most challenging and time-consuming bottleneck of the entire modeling and analysis process. In this paper, we present a novel…
▽ More
High-quality mesh generation is the foundation of accurate finite element analysis. Due to the vast interior vertices search space and complex initial boundaries, mesh generation for complicated domains requires substantial manual processing and has long been considered the most challenging and time-consuming bottleneck of the entire modeling and analysis process. In this paper, we present a novel computational framework named ``SRL-assisted AFM" for meshing planar geometries by combining the advancing front method with neural networks that select reference vertices and update the front boundary using ``policy networks." These deep neural networks are trained using a unique pipeline that combines supervised learning with reinforcement learning to iteratively improve mesh quality. First, we generate different initial boundaries by randomly sampling points in a square domain and connecting them sequentially. These boundaries are used for obtaining input meshes and extracting training datasets in the supervised learning module. We then iteratively improve the reinforcement learning model performance with reward functions designed for special requirements, such as improving the mesh quality and controlling the number and distribution of extraordinary points. Our proposed supervised learning neural networks achieve an accuracy higher than 98% on predicting commercial software. The final reinforcement learning neural networks automatically generate high-quality quadrilateral meshes for complex planar domains with sharp features and boundary layers.
△ Less
Submitted 30 April, 2023;
originally announced May 2023.
-
Neutron-proton effective mass splitting in neutron-rich matter
Authors:
Sibo Wang,
Hui Tong,
Qiang Zhao,
Chencan Wang,
Peter Ring,
Jie Meng
Abstract:
Nucleon effective masses in neutron-rich matter are studied with the relativistic Brueckner-Hartree-Fock (RBHF) theory in the full Dirac space. The neutron and proton effective masses for symmetric nuclear matter are 0.80 times rest mass, which agrees well with the empirical values. In neutron-rich matter, the effective mass of the neutron is found larger than that of the proton, and the neutron-p…
▽ More
Nucleon effective masses in neutron-rich matter are studied with the relativistic Brueckner-Hartree-Fock (RBHF) theory in the full Dirac space. The neutron and proton effective masses for symmetric nuclear matter are 0.80 times rest mass, which agrees well with the empirical values. In neutron-rich matter, the effective mass of the neutron is found larger than that of the proton, and the neutron-proton effective mass splittings at the empirical saturation density are predicted as $0.187α$ with $α$ being the isospin asymmetry parameter. The result is compared to other ab initio calculations and is consistent with the constraints from the nuclear reaction and structure measurements, such as the nucleon-nucleus scattering, the giant resonances of $^{208}$Pb, and the Hugenholtz-Van Hove theorem with systematics of nuclear symmetry energy and its slope. The predictions of the neutron-proton effective mass splitting from the RBHF theory in the full Dirac space might be helpful to constrain the isovector parameters in phenomenological density functionals.
△ Less
Submitted 20 September, 2023; v1 submitted 26 April, 2023;
originally announced April 2023.
-
Neural Multi-network Diffusion towards Social Recommendation
Authors:
Boxin Du,
Lihui Liu,
Jiejun Xu,
Fei Wang,
Hanghang Tong
Abstract:
Graph Neural Networks (GNNs) have been widely applied on a variety of real-world applications, such as social recommendation. However, existing GNN-based models on social recommendation suffer from serious problems of generalization and oversmoothness, because of the underexplored negative sampling method and the direct implanting of the off-the-shelf GNN models. In this paper, we propose a succin…
▽ More
Graph Neural Networks (GNNs) have been widely applied on a variety of real-world applications, such as social recommendation. However, existing GNN-based models on social recommendation suffer from serious problems of generalization and oversmoothness, because of the underexplored negative sampling method and the direct implanting of the off-the-shelf GNN models. In this paper, we propose a succinct multi-network GNN-based neural model (NeMo) for social recommendation. Compared with the existing methods, the proposed model explores a generative negative sampling strategy, and leverages both the positive and negative user-item interactions for users' interest propagation. The experiments show that NeMo outperforms the state-of-the-art baselines on various real-world benchmark datasets (e.g., by up to 38.8% in terms of NDCG@15).
△ Less
Submitted 11 April, 2023;
originally announced April 2023.
-
FairGen: Towards Fair Graph Generation
Authors:
Lecheng Zheng,
Dawei Zhou,
Hanghang Tong,
Jiejun Xu,
Yada Zhu,
Jingrui He
Abstract:
There have been tremendous efforts over the past decades dedicated to the generation of realistic graphs in a variety of domains, ranging from social networks to computer networks, from gene regulatory networks to online transaction networks. Despite the remarkable success, the vast majority of these works are unsupervised in nature and are typically trained to minimize the expected graph reconstr…
▽ More
There have been tremendous efforts over the past decades dedicated to the generation of realistic graphs in a variety of domains, ranging from social networks to computer networks, from gene regulatory networks to online transaction networks. Despite the remarkable success, the vast majority of these works are unsupervised in nature and are typically trained to minimize the expected graph reconstruction loss, which would result in the representation disparity issue in the generated graphs, i.e., the protected groups (often minorities) contribute less to the objective and thus suffer from systematically higher errors. In this paper, we aim to tailor graph generation to downstream mining tasks by leveraging label information and user-preferred parity constraints. In particular, we start from the investigation of representation disparity in the context of graph generative models. To mitigate the disparity, we propose a fairness-aware graph generative model named FairGen. Our model jointly trains a label-informed graph generation module and a fair representation learning module by progressively learning the behaviors of the protected and unprotected groups, from the `easy' concepts to the `hard' ones. In addition, we propose a generic context sampling strategy for graph generative models, which is proven to be capable of fairly capturing the contextual information of each group with a high probability. Experimental results on seven real-world data sets, including web-based graphs, demonstrate that FairGen (1) obtains performance on par with state-of-the-art graph generative models across nine network properties, (2) mitigates the representation disparity issues in the generated graphs, and (3) substantially boosts the model performance by up to 17% in downstream tasks via data augmentation.
△ Less
Submitted 16 December, 2023; v1 submitted 30 March, 2023;
originally announced March 2023.
-
A Novel Two-Layer Codebook Based Near-Field Beam Training for Intelligent Reflecting Surface
Authors:
Tao Wang,
Jie Lv,
Haonan Tong,
Changsheng You,
Changchuan Yin
Abstract:
In this paper, we study the codebook-based near-field beam training for intelligent reflecting surfaces (IRSs) aided wireless system. In the considered model, the near-field beam training is critical to focus signals at the location of user equipment (UE) to obtain prominent IRS array gain. However, existing codebook schemes cannot achieve low training overhead and high receiving power simultaneou…
▽ More
In this paper, we study the codebook-based near-field beam training for intelligent reflecting surfaces (IRSs) aided wireless system. In the considered model, the near-field beam training is critical to focus signals at the location of user equipment (UE) to obtain prominent IRS array gain. However, existing codebook schemes cannot achieve low training overhead and high receiving power simultaneously. To tackle this issue, a novel two-layer codebook based beam training scheme is proposed. The layer-1 codebook is designed based on the omnidirectionality of a random-phase beam pattern, which estimates the UE distance with training overhead equivalent to that of one DFT codeword. Then, based on the estimated UE distance, the layer-2 codebook is generated to scan candidate UE locations and obtain the optimal codeword for IRS beamforming. Numerical results show that compared with benchmarks, the proposed two-layer beam training scheme achieves more accurate UE distance and angle estimation, higher data rate, and smaller training overhead.
△ Less
Submitted 18 April, 2023; v1 submitted 13 March, 2023;
originally announced March 2023.
-
KHAN: Knowledge-Aware Hierarchical Attention Networks for Accurate Political Stance Prediction
Authors:
Yunyong Ko,
Seongeun Ryu,
Soeun Han,
Youngseung Jeon,
Jaehoon Kim,
Sohyun Park,
Kyungsik Han,
Hanghang Tong,
Sang-Wook Kim
Abstract:
The political stance prediction for news articles has been widely studied to mitigate the echo chamber effect -- people fall into their thoughts and reinforce their pre-existing beliefs. The previous works for the political stance problem focus on (1) identifying political factors that could reflect the political stance of a news article and (2) capturing those factors effectively. Despite their e…
▽ More
The political stance prediction for news articles has been widely studied to mitigate the echo chamber effect -- people fall into their thoughts and reinforce their pre-existing beliefs. The previous works for the political stance problem focus on (1) identifying political factors that could reflect the political stance of a news article and (2) capturing those factors effectively. Despite their empirical successes, they are not sufficiently justified in terms of how effective their identified factors are in the political stance prediction. Motivated by this, in this work, we conduct a user study to investigate important factors in political stance prediction, and observe that the context and tone of a news article (implicit) and external knowledge for real-world entities appearing in the article (explicit) are important in determining its political stance. Based on this observation, we propose a novel knowledge-aware approach to political stance prediction (KHAN), employing (1) hierarchical attention networks (HAN) to learn the relationships among words and sentences in three different levels and (2) knowledge encoding (KE) to incorporate external knowledge for real-world entities into the process of political stance prediction. Also, to take into account the subtle and important difference between opposite political stances, we build two independent political knowledge graphs (KG) (i.e., KG-lib and KG-con) by ourselves and learn to fuse the different political knowledge. Through extensive evaluations on three real-world datasets, we demonstrate the superiority of DASH in terms of (1) accuracy, (2) efficiency, and (3) effectiveness.
△ Less
Submitted 4 April, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Do We Really Need Complicated Model Architectures For Temporal Networks?
Authors:
Weilin Cong,
Si Zhang,
Jian Kang,
Baichuan Yuan,
Hao Wu,
Xin Zhou,
Hanghang Tong,
Mehrdad Mahdavi
Abstract:
Recurrent neural network (RNN) and self-attention mechanism (SAM) are the de facto methods to extract spatial-temporal information for temporal graph learning. Interestingly, we found that although both RNN and SAM could lead to a good performance, in practice neither of them is always necessary. In this paper, we propose GraphMixer, a conceptually and technically simple architecture that consists…
▽ More
Recurrent neural network (RNN) and self-attention mechanism (SAM) are the de facto methods to extract spatial-temporal information for temporal graph learning. Interestingly, we found that although both RNN and SAM could lead to a good performance, in practice neither of them is always necessary. In this paper, we propose GraphMixer, a conceptually and technically simple architecture that consists of three components: (1) a link-encoder that is only based on multi-layer perceptrons (MLP) to summarize the information from temporal links, (2) a node-encoder that is only based on neighbor mean-pooling to summarize node information, and (3) an MLP-based link classifier that performs link prediction based on the outputs of the encoders. Despite its simplicity, GraphMixer attains an outstanding performance on temporal link prediction benchmarks with faster convergence and better generalization performance. These results motivate us to rethink the importance of simpler model architecture.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
STERLING: Synergistic Representation Learning on Bipartite Graphs
Authors:
Baoyu Jing,
Yuchen Yan,
Kaize Ding,
Chanyoung Park,
Yada Zhu,
Huan Liu,
Hanghang Tong
Abstract:
A fundamental challenge of bipartite graph representation learning is how to extract informative node embeddings. Self-Supervised Learning (SSL) is a promising paradigm to address this challenge. Most recent bipartite graph SSL methods are based on contrastive learning which learns embeddings by discriminating positive and negative node pairs. Contrastive learning usually requires a large number o…
▽ More
A fundamental challenge of bipartite graph representation learning is how to extract informative node embeddings. Self-Supervised Learning (SSL) is a promising paradigm to address this challenge. Most recent bipartite graph SSL methods are based on contrastive learning which learns embeddings by discriminating positive and negative node pairs. Contrastive learning usually requires a large number of negative node pairs, which could lead to computational burden and semantic errors. In this paper, we introduce a novel synergistic representation learning model (STERLING) to learn node embeddings without negative node pairs. STERLING preserves the unique local and global synergies in bipartite graphs. The local synergies are captured by maximizing the similarity of the inter-type and intra-type positive node pairs, and the global synergies are captured by maximizing the mutual information of co-clusters. Theoretical analysis demonstrates that STERLING could improve the connectivity between different node types in the embedding space. Extensive empirical evaluation on various benchmark datasets and tasks demonstrates the effectiveness of STERLING for extracting node embeddings.
△ Less
Submitted 10 February, 2024; v1 submitted 24 January, 2023;
originally announced February 2023.
-
Sum-Rate Maximization for Active RIS-Aided Downlink RSMA System
Authors:
Xinhao Li,
Tao Wang,
Haonan Tong,
Zhaohui Yang,
Yijie Mao,
Changchuan Yin
Abstract:
In this paper, the problem of sum-rate maximization for an active reconfigurable intelligent surface (RIS) assisted downlink rate-splitting multiple access (RSMA) transmission system is studied. In the considered model, the active RIS is deployed to overcome severe power attenuation, which is caused by the cumulative product of RIS incidence path loss and the reflection path loss. Since the active…
▽ More
In this paper, the problem of sum-rate maximization for an active reconfigurable intelligent surface (RIS) assisted downlink rate-splitting multiple access (RSMA) transmission system is studied. In the considered model, the active RIS is deployed to overcome severe power attenuation, which is caused by the cumulative product of RIS incidence path loss and the reflection path loss. Since the active RIS can adjust both the phase and the amplitude of the incident signal simultaneously, the RIS control scheme requires delicate design to improve RSMA communication performance. To address this issue, a sum-rate maximization problem is formulated to jointly optimize the beamforming vectors, rate allocation vector, and RIS precoding matrix. To solve this non-convex sum-rate maximization problem, an iterative algorithm based on fractional programming (FP) and quadratic constraint quadratic programming (QCQP) is proposed. In particular, the proposed algorithm firstly decomposes the original problem into two subproblems, namely, 1) beamforming and rate allocation optimization and 2) active RIS precoding optimization. The corresponding variables of the two subproblems are optimized through sequential convex approximation (SCA) and block coordinate descent (BCD), respectively. Numerical results show that the proposed active RIS-aided RSMA system could increase the sum-rate by up to 45% over the conventional passive RIS-aided RSMA system with the same energy consumption.
△ Less
Submitted 30 January, 2023;
originally announced January 2023.
-
Concept Discovery for Fast Adapatation
Authors:
Shengyu Feng,
Hanghang Tong
Abstract:
The advances in deep learning have enabled machine learning methods to outperform human beings in various areas, but it remains a great challenge for a well-trained model to quickly adapt to a new task. One promising solution to realize this goal is through meta-learning, also known as learning to learn, which has achieved promising results in few-shot learning. However, current approaches are sti…
▽ More
The advances in deep learning have enabled machine learning methods to outperform human beings in various areas, but it remains a great challenge for a well-trained model to quickly adapt to a new task. One promising solution to realize this goal is through meta-learning, also known as learning to learn, which has achieved promising results in few-shot learning. However, current approaches are still enormously different from human beings' learning process, especially in the ability to extract structural and transferable knowledge. This drawback makes current meta-learning frameworks non-interpretable and hard to extend to more complex tasks. We tackle this problem by introducing concept discovery to the few-shot learning problem, where we achieve more effective adaptation by meta-learning the structure among the data features, leading to a composite representation of the data. Our proposed method Concept-Based Model-Agnostic Meta-Learning (COMAML) has been shown to achieve consistent improvements in the structured data for both synthesized datasets and real-world datasets.
△ Less
Submitted 9 April, 2023; v1 submitted 18 January, 2023;
originally announced January 2023.
-
Properties of $^{208}$Pb predicted from the relativistic equation of state in the full Dirac space
Authors:
Hui Tong,
Jing Gao,
Chencan Wang,
Sibo Wang
Abstract:
Relativistic Brueckner-Hartree-Fock (RBHF) theory in the full Dirac space allows one to determine uniquely the momentum dependence of scalar and vector components of the single-particle potentials. In order to extend this new method from nuclear matter to finite nuclei, as a first step, properties of $^{208}$Pb are explored by using the microscopic equation of state for asymmetric nuclear matter a…
▽ More
Relativistic Brueckner-Hartree-Fock (RBHF) theory in the full Dirac space allows one to determine uniquely the momentum dependence of scalar and vector components of the single-particle potentials. In order to extend this new method from nuclear matter to finite nuclei, as a first step, properties of $^{208}$Pb are explored by using the microscopic equation of state for asymmetric nuclear matter and a liquid droplet model. The neutron and proton density distributions, the binding energies, the neutron and proton radii, and the neutron skin thickness in $^{208}$Pb are calculated. In order to further compare the charge densities predicted from the RBHF theory in the full Dirac space with the experimental charge densities, the differential cross sections and the electric charge form factors in the elastic electron-nucleus scattering are obtained by using the phase-shift analysis method. The results from the RBHF theory are in good agreement with the experimental data. In addition, the uncertainty arising from variations of the surface term parameter $f_0$ in the liquid droplet model is also discussed.
△ Less
Submitted 29 December, 2022;
originally announced December 2022.
-
$K_0^\ast(1430)$ Twist-2 Distribution Amplitude and $B_s,D_s \to K_0^\ast(1430)$ Transition Form Factors
Authors:
Dong Huang,
Tao Zhong,
Hai-Bing Fu,
Zai-Hui Wu,
Xing-Gang Wu,
Hong Tong
Abstract:
Based on the scenario that the $K_0^\ast(1430)$ is viewed as the ground state of $s\bar{q}$ or $q\bar{s}$, we study the $K_0^\ast(1430)$ leading-twist distribution amplitude (DA) $φ_{2;K_0^\ast}(x,μ)$ with the QCD sum rules in the framework of background field theory. A more reasonable sum rule formula for $ξ$-moments $\langleξ^n\rangle_{2;K_0^\ast}$ is suggested, which eliminates the influence br…
▽ More
Based on the scenario that the $K_0^\ast(1430)$ is viewed as the ground state of $s\bar{q}$ or $q\bar{s}$, we study the $K_0^\ast(1430)$ leading-twist distribution amplitude (DA) $φ_{2;K_0^\ast}(x,μ)$ with the QCD sum rules in the framework of background field theory. A more reasonable sum rule formula for $ξ$-moments $\langleξ^n\rangle_{2;K_0^\ast}$ is suggested, which eliminates the influence brought by the fact that the sum rule of $\langleξ^0_p\rangle_{3;K_0^\ast}$ cannot be normalized in whole Borel region. More accurate values of the first ten $ξ$-moments, $\langleξ^n\rangle_{2;K_0^\ast} (n = 1,2,\cdots,10)$, are evaluated. A new light-cone harmonic oscillator (LCHO) model for $K_0^\ast(1430)$ leading-twist DA is established for the first times. By fitting the resulted values of $\langleξ^n\rangle_{2;K_0^\ast} (n = 1,2,\cdots,10)$ via the least squares method, the behavior of $K_0^\ast(1430)$ leading-twist DA described with LCHO model is determined. Further, by adopting the light-cone QCD sum rules, we calculate the $B_s,D_s \to K_0^\ast(1430)$ transition form factors and branching fractions of the semileptonic decays $B_s,D_s \to K_0^\ast(1430) \ell ν_\ell$. The corresponding numerical results can be used to extract the Cabibbo-Kobayashi-Maskawa matrix elements by combining the relative experimental data in the future.
△ Less
Submitted 1 August, 2023; v1 submitted 11 November, 2022;
originally announced November 2022.
-
GENIUS: A Novel Solution for Subteam Replacement with Clustering-based Graph Neural Network
Authors:
Chuxuan Hu,
Qinghai Zhou,
Hanghang Tong
Abstract:
Subteam replacement is defined as finding the optimal candidate set of people who can best function as an unavailable subset of members (i.e., subteam) for certain reasons (e.g., conflicts of interests, employee churn), given a team of people embedded in a social network working on the same task. Prior investigations on this problem incorporate graph kernel as the optimal criteria for measuring th…
▽ More
Subteam replacement is defined as finding the optimal candidate set of people who can best function as an unavailable subset of members (i.e., subteam) for certain reasons (e.g., conflicts of interests, employee churn), given a team of people embedded in a social network working on the same task. Prior investigations on this problem incorporate graph kernel as the optimal criteria for measuring the similarity between the new optimized team and the original team. However, the increasingly abundant social networks reveal fundamental limitations of existing methods, including (1) the graph kernel-based approaches are powerless to capture the key intrinsic correlations among node features, (2) they generally search over the entire network for every member to be replaced, making it extremely inefficient as the network grows, and (3) the requirement of equal-sized replacement for the unavailable subteam can be inapplicable due to limited hiring budget. In this work, we address the limitations in the state-of-the-art for subteam replacement by (1) proposing GENIUS, a novel clustering-based graph neural network (GNN) framework that can capture team network knowledge for flexible subteam replacement, and (2) equipping the proposed GENIUS with self-supervised positive team contrasting training scheme to improve the team-level representation learning and unsupervised node clusters to prune candidates for fast computation. Through extensive empirical evaluations, we demonstrate the efficacy of the proposed method (1) effectiveness: being able to select better candidate members that significantly increase the similarity between the optimized and original teams, and (2) efficiency: achieving more than 600 times speed-up in average running time.
△ Less
Submitted 11 November, 2022; v1 submitted 8 November, 2022;
originally announced November 2022.
-
A note on the anti-glitch of magnetar SGR 1935+2154
Authors:
H. Tong
Abstract:
The magnetar SGR 1935+2154 is reported to have an anti-glitch, accompanied by fast radio bursts, and transient pulsed radio emission. In the wind braking model, this triplet event tells people that (1) SGR 1935+2154 does not have a strong particle wind and can be approximated by magnetic dipole braking in the persistent state; (2) Its anti-glitch is due to an enhanced particle wind, similar to the…
▽ More
The magnetar SGR 1935+2154 is reported to have an anti-glitch, accompanied by fast radio bursts, and transient pulsed radio emission. In the wind braking model, this triplet event tells people that (1) SGR 1935+2154 does not have a strong particle wind and can be approximated by magnetic dipole braking in the persistent state; (2) Its anti-glitch is due to an enhanced particle wind, similar to the first anti-glitch in magnetars; (3) Its transient pulsed radio emission may be due to a decreasing emission beam during the outburst; (4) The enhanced particle acceleration potential and pulsar death line may not be the dominate factor.
△ Less
Submitted 20 December, 2022; v1 submitted 2 November, 2022;
originally announced November 2022.
-
Nuclear Matter and Neutron Stars from Relativistic Brueckner-Hartree-Fock Theory
Authors:
Hui Tong,
Chencan Wang,
Sibo Wang
Abstract:
The momentum and isospin dependence of the single-particle potential for the in-medium nucleon are the key quantities in the Relativistic Brueckner-Hartree-Fock (RBHF) theory. It depends on how to extract the scalar and the vector components of the single-particle potential inside nuclear matter. In contrast to the RBHF calculations in the Dirac space with the positive-energy states (PESs) only, t…
▽ More
The momentum and isospin dependence of the single-particle potential for the in-medium nucleon are the key quantities in the Relativistic Brueckner-Hartree-Fock (RBHF) theory. It depends on how to extract the scalar and the vector components of the single-particle potential inside nuclear matter. In contrast to the RBHF calculations in the Dirac space with the positive-energy states (PESs) only, the single-particle potential can be determined in a unique way by the RBHF theory together with the negative-energy states (NESs), i.e., the RBHF theory in the full Dirac space. The saturation properties of symmetric and asymmetric nuclear matter in the full Dirac space are systematically investigated based on the realistic Bonn nucleon-nucleon potentials. In order to further specify the importance of the calculations in the full Dirac space, the neutron star properties are investigated. The direct URCA process in neutron star cooling will happen at density $ρ_{\rm{DURCA}}=0.43,~0.48,~0.52$ fm$^{-3}$ with the proton fractions $Y_{p,\rm{DURCA}}=0.13$. The radii of a $1.4M_\odot$ neutron star are predicated as $R_{1.4M_\odot}=11.97,~12.13,~12.27$ km, and their tidal deformabilities are $Λ_{1.4M_\odot}=376,~405,~433$ for potential Bonn A, B, C. Comparing with the results obtained in the Dirac space with PESs only, full-Dirac-space RBHF calculation predicts the softest symmetry energy which would be more favored by the gravitational waves (GW) detection from GW170817. Furthermore, the results from full-Dirac-space RBHF theory are consistent with the recent astronomical observations of massive neutron stars and simultaneous mass-radius measurement.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
JuryGCN: Quantifying Jackknife Uncertainty on Graph Convolutional Networks
Authors:
Jian Kang,
Qinghai Zhou,
Hanghang Tong
Abstract:
Graph Convolutional Network (GCN) has exhibited strong empirical performance in many real-world applications. The vast majority of existing works on GCN primarily focus on the accuracy while ignoring how confident or uncertain a GCN is with respect to its predictions. Despite being a cornerstone of trustworthy graph mining, uncertainty quantification on GCN has not been well studied and the scarce…
▽ More
Graph Convolutional Network (GCN) has exhibited strong empirical performance in many real-world applications. The vast majority of existing works on GCN primarily focus on the accuracy while ignoring how confident or uncertain a GCN is with respect to its predictions. Despite being a cornerstone of trustworthy graph mining, uncertainty quantification on GCN has not been well studied and the scarce existing efforts either fail to provide deterministic quantification or have to change the training procedure of GCN by introducing additional parameters or architectures. In this paper, we propose the first frequentist-based approach named JuryGCN in quantifying the uncertainty of GCN, where the key idea is to quantify the uncertainty of a node as the width of confidence interval by a jackknife estimator. Moreover, we leverage the influence functions to estimate the change in GCN parameters without re-training to scale up the computation. The proposed JuryGCN is capable of quantifying uncertainty deterministically without modifying the GCN architecture or introducing additional parameters. We perform extensive experimental evaluation on real-world datasets in the tasks of both active learning and semi-supervised node classification, which demonstrate the efficacy of the proposed method.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
Image Segmentation Semantic Communication over Internet of Vehicles
Authors:
Qiang Pan,
Haonan Tong,
Jie Lv,
Tao Luo,
Zhilong Zhang,
Changchuan Yin,
Jianfeng Li
Abstract:
In this paper, the problem of semantic-based efficient image transmission is studied over the Internet of Vehicles (IoV). In the considered model, a vehicle shares massive amount of visual data perceived by its visual sensors to assist other vehicles in making driving decisions. However, it is hard to maintain a high reliable visual data transmission due to the limited spectrum resources. To tackl…
▽ More
In this paper, the problem of semantic-based efficient image transmission is studied over the Internet of Vehicles (IoV). In the considered model, a vehicle shares massive amount of visual data perceived by its visual sensors to assist other vehicles in making driving decisions. However, it is hard to maintain a high reliable visual data transmission due to the limited spectrum resources. To tackle this problem, a semantic communication approach is introduced to reduce the transmission data amount while ensuring the semantic-level accuracy. Particularly, an image segmentation semantic communication (ISSC) system is proposed, which can extract the semantic features from the perceived images and transmit the features to the receiving vehicle that reconstructs the image segmentations. The ISSC system consists of an encoder and a decoder at the transmitter and the receiver, respectively. To accurately extract the image semantic features, the ISSC system encoder employs a Swin Transformer based multi-scale semantic feature extractor. Then, to resist the wireless noise and reconstruct the image segmentation, a semantic feature decoder and a reconstructor are designed at the receiver. Simulation results show that the proposed ISSC system can reconstruct the image segmentation accurately with a high compression ratio, and can achieve robust transmission performance against channel noise, especially at the low signal-to-noise ratio (SNR). In terms of mean Intersection over Union (mIoU), the ISSC system can achieve an increase by 75%, compared to the baselines using traditional coding method
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs
Authors:
Haipeng Luo,
Hanghang Tong,
Mengxiao Zhang,
Yuheng Zhang
Abstract:
We study high-probability regret bounds for adversarial $K$-armed bandits with time-varying feedback graphs over $T$ rounds. For general strongly observable graphs, we develop an algorithm that achieves the optimal regret $\widetilde{\mathcal{O}}((\sum_{t=1}^Tα_t)^{1/2}+\max_{t\in[T]}α_t)$ with high probability, where $α_t$ is the independence number of the feedback graph at round $t$. Compared to…
▽ More
We study high-probability regret bounds for adversarial $K$-armed bandits with time-varying feedback graphs over $T$ rounds. For general strongly observable graphs, we develop an algorithm that achieves the optimal regret $\widetilde{\mathcal{O}}((\sum_{t=1}^Tα_t)^{1/2}+\max_{t\in[T]}α_t)$ with high probability, where $α_t$ is the independence number of the feedback graph at round $t$. Compared to the best existing result [Neu, 2015] which only considers graphs with self-loops for all nodes, our result not only holds more generally, but importantly also removes any $\text{poly}(K)$ dependence that can be prohibitively large for applications such as contextual bandits. Furthermore, we also develop the first algorithm that achieves the optimal high-probability regret bound for weakly observable graphs, which even improves the best expected regret bound of [Alon et al., 2015] by removing the $\mathcal{O}(\sqrt{KT})$ term with a refined analysis. Our algorithms are based on the online mirror descent framework, but importantly with an innovative combination of several techniques. Notably, while earlier works use optimistic biased loss estimators for achieving high-probability bounds, we find it important to use a pessimistic one for nodes without self-loop in a strongly observable graph.
△ Less
Submitted 29 January, 2023; v1 submitted 4 October, 2022;
originally announced October 2022.
-
Improved Algorithms for Neural Active Learning
Authors:
Yikun Ban,
Yuheng Zhang,
Hanghang Tong,
Arindam Banerjee,
Jingrui He
Abstract:
We improve the theoretical and empirical performance of neural-network(NN)-based active learning algorithms for the non-parametric streaming setting. In particular, we introduce two regret metrics by minimizing the population loss that are more suitable in active learning than the one used in state-of-the-art (SOTA) related work. Then, the proposed algorithm leverages the powerful representation o…
▽ More
We improve the theoretical and empirical performance of neural-network(NN)-based active learning algorithms for the non-parametric streaming setting. In particular, we introduce two regret metrics by minimizing the population loss that are more suitable in active learning than the one used in state-of-the-art (SOTA) related work. Then, the proposed algorithm leverages the powerful representation of NNs for both exploitation and exploration, has the query decision-maker tailored for $k$-class classification problems with the performance guarantee, utilizes the full feedback, and updates parameters in a more practical and efficient manner. These careful designs lead to an instance-dependent regret upper bound, roughly improving by a multiplicative factor $O(\log T)$ and removing the curse of input dimensionality. Furthermore, we show that the algorithm can achieve the same performance as the Bayes-optimal classifier in the long run under the hard-margin setting in classification problems. In the end, we use extensive experiments to evaluate the proposed algorithm and SOTA baselines, to show the improved empirical performance.
△ Less
Submitted 16 January, 2023; v1 submitted 2 October, 2022;
originally announced October 2022.
-
Solving Coupled Differential Equation Groups Using PINO-CDE
Authors:
Wenhao Ding,
Qing He,
Hanghang Tong,
Qingjing Wang,
Ping Wang
Abstract:
As a fundamental mathmatical tool in many engineering disciplines, coupled differential equation groups are being widely used to model complex structures containing multiple physical quantities. Engineers constantly adjust structural parameters at the design stage, which requires a highly efficient solver. The rise of deep learning technologies has offered new perspectives on this task. Unfortunat…
▽ More
As a fundamental mathmatical tool in many engineering disciplines, coupled differential equation groups are being widely used to model complex structures containing multiple physical quantities. Engineers constantly adjust structural parameters at the design stage, which requires a highly efficient solver. The rise of deep learning technologies has offered new perspectives on this task. Unfortunately, existing black-box models suffer from poor accuracy and robustness, while the advanced methodologies of single-output operator regression cannot deal with multiple quantities simultaneously. To address these challenges, we propose PINO-CDE, a deep learning framework for solving coupled differential equation groups (CDEs) along with an equation normalization algorithm for performance enhancing. Based on the theory of physics-informed neural operator (PINO), PINO-CDE uses a single network for all quantities in a CDEs, instead of training dozens, or even hundreds of networks as in the existing literature. We demonstrate the flexibility and feasibility of PINO-CDE for one toy example and two engineering applications: vehicle-track coupled dynamics (VTCD) and reliability assessment for a four-storey building (uncertainty propagation). The performance of VTCD indicates that PINO-CDE outperforms existing software and deep learning-based methods in terms of efficiency and precision, respectively. For the uncertainty propagation task, PINO-CDE provides higher-resolution results in less than a quarter of the time incurred when using the probability density evolution method (PDEM). This framework integrates engineering dynamics and deep learning technologies and may reveal a new concept for CDEs solving and uncertainty propagation.
△ Less
Submitted 23 June, 2023; v1 submitted 1 October, 2022;
originally announced October 2022.
-
Retrieval Based Time Series Forecasting
Authors:
Baoyu Jing,
Si Zhang,
Yada Zhu,
Bin Peng,
Kaiyu Guan,
Andrew Margenot,
Hanghang Tong
Abstract:
Time series data appears in a variety of applications such as smart transportation and environmental monitoring. One of the fundamental problems for time series analysis is time series forecasting. Despite the success of recent deep time series forecasting methods, they require sufficient observation of historical values to make accurate forecasting. In other words, the ratio of the output length…
▽ More
Time series data appears in a variety of applications such as smart transportation and environmental monitoring. One of the fundamental problems for time series analysis is time series forecasting. Despite the success of recent deep time series forecasting methods, they require sufficient observation of historical values to make accurate forecasting. In other words, the ratio of the output length (or forecasting horizon) to the sum of the input and output lengths should be low enough (e.g., 0.3). As the ratio increases (e.g., to 0.8), the uncertainty for the forecasting accuracy increases significantly. In this paper, we show both theoretically and empirically that the uncertainty could be effectively reduced by retrieving relevant time series as references. In the theoretical analysis, we first quantify the uncertainty and show its connections to the Mean Squared Error (MSE). Then we prove that models with references are easier to learn than models without references since the retrieved references could reduce the uncertainty. To empirically demonstrate the effectiveness of the retrieval based time series forecasting models, we introduce a simple yet effective two-stage method, called ReTime consisting of a relational retrieval and a content synthesis. We also show that ReTime can be easily adapted to the spatial-temporal time series and time series imputation settings. Finally, we evaluate ReTime on real-world datasets to demonstrate its effectiveness.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
The population properties of spinning black holes using Gravitational-wave Transient Catalog 3
Authors:
Hui Tong,
Shanika Galaudage,
Eric Thrane
Abstract:
Binary black holes formed via different pathways are predicted to have distinct spin properties. Measuring these properties with gravitational waves provides an opportunity to unveil the origins of binary black holes. Recent work draws conflicting conclusions regarding the spin distribution observed by LIGO--Virgo--KAGRA (LVK). Some analyses suggest that a fraction of the observed black-hole spin…
▽ More
Binary black holes formed via different pathways are predicted to have distinct spin properties. Measuring these properties with gravitational waves provides an opportunity to unveil the origins of binary black holes. Recent work draws conflicting conclusions regarding the spin distribution observed by LIGO--Virgo--KAGRA (LVK). Some analyses suggest that a fraction of the observed black-hole spin vectors are significantly misaligned (by $>90^\circ$) relative to the orbital angular momentum. This has been interpreted to mean that some binaries in the LVK dataset are assembled dynamically in dense stellar environments. Other analyses find support for a sub-population of binaries with negligible spin and no evidence for significantly misaligned spin -- a result consistent with the field formation scenario. In this work, we study the spin properties of binary black holes in the third LVK gravitational-wave transient catalog. We find that there is insufficient data to resolve the existence of a sub-population of binaries with negligible black-hole spin (the presence of this sub-population is supported by a modest Bayes factor of 1.7). We find modest support for the existence of mergers with extreme spin tilt angles $> 90^\circ$ (the presence of extreme-tilt binaries is favored by a Bayes factor of 10.1). Only one thing is clear: at least some of the LVK binaries formed in the field. At most $89\%$ of binaries are assembled dynamically (99\% credibility), though, the true branching fraction could be much lower, even negligible.
△ Less
Submitted 13 October, 2022; v1 submitted 6 September, 2022;
originally announced September 2022.
-
ARIEL: Adversarial Graph Contrastive Learning
Authors:
Shengyu Feng,
Baoyu Jing,
Yada Zhu,
Hanghang Tong
Abstract:
Contrastive learning is an effective unsupervised method in graph representation learning, and the key component of contrastive learning lies in the construction of positive and negative samples. Previous methods usually utilize the proximity of nodes in the graph as the principle. Recently, the data-augmentation-based contrastive learning method has advanced to show great power in the visual doma…
▽ More
Contrastive learning is an effective unsupervised method in graph representation learning, and the key component of contrastive learning lies in the construction of positive and negative samples. Previous methods usually utilize the proximity of nodes in the graph as the principle. Recently, the data-augmentation-based contrastive learning method has advanced to show great power in the visual domain, and some works extended this method from images to graphs. However, unlike the data augmentation on images, the data augmentation on graphs is far less intuitive and much harder to provide high-quality contrastive samples, which leaves much space for improvement. In this work, by introducing an adversarial graph view for data augmentation, we propose a simple but effective method, Adversarial Graph Contrastive Learning (ARIEL), to extract informative contrastive samples within reasonable constraints. We develop a new technique called information regularization for stable training and use subgraph sampling for scalability. We generalize our method from node-level contrastive learning to the graph level by treating each graph instance as a super-node. ARIEL consistently outperforms the current graph contrastive learning methods for both node-level and graph-level classification tasks on real-world datasets. We further demonstrate that ARIEL is more robust in the face of adversarial attacks.
△ Less
Submitted 5 February, 2024; v1 submitted 14 August, 2022;
originally announced August 2022.
-
Carnap's problem for intuitionistic propositional logic
Authors:
Haotian Tong,
Dag Westerståhl
Abstract:
We show that intuitionistic propositional logic is \emph{Carnap categorical}: the only interpretation of the connectives consistent with the intuitionistic consequence relation is the standard interpretation. This holds relative to the most well-known semantics with respect to which intuitionistic logic is sound and complete; among them Kripke semantics, Beth semantics, Dragalin semantics, and top…
▽ More
We show that intuitionistic propositional logic is \emph{Carnap categorical}: the only interpretation of the connectives consistent with the intuitionistic consequence relation is the standard interpretation. This holds relative to the most well-known semantics with respect to which intuitionistic logic is sound and complete; among them Kripke semantics, Beth semantics, Dragalin semantics, and topological semantics. It also holds for algebraic semantics, although categoricity in that case is different in kind from categoricity relative to possible worlds style semantics.
△ Less
Submitted 25 December, 2022; v1 submitted 29 July, 2022;
originally announced July 2022.
-
Privacy-preserving Graph Analytics: Secure Generation and Federated Learning
Authors:
Dongqi Fu,
Jingrui He,
Hanghang Tong,
Ross Maciejewski
Abstract:
Directly motivated by security-related applications from the Homeland Security Enterprise, we focus on the privacy-preserving analysis of graph data, which provides the crucial capacity to represent rich attributes and relationships. In particular, we discuss two directions, namely privacy-preserving graph generation and federated graph learning, which can jointly enable the collaboration among mu…
▽ More
Directly motivated by security-related applications from the Homeland Security Enterprise, we focus on the privacy-preserving analysis of graph data, which provides the crucial capacity to represent rich attributes and relationships. In particular, we discuss two directions, namely privacy-preserving graph generation and federated graph learning, which can jointly enable the collaboration among multiple parties each possessing private graph data. For each direction, we identify both "quick wins" and "hard problems". Towards the end, we demonstrate a user interface that can facilitate model explanation, interpretation, and visualization. We believe that the techniques developed in these directions will significantly enhance the capabilities of the Homeland Security Enterprise to tackle and mitigate the various security risks.
△ Less
Submitted 30 June, 2022;
originally announced July 2022.
-
ICME 2022 Few-shot LOGO detection top 9 solution
Authors:
Ka Ho Tong,
Ka Wai Cheung,
Xiaochuan Yu
Abstract:
ICME-2022 few-shot logo detection competition is held in May, 2022. Participants are required to develop a single model to detect logos by handling tiny logo instances, similar brands, and adversarial images at the same time, with limited annotations. Our team achieved rank 16 and 11 in the first and second round of the competition respectively, with a final rank of 9th. This technical report summ…
▽ More
ICME-2022 few-shot logo detection competition is held in May, 2022. Participants are required to develop a single model to detect logos by handling tiny logo instances, similar brands, and adversarial images at the same time, with limited annotations. Our team achieved rank 16 and 11 in the first and second round of the competition respectively, with a final rank of 9th. This technical report summarized our major techniques used in this competitions, and potential improvement.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
Geometric Matrix Completion via Sylvester Multi-Graph Neural Network
Authors:
Boxin Du,
Changhe Yuan,
Fei Wang,
Hanghang Tong
Abstract:
Despite the success of the Sylvester equation empowered methods on various graph mining applications, such as semi-supervised label learning and network alignment, there also exists several limitations. The Sylvester equation's inability of modeling non-linear relations and the inflexibility of tuning towards different tasks restrict its performance. In this paper, we propose an end-to-end neural…
▽ More
Despite the success of the Sylvester equation empowered methods on various graph mining applications, such as semi-supervised label learning and network alignment, there also exists several limitations. The Sylvester equation's inability of modeling non-linear relations and the inflexibility of tuning towards different tasks restrict its performance. In this paper, we propose an end-to-end neural framework, SYMGNN, which consists of a multi-network neural aggregation module and a prior multi-network association incorporation learning module. The proposed framework inherits the key ideas of the Sylvester equation, and meanwhile generalizes it to overcome aforementioned limitations. Empirical evaluations on real-world datasets show that the instantiations of SYMGNN overall outperform the baselines in geometric matrix completion task, and its low-rank instantiation could further reduce the memory consumption by 16.98\% on average.
△ Less
Submitted 19 June, 2022;
originally announced June 2022.
-
Exploring universal characteristics of neutron star matter with relativistic \textit{ab initio} equations of state
Authors:
Sibo Wang,
Chencan Wang,
Hui Tong
Abstract:
Starting from the relativistic realistic nucleon-nucleon ($NN$) interactions, a newly developed relativistic \textit{ab initio} method, i.e., the relativistic Brueckner-Hartree-Fock (RBHF) theory in the full Dirac space is employed to study the neutron star properties. First, the one-to-one correspondence relation for gravitational redshift and mass is established and used to infer the mass of iso…
▽ More
Starting from the relativistic realistic nucleon-nucleon ($NN$) interactions, a newly developed relativistic \textit{ab initio} method, i.e., the relativistic Brueckner-Hartree-Fock (RBHF) theory in the full Dirac space is employed to study the neutron star properties. First, the one-to-one correspondence relation for gravitational redshift and mass is established and used to infer the mass of isolated neutron stars combining the gravitational redshift measurements. Next, the ratio of the moment of inertia $I$ to $MR^2$ as a function of the compactness $M/R$ is obtained, which is consistent with the universal relations in the literature. The moment of inertia for $1.338M_\odot$ pulsar PSR J0737-3039A $I_{1.338M_\odot}$ is predicted to be 1.356$\times10^{45}$, 1.381$\times10^{45}$, and $1.407\times10^{45}\ \mathrm{g~cm^2}$ by the RBHF theory in the full Dirac space with $NN$ interactions Bonn A, B, and C, respectively. Finally, the quadrupole moment of neutron star is calculated under the slow-rotation and small-tidal-deformation approximation. The equation of states constructed by the RBHF theory in the full Dirac space, together with those by the projection method and momentum-independence approximation, conform to universal $I$-Love-$Q$ relations as well. By combing the tidal deformability from GW170817 and the universal relations from relativistic \textit{ab initio} methods, the moment of inertia of neutron star with 1.4 solar mass is also deduced as $I_{1.4M_\odot}=1.22^{+0.40}_{-0.25}\times 10^{45}\mathrm{g\ cm^2}$.
△ Less
Submitted 25 October, 2022; v1 submitted 17 June, 2022;
originally announced June 2022.
-
Schema-Guided Event Graph Completion
Authors:
Hongwei Wang,
Zixuan Zhang,
Sha Li,
Jiawei Han,
Yizhou Sun,
Hanghang Tong,
Joseph P. Olive,
Heng Ji
Abstract:
We tackle a new task, event graph completion, which aims to predict missing event nodes for event graphs. Existing link prediction or graph completion methods have difficulty dealing with event graphs because they are usually designed for a single large graph such as a social network or a knowledge graph, rather than multiple small dynamic event graphs. Moreover, they can only predict missing edge…
▽ More
We tackle a new task, event graph completion, which aims to predict missing event nodes for event graphs. Existing link prediction or graph completion methods have difficulty dealing with event graphs because they are usually designed for a single large graph such as a social network or a knowledge graph, rather than multiple small dynamic event graphs. Moreover, they can only predict missing edges rather than missing nodes. In this work, we propose to utilize event schema, a template that describes the stereotypical structure of event graphs, to address the above issues. Our schema-guided event graph completion approach first maps an instance event graph to a subgraph of the schema graph by a heuristic subgraph matching algorithm. Then it predicts whether a candidate event node in the schema graph should be added to the instantiated schema subgraph by characterizing two types of local topology of the schema graph: neighbors of the candidate node and the subgraph, and paths that connect the candidate node and the subgraph. These two modules are later combined together for the final prediction. We also propose a self-supervised strategy to construct training samples, as well as an inference algorithm that is specifically designed to complete event graphs. Extensive experimental results on four datasets demonstrate that our proposed method achieves state-of-the-art performance, with 4.3% to 19.4% absolute F1 gains over the best baseline method on the four datasets.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
CoNSoLe: Convex Neural Symbolic Learning
Authors:
Haoran Li,
Yang Weng,
Hanghang Tong
Abstract:
Learning the underlying equation from data is a fundamental problem in many disciplines. Recent advances rely on Neural Networks (NNs) but do not provide theoretical guarantees in obtaining the exact equations owing to the non-convexity of NNs. In this paper, we propose Convex Neural Symbolic Learning (CoNSoLe) to seek convexity under mild conditions. The main idea is to decompose the recovering p…
▽ More
Learning the underlying equation from data is a fundamental problem in many disciplines. Recent advances rely on Neural Networks (NNs) but do not provide theoretical guarantees in obtaining the exact equations owing to the non-convexity of NNs. In this paper, we propose Convex Neural Symbolic Learning (CoNSoLe) to seek convexity under mild conditions. The main idea is to decompose the recovering process into two steps and convexify each step. In the first step of searching for right symbols, we convexify the deep Q-learning. The key is to maintain double convexity for both the negative Q-function and the negative reward function in each iteration, leading to provable convexity of the negative optimal Q function to learn the true symbol connections. Conditioned on the exact searching result, we construct a Locally Convex equation Learner (LoCaL) neural network to convexify the estimation of symbol coefficients. With such a design, we quantify a large region with strict convexity in the loss surface of LoCaL for commonly used physical functions. Finally, we demonstrate the superior performance of the CoNSoLe framework over the state-of-the-art on a diverse set of datasets.
△ Less
Submitted 12 October, 2022; v1 submitted 1 June, 2022;
originally announced June 2022.
-
COIN: Co-Cluster Infomax for Bipartite Graphs
Authors:
Baoyu Jing,
Yuchen Yan,
Yada Zhu,
Hanghang Tong
Abstract:
Bipartite graphs are powerful data structures to model interactions between two types of nodes, which have been used in a variety of applications, such as recommender systems, information retrieval, and drug discovery. A fundamental challenge for bipartite graphs is how to learn informative node embeddings. Despite the success of recent self-supervised learning methods on bipartite graphs, their o…
▽ More
Bipartite graphs are powerful data structures to model interactions between two types of nodes, which have been used in a variety of applications, such as recommender systems, information retrieval, and drug discovery. A fundamental challenge for bipartite graphs is how to learn informative node embeddings. Despite the success of recent self-supervised learning methods on bipartite graphs, their objectives are discriminating instance-wise positive and negative node pairs, which could contain cluster-level errors. In this paper, we introduce a novel co-cluster infomax (COIN) framework, which captures the cluster-level information by maximizing the mutual information of co-clusters. Different from previous infomax methods which estimate mutual information by neural networks, COIN could easily calculate mutual information. Besides, COIN is an end-to-end coclustering method which can be trained jointly with other objective functions and optimized via back-propagation. Furthermore, we also provide theoretical analysis for COIN. We theoretically prove that COIN is able to effectively increase the mutual information of node embeddings and COIN is upper-bounded by the prior distributions of nodes. We extensively evaluate the proposed COIN framework on various benchmark datasets and tasks to demonstrate the effectiveness of COIN.
△ Less
Submitted 2 November, 2022; v1 submitted 31 May, 2022;
originally announced June 2022.
-
PINO-MBD: Physics-informed Neural Operator for Solving Coupled ODEs in Multi-body Dynamics
Authors:
Wenhao Ding,
Qing He,
Hanghang Tong,
Ping Wang
Abstract:
In multi-body dynamics, the motion of a complicated physical object is described as a coupled ordinary differential equation system with multiple unknown solutions. Engineers need to constantly adjust the object to meet requirements at the design stage, where a highly efficient solver is needed. The rise of machine learning-based partial differential equation solvers can meet this need. These solv…
▽ More
In multi-body dynamics, the motion of a complicated physical object is described as a coupled ordinary differential equation system with multiple unknown solutions. Engineers need to constantly adjust the object to meet requirements at the design stage, where a highly efficient solver is needed. The rise of machine learning-based partial differential equation solvers can meet this need. These solvers can be classified into two categories: approximating the solution function (Physics-informed neural network) and learning the solution operator (Neural operator). The recently proposed physics-informed neural operator (PINO) gains advantages from both categories by embedding physics equations into the loss function of a neural operator. Following this state-of-art concept, we propose the physics-informed neural operator for coupled ODEs in multi-body dynamics (PINO-MBD), which learns the mapping between parameter spaces and solution spaces. Once PINO-MBD is trained, only one forward pass of the network is required to obtain the solutions for a new instance with different parameters. To handle the difficulty that coupled ODEs contain multiple solutions (instead of only one in normal PDE problems), two new physics embedding methods are also proposed. The experimental results on classic vehicle-track coupled dynamics problem show state-of-art performance not only on solutions but also the first and second derivatives of solutions.
△ Less
Submitted 22 May, 2022;
originally announced May 2022.
-
SUGER: A Subgraph-based Graph Convolutional Network Method for Bundle Recommendation
Authors:
Zhenning Zhang,
Boxin Du,
Hanghang Tong
Abstract:
Bundle recommendation is an emerging research direction in the recommender system with the focus on recommending customized bundles of items for users. Although Graph Neural Networks (GNNs) have been applied in this problem and achieve superior performance, existing methods underexplore the graph-level GNN methods, which exhibit great potential in traditional recommender system. Furthermore, they…
▽ More
Bundle recommendation is an emerging research direction in the recommender system with the focus on recommending customized bundles of items for users. Although Graph Neural Networks (GNNs) have been applied in this problem and achieve superior performance, existing methods underexplore the graph-level GNN methods, which exhibit great potential in traditional recommender system. Furthermore, they usually lack the transferability from one domain with sufficient supervision to another domain which might suffer from the label scarcity issue. In this work, we propose a subgraph-based Graph Neural Network model, SUGER, for bundle recommendation to handle these limitations. SUGER generates heterogeneous subgraphs around the user-bundle pairs, and then maps those subgraphs to the users' preference predictions via neural relational graph propagation. Experimental results show that SUGER significantly outperforms the state-of-the-art baselines in both the basic and the transfer bundle recommendation problems.
△ Less
Submitted 5 May, 2022;
originally announced May 2022.
-
Trustworthy Graph Neural Networks: Aspects, Methods and Trends
Authors:
He Zhang,
Bang Wu,
Xingliang Yuan,
Shirui Pan,
Hanghang Tong,
Jian Pei
Abstract:
Graph neural networks (GNNs) have emerged as a series of competent graph learning methods for diverse real-world scenarios, ranging from daily applications like recommendation systems and question answering to cutting-edge technologies such as drug discovery in life sciences and n-body simulation in astrophysics. However, task performance is not the only requirement for GNNs. Performance-oriented…
▽ More
Graph neural networks (GNNs) have emerged as a series of competent graph learning methods for diverse real-world scenarios, ranging from daily applications like recommendation systems and question answering to cutting-edge technologies such as drug discovery in life sciences and n-body simulation in astrophysics. However, task performance is not the only requirement for GNNs. Performance-oriented GNNs have exhibited potential adverse effects like vulnerability to adversarial attacks, unexplainable discrimination against disadvantaged groups, or excessive resource consumption in edge computing environments. To avoid these unintentional harms, it is necessary to build competent GNNs characterised by trustworthiness. To this end, we propose a comprehensive roadmap to build trustworthy GNNs from the view of the various computing technologies involved. In this survey, we introduce basic concepts and comprehensively summarise existing efforts for trustworthy GNNs from six aspects, including robustness, explainability, privacy, fairness, accountability, and environmental well-being. Additionally, we highlight the intricate cross-aspect relations between the above six aspects of trustworthy GNNs. Finally, we present a thorough overview of trending directions for facilitating the research and industrialisation of trustworthy GNNs.
△ Less
Submitted 21 February, 2024; v1 submitted 15 May, 2022;
originally announced May 2022.