-
Galaxy Mergers in the Epoch of Reionization I: A JWST Study of Pair Fractions, Merger Rates, and Stellar Mass Accretion Rates at $z = 4.5-11.5$
Authors:
Qiao Duan,
Christopher J. Conselice,
Qiong Li,
Duncan Austin,
Thomas Harvey,
Nathan J. Adams,
Kenneth J. Duncan,
James Trussler,
Leonardo Ferreira,
Lewi Westcott,
Honor Harris,
Rogier A. Windhorst,
Benne W. Holwerda,
Thomas J. Broadhurst,
Dan Coe,
Seth H. Cohen,
Simon P. Driver,
Brenda Frye,
Norman A. Grogin,
Nimish P. Hathi,
Rolf A. Jansen,
Anton M. Koekemoer,
Madeline A. Marshall,
Mario Nonino,
Rafael Ortiz III
, et al. (7 additional authors not shown)
Abstract:
We present a full analysis of galaxy major merger pair fractions, merger rates, and mass accretion rates, thus uncovering the role of mergers in galaxy formation at the earliest previously unexplored epoch of $4.5<z<11.5$. We target galaxies with masses $\log_{10}(\mathrm{M}_*/\mathrm{M}_\odot) = 8.0 - 10.0$, utilizing data from eight JWST Cycle-1 fields (CEERS, JADES GOODS-S, NEP-TDF, NGDEEP, GLA…
▽ More
We present a full analysis of galaxy major merger pair fractions, merger rates, and mass accretion rates, thus uncovering the role of mergers in galaxy formation at the earliest previously unexplored epoch of $4.5<z<11.5$. We target galaxies with masses $\log_{10}(\mathrm{M}_*/\mathrm{M}_\odot) = 8.0 - 10.0$, utilizing data from eight JWST Cycle-1 fields (CEERS, JADES GOODS-S, NEP-TDF, NGDEEP, GLASS, El-Gordo, SMACS-0723, MACS-0416), covering an unmasked area of 189.36 $\mathrm{arcmin}^2$. We develop a new probabilistic pair-counting methodology that integrates full photometric redshift posteriors and corrects for detection incompleteness to quantify close pairs with physical projected separations between 20 and 50 kpc. Our analysis reveals an increase in pair fractions up to $z = 8$, reaching $0.211 \pm 0.065$, followed by a statistically flat evolution to $z = 11.5$. We find that the galaxy merger rate increases from the local Universe up to $z = 6$ and then stabilizes at a value of $\sim 6$ Gyr$^{-1}$ up to $z = 11.5$. We fit both a power-law and a power-law + exponential model to our pair fraction and merger rate redshift evolution, finding that the latter model describes the trends more accurately, particularly at $z = 8.0 - 11.5$. In addition, we measure that the average galaxy increases its stellar mass due to mergers by a factor of $2.77 \pm 0.99$ from redshift $z = 10.5$ to $z = 5.0$. Lastly, we investigate the impact of mergers on galaxy stellar mass growth, revealing that mergers contribute $71 \pm 25\%$ as much to galaxy stellar mass increases as star formation from gas. This indicates that mergers drive about half of galaxy assembly at high redshift.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
UIFV: Data Reconstruction Attack in Vertical Federated Learning
Authors:
Jirui Yang,
Peng Chen,
Zhihui Lu,
Qiang Duan,
Yubing Bao
Abstract:
Vertical Federated Learning (VFL) facilitates collaborative machine learning without the need for participants to share raw private data. However, recent studies have revealed privacy risks where adversaries might reconstruct sensitive features through data leakage during the learning process. Although data reconstruction methods based on gradient or model information are somewhat effective, they…
▽ More
Vertical Federated Learning (VFL) facilitates collaborative machine learning without the need for participants to share raw private data. However, recent studies have revealed privacy risks where adversaries might reconstruct sensitive features through data leakage during the learning process. Although data reconstruction methods based on gradient or model information are somewhat effective, they reveal limitations in VFL application scenarios. This is because these traditional methods heavily rely on specific model structures and/or have strict limitations on application scenarios. To address this, our study introduces the Unified InverNet Framework into VFL, which yields a novel and flexible approach (dubbed UIFV) that leverages intermediate feature data to reconstruct original data, instead of relying on gradients or model details. The intermediate feature data is the feature exchanged by different participants during the inference phase of VFL. Experiments on four datasets demonstrate that our methods significantly outperform state-of-the-art techniques in attack precision. Our work exposes severe privacy vulnerabilities within VFL systems that pose real threats to practical VFL applications and thus confirms the necessity of further enhancing privacy protection in the VFL architecture.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
EPOCHS Paper X: Environmental effects on Galaxy Formation and Protocluster Galaxy candidates at $4.5<z<10$ from JWST observations
Authors:
Qiong Li,
Christopher J. Conselice,
Florian Sarron,
Tom Harvey,
Duncan Austin,
Nathan Adams,
James A. A. Trussler,
Qiao Duan,
Leonardo Ferreira,
Lewi Westcott,
Honor Harris,
Hervé Dole,
Norman A. Grogin,
Brenda Frye,
Anton M. Koekemoer,
Clayton Robertson,
Rogier A. Windhorst,
Maria del Carmen Polletta,
Nimish P. Hathi
Abstract:
In this paper we describe our search for galaxy protocluster candidates at $4.5< z < 10$ and explore the environmental and physical properties of their member galaxies identified through JWST wide-field surveys within the CEERS, JADES, and PEARLS NEP-TDF fields. Combining with HST data, we identify 2948 robust $z>4.5$ candidates within an area of 185.4 arcmin$^2$. We determine nearest neighbour st…
▽ More
In this paper we describe our search for galaxy protocluster candidates at $4.5< z < 10$ and explore the environmental and physical properties of their member galaxies identified through JWST wide-field surveys within the CEERS, JADES, and PEARLS NEP-TDF fields. Combining with HST data, we identify 2948 robust $z>4.5$ candidates within an area of 185.4 arcmin$^2$. We determine nearest neighbour statistics and galaxy environments. We find that high-$z$ galaxies in overdense environments exhibit higher star formation activity compared to those in underdense regions. Galaxies in dense environments have a slightly increased SFR at a given mass compared with galaxies in the lower density environments. At the high mass end we also find a gradual flattening of the $M_{\star}$-SFR slope. We find that galaxies in high-density regions often have redder UV slopes than those in low-density regions, suggesting more dust extinction, weaker Lyman-alpha emission and / or a higher damped Lyman-alpha absorption. We also find that the mass-size relation remains consistent and statistically similar across all environments. Furthermore, we quantitatively assess the probability of a galaxy belonging to a protocluster candidate. In total, we identified 26 overdensities at $z=5-7$ and estimate their dark matter halo masses. We find that all protocluster candidates could evolve into clusters with $M_{\rm halo} > 10^{14}M_{\odot}$ at $z = 0$, thereby supporting the theoretical and simulation predictions of cluster formation. Notably, this marks an early search for protocluster candidates in JWST wide field based on photometric data, providing valuable candidates to study cosmic structure formation at the early stages.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
C3LLM: Conditional Multimodal Content Generation Using Large Language Models
Authors:
Zixuan Wang,
Qinkai Duan,
Yu-Wing Tai,
Chi-Keung Tang
Abstract:
We introduce C3LLM (Conditioned-on-Three-Modalities Large Language Models), a novel framework combining three tasks of video-to-audio, audio-to-text, and text-to-audio together. C3LLM adapts the Large Language Model (LLM) structure as a bridge for aligning different modalities, synthesizing the given conditional information, and making multimodal generation in a discrete manner. Our contributions…
▽ More
We introduce C3LLM (Conditioned-on-Three-Modalities Large Language Models), a novel framework combining three tasks of video-to-audio, audio-to-text, and text-to-audio together. C3LLM adapts the Large Language Model (LLM) structure as a bridge for aligning different modalities, synthesizing the given conditional information, and making multimodal generation in a discrete manner. Our contributions are as follows. First, we adapt a hierarchical structure for audio generation tasks with pre-trained audio codebooks. Specifically, we train the LLM to generate audio semantic tokens from the given conditions, and further use a non-autoregressive transformer to generate different levels of acoustic tokens in layers to better enhance the fidelity of the generated audio. Second, based on the intuition that LLMs were originally designed for discrete tasks with the next-word prediction method, we use the discrete representation for audio generation and compress their semantic meanings into acoustic tokens, similar to adding "acoustic vocabulary" to LLM. Third, our method combines the previous tasks of audio understanding, video-to-audio generation, and text-to-audio generation together into one unified model, providing more versatility in an end-to-end fashion. Our C3LLM achieves improved results through various automated evaluation metrics, providing better semantic alignment compared to previous methods.
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
Automated Metaheuristic Algorithm Design with Autoregressive Learning
Authors:
Qi Zhao,
Tengfei Liu,
Bai Yan,
Qiqi Duan,
Jian Yang,
Yuhui Shi
Abstract:
Automated design of metaheuristic algorithms offers an attractive avenue to reduce human effort and gain enhanced performance beyond human intuition. Current automated methods design algorithms within a fixed structure and operate from scratch. This poses a clear gap towards fully discovering potentials over the metaheuristic family and fertilizing from prior design experience. To bridge the gap,…
▽ More
Automated design of metaheuristic algorithms offers an attractive avenue to reduce human effort and gain enhanced performance beyond human intuition. Current automated methods design algorithms within a fixed structure and operate from scratch. This poses a clear gap towards fully discovering potentials over the metaheuristic family and fertilizing from prior design experience. To bridge the gap, this paper proposes an autoregressive learning-based designer for automated design of metaheuristic algorithms. Our designer formulates metaheuristic algorithm design as a sequence generation task, and harnesses an autoregressive generative network to handle the task. This offers two advances. First, through autoregressive inference, the designer generates algorithms with diverse lengths and structures, enabling to fully discover potentials over the metaheuristic family. Second, prior design knowledge learned and accumulated in neurons of the designer can be retrieved for designing algorithms for future problems, paving the way to continual design of algorithms for open-ended problem-solving. Extensive experiments on numeral benchmarks and real-world problems reveal that the proposed designer generates algorithms that outperform all human-created baselines on 24 out of 25 test problems. The generated algorithms display various structures and behaviors, reasonably fitting for different problem-solving contexts. Code will be released after paper publication.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
EPOCHS III: Unbiased UV continuum slopes at 6.5<z<13 from combined PEARLS GTO and public JWST NIRCam imaging
Authors:
Duncan Austin,
Christopher J. Conselice,
Nathan J. Adams,
Thomas Harvey,
Qiao Duan,
James Trussler,
Qiong Li,
Ignas Juodzbalis,
Katherine Ormerod,
Leonardo Ferreira,
Lewi Westcott,
Honor Harris,
Stephen M. Wilkins,
Rachana Bhatawdekar,
Joseph Caruana,
Dan Coe,
Seth H. Cohen,
Simon P. Driver,
Jordan C. J. D'Silva,
Brenda Frye,
Lukas J. Furtak,
Norman A. Grogin,
Nimish P. Hathi,
Benne W. Holwerda,
Rolf A. Jansen
, et al. (12 additional authors not shown)
Abstract:
We present an analysis of rest-frame UV continuum slopes, $β$, using a sample of 1011 galaxies at $6.5<z<13$ from the EPOCHS photometric sample collated from the GTO PEARLS and public ERS/GTO/GO (JADES, CEERS, NGDEEP, GLASS) JWST NIRCam imaging across $178.9~\mathrm{arcmin}^2$ of unmasked blank sky. We correct our UV slopes for the photometric error coupling bias using $200,000$ power law SEDs for…
▽ More
We present an analysis of rest-frame UV continuum slopes, $β$, using a sample of 1011 galaxies at $6.5<z<13$ from the EPOCHS photometric sample collated from the GTO PEARLS and public ERS/GTO/GO (JADES, CEERS, NGDEEP, GLASS) JWST NIRCam imaging across $178.9~\mathrm{arcmin}^2$ of unmasked blank sky. We correct our UV slopes for the photometric error coupling bias using $200,000$ power law SEDs for each $β=\{-1,-1.5,-2,-2.5,-3\}$ in each field, finding biases as large as $Δβ\simeq-0.55$ for the lowest SNR galaxies in our sample. Additionally, we simulate the impact of rest-UV line emission (including Ly$α$) and damped Ly$α$ systems on our measured $β$, finding biases as large as $0.5-0.6$ for the most extreme systems. We find a decreasing trend with redshift of $β=-1.51\pm0.08-(0.097\pm0.010)\times z$, with potential evidence for Pop.~III stars or top-heavy initial mass functions (IMFs) in a subsample of 68 $β+σ_β<-2.8$ galaxies. At $z\simeq11.5$, we measure an extremely blue $β(M_{\mathrm{UV}}=-19)=-2.73\pm0.06$, deviating from simulations, indicative of low-metallicity galaxies with non-zero Lyman continuum escape fractions $f_{\mathrm{esc, LyC}}\gtrsim0$ and minimal dust content. The observed steepening of $\mathrm{d}β/\mathrm{d}\log_{10}(M_{\star}/\mathrm{M}_{\odot})$ from $0.22\pm0.02$ at $z=7$ to $0.81\pm0.13$ at $z=11.5$ implies that dust produced in core-collapse supernovae (SNe) at early times may be ejected via outflows from low mass galaxies. We also observe a flatter $\mathrm{d}β/\mathrm{d}M_{\mathrm{UV}}=0.03\pm0.02$ at $z=7$ and a shallower $\mathrm{d}β/\mathrm{d}\log_{10}(M_{\star} / \mathrm{M}_{\odot})$ at $z<11$ than seen by HST, unveiling a new population of low mass, faint, galaxies reddened by dust produced in the stellar winds of asymptotic giant branch (AGB) stars or carbon-rich Wolf-Rayet binaries.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System
Authors:
Shijing Hu,
Ruijun Deng,
Xin Du,
Zhihui Lu,
Qiang Duan,
Yi He,
Shih-Chia Huang,
Jie Wu
Abstract:
Recent large vision models (e.g., SAM) enjoy great potential to facilitate intelligent perception with high accuracy. Yet, the resource constraints in the IoT environment tend to limit such large vision models to be locally deployed, incurring considerable inference latency thereby making it difficult to support real-time applications, such as autonomous driving and robotics. Edge-cloud collaborat…
▽ More
Recent large vision models (e.g., SAM) enjoy great potential to facilitate intelligent perception with high accuracy. Yet, the resource constraints in the IoT environment tend to limit such large vision models to be locally deployed, incurring considerable inference latency thereby making it difficult to support real-time applications, such as autonomous driving and robotics. Edge-cloud collaboration with large-small model co-inference offers a promising approach to achieving high inference accuracy and low latency. However, existing edge-cloud collaboration methods are tightly coupled with the model architecture and cannot adapt to the dynamic data drifts in heterogeneous IoT environments. To address the issues, we propose LAECIPS, a new edge-cloud collaboration framework. In LAECIPS, both the large vision model on the cloud and the lightweight model on the edge are plug-and-play. We design an edge-cloud collaboration strategy based on hard input mining, optimized for both high accuracy and low latency. We propose to update the edge model and its collaboration strategy with the cloud under the supervision of the large vision model, so as to adapt to the dynamic IoT data streams. Theoretical analysis of LAECIPS proves its feasibility. Experiments conducted in a robotic semantic segmentation system using real-world datasets show that LAECIPS outperforms its state-of-the-art competitors in accuracy, latency, and communication overhead while having better adaptability to dynamic environments.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Dust Extinction Measures for $z\sim 8$ Galaxies using Machine Learning on JWST Imaging
Authors:
Kwan Lin Kristy Fu,
Christopher J. Conselice,
Leonardo Ferreira,
Thomas Harvey,
Qiao Duan,
Nathan Adams,
Duncan Austin
Abstract:
We present the results of a machine learning study to measure the dust content of galaxies observed with JWST at z > 6 through the use of trained neural networks based on high-resolution IllustrisTNG simulations. Dust is an important unknown in the evolution and observability of distant galaxies and is degenerate with other stellar population features through spectral energy fitting. As such, we d…
▽ More
We present the results of a machine learning study to measure the dust content of galaxies observed with JWST at z > 6 through the use of trained neural networks based on high-resolution IllustrisTNG simulations. Dust is an important unknown in the evolution and observability of distant galaxies and is degenerate with other stellar population features through spectral energy fitting. As such, we develop and test a new SED-independent machine learning method to predict dust attenuation and sSFR of high redshift (z > 6) galaxies. Simulated galaxies were constructed using the IllustrisTNG model, with a variety of dust contents parameterized by E(B-V) and A(V) values, then used to train Convolutional Neural Network (CNN) models using supervised learning through a regression model. We demonstrate that within the context of these simulations, our single and multi-band models are able to predict dust content of distant galaxies to within a 1$σ$ dispersion of A(V) $\sim 0.1$. Applied to spectroscopically confirmed z > 6 galaxies from the JADES and CEERS programs, our models predicted attenuation values of A(V) < 0.7 for all systems, with a low average (A(V) = 0.28). Our CNN predictions show larger dust attenuation but lower amounts of star formation compared to SED fitted values. Both results show that distant galaxies with confirmed spectroscopy are not extremely dusty, although this sample is potentially significantly biased. We discuss these issues and present ideas on how to accurately measure dust features at the highest redshifts using a combination of machine learning and SED fitting.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Ray Theory of Waves
Authors:
K. F. Ren,
M. Yang,
Q. Duan,
C. Rozé,
C. Zhang,
X. Han
Abstract:
In order to deal with the interaction of an electromagnetic wave with large homogeneous objects of arbitrary shape with smooth surface we develop the ray theory of waves (RTW) which is composed of the vectorial complex ray model (VCRM) and VCRM based singularity theory. By introducing the wavefront curvature as an intrinsic property of rays, VCRM permits to predict the amplitude and the phase of f…
▽ More
In order to deal with the interaction of an electromagnetic wave with large homogeneous objects of arbitrary shape with smooth surface we develop the ray theory of waves (RTW) which is composed of the vectorial complex ray model (VCRM) and VCRM based singularity theory. By introducing the wavefront curvature as an intrinsic property of rays, VCRM permits to predict the amplitude and the phase of field at any point rigorously in the sense of ray model. Its combination with the singularity theory remedies the discontinuity in the ray model. In this letter, the wavefront equation, key physical law of VCRM describing the relation between the wavefront curvatures of the incident wave and the refracted/reflected wave, is derived for the most general case of three dimension scattering. The strategy of the calculation scheme in RTW is described. Typical applications to the prediction of the rainbow patterns of a spheroidal drop are presented. The comparison to a rigorous numerical method, multilevel fast multipole algorithm, shows that RTW can predict very fast and precisely the scattered field even in the vicinity of caustics.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
EPOCHS IV: SED Modelling Assumptions and their impact on the Stellar Mass Function at 6.5 < z < 13.5 using PEARLS and public JWST observations
Authors:
Thomas Harvey,
Christopher Conselice,
Nathan J. Adams,
Duncan Austin,
Ignas Juodzbalis,
James Trussler,
Qiong Li,
Katherine Ormerod,
Leonardo Ferreira,
Qiao Duan,
Lewi Westcott,
Honor Harris,
Rachana Bhatawdekar,
Dan Coe,
Seth H. Cohen,
Joseph Caruana,
Cheng Cheng,
9 Simon P. Driver,
Brenda Frye,
Lukas J. Furtak,
Norman A. Grogin,
Nimish P. Hathi,
Benne W. Holwerda,
Rolf A. Jansen,
Anton M. Koekemoer
, et al. (10 additional authors not shown)
Abstract:
We utilize deep JWST NIRCam observations for the first direct constraints on the Galaxy Stellar Mass Function (GSMF) at z>10. Our EPOCHS v1 sample includes 1120 galaxy candidates at 6.5<z<13.5 taken from a consistent reduction and analysis of publicly available deep JWST NIRCam data covering the PEARLS, CEERS, GLASS, JADES GOOD-S, NGDEEP, and SMACS0723 surveys, totalling 187 arcmin2. We investigat…
▽ More
We utilize deep JWST NIRCam observations for the first direct constraints on the Galaxy Stellar Mass Function (GSMF) at z>10. Our EPOCHS v1 sample includes 1120 galaxy candidates at 6.5<z<13.5 taken from a consistent reduction and analysis of publicly available deep JWST NIRCam data covering the PEARLS, CEERS, GLASS, JADES GOOD-S, NGDEEP, and SMACS0723 surveys, totalling 187 arcmin2. We investigate the impact of SED fitting methods, assumed star formation histories (SFH), dust laws, and priors on galaxy masses and the resultant GSMF. Whilst our fiducial GSMF agrees with the literature at z<13.5, we find that the assumed SFH model has a large impact on the GSMF and stellar mass density (SMD), finding a 0.75 dex increase in the SMD at z=10.5 between a flexible non-parametric and standard parametric SFH. Overall, we find a flatter SMD evolution at z > 9 than some studies predict, suggesting a rapid buildup of stellar mass in the early Universe. We find no incompatibility between our results and those of standard cosmological models, as suggested previously, although the most massive galaxies may require a high star formation efficiency. We find that the 'Little Red Dot' galaxies dominate the z=7 GSMF at high-masses, necessitating a better understanding of the relative contributions of AGN and stellar emission. We show that assuming a theoretically motivated top-heavy IMF reduces stellar mass by 0.5 dex without affecting fit quality, but our results remain consistent with existing cosmological models with a standard IMF.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Control water waves by metagratings
Authors:
Linkang Han,
Qilin Duan,
Junliang Duan,
Shan Zhu,
Shiming Chen,
Yuhang Yin,
Huanyang Chen
Abstract:
Metasurfaces and metagratings offers new platforms for electromagnetic wave control with significant responses. However, metasurfaces based on abrupt phase change and resonant structures suffer from the drawback of high loss and face challenges when applied in water waves. Therefore, the application of metasurfaces in water wave control is not ideal due to the limitations associated with high loss…
▽ More
Metasurfaces and metagratings offers new platforms for electromagnetic wave control with significant responses. However, metasurfaces based on abrupt phase change and resonant structures suffer from the drawback of high loss and face challenges when applied in water waves. Therefore, the application of metasurfaces in water wave control is not ideal due to the limitations associated with high loss and other challenges. We have discovered that non-resonant metagratings exhibit promising effects in water wave control. Leveraging the similarity between bridges and metagratings, we have successfully developed a water wave metagrating model inspired by the Luoyang Bridge in ancient China. We conducted theoretical calculations and simulations on the metagrating and derived the equivalent anisotropic model of the metagrating. This model provides evidence that the metagrating has the capability to control water waves and achieve unidirectional surface water wave. The accuracy of our theory is strongly supported by the clear observation of the unidirectional propagation phenomenon during simulation and experiments conducted using a reduced version of the metagrating. It is the first time that the unidirectional propagation of water waves has been seen in water wave metagrating experiment. Above all, we realize the water wave metagrating experiment for the first time. By combining complex gratings with real bridges, we explore the physics embedded in the ancient building-Luoyang Bridge, which are of great significance for the water wave metagrating design, as well as the development and preservation of ancient bridges.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Superconductivity and Charge-density-wave-like Transition in Th2Cu4As5
Authors:
Qing-Chen Duan,
Shao-Hua Liu,
Bai-Zhuo Li,
Jiao-Jiao Meng,
Wu-Zhang Yang,
Yi Liu,
Yi-Qiang Lin,
Si-Qi Wu,
Jia-Yi Lu,
Jin-Ke Bao,
Yu-Sen Xiao,
Xin-Yu Zhao,
Yu-Xue Mei,
Yu-Ping Sun,
Dan Yu,
Shu-Gang Tan,
Qiang Jing,
Rui-Dan Zhong,
Yong-Liang Chen,
Yong Zhao,
Zhi Ren,
Cao Wang,
Guang-Han Cao
Abstract:
We report the synthesis, crystal structure, and physical properties of a novel ternary compound, Th$_2$Cu$_4$As$_5$. The material crystallizes in a tetragonal structure with lattice parameters $a=4.0716(1)$ Å and $c=24.8131(4)$ Å. Its structure can be described as an alternating stacking of fluorite-type Th$_2$As$_2$ layers with antifluorite-type double-layered Cu$_4$As$_3$ slabs. The measurement…
▽ More
We report the synthesis, crystal structure, and physical properties of a novel ternary compound, Th$_2$Cu$_4$As$_5$. The material crystallizes in a tetragonal structure with lattice parameters $a=4.0716(1)$ Å and $c=24.8131(4)$ Å. Its structure can be described as an alternating stacking of fluorite-type Th$_2$As$_2$ layers with antifluorite-type double-layered Cu$_4$As$_3$ slabs. The measurement of electrical resistivity, magnetic susceptibility and specific heat reveals that Th$_2$Cu$_4$As$_5$ undergoes bulk superconducting transition at 4.2 K. Moreover, all these physical quantities exhibit anomalies at 48 K, where the Hall coefficient change the sign. These findings suggest a charge-density-wave-like (CDW) transition, making Th$_2$Cu$_4$As$_5$ a rare example for studying the interplay between CDW and superconductivity.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
Observation of a Topological Phase Transition in Random Coaxial Cable Structures with Chiral Symmetry
Authors:
D. M. Whittaker,
Maxine M. McCarthy,
Qingqing Duan
Abstract:
We report an experimental study of the disordered Su-Schrieffer-Heeger (SSH) model, implemented in a system of coaxial cables, whose radio frequency properties map on to the SSH Hamiltonian. By measuring multiple chains with random hopping terms, we demonstrate the presence of a topologically protected state, with frequency variation of less than 0.2% over the ensemble. Connecting the ends of the…
▽ More
We report an experimental study of the disordered Su-Schrieffer-Heeger (SSH) model, implemented in a system of coaxial cables, whose radio frequency properties map on to the SSH Hamiltonian. By measuring multiple chains with random hopping terms, we demonstrate the presence of a topologically protected state, with frequency variation of less than 0.2% over the ensemble. Connecting the ends of the chains to form loops, we observe a topological phase transition, characterised by the closure of the band gap and the appearance of states which are delocalised, despite the strong disorder.
△ Less
Submitted 18 November, 2023;
originally announced November 2023.
-
Distributed Evolution Strategies with Multi-Level Learning for Large-Scale Black-Box Optimization
Authors:
Qiqi Duan,
Chang Shao,
Guochen Zhou,
Minghan Zhang,
Qi Zhao,
Yuhui Shi
Abstract:
In the post-Moore era, main performance gains of black-box optimizers are increasingly depending on parallelism, especially for large-scale optimization (LSO). Here we propose to parallelize the well-established covariance matrix adaptation evolution strategy (CMA-ES) and in particular its one latest LSO variant called limited-memory CMA-ES (LM-CMA). To achieve efficiency while approximating its p…
▽ More
In the post-Moore era, main performance gains of black-box optimizers are increasingly depending on parallelism, especially for large-scale optimization (LSO). Here we propose to parallelize the well-established covariance matrix adaptation evolution strategy (CMA-ES) and in particular its one latest LSO variant called limited-memory CMA-ES (LM-CMA). To achieve efficiency while approximating its powerful invariance property, we present a multilevel learning-based meta-framework for distributed LM-CMA. Owing to its hierarchically organized structure, Meta-ES is well-suited to implement our distributed meta-framework, wherein the outer-ES controls strategy parameters while all parallel inner-ESs run the serial LM-CMA with different settings. For the distribution mean update of the outer-ES, both the elitist and multi-recombination strategy are used in parallel to avoid stagnation and regression, respectively. To exploit spatiotemporal information, the global step-size adaptation combines Meta-ES with the parallel cumulative step-size adaptation. After each isolation time, our meta-framework employs both the structure and parameter learning strategy to combine aligned evolution paths for CMA reconstruction. Experiments on a set of large-scale benchmarking functions with memory-intensive evaluations, arguably reflecting many data-driven optimization problems, validate the benefits (e.g., effectiveness w.r.t. solution quality, and adaptability w.r.t. second-order learning) and costs of our meta-framework.
△ Less
Submitted 2 November, 2023; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Adding Value to JWST Spectra and Photometry: Stellar Population and Star Formation Properties of Spectroscopically Confirmed JADES and CEERS Galaxies at $z > 7$
Authors:
Qiao Duan,
Christopher J. Conselice,
Qiong Li,
Thomas Harvey,
Duncan Austin,
Katherine Ormerod,
James Trussler,
Nathan Adams
Abstract:
In this paper, we discuss measurements of the stellar population and star forming properties for 43 spectroscopically confirmed publicly available high-redshift $z > 7$ JWST galaxies in the JADES and CEERS observational programs. We carry out a thorough study investigating the relationship between spectroscopic features and photometrically derived ones, including from spectral energy distribution…
▽ More
In this paper, we discuss measurements of the stellar population and star forming properties for 43 spectroscopically confirmed publicly available high-redshift $z > 7$ JWST galaxies in the JADES and CEERS observational programs. We carry out a thorough study investigating the relationship between spectroscopic features and photometrically derived ones, including from spectral energy distribution (SED) fitting of models, as well as morphological and structural properties. We find that the star formation rates (SFRs) measured from H$β$ line emission are higher than those estimated from Bayesian SED fitting and UV luminosity, with ratios SFR$_{Hβ}$/ SFR$_{UV}$ ranging from 2~13. This is a sign that the star formation history is consistently rising given the timescales of H$β$ vs UV star formation probes. In addition, we investigate how well equivalent widths (EWs) of H$β$ $λ$4861, [O III] $λ$4959, and [O III] $λ$5007 can be measured from photometry, finding that on average the EW derived from photometric excesses in filters is 30% smaller than the direct spectroscopic measurement. We also discover that a stack of the line emitting galaxies shows a distinct morphology after subtracting imaging that contains only the continuum. This gives us a first view of the line or ionized gas emission from $z > 7$ galaxies, demonstrating that this material has a similar distribution, statistically, as the continuum. We also compare the derived SFRs and stellar masses for both parametric and non-parametric star formation histories, where we find that 35% of our sample formed at least 30% of their stellar mass in recent (< 10 Myr) starburst events.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching
Authors:
Yun Liao,
Yide Di,
Hao Zhou,
Kaijun Zhu,
Mingyu Lu,
Yijia Zhang,
Qing Duan,
Junhui Liu
Abstract:
Local feature matching remains a challenging task, primarily due to difficulties in matching sparse keypoints and low-texture regions. The key to solving this problem lies in effectively and accurately integrating global and local information. To achieve this goal, we introduce an innovative local feature matching method called TKwinFormer. Our approach employs a multi-stage matching strategy to o…
▽ More
Local feature matching remains a challenging task, primarily due to difficulties in matching sparse keypoints and low-texture regions. The key to solving this problem lies in effectively and accurately integrating global and local information. To achieve this goal, we introduce an innovative local feature matching method called TKwinFormer. Our approach employs a multi-stage matching strategy to optimize the efficiency of information interaction. Furthermore, we propose a novel attention mechanism called Top K Window Attention, which facilitates global information interaction through window tokens prior to patch-level matching, resulting in improved matching accuracy. Additionally, we design an attention block to enhance attention between channels. Experimental results demonstrate that TKwinFormer outperforms state-of-the-art methods on various benchmarks. Code is available at: https://github.com/LiaoYun0x0/TKwinFormer.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
A LiDAR-Inertial SLAM Tightly-Coupled with Dropout-Tolerant GNSS Fusion for Autonomous Mine Service Vehicles
Authors:
Yusheng Wang,
Yidong Lou,
Weiwei Song,
Bing Zhan,
Feihuang Xia,
Qigeng Duan
Abstract:
Multi-modal sensor integration has become a crucial prerequisite for the real-world navigation systems. Recent studies have reported successful deployment of such system in many fields. However, it is still challenging for navigation tasks in mine scenes due to satellite signal dropouts, degraded perception, and observation degeneracy. To solve this problem, we propose a LiDAR-inertial odometry me…
▽ More
Multi-modal sensor integration has become a crucial prerequisite for the real-world navigation systems. Recent studies have reported successful deployment of such system in many fields. However, it is still challenging for navigation tasks in mine scenes due to satellite signal dropouts, degraded perception, and observation degeneracy. To solve this problem, we propose a LiDAR-inertial odometry method in this paper, utilizing both Kalman filter and graph optimization. The front-end consists of multiple parallel running LiDAR-inertial odometries, where the laser points, IMU, and wheel odometer information are tightly fused in an error-state Kalman filter. Instead of the commonly used feature points, we employ surface elements for registration. The back-end construct a pose graph and jointly optimize the pose estimation results from inertial, LiDAR odometry, and global navigation satellite system (GNSS). Since the vehicle has a long operation time inside the tunnel, the largely accumulated drift may be not fully by the GNSS measurements. We hereby leverage a loop closure based re-initialization process to achieve full alignment. In addition, the system robustness is improved through handling data loss, stream consistency, and estimation error. The experimental results show that our system has a good tolerance to the long-period degeneracy with the cooperation different LiDARs and surfel registration, achieving meter-level accuracy even for tens of minutes running during GNSS dropouts.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
CAME: Contrastive Automated Model Evaluation
Authors:
Ru Peng,
Qiuyang Duan,
Haobo Wang,
Jiachen Ma,
Yanbo Jiang,
Yongjun Tu,
Xiu Jiang,
Junbo Zhao
Abstract:
The Automated Model Evaluation (AutoEval) framework entertains the possibility of evaluating a trained machine learning model without resorting to a labeled testing set. Despite the promise and some decent results, the existing AutoEval methods heavily rely on computing distribution shifts between the unlabelled testing set and the training set. We believe this reliance on the training set becomes…
▽ More
The Automated Model Evaluation (AutoEval) framework entertains the possibility of evaluating a trained machine learning model without resorting to a labeled testing set. Despite the promise and some decent results, the existing AutoEval methods heavily rely on computing distribution shifts between the unlabelled testing set and the training set. We believe this reliance on the training set becomes another obstacle in shipping this technology to real-world ML development. In this work, we propose Contrastive Automatic Model Evaluation (CAME), a novel AutoEval framework that is rid of involving training set in the loop. The core idea of CAME bases on a theoretical analysis which bonds the model performance with a contrastive loss. Further, with extensive empirical validation, we manage to set up a predictable relationship between the two, simply by deducing on the unlabeled/unseen testing set. The resulting framework CAME establishes a new SOTA results for AutoEval by surpassing prior work significantly.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
JIANG: Chinese Open Foundation Language Model
Authors:
Qinhua Duan,
Wenchao Gu,
Yujia Chen,
Wenxin Mao,
Zewen Tian,
Hui Cao
Abstract:
With the advancements in large language model technology, it has showcased capabilities that come close to those of human beings across various tasks. This achievement has garnered significant interest from companies and scientific research institutions, leading to substantial investments in the research and development of these models. While numerous large models have emerged during this period,…
▽ More
With the advancements in large language model technology, it has showcased capabilities that come close to those of human beings across various tasks. This achievement has garnered significant interest from companies and scientific research institutions, leading to substantial investments in the research and development of these models. While numerous large models have emerged during this period, the majority of them have been trained primarily on English data. Although they exhibit decent performance in other languages, such as Chinese, their potential remains limited due to factors like vocabulary design and training corpus. Consequently, their ability to fully express their capabilities in Chinese falls short. To address this issue, we introduce the model named JIANG (Chinese pinyin of ginger) specifically designed for the Chinese language. We have gathered a substantial amount of Chinese corpus to train the model and have also optimized its structure. The extensive experimental results demonstrate the excellent performance of our model.
△ Less
Submitted 1 August, 2023;
originally announced August 2023.
-
Well-posedness of regular solutions for 3-D full compressible Navier-Stokes equations with degenerate viscosities and heat conductivity
Authors:
Qin Duan,
Zhouping Xin,
Shengguo Zhu
Abstract:
For the degenerate viscous and heat conductive compressible fluids, the momentum equations and the energy equation are degenerate both in the time evolution and spatial dissipation when vacuum appears, and then the physical entropy S behaves singularly, which make it challenging to study the corresponding well-posedness of regular solutions with high order regularities of S near the vacuum. In thi…
▽ More
For the degenerate viscous and heat conductive compressible fluids, the momentum equations and the energy equation are degenerate both in the time evolution and spatial dissipation when vacuum appears, and then the physical entropy S behaves singularly, which make it challenging to study the corresponding well-posedness of regular solutions with high order regularities of S near the vacuum. In this paper, for the physically important case that the coefficients of viscosities and heat conductivity depend on the absolute temperature θin a power law of Chapman-Enskog, we identify a class of initial data admitting a local-in-time regular solution with far field vacuum to the Cauchy problem of the 3-D full CNS, and such a solution possesses the uniformly high order regularities for S near the vacuum. The key idea here is to study the vacuum problem in terms of the mass density ρ, velocity u and S instead of (ρ, u,θ), which makes it possible to compare the orders of the degeneracy of the time evolution and the spatial dissipations near the vacuum in terms of the powers of ρ. However, for heat conductive fluids, both a degenerate spatial dissipation and a source term related to \triangle ρ^{γ-1}, will appear in the time evolution equation for S, which makes it formidable to study the propagation of regularities of S. Fortunately, based on some elaborate analysis of the intrinsic degenerate-singular structures of the 3-D full CNS, we can choose proper weights to control the behaviors of (ρ, u,S) by introducing an enlarged reformulated system, which includes a singular parabolic system for u, and one degenerate-singular parabolic equation for S. Then one can carry out a series of weighted energy estimates carefully designed for this reformulated system, which provides an effective propagation mechanism for S's high order regularities near the vacuum.
△ Less
Submitted 29 April, 2024; v1 submitted 13 July, 2023;
originally announced July 2023.
-
MedFMC: A Real-world Dataset and Benchmark For Foundation Model Adaptation in Medical Image Classification
Authors:
Dequan Wang,
Xiaosong Wang,
Lilong Wang,
Mengzhang Li,
Qian Da,
Xiaoqiang Liu,
Xiangyu Gao,
Jun Shen,
Junjun He,
Tian Shen,
Qi Duan,
Jie Zhao,
Kang Li,
Yu Qiao,
Shaoting Zhang
Abstract:
Foundation models, often pre-trained with large-scale data, have achieved paramount success in jump-starting various vision and language applications. Recent advances further enable adapting foundation models in downstream tasks efficiently using only a few training samples, e.g., in-context learning. Yet, the application of such learning paradigms in medical image analysis remains scarce due to t…
▽ More
Foundation models, often pre-trained with large-scale data, have achieved paramount success in jump-starting various vision and language applications. Recent advances further enable adapting foundation models in downstream tasks efficiently using only a few training samples, e.g., in-context learning. Yet, the application of such learning paradigms in medical image analysis remains scarce due to the shortage of publicly accessible data and benchmarks. In this paper, we aim at approaches adapting the foundation models for medical image classification and present a novel dataset and benchmark for the evaluation, i.e., examining the overall performance of accommodating the large-scale foundation models downstream on a set of diverse real-world clinical tasks. We collect five sets of medical imaging data from multiple institutes targeting a variety of real-world clinical tasks (22,349 images in total), i.e., thoracic diseases screening in X-rays, pathological lesion tissue screening, lesion detection in endoscopy images, neonatal jaundice evaluation, and diabetic retinopathy grading. Results of multiple baseline methods are demonstrated using the proposed dataset from both accuracy and cost-effective perspectives.
△ Less
Submitted 15 June, 2023;
originally announced June 2023.
-
Cooperative Coevolution for Non-Separable Large-Scale Black-Box Optimization: Convergence Analyses and Distributed Accelerations
Authors:
Qiqi Duan,
Chang Shao,
Guochen Zhou,
Haobin Yang,
Qi Zhao,
Yuhui Shi
Abstract:
Given the ubiquity of non-separable optimization problems in real worlds, in this paper we analyze and extend the large-scale version of the well-known cooperative coevolution (CC), a divide-and-conquer black-box optimization framework, on non-separable functions. First, we reveal empirical reasons of when decomposition-based methods are preferred or not in practice on some non-separable large-sca…
▽ More
Given the ubiquity of non-separable optimization problems in real worlds, in this paper we analyze and extend the large-scale version of the well-known cooperative coevolution (CC), a divide-and-conquer black-box optimization framework, on non-separable functions. First, we reveal empirical reasons of when decomposition-based methods are preferred or not in practice on some non-separable large-scale problems, which have not been clearly pointed out in many previous CC papers. Then, we formalize CC to a continuous-game model via simplification, but without losing its essential property. Different from previous evolutionary game theory for CC, our new model provides a much simpler but useful viewpoint to analyze its convergence, since only the pure Nash equilibrium concept is needed and more general fitness landscapes can be explicitly considered. Based on convergence analyses, we propose a hierarchical decomposition strategy for better generalization, as for any decomposition, there is a risk of getting trapped into a suboptimal Nash equilibrium. Finally, we use powerful distributed computing to accelerate it under the recent multi-level learning framework, which combines the fine-tuning ability from decomposition with the invariance property of CMA-ES. Experiments on a set of high-dimensional test functions validate both its search performance and scalability (w.r.t. CPU cores) on a clustering computing platform with 400 CPU cores.
△ Less
Submitted 14 May, 2024; v1 submitted 11 April, 2023;
originally announced April 2023.
-
AutoOptLib: Tailoring Metaheuristic Optimizers via Automated Algorithm Design
Authors:
Qi Zhao,
Bai Yan,
Taiwei Hu,
Xianglong Chen,
Qiqi Duan,
Jian Yang,
Yuhui Shi
Abstract:
Metaheuristics are prominent gradient-free optimizers for solving hard problems that do not meet the rigorous mathematical assumptions of analytical solvers. The canonical manual optimizer design could be laborious, untraceable and error-prone, let alone human experts are not always available. This arises increasing interest and demand in automating the optimizer design process. In response, this…
▽ More
Metaheuristics are prominent gradient-free optimizers for solving hard problems that do not meet the rigorous mathematical assumptions of analytical solvers. The canonical manual optimizer design could be laborious, untraceable and error-prone, let alone human experts are not always available. This arises increasing interest and demand in automating the optimizer design process. In response, this paper proposes AutoOptLib, the first platform for accessible automated design of metaheuristic optimizers. AutoOptLib leverages computing resources to conceive, build up, and verify the design choices of the optimizers. It requires much less labor resources and expertise than manual design, democratizing satisfactory metaheuristic optimizers to a much broader range of researchers and practitioners. Furthermore, by fully exploring the design choices with computing resources, AutoOptLib has the potential to surpass human experience, subsequently gaining enhanced performance compared with human problem-solving. To realize the automated design, AutoOptLib provides 1) a rich library of metaheuristic components for continuous, discrete, and permutation problems; 2) a flexible algorithm representation for evolving diverse algorithm structures; 3) different design objectives and techniques for different optimization scenarios; and 4) a graphic user interface for accessibility and practicability. AutoOptLib is fully written in Matlab/Octave; its source code and documentation are available at https://github.com/qz89/AutoOpt and https://AutoOpt.readthedocs.io/, respectively.
△ Less
Submitted 14 November, 2023; v1 submitted 11 March, 2023;
originally announced March 2023.
-
Automated Design of Metaheuristic Algorithms: A Survey
Authors:
Qi Zhao,
Qiqi Duan,
Bai Yan,
Shi Cheng,
Yuhui Shi
Abstract:
Metaheuristics have gained great success in academia and practice because their search logic can be applied to any problem with available solution representation, solution quality evaluation, and certain notions of locality. Manually designing metaheuristic algorithms for solving a target problem is criticized for being laborious, error-prone, and requiring intensive specialized knowledge. This gi…
▽ More
Metaheuristics have gained great success in academia and practice because their search logic can be applied to any problem with available solution representation, solution quality evaluation, and certain notions of locality. Manually designing metaheuristic algorithms for solving a target problem is criticized for being laborious, error-prone, and requiring intensive specialized knowledge. This gives rise to increasing interest in automated design of metaheuristic algorithms. With computing power to fully explore potential design choices, the automated design could reach and even surpass human-level design and could make high-performance algorithms accessible to a much wider range of researchers and practitioners. This paper presents a broad picture of automated design of metaheuristic algorithms, by conducting a survey on the common grounds and representative techniques in terms of design space, design strategies, performance evaluation strategies, and target problems in this field.
△ Less
Submitted 21 February, 2024; v1 submitted 11 March, 2023;
originally announced March 2023.
-
Factoring integers with sublinear resources on a superconducting quantum processor
Authors:
Bao Yan,
Ziqi Tan,
Shijie Wei,
Haocong Jiang,
Weilong Wang,
Hong Wang,
Lan Luo,
Qianheng Duan,
Yiting Liu,
Wenhao Shi,
Yangyang Fei,
Xiangdong Meng,
Yu Han,
Zheng Shan,
Jiachen Chen,
Xuhao Zhu,
Chuanyu Zhang,
Feitong Jin,
Hekang Li,
Chao Song,
Zhen Wang,
Zhi Ma,
H. Wang,
Gui-Lu Long
Abstract:
Shor's algorithm has seriously challenged information security based on public key cryptosystems. However, to break the widely used RSA-2048 scheme, one needs millions of physical qubits, which is far beyond current technical capabilities. Here, we report a universal quantum algorithm for integer factorization by combining the classical lattice reduction with a quantum approximate optimization alg…
▽ More
Shor's algorithm has seriously challenged information security based on public key cryptosystems. However, to break the widely used RSA-2048 scheme, one needs millions of physical qubits, which is far beyond current technical capabilities. Here, we report a universal quantum algorithm for integer factorization by combining the classical lattice reduction with a quantum approximate optimization algorithm (QAOA). The number of qubits required is O(logN/loglog N), which is sublinear in the bit length of the integer $N$, making it the most qubit-saving factorization algorithm to date. We demonstrate the algorithm experimentally by factoring integers up to 48 bits with 10 superconducting qubits, the largest integer factored on a quantum device. We estimate that a quantum circuit with 372 physical qubits and a depth of thousands is necessary to challenge RSA-2048 using our algorithm. Our study shows great promise in expediting the application of current noisy quantum computers, and paves the way to factor large integers of realistic cryptographic significance.
△ Less
Submitted 23 December, 2022;
originally announced December 2022.
-
PyPop7: A Pure-Python Library for Population-Based Black-Box Optimization
Authors:
Qiqi Duan,
Guochen Zhou,
Chang Shao,
Zhuowei Wang,
Mingyang Feng,
Yuwei Huang,
Yajing Tan,
Yijun Yang,
Qi Zhao,
Yuhui Shi
Abstract:
In this paper, we present an open-source pure-Python library called PyPop7 for black-box optimization (BBO). As population-based methods (e.g., evolutionary algorithms, swarm intelligence, and pattern search) become increasingly popular for BBO, the design goal of PyPop7 is to provide a unified API and elegant implementations for them, particularly in challenging high-dimensional scenarios. Since…
▽ More
In this paper, we present an open-source pure-Python library called PyPop7 for black-box optimization (BBO). As population-based methods (e.g., evolutionary algorithms, swarm intelligence, and pattern search) become increasingly popular for BBO, the design goal of PyPop7 is to provide a unified API and elegant implementations for them, particularly in challenging high-dimensional scenarios. Since these population-based methods easily suffer from the notorious curse of dimensionality owing to random sampling as one of core operations for most of them, recently various improvements and enhancements have been proposed to alleviate this issue more or less mainly via exploiting possible problem structures: such as, decomposition of search distribution or space, low-memory approximation, low-rank metric learning, variance reduction, ensemble of random subspaces, model self-adaptation, and fitness smoothing. These novel sampling strategies could better exploit different problem structures in high-dimensional search space and therefore they often result in faster rates of convergence and/or better qualities of solution for large-scale BBO. Now PyPop7 has covered many of these important advances on a set of well-established BBO algorithm families and also provided an open-access interface to adding the latest or missed black-box optimizers for further functionality extensions. Its well-designed source code (under GPL-3.0 license) and full-fledged online documents (under CC-BY 4.0 license) have been freely available at \url{https://github.com/Evolutionary-Intelligence/pypop} and \url{https://pypop.readthedocs.io}, respectively.
△ Less
Submitted 5 July, 2024; v1 submitted 11 December, 2022;
originally announced December 2022.
-
Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images
Authors:
Yan Zhang,
Xiyuan Gao,
Qingyan Duan,
Jiaxu Leng,
Xiao Pu,
Xinbo Gao
Abstract:
Very high-resolution (VHR) remote sensing (RS) image classification is the fundamental task for RS image analysis and understanding. Recently, transformer-based models demonstrated outstanding potential for learning high-order contextual relationships from natural images with general resolution (224x224 pixels) and achieved remarkable results on general image classification tasks. However, the com…
▽ More
Very high-resolution (VHR) remote sensing (RS) image classification is the fundamental task for RS image analysis and understanding. Recently, transformer-based models demonstrated outstanding potential for learning high-order contextual relationships from natural images with general resolution (224x224 pixels) and achieved remarkable results on general image classification tasks. However, the complexity of the naive transformer grows quadratically with the increase in image size, which prevents transformer-based models from VHR RS image (500x500 pixels) classification and other computationally expensive downstream tasks. To this end, we propose to decompose the expensive self-attention (SA) into real and imaginary parts via discrete Fourier transform (DFT) and therefore propose an efficient complex self-attention (CSA) mechanism. Benefiting from the conjugated symmetric property of DFT, CSA is capable to model the high-order contextual information with less than half computations of naive SA. To overcome the gradient explosion in Fourier complex field, we replace the Softmax function with the carefully designed Logmax function to normalize the attention map of CSA and stabilize the gradient propagation. By stacking various layers of CSA blocks, we propose the Fourier Complex Transformer (FCT) model to learn global contextual information from VHR aerial images following the hierarchical manners. Universal experiments conducted on commonly used RS classification data sets demonstrate the effectiveness and efficiency of FCT, especially on very high-resolution RS images.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
Combined Federated and Split Learning in Edge Computing for Ubiquitous Intelligence in Internet of Things: State of the Art and Future Directions
Authors:
Qiang Duan,
Shijing Hu,
Ruijun Deng,
Zhihui Lu
Abstract:
Federated learning (FL) and split learning (SL) are two emerging collaborative learning methods that may greatly facilitate ubiquitous intelligence in Internet of Things (IoT). Federated learning enables machine learning (ML) models locally trained using private data to be aggregated into a global model. Split learning allows different portions of an ML model to be collaboratively trained on diffe…
▽ More
Federated learning (FL) and split learning (SL) are two emerging collaborative learning methods that may greatly facilitate ubiquitous intelligence in Internet of Things (IoT). Federated learning enables machine learning (ML) models locally trained using private data to be aggregated into a global model. Split learning allows different portions of an ML model to be collaboratively trained on different workers in a learning framework. Federated learning and split learning, each has unique advantages and respective limitations, may complement each other toward ubiquitous intelligence in IoT. Therefore, combination of federated learning and split learning recently became an active research area attracting extensive interest. In this article, we review the latest developments in federated learning and split learning and present a survey on the state-of-the-art technologies for combining these two learning methods in an edge computing-based IoT environment. We also identify some open problems and discuss possible directions for future research in this area with a hope to further arouse the research community's interest in this emerging field.
△ Less
Submitted 19 July, 2022;
originally announced July 2022.
-
Anomalous ferromagnetic behavior in the orthorhombic Li$_3$Co$_2$SbO$_6$
Authors:
Qianhui Duan,
Huanpeng Bu,
Vladimir Pomjakushin,
Hubertus Luetkens,
Yuke Li,
Jinkui Zhao,
Jason S. Gardner,
Hanjie Guo
Abstract:
Monoclinic Li$_3$Co$_2$SbO$_6$ has been proposed as a Kitaev spin liquid candidate and investigated intensively, whereas the properties of its polymorph, the orthorhombic phase, is less known. Here we report the magnetic properties of the orthorhombic Li$_3$Co$_2$SbO$_6$ as revealed by dc and ac magnetic susceptibility, muon spin relaxation ($μ$SR) and neutron diffraction measurements. Successive…
▽ More
Monoclinic Li$_3$Co$_2$SbO$_6$ has been proposed as a Kitaev spin liquid candidate and investigated intensively, whereas the properties of its polymorph, the orthorhombic phase, is less known. Here we report the magnetic properties of the orthorhombic Li$_3$Co$_2$SbO$_6$ as revealed by dc and ac magnetic susceptibility, muon spin relaxation ($μ$SR) and neutron diffraction measurements. Successive magnetic transitions at (115, 89 and 71) K were observed in the low field dc susceptibility measurements. The transitions below $T_N$ (= 115 K), are suppressed in higher applied fields. However, zero field, ac susceptibility measurements reveals distinct frequency independent transitions at about (114, 107, 97, 79 and 71) K. A long range magnetic ordered state was confirmed by specific heat, $μ$SR and neutron diffraction measurements, all indicating a single transition at about 115 K. The discrepancy between different measurements is attributed to possible stacking faults and/or local disorders of the ferromagnetic zig-zag chains, resulting in ferromagnetic boundaries within the overall antiferromagnetic matrix.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
On regular solutions for three-dimensional full compressible Navier-Stokes equations with degenerate viscosities and far field vacuum
Authors:
Qin Duan,
Zhouping Xin,
Shengguo Zhu
Abstract:
In this paper, the Cauchy problem for the three-dimensional (3-D) full compressible Navier-Stokes equations (CNS) with zero thermal conductivity is considered. First, when shear and bulk viscosity coefficients both depend on the absolute temperature $θ$ in a power law ($θ^ν$ with $ν>0$) of Chapman-Enskog, based on some elaborate analysis of this system's intrinsic singular structures, we identify…
▽ More
In this paper, the Cauchy problem for the three-dimensional (3-D) full compressible Navier-Stokes equations (CNS) with zero thermal conductivity is considered. First, when shear and bulk viscosity coefficients both depend on the absolute temperature $θ$ in a power law ($θ^ν$ with $ν>0$) of Chapman-Enskog, based on some elaborate analysis of this system's intrinsic singular structures, we identify one class of initial data admitting a local-in-time regular solution with far field vacuum in terms of the mass density $ρ$, velocity $u$ and entropy $S$. Furthermore, it is shown that within its life span of such a regular solution, the velocity stays in an inhomogeneous Sobolev space, i.e., $u\in H^3(\mathbb{R}^3)$, $S$ has uniformly finite lower and upper bounds in the whole space, and the laws of conservation of total mass, momentum and total energy are all satisfied. Note that due to the appearance of the vacuum, the momentum equations are degenerate both in the time evolution and viscous stress tensor, and the physical entropy for polytropic gases behaves singularly, which make the study on corresponding well-posedness challenging. For proving the existence, we first introduce an enlarged reformulated structure by considering some new variables, which can transfer the degeneracies of the full CNS to the possible singularities of some special source terms related with $S$, and then carry out some singularly weighted energy estimates carefully designed for this reformulated system.
△ Less
Submitted 11 February, 2022;
originally announced February 2022.
-
Reinforcement learning for multi-item retrieval in the puzzle-based storage system
Authors:
Jing He,
Xinglu Liu,
Qiyao Duan,
Wai Kin Victor Chan,
Mingyao Qi
Abstract:
Nowadays, fast delivery services have created the need for high-density warehouses. The puzzle-based storage system is a practical way to enhance the storage density, however, facing difficulties in the retrieval process. In this work, a deep reinforcement learning algorithm, specifically the Double&Dueling Deep Q Network, is developed to solve the multi-item retrieval problem in the system with g…
▽ More
Nowadays, fast delivery services have created the need for high-density warehouses. The puzzle-based storage system is a practical way to enhance the storage density, however, facing difficulties in the retrieval process. In this work, a deep reinforcement learning algorithm, specifically the Double&Dueling Deep Q Network, is developed to solve the multi-item retrieval problem in the system with general settings, where multiple desired items, escorts, and I/O points are placed randomly. Additionally, we propose a general compact integer programming model to evaluate the solution quality. Extensive numerical experiments demonstrate that the reinforcement learning approach can yield high-quality solutions and outperforms three related state-of-the-art heuristic algorithms. Furthermore, a conversion algorithm and a decomposition framework are proposed to handle simultaneous movement and large-scale instances respectively, thus improving the applicability of the PBS system.
△ Less
Submitted 5 February, 2022;
originally announced February 2022.
-
Generalized rainbow patterns of oblate drops simulated by a ray model in three dimensions
Authors:
Qingwei Duan,
F. Onofri,
Xiang'e Han,
Kuan Fang Ren
Abstract:
The scattering patterns near the primary rainbow of oblate drops are simulated by extending the vectorial complex ray model (VCRM) [1] to three-dimensional (3D) calculations. With the curvature of wavefront as intrinsic property of a ray, this advanced ray model permits, in principle, to predict the amplitudes and phases of all emergent rays with a rigorous algebraic formalism. This letter reports…
▽ More
The scattering patterns near the primary rainbow of oblate drops are simulated by extending the vectorial complex ray model (VCRM) [1] to three-dimensional (3D) calculations. With the curvature of wavefront as intrinsic property of a ray, this advanced ray model permits, in principle, to predict the amplitudes and phases of all emergent rays with a rigorous algebraic formalism. This letter reports a breakthrough of VCRM for 3D scattering with a line-by-line triangulation interpolation algorithm allowing to calculate the total complex amplitude of scattered f eld. This makes possible to simulate not only the skeleton (geometrical rainbow angles, hyperbolic-umbilic caustics), but also the coarse (Airy bows, lattice) and f ne (ripple fringes) structures of the generalized rainbow patterns (GRPs) of oblate drops. The simulated results are found qualitatively and quantitatively in good agreement with experimental scattering patterns for drops of different aspect ratios. The physical interpretation of the GRPs is also given. This work opens up prominent perspectives for simulating and understanding the 3D scattering of large particles of any shape with smooth surface by VCRM.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Dual Optimization for Kolmogorov Model Learning Using Enhanced Gradient Descent
Authors:
Qiyou Duan,
Hadi Ghauch,
Taejoon Kim
Abstract:
Data representation techniques have made a substantial contribution to advancing data processing and machine learning (ML). Improving predictive power was the focus of previous representation techniques, which unfortunately perform rather poorly on the interpretability in terms of extracting underlying insights of the data. Recently, the Kolmogorov model (KM) was studied, which is an interpretable…
▽ More
Data representation techniques have made a substantial contribution to advancing data processing and machine learning (ML). Improving predictive power was the focus of previous representation techniques, which unfortunately perform rather poorly on the interpretability in terms of extracting underlying insights of the data. Recently, the Kolmogorov model (KM) was studied, which is an interpretable and predictable representation approach to learning the underlying probabilistic structure of a set of random variables. The existing KM learning algorithms using semi-definite relaxation with randomization (SDRwR) or discrete monotonic optimization (DMO) have, however, limited utility to big data applications because they do not scale well computationally. In this paper, we propose a computationally scalable KM learning algorithm, based on the regularized dual optimization combined with enhanced gradient descent (GD) method. To make our method more scalable to large-dimensional problems, we propose two acceleration schemes, namely, the eigenvalue decomposition (EVD) elimination strategy and an approximate EVD algorithm. Furthermore, a thresholding technique by exploiting the error bound analysis and leveraging the normalized Minkowski $\ell_1$-norm, is provided for the selection of the number of iterations of the approximate EVD algorithm. When applied to big data applications, it is demonstrated that the proposed method can achieve compatible training/prediction performance with significantly reduced computational complexity; roughly two orders of magnitude improvement in terms of the time overhead, compared to the existing KM learning algorithms. Furthermore, it is shown that the accuracy of logical relation mining for interpretability by using the proposed KM learning algorithm exceeds $80\%$.
△ Less
Submitted 20 May, 2022; v1 submitted 11 July, 2021;
originally announced July 2021.
-
Hybrid Supervision Learning for Pathology Whole Slide Image Classification
Authors:
Jiahui Li,
Wen Chen,
Xiaodi Huang,
Zhiqiang Hu,
Qi Duan,
Hongsheng Li,
Dimitris N. Metaxas,
Shaoting Zhang
Abstract:
Weak supervision learning on classification labels has demonstrated high performance in various tasks, while a few pixel-level fine annotations are also affordable. Naturally a question comes to us that whether the combination of pixel-level (e.g., segmentation) and image level (e.g., classification) annotation can introduce further improvement. However in computational pathology this is a difficu…
▽ More
Weak supervision learning on classification labels has demonstrated high performance in various tasks, while a few pixel-level fine annotations are also affordable. Naturally a question comes to us that whether the combination of pixel-level (e.g., segmentation) and image level (e.g., classification) annotation can introduce further improvement. However in computational pathology this is a difficult task for this reason: High resolution of whole slide images makes it difficult to do end-to-end classification model training, which is challenging to research of weak or hybrid supervision learning in the past. To handle this problem, we propose a hybrid supervision learning framework for this kind of high resolution images with sufficient image-level coarse annotations and a few pixel-level fine labels. This framework, when applied in training patch model, can carefully make use of coarse image-level labels to refine generated pixel-level pseudo labels. Complete strategy is proposed to suppress pixel-level false positives and false negatives. A large hybrid annotated dataset is used to evaluate the effectiveness of hybrid supervision learning. By extracting pixel-level pseudo labels in initially image-level labeled samples, we achieve 5.2% higher specificity than purely training on existing labels while retaining 100% sensitivity, in the task of image-level classification to be positive or negative.
△ Less
Submitted 25 October, 2021; v1 submitted 2 July, 2021;
originally announced July 2021.
-
Superconductivity in ThMo2Si2C with Mo2C Square Net
Authors:
Zichen Liu,
Baizhuo Li,
Yusen Xiao,
Qingchen Duan,
Yanwei Cui,
YuXue Mei,
Qian Tao,
Shuli Wei,
Shugang Tan,
Qiang Jing,
Qing Lu,
Yuping Sun,
Yunyan Liu,
Shenggui Fu,
Hao Jiang,
Zhi Ren,
Zhu'an Xu,
Cao Wang,
Guanghan Cao
Abstract:
We report the superconductivity of a new quaternary compound ThMo$_2$Si$_2$C, synthesized with the arc-melting technique. The compound crystallizes in a tetragonal CeCr$_2$Si$_2$C-type structure with cell parameters of $a$ = 4.2296 Åand $c$ = 5.3571 Å. An interlayer Si-Si covalent bonding is suggested by the atomic distance. The electrical resistivity and magnetic susceptibility measurements indic…
▽ More
We report the superconductivity of a new quaternary compound ThMo$_2$Si$_2$C, synthesized with the arc-melting technique. The compound crystallizes in a tetragonal CeCr$_2$Si$_2$C-type structure with cell parameters of $a$ = 4.2296 Åand $c$ = 5.3571 Å. An interlayer Si-Si covalent bonding is suggested by the atomic distance. The electrical resistivity and magnetic susceptibility measurements indicate a Pauli-paramagnetic metal with dominant electron-electron scattering in the normal-state. Bulk superconductivity at 2.2 K is demonstrated with a dimensionless specific-heat jump of $ΔC/γ_{\rm n}T$ = 0.98. The superconducting parameters of the critical magnetic fields, coherence length, penetration depth, and superconducting energy gap are given.
△ Less
Submitted 20 April, 2021;
originally announced April 2021.
-
Spatio-temporal quantile regression analysis revealing more nuanced patterns of climate change: a study of long-term daily temperature in Australia
Authors:
Qibin Duan,
Clare A. McGrory,
Glenn Brown,
Kerrie Mengersen,
You-Gan Wang
Abstract:
Climate change is commonly associated with an overall increase in mean temperature in a defined past time period. Many studies consider temperature trends at the global scale, but the literature is lacking in in-depth analysis of the temperature trends across Australia in recent decades. In addition to heterogeneity in mean and median values, daily Australia temperature data suffers from quasi-per…
▽ More
Climate change is commonly associated with an overall increase in mean temperature in a defined past time period. Many studies consider temperature trends at the global scale, but the literature is lacking in in-depth analysis of the temperature trends across Australia in recent decades. In addition to heterogeneity in mean and median values, daily Australia temperature data suffers from quasi-periodic heterogeneity in variance. However, this issue has barely been overlooked in climate research. A contribution of this article is that we propose a joint model of quantile regression and variability. By accounting appropriately for the heterogeneity in these types of data, our analysis reveals that daily maximum temperature is warming by 0.21 Celsius per decade and daily minimum temperature by 0.13 Celsius per decade. However, our modeling also shows nuanced patterns of climate change depends on location, season, and the percentiles of the temperature series over Australia.
△ Less
Submitted 9 March, 2021;
originally announced March 2021.
-
Domain Private and Agnostic Feature for Modality Adaptive Face Recognition
Authors:
Yingguo Xu,
Lei Zhang,
Qingyan Duan
Abstract:
Heterogeneous face recognition is a challenging task due to the large modality discrepancy and insufficient cross-modal samples. Most existing works focus on discriminative feature transformation, metric learning and cross-modal face synthesis. However, the fact that cross-modal faces are always coupled by domain (modality) and identity information has received little attention. Therefore, how to…
▽ More
Heterogeneous face recognition is a challenging task due to the large modality discrepancy and insufficient cross-modal samples. Most existing works focus on discriminative feature transformation, metric learning and cross-modal face synthesis. However, the fact that cross-modal faces are always coupled by domain (modality) and identity information has received little attention. Therefore, how to learn and utilize the domain-private feature and domain-agnostic feature for modality adaptive face recognition is the focus of this work. Specifically, this paper proposes a Feature Aggregation Network (FAN), which includes disentangled representation module (DRM), feature fusion module (FFM) and adaptive penalty metric (APM) learning session. First, in DRM, two subnetworks, i.e. domain-private network and domain-agnostic network are specially designed for learning modality features and identity features, respectively. Second, in FFM, the identity features are fused with domain features to achieve cross-modal bi-directional identity feature transformation, which, to a large extent, further disentangles the modality information and identity information. Third, considering that the distribution imbalance between easy and hard pairs exists in cross-modal datasets, which increases the risk of model bias, the identity preserving guided metric learning with adaptive hard pairs penalization is proposed in our FAN. The proposed APM also guarantees the cross-modality intra-class compactness and inter-class separation. Extensive experiments on benchmark cross-modal face datasets show that our FAN outperforms SOTA methods.
△ Less
Submitted 9 August, 2020;
originally announced August 2020.
-
Enhanced Beam Alignment for Millimeter Wave MIMO Systems: A Kolmogorov Model
Authors:
Qiyou Duan,
Taejoon Kim,
Hadi Ghauch
Abstract:
We present an enhancement to the problem of beam alignment in millimeter wave (mmWave) multiple-input multiple-output (MIMO) systems, based on a modification of the machine learning-based criterion, called Kolmogorov model (KM), previously applied to the beam alignment problem. Unlike the previous KM, whose computational complexity is not scalable with the size of the problem, a new approach, cent…
▽ More
We present an enhancement to the problem of beam alignment in millimeter wave (mmWave) multiple-input multiple-output (MIMO) systems, based on a modification of the machine learning-based criterion, called Kolmogorov model (KM), previously applied to the beam alignment problem. Unlike the previous KM, whose computational complexity is not scalable with the size of the problem, a new approach, centered on discrete monotonic optimization (DMO), is proposed, leading to significantly reduced complexity. We also present a Kolmogorov-Smirnov (KS) criterion for the advanced hypothesis testing, which does not require any subjective threshold setting compared to the frequency estimation (FE) method developed for the conventional KM. Simulation results that demonstrate the efficacy of the proposed KM learning for mmWave beam alignment are presented.
△ Less
Submitted 26 July, 2020;
originally announced July 2020.
-
Predication of Inflection Point and Outbreak Size of COVID-19 in New Epicentres
Authors:
Qibin Duan,
Jinran Wu,
Gaojun Wu,
You-Gan Wang
Abstract:
The coronavirus disease 2019 (COVID-19) had caused more that 8 million infections as of middle June 2020. Recently, Brazil has become a new epicentre of COVID-19, while India and African region are potential epicentres. This study aims to predict the inflection point and outbreak size of these new/potential epicentres at the early phase of the epidemics by borrowing information from more `mature'…
▽ More
The coronavirus disease 2019 (COVID-19) had caused more that 8 million infections as of middle June 2020. Recently, Brazil has become a new epicentre of COVID-19, while India and African region are potential epicentres. This study aims to predict the inflection point and outbreak size of these new/potential epicentres at the early phase of the epidemics by borrowing information from more `mature' curves from other countries. We modeled the cumulative cases to the well-known sigmoid growth curves to describe the epidemic trends under the mixed-effect models and using the four-parameter logistic model after power transformations. African region is predicted to have the largest total outbreak size of 3.9 million cases (2.2 to 6 million), and the inflection will come around September 13, 2020. Brazil and India are predicted to have a similar final outbreak size of around 2.5 million cases (1.1 to 4.3 million), with the inflection points arriving June 23 and July 26, respectively. We conclude in Brazil, India, and African the epidemics of COVI19 have not yet passed the inflection points; these regions potentially can take over USA in terms of outbreak size
△ Less
Submitted 15 July, 2020;
originally announced July 2020.
-
On the vanishing dissipation limit for the incompressible MHD equations on bounded domains
Authors:
Qin Duan,
Yuelong Xiao,
Zhouping Xin
Abstract:
In this paper, we investigate the solvability, regularity and the vanishing dissipation limit of solutions to the three-dimensional viscous magneto-hydrodynamic (MHD) equations in bounded domains. On the boundary, the velocity field fulfills a Navier-slip condition, while the magnetic field satisfies the insulating condition. It is shown that the initial-boundary problem has a global weak solution…
▽ More
In this paper, we investigate the solvability, regularity and the vanishing dissipation limit of solutions to the three-dimensional viscous magneto-hydrodynamic (MHD) equations in bounded domains. On the boundary, the velocity field fulfills a Navier-slip condition, while the magnetic field satisfies the insulating condition. It is shown that the initial-boundary problem has a global weak solution for a general smooth domain. More importantly, for a flat domain, we establish the uniform local well-posedness of the strong solution with higher order uniform regularity and the asymptotic convergence with a rate to the solution of the ideal MHD as the dissipation tends to zero.
△ Less
Submitted 6 July, 2020;
originally announced July 2020.
-
Polymerase/nicking enzyme powered dual-template multi-cycled G-triplex machine for HIV-1 determination
Authors:
Qiuyue Duan,
Qi Yan,
Yuqi Huang,
Wenxiu Zhang,
Shuhui Zhao,
Gang Yi
Abstract:
We proposed a dual-template multi-cycled DNA nanomachine driven by polymerase nicking enzyme with high efficiency. The reaction system simply consists of two templates (T1, T2) and two enzymes (KF polymerase, Nb.BbvCI). The two templates are similar in structure (X-X-Y, Y-Y-C): primer recognition region, primer analogue generation region, output region (3 to 5), and there is a nicking site between…
▽ More
We proposed a dual-template multi-cycled DNA nanomachine driven by polymerase nicking enzyme with high efficiency. The reaction system simply consists of two templates (T1, T2) and two enzymes (KF polymerase, Nb.BbvCI). The two templates are similar in structure (X-X-Y, Y-Y-C): primer recognition region, primer analogue generation region, output region (3 to 5), and there is a nicking site between each two regions. Output of T1 is the primer of T2 and G-rich fragment (G3) is designed as the final products. In the presence of HIV-1, numerous of G3 were generated owing to the multi-cycled amplification strategy and formed into G-triplex ThT complex after the addition of thioflavin T (ThT), which greatly enhanced the fluorescence intensity as signal reporter in the label-free sensing strategy. A dynamic response range of 50 fM-2 nM for HIV-1 gene detection can be achieved through this multi-cycled G-triplex machine, and benefit from the high efficiency amplification strategy, enzymatic reaction can be completed within 45 minutes followed by fluorescence measurement. In addition, analysis of other targets can be achieved by replacing the template sequence. Thus there is a certain application potential for trace biomarker analysis in this strategy.
△ Less
Submitted 28 June, 2020;
originally announced June 2020.
-
The Panacea Threat Intelligence and Active Defense Platform
Authors:
Adam Dalton,
Ehsan Aghaei,
Ehab Al-Shaer,
Archna Bhatia,
Esteban Castillo,
Zhuo Cheng,
Sreekar Dhaduvai,
Qi Duan,
Md Mazharul Islam,
Younes Karimi,
Amir Masoumzadeh,
Brodie Mather,
Sashank Santhanam,
Samira Shaikh,
Tomek Strzalkowski,
Bonnie J. Dorr
Abstract:
We describe Panacea, a system that supports natural language processing (NLP) components for active defenses against social engineering attacks. We deploy a pipeline of human language technology, including Ask and Framing Detection, Named Entity Recognition, Dialogue Engineering, and Stylometry. Panacea processes modern message formats through a plug-in architecture to accommodate innovative appro…
▽ More
We describe Panacea, a system that supports natural language processing (NLP) components for active defenses against social engineering attacks. We deploy a pipeline of human language technology, including Ask and Framing Detection, Named Entity Recognition, Dialogue Engineering, and Stylometry. Panacea processes modern message formats through a plug-in architecture to accommodate innovative approaches for message analysis, knowledge representation and dialogue generation. The novelty of the Panacea system is that uses NLP for cyber defense and engages the attacker using bots to elicit evidence to attribute to the attacker and to waste the attacker's time and resources.
△ Less
Submitted 20 April, 2020;
originally announced April 2020.
-
SenseCare: A Research Platform for Medical Image Informatics and Interactive 3D Visualization
Authors:
Qi Duan,
Guotai Wang,
Rui Wang,
Chao Fu,
Xinjun Li,
Na Wang,
Yechong Huang,
Xiaodi Huang,
Tao Song,
Liang Zhao,
Xinglong Liu,
Qing Xia,
Zhiqiang Hu,
Yinan Chen,
Shaoting Zhang
Abstract:
Clinical research on smart health has an increasing demand for intelligent and clinic-oriented medical image computing algorithms and platforms that support various applications. To this end, we have developed SenseCare research platform, which is designed to facilitate translational research on intelligent diagnosis and treatment planning in various clinical scenarios. To enable clinical research…
▽ More
Clinical research on smart health has an increasing demand for intelligent and clinic-oriented medical image computing algorithms and platforms that support various applications. To this end, we have developed SenseCare research platform, which is designed to facilitate translational research on intelligent diagnosis and treatment planning in various clinical scenarios. To enable clinical research with Artificial Intelligence (AI), SenseCare provides a range of AI toolkits for different tasks, including image segmentation, registration, lesion and landmark detection from various image modalities ranging from radiology to pathology. In addition, SenseCare is clinic-oriented and supports a wide range of clinical applications such as diagnosis and surgical planning for lung cancer, pelvic tumor, coronary artery disease, etc. SenseCare provides several appealing functions and features such as advanced 3D visualization, concurrent and efficient web-based access, fast data synchronization and high data security, multi-center deployment, support for collaborative research, etc. In this report, we present an overview of SenseCare as an efficient platform providing comprehensive toolkits and high extensibility for intelligent image analysis and clinical research in different application scenarios. We also summarize the research outcome through the collaboration with multiple hospitals.
△ Less
Submitted 2 September, 2022; v1 submitted 2 April, 2020;
originally announced April 2020.
-
Large-scale Gastric Cancer Screening and Localization Using Multi-task Deep Neural Network
Authors:
Hong Yu,
Xiaofan Zhang,
Lingjun Song,
Liren Jiang,
Xiaodi Huang,
Wen Chen,
Chenbin Zhang,
Jiahui Li,
Jiji Yang,
Zhiqiang Hu,
Qi Duan,
Wanyuan Chen,
Xianglei He,
Jinshuang Fan,
Weihai Jiang,
Li Zhang,
Chengmin Qiu,
Minmin Gu,
Weiwei Sun,
Yangqiong Zhang,
Guangyin Peng,
Weiwei Shen,
Guohui Fu
Abstract:
Gastric cancer is one of the most common cancers, which ranks third among the leading causes of cancer death. Biopsy of gastric mucosa is a standard procedure in gastric cancer screening test. However, manual pathological inspection is labor-intensive and time-consuming. Besides, it is challenging for an automated algorithm to locate the small lesion regions in the gigapixel whole-slide image and…
▽ More
Gastric cancer is one of the most common cancers, which ranks third among the leading causes of cancer death. Biopsy of gastric mucosa is a standard procedure in gastric cancer screening test. However, manual pathological inspection is labor-intensive and time-consuming. Besides, it is challenging for an automated algorithm to locate the small lesion regions in the gigapixel whole-slide image and make the decision correctly.To tackle these issues, we collected large-scale whole-slide image dataset with detailed lesion region annotation and designed a whole-slide image analyzing framework consisting of 3 networks which could not only determine the screening result but also present the suspicious areas to the pathologist for reference. Experiments demonstrated that our proposed framework achieves sensitivity of 97.05% and specificity of 92.72% in screening task and Dice coefficient of 0.8331 in segmentation task. Furthermore, we tested our best model in real-world scenario on 10,315 whole-slide images collected from 4 medical centers.
△ Less
Submitted 19 September, 2020; v1 submitted 8 October, 2019;
originally announced October 2019.
-
Coherence Statistics of Structured Random Ensembles and Support Detection Bounds for OMP
Authors:
Qiyou Duan,
Taejoon Kim,
Lin Dai,
Erik Perrins
Abstract:
A structured random matrix ensemble that maintains constant modulus entries and unit-norm columns, often called a random phase-rotated (RPR) matrix, is considered in this paper. We analyze the coherence statistics of RPR measurement matrices and apply them to acquire probabilistic performance guarantees of orthogonal matching pursuit (OMP) for support detection (SD). It is revealed via numerical s…
▽ More
A structured random matrix ensemble that maintains constant modulus entries and unit-norm columns, often called a random phase-rotated (RPR) matrix, is considered in this paper. We analyze the coherence statistics of RPR measurement matrices and apply them to acquire probabilistic performance guarantees of orthogonal matching pursuit (OMP) for support detection (SD). It is revealed via numerical simulations that the SD performance guarantee provides a tight characterization, especially when the signal is sparse.
△ Less
Submitted 17 September, 2019;
originally announced September 2019.
-
Signet Ring Cell Detection With a Semi-supervised Learning Framework
Authors:
Jiahui Li,
Shuang Yang,
Xiaodi Huang,
Qian Da,
Xiaoqun Yang,
Zhiqiang Hu,
Qi Duan,
Chaofu Wang,
Hongsheng Li
Abstract:
Signet ring cell carcinoma is a type of rare adenocarcinoma with poor prognosis. Early detection leads to huge improvement of patients' survival rate. However, pathologists can only visually detect signet ring cells under the microscope. This procedure is not only laborious but also prone to omission. An automatic and accurate signet ring cell detection solution is thus important but has not been…
▽ More
Signet ring cell carcinoma is a type of rare adenocarcinoma with poor prognosis. Early detection leads to huge improvement of patients' survival rate. However, pathologists can only visually detect signet ring cells under the microscope. This procedure is not only laborious but also prone to omission. An automatic and accurate signet ring cell detection solution is thus important but has not been investigated before. In this paper, we take the first step to present a semi-supervised learning framework for the signet ring cell detection problem. Self-training is proposed to deal with the challenge of incomplete annotations, and cooperative-training is adapted to explore the unlabeled regions. Combining the two techniques, our semi-supervised learning framework can make better use of both labeled and unlabeled data. Experiments on large real clinical data demonstrate the effectiveness of our design. Our framework achieves accurate signet ring cell detection and can be readily applied in the clinical trails. The dataset will be released soon to facilitate the development of the area.
△ Less
Submitted 8 July, 2019;
originally announced July 2019.
-
BoostGAN for Occlusive Profile Face Frontalization and Recognition
Authors:
Qingyan Duan,
Lei Zhang
Abstract:
There are many facts affecting human face recognition, such as pose, occlusion, illumination, age, etc. First and foremost are large pose and occlusion problems, which can even result in more than 10% performance degradation. Pose-invariant feature representation and face frontalization with generative adversarial networks (GAN) have been widely used to solve the pose problem. However, the synthes…
▽ More
There are many facts affecting human face recognition, such as pose, occlusion, illumination, age, etc. First and foremost are large pose and occlusion problems, which can even result in more than 10% performance degradation. Pose-invariant feature representation and face frontalization with generative adversarial networks (GAN) have been widely used to solve the pose problem. However, the synthesis and recognition of occlusive but profile faces is still an uninvestigated problem. To address this issue, in this paper, we aim to contribute an effective solution on how to recognize occlusive but profile faces, even with facial keypoint region (e.g. eyes, nose, etc.) corrupted. Specifically, we propose a boosting Generative Adversarial Network (BoostGAN) for de-occlusion, frontalization, and recognition of faces. Upon the assumption that facial occlusion is partial and incomplete, multiple patch occluded images are fed as inputs for knowledge boosting, such as identity and texture information. A new aggregation structure composed of a deep GAN for coarse face synthesis and a shallow boosting net for fine face generation is further designed. Exhaustive experiments demonstrate that the proposed approach not only presents clear perceptual photo-realistic results but also shows state-of-the-art recognition performance for occlusive but profile faces.
△ Less
Submitted 26 February, 2019;
originally announced February 2019.
-
Representation Learning for Heterogeneous Information Networks via Embedding Events
Authors:
Guoji Fu,
Bo Yuan,
Qiqi Duan,
Xin Yao
Abstract:
Network representation learning (NRL) has been widely used to help analyze large-scale networks through mapping original networks into a low-dimensional vector space. However, existing NRL methods ignore the impact of properties of relations on the object relevance in heterogeneous information networks (HINs). To tackle this issue, this paper proposes a new NRL framework, called Event2vec, for HIN…
▽ More
Network representation learning (NRL) has been widely used to help analyze large-scale networks through mapping original networks into a low-dimensional vector space. However, existing NRL methods ignore the impact of properties of relations on the object relevance in heterogeneous information networks (HINs). To tackle this issue, this paper proposes a new NRL framework, called Event2vec, for HINs to consider both quantities and properties of relations during the representation learning process. Specifically, an event (i.e., a complete semantic unit) is used to represent the relation among multiple objects, and both event-driven first-order and second-order proximities are defined to measure the object relevance according to the quantities and properties of relations. We theoretically prove how event-driven proximities can be preserved in the embedding space by Event2vec, which utilizes event embeddings to facilitate learning the object embeddings. Experimental studies demonstrate the advantages of Event2vec over state-of-the-art algorithms on four real-world datasets and three network analysis tasks (including network reconstruction, link prediction, and node classification).
△ Less
Submitted 12 February, 2019; v1 submitted 29 January, 2019;
originally announced January 2019.
-
Machine Learning Promoting Extreme Simplification of Spectroscopy Equipment
Authors:
Jianchao Lee,
Qiannan Duan,
Sifan Bi,
Ruen Luo,
Yachao Lian,
Hanqiang Liu,
Ruixing Tian,
Jiayuan Chen,
Guodong Ma,
Jinhong Gao,
Zhaoyi Xu
Abstract:
The spectroscopy measurement is one of main pathways for exploring and understanding the nature. Today, it seems that racing artificial intelligence will remould its styles. The algorithms contained in huge neural networks are capable of substituting many of expensive and complex components of spectrum instruments. In this work, we presented a smart machine learning strategy on the measurement of…
▽ More
The spectroscopy measurement is one of main pathways for exploring and understanding the nature. Today, it seems that racing artificial intelligence will remould its styles. The algorithms contained in huge neural networks are capable of substituting many of expensive and complex components of spectrum instruments. In this work, we presented a smart machine learning strategy on the measurement of absorbance curves, and also initially verified that an exceedingly-simplified equipment is sufficient to meet the needs for this strategy. Further, with its simplicity, the setup is expected to infiltrate into many scientific areas in versatile forms.
△ Less
Submitted 13 September, 2019; v1 submitted 5 August, 2018;
originally announced August 2018.
-
Fitting Laguerre tessellation approximations to tomographic image data
Authors:
Aaron Spettl,
Tim Brereton,
Qibin Duan,
Thomas Werz,
Carl E. Krill III,
Dirk P. Kroese,
Volker Schmidt
Abstract:
The analysis of polycrystalline materials benefits greatly from accurate quantitative descriptions of their grain structures. Laguerre tessellations approximate such grain structures very well. However, it is a quite challenging problem to fit a Laguerre tessellation to tomographic data, as a high-dimensional optimization problem with many local minima must be solved. In this paper, we formulate a…
▽ More
The analysis of polycrystalline materials benefits greatly from accurate quantitative descriptions of their grain structures. Laguerre tessellations approximate such grain structures very well. However, it is a quite challenging problem to fit a Laguerre tessellation to tomographic data, as a high-dimensional optimization problem with many local minima must be solved. In this paper, we formulate a version of this optimization problem that can be solved quickly using the cross-entropy method, a robust stochastic optimization technique that can avoid becoming trapped in local minima. We demonstrate the effectiveness of our approach by applying it to both artificially generated and experimentally produced tomographic data.
△ Less
Submitted 24 November, 2015; v1 submitted 6 August, 2015;
originally announced August 2015.