Skip to main content

Showing 1–50 of 117 results for author: Shang, S

  1. arXiv:2407.03007  [pdf, other

    cs.CL cs.AI

    What Affects the Stability of Tool Learning? An Empirical Study on the Robustness of Tool Learning Frameworks

    Authors: Chengrui Huang, Zhengliang Shi, Yuntao Wen, Xiuying Chen, Peng Han, Shen Gao, Shuo Shang

    Abstract: Tool learning methods have enhanced the ability of large language models (LLMs) to interact with real-world applications. Many existing works fine-tune LLMs or design prompts to enable LLMs to select appropriate tools and correctly invoke them to meet user requirements. However, it is observed in previous works that the performance of tool learning varies from tasks, datasets, training settings, a… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 19 pages, 9 figures

  2. arXiv:2407.01909  [pdf, other

    cs.CL cs.SD eess.AS

    Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models

    Authors: Zhiyuan Tang, Dong Wang, Shen Huang, Shidong Shang

    Abstract: Recent studies have demonstrated the efficacy of large language models (LLMs) in error correction for automatic speech recognition (ASR). However, much of the research focuses on the English language. This paper redirects the attention to Chinese. Firstly, we construct a specialized benchmark dataset aimed at error correction for Chinese ASR with 724K hypotheses-transcription pairs, named the Chin… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Interspeech 2024

  3. arXiv:2407.00993  [pdf, other

    cs.AI cs.CL

    Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents

    Authors: Shihan Deng, Weikai Xu, Hongda Sun, Wei Liu, Tao Tan, Jianfeng Liu, Ang Li, Jian Luan, Bin Wang, Rui Yan, Shuo Shang

    Abstract: With the remarkable advancements of large language models (LLMs), LLM-based agents have become a research hotspot in human-computer interaction. However, there is a scarcity of benchmarks available for LLM-based mobile agents. Benchmarking these agents generally faces three main challenges: (1) The inefficiency of UI-only operations imposes limitations to task evaluation. (2) Specific instructions… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2406.19966  [pdf, other

    cs.CL

    Simulating Financial Market via Large Language Model based Agents

    Authors: Shen Gao, Yuntao Wen, Minghang Zhu, Jianing Wei, Yuhan Cheng, Qunzi Zhang, Shuo Shang

    Abstract: Most economic theories typically assume that financial market participants are fully rational individuals and use mathematical models to simulate human behavior in financial markets. However, human behavior is often not entirely rational and is challenging to predict accurately with mathematical models. In this paper, we propose \textbf{A}gent-based \textbf{S}imulated \textbf{F}inancial \textbf{M}… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  5. arXiv:2406.15223  [pdf

    cond-mat.mtrl-sci cond-mat.stat-mech physics.comp-ph

    Thermodynamic modeling of the LiCl-KCl-LaCl$_3$ system with Bayesian model selection and uncertainty quantification

    Authors: Rushi Gong, Shun-Li Shang, Vitaliy G. Goncharov, Xiaofeng Guo, Zi-Kui Liu

    Abstract: Chloride molten salts are increasingly recognized for their applications in pyroprocessing techniques for the separation of lanthanides. Understanding the thermodynamic properties of these molten salts is essential to optimize the separation process. Several thermodynamic models, including the associate model, the two-sublattice ionic model, and the modified quasichemical model with quadruplet app… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 36 pages, 7 figures, 7 tables. To be submitted to peer-reviewed journal

  6. arXiv:2406.12123  [pdf, other

    cs.RO cs.AI cs.LG

    ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke

    Authors: Jingxi Xu, Runsheng Wang, Siqi Shang, Ava Chen, Lauren Winterbottom, To-Liang Hsu, Wenxi Chen, Khondoker Ahmed, Pedro Leandro La Rotta, Xinyue Zhu, Dawn M. Nilsen, Joel Stein, Matei Ciocarlie

    Abstract: Intent inferral on a hand orthosis for stroke patients is challenging due to the difficulty of data collection from impaired subjects. Additionally, EMG signals exhibit significant variations across different conditions, sessions, and subjects, making it hard for classifiers to generalize. Traditional approaches require a large labeled dataset from the new condition, session, or subject to train i… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 8 pages

  7. arXiv:2405.18113  [pdf, other

    cs.CL cs.AI

    Facilitating Multi-Role and Multi-Behavior Collaboration of Large Language Models for Online Job Seeking and Recruiting

    Authors: Hongda Sun, Hongzhan Lin, Haiyu Yan, Chen Zhu, Yang Song, Xin Gao, Shuo Shang, Rui Yan

    Abstract: The emergence of online recruitment services has revolutionized the traditional landscape of job seeking and recruitment, necessitating the development of high-quality industrial applications to improve person-job fitting. Existing methods generally rely on modeling the latent semantics of resumes and job descriptions and learning a matching function between them. Inspired by the powerful role-pla… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  8. arXiv:2405.09445  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Revisiting first-principles thermodynamics by quasiharmonic approach: Application to study thermal expansion of additively-manufactured Inconel 625

    Authors: Shun-Li Shang, Rushi Gong, Michael C. Gao, Darren C. Pagan, Zi-Kui Liu

    Abstract: An innovative method is developed for accurate determination of thermodynamic properties as a function of temperature by revisiting the density functional theory (DFT) based quasiharmonic approach (QHA). The present methodology individually evaluates the contributions from static total energy, phonon, and thermal electron to free energy for increased efficiency and accuracy. The Akaike information… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: This manuscript includes both the main text and the supplementary material, but without the supplementary Excel file

  9. arXiv:2405.03654  [pdf, other

    cs.CR cs.AI

    Can LLMs Deeply Detect Complex Malicious Queries? A Framework for Jailbreaking via Obfuscating Intent

    Authors: Shang Shang, Xinqiang Zhao, Zhongjiang Yao, Yepeng Yao, Liya Su, Zijing Fan, Xiaodan Zhang, Zhengwei Jiang

    Abstract: To demonstrate and address the underlying maliciousness, we propose a theoretical hypothesis and analytical approach, and introduce a new black-box jailbreak attack methodology named IntentObfuscator, exploiting this identified flaw by obfuscating the true intentions behind user prompts.This approach compels LLMs to inadvertently generate restricted content, bypassing their built-in content securi… ▽ More

    Submitted 7 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  10. arXiv:2404.19098  [pdf

    cond-mat.mtrl-sci

    Investigation of ideal shear strength of dilute binary and ternary Ni-based alloys using first-principles calculations, CALPHAD modeling and correlation analysis

    Authors: Shuang Lin, Shun-Li Shang, John D. Shimanek, Yi Wang, Allison M. Beese, Zi-Kui Liu

    Abstract: In the present work, the ideal shear strength (Tis) of dilute Ni34XZ ternary alloys (X or Z = Al, Co, Cr, Fe, Mn, Mo, Nb, Si, Ti) are predicted by first-principles calculations based on density functional theory (DFT) in terms of pure alias shear deformations. The results show that within the concentration up to 8.3% of the alloying elements, Tis increases with composition in binary systems with M… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 10 Figures Submitted to Computational Materials Science on April 25th

    ACM Class: I.6.6; J.6

  11. Global prediction of nuclear charge density distributions using deep neural network

    Authors: Tian Shuai Shang, Hui Hui Xie, Jian Li, Haozhao Liang

    Abstract: A deep neural network (DNN) has been developed to generate the distributions of nuclear charge density, utilizing the training data from the relativistic density functional theory and incorporating available experimental charge radii of 1014 nuclei into the loss function. The DNN achieved a root-mean-square (rms) deviation of 0.0193 fm for charge radii on its validation set. Furthermore, the DNN c… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 11 pages, 5 figures

    Journal ref: Phys. Rev. C 110 (2024) 014308

  12. arXiv:2404.06311  [pdf, other

    cs.IR

    DRE: Generating Recommendation Explanations by Aligning Large Language Models at Data-level

    Authors: Shen Gao, Yifan Wang, Jiabao Fang, Lisi Chen, Peng Han, Shuo Shang

    Abstract: Recommendation systems play a crucial role in various domains, suggesting items based on user behavior.However, the lack of transparency in presenting recommendations can lead to user confusion. In this paper, we introduce Data-level Recommendation Explanation (DRE), a non-intrusive explanation framework for black-box recommendation models.Different from existing methods, DRE does not require any… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 5 pages, 2 figures

  13. arXiv:2404.05569  [pdf, other

    cs.AI cs.CL cs.MA

    360$^\circ$REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System

    Authors: Shen Gao, Hao Li, Chengrui Huang, Quan Tu, Zhiliang Tian, Minlie Huang, Shuo Shang

    Abstract: Large language model agents have demonstrated remarkable advancements across various complex tasks. Recent works focus on optimizing the agent team or employing self-reflection to iteratively solve complex tasks. Since these agents are all based on the same LLM, only conducting self-evaluation or removing underperforming agents does not substantively enhance the capability of the agents. We argue… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  14. arXiv:2404.00719  [pdf

    cond-mat.supr-con physics.comp-ph quant-ph

    Revealing Symmetry-broken Superconducting Configurations by Density Functional Theory

    Authors: Zi-Kui Liu, Shun-Li Shang

    Abstract: A coherent theory for both conventional and unconventional superconductors is currently lacking. Here we show that the electron charge densities of Al, YBa2Cu3O7 (YBCO), and LaH10 along with Pb and Nb3Sn share the same feature of electron charge gains in their respective superconducting configurations (SCCs) predicted by first-principles calculations based on the density functional theory (DFT). I… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  15. arXiv:2403.19084  [pdf

    cond-mat.mtrl-sci

    MaterialsMap: A CALPHAD-Based Tool to Design Composition Pathways through feasibility map for Desired Dissimilar Materials, demonstrated with RSW Joining of Ag-Al-Cu

    Authors: Hui Sun, Bo Pan, Zhening Yang, Adam M. Krajewski, Brandon Bocklund, Shun-Li Shang, Jingjing Li, Allison M. Beese, Zi-Kui Liu

    Abstract: Assembly of dissimilar metals can be achieved by different methods, for example, casting, welding, and additive manufacturing (AM). However, undesired phases formed in liquid-phase assembling processes due to solute segregation during solidification diminish mechanical and other properties of the processed parts. In the present work, an open-source software named MaterialsMap, has been developed b… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  16. arXiv:2403.17755  [pdf, other

    cs.AI cs.CR cs.CV

    DataCook: Crafting Anti-Adversarial Examples for Healthcare Data Copyright Protection

    Authors: Sihan Shang, Jiancheng Yang, Zhenglong Sun, Pascal Fua

    Abstract: In the realm of healthcare, the challenges of copyright protection and unauthorized third-party misuse are increasingly significant. Traditional methods for data copyright protection are applied prior to data distribution, implying that models trained on these data become uncontrollable. This paper introduces a novel approach, named DataCook, designed to safeguard the copyright of healthcare data… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  17. arXiv:2403.06831  [pdf, other

    cs.CV

    HDRTransDC: High Dynamic Range Image Reconstruction with Transformer Deformation Convolution

    Authors: Shuaikang Shang, Xuejing Kang, Anlong Ming

    Abstract: High Dynamic Range (HDR) imaging aims to generate an artifact-free HDR image with realistic details by fusing multi-exposure Low Dynamic Range (LDR) images. Caused by large motion and severe under-/over-exposure among input LDR images, HDR imaging suffers from ghosting artifacts and fusion distortions. To address these critical issues, we propose an HDR Transformer Deformation Convolution (HDRTran… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  18. arXiv:2403.05217  [pdf, other

    cs.CL cs.AI cs.IR

    Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering

    Authors: Hongda Sun, Yuxuan Liu, Chengwei Wu, Haiyu Yan, Cheng Tai, Xin Gao, Shuo Shang, Rui Yan

    Abstract: Open-domain question answering (ODQA) has emerged as a pivotal research spotlight in information systems. Existing methods follow two main paradigms to collect evidence: (1) The \textit{retrieve-then-read} paradigm retrieves pertinent documents from an external corpus; and (2) the \textit{generate-then-read} paradigm employs large language models (LLMs) to generate relevant documents. However, nei… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: TheWebConf 2024 (WWW 2024) oral, code repo: https://github.com/EthanLeo-LYX/LLMQA

  19. arXiv:2403.03102  [pdf, other

    cs.CL cs.AI

    "In Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning

    Authors: Chuanqi Cheng, Quan Tu, Wei Wu, Shuo Shang, Cunli Mao, Zhengtao Yu, Rui Yan

    Abstract: Personalized dialogue systems have gained significant attention in recent years for their ability to generate responses in alignment with different personas. However, most existing approaches rely on pre-defined personal profiles, which are not only time-consuming and labor-intensive to create but also lack flexibility. We propose In-Dialogue Learning (IDL), a fine-tuning framework that enhances t… ▽ More

    Submitted 12 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  20. arXiv:2403.02181  [pdf, other

    cs.CL cs.AI cs.LG

    Not All Layers of LLMs Are Necessary During Inference

    Authors: Siqi Fan, Xin Jiang, Xiang Li, Xuying Meng, Peng Han, Shuo Shang, Aixin Sun, Yequan Wang, Zhongyuan Wang

    Abstract: Due to the large number of parameters, the inference phase of Large Language Models (LLMs) is resource-intensive. However, not all requests posed to LLMs are equally difficult to handle. Through analysis, we show that for some tasks, LLMs can achieve results comparable to the final output at some intermediate layers. That is, not all layers of LLMs are necessary during inference. If we can predict… ▽ More

    Submitted 9 July, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  21. arXiv:2403.00832  [pdf, other

    cs.IR cs.AI

    Explainable Session-based Recommendation via Path Reasoning

    Authors: Yang Cao, Shuo Shang, Jun Wang, Wei Zhang

    Abstract: This paper explores providing explainability for session-based recommendation (SR) by path reasoning. Current SR models emphasize accuracy but lack explainability, while traditional path reasoning prioritizes knowledge graph exploration, ignoring sequential patterns present in the session history. Therefore, we propose a generalized hierarchical reinforcement learning framework for SR, which impro… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  22. arXiv:2403.00705  [pdf

    cond-mat.mtrl-sci

    First-principles Investigation of Thermodynamic Properties of CrNbO4 and CrTaO4

    Authors: Shuang Lin, Shun-Li Shang, Allison M. Beese, Zi-Kui Liu

    Abstract: In the present study, the DFT+U method was employed to predict the thermodynamic properties of Cr2O3, Nb2O5, and Ta2O5. Results were benchmarked with experimental data showing high accuracy, except for the negative thermal expansion (NTE) of Nb2O5, which is attributed to its polymorphic complexity. Additionally, we extended our analysis to rutile-type oxides CrNbO4 and CrTaO4, examining their entr… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  23. arXiv:2402.18425  [pdf

    cond-mat.mtrl-sci

    Predicting Phase Transitions in PbTiO$_3$ using Zentropy

    Authors: Nigel Lee En Hew, Shun-Li Shang, Zi-Kui Liu

    Abstract: According to conventional X-ray measurements, PbTiO$_3$ undergoes a phase transition from a tetragonal ferroelectric phase to a cubic paraelectric phase at 763 K. However, x-ray absorption fine-structure (XAFS) measurements indicate that PbTiO$_3$ is tetragonal even after the phase transition has occurred. The difference in these results concerns the length scales accessible to each measurement te… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 21 pages, 7 Figures, 1 Table, TMS 2024

  24. Revisiting thermodynamics in (LiF, NaF, KF, CrF2)-CrF3 by first-principles calculations and CALPHAD modeling

    Authors: Rushi Gong, Shun-Li Shang, Yi Wang, Jorge Paz Soldan Palma, Hojong Kim, Zi-Kui Liu

    Abstract: The thermodynamic description of the (LiF, NaF, KF, CrF2)-CrF3 systems has been revisited, aiming for a better understanding of the effects of Cr on the FLiNaK molten salt. First-principles calculations based on density functional theory (DFT) were performed to determine the electronic and structural properties of each compound, including the formation enthalpy, volume, and bulk modulus. DFT-based… ▽ More

    Submitted 28 February, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  25. arXiv:2401.15484  [pdf, other

    cs.RO

    R$\times$R: Rapid eXploration for Reinforcement Learning via Sampling-based Reset Distributions and Imitation Pre-training

    Authors: Gagan Khandate, Tristan L. Saidi, Siqi Shang, Eric T. Chang, Yang Liu, Seth Dennis, Johnson Adams, Matei Ciocarlie

    Abstract: We present a method for enabling Reinforcement Learning of motor control policies for complex skills such as dexterous manipulation. We posit that a key difficulty for training such policies is the difficulty of exploring the problem state space, as the accessible and useful regions of this space form a complex structure along manifolds of the original high-dimensional state space. This work prese… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 20 pages, 14 figures, submitted to Autonomous Robots, RSS 2023 Special Issue. arXiv admin note: substantial text overlap with arXiv:2303.03486

  26. arXiv:2401.14586  [pdf

    cond-mat.mtrl-sci

    Additively manufactured Ni-20Cr to V functionally graded material: computational predictions and experimental verification of phase formations

    Authors: Beril Tonyali, Hui Sun, Brandon Bocklund, John Paul Borgonia, Richard A. Otis, Shun-Li Shang, Zi-Kui Liu, Allison M. Beese

    Abstract: A database for the Cr-Ni-V system was constructed by modeling the binary Cr-V and ternary Cr-Ni-V systems using the CALPHAD approach aided by density functional theory (DFT)-based first-principles calculations and ab initio molecular dynamics (AIMD) simulations. To validate this new database, a functionally graded material (FGM) using Ni-20Cr and elemental V was fabricated using directed energy de… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  27. arXiv:2401.03212  [pdf, other

    cond-mat.soft physics.comp-ph

    Viscosity bounds in liquids with different structure and bonding types

    Authors: M. Withington, H. L. Devereux, C. Cockrell, A. M. Elena, I. T. Todorov, Z. K. Liu, S. L. Shang, J. S. McCloy, P. A. Bingham, K. Trachenko

    Abstract: Recently, it was realised that liquid viscosity has a lower bound which is nearly constant for all liquids and is governed by fundamental physical constants. This was supported by experimental data in noble and molecular liquids. Here, we perform large-scale molecular dynamics simulations to ascertain this bound in two other important liquid types: the ionic molten salt system LiF and metallic Pb.… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: 6 pages, 7 figures

    Journal ref: Phys. Rev. B 109, 094205,19 March 2024

  28. arXiv:2312.16369  [pdf, ps, other

    math.QA math.RT

    Allison-Benkart-Gao functor and the free non-unital alternative algebras

    Authors: Shikui Shang

    Abstract: Let $k$ be a field of characteristic $0$. We introduce a pair of adjoint functors, Allison-Benkart-Gao functor $\mathcal{ABG}$ and Berman-Moody functor $\mathcal{BM}$, between the category of non-unital alternative algebras over $k$ and the category ${\text{\bf Lie}_{\text{R}}}$ of Lie algebras with appropriate $sl_3(k)$-module structures. Surprisingly, when $A$ is a non-unital alternative algebra… ▽ More

    Submitted 30 December, 2023; v1 submitted 26 December, 2023; originally announced December 2023.

  29. arXiv:2312.01871  [pdf, other

    cs.CV

    FeaInfNet: Diagnosis in Medical Image with Feature-Driven Inference and Visual Explanations

    Authors: Yitao Peng, Lianghua He, Die Hu, Yihang Liu, Longzhen Yang, Shaohua Shang

    Abstract: Interpretable deep learning models have received widespread attention in the field of image recognition. Due to the unique multi-instance learning of medical images and the difficulty in identifying decision-making regions, many interpretability models that have been proposed still have problems of insufficient accuracy and interpretability in medical image disease diagnosis. To solve these proble… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  30. arXiv:2310.20226  [pdf, ps, other

    math.NT math.CO

    Value sets of non-permutation polynomials over the residue class rings of integers

    Authors: Shikui Shang

    Abstract: In this paper, we study the value sets of non-permutation polynomial functions over the residue class ring $\mathbb{Z}/m\mathbb{Z}$. When $m=p^r$ is a power of some prime $p$, an upper bound is given for the size of the value set of a polynomial function which is not a permutation. We also show that this upper bound can be achieved by some integral polynomials. Finally, we generalize the results f… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  31. arXiv:2310.18659  [pdf, other

    cs.AI cs.CL

    DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy

    Authors: Hongda Sun, Weikai Xu, Wei Liu, Jian Luan, Bin Wang, Shuo Shang, Ji-Rong Wen, Rui Yan

    Abstract: Recent advances in large language models (LLMs) have revolutionized the landscape of reasoning tasks. To enhance the capabilities of LLMs to emulate human reasoning, prior studies have focused on modeling reasoning steps using various thought structures like chains, trees, or graphs. However, LLM-based reasoning still encounters the following challenges: (1) Limited adaptability of preset structur… ▽ More

    Submitted 26 May, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted at ACL 2024 Main, Code repo: https://github.com/XiaoMi/DetermLR

  32. arXiv:2310.15523  [pdf, other

    cs.LG cs.AI

    Generative and Contrastive Paradigms Are Complementary for Graph Self-Supervised Learning

    Authors: Yuxiang Wang, Xiao Yan, Chuang Hu, Fangcheng Fu, Wentao Zhang, Hao Wang, Shuo Shang, Jiawei Jiang

    Abstract: For graph self-supervised learning (GSSL), masked autoencoder (MAE) follows the generative paradigm and learns to reconstruct masked graph edges or node features. Contrastive Learning (CL) maximizes the similarity between augmented views of the same graph and is widely used for GSSL. However, MAE and CL are considered separately in existing works for GSSL. We observe that the MAE and CL paradigms… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  33. arXiv:2310.12519  [pdf, ps, other

    math.MG

    The unimodular equivalence of sublattices in an $n$-dimensional lattice

    Authors: Shikui Shang

    Abstract: In this paper, we study the unimodular equivalence of sublattices in an $n$-dimensional lattice. A recursive procedure is given to compute the cardinalities of the unimodular equivalent classes with the indices which are powers of a prime $p$. We also show that these are integral polynomials in $p$. When $n=2$, the explicit formulae of the cardinalities are presented depending on the prime decompo… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  34. arXiv:2310.10992  [pdf, other

    cs.SD eess.AS

    A High Fidelity and Low Complexity Neural Audio Coding

    Authors: Wenzhe Liu, Wei Xiao, Meng Wang, Shan Yang, Yupeng Shi, Yuyong Kang, Dan Su, Shidong Shang, Dong Yu

    Abstract: Audio coding is an essential module in the real-time communication system. Neural audio codecs can compress audio samples with a low bitrate due to the strong modeling and generative capabilities of deep neural networks. To address the poor high-frequency expression and high computational cost and storage consumption, we proposed an integrated framework that utilizes a neural network to model wide… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  35. arXiv:2310.06527  [pdf

    cond-mat.stat-mech cond-mat.mtrl-sci

    Zentropy theory for accurate prediction of free energy, volume, and thermal expansion without fitting parameters

    Authors: Zi-Kui Liu, Nigel L. E. Hew, Shun-Li Shang

    Abstract: Based on statistical mechanics, a macroscopically homogeneous system, i.e., a single phase in the present context, is composed of many independent configurations that the system embraces. The macroscopical properties of the system are determined by the properties and statistical probabilities of those configurations with respect to external conditions. The volume of a single phase is thus the weig… ▽ More

    Submitted 7 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2309.09823

    Journal ref: Microstructures 4, 2024009 (2024)

  36. arXiv:2308.16385  [pdf, other

    cs.LG cs.AI

    BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks

    Authors: Qiang Huang, Jiawei Jiang, Xi Susie Rao, Ce Zhang, Zhichao Han, Zitao Zhang, Xin Wang, Yongjun He, Quanqing Xu, Yang Zhao, Chuang Hu, Shuo Shang, Bo Du

    Abstract: To handle graphs in which features or connectivities are evolving over time, a series of temporal graph neural networks (TGNNs) have been proposed. Despite the success of these TGNNs, the previous TGNN evaluations reveal several limitations regarding four critical issues: 1) inconsistent datasets, 2) inconsistent evaluation pipelines, 3) lacking workload diversity, and 4) lacking efficient compari… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 28 pages, 23 figures, 27 tables. Submitted to the Conference on Neural Information Processing Systems 2023 Track on Datasets and Benchmarks

  37. arXiv:2308.13180  [pdf

    cond-mat.mtrl-sci cond-mat.supr-con

    Electronic-grade epitaxial (111) KTaO3 heterostructures

    Authors: Jieun Kim, Muqing Yu, Jung-Woo Lee, Shun-Li Shang, Gi-Yeop Kim, Pratap Pal, Jinsol Seo, Neil Campbell, Kitae Eom, Ranjani Ramachandran, Mark S. Rzchowski, Sang Ho Oh, Si-Young Choi, Zi-Kui Liu, Jeremy Levy, Chang-Beom Eom

    Abstract: KTaO3 has recently attracted attention as a model system to study the interplay of quantum paraelectricity, spin-orbit coupling, and superconductivity. However, the high and low vapor pressures of potassium and tantalum present processing challenges to creating interfaces clean enough to reveal the intrinsic quantum properties. Here, we report superconducting heterostructures based on electronic-g… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  38. arXiv:2308.10278  [pdf, other

    cs.CL

    CharacterChat: Learning towards Conversational AI with Personalized Social Support

    Authors: Quan Tu, Chuanqi Chen, Jinpeng Li, Yanran Li, Shuo Shang, Dongyan Zhao, Ran Wang, Rui Yan

    Abstract: In our modern, fast-paced, and interconnected world, the importance of mental well-being has grown into a matter of great urgency. However, traditional methods such as Emotional Support Conversations (ESC) face challenges in effectively addressing a diverse range of individual personalities. In response, we introduce the Social Support Conversation (S2Conv) framework. It comprises a series of supp… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 10 pages, 6 figures, 5 tables

  39. arXiv:2308.05837  [pdf

    cond-mat.mtrl-sci

    Predictions and correlation analyses of Ellingham diagrams in binary oxides

    Authors: Shun-Li Shang, Shuang Lin, Michael C. Gao, Darrell G. Schlom, Zi-Kui Liu

    Abstract: Knowing oxide-forming ability is vital to gain desired or avoid deleterious oxides formation through tuning oxidizing environment and materials chemistry. Here, we have conducted a comprehensive thermodynamic analysis of 137 binary oxides using the presently predicted Ellingham diagrams. It is found that the active elements to form oxides easily are the f-block elements (lanthanides and actinides)… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  40. arXiv:2307.14554  [pdf, ps, other

    math.PR

    Large deviation principle for stochastic reaction-diffusion equations with super-linear drift on $\mathbb{R}$ driven by space-time white noise

    Authors: Yue Li, Shijie Shang, Jianliang Zhai

    Abstract: In this paper, we consider stochastic reaction-diffusion equations with super-linear drift on the real line $\mathbb{R}$ driven by space-time white noise. A Freidlin-Wentzell large deviation principle is established by a modified weak convergence method on the space $C([0,T], C_{tem}(\mathbb{R}))$. Obtaining the main result in this paper is challenging due to the setting of unbounded domain, the s… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    MSC Class: 60H15; 60F10

  41. arXiv:2307.13581  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Comparing Forward and Inverse Design Paradigms: A Case Study on Refractory High-Entropy Alloys

    Authors: Arindam Debnath, Lavanya Raman, Wenjie Li, Adam M. Krajewski, Marcia Ahn, Shuang Lin, Shunli Shang, Allison M. Beese, Zi-Kui Liu, Wesley F. Reinhart

    Abstract: The rapid design of advanced materials is a topic of great scientific interest. The conventional, ``forward'' paradigm of materials design involves evaluating multiple candidates to determine the best candidate that matches the target properties. However, recent advances in the field of deep learning have given rise to the possibility of an ``inverse'' design paradigm for advanced materials, where… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  42. arXiv:2305.19739  [pdf, ps, other

    math.PR

    Transportation cost inequalities for stochastic reaction diffusion equations on the whole line $\mathbb{R}$

    Authors: Yue Li, Shijie Shang, Tusheng Zhang

    Abstract: In this paper, we established quadratic transportation cost inequalities for solutions of stochastic reaction diffusion equations driven by multiplicative space-time white noise on the whole line $\mathbb{R}$. Since the space variable is defined on the unbounded domain $\mathbb{R}$, the inequalities are proved under a weighted $L^2$-norm and a weighted uniform metric in the so called $L^2_{tem}$,… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    MSC Class: 60H15 (Primary) 35R60 (Secondary)

  43. arXiv:2305.19085  [pdf, ps, other

    math.AG

    Hard Lefschetz theorems for free line bundles

    Authors: Jiajun Hu, Shijie Shang, Jian Xiao

    Abstract: We introduce a partial positivity notion for algebraic maps via the defect of semismallness. This positivity notion is modeled on $m$-positivity in the analytic setting and $m$-ampleness in the geometric setting. Using this positivity condition for algebraic maps, we establish Kähler packages, that is, Hard Lefschetz theorems and Hodge-Riemann bilinear relations, for the complete intersections of… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 14 pages; comments welcome!

  44. arXiv:2305.05599  [pdf, other

    cs.SD cs.HC eess.AS

    Inter-SubNet: Speech Enhancement with Subband Interaction

    Authors: Jun Chen, Wei Rao, Zilin Wang, Jiuxin Lin, Zhiyong Wu, Yannan Wang, Shidong Shang, Helen Meng

    Abstract: Subband-based approaches process subbands in parallel through the model with shared parameters to learn the commonality of local spectrums for noise reduction. In this way, they have achieved remarkable results with fewer parameters. However, in some complex environments, the lack of global spectral information has a negative impact on the performance of these subband-based approaches. To this end… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted by ICASSP 2023

  45. arXiv:2304.06875  [pdf, other

    cs.CL cs.LG

    nanoLM: an Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales

    Authors: Yiqun Yao, Siqi fan, Xiusheng Huang, Xuezhi Fang, Xiang Li, Ziyi Ni, Xin Jiang, Xuying Meng, Peng Han, Shuo Shang, Kang Liu, Aixin Sun, Yequan Wang

    Abstract: As language models scale up, it becomes increasingly expensive to verify research ideas because conclusions on small models do not trivially transfer to large ones. A possible solution is to establish a generic system that accurately predicts certain metrics for large models without training them. Existing scaling laws require hyperparameter search on the largest models, limiting their predicative… ▽ More

    Submitted 6 April, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: This is a modified and extended version of our previous Mu-scaling work released in April 2023 (see v1)

  46. arXiv:2303.08714  [pdf, other

    cs.CV

    ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution

    Authors: Shuyao Shang, Zhengyang Shan, Guangxing Liu, LunQian Wang, XingHua Wang, Zekai Zhang, Jinglin Zhang

    Abstract: Adapting the Diffusion Probabilistic Model (DPM) for direct image super-resolution is wasteful, given that a simple Convolutional Neural Network (CNN) can recover the main low-frequency content. Therefore, we present ResDiff, a novel Diffusion Probabilistic Model based on Residual structure for Single Image Super-Resolution (SISR). ResDiff utilizes a combination of a CNN, which restores primary lo… ▽ More

    Submitted 2 February, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 9 pages, 5 figures

  47. arXiv:2303.07704  [pdf, other

    eess.AS cs.SD

    TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

    Authors: Yukai Ju, Jun Chen, Shimin Zhang, Shulin He, Wei Rao, Weixin Zhu, Yannan Wang, Tao Yu, Shidong Shang

    Abstract: This paper introduces the Unbeatable Team's submission to the ICASSP 2023 Deep Noise Suppression (DNS) Challenge. We expand our previous work, TEA-PSE, to its upgraded version -- TEA-PSE 3.0. Specifically, TEA-PSE 3.0 incorporates a residual LSTM after squeezed temporal convolution network (S-TCN) to enhance sequence modeling capabilities. Additionally, the local-global representation (LGR) struct… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  48. arXiv:2303.03486  [pdf, other

    cs.RO

    Sampling-based Exploration for Reinforcement Learning of Dexterous Manipulation

    Authors: Gagan Khandate, Siqi Shang, Eric T. Chang, Tristan Luca Saidi, Yang Liu, Seth Matthew Dennis, Johnson Adams, Matei Ciocarlie

    Abstract: In this paper, we present a novel method for achieving dexterous manipulation of complex objects, while simultaneously securing the object without the use of passive support surfaces. We posit that a key difficulty for training such policies in a Reinforcement Learning framework is the difficulty of exploring the problem state space, as the accessible regions of this space form a complex structure… ▽ More

    Submitted 23 May, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 10 pages, 7 figures, accepted at Robotics Science & Systems 2023

  49. arXiv:2212.12116  [pdf, other

    cs.CV cs.AI

    Unpaired Overwater Image Defogging Using Prior Map Guided CycleGAN

    Authors: Yaozong Mo, Chaofeng Li, Wenqi Ren, Shaopeng Shang, Wenwu Wang, Xiao-jun Wu

    Abstract: Deep learning-based methods have achieved significant performance for image defogging. However, existing methods are mainly developed for land scenes and perform poorly when dealing with overwater foggy images, since overwater scenes typically contain large expanses of sky and water. In this work, we propose a Prior map Guided CycleGAN (PG-CycleGAN) for defogging of images with overwater scenes. T… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  50. arXiv:2212.12096  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Silicon-doped $β$-Ga$_2$O$_3$ films grown at 1 $μ$m/h by suboxide molecular-beam epitaxy

    Authors: Kathy Azizie, Felix V. E. Hensling, Cameron A. Gorsak, Yunjo Kim, Daniel M. Dryden, M. K. Indika Senevirathna, Selena Coye, Shun-Li Shang, Jacob Steele, Patrick Vogt, Nicholas A. Parker, Yorick A. Birkhölzer, Jonathan P. McCandless, Debdeep Jena, Huili G. Xing, Zi-Kui Liu, Michael D. Williams, Andrew J. Green, Kelson Chabak, Adam T. Neal, Shin Mou, Michael O. Thompson, Hari P. Nair, Darrell G. Schlom

    Abstract: We report the use of suboxide molecular-beam epitaxy (S-MBE) to grow $β$-Ga$_2$O$_3$ at a growth rate of ~1 $μ$m/h with control of the silicon doping concentration from 5x10$^{16}$ to 10$^{19}$ cm$^{-3}$. In S-MBE, pre-oxidized gallium in the form of a molecular beam that is 99.98\% Ga$_2$O, i.e., gallium suboxide, is supplied. Directly supplying Ga2O to the growth surface bypasses the rate-limiti… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: 19 pages, 7 figures, 2 tables, 2 pages supplementary materials