Skip to main content

Showing 1–50 of 103 results for author: Zuo, S

  1. arXiv:2406.19350  [pdf, other

    cs.GT

    Complex Dynamics in Autobidding Systems

    Authors: Renato Paes Leme, Georgios Piliouras, Jon Schneider, Kelly Spendlove, Song Zuo

    Abstract: It has become the default in markets such as ad auctions for participants to bid in an auction through automated bidding agents (autobidders) which adjust bids over time to satisfy return-over-spend constraints. Despite the prominence of such systems for the internet economy, their resulting dynamical behavior is still not well understood. Although one might hope that such relatively simple system… ▽ More

    Submitted 1 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.16694  [pdf, other

    cs.CL

    Task Oriented In-Domain Data Augmentation

    Authors: Xiao Liang, Xinyu Hu, Simiao Zuo, Yeyun Gong, Qiang Lou, Yi Liu, Shao-Lun Huang, Jian Jiao

    Abstract: Large Language Models (LLMs) have shown superior performance in various applications and fields. To achieve better performance on specialized domains such as law and advertisement, LLMs are often continue pre-trained on in-domain data. However, existing approaches suffer from two major issues. First, in-domain data are scarce compared with general domain-agnostic data. Second, data used for contin… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.15740  [pdf, other

    astro-ph.IM physics.ins-det

    The FRB-searching pipeline of the Tianlai Cylinder Pathfinder Array

    Authors: Zijie Yu, Furen Deng, Shijie Sun, Chenhui Niu, Jixia Li, Fengquan Wu, Wei-Yang Wang, Yougang Wang, Shifan Zuo, Lin Shu, Jie Hao, Xiaohui Liu, Reza Ansari, Ue-Li Pen, Albert Stebbins, Peter Timbie, Xuelei Chen

    Abstract: This paper presents the design, calibration, and survey strategy of the Fast Radio Burst (FRB) digital backend and its real-time data processing pipeline employed in the Tianlai Cylinder Pathfinder array. The array, consisting of three parallel cylindrical reflectors and equipped with 96 dual-polarization feeds, is a radio interferometer array designed for conducting drift scans of the northern ce… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 27 pages, 21 figures, 7 tables, RAA accepted

  4. arXiv:2406.11409  [pdf, other

    cs.CL cs.AI

    CodeGemma: Open Code Models Based on Gemma

    Authors: CodeGemma Team, Heri Zhao, Jeffrey Hui, Joshua Howland, Nam Nguyen, Siqi Zuo, Andrea Hu, Christopher A. Choquette-Choo, Jingyue Shen, Joe Kelley, Kshitij Bansal, Luke Vilnis, Mateo Wirth, Paul Michel, Peter Choy, Pratik Joshi, Ravin Kumar, Sarmad Hashmi, Shubham Agrawal, Zhitao Gong, Jane Fine, Tris Warkentin, Ale Jakse Hartman, Bin Ni, Kathy Korevec , et al. (2 additional authors not shown)

    Abstract: This paper introduces CodeGemma, a collection of specialized open code models built on top of Gemma, capable of a variety of code and natural language generation tasks. We release three model variants. CodeGemma 7B pretrained (PT) and instruction-tuned (IT) variants have remarkably resilient natural language understanding, excel in mathematical reasoning, and match code capabilities of other open… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: v1: 11 pages, 4 figures, 5 tables. v2: Update metadata

  5. arXiv:2406.07023  [pdf, other

    cs.CV

    LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection

    Authors: Jiahua Xu, Si Zuo, Chenfeng Wei, Wei Zhou

    Abstract: With the rapid proliferation of autonomous driving, there has been a heightened focus on the research of lidar-based 3D semantic segmentation and object detection methodologies, aiming to ensure the safety of traffic participants. In recent decades, learning-based approaches have emerged, demonstrating remarkable performance gains in comparison to conventional algorithms. However, the segmentation… ▽ More

    Submitted 11 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  6. arXiv:2406.03211  [pdf, ps, other

    nucl-th hep-ph

    Study of hybrid stars with nonstrange quark matter cores

    Authors: Cheng-Ming Li, He-Rui Zheng, Shu-Yu Zuo, Ya-Peng Zhao, Fei Wang, Yong-Feng Huang

    Abstract: In this work, under the hypothesis that quark matter may not be strange [Phys. Rev. Lett. 120, 222001 (2018)], we adopt a modification of the coupling constant of the four-quark scalar interaction $G\rightarrow G_1+G_2\langle\barψψ\rangle$ in the 2-flavor Nambu-Jona-Lasinio model to study nonstrange hybrid stars. According to lattice QCD simulation results of the critical temperature at zero chemi… ▽ More

    Submitted 23 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 11 pages, 10 figures

  7. arXiv:2405.20642  [pdf, other

    cs.LG stat.ML

    Principal-Agent Multitasking: the Uniformity of Optimal Contracts and its Efficient Learning via Instrumental Regression

    Authors: Shiliang Zuo

    Abstract: This work studies the multitasking principal-agent problem. I first show a ``uniformity'' result. Specifically, when the tasks are perfect substitutes, and the agent's cost function is homogeneous to a certain degree, then the optimal contract only depends on the marginal utility of each task and the degree of homogeneity. I then study a setting where the marginal utility of each task is unknown s… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  8. arXiv:2405.20631  [pdf, ps, other

    cs.GT

    Optimizing Contracts in Principal-Agent Team Production

    Authors: Shiliang Zuo

    Abstract: I study a principal-agent team production model. The principal hires a team of agents to participate in a common production task. The exact effort of each agent is unobservable and unverifiable, but the total production outcome (e.g. the total revenue) can be observed. The principal incentivizes the agents to exert effort through contracts. Specifically, the principal promises that each agent rece… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  9. arXiv:2404.03476  [pdf, other

    cs.GT

    A Reduction from Multi-Parameter to Single-Parameter Bayesian Contract Design

    Authors: Matteo Castiglioni, Junjie Chen, Minming Li, Haifeng Xu, Song Zuo

    Abstract: The main result of this paper is an almost approximation-preserving polynomial-time reduction from the most general multi-parameter Bayesian contract design (BCD) to single-parameter BCD. That is, for any multi-parameter BCD instance $I^M$, we construct a single-parameter instance $I^S$ such that any $β$-approximate contract (resp. menu of contracts) of $I^S$ can in turn be converted to a $(β-ε)$-… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  10. arXiv:2403.13374  [pdf, other

    cs.LG cs.AI cs.CR

    Byzantine-resilient Federated Learning With Adaptivity to Data Heterogeneity

    Authors: Shiyuan Zuo, Xingrun Yan, Rongfei Fan, Han Hu, Hangguan Shan, Tony Q. S. Quek

    Abstract: This paper deals with federated learning (FL) in the presence of malicious Byzantine attacks and data heterogeneity. A novel Robust Average Gradient Algorithm (RAGA) is proposed, which leverages the geometric median for aggregation and can freely select the round number for local updating. Different from most existing resilient approaches, which perform convergence analysis based on strongly-conve… ▽ More

    Submitted 27 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  11. arXiv:2403.07143  [pdf, ps, other

    cs.GT cs.LG

    New Perspectives in Online Contract Design

    Authors: Shiliang Zuo

    Abstract: This work studies the repeated principal-agent problem from an online learning perspective. The principal's goal is to learn the optimal contract that maximizes her utility through repeated interactions, without prior knowledge of the agent's type (i.e., the agent's cost and production functions). This work contains three technical results. First, learning linear contracts with binary outcomes is… ▽ More

    Submitted 22 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  12. arXiv:2402.13417  [pdf, other

    cs.IR

    Unlocking the `Why' of Buying: Introducing a New Dataset and Benchmark for Purchase Reason and Post-Purchase Experience

    Authors: Tao Chen, Siqi Zuo, Cheng Li, Mingyang Zhang, Qiaozhu Mei, Michael Bendersky

    Abstract: Explanations are crucial for enhancing user trust and understanding within modern recommendation systems. To build truly explainable systems, we need high-quality datasets that elucidate why users make choices. While previous efforts have focused on extracting users' post-purchase sentiment in reviews, they ignore the reasons behind the decision to buy. In our work, we propose a novel purchase r… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  13. arXiv:2401.13986  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning

    Authors: Yanda Chen, Chandan Singh, Xiaodong Liu, Simiao Zuo, Bin Yu, He He, Jianfeng Gao

    Abstract: Large language models (LLMs) often generate convincing, fluent explanations. However, different from humans, they often generate inconsistent explanations on different inputs. For example, an LLM may generate the explanation "all birds can fly" when answering the question "Can sparrows fly?" but meanwhile answer "no" to the related question "Can penguins fly?". Explanations should be consistent ac… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2307.08678

  14. arXiv:2312.07145  [pdf, other

    cs.LG stat.ML

    Contextual Bandits with Online Neural Regression

    Authors: Rohan Deb, Yikun Ban, Shiliang Zuo, Jingrui He, Arindam Banerjee

    Abstract: Recent works have shown a reduction from contextual bandits to online regression under a realizability assumption [Foster and Rakhlin, 2020, Foster and Krishnamurthy, 2021]. In this work, we investigate the use of neural networks for such online regression and associated Neural Contextual Bandits (NeuCBs). Using existing results for wide networks, one can readily show a ${\mathcal{O}}(\sqrt{T})$ r… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  15. arXiv:2312.01064  [pdf, other

    astro-ph.IM astro-ph.CO

    Application of Regularization Methods in the Sky Map Reconstruction of the Tianlai Cylinder Pathfinder Array

    Authors: Kaifeng Yu, Shifan Zuo, Fengquan Wu, Yougang Wang, Xuelei Chen

    Abstract: The Tianlai cylinder pathfinder is a radio interferometer array to test 21 cm intensity mapping techniques in the post-reionization era. It works in passive drift scan mode to survey the sky visible in the northern hemisphere. To deal with the large instantaneous field of view and the spherical sky, we decompose the drift scan data into m-modes, which are linearly related to the sky intensity. The… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: 17 pages, 14 figures

  16. arXiv:2311.10679  [pdf, other

    cs.GT

    Non-uniform Bid-scaling and Equilibria for Different Auctions: An Empirical Study

    Authors: Yuan Deng, Jieming Mao, Vahab Mirrokni, Yifeng Teng, Song Zuo

    Abstract: In recent years, the growing adoption of autobidding has motivated the study of auction design with value-maximizing auto-bidders. It is known that under mild assumptions, uniform bid-scaling is an optimal bidding strategy in truthful auctions, e.g., Vickrey-Clarke-Groves auction (VCG), and the price of anarchy for VCG is $2$. However, for other auction formats like First-Price Auction (FPA) and G… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  17. arXiv:2310.17602  [pdf, other

    astro-ph.IM astro-ph.CO

    Simulation-based Inference of Reionization Parameters from 3D Tomographic 21 cm Light-cone Images -- II: Application of Solid Harmonic Wavelet Scattering Transform

    Authors: Xiaosheng Zhao, Yi Mao, Shifan Zuo, Benjamin D. Wandelt

    Abstract: The information regarding how the intergalactic medium is reionized by astrophysical sources is contained in the tomographic three-dimensional 21 cm images from the epoch of reionization. In Zhao et al. (2022a) ("Paper I"), we demonstrated for the first time that density estimation likelihood-free inference (DELFI) can be applied efficiently to perform a Bayesian inference of the reionization para… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 19 pages, 10 figures, 7 tables. Submitted to ApJ. Comments welcome

  18. arXiv:2310.16336  [pdf, other

    cs.LG stat.ML

    SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process

    Authors: Zichong Li, Yanbo Xu, Simiao Zuo, Haoming Jiang, Chao Zhang, Tuo Zhao, Hongyuan Zha

    Abstract: Transformer Hawkes process models have shown to be successful in modeling event sequence data. However, most of the existing training methods rely on maximizing the likelihood of event sequences, which involves calculating some intractable integral. Moreover, the existing methods fail to provide uncertainty quantification for model predictions, e.g., confidence intervals for the predicted event's… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  19. arXiv:2310.13855  [pdf, other

    cs.CL cs.AI

    Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing

    Authors: Xinyu Hu, Pengfei Tang, Simiao Zuo, Zihan Wang, Bowen Song, Qiang Lou, Jian Jiao, Denis Charles

    Abstract: Large language models (LLMs) have made impressive progress in natural language processing. These models rely on proper human instructions (or prompts) to generate suitable responses. However, the potential of LLMs are not fully harnessed by commonly-used prompting methods: many human-in-the-loop algorithms employ ad-hoc procedures for prompt selection; while auto prompt generation approaches are e… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  20. arXiv:2310.10826  [pdf, ps, other

    cs.GT econ.TH

    Mechanism Design for Large Language Models

    Authors: Paul Duetting, Vahab Mirrokni, Renato Paes Leme, Haifeng Xu, Song Zuo

    Abstract: We investigate auction mechanisms for AI-generated content, focusing on applications like ad creative generation. In our model, agents' preferences over stochastically generated content are encoded as large language models (LLMs). We propose an auction format that operates on a token-by-token basis, and allows LLM agents to influence content creation through single dimensional bids. We formulate t… ▽ More

    Submitted 2 July, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: WWW'24 Best Paper

  21. arXiv:2310.10810  [pdf, other

    cs.LG

    Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms

    Authors: Alexander Bukharin, Yan Li, Yue Yu, Qingru Zhang, Zhehui Chen, Simiao Zuo, Chao Zhang, Songan Zhang, Tuo Zhao

    Abstract: Multi-Agent Reinforcement Learning (MARL) has shown promising results across several domains. Despite this promise, MARL policies often lack robustness and are therefore sensitive to small changes in their environment. This presents a serious concern for the real world deployment of MARL algorithms, where the testing environment may slightly differ from the training environment. In this work we sh… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 33 pages, 10 figures

  22. arXiv:2310.03105  [pdf, other

    cs.GT

    Efficiency of the Generalized Second-Price Auction for Value Maximizers

    Authors: Yuan Deng, Mohammad Mahdian, Jieming Mao, Vahab Mirrokni, Hanrui Zhang, Song Zuo

    Abstract: We study the price of anarchy of the generalized second-price auction where bidders are value maximizers (i.e., autobidders). We show that in general the price of anarchy can be as bad as $0$. For comparison, the price of anarchy of running VCG is $1/2$ in the autobidding world. We further show a fined-grained price of anarchy with respect to the discount factors (i.e., the ratios of click probabi… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  23. arXiv:2309.17411  [pdf, other

    eess.SY

    Resilient Model-Free Asymmetric Bipartite Consensus for Nonlinear Multi-Agent Systems against DoS Attacks

    Authors: Yi Zhang, Yichao Wang, Junbo Zhao, Shan Zuo

    Abstract: In this letter, we study an unified resilient asymmetric bipartite consensus (URABC) problem for nonlinear multi-agent systems with both cooperative and antagonistic interactions under denial-of-service (DoS) attacks. We first prove that the URABC problem is solved by stabilizing the neighborhood asymmetric bipartite consensus error. Then, we develop a distributed compact form dynamic linearizatio… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  24. arXiv:2309.17301  [pdf, other

    eess.SY

    Distributed Resilient Control of DC Microgrids Under Generally Unbounded FDI Attacks

    Authors: Yichao Wang, Mohamadamin Rajabinezhad, Omar A. Beg, Shan Zuo

    Abstract: Due to the nature of distributed secondary control paradigm, DC microgrids are prone to malicious cyber-physical attacks, which could be unbounded to maximize their damage. Existing resilient secondary control methods addressing unbounded attacks require that the first time derivatives of cyber-physical attack signals be bounded. The secondary defense strategy presented in this letter relax such a… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  25. arXiv:2309.17253  [pdf, other

    eess.SY

    Secondary Defense Strategies of AC Microgrids Against Generally Unbounded Attacks

    Authors: Yichao Wang, Mohamadamin Rajabinezhad, Shan Zuo

    Abstract: This paper develops a fully distributed attack-resilient secondary defense strategies for AC microgrids, addressing more generally unbounded attacks on control input channels than those addressed in existing literature. The secondary control of local inverter includes consensus-based voltage and current regulators utilizing relative information from neighboring inverters. This distributed control… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  26. arXiv:2308.16896  [pdf, other

    cs.CV cs.AI cs.LG

    PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction

    Authors: Sicheng Zuo, Wenzhao Zheng, Yuanhui Huang, Jie Zhou, Jiwen Lu

    Abstract: Semantic segmentation in autonomous driving has been undergoing an evolution from sparse point segmentation to dense voxel segmentation, where the objective is to predict the semantic occupancy of each voxel in the concerned 3D space. The dense nature of the prediction space has rendered existing efficient 2D-projection-based methods (e.g., bird's eye view, range view, etc.) ineffective, as they c… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Code is available at https://github.com/wzzheng/PointOcc

  27. arXiv:2308.10427  [pdf, other

    cs.LG cs.CR cs.DC

    Federated Learning Robust to Byzantine Attacks: Achieving Zero Optimality Gap

    Authors: Shiyuan Zuo, Rongfei Fan, Han Hu, Ning Zhang, Shimin Gong

    Abstract: In this paper, we propose a robust aggregation method for federated learning (FL) that can effectively tackle malicious Byzantine attacks. At each user, model parameter is firstly updated by multiple steps, which is adjustable over iterations, and then pushed to the aggregation center directly. This decreases the number of interactions between the aggregation center and users, allows each user to… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

  28. arXiv:2308.09082  [pdf, other

    cs.LG

    Over-the-Air Computation Aided Federated Learning with the Aggregation of Normalized Gradient

    Authors: Rongfei Fan, Xuming An, Shiyuan Zuo, Han Hu

    Abstract: Over-the-air computation is a communication-efficient solution for federated learning (FL). In such a system, iterative procedure is performed: Local gradient of private loss function is updated, amplified and then transmitted by every mobile device; the server receives the aggregated gradient all-at-once, generates and then broadcasts updated model parameters to every mobile device. In terms of a… ▽ More

    Submitted 2 September, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

  29. arXiv:2308.09072  [pdf, other

    cs.LG

    Joint Power Control and Data Size Selection for Over-the-Air Computation Aided Federated Learning

    Authors: Xuming An, Rongfei Fan, Shiyuan Zuo, Han Hu, Hai Jiang, Ning Zhang

    Abstract: Federated learning (FL) has emerged as an appealing machine learning approach to deal with massive raw data generated at multiple mobile devices, {which needs to aggregate the training model parameter of every mobile device at one base station (BS) iteratively}. For parameter aggregating in FL, over-the-air computation is a spectrum-efficient solution, which allows all mobile devices to transmit t… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  30. arXiv:2308.04931  [pdf, other

    astro-ph.IM astro-ph.CO

    A simulation of calibration and map-making errors of the Tianlai cylinder pathfinder array

    Authors: Kaifeng Yu, Fengquan Wu, Shifan Zuo, Jixia Li, Shijie Sun, Yougang Wang, Xuelei Chen

    Abstract: The Tianlai cylinder array is a pathfinder for developing and testing 21cm intensity mapping techniques. In this paper, we use numerical simulation to assess how its measurement is affected by thermal noise and the errors in calibration and map-making process, and the error in the sky map reconstructed from a drift scan survey. Here we consider only the single frequency, unpolarized case. The beam… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 25 pages, 18 figures, RAA accepted

    Journal ref: Research in Astronomy and Astrophysics, 23, 105008 (2023)

  31. arXiv:2307.13903  [pdf, ps, other

    cs.LG stat.ML

    Corruption-Robust Lipschitz Contextual Search

    Authors: Shiliang Zuo

    Abstract: I study the problem of learning a Lipschitz function with corrupted binary signals. The learner tries to learn a $L$-Lipschitz function $f: [0,1]^d \rightarrow [0, L]$ that the adversary chooses. There is a total of $T$ rounds. In each round $t$, the adversary selects a context vector $x_t$ in the input space, and the learner makes a guess to the true function value $f(x_t)$ and receives a binary… ▽ More

    Submitted 1 February, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted at ALT 2024

  32. arXiv:2307.09530  [pdf, other

    astro-ph.IM astro-ph.CO

    3D ScatterNet: Inference from 21 cm Light-cones

    Authors: Xiaosheng Zhao, Shifan Zuo, Yi Mao

    Abstract: The Square Kilometre Array (SKA) will have the sensitivity to take the 3D light-cones of the 21 cm signal from the epoch of reionization. This signal, however, is highly non-Gaussian and can not be fully interpreted by the traditional statistic using power spectrum. In this work, we introduce the 3D ScatterNet that combines the normalizing flows with solid harmonic wavelet scattering transform, a… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: 9 pages, 4 figures, 2 tables. Accepted to ICML 2023 Machine Learning for Astrophysics workshop. Comments and suggestions are welcome

  33. arXiv:2306.17413  [pdf, other

    cs.IR

    DeepTagger: Knowledge Enhanced Named Entity Recognition for Web-Based Ads Queries

    Authors: Simiao Zuo, Pengfei Tang, Xinyu Hu, Qiang Lou, Jian Jiao, Denis Charles

    Abstract: Named entity recognition (NER) is a crucial task for online advertisement. State-of-the-art solutions leverage pre-trained language models for this task. However, three major challenges remain unresolved: web queries differ from natural language, on which pre-trained models are trained; web queries are short and lack contextual information; and labeled data for NER is scarce. We propose DeepTagger… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

  34. arXiv:2306.09217  [pdf, other

    astro-ph.IM eess.IV

    Map Reconstruction of radio observations with Conditional Invertible Neural Networks

    Authors: Haolin Zhang, Shifan Zuo, Le Zhang

    Abstract: In radio astronomy, the challenge of reconstructing a sky map from time ordered data (TOD) is known as an inverse problem. Standard map-making techniques and gridding algorithms are commonly employed to address this problem, each offering its own benefits such as producing minimum-variance maps. However, these approaches also carry limitations such as computational inefficiency and numerical insta… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted for publication in Research in Astronomy and Astrophysics (RAA); 20 pages, 10 figures

  35. arXiv:2306.06554  [pdf, other

    cs.GT

    Bayesian Calibrated Click-Through Auction

    Authors: Junjie Chen, Minming Li, Haifeng Xu, Song Zuo

    Abstract: We study information design in click-through auctions, in which the bidders/advertisers bid for winning an opportunity to show their ads but only pay for realized clicks. The payment may or may not happen, and its probability is called the click-through rate (CTR). This auction format is widely used in the industry of online advertising. Bidders have private values, whereas the seller has private… ▽ More

    Submitted 20 April, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

    Comments: add more explanations, details and discussions, use a new template

  36. arXiv:2306.05285  [pdf, other

    eess.SP cs.LG

    Unsupervised Statistical Feature-Guided Diffusion Model for Sensor-based Human Activity Recognition

    Authors: Si Zuo, Vitor Fortes Rey, Sungho Suh, Stephan Sigg, Paul Lukowicz

    Abstract: Human activity recognition (HAR) from on-body sensors is a core functionality in many AI applications: from personal health, through sports and wellness to Industry 4.0. A key problem holding up progress in wearable sensor-based HAR, compared to other ML areas, such as computer vision, is the unavailability of diverse and labeled training data. Particularly, while there are innumerable annotated i… ▽ More

    Submitted 19 May, 2024; v1 submitted 30 May, 2023; originally announced June 2023.

  37. arXiv:2306.03109  [pdf, other

    q-bio.QM cs.LG physics.chem-ph

    Machine Learning Force Fields with Data Cost Aware Training

    Authors: Alexander Bukharin, Tianyi Liu, Shengjie Wang, Simiao Zuo, Weihao Gao, Wen Yan, Tuo Zhao

    Abstract: Machine learning force fields (MLFF) have been proposed to accelerate molecular dynamics (MD) simulation, which finds widespread applications in chemistry and biomedical research. Even for the most data-efficient MLFFs, reaching chemical accuracy can require hundreds of frames of force and energy labels generated by expensive quantum mechanical algorithms, which may scale as $O(n^3)$ to $O(n^7)$,… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  38. arXiv:2305.06405  [pdf, other

    astro-ph.CO astro-ph.GA

    FAST drift scan survey for HI intensity mapping: I. preliminary data analysis

    Authors: Yichao Li, Yougang Wang, Furen Deng, Wenxiu Yang, Wenkai Hu, Diyang Liu, Xinyang Zhao, Shifan Zuo, Shuanghao Shu, Jixia Li, Peter Timbie, Reza Ansari, Olivier Perdereau, Albert Stebbins, Laura Wolz, Fengquan Wu, Xin Zhang, Xuelei Chen

    Abstract: This work presents the initial results of the drift-scan observation for the neutral hydrogen (HI) intensity mapping survey with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The data analyzed in this work were collected in night observations from 2019 through 2021. The primary findings are based on 28 hours of drift-scan observation carried out over seven nights in 2021, which… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 26 pages, 26 figures, and 4 tables

  39. arXiv:2304.13108  [pdf, other

    astro-ph.IM

    Detecting HI Galaxies with Deep Neural Networks in the Presence of Radio Frequency Interference

    Authors: Ruxi Liang, Furen Deng, Zepei Yang, Chunming Li, Feiyu Zhao, Botao Yang, Shuanghao Shu, Wenxiu Yang, Shifan Zuo, Yichao Li, Yougang Wang, Xuelei Chen

    Abstract: In neutral hydrogen (HI) galaxy survey, a significant challenge is to identify and extract the HI galaxy signal from observational data contaminated by radio frequency interference (RFI). For a drift-scan survey, or more generally a survey of a spatially continuous region, in the time-ordered spectral data, the HI galaxies and RFI all appear as regions which extend an area in the time-frequency wa… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 17 pages, 9 figures, 1 tables. Accepted for publication in RAA

  40. arXiv:2303.07943  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA

    SKA Science Data Challenge 2: analysis and results

    Authors: P. Hartley, A. Bonaldi, R. Braun, J. N. H. S. Aditya, S. Aicardi, L. Alegre, A. Chakraborty, X. Chen, S. Choudhuri, A. O. Clarke, J. Coles, J. S. Collinson, D. Cornu, L. Darriba, M. Delli Veneri, J. Forbrich, B. Fraga, A. Galan, J. Garrido, F. Gubanov, H. Håkansson, M. J. Hardcastle, C. Heneka, D. Herranz, K. M. Hess , et al. (83 additional authors not shown)

    Abstract: The Square Kilometre Array Observatory (SKAO) will explore the radio sky to new depths in order to conduct transformational science. SKAO data products made available to astronomers will be correspondingly large and complex, requiring the application of advanced analysis techniques to extract key science findings. To this end, SKAO is conducting a series of Science Data Challenges, each designed t… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Under review by MNRAS; 28 pages, 16 figures

  41. Nonextensive effects on QCD chiral phase diagram and baryon-number fluctuations within Polyakov-Nambu-Jona-Lasinio model

    Authors: Ya-Peng Zhao, Chao-Yong Wang, Shu-Yu Zuo, Cheng-Ming Li

    Abstract: In this paper, a version of the Polyakov-Nambu-Jona-Lasinio (PNJL) model based on nonextensive statistical mechanics is presented. This new statistics summarizes all possible factors that violate the assumptions of the Boltzmann-Gibbs (BG) statistics to a dimensionless nonextensivity parameter $q$, and when $q$ tends to 1, it returns to the BG case. Within the nonextensive PNJL model, we found tha… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  42. arXiv:2302.00377  [pdf, ps, other

    cs.GT

    Autobidding Auctions in the Presence of User Costs

    Authors: Yuan Deng, Jieming Mao, Vahab Mirrokni, Hanrui Zhang, Song Zuo

    Abstract: We study autobidding ad auctions with user costs, where each bidder is value-maximizing subject to a return-over-investment (ROI) constraint, and the seller aims to maximize the social welfare taking into consideration the user's cost of viewing an ad. We show that in the worst case, the approximation ratio of social welfare by running the vanilla VCG auctions with user costs could as bad as 0. To… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  43. Resilient Containment Control of Heterogeneous Multi-Agent Systems Against Unbounded Sensor and Actuator Attacks

    Authors: Shan Zuo, Yi Zhang, Yichao Wang

    Abstract: Accurate local state measurement is important to ensure the reliable operation of distributed multi-agent systems (MAS). Existing fault-tolerant control strategies generally assume the sensor faults to be bounded and uncorrelated. In this paper, we study the ramifications of allowing the sensor attack injections to be unbounded and correlated. These malicious sensor attacks may bypass the conventi… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  44. arXiv:2212.08136  [pdf, other

    cs.CL cs.LG

    Efficient Long Sequence Modeling via State Space Augmented Transformer

    Authors: Simiao Zuo, Xiaodong Liu, Jian Jiao, Denis Charles, Eren Manavoglu, Tuo Zhao, Jianfeng Gao

    Abstract: Transformer models have achieved superior performance in various natural language processing tasks. However, the quadratic computational cost of the attention mechanism limits its practicality for long sequences. There are existing attention variants that improve the computational efficiency, but they have limited ability to effectively compute global information. In parallel to Transformer models… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  45. arXiv:2212.05577  [pdf, other

    stat.ME

    Mediation analysis with the mediator and outcome missing not at random

    Authors: Shuozhi Zuo, Debashis Ghosh, Peng Ding, Fan Yang

    Abstract: Mediation analysis is widely used for investigating direct and indirect causal pathways through which an effect arises. However, many mediation analysis studies are challenged by missingness in the mediator and outcome. In general, when the mediator and outcome are missing not at random, the direct and indirect effects are not identifiable without further assumptions. In this work, we study the id… ▽ More

    Submitted 22 September, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

  46. arXiv:2210.01351  [pdf, other

    cs.CL cs.AI cs.LG

    Less is More: Task-aware Layer-wise Distillation for Language Model Compression

    Authors: Chen Liang, Simiao Zuo, Qingru Zhang, Pengcheng He, Weizhu Chen, Tuo Zhao

    Abstract: Layer-wise distillation is a powerful tool to compress large models (i.e. teacher models) into small ones (i.e., student models). The student distills knowledge from the teacher by mimicking the hidden representations of the teacher at every intermediate layer. However, layer-wise distillation is difficult. Since the student has a smaller model capacity than the teacher, it is often under-fitted.… ▽ More

    Submitted 5 June, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Proceedings of ICML 2023

  47. arXiv:2209.07584  [pdf, other

    cs.IR cs.LG

    Context-Aware Query Rewriting for Improving Users' Search Experience on E-commerce Websites

    Authors: Simiao Zuo, Qingyu Yin, Haoming Jiang, Shaohui Xi, Bing Yin, Chao Zhang, Tuo Zhao

    Abstract: E-commerce queries are often short and ambiguous. Consequently, query understanding often uses query rewriting to disambiguate user-input queries. While using e-commerce search tools, users tend to enter multiple searches, which we call context, before purchasing. These history searches contain contextual insights about users' true shopping intents. Therefore, modeling such contextual information… ▽ More

    Submitted 24 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

  48. arXiv:2209.07499  [pdf, other

    cs.LG

    DiP-GNN: Discriminative Pre-Training of Graph Neural Networks

    Authors: Simiao Zuo, Haoming Jiang, Qingyu Yin, Xianfeng Tang, Bing Yin, Tuo Zhao

    Abstract: Graph neural network (GNN) pre-training methods have been proposed to enhance the power of GNNs. Specifically, a GNN is first pre-trained on a large-scale unlabeled graph and then fine-tuned on a separate small labeled graph for downstream applications, such as node classification. One popular pre-training method is to mask out a proportion of the edges, and a GNN is trained to recover them. Howev… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  49. arXiv:2209.07303  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Estimation of Hawkes Process

    Authors: Simiao Zuo, Tianyi Liu, Tuo Zhao, Hongyuan Zha

    Abstract: Point process models are of great importance in real world applications. In certain critical applications, estimation of point process models involves large amounts of sensitive personal data from users. Privacy concerns naturally arise which have not been addressed in the existing literature. To bridge this glaring gap, we propose the first general differentially private estimation procedure for… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  50. arXiv:2208.14675  [pdf, other

    astro-ph.CO astro-ph.IM

    A Semi-blind PCA-based Foreground Subtraction Method for 21 cm Intensity Mapping

    Authors: Shifan Zuo, Xuelei Chen, Yi Mao

    Abstract: The Principal Component Analysis (PCA) method and the Singular Value Decomposition (SVD) method are widely used for foreground subtraction in 21 cm intensity mapping experiments. We show their equivalence, and point out that the condition for completely clean separation of foregrounds and cosmic 21 cm signal using the PCA/SVD is unrealistic. We propose a PCA-based foreground subtraction method, du… ▽ More

    Submitted 1 February, 2023; v1 submitted 31 August, 2022; originally announced August 2022.

    Comments: Comments welcome

    Journal ref: ApJ, 2023, vol. 945, id. 38