Skip to main content

Showing 1–50 of 353 results for author: Ni, Y

  1. arXiv:2407.09457  [pdf, other

    astro-ph.SR astro-ph.GA physics.space-ph

    How coronal mass ejections are influenced by the morphology and toroidal flux of their source magnetic flux ropes?

    Authors: J. H. Guo, L. Linan, S. Poedts, Y. Guo, B. Schmieder, A. Lani, Y. W. Ni, M. Brchnelova, B. Perri, T. Baratashvili, S. T. Li, P. F. Chen

    Abstract: Coronal mass ejections (CMEs) stand as intense eruptions of magnetized plasma from the Sun, playing a pivotal role in driving significant changes of the heliospheric environment. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space weather forecasting. Deducing the properties of CMEs from their progenitors in solar source regions is crucial for space… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 11 pages, 10 figrues, accepted for publication by A&A

  2. arXiv:2407.06584  [pdf, other

    cs.RO

    HiLMa-Res: A General Hierarchical Framework via Residual RL for Combining Quadrupedal Locomotion and Manipulation

    Authors: Xiaoyu Huang, Qiayuan Liao, Yiming Ni, Zhongyu Li, Laura Smith, Sergey Levine, Xue Bin Peng, Koushil Sreenath

    Abstract: This work presents HiLMa-Res, a hierarchical framework leveraging reinforcement learning to tackle manipulation tasks while performing continuous locomotion using quadrupedal robots. Unlike most previous efforts that focus on solving a specific task, HiLMa-Res is designed to be general for various loco-manipulation tasks that require quadrupedal robots to maintain sustained mobility. The novel des… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: IROS 2024

  3. arXiv:2406.15252  [pdf, other

    cs.CV cs.AI

    VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

    Authors: Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Yuchen Lin, Wenhu Chen

    Abstract: The recent years have witnessed great advances in video generation. However, the development of automatic video metrics is lagging significantly behind. None of the existing metric is able to provide reliable scores over generated videos. The main barrier is the lack of large-scale human-annotated dataset. In this paper, we release VideoFeedback, the first large-scale dataset containing human-prov… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2406.11168  [pdf, other

    math.OC cs.LG

    Two-Timescale Optimization Framework for Decentralized Linear-Quadratic Optimal Control

    Authors: Lechen Feng, Yuan-Hua Ni, Xuebo Zhang

    Abstract: This study investigates a decentralized linear-quadratic optimal control problem, and several approximate separable constrained optimization problems are formulated for the first time based on the selection of sparsity promoting functions. First, for the optimization problem with weighted $\ell_1$ sparsity promoting function, a two-timescale algorithm is adopted that is based on the BSUM (Block Su… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  5. arXiv:2406.10318  [pdf, other

    cs.CV cs.AI

    Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding

    Authors: Tuo Zhang, Tiantian Feng, Yibin Ni, Mengqin Cao, Ruying Liu, Katharine Butler, Yanjun Weng, Mi Zhang, Shrikanth S. Narayanan, Salman Avestimehr

    Abstract: Large vision-language models (VLMs) have demonstrated remarkable abilities in understanding everyday content. However, their performance in the domain of art, particularly culturally rich art forms, remains less explored. As a pearl of human wisdom and creativity, art encapsulates complex cultural narratives and symbolism. In this paper, we offer the Pun Rebus Art Dataset, a multimodal dataset for… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  6. arXiv:2406.05862  [pdf, other

    cs.CL cs.AI cs.CV

    II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

    Authors: Ziqiang Liu, Feiteng Fang, Xi Feng, Xinrun Du, Chenhao Zhang, Zekun Wang, Yuelin Bai, Qixuan Zhao, Liyang Fan, Chengguang Gan, Hongquan Lin, Jiaming Li, Yuansheng Ni, Haihong Wu, Yaswanth Narsupalli, Zhigang Zheng, Chengming Li, Xiping Hu, Ruifeng Xu, Xiaojun Chen, Min Yang, Jiaheng Liu, Ruibo Liu, Wenhao Huang, Ge Zhang , et al. (1 additional authors not shown)

    Abstract: The rapid advancements in the development of multimodal large language models (MLLMs) have consistently led to new breakthroughs on various benchmarks. In response, numerous challenging and comprehensive benchmarks have been proposed to more accurately assess the capabilities of MLLMs. However, there is a dearth of exploration of the higher-order perceptual capabilities of MLLMs. To fill this gap,… ▽ More

    Submitted 11 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: 100 pages, 82 figures, add citations

  7. arXiv:2406.04485  [pdf, other

    cs.AI cs.CV

    GenAI Arena: An Open Evaluation Platform for Generative Models

    Authors: Dongfu Jiang, Max Ku, Tianle Li, Yuansheng Ni, Shizhuo Sun, Rongqi Fan, Wenhu Chen

    Abstract: Generative AI has made remarkable strides to revolutionize fields such as image and video generation. These advancements are driven by innovative algorithms, architecture, and data. However, the rapid proliferation of generative models has highlighted a critical gap: the absence of trustworthy evaluation metrics. Current automatic assessments such as FID, CLIP, FVD, etc often fail to capture the n… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 9 pages,7 figures

  8. arXiv:2406.02803  [pdf, other

    cs.DC

    DRust: Language-Guided Distributed Shared Memory with Fine Granularity, Full Transparency, and Ultra Efficiency

    Authors: Haoran Ma, Yifan Qiao, Shi Liu, Shan Yu, Yuanjiang Ni, Qingda Lu, Jiesheng Wu, Yiying Zhang, Miryung Kim, Harry Xu

    Abstract: Despite being a powerful concept, distributed shared memory (DSM) has not been made practical due to the extensive synchronization needed between servers to implement memory coherence. This paper shows a practical DSM implementation based on the insight that the ownership model embedded in programming languages such as Rust automatically constrains the order of read and write, providing opportunit… ▽ More

    Submitted 27 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  9. arXiv:2406.02664  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.HE

    Discrepancies Between JWST Observations and Simulations of Quenched Massive Galaxies at $z > 3$: A Comparative Study With IllustrisTNG and ASTRID

    Authors: Emma Jane Weller, Fabio Pacucci, Yueying Ni, Lars Hernquist, Minjung Park

    Abstract: Recent JWST observations have uncovered an unexpectedly large population of massive quiescent galaxies at $z>3$. Using the cosmological simulations IllustrisTNG and ASTRID, we identify analogous galaxies and investigate their abundance, formation, quenching mechanisms, and post-quenching evolution for stellar masses $9.5 < \log_{10}{(M_\star/{\rm M}_\odot)} < 12$. We apply three different quenchin… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Submitted to The Astrophysical Journal. 13 pages, 12 figures

  10. arXiv:2406.01574  [pdf, other

    cs.CL

    MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

    Authors: Yubo Wang, Xueguang Ma, Ge Zhang, Yuansheng Ni, Abhranil Chandra, Shiguang Guo, Weiming Ren, Aaran Arulraj, Xuan He, Ziyan Jiang, Tianle Li, Max Ku, Kai Wang, Alex Zhuang, Rongqi Fan, Xiang Yue, Wenhu Chen

    Abstract: In the age of large-scale language models, benchmarks like the Massive Multitask Language Understanding (MMLU) have been pivotal in pushing the boundaries of what AI can achieve in language comprehension and reasoning across diverse domains. However, as models continue to improve, their performance on these benchmarks has begun to plateau, making it increasingly difficult to discern differences in… ▽ More

    Submitted 23 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  11. arXiv:2406.01255  [pdf, other

    cs.LG cs.AI

    On the Nonlinearity of Layer Normalization

    Authors: Yunhao Ni, Yuxin Guo, Junlong Jia, Lei Huang

    Abstract: Layer normalization (LN) is a ubiquitous technique in deep learning but our theoretical understanding to it remains elusive. This paper investigates a new theoretical direction for LN, regarding to its nonlinearity and representation capacity. We investigate the representation capacity of a network with layerwise composition of linear and LN transformations, referred to as LN-Net. We theoretically… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 42 pages, accepted to ICML 2024

  12. arXiv:2405.18203  [pdf, other

    cs.CL

    IAPT: Instruction-Aware Prompt Tuning for Large Language Models

    Authors: Wei Zhu, Aaron Xuxiang Tian, Congrui Yin, Yuan Ni, Xiaoling Wang, Guotong Xie

    Abstract: Soft prompt tuning is a widely studied parameter-efficient fine-tuning method. However, it has a clear drawback: many soft tokens must be inserted into the input sequences to guarantee downstream performance. As a result, soft prompt tuning is less considered than Low-rank adaptation (LoRA) in the large language modeling (LLM) era. In this work, we propose a novel prompt tuning method, Instruction… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL-2024

  13. arXiv:2405.14051  [pdf, ps, other

    cs.LG math.ST

    A Concentration Inequality for Maximum Mean Discrepancy (MMD)-based Statistics and Its Application in Generative Models

    Authors: Yijin Ni, Xiaoming Huo

    Abstract: Maximum Mean Discrepancy (MMD) is a probability metric that has found numerous applications in machine learning. In this work, we focus on its application in generative models, including the minimum MMD estimator, Generative Moment Matching Network (GMMN), and Generative Adversarial Network (GAN). In these cases, MMD is part of an objective function in a minimization or min-max optimization proble… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  14. arXiv:2405.10343  [pdf, other

    q-bio.BM cs.AI cs.LG

    UniCorn: A Unified Contrastive Learning Approach for Multi-view Molecular Representation Learning

    Authors: Shikun Feng, Yuyan Ni, Minghao Li, Yanwen Huang, Zhi-Ming Ma, Wei-Ying Ma, Yanyan Lan

    Abstract: Recently, a noticeable trend has emerged in developing pre-trained foundation models in the domains of CV and NLP. However, for molecular pre-training, there lacks a universal model capable of effectively applying to various categories of molecular tasks, since existing prevalent pre-training methods exhibit effectiveness for specific types of downstream tasks. Furthermore, the lack of profound un… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  15. arXiv:2405.07542  [pdf, other

    cs.CL

    EMS-SD: Efficient Multi-sample Speculative Decoding for Accelerating Large Language Models

    Authors: Yunsheng Ni, Chuanjian Liu, Yehui Tang, Kai Han, Yunhe Wang

    Abstract: Speculative decoding emerges as a pivotal technique for enhancing the inference speed of Large Language Models (LLMs). Despite recent research aiming to improve prediction efficiency, multi-sample speculative decoding has been overlooked due to varying numbers of accepted tokens within a batch in the verification phase. Vanilla method adds padding tokens in order to ensure that the number of new t… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  16. arXiv:2405.02940  [pdf

    cond-mat.mes-hall

    Spherulite-enhanced Macroscopic Polarization in Molecular Ferroelectric Films from Vacuum Deposition

    Authors: Bibek Tiwari, Yuanyuan Ni, Jackson Savage, Ellen Daugherty, Bharat Giri, Xin Li, Xiaoshan Xu

    Abstract: Proton-transfer type molecular ferroelectrics hold great application potential due to their large spontaneous polarizations, high Curie temperatures, and small switching fields. However, it is puzzling that preparation of quasi-2D films with macroscopic ferroelectric behaviors has only been reported in few molecular ferroelectrics. To resolve this puzzle, we studied the effect of microstructures o… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  17. arXiv:2404.18911  [pdf, other

    cs.CL cs.LG

    Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

    Authors: Fangcheng Liu, Yehui Tang, Zhenhua Liu, Yunsheng Ni, Kai Han, Yunhe Wang

    Abstract: Speculative decoding has demonstrated its effectiveness in accelerating the inference of large language models while maintaining a consistent sampling distribution. However, the conventional approach of training a separate draft model to achieve a satisfactory token acceptance rate can be costly. Drawing inspiration from early exiting, we propose a novel self-speculative decoding framework \emph{K… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  18. arXiv:2404.05148  [pdf, other

    stat.ME stat.ML

    Generalized Criterion for Identifiability of Additive Noise Models Using Majorization

    Authors: Aramayis Dallakyan, Yang Ni

    Abstract: The discovery of causal relationships from observational data is very challenging. Many recent approaches rely on complexity or uncertainty concepts to impose constraints on probability distributions, aiming to identify specific classes of directed acyclic graph (DAG) models. In this paper, we introduce a novel identifiability criterion for DAGs that places constraints on the conditional variances… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  19. arXiv:2404.04949  [pdf, other

    cs.CL cs.CE

    SilverSight: A Multi-Task Chinese Financial Large Language Model Based on Adaptive Semantic Space Learning

    Authors: Yuhang Zhou, Zeping Li, Siyu Tian, Yuchen Ni, Sen Liu, Guangnan Ye, Hongfeng Chai

    Abstract: Large language models (LLMs) are increasingly being applied across various specialized fields, leveraging their extensive knowledge to empower a multitude of scenarios within these domains. However, each field encompasses a variety of specific tasks that require learning, and the diverse, heterogeneous data across these domains can lead to conflicts during model task transfer. In response to this… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 17 pages, 17 figures

  20. arXiv:2404.00521  [pdf, other

    cs.LG cs.CV

    CHAIN: Enhancing Generalization in Data-Efficient GANs via lipsCHitz continuity constrAIned Normalization

    Authors: Yao Ni, Piotr Koniusz

    Abstract: Generative Adversarial Networks (GANs) significantly advanced image generation but their performance heavily depends on abundant training data. In scenarios with limited data, GANs often struggle with discriminator overfitting and unstable training. Batch Normalization (BN), despite being known for enhancing generalization and training stability, has rarely been used in the discriminator of Data-E… ▽ More

    Submitted 1 June, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024. 26 pages. Improve Lemma 3.1 - Prop. 3.1 logic flow. Code: https://github.com/MaxwellYaoNi/CHAIN

  21. arXiv:2403.17372  [pdf, other

    cs.IR

    An Empirical Study of Training ID-Agnostic Multi-modal Sequential Recommenders

    Authors: Youhua Li, Hanwen Du, Yongxin Ni, Yuanqi He, Junchen Fu, Xiangyan Liu, Qi Guo

    Abstract: Sequential Recommendation (SR) aims to predict future user-item interactions based on historical interactions. While many SR approaches concentrate on user IDs and item IDs, the human perception of the world through multi-modal signals, like text and images, has inspired researchers to delve into constructing SR from multi-modal information without using IDs. However, the complexity of multi-modal… ▽ More

    Submitted 30 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: An Empirical Study of Training ID-Agnostic Multi-modal Sequential Recommenders

  22. arXiv:2403.16797  [pdf, other

    eess.SY

    Privacy Preservation by Intermittent Transmission in Cooperative LQG Control Systems

    Authors: Wenhao Lin, Yuqing Ni, Wen Yang, Chao Yang

    Abstract: In this paper, we study a cooperative linear quadratic Gaussian (LQG) control system with a single user and a server. In this system, the user runs a process and employs the server to meet the needs of computation. However, the user regards its state trajectories as privacy. Therefore, we propose a privacy scheme, in which the user sends data to the server intermittently. By this scheme, the serve… ▽ More

    Submitted 28 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  23. arXiv:2403.15983  [pdf, other

    stat.ME

    Bayesian segmented Gaussian copula factor model for single-cell sequencing data

    Authors: Junsouk Choi, Hee Cheol Chung, Irina Gaynanova, Yang Ni

    Abstract: Single-cell sequencing technologies have significantly advanced molecular and cellular biology, offering unprecedented insights into cellular heterogeneity by allowing for the measurement of gene expression at an individual cell level. However, the analysis of such data is challenged by the prevalence of low counts due to dropout events and the skewed nature of the data distribution, which convent… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  24. arXiv:2403.14027  [pdf, other

    cs.CV

    EcoSense: Energy-Efficient Intelligent Sensing for In-Shore Ship Detection through Edge-Cloud Collaboration

    Authors: Wenjun Huang, Hanning Chen, Yang Ni, Arghavan Rezvani, Sanggeon Yun, Sungheon Jeon, Eric Pedley, Mohsen Imani

    Abstract: Detecting marine objects inshore presents challenges owing to algorithmic intricacies and complexities in system deployment. We propose a difficulty-aware edge-cloud collaborative sensing system that splits the task into object localization and fine-grained classification. Objects are classified either at the edge or within the cloud, based on their estimated difficulty. The framework comprises a… ▽ More

    Submitted 26 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  25. arXiv:2403.12987  [pdf, other

    q-bio.BM cs.LG

    Rethinking Specificity in SBDD: Leveraging Delta Score and Energy-Guided Diffusion

    Authors: Bowen Gao, Minsi Ren, Yuyan Ni, Yanwen Huang, Bo Qiang, Zhi-Ming Ma, Wei-Ying Ma, Yanyan Lan

    Abstract: In the field of Structure-based Drug Design (SBDD), deep learning-based generative models have achieved outstanding performance in terms of docking score. However, further study shows that the existing molecular generative methods and docking scores both have lacked consideration in terms of specificity, which means that generated molecules bind to almost every protein pocket with high affinity. T… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  26. arXiv:2403.10648  [pdf, other

    astro-ph.CO astro-ph.GA

    Debiasing with Diffusion: Probabilistic reconstruction of Dark Matter fields from galaxies with CAMELS

    Authors: Victoria Ono, Core Francisco Park, Nayantara Mudur, Yueying Ni, Carolina Cuesta-Lazaro, Francisco Villaescusa-Navarro

    Abstract: Galaxies are biased tracers of the underlying cosmic web, which is dominated by dark matter components that cannot be directly observed. Galaxy formation simulations can be used to study the relationship between dark matter density fields and galaxy distributions. However, this relationship can be sensitive to assumptions in cosmology and astrophysical processes embedded in the galaxy formation mo… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  27. arXiv:2403.08108  [pdf, other

    cs.CV

    TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection

    Authors: Hanning Chen, Wenjun Huang, Yang Ni, Sanggeon Yun, Fei Wen, Hugo Latapie, Mohsen Imani

    Abstract: Task-oriented object detection aims to find objects suitable for accomplishing specific tasks. As a challenging task, it requires simultaneous visual data processing and reasoning under ambiguous semantics. Recent solutions are mainly all-in-one models. However, the object detection backbones are pre-trained without text supervision. Thus, to incorporate task requirements, their intricate models u… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  28. arXiv:2403.05763  [pdf, other

    cs.AR cs.AI cs.LG

    HDReason: Algorithm-Hardware Codesign for Hyperdimensional Knowledge Graph Reasoning

    Authors: Hanning Chen, Yang Ni, Ali Zakeri, Zhuowen Zou, Sanggeon Yun, Fei Wen, Behnam Khaleghi, Narayan Srinivasa, Hugo Latapie, Mohsen Imani

    Abstract: In recent times, a plethora of hardware accelerators have been put forth for graph learning applications such as vertex classification and graph classification. However, previous works have paid little attention to Knowledge Graph Completion (KGC), a task that is well-known for its significantly higher algorithm complexity. The state-of-the-art KGC solutions based on graph convolution neural netwo… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  29. arXiv:2403.03944  [pdf, other

    stat.AP

    MR.RGM: An R Package for Fitting Bayesian Multivariate Bidirectional Mendelian Randomization Networks

    Authors: Bitan Sarkar, Yang Ni

    Abstract: Motivation: Mendelian randomization (MR) infers causal relationships between exposures and outcomes using genetic variants as instrumental variables. Typically, MR considers only a pair of exposure and outcome at a time, limiting its capability of capturing the entire causal network. We overcome this limitation by developing 'MR.RGM' (Mendelian randomization via reciprocal graphical model), a fast… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  30. arXiv:2402.13779  [pdf, other

    cs.LG cs.AI q-bio.BM

    Contextual Molecule Representation Learning from Chemical Reaction Knowledge

    Authors: Han Tang, Shikun Feng, Bicheng Lin, Yuyan Ni, JIngjing Liu, Wei-Ying Ma, Yanyan Lan

    Abstract: In recent years, self-supervised learning has emerged as a powerful tool to harness abundant unlabelled data for representation learning and has been broadly adopted in diverse areas. However, when applied to molecular representation learning (MRL), prevailing techniques such as masked sub-unit reconstruction often fall short, due to the high degree of freedom in the possible combinations of atoms… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Preprint. Under Review

  31. arXiv:2402.12713  [pdf, ps, other

    cs.CL

    Are LLMs Rational Investors? A Study on Detecting and Reducing the Financial Bias in LLMs

    Authors: Yuhang Zhou, Yuchen Ni, Yunhui Gan, Zhangyue Yin, Xiang Liu, Jian Zhang, Sen Liu, Xipeng Qiu, Guangnan Ye, Hongfeng Chai

    Abstract: Large Language Models (LLMs) are increasingly adopted in financial analysis for interpreting complex market data and trends. However, their use is challenged by intrinsic biases (e.g., risk-preference bias) and a superficial understanding of market intricacies, necessitating a thorough assessment of their financial insight. To address these issues, we introduce Financial Bias Indicators (FBI), a f… ▽ More

    Submitted 1 July, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  32. arXiv:2402.11223  [pdf, other

    cs.LG

    HEAL: Brain-inspired Hyperdimensional Efficient Active Learning

    Authors: Yang Ni, Zhuowen Zou, Wenjun Huang, Hanning Chen, William Youngwoo Chung, Samuel Cho, Ranganath Krishnan, Pietro Mercati, Mohsen Imani

    Abstract: Drawing inspiration from the outstanding learning capability of our human brains, Hyperdimensional Computing (HDC) emerges as a novel computing paradigm, and it leverages high-dimensional vector presentation and operations for brain-like lightweight Machine Learning (ML). Practical deployments of HDC have significantly enhanced the learning efficiency compared to current deep ML methods on a broad… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  33. arXiv:2402.06079  [pdf, other

    q-bio.GN cs.AI cs.LG

    DiscDiff: Latent Diffusion Model for DNA Sequence Generation

    Authors: Zehui Li, Yuhao Ni, William A V Beardall, Guoxuan Xia, Akashaditya Das, Guy-Bart Stan, Yiren Zhao

    Abstract: This paper introduces a novel framework for DNA sequence generation, comprising two key components: DiscDiff, a Latent Diffusion Model (LDM) tailored for generating discrete DNA sequences, and Absorb-Escape, a post-training algorithm designed to refine these sequences. Absorb-Escape enhances the realism of the generated sequences by correcting `round errors' inherent in the conversion process betw… ▽ More

    Submitted 17 April, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Different from the prior work "Latent Diffusion Model for DNA Sequence Generation" (arXiv:2310.06150), we updated the evaluation framework and compared the DiscDiff with other methods comprehensively. In addition, a post-training framework is proposed to increase the quality of generated sequences

  34. arXiv:2402.03584  [pdf, other

    astro-ph.SR astro-ph.HE

    Helium-deficient ER UMa-type dwarf nova below the period minimum with a hot secondary

    Authors: Youngdae Lee, Dae-Sik Moon, Sang Chul Kim, Hong Soo Park, Yuan Qi Ni

    Abstract: We present the discovery of a peculiar dwarf nova KSP-OT-201712a using high-cadence, multi-color observations made with the Korea Microlensing Telescope Network. KSP-OT-201712a exhibits a rare presence of outbursts during standstills as well as strong H$α$ emission for a dwarf nova below the period minimum with an orbital period of 58.75 $\pm$ 0.02 minutes. The outburst cycles are ~ 6.6 days withi… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 12 pages, 5 figures, accepted for publication in ApJ

  35. arXiv:2402.02791  [pdf, other

    cs.CL cs.AI cs.LG

    Rethinking Optimization and Architecture for Tiny Language Models

    Authors: Yehui Tang, Fangcheng Liu, Yunsheng Ni, Yuchuan Tian, Zheyuan Bai, Yi-Qi Hu, Sichao Liu, Shangling Jui, Kai Han, Yunhe Wang

    Abstract: The power of large language models (LLMs) has been demonstrated through numerous data and computing resources. However, the application of language models on mobile devices is facing huge challenge on the computation and memory costs, that is, tiny language models with high performance are urgently required. Limited by the highly complex training process, there are many details for optimizing lang… ▽ More

    Submitted 6 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  36. arXiv:2402.02043  [pdf, other

    cs.LG cs.AI cs.NI

    A Plug-in Tiny AI Module for Intelligent and Selective Sensor Data Transmission

    Authors: Wenjun Huang, Arghavan Rezvani, Hanning Chen, Yang Ni, Sanggeon Yun, Sungheon Jeong, Mohsen Imani

    Abstract: Applications in the Internet of Things (IoT) utilize machine learning to analyze sensor-generated data. However, a major challenge lies in the lack of targeted intelligence in current sensing systems, leading to vast data generation and increased computational and communication costs. To address this challenge, we propose a novel sensing module to equip sensing frameworks with intelligent data tra… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 14 pages, 6 figures

  37. arXiv:2402.00395  [pdf, other

    cs.AR eess.SP

    ONE-SA: Enabling Nonlinear Operations in Systolic Arrays for Efficient and Flexible Neural Network Inference

    Authors: Ruiqi Sun, Yinchen Ni, Xin He, Jie Zhao, An Zou

    Abstract: The computation and memory-intensive nature of DNNs limits their use in many mobile and embedded contexts. Application-specific integrated circuit (ASIC) hardware accelerators employ matrix multiplication units (such as the systolic arrays) and dedicated nonlinear function units to speed up DNN computations. A close examination of these ASIC accelerators reveals that the designs are often speciali… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted to DATE 2024

  38. arXiv:2401.16608  [pdf, other

    astro-ph.GA astro-ph.CO

    The evolution of galaxy morphology from redshift z=6 to 3: Mock JWST observations of galaxies in the ASTRID simulation

    Authors: Patrick LaChance, Rupert Croft, Yueying Ni, Nianyi Chen, Tiziana Di Matteo, Simeon Bird

    Abstract: We present mock JWST observations for more than 215,000 different galaxies from the Astrid simulation with $3 \leq z \leq 6$. The mock observations are made using the BPASS stellar SED model, and a simple dust model. They are then viewed through NIRCam filters, convolved with a PSF, have noise added, and are drizzled together to emulate the Cosmic Evolution Early Release Science (CEERS) survey. We… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 17 pages, 14 figures

  39. arXiv:2401.10400  [pdf, other

    math.OC cs.IT

    Auto-Calibration and Biconvex Compressive Sensing with Applications to Parallel MRI

    Authors: Yuan Ni, Thomas Strohmer

    Abstract: We study an auto-calibration problem in which a transform-sparse signal is compressive-sensed by multiple sensors in parallel with unknown sensing parameters. The problem has an important application in pMRI reconstruction, where explicit coil calibrations are often difficult and costly to achieve in practice, but nevertheless a fundamental requirement for high-precision reconstructions. Most auto… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Keywords: Self-calibration, Compressive sensing, Convex optimization, Random matrices, Parallel MRI

  40. arXiv:2401.08914  [pdf, other

    astro-ph.GA

    Simulated host galaxy analogs of high-z quasars observed with JWST

    Authors: Sabrina Berger, Madeline A. Marshall, J. Stuart B. Wyithe, Tiziana di Matteo, Yueying Ni, Stephen M. Wilkins

    Abstract: The hosts of two low-luminosity high-z quasars, J2255+0251 and J2236+0032, were recently detected using JWST's NIRCam instrument. These represent the first high-z quasar host galaxy stellar detections and open a new window into studying high-z quasars. We examine the implications of the measured properties of J2255+0251 and J2236+0032 within the context of the hydrodynamic simulation BlueTides at… ▽ More

    Submitted 18 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted to MNRAS. 15 pages, 11 figures

  41. arXiv:2401.02034  [pdf, other

    cs.CL

    Text2MDT: Extracting Medical Decision Trees from Medical Texts

    Authors: Wei Zhu, Wenfeng Li, Xing Tian, Pengfei Wang, Xiaoling Wang, Jin Chen, Yuanbin Wu, Yuan Ni, Guotong Xie

    Abstract: Knowledge of the medical decision process, which can be modeled as medical decision trees (MDTs), is critical to build clinical decision support systems. However, the current MDT construction methods rely heavily on time-consuming and laborious manual annotation. In this work, we propose a novel task, Text2MDT, to explore the automatic extraction of MDTs from medical texts such as medical guidelin… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  42. arXiv:2401.01286  [pdf, other

    cs.CL cs.AI cs.CV cs.HC cs.LG

    A Comprehensive Study of Knowledge Editing for Large Language Models

    Authors: Ningyu Zhang, Yunzhi Yao, Bozhong Tian, Peng Wang, Shumin Deng, Mengru Wang, Zekun Xi, Shengyu Mao, Jintian Zhang, Yuansheng Ni, Siyuan Cheng, Ziwen Xu, Xin Xu, Jia-Chen Gu, Yong Jiang, Pengjun Xie, Fei Huang, Lei Liang, Zhiqiang Zhang, Xiaowei Zhu, Jun Zhou, Huajun Chen

    Abstract: Large Language Models (LLMs) have shown extraordinary capabilities in understanding and generating text that closely mirrors human communication. However, a primary limitation lies in the significant computational demands during training, arising from their extensive parameterization. This challenge is further intensified by the dynamic nature of the world, necessitating frequent updates to LLMs t… ▽ More

    Submitted 28 March, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: Ongoing work; 52 pages, 282 citations; benchmark is available at https://huggingface.co/datasets/zjunlp/KnowEdit code is available at https://github.com/zjunlp/EasyEdit paper list is available at https://github.com/zjunlp/KnowledgeEditingPapers

  43. arXiv:2312.14263  [pdf, other

    astro-ph.GA

    z~2 dual AGN host galaxies are disky: stellar kinematics in the ASTRID Simulation

    Authors: Ekaterine Dadiani, Tiziana Di Matteo, Nianyi Chen, Patrick Lachance, Yue Shen, Yu-Ching Chen, Rupert Croft, Yueying Ni, Simeon Bird

    Abstract: We study dual AGN host galaxy morphologies at $z=2$ using the ASTRID simulation, selecting black hole (BH) pairs with small separation ($Δr<30\rm{kpc}$), high mass ($M_{\text{BH,12}}>10^7M_\odot$), and luminosity ($L_{\text{bol,12}}>10^{43}\rm{erg/s}$). We kinematically decompose (using MORDOR) $\sim1000$ dual AGN hosts into standard components - a `disk' (thin and thick disk, pseudo-bulge) and 'b… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 15 pages, 12 figures, submitted to the Open Journal of Astrophysics

  44. arXiv:2312.09602  [pdf, other

    cs.IR

    Multi-Modality is All You Need for Transferable Recommender Systems

    Authors: Youhua Li, Hanwen Du, Yongxin Ni, Pengpeng Zhao, Qi Guo, Fajie Yuan, Xiaofang Zhou

    Abstract: ID-based Recommender Systems (RecSys), where each item is assigned a unique identifier and subsequently converted into an embedding vector, have dominated the designing of RecSys. Though prevalent, such ID-based paradigm is not suitable for developing transferable RecSys and is also susceptible to the cold-start issue. In this paper, we unleash the boundaries of the ID-based paradigm and propose a… ▽ More

    Submitted 18 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: ICDE'24 Accepted

  45. arXiv:2312.09183  [pdf, other

    astro-ph.GA

    MAGICS I. The First Few Orbits Encode the Fate of Seed Massive Black Hole Pairs

    Authors: Nianyi Chen, Diptajyoti Mukherjee, Tiziana Di Matteo, Yueying Ni, Simeon Bird, Rupert Croft

    Abstract: The elusive massive black hole (MBH) seeds stand to be revealed by the Laser Space Antenna Interferometer through mergers. As an aftermath of galaxy mergers, MBH coalescence is a vastly multi-scale process connected to galaxy formation. We introduce the "Massive black hole Assembly in Galaxies Informed by Cosmological Simulations" (MAGICS) suite, with galaxy/MBH properties and orbits recovered fro… ▽ More

    Submitted 25 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 21 pages, 20 Figures. Submitted to The Open Journal of Astrophysics. Comments welcome!

  46. arXiv:2312.05860  [pdf, other

    cond-mat.str-el

    Hund's coupling driven interorbital entanglement in orbital-selective Mott phase

    Authors: Yuekun Niu, Yu Ni, Haishan Zhang, Liang Qiu, Jianli Wang, Leiming Chen, Yun Song, Shiping Feng

    Abstract: We examine the orbital-selective Mott transition in the non-hybridized two-band Hubbard model using the dynamical mean-field theory. We find that the orbital-selective Mott transition could be depicted by the local quantum state fidelity. Additionally, within the orbital-selective Mott phase, the combined characteristics of the two orbitals lead to the presence of interorbital entanglement, which… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 6 pages, 3 figures

  47. arXiv:2312.04426  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.HE

    SN2023ixf in Messier 101: the twilight years of the progenitor as seen by Pan-STARRS

    Authors: Conor L. Ransome, V. Ashley Villar, Anna Tartaglia, Sebastian Javier Gonzalez, Wynn V. Jacobson-Galán, Charles D. Kilpatrick, Raffaella Margutti, Ryan J. Foley, Matthew Grayling, Yuan Qi Ni, Ricardo Yarza, Christine Ye, Katie Auchettl, Thomas de Boer, Kenneth C. Chambers, David A. Coulter, Maria R. Drout, Diego Farias, Christa Gall, Hua Gao, Mark E. Huber, Adaeze L. Ibik, David O. Jones, Nandita Khetan, Chien-Cheng Lin , et al. (6 additional authors not shown)

    Abstract: The nearby type II supernova, SN2023ixf in M101 exhibits signatures of early-time interaction with circumstellar material in the first week post-explosion. This material may be the consequence of prior mass loss suffered by the progenitor which possibly manifested in the form of a detectable pre-supernova outburst. We present an analysis of the long-baseline pre-explosion photometric data in $g$,… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 19 pages, 8 figures, 1 table

  48. arXiv:2311.16502  [pdf, other

    cs.CL cs.AI cs.CV

    MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

    Authors: Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen

    Abstract: We introduce MMMU: a new benchmark designed to evaluate multimodal models on massive multi-discipline tasks demanding college-level subject knowledge and deliberate reasoning. MMMU includes 11.5K meticulously collected multimodal questions from college exams, quizzes, and textbooks, covering six core disciplines: Art & Design, Business, Science, Health & Medicine, Humanities & Social Science, and… ▽ More

    Submitted 13 June, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 Oral

  49. arXiv:2311.14531  [pdf, other

    astro-ph.SR

    Formation of a long filament through the connection of two filament segments observed by CHASE

    Authors: H. T. Li, X. Cheng, Y. W. Ni, C. Li, S. H. Rao, J. H. Guo, M. D. Ding, P. F. Chen

    Abstract: We present imaging and spectroscopic diagnostics of a long filament during its formation with the observations from the Chinese H$α$ Solar Explorer and Solar Dynamics Observatory. The seed filament first appeared at about 05:00 UT on 2022 September 13. Afterwards, it grew gradually and connected to another filament segment nearby, building up a long filament at about 20:00 UT on the same day. The… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 11 pages, 6 figures, Accepted for publication in ApJL

  50. arXiv:2311.13432  [pdf, other

    astro-ph.SR physics.plasm-ph physics.space-ph

    Modelling the propagation of coronal mass ejections with COCONUT: implementation of the Regularized Biot-Savart Laws flux rope model

    Authors: Jinhan Guo, L. Linan, S. Poedts, Y. Guo, A. Lani, B. Schmieder, M. Brchnelova, B. Perri, T. Baratashvili, Y. W. Ni, P. F. Chen

    Abstract: Context: Coronal mass ejections (CMEs) are rapid eruptions of magnetized plasma that occur on the Sun, which are known as the main drivers of adverse space weather. Accurately tracking their evolution in the heliosphere in numerical models is of utmost importance for space weather forecasting. Aims: The main objective of this paper is to implement the Regularized Biot-Savart Laws (RBSL) method in… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 14 pages, 8 figures, accepted for publication in A&A