Skip to main content

Showing 1–50 of 73 results for author: Yan, A

  1. arXiv:2407.08486  [pdf, ps, other

    nlin.SI

    Noncommutative nonisospectral Toda and Lotka-Volterra lattices, and matrix discrete Painlevé equations

    Authors: Anhui Yan, Chunxia Li

    Abstract: The noncommutative analogues of the nonisospectral Toda and Lotka-Volterra lattices are proposed and studied by performing nonisopectral deformations on the matrix orthogonal polynomials and matrix symmetric orthogonal polynomials without specific weight functions, respectively. Under stationary reductions, matrix discrete Painlevé I and matrix asymmetric discrete Painlevé I equations are derived… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.00541  [pdf

    cs.CL cs.AI cs.IR

    Answering real-world clinical questions using large language model based systems

    Authors: Yen Sia Low, Michael L. Jackson, Rebecca J. Hyde, Robert E. Brown, Neil M. Sanghavi, Julian D. Baldwin, C. William Pike, Jananee Muralidharan, Gavin Hui, Natasha Alexander, Hadeel Hassan, Rahul V. Nene, Morgan Pike, Courtney J. Pokrzywa, Shivam Vedak, Adam Paul Yan, Dong-han Yao, Amy R. Zipursky, Christina Dinh, Philip Ballentine, Dan C. Derieg, Vladimir Polony, Rehan N. Chawdry, Jordan Davies, Brigham B. Hyde , et al. (2 additional authors not shown)

    Abstract: Evidence to guide healthcare decisions is often limited by a lack of relevant and trustworthy literature as well as difficulty in contextualizing existing research for a specific patient. Large language models (LLMs) could potentially address both challenges by either summarizing published literature or generating new studies based on real-world data (RWD). We evaluated the ability of five LLM-bas… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 28 pages (2 figures, 3 tables) inclusive of 8 pages of supplemental materials (4 supplemental figures and 4 supplemental tables)

  3. arXiv:2406.04744  [pdf, other

    cs.CL

    CRAG -- Comprehensive RAG Benchmark

    Authors: Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar , et al. (2 additional authors not shown)

    Abstract: Retrieval-Augmented Generation (RAG) has recently emerged as a promising solution to alleviate Large Language Model (LLM)'s deficiency in lack of knowledge. Existing RAG datasets, however, do not adequately represent the diverse and dynamic nature of real-world Question Answering (QA) tasks. To bridge this gap, we introduce the Comprehensive RAG Benchmark (CRAG), a factual question answering bench… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2405.15161  [pdf, other

    cs.CR cs.CV

    Are You Copying My Prompt? Protecting the Copyright of Vision Prompt for VPaaS via Watermark

    Authors: Huali Ren, Anli Yan, Chong-zhi Gao, Hongyang Yan, Zhenxin Zhang, Jin Li

    Abstract: Visual Prompt Learning (VPL) differs from traditional fine-tuning methods in reducing significant resource consumption by avoiding updating pre-trained model parameters. Instead, it focuses on learning an input perturbation, a visual prompt, added to downstream task data for making predictions. Since learning generalizable prompts requires expert design and creation, which is technically demanding… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 11 pages, 7 figures,

  5. arXiv:2405.01769  [pdf, other

    cs.CL

    A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law

    Authors: Zhiyu Zoey Chen, Jing Ma, Xinlu Zhang, Nan Hao, An Yan, Armineh Nourbakhsh, Xianjun Yang, Julian McAuley, Linda Petzold, William Yang Wang

    Abstract: In the fast-evolving domain of artificial intelligence, large language models (LLMs) such as GPT-3 and GPT-4 are revolutionizing the landscapes of finance, healthcare, and law: domains characterized by their reliance on professional expertise, challenging data acquisition, high-stakes, and stringent regulatory compliance. This survey offers a detailed exploration of the methodologies, applications… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 35 pages, 6 figures

  6. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  7. arXiv:2404.16375  [pdf, other

    cs.CV cs.AI cs.CL

    List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

    Authors: An Yan, Zhengyuan Yang, Junda Wu, Wanrong Zhu, Jianwei Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Julian McAuley, Jianfeng Gao, Lijuan Wang

    Abstract: Set-of-Mark (SoM) Prompting unleashes the visual grounding capability of GPT-4V, by enabling the model to associate visual objects with tags inserted on the image. These tags, marked with alphanumerics, can be indexed via text tokens for easy reference. Despite the extraordinary performance from GPT-4V, we observe that other Multimodal Large Language Models (MLLMs) struggle to understand these vis… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Preprint

  8. arXiv:2404.10284  [pdf, other

    math.CO

    Log-concavity in Combinatorics

    Authors: Alan Yan

    Abstract: We survey some of the mechanisms used to prove that naturally defined sequences in combinatorics are log-concave. Among these mechanisms are Alexandrov's inequality for mixed discriminants, the Alexandrov Fenchel inequality for mixed volumes, Lorentzian polynomials, and the Hard Lefschetz theorem. We use these mechanisms to prove some new log-concavity and extremal results related to partially ord… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: written in 2022-2023 as an undergraduate thesis

  9. arXiv:2403.03952  [pdf, other

    cs.IR

    Bridging Language and Items for Retrieval and Recommendation

    Authors: Yupeng Hou, Jiacheng Li, Zhankui He, An Yan, Xiusi Chen, Julian McAuley

    Abstract: This paper introduces BLaIR, a series of pretrained sentence embedding models specialized for recommendation scenarios. BLaIR is trained to learn correlations between item metadata and potential natural language context, which is useful for retrieving and recommending items. To pretrain BLaIR, we collect Amazon Reviews 2023, a new dataset comprising over 570 million reviews and 48 million items fr… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  10. arXiv:2312.15617  [pdf, other

    cs.CR cs.CV

    GanFinger: GAN-Based Fingerprint Generation for Deep Neural Network Ownership Verification

    Authors: Huali Ren, Anli Yan, Xiaojun Ren, Pei-Gen Ye, Chong-zhi Gao, Zhili Zhou, Jin Li

    Abstract: Deep neural networks (DNNs) are extensively employed in a wide range of application scenarios. Generally, training a commercially viable neural network requires significant amounts of data and computing resources, and it is easy for unauthorized users to use the networks illegally. Therefore, network ownership verification has become one of the most crucial steps in safeguarding digital assets. To… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 9 pages, 6 figures

  11. arXiv:2312.12027  [pdf, other

    physics.atom-ph

    A continuous cold rubidium atomic beam with enhanced flux and tunable velocity

    Authors: Shengzhe Wang, Zhixin Meng, and Peiqiang Yan, Yuanxing Liu, Yanying Feng

    Abstract: We present a cold atomic beam source based on a two-dimensional (2D)+ magneto-optical trap (MOT), capable of generating a continuous cold beam of 87Rb atoms with a flux up to 4.3*10^9 atoms/s, a mean velocity of 10.96(2.20) m/s, and a transverse temperature of 16.90(1.56) uK. Investigating the influence of high cooling laser intensity, we observe a significant population loss of atoms to hyperfine… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  12. arXiv:2312.06964  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Ground Calibration Result of the Lobster Eye Imager for Astronomy

    Authors: Huaqing Cheng, Zhixing Ling, Chen Zhang, Xiaojin Sun, Shengli Sun, Yuan Liu, Yanfeng Dai, Zhenqing Jia, Haiwu Pan, Wenxin Wang, Donghua Zhao, Yifan Chen, Zhiwei Cheng, Wei Fu, Yixiao Han, Junfei Li, Zhengda Li, Xiaohao Ma, Yulong Xue, Ailiang Yan, Qiang Zhang, Yusa Wang, Xiongtao Yang, Zijian Zhao, Weimin Yuan

    Abstract: We report on results of the on-ground X-ray calibration of the Lobster Eye Imager for Astronomy (LEIA), an experimental space wide-field (18.6*18.6 square degrees) X-ray telescope built from novel lobster eye mirco-pore optics. LEIA was successfully launched on July 27, 2022 onboard the SATech-01 satellite. To achieve full characterisation of its performance before launch, a series of tests and ca… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 24 pages, 13 figures. Submitted to Experimental Astronomy

  13. arXiv:2311.07562  [pdf, other

    cs.CV cs.AI

    GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

    Authors: An Yan, Zhengyuan Yang, Wanrong Zhu, Kevin Lin, Linjie Li, Jianfeng Wang, Jianwei Yang, Yiwu Zhong, Julian McAuley, Jianfeng Gao, Zicheng Liu, Lijuan Wang

    Abstract: We present MM-Navigator, a GPT-4V-based agent for the smartphone graphical user interface (GUI) navigation task. MM-Navigator can interact with a smartphone screen as human users, and determine subsequent actions to fulfill given instructions. Our findings demonstrate that large multimodal models (LMMs), specifically GPT-4V, excel in zero-shot GUI navigation through its advanced screen interpretat… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Work in progress

  14. arXiv:2311.01361  [pdf, other

    cs.CV cs.CL

    GPT-4V(ision) as a Generalist Evaluator for Vision-Language Tasks

    Authors: Xinlu Zhang, Yujie Lu, Weizhi Wang, An Yan, Jun Yan, Lianke Qin, Heng Wang, Xifeng Yan, William Yang Wang, Linda Ruth Petzold

    Abstract: Automatically evaluating vision-language tasks is challenging, especially when it comes to reflecting human judgments due to limitations in accounting for fine-grained details. Although GPT-4V has shown promising results in various multi-modal tasks, leveraging GPT-4V as a generalist evaluator for these tasks has not yet been systematically explored. We comprehensively validate GPT-4V's capabiliti… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  15. arXiv:2310.16639  [pdf, other

    cs.CV cs.LG

    Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving

    Authors: Jessica Echterhoff, An Yan, Kyungtae Han, Amr Abdelraouf, Rohit Gupta, Julian McAuley

    Abstract: Concept bottleneck models have been successfully used for explainable machine learning by encoding information within the model with a set of human-defined concepts. In the context of human-assisted or autonomous driving, explainability models can help user acceptance and understanding of decisions made by the autonomous vehicle, which can be used to rationalize and explain driver or vehicle behav… ▽ More

    Submitted 26 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  16. arXiv:2310.14088  [pdf, other

    cs.CL

    MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation

    Authors: Zexue He, Yu Wang, An Yan, Yao Liu, Eric Y. Chang, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu

    Abstract: Curated datasets for healthcare are often limited due to the need of human annotations from experts. In this paper, we present MedEval, a multi-level, multi-task, and multi-domain medical benchmark to facilitate the development of language models for healthcare. MedEval is comprehensive and consists of data from several healthcare systems and spans 35 human body regions from 8 examination modaliti… ▽ More

    Submitted 14 November, 2023; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023. Camera-ready version: updated IRB, added more evaluation results on LLMs such as GPT4, LLaMa2, and LLaMa2-chat

  17. arXiv:2310.13997  [pdf

    cond-mat.mtrl-sci

    Topological Magnetoresistance of Magnetic Skyrmionic Bubbles

    Authors: Fei Li, Hao Nie, Yu Zhao, Zhihe Zhao, Juntao Huo, Hongxian Shen, Sida Jiang, Renjie Chen, Aru Yan, S-W Cheong, Weixing Xia, Lunyong Zhang, Jianfei Sun

    Abstract: Magnetic skyrmions offer promising prospects for constructing future energy-efficient and high-density information technology, leading to extensive explorations of new skyrmionic materials recently. The topological Hall effect has been widely adopted as a distinctive marker of skyrmion emergence. Alternately, here we propose a novel signature of skyrmion state by quantitatively investigating the m… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 17 pages,5 figures,submitted

  18. arXiv:2310.03182  [pdf, other

    cs.CV cs.CL cs.LG

    Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models

    Authors: An Yan, Yu Wang, Yiwu Zhong, Zexue He, Petros Karypis, Zihan Wang, Chengyu Dong, Amilcare Gentili, Chun-Nan Hsu, Jingbo Shang, Julian McAuley

    Abstract: Medical image classification is a critical problem for healthcare, with the potential to alleviate the workload of doctors and facilitate diagnoses of patients. However, two challenges arise when deploying deep learning models to real-world healthcare applications. First, neural models tend to learn spurious correlations instead of desired features, which could fall short when generalizing to new… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 18 pages, 12 figures

  19. arXiv:2309.13434  [pdf, ps, other

    math.CO math.MG

    The extremals of the Kahn-Saks inequality

    Authors: Ramon van Handel, Alan Yan, Xinmeng Zeng

    Abstract: A classical result of Kahn and Saks states that given any partially ordered set with two distinguished elements, the number of linear extensions in which the ranks of the distinguished elements differ by $k$ is log-concave as a function of $k$. The log-concave sequences that can arise in this manner prove to exhibit a much richer structure, however, than is evident from log-concavity alone. The ma… ▽ More

    Submitted 30 June, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: 30 pages, 1 figure; minor corrections and clarifications

    MSC Class: 06A07; 52A39; 52A40; 52B05

  20. arXiv:2308.03685  [pdf, other

    cs.CV

    Learning Concise and Descriptive Attributes for Visual Recognition

    Authors: An Yan, Yu Wang, Yiwu Zhong, Chengyu Dong, Zexue He, Yujie Lu, William Wang, Jingbo Shang, Julian McAuley

    Abstract: Recent advances in foundation models present new opportunities for interpretable visual recognition -- one can first query Large Language Models (LLMs) to obtain a set of attributes that describe each class, then apply vision-language models to classify images via these attributes. Pioneering work shows that querying thousands of attributes can achieve performance competitive with image features.… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  21. Effective Hamiltonian approach to the quantum phase transitions in the extended Jaynes-Cummings model

    Authors: H. T. Cui, Y. A. Yan, M. Qin, X. X. Yi

    Abstract: The study of phase transitions in dissipative quantum systems based on the Liouvillian is often hindered by the difficulty of constructing a time-local master equation when the system-environment coupling is strong. To address this issue, the complex discretization approximation for the environment is proposed to study the quantum phase transition in the extended Jaynes-Cumming model with an infin… ▽ More

    Submitted 6 April, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: 12pages, published version

    Journal ref: Phys. Rev. A 109.042202(2024)

  22. arXiv:2307.03691  [pdf, other

    cs.CL cs.AI cs.IR

    Comparing Apples to Apples: Generating Aspect-Aware Comparative Sentences from User Reviews

    Authors: Jessica Echterhoff, An Yan, Julian McAuley

    Abstract: It is time-consuming to find the best product among many similar alternatives. Comparative sentences can help to contrast one item from others in a way that highlights important features of an item that stand out. Given reviews of one or multiple items and relevant item features, we generate comparative review sentences to aid users to find the best fit. Specifically, our model consists of three s… ▽ More

    Submitted 23 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  23. arXiv:2305.14895  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    The Lobster Eye Imager for Astronomy Onboard the SATech-01 Satellite

    Authors: Z. X. Ling, X. J. Sun, C. Zhang, S. L. Sun, G. Jin, S. N. Zhang, X. F. Zhang, J. B. Chang, F. S. Chen, Y. F. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, Z. D. Li, P. R. Liu, Y. H. Lv, X. H. Ma, Y. J. Tang, C. B. Wang, R. J. Xie, Y. L. Xue, A. L. Yan , et al. (101 additional authors not shown)

    Abstract: The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (Fo… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted by RAA

  24. arXiv:2305.08300  [pdf, other

    cs.CL

    "Nothing Abnormal": Disambiguating Medical Reports via Contrastive Knowledge Infusion

    Authors: Zexue He, An Yan, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu

    Abstract: Sharing medical reports is essential for patient-centered care. A recent line of work has focused on automatically generating reports with NLP methods. However, different audiences have different purposes when writing/reading medical reports -- for example, healthcare professionals care more about pathology, whereas patients are more concerned with the diagnosis ("Is there any abnormality?"). The… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted to AAAI 2023. 13 pages including 4-page supplementary materials

  25. arXiv:2303.11107  [pdf

    physics.app-ph cond-mat.mes-hall

    Landauer-QFLPS model for mixed Schottky-Ohmic contact two-dimensional transistors

    Authors: Zhao-Yi Yan, Zhan Hou, Kan-Hao Xue, Tian Lu, Ruiting Zhao, Junying Xue, Fan Wu, Minghao Shao, Jianlan Yan, Anzhi Yan, Zhenze Wang, Penghui Shen, Mingyue Zhao, Xiangshui Miao, Zhaoyang Lin, Houfang Liu, He Tian, Yi Yang, Tian-Ling Ren

    Abstract: Two-dimensional material-based field effect transistors (2DM-FETs) are playing a revolutionary role in electronic devices. However, after years of development, no device model can match the Pao-Sah model for standard silicon-based transistors in terms of physical accuracy and computational efficiency to support large-scale integrated circuit design. One remaining critical obstacle is the contacts… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  26. arXiv:2303.06926  [pdf

    cond-mat.mtrl-sci

    An efficient model algorithm for two-dimensional field-effect transistors

    Authors: Zhao-Yi Yan, Zhan Hou, Fan Wu, Ruiting Zhao, Jianlan Yan, Anzhi Yan, Zhenze Wang, Kan-Hao Xue, Houfang Liu, He Tian, Yi Yang, Tian-Ling Ren

    Abstract: Two-dimensional materials-based field-effect transistors (2DM-FETs) exhibit both ambipolar and unipolar transport types. To physically and compactly cover both cases, we put forward a quasi-Fermi-level phase space (QFLPS) approach to model the ambipolar effect in our previous work. This work aims to further improve the QFLPS model's numerical aspect so that the model can be implanted into the stan… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 17 pages, 9 figures

  27. arXiv:2303.06584  [pdf, other

    quant-ph

    Effective Hamiltonian approach to the exact dynamics of open system by complex discretization approximation for environment

    Authors: H. T. Cui, Y. A. Yan, M. Qin, X. X. Yi

    Abstract: The discretization approximation method commonly used to simulate the open dynamics of system coupled to the environment in continuum often suffers from the recurrence. To address this issue, this paper proposes a noval generalization of the discretization approximation method in the complex plane using complex Gauss quadratures. The effective Hamiltonian can be constructed by this way, which is n… ▽ More

    Submitted 27 May, 2024; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: Title is changed. A significant improvement. The discussion about the open dynamics of AAH model is completely rewriteen

  28. arXiv:2211.10007  [pdf, other

    astro-ph.HE astro-ph.IM

    First wide field-of-view X-ray observations by a lobster eye focusing telescope in orbit

    Authors: C. Zhang, Z. X. Ling, X. J. Sun, S. L. Sun, Y. Liu, Z. D. Li, Y. L. Xue, Y. F. Chen, Y. F. Dai, Z. Q. Jia, H. Y. Liu, X. F. Zhang, Y. H. Zhang, S. N. Zhang, F. S. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, P. R. Liu, X. H. Ma, Y. J. Tang, C. B. Wang , et al. (53 additional authors not shown)

    Abstract: As a novel X-ray focusing technology, lobster eye micro-pore optics (MPO) feature both a wide observing field of view and true imaging capability, promising sky monitoring with significantly improved sensitivity and spatial resolution in soft X-rays. Since first proposed by Angel (1979), the optics have been extensively studied, developed and trialed over the past decades. In this Letter, we repor… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 11 pages, 4 figures. Accepted for publication in Astrophysical Journal Letter

  29. arXiv:2210.05836  [pdf, other

    cs.CL

    CLIP also Understands Text: Prompting CLIP for Phrase Understanding

    Authors: An Yan, Jiacheng Li, Wanrong Zhu, Yujie Lu, William Yang Wang, Julian McAuley

    Abstract: Contrastive Language-Image Pretraining (CLIP) efficiently learns visual concepts by pre-training with natural language supervision. CLIP and its visual encoder have been explored on various vision and language tasks and achieve strong zero-shot or transfer learning performance. However, the application of its text encoder solely for text understanding has been less explored. In this paper, we find… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Work in progress

  30. arXiv:2210.03765  [pdf, other

    cs.CL cs.AI

    Visualize Before You Write: Imagination-Guided Open-Ended Text Generation

    Authors: Wanrong Zhu, An Yan, Yujie Lu, Wenda Xu, Xin Eric Wang, Miguel Eckstein, William Yang Wang

    Abstract: Recent advances in text-to-image synthesis make it possible to visualize machine imaginations for a given context. On the other hand, when generating text, human writers are gifted at creative visualization, which enhances their writings by forming imaginations as blueprints before putting down the stories in words. Inspired by such a cognitive process, we ask the natural question of whether we ca… ▽ More

    Submitted 14 February, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: EACL 2023

  31. arXiv:2209.00196  [pdf, other

    eess.IV physics.optics

    Group frame neural network of moving object ghost imaging combined with frame merging algorithm

    Authors: Da Chen, Shan-Guo Feng, Hua-Hua Wang, Jia-Ning Cao, Zhi-Wei Zhang, Zhi-Xin Yang, Ao Yan, Lu Gao, Ze Zhang

    Abstract: The nature of multiple samples to extract correlation information limits the applications of ghost imaging of moving objects. A novel multi-to-one neural network is proposed and the concept of "batch frame" is introduced to improve the serial imaging method. The neural network extracts more correlation information from a small number of samples, thus reducing the sampling ratio of the ghost imagin… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

    Comments: 12 pages, 7 figures

  32. arXiv:2207.00422  [pdf, other

    cs.IR cs.AI cs.CV

    Personalized Showcases: Generating Multi-Modal Explanations for Recommendations

    Authors: An Yan, Zhankui He, Jiacheng Li, Tianyang Zhang, Julian McAuley

    Abstract: Existing explanation models generate only text for recommendations but still struggle to produce diverse contents. In this paper, to further enrich explanations, we propose a new task named personalized showcases, in which we provide both textual and visual information to explain our recommendations. Specifically, we first select a personalized image set that is the most relevant to a user's inter… ▽ More

    Submitted 6 April, 2023; v1 submitted 29 June, 2022; originally announced July 2022.

    Comments: Accepted to SIGIR-23, with additional dataset details. Code and data: https://github.com/zzxslp/Gest

  33. AdS$_3$/AdS$_2$ degression of Fronsdal fields

    Authors: A. N. Yan

    Abstract: We analyze the Kaluza-Klein type procedure in AdS$_3$ space called the dimensional degression. The topological theory of the Fronsdal field in AdS$_3$ is reformulated in terms of the fields propagating in AdS$_2$. We find that the Fronsdal field in AdS$_3$ leads to finitely many Kaluza-Klein modes. Namely, the obtained spectrum is the massive Klein-Gordon and Proca fields in AdS$_2$. The result is… ▽ More

    Submitted 3 July, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

    Comments: v2: 24 pages, Appendix E added, typos fixed

  34. arXiv:2202.09651  [pdf, other

    math.OC math.ST

    An Oracle Gradient Regularized Newton Method for Quadratic Measurements Regression

    Authors: Jun Fan, Jie Sun, Ailing Yan, Shenglong Zhou

    Abstract: Recently, recovering an unknown signal from quadratic measurements has gained popularity because it includes many interesting applications as special cases such as phase retrieval, fusion frame phase retrieval, and positive operator-valued measure. In this paper, by employing the least squares approach to reconstruct the signal, we establish the non-asymptotic statistical property showing that the… ▽ More

    Submitted 15 March, 2023; v1 submitted 19 February, 2022; originally announced February 2022.

  35. arXiv:2109.12242  [pdf, other

    cs.CL

    Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation

    Authors: An Yan, Zexue He, Xing Lu, Jiang Du, Eric Chang, Amilcare Gentili, Julian McAuley, Chun-Nan Hsu

    Abstract: Radiology report generation aims at generating descriptive text from radiology images automatically, which may present an opportunity to improve radiology reporting and interpretation. A typical setting consists of training encoder-decoder models on image-report pairs with a cross entropy loss, which struggles to generate informative sentences for clinical diagnoses since normal findings dominate… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Findings of EMNLP 2021

  36. arXiv:2109.07650  [pdf

    math.OC

    Study on S2 Flow Path Design and Three-dimensional Numerical Simulation Parameter Calibration in Axial Compressor

    Authors: Aobo Yang, An Yan, Jiang Chen

    Abstract: Aerodynamic design process of multi - stage axial flow compressor usually uses the way that combines the S2 flow design and three-dimensional CFD numerical simulation analysis. Based on Mr. Wu Zhonghua's " Three-dimensional flow theory ", aiming at the S2 flow design matching parameters and the three-dimensional CFD numerical simulation data, through autonomous programming, the S2 design parameter… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  37. arXiv:2106.05970  [pdf, other

    cs.CL cs.AI cs.CV

    ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation

    Authors: Wanrong Zhu, Xin Eric Wang, An Yan, Miguel Eckstein, William Yang Wang

    Abstract: Automatic evaluations for natural language generation (NLG) conventionally rely on token-level or embedding-level comparisons with text references. This differs from human language processing, for which visual imagination often improves comprehension. In this work, we propose ImaginE, an imagination-based automatic evaluation metric for natural language generation. With the help of StableDiffusion… ▽ More

    Submitted 14 February, 2023; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: EACL 2023

  38. arXiv:2105.05722  [pdf, other

    hep-th

    AdS$_3$/AdS$_2$ degression of massless particles

    Authors: K. B. Alkalaev, A. N. Yan

    Abstract: We study a 3d/2d dimensional degression which is a Kaluza-Klein type mechanism in AdS$_3$ space foliated into AdS$_2$ hypersurfaces. It is shown that an AdS$_3$ massless particle of spin $s=1,2,...,\infty$ degresses into a couple of AdS$_2$ particles of equal energies $E=s$. Note that the Kaluza-Klein spectra in higher dimensions are always infinite. To formulate the AdS$_3$/AdS$_2$ degression we… ▽ More

    Submitted 29 September, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: 39 pages; v2: minor edits and notational improvements in Section 3 and Appendix B, typos fixed; v3: minor corrections; v4: comments added, JHEP version

  39. arXiv:2102.01860  [pdf, other

    cs.CV cs.CL

    L2C: Describing Visual Differences Needs Semantic Understanding of Individuals

    Authors: An Yan, Xin Eric Wang, Tsu-Jui Fu, William Yang Wang

    Abstract: Recent advances in language and vision push forward the research of captioning a single image to describing visual differences between image pairs. Suppose there are two images, I_1 and I_2, and the task is to generate a description W_{1,2} comparing them, existing methods directly model { I_1, I_2 } -> W_{1,2} mapping without the semantic understanding of individuals. In this paper, we introduce… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: EACL-2021 short

  40. arXiv:2007.00229  [pdf, other

    cs.CL cs.AI cs.CV

    Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation

    Authors: Wanrong Zhu, Xin Eric Wang, Tsu-Jui Fu, An Yan, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang

    Abstract: One of the most challenging topics in Natural Language Processing (NLP) is visually-grounded language understanding and reasoning. Outdoor vision-and-language navigation (VLN) is such a task where an agent follows natural language instructions and navigates a real-life urban environment. Due to the lack of human-annotated instructions that illustrate intricate urban scenes, outdoor VLN remains a c… ▽ More

    Submitted 3 February, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: EACL 2021

  41. arXiv:2006.00545  [pdf, other

    cs.RO cs.CV

    Motion2Vec: Semi-Supervised Representation Learning from Surgical Videos

    Authors: Ajay Kumar Tanwani, Pierre Sermanet, Andy Yan, Raghav Anand, Mariano Phielipp, Ken Goldberg

    Abstract: Learning meaningful visual representations in an embedding space can facilitate generalization in downstream tasks such as action segmentation and imitation. In this paper, we learn a motion-centric representation of surgical video demonstrations by grouping them into action segments/sub-goals/options in a semi-supervised manner. We present Motion2Vec, an algorithm that learns a deep embedding fea… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

    Comments: IEEE International Conference on Robotics and Automation (ICRA), 2020

  42. arXiv:2004.03361  [pdf

    cond-mat.mes-hall physics.optics

    Continuously tuning the refractive indices by restructuring materials on the 20-1000 atoms scale: improving anti-reflection coating designs

    Authors: Jacob Poole, Aidong Yan, Paul Ohodnicki, Kevin Chen

    Abstract: We demonstrate the capability of block-copolymer templating to tune the refractive indices of functional oxides over a broad range by structuring materials on the 20-1000 atoms scale, with simple one-pot synthesis. The presented method is then combined with genetic algorithm-based optimization to explore its application for anti-reflection coating design. Merging these techniques allows for the re… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

    Comments: 10 pages, 5 figures, 2 tables, and a couple of equations. arXiv admin note: substantial text overlap with arXiv:1504.08346

  43. arXiv:2002.07108  [pdf, other

    cond-mat.mtrl-sci physics.optics

    The ultrafast onset of exciton formation in 2D semiconductors

    Authors: Chiara Trovatello, Florian Katsch, Nicholas J. Borys, Malte Selig, Kaiyuan Yao, Rocio Borrego-Varillas, Francesco Scotognella, Ilka Kriegel, Aiming Yan, Alex Zettl, P. James Schuck, Andreas Knorr, Giulio Cerullo, Stefano Dal Conte

    Abstract: The equilibrium and non-equilibrium optical properties of single-layer transition metal dichalcogenides (TMDs) are determined by strongly bound excitons. Exciton relaxation dynamics in TMDs have been extensively studied by time-domain optical spectroscopies. However, the formation dynamics of excitons following non-resonant photoexcitation of free electron-hole pairs have been challenging to direc… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

  44. arXiv:1910.11301  [pdf, other

    cs.CL cs.CV cs.RO

    Cross-Lingual Vision-Language Navigation

    Authors: An Yan, Xin Eric Wang, Jiangtao Feng, Lei Li, William Yang Wang

    Abstract: Commanding a robot to navigate with natural language instructions is a long-term goal for grounded language understanding and robotics. But the dominant language is English, according to previous studies on vision-language navigation (VLN). To go beyond English and serve people speaking different languages, we collect a bilingual Room-to-Room (BL-R2R) dataset, extending the original benchmark with… ▽ More

    Submitted 5 December, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

  45. arXiv:1910.00727  [pdf, other

    cs.LG cs.CV stat.ML

    Analyzing and Improving Neural Networks by Generating Semantic Counterexamples through Differentiable Rendering

    Authors: Lakshya Jain, Varun Chandrasekaran, Uyeong Jang, Wilson Wu, Andrew Lee, Andy Yan, Steven Chen, Somesh Jha, Sanjit A. Seshia

    Abstract: Even as deep neural networks (DNNs) have achieved remarkable success on vision-related tasks, their performance is brittle to transformations in the input. Of particular interest are semantic transformations that model changes that have a basis in the physical world, such as rotations, translations, changes in lighting or camera pose. In this paper, we show how differentiable rendering can be util… ▽ More

    Submitted 17 July, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

  46. arXiv:1908.09972  [pdf, other

    cs.IR cs.LG

    CosRec: 2D Convolutional Neural Networks for Sequential Recommendation

    Authors: An Yan, Shuo Cheng, Wang-Cheng Kang, Mengting Wan, Julian McAuley

    Abstract: Sequential patterns play an important role in building modern recommender systems. To this end, several recommender systems have been built on top of Markov Chains and Recurrent Models (among others). Although these sequential models have proven successful at a range of tasks, they still struggle to uncover complex relationships nested in user purchase histories. In this paper, we argue that model… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: To appear in CIKM-2019, code at https://github.com/zzxslp/CosRec

  47. arXiv:1907.03827  [pdf, other

    cs.CY cs.AI cs.LG stat.AP

    FairST: Equitable Spatial and Temporal Demand Prediction for New Mobility Systems

    Authors: An Yan, Bill Howe

    Abstract: Emerging transportation modes, including car-sharing, bike-sharing, and ride-hailing, are transforming urban mobility but have been shown to reinforce socioeconomic inequities. Spatiotemporal demand prediction models for these new mobility regimes must therefore consider fairness as a first-class design requirement. We present FairST, a fairness-aware model for predicting demand for new mobility s… ▽ More

    Submitted 21 June, 2019; originally announced July 2019.

  48. arXiv:1906.08833  [pdf, other

    physics.med-ph eess.IV physics.optics

    Sample phase gradient and fringe phase shift in dual phase grating X-ray interferometry

    Authors: Aimin Yan, Xizeng Wu, Hong Liu

    Abstract: One of the key tasks in grating based x-ray phase contrast imaging is to accurately retrieve local phase gradients of a sample from measured intensity fringe shifts. To fulfill this task in dual phase grating interferometry, one needs to know the exact mathematical relationship between the two. In this work, using intuitive analysis of the sample-generated fringe shifts based on the beat pattern f… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: submitted to Optics Express for review

  49. arXiv:1906.01587  [pdf, other

    physics.med-ph

    Clarification on Generalized Lau condition for X-ray interferometers based on dual phase gratings

    Authors: Aimin Yan, Xizeng Wu, Hong Liu

    Abstract: To implement dual phase grating x-ray interferometry with x-ray tubes, one needs to incorporate an absorbing source grating. In order to attain good fringe visibility, the period of a source grating should be subject to a stringent condition. In literature some authors claim that the Lau-condition in Talbot-Lau interferometry can be literally transferred to dual phase grating interferometry. In th… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: submitted to Optics Express

  50. Cellular reproduction number, generation time and growth rate differ between human- and avian-adapted influenza strains

    Authors: Ada W. C. Yan, Jie Zhou, Catherine A. A. Beauchemin, Colin A. Russell, Wendy S. Barclay, Steven Riley

    Abstract: When analysing in vitro data, growth kinetics of influenza strains are often compared by computing their growth rates, which are sometimes used as proxies for fitness. However, analogous to mechanistic epidemic models, the growth rate can be defined as a function of two parameters: the basic reproduction number (the average number of cells each infected cell infects) and the mean generation time (… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: 24 pages, 5 figures, 8 supplementary figures, 1 supplementary table

    Report number: RIKEN-iTHEMS-Report-19 MSC Class: 92C60; 92D15; 92D30

    Journal ref: Epidemics, 33, Dec. (2020) 100406