Skip to main content

Showing 1–50 of 653 results for author: Lee, N

  1. arXiv:2407.09043  [pdf, other

    cs.AI

    Molecule Language Model with Augmented Pairs and Expertise Transfer

    Authors: Namkyeong Lee, Siddhartha Laghuvarapu, Chanyoung Park, Jimeng Sun

    Abstract: Understanding the molecules and their textual descriptions via molecule language models (MoLM) recently got a surge of interest among researchers. However, unique challenges exist in the field of MoLM due to 1) a limited amount of molecule-text paired data and 2) missing expertise that occurred due to the specialized areas of focus among the experts. To this end, we propose AMOLE, which 1) augment… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: ACL 2024 Workshop on Languages and Molecule

  2. arXiv:2406.15524  [pdf, other

    cs.CL cs.LG

    Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization

    Authors: Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee

    Abstract: This work suggests fundamentally rethinking the current practice of pruning large language models (LLMs). The way it is done is by divide and conquer: split the model into submodels, sequentially prune them, and reconstruct predictions of the dense counterparts on small calibration data one at a time; the final model is obtained simply by putting the resulting sparse submodels together. While this… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.09948  [pdf, other

    cs.CL

    BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

    Authors: Junho Myung, Nayeon Lee, Yi Zhou, Jiho Jin, Rifki Afina Putri, Dimosthenis Antypas, Hsuvas Borkakoty, Eunsu Kim, Carla Perez-Almendros, Abinew Ali Ayele, Víctor Gutiérrez-Basulto, Yazmín Ibáñez-García, Hwaran Lee, Shamsuddeen Hassan Muhammad, Kiwoong Park, Anar Sabuhi Rzayev, Nina White, Seid Muhie Yimam, Mohammad Taher Pilehvar, Nedjma Ousidhoum, Jose Camacho-Collados, Alice Oh

    Abstract: Large language models (LLMs) often lack culture-specific knowledge of daily life, especially across diverse regions and non-English languages. Existing benchmarks for evaluating LLMs' cultural sensitivities are limited to a single language or collected from online sources such as Wikipedia, which do not reflect the mundane everyday lifestyles of diverse regions. That is, information about the food… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  4. arXiv:2406.06424  [pdf, other

    cs.CV

    Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

    Authors: Jiwoo Hong, Sayak Paul, Noah Lee, Kashif Rasul, James Thorne, Jongheon Jeong

    Abstract: Modern alignment techniques based on human preferences, such as RLHF and DPO, typically employ divergence regularization relative to the reference model to ensure training stability. However, this often limits the flexibility of models during alignment, especially when there is a clear distributional discrepancy between the preference data and the reference model. In this paper, we focus on the al… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Preprint

  5. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Cho, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  6. arXiv:2406.00925  [pdf, other

    hep-th

    Dimers for Type D Relativistic Toda Model

    Authors: Kimyeong Lee, Norton Lee

    Abstract: We construct dimer graphs for type D relativistic Toda models by introducing impurities to the $Y^{2N,0}$ square dimer graphs. By properly placing the impurities and change of canonical variables assigned to the 1-loops on the dimer graph, we introduce the "folding" of the graphs and get the type D relativistic Toda lattice Hamiltonian and monodromy matrix.

    Submitted 23 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: 25+6 pages, 14 figures, add citation

    Report number: KIAS-P24038, CGP24008

  7. arXiv:2405.08614  [pdf, other

    eess.SP

    FDD Massive MIMO: How to Optimally Combine UL Pilot and Limited DL CSI Feedback?

    Authors: Jungyeon Kim, Jinseok Choi, Jeonghun Park, Ahmed Alkhateeb, Namyoon Lee

    Abstract: In frequency-division duplexing (FDD) multiple-input multiple-output (MIMO) systems, obtaining accurate downlink channel state information (CSI) for precoding is vastly challenging due to the tremendous feedback overhead with the growing number of antennas. Utilizing uplink pilots for downlink CSI estimation is a promising approach that can eliminate CSI feedback. However, the downlink CSI estimat… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 13 pages, 10 figures

  8. arXiv:2404.14276  [pdf, other

    stat.ML cs.LG

    A Bayesian Approach for Prioritising Driving Behaviour Investigations in Telematic Auto Insurance Policies

    Authors: Mark McLeod, Bernardo Perez-Orozco, Nika Lee, Davide Zilli

    Abstract: Automotive insurers increasingly have access to telematic information via black-box recorders installed in the insured vehicle, and wish to identify undesirable behaviour which may signify increased risk or uninsured activities. However, identification of such behaviour with machine learning is non-trivial, and results are far from perfect, requiring human investigation to verify suspected cases.… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: International Congress of Actuaries (2023)

  9. arXiv:2404.09959  [pdf, other

    hep-ph hep-ex

    NNLO QCD corrections to polarized semi-inclusive DIS

    Authors: Saurav Goyal, Roman N. Lee, Sven-Olaf Moch, Vaibhav Pathak, Narayan Rana, V. Ravindran

    Abstract: Polarized semi-inclusive deep-inelastic scattering (SIDIS) is a key process in the quest for a resolution of the proton spin puzzle. We present the complete results for the polarized SIDIS process at next-to-next-to-leading order (NNLO) in perturbative quantum chromodynamics. Our analytical results include all partonic channels for the scattering of polarized leptons off hadrons and a spin-average… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 6 pages, 2 figures; 1 ancillary file

  10. arXiv:2404.03655  [pdf, other

    astro-ph.CO

    Magnetic fields from small-scale primordial perturbations

    Authors: Nanoom Lee, Yacine Ali-Haimoud

    Abstract: Weak magnetic fields must have existed in the early Universe, as they were sourced by the cross product of electron density and temperature gradients through the Biermann-battery mechanism. In this paper we calculate the magnetic fields generated at cosmic dawn by a variety of small-scale primordial perturbations, carefully computing the evolution of electron density and temperature fluctuations,… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures

  11. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  12. Generalized Calogero-Moser system and supergroup gauge origami

    Authors: Taro Kimura, Norton Lee

    Abstract: We study the integrability and the Bethe/Gauge correspondence of the Generalized Calogero-Moser system proposed by Berntson, Langmann and Lenells which we call the elliptic quadruple Calogero-Moser system (eqCM). We write down the Dunkl operators which give commuting Hamiltonians of the quantum integrable system. We identify the gauge theory in correspondence is a supergroup version of the gauge o… ▽ More

    Submitted 30 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 28+4 pages. hyperlink fixed, add reference. arXiv admin note: text overlap with arXiv:1908.04928

    Report number: CGP24006

    Journal ref: Nucl.Phys.B1005(2024)116604

  13. arXiv:2403.18932  [pdf, other

    cs.CL cs.AI

    Measuring Political Bias in Large Language Models: What Is Said and How It Is Said

    Authors: Yejin Bang, Delong Chen, Nayeon Lee, Pascale Fung

    Abstract: We propose to measure political bias in LLMs by analyzing both the content and style of their generated content regarding political issues. Existing benchmarks and measures focus on gender and racial biases. However, political bias exists in LLMs and can lead to polarization and other harms in downstream applications. In order to provide transparency to users, we advocate that there should be fine… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 16 pages

  14. arXiv:2403.16372  [pdf, other

    cs.LG cs.DC eess.SP

    SignSGD with Federated Voting

    Authors: Chanho Park, H. Vincent Poor, Namyoon Lee

    Abstract: Distributed learning is commonly used for accelerating model training by harnessing the computational capabilities of multiple-edge devices. However, in practical applications, the communication delay emerges as a bottleneck due to the substantial information exchange required between workers and a central parameter server. SignSGD with majority voting (signSGD-MV) is an effective distributed lear… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  15. arXiv:2403.15692  [pdf, other

    cs.IT eess.SP

    Block Orthogonal Sparse Superposition Codes for $ \sf{L}^3 $ Communications: Low Error Rate, Low Latency, and Low Power Consumption

    Authors: Donghwa Han, Bowhyung Lee, Min Jang, Donghun Lee, Seho Myung, Namyoon Lee

    Abstract: Block orthogonal sparse superposition (BOSS) code is a class of joint coded modulation methods, which can closely achieve the finite-blocklength capacity with a low-complexity decoder at a few coding rates under Gaussian channels. However, for fading channels, the code performance degrades considerably because coded symbols experience different channel fading effects. In this paper, we put forth n… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  16. arXiv:2403.15042  [pdf, other

    cs.CL

    LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

    Authors: Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipali, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

    Abstract: Pretrained large language models (LLMs) are currently state-of-the-art for solving the vast majority of natural language processing tasks. While many real-world applications still require fine-tuning to reach satisfactory levels of performance, many of them are in the low-data regime, making fine-tuning challenging. To address this, we propose LLM2LLM, a targeted and iterative data augmentation st… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Our code is available at https://github.com/SqueezeAILab/LLM2LLM

  17. arXiv:2403.11762  [pdf, other

    cs.IT eess.SP

    Full-Duplex MU-MIMO Systems with Coarse Quantization: How Many Bits Do We Need?

    Authors: Seunghyeong Yoo, Seokjun Park, Mintaek Oh, Namyoon Lee, Jinseok Choi

    Abstract: This paper investigates full-duplex (FD) multi-user multiple-input multiple-output (MU-MIMO) system design with coarse quantization. We first analyze the impact of self-interference (SI) on quantization in FD single-input single-output systems. The analysis elucidates that the minimum required number of analog-to-digital converter (ADC) bits is logarithmically proportional to the ratio of total re… ▽ More

    Submitted 18 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  18. arXiv:2403.11096  [pdf, other

    eess.SP

    Modeling and Coverage Analysis of K-Tier Integrated Satellite-Terrestrial Downlink Networks

    Authors: Jungbin Yim, Jeonghun Park, Namyoon Lee

    Abstract: Integrated satellite-terrestrial networks (ISTNs) can significantly expand network coverage while diminishing reliance on terrestrial infrastructure. Despite the enticing potential of ISTNs, there is no comprehensive mathematical performance analysis framework for these emerging networks. In this paper, we introduce a tractable approach to analyze the downlink coverage performance of multi-tier IS… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 13 pages, 9 figures

  19. arXiv:2403.11094  [pdf, other

    eess.SP

    Nonlinear Self-Interference Cancellation With Learnable Orthonormal Polynomials for Full-Duplex Wireless Systems

    Authors: Hyowon Lee, Jungyeon Kim, Geon Choi, Ian P. Roberts, Jinseok Choi, Namyoon Lee

    Abstract: Nonlinear self-interference cancellation (SIC) is essential for full-duplex communication systems, which can offer twice the spectral efficiency of traditional half-duplex systems. The challenge of nonlinear SIC is similar to the classic problem of system identification in adaptive filter theory, whose crux lies in identifying the optimal nonlinear basis functions for a nonlinear system. This beco… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 13 pages, total 16 figures

  20. arXiv:2403.07821  [pdf, other

    cs.SE

    Augmenting Interpolation-Based Model Checking with Auxiliary Invariants (Extended Version)

    Authors: Dirk Beyer, Po-Chun Chien, Nian-Ze Lee

    Abstract: Software model checking is a challenging problem, and generating relevant invariants is a key factor in proving the safety properties of a program. Program invariants can be obtained by various approaches, including lightweight procedures based on data-flow analysis and intensive techniques using Craig interpolation. Although data-flow analysis runs efficiently, it often produces invariants that a… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  21. arXiv:2403.07691  [pdf, other

    cs.CL cs.AI

    ORPO: Monolithic Preference Optimization without Reference Model

    Authors: Jiwoo Hong, Noah Lee, James Thorne

    Abstract: While recent preference alignment algorithms for language models have demonstrated promising results, supervised fine-tuning (SFT) remains imperative for achieving successful convergence. In this paper, we study the crucial role of SFT within the context of preference alignment, emphasizing that a minor penalty for the disfavored generation style is sufficient for preference-aligned SFT. Building… ▽ More

    Submitted 14 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Preprint

  22. arXiv:2403.05389  [pdf, other

    physics.chem-ph

    Multi-reference coupled cluster theory using the normal ordered exponential ansatz

    Authors: Alexander Gunasekera, Nicholas Lee, David P. Tew

    Abstract: Properly spin-adapted coupled-cluster theory for general open-shell configurations remains an elusive goal in electronic structure theory. In this contribution we examine Lindgren's normal-ordered exponential ansatz using spin-free excitation operators, with the aid of automatic equation generation software. We present a size-extensive reformulation of the unlinked working equations, and analyse t… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 9 pages, 2 figures

  23. arXiv:2402.13889  [pdf, other

    hep-th math-ph math.DG math.QA nlin.SI

    Bispectral duality and separation of variables from surface defect transition

    Authors: Saebyeok Jeong, Norton Lee

    Abstract: We study two types of surface observables $-$ the $\mathbf{Q}$-observables and the $\mathbf{H}$-observables $-$ of the 4d $\mathcal{N}=2$ $A_1$-quiver $U(N)$ gauge theory obtained by coupling a 2d $\mathcal{N}=(2,2)$ gauged linear sigma model. We demonstrate that the transition between the two surface defects manifests as a Fourier transformation between the surface observables. Utilizing the resu… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 62+11 pages; 10 figures

    Report number: CERN-TH-2024-024, CGP24003

  24. arXiv:2402.13888  [pdf, other

    hep-th math-ph math.DG math.QA nlin.SI

    di-Langlands correspondence and extended observables

    Authors: Saebyeok Jeong, Norton Lee, Nikita Nekrasov

    Abstract: We explore the $\textit{difference Langlands correspondence}$ using the four dimensional ${\mathcal{N}}=2$ super-QCD. Surface defects and surface observables play the crucial role. As an application, we give the first construction of the full set of quantum integrals, i.e. commuting differential operators, such that the partition function of the so-called regular monodromy surface defect is their… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 50+11 pages

    Report number: CERN-TH-2023-220, CGP24002

  25. arXiv:2402.09903  [pdf, ps, other

    math.CO

    Enumeration of multiplex juggling card sequences using generalized q-derivatives

    Authors: Yumin Cho, Jaehyun Kim, Jang Soo Kim, Nakyung Lee

    Abstract: In 2019, Butler, Choi, Kim, and Seo introduced a new type of juggling card that represents multiplex juggling patterns in a natural bijective way. They conjectured a formula for the generating function for the number of multiplex juggling cards with capacity 2. In this paper we prove their conjecture. More generally, we find an explicit formula for the generating function with any capacity. We als… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 17 pages, 4 figures

  26. arXiv:2402.09155  [pdf, ps, other

    eess.SP cs.IT

    Joint and Robust Beamforming Framework for Integrated Sensing and Communication Systems

    Authors: Jinseok Choi, Jeonghun Park, Namyoon Lee, Ahmed Alkhateeb

    Abstract: Integrated sensing and communication (ISAC) is widely recognized as a fundamental enabler for future wireless communications. In this paper, we present a joint communication and radar beamforming framework for maximizing a sum spectral efficiency (SE) while guaranteeing desired radar performance with imperfect channel state information (CSI) in multi-user and multi-target ISAC systems. To this end… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: submitted for possible IEEE publication

  27. arXiv:2402.08858  [pdf, other

    physics.chem-ph physics.comp-ph quant-ph

    Spin-coupled molecular orbitals: chemical intuition meets quantum chemistry

    Authors: Daniel Marti-Dafcik, Nicholas Lee, Hugh G. A. Burton, David P. Tew

    Abstract: Molecular orbital theory is powerful both as a conceptual tool for understanding chemical bonding, and as a theoretical framework for ab initio quantum chemistry. Despite its undoubted success, MO theory has well documented shortcomings, most notably that it fails to correctly describe diradical states and homolytic bond fission. In this contribution, we introduce a generalised MO theory that incl… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 11 pages, 5 figures

  28. arXiv:2402.07381  [pdf, other

    cs.IT

    RIS-Empowered LEO Satellite Networks for 6G: Promising Usage Scenarios and Future Directions

    Authors: Mesut Toka, Byungju Lee, Jaehyup Seong, Aryan Kaushik, Juhwan Lee, Jungwoo Lee, Namyoon Lee, Wonjae Shin, H. Vincent Poor

    Abstract: Low-Earth orbit (LEO) satellite systems have been deemed a promising key enabler for current 5G and the forthcoming 6G wireless networks. Such LEO satellite constellations can provide worldwide three-dimensional coverage, high data rate, and scalability, thus enabling truly ubiquitous connectivity. On the other hand, another promising technology, reconfigurable intelligent surfaces (RISs), has eme… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: 18 pages, 5 figures, Paper accepted by IEEE Communications Magazine

  29. arXiv:2402.04248  [pdf, other

    cs.LG

    Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

    Authors: Jongho Park, Jaeseung Park, Zheyang Xiong, Nayoung Lee, Jaewoong Cho, Samet Oymak, Kangwook Lee, Dimitris Papailiopoulos

    Abstract: State-space models (SSMs), such as Mamba (Gu & Dao, 2023), have been proposed as alternatives to Transformer networks in language modeling, by incorporating gating, convolutions, and input-dependent token selection to mitigate the quadratic cost of multi-head attention. Although SSMs exhibit competitive performance, their in-context learning (ICL) capabilities, a remarkable emergent property of mo… ▽ More

    Submitted 25 April, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: Changes in v2: experiments on formal language ICL and explorations of width vs. depth on ICL; code repo available (24 pages, 10 figures)

  30. arXiv:2402.01340  [pdf, ps, other

    cs.LG cs.CR eess.SP

    SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign Decoding

    Authors: Chanho Park, Namyoon Lee

    Abstract: Distributed learning is an effective approach to accelerate model training using multiple workers. However, substantial communication delays emerge between workers and a parameter server due to massive costs associated with communicating gradients. SignSGD with majority voting (signSGD-MV) is a simple yet effective optimizer that reduces communication costs through one-bit quantization, yet the co… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  31. arXiv:2401.05193  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Experiment Planning with Function Approximation

    Authors: Aldo Pacchiano, Jonathan N. Lee, Emma Brunskill

    Abstract: We study the problem of experiment planning with function approximation in contextual bandit problems. In settings where there is a significant overhead to deploying adaptive algorithms -- for example, when the execution of the data collection policies is required to be distributed, or a human in the loop is needed to implement these policies -- producing in advance a set of policies for data coll… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 10 pages main

  32. arXiv:2401.04724  [pdf, other

    quant-ph physics.app-ph

    A parametrically programmable delay line for microwave photons

    Authors: Takuma Makihara, Nathan Lee, Yudan Guo, Wenyan Guan, Amir H. Safavi-Naeini

    Abstract: Delay lines capable of storing quantum information are crucial for advancing quantum repeaters and hardware efficient quantum computers. Traditionally, they are physically realized as extended systems that support wave propagation, such as waveguides. But such delay lines typically provide limited control over the propagating fields. Here, we introduce a parametrically addressed delay line (PADL)… ▽ More

    Submitted 11 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: 13 pages, 9 figures; v2: minor update of references

  33. arXiv:2312.13289  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Stoichiometry Representation Learning with Polymorphic Crystal Structures

    Authors: Namkyeong Lee, Heewoong Noh, Gyoung S. Na, Tianfan Fu, Jimeng Sun, Chanyoung Park

    Abstract: Despite the recent success of machine learning (ML) in materials science, its success heavily relies on the structural description of crystal, which is itself computationally demanding and occasionally unattainable. Stoichiometry descriptors can be an alternative approach, which reveals the ratio between elements involved to form a certain compound without any structural information. However, it i… ▽ More

    Submitted 17 November, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 AI4Science Workshop

  34. arXiv:2312.13133  [pdf, other

    hep-th

    New dimer integrable systems and defects in five dimensional gauge theory

    Authors: Norton Lee

    Abstract: We study the relation between the quantum integrable systems derived from the dimer graphs and five dimensional $\mathcal{N}=1$ supersymmetric gauge theories on $S^1 \times \mathbb{R}^4$. We construct integrable systems based on new dimer graphs obtained from modification of hexagon dimer diagram. We study the gauge theories in correspondence to the newly proposed integrable systems. By examining… ▽ More

    Submitted 16 June, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 45+13 pages, 12 figures, correct typos, add citation

    Report number: CGP-23022

  35. arXiv:2312.06985  [pdf, ps, other

    eess.SP

    Ergodic Secrecy Rate Analysis for LEO Satellite Downlink Networks

    Authors: Daeun Kim, Namyoon Lee

    Abstract: Satellite networks are recognized as an effective solution to ensure seamless connectivity worldwide, catering to a diverse range of applications. However, the broad coverage and broadcasting nature of satellite networks also expose them to security challenges. Despite these challenges, there is a lack of analytical understanding addressing the secrecy performance of these networks. This paper pre… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  36. arXiv:2312.04511  [pdf, other

    cs.CL

    An LLM Compiler for Parallel Function Calling

    Authors: Sehoon Kim, Suhong Moon, Ryan Tabrizi, Nicholas Lee, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

    Abstract: The reasoning capabilities of the recent LLMs enable them to execute external function calls to overcome their inherent limitations, such as knowledge cutoffs, poor arithmetic skills, or lack of access to private data. This development has allowed LLMs to select and coordinate multiple functions based on the context to tackle more complex problems. However, current methods for function calling oft… ▽ More

    Submitted 4 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: ICML 2024

  37. arXiv:2312.03901  [pdf, other

    cs.CY

    Redrawing the 2012 map of the Maryland congressional districts

    Authors: Noah Lee, Hyunwoo Park, Sangho Shim

    Abstract: Gerrymandering is the practice of drawing biased electoral maps that manipulate the voter population to gain an advantage. The most recent time gerrymandering became an issue was 2019 when the U.S. Federal Supreme Court decided that the court does not have the authority to dictate how to draw the district map and state legislators are the ones who should come up with an electoral district plan. We… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 8 pages, to be submitted to IISE 2024 Annual Conference Proceedings

    MSC Class: 90

  38. arXiv:2312.03684  [pdf

    cond-mat.str-el cond-mat.mes-hall

    Spontaneous Chirality Flipping in an Orthogonal Spin-Charge Ordered Topological Magnet

    Authors: H. Miao, J. Bouaziz, G. Fabbris, W. R. Meier, F. Z. Yang, H. X. Li, C. Nelson, E. Vescovo, S. Zhang, A. Christianson, H. N. Lee, Y. Zhang, C. D. Batista, S. Blügel

    Abstract: The asymmetric distribution of chiral objects with opposite chirality is of great fundamental interests ranging from molecular biology to particle physics. In quantum materials, chiral states can build on inversion-symmetry-breaking lattice structures or emerge from spontaneous magnetic ordering induced by competing interactions. Although the handedness of a chiral state can be changed through ext… ▽ More

    Submitted 19 February, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Supplementary materials are available from the corresponding author upon request

  39. arXiv:2312.03055  [pdf, other

    astro-ph.HE

    Front-row seat of the recent R Aqr periastron passage: X-ray multi-epoch spectral and spatial analysis

    Authors: A. Sacchi, M. Karovska, J. Raymond, V. Kashyap, T. J. Gaetz, W. Hack, J. Kennea, N. Lee, A. J Mioduszewski, M. J Claussen

    Abstract: We report on the X-ray spectral and spatial evolution of the Symbiotic star R Aqr. Through a multi-epoch observational campaign performed with Chandra between 2017 and 2022, we study the X-ray emission of this binary system, composed of an evolved red giant star and a white dwarf (WD). This analysis is particularly timely as the WD approached the periastron in late 2018/early 2019, thus mass trans… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 14 pages, 9 figures, 3 tables. Accepted for publication in ApJ

  40. arXiv:2311.18172  [pdf, other

    cs.IT eess.SP

    Multi-Rate Variable-Length CSI Compression for FDD Massive MIMO

    Authors: Bumsu Park, Heedong Do, Namyoon Lee

    Abstract: For frequency-division-duplexing (FDD) systems, channel state information (CSI) should be fed back from the user terminal to the base station. This feedback overhead becomes problematic as the number of antennas grows. To alleviate this issue, we propose a flexible CSI compression method using variational autoencoder (VAE) with an entropy bottleneck structure, which can support multi-rate and vari… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  41. arXiv:2311.17539  [pdf, other

    cs.LG math.OC stat.ML

    Critical Influence of Overparameterization on Sharpness-aware Minimization

    Authors: Sungbin Shin, Dongyeop Lee, Maksym Andriushchenko, Namhoon Lee

    Abstract: Training an overparameterized neural network can yield minimizers of different generalization capabilities despite the same level of training loss. Meanwhile, with evidence that suggests a strong correlation between the sharpness of minima and their generalization errors, increasing efforts have been made to develop optimization methods to explicitly find flat minima as more generalizable solution… ▽ More

    Submitted 19 June, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  42. arXiv:2311.13807  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.IM

    rrlfe: Software for Generating and Applying Metallicity Calibrations for RR Lyrae Variable Stars Across a Wide Range of Phases and Temperatures

    Authors: Eckhart Spalding, Ronald Wilhelm, Nathan De Lee, Stacy Long, Timothy C. Beers, Vinicius M. Placco, John Kielkopf, Young Sun Lee, Joshua Pepper, Kenneth Carrell

    Abstract: RR Lyrae stars play a central role in tracing phase-space structures within the Milky Way because they are easy to identify, are relatively luminous, and are found in large numbers in the Galactic bulge, disk, and halo. In this work, we present a new set of spectroscopic metallicity calibrations that use the equivalent widths of the Ca II K and Balmer H-gamma and H-delta lines to calculate metalli… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Published

    Journal ref: Monthly Notices of the Royal Astronomical Society, vol. 527, issue 1, January 2024, p. 828

  43. arXiv:2311.12856  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG

    Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transformer

    Authors: Namkyeong Lee, Heewoong Noh, Sungwon Kim, Dongmin Hyun, Gyoung S. Na, Chanyoung Park

    Abstract: The density of states (DOS) is a spectral property of crystalline materials, which provides fundamental insights into various characteristics of the materials. While previous works mainly focus on obtaining high-quality representations of crystalline materials for DOS prediction, we focus on predicting the DOS from the obtained representations by reflecting the nature of DOS: DOS determines the ge… ▽ More

    Submitted 22 November, 2023; v1 submitted 24 October, 2023; originally announced November 2023.

    Comments: NeurIPS 2023. arXiv admin note: text overlap with arXiv:2303.07000

  44. arXiv:2311.05860  [pdf, other

    hep-ph physics.atom-ph

    $\mathcal{O}\left(mα^2 (Zα)^6\right)$ contribution to Lamb shift from radiative corrections to the Wichmann-Kroll potential

    Authors: Petr A. Krachkov, Roman N. Lee

    Abstract: We derive an analytical expression for the contribution of the order $mα^2 (Zα)^6$ to the hydrogen Lamb shift which comes from the diagrams for radiative corrections to the Wichmann-Kroll potential. We use modern methods of multiloop calculations, based on IBP reduction, DRA method and differential equations.

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 9 pages

  45. ezBIDS: Guided standardization of neuroimaging data interoperable with major data archives and platforms

    Authors: Daniel Levitas, Soichi Hayashi, Sophia Vinci-Booher, Anibal Heinsfeld, Dheeraj Bhatia, Nicholas Lee, Anthony Galassi, Guiomar Niso, Franco Pestilli

    Abstract: Data standardization has become one of the leading methods neuroimaging researchers rely on for data sharing and reproducibility. Data standardization promotes a common framework through which researchers can utilize others' data. Yet, as of today, formatting datasets that adhere to community best practices requires technical expertise involving coding and considerable knowledge of file formats an… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  46. arXiv:2311.03285  [pdf, other

    cs.LG cs.AI cs.DC

    S-LoRA: Serving Thousands of Concurrent LoRA Adapters

    Authors: Ying Sheng, Shiyi Cao, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica

    Abstract: The "pretrain-then-finetune" paradigm is commonly adopted in the deployment of large language models. Low-Rank Adaptation (LoRA), a parameter-efficient fine-tuning method, is often employed to adapt a base model to a multitude of tasks, resulting in a substantial collection of LoRA adapters derived from one base model. We observe that this paradigm presents significant opportunities for batched in… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  47. arXiv:2311.02236  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Robust Fine-Tuning of Vision-Language Models for Domain Generalization

    Authors: Kevin Vogt-Lowell, Noah Lee, Theodoros Tsiligkaridis, Marc Vaillant

    Abstract: Transfer learning enables the sharing of common knowledge among models for a variety of downstream tasks, but traditional methods suffer in limited training data settings and produce narrow models incapable of effectively generalizing under distribution shifts. Foundation models have recently demonstrated impressive zero-shot inference capabilities and robustness under distribution shifts. However… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: In proceedings of the 27th IEEE High Performance Extreme Computing Conference

  48. arXiv:2311.01817  [pdf, other

    cs.CL

    Mitigating Framing Bias with Polarity Minimization Loss

    Authors: Yejin Bang, Nayeon Lee, Pascale Fung

    Abstract: Framing bias plays a significant role in exacerbating political polarization by distorting the perception of actual events. Media outlets with divergent political stances often use polarized language in their reporting of the same event. We propose a new loss function that encourages the model to minimize the polarity difference between the polarized input articles to reduce framing bias. Specific… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 11 pages, EMNLP2023

  49. arXiv:2310.07101  [pdf, other

    cs.IT eess.SP

    Hybrid Arrays: How Many RF Chains Are Required to Prevent Beam Squint?

    Authors: Heedong Do, Namyoon Lee, Robert W. Heath Jr, Angel Lozano

    Abstract: With increasing frequencies, bandwidths, and array apertures, the phenomenon of beam squint arises as a serious impairment to beamforming. Fully digital arrays with true time delay per antenna element are a potential solution, but they require downconversion at each element. This paper shows that hybrid arrays can perform essentially as well as digital arrays once the number of radio-frequency cha… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  50. arXiv:2310.06271  [pdf, other

    cs.CL cs.AI

    Towards Mitigating Hallucination in Large Language Models via Self-Reflection

    Authors: Ziwei Ji, Tiezheng Yu, Yan Xu, Nayeon Lee, Etsuko Ishii, Pascale Fung

    Abstract: Large language models (LLMs) have shown promise for generative and knowledge-intensive tasks including question-answering (QA) tasks. However, the practical deployment still faces challenges, notably the issue of "hallucination", where models generate plausible-sounding but unfaithful or nonsensical information. This issue becomes particularly critical in the medical domain due to the uncommon pro… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted by the findings of EMNLP 2023