Skip to main content

Showing 1–50 of 642 results for author: Han, D

  1. arXiv:2407.09360  [pdf, other

    cs.LG math.OC

    Novel clustered federated learning based on local loss

    Authors: Endong Gu, Yongxin Chen, Hao Wen, Xingju Cai, Deren Han

    Abstract: This paper proposes LCFL, a novel clustering metric for evaluating clients' data distributions in federated learning. LCFL aligns with federated learning requirements, accurately assessing client-to-client variations in data distribution. It offers advantages over existing clustered federated learning methods, addressing privacy concerns, improving applicability to non-convex models, and providing… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2407.05045  [pdf, other

    cs.CV

    Robust Skin Color Driven Privacy Preserving Face Recognition via Function Secret Sharing

    Authors: Dong Han, Yufan Jiang, Yong Li, Ricardo Mendes, Joachim Denzler

    Abstract: In this work, we leverage the pure skin color patch from the face image as the additional information to train an auxiliary skin color feature extractor and face recognition model in parallel to improve performance of state-of-the-art (SOTA) privacy-preserving face recognition (PPFR) systems. Our solution is robust against black-box attacking and well-established generative adversarial network (GA… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted at ICIP2024

  3. arXiv:2406.17226  [pdf, other

    math.OC

    Extended alternating structure-adapted proximal gradient algorithm for nonconvex nonsmooth problems

    Authors: Ying Gao, Chunfeng Cui, Wenxing Zhang, Deren Han

    Abstract: Alternating structure-adapted proximal (ASAP) gradient algorithm (M. Nikolova and P. Tan, SIAM J Optim, 29:2053-2078, 2019) has drawn much attention due to its efficiency in solving nonconvex nonsmooth optimization problems. However, the multiblock nonseparable structure confines the performance of ASAP to far-reaching practical problems, e.g., coupled tensor decomposition. In this paper, we propo… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2406.08777  [pdf, other

    math.AP

    Finite Time Blowup of Integer- and Fractional-Order Time-Delayed Diffusion Equations

    Authors: Christopher N. Angstmann, Stuart-James M. Burney, Daniel S. Han, Bruce I. Henry, Boris Z. Huang, Zhuang Xu

    Abstract: In this work, exact solutions are derived for an integer- and fractional-order time-delayed diffusion equation with arbitrary initial conditions. The solutions are obtained using Fourier transform methods in conjunction with the known properties of delay functions. It is observed that the solutions do not exhibit infinite speed of propagation for smooth initial conditions that are bounded and posi… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 2 figures

    MSC Class: 35R25; 35C10; 34K06; 34K37; 33E20; 42A38

  5. arXiv:2406.01349  [pdf, other

    cs.CV

    Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation

    Authors: Enhui Ma, Lijun Zhou, Tao Tang, Zhan Zhang, Dong Han, Junpeng Jiang, Kun Zhan, Peng Jia, Xianpeng Lang, Haiyang Sun, Di Lin, Kaicheng Yu

    Abstract: Using generative models to synthesize new data has become a de-facto standard in autonomous driving to address the data scarcity issue. Though existing approaches are able to boost perception models, we discover that these approaches fail to improve the performance of planning of end-to-end autonomous driving models as the generated videos are usually less than 8 frames and the spatial and tempora… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Project Page: https://westlake-autolab.github.io/delphi.github.io/, 8 figures

  6. arXiv:2406.00988  [pdf, other

    cs.AR

    ADE-HGNN: Accelerating HGNNs through Attention Disparity Exploitation

    Authors: Dengke Han, Meng Wu, Runzhen Xue, Mingyu Yan, Xiaochun Ye, Dongrui Fan

    Abstract: Heterogeneous Graph Neural Networks (HGNNs) have recently demonstrated great power in handling heterogeneous graph data, rendering them widely applied in many critical real-world domains. Most HGNN models leverage attention mechanisms to significantly improvemodel accuracy, albeit at the cost of increased computational complexity and memory bandwidth requirements. Fortunately, the attention dispar… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 15 pages, 9 figures, accepted by Euro-PAR 2024

  7. arXiv:2406.00897  [pdf, other

    math.AP

    Exact Solutions of a Time-Delay Advection Equation and a Fractional Time-Delay Advection Equation

    Authors: Christopher N. Angstmann, Stuart-James M. Burney, Daniel S. Han, Bruce I. Henry, Boris Z. Huang, Zhuang Xu

    Abstract: Exact solutions are derived for a time-delay advection equation and a fractional-order time-delay advection equation with a time-delay in the spatial derivative. Solutions are obtained, for arbitrary separable initial conditions, by incorporating recently introduced delay functions in a separation of variables approach. Examples are provided showing oscillatory and translatory behaviours fundament… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Letter

    MSC Class: 35C10; 35F10; 34K06; 42A38; 33E20

  8. arXiv:2405.19044  [pdf, ps, other

    math.NA

    On adaptive stochastic extended iterative methods for solving least squares

    Authors: Yun Zeng, Deren Han, Yansheng Su, Jiaxin Xie

    Abstract: In this paper, we propose a novel adaptive stochastic extended iterative method, which can be viewed as an improved extension of the randomized extended Kaczmarz (REK) method, for finding the unique minimum Euclidean norm least-squares solution of a given linear system. In particular, we introduce three equivalent stochastic reformulations of the linear least-squares problem: stochastic unconstrai… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  9. arXiv:2405.16605  [pdf, other

    cs.CV

    Demystify Mamba in Vision: A Linear Attention Perspective

    Authors: Dongchen Han, Ziyi Wang, Zhuofan Xia, Yizeng Han, Yifan Pu, Chunjiang Ge, Jun Song, Shiji Song, Bo Zheng, Gao Huang

    Abstract: Mamba is an effective state space model with linear computation complexity. It has recently shown impressive efficiency in dealing with high-resolution inputs across various vision tasks. In this paper, we reveal that the powerful Mamba model shares surprising similarities with linear attention Transformer, which typically underperform conventional Transformer in practice. By exploring the similar… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  10. arXiv:2405.14745  [pdf, other

    cs.LG

    AnyLoss: Transforming Classification Metrics into Loss Functions

    Authors: Doheon Han, Nuno Moniz, Nitesh V Chawla

    Abstract: Many evaluation metrics can be used to assess the performance of models in binary classification tasks. However, most of them are derived from a confusion matrix in a non-differentiable form, making it very difficult to generate a differentiable loss function that could directly optimize them. The lack of solutions to bridge this challenge not only hinders our ability to solve difficult tasks, suc… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  11. arXiv:2405.14362  [pdf, other

    cs.NE

    Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators

    Authors: Changze Lv, Dongqi Han, Yansen Wang, Xiaoqing Zheng, Xuanjing Huang, Dongsheng Li

    Abstract: Spiking neural networks (SNNs) represent a promising approach to developing artificial neural networks that are both energy-efficient and biologically plausible. However, applying SNNs to sequential tasks, such as text classification and time-series forecasting, has been hindered by the challenge of creating an effective and hardware-friendly spike-form positional encoding (PE) strategy. Drawing i… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  12. arXiv:2405.13549  [pdf, other

    eess.SP cs.IT

    Multi-Objective Optimization-Based Waveform Design for Multi-User and Multi-Target MIMO-ISAC Systems

    Authors: Peng Wang, Dongsheng Han, Yashuai Cao, Wanli Ni, Dusit Niyato

    Abstract: Integrated sensing and communication (ISAC) opens up new service possibilities for sixth-generation (6G) systems, where both communication and sensing (C&S) functionalities co-exist by sharing the same hardware platform and radio resource. In this paper, we investigate the waveform design problem in a downlink multi-user and multi-target ISAC system under different C&S performance preferences. The… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 13 pages, submitted to IEEE TWC

  13. arXiv:2405.05947  [pdf, other

    cs.HC

    A Survey on Visualization Approaches in Political Science for Social and Political Factors: Progress to Date and Future Opportunities

    Authors: Dongyun Han, Abdullah-Al-Raihan Nayeem, Jason Windett, Yaoyao Dai, Benjamin Radford, Isaac Cho

    Abstract: Politics is the set of activities related to strategic decision-making in groups. Political scientists study the strategic interactions between states, institutions, politicians, and citizens; they seek to understand the causes and consequences of those decisions and interactions. While some decisions might alleviate social problems, others might lead to disasters such as war and conflict. Data vi… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  14. arXiv:2405.04091  [pdf, ps, other

    math.NA

    Randomized iterative methods for generalized absolute value equations: Solvability and error bounds

    Authors: Jiaxin Xie, Houduo Qi, Deren Han

    Abstract: Randomized iterative methods, such as the Kaczmarz method and its variants, have gained growing attention due to their simplicity and efficiency in solving large-scale linear systems. Meanwhile, absolute value equations (AVE) have attracted increasing interest due to their connection with the linear complementarity problem. In this paper, we investigate the application of randomized iterative meth… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  15. arXiv:2405.02760  [pdf, other

    cs.CE cs.SI

    GTFS2STN: Analyzing GTFS Transit Data by Generating Spatiotemporal Transit Network

    Authors: Diyi Liu, Jing Guo, Yangsong Gu, Meredith King, Lee D. Han, Candace Brakewood

    Abstract: GTFS, the General Transit Feed Specialization, is an open standard format to record transit information used by thousands of transit agencies across the world. By converting a static GTFS transit network to a spatiotemporal network connecting bus stops over space and time, a preliminary tool named GTFS2STN is implemented to analyze the accessibility of the transit system. Furthermore, a simple app… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 8 pages, 8 figures

  16. arXiv:2405.01113  [pdf, other

    cs.CV cs.AI eess.IV

    Domain-Transferred Synthetic Data Generation for Improving Monocular Depth Estimation

    Authors: Seungyeop Lee, Knut Peterson, Solmaz Arezoomandan, Bill Cai, Peihan Li, Lifeng Zhou, David Han

    Abstract: A major obstacle to the development of effective monocular depth estimation algorithms is the difficulty in obtaining high-quality depth data that corresponds to collected RGB images. Collecting this data is time-consuming and costly, and even data collected by modern sensors has limited range or resolution, and is subject to inconsistencies and noise. To combat this, we propose a method of data g… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  17. arXiv:2404.18560  [pdf, other

    math.OC cs.RO

    Non-convex Pose Graph Optimization in SLAM via Proximal Linearized Riemannian ADMM

    Authors: Xin Chen, Chunfeng Cui, Deren Han, Liqun Qi

    Abstract: Pose graph optimization (PGO) is a well-known technique for solving the pose-based simultaneous localization and mapping (SLAM) problem. In this paper, we represent the rotation and translation by a unit quaternion and a three-dimensional vector, and propose a new PGO model based on the von Mises-Fisher distribution. The constraints derived from the unit quaternions are spherical manifolds, and th… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  18. arXiv:2404.17955  [pdf, other

    cs.SE

    A Survey of Third-Party Library Security Research in Application Software

    Authors: Jia Zeng, Dan Han, Yaling Zhu, Yangzhong Wang, Fangchen Weng

    Abstract: In the current software development environment, third-party libraries play a crucial role. They provide developers with rich functionality and convenient solutions, speeding up the pace and efficiency of software development. However, with the widespread use of third-party libraries, associated security risks and potential vulnerabilities are increasingly apparent. Malicious attackers can exploit… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 21 pages, 3 figures, one table

  19. arXiv:2404.17507  [pdf, other

    cs.CV

    HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

    Authors: Wonjae Kim, Sanghyuk Chun, Taekyung Kim, Dongyoon Han, Sangdoo Yun

    Abstract: In an era where the volume of data drives the effectiveness of self-supervised learning, the specificity and clarity of data semantics play a crucial role in model training. Addressing this, we introduce HYPerbolic Entailment filtering (HYPE), a novel methodology designed to meticulously extract modality-wise meaningful and well-aligned data from extensive, noisy image-text pair datasets. Our appr… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 28pages, 4.5MB

  20. arXiv:2404.17281  [pdf

    physics.optics

    Topological polarization singularities induced by the non-Hermitian Dirac points

    Authors: Jun Wang, Jie Liu, Peng Hu, Qiao Jiang, Dezhuan Han

    Abstract: A Dirac point in the Hermitian photonic system will split into a pair of exceptional points (EPs) or even spawn a ring of EPs if non-Hermiticity is involved. Here, we present a new type of non-Hermitian Dirac point which is situated in the complex plane of eigenfrequency. When there is differential loss, the Dirac point exhibits a dual behavior: it not only splits into a pair of EPs with opposite… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  21. arXiv:2404.16659  [pdf, other

    cs.CL cs.AI

    ProbGate at EHRSQL 2024: Enhancing SQL Query Generation Accuracy through Probabilistic Threshold Filtering and Error Handling

    Authors: Sangryul Kim, Donghee Han, Sehyun Kim

    Abstract: Recently, deep learning-based language models have significantly enhanced text-to-SQL tasks, with promising applications in retrieving patient records within the medical domain. One notable challenge in such applications is discerning unanswerable queries. Through fine-tuning model, we demonstrate the feasibility of converting medical record inquiries into SQL queries. Additionally, we introduce a… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: The 6th Clinical Natural Language Processing Workshop at NAACL 2024. Code is available at https://github.com/venzino-han/probgate_ehrsql

  22. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  23. arXiv:2404.16263  [pdf, other

    astro-ph.HE

    New Timing Results of MSPs from NICER Observations

    Authors: Shijie Zheng, Dawei Han, Heng Xu, Kejia Lee, Jianping Yuan, Haoxi Wang, Mingyu Ge, Liang Zhang, Yongye Li, Yitao Yin, Xiang Ma, Yong Chen, Shuangnan Zhang

    Abstract: Millisecond pulsars (MSPs) are known for their long-term stability. Using six years of observations from the Neutron Star Interior Composition Explorer (NICER), we have conducted an in-depth analysis of the X-ray timing results for six MSPs: PSRs B1937+21, B1821$-$24, J0437$-$4715, J0030+0451, J0218+4232, and J2124$-$3358. The timing stability parameter $σ_z$ has been calculated, revealing remarka… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  24. arXiv:2404.14285  [pdf, other

    cs.RO cs.AI

    LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

    Authors: Dongge Han, Trevor McInroe, Adam Jelley, Stefano V. Albrecht, Peter Bell, Amos Storkey

    Abstract: Large language models (LLMs) have shown significant potential for robotics applications, particularly task planning, by harnessing their language comprehension and text generation capabilities. However, in applications such as household robotics, a critical gap remains in the personalization of these models to individual user preferences. We introduce LLM-Personalize, a novel framework with an opt… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  25. arXiv:2404.11822  [pdf, ps, other

    math.NA

    A class of maximum-based iteration methods for the generalized absolute value equation

    Authors: Shiliang Wu, Deren Han, Cuixia Li

    Abstract: In this paper, by using $|x|=2\max\{0,x\}-x$, a class of maximum-based iteration methods is established to solve the generalized absolute value equation $Ax-B|x|=b$. Some convergence conditions of the proposed method are presented. By some numerical experiments, the effectiveness and feasibility of the proposed method are confirmed.

    Submitted 17 April, 2024; originally announced April 2024.

  26. arXiv:2404.09490  [pdf, other

    cs.CV

    Leveraging Temporal Contextualization for Video Action Recognition

    Authors: Minji Kim, Dongyoon Han, Taekyung Kim, Bohyung Han

    Abstract: Pretrained vision-language models have shown effectiveness in video understanding. However, recent studies have not sufficiently leveraged essential temporal information from videos, simply averaging frame-wise representations or referencing consecutive frames. We introduce Temporally Contextualized CLIP (TC-CLIP), a pioneering framework for video understanding that effectively and efficiently lev… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 24 pages, 10 figures, 12 tables

  27. arXiv:2404.09460  [pdf, other

    math.OC

    Optimal Real-time Bidding Strategy For EV Aggregators in Wholesale Electricity Markets

    Authors: Shihan Huang, Dongkun Han, John Zhen Fu Pang, Yue Chen

    Abstract: With the rapid growth of electric vehicles (EVs), EV aggregators have been playing a increasingly vital role in power systems by not merely providing charging management but also participating in wholesale electricity markets. This work studies the optimal real-time bidding strategy for an EV aggregator. Since the charging process of EVs is time-coupled, it is necessary for EV aggregators to consi… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 13 pages, 6 figures

  28. arXiv:2404.08317  [pdf, other

    hep-ex physics.ins-det

    Technical Design Report of the Spin Physics Detector at NICA

    Authors: The SPD Collaboration, V. Abazov, V. Abramov, L. Afanasyev, R. Akhunzyanov, A. Akindinov, I. Alekseev, A. Aleshko, V. Alexakhin, G. Alexeev, L. Alimov, A. Allakhverdieva, A. Amoroso, V. Andreev, V. Andreev, E. Andronov, Yu. Anikin, S. Anischenko, A. Anisenkov, V. Anosov, E. Antokhin, A. Antonov, S. Antsupov, A. Anufriev, K. Asadova , et al. (392 additional authors not shown)

    Abstract: The Spin Physics Detector collaboration proposes to install a universal detector in the second interaction point of the NICA collider under construction (JINR, Dubna) to study the spin structure of the proton and deuteron and other spin-related phenomena using a unique possibility to operate with polarized proton and deuteron beams at a collision energy up to 27 GeV and a luminosity up to… ▽ More

    Submitted 28 May, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  29. arXiv:2404.08003  [pdf, other

    cs.LG cs.DC cs.NI

    Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis

    Authors: Guangchen Lan, Dong-Jun Han, Abolfazl Hashemi, Vaneet Aggarwal, Christopher G. Brinton

    Abstract: To improve the efficiency of reinforcement learning, we propose a novel asynchronous federated reinforcement learning framework termed AFedPG, which constructs a global model through collaboration among $N$ agents using policy gradient (PG) updates. To handle the challenge of lagged policies in asynchronous settings, we design delay-adaptive lookahead and normalized update techniques that can effe… ▽ More

    Submitted 14 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    ACM Class: I.2.6; I.2.11

  30. GDR-HGNN: A Heterogeneous Graph Neural Networks Accelerator Frontend with Graph Decoupling and Recoupling

    Authors: Runzhen Xue, Mingyu Yan, Dengke Han, Yihan Teng, Zhimin Tang, Xiaochun Ye, Dongrui Fan

    Abstract: Heterogeneous Graph Neural Networks (HGNNs) have broadened the applicability of graph representation learning to heterogeneous graphs. However, the irregular memory access pattern of HGNNs leads to the buffer thrashing issue in HGNN accelerators. In this work, we identify an opportunity to address buffer thrashing in HGNN acceleration through an analysis of the topology of heterogeneous graphs. To… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 6 pages, 10 figures, accepted by DAC'61

  31. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  32. arXiv:2404.01745  [pdf, other

    cs.CV cs.AI

    Unleash the Potential of CLIP for Video Highlight Detection

    Authors: Donghoon Han, Seunghyeon Seo, Eunhwan Park, Seong-Uk Nam, Nojun Kwak

    Abstract: Multimodal and large language models (LLMs) have revolutionized the utilization of open-world knowledge, unlocking novel potentials across various tasks and applications. Among these domains, the video domain has notably benefited from their capabilities. In this paper, we present Highlight-CLIP (HL-CLIP), a method designed to excel in the video highlight detection task by leveraging the pre-train… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  33. arXiv:2404.01604  [pdf, other

    cs.CV eess.IV

    WaveDH: Wavelet Sub-bands Guided ConvNet for Efficient Image Dehazing

    Authors: Seongmin Hwang, Daeyoung Han, Cheolkon Jung, Moongu Jeon

    Abstract: The surge in interest regarding image dehazing has led to notable advancements in deep learning-based single image dehazing approaches, exhibiting impressive performance in recent studies. Despite these strides, many existing methods fall short in meeting the efficiency demands of practical applications. In this paper, we introduce WaveDH, a novel and compact ConvNet designed to address this effic… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Submitted to TMM

    MSC Class: 68T07 ACM Class: I.4.4; I.4.9

  34. arXiv:2403.19588  [pdf, other

    cs.CV cs.LG cs.NE

    DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs

    Authors: Donghyun Kim, Byeongho Heo, Dongyoon Han

    Abstract: This paper revives Densely Connected Convolutional Networks (DenseNets) and reveals the underrated effectiveness over predominant ResNet-style architectures. We believe DenseNets' potential was overlooked due to untouched training methods and traditional design elements not fully revealing their capabilities. Our pilot study shows dense connections through concatenation are strong, demonstrating t… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Code at https://github.com/naver-ai/rdnet

  35. arXiv:2403.19522  [pdf, other

    cs.LG cs.CV

    Model Stock: All we need is just a few fine-tuned models

    Authors: Dong-Hwan Jang, Sangdoo Yun, Dongyoon Han

    Abstract: This paper introduces an efficient fine-tuning method for large pre-trained models, offering strong in-distribution (ID) and out-of-distribution (OOD) performance. Breaking away from traditional practices that need a multitude of fine-tuned models for averaging, our approach employs significantly fewer models to achieve final weights yet yield superior accuracy. Drawing from key insights in the we… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Code at https://github.com/naver-ai/model-stock

  36. arXiv:2403.19218  [pdf, other

    math.NA

    A piecewise neural network method for solving large interval solution to initial value problem of ordinary differential equations

    Authors: Dongpeng Han, Chaolu Temuer

    Abstract: Various traditional numerical methods for solving initial value problems of differential equations often produce local solutions near the initial value point, despite the problems having larger interval solutions. Even current popular neural network algorithms or deep learning methods cannot guarantee yielding large interval solutions for these problems. In this paper, we propose a piecewise neura… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 26 pages,13 figures

  37. arXiv:2403.15692  [pdf, other

    cs.IT eess.SP

    Block Orthogonal Sparse Superposition Codes for $ \sf{L}^3 $ Communications: Low Error Rate, Low Latency, and Low Power Consumption

    Authors: Donghwa Han, Bowhyung Lee, Min Jang, Donghun Lee, Seho Myung, Namyoon Lee

    Abstract: Block orthogonal sparse superposition (BOSS) code is a class of joint coded modulation methods, which can closely achieve the finite-blocklength capacity with a low-complexity decoder at a few coding rates under Gaussian channels. However, for fading channels, the code performance degrades considerably because coded symbols experience different channel fading effects. In this paper, we put forth n… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  38. arXiv:2403.13977  [pdf, other

    math-ph math.AP math.PR math.SP

    Spectral Analysis of Lattice Schrödinger-Type Operators Associated with the Nonstationary Anderson Model and Intermittency

    Authors: Dan Han, Stanislav Molchanov, Boris Vainberg

    Abstract: The research explores a high irregularity, commonly referred to as intermittency, of the solution to the non-stationary parabolic Anderson problem: \begin{equation*} \frac{\partial u}{\partial t} = \varkappa \mathcal{L}u(t,x) + ξ_{t}(x)u(t,x) \end{equation*} with the initial condition \(u(0,x) \equiv 1\), where \((t,x) \in [0,\infty)\times \mathbb{Z}^d\). Here, \(\varkappa \mathcal{L}\) denotes… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    MSC Class: 60H25; 60H15; 81Q10; 37H15; 35B40

  39. arXiv:2403.13298  [pdf, other

    cs.CV cs.LG

    Rotary Position Embedding for Vision Transformer

    Authors: Byeongho Heo, Song Park, Dongyoon Han, Sangdoo Yun

    Abstract: Rotary Position Embedding (RoPE) performs remarkably on language models, especially for length extrapolation of Transformers. However, the impacts of RoPE on computer vision domains have been underexplored, even though RoPE appears capable of enhancing Vision Transformer (ViT) performance in a way similar to the language domain. This study provides a comprehensive analysis of RoPE when applied to… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

  40. arXiv:2403.12404  [pdf, other

    cs.LG cs.CV

    Understanding and Improving Training-free Loss-based Diffusion Guidance

    Authors: Yifei Shen, Xinyang Jiang, Yezhen Wang, Yifan Yang, Dongqi Han, Dongsheng Li

    Abstract: Adding additional control to pretrained diffusion models has become an increasingly popular research area, with extensive applications in computer vision, reinforcement learning, and AI for science. Recently, several studies have proposed training-free loss-based guidance by using off-the-shelf networks pretrained on clean images. This approach enables zero-shot conditional generation for universa… ▽ More

    Submitted 29 May, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  41. Distributed Adaptive Gradient Algorithm with Gradient Tracking for Stochastic Non-Convex Optimization

    Authors: Dongyu Han, Kun Liu, Yeming Lin, Yuanqing Xia

    Abstract: This paper considers a distributed stochastic non-convex optimization problem, where the nodes in a network cooperatively minimize a sum of $L$-smooth local cost functions with sparse gradients. By adaptively adjusting the stepsizes according to the historical (possibly sparse) gradients, a distributed adaptive gradient algorithm is proposed, in which a gradient tracking estimator is used to handl… ▽ More

    Submitted 29 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures

    Journal ref: IEEE Transactions on Automatic Control (2024)

  42. arXiv:2403.11079  [pdf, other

    cs.SE cs.LG

    Bridging Expert Knowledge with Deep Learning Techniques for Just-In-Time Defect Prediction

    Authors: Xin Zhou, DongGyun Han, David Lo

    Abstract: Just-In-Time (JIT) defect prediction aims to automatically predict whether a commit is defective or not, and has been widely studied in recent years. In general, most studies can be classified into two categories: 1) simple models using traditional machine learning classifiers with hand-crafted features, and 2) complex models using deep learning techniques to automatically extract features from co… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 48 pages

  43. arXiv:2403.09675  [pdf, other

    cs.CV cs.GR

    Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases

    Authors: Rio Aguina-Kang, Maxim Gumin, Do Heon Han, Stewart Morris, Seung Jean Yoo, Aditya Ganeshan, R. Kenny Jones, Qiuhong Anna Wei, Kailiang Fu, Daniel Ritchie

    Abstract: We present a system for generating indoor scenes in response to text prompts. The prompts are not limited to a fixed vocabulary of scene descriptions, and the objects in generated scenes are not restricted to a fixed set of object categories -- we call this setting indoor scene generation. Unlike most prior work on indoor scene generation, our system does not require a large training dataset of ex… ▽ More

    Submitted 4 February, 2024; originally announced March 2024.

    Comments: See ancillary files for link to supplemental material

  44. arXiv:2402.15265  [pdf, other

    cs.HC cs.CL

    CloChat: Understanding How People Customize, Interact, and Experience Personas in Large Language Models

    Authors: Juhye Ha, Hyeon Jeon, DaEun Han, Jinwook Seo, Changhoon Oh

    Abstract: Large language models (LLMs) have facilitated significant strides in generating conversational agents, enabling seamless, contextually relevant dialogues across diverse topics. However, the existing LLM-driven conversational agents have fixed personalities and functionalities, limiting their adaptability to individual user needs. Creating personalized agent personas with distinct expertise or trai… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '24)

  45. arXiv:2402.15019  [pdf, other

    cs.LG cs.AI stat.ML

    Consistency-Guided Temperature Scaling Using Style and Content Information for Out-of-Domain Calibration

    Authors: Wonjeong Choi, Jungwuk Park, Dong-Jun Han, Younghyun Park, Jaekyun Moon

    Abstract: Research interests in the robustness of deep neural networks against domain shifts have been rapidly increasing in recent years. Most existing works, however, focus on improving the accuracy of the model, not the calibration performance which is another important requirement for trustworthy AI systems. Temperature scaling (TS), an accuracy-preserving post-hoc calibration method, has been proven to… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted at AAAI-24 (The 38th AAAI Conference on Artificial Intelligence, February 2024)

  46. arXiv:2402.13851  [pdf, other

    cs.CV

    VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models

    Authors: Jiawei Liang, Siyuan Liang, Man Luo, Aishan Liu, Dongchen Han, Ee-Chien Chang, Xiaochun Cao

    Abstract: Autoregressive Visual Language Models (VLMs) showcase impressive few-shot learning capabilities in a multimodal context. Recently, multimodal instruction tuning has been proposed to further enhance instruction-following abilities. However, we uncover the potential threat posed by backdoor attacks on autoregressive VLMs during instruction tuning. Adversaries can implant a backdoor by injecting pois… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  47. arXiv:2402.08963  [pdf, other

    cs.LG cs.AI

    DUEL: Duplicate Elimination on Active Memory for Self-Supervised Class-Imbalanced Learning

    Authors: Won-Seok Choi, Hyundo Lee, Dong-Sig Han, Junseok Park, Heeyeon Koo, Byoung-Tak Zhang

    Abstract: Recent machine learning algorithms have been developed using well-curated datasets, which often require substantial cost and resources. On the other hand, the direct use of raw data often leads to overfitting towards frequently occurring class information. To address class imbalances cost-efficiently, we propose an active data filtering process during self-supervised pre-training in our novel fram… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted as a full paper at AAAI 2024: The 38th Annual AAAI Conference on Artificial Intelligence (Main Tech Track). 7 pages (main paper), 2 pages (references), 11 pages (appendix) each

  48. arXiv:2402.04406  [pdf, other

    math.OC

    Regularized MIP Model for Optimal Power Flow with Energy Storage Systems and its Applications

    Authors: Dahye Han, Nan Jiang, Santanu S. Dey, Weijun Xie

    Abstract: Incorporating energy storage systems (ESS) into power systems has been studied in many recent works, where binary variables are often introduced to model the complementary nature of battery charging and discharging. A conventional approach for these ESS optimization problems is to relax binary variables and convert the problem into a linear program. However, such linear programming relaxation mode… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  49. arXiv:2402.03448  [pdf, other

    cs.LG cs.DC

    Decentralized Sporadic Federated Learning: A Unified Algorithmic Framework with Convergence Guarantees

    Authors: Shahryar Zehtabi, Dong-Jun Han, Rohit Parasnis, Seyyedali Hosseinalipour, Christopher G. Brinton

    Abstract: Decentralized federated learning (DFL) captures FL settings where both (i) model updates and (ii) model aggregations are exclusively carried out by the clients without a central server. Existing DFL works have mostly focused on settings where clients conduct a fixed number of local updates between local model exchanges, overlooking heterogeneity and dynamics in communication and computation capabi… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  50. arXiv:2402.02442  [pdf, other

    cs.LG eess.IV

    A Momentum Accelerated Algorithm for ReLU-based Nonlinear Matrix Decomposition

    Authors: Qingsong Wang, Chunfeng Cui, Deren Han

    Abstract: Recently, there has been a growing interest in the exploration of Nonlinear Matrix Decomposition (NMD) due to its close ties with neural networks. NMD aims to find a low-rank matrix from a sparse nonnegative matrix with a per-element nonlinear function. A typical choice is the Rectified Linear Unit (ReLU) activation function. To address over-fitting in the existing ReLU-based NMD model (ReLU-NMD),… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 5 pages, 7 figures