Skip to main content

Showing 1–8 of 8 results for author: Calhoun, J C

  1. arXiv:2404.02840  [pdf, ps, other

    cs.DC

    A Survey on Error-Bounded Lossy Compression for Scientific Datasets

    Authors: Sheng Di, Jinyang Liu, Kai Zhao, Xin Liang, Robert Underwood, Zhaorui Zhang, Milan Shah, Yafan Huang, Jiajun Huang, Xiaodong Yu, Congrong Ren, Hanqi Guo, Grant Wilkins, Dingwen Tao, Jiannan Tian, Sian Jin, Zizhe Jian, Daoce Wang, MD Hasanur Rahman, Boyuan Zhang, Jon C. Calhoun, Guanpeng Li, Kazutomo Yoshii, Khalid Ayed Alharthi, Franck Cappello

    Abstract: Error-bounded lossy compression has been effective in significantly reducing the data storage/transfer burden while preserving the reconstructed data fidelity very well. Many error-bounded lossy compressors have been developed for a wide range of parallel and distributed use cases for years. These lossy compressors are designed with distinct compression models and design principles, such that each… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: submitted to ACM Computing journal, requited to be 35 pages including references

  2. arXiv:2403.15953  [pdf, other

    cs.LG cs.AI

    Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets

    Authors: Robert Underwood, Jon C. Calhoun, Sheng Di, Franck Cappello

    Abstract: Learning and Artificial Intelligence (ML/AI) techniques have become increasingly prevalent in high performance computing (HPC). However, these methods depend on vast volumes of floating point data for training and validation which need methods to share the data on a wide area network (WAN) or to transfer it from edge devices to data centers. Data compression can be a solution to these problems, bu… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 12 pages, 4 figures

    ACM Class: I.2.6; E.2; C.4

  3. arXiv:2312.13461  [pdf, other

    cs.DC

    FedSZ: Leveraging Error-Bounded Lossy Compression for Federated Learning Communications

    Authors: Grant Wilkins, Sheng Di, Jon C. Calhoun, Zilinghan Li, Kibaek Kim, Robert Underwood, Richard Mortier, Franck Cappello

    Abstract: With the promise of federated learning (FL) to allow for geographically-distributed and highly personalized services, the efficient exchange of model updates between clients and servers becomes crucial. FL, though decentralized, often faces communication bottlenecks, especially in resource-constrained scenarios. Existing data compression techniques like gradient sparsification, quantization, and p… ▽ More

    Submitted 24 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Appearing at 44th IEEE International Conference on Distributed Computing Systems (ICDCS)

  4. arXiv:2305.08801  [pdf, other

    cs.DC cs.IT

    Black-Box Statistical Prediction of Lossy Compression Ratios for Scientific Data

    Authors: Robert Underwood, Julie Bessac, David Krasowska, Jon C. Calhoun, Sheng Di, Franck Cappello

    Abstract: Lossy compressors are increasingly adopted in scientific research, tackling volumes of data from experiments or parallel numerical simulations and facilitating data storage and movement. In contrast with the notion of entropy in lossless compression, no theoretical or data-based quantification of lossy compressibility exists for scientific data. Users rely on trial and error to assess lossy compre… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 16 pages, 10 figures

  5. arXiv:2112.02289  [pdf, other

    cs.DC

    Towards Aggregated Asynchronous Checkpointing

    Authors: Mikaila J. Gossman, Bogdan Nicolae, Jon C. Calhoun, Franck Cappello, Melissa C. Smith

    Abstract: High-Performance Computing (HPC) applications need to checkpoint massive amounts of data at scale. Multi-level asynchronous checkpoint runtimes like VELOC (Very Low Overhead Checkpoint Strategy) are gaining popularity among application scientists for their ability to leverage fast node-local storage and flush independently to stable, external storage (e.g., parallel file systems) in the background… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

    Comments: Accepted submission to the SuperCheck Workshop at the SuperComputing Conference held in St. Louis, MO. November 14-19, 2021(SC'21)

  6. arXiv:2111.02925  [pdf, other

    cs.DC

    SZ3: A Modular Framework for Composing Prediction-Based Error-Bounded Lossy Compressors

    Authors: Xin Liang, Kai Zhao, Sheng Di, Sihuan Li, Robert Underwood, Ali M. Gok, Jiannan Tian, Junjing Deng, Jon C. Calhoun, Dingwen Tao, Zizhong Chen, Franck Cappello

    Abstract: Today's scientific simulations require a significant reduction of data volume because of extremely large amounts of data they produce and the limited I/O bandwidth and storage space. Error-bounded lossy compressor has been considered one of the most effective solutions to the above problem. In practice, however, the best-fit compression method often needs to be customized/optimized in particular b… ▽ More

    Submitted 11 November, 2021; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: 13 pages

  7. arXiv:2002.03742  [pdf

    cs.CV

    Dynamic Error-bounded Lossy Compression (EBLC) to Reduce the Bandwidth Requirement for Real-time Vision-based Pedestrian Safety Applications

    Authors: Mizanur Rahman, Mhafuzul Islam, Jon C. Calhoun, Mashrur Chowdhury

    Abstract: As camera quality improves and their deployment moves to areas with limited bandwidth, communication bottlenecks can impair real-time constraints of an ITS application, such as video-based real-time pedestrian detection. Video compression reduces the bandwidth requirement to transmit the video but degrades the video quality. As the quality level of the video decreases, it results in the correspond… ▽ More

    Submitted 29 January, 2020; originally announced February 2020.

    Comments: 10 pages, 8 figures, 2 tables

  8. arXiv:2001.06139  [pdf, other

    cs.DC

    FRaZ: A Generic High-Fidelity Fixed-Ratio Lossy Compression Framework for Scientific Floating-point Data

    Authors: Robert Underwood, Sheng Di, Jon C. Calhoun, Franck Cappello

    Abstract: With ever-increasing volumes of scientific floating-point data being produced by high-performance computing applications, significantly reducing scientific floating-point data size is critical, and error-controlled lossy compressors have been developed for years. None of the existing scientific floating-point lossy data compressors, however, support effective fixed-ratio lossy compression. Yet fix… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

    Comments: 12 pages