Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2404.13470 (cs)

[Submitted on 20 Apr 2024]

Title:GWLZ: A Group-wise Learning-based Lossy Compression Framework for Scientific Data

Authors:Wenqi Jia, Sian Jin, Jinzhen Wang, Wei Niu, Dingwen Tao, Miao Yin

Abstract:The rapid expansion of computational capabilities and the ever-growing scale of modern HPC systems present formidable challenges in managing exascale scientific data. Faced with such vast datasets, traditional lossless compression techniques prove insufficient in reducing data size to a manageable level while preserving all information intact. In response, researchers have turned to error-bounded lossy compression methods, which offer a balance between data size reduction and information retention. However, despite their utility, these compressors employing conventional techniques struggle with limited reconstruction quality. To address this issue, we draw inspiration from recent advancements in deep learning and propose GWLZ, a novel group-wise learning-based lossy compression framework with multiple lightweight learnable enhancer models. Leveraging a group of neural networks, GWLZ significantly enhances the decompressed data reconstruction quality with negligible impact on the compression efficiency. Experimental results on different fields from the Nyx dataset demonstrate remarkable improvements by GWLZ, achieving up to 20% quality enhancements with negligible overhead as low as 0.0003x.

Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.13470 [cs.DC]
	(or arXiv:2404.13470v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2404.13470

Submission history

From: Wenqi Jia [view email]
[v1] Sat, 20 Apr 2024 21:12:53 UTC (1,612 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:GWLZ: A Group-wise Learning-based Lossy Compression Framework for Scientific Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:GWLZ: A Group-wise Learning-based Lossy Compression Framework for Scientific Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators