Skip to main content
Publication

Use cases of lossy compression for floating-point data in scientific datasets

Authors

Cappello, Franck; Di, Sheng; Li, Sihuan; Liang, Xin; Gok, Ali Murat; Tao, Dingwen; Yoon, Chun Hong; Wu, Xin-Chuan; Alexeev, Yuri; Chong, Frederic

Abstract

Architectural and technological trends of systems used for scientific computing call for a significant reduction of scientific data sets that are composed mainly of floating-point data. This article surveys and presents experimental results of currently identified use cases of generic lossy compression to address the different limitations of scientific computing systems. The article shows from a collection of experiments run on parallel systems of a leadership facility that lossy data compression not only can reduce the footprint of scientific data sets on storage but also can reduce I/O and checkpoint/restart times, accelerate computation, and even allow significantly larger problems to be run than without lossy compression. These results suggest that lossy compression will become an important technology in many aspects of high performance scientific computing. Because the constraints for each use case are different and often conflicting, this collection of results also indicates the need for more specialization of the compression pipelines.