Skip to main content

Showing 1–16 of 16 results for author: Huebl, A

  1. arXiv:2407.04267  [pdf, other

    cs.DC

    A High-Quality Workflow for Multi-Resolution Scientific Data Reduction and Visualization

    Authors: Daoce Wang, Pascal Grosset, Jesus Pulido, Tushar M. Athawale, Jiannan Tian, Kai Zhao, Zarija Lukić, Axel Huebl, Zhe Wang, James Ahrens, Dingwen Tao

    Abstract: Multi-resolution methods such as Adaptive Mesh Refinement (AMR) can enhance storage efficiency for HPC applications generating vast volumes of data. However, their applicability is limited and cannot be universally deployed across all applications. Furthermore, integrating lossy compression with multi-resolution techniques to further boost storage efficiency encounters significant barriers. To thi… ▽ More

    Submitted 11 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: accepted by SC '24

  2. arXiv:2403.12179  [pdf, other

    cs.DC

    AMReX and pyAMReX: Looking Beyond ECP

    Authors: Andrew Myers, Weiqun Zhang, Ann Almgren, Thierry Antoun, John Bell, Axel Huebl, Alexander Sinn

    Abstract: AMReX is a software framework for the development of block-structured mesh applications with adaptive mesh refinement (AMR). AMReX was initially developed and supported by the AMReX Co-Design Center as part of the U.S. DOE Exascale Computing Project, and is continuing to grow post-ECP. In addition to adding new functionality and performance improvements to the core AMReX framework, we have also de… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 12 pages, 1 figure, submitted to the International Journal of High Performance Computing Applications

  3. arXiv:2311.02010  [pdf, other

    cs.CY

    A cast of thousands: How the IDEAS Productivity project has advanced software productivity and sustainability

    Authors: Lois Curfman McInnes, Michael Heroux, David E. Bernholdt, Anshu Dubey, Elsa Gonsiorowski, Rinku Gupta, Osni Marques, J. David Moulton, Hai Ah Nam, Boyana Norris, Elaine M. Raybourn, Jim Willenbring, Ann Almgren, Ross Bartlett, Kita Cranfill, Stephen Fickas, Don Frederick, William Godoy, Patricia Grubel, Rebecca Hartman-Baker, Axel Huebl, Rose Lynch, Addi Malviya Thakur, Reed Milewicz, Mark C. Miller , et al. (9 additional authors not shown)

    Abstract: Computational and data-enabled science and engineering are revolutionizing advances throughout science and society, at all scales of computing. For example, teams in the U.S. DOE Exascale Computing Project have been tackling new frontiers in modeling, simulation, and analysis by exploiting unprecedented exascale computing capabilities-building an advanced software ecosystem that supports next-gene… ▽ More

    Submitted 16 February, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 12 pages, 1 figure

  4. arXiv:2310.00469  [pdf, other

    cs.DC cs.HC

    State of In Situ Visualization in Simulations: We are fast. But are we inspiring?

    Authors: Axel Huebl, Arianna Formenti, Marco Garten, Jean-Luc Vay

    Abstract: Visualization of dynamic processes in scientific high-performance computing is an immensely data intensive endeavor. Application codes have recently demonstrated scaling to full-size Exascale machines, and generating high-quality data for visualization is consequently on the machine-scale, easily spanning 100s of TBytes of input to generate a single video frame. In situ visualization, the techniqu… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 2 pages + references, 1 figure, accepted lightning talk abstract for ISAV23 (in conjunction with SC23)

  5. arXiv:2304.10566  [pdf, other

    astro-ph.HE cs.CE

    Particle-in-Cell Simulations of Relativistic Magnetic Reconnection with Advanced Maxwell Solver Algorithms

    Authors: Hannah Klion, Revathi Jambunathan, Michael E. Rowan, Eloise Yang, Donald Willcox, Jean-Luc Vay, Remi Lehe, Andrew Myers, Axel Huebl, Weiqun Zhang

    Abstract: Relativistic magnetic reconnection is a non-ideal plasma process that is a source of non-thermal particle acceleration in many high-energy astrophysical systems. Particle-in-cell (PIC) methods are commonly used for simulating reconnection from first principles. While much progress has been made in understanding the physics of reconnection, especially in 2D, the adoption of advanced algorithms and… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 19 pages, 10 figures. Submitted to ApJ

  6. arXiv:2303.12873  [pdf, other

    physics.acc-ph cs.SE physics.plasm-ph

    From Compact Plasma Particle Sources to Advanced Accelerators with Modeling at Exascale

    Authors: Axel Huebl, Remi Lehe, Edoardo Zoni, Olga Shapoval, Ryan T. Sandberg, Marco Garten, Arianna Formenti, Revathi Jambunathan, Prabhat Kumar, Kevin Gott, Andrew Myers, Weiqun Zhang, Ann Almgren, Chad E. Mitchell, Ji Qiang, David Grote, Alexander Sinn, Severin Diederichs, Maxence Thevenet, Luca Fedeli, Thomas Clark, Neil Zaim, Henri Vincenti, Jean-Luc Vay

    Abstract: Developing complex, reliable advanced accelerators requires a coordinated, extensible, and comprehensive approach in modeling, from source to the end of beam lifetime. We present highlights in Exascale Computing to scale accelerator modeling software to the requirements set for contemporary science drivers. In particular, we present the first laser-plasma modeling on an exaflop supercomputer using… ▽ More

    Submitted 18 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: 4 pages, 3 figures, presented at the 20th Advanced Accelerator Concepts Workshop (AAC22)

  7. arXiv:2208.02382  [pdf, ps, other

    physics.acc-ph cs.DC

    Next Generation Computational Tools for the Modeling and Design of Particle Accelerators at Exascale

    Authors: Axel Huebl, Remi Lehe, Chad E. Mitchell, Ji Qiang, Robert D. Ryne, Ryan T. Sandberg, Jean-Luc Vay

    Abstract: Particle accelerators are among the largest, most complex devices. To meet the challenges of increasing energy, intensity, accuracy, compactness, complexity and efficiency, increasingly sophisticated computational tools are required for their design and optimization. It is key that contemporary software take advantage of the latest advances in computer hardware and scientific software engineering… ▽ More

    Submitted 9 August, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

    Comments: 4 pages, 8 figures; NAPAC22, Invited Oral, TUYE2

    MSC Class: 78-10 ACM Class: I.6.0; D.2.12; D.2.13

    Journal ref: NAPAC22, 2022

  8. Improving I/O Performance for Exascale Applications through Online Data Layout Reorganization

    Authors: Lipeng Wan, Axel Huebl, Junmin Gu, Franz Poeschel, Ana Gainaru, Ruonan Wang, Jieyang Chen, Xin Liang, Dmitry Ganyushin, Todd Munson, Ian Foster, Jean-Luc Vay, Norbert Podhorszki, Kesheng Wu, Scott Klasky

    Abstract: The applications being developed within the U.S. Exascale Computing Project (ECP) to run on imminent Exascale computers will generate scientific results with unprecedented fidelity and record turn-around time. Many of these codes are based on particle-mesh methods and use advanced algorithms, especially dynamic load-balancing and mesh-refinement, to achieve high performance on Exascale machines. Y… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: 12 pages, 15 figures, accepted by IEEE Transactions on Parallel and Distributed Systems

    Journal ref: IEEE Transactions on Parallel and Distributed Systems, 2021

  9. Transitioning from file-based HPC workflows to streaming data pipelines with openPMD and ADIOS2

    Authors: Franz Poeschel, Juncheng E, William F. Godoy, Norbert Podhorszki, Scott Klasky, Greg Eisenhauer, Philip E. Davis, Lipeng Wan, Ana Gainaru, Junmin Gu, Fabian Koller, René Widera, Michael Bussmann, Axel Huebl

    Abstract: This paper aims to create a transition path from file-based IO to streaming-based workflows for scientific applications in an HPC environment. By using the openPMP-api, traditional workflows limited by filesystem bottlenecks can be overcome and flexibly extended for in situ analysis. The openPMD-api is a library for the description of scientific data according to the Open Standard for Particle-Mes… ▽ More

    Submitted 19 January, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: 18 pages, 9 figures, SMC2021, supplementary material at https://zenodo.org/record/4906276

  10. arXiv:2104.11385  [pdf, other

    cs.DC physics.acc-ph physics.comp-ph physics.plasm-ph

    In-Situ Assessment of Device-Side Compute Work for Dynamic Load Balancing in a GPU-Accelerated PIC Code

    Authors: Michael E. Rowan, Axel Huebl, Kevin N. Gott, Jack Deslippe, Maxence Thévenet, Remi Lehe, Jean-Luc Vay

    Abstract: Maintaining computational load balance is important to the performant behavior of codes which operate under a distributed computing model. This is especially true for GPU architectures, which can suffer from memory oversubscription if improperly load balanced. We present enhancements to traditional load balancing approaches and explicitly target GPU architectures, exploring the resulting performan… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: 11 pages, 8 figures. Paper accepted in the Platform for Advanced Scientific Computing Conference (PASC '21), July 5 to 9, 2021, Geneva, Switzerland

    Journal ref: PASC 2021: Proceedings of the Platform for Advanced Scientific Computing Conference

  11. arXiv:2101.12149  [pdf, other

    physics.comp-ph cs.DC physics.acc-ph

    Porting WarpX to GPU-accelerated platforms

    Authors: A. Myers, A. Almgren, L. D. Amorim, J. Bell, L. Fedeli, L. Ge, K. Gott, D. P. Grote, M. Hogan, A. Huebl, R. Jambunathan, R. Lehe, C. Ng, M. Rowan, O. Shapoval, M. Thévenet, J. -L. Vay, H. Vincenti, E. Yang, N. Zaïm, W. Zhang, Y. Zhao, E. Zoni

    Abstract: WarpX is a general purpose electromagnetic particle-in-cell code that was originally designed to run on many-core CPU architectures. We describe the strategy followed to allow WarpX to use the GPU-accelerated nodes on OLCF's Summit supercomputer, a strategy we believe will extend to the upcoming machines Frontier and Aurora. We summarize the challenges encountered, lessons learned, and give curren… ▽ More

    Submitted 2 September, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 11 pages, 5 figures, accepted by Parallel Computing. Minor revisions, results unchanged

    Journal ref: Parallel Computing, Volume 108, 2021, 102833

  12. Tuning and optimization for a variety of many-core architectures without changing a single line of implementation code using the Alpaka library

    Authors: Alexander Matthes, René Widera, Erik Zenker, Benjamin Worpitz, Axel Huebl, Michael Bussmann

    Abstract: We present an analysis on optimizing performance of a single C++11 source code using the Alpaka hardware abstraction library. For this we use the general matrix multiplication (GEMM) algorithm in order to show that compilers can optimize Alpaka code effectively when tuning key parameters of the algorithm. We do not intend to rival existing, highly optimized DGEMM versions, but merely choose this e… ▽ More

    Submitted 30 June, 2017; originally announced June 2017.

    Comments: Accepted paper for the P\^{}3MA workshop at the ISC 2017 in Frankfurt

    Journal ref: J.M. Kunkel et al. (Eds.): ISC High Performance Workshops 2017, LNCS 10524, pp. 496-514, 2017

  13. arXiv:1706.00522  [pdf, other

    cs.PF physics.comp-ph

    On the Scalability of Data Reduction Techniques in Current and Upcoming HPC Systems from an Application Perspective

    Authors: Axel Huebl, Rene Widera, Felix Schmitt, Alexander Matthes, Norbert Podhorszki, Jong Youl Choi, Scott Klasky, Michael Bussmann

    Abstract: We implement and benchmark parallel I/O methods for the fully-manycore driven particle-in-cell code PIConGPU. Identifying throughput and overall I/O size as a major challenge for applications on today's and future HPC systems, we present a scaling law characterizing performance bottlenecks in state-of-the-art approaches for data reduction. Consequently, we propose, implement and verify multi-threa… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

    Comments: 15 pages, 5 figures, accepted for DRBSD-1 in conjunction with ISC'17

    ACM Class: D.4.8; B.4.3; I.6.6

    Journal ref: J.M. Kunkel et al. (Eds.): ISC High Performance Workshops 2017, LNCS 10524, pp. 15-29, 2017

  14. In situ, steerable, hardware-independent and data-structure agnostic visualization with ISAAC

    Authors: Alexander Matthes, Axel Huebl, René Widera, Sebastian Grottel, Stefan Gumhold, Michael Bussmann

    Abstract: The computation power of supercomputers grows faster than the bandwidth of their storage and network. Especially applications using hardware accelerators like Nvidia GPUs cannot save enough data to be analyzed in a later step. There is a high risk of loosing important scientific information. We introduce the in situ template library ISAAC which enables arbitrary applications like scientific simula… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

    Journal ref: Supercomputing Frontiers and Innovations, [S.l.], v. 3, n. 4, p. 30-48, oct. 2016

  15. Performance-Portable Many-Core Plasma Simulations: Porting PIConGPU to OpenPower and Beyond

    Authors: Erik Zenker, René Widera, Axel Huebl, Guido Juckeland, Andreas Knüpfer, Wolfgang E. Nagel, Michael Bussmann

    Abstract: With the appearance of the heterogeneous platform OpenPower,many-core accelerator devices have been coupled with Power host processors for the first time. Towards utilizing their full potential, it is worth investigating performance portable algorithms that allow to choose the best-fitting hardware for each domain-specific compute task. Suiting even the high level of parallelism on modern GPGPUs,… ▽ More

    Submitted 12 June, 2016; v1 submitted 9 June, 2016; originally announced June 2016.

    Comments: 9 pages, 3 figures, accepted on IWOPH 2016

    Journal ref: Lecture Notes in Computer Science, 9945, pp 293-301, 2016

  16. Alpaka - An Abstraction Library for Parallel Kernel Acceleration

    Authors: Erik Zenker, Benjamin Worpitz, René Widera, Axel Huebl, Guido Juckeland, Andreas Knüpfer, Wolfgang E. Nagel, Michael Bussmann

    Abstract: Porting applications to new hardware or programming models is a tedious and error prone process. Every help that eases these burdens is saving developer time that can then be invested into the advancement of the application itself instead of preserving the status-quo on a new platform. The Alpaka library defines and implements an abstract hierarchical redundant parallelism model. The model explo… ▽ More

    Submitted 26 February, 2016; originally announced February 2016.

    Comments: 10 pages, 10 figures