From 23b3f52c1f10ec359ce41eb331b26f3e596a0cc9 Mon Sep 17 00:00:00 2001 From: Dietz <4tq@mac133153.ornl.gov> Date: Fri, 24 Oct 2025 16:26:54 -0400 Subject: [PATCH 1/4] First pass at updating prose to match user guide --- software/workflows/rp.rst | 144 +++++++++++++++++--------------------- 1 file changed, 66 insertions(+), 78 deletions(-) diff --git a/software/workflows/rp.rst b/software/workflows/rp.rst index bea51c29..1bc75f22 100644 --- a/software/workflows/rp.rst +++ b/software/workflows/rp.rst @@ -10,29 +10,25 @@ Overview Scientific productivity can be enhanced through workflow management tools, relieving large High Performance Computing (HPC) system users from the tedious tasks of scheduling and designing the complex -computational execution of scientific applications. This report presents a study on the usage of ensemble -workflow tools to accelerate science using the Frontier supercomputing systems. This technical report -aims to connect science domain simulations using Oak Ridge Leadership Computing Facility (OLCF) supercomputing -platforms with ensemble workflow methods in order to accelerate HPC-enabled discovery -and boost scientific impact. We present the coupling, porting and installation of Radical-Cybertools on -two applications: Chroma and NAMD. https://www.osti.gov/biblio/2575304 +computational execution of scientific applications. This user documentation page presents several examples on +the usage of ensemble workflow tools to accelerate science using the Frontier supercomputing system. +This page presents the coupling, porting and installation of Radical-Cybertools on two applications: Chroma +and NAMD. The content of this page is adapated from a technical report where additional information and detail +can be found: https://www.osti.gov/biblio/2575304 Introduction ============= -This technical guide provides guidance for OLCF users implementing RP workflow tool on Frontier. As -the guidelines and software matures and evolves, our team will deliver biannual updates to the policies and -best practices. The document offers comprehensive technical and scientific guidelines for adopting and -configuring RP on the Frontier supercomputer, complementing RP's platform-specific documentation. We -include essential information on data management strategies and OLCF ensemble policies, while highlighting -our solutions and multi-track capabilities for installation and usability. +This guide provides summary guidance efor OLCF users implementing the RADICAL-Pilot (RP) workflow tool on Frontier. +The source `techincal report `__ offers additional comprehensive technical and scientific +guidelines for adopting and configuring RP on the Frontier supercomputer, complementing RP's platform-specific documentation. +The report includes essential information on data management strategies and OLCF ensemble policies, while highlighting +solutions and multi-track capabilities for installation and usability. RP is an ensemble tool that leverages Python-based scripts for efficient job launching, scheduling, error management, and resource allocation. Its application-agnostic design provides customizable workflows for domain-specific requirements. RP's multi-level metadata management system organizes execution data -in structured directories. While workflow tools often struggle to adapt to specific production systems and -facility policies this technical paper addresses platform heterogeneity by documenting our experience integrating, -porting, and running RP on Frontier. +in structured directories. RP demonstrates exceptional error reporting capabilities, enabling rapid job relaunch and preventing execution hangs during ensemble operations. Its efficient restart options maintain minimal overhead across @@ -40,47 +36,26 @@ our flagship applications detailed in this document. Previous publications on OL established portability as a versatile ensemble tool Titov et al. 2024; Titov et al. 2022; Merzky et al. 2021; Merzky, Turilli, and Jha 2022; Turilli et al. 2021. -INSTALLATION OF THE RADICAL-PILOT TOOL +Installation of the RADICAL-Pilot Tool ====================================== -Workflow management is a strategic approach that assists organizing and optimizing model runs on large -heterogeneous High Performance Computing (HPC) systems. At OLCF we cater to these workflow needs -and feature demands by providing complex workflow tools with state-of-the-art management capabilities. -RADICAL-Pilot has showcased the ability to simplify the computational runs on Frontier and is widely -used across platforms and scientific groups. The source materials from the developers reside here: -https://radicalpilot.readthedocs.io/en/stable/supported/frontier.html - -A user's guide is provided to encapsulate directions and practices on installing the RADICAL-Cybertools -stack (RCT) on Frontier with the pip install command. OLCF supports Python virtual environment usageincluded -with instructions for the execution environment- by creating a virtual environment with venv: +Frontier supports Python virtual environment usage: .. code-block:: console $ export PYTHONNOUSERSITE=True - $ module load cray−python/3.11.7 - $ python3 −m venv ve.rp + $ module load cray-python + $ python3 -m venv ve.rp $ source ve.rp/bin/activate -Subsequently, install RP in the activated corresponding virtual environment: +Subsequently, install RP in the newly created and activated virtual environment: .. code-block:: console $ pip install radical.pilot -An alternate way to install RP manually is the following user-based installation method for Frontier: - -.. code-block:: console - - $ module load cray−python/3.11.7 - $ python −m venv ve.rp - $ source ve.rp/bin/activate - $ pip install −U pip - -Use the pip install –user pip command if any errors appear. Passing the –user option to python --m pip install will install a package just for the current user, rather than for all users of the system. - The latest versions of RCT tools are within development branches, and include the latest fixes, updates and -new features. These versions are considered unstable and they are optional for users. +new features. These versions are considered unstable and they are optional for users, but could be installed if desired: .. code-block:: console @@ -88,15 +63,17 @@ new features. These versions are considered unstable and they are optional for u $ pip install git+https://github.com/radical-cybertools/radical.gtod.git@devel $ pip install git+https://github.com/radical-cybertools/radical.pilot.git@devel -Run the command ``radical-stack`` to verify the success of the installation. -RP application (i.e., Python application using RP as a pilot-based runtime system) can be launched as -a regular Python script: ``python rp_app.py`` (or ``./rp_app.py`` if it includes a corresponding shebang, -e.g., #!/usr/bin/env python). To keep it running in the background the following command is recommended. +Run the command ``radical-stack`` to verify the success of the installation. This should print the corresponding Python +and RP versions that have been installed. + +Running Overview +^^^^^^^^^^^^^^^^ + +A RP application (i.e., Python application using RP as a pilot-based runtime system) can be launched as +a regular Python script: ``python rp_app.py``. To keep it running in the background the following command is recommended. ``nohup python rp_app.py > OUTPUT 2>&1 Date: Tue, 28 Oct 2025 15:49:38 -0400 Subject: [PATCH 2/4] Update links to new repo --- software/workflows/rp.rst | 10 +++------- 1 file changed, 3 insertions(+), 7 deletions(-) diff --git a/software/workflows/rp.rst b/software/workflows/rp.rst index 1bc75f22..4326ed23 100644 --- a/software/workflows/rp.rst +++ b/software/workflows/rp.rst @@ -101,11 +101,11 @@ et al. 2024 CHROMA ------ -The repository for this example can be reached here: https://github.com/henrymonge/chroma_rp. +The repository for this example can be reached here: https://github.com/olcf/workflow-examples .. code-block:: console - $ git clone https://github.com/henrymonge/chroma_rp.git + $ git clone https://github.com/olcf/workflow-examples.git $ cd chroma_rp $ ./compile_chroma/build_stack.sh @@ -276,13 +276,9 @@ logic: ``python rp_analysis_SMT1.py``. Collecting the results ^^^^^^^^^^^^^^^^^^^^^^ -The ``bash ProcessPair.sh`` should process all the pair ``.log`` files and print out the estimated free energy of +The ``bash ProcessPair.sh`` scripts should process all the pair ``.log`` files and print out the estimated free energy of charging. -As a further check of the results, in the directory `Reference` estimates are provided from -simulations that exclusively used the CPU (on a MacBook Pro laptop). Running the ``ProcessPair.sh`` script -inside that Reference directory should print out the results obtained on the Mac. - Expected result ^^^^^^^^^^^^^^^ From a4704bde036d772ef50cea031b657742781c9262 Mon Sep 17 00:00:00 2001 From: Dietz <4tq@mac133153.ornl.gov> Date: Tue, 28 Oct 2025 15:51:15 -0400 Subject: [PATCH 3/4] Remove hyphenated line wrap --- software/workflows/rp.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/software/workflows/rp.rst b/software/workflows/rp.rst index 4326ed23..9d6732c6 100644 --- a/software/workflows/rp.rst +++ b/software/workflows/rp.rst @@ -224,8 +224,8 @@ NAMD ---- This example calculates the free energy of charging an ion for illustrating the use of NAMD -within Radical Pilot. The example finds the electrostatic contribution to the free energy of charging a Na+ ion in wa- -ter. It will use the SPC/E water model and the Na+ parameters from Hummer et al.Hummer, Pratt, and García +within Radical Pilot. The example finds the electrostatic contribution to the free energy of charging a Na+ ion in +water. It will use the SPC/E water model and the Na+ parameters from Hummer et al.Hummer, Pratt, and García 1996. The accompanying archive collects all the work necessary for running the ensemble calculation. Simulation task From 717679e8833cc220e0cebdc89d08be518902735e Mon Sep 17 00:00:00 2001 From: Dietz <4tq@mac133153.ornl.gov> Date: Mon, 3 Nov 2025 15:48:02 -0500 Subject: [PATCH 4/4] Clarify a couple more things --- software/workflows/rp.rst | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) diff --git a/software/workflows/rp.rst b/software/workflows/rp.rst index 9d6732c6..c75bd772 100644 --- a/software/workflows/rp.rst +++ b/software/workflows/rp.rst @@ -106,10 +106,11 @@ The repository for this example can be reached here: https://github.com/olcf/wor .. code-block:: console $ git clone https://github.com/olcf/workflow-examples.git - $ cd chroma_rp - $ ./compile_chroma/build_stack.sh + $ cd workflow-examples + $ cd chroma_rp/compile_chroma/ + $ ./build_stack.sh -The user will be required to edit the ``setup_tasks_example.py`` and ``chroma_rp.py`` to your desired workflow +The user will be required to edit the ``env.sh` to the current environment, ``setup_tasks_example.py`` and ``chroma_rp.py`` to your desired workflow as necessary. Then run: