Software Productivity and Sustainability
a track presented at
Argonne Training Program on Extreme-Scale Computing
on 8:30 am - 9:30 pm CDT (UTC-5) Friday 4 August 2023
Presenters: Anshu Dubey (Argonne National Laboratory), David E. Bernholdt (Oak Ridge National Laboratory), Greg Becker (Lawrence Livermore National Laboratory), and Jared O'Neal (Argonne National Laboratory)
This page provides detailed information specific to the tutorial event above. Expect updates to this page up to, and perhaps shortly after, the date of the tutorial. Pages for other tutorial events can be accessed from the main page of this site.
Quick Links
- Presentation Slides (FigShare)
On this Page
- Description
- Agenda
- Presentation Slides
- How to Participate
- Stay in Touch
- Resources from Presentations
- Requested Citation
- Acknowledgements
Description
The BSSw tutorial focuses on issues of developer productivity, software sustainability, and reproducibility in scientific research software, particularly targeting high-performance computers.
Agenda
Time (CDT) | Title | Presenter |
---|---|---|
8:30 AM | Introduction | David E. Bernholdt (ORNL) |
8:35 AM | Motivation and Overview of Best Practices in HPC Software Development | David E. Bernholdt (ORNL) |
9:15 AM | Scientific Software Design | Anshu Dubey (ANL) |
10:00 AM | Break | |
10:30 AM | Spack: Package Management for HPC | Greg Becker (LLNL) |
11:30 AM | Spack Hands-On | Greg Becker (LLNL) |
12:30 PM | Lunch | |
1:30 PM | Software Testing and Verification | Anshu Dubey (ANL) |
2:30 PM | Refactoring Scientific Software | Anshu Dubey (ANL) |
3:00 PM | Break | |
3:30 PM | Lab Notebooks for Computational Mathematics, Sciences, & Engineering | Jared O'Neal (ANL) |
4:30 PM | Managing Computational Experiments | Anshu Dubey (ANL) |
4:45 PM | Improving Reproducibility Through Better Software Practices | David E. Bernholdt (ORNL) |
5:45 PM | Summary | David E. Bernholdt (ORNL) |
6:00 PM | Q&A, including your software experiences and challenges | All |
6:30 PM | Adjourn |
Presentation Slides
The latest version of the slides will always be available at https://doi.org/10.6084/m9.figshare.23823822.
Note that these files may include additional slides that will not be discussed during the tutorial, but questions are welcome.
How to Participate
-
We want to interact with you! We find these tutorials most interesting and informative (for everyone) if you ask questions and share experiences! We learn too!
-
Please raise your hand at any time to ask a question
Stay in Touch
-
After the tutorial please feel free to email questions or feedback to the BSSw tutorial team at bssw-tutorial@lists.mcs.anl.gov.
-
To find out about future events organized by the IDEAS Productivity Project, you can subscribe to our mailing list (usually ~2 messages/month).
-
For monthly updates on the Better Scientific Software site, subscribe to our monthly digest.
Resources from Presentations
Links from the tutorial presentations are listed here for convenience
- Module 1: Introduction
- Module 2: Motivation and Overview of Best Practices in HPC Software Development
- COVID-19 epidemiology saga
- https://doi.org/10.25561/77482
- https://www.nicholaslewis.org/imperial-college-uk-covid-19-numbers-dont-seem-to-add-up/
- https://www.nature.com/articles/d41586-020-01003-6
- https://www.foxnews.com/world/imperial-college-britain-coronavirus-lockdown-buggy-mess-unreliable
- https://www.telegraph.co.uk/technology/2020/05/16/coding-led-lockdown-totally-unreliable-buggy-mess-say-experts/
- https://github.com/mrc-ide/covid-sim/
- https://philbull.wordpress.com/2020/05/10/why-you-can-ignore-reviews-of-scientific-code-by-commercial-software-developers/amp/
- http://doi.org/10.5281/zenodo.3865491
- Best Practices for Scientific Computing
- Good Enough Practices in Scientific Computing
- Linux Foundation Core Infrastructure Initiative (CII) Best Practices Badging Program
- Rate Your Project Assesment Tool
- Progress Tracking Card (PTC) Examples
- Productivity and Sustainability Improvement Planning
- Better Scientific Software (BSSw)
- COVID-19 epidemiology saga
- Module 3: Scientific Software Design
- Module 4: Spack: Package Management for HPC
- Spack Website
- Spack Slack
- Spack Tutorial
- Spack Documentation
- Spack on GitHub
- Extreme Scale Scientific Software Stack (E4S)
- Spack v0.20.0 Release Notes
- Other package management tools
- Spack Twitter
- Module 5: Software Testing and Verification
- CI introduction
- Useful resources on testing (formerly linked to
ideas-productivity.org/resources/howtos/) - Lcov (formerly linked to
ltp.sourceforge.net/coverage/lcov.php) - Additional resources:
- Module 6: Refactoring Scientific Software
- Refactoring example code repository: https://github.com/bssw-tutorial/hello-numerical-world
- Module 7: Lab Notebooks for Computational Mathematics, Sciences, & Engineering
- European Southern Observatory
- HPC and the Lab Manager
- What All Codes Should Do (ATPESC 2019)
- Writing the Laboratory Notebook
- DIKW pyramid
- How to pick an electronic notebook
- Resources – Execution Environments
- BSSw article by Jean Shuler
- Popper by Ivo Jimenez
- Code Ocean
- Weight & Biases
- Multiphase Simulations
- FlashKit
- Computational Lab Environment Example
- Module 8: Managing Computational Experiments
- Dubey A, Calder AC, Daley C, et al. Pragmatic optimizations for better scientific utilization of large supercomputers. The International Journal of High Performance Computing Applications. 2013;27(3):360-373. doi:10.1177/1094342012464404
- Wilfred F. van Gunsteren, and Alan E. Mark. Validation of molecular dynamics simulation. J. Chem. Phys. 108(15), 6109-6116 (1998). doi:10.1063/1.476021
- Module 9: Improving Reproducibility Through Better Software Practices
- Toward a Compatible Reproducibility Taxonomy for Computational and Computing Sciences
- Reproducibility and Replicability in Science
- Many Psychology Findings Not As Strong As Claimed
- The War Over Supercooled Water
- Researchers find bug in Python Script may have affected hundreds of studies
- National Science Foundation Data Management Plan Requirements
- Findable, Accessible, Interoperable, Re-usable
- FAIR Data Principles US
- SC23 Reproducibility Initiative
- ACM Transactions on Mathematical Software (TOMS)
- ACM Artifact Review and Badging
- http://fursin.net/reproducibility.html
- National Information Standards Organization (NISO) on Reproducibility and Badging
- Helpful Tools
- Floating Point Analysis Tools
- Code Ocean (Cloud platforms - publish and reproduce research code and data)
- DOIs and hosting of data, code, documents:
- Other Resources:
- The FAIR Guiding Principles for Scientific Data Management and Stewardship. Mark D. Wilkinson, et al. 2016
- FAIR4RS (previously linked to
www.rd-alliance.org/groups/fair-research-software-fair4rs-wg) - Editorial: ACM TOMS Replicated Computational Results Initiative. Michael A. Heroux. 2015
- Enhancing Reproducibility for Computational Methods
- Simple experiments in reproducibility and technical trust by Mike Heroux and students (work in progress)
- What every scientist should know about floating-point arithmetic. David Goldberg.
- Module 10: Summary
Requested Citation
The requested citation the overall tutorial is:
Anshu Dubey, David E. Bernholdt, Greg Becker, and Jared O’Neal, Software Productivity and Sustainability track, in Argonne Training Program on Extreme-Scale Computing, St. Charles, Illinois, 2023. DOI: 10.6084/m9.figshare.23823822.
Individual modules may be cited as Speaker, Module Title, in Software Productivity and Sustainability track…
Acknowledgements
This tutorial is produced by the IDEAS Productivity project.
This work was supported by the U.S. Department of Energy Office of Science, Office of Advanced Scientific Computing Research (ASCR), and by the Exascale Computing Project (17-SC-20-SC), a collaborative effort of the U.S. Department of Energy Office of Science and the National Nuclear Security Administration.