Workshop Session

FTXS: Workshop on Fault-Tolerance for HPC at Extreme Scale

FTXS – Break

From Tasks Graphs to Asynchronous Distributed Checkpointing with Local Restart

A Generic Strategy for Node-Failure Resilience for Certain Iterative Linear Algebra Methods

Checkpointing OpenSHMEM Programs Using Compiler Analysis

SuperCompCloud: 3rd International Workshop on Interoperability of Supercomputing and Cloud Technologies

SuperCompCloud – Introduction: 3rd International Workshop on Interoperability of Supercomputing and Cloud Technologies

SuperCompCloud – Welcome

Exosphere - Bringing The Cloud Closer

SELVEDAS: A Data and Compute as a Service Workflow Demonstrator Targeting Supercomputing Ecosystems

Lightning Talk – Jetstream2: Accelerating Science and Engineering on Demand

SuperCompCloud – Break

Performance Characteristics of Virtualized GPUs for Deep Learning

FirecREST: a RESTful API to HPC Systems

Leveraging Hybrid Cloud HPC in Multitier Reactive Programming

SuperCompCloud – Break

The "Geddes" Composable Platform – An Evolution of Community Clusters for a Composable World

Archival Data Repository Services to Enable HPC and Cloud Workflows in a Federated Research e-Infrastructure

SuperCompCloud – Closing Remarks and ISC21 Invite

Back to Workshop Archive Listing

MCHPC’20: Workshop on Memory Centric High-Performance Computing

MCHPC’20 – Introduction: Workshop on Memory Centric High-Performance Computing

MCHPC’20 – Keynote: The 3rd Wall and the Need for Innovation in Architectures

MCHPC’20 – Break

Session I - Exploiting Heterogeneous Memory

Persistent Memory Object Storage and Indexing for Scientific Computing

Performance Potential of Mixed Data Management Modes for Heterogeneous Memory Systems

Leveraging a Heterogenous Memory System for a Legacy Fortran Code: The Interplay of Storage Class Memory, DRAM, and OS

Architecting Heterogeneous Memory Systems with DRAM Technology Only: A Case Study on Relational Database

MCHPC'20 – Break

Session II - Cache Impacts and Optimizations

Hostile Cache Implications for Small, Dense Linear Solves

Cache Oblivious Strategies to Exploit Multi-Level Memory on Manycore Systems

Understanding the Impact of Memory Access Patterns in Intel Processors

MCHPC’20 – Closing Remarks

Back to Workshop Archive Listing

IA^3 2020: 10th Workshop on Irregular Applications: Architectures and Algorithms

IA^3 2020 – Introduction: 10th Workshop on Irregular Applications: Architectures and Algorithms

IA^3 2020 – Keynote: Research Challenges in Compiler Technology for Sparse Tensors

IA^3 2020 – Break

Accelerating Domain Propagation: an Efficient GPU-Parallel Algorithm over Sparse Matrices

Parallelizing Irregular Computations for Molecular Docking

Reducing Queuing Impact in Irregular Data Streaming Applications

Supporting Irregularity in Throughput-Oriented Computing by SIMT-SIMD Integration

IA^3 2020 – Paper Session – Q/A

IA^3 2020 – Break

IA^3 2020 - Panel

IA^3 2020 – Lunch Break

IA^3 2020 – Keynote: Memory Performance Optimization

IA^3 2020 – Break

DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs

Labeled Triangle Indexing for Efficiency Gains in Distributed Interactive Subgraph Search

Distributed Memory Graph Coloring Algorithms for Multiple GPUs

Performance Evaluation of the Vectorizable Binary Search Algorithms on an FPGA Platform

IA^3 2020 – Paper Session: Q/A

IA^3 2020 – Thank You and Closing

Back to Workshop Archive Listing

WORKS20: 15th Workshop on Workflows in Support of Large-Scale Science

WORKS20 – Introduction: 15th Workshop on Workflows in Support of Large-Scale Science

WORKS20 – Introductory Remarks

WORKS20 – Keynote: In Situ Data Analytics for Next Generation Molecular Dynamics Workflows

WORKS20 – Break

Runtime vs Scheduler: Analyzing Dask's Overheads

Workflow Generation with wfGenes

WORKS20 – Break

Supercomputing with MPI Meets the CommonWorkflow Language Standards: An Experience Report

Applying Workflows to Scientific Projects Represented in File System Directory Tree

WORKS20 – Break

Adaptive Optimizations for Stream-Based Workflows

Enabling Discoverable Trusted Services for Highly Dynamic Decentralized Workflows

WORKS20 – Break

WorkflowHub: Community Framework for Enabling Scientific Workflow Research and Development

Characterizing Scientific Workflows on HPC Systems Using Logs

WORKS20 – Closing Remarks

Back to Workshop Archive Listing

First International Workshop on Quantum Computing Software

Introduction: First International Workshop on Quantum Computing Software

Quantum Algorithms for Quantum Software

LEAP: Scaling Numerical Optimization Based Synthesis Using an Incremental Approach

XACC: A Service-Oriented Software Architecture for Quantum Computing

First International Workshop on Quantum Computing Software – Break

Visualizing High-Level Quantum Programs

Logic Formulas as Program Abstractions for Quantum Circuits: A Case Study in Noisy Variational Algorithm Simulation

Large-Scale Parallel Tensor Network Quantum Simulator

QubiC - Qubits Control Systems at LBNL

First International Workshop on Quantum Computing Software – Lunch

JuQBox: A Quantum Optimal Control Toolbox In Julia

ArQTiC: A Full-Stack Software Package for Dynamic Simulations of Materials on Quantum Computers

A QUBO Formulation for Qubit Allocation

First International Workshop on Quantum Computing Software - Break

Tensor Network Quantum Virtual Machine for Exascale Computing

QASMBench: An OpenQASM Benchmark Suite for NISQ Evaluation and Simulation

Using Pygsti for Quantum Processor Characterization and Benchmarking

Quantum Control Infrastructure Software for Lab and Cloud-Based Quantum Computers

First International Workshop on Quantum Computing Software – Concluding Remarks

Back to Workshop Archive Listing

HiPar20: Workshop on Hierarchical Parallelism for Exascale Computing

HiPar20 – Introduction: Workshop on Hierarchical Parallelism for Exascale Computing

HiPar20 – Keynote 1: Exploiting Hierarchical Algorithms on Ever More Hierarchical Architectures

HiPar20 – Break

A Case Study and Characterization of a Many-Socket, Multi-Tier NUMA HPC Platform

Introducing Multi-Level Parallelism, at Coarse, Fine, and Instruction Level to Enhance the Performance of Iterative Solvers for Large Sparse Linear Systems on Multi- and Many-Core Architecture

Using Hierarchical Parallelism to Accelerate the Solution of Many Small Partial Differential Equations

Flexible Runtime Reconfigurable Computing Overlay Architecture and Optimization for Dataflow Applications

HiPar20: Keynote 2: Single-Level Programming on Hierarchical Hardware via Adaptive Runtime? Maybe

HiPar20 – Break

HiPar20 – Panel Session

HiPar20 – Break

HiPar20 – Invited Talk: A Portable Asynchronous Tasking Approach to Hierarchical Parallelism - Successes, Challenges, and Future Prospects

HiPar20 – Invited Talk: Glow: a Machine Learning Compiler and Execution Engine

HiPar20 – Closing Remarks

Back to Workshop Archive Listing

ESPM2 2020: Fifth International Workshop on Extreme Scale Programming Models and Middleware

Introduction: ESPM2 2020: Fifth International Workshop on Extreme Scale Programming Models and Middleware

Keynote: Extreme Scale Programming with Novel Programming Methods

Achieving Computation-Communication Overlap with Overdecomposition on GPU Systems

ESPM2 – Break

Invited Talk: Automating Massively Parallel Heterogeneous Computing for Python Programmers

Deploying a Task-Based Runtime System on Raspberry Pi Clusters

Benesh: a Programming Model for Coupled Scientific Workflows

ESPM2 – Lunch Break

Invited Talk: A Parallel Execution Model for Exascale non von Neumann Memory-Centric Architectures

Invited Talk: Programming modern GPU supercomputers

ESPM2 – Break

Panel Discussion: Moderated by Nectarios Koziris, National Technical University of Athens

Refining Fortran Failed Images

Compiler Abstractions and Runtime for Extreme-Scale SAR and CFD Workloads

The Template Task Graph (TTG) --- an Emerging Practical Dataflow Programming Paradigm for Scientific Simulation at Extreme Scale

ESPM2 – Closing Remarks

Back to Workshop Archive Listing

Seventh SC Workshop on Best Practices for HPC Training and Education

Seventh SC Workshop on Best Practices for HPC Training and Education – Introduction

DeapSECURE Computational Training for Cybersecurity Student: Improvements, Mid-Stage Evaluation and Lessons Learned

High-Performance Computing Course Development for Cultivating the System-Level Comprehensive Capability

Training for Researcher-Facing Cyberinfrastructure Professionals: The Virtual Residency

Best Practices for HPC Training and Education – Break

Transitioning Education and Training to a Virtual World – Lessons Learned

Bringing GPU Accelerated Computing and Deep Learning to the Classroom

Pawsey Training Goes Remote: Experiences and Best Practices

Diversity, Community Building, and Virtual Events

HPC Internship Best Practices: The Summer Internships in Parallel Computational Sciences Program

Best Practices for HPC Training and Education – Lunch

XSEDE EMPOWER: Engaging Undergraduates in the Work of Advanced Digital Services and Resources

Exploring Remote Learning Methods for User Training in Research Computing

How the ECP Training Project is Helping the Entire HPC Community Prepare for Exascale Computing

Employing Directed Internship and Apprenticeship for Fostering HPC Training and Education

Inward and Outward Facing Best Practices for XSEDE's Extended Collaborative Support Service (ECSS)

A Collaborative Peer Review Process in Grading Coding Assignments for HPC

Best Practices for HPC Training and Education – Break

Best Practices for Virtual HPC Education and Training

Best Practices for HPC Training and Education – Open Discussion

Back to Workshop Archive Listing

The 5th Deep Learning on Supercomputers Workshop

DLS – Introduction: The 5th Deep Learning on Supercomputers Workshop

AI for Science: AI + HPC

DLS – Break

Online-Codistillation Meets LARS: Going beyond the Limit of Data Parallelism in Deep Learning

DLS – Lunch Break

Exploring the Limits of Concurrency in ML Training on Google TPUs

DLS – Break

Time-Based Roofline for Deep Learning Performance Analysis

Towards a Scalable and Distributed Infrastructure for Deep Learning Applications

DDLBench: Towards a Scalable Benchmarking Infrastructure for Distributed Deep Learning

DLS – Break

Vandermonde Wave Function Ansatz for Improved Variational Monte Carlo

TopiQAL: Topic-aware Question Answering Using Scalable Domain-Specific Supercomputers

DeepGalaxy: Deducing the Properties of Galaxy Mergers from Images Using Deep Neural Networks

Back to Workshop Archive Listing

Women in HPC: Diversifying the HPC Community and Engaging Male Allies

Women in HPC – Introduction: Diversifying the HPC Community and Engaging Male Allies

WHPC’20 – Keynote 1 - Afterward

WHPC’20 – Break

How to Get Your Idea Funded (or, Playing the Long Game)

Navigating Your Way to the Job or Promotion You Want

Always Celebrate your Achievements: A Strategic Approach to Advancing Yourself, Your Career, Your Colleagues and Your Organization

Panel – Making the Leap: Jumping into a Different Career Path

WHPC’20 – Lunch Break

WHPC’20 – Keynote 2 - Everything is a Growth Opportunity

WHPC’20 – Lightning Talks

Molecular Design Using GraphINVENT

STRUMPACK - High-Performance Scalable Software Library Based on Low-Rank Approximations

A Machine Learning Classifier of Damaging Earthquakes as a Microservice in the Urgent Computing Workflow

Smart-PGSim: Using Neural Network to Accelerate AC-OPFPower Grid Simulation

From Wet-Lab Scientist to Data-Driven Computation: Utilizing HPC to Tackle Disparities in Healthcare and a Call for HPC Education

Probabilistic Volcanic Hazard Assessment within the Framework of the ChEESE Center of Excellence

Toward Modular Supercomputing: Resource Disaggregation and Virtualization by Network-Attached Accelerators

Ensembles of Networks Produced from Neural Architecture Search

High-Performance Sparse Tensor Algebra Compiler

WHPC’20 – Break

Mentoring and Peer Evaluation

Raising Awareness about Inclusivity at Workplace

Navigating Change and Transition at Work and in Personal

WHPC’20 – Closing Remarks

Q&A with Lightning Talk Presenters

Back to Workshop Archive Listing

HUST-20: 7th International Workshop on HPC User Support Tools

HUST-20 – Introduction: 7th International Workshop on HPC User Support Tools

HPC Software Tracking Strategies for a Diverse Workload

Automation of NERSC Application Usage Report

HUST-20 – Break

HUST-20 – Panel

HUST-20 – Break

Integrating Science Gateways with Secure Cloud Computing Resources: An Examination of Two Deployment Patterns and Their Requirements

Demystifying Python Package Installation with Conda-Env-Mod

HUST-20 – Conclusion

Back to Workshop Archive Listing

Correctness 2020: 4th International Workshop on Software Correctness for HPC Applications

Correctness 2020 – Introduction: 4th International Workshop on Software Correctness for HPC Applications

Reproducible Scientific Computing: Progress and Challenges

Correctness-Preserving Compression of Datasets and Neural Network Models

Order Matters: A Case Study on Reducing Floating Point Error in Sums through Ordering and Grouping

Correctness 2020 – Break

Enhancing DataRaceBench for Evaluating Data Race Detection Tools

PARCOACH Extension for Static MPI Nonblocking and Persistent Communication Validation

Toward Compiler-Aided Correctness Checking of Adjoint MPI Applications

A Statistical Analysis of Error in MPI Reduction Operations

Back to Workshop Archive Listing

XLOOP 2020: 2nd Annual Workshop on Extreme-Scale Experiment-in-the-Loop-Computing

XLOOP – Introduction: 2nd Annual Workshop on Extreme-Scale Experiment-in-the-Loop-Computing

XLOOP – Keynote - Challenges and Opportunities for Composable AI-Integrated Applications at the Digital Continuum

Cross-Facility Science with the Superfacility Project at LBNL

Tomographic Reconstruction of Dynamic Features with Streaming Sliding Subsets

Toward an Automated HPC Pipeline for Processing Large Scale Electron Microscopy Data

XLOOP – Break

Managing Event-Oriented Workflows

Interactive Parallel Workflows for Synchrotron Tomography

Toward Online Steering of Flame Spray Pyrolysis Nanoparticle Synthesis

XLOOP – Panel Discussion

XLOOP – Concluding Comments

Back to Workshop Archive Listing

RSE-HPC 2020: Research Software Engineers in HPC – Creating Community, Building Careers, Addressing Challenges

RSE-HPC – Introduction: Research Software Engineers in HPC: Creating Community, Building Careers, Addressing Challenges

RSE-HPC – Break

RSE-HPC – Panel: Building RSE Teams and Groups

RSE-HPC – Break

RSE-HPC – Panel: Supporting RSE Careers

Back to Workshop Archive Listing

Fifth International Parallel Data Systems Workshop

Fifth International Parallel Data Systems Workshop – Introduction

PDSW 2020 - Keynote: "Sink or Swim: How Not to Drown in Colossal Streams of Data?"

Fifth International Parallel Data Systems Workshop – Break

Keeping It Real: Why HPC Data Services Don't Achieve I/O Microbenchmark Performance

Toward On-Demand I/O Forwarding in HPC Platforms

Fifth International Parallel Data Systems Workshop – Break

Gauge: An Interactive Data-Driven Visualization Tool for HPC Application I/O Performance Analysis

Fractional-Overlap Declustered Parity: Evaluating Reliability for Storage Systems

GPU Direct I/O with HDF5

Fifth International Parallel Data Systems Workshop – Break

Emulating I/O Behavior in Scientific Workflows on High Performance Computing Systems

Pangeo Benchmarking Analysis: Object Storage vs. POSIX File System

Fingerprinting the Checker Policies of Parallel File Systems

Fifth International Parallel Data Systems Workshop – Break

Deriving Storage Insights from the IO500

I/O Traces of HPC Applications

Scalable Communication and Data Persistence Layer for NVM-Based Storage Systems

Q&A with Work-in-Progress Speakers

Fifth International Parallel Data Systems Workshop – Closing Remarks

Back to Workshop Archive Listing

Machine Learning in HPC Environments

Introduction: Machine Learning in HPC Environments

Fairness, Accountability, Transparency, and Ethics in Computer Vision

Machine Learning in HPC Environments – Break

EventGraD: Event-Triggered Communication in Parallel Stochastic Gradient Descent

A Benders Decomposition Approach to Correlation Clustering

Accelerating GPU-Based Machine Learning in Python Using MPI Library: A Case Study with MVAPICH2-GDR

Machine Learning in HPC Environments – Lunch Break

Keynote: Michael Garland - Programming Systems of Data

Accelerate Distributed Stochastic Gradient Descent for Nonconvex Optimization with Momentum

Machine Learning in HPC Environments – Break

High-Bypass Learning: Automated Detection of Tumor Cells that Significantly Impact Drug Response

Deep Generative Models that Solve PDEs: Distributed Computing for Training Large Data-Free Models

Machine Learning in HPC Environments – Concluding Remarks

Back to Workshop Archive Listing

PMBS20: The 11th International Workshop on Performance Modeling, Benchmarking and Simulation of High-Performance Computer Systems

PMBS20 – Introduction: The 11th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems

Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX

The Performance and Energy Efficiency Potential of FPGAs in Scientific Computing

PMBS20 – Break

Benchmarking Julia’s Communication Performance: Is Julia HPC Ready or Full HPC?

Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations

Exploiting the Potentials of the Second Generation SX-Aurora TSUBASA

Lightweight Measurement and Analysis of HPC Performance Variability

PMBS20 – Lunch Break

Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization

Warwick Data Store: A Data Structure Abstraction Library

Accelerating High-Order Stencils on GPUs

PMBS20 – Break

Developing Models for the Runtime of Programs with Exponential Runtime Behavior

Performance Tradeoffs in GPU Communication: A Study of Host and Device-Initiated Approaches

Evaluation of the Communication Motif for a Distributed Eigensolver Using the SST Network Simulation Tool

PMBS20 – Wrapup

Back to Workshop Archive Listing

ISAV 2020: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

ISAV 2020 – Introduction: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

ISAV 2020 – Opening

ISAV 2020 – Best Paper Award

Personalized In Situ steering for Analysis and Visualization

In Situ and Post-Processing Volume Rendering with Cinema

Chimbuko: A Workflow-Level Scalable Performance Trace Analysis Tool

ISAV 2020 – Break

An Architecture for Interactive In Situ Visualization and Its Transparent Implementation in OpenFPM

JIT’s Complicated: A Comprehensive System for Derived Field Generation

In Situ Temporal Caching

ISAV 2020 – Break

Keynote: Uses of In Situ/In Transit Methods in Large-Scale Modeling of Plasma-Based Particle Accelerators

The Challenges of In Situ Analysis for Multiple Simulations

Benchmarking In Situ Triggers Via Reconstruction Error

ISAV 2020 – Break

ISAV 2020 – Panel: In Situ – Experiences from the Trenches

ISAV 2020 – Closing Remarks

Back to Workshop Archive Listing

DRBSD-6: The 6th International Workshop on Data Analysis and Reduction for Big Scientific Data

DRBSD-6 – Introduction: The 6th International Workshop on Data Analysis and Reduction for Big Scientific Data

DRBSD-6 – Welcome and Introduction

Streaming Data – The Transformation of HPC Systems into Discovery Machines

The Square Kilometre Array and Exascale Challenges for Future Astronomy Facilities

Toward a Framework for Policy-Driven Adaptive In Situ Workflows

Combining Spatial and Temporal Properties for Improvements in Data Reduction

DRBSD-6 – Break

Invited Talk: Sanjay Ranka

Dynamic, Adaptive Resource Management for Scientific Workflows

DRBSD-6 – Break

AI for Science: Some Big Data Challenges

Intelligent Data Management for Extreme-Scales In-Situ Workflows

Data Compression with Deep Learning Based Generative Modeling

DRBSD-6 – Break

Data Analytics for Scientific Data Compression

Machine learning for science with a deadline: a focus on the scientist

A Survey of Resource Constrained Scheduling for In Situ Analysis

DRBSD-6 – Closing Remarks

Back to Workshop Archive Listing

INDIS 2020: The 7th International Workshop on Innovating the Network for Data-Intensive Science

INDIS – Introduction: Innovating the Network for Data-Intensive Science

INDIS – Welcome Message

A Brief History of INDIS

INDIS – Keynote: "Grand" Challenges for a Science Mission Network

Introduction to SCinet

SCinet Architecture: Past, Present, Future

Panel Introduction: Experimental Networks (XNET)

XNET Lightning Talk: The BRIDGES Project – Building a Global Cyber-Infrastructure Canvas Supporting Networked Applications Experimentation and Evolution

XNET Lightning Talk: FABRIC/FAB Deep Dive

XNET Lightning Talk: SAGE: AI at the Edge for Software-Defined Wireless Sensors

XNET Lightning Talk: Quantum Networking

XNET Lightning Talk: Extending the Research Engineering Network to the Wireless Edge

Panel Discussion: Experimental Networks (XNET)

INDIS – Break

Network Research Exhibition: An Introduction

NRE Demo Talk I: P4 Experimental Networks for the Global Research Platform

NRE Demo Talk II: Advanced Data Algorithms and Architectures for Security Monitoring

Using P4 and RDMA to Collect Telemetry Data

Application Aware Software Defined Flows of Workflow Ensembles

INDIS – Break

Keynote: Enhancing Distributed Computing with Programmable and Open Optical Networks

A Trial Deployment of a Reliable Network-Multicast Application across Internet2

ROBIN (RuciO/BIgData Express/SENSE): A Next-Generation High-Performance Data Service Platform

The NetSage Measurement Framework: Design, Development, and Discoveries

INDIS – Break

AI for Networking: the Engineering Perspective

An Evaluation of Ethernet Performance for Scientific Workloads

Computing Bottleneck Structures at Scale for High-Precision Network Performance Analysis

INDIS – Closing Remarks

INDIS – End of Day

Back to Workshop Archive Listing

LLVM-HPC2020: The Sixth Workshop on the LLVM Compiler Infrastructure in HPC

LLVM-HPC2020 – Introduction: The Sixth Workshop on the LLVM Compiler Infrastructure in HPC

LLVM-HPC2020 – Welcome

LLVM-HPC2020 – Keynote

LLVM-HPC2020 – Break

Static Neural Compiler Optimization via Deep Reinforcement Learning

Autotuning Search Space for Loop Transformations

Deep Learning-Based Approximate Graph-Coloring Algorithm for Register Allocation

LLVM-HPC2020 – Lunch Break

Extending the LLVM/Clang Framework for OpenMP Metadirective Support

Toward Automated Kernel Fusion for the Optimization of Scientific Applications

Robust Practical Binary Optimization at Run-Time Using LLVM

LLVM-HPC2020 – Break

Really Embedding Domain-Specific Languages into C++

LLVM-HPC2020 – Panel

LLVM-HPC2020 – Closing Remarks

Back to Workshop Archive Listing

11th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

ScalA – Introduction: 11th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

ScalA – Keynote: Performance Evaluation of the Supercomputer "Fugaku" and A64FX Manycore Processor

An Integer Arithmetic-Based Sparse Linear Solver Using a GMRES Method and Iterative Refinement

ScalA – Break

ScalA – Keynote: High Performance Data Analytics and Some Applications

Two-Stage Asynchronous Iterative Solvers for Multi-GPU Clusters

Revisiting Exponential Integrator Methods for HPC with a Mini-Application

A Survey of Singular Value Decomposition Methods for Distributed Tall/Skinny Data

ScalA – Break

Keynote 3: ECP – Recent Experiences in Porting Complex Applications to Accelerator-Based Systems

Replacing Pivoting in Distributed Gaussian Elimination with Randomized Techniques

Recursive Basic Linear Algebra Operations on TensorCore GPU

High-Order Finite Element Method Using Standard and Device-Level Batch GEMM on GPUs

ScalA – Break

A Fast Scalable Iterative Implicit Solver with Green's Function-Based Neural Networks

Implementation and Numerical Techniques for One Eflop/s HPL-AI Benchmark on Fugaku

Performance Analysis of a Quantum Monte Carlo Application on Multiple Hardware Architectures Using the HPX Runtime

ScalA – Closing

Back to Workshop Archive Listing

CANOPIE-HPC: Containers and New Orchestration Paradigms for Isolated Environments in HPC

CANOPIE-HPC – Introduction: Containers and New Orchestration Paradigms for Isolated Environments in HPC

CANOPIE-HPC – Keynote

CANOPIE-HPC – Break

Extending the Control Plane of Container Orchestrators for I/O Virtualization

Enabling Seamless Execution of Computational and Data Science Workflows on HPC and Cloud with the Popper Container-Native Automation Engine

The Role of Containers in Reproducibility

CANOPIE-HPC – Session 1 Discussion

CANOPIE-HPC – Break

Containers for Massive Ensemble of I/O Bound Hierarchical Coupled Simulations

Design Considerations for Building and Running Containerized MPI Applications

archspec: A Library for Detecting, Labeling, and Reasoning About Microarchitectures

CANOPIE-HPC – Session 2 Discussion

CANOPIE-HPC – Break

CANOPIE-HPC – Lightning Talks

CANOPIE-HPC – Panel Discussion

CANOPIE-HPC – Final Remarks

Back to Workshop Archive Listing

ProTools 2020: Workshop on Programming and Performance Visualization Tools

ProTools – Introduction: Workshop on Programming and Performance Visualization Tools

OpenACC Profiling Support for Clang and LLVM Using Clacc and TAU

Usability and Performance Improvements in Hatchet

ProTools – Break

Exascale Potholes for HPC: Execution Performance and Variability Analysis of the Flagship Application Code HemeLB

Empirical Modeling of Spatially Diverging Performance

Simulation-Based Performance Prediction of HPC Applications: A Case Study of HPL

Back to Workshop Archive Listing

PAW-ATM 2020: The 3rd Annual Parallel Applications Workshop, Alternatives To MPI+X

PAW-ATM - Introduction: The 3rd Annual Parallel Applications Workshop, Alternatives To MPI+X

PAW-ATM – Keynote: Performance Portability in the Age of Extreme Heterogeneity

PAW-ATM – Break

Hedgehog: Understandable Scheduler-Free Heterogeneous Asynchronous Multithreaded Data-Flow Graphs

TaskTorrent: A Lightweight Distributed Task-Based Runtime System in C++

PAW-ATM – Break

Evaluation of Multiple HPC Parallelization Frameworks in a Shallow Water Mini Application with Multi-Rate Local Time Stepping

Task-Parallel In Situ Data Compression of Large-Scale Computational Fluid Dynamics Simulations

An Implicitly Parallel Meshfree Solver in Regent

PAW-ATM – Break

HOOVER: Leveraging OpenSHMEM for High Performance, Flexible, Streaming Graph Applications

Exploring Hybrid MPI+Kokkos Tasks Programming Model

Back to Workshop Archive Listing

HPCSYSPROS20

HPCSYSPROS20 – Introduction

HPCSYSPROS20 – Keynote: Riken

Site Report: NVIDIA

Site Report: MARCC

Site Report: NREL

Site Report: INL

Setup and Management of a Small National Computational Facility: What We’ve Learned the First 10 Years

Case Study of TCP/IP Tunings for High Performance Interconnects

NGC Container Environment Modules

HPCSysPros20 – Break

Application Performance in the Frontera Acceptance Process

Parallelized Data Replication of Multi-Petabyte Storage Systems

Log-Based Identification, Classification, and Behavior Prediction of HPC Applications

Modernizing the HPC System Software Stack

Cluster Management

Traxler Family Award for Community Service

HPCSYSPROS20 – Closing Remarks

Back to Workshop Archive Listing

Fourth International Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware

IPDRM – Introduction: Fourth International Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware

IPDRM – Keynote

Scheduling across Multiple Applications Using Task-Based Programming Models

MENPS: A Decentralized Distributed Shared Memory Exploiting RDMA

RaDD Runtimes: Radical and Different Distributed Runtimes with SmartNICs

IPDRM – Break

DEMAC – A Modular Platform for HW-SW Co-Design

CODIR: Toward an MLIR Codelet Model Dialect

MPI Meets Cloud: Case Study with Amazon EC2 and Microsoft Azure

IPDRM – Conclusions and Questions

Back to Workshop Archive Listing

WACCPD 2020: Seventh Workshop on Accelerator Programming Using Directives

WACCPD – Introduction: Seventh Workshop on Accelerator Programming Using Directives

WACCPD – Keynote: Achieving Performance Portability for Extreme Heterogeneity

ADELUS: A Performance-Portable Dense LU Solver for Distributed-Memory Hardware-Accelerated Systems

GPU Acceleration of the FINE/FR CFD Solver in a Heterogeneous Environment with OpenACC Directives

Performance and Portability of a Linear Solver Across Emerging Architectures

WACCPD – Invited Talk: Enabling Portable Directive-Based Programming at Exascale

WACCPD – Break

Evaluating Performance Portability of OpenMP for SNAP on NVIDIA, Intel, and AMD GPUs Using the Roofline Methodology

Performance Assessment of OpenMP Compilers Targeting NVIDIA V100 GPUs

WACCPD – Closing Remarks

Back to Workshop Archive Listing

AI4S: Workshop on Artificial Intelligence and Machine Learning for Scientific Applications

AI4S – Introduction: Workshop on Artificial Intelligence and Machine Learning for Scientific Applications

AI4S – Keynote

Automatic Particle Trajectory Classification in Plasma Simulations

Reinforcement Learning-Based Solution to Power Grid Planning and Operation Under Uncertainties

Predictions of Steady and Unsteady Flows Using Machine-Learned Surrogate Models

Deep Learning-Based Low-Dose Tomography Reconstruction with Hybrid-Dose Measurements

How Good Is Your Scientific Data Generative Model?

AI4S – Panel

Back to Workshop Archive Listing

ROSS 2020: 10th International Workshop on Runtime and Operating Systems for Supercomputers

ROSS – Introduction: 10th International Workshop on Runtime and Operating Systems for Supercomputers

The Challenges of Hybrid Computing

ROSS – Break

A Dataflow-Graph Partitioning Method for Training Large Deep Learning Models

Improving Job Launch Rates in the TaPaSCo FPGA Middleware by Hardware/Software-Co-Design

Pinpoint the Joules: Unifying Runtime-Support for Energy Measurements on Heterogeneous Systems

ROSS – Break

Data-Centric Resource Management for Complex Memory Fabrics

Ch’i: Scaling Microkernel Capabilities in Cache-Incoherent Systems

ROSS – Break

Towards Generalizable Models of I/O Throughput

Programming Model Developments Present Opportunities for Runtime and Operating Systems

Application-Driven Requirements for Node Resource Management in Next-Generation Systems

Locality-Aware Scheduling for Scalable Heterogeneous Environments

ROSS – Closing Remarks

Back to Workshop Archive Listing

ExaMPI: Workshop on Exascale MPI

ExaMPI – Introduction: Workshop on Exascale MPI

ExaMPI – Keynote: Jim Dinan

Challenges of GPU-Aware Communication in MPI

Scalable MPI Collectives Using SHARP: Large Scale Performance Evaluation on the TACC Frontera System

Implementing Flexible Threading Support in Open MPI

ExaMPI – Break

ExaMPI – Keynote: Streaming Messages – A Distributed Memory Programming Model for Reconfigurable Hardware

Design and Implementation Techniques for an MPI-Oriented AMT Runtime

Integrating Inter-Node Communication with a Resilient Asynchronous Many-Task Runtime System

Extending the MPI Stages Model of Fault Tolerance

ExaMPI – Closing

Back to Workshop Archive Listing

PyHPC 2020: 9th Workshop on Python for High-Performance and Scientific Computing

PyHPC – Introduction: 9th Workshop on Python for High-Performance and Scientific Computing

PyHPC – Keynote: Reprising the Zen of Python for HPC

PyHPC – Break

PyHPC – Session 1: Introduction

Experiences in Developing a Distributed Agent-Based Modeling Toolkit with Python

Data Engineering for HPC with Python

PyHPC – Break

PyHPC – Invited Keynote: MPI for Python

PyHPC – Break

PyHPC – Session 2: Introduction

Enabling System Wide Shared Memory for Performance Improvement in PyCOMPSs Applications

Python Workflows on HPC Systems

PyHPC – Break

PyHPC – Session 3: Introduction

Accelerating Microstructural Analytics with Dask for Volumetric X-Ray Imaging

Distributed Asynchronous Array Computing with the JetLag Environment

PyHPC – Break

PyHPC – Lightning Talks: Introduction

DaCe Python Frontend

Linear Algebraic Graphs Algorithms in Python

Fil: A Python Memory Profiler for Scientific Computing

Validating Oil Spill Dispersion Models Against Real-World Observations Using the GeoPandas Library

Back to Workshop Archive Listing

CAFCW20: Sixth Computational Approaches for Cancer Workshop

CAFCW20 – Introduction: Sixth Computational Approaches for Cancer Workshop

Keynote: Data Science Initiatives at the National Cancer Institute, Dr. Norman "Ned" Sharpless, National Cancer Institute Director with Introduction by Sean E. Hanlon, PhD, National Cancer Institute

CAFCW20 – Panel: HPC, Cancer, and COVID-19

Scalable Human Pharmacokinetics Property Prediction for Cancer Drug Discovery at ATOM

CAFCW20 – Break

Scaffold-Induced Molecular Subgraphs (SIMSG): Effective Graph Sampling Methods for High-Throughput Computational Drug Discovery

Causal Deconvolution of a Mechanistic Model of EGFR and ERK Signaling Explains Adaptive and Genetic Resistance in Melanoma

An Efficient, Data-Driven Approach To Model Specific Cancer Cell Lines

Deep Learning Based Prediction of the Temporal Behavior of RAS Protein Conformations on Simulated Cell Membrane Surfaces

CAFCW20 – Panel: Digital Twins for Cancer Care

CAFCW20 – Lunch Break

CAFCW20 – Afternoon Welcome

Machine Learning Driven Importance Sampling Approach for Multiscale Simulations

A Metapath Approach to Predicting Drug Response in Cancer Cell Lines

Toward a Data-Driven System for Personalized Cervical Cancer Screening

Integration of Domain Knowledge Using Medical Knowledge Graph Deep Learning for Cancer Phenotyping

CAFCW20 – Break

Deciphering Hallmarks of Resistance in Breast Cancer

Why I’m Not Answering: Understanding Determinants of Classification of an Abstaining Classifier for Cancer Pathology Reports

CAFCW20 – Panel: Translating Cancer Research Advances in Artificial Intelligence into Clinical Practice

CAFCW20 – Wrapup

Back to Workshop Archive Listing

H2RC 2020: Sixth International Workshop on Heterogeneous High-Performance Reconfigurable Computing

H2RC – Introduction: Sixth International Workshop on Heterogeneous High-Performance Reconfigurable Computing

Programming Reconfigurable Heterogeneous Computing Clusters Using MPI with Transpilation

Evaluating FPGA Accelerator Performance with a Parameterized OpenCL Adaptation of Selected Benchmarks of the HPC Challenge Benchmark Suite

Exploring the Acceleration of Nekbone on Reconfigurable Architectures

H2RC – Break

H2RC – Invited Talk: Fast, Scalable Quantized Neural Network Inference on FPGAs with FINN and LogicNets

H2RC - Keynote: Rapid System Level Design and Evaluation of Near Memory Fixed Function Units: A Reconfigurable Computing Application

H2RC – Lunch Break

FPGA Acceleration of Fluid-Flow Kernels

H2RC – Invited Talk: FPGA Fabric is Eating the World

FPGAs-as-a-Service Toolkit (FaaST)

H2RC – Break

H2RC – Invited Talk: FPGA programming and the oneAPI industry initiative

H2RC – Invited Talk: AIgean: An Open Framework for Machine Learning on a Heterogeneous Cluster

OpenCL-Enabled Parallel Raytracing for Astrophysical Application on Multiple FPGAs with Optical Links

H2RC – Closing Remarks

Back to Workshop Archive Listing

P3HPC: 3rd International Workshop on Performance Portability and Productivity

P3HPC – Introduction: 3rd International Workshop on Performance Portability and Productivity

P3HPC – Keynote: Andrew Siegel

Tracking Performance Portability on the Yellow Brick Road to Exascale

Performance Portability of Molecular Docking Miniapp on Leadership Computing Platforms

P3HPC – Break

P3HPC – Forum Recap

OpenMP State of The Standard by Tom Scogland

Interpreting and Visualizing Performance Portability Metrics

P3HPC – Break

Findings from the ECP Performance Portability Panel Series

Mini-Panel with ECP Representatives

Panel: Trends that Make Performance, Portability, and Productivity Either More Challenging or Tractable

P3HPC – Break

Cross-Platform Performance Portability of DNN Models Using SYCL

Evaluating the Performance and Portability of Contemporary SYCL Implementations

P3HPC – Wrapup

Back to Workshop Archive Listing

EduHPC20: Workshop on Education for High-Performance Computing

EduHPC – Introduction: Workshop on Education for High-Performance Computing

Toward Generic Parallel Programming in Computer Science Education with Kokkos

Extending FreeCompilerCamp.org as an Online Self-Learning Platform for Compiler Development

Integrating Machine Learning with HPC-Driven Simulations for Enhanced Student Learning

EduHPC – Break

EduHPC – Peachy Parallel Assignments

EduHPC – Keynote: Parallel and Distributed Computing Infusion in K-14

CDER Curriculum Update

EduHPC – Lunch Break

Trying to Do It All in a Single Course: A Surprisingly Good Idea.

Applying Parallel and Distributed Computing Curriculum to Cyber Security Courses

Teaching Software Sustainability for High Performance Computing at ATPESC

EduHPC – Break

EduHPC – Lightning Talks

EduWRENCH: Simulation-Driven Pedagogic Modules

Evolving the Traditional Student Cluster Competition as Tomorrow’s “Peachy Assignments”

Broadening Participation via Computer Systems Genome Research Group

A Masters Degree Course in Computational Engineering at University of Warsaw

EduHPC – Discussion

EduHPC – Final Remarks

Back to Workshop Archive Listing

Urgent HPC: HPC for Urgent Decision Making

Urgent HPC – Introduction: HPC for Urgent Decision Making

Urgent HPC – Introduction and Welcome

Keynote: Urgent high performance computing for real-time pandemic response and decision making

Urgent HPC – Break

Rapid Processing of Astronomical Data for the Dark Energy Spectroscopic Instrument

A Bespoke Workflow Management System for Data-Driven Urgent HPC

Fast Tsunami Simulations for a Real-Time Emergency Response Flow

Benchmarking Micro-Core Architectures for Detecting Disasters at the Edge

Urgent HPC – Break

VESTEC - Visual Exploration and Sampling Toolkit for Extreme Computing

Studies of Leveraging HPC for Workloads with Real-Time Time Constraints

Integrated Micro-Scale Modeling and Urban Environment Service Cyberinfrastructure for Smart Cities

Urgent Data Analysis: Tracking Tornados in Real Time

Urgent Supercomputing of Earthquakes in the ChEESE Project

Earthquake Early Warning (EEW) System: A Case for Urgent Analytics across the Computing Continuum

SciStream: Architecture and Toolkit for Data Streaming between Federated Science Instruments

A Parallel Job Scheduling Method to Effectively Use Shared Heterogeneous Systems for Urgent Computations

Computer Aided Diagnostic Tools for COVID-19 Detection via X-Ray Imaging

The LEXIS Project: A Federated HPC, Cloud, and Big Data Infrastructure with Urgent and Real-Time Pilot Workflows

Globus Services for Data-Intensive Experimental Research Automation

Urgent HPC – Lightning Talk Q&A

Urgent HPC – Summary and Wrapup

Back to Workshop Archive Listing

Fourth Workshop on Interactive High-Performance Computing

InteractiveHPC – Introduction: Fourth Workshop on Interactive High-Performance Computing

Keynote: Rapid-response Data Analytics for COVID-19 Using GPUs

Toward an HPC Service-Oriented Hybrid Cloud Architecture Designed for Interactive Workflows

InteractiveHPC – Break

Accelerating Fusion Energy Experimental Workflows Using HPC Resources

Toward Interactive, Reproducible Analytics at Scale on HPC Systems

InteractiveHPC – Break

InteractiveHPC – Panel

InteractiveHPC – Wrapup

Back to Workshop Archive Listing

2nd Workshop on Machine Learning for Computing Systems

MLCS – Introduction: 2nd Workshop on Machine Learning for Computing Systems

MLCS – Keynote

A Year of Automated Anomaly Detection in a Datacenter

Early Prediction of High-Performance Computing Job Outcomes via Modeling System Text Logs

Explainable Machine Learning Frameworks for Managing HPC Systems

MLCS – Break

Scheduling in Data Centers Running on Renewable Energy with Deep Reinforcement Learning

Investigating the Efficacy of Unstructured Text Analysis for Node Failure Detection in Syslog

MLCS – Panel

Back to Workshop Archive Listing

SC Workshop Archives

FTXS: Workshop on Fault-Tolerance for HPC at Extreme Scale

SuperCompCloud: 3rd International Workshop on Interoperability of Supercomputing and Cloud Technologies

MCHPC’20: Workshop on Memory Centric High-Performance Computing

IA^3 2020: 10th Workshop on Irregular Applications: Architectures and Algorithms

WORKS20: 15th Workshop on Workflows in Support of Large-Scale Science

First International Workshop on Quantum Computing Software

HiPar20: Workshop on Hierarchical Parallelism for Exascale Computing

ESPM2 2020: Fifth International Workshop on Extreme Scale Programming Models and Middleware

Seventh SC Workshop on Best Practices for HPC Training and Education

The 5th Deep Learning on Supercomputers Workshop

Women in HPC: Diversifying the HPC Community and Engaging Male Allies

HUST-20: 7th International Workshop on HPC User Support Tools

Correctness 2020: 4th International Workshop on Software Correctness for HPC Applications

XLOOP 2020: 2nd Annual Workshop on Extreme-Scale Experiment-in-the-Loop-Computing

RSE-HPC 2020: Research Software Engineers in HPC – Creating Community, Building Careers, Addressing Challenges

Fifth International Parallel Data Systems Workshop

Machine Learning in HPC Environments

PMBS20: The 11th International Workshop on Performance Modeling, Benchmarking and Simulation of High-Performance Computer Systems

ISAV 2020: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

DRBSD-6: The 6th International Workshop on Data Analysis and Reduction for Big Scientific Data

INDIS 2020: The 7th International Workshop on Innovating the Network for Data-Intensive Science

LLVM-HPC2020: The Sixth Workshop on the LLVM Compiler Infrastructure in HPC

11th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

CANOPIE-HPC: Containers and New Orchestration Paradigms for Isolated Environments in HPC

ProTools 2020: Workshop on Programming and Performance Visualization Tools

PAW-ATM 2020: The 3rd Annual Parallel Applications Workshop, Alternatives To MPI+X

HPCSYSPROS20

Fourth International Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware

WACCPD 2020: Seventh Workshop on Accelerator Programming Using Directives

AI4S: Workshop on Artificial Intelligence and Machine Learning for Scientific Applications

ROSS 2020: 10th International Workshop on Runtime and Operating Systems for Supercomputers

ExaMPI: Workshop on Exascale MPI

PyHPC 2020: 9th Workshop on Python for High-Performance and Scientific Computing

CAFCW20: Sixth Computational Approaches for Cancer Workshop

H2RC 2020: Sixth International Workshop on Heterogeneous High-Performance Reconfigurable Computing

P3HPC: 3rd International Workshop on Performance Portability and Productivity

EduHPC20: Workshop on Education for High-Performance Computing

Urgent HPC: HPC for Urgent Decision Making

Fourth Workshop on Interactive High-Performance Computing

2nd Workshop on Machine Learning for Computing Systems