FTXS – Introduction: Workshop on Fault-Tolerance for HPC at Extreme Scale
Towards Distributed Software Resilience in Asynchronous Many-Task Programming Models
FTXS – Break
From Tasks Graphs to Asynchronous Distributed Checkpointing with Local Restart
A Generic Strategy for Node-Failure Resilience for Certain Iterative Linear Algebra Methods
Back to Workshop Archive Listing
SELVEDAS: A Data and Compute as a Service Workflow Demonstrator Targeting Supercomputing Ecosystems
Lightning Talk – Jetstream2: Accelerating Science and Engineering on Demand
SuperCompCloud – Break
Performance Characteristics of Virtualized GPUs for Deep Learning
Leveraging Hybrid Cloud HPC in Multitier Reactive Programming
SuperCompCloud – Break
The "Geddes" Composable Platform – An Evolution of Community Clusters for a Composable World
Back to Workshop Archive Listing
MCHPC’20 – Introduction: Workshop on Memory Centric High-Performance Computing
MCHPC’20 – Keynote: The 3rd Wall and the Need for Innovation in Architectures
MCHPC’20 – Break
Persistent Memory Object Storage and Indexing for Scientific Computing
Performance Potential of Mixed Data Management Modes for Heterogeneous Memory Systems
MCHPC'20 – Break
Cache Oblivious Strategies to Exploit Multi-Level Memory on Manycore Systems
Understanding the Impact of Memory Access Patterns in Intel Processors
Back to Workshop Archive Listing
IA^3 2020 – Introduction: 10th Workshop on Irregular Applications: Architectures and Algorithms
IA^3 2020 – Keynote: Research Challenges in Compiler Technology for Sparse Tensors
IA^3 2020 – Break
Accelerating Domain Propagation: an Efficient GPU-Parallel Algorithm over Sparse Matrices
Reducing Queuing Impact in Irregular Data Streaming Applications
Supporting Irregularity in Throughput-Oriented Computing by SIMT-SIMD Integration
IA^3 2020 – Break
IA^3 2020 – Lunch Break
IA^3 2020 – Break
DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs
Labeled Triangle Indexing for Efficiency Gains in Distributed Interactive Subgraph Search
Distributed Memory Graph Coloring Algorithms for Multiple GPUs
Performance Evaluation of the Vectorizable Binary Search Algorithms on an FPGA Platform
Back to Workshop Archive Listing
WORKS20 – Introduction: 15th Workshop on Workflows in Support of Large-Scale Science
WORKS20 – Keynote: In Situ Data Analytics for Next Generation Molecular Dynamics Workflows
WORKS20 – Break
WORKS20 – Break
Supercomputing with MPI Meets the CommonWorkflow Language Standards: An Experience Report
Applying Workflows to Scientific Projects Represented in File System Directory Tree
WORKS20 – Break
Enabling Discoverable Trusted Services for Highly Dynamic Decentralized Workflows
WORKS20 – Break
WorkflowHub: Community Framework for Enabling Scientific Workflow Research and Development
Characterizing Scientific Workflows on HPC Systems Using Logs
Back to Workshop Archive Listing
Introduction: First International Workshop on Quantum Computing Software
LEAP: Scaling Numerical Optimization Based Synthesis Using an Incremental Approach
XACC: A Service-Oriented Software Architecture for Quantum Computing
First International Workshop on Quantum Computing Software – Break
First International Workshop on Quantum Computing Software – Lunch
ArQTiC: A Full-Stack Software Package for Dynamic Simulations of Materials on Quantum Computers
First International Workshop on Quantum Computing Software - Break
Tensor Network Quantum Virtual Machine for Exascale Computing
QASMBench: An OpenQASM Benchmark Suite for NISQ Evaluation and Simulation
Using Pygsti for Quantum Processor Characterization and Benchmarking
Quantum Control Infrastructure Software for Lab and Cloud-Based Quantum Computers
First International Workshop on Quantum Computing Software – Concluding Remarks
Back to Workshop Archive Listing
HiPar20 – Introduction: Workshop on Hierarchical Parallelism for Exascale Computing
HiPar20 – Keynote 1: Exploiting Hierarchical Algorithms on Ever More Hierarchical Architectures
HiPar20 – Break
A Case Study and Characterization of a Many-Socket, Multi-Tier NUMA HPC Platform
HiPar20: Keynote 2: Single-Level Programming on Hierarchical Hardware via Adaptive Runtime? Maybe
HiPar20 – Break
HiPar20 – Break
HiPar20 – Invited Talk: Glow: a Machine Learning Compiler and Execution Engine
Back to Workshop Archive Listing
Keynote: Extreme Scale Programming with Novel Programming Methods
Achieving Computation-Communication Overlap with Overdecomposition on GPU Systems
ESPM2 – Break
Invited Talk: Automating Massively Parallel Heterogeneous Computing for Python Programmers
Deploying a Task-Based Runtime System on Raspberry Pi Clusters
Benesh: a Programming Model for Coupled Scientific Workflows
ESPM2 – Lunch Break
Invited Talk: A Parallel Execution Model for Exascale non von Neumann Memory-Centric Architectures
ESPM2 – Break
Panel Discussion: Moderated by Nectarios Koziris, National Technical University of Athens
Compiler Abstractions and Runtime for Extreme-Scale SAR and CFD Workloads
Back to Workshop Archive Listing
Seventh SC Workshop on Best Practices for HPC Training and Education – Introduction
Training for Researcher-Facing Cyberinfrastructure Professionals: The Virtual Residency
Best Practices for HPC Training and Education – Break
Transitioning Education and Training to a Virtual World – Lessons Learned
Bringing GPU Accelerated Computing and Deep Learning to the Classroom
HPC Internship Best Practices: The Summer Internships in Parallel Computational Sciences Program
XSEDE EMPOWER: Engaging Undergraduates in the Work of Advanced Digital Services and Resources
Exploring Remote Learning Methods for User Training in Research Computing
How the ECP Training Project is Helping the Entire HPC Community Prepare for Exascale Computing
Employing Directed Internship and Apprenticeship for Fostering HPC Training and Education
Inward and Outward Facing Best Practices for XSEDE's Extended Collaborative Support Service (ECSS)
A Collaborative Peer Review Process in Grading Coding Assignments for HPC
Best Practices for HPC Training and Education – Break
Best Practices for HPC Training and Education – Open Discussion
Back to Workshop Archive Listing
DLS – Introduction: The 5th Deep Learning on Supercomputers Workshop
DLS – Break
Online-Codistillation Meets LARS: Going beyond the Limit of Data Parallelism in Deep Learning
DLS – Lunch Break
Exploring the Limits of Concurrency in ML Training on Google TPUs
DLS – Break
Towards a Scalable and Distributed Infrastructure for Deep Learning Applications
DDLBench: Towards a Scalable Benchmarking Infrastructure for Distributed Deep Learning
DLS – Break
Vandermonde Wave Function Ansatz for Improved Variational Monte Carlo
TopiQAL: Topic-aware Question Answering Using Scalable Domain-Specific Supercomputers
DeepGalaxy: Deducing the Properties of Galaxy Mergers from Images Using Deep Neural Networks
Back to Workshop Archive Listing
Women in HPC – Introduction: Diversifying the HPC Community and Engaging Male Allies
WHPC’20 – Break
Panel – Making the Leap: Jumping into a Different Career Path
WHPC’20 – Lunch Break
STRUMPACK - High-Performance Scalable Software Library Based on Low-Rank Approximations
Smart-PGSim: Using Neural Network to Accelerate AC-OPFPower Grid Simulation
Probabilistic Volcanic Hazard Assessment within the Framework of the ChEESE Center of Excellence
Ensembles of Networks Produced from Neural Architecture Search
WHPC’20 – Break
Back to Workshop Archive Listing
HUST-20 – Introduction: 7th International Workshop on HPC User Support Tools
HUST-20 – Break
HUST-20 – Break
Back to Workshop Archive Listing
Correctness-Preserving Compression of Datasets and Neural Network Models
Order Matters: A Case Study on Reducing Floating Point Error in Sums through Ordering and Grouping
Correctness 2020 – Break
Enhancing DataRaceBench for Evaluating Data Race Detection Tools
PARCOACH Extension for Static MPI Nonblocking and Persistent Communication Validation
Toward Compiler-Aided Correctness Checking of Adjoint MPI Applications
Back to Workshop Archive Listing
XLOOP – Introduction: 2nd Annual Workshop on Extreme-Scale Experiment-in-the-Loop-Computing
Cross-Facility Science with the Superfacility Project at LBNL
Tomographic Reconstruction of Dynamic Features with Streaming Sliding Subsets
Toward an Automated HPC Pipeline for Processing Large Scale Electron Microscopy Data
XLOOP – Break
Toward Online Steering of Flame Spray Pyrolysis Nanoparticle Synthesis
Back to Workshop Archive Listing
RSE-HPC – Break
RSE-HPC – Break
Back to Workshop Archive Listing
Fifth International Parallel Data Systems Workshop – Introduction
PDSW 2020 - Keynote: "Sink or Swim: How Not to Drown in Colossal Streams of Data?"
Fifth International Parallel Data Systems Workshop – Break
Keeping It Real: Why HPC Data Services Don't Achieve I/O Microbenchmark Performance
Fifth International Parallel Data Systems Workshop – Break
Gauge: An Interactive Data-Driven Visualization Tool for HPC Application I/O Performance Analysis
Fractional-Overlap Declustered Parity: Evaluating Reliability for Storage Systems
Fifth International Parallel Data Systems Workshop – Break
Emulating I/O Behavior in Scientific Workflows on High Performance Computing Systems
Pangeo Benchmarking Analysis: Object Storage vs. POSIX File System
Fingerprinting the Checker Policies of Parallel File Systems
Fifth International Parallel Data Systems Workshop – Break
Scalable Communication and Data Persistence Layer for NVM-Based Storage Systems
Fifth International Parallel Data Systems Workshop – Closing Remarks
Back to Workshop Archive Listing
Fairness, Accountability, Transparency, and Ethics in Computer Vision
Machine Learning in HPC Environments – Break
EventGraD: Event-Triggered Communication in Parallel Stochastic Gradient Descent
Accelerating GPU-Based Machine Learning in Python Using MPI Library: A Case Study with MVAPICH2-GDR
Machine Learning in HPC Environments – Lunch Break
Accelerate Distributed Stochastic Gradient Descent for Nonconvex Optimization with Momentum
Machine Learning in HPC Environments – Break
High-Bypass Learning: Automated Detection of Tumor Cells that Significantly Impact Drug Response
Deep Generative Models that Solve PDEs: Distributed Computing for Training Large Data-Free Models
Back to Workshop Archive Listing
Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX
The Performance and Energy Efficiency Potential of FPGAs in Scientific Computing
PMBS20 – Break
Benchmarking Julia’s Communication Performance: Is Julia HPC Ready or Full HPC?
Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations
Exploiting the Potentials of the Second Generation SX-Aurora TSUBASA
Lightweight Measurement and Analysis of HPC Performance Variability
PMBS20 – Lunch Break
PMBS20 – Break
Developing Models for the Runtime of Programs with Exponential Runtime Behavior
Performance Tradeoffs in GPU Communication: A Study of Host and Device-Initiated Approaches
Back to Workshop Archive Listing
Personalized In Situ steering for Analysis and Visualization
Chimbuko: A Workflow-Level Scalable Performance Trace Analysis Tool
ISAV 2020 – Break
An Architecture for Interactive In Situ Visualization and Its Transparent Implementation in OpenFPM
JIT’s Complicated: A Comprehensive System for Derived Field Generation
ISAV 2020 – Break
ISAV 2020 – Break
Back to Workshop Archive Listing
Streaming Data – The Transformation of HPC Systems into Discovery Machines
The Square Kilometre Array and Exascale Challenges for Future Astronomy Facilities
Toward a Framework for Policy-Driven Adaptive In Situ Workflows
Combining Spatial and Temporal Properties for Improvements in Data Reduction
DRBSD-6 – Break
Dynamic, Adaptive Resource Management for Scientific Workflows
DRBSD-6 – Break
Intelligent Data Management for Extreme-Scales In-Situ Workflows
Data Compression with Deep Learning Based Generative Modeling
DRBSD-6 – Break
Machine learning for science with a deadline: a focus on the scientist
A Survey of Resource Constrained Scheduling for In Situ Analysis
Back to Workshop Archive Listing
INDIS – Introduction: Innovating the Network for Data-Intensive Science
INDIS – Keynote: "Grand" Challenges for a Science Mission Network
XNET Lightning Talk: SAGE: AI at the Edge for Software-Defined Wireless Sensors
XNET Lightning Talk: Extending the Research Engineering Network to the Wireless Edge
INDIS – Break
NRE Demo Talk I: P4 Experimental Networks for the Global Research Platform
NRE Demo Talk II: Advanced Data Algorithms and Architectures for Security Monitoring
Application Aware Software Defined Flows of Workflow Ensembles
INDIS – Break
Keynote: Enhancing Distributed Computing with Programmable and Open Optical Networks
A Trial Deployment of a Reliable Network-Multicast Application across Internet2
ROBIN (RuciO/BIgData Express/SENSE): A Next-Generation High-Performance Data Service Platform
The NetSage Measurement Framework: Design, Development, and Discoveries
INDIS – Break
An Evaluation of Ethernet Performance for Scientific Workloads
Computing Bottleneck Structures at Scale for High-Precision Network Performance Analysis
Back to Workshop Archive Listing
LLVM-HPC2020 – Introduction: The Sixth Workshop on the LLVM Compiler Infrastructure in HPC
LLVM-HPC2020 – Break
Static Neural Compiler Optimization via Deep Reinforcement Learning
Deep Learning-Based Approximate Graph-Coloring Algorithm for Register Allocation
LLVM-HPC2020 – Lunch Break
Extending the LLVM/Clang Framework for OpenMP Metadirective Support
Toward Automated Kernel Fusion for the Optimization of Scientific Applications
LLVM-HPC2020 – Break
Back to Workshop Archive Listing
ScalA – Keynote: Performance Evaluation of the Supercomputer "Fugaku" and A64FX Manycore Processor
An Integer Arithmetic-Based Sparse Linear Solver Using a GMRES Method and Iterative Refinement
ScalA – Break
ScalA – Keynote: High Performance Data Analytics and Some Applications
Two-Stage Asynchronous Iterative Solvers for Multi-GPU Clusters
Revisiting Exponential Integrator Methods for HPC with a Mini-Application
A Survey of Singular Value Decomposition Methods for Distributed Tall/Skinny Data
ScalA – Break
Keynote 3: ECP – Recent Experiences in Porting Complex Applications to Accelerator-Based Systems
Replacing Pivoting in Distributed Gaussian Elimination with Randomized Techniques
High-Order Finite Element Method Using Standard and Device-Level Batch GEMM on GPUs
ScalA – Break
A Fast Scalable Iterative Implicit Solver with Green's Function-Based Neural Networks
Implementation and Numerical Techniques for One Eflop/s HPL-AI Benchmark on Fugaku
Back to Workshop Archive Listing
CANOPIE-HPC – Break
Extending the Control Plane of Container Orchestrators for I/O Virtualization
CANOPIE-HPC – Break
Containers for Massive Ensemble of I/O Bound Hierarchical Coupled Simulations
Design Considerations for Building and Running Containerized MPI Applications
archspec: A Library for Detecting, Labeling, and Reasoning About Microarchitectures
CANOPIE-HPC – Break
Back to Workshop Archive Listing
ProTools – Introduction: Workshop on Programming and Performance Visualization Tools
OpenACC Profiling Support for Clang and LLVM Using Clacc and TAU
ProTools – Break
Simulation-Based Performance Prediction of HPC Applications: A Case Study of HPL
Back to Workshop Archive Listing
PAW-ATM - Introduction: The 3rd Annual Parallel Applications Workshop, Alternatives To MPI+X
PAW-ATM – Keynote: Performance Portability in the Age of Extreme Heterogeneity
PAW-ATM – Break
Hedgehog: Understandable Scheduler-Free Heterogeneous Asynchronous Multithreaded Data-Flow Graphs
TaskTorrent: A Lightweight Distributed Task-Based Runtime System in C++
PAW-ATM – Break
Task-Parallel In Situ Data Compression of Large-Scale Computational Fluid Dynamics Simulations
PAW-ATM – Break
HOOVER: Leveraging OpenSHMEM for High Performance, Flexible, Streaming Graph Applications
Back to Workshop Archive Listing
Case Study of TCP/IP Tunings for High Performance Interconnects
HPCSysPros20 – Break
Parallelized Data Replication of Multi-Petabyte Storage Systems
Log-Based Identification, Classification, and Behavior Prediction of HPC Applications
Back to Workshop Archive Listing
Scheduling across Multiple Applications Using Task-Based Programming Models
MENPS: A Decentralized Distributed Shared Memory Exploiting RDMA
RaDD Runtimes: Radical and Different Distributed Runtimes with SmartNICs
IPDRM – Break
MPI Meets Cloud: Case Study with Amazon EC2 and Microsoft Azure
Back to Workshop Archive Listing
WACCPD – Introduction: Seventh Workshop on Accelerator Programming Using Directives
WACCPD – Keynote: Achieving Performance Portability for Extreme Heterogeneity
ADELUS: A Performance-Portable Dense LU Solver for Distributed-Memory Hardware-Accelerated Systems
GPU Acceleration of the FINE/FR CFD Solver in a Heterogeneous Environment with OpenACC Directives
Performance and Portability of a Linear Solver Across Emerging Architectures
WACCPD – Invited Talk: Enabling Portable Directive-Based Programming at Exascale
WACCPD – Break
Performance Assessment of OpenMP Compilers Targeting NVIDIA V100 GPUs
Back to Workshop Archive Listing
Automatic Particle Trajectory Classification in Plasma Simulations
Reinforcement Learning-Based Solution to Power Grid Planning and Operation Under Uncertainties
Predictions of Steady and Unsteady Flows Using Machine-Learned Surrogate Models
Deep Learning-Based Low-Dose Tomography Reconstruction with Hybrid-Dose Measurements
Back to Workshop Archive Listing
ROSS – Introduction: 10th International Workshop on Runtime and Operating Systems for Supercomputers
ROSS – Break
A Dataflow-Graph Partitioning Method for Training Large Deep Learning Models
Improving Job Launch Rates in the TaPaSCo FPGA Middleware by Hardware/Software-Co-Design
Pinpoint the Joules: Unifying Runtime-Support for Energy Measurements on Heterogeneous Systems
ROSS – Break
Ch’i: Scaling Microkernel Capabilities in Cache-Incoherent Systems
ROSS – Break
Programming Model Developments Present Opportunities for Runtime and Operating Systems
Application-Driven Requirements for Node Resource Management in Next-Generation Systems
Locality-Aware Scheduling for Scalable Heterogeneous Environments
Back to Workshop Archive Listing
Scalable MPI Collectives Using SHARP: Large Scale Performance Evaluation on the TACC Frontera System
ExaMPI – Break
Design and Implementation Techniques for an MPI-Oriented AMT Runtime
Integrating Inter-Node Communication with a Resilient Asynchronous Many-Task Runtime System
Back to Workshop Archive Listing
PyHPC – Introduction: 9th Workshop on Python for High-Performance and Scientific Computing
PyHPC – Break
Experiences in Developing a Distributed Agent-Based Modeling Toolkit with Python
PyHPC – Break
PyHPC – Break
Enabling System Wide Shared Memory for Performance Improvement in PyCOMPSs Applications
PyHPC – Break
Accelerating Microstructural Analytics with Dask for Volumetric X-Ray Imaging
Distributed Asynchronous Array Computing with the JetLag Environment
PyHPC – Break
Validating Oil Spill Dispersion Models Against Real-World Observations Using the GeoPandas Library
Back to Workshop Archive Listing
CAFCW20 – Introduction: Sixth Computational Approaches for Cancer Workshop
Scalable Human Pharmacokinetics Property Prediction for Cancer Drug Discovery at ATOM
CAFCW20 – Break
An Efficient, Data-Driven Approach To Model Specific Cancer Cell Lines
CAFCW20 – Lunch Break
Machine Learning Driven Importance Sampling Approach for Multiscale Simulations
A Metapath Approach to Predicting Drug Response in Cancer Cell Lines
Toward a Data-Driven System for Personalized Cervical Cancer Screening
Integration of Domain Knowledge Using Medical Knowledge Graph Deep Learning for Cancer Phenotyping
CAFCW20 – Break
Back to Workshop Archive Listing
Programming Reconfigurable Heterogeneous Computing Clusters Using MPI with Transpilation
Exploring the Acceleration of Nekbone on Reconfigurable Architectures
H2RC – Break
H2RC – Lunch Break
H2RC – Break
H2RC – Invited Talk: FPGA programming and the oneAPI industry initiative
H2RC – Invited Talk: AIgean: An Open Framework for Machine Learning on a Heterogeneous Cluster
Back to Workshop Archive Listing
P3HPC – Introduction: 3rd International Workshop on Performance Portability and Productivity
Tracking Performance Portability on the Yellow Brick Road to Exascale
Performance Portability of Molecular Docking Miniapp on Leadership Computing Platforms
P3HPC – Break
Interpreting and Visualizing Performance Portability Metrics
P3HPC – Break
P3HPC – Break
Cross-Platform Performance Portability of DNN Models Using SYCL
Evaluating the Performance and Portability of Contemporary SYCL Implementations
Back to Workshop Archive Listing
EduHPC – Introduction: Workshop on Education for High-Performance Computing
Toward Generic Parallel Programming in Computer Science Education with Kokkos
Extending FreeCompilerCamp.org as an Online Self-Learning Platform for Compiler Development
Integrating Machine Learning with HPC-Driven Simulations for Enhanced Student Learning
EduHPC – Break
EduHPC – Keynote: Parallel and Distributed Computing Infusion in K-14
EduHPC – Lunch Break
Trying to Do It All in a Single Course: A Surprisingly Good Idea.
Applying Parallel and Distributed Computing Curriculum to Cyber Security Courses
Teaching Software Sustainability for High Performance Computing at ATPESC
EduHPC – Break
Evolving the Traditional Student Cluster Competition as Tomorrow’s “Peachy Assignments”
Broadening Participation via Computer Systems Genome Research Group
A Masters Degree Course in Computational Engineering at University of Warsaw
Back to Workshop Archive Listing
Keynote: Urgent high performance computing for real-time pandemic response and decision making
Urgent HPC – Break
Rapid Processing of Astronomical Data for the Dark Energy Spectroscopic Instrument
A Bespoke Workflow Management System for Data-Driven Urgent HPC
Fast Tsunami Simulations for a Real-Time Emergency Response Flow
Benchmarking Micro-Core Architectures for Detecting Disasters at the Edge
Urgent HPC – Break
VESTEC - Visual Exploration and Sampling Toolkit for Extreme Computing
Studies of Leveraging HPC for Workloads with Real-Time Time Constraints
Integrated Micro-Scale Modeling and Urban Environment Service Cyberinfrastructure for Smart Cities
Earthquake Early Warning (EEW) System: A Case for Urgent Analytics across the Computing Continuum
SciStream: Architecture and Toolkit for Data Streaming between Federated Science Instruments
Computer Aided Diagnostic Tools for COVID-19 Detection via X-Ray Imaging
Globus Services for Data-Intensive Experimental Research Automation
Back to Workshop Archive Listing
InteractiveHPC – Introduction: Fourth Workshop on Interactive High-Performance Computing
Keynote: Rapid-response Data Analytics for COVID-19 Using GPUs
Toward an HPC Service-Oriented Hybrid Cloud Architecture Designed for Interactive Workflows
InteractiveHPC – Break
Accelerating Fusion Energy Experimental Workflows Using HPC Resources
Toward Interactive, Reproducible Analytics at Scale on HPC Systems
InteractiveHPC – Break
Back to Workshop Archive Listing
MLCS – Introduction: 2nd Workshop on Machine Learning for Computing Systems
Early Prediction of High-Performance Computing Job Outcomes via Modeling System Text Logs
Explainable Machine Learning Frameworks for Managing HPC Systems
MLCS – Break
Scheduling in Data Centers Running on Renewable Energy with Deep Reinforcement Learning
Investigating the Efficacy of Unstructured Text Analysis for Node Failure Detection in Syslog
Back to Workshop Archive Listing