BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:America/New_York
X-LIC-LOCATION:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=2SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20210402T160555Z
LOCATION:Track 8
DTSTART;TZID=America/New_York:20201112T175000
DTEND;TZID=America/New_York:20201112T181500
UID:submissions.supercomputing.org_SC20_sess214_ws_lasalss106@linklings.co
 m
SUMMARY:Performance Analysis of a Quantum Monte Carlo Application on Multi
 ple Hardware Architectures Using the HPX Runtime
DESCRIPTION:Workshop\n\nPerformance Analysis of a Quantum Monte Carlo Appl
 ication on Multiple Hardware Architectures Using the HPX Runtime\n\nWei, C
 hatterjee, Huck, Hernandez, Kaiser\n\nThis paper describes how we successf
 ully used the HPX programming model to port the DCA++ application on multi
 ple architectures that include POWER9, x86, ARM v8, and NVIDIA GPUs. We de
 scribe the lessons we can learn from this experience as well as the benefi
 ts of enabling the HPX in the application to improve the CPU threading par
 t of the code, which led to an overall 21% improvement across architecture
 s. We also describe how we used HPX-APEX to raise the level of abstraction
  to understand performance issues and to identify tasking optimization opp
 ortunities in the code, and how these relate to CPU/GPU utilization counte
 rs, device memory allocation over time, and CPU kernel level context switc
 hes on a given architecture.\n\nTag: Algorithms, Extreme Scale Computing, 
 Performance/Productivity Measurement and Evaluation, Scalable Computing, S
 cientific Computing\n\nRegistration Category: Workshop Reg Pass
END:VEVENT
END:VCALENDAR

