OpenX { X = MP/ACC } Directives
.
├── AccelerationToMulitcore
│ ├── 1.CPUOpenmp
│ ├── 2.SIMD
│ └── 3.AffinityOpenMP
├── AccelerationToGU
│ ├── GPUoffOpenACC
│ └── GPUoffOpenMP
├── Benchmarks
│ ├── OpenACC_ganga_reduction_collapse
│ ├── OpenACC_kernels_data_optimized
│ ├── OpenAcc_reduction_collapse
│ ├── OpenMP_reduction_collapse
│ ├── OpenMP_reduction_collapse_teams_gang
│ └── openmpPerformance
├── benchmarkfig1.html
└── benchmarkfig1.png
-
OpenMP Directives
-
Single instruction multiple data (SIMD)
-
OpenACC offloadings
- Data movement
- Loop parallelisms
-
OpenMP Offloadings
- Target construct
- The map clause
- Target Teams construct
- Nowait clauses
- Distributed construct
-
Benchmark
Thanks to:
- Tim Mattson (Intel)
- Tom Deakin (University of Bristol)
- Michael Klemm (AMD)
- Jeff Larkin (Nvidia)
- Manuel Arenaz (NERSC)
- Luca and Sergio (CINECA) and many others OpenMP 5.0.1 specification and examples https://www.openmp.org/resources/