Log In
Or create an account ->
Imperial Library
Home
About
News
Upload
Forum
Help
Login/SignUp
Index
High Performance Parallelism Pearls
Cover image
Title page
Table of Contents
Copyright
Contributors
Acknowledgments
Foreword
Preface
Chapter 1: Introduction
Chapter 2: From “Correct” to “Correct & Efficient”: A Hydro2D Case Study with Godunov’s Scheme
Chapter 3: Better Concurrency and SIMD on HBM
Chapter 4: Optimizing for Reacting Navier-Stokes Equations
Chapter 5: Plesiochronous Phasing Barriers
Chapter 6: Parallel Evaluation of Fault Tree Expressions
Chapter 7: Deep-Learning Numerical Optimization
Chapter 8: Optimizing Gather/Scatter Patterns
Chapter 9: A Many-Core Implementation of the Direct N-Body Problem
Chapter 10: N-Body Methods
Chapter 11: Dynamic Load Balancing Using OpenMP 4.0
Chapter 12: Concurrent Kernel Offloading
Chapter 13: Heterogeneous Computing with MPI
Chapter 14: Power Analysis on the Intel® Xeon Phi™ Coprocessor
Chapter 15: Integrating Intel Xeon Phi Coprocessors into a Cluster Environment
Chapter 16: Supporting Cluster File Systems on Intel® Xeon Phi™ Coprocessors
Chapter 17: NWChem: Quantum Chemistry Simulations at Scale
Chapter 18: Efficient Nested Parallelism on Large-Scale Systems
Chapter 19: Performance Optimization of Black-Scholes Pricing
Chapter 20: Data Transfer Using the Intel COI Library
Chapter 21: High-Performance Ray Tracing
Chapter 22: Portable Performance with OpenCL
Chapter 23: Characterization and Optimization Methodology Applied to Stencil Computations
Chapter 24: Profiling-Guided Optimization
Chapter 25: Heterogeneous MPI application optimization with ITAC
Chapter 26: Scalable Out-of-Core Solvers on a Cluster
Chapter 27: Sparse Matrix-Vector Multiplication: Parallelization and Vectorization
Chapter 28: Morton Order Improves Performance
Author Index
Subject Index
← Prev
Back
Next →
← Prev
Back
Next →