Programming Massively Parallel Processors: A Hands-on Approach
Autor Wen-Mei W. Hwu, David B. Kirk, Izzat El Hajjen Limba Engleză Paperback – 23 sep 2022
- Parallel Patterns Introduces new chapters on frequently used parallel patterns (stencil, reduction, sorting) and major improvements to previous chapters (convolution, histogram, sparse matrices, graph traversal, deep learning)
- Ampere Includes a new chapter focused on GPU architecture and draws examples from recent architecture generations, including Ampere
- Systematic Approach Incorporates major improvements to abstract discussions of problem decomposition strategies and performance considerations, with a new optimization checklist
Preț: 388.01 lei
Preț vechi: 485.01 lei
-20% Nou
Puncte Express: 582
Preț estimativ în valută:
74.26€ • 77.13$ • 61.68£
74.26€ • 77.13$ • 61.68£
Carte în stoc
Livrare din stoc 24 decembrie
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9780323912310
ISBN-10: 0323912311
Pagini: 580
Ilustrații: Approx. 340 illustrations
Dimensiuni: 191 x 235 x 23 mm
Greutate: 0.94 kg
Ediția:4
Editura: ELSEVIER SCIENCE
ISBN-10: 0323912311
Pagini: 580
Ilustrații: Approx. 340 illustrations
Dimensiuni: 191 x 235 x 23 mm
Greutate: 0.94 kg
Ediția:4
Editura: ELSEVIER SCIENCE
Cuprins
1 Introduction
Part I Fundamental Concepts
2 Heterogeneous data parallel computing
3 Multidimensional grids and data
4 Compute architecture and scheduling
5 Memory architecture and data locality
6 Performance considerations
Part II Parallel Patterns
7 Convolution: An introduction to constant memory and caching
8 Stencil
9 Parallel histogram
10 Reduction And minimizing divergence
11 Prefix sum (scan)
12 Merge: An introduction to dynamic input data identification
Part III Advanced patterns and applications
13 Sorting
14 Sparse matrix computation
15 Graph traversal
16 Deep learning
17 Iterative magnetic resonance imaging reconstruction
18 Electrostatic potential map
19 Parallel programming and computational thinking
Part IV Advanced Practices
20 Programming a heterogeneous computing cluster: An introduction to CUDA streams
21 CUDA dynamic parallelism
22 Advanced practices and future evolution
23 Conclusion and outlook
Appendix A: Numerical considerations
Part I Fundamental Concepts
2 Heterogeneous data parallel computing
3 Multidimensional grids and data
4 Compute architecture and scheduling
5 Memory architecture and data locality
6 Performance considerations
Part II Parallel Patterns
7 Convolution: An introduction to constant memory and caching
8 Stencil
9 Parallel histogram
10 Reduction And minimizing divergence
11 Prefix sum (scan)
12 Merge: An introduction to dynamic input data identification
Part III Advanced patterns and applications
13 Sorting
14 Sparse matrix computation
15 Graph traversal
16 Deep learning
17 Iterative magnetic resonance imaging reconstruction
18 Electrostatic potential map
19 Parallel programming and computational thinking
Part IV Advanced Practices
20 Programming a heterogeneous computing cluster: An introduction to CUDA streams
21 CUDA dynamic parallelism
22 Advanced practices and future evolution
23 Conclusion and outlook
Appendix A: Numerical considerations