Part I Linear Algebra
1 Accelerating Numerical Dense Linear Algebra Calculations
with GPUs .................................................... 3
Jack Dongarra, Mark Gates, Azzam Haidar, Jakub Kurzak,
Piotr Luszczek, Stanimire Tomov, and Ichitaro Yamazaki
2 A Guide for Implementing Tridiagonal Solvers on GPUs ........ 29
Li-Wen Chang and Wen-mei W. Hwu
3 Batch Matrix Exponentiation ................................. 45
M. Graham Lopez and Mitchel D. Horton
4 Efficient Batch LU and QR Decomposition on GPU .............. 69
William J. Brouwer and Pierre-Yves Taunay
5 A Flexible CUDA LU-Based Solver for Small, Batched
Linear Systems .............................................. 87
Antonino Tumeo, Nitin Gawande, and Oreste Villa
6 Sparse Matrix-Vector Product ............................... 103
Zbigniew Koza, Maciej Matyka, Łukasz Mirosław, and Jakub
Poła
Part II Differential Equations
7 Solving Ordinary Differential Equations on GPUs ............ 125
Karsten Ahnert, Denis Demidov, and Mario Mulansky
8 GPU-Based Parallel Integration of Large Numbers
of Independent ODE Systems ................................. 159
Kyle E. Niemeyer and Chih-Jen Sung
9 Finite and Spectral Element Methods on Unstructured
Grids for Flow and Wave Propagation Problems ............... 183
Dominik Göddeke, Dimitri Komatitsch, and Matthias Möller
10 A GPU Implementation for Solving the Convection
Diffusion Equation Using the Local Modified SOR Method ..... 207
Yiannis Cotronis, Elias Konstantinidis, and Nikolaos
M. Missirlis
11 Finite-Difference in Time-Domain Scalable Implementations
on CUDA and OpenCL ......................................... 223
Lidia Kuan, Pedro Tomás, and Leonel Sousa
Part III. Random Numbers and Monte Carlo Methods
12 Pseudorandom Numbers Generation for Monte Carlo
Simulations on GPUs: OpenCL Approach ....................... 245
Vadim Demchik
13 Monte Carlo Automatic Integration with Dynamic
Parallelism in CUDA ........................................ 273
Elise de Doncker, John Kapenga, and Rida Assaf
14 GPU: Accelerated Computation Routines for Quantum
Trajectories Method ........................................ 299
Joanna Wiśniewska and Marek Sawerwain
15 Monte Carlo Simulation of Dynamic Systems on GPU's ......... 319
Jonathan Rogers
Part IV. Fast Fourier Transform and Localized n-Body
Problems
16 Fast Fourier Transform (FFT) on GPUs ....................... 339
Yash Ukidave, Gunar Schirner, and David Kaeli
17 A Highly Efficient FFT Using Shared-Memory Multiplexing .... 363
Yi Yang and Huiyang Zhou
18 Increasing Parallelism and Reducing Thread Contentions
in Mapping Localized N-Body Simulations to GPUs ............ 379
Bharat Sukhwani and Martin C. Herbordt
|