Email the Author

You can use this page to email GitforGits | Asian Publishing House about Practical GPU Programming.

Please include an email address so the author can respond to your query

This message will be sent to GitforGits | Asian Publishing House

This site is protected by reCAPTCHA and the Google  Privacy Policy and  Terms of Service apply.

About the Book

If you're a Python pro looking to get the most out of your code with GPUs, then Practical GPU Programming is the right book for you. This book will walk you through the basics of GPU architectures, show you hands-on parallel programming techniques, and give you the know-how to confidently speed up real workloads in data processing, analytics, and engineering.

The first thing you'll do is set up the environment, install CUDA, and get a handle on using Python libraries like PyCUDA and CuPy. You'll then dive into memory management, kernel execution, and parallel patterns like reductions and histogram computations. Then, we'll dive into sorting and search techniques, but with a focus on how GPU acceleration transforms business data processing. We'll also put a strong emphasis on linear algebra to show you how to supercharge classic vector and matrix operations with cuBLAS and CuPy. Plus, with batched computations, efficient broadcasting, custom kernels, and mixed-library workflows, you can tackle both standard and advanced problems with ease.

Throughout, we evaluate numerical accuracy and performance side by side, so you can understand both the strengths and limitations of GPU-based solutions. The book covers nearly every essential skill and modern toolkit for practical GPU programming, but it's not going to turn you into a master overnight.

Key Learnings

Boost processing speed and efficiency for data-intensive tasks. Use CuPy and PyCUDA to write and execute custom CUDA kernels. Maximize GPU occupancy and throughput efficiency by using optimal thread block and grid configuration. Reduce global memory bottlenecks in kernels by using shared memory and coalesced access patterns. Perform dynamic kernel compilation to ensure tailored performance. Use CuPy to carry out custom, high-speed elementwise GPU operations and expressions. Implement bitonic and radix sort algorithms for large or batch integer datasets. Execute parallel linear search kernels to detect patterns rapidly. Scale matrix operations using Batched GEMM and high-level cuBLAS routines.

Table of Content

Introduction to GPU Fundamentals Setting up GPU Programming Environment Basic Data Transfers and Memory Types Simple Parallel Patterns Introduction to Kernel Optimization Working with PyCUDA and CuPy Features Practical Sorting and Search Linear Algebra Essentials on GPU


About the Author

GitforGits | Asian Publishing House’s avatar GitforGits | Asian Publishing House

@GitforGits

GitforGits is an Asian publishing house where knowledgeable experts and open-source contributors collaborate to disseminate new ideas and innovations. We plan to provide niche, original, and useful content; we are a self-funded, independent publisher. We have books spanning the fields of computer science, cybersecurity, cloud computing, devops, deep learning, hardware programming, networking, the Internet of Things, and any other area of technology to which we can satisfactorily contribute.

Logo white 96 67 2x

Publish Early, Publish Often

  • Path
  • There are many paths, but the one you're on right now on Leanpub is:
  • Pythongpuprogramming › Email Author › New
    • READERS
    • Newsletters
    • Weekly Sale
    • Monthly Sale
    • Store
    • Home
    • Redeem a Token
    • Search
    • Support
    • Leanpub FAQ
    • Leanpub Author FAQ
    • Search our Help Center
    • How to Contact Us
    • FRONTMATTER PODCAST
    • Featured Episode
    • Episode List
    • MEMBERSHIPS
    • Reader Memberships
    • Department Reader Memberships
    • Author Memberships
    • Your Membership
    • COMPANY
    • About
    • About Leanpub
    • Blog
    • Contact
    • Press
    • Essays
    • AI Services
    • Imagine a world...
    • Manifesto
    • More
    • Partner Program
    • Causes
    • Accessibility
    • AUTHORS
    • Write and Publish on Leanpub
    • Create a Book
    • Create a Bundle
    • Create a Course
    • Create a Track
    • Testimonials
    • Why Leanpub
    • Services
    • TranslateAI
    • TranslateWord
    • TranslateEPUB
    • PublishWord
    • Publish on Amazon
    • CourseAI
    • GlobalAuthor
    • Marketing Packages
    • IndexAI
    • Author Newsletter
    • The Leanpub Author Update
    • Author Support
    • Author Help Center
    • Leanpub Authors Forum
    • The Leanpub Manual
    • Supported Languages
    • The LFM Manual
    • Markua Manual
    • API Docs
    • Organizations
    • Learn More
    • Sign Up
    • LEGAL
    • Terms of Service
    • Copyright Policy
    • Privacy Policy
    • Refund Policy

*   *   *

Leanpub is copyright © 2010-2025 Ruboss Technology Corp.
All rights reserved.

This site is protected by reCAPTCHA
and the Google  Privacy Policy and  Terms of Service apply.

Leanpub requires cookies in order to provide you the best experience. Dismiss