|
|
|||
|
||||
OverviewYou've mastered the architecture-now it's time to own the performance. Every GPU developer hits the same wall: the profiler says you're close to peak, but you know there's still headroom. What's missing isn't another compiler flag-it's visibility into the hardware's final truth. That truth lives in SASS, the real machine code running on NVIDIA GPUs. Mastering PTX and SASS - Volume II takes you past theory into the territory where nanoseconds matter. Here you'll learn how to read, analyze, and tune instruction streams with surgical precision. You'll uncover how schedulers pair ops, how register pressure throttles throughput, and how to turn your kernels into clock-cycle-balanced engines of pure efficiency. This book is for engineers who refuse to settle for ""good enough."" It turns profiling, disassembly, and optimization into a repeatable process-one grounded in data, not superstition. From tensor cores to warp shuffles, from atomic operations to multi-GPU scaling, you'll learn how real experts bend hardware to their will. Volume I built the foundation; Volume II shows you how to weaponize it. If you're ready to squeeze every drop of performance from your GPU-and understand exactly how you did it-this is the manual you've been waiting for. Full Product DetailsAuthor: Gareth ThomasPublisher: Independently Published Imprint: Independently Published Dimensions: Width: 21.60cm , Height: 2.60cm , Length: 27.90cm Weight: 1.188kg ISBN: 9798270050214Pages: 518 Publication Date: 15 October 2025 Audience: General/trade , General Format: Paperback Publisher's Status: Active Availability: Available To Order We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately. Table of ContentsReviewsAuthor InformationTab Content 6Author Website:Countries AvailableAll regions |
||||