GPU Programming - OpenCL vs CUDA vs ROCm Training Course

GPU programming harnesses the parallel processing capabilities of graphics processing units to accelerate applications demanding high-performance computing, such as artificial intelligence, gaming, computer graphics, and scientific simulations. Numerous frameworks facilitate GPU programming, each presenting distinct advantages and limitations. OpenCL serves as an open standard enabling programming across CPUs, GPUs, and other devices from various vendors, whereas CUDA is tailored specifically for NVIDIA GPUs. ROCm is a platform supporting GPU programming on AMD hardware, additionally offering compatibility with both CUDA and OpenCL.

This instructor-led, live training session (available online or onsite) targets beginner to intermediate developers seeking to utilise diverse frameworks for GPU programming, allowing them to compare their features, performance, and compatibility.

Upon completion of this training, participants will be able to:

Configure a development environment encompassing the OpenCL SDK, CUDA Toolkit, ROCm Platform, a compatible device supporting OpenCL, CUDA, or ROCm, and Visual Studio Code.
Develop a fundamental GPU program executing vector addition using OpenCL, CUDA, and ROCm, whilst comparing the syntax, structure, and execution methods of each framework.
Employ the respective APIs to query device information, allocate and deallocate device memory, transfer data between host and device, launch kernels, and synchronise threads.
Utilise the respective languages to write kernels that execute on the device and manipulate data.
Leverage the respective built-in functions, variables, and libraries to carry out common tasks and operations.
Apply the respective memory spaces, including global, local, constant, and private, to optimise data transfers and memory access.
Utilise the respective execution models to control the threads, blocks, and grids that define parallelism.
Debug and test GPU programs using tools such as CodeXL, CUDA-GDB, CUDA-MEMCHECK, and NVIDIA Nsight.
Optimise GPU programs using techniques such as coalescing, caching, prefetching, and profiling.

Course Format

Interactive lectures and discussions.
Extensive exercises and practical application.
Hands-on implementation within a live-lab environment.

Course Customisation Options

To request bespoke training for this course, please contact us to arrange.

28 hours

200 Mary Street, Brisbane

10000 AUD (Online)

23720 AUD (Classroom)

GPU Programming - OpenCL vs CUDA vs ROCm Training Course

Course Outline

Requirements

Provisional Upcoming Courses (Require 5+ participants)

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

GPU Programming - OpenCL vs CUDA vs ROCm Training Course

Course Outline

Requirements

Provisional Upcoming Courses (Require 5+ participants)

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

GPU Programming - OpenCL vs CUDA vs ROCm

Related Courses

Developing AI Applications with Huawei Ascend and CANN

Deploying AI Models with CANN and Ascend AI Processors

AI Inference and Deployment with CloudMatrix

GPU Programming on Biren AI Accelerators

Cambricon MLU Development with BANGPy and Neuware

Introduction to CANN for AI Framework Developers

CANN for Edge AI Deployment

Understanding Huawei’s AI Compute Stack: From CANN to MindSpore

Optimizing Neural Network Performance with CANN SDK

CANN SDK for Computer Vision and NLP Pipelines

Building Custom AI Operators with CANN TIK and TVM

Migrating CUDA Applications to Chinese GPU Architectures

Performance Optimization on Ascend, Biren, and Cambricon

Related Categories

GPU

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites