Get in Touch

Course Outline

Introduction to Programming Big Data with R (bpdR)

  • Configuring your environment for pbdR
  • Overview and capabilities of pbdR
  • Commonly used packages alongside pbdR for Big Data

Message Passing Interface (MPI)

  • Utilising pbdR MPI 5
  • Parallel processing
  • Point-to-point communication
  • Transmitting matrices
  • Summing matrices
  • Collective communication
  • Summing matrices using Reduce
  • Scatter / Gather operations
  • Additional MPI communication methods

Distributed Matrices

  • Creating a distributed diagonal matrix
  • Performing SVD on a distributed matrix
  • Constructing a distributed matrix in parallel

Statistical Applications

  • Monte Carlo integration
  • Reading datasets
  • Reading data across all processes
  • Broadcasting from a single process
  • Reading partitioned data
  • Distributed regression
  • Distributed bootstrap
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Provisional Upcoming Courses (Require 5+ participants)

Related Categories