Get in Touch

Course Outline

Introduction to Programming Big Data with R (bpdR)

  • Configuring your environment for pbdR
  • Overview and tools available in pbdR
  • Common packages used with Big Data in conjunction with pbdR

Message Passing Interface (MPI)

  • Utilizing pbdR MPI 5
  • Parallel processing techniques
  • Point-to-point communication
  • Sending matrices
  • Summing matrices
  • Collective communication
  • Summing matrices using Reduce
  • Scatter and Gather operations
  • Additional MPI communication methods

Distributed Matrices

  • Constructing a distributed diagonal matrix
  • Performing Singular Value Decomposition (SVD) on a distributed matrix
  • Building a distributed matrix in parallel

Statistics Applications

  • Monte Carlo Integration
  • Loading datasets
  • Reading data across all processes
  • Broadcasting data from a single process
  • Accessing partitioned data
  • Distributed Regression
  • Distributed Bootstrap
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories