Thank you for sending your enquiry! One of our team member will contact you shortly.
Thank you for sending your booking! One of our team member will contact you shortly.
Course Outline
Introduction to Programming Big Data with R (bpdR)
- Setting up your environment to use pbdR
- Scope and tools available in pbdR
- Packages commonly used with Big Data alongside pbdR
Message Passing Interface (MPI)
- Using pbdR MPI 5
- Parallel processing
- Point-to-point communication
- Send Matrices
- Summing Matrices
- Collective communication
- Summing Matrices with Reduce
- Scatter / Gather
- Other MPI communications
Distributed Matrices
- Creating a distributed diagonal matrix
- SVD of a distributed matrix
- Building a distributed matrix in parallel
Statistics Applications
- Monte Carlo Integration
- Reading Datasets
- Reading on all processes
- Broadcasting from one process
- Reading partitioned data
- Distributed Regression
- Distributed Bootstrap
Testimonials
The subject matter and the pace were perfect.
Tim - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
The tutor, Mr. Michael Yan, interacted with the audience very well, the instruction was clear. The tutor also go extent to add more information based on the requests from the students during the training.
Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
the introduction of new packages
Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
Michael the trainer is very knowledgeable and skillful about the subject of Big Data and R. He is very flexible and quickly customize the training to meet clients' need. He is also very capable to solve technical and subject matter problems on the go. Fantastic and professional training!
Xiaoyuan Geng - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
Related Courses
Apache Airflow
21 hours
Apache Hama
14 hours
Apache Accumulo Fundamentals
21 hours
Apache Drill
21 hours
Apache Drill Query Optimization
7 hours