Introduction to MapReduce
Posted by EdmontonPMMay 19
Live Webinar – May 27th 2014, 12:00-1:00 PM EDT
Offered by ASPE (REP 2161) 1 Category A PDU – Free PDU
Note: Although ASPE is an REP presentations may have to be recorded as a Cat C PDU Event – Contact Traci Lester Marketing Specialist at ASPE for more information
MapReduce is a programming framework introduced by Google in the early 2000’s. It is targeted at solving problems that have to work on huge datasets.
Rather than devising an algorithm that works on the entire dataset, the map reduce framework works on several chunks of the same dataset in parallel during the map phase and combines the results together during the reduce phase.
MapReduce can take advantage of locality of data, processing data on or near the storage assets to decrease transmission of data.
In this session Sriram Mohan (LinkedIn profile) will lead you through the following:
- Introduction to the MapReduce paradigm in Hadoop
- Code-walkthrough in Java
- How does MapReduce work in Hadoop? – This part will provide an explanation for the internal workings (Shuffle-Sort) of MapReduce, job scheduling and failure handling in classical MapReduce.
Here are some other great resources on Map Reduce:
- Amazon Elastic MapReduce Developer Guide
- An Overview of MapReduce and Its Impact on Distributed Data Processing
- MapReduce Design Patterns: Building Effective Algorithms and Analytics for Hadoop and Other Systems
- Instant MapReduce Patterns Hadoop Essentials How-to
PDU Category C (PMBOK 5) documentation details:
Process Groups: Planning
Knowledge Areas: 9 – Human Resources
- 9.3 Develop Project Team
As a Category C ‘Self Directed Learning Activity’ remember to document your learning experience and its relationship to project management for your ‘PDU Audit Trail Folder’
Click to register for Introduction to MapReduce
Leave a Reply