This advanced course provides Java programmers a deep-dive into Hadoop application development. Students will learn how to design and develop efficient and effective MapReduce applications for Hadoop using the Hortonworks Data Platform, including how to implement combiners, partitioners, secondary sorts, custom input and output formats, joining large datasets, unit testing, and developing UDFs for Pig and Hive. Labs are run on a 7-node HDP 2.1 cluster running in a virtual machine that students can keep for use after the training.
Describe Hadoop 2 and the Hadoop Distributed File System
Describe the YARN framework
Develop and run a Java MapReduce application on YARN
Use combiners and in-map aggregation
Write a custom partitioner to avoid data skew on reducers
Perform a secondary sort
Recognize use cases for built-in input and output formats
Write a custom MapReduce input and output format
Optimize a MapReduce job
Configure MapReduce to optimize mappers and reducers
Develop a custom RawComparator class
Distribute files as LocalResources
Describe and perform join techniques in Hadoop
Perform unit tests using the UnitMR API
Describe the basic architecture of HBase
Write an HBase MapReduce application
List use cases for Pig and Hive
Write a simple Pig script to explore and transform big data
Write a Pig UDF (User-Defined Function) in Java
Write a Hive UDF in Java
Use JobControl class to create a MapReduce workflow
College Credit, CEUs, PDUs and CDUs When you take courses with the Babbage Simmel, be sure you get the credit you deserve. Curriculum offered by Babbage Simmel can earn you college credit, CEUs, PDUs or CDUs.
College Credit Select curriculum offered by Babbage Simmel can earn you college credit. For questions please E-Mail: firstname.lastname@example.org or call 614-481-4345.
Continuing Education Units (CEUs) Continuing Education Units (CEUs) are nationally recognized standard units of measurement earned for satisfactory completion of qualified programs of continuing education. If you need more information about CEUs, please E-Mail: email@example.com or call 614-481-4345.
Professional Development Units (PDUs) Professional Development Units (PDUs) can be issued by PMI® for formal learning activities related to project management. Project Management Professionals (PMPs®) are required to earn a minimum of 60 PDUs every 3 years to maintain certification. For more information about this program go to the PMI® web site or call 1-855 746 4849.
Continuing Development Units (CDUs) CDUs may be earned by attending professional development (e.g. courses, seminars) offered by organizations endorsed by IIBA® and designated as an EEP vendor. As an IIBA Endorsed Education Provider (EEP) Babbage Simmel's IIBA® endorsed courses qualify for CDU credit. For more information about CDUs go the IIBA® web site or call 1-647-426-3735.
Our babsimLIVE distance learning brings the classroom learning experience to you by seating you virtually into a real-life instructor-led classroom taught by award winning world-class instructors with other IT professionals like yourself. From the comfort of your home, workplace, or at the Babbage Simmel Columbus Campus, you acquire the training you need, when you want it, in the environment that is most comfortable for you to be successful.