This course is designed for developers who create applications and analyze Big Data in Apache Hadoop on Windows using Pig and Hive.
Topics include: Hadoop, YARN, the Hadoop Distributed File System (HDFS), MapReduce, Sqoop and the HiveODBC Driver.
◾Describe Hadoop and Hadoop and YARN
◾Describe the Hadoop ecosystem
◾List Components & deployment options for HDP on Windows
◾Describe the HDFS architecture
◾Use the Hadoop client to input data into HDFS
◾Transfer data between Hadoop and Microsoft SQL Server
◾Describe the MapReduce and YARN architecture
◾Run a MapReduce job on YARN
◾Write a Pig script
◾Define advanced Pig relations
◾Use Pig to apply structure to unstructured Big Data
◾Invoke a Pig User-Defined Function
◾Use Pig to organize and analyze Big Data
◾Describe how Hive tables are defined and implemented
◾Use Hive windowing functions
◾Define and use Hive file formats
◾Create Hive tables that use the ORC file format
◾Use Hive to run SQL-like queries to perform data analysis
◾Use Hive to join datasets
◾Create ngrams and context ngrams using Hive
◾Perform data analytics
◾Use HCatalog with Pig and Hive
◾Install and configure HiveODBC Driver for Windows
◾Import data from Hadoop into Microsoft Excel
◾Define a workflow using Oozie
◾Start HDP on Windows
◾Add/remove files and folders from HDFS
◾Transfer data between HDFS and Microsoft SQL Server
◾Run a MapReduce job
◾Using Pig to analyze data
◾Retrieve HCatalog schemas from within a Pig script
◾Using Hive tables and queries
◾Advanced Hive features like windowing, views and ORC files
◾Hive analytics functions using the Pig DataFu library
◾Use Hive to compute ngrams on Avro-formatted files
◾Connect Microsoft Excel to Hadoop with HiveODBC Driver
◾Run a YARN application
◾Define an Oozie workflow
Software developers who need to understand and develop applications for Hadoop 2.x on Windows.
Students should be familiar with programming principles and have experience in software development. SQL knowledge and familiarity with Microsoft Windows is also helpful. No prior Hadoop knowledge is required.
This course is designed for developers who need to create applications to analyz...View course details
This advanced course provides Java programmers a deep-dive into Hadoop applicati...View course details
This one-day course provides a technical overview of Apache Hadoop for decision ...View course details
College Credit, CEUs, PDUs and CDUs
When you take courses with the Babbage Simmel, be sure you get the credit you deserve. Curriculum offered by Babbage Simmel can earn you college credit, CEUs, PDUs or CDUs.
Select curriculum offered by Babbage Simmel can earn you college credit. For questions please E-Mail: email@example.com or call 614-481-4345.
Continuing Education Units (CEUs)
Continuing Education Units (CEUs) are nationally recognized standard units of measurement earned for satisfactory completion of qualified programs of continuing education. If you need more information about CEUs, please E-Mail: firstname.lastname@example.org or call 614-481-4345.
Professional Development Units (PDUs)
Professional Development Units (PDUs) can be issued by PMI® for formal learning activities related to project management. Project Management Professionals (PMPs®) are required to earn a minimum of 60 PDUs every 3 years to maintain certification. For more information about this program go to the PMI® web site or call 1-855 746 4849.
Continuing Development Units (CDUs)
CDUs may be earned by attending professional development (e.g. courses, seminars) offered by organizations endorsed by IIBA® and designated as an EEP vendor. As an IIBA Endorsed Education Provider (EEP) Babbage Simmel's IIBA® endorsed courses qualify for CDU credit. For more information about CDUs go the IIBA® web site or call 1-647-426-3735.
Our babsimLIVE distance learning brings the classroom learning experience to you by seating you virtually into a real-life instructor-led classroom taught by award winning world-class instructors with other IT professionals like yourself. From the comfort of your home, workplace, or at the Babbage Simmel Columbus Campus, you acquire the training you need, when you want it, in the environment that is most comfortable for you to be successful.