HDP Developer: Windows - Virtual

Displaying courses for Great Britain [Change]

Overview

This course is designed for developers who create applications and analyze Big Data in Apache Hadoop on Windows using Pig and Hive. Topics include: Hadoop, YARN, the Hadoop Distributed File System (HDFS), MapReduce, Sqoop and the HiveODBC Driver.

Audience

Software developers who need to understand and develop applications for Hadoop 2.x on Windows.

Prerequisites

  • Delegates should be familiar with programming principles and have experience in software development.
  • SQL knowledge and familiarity with Microsoft Windows is also helpful.
  • No prior Hadoop knowledge is required.

Syllabus

  • Describe Hadoop and Hadoop and YARN
  • Describe the Hadoop ecosystem
  • List Components & deployment options for HDP on Windows
  • Describe the HDFS architecture
  • Use the Hadoop client to input data into HDFS
  • Transfer data between Hadoop and Microsoft SQL Server
  • Describe the MapReduce and YARN architecture
  • Run a MapReduce job on YARN
  • Write a Pig script
  • Define advanced Pig relations
  • Use Pig to apply structure to unstructured Big Data
  • Invoke a Pig User-Defined Function
  • Use Pig to organize and analyze Big Data
  • Describe how Hive tables are defined and implemented
  • Use Hive windowing functions
  • Define and use Hive file formats
  • Create Hive tables that use the ORC file format
  • Use Hive to run SQL-like queries to perform data analysis
  • Use Hive to join datasets
  • Create ngrams and context ngrams using Hive
  • Perform data analytics
  • Use HCatalog with Pig and Hive
  • Install and configure HiveODBC Driver for Windows
  • Import data from Hadoop into Microsoft Excel
  • Define a workflow using Oozie

Hands-On Labs

  • Start HDP on Windows
  • Add/remove files and folders from HDFS
  • Transfer data between HDFS and Microsoft SQL Server
  • Run a MapReduce job
  • Using Pig to analyze data
  • Retrieve HCatalog schemas from within a Pig script
  • Using Hive tables and queries
  • Advanced Hive features like windowing, views and ORC files
  • Hive analytics functions using the Pig DataFu library
  • Compute quantiles
  • Use Hive to compute ngrams on Avro-formatted files
  • Connect Microsoft Excel to Hadoop with HiveODBC Driver
  • Run a YARN application
  • Define an Oozie workflow

Training provider

Teaching mode: Classroom - Instructor Led
Duration: 4 days
Gooroo has partnered with the global leaders in IT training to give you access to quality training, personalised to you, targeted at increasing your job opportunities and salary.

Our pricing

We do not display pricing as Gooroo members qualify for special discounts not available elsewhere. You must enquire through Gooroo to get this benefit.

New courses are happening all the time

Our partner's expert training consultant will provide you with the times and all the details you need. Enquire today.

Top skills covered in this course

Data analysis
Great Britain
This skill has an average salary of
£44,325
and is mentioned in
2.67%
of job ads in this area.
Analytics
Great Britain
This skill has an average salary of
£52,586
and is mentioned in
4.17%
of job ads in this area.
Apache Hadoop
Great Britain
This skill has an average salary of
£66,527
and is mentioned in
0.32%
of job ads in this area.
Microsoft SQL Server
Great Britain
This skill has an average salary of
£49,551
and is mentioned in
1.95%
of job ads in this area.