In this Hadoop Intermediate course, you’ll learn about some advanced features and applications used in Hadoop, including YARN, Tez and Spark, Flume and NiFi, MapReduce, HBase, and how to create an HBase App.
Hadoop has made some major changes over the years. One of the more significant changes is Yet Another Resource Negotiator (YARN). You’ll learn why YARN is an important technology upgrade and how it works within the Hadoop ecosystem. We’ll look at the services that comprise the YARN framework within Hadoop and how they work together to provide a cohesive environment for applications.
We’ll introduce you to newcomers Tez (which improves efficiency for certain kinds of jobs over MapReduce) and Spark, a tool that expands Hadoop development capabilities by adding a number of new languages such as Python. We will look at how Spark applications are generally easier to build than traditional MapReduce applications.
We’re going to show you how to install a Hortonworks Data Flow (HDF) library and get Nifi up and running in the Hadoop Sandbox. Once installed, your system will be ready to create sophisticated Nifi workflows to do ETL.
You will learn about the MapReduce Combiner, why you would use it and how you would use it. You will gain an understanding of how the combiner fits within a typical MapReduce application and how to implement it within Hadoop.
This course will end with an overview on HBase and how to create an HBase App.