Learn your way! Get started

Hadoop, Part 4: HBase and MapReduce

with expert Kevin McCarty


Watch trailer

 Hadoop, Part 4: HBase and MapReduce Trailer

Course at a glance

Included in these subscriptions:

  • Dev & IT Pro Video
  • Dev & IT Pro Power Pack
  • Power Pack Plus

Release date Release date 10/27/2016
Level Level Intermediate
Runtime Runtime 2h 19m
Platform Platform Major browsers on Windows Major browsers on Windows Major browsers on Mac OSX Major browsers on Mac OSX Mobile Devices Mobile Devices
Closed captioning Closed captioning N/A
Transcript Transcript N/A
eBooks / courseware eBooks / courseware N/A
Hands-on labs Hands-on labs N/A
Sample code Sample code Included
Exams Exams Included


Enterprise Solutions
Enterprise Solutions

Need reporting, custom learning tracks, or SCORM? Learn More



Course description

Big Data development is a growing field and understanding how technologies such as the MapReduce Combiner and HBase can make data analysis easier is important. Learn how to use the Combiner, look deeply at HBase, a NoSQL tool for managing the storage and retrieval of large and diverse data sets, and create a starter app to use HBase in Java.

Prerequisites

You should have some programming background and some familiarity with a Unix-based operating system. No specific experience with Java programming language or Hadoop is required. As with any such course, the more experience you bring to the course, the more you’ll get out of it. This course moves quickly through a broad range of topics, but it does not require any prior experience with Hadoop. The course does assume that you are well familiarized with how to use the version of Windows that you are running. For example, the course might say simply “Open PuTTY” without explaining how to do that. You should also be able to navigate the folder hierarchy using Windows Explorer.

Learning Paths

This course is part of the following LearnNowOnline SuccessPaths™:
Hadoop

Meet the expert

Kevin McCarty Kevin McCarty is a computer professional with over 30 years of experience in the industry as a programmer, project manager, database administrator, architect, and data scientist. He is a Microsoft Certified Trainer with over 25 individual certifications in programming and database technologies and serves as the chapter leader of the Boise SQL Server Users Group. A former Army officer and Eagle Scout, he holds a doctorate in Computer Science and a lifelong love of learning.


Course outline



MapReduce and HBase

MapReduce Combiner (07:22)
  • Introduction (00:41)
  • MapReduce Components (00:52)
  • Combiner (00:35)
  • Why Use a Combiner? (01:41)
  • How the Combiner Works (00:51)
  • Combiner, Reducer Differences (01:56)
  • Summary (00:44)
MapReduce Combiner Demo (19:17)
  • Introduction (00:30)
  • Demo: Mapper Stub (04:20)
  • Demo: Reducer & Combiner Stubs (01:59)
  • Demo: Main (01:14)
  • Demo: Fill in Methods (04:28)
  • Demo: Run WordCountCombiner (05:56)
  • Summary (00:47)
HBase Basics (12:37)
  • Introduction (00:42)
  • Relational vs. Non-Relational Databases (02:45)
  • What Is HBase? (01:29)
  • What Does HBase Do? (04:32)
  • Other HBase Features (01:00)
  • The HBase Shell (01:10)
  • Summary (00:55)
HBase Shell Intro (18:03)
  • Introduction (00:38)
  • Demo: The HBase Shell (03:09)
  • Demo: Ambari (03:18)
  • Demo: Build a Table (03:58)
  • Demo: Add Data (06:13)
  • Summary (00:43)
HBase Lifecycle Architecture (12:03)
  • Introduction (00:42)
  • HBase Architecture (01:12)
  • HBase Scalability (02:07)
  • HBase Durability (00:52)
  • HBase Consistency (00:44)
  • HBase Input/Output (00:55)
  • Working with Data (00:53)
  • Partitioning a Table (00:57)
  • Serving Regions (00:50)
  • Partitioning Advantage (00:44)
  • Region Server Failover/Load Balancing (01:24)
  • Summary (00:39)
HBase Lifecycle Services (18:06)
  • Introduction (00:46)
  • The HBase Write Path (00:23)
  • HBase MemStore (00:44)
  • HBase Write-Ahead Log (03:12)
  • HBase on a Single Server (00:40)
  • Scaling HBase (00:44)
  • The Flush Operation (02:15)
  • HBase Cache (01:31)
  • Read Optimization (00:36)
  • Minimizing Disk Seeks (01:29)
  • HBase Compaction (00:50)
  • HBase Load Balancing (00:48)
  • HBase Split Operation (00:46)
  • HBase Scalability (00:36)
  • HBase Limitations (01:42)
  • Summary (00:57)

Creating an HBase App

Creating an Hbase App (28:45)
  • Introduction (00:33)
  • HBase Archetypes (01:07)
  • What Is a Good HBase Archetype? (01:23)
  • Effective Design (00:31)
  • Effective Design of Queries (03:11)
  • Row-Key Design (01:52)
  • Effective Application Design (02:00)
  • Entities in HBase (00:56)
  • Working with Simple Entities (01:41)
  • Simple Entity Guidelines (01:33)
  • Working with Linked Entities (01:21)
  • Creating a Linked Entity (01:28)
  • Linked Entity Guidelines (04:31)
  • Compound Key (00:48)
  • Compound Time Series (00:52)
  • HBase Techniques for Historical Data (02:53)
  • Considerations for Timestamps (01:03)
  • Summary (00:54)
Creating an Hbase App Demo (23:20)
  • Introduction (00:44)
  • Demo: Setup (04:06)
  • Demo: HBase Methods (03:52)
  • Demo: Flesh Out Methods (05:19)
  • Demo: Main Method (01:05)
  • Demo: HBase Environment (03:33)
  • Demo: Run HBase App (04:05)
  • Summary (00:34)