HBase Introduction & Essentials - 2 Day Bootcamp

Apache HBase is a distributed, scalable, NoSQL database for big data built on Hadoop. HBase can store data in massive tables consisting of billions of rows and millions of columns, serve data to many users in near real time, and provide fast, random read/write access to applications.

Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop ecosystem, learning topics such as:

  • The use cases and usage occasions for HBase, Hadoop, and RDBMS

  • Using the HBase shell to directly manipulate HBase tables

  • Designing optimal HBase schemas for efficient data storage and recovery

  • How to connect to HBase using the Java API to insert and retrieve data in real time

  • Best practices for identifying and resolving performance bottlenecks

Outline:

 

1. Introduction to Hadoop and Hbase HBase Tables

 

2. Using Hbase Shell Hands On: Hbase shell

 

3. HBase Architecture Fundamentals Basic Data Access with the HBase API

 

4. Hands On: DDL & DML on HBase

5. Advanced HBase API Features HBase on the Cluster HBase Reads and Writes

 

6. HBase Performance Tuning

7. HBase Administration and Cluster Management HBase Replication and Backup

 

8. Hands on: Admin and Backup of HBase

 

9. Use cases: Hadoop, Hive and Impala with Hbase