Intro in Apache HBase
COURSE FEATURES
The scope of this course is to provide an understanding of Apache HBase architecture. At the end of this course the participants will be able to understand the components of Apache HBase (HDFS, Region Servers, HBase Master, Zookeeper) and how to model data for HBase.
This course is taught in Romanian, with course materials available in either English, or Romanian. The course can be customized - on request.
DAY
1
TOPICS
8
TOPICS
8
What is Hadoop, HDFS emergence and MapReduce evolution
Use cases of Hadoop and Hadoop IRL
Hadoop framework overview, detailed overview and applicability cases: HDFS & MapReduce essentials, YARN
Apache HDFS
NameNode, DataNode
Write & read path
NameNode Resiliency (quorum journal)
Known issues of HDFS
DAY
2
TOPICS
8
TOPICS
8
Concepts: map, reduce, shuffle, sort
Mappers, reducers, combiners, partitioners
Limitations of MapReduce architecture
Apache YARN:
Architecture before Hadoop 2.0 (MapReduce v1 – JobTracker, TaskTracker)
Hadoop 2.0 and MapReduce v2 – YARN introduction (Resource Manager, Node Manager, container, application master)
How YARN works in big data architectures
YARN capacity scheduler
DAY
3
TOPICS
7
TOPICS
7
Architecture review: Master (s), Region Server(s), HDFS, Zookeeper
What is an LSM tree: Write ahead log and Hfiles
How Hbase and HDFS work together
Write & Read path
Row key design aspects & data modeling
Regions Split
Tuning Hbase for performance: which are the configurations/parameters to look for
Contact Us
Feel free to leave us your thoughts so we can discover the solution together!
academy@esolutions.ro
Get in touch
0753.029.187
Our address
20 Constantin Budisteanu Street , 1 st. District, Bucharest
academy@esolutions.ro
Get in touch
0753.029.187
Our address
20 Constantin Budisteanu Street , 1 st. District, Bucharest