Ocs352-Iot Book-69-71
Ocs352-Iot Book-69-71
HBase is an example of columnar format data storage which enables read or write
access in real time for very large tables distributed in Hadoop File System (HDFS). HBase
is database for big data. Data access is random access. Therefore, it provides fast look-up
from large tables and access latency is small. HBase uses big hash tables. HBase can be
considered similar to Google’s BigTable.
Figure 5.4 shows an overview of a reference model for analytics architecture. Figure 5.4
also shows on the right-hand side the layers in the reference model.
Organised
Traditional DataStore/Data Warehouse
Data Store
Event Stream Processing
Layer
Complex Event Processing
Sources
IoT/M2M Data Sources Acquiring
Enterprise Data Sources Data
External Data Sources
Analytical sandbox means analytics tools and analytics environment for predictive
analytics on multistructured data. Mesos v0.9 is a resources management platform which
enable multiple frameworks sharing of cluster of nodes and which is compatible with
open analytics stack [data processing (Hive, Hadoop, HBase, Storm), data management
(HDFS)].
190 Internet of Things: Architecture and Design Principles
Berkeley Data Analytics Stack (BDAS) consists of data processing, data management
and resource management layers.
Applications, AMP-Genomics and Carat run at the BDAS. Data processing software
component provides in-memory processing which processes the data efficiently across the
frameworks. AMP stands for Berkeley’s Algorithms, Machines and Peoples Laboratory.
Data processing combines batch, streaming and interactive computations.
Resource management software component provides for sharing the infrastructure
across the frameworks.
Figure 5.5 shows an overview of BDAS architecture which is a reference model for
analytics architecture. Figure 5.5 also shows on right-hand side the file system, library of
machine learning algorithms and SQL like scripting language software for the Big data
analytics in Hadoop ecosystem.
Mahout
Business Distributed and
Services, Reporting, Data Visualisations, OLAP, Analytics and Scalable Library of
Advance Analytics (Predictive/Prescriptive Analytics) Intelligence Machine Learning
Applications Algorithms
Organised
Traditional DataStore/Data Warehouse HDFS
Data Store
Event Stream Processing (Hadoop File
Layer System) for
Complex Event Processing
Sources Big Data
Acquiring Data
IoT/M2M Data Sources
Enterprise Data Sources
External Data Sources