0% found this document useful (0 votes)

18 views3 pages

Ocs352-Iot Book-69-71

Ocs

Uploaded by

gopaldhanu608

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views3 pages

Ocs352-Iot Book-69-71

Ocs

Uploaded by

gopaldhanu608

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

188 Internet of Things: Architecture and Design Principles

5.5.6 Big Data Analytics

Big data is multistructured data while RDMS maintain more structured data. The open
source software Hadoop and MapReduce are from Apache Software. They enable storage
and analyse the massive amounts of data. Hadoop File System (HDFS), Mahout, a library
of machine learning algorithms and HiveQ, a SQL like scripting language software are
used for Big data analytics in the Hadoop ecosystem. MapReduce is a programming model
and a core of Hadoop. Large data sets process onto a cluster of nodes using MapReduce.
Same node runs the algorithm using the data sets at HDFS and processing is at that node
itself.
Hadoop is an open-source framework. The framework stores and processes big data.
The clusters of computing nodes process that data using simple programming models.
Processing takes place in a distributed environment. The framework scales up from single
server to thousands of processing machines and servers, each offering environment of local
storage and processing. Hadoop accesses data in sequential manner and performs batch
processing. A new data set results from input data set that also processes sequentially.
Data Acquiring, Organising, Processing and Analytics 189

HBase is an example of columnar format data storage which enables read or write
access in real time for very large tables distributed in Hadoop File System (HDFS). HBase
is database for big data. Data access is random access. Therefore, it provides fast look-up
from large tables and access latency is small. HBase uses big hash tables. HBase can be
considered similar to Google’s BigTable.

5.5.7 Data Analytics Architecture and Stack

Analytics architecture consists of the following layers:
● Data sources layer

● Data storage and processing layer

● Data access and query processing layer

● Data services, reporting and advanced analytics layer

Figure 5.4 shows an overview of a reference model for analytics architecture. Figure 5.4
also shows on the right-hand side the layers in the reference model.

Services, Reporting, Data Visualisations, OLAP, Analytics

Advance Analytics (Predictive/Prescriptive Analytics) Applications

Data Access, SQL, Query Processing, OLTP, ETL, Analytics

R-Descriptive Statistics, In-Memory or On-Store Applications
Database Processing, MapReduce and Others Support
Applications Support Layer

Organised
Traditional DataStore/Data Warehouse
Data Store
Event Stream Processing
Layer
Complex Event Processing

Sources
IoT/M2M Data Sources Acquiring
Enterprise Data Sources Data
External Data Sources

Figure 5.4 Analytics Architecture Reference Model

Analytical sandbox means analytics tools and analytics environment for predictive
analytics on multistructured data. Mesos v0.9 is a resources management platform which
enable multiple frameworks sharing of cluster of nodes and which is compatible with
open analytics stack [data processing (Hive, Hadoop, HBase, Storm), data management
(HDFS)].
190 Internet of Things: Architecture and Design Principles

Berkeley Data Analytics Stack (BDAS) consists of data processing, data management
and resource management layers.
Applications, AMP-Genomics and Carat run at the BDAS. Data processing software
component provides in-memory processing which processes the data efficiently across the
frameworks. AMP stands for Berkeley’s Algorithms, Machines and Peoples Laboratory.
Data processing combines batch, streaming and interactive computations.
Resource management software component provides for sharing the infrastructure
across the frameworks.
Figure 5.5 shows an overview of BDAS architecture which is a reference model for
analytics architecture. Figure 5.5 also shows on right-hand side the file system, library of
machine learning algorithms and SQL like scripting language software for the Big data
analytics in Hadoop ecosystem.

Mahout
Business Distributed and
Services, Reporting, Data Visualisations, OLAP, Analytics and Scalable Library of
Advance Analytics (Predictive/Prescriptive Analytics) Intelligence Machine Learning
Applications Algorithms

Data Access, SQL, Query Processing, OLTP, ETL, Analytics HiveQL

R-Descriptive Statistics, In-Memory or On-Store Applications (SQL like
Database Processing, MapReduce and Others Support Scripting
Language)
Applications Support Layer

Organised
Traditional DataStore/Data Warehouse HDFS
Data Store
Event Stream Processing (Hadoop File
Layer System) for
Complex Event Processing
Sources Big Data
Acquiring Data
IoT/M2M Data Sources
Enterprise Data Sources
External Data Sources

Figure 5.5 Berkeley data analytics stack architecture

Reconfirm Your Understanding

● Organised data in database or data store is used for analytics, new facts and decision taking on
those facts. Analytics has three phases before deriving new facts and provide business intelligence—
descriptive, predictive and prescriptive analytics.

Big Data & Hadoop Training Material 0 1 PDF
50% (2)
Big Data & Hadoop Training Material 0 1 PDF
168 pages
Touchable Pro Manual
No ratings yet
Touchable Pro Manual
41 pages
big data unit 1
No ratings yet
big data unit 1
24 pages
Data Science
No ratings yet
Data Science
87 pages
Big Data Analytics
From Everand
Big Data Analytics
Nitin Kumar Yadav
No ratings yet
DC Unit V.docx
No ratings yet
DC Unit V.docx
26 pages
Big Data Analytics - Unit 2
No ratings yet
Big Data Analytics - Unit 2
10 pages
BAD601 Big Data Model Question Paper Solution Search Creators
No ratings yet
BAD601 Big Data Model Question Paper Solution Search Creators
50 pages
Apache Hive Handbook: Query, Analyze, and Optimize Big Data
From Everand
Apache Hive Handbook: Query, Analyze, and Optimize Big Data
Robert Johnson
No ratings yet
The Age OF: Every Minute
No ratings yet
The Age OF: Every Minute
47 pages
Ashish_Presentation_Stage1_modify_LR
No ratings yet
Ashish_Presentation_Stage1_modify_LR
24 pages
BDTools
No ratings yet
BDTools
15 pages
Big Data complete Notes
No ratings yet
Big Data complete Notes
33 pages
Bba13 Notes Bdf Unit 1
No ratings yet
Bba13 Notes Bdf Unit 1
3 pages
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Big Data Technology
No ratings yet
Big Data Technology
9 pages
Module 2.pptx
No ratings yet
Module 2.pptx
20 pages
Big Data Analytics
100% (3)
Big Data Analytics
79 pages
Big Data Analytics
No ratings yet
Big Data Analytics
31 pages
Big Data Course Agenda
No ratings yet
Big Data Course Agenda
3 pages
Hadoop
No ratings yet
Hadoop
21 pages
UNIT1 -BDH
No ratings yet
UNIT1 -BDH
77 pages
Analyzing Limitations and Solutions of Existing Data Analytics
No ratings yet
Analyzing Limitations and Solutions of Existing Data Analytics
21 pages
Open Source Technologies
No ratings yet
Open Source Technologies
19 pages
BDA Module-2 Notes PDF
100% (1)
BDA Module-2 Notes PDF
14 pages
Big Data: Introduction To Terms, Concepts and Tools
No ratings yet
Big Data: Introduction To Terms, Concepts and Tools
23 pages
Ite06 Big Data Analytics-Qbank
No ratings yet
Ite06 Big Data Analytics-Qbank
18 pages
CP 329_Lecture twoNew_2025_122059
No ratings yet
CP 329_Lecture twoNew_2025_122059
43 pages
Big Data Analytics
100% (1)
Big Data Analytics
14 pages
Big Data S All Units
No ratings yet
Big Data S All Units
122 pages
IOT and Comp.architecture
No ratings yet
IOT and Comp.architecture
17 pages
Types of Digital Data: Unit 1 Big Data KCS-061
No ratings yet
Types of Digital Data: Unit 1 Big Data KCS-061
12 pages
Chapter 6 - Big Data Architecture Part 1
No ratings yet
Chapter 6 - Big Data Architecture Part 1
41 pages
Mastering Big Data and Hadoop: From Basics to Expert Proficiency
From Everand
Mastering Big Data and Hadoop: From Basics to Expert Proficiency
William Smith
No ratings yet
Chapter - 2 Hadoop
No ratings yet
Chapter - 2 Hadoop
32 pages
BIG DATA AND ANALYTICS presentation
No ratings yet
BIG DATA AND ANALYTICS presentation
31 pages
Big Data Spark Lab Manual 2025-2026
No ratings yet
Big Data Spark Lab Manual 2025-2026
62 pages
Big Data Unit 1 Notes
No ratings yet
Big Data Unit 1 Notes
20 pages
Fillatre Big Data
No ratings yet
Fillatre Big Data
98 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
36 pages
yasir f29 ass1 bigdata
No ratings yet
yasir f29 ass1 bigdata
7 pages
Hadoop PPT
No ratings yet
Hadoop PPT
25 pages
Module 1.ppt
No ratings yet
Module 1.ppt
29 pages
Big data 2
No ratings yet
Big data 2
49 pages
It-222 Reviewer
No ratings yet
It-222 Reviewer
3 pages
Berkeley Data Analytics Stack: Prof. Harold Liu 15 December 2014
No ratings yet
Berkeley Data Analytics Stack: Prof. Harold Liu 15 December 2014
48 pages
Big Data technologies UNIT 1
No ratings yet
Big Data technologies UNIT 1
5 pages
Big Data Lec4
No ratings yet
Big Data Lec4
38 pages
A Guide For Beginners: Big Data Glossary
No ratings yet
A Guide For Beginners: Big Data Glossary
1 page
Big Data Architecture
No ratings yet
Big Data Architecture
9 pages
Lec1 Special
No ratings yet
Lec1 Special
21 pages
Comprehensive Guide to Hive Architecture and Query Language: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to Hive Architecture and Query Language: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
BDMA Part 2
No ratings yet
BDMA Part 2
16 pages
BDA 02 - Fundamentals
No ratings yet
BDA 02 - Fundamentals
64 pages
BD by maaz
No ratings yet
BD by maaz
19 pages
Last Min Preparation -Big Data
No ratings yet
Last Min Preparation -Big Data
5 pages
Big Data Analytics 1-5
100% (1)
Big Data Analytics 1-5
63 pages
Notes Big Data
No ratings yet
Notes Big Data
106 pages
Big Data Analytics On Large Scale Shared Storage System: First Seminar
No ratings yet
Big Data Analytics On Large Scale Shared Storage System: First Seminar
22 pages
Big Data Analytics Presentation (1)
No ratings yet
Big Data Analytics Presentation (1)
7 pages
Lecture 2 - Big Data
No ratings yet
Lecture 2 - Big Data
8 pages
Study Guide - DISEC - CarMUN 2023
No ratings yet
Study Guide - DISEC - CarMUN 2023
28 pages
OULMS User Guide - Ver 1.0
No ratings yet
OULMS User Guide - Ver 1.0
7 pages
Install OpenProject With DEB - RPM Packages
No ratings yet
Install OpenProject With DEB - RPM Packages
26 pages
ISTQB Certified Tester - Foundation Level Syllabus v4.0-pg2
No ratings yet
ISTQB Certified Tester - Foundation Level Syllabus v4.0-pg2
1 page
Solution Manual To Chapter 10
100% (2)
Solution Manual To Chapter 10
6 pages
The 29 Annual Intelligent Ground Vehicle Competition (IGVC) Self-Drive
No ratings yet
The 29 Annual Intelligent Ground Vehicle Competition (IGVC) Self-Drive
64 pages
Abclx Driver Help
No ratings yet
Abclx Driver Help
56 pages
Piramal Agastya Case Study
No ratings yet
Piramal Agastya Case Study
3 pages
Github Ex 5 NIVESH R URK23CS1262
No ratings yet
Github Ex 5 NIVESH R URK23CS1262
4 pages
ELDEN RING NIGHTREIGN Closed Network Test Registrations Are Now Live Official Site
No ratings yet
ELDEN RING NIGHTREIGN Closed Network Test Registrations Are Now Live Official Site
1 page
Indigo BSP Form of Payment: Advisory Number: Effective Date: High Level Description
No ratings yet
Indigo BSP Form of Payment: Advisory Number: Effective Date: High Level Description
8 pages
Jar 16-20296
100% (4)
Jar 16-20296
13 pages
AQ-LCS-100-G Manual Rev. 2
No ratings yet
AQ-LCS-100-G Manual Rev. 2
21 pages
Barracuda NextGen Firewall F WAN Track-Student Guide Rev1
No ratings yet
Barracuda NextGen Firewall F WAN Track-Student Guide Rev1
92 pages
Program No. 1 AIM: Write A Program To Implement Array Traversal
No ratings yet
Program No. 1 AIM: Write A Program To Implement Array Traversal
37 pages
User Interface Design Process & Basic Design Issues
No ratings yet
User Interface Design Process & Basic Design Issues
14 pages
2010 Pepperdine University Information Technology Annual Report
No ratings yet
2010 Pepperdine University Information Technology Annual Report
15 pages
Atlantic Computer Case
100% (1)
Atlantic Computer Case
7 pages
Laboratory Report Cover Sheet
No ratings yet
Laboratory Report Cover Sheet
6 pages
Klemsan - KIO - EN - Low PDF
No ratings yet
Klemsan - KIO - EN - Low PDF
68 pages
Assignment - Knowledge Assessment: Written Answer Question Guidance
No ratings yet
Assignment - Knowledge Assessment: Written Answer Question Guidance
8 pages
AZ 104T00A ENU ChangeLog
No ratings yet
AZ 104T00A ENU ChangeLog
10 pages
Tally Prime
No ratings yet
Tally Prime
2 pages
AVEVA Hull Detailed
No ratings yet
AVEVA Hull Detailed
4 pages
MCE - Prep - Slide Guide - V1 - REDUCE
No ratings yet
MCE - Prep - Slide Guide - V1 - REDUCE
81 pages
Experiment 1: A.1 Aim: Finalize The Mini Project Problem Statement, With The Scope and Purpose of The Mini Project
No ratings yet
Experiment 1: A.1 Aim: Finalize The Mini Project Problem Statement, With The Scope and Purpose of The Mini Project
5 pages
S7basecomm Xsend Doku v10 e
No ratings yet
S7basecomm Xsend Doku v10 e
29 pages
Visual Workshop 3B User Guide
No ratings yet
Visual Workshop 3B User Guide
129 pages
CS412 Assignment 1 Ref Solution
50% (2)
CS412 Assignment 1 Ref Solution
8 pages

Ocs352-Iot Book-69-71

Uploaded by

Ocs352-Iot Book-69-71

Uploaded by

188 Internet of Things: Architecture and Design Principles

5.5.6 Big Data Analytics

5.5.7 Data Analytics Architecture and Stack

● Data storage and processing layer

● Data access and query processing layer

● Data services, reporting and advanced analytics layer

Services, Reporting, Data Visualisations, OLAP, Analytics

Data Access, SQL, Query Processing, OLTP, ETL, Analytics

Figure 5.4 Analytics Architecture Reference Model

Data Access, SQL, Query Processing, OLTP, ETL, Analytics HiveQL

Figure 5.5 Berkeley data analytics stack architecture

Reconfirm Your Understanding

You might also like