0% found this document useful (0 votes)
68 views5 pages

6.BDA Question Bank

1. The document provides information about the CE and IT departments at Ahmedabad Institute of Technology, including their visions and missions. 2. The CE department aims to produce technically and ethically responsible computer engineers through quality education. The IT department aims to provide quality education to shape students technically and ethically. 3. Both departments seek to promote lifelong learning and overall student development through curricular, co-curricular and extra-curricular activities.

Uploaded by

shalini s
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
68 views5 pages

6.BDA Question Bank

1. The document provides information about the CE and IT departments at Ahmedabad Institute of Technology, including their visions and missions. 2. The CE department aims to produce technically and ethically responsible computer engineers through quality education. The IT department aims to provide quality education to shape students technically and ethically. 3. Both departments seek to promote lifelong learning and overall student development through curricular, co-curricular and extra-curricular activities.

Uploaded by

shalini s
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Ahmedabad Institute of Technology

CE & IT Department
Big Data Analytics (2180710)(Dept Elec - II)
Question Bank
Year: 2020-21

Prepared By: - Prof. Neha Prajapati


CE Department Vision:
To produce technically sound and ethically responsible Computer Engineers to the society by
providing Quality Education.

CE Department Mission:

1) To provide healthy Learning Environment based on current and future Industrial demands.

2) To promote curricular, co-curricular and extra-curricular activities for overall personality


development of the students.

3) To groom technically powerful and ethically dominant engineers having real life problem
solving capabilities.

4) To provide platform for Effective Teaching Learning.

IT Department Vision:
To provide quality education and assistance to the students through innovative teaching learning
methodology for shaping young mind technically sound and ethically strong.

IT Department Mission:

1) To serve society by producing technically and ethically sound engineers.

2) To generate groomed and efficient problem solvers as per Industrial needs by adopting
innovative teaching learning methods.

3) To emphasis on overall development of the students through various curricular, co-curricular


and extra-curricular activities.
Topic Marks

TOPIC:1. Introduction Big Data [CO-2]


[Distributed file system–Big Data and its importance, Four Vs, Drivers for Big
data, Big data analytics, Big data applications. Algorithms using map reduce]

1 What is Big Data ? Explain Characteristicis of Big Data.(Nov-2016- 7


IT)(May_2018_IT)(May_2019_CE)(Dec_2019_IT)
2 Draw HDFS Architecture. Explain any two commands of HDFS from the following commands 7
with syntax atleast one example of each. CopyFromLocal,setrep,checksum.(Nov 2016-IT)
3 What is big data analytics? Explain four ‘V’s of Big data. Briefly discuss applications of big 7
data.(May 2017)(May_2019_CE)(Dec_2019_IT)
4 What is Map Reduce? Explain working of various phases of Map Reduce with appropriate 7
example and diagram(May 2017)(May_2019_CE)
5 Discuss Big Data in Healthcare,Trasportation & Medicine.(May-2017) 7

6 What are the advantages of Hadoop? Explain Hadoop Architecture and its Components with
7
proper diagram.(Nov_2017_IT)
7 Write Map Reduce steps for counting occurrences of specific numbers in the input text file(s). 7
Also write the commands to compile and run the code.(April_2018_IT)(Dec_2019_IT)
8 Discuss Hadoop YARN in detail with failures in classic MapReduce.(May-2017)
7

9 What are the benefits of Big Data? Discuss challenges under Big Data. How Big Data Analytics 7
can be useful in the development of smart cities.(April-2018)
10 Explain Job Scheduling in Map Reduce. How it is done in case of (i) The Fair Scheduler 7
(ii) The Capacity Scheduler(Nov_2016_IT)
11 What is Big data? Discuss it in terms of four dimensions, volume, velocity(May -2017) 4

12 List various application of big data. How it can be used to improve business for a
4
superstore.(April_2018_IT)(Winter_2019_IT)(Dec_2019_IT)
13 Explain Map-reduce framework in brief.(Nov-2017-IT) 4

14 Explain the difference between structure and unstructured data.(May_2019_CE) 3

15 Explain “Map Phase” and “Combiner Phase” in MapReduce.(April_2018_IT)(Dec_2019_IT)


3

16 What is Big Data? Explain how big data processing differs from distributed
3
processing.(April_2018_IT)
17 Explain Avro data serialization technique in MapReduce.(April_2018_IT)
3

18 Explain following commands with syntax and at least one example of each. (1) copyFromLocal
3
(2) showing the content of outputfile (3) setrep (4) checksum.(April_2018_IT)(Dec_2019_IT)
19 Explain advantages and disadvantages of big data analytics.(April_2018_IT) 3
TOPIC:2 Introduction to Hadoop and Hadoop Architecture[Co-1]
[Big Data – Apache Hadoop & Hadoop EcoSystem, Moving Data in and out of Hadoop –
Understanding inputs and outputs of MapReduce -, Data Serialization]
1 What are the advantages of Hadoop? Explain Hadoop Architecture and its Components with proper 7
diagram.(Nov-2016-IT)(May_2018_IT)
2 What is Hadoop Ecosystem? Discuss various components of Hadoop Ecosystem.(May 7
2017)(April_2018_IT)
3 Explain core architecture of Hadoop with suitable block diagram. Discuss role of each component in 7
detail.(May 2017)
4 What is data serialization? With proper examples discuss and differentiate structured, unstructured and 7
semi-structured data. Make a note on how type of data affects data serialization.(May
2017)(April_2018_IT)(May_2019_CE)
5 List various configuration files used in Hadoop Installation. What is use of mapred- 3
site.xml?(April_2018_IT)
6 What is Name node & Data node in Hadoop Architecture.(May_2019_CE)
3

TOPIC:3 Hdfs,Hive and HiveQL,Hbase [CO-1]

1 Explain working of Hive with proper steps and 7


diagram.(Nov_2016_IT)(April_2018_IT)(May_2019_CE)(Dec_2019_IT)
2 (i) What is Zookeeper? List the benefits of it. (ii) Differentiate: Apache pig Vs Map 7
Reduce.(May_2017_IT)(May_2017_CE)(Nov_2017_IT)(April_2018_IT)(May_2018_IT)(May_20
19_CE)(Dec_2019_IT)
3 Define HDFS. Discuss the HDFS Architecture and HDFS Commands in 7
brief.(April_2018_IT)(May_2018_IT)(Dec_2019_IT)
4 What do you mean by HiveQL Data Definition Language? Explain any three HiveQL DDL command 7
with its syntax and example.(Nov_2016_IT)(May_2017_CE)(Nov_2017_IT)(May_2019_CE)
5 (i) Explain Metastore in Hive. (ii) Explain Storage mechanism in 7
HBase.(Nov_2016_IT)(May_2018_IT)(May_2019_CE)(Dec_2019_IT)
6 With suitable block diagram explain architecture of HDFS. Discuss role of Data node and Name node 7
in HDFS. Give commands with appropriate arguments to perform data transfer between local file
system and HDFS.(Nov_2017_IT)(April_2018_IT)(May_2019_CE)
7 Explain the HiveQL-Select-Order By with suitable example.(Nov_2017_CE)(May_2019_CE) 7

8 Define join and explain types of join(Nov_2017_CE)(Dec_2019_IT) 7


9 Explain the concept of Blocks and Heartbeat Message in HDFS Architecture. What are the benefits of 7
block transfer?(Nov_2017_IT)
10 Explain Hive Data types(Nov_2017_CE) 4

11 Explain HBase architecture.(Nov_2017_CE)(April_2018_IT) 4


12 Discuss role of Data node and Name node in HDFS.(April_2018_IT) 4
13 Write a short note on Apache Pig. Enlist applications of Apache 4
Pig.(Nov_2017_IT)(April_2018_IT)(May_2018_IT)(Dec_2019_IT)
14 Difference between Hive and RDBMS(Nov_2017_CE) 3
15 Difference between HDFS and Hbase.(Nov_2017_CE)(May_2019_CE) 3
16 Compare Raw oriented and Column Oriented database structures.(Nov_2017_IT)(May_2018_IT) 3
17 Explain the 5 P’s of Data science in brief.(April_2018_IT) 3

TOPIC:4 Spark[CO-1,2,5]
1 Explain Spark components in detail. Also list the features of 7
spark.(Nov_2017_IT)(Nov_2016_IT)(Dec_2019_IT)
2 What are the problems related to Map Reduce data storage? How Apache Spark solves it using 7
Resilient Distributed Dataset? Explain RDDs in
detail.(Nov_2017_IT)(May_2019_CE)(Dec_2019_IT)
3 What is Apache Spark? What are the advantages of using Apache Spark over Hadoop? 7
Explain in brief four major libraries of Apache
Spark.(May_2017_CE)(May_2018_IT)(May_2019_CE)
4 What is transformation and actions in Apache Spark? Discuss various commands available for 7
this activities in Apache Spark?(May_2017_CE)
5 What is Resilient Distributed Dataset in Apache Spark? Explain in detail. Make a note on why 7
RDD is better than Map Reduce data
storage?(May_2017_CE)(April_2018_IT)(May_2019_CE)
6 What is Apache Spark(May_2018_CE)(May_2019_CE) 3

TOPIC:5 NoSQL[CO-4]
[Types of NoSQL databases, Why NoSQL?, Advantages of NoSQL, Use of
NoSQL in Industry, SQL vs NoSQL, NewSQL]

1 What is NoSQL database? List the differences between NoSQL and relational databases. 7
Explain in brief various types of NoSQL databases in practice.(Nov_2016_IT)(Dec_2019_IT)
2 Use of NoSQL in industry.(Nov_2017_CE) 7
3 Define NewSQL and explain benefits and limitation of NewSQL.(Nov_2017_IT) 7
4 Define NoSQL and where is it used? (b) i) Document Oriented Database ii) Graph based 7
Database.(Nov_2017_CE)(May_2019_CE)
5 Write differences between NoSQL and SQL.(Nov_2017_IT)(May_2019_CE) 4

6 Explain NoSQL.(Nov_2017_IT)(May_2019_CE) 3

TOPIC:6 Data Base for the Modern Web[CO-3]


[MongoDB key features, Core Server tools, MongoDB through the JavaScript’s
Shell, Creating and Querying through Indexes, Document-Oriented, principles of
schema design, Constructing queries on Databases, collections and Documents ,
MongoDB Query Language]

1 Explain following in brief with repect to mongo DB : (1) Collections and documents, (2) 7
Indexing and retrieval (3) Data aggregation(May_2018_IT)(Nov_2016_IT)(Dec_2019_IT)
2 Explain scaling in MangoDB(April-2018)(May_2019_CE) 7

3 Explain CRUD operations in MongoDB.(April- 7


2018)(Nov_2016_IT)(May_2019_CE)(Dec_2019_IT)
4 Requirement specification of blog application in social networking is as follows. 7
Every post has a unique title, description and url.
Every post can have one or more tags.
Every post has the name of its publisher and total number of likes.
Every post has comments given by users along with their name, message, data-time and likes.
On each post, there can be zero or more comments.
For this set of requirements design a Mongo DB schema.(May_2017_CE)
5 What is Mongo DB? Explain in brief key features of Mongo DB. Show basic CRUD operations 7
in Mongo DB with proper example.(May_2017_CE)(May_2019_CE)
6 Requirement specification for a meeting dashboard application in an organization is as 7
follows:
Any member in an organization can host a meeting and send an invitations to other members
within an organization.
Invitees can accept or reject the meeting with proper reason.
Every meeting has the title, timestamp and place/location associated.
Every meeting has predefined agendas and documents associated.
Meeting discussion concludes with identifying tasks to accomplish.
Every task has title, priority, deadline and note associated with it. Task can be assigned to
any attendee of meeting.
For this set of requirements design a Mongo DB schema.(May_2017_CE)
7 Explain MongoDB-Create database and Drop-Database(Nov_2017_CE)(May_2019_CE) 7

8 Explain principles of schema design in MongoDB.(May_2017_CE)(May_2019_CE) 4

9 Write difference between MangoDB and Hadoop.(May_2017_CE) 4

10 Explain MongoDB shell and how to run the shell(May_2019_CE) 4

11 Explain Key Features of MongoDB in brief.(May_2018_IT) 3

You might also like