6.BDA Question Bank
6.BDA Question Bank
CE & IT Department
Big Data Analytics (2180710)(Dept Elec - II)
Question Bank
Year: 2020-21
CE Department Mission:
1) To provide healthy Learning Environment based on current and future Industrial demands.
3) To groom technically powerful and ethically dominant engineers having real life problem
solving capabilities.
IT Department Vision:
To provide quality education and assistance to the students through innovative teaching learning
methodology for shaping young mind technically sound and ethically strong.
IT Department Mission:
2) To generate groomed and efficient problem solvers as per Industrial needs by adopting
innovative teaching learning methods.
6 What are the advantages of Hadoop? Explain Hadoop Architecture and its Components with
7
proper diagram.(Nov_2017_IT)
7 Write Map Reduce steps for counting occurrences of specific numbers in the input text file(s). 7
Also write the commands to compile and run the code.(April_2018_IT)(Dec_2019_IT)
8 Discuss Hadoop YARN in detail with failures in classic MapReduce.(May-2017)
7
9 What are the benefits of Big Data? Discuss challenges under Big Data. How Big Data Analytics 7
can be useful in the development of smart cities.(April-2018)
10 Explain Job Scheduling in Map Reduce. How it is done in case of (i) The Fair Scheduler 7
(ii) The Capacity Scheduler(Nov_2016_IT)
11 What is Big data? Discuss it in terms of four dimensions, volume, velocity(May -2017) 4
12 List various application of big data. How it can be used to improve business for a
4
superstore.(April_2018_IT)(Winter_2019_IT)(Dec_2019_IT)
13 Explain Map-reduce framework in brief.(Nov-2017-IT) 4
16 What is Big Data? Explain how big data processing differs from distributed
3
processing.(April_2018_IT)
17 Explain Avro data serialization technique in MapReduce.(April_2018_IT)
3
18 Explain following commands with syntax and at least one example of each. (1) copyFromLocal
3
(2) showing the content of outputfile (3) setrep (4) checksum.(April_2018_IT)(Dec_2019_IT)
19 Explain advantages and disadvantages of big data analytics.(April_2018_IT) 3
TOPIC:2 Introduction to Hadoop and Hadoop Architecture[Co-1]
[Big Data – Apache Hadoop & Hadoop EcoSystem, Moving Data in and out of Hadoop –
Understanding inputs and outputs of MapReduce -, Data Serialization]
1 What are the advantages of Hadoop? Explain Hadoop Architecture and its Components with proper 7
diagram.(Nov-2016-IT)(May_2018_IT)
2 What is Hadoop Ecosystem? Discuss various components of Hadoop Ecosystem.(May 7
2017)(April_2018_IT)
3 Explain core architecture of Hadoop with suitable block diagram. Discuss role of each component in 7
detail.(May 2017)
4 What is data serialization? With proper examples discuss and differentiate structured, unstructured and 7
semi-structured data. Make a note on how type of data affects data serialization.(May
2017)(April_2018_IT)(May_2019_CE)
5 List various configuration files used in Hadoop Installation. What is use of mapred- 3
site.xml?(April_2018_IT)
6 What is Name node & Data node in Hadoop Architecture.(May_2019_CE)
3
TOPIC:4 Spark[CO-1,2,5]
1 Explain Spark components in detail. Also list the features of 7
spark.(Nov_2017_IT)(Nov_2016_IT)(Dec_2019_IT)
2 What are the problems related to Map Reduce data storage? How Apache Spark solves it using 7
Resilient Distributed Dataset? Explain RDDs in
detail.(Nov_2017_IT)(May_2019_CE)(Dec_2019_IT)
3 What is Apache Spark? What are the advantages of using Apache Spark over Hadoop? 7
Explain in brief four major libraries of Apache
Spark.(May_2017_CE)(May_2018_IT)(May_2019_CE)
4 What is transformation and actions in Apache Spark? Discuss various commands available for 7
this activities in Apache Spark?(May_2017_CE)
5 What is Resilient Distributed Dataset in Apache Spark? Explain in detail. Make a note on why 7
RDD is better than Map Reduce data
storage?(May_2017_CE)(April_2018_IT)(May_2019_CE)
6 What is Apache Spark(May_2018_CE)(May_2019_CE) 3
TOPIC:5 NoSQL[CO-4]
[Types of NoSQL databases, Why NoSQL?, Advantages of NoSQL, Use of
NoSQL in Industry, SQL vs NoSQL, NewSQL]
1 What is NoSQL database? List the differences between NoSQL and relational databases. 7
Explain in brief various types of NoSQL databases in practice.(Nov_2016_IT)(Dec_2019_IT)
2 Use of NoSQL in industry.(Nov_2017_CE) 7
3 Define NewSQL and explain benefits and limitation of NewSQL.(Nov_2017_IT) 7
4 Define NoSQL and where is it used? (b) i) Document Oriented Database ii) Graph based 7
Database.(Nov_2017_CE)(May_2019_CE)
5 Write differences between NoSQL and SQL.(Nov_2017_IT)(May_2019_CE) 4
6 Explain NoSQL.(Nov_2017_IT)(May_2019_CE) 3
1 Explain following in brief with repect to mongo DB : (1) Collections and documents, (2) 7
Indexing and retrieval (3) Data aggregation(May_2018_IT)(Nov_2016_IT)(Dec_2019_IT)
2 Explain scaling in MangoDB(April-2018)(May_2019_CE) 7