Bsd1313 Chapter 1
Bsd1313 Chapter 1
BSD1313
DR. MOHD KHAIRUL BAZLI BIN MOHD AZIZ
CENTRE FOR MATHEMATICAL SCIENCES, UNIVERSITI MALAYSIA PAHANG
CHAPTER 1: INTRODUCTION TO DATA SCIENCE
The term “big data” refers to data that is so large, fast or complex that
it’s difficult or impossible to process using traditional methods. The act
of accessing and storing large amounts of information for analytics has
been around a long time. But the concept of big data gained momentum
in the early 2000s when industry analyst Doug Laney articulated the
now-mainstream definition of big data as the three V’s:
Volume
Velocity
Variety
Meet Katie Bouman, the woman
behind the first-ever image of a
black hole
She led the development of a computer
program which eventually put all the pieces
together.
Reported in the World Economic Forum (WEF, 2018) by Guthrie Jensen Global Training Consultants, the 10 most
in-demand job in 2020.
WHY DATA SCIENCE?
2022 Job Skills
Amazon's recommendation
engines suggest items for you
to buy, determined by their
algorithms
DATA SCIENCE
DISCOVERY OF DATA
INSIGHT
Amazon is a global e-
commerce and cloud
computing.
It hires data scientists on a
big scale
The data scientists will
explore customer mindset
and enhance the
geographical reach of both
e-commerce and cloud
domains.
DATA SCIENCE
VISA
According to Alison Doyle (2019), Data Scientist is a multi-skilled person with analytical skills, mathematics,
programming, open-mindedness and communication.
Analytical Skills Open Communication Mathematics Programming and
Mindedness Technical
Proficiencies
Big Data Adaptability Assertiveness Statistics Microsoft Excel
Data Analysis Decision Making Collaboration Construct Python/R/C++/Java
Data Analytics Critical Thinking Consulting Algorithms MATLAB
Data Science Logical Thinking Consensus Linear Algebra SQL
Predictive Modelling Problem Solving Facilitating Machine Learning NoSQL
Data Mining Leadership Multivariable Tableau
Data Visualization Professionals Calculus
Verbal/Written
Communications
WHAT DATA SCIENTIST DO?
A Data Scientist's job is to analyze data for actionable insights by
doing following tasks:
Identifying and asking questions to be solved.
Devising and applying models and algorithms for mining big data
from structured and unstructured forms.
Cleaning and validating data to ensure accuracy, completeness
and uniformity.
Analyzing the data to identify patterns and trends.
Communicating findings to stakeholders using visualization and
other means.
FUTURE DATA
SCIENTIST IS BORN
IN UMP