0% found this document useful (0 votes)

11 views21 pages

06_Banerjee and Banerjee_Business Analytics_Ch06

Chapter 6 discusses various analytical methods for both parametric and non-parametric data in business research, emphasizing the significance of sampling, confidence intervals, hypothesis testing, and correlation. It covers techniques such as cross-tabulation, factor analysis, regression models, and forecasting, while addressing issues like multicollinearity and heteroscedasticity in time series analysis. The chapter aims to equip researchers with the necessary tools to analyze data effectively and make informed decisions based on statistical findings.

Uploaded by

ujjwal.2325mba1066

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views21 pages

06_Banerjee and Banerjee_Business Analytics_Ch06

Uploaded by

ujjwal.2325mba1066

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Chapter 6: Analytical Methods for Parametric and

Non-parametric Data
Contents

1. Significance of sampling in business research.

2. Confidence interval and hypothesis testing.
3. Cross-tabulation.
4. Correlation.
5. Factor analysis.
6. Regression (OLS) models.
7. Multicollinearity.
8. Forecasting and time series analysis.
9. Heteroscedasticity in time series models.
Significance of Sampling in Business Research

Opening example:
• Every year, about 2 lac students in India appear for the CAT for securing admission
across reputed business schools in the postgraduate programme in India.
• This is a computer-based examination that tests the students’ verbal ability (VA),
quantitative ability (QA), data interpretation (DI), logical reasoning (LR) and reading
comprehension (RC).
• Students receive their score as a percentile which ranks them vis-a-vis the
performance of all those who appeared for the examination.
• But does a high score in CAT guarantee a seat in big business schools?
• Several variables and metrics are considered together by the business schools’
admission team for finalizing the list of admitted students.
Significance of Sampling in Business Research

• Population is the entire collection of items under study.

• The characteristics of the individuals (variables/attributes) who form the
population help define the target group.
• Small group that is a representation of the population under study and has all the
characteristics of the population is called a sample.
Types of Sampling

• There are primarily two types of sampling techniques: probability and non-
probability sampling.
• In probability sampling, every item in the sample has an equal chance of getting
selected.
• While in non-probability technique, interventions such as likelihood of reach,
human intelligence and grouping methods are used.
Types of Sampling
Confidence Interval and Hypothesis Testing

• Analysts test several hypotheses based on objectives of a research problem or

opportunity. In most cases, the investment of time, money and resources is done
because the analyst intends to test the possibility of accepting an alternate
hypothesis.
• As you have studied in statistics and BRM course, the confidence interval is used to
accept or reject the null hypothesis based on statistical tests conducted on the
sample data.
• For a 95 per cent confidence interval, the null hypothesis gets rejected if the p-
value in the outcome is less than 0.05, indicating that the value lies in the 5 per
cent rejection region.
Bell-Shaped Curve
Cross-Tabulation

• Cross-tabulation or contingency tables are ways of grouping variable to analyse the

relationships between them.
• They are used for categorical data, groups of data in categories that are mutually
exclusive.
Correlation

• When the variables are numerical, the degree of strength of the relationship
between them is expressed by correlation.
• The variables need to be numerical and continuous in nature. For example, age,
height, weight, sales volume and number of units sold per day.
• Correlation is expressed by r which has a value from +1 to –1. Positive values
indicate positive correlation (as one variable increases so does the other and vice
versa).
Negative Correlation

January Temperature
80

70
f(x) = − 1.75828304061843 x + 104.98203702448
60 R² = 0.722976628031957
T
e 50
m
p
e 40
r
a
t 30
u
r
e 20

0
15 20 25 30 35 40 45 50 55 60 65
Latitude
Factor Analysis

• A correlation is a useful input into conducting various types of factor analysis.

• A commonly used variant of factor analysis is the PCA, also referred as exploratory
factor analysis.
• In PCA, all associations among variables of interest are identified (in numeric terms
through a correlation analysis).
• Factor analysis (PCA) provides a platform to reduce data without a commensurate
reduction in the information content of the data. Using correlation as the basis of
commonality across variables, it groups variables with similar ‘information’ and
bunches them together (each one of the variables represents others).
• In applications, this is a useful way to summarize the information into tighter
dimensions, helpful for understanding and interpreting the information.
Regression (OLS) Models

• OLS regression models, another type of multivariate analyses, are very prolifically
used as prediction models.
• The basic requirement for developing these models is an outcome variable that is
measured on a continuous scale (interval or ratio) and juxtaposed on the outcome
variable should be some relevant predictor (explanatory) variables also measured
usually on a continuous scale.
• This modelling technique is actually another form of correlation analyses across
multiple variables.
• The theme of these models is the association of one target variable (outcome) to
other explanatory (predictor) variables, which may also be termed as ‘antecedent’
variables. Antecedence is only proven by the domain in which the model is being
built and not by the correlation among the variables.
Regression (OLS) Models
Regression (OLS) Models

• The closer the fit is to the actual data (higher r-square), the better is the chance
that the equation will be able to predict outcomes based on values of the input
(explanatory) variables.
• However, there is no guarantee that the models will continue to predict well across
other data samples, unless the nature of the data remains largely the same.
• The standardized coefficient (the magnitude) signifies the importance of the
variable in determining the value of the outcome. The sign (+ve or –ve) determines
the nature of the relationship between the outcome and the variable.
Regression (OLS) Models

• The (un)standardized coefficients indicate the relationship between each of the

explanatory variables and the outcome variable (referred as dependent variable).
• The ‘std. error’ is a measure of the ‘fuzziness’ of the relationship (higher the
number, larger the fuzziness about the existence of the relationship).
• The unstandardized coefficient and the ‘std. error’ together determine the strength
of the relationship and is indicated by the ‘t’.
• Higher the absolute ‘t’ value, stronger is the relationship between the specific
predictor variable and the outcome variable.
Multicollinearity

• Multicollinearity is a usual problem in a diagnostic regression model which is meant to

identify the strength of the relationships of predictor variables with outcome variable.
• If the true strength of the relationship is ‘hidden’ due to the correlation of an
explanatory variable with another, the objective of identifying relationships among
variables (predictor and outcome variables) becomes difficult.
• Usually, when two explanatory variables are correlated, the regression model is unable
to churn out the independent associations of each of the explanatory variables with
the outcome variable, since the former have an association among them as well.
• In reality, even when correlations are moderate, effect of multicollinearity is noticed in
terms of higher ‘fuzziness’ in coefficients, weakening the association with the outcome
variables.
• Manoeuvring around a multicollinearity problem that makes diagnosis complicated
requires some creativity and prior experience of handling such problems and does not
usually have a standard solution.
Forecasting and Time Series Analysis

• There are many applications where the primary role of the model is to find a
relationship between explanatory variable and the outcome, in order to
predict/forecast outcomes in the future. Such models are termed as forecasting
models.
• Smaller is the band of uncertainty, higher is the confidence in the estimated
outcome (or forecast).
• R-square of these models is usually expected to be very high (although this is just
one of the many necessary conditions).
Forecasting and Time Series Analysis

• Time series analysis is a form of forecasting models that is estimated using

historically (time-based) collected data of outcomes that are used to project the
expected outcome for the future.
• The time series model represents a situation where the forecasted value of the
future is assumed to be significantly driven by the past outcomes (although there
may be a part that is determined by the explanatory variable).
• In forecasting models using time series data, the objective is not so much as to
explain the reasons for a certain expected outcome, it is more about predicting the
future accurately.
Heteroscedasticity in Time Series Models

• In time series analysis, it is worthwhile to discuss the properties of the unexplained

part (Єt+1) of the model.
• This is normally assumed to be random (normally distributed) and uncorrelated
across successive observations, although in most practitioner settings, analysts
rarely check on the actual distribution of the errors to validate if the model is
correct (not that we would like to ratify such practice).
• If the error is positive from the mean, chances of the next error being positive are
high and vice versa are termed as heteroscedastic errors and are a violation of the
assumptions of the regression model.
• Technically, this distortion needs to be addressed to build a technically sound
model. The statistical properties of the estimate of coefficients in an OLS model are
invalid when the errors are correlated.
• A Durbin–Watson test is employed to ascertain whether the serial correlation in
the errors has been resolved.
Attempt the review questions and case studies at
the end of the chapter
****************

Stat2 Textbook
No ratings yet
Stat2 Textbook
1,656 pages
All Command 7360 - CLi & TL1
100% (3)
All Command 7360 - CLi & TL1
7 pages
Regression Analysis
No ratings yet
Regression Analysis
12 pages
Eddie Daniels - My One and Only Love, Flute PDF
No ratings yet
Eddie Daniels - My One and Only Love, Flute PDF
8 pages
BCS 040
No ratings yet
BCS 040
21 pages
ECO 391 Lecture Slides - Part 2
No ratings yet
ECO 391 Lecture Slides - Part 2
26 pages
5. Regression
No ratings yet
5. Regression
20 pages
Data Analytics Lesson 11 Notes
No ratings yet
Data Analytics Lesson 11 Notes
8 pages
Income Tax
No ratings yet
Income Tax
9 pages
Correlation and Regression 2
No ratings yet
Correlation and Regression 2
24 pages
Chap3-INTERVENTION ANALYSIS
No ratings yet
Chap3-INTERVENTION ANALYSIS
62 pages
Predictive Analytics-Mid Sem Exam Question Bank
No ratings yet
Predictive Analytics-Mid Sem Exam Question Bank
28 pages
Econometrics Session
No ratings yet
Econometrics Session
43 pages
DS
No ratings yet
DS
5 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
Aiml M3 C3
No ratings yet
Aiml M3 C3
37 pages
Research Method
No ratings yet
Research Method
18 pages
BRM-Lecture 4-2023
No ratings yet
BRM-Lecture 4-2023
48 pages
Aiml Module 3 Part 3
No ratings yet
Aiml Module 3 Part 3
12 pages
Mathematical Modeling Using Linear Regresion
No ratings yet
Mathematical Modeling Using Linear Regresion
52 pages
Intermediate Analytics-Regression-Week 1
No ratings yet
Intermediate Analytics-Regression-Week 1
52 pages
Chapter 2
No ratings yet
Chapter 2
136 pages
Simple Linear Regression (1)
No ratings yet
Simple Linear Regression (1)
83 pages
Data Science 03 - Regression PDF
No ratings yet
Data Science 03 - Regression PDF
32 pages
Topic0 Introduction
No ratings yet
Topic0 Introduction
9 pages
Copy of Unit 5 Business Analytics
No ratings yet
Copy of Unit 5 Business Analytics
24 pages
MA Forecasting
No ratings yet
MA Forecasting
22 pages
6 Correlation and Linear Regression
No ratings yet
6 Correlation and Linear Regression
32 pages
Econometrics for Mgt ppt-2 (1)
No ratings yet
Econometrics for Mgt ppt-2 (1)
58 pages
Data Analysis
No ratings yet
Data Analysis
263 pages
CH 5 - Correlation and Regression
No ratings yet
CH 5 - Correlation and Regression
9 pages
CH 5
No ratings yet
CH 5
36 pages
Group 5 - Paz, Chavez, Raña, Corporal
No ratings yet
Group 5 - Paz, Chavez, Raña, Corporal
46 pages
Quantitative Methods Vocabulary
No ratings yet
Quantitative Methods Vocabulary
5 pages
Advance Business Research Methods
No ratings yet
Advance Business Research Methods
38 pages
Correlation and Regression: Predicting The Unknown
No ratings yet
Correlation and Regression: Predicting The Unknown
5 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
unit 7 8614
No ratings yet
unit 7 8614
35 pages
Week-4 Statistical-Forecasting Handout
No ratings yet
Week-4 Statistical-Forecasting Handout
9 pages
003-Forecasting Techniques Detailed
No ratings yet
003-Forecasting Techniques Detailed
20 pages
DTB Theory
No ratings yet
DTB Theory
15 pages
ARM 2nd Mid
No ratings yet
ARM 2nd Mid
13 pages
ASM using r 2 marks answer Keys
No ratings yet
ASM using r 2 marks answer Keys
10 pages
Data Analytics Unit III
No ratings yet
Data Analytics Unit III
15 pages
Regression and Correlation
No ratings yet
Regression and Correlation
37 pages
Module 6 RM: Advanced Data Analysis Techniques
No ratings yet
Module 6 RM: Advanced Data Analysis Techniques
23 pages
Linear Regression With R
No ratings yet
Linear Regression With R
45 pages
Psych Stat Reviewer Midterms
No ratings yet
Psych Stat Reviewer Midterms
10 pages
Unit-III
No ratings yet
Unit-III
13 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
70 pages
Predective Analytics or Inferential Statistics
No ratings yet
Predective Analytics or Inferential Statistics
27 pages
Correlation
No ratings yet
Correlation
5 pages
Corr_Regression Analysis
No ratings yet
Corr_Regression Analysis
19 pages
Correlation and Regression
No ratings yet
Correlation and Regression
3 pages
Topic 5-Lecture Notes
No ratings yet
Topic 5-Lecture Notes
12 pages
Chapter 5 - 1
No ratings yet
Chapter 5 - 1
5 pages
7. Chapter 14 Simple Linear Regression .
No ratings yet
7. Chapter 14 Simple Linear Regression .
39 pages
Module 3 - Regression and Correlation Analysis
No ratings yet
Module 3 - Regression and Correlation Analysis
54 pages
Advancedeconometricsl3!4!240128102442 58a0f1f1
No ratings yet
Advancedeconometricsl3!4!240128102442 58a0f1f1
58 pages
Ra Web
No ratings yet
Ra Web
70 pages
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Thumbnail PPT Template
No ratings yet
Thumbnail PPT Template
1 page
Risk & Return
No ratings yet
Risk & Return
12 pages
09 Banerjee and Banerjee Business Analytics Ch09
No ratings yet
09 Banerjee and Banerjee Business Analytics Ch09
10 pages
10 Banerjee and Banerjee Business Analytics Ch10
No ratings yet
10 Banerjee and Banerjee Business Analytics Ch10
10 pages
08 Banerjee and Banerjee Business Analytics Ch08
No ratings yet
08 Banerjee and Banerjee Business Analytics Ch08
14 pages
11_Banerjee and Banerjee_Business Analytics_Ch11
No ratings yet
11_Banerjee and Banerjee_Business Analytics_Ch11
11 pages
02 Banerjee and Banerjee Business Analytics Ch02
No ratings yet
02 Banerjee and Banerjee Business Analytics Ch02
16 pages
Product Data Sheet: Multi 9 - C60N - MCB - 3P - 3 A - B Curve - 415 V - 10 Ka
No ratings yet
Product Data Sheet: Multi 9 - C60N - MCB - 3P - 3 A - B Curve - 415 V - 10 Ka
3 pages
A Map Is A Diagrammatic Representation of An Area of Land or Sea Showing Physical Features
No ratings yet
A Map Is A Diagrammatic Representation of An Area of Land or Sea Showing Physical Features
8 pages
Questions and Problems 2022
No ratings yet
Questions and Problems 2022
22 pages
Fulltext01 PDF
No ratings yet
Fulltext01 PDF
66 pages
Quiz 5
0% (1)
Quiz 5
8 pages
Department of Education: School Action Plan in English Club Person'S Involved Resources Date/Time Frame Expected Outcome
No ratings yet
Department of Education: School Action Plan in English Club Person'S Involved Resources Date/Time Frame Expected Outcome
1 page
Making The Case For CAASM
No ratings yet
Making The Case For CAASM
14 pages
Ethical Considerations in Management Philosophy
No ratings yet
Ethical Considerations in Management Philosophy
15 pages
Exw Fca CPT Cip Dpu Dap DDP: Fas Fob CFR Cif
No ratings yet
Exw Fca CPT Cip Dpu Dap DDP: Fas Fob CFR Cif
1 page
Биология. Общие Закономерности Жизни 9 Класс
No ratings yet
Биология. Общие Закономерности Жизни 9 Класс
129 pages
Constitution and By-Laws of The Supreme Student Government
No ratings yet
Constitution and By-Laws of The Supreme Student Government
18 pages
"Network Engineer": Visvesvaraya Technological University Jnana Sangama, Belagavi-590018
No ratings yet
"Network Engineer": Visvesvaraya Technological University Jnana Sangama, Belagavi-590018
6 pages
Chapter 10 Problem 10
100% (1)
Chapter 10 Problem 10
10 pages
Como agregar un miembro a tu club en my rotary
No ratings yet
Como agregar un miembro a tu club en my rotary
5 pages
PPT- ISO 14001 & ISO 45001
No ratings yet
PPT- ISO 14001 & ISO 45001
98 pages
Sds-Plustek PB300G33BK11 PDF
No ratings yet
Sds-Plustek PB300G33BK11 PDF
9 pages
Hardness Conversion
No ratings yet
Hardness Conversion
11 pages
Important Mcq-Digital Electronics
No ratings yet
Important Mcq-Digital Electronics
27 pages
Begin to Code with C 1st Edition Rob Miles pdf download
100% (1)
Begin to Code with C 1st Edition Rob Miles pdf download
29 pages
Malus Law Experiment - For Students
No ratings yet
Malus Law Experiment - For Students
13 pages
job portal documentation
No ratings yet
job portal documentation
48 pages
asbg_12_government_grants_2017_eng_0
No ratings yet
asbg_12_government_grants_2017_eng_0
6 pages
8020 Blocked From Use: Tuesday
No ratings yet
8020 Blocked From Use: Tuesday
95 pages
DLL Mathematics-1 Q1 W7
No ratings yet
DLL Mathematics-1 Q1 W7
10 pages
Civil Air Patrol News - Sep 2008
No ratings yet
Civil Air Patrol News - Sep 2008
60 pages
Chapter 5
No ratings yet
Chapter 5
34 pages
Silabus Autocad 3D
No ratings yet
Silabus Autocad 3D
4 pages
Personal Information Worsheet
No ratings yet
Personal Information Worsheet
4 pages

06_Banerjee and Banerjee_Business Analytics_Ch06

Uploaded by

06_Banerjee and Banerjee_Business Analytics_Ch06

Uploaded by

Chapter 6: Analytical Methods for Parametric and

1. Significance of sampling in business research.

• Population is the entire collection of items under study.

• Analysts test several hypotheses based on objectives of a research problem or

• Cross-tabulation or contingency tables are ways of grouping variable to analyse the

• A correlation is a useful input into conducting various types of factor analysis.

• The (un)standardized coefficients indicate the relationship between each of the

• Multicollinearity is a usual problem in a diagnostic regression model which is meant to

• Time series analysis is a form of forecasting models that is estimated using

• In time series analysis, it is worthwhile to discuss the properties of the unexplained

You might also like