0% found this document useful (0 votes)

34 views

Regression and Correlation

- Regression and correlation analyze the relationships between variables. Correlation determines the strength of linear relationships while regression determines the form or nature of relationships. - Scatter plots graphically show the relationships between two variables. Correlation coefficients measure the strength of linear relationships from -1 to 1. Simple linear regression models fit a line to data to predict the dependent variable from the independent variable. - Statistical tests assess whether correlation or regression coefficients are statistically different from hypothesized values. Confidence intervals provide ranges that estimated coefficients are likely to fall within.

Uploaded by

RoMiecar Colipano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views

Regression and Correlation

Uploaded by

RoMiecar Colipano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 17

Regression and Correlation

-Relationships between variables. For example:

• How do the sales of a product depend on the
price charged?
•How does the strength of material depend on
temperature?
•To what extent is metal pitting related to pollution?
•How strong is the link between inflation and
employment rates?
•How can we use the amount of fertilizer used to
predict crop yields?
These are essentially two types of problem:
•CORRELATION problems which involve measuring
the strength of relationship.
•REGRESSION problems which are concerned with
the form or nature of a relationship.

SCATTER PLOT – a graphical presentation of the

variables by plotting the points in the XY-plane.
- Gives us the idea on the strength and form of
relationships between the two variables.
CORRELATION ANALYSIS

Objective: to determine the degree or strength

of the linear association between the values of
two variables, X and Y.

The analysis does not differentiate the dependent
and the independent variable.

A correlation coefficient measures how weak or
strong the linear relationship is.

Pearson’s Correlation Coefficient, 
• Most commonly used measure of linear
association between two (interval or ratio)
variables, X and Y
• Denoted by  = XY / XY
• Estimated by the sample correlation coefficient,
SPXY
r
SS X SS Y

where:  X  Y 
SPXY   XY    SS X

 X
 X  
2  2 

n   n 
   

( Y ) 2
SS Y   Y 
2

n
• Range of values: -1    1 and -1  r  1
• Qualitative interpretation of  and r
Absolute Value of Strength of Linear Relationship
Correlation Coefficient Between X and Y
0.01 – 0.20 Very weak
0.21 – 0.40 Weak
0.41 – 0.60 Moderate
0.61 – 0.80 Strong
0.81 – 0.99 Very strong
Example: It is suspected that there is some
relationship between relative humidity and tensile
strength of certain material. The following
measurements are obtained.
Relative Humidity(%) Tensile strength
45 80
55 67
65 58
80 55
95 30

Estimate the strength /degree of relationship

between relative humidity(%) and tensile strength.
Test of Hypothesis on 

Ho: There is no linear relationship between X and Y ( = 0 ).

Ha: i)   0 ii)  > 0 iii)  < 0
r n2
Test stat: ct  ~ t ( n2)
2
1 r

Dec. Rule: Reject Ho if i)  tc  > t2(n-2)

ii) tc > t(n-2)
iii) tc < -t(n-2).
Else, fail to reject Ho.
REGRESSION ANALYSIS

OBJECTIVE: To determine the probable form of the
relationship between X and Y where X and Y are
paired variables.

The relationship between the variables X and Y is

represented by a statistical model of the form:
Y = f(X) + 

where Y – the response or dependent variable
X – the explanatory or independent variable
( attempts to explain the outcomes)
 - the random error component
SIMPLE LINEAR REGRESSION MODEL:

Yi   0  1 X i   i
where
0 – the regression constant; true Y – intercept
1 – the regression coefficient; measure of true
change in Y per unit change in X
i – the random error associated with Yi for a
given Xi
Yi, Xi – ith observed value for Y and X,
respectively
ASSUMPTIONS UNDERLYING THE SLR MODEL

• The values of the independent variable X may

either be fixed or random.
• The X’s are measured without error.
• The Y’s are statistically independent.
• For each value of X, there is a sub-population of
Y – values that is normally distributed.
• The variances of the sub-populations are all
equal.
• The mean of the sub-populations of Y all lie on
the same straight line.
Based from a SRS of size n, an estimate of the
model is: Yˆi  b0  b1 X i
where b0 – the estimated regression constant
b1 – the estimated regression coefficient

The estimators b0 and b1 are obtained by minimizing

the sum of squares of errors (LEAST SQUARES
ESTIMATION
n n
PROCEDURE – LSE ). That is,
min  ei2    Yi  (  0  1 X i ) 
2

i 1 i 1

The results of the minimization are as follows:

SPXY
b1 
SS X
and
b0  Y  b1 X
Adequacy of the Predicting Equation
• An overall measure of adequacy of the equation is
provided by the coefficient of multiple
determination, R2.
• R2 gives the proportion of total variation in Y that is
accounted for by the independent variable X.
• R2 ranges from 0 to 1 or 0% to 100%. The nearer
it is to 1 or 100%, the better is the fit of the model.
b1SP XY
R 
2
*100%
SSY
Test of Hypothesis
A test of hypothesis about the regression
coefficient, 1 can be performed at a certain
level of significance .

Using the t – test
Ho: 1 = 1* vs i) Ha: 1  1* or
ii) Ha: 1 > 1* or
iii) Ha: 1 < 1*
Test Statistic: tc = b1 - 1*
s.e.(b1)
where: SSy  b1 SPXY
s.e.(b1 ) 
SS X (n  2)

Decision Rule: Reject Ho if i)  tc  > t2(n-2)

ii) tc > t(n-2)
iii) tc < -t(n-2).
Else, fail to reject Ho.

II. Using the F – test in the ANOVA
Ho: 1 = 1* vs Ha: 1  1*
Test Statistic: Fc
Decision Rule: Reject Ho if Fc > F( 1, n-2 ).
Else, fail to reject Ho.

Analysis of Variance Table:
Sources of Mean
Variation df SS Square
Regression 1 b1SPXY MSR
Error n – 2 SSY - b1SPXY MSE
TOTAL n–1 SSY
Using the t – test
Ho: 0 = 0* vs i) Ha: 0  0* or
ii) Ha: 0 > 0* or
iii) Ha: 0 < 0*

Test Statistic: tc = b0 - 0*

s.e.(b0)
where:
( SSy  b1 SPXY )  X 2
s.e.(b0 ) 
SS X ( n  2) n
Interval Estimation of 0 and 1:
A (1-) x 100% confidence interval for 0:
b0 t ( n 2)
se(b0 )
2
where
MSE  X 2
se(b0 ) 
nSS X

A (1-) x 100% confidence interval for 1:

b1 t ( n2)
se (b1 )
2
where
MSE
se(b1 ) 
SS X

Chapter 6 Student
No ratings yet
Chapter 6 Student
21 pages
IA New Criteria
No ratings yet
IA New Criteria
4 pages
Apple Vs Samsung
85% (20)
Apple Vs Samsung
88 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
7 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Regression and Correlation
100% (1)
Regression and Correlation
9 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
36 pages
Regression and Correlation
No ratings yet
Regression and Correlation
13 pages
Simple Linear Regression and Correlation
No ratings yet
Simple Linear Regression and Correlation
32 pages
Regression
No ratings yet
Regression
15 pages
Correlation and Regression Analyses
No ratings yet
Correlation and Regression Analyses
8 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
8 pages
REGRESSION and CORRELATION ANALYSIS STA 106 -DR. BASHIRU
No ratings yet
REGRESSION and CORRELATION ANALYSIS STA 106 -DR. BASHIRU
10 pages
6.3 SSK5210 Parametric Statistical Testing - Analysis of Variance LR and Correlation - 2
No ratings yet
6.3 SSK5210 Parametric Statistical Testing - Analysis of Variance LR and Correlation - 2
39 pages
Chapter No 11 (Simple Linear Regression)
No ratings yet
Chapter No 11 (Simple Linear Regression)
3 pages
STB1003_Unit-3 bsc
No ratings yet
STB1003_Unit-3 bsc
12 pages
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
No ratings yet
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
10 pages
Lesson 12 - Introduction To Regression and Correlation Analysis Regression Analysis
No ratings yet
Lesson 12 - Introduction To Regression and Correlation Analysis Regression Analysis
39 pages
M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
No ratings yet
M. Amir Hossain PHD: Course No: Emba 502: Business Mathematics and Statistics
31 pages
Mago, Jessica Marionne O. - Hypothesis Tests in Simple Linear Regression - Quiz
No ratings yet
Mago, Jessica Marionne O. - Hypothesis Tests in Simple Linear Regression - Quiz
2 pages
Regression
No ratings yet
Regression
24 pages
MAP 716 Lecture 4 Simple Linear Regression
No ratings yet
MAP 716 Lecture 4 Simple Linear Regression
23 pages
Lecture 8 Correlation and Linear Regression
No ratings yet
Lecture 8 Correlation and Linear Regression
66 pages
F_Regression
No ratings yet
F_Regression
65 pages
Regression: by Vijeta Gupta Amity University
No ratings yet
Regression: by Vijeta Gupta Amity University
15 pages
stat 7 modify
No ratings yet
stat 7 modify
6 pages
Regression Analysis
No ratings yet
Regression Analysis
18 pages
Dr. Sufian M. Salih / Regression and Correlation
No ratings yet
Dr. Sufian M. Salih / Regression and Correlation
14 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
11 pages
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
No ratings yet
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
11 pages
Lecture 12
No ratings yet
Lecture 12
47 pages
Ch 4- Correlation and Regression YARA&LAMA
No ratings yet
Ch 4- Correlation and Regression YARA&LAMA
27 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Linear Regression (1)
No ratings yet
Linear Regression (1)
19 pages
Module 6A Estimating Relationships
No ratings yet
Module 6A Estimating Relationships
104 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
Lecture 8 and 9 Regression Correlation and Index
No ratings yet
Lecture 8 and 9 Regression Correlation and Index
32 pages
Simple Linear Regressionclassroom
No ratings yet
Simple Linear Regressionclassroom
37 pages
Regression: Leech N L, Barret K C & Morgan G A (2011)
No ratings yet
Regression: Leech N L, Barret K C & Morgan G A (2011)
35 pages
Regression and Correlation
No ratings yet
Regression and Correlation
37 pages
QT _Unit 2_Part B - Regression
No ratings yet
QT _Unit 2_Part B - Regression
40 pages
Session 17
No ratings yet
Session 17
23 pages
Linear Regression Analysis_1
No ratings yet
Linear Regression Analysis_1
18 pages
Investigating Variables
No ratings yet
Investigating Variables
15 pages
Chapter 10
No ratings yet
Chapter 10
3 pages
Lecture 3.1.9 (REGRESSION)
No ratings yet
Lecture 3.1.9 (REGRESSION)
9 pages
5 Chapter Fi
No ratings yet
5 Chapter Fi
29 pages
Chapter 5 - 1
No ratings yet
Chapter 5 - 1
5 pages
Econometrics 2
No ratings yet
Econometrics 2
27 pages
Regression Analysis
No ratings yet
Regression Analysis
5 pages
Handout 5 Correlation and Regression (Recovered)
No ratings yet
Handout 5 Correlation and Regression (Recovered)
6 pages
6 Continuous Data Analysis
No ratings yet
6 Continuous Data Analysis
49 pages
8-Simple Regression Analysis
No ratings yet
8-Simple Regression Analysis
9 pages
Simple Linear Regression and Correlation: Model and Examine The Relationship Between A and One or More (Predictors)
No ratings yet
Simple Linear Regression and Correlation: Model and Examine The Relationship Between A and One or More (Predictors)
31 pages
Regression Analysis
No ratings yet
Regression Analysis
47 pages
Chapter 4 Regression
No ratings yet
Chapter 4 Regression
38 pages
Business Stat 10 12 .PDF
No ratings yet
Business Stat 10 12 .PDF
144 pages
A Tutorial On How To Run A Simple Linear Regression in Excel
No ratings yet
A Tutorial On How To Run A Simple Linear Regression in Excel
19 pages
Lesson 9: Test of Correlation and Simple Linear Regression
No ratings yet
Lesson 9: Test of Correlation and Simple Linear Regression
7 pages
Chapter 9
No ratings yet
Chapter 9
10 pages
@regression
No ratings yet
@regression
33 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
ED105858
No ratings yet
ED105858
51 pages
PE Lesson 4
No ratings yet
PE Lesson 4
2 pages
Statistics: Derivation
No ratings yet
Statistics: Derivation
17 pages
Learning Style Effectiveness
No ratings yet
Learning Style Effectiveness
15 pages
A Comparative Study On Factors Affecting Consumer's Buying Behavior Towards Home Loans (With Special Reference To State Bank of India and Life Insurance Corporation, Allahabad)
No ratings yet
A Comparative Study On Factors Affecting Consumer's Buying Behavior Towards Home Loans (With Special Reference To State Bank of India and Life Insurance Corporation, Allahabad)
5 pages
1 - Introduction To Statistics
No ratings yet
1 - Introduction To Statistics
34 pages
Written Report
No ratings yet
Written Report
3 pages
Full Text 01
No ratings yet
Full Text 01
82 pages
Rohit Godke Dsbda Report Sppu
No ratings yet
Rohit Godke Dsbda Report Sppu
10 pages
Poe Answers
100% (2)
Poe Answers
78 pages
Statistika 3 - 2103007 - Sri Anugra Lestari
No ratings yet
Statistika 3 - 2103007 - Sri Anugra Lestari
5 pages
Generalization Error: Elie Kawerk
No ratings yet
Generalization Error: Elie Kawerk
37 pages
Answer Key B A Prog (SEC) SET-1 Question Paper-SEC Data Analysis BY DR.D.APPALA NAIDU
No ratings yet
Answer Key B A Prog (SEC) SET-1 Question Paper-SEC Data Analysis BY DR.D.APPALA NAIDU
5 pages
Paper Dinesh Clustering Techniques
No ratings yet
Paper Dinesh Clustering Techniques
5 pages
R Module 1 - Data Exploration (3)
No ratings yet
R Module 1 - Data Exploration (3)
19 pages
Relationship Between Availability and Reliability
No ratings yet
Relationship Between Availability and Reliability
20 pages
Begin Your Journey To AI
No ratings yet
Begin Your Journey To AI
19 pages
Iie Postgraduate Diploma in Data Analytics Full Time Factsheet 2024 v1
No ratings yet
Iie Postgraduate Diploma in Data Analytics Full Time Factsheet 2024 v1
2 pages
ADS Phase4
No ratings yet
ADS Phase4
21 pages
School Environmental Influences, Student Discipline and Learning Motivation Toward Increasing Senior High Students Achievement
No ratings yet
School Environmental Influences, Student Discipline and Learning Motivation Toward Increasing Senior High Students Achievement
10 pages
Frequent Pattern Based Clustering Methods
No ratings yet
Frequent Pattern Based Clustering Methods
23 pages
Data-Mining-Lab-Manual Cs 703b
No ratings yet
Data-Mining-Lab-Manual Cs 703b
41 pages
Data Science Bootcamp Syllabus
No ratings yet
Data Science Bootcamp Syllabus
30 pages
Bi12-019 Bi12-263 LW2
No ratings yet
Bi12-019 Bi12-263 LW2
17 pages
2SLS Notes
No ratings yet
2SLS Notes
44 pages
B.tech-I.T 7 & 8 Semester Course Work
No ratings yet
B.tech-I.T 7 & 8 Semester Course Work
16 pages
Wooldridge Solution Chapter 3
50% (2)
Wooldridge Solution Chapter 3
11 pages
Ecommerce Foodmandu
50% (6)
Ecommerce Foodmandu
36 pages
Logistic Regression
No ratings yet
Logistic Regression
5 pages
Challenges of Measuring Performance of The Sales and Operations Planning Process
No ratings yet
Challenges of Measuring Performance of The Sales and Operations Planning Process
13 pages
Industrial Training
No ratings yet
Industrial Training
20 pages
Data Sceince 2
No ratings yet
Data Sceince 2
14 pages

Regression and Correlation

Uploaded by

Regression and Correlation

Uploaded by

Regression and Correlation

-Relationships between variables. For example:

SCATTER PLOT – a graphical presentation of the

Objective: to determine the degree or strength

Estimate the strength /degree of relationship

Dec. Rule: Reject Ho if i)  tc  > t2(n-2)

The relationship between the variables X and Y is

• The values of the independent variable X may

The estimators b0 and b1 are obtained by minimizing

The results of the minimization are as follows:

Decision Rule: Reject Ho if i)  tc  > t2(n-2)

Test Statistic: tc = b0 - 0*

A (1-) x 100% confidence interval for 1:

You might also like