Regression Analysis
Regression Analysis
Regression
Models
Univariate Multivariable
One Explanatory Variable 2+ Explanatory Variables
Non- Non-
Linear Linear
Linear Linear
GRAPHS BEFORE FITTING A MODEL
• Model - should be based on theoretical background
or the hypothesis to be tested.
• Data may be used to suggest the model.
• Graphs may be used before fitting a model.
Four possible groups of graphs are:
1. One dimensional graphs
2. Two dimensional graphs
3. Rotating plots, and
4. Dynamic graphs
One dimensional Graphs
• Histogram
• Stem and leaf display
• Dot Plot
• Box Plot
Two Dimensional Graphs
Y = b0 + b1 x1 + e
Dependent or Independent or Expl.
Resp. Variable Variable
(e.g., income) (e.g., education)
• To fit the regression equation to the given data is
nothing but to estimate unknown constants
which are involved in regression line.
• Regression Equation of
• Regression Equation of :
• Regression Equation of X on Y:
Interpretation of regression coefficients
Where - intercept
Slope of the line
Dependent variable
Independent variable
Regression line
• Regression line of on : gives the best estimate
for the value of for any specific given values of
XWhere - intercept
Slope of the line
Dependent variable
Independent variable
• is also called the regression coefficient of on . It is also
denoted by . The above equation is called regression
equation on , it means we can write in term of .
Height father in cm Height son in cm
(Y) (X) y^2 X^2 YX
65 68 4225 4624 4420 a -3.37687
63 66 3969 4356 4158 b 1.036403
67 68 4489 4624 4556 X=-3.38+1.036Y
64 65 4096 4225 4160
68 69 4624 4761 4692
62 66 3844 4356 4092
70 68 4900 4624 4760
66 65 4356 4225 4290
68 71 4624 5041 4828
67 67 4489 4489 4489
69 68 4761 4624 4692
71 70 5041 4900 4970
Sum= 800 Sum= 811 53418 54849 54107
Mean= 66.667 Mean = 67.583
n=12
Linear Regression :Model
Y
? (the actual value of Yi)
Yi Y = b 0 + b1 X
ei
Xi X
Regression Line
How to draw a line through these points? and how
to determine the best fit line?
Y
60
40
20
0 X
0 20 40 60
Regression Line
How to draw a line through these points? and how
to determine the best fit line?
Y
60
40
20
0 X
0 20 40 60
Regression Line
How to draw a line through these points? and how
to determine the best fit line?
Y
60
40
20
0 X
0 20 40 60
Regression Line
How to draw a line through these points? and how
to determine the best fit line?
Y
60
40
20
0 X
0 20 40 60
Regression Line
How to draw a line through these points? and how
to determine the best fit line?
Y
60
40
20
0 X
0 20 40 60
Regression Line
How to draw a line through these points? and how
to determine the best fit line?
Y
60
40
20
0 X
0 20 40 60
Regression Line
How to draw a line through these points? and how
to determine the best fit line?
Y
60
40
20
0 X
0 20 40 60
Regression Lines
(A) Regression line of X on Y
X X bxy (Y Y )
orX X bxyY bxyY
bY
Where
Y
dx
X=
a + x SD( X )
bxy r r
dx y SD(Y )
dx N XY X Y
N Y 2 ( Y ) 2
dx
Re g . coeff. of X on Y
r Corr. Coeff. X,Y
X
Gives better estimate of X Contd..
Regression Lines Contd..