0% found this document useful (0 votes)

47 views13 pages

Linear Regression in Python.

The document provides an overview of linear regression, explaining its role in predicting dependent variables based on independent variables. It details the simple linear regression model, the regression line, and the process of estimating errors. Additionally, it includes a practical example of implementing linear regression in Python using the Boston House Prices Dataset.

Uploaded by

govindarajulac

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views13 pages

Linear Regression in Python.

Uploaded by

govindarajulac

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Linear Regression

Ankita Bhanushali
Pursuing M.Sc. Statistics
Consultant, Kantar
INSAID , March 2020 GCD
Cohort.
Linear Regression

Regression analysis is one of the most widely used methods for prediction. It is
applied whenever we have a causal relationship between variables.

Regression Analysis
We will use our typical step-by-step approach. We’ll start with the simple linear

regression model.

What is a Linear Regression

Let’s start with some dry theory. A linear regression is a linear approximation of a causal

relationship between two or more variables.

Ankita Bhanushali | Kantar Page 1

Linear Regression

Regression models are highly valuable, as they are one of the most common ways to

make inferences and predictions.

There is a dependent variable, labeled Y, being predicted, and independent variables,

labeled x1, x2, and so forth. These are the predictors. Y is a function of the X variables,

and the regression model is a linear approximation of this function.

Ankita Bhanushali | Kantar Page 2

Linear Regression

The Simple Linear Regression

The easiest regression model is the simple linear regression:

Y = β 0 + β 1 * x 1 + ε.

Let’s see what these values mean. Y is the variable we are trying to predict and is called

the dependent variable. X is an independent variable.

Ankita Bhanushali | Kantar Page 3

Linear Regression

The Regression Line

You may have heard about the regression line, too. When we plot the data points on

an x-y plane, the regression line is the best-fitting line through the data points. You can

take a look at a plot with some data points in the picture above. We plot the line based

on the regression equation. The grey points that are scattered are the observed

values. B 0 , as we said earlier, is a constant and is the intercept of the regression line with

the y-axis.B 1 is the slope of the regression line. It shows how much y changes for each

unit change of x.

Ankita Bhanushali | Kantar Page 4

Linear Regression

The Estimator of the Error

The distance between the observed values and the regression line is the estimator of the

error term epsilon. Its point estimate is called residual. Now, suppose we draw a

perpendicular from an observed point to the regression line. The intercept between that

perpendicular and the regression line will be a point with a y value equal to . As we said

earlier, given an x, is the value predicted by the regression line.

Ankita Bhanushali | Kantar Page 5

Linear Regression

Linear Regression in Python Example

We believe it is high time that we actually got down to it and wrote some code! So, let’s

get our hands dirty with our first linear regression example in Python.

Understanding the Dataset

Before we get started with the Python linear regression hands-on, let us explore the dataset. We
will be using the Boston House Prices Dataset, with 506 rows and 13 attributes with a target
column. Let’s take a quick look at the dataset.

Let’s take a quick look at the dataset.

In this Python Linear Regression example, we will train two models to predict the price.

Ankita Bhanushali | Kantar Page 6

Linear Regression

Model Building
Now that we are familiar with the dataset, let us build the Python linear regression models.

Simple Linear Regression in Python

Consider ‘lstat’ as independent and ‘medv’ as dependent variables

Step 1: Load the Boston dataset

Step 2: Have a glance at the shape

Ankita Bhanushali | Kantar Page 7

Linear Regression

Step 3: Have a glance at the dependent and independent variables

Step 4: Visualize the change in the variables

Ankita Bhanushali | Kantar Page 8

Linear Regression

Step 5: Divide the data into independent and dependent variables

Step 6: Split the data into train and test sets

Step 7: Shape of the train and test sets

Step 8: Train the algorithm

Ankita Bhanushali | Kantar Page 9

Linear Regression

Step 9: Retrieve the intercept

Step 10: Retrieve the slope

Step 11: Predicted value

Ankita Bhanushali | Kantar Page 10

Linear Regression

Step 12: Actual value

Step 13: Evaluate the algorithm

Ankita Bhanushali | Kantar Page 11

Linear Regression

What Did We Learn?

We embarked on it by first learning about what a linear regression is. Then, we went over the
process of creating one. We also went over a linear regression example. Afterwards, we talked
about the simple linear regression where we introduced the linear regression equation. By
then, we were done with the theory and got our hands on the keyboard and explored
another linear regression example in Python! We imported the relevant libraries and loaded the
data. We cleared up when exactly we need to create regressions and started creating our own.
The process consisted of several steps which, now, you should be able to per form with ease.

Ankita Bhanushali | Kantar Page 12

Statistical Physics of Particles PDF
No ratings yet
Statistical Physics of Particles PDF
10 pages
Assignment 1
No ratings yet
Assignment 1
12 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
ch12 0
No ratings yet
ch12 0
43 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
Practical 5
No ratings yet
Practical 5
8 pages
Lecture 3 - Linear Regression Imran 20022025 092939am
No ratings yet
Lecture 3 - Linear Regression Imran 20022025 092939am
46 pages
Regression Coeffient
No ratings yet
Regression Coeffient
52 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
UNIT II Regration
No ratings yet
UNIT II Regration
62 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
23 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Linear & Polynomial Regression Guide
No ratings yet
Linear & Polynomial Regression Guide
56 pages
18-Linear Regression
No ratings yet
18-Linear Regression
29 pages
Supervised Machine Learning - Regression
No ratings yet
Supervised Machine Learning - Regression
34 pages
Unit Iii
No ratings yet
Unit Iii
27 pages
Chapter4 Regression
No ratings yet
Chapter4 Regression
15 pages
AI Lec23
No ratings yet
AI Lec23
36 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
ML Unit
No ratings yet
ML Unit
23 pages
ch12 0
No ratings yet
ch12 0
82 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Linear Regression for Analysts
No ratings yet
Linear Regression for Analysts
6 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
MachineLearning Unit-II
No ratings yet
MachineLearning Unit-II
45 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Lect 10 Regression
No ratings yet
Lect 10 Regression
7 pages
Linear Regression Explained
No ratings yet
Linear Regression Explained
8 pages
Lecture 3
No ratings yet
Lecture 3
42 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
OE-ML Unit - 3
No ratings yet
OE-ML Unit - 3
29 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
MachineLearning Unit II
No ratings yet
MachineLearning Unit II
45 pages
Python Data Analysis Guide
No ratings yet
Python Data Analysis Guide
171 pages
Lecture 9-10
No ratings yet
Lecture 9-10
28 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Linear Regression
No ratings yet
Linear Regression
35 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Linear Regression - Module 3
No ratings yet
Linear Regression - Module 3
16 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
StatLearning2r PDF
No ratings yet
StatLearning2r PDF
267 pages
Lecture Note #8 - PEC-CS701E
No ratings yet
Lecture Note #8 - PEC-CS701E
20 pages
Unit 2
No ratings yet
Unit 2
26 pages
Isn't Linear Regression From Statistics?
No ratings yet
Isn't Linear Regression From Statistics?
4 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Lecture3 221109 035214
No ratings yet
Lecture3 221109 035214
87 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
Module III (Part II) (Regression and Time Series)
No ratings yet
Module III (Part II) (Regression and Time Series)
118 pages
3CP10 Final MJJ Linear Regression
No ratings yet
3CP10 Final MJJ Linear Regression
68 pages
Linear Regression: What Is Regression Analysis?
100% (1)
Linear Regression: What Is Regression Analysis?
21 pages
Mod3 Eda
No ratings yet
Mod3 Eda
16 pages
RRB - Unit 2 Regresion
No ratings yet
RRB - Unit 2 Regresion
53 pages
Linear Regression - 1st Draft
No ratings yet
Linear Regression - 1st Draft
5 pages
DA Notes 3
No ratings yet
DA Notes 3
12 pages
Paired Sample T-Test Analysis
No ratings yet
Paired Sample T-Test Analysis
4 pages
Extra Proves
No ratings yet
Extra Proves
1 page
Maths and Stat Research Projects Supervision 2024 25
No ratings yet
Maths and Stat Research Projects Supervision 2024 25
4 pages
DiD Regression
No ratings yet
DiD Regression
18 pages
Capstone Notes-Model
No ratings yet
Capstone Notes-Model
20 pages
K-Nearest Neighbour - Jupyter Notebook
No ratings yet
K-Nearest Neighbour - Jupyter Notebook
2 pages
Advanced Python for Data Scientists
No ratings yet
Advanced Python for Data Scientists
19 pages
Cronbach
No ratings yet
Cronbach
7 pages
Assumptions in Linear Regression
No ratings yet
Assumptions in Linear Regression
3 pages
Thuchanh
No ratings yet
Thuchanh
1 page
Confidence Intervals in Statistics
No ratings yet
Confidence Intervals in Statistics
4 pages
Tugas Panel Ainul
No ratings yet
Tugas Panel Ainul
8 pages
Sutherland, 2019 MMD Preprint
No ratings yet
Sutherland, 2019 MMD Preprint
11 pages
Tugas Skill Lab Ebm Dr. Muhammad Fikri Aulia
No ratings yet
Tugas Skill Lab Ebm Dr. Muhammad Fikri Aulia
26 pages
FEU Diliman - Forecasting Techniques
No ratings yet
FEU Diliman - Forecasting Techniques
5 pages
Econometrics Chapter Three
No ratings yet
Econometrics Chapter Three
35 pages
Introduction To Simple Linear Regression: - K.Tejashree (23H51A66F8)
No ratings yet
Introduction To Simple Linear Regression: - K.Tejashree (23H51A66F8)
10 pages
CH 05 Wooldridge 5e
No ratings yet
CH 05 Wooldridge 5e
8 pages
Regression Cheat Sheet
No ratings yet
Regression Cheat Sheet
6 pages
Pre-Test & Post-Test Energi Terbarukan
No ratings yet
Pre-Test & Post-Test Energi Terbarukan
1 page
Econometric Si Syl Lab Us
No ratings yet
Econometric Si Syl Lab Us
5 pages
Econometrics: CLRM Assumptions Guide
No ratings yet
Econometrics: CLRM Assumptions Guide
13 pages
Econometrics for Undergrads
No ratings yet
Econometrics for Undergrads
3 pages
Ardl Analysis Appendix
No ratings yet
Ardl Analysis Appendix
9 pages
BM2 Chapter 5 Forecasting
No ratings yet
BM2 Chapter 5 Forecasting
21 pages
Eberhardt 2012 Estimating Panel Time Series Models With Heterogeneous Slopes
No ratings yet
Eberhardt 2012 Estimating Panel Time Series Models With Heterogeneous Slopes
11 pages
Sta 221 Assignment
No ratings yet
Sta 221 Assignment
2 pages
Tugas 1-Forecasting SP 2023
No ratings yet
Tugas 1-Forecasting SP 2023
3 pages

Linear Regression in Python.

Uploaded by

Linear Regression in Python.

Uploaded by

Linear Regression

What is a Linear Regression

relationship between two or more variables.

Ankita Bhanushali | Kantar Page 1

make inferences and predictions.

There is a dependent variable, labeled Y, being predicted, and independent variables,

and the regression model is a linear approximation of this function.

Ankita Bhanushali | Kantar Page 2

The Simple Linear Regression

the dependent variable. X is an independent variable.

Ankita Bhanushali | Kantar Page 3

The Regression Line

Ankita Bhanushali | Kantar Page 4

The Estimator of the Error

earlier, given an x, is the value predicted by the regression line.

Ankita Bhanushali | Kantar Page 5

Linear Regression in Python Example

Understanding the Dataset

Let’s take a quick look at the dataset.

Ankita Bhanushali | Kantar Page 6

Simple Linear Regression in Python

Step 1: Load the Boston dataset

Step 2: Have a glance at the shape

Ankita Bhanushali | Kantar Page 7

Step 3: Have a glance at the dependent and independent variables

Step 4: Visualize the change in the variables

Ankita Bhanushali | Kantar Page 8

Step 5: Divide the data into independent and dependent variables

Step 6: Split the data into train and test sets

Step 7: Shape of the train and test sets

Step 8: Train the algorithm

Ankita Bhanushali | Kantar Page 9

Step 9: Retrieve the intercept

Step 10: Retrieve the slope

Step 11: Predicted value

Ankita Bhanushali | Kantar Page 10

Step 12: Actual value

Step 13: Evaluate the algorithm

Ankita Bhanushali | Kantar Page 11

What Did We Learn?

Ankita Bhanushali | Kantar Page 12

You might also like