0% found this document useful (0 votes)

93 views

Presentation Day 3 - Lasso-Ridge Regression, Logistic Regression, SVM

1. Ridge and LASSO are regularization methods that decrease model complexity and variance by penalizing coefficients. 2. Ridge does not perform variable selection while LASSO does. 3. LASSO handles high-dimensional data better than ridge when it has a proper penalty term.

Uploaded by

Akhiyar Waladi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views

Presentation Day 3 - Lasso-Ridge Regression, Logistic Regression, SVM

Uploaded by

Akhiyar Waladi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 56

Ridge Regression and

LASSO Regression
Tubagus Dhafin Rukmanda
PACMANN AI Researcher
Email: [email protected]
+62 89620615729
Bias-Variance Trade-off Review

Validation data

Training data
How to deal with Overfitting?

- Subset selection
- Regularization
- Using a less complex model
Regularized Linear Regression
(Shrinkage Methods)
What is Regularization?

IDEA

Adding a penalty cost to the cost function, so that, the parameters can be shrunk
towards zero

Regularized OLS Cost Function = RSS + Regularizer

How can we penalize the cost function?

We can use L2 norm to perform a regularization called Ridge Regression,

Or,

We can use L1 norm to perform regularization called LASSO Regression.

Ridge Regression : Shrinkage Parameter

Note: We can choose the shrinkage parameter value by cross-validation error

Ridge Regression : Shrinkage Parameter
Ridge Regression : Shrinkage Parameter
Ridge Regression : Visualization

Image : ISL
Ridge Regression Improve Linear Regression

- Least Square : Variance

High but no bias
(relationship response and
predictor close to linear)
- Ridge : Reduction variance
but slight increase in bias
Black : Squared bias
Green : Variance
Purple : Test Error
Image : ISL
LASSO Regression: Cost Function

Linear Regression

Ridge Regression

LASSO Regression

Where,
LASSO Regression
LASSO Regression

Image : ISL
LASSO Regression
LASSO Regression
LASSO Regression
Black :
Squared bias

Green :
Variance

Purple :
Test Error

Image : ISL

- Lasso : Reduction variance but slight increase in bias

- Variance Ridge slightly lower than Variance Lasso
Ridge vs LASSO Regression
LASSO :
● Produce simpler and more interpretable models
(variable selection)
● Perform better where model have relatively small
number of predictors have substantial coefficients

Ridge :
● Have variance slightly lower
● Perform better where model have many predictors
have coefficients roughly equal size
Summary

1. Subset Selection, Ridge and LASSO are decreasing

complexity, decreasing variance, increasing bias (slower),
and increasing interpretability of a model.
2. Ridge doesn’t perform a variable selection, but LASSO
does.
3. LASSO can handle p>n easily when it has a proper
penalty term
Logistic Regression
Tubagus Dhafin Rukmanda
PACMANN AI Researcher
Email: [email protected]
+62 89620615729
The Logistic Regression
•
The Logistic Model
• Using Linear Regression with
very large (or small) balance, we
will get values of default probability
bigger than 1 (or smaller than 0)

Picture from: Intoduction to Statistical Learning

The Logistic Model
Instead of a straight line
relationship, We need a
“Squashed” result, that is:
- Upper-bounded to 1
- Lower-bounded to 0

Picture from: Intoduction to Statistical Learning

The Logistic Model
We use our old Linear Regression function
and squased it in sigmoid function.

Picture from: Intoduction to Statistical Learning

Support Vector Machine
Tubagus Dhafin Rukmanda
PACMANN AI Researcher
Email: [email protected]
+62 89620615729
Support Vector Machine (SVM)
IDEA : Separating Data With Hyperplane
Data 1 Dimension/ Variable

Hyperplane
Data 2 Dimension/ Variables
Hyperplane
Data 3 Dimension/ Variables

Hyperplane

Stackoverfolw.com
Data n Dimension/ Variables ? n>3

Can’t Be Visualize
Separating Hyperplane
Separating Hyperplane
We can create infinite
hyperplane
Separating Hyperplane
Which hyperplane will we
choose?
Maximum Margin Classifier
M
ar
gi
n

Support Vector

Support Vector

Maximum Margin Classifier
M
ar
gi
n

Support Vector

Support Vector

What if Data can not be
Separated by Hyperplane?
We Use Soft Margin Classifier
(Support Vector Classifier)

Soft = Can be Violated by Some Observation

Support Vector Classifier
Support Vector Classifier

Max Margin Vs. SVC
Maximum Margin Classifier

or,

Support Vector Classifier

“cost” parameter
Support Vector Classifier

Support Vector Classifier

“cost” parameter
Support Vector Classifier

Mis-classified boundary
Support Vector Classifier

Mis-classified boundary
What if the data can not be separated by
linear boundaries?
Support Vector Machine and
Kernel
Kernel

Just imagine that using kernel is similar with adding new variable.
Mis-classified boundary
Kernel

Mis-classified boundary
Kernel Function

Aleksei Tiulpin
Kernel

Mis-classified boundary

Shuangyin Liu
Kernel

Mis-classified boundary

stackoverflow.com
Kernel

Mis-classified boundary

Mahmoud Elmezain
Kernel

Mis-classified boundary

Xiaochuan Li
Thank You, Question?
Tubagus Dhafin Rukmanda
PACMANN AI Researcher
Email: [email protected]
+62 89620615729

Chapter 7 - Correlation Functions: EE420/500 Class Notes 7/22/2009 John Stensby
No ratings yet
Chapter 7 - Correlation Functions: EE420/500 Class Notes 7/22/2009 John Stensby
26 pages
Linear Algebra - Solved Assignments - Fall 2005 Semester
100% (1)
Linear Algebra - Solved Assignments - Fall 2005 Semester
28 pages
Convex and Concave Functions
No ratings yet
Convex and Concave Functions
21 pages
Quiz 6
100% (1)
Quiz 6
8 pages
Multiple-Choice Test Chapter 09.01 Golden Section Search Method Optimization
No ratings yet
Multiple-Choice Test Chapter 09.01 Golden Section Search Method Optimization
9 pages
Tutorial2 PDF
No ratings yet
Tutorial2 PDF
10 pages
Sensitivity Analysis Notes
No ratings yet
Sensitivity Analysis Notes
23 pages
UNIT V Correlation and Regression Important Questions and QB
No ratings yet
UNIT V Correlation and Regression Important Questions and QB
7 pages
Unit 5 Game Theory Question Bank
No ratings yet
Unit 5 Game Theory Question Bank
6 pages
Basic Solutions
No ratings yet
Basic Solutions
56 pages
2 - Dual Simplex Method - Hira Gupta
No ratings yet
2 - Dual Simplex Method - Hira Gupta
9 pages
Class Note Simplex Method
No ratings yet
Class Note Simplex Method
6 pages
Solving Transportation Problem Using Vogel's Approximation Method, Stepping Stone Method & Modified Distribution Method
No ratings yet
Solving Transportation Problem Using Vogel's Approximation Method, Stepping Stone Method & Modified Distribution Method
38 pages
LPP 1
No ratings yet
LPP 1
4 pages
Introduction To Robotics
No ratings yet
Introduction To Robotics
32 pages
Assignment of Econometrics
No ratings yet
Assignment of Econometrics
12 pages
RMT Notes
No ratings yet
RMT Notes
31 pages
4.2 Relations, Functions, Domain & Range Quiz
No ratings yet
4.2 Relations, Functions, Domain & Range Quiz
6 pages
B.C.A. - 5 Sem: Assignment: Unit-4 Job Sequencing
No ratings yet
B.C.A. - 5 Sem: Assignment: Unit-4 Job Sequencing
1 page
One Variable Optimization
No ratings yet
One Variable Optimization
15 pages
LPP INTRODUCTION DEFINITIONAND EXAMPLES OF Linear Programming Intro
No ratings yet
LPP INTRODUCTION DEFINITIONAND EXAMPLES OF Linear Programming Intro
18 pages
DEVC. Question Bank final (1)
No ratings yet
DEVC. Question Bank final (1)
5 pages
MCQ
100% (1)
MCQ
26 pages
Classical Optimization Techniques
No ratings yet
Classical Optimization Techniques
26 pages
Module 5 Attributes
100% (1)
Module 5 Attributes
16 pages
Objective Assignment 5: (Https://swayam - Gov.in)
No ratings yet
Objective Assignment 5: (Https://swayam - Gov.in)
4 pages
Linear Regression
100% (2)
Linear Regression
228 pages
Chapter Two Part V Duality and Sensitivity Analysis
No ratings yet
Chapter Two Part V Duality and Sensitivity Analysis
75 pages
Module - 3 - ANALYSIS OF TIME SERIES
No ratings yet
Module - 3 - ANALYSIS OF TIME SERIES
21 pages
Beta Distribution (Of Second Kind) MCQs
No ratings yet
Beta Distribution (Of Second Kind) MCQs
3 pages
CH 12
No ratings yet
CH 12
30 pages
C Program - Vogel's Approximation Method (VAM) For Transportation Problems
100% (1)
C Program - Vogel's Approximation Method (VAM) For Transportation Problems
8 pages
Data Dictionary
No ratings yet
Data Dictionary
6 pages
ch14 Nonlinear Regression Models
100% (1)
ch14 Nonlinear Regression Models
18 pages
ML U3 MCQ
No ratings yet
ML U3 MCQ
20 pages
Unit-17 IGNOU STATISTICS
No ratings yet
Unit-17 IGNOU STATISTICS
15 pages
(Quantitative Analysis) Linear Programming
No ratings yet
(Quantitative Analysis) Linear Programming
2 pages
ML Unit 03 MCQ
No ratings yet
ML Unit 03 MCQ
20 pages
Brief Lecture Notes On Simple Linear Regression Regression Analysis
No ratings yet
Brief Lecture Notes On Simple Linear Regression Regression Analysis
8 pages
Objective Questions of Discrete Mathematics
No ratings yet
Objective Questions of Discrete Mathematics
144 pages
13.1..MEC217 - QR Factorization
100% (1)
13.1..MEC217 - QR Factorization
12 pages
Numerical Analysis: Lecture - 9
No ratings yet
Numerical Analysis: Lecture - 9
14 pages
Moment Generating Function
No ratings yet
Moment Generating Function
5 pages
Mca Notes - m-1
No ratings yet
Mca Notes - m-1
25 pages
Gauss Jordan Elimination
No ratings yet
Gauss Jordan Elimination
13 pages
Multiple Choice Questions: MCQ On Numerical Analysis
No ratings yet
Multiple Choice Questions: MCQ On Numerical Analysis
25 pages
Linear Programming
100% (1)
Linear Programming
9 pages
1) Statement: Descriptive Analytics, Is The Conventional Form of Business Intelligence and Data Analysis. B. False
100% (1)
1) Statement: Descriptive Analytics, Is The Conventional Form of Business Intelligence and Data Analysis. B. False
21 pages
Model Question Paper - I - or - 5TH SEM
No ratings yet
Model Question Paper - I - or - 5TH SEM
7 pages
Em-I (MCQ) PDF
No ratings yet
Em-I (MCQ) PDF
12 pages
Introduction To Linear Programming
100% (1)
Introduction To Linear Programming
82 pages
MA3391 Probability and Statistics Apr May 2024 Question Paper Download
No ratings yet
MA3391 Probability and Statistics Apr May 2024 Question Paper Download
7 pages
How To Use Tora Solver ?: Assignment Problem Using TORA (Input Screen)
No ratings yet
How To Use Tora Solver ?: Assignment Problem Using TORA (Input Screen)
3 pages
Excel Project 4 Template Bonds and Stocks FA24
No ratings yet
Excel Project 4 Template Bonds and Stocks FA24
4 pages
Non Linear Programming
No ratings yet
Non Linear Programming
15 pages
3.linear Functions and Application
No ratings yet
3.linear Functions and Application
55 pages
Unit 3 2DRV
No ratings yet
Unit 3 2DRV
82 pages
Regreesion Analysis
No ratings yet
Regreesion Analysis
24 pages
Module 3
No ratings yet
Module 3
35 pages
Mod 3
No ratings yet
Mod 3
9 pages
IJSDR191671
No ratings yet
IJSDR191671
4 pages
Machine Learning Based Prediction of Consumer Purchasing Decisions: The Evidence and Its Significance
No ratings yet
Machine Learning Based Prediction of Consumer Purchasing Decisions: The Evidence and Its Significance
7 pages
Master Sarvi Tuukka 2020
No ratings yet
Master Sarvi Tuukka 2020
68 pages
Topic Extraction From Online Reviews For Classification and Recommendation
No ratings yet
Topic Extraction From Online Reviews For Classification and Recommendation
8 pages
Report Tag I - 202109
No ratings yet
Report Tag I - 202109
522 pages