0% found this document useful (0 votes)

89 views

1 Eric Boosting304FinalRpdf

This document discusses the AdaBoost algorithm. AdaBoost is a boosting algorithm that can generate a strong classifier by combining weak learners. It works by focusing on examples that previous weak learners misclassified and adjusting example weights accordingly. AdaBoost is proven to reduce training error and empirically works well due to its ability to increase margins between classes rather than just reducing errors. While simple and effective, it can overfit if weak learners are too complex or fail to produce large margins.

Uploaded by

Christina Jenkins

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views

1 Eric Boosting304FinalRpdf

Uploaded by

Christina Jenkins

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

BOOSTING (ADABOOST ALGORITHM)

Eric Emer

Consider Horse-Racing Gambler

Rules of Thumb for determining Win/Loss: Most favored odds Fastest recorded lap time Most wins recently, say, in the past 1 month Hard to determine how he combines analysis of feature

set into a single bet.

Consider MIT Admissions

2-class system (Admit/Deny) Both Quantitative Data and Qualitative Data

We consider (Y/N) answers to be Quantitative (-1,+1) Region, for instance, is qualitative.

Rules of Thumb, Weak Classifiers

Easy to come up with rules of thumb that correctly classify the training data at

better than chance.

E.g. IF GoodAtMath==Y THEN predict Admit. Difficult to find a single, highly accurate prediction rule. This is where our Weak

Learning Algorithm, AdaBoost, helps us.

What is a Weak Learner?

For any distribution, with high probability, given

polynomially many examples and polynomial time we can find a classifier with generalization error better than random guessing.
< 1 2 , also denoted > 0 for generalization error ( 1 2 )

Weak Learning Assumption

We assume that our Weak Learning Algorithm (Weak

Learner) can consistently find weak classifiers (rules of thumb which classify the data correctly at better than 50%) Given this assumption, we can use boosting to generate a single weighted classifier which correctly classifies our training data at 99%-100%.

AdaBoost Specifics
How does AdaBoost weight training examples optimally? Focus on difficult data points. The data points that have been misclassified most by the previous weak classifier. How does AdaBoost combine these weak classifiers into a

comprehensive prediction?
Use an optimally weighted majority vote of weak classifier.

AdaBoost Technical Description

Missing details: How to generate distribution? How to get single classifier?

Constructing Dt
D 1 ( i) = and given Dt and ht : Dt+1 c ( x) = D t ( i) = c ( x) Zt : y i = ht ( x i ) : yi 6= ht (xi )
t yi h t (x i )

1 m

e t e t

Dt+1 = where Zt = normalization constant

D t ( i) e Zt

1 1 t >0 t = ln 2 t

Getting a Single Classifier

X
t

Hf inal (x) = sign(

t ht (x))

Mini-Problem

Training Error Analysis

Claim: then,

Proof
Step 1: unwrapping the recurrence

training error(Hf inal ) p Step 3: Show Zt = 2 t (1 t )

Step 2: Show

How might test error react to AdaBoost?

We expect to encounter:
Occams Razor Overfitting

Empirical results of test error

Test error does not increase even after 1000 rounds. Test error continues to drop after training error reaches zero.

Difference from Expectation: The Margins Explanation

Our training error only measures correctness of

classifications, neglects confidence of classifications. How can we measure confidence of classifications?

Hf inal (x) = sign(f (x)) P t ht t f ( x) = P 2 [ 1, 1] t t margin(x, y ) = yf (x)

Margin(x,y) close to +1 is high confidence, correct. Margin(x,y) close to -1 is high confidence, incorrect. Margin(x,y) close to 0 is low confidence.

Empirical Evidence Supporting Margins Explanation

Hf inal (x) = sign(f (x)) P t ht t f ( x) = P 2 [ 1, 1] t t margin(x, y ) = yf (x)

Cumulative distribution of margins on training examples

Pros/Cons of AdaBoost
Pros
Fast Simple and easy to program No parameters to tune

(except T) No prior knowledge needed about weak learner Provably effective given Weak Learning Assumption versatile

Cons Weak classifiers too complex leads to overfitting. Weak classifiers too weak can lead to low margins, and can also lead to overfitting. From empirical evidence, AdaBoost is particularly vulnerable to uniform noise.

Predicting College Football Results

Training Data: 2009 NCAAF Season Test Data: 2010 NCAAF Season

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Research On Hattrick's Youth Academy Scouting Pools
No ratings yet
Research On Hattrick's Youth Academy Scouting Pools
3 pages
The Normal Curve
No ratings yet
The Normal Curve
85 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Boosting and Applications Yuan
No ratings yet
Boosting and Applications Yuan
41 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Adaboost Algorithm
No ratings yet
Adaboost Algorithm
17 pages
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
No ratings yet
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
29 pages
Ada Boost
No ratings yet
Ada Boost
25 pages
Boosting and AdaBoost For Machine Learning
No ratings yet
Boosting and AdaBoost For Machine Learning
18 pages
LectureNotes7
No ratings yet
LectureNotes7
8 pages
Adaboost: Derek Hoiem March 31, 2004
No ratings yet
Adaboost: Derek Hoiem March 31, 2004
46 pages
Ada Boost
No ratings yet
Ada Boost
7 pages
Introduction To Boosting - 2
No ratings yet
Introduction To Boosting - 2
79 pages
Boosting Approach To Machine Learn
No ratings yet
Boosting Approach To Machine Learn
23 pages
ADABOOST
No ratings yet
ADABOOST
9 pages
Boosting Mit
No ratings yet
Boosting Mit
36 pages
Resilience To Overfitting AdaBoosts Approach
No ratings yet
Resilience To Overfitting AdaBoosts Approach
8 pages
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
No ratings yet
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
35 pages
Zhu - Multiclass Adaboost2009 PDF
No ratings yet
Zhu - Multiclass Adaboost2009 PDF
12 pages
DM(Boosting)
No ratings yet
DM(Boosting)
15 pages
Computational Data Analysis: Machine Learning
No ratings yet
Computational Data Analysis: Machine Learning
26 pages
ENG6500 7 Ensembles Boosting
No ratings yet
ENG6500 7 Ensembles Boosting
49 pages
_LECTURE+NOTES_Boosting
No ratings yet
_LECTURE+NOTES_Boosting
8 pages
addaboost
No ratings yet
addaboost
12 pages
FAQ - Boosting - Ensemble Techniques - Great Learning
No ratings yet
FAQ - Boosting - Ensemble Techniques - Great Learning
2 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
Lecture-10-boosting
No ratings yet
Lecture-10-boosting
20 pages
Improving Classification With AdaBoost
No ratings yet
Improving Classification With AdaBoost
20 pages
Adaboost Matas
No ratings yet
Adaboost Matas
136 pages
Boosting: 1. What Is The Difference Between Adaboost and Gradient Boosting?
No ratings yet
Boosting: 1. What Is The Difference Between Adaboost and Gradient Boosting?
2 pages
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
No ratings yet
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
19 pages
boosting algo adaboost
No ratings yet
boosting algo adaboost
3 pages
AdaBoost-From Theoretical Perspective
No ratings yet
AdaBoost-From Theoretical Perspective
7 pages
TM Adaboost
No ratings yet
TM Adaboost
12 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
AdaBoost Is Consistent
No ratings yet
AdaBoost Is Consistent
22 pages
AdaBoost Final
No ratings yet
AdaBoost Final
97 pages
107 Boostong Models
No ratings yet
107 Boostong Models
27 pages
Pradipta Kumar Pattanayak - Ada Boosting
No ratings yet
Pradipta Kumar Pattanayak - Ada Boosting
44 pages
07 Boosting Notes
No ratings yet
07 Boosting Notes
10 pages
ensemble
No ratings yet
ensemble
33 pages
L07 Classifiers Combination
No ratings yet
L07 Classifiers Combination
17 pages
Ada Boost
No ratings yet
Ada Boost
11 pages
AdaBoost New PDF
No ratings yet
AdaBoost New PDF
45 pages
Introduction to Boosting: Slides Adapted from Che Wanxiang (车万翔) at HIT, and Robin Dhamankar of Many thanks!
100% (1)
Introduction to Boosting: Slides Adapted from Che Wanxiang (车万翔) at HIT, and Robin Dhamankar of Many thanks!
41 pages
chapter 3- boosting theory
No ratings yet
chapter 3- boosting theory
7 pages
Ensemble Classifiers
No ratings yet
Ensemble Classifiers
37 pages
Lecture18 Boosting
No ratings yet
Lecture18 Boosting
21 pages
ML-9
No ratings yet
ML-9
64 pages
IMPROVE_boost_1999
No ratings yet
IMPROVE_boost_1999
40 pages
Machine Learning: Lecture 8: Ensemble Methods
No ratings yet
Machine Learning: Lecture 8: Ensemble Methods
28 pages
Gradient Boosting in ML
No ratings yet
Gradient Boosting in ML
5 pages
22 Boosting
No ratings yet
22 Boosting
32 pages
Statistics Project
No ratings yet
Statistics Project
5 pages
ENSEMBLE_LEARNING
No ratings yet
ENSEMBLE_LEARNING
9 pages
Boosting
No ratings yet
Boosting
11 pages
CS229 Supplemental Lecture Notes: 1 Boosting
No ratings yet
CS229 Supplemental Lecture Notes: 1 Boosting
11 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Wages Micro Data
No ratings yet
Wages Micro Data
28 pages
Adaptive Neuro-Fuzzy Inference System: Dr. Sadeq D. Al-Majidi
No ratings yet
Adaptive Neuro-Fuzzy Inference System: Dr. Sadeq D. Al-Majidi
11 pages
Datacollectionin Statistics Week 2-3
No ratings yet
Datacollectionin Statistics Week 2-3
57 pages
LATIKA PROJECT
No ratings yet
LATIKA PROJECT
30 pages
Maharashtra - District Level Report
No ratings yet
Maharashtra - District Level Report
152 pages
Analysis__Rubric_Criteria
No ratings yet
Analysis__Rubric_Criteria
3 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Singman & Kellen - An Introduction to Mixed Models for Experimental Psychology
No ratings yet
Singman & Kellen - An Introduction to Mixed Models for Experimental Psychology
16 pages
Historical Supply
No ratings yet
Historical Supply
13 pages
Mann Kendall Analysis 1 PDF
No ratings yet
Mann Kendall Analysis 1 PDF
7 pages
(Johnson, 2001) Toward A New Classification of Nonexperimental Quantitative
No ratings yet
(Johnson, 2001) Toward A New Classification of Nonexperimental Quantitative
12 pages
Mathematical Foundations of Machine Learning: (NMAG 469, FALL TERM 2018-2019)
100% (1)
Mathematical Foundations of Machine Learning: (NMAG 469, FALL TERM 2018-2019)
84 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
Chapter 7 Statistics
No ratings yet
Chapter 7 Statistics
14 pages
SCYP Brochure 2020 PDF
No ratings yet
SCYP Brochure 2020 PDF
8 pages
Biostatistics - Part 5 - DR - Vennila J
No ratings yet
Biostatistics - Part 5 - DR - Vennila J
29 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
11 pages
(FREE PDF Sample) Primer of Applied Regression and Analysis of Variance 3rd Edition Glantz S.A. Ebooks
100% (1)
(FREE PDF Sample) Primer of Applied Regression and Analysis of Variance 3rd Edition Glantz S.A. Ebooks
62 pages
28 - BhaveshLaku - 7 - AIT VCXVV
No ratings yet
28 - BhaveshLaku - 7 - AIT VCXVV
10 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
10 pages
Investigating The Use of The Four Perspectives of
No ratings yet
Investigating The Use of The Four Perspectives of
10 pages
General Forms of Research
No ratings yet
General Forms of Research
25 pages
Handouts No. 01: Quantitative Research (Pr2)
No ratings yet
Handouts No. 01: Quantitative Research (Pr2)
3 pages
Paper3 Syl22 Dec24 Set1
No ratings yet
Paper3 Syl22 Dec24 Set1
11 pages
Negative Binomial Distribution
No ratings yet
Negative Binomial Distribution
10 pages
A Survey On The Emotional Quotient and The Social Media Preferences
100% (1)
A Survey On The Emotional Quotient and The Social Media Preferences
22 pages
Mathematics in The Modern World Syllabus: Sultan Kudarat State University S.Y. 2019-2020
No ratings yet
Mathematics in The Modern World Syllabus: Sultan Kudarat State University S.Y. 2019-2020
11 pages
Pre-Test 8 - Attempt Review
No ratings yet
Pre-Test 8 - Attempt Review
2 pages

1 Eric Boosting304FinalRpdf

Uploaded by

1 Eric Boosting304FinalRpdf

Uploaded by

BOOSTING (ADABOOST ALGORITHM)

Consider Horse-Racing Gambler

set into a single bet.

Consider MIT Admissions

2-class system (Admit/Deny) Both Quantitative Data and Qualitative Data

Rules of Thumb, Weak Classifiers

better than chance.

Learning Algorithm, AdaBoost, helps us.

What is a Weak Learner?

Weak Learning Assumption

AdaBoost Technical Description

Missing details: How to generate distribution? How to get single classifier?

Dt+1 = where Zt = normalization constant

Getting a Single Classifier

Hf inal (x) = sign(

Training Error Analysis

training error(Hf inal ) p Step 3: Show Zt = 2 t (1 t )

How might test error react to AdaBoost?

Empirical results of test error

Difference from Expectation: The Margins Explanation

classifications, neglects confidence of classifications. How can we measure confidence of classifications?

Hf inal (x) = sign(f (x)) P t ht t f ( x) = P 2 [ 1, 1] t t margin(x, y ) = yf (x)

Empirical Evidence Supporting Margins Explanation

Cumulative distribution of margins on training examples

Predicting College Football Results

You might also like