PA

Uploaded by

laluprasad13062002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

PA

Uploaded by

laluprasad13062002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Multiple regresssions

Linear regression using multiple inputs, often referred to as multiple

linear regression, is a statistical method used to model the
relationship between a dependent variable two or more independent
variables It is an extension of simple linear regression, which deals
with only one independent variable
**Data Collection: Gather data for the dependent variable and
multiple independent variables, ensuring the data is suitable for
regression (e.g., no severe multicollinearity).**Model Fitting: Use the
least squares method to fit the regression model, minimizing the
difference between observed and predicted values.**Coefficient
Estimation: Calculate coefficients that show how much the
dependent variable changes with a one-unit change in each
independent variable, assuming others remain constant.**Statistical
Significance: Test the significance of each coefficient with t-tests and
the overall model with an F-test to determine if the relationships are
statistically meaningful.** Model Evaluation:1.R-squared: Measures
the proportion of variance in the dependent variable explained by
the independent variables.2.Residual Analysis: Check if errors are
randomly distributed.3.Multicollinearity: Ensure independent
variables aren't too highly correlated. **Prediction: Use the validated
model to predict the dependent variable based on new data.
Linear regression with multiple outputs is a variation of traditional
linear regression that is used when you want to predict multiple
dependent variables simultaneously, rather than just a single
dependent variable. This technique is also known as multivariate
linear regression.  In standard linear regression, you have a single
dependent variable (Y) and one or more independent variables (X)
and aim to find a linear relationship that best describes the
relationship between them. The general equation for simple linear
regression is: Y = β0 + β1X + ε
Qualitative vs. Quantitative Attributes
1. Qualitative Attributes:#Definition: Qualitative attributes (also called
categorical attributes) are non-numeric and describe qualities or
characteristics of an entity.//Examples:*Color of a car (Red,
Blue)*Gender (Male, Female).//Usage: Used for classification tasks;
converted to numeric form using encoding techniques like One-Hot or
Label Encoding.2. Quantitative Attributes:#Definition: Quantitative
attributes are numeric and describe measurable quantities or
amounts..//Examples:*Height (170 cm)*Price ($19.99).//Usage: Used
directly in calculations and regression tasks; may require normalization.
When to Use:#Qualitative: When dealing with categories (e.g.,
predicting customer preferences).#Quantitative: When measuring
quantities (e.g., predicting prices based on numeric features).
Example Scenario: Analyzing retail customer data:*Qualitative: Gender,
Payment Method for understanding categories.*Quantitative: Age,
Purchase Amount for predicting spending patterns.
Method of Least Squares
1. Purpose://The method of least squares is used to find the best-
fitting line (or model) by minimizing the residual sum of squares
(RSS), which represents the difference between observed and
predicted values. 2. Residual Sum of Squares (RSS)://RSS Formula:
RSS=∑(yi−yî)2RSS = \sum (y_i - \hat{y}_i)^2RSS=∑(yi−yî)2 where
yiy_iyi are the observed values and yî\hat{y}_iyî are the predicted
values. 3. Choosing Coefficients (β\betaβ)://The coefficients β0,β1,
…,βn\beta_0, \beta_1, \dots, \beta_nβ0,β1,…,βn are chosen by
minimizing the RSS.//Minimization Process:**Calculate the partial
derivatives of the RSS with respect to each β\betaβ.**Set these
derivatives to zero and solve the resulting equations to find the
values of β\betaβ that minimize the RSS. 4. Outcome:*The resulting
coefficients β\betaβ provide the line or model that best fits the data
by minimizing the overall prediction error.
Linear Discriminant Analysis (LDA) is a statistical and machine learning
technique used for dimensionality reduction and classification. It is primarily
employed in the field of pattern recognition and machine learning for tasks like
face recognition, image classification, and data compression. LDA differs from
other dimensionality reduction techniques, such as Principal Component
Analysis (PCA), because it takes into account the class labels of the data points.
● Data Preprocessing: LDA begins with a labeled dataset, where each data
point is associated with a class label. This dataset is used for both
dimensionality reduction and classification tasks. ● Calculate Class Means: For
each class in the dataset, LDA calculates the mean (average) of the feature
vectors belonging to that class. This results in as many class means as there are
classes in the data. ● Calculate Scatter Matrices: LDA then computes two
scatter matrices: ● Within-class scatter matrix (Sw): This measures the variance
of data points within each class. It is calculated by summing up the covariance
matrices of individual classes. ● Between-class scatter matrix (Sb): This
measures the variance between class means. It is calculated by finding the
covariance between the class means and then scaling it by the number of data
points in each class. ● Eigenvalue Decomposition: The next step involves
calculating the eigenvectors and eigenvalues of the matrix Sw^-1 * Sb. These
eigenvectors represent the directions in the feature space along which the
classes are best separated. ● Selecting Discriminant Vectors: The eigenvectors
with the highest eigenvalues are selected as the discriminant vectors. These
vectors capture the most important information for class discrimination. ●
Projecting Data: To reduce the dimensionality of the data, you can project the
original data onto the discriminant vectors. The number of discriminant vectors
chosen typically depends on the desired dimensionality reduction. ●
Classification: LDA can also be used for classification tasks. After reducing the
dimensionality of the data using the discriminant vectors, you can apply a
classifier (e.g., linear discriminant analysis, logistic regression) to classify new
data points. Usage in Predictions:*Training: Fit the LDA model on labeled
training data, learning the means, covariance matrix, and prior probabilities of
each class.Prediction: For new data points, LDA calculates the probability of
belonging to each class and assigns the point to the class with the highest
probability.
Perceptron is a basic linear classification method suitable for simple
tasks with linearly separable data, while logistic regression offers
more flexibility and sophistication, making it a common choice for
various classification problems. More advanced neural networks are
typically preferred for handling complex, nonlinear classification
tasks. Perceptron learning algorithm is a linear classification method,
and it's important to understand it in the context of other linear
classification methods. Linear classification methods are used for
separating data points into different classes using linear decision
boundaries, such as lines or hyperplanes.

This step function or Activation function is vital in ensuring that

output is mapped between (0,1) or (-1,1). Take note that the weight
of input indicates a node’s strength. Similarly, an input value gives the
ability the shift the activation function curve
Context in Predictive Algorithms:**The Perceptron algorithm is used
in situations where data is linearly separable. It's a foundation for
more complex neural networks and is the basis for algorithms like
Support Vector Machines (SVM).**In predictive modeling, it serves
as a simple, interpretable model for binary classification but is limited
to linearly separable data.
Ridge regression shrinks the regression coefficients by imposing a penalty
on their size. The ridge coefficients minimize a penalized residual sum
Ridge regression extends linear regression by adding a regularization term
to the OLS cost function. **Purpose: Addresses multicollinearity and
stabilizes coefficient estimates by adding a penalty term to the regression
model.**Penalty Term: Adds the square of the magnitude of coefficients
(L2 regularization) . **Formula: Minimize(∑(y−y^)2+λ∑βi^2)**Effect:
Shrinks coefficients towards zero but does not set any coefficients exactly
to zero, keeping all features in the model. **Example: Predicting house
prices with features such as size, location, and number of bedrooms.
Ridge regression will reduce the effect of correlated features like size and
location but keeps all features in the model to prevent multicollinearity.
Lasso regression, short for "Least Absolute Shrinkage and Selection
Operator" regression, is a linear regression technique used for feature
selection and regularization in statistical modeling and machine
learning**Purpose: Performs both regularization and variable selection
by adding a penalty term that can shrink some coefficients to
zero.**Penalty Term: Adds the absolute value of the magnitude of
coefficients (L1 regularization).**Formula:
Minimize(∑(y−y^)2+λ∑∣βi∣)**Effect: Shrinks some coefficients to exactly
zero, effectively excluding less important features from the model.**
Example: Predicting house prices with many features, including less
relevant ones like the number of fireplaces. Lasso regression might set the
coefficients of less important features (e.g., number of fireplaces) to zero,
simplifying the model and improving interpretability.
Lasso regression, short for "Least Absolute Shrinkage and Selection
Operator" regression, is a linear regression technique used for feature
selection and regularization in statistical modeling and machine learning.
It is particularly useful when dealing with datasets that have a large
number of features, as it helps prevent overfitting and simplifies the
model by automatically selecting a subset of the most important features.
A Generalised Additive Model (GAM) is an extension of the multiple linear
model, which recall is

In order to allow for non-linear effects a GAM replaces each linear component
βjxj with a smooth non-linear function fj(xj)

This is called an additive model because we estimate each fj(xj) for j=1,2,3,....,p
and then add together all of these individual contributions.
1. Generalized Additive Models (GAMs):**Description: GAMs extend
generalized linear models by allowing non-linear relationships between
predictors and the response variable using smooth functions.**Form:
g(E(Y))=β0+f1(X1)+f2(X2)+⋯+fn(Xn)g(E(Y)) = \beta_0 + f_1(X_1) + f_2(X_2) + \
cdots + f_n(X_n)g(E(Y))=β0+f1(X1)+f2(X2)+⋯+fn(Xn) where ggg is a link
function, β0\beta_0β0 is the intercept, and fif_ifi are smooth functions of the
predictors.**Use Case: Useful when the relationship between predictors and
the response is not strictly linear but can be modeled with smooth functions.
2. Additive Models (AMs):**Description: AMs are a subset of GAMs where the
predictors are added linearly, but each predictor can be transformed non-
linearly.**Form: Y=β0+f1(X1)+f2(X2)+⋯+fn(Xn)+ϵY = \beta_0 + f_1(X_1) +
f_2(X_2) + \cdots + f_n(X_n) + \epsilonY=β0+f1(X1)+f2(X2)+⋯+fn(Xn)+ϵ where
fif_ifi are non-linear functions of predictors, and ϵ\epsilonϵ is the error
term.**Use Case: Useful for modeling complex, non-linear relationships
without assuming a specific functional form for the relationship.
3. Smooth Additive Models:**Description: A type of additive model where
smooth functions are used to model relationships between predictors and the
response.**Form: Similar to GAMs, with smooth functions applied to
predictors but focusing specifically on smoothness in the modeling process.
**Use Case: Suitable for capturing smooth, non-linear effects in the
data.Regression Tree://Purpose: Predicts a continuous target variable by
splitting data into subsets based on feature values.//Process:**Splitting: At
each node, the tree splits the data based on a feature that best separates the
target variable into homogeneous groups.**Leaf Nodes: Each terminal leaf
node represents a predicted value, calculated as the mean of the target
variable in that subset.
Gini Index and Split Criteria:**Gini Index: Primarily used in classification trees,
it measures the impurity of a node. For regression trees, the focus is on
variance reduction rather than Gini Index.**Split Criteria for Regression
Trees:Variance Reduction: Chooses splits that minimize the variance of the
target variable within the resulting subsets. The goal is to reduce the variance
in the target variable as much as possible with each split, improving predictive
accuracy.
Gradient definition:The gradient is a vector that indicates the direction and
rate of the steepest increase of a function. It is used in optimization to find the
minimum or maximum of a function.
Components:*For a function f(x)f(x)f(x) with multiple variables x1,x2,…,xn the
gradient is a vector of partial derivatives:

Interpretation://Positive Gradient:**Indicates that the function is increasing in

the direction of the gradient.**If the gradient is positive for a specific variable,
increasing that variable will increase the function value. This suggests moving
in the direction of the gradient to increase the function's value.//Negative
Gradient:**Indicates that the function is decreasing in the direction of the
gradient.**If the gradient is negative for a specific variable, increasing that
variable will decrease the function value. This suggests moving in the opposite
direction of the gradient to decrease the function's value.
Gaining Insight into Exponential Loss Function1. Exponential Loss
Function://Definition: The exponential loss function, used in boosting
algorithms, is defined as L(y,y^)=exp(−y⋅y^) where y is the true label and y^\
hat{y}y^ is the predicted value.2. Insights from Properties://Sensitivity to
Misclassification: The exponential loss penalizes misclassifications heavily,
especially when the prediction is far from the true label. This makes it sensitive
to outliers. //Boosting Effect: In boosting, this loss function drives the model to
focus more on hard-to-classify instances by increasing their weights, improving
overall model performance.//Gradient Behavior: The gradient of the
exponential loss function increases with the magnitude of the prediction error,
guiding the optimization process to correct large errors effectively

MA in Political Economy - Handbook - 2016 (KCL)
No ratings yet
MA in Political Economy - Handbook - 2016 (KCL)
15 pages
Tango - 1998 - Equivalence Test and Confidence Interval For The Difference in Proportions For The Paired-Sample Design
No ratings yet
Tango - 1998 - Equivalence Test and Confidence Interval For The Difference in Proportions For The Paired-Sample Design
18 pages
Research Methods - Sampling Techniques
No ratings yet
Research Methods - Sampling Techniques
10 pages
Linear+Discriminant+Analysis+Reference
No ratings yet
Linear+Discriminant+Analysis+Reference
6 pages
Linear Discriminat Analysis
No ratings yet
Linear Discriminat Analysis
23 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
9 - Linear Discriminant Analysis
No ratings yet
9 - Linear Discriminant Analysis
19 pages
ML-UNIT4
No ratings yet
ML-UNIT4
41 pages
Week 8 Notes_DM
No ratings yet
Week 8 Notes_DM
26 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Machine Learning Lab Manual 8
No ratings yet
Machine Learning Lab Manual 8
12 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
27 pages
ML Unit 2
No ratings yet
ML Unit 2
53 pages
ML-UNIT4
No ratings yet
ML-UNIT4
44 pages
Predictive Analytics (2)
No ratings yet
Predictive Analytics (2)
46 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
12 pages
ML Unit-2 Final
No ratings yet
ML Unit-2 Final
32 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages
UNIT-4
No ratings yet
UNIT-4
38 pages
3rd Unit Last 5 Answer AIML
No ratings yet
3rd Unit Last 5 Answer AIML
21 pages
ANN UNIT-II (1)
No ratings yet
ANN UNIT-II (1)
29 pages
Unit 3 in Machine Intelligence
No ratings yet
Unit 3 in Machine Intelligence
62 pages
Supervised Learning: Linear Methods (1/2) : Applied Multivariate Statistics - Spring 2012
No ratings yet
Supervised Learning: Linear Methods (1/2) : Applied Multivariate Statistics - Spring 2012
15 pages
C30 C35 LinearModelForClassification
No ratings yet
C30 C35 LinearModelForClassification
50 pages
LDA
No ratings yet
LDA
10 pages
U20cs604 Machine Learning Unit II
No ratings yet
U20cs604 Machine Learning Unit II
50 pages
CS Notes
No ratings yet
CS Notes
3 pages
Unit 2 ML_Ver 2
No ratings yet
Unit 2 ML_Ver 2
129 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
HTCB Unit 4
No ratings yet
HTCB Unit 4
6 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Multivariate
100% (1)
Multivariate
78 pages
PDSLabManualEXP7.docx (2)
No ratings yet
PDSLabManualEXP7.docx (2)
6 pages
Foundation of Machine Learning F-PMLFML02-WS
No ratings yet
Foundation of Machine Learning F-PMLFML02-WS
352 pages
Lda PDF
No ratings yet
Lda PDF
47 pages
LDA Tutorial
No ratings yet
LDA Tutorial
47 pages
KCA 034 - Unit 2
No ratings yet
KCA 034 - Unit 2
97 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
16 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
ML Combined
No ratings yet
ML Combined
254 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
STATISTIC%20AND%20DATA%20SCIENCE%20II.pdf
No ratings yet
STATISTIC%20AND%20DATA%20SCIENCE%20II.pdf
37 pages
B22CS014 Report
No ratings yet
B22CS014 Report
11 pages
Module 5
No ratings yet
Module 5
48 pages
Information Retrieval Important questions
No ratings yet
Information Retrieval Important questions
20 pages
ML Algorithms Week 3
No ratings yet
ML Algorithms Week 3
30 pages
Data Science Module 5 q & A
No ratings yet
Data Science Module 5 q & A
8 pages
UNIT3 Machine Learning
No ratings yet
UNIT3 Machine Learning
53 pages
FINAL - CC01 - Group7
No ratings yet
FINAL - CC01 - Group7
23 pages
Final Cc01 Group7
No ratings yet
Final Cc01 Group7
23 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
Linear Discriminant Analysis: January 2015
No ratings yet
Linear Discriminant Analysis: January 2015
67 pages
Linear Methods For Classification
No ratings yet
Linear Methods For Classification
29 pages
Reference+Material LDA
No ratings yet
Reference+Material LDA
24 pages
Unit 1(DS)
No ratings yet
Unit 1(DS)
15 pages
Detailed_Linear_Discriminant_Functions_Notes
No ratings yet
Detailed_Linear_Discriminant_Functions_Notes
2 pages
Weekly Homework X
No ratings yet
Weekly Homework X
15 pages
Predictive-Analytics (1)
No ratings yet
Predictive-Analytics (1)
22 pages
Machine Learning Unit 4
No ratings yet
Machine Learning Unit 4
28 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
datamining unit4
No ratings yet
datamining unit4
21 pages
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet
STA408 Appendix
No ratings yet
STA408 Appendix
2 pages
Chapter #7 HW. Nghi Huynh
No ratings yet
Chapter #7 HW. Nghi Huynh
2 pages
Chapter - 14 Advanced Regression Models
No ratings yet
Chapter - 14 Advanced Regression Models
49 pages
A Simple But Effective Logistic Regression Derivation
No ratings yet
A Simple But Effective Logistic Regression Derivation
6 pages
ˆ β = (X X) X y: dyˆ i,i dy
No ratings yet
ˆ β = (X X) X y: dyˆ i,i dy
7 pages
Chapter 6
No ratings yet
Chapter 6
11 pages
Forecasting MethodsandApplicationsABookreview
No ratings yet
Forecasting MethodsandApplicationsABookreview
4 pages
Instrumental Variables in Economics and Statistics
No ratings yet
Instrumental Variables in Economics and Statistics
6 pages
Assessment 4 Iop
No ratings yet
Assessment 4 Iop
6 pages
SLIDES 1 Week 7-8. Confidence Intervals v4 (1)
No ratings yet
SLIDES 1 Week 7-8. Confidence Intervals v4 (1)
58 pages
BA5106 SOM Important Questions 2by2results PDF
No ratings yet
BA5106 SOM Important Questions 2by2results PDF
3 pages
Multiple Regression Model and Multicollinearity
No ratings yet
Multiple Regression Model and Multicollinearity
25 pages
Assignments Ashoka University
No ratings yet
Assignments Ashoka University
32 pages
202003241550009941rajeev Pandey Correlation Research
No ratings yet
202003241550009941rajeev Pandey Correlation Research
87 pages
Hernán 2019 A Second Chance To Get Causal Inference Right, A Classification of Data Science Tasks
No ratings yet
Hernán 2019 A Second Chance To Get Causal Inference Right, A Classification of Data Science Tasks
9 pages
Chap10 Logistic Regression
No ratings yet
Chap10 Logistic Regression
36 pages
Unit Roots and Non-Stationary Time Series: 5 November 2021 Dr. Maurice J. Roche, Department of Economics Topic I: 1
No ratings yet
Unit Roots and Non-Stationary Time Series: 5 November 2021 Dr. Maurice J. Roche, Department of Economics Topic I: 1
36 pages
Ganda Ko
43% (7)
Ganda Ko
15 pages
Statistics Papers
No ratings yet
Statistics Papers
7 pages
University of Hyderabad Post Graduate Diploma in Business Management (PGDBM) I Term Assignment (2016) Dbm-416: Quantitative and Research Methods
No ratings yet
University of Hyderabad Post Graduate Diploma in Business Management (PGDBM) I Term Assignment (2016) Dbm-416: Quantitative and Research Methods
4 pages
R commands New 2
No ratings yet
R commands New 2
23 pages
Bias Format Updated
No ratings yet
Bias Format Updated
12 pages
Week 8:: Hypothesis Testing With One-Sample T-Test
No ratings yet
Week 8:: Hypothesis Testing With One-Sample T-Test
18 pages
Introduction to Biostatistics A Guide to Design, Analysis, and Discovery [FULL VERSION DOWNLOAD]
100% (12)
Introduction to Biostatistics A Guide to Design, Analysis, and Discovery [FULL VERSION DOWNLOAD]
15 pages
Statistical Techniques In Business And Economics 17th Edition Lind Solutions Manual instant download
100% (1)
Statistical Techniques In Business And Economics 17th Edition Lind Solutions Manual instant download
45 pages
MSE 600 Final Exam Study Guide
100% (1)
MSE 600 Final Exam Study Guide
3 pages
Assignment3 A20
No ratings yet
Assignment3 A20
3 pages

PA

Uploaded by

PA

Uploaded by

Multiple regresssions

Linear regression using multiple inputs, often referred to as multiple

This step function or Activation function is vital in ensuring that

Interpretation://Positive Gradient:**Indicates that the function is increasing in

You might also like