Random Motors Briefing
Random Motors Briefing
Decision Sciences
Shukla
Agenda
Topic Time (mins.)
Introduction to Random Motors Project- Case Briefing 10
Expectations and Question discussion (Q:1,2,3) 10
Practice: Hypothesis 20
Expectations and Question discussion (Q:4,5,6,7,8) 5
Practice: Regression 20
Common mistakes to avoid 5
Quiz :https://forms.gle/geXfYnWkvGdANYNb6 5
Q&A 15
Total 90
https://www.caradvice.com.au/955136/mitsubishi-loses-supreme-court-appeal-on-misleading-and
deceptive-fuel-consumption-claim/
Introduction to Random Motors Project- Case Briefing
• You are the Lead Business Analyst at Random Motors, an automobile conglomerate that manufactures
cars across a large range of price points
• For the price-conscious working-class consumers, Random Motors manufactures cars under the brand
Rocinante. The various models of Rocinante fall in the range of ₹5–10 lakh. The brand is known for
cars that are value for money and very attractive mileage.
• For the consumers willing to purchase a luxury car, Random Motors operates under the brand
Marengo. Marengo models fall in the range of ₹30–60 lakh. They are considered sports cars and have
high horsepower and top speed.
• Every time Random Motors launches a new model, it advertises the model’s specifications, such as
price, mileage, and top speed, because consumers usually take these factors into account before
purchasing cars.
• Suppose your chief engineer has recently come up with two new models: Rocinante36 and
Marengo32. He promises that they will outperform the existing competitors in the market.
Introduction to Random Motors Project- Case Briefing
• The chief engineer has built 20 prototypes of each of the two models to check if the cars are performing
as per the desired specifications.
• The test drive results for the 20 prototypes of each of the two models are provided in the file titled Data
Exhibit 1. During the test, the mileage and the top speed is recorded for each of the prototypes. The
data will be used for performing hypothesis testing for the claims made on the specifications.
• Currently, the company has enough cash reserves to invest in only one model. Therefore, as the Lead
Business Analyst, you need to carefully choose the right model and decide the number of cars that you
want to produce. Since you believe in making data-based decisions, you ask your team to provide you
with the sales data of all the past models released under the brands Rocinante and Marengo. The sales
data is provided in the file titled Data Exhibit 2.
You need to upload the three files (2 Excel files and One Presentation saved as PDF) in a single ZIP
file. The ZIP file should be named as "Your Name_RandomMotors“
Item Submission Assessment Reference
Data Exhibit 1 Hypothesis_Testing_(Your Name) Q1,2,3
Data Exhibit 2 Regression_Analysis_(Your Name) Q4,5,6,7,8
PPT Your Name_RandomMotors (pdf) Presentation of findings for evaluation
✓Q1. 50 (25,25)
✓Q2. 100 (50 –p values, 50 right conclusion)
✓Q3. 150 (50- identifying, 100- reason)
✓Q4. 300(50-reg coeff, 50- reg eq, 50-sales)
✓Q5. 150 (75,75)
✓Q6. 50 (identify model)
✓Q7. 100 (50-identify, 50- logic)
✓Q8. 100 (50- Adjusted r value calculation, 50 – Reason for
change) ✓Total =1000 marks
Q-1a) Formulate the null hypotheses to check whether the new models are
performing as per the desired design specifications. 50 Marks (25+25)
Mileage HO :
Top speed HO :
null hypotheses
Q-1b) Formulate the alternate hypotheses to check whether the new models are
performing as per the desired design specifications.
Top speed H1:
For Rocinante36:
Mileage H1:
Price:
Mileage:
Top speed:
Equation:
Price:
Mileage:
Top speed:
Equation:
Rocinante36 Model:
Profit
Marengo32 Model:
Q-6) As a CEO, you wish to invest only in the model which is predicted to be more
profitable. Which model among Rocinante36 and Marengo32 will you invest in?
(50-identify model)
Hypothesis testing
Steps
1. Frame null and alternate hypothesis (one tail vs two
tail) 2. Decide test to be used (z/t )
3. Decide level of significance(or Confidence interval)
4. Run the test to get value of test statistics (Exhibit
1)
5. Calculate p value
(http://courses.atlas.illinois.edu/spring2016/STAT/STAT200/pt.html
) 6. Give result based on p value (reject or fail to reject null) 7.
Frame type of errors, decide which is more costly
Calculation
Method 1 Method 2
Roccinate Mileage p-value http://courses.atlas.illinois.edu/sprin
g2 016/STAT/STAT200/pt.html)
Sample size
1. Use data analysis tool pak
Sample standard de
= 2. Create dummy variable with value 0 3. Use test for
unequal variances to run test at 95% confidence level
Hypothesized mean
Sample mean =average
Std error of mean =standard dev/sqrt of
sample size Rule for decision making
Calculation
Steps for regression
▪ Profit (Q5)
▪ Use Sales calculated above to calculate profit
▪ Profit= Sales*(price-cost)
• https://forms.gle/geXfYnWkvGdANYNb6
Regression
• Regression is a statistical method used to determine the
relationship between variables in the form of an equation.
• In regression terminology, the variable that you predict is
called the ‘dependent’ or the ‘response’ variable. It is
usually denoted by ‘Y’.
• The variable that is used to predict this dependent variable
is called the ‘independent’ or ‘explanatory’ variable. It is
usually denoted by ‘X’.
• In simple linear regression, a regression model is in the
form of a straight line: Y=β0+β1X
• Y = a+bX
Summary
• Regression is the determination of a statistical relationship between
two or more variables.
• In simple regression, we have only two variables, one variable
(defined as independent) is the cause of the behaviour of another
one (defined as dependent variable).
EatingWeight
Gain
Independent Variable Dependent Variable
• Null Hypothesis
• Null hypothesis: Temperature does not significantly influence air conditioner sales
• Null hypothesis : β1=0 , where β1 is the coefficient of X in the equation Y=β0+β1X
• Alternative Hypothesis
• Alternative hypothesis : Temperature has a significant influence on the sales
Alternative hypothesis: β1≠0, where β1 is the coefficient of X in the equation
Y=β0+β1X
R square
• R-square, or the coefficient of determination, is a metric used to
quantify the goodness of fit for a regression model.
• R-square explains how close the actual data points are to the fitted
regression line. In other words, you can say that it measures the
percentage of variability in the dependent variable that is explained by
the regression line.
• R-square is equal to the explained variation in the dependent variable
divided by Total variation. R-square values lie between 0 and 1.
R square
• R square=0.8859
• Regression model is able to explain
88.59% of the variability of the
dependent variable
• 60-70% R square shows good
model • Its not adequate and
full proof test. As number of
variables increases, the R
square goes up.
• So, check for standard error.
Smaller the standard error, more
accurate the predictions of model
The standard error of the estimate
• When there are two or more than two independent variables, the
analysis concerning relationship is known as multiple correlation and
the equation describing such relationship as the multiple regression
equation.
► T test
• Overall Significance Test
• F test
F test- test for over all significance
• Check P value in ANOVA
table
• P<.01, Null hypothesis
rejected at 99%
confidence level
• At least one IDV has
significant impact on DV
Individual significance test : t test
• Check individual p value
• P<.01 reject Null hypothesis for Television advertising and News paper
advertising at 99% CL (i.e. accept Alternate and use Beta values) • P>.01, Fail
to reject Null hypothesis for online advertising at 99% CL (Beta values not
important)
Regression Equation
R square and Adjusted R square
• R Square is a basic matrix which tells you about that how much variance is been
explained by the model.
• What happens in a multivariate linear regression is that if you keep on adding new
variables, the R square value will always increase irrespective of the variable significance.
• What adjusted R square do is calculate R square from only those variables whose addition
in the model which are significant.
• So always while doing a multivariate linear regression we should look at adjusted R
square instead of R square.
Multiple R 0.8996
R-Square 0.8093