100% found this document useful (1 vote)
765 views

Itae006 Exam

The document contains 28 multiple choice questions about predictive modeling concepts. The questions cover topics like descriptive modeling algorithms, bagging ensembles, normal distributions, missing data, model evaluation metrics, and more.

Uploaded by

Nageshwar Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
765 views

Itae006 Exam

The document contains 28 multiple choice questions about predictive modeling concepts. The questions cover topics like descriptive modeling algorithms, bagging ensembles, normal distributions, missing data, model evaluation metrics, and more.

Uploaded by

Nageshwar Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

Question 1

Which of the following statements is incorrect?

O Descriptive modeling algorithms are also called as unsupervised learning methods.

O Descriptive modeling algorithms try to find relationships between inputs

O Descriptive modeling algorithms discover the best way to segment the data

O Descriptive modeling algorithms try to find relationships that associate inputs to one or more target
variables

Question 2

is an important requirement for building good bagged ensembles?

O Underfiting the model

O Overtitting the model

O Exact fiting the model

ONone of these

Question 3

Which of the following is a property of normal distribution?

O Approximately 95% of the data will fall between the mean and +/—1 standard deviation from the
mean.

O Approximately 95% of the data will fall between the mean and +/—2 standard deviations from the
mean

O Approximately 95% of the data will fall between the mean and +/-3 standard deviations from the
mean

O Approximately 95% of the data will fall between the mean and +/—4 standard deviations from the
mean

Question 4

This study source was downloaded by 100000824045226 from CourseHero.com on 01-03-2023 00:37:18 GMT -06:00

https://www.coursehero.com/file/61581571/BATC601-Predictive-Modelling-Test-2pdf/
MCAR stands for

O missing completely at random

O missing conditional at random

O missing convolute at random

O None of these

Question 5

built classifier always have percent correct classification (PCC) in the numeric range of.

O well, 50 to 100

O badly, 50 to 100

O badly, 0 to 10

O well, 0 to 10

Question 6

What sampling technique do statisticians typically use to assess model stability?

O Cross Validation

O Curse of dimensionality

O Rule of Thumb

O Temporal Sequencing

Question 7

Decision trees are not distance-based algorithms and therefore are _________ by __________. and
skewed distributions.

This study source was downloaded by 100000824045226 from CourseHero.com on 01-03-2023 00:37:18 GMT -06:00

https://www.coursehero.com/file/61581571/BATC601-Predictive-Modelling-Test-2pdf/
O unaffected, outliers

O affected, normalized

O unaffected, normalized

Question 8

Computational methods to discover and report influential patterns in data are known as

O Data Mining

O Data Discovery

O Data Analytics

OAlll of the Above

Question 9

Which of the following is a standard data mining methodology?

O CRISP-DM

O SPSS

O Clementine

O Mineset

Question 10

What are the different approaches to handle outliers in data?

O Remove the outliers from the modeling data

O Separate the outliers and create separate models just for outliers

O Transform the outliers so that they are no longer outliers

O All of these

This study source was downloaded by 100000824045226 from CourseHero.com on 01-03-2023 00:37:18 GMT -06:00

https://www.coursehero.com/file/61581571/BATC601-Predictive-Modelling-Test-2pdf/
Question 11

What is true about a distribution measured by kurtosis?

O Kurtosis is always negative

O Normal distribution will have a Kurtosis value of 2

O A leptokurtic distribution is one in which Kurtosis values is more than 4

O A platykaurtic distribution is one in which Kurtosis values is greater than 3

Question 12

The choice of the model assessment metric should be tied to ______ rather than ______

O operational considerations, algorithmic expedience

O algorithmic expedience, operational considerations

O algorithmic considerations, operational expedience

O None of these

Question 13

Predictive analytics is the process of

O just cleaning data

O just compressing data

O guessing about present output with out any data

O information retrieval to make useful predictions about future outcomes

Question 14

What is the format in which data must be available for predictive modelling?

O One dimension

O Two dimension

This study source was downloaded by 100000824045226 from CourseHero.com on 01-03-2023 00:37:18 GMT -06:00

https://www.coursehero.com/file/61581571/BATC601-Predictive-Modelling-Test-2pdf/
O Three Dimension

O n Dimension

Question 15

Inputs are analysed and grouped / clustered based on the proximity of input values to one another is

O Supervised Learning

O Unsupervised Learning

O Descripti ‘

O Both 2 and 3

Question 16

What is the value of skew in a normal distribution?

O Less than 1

O 00

O1

O Greater than 1

Question 17

As a thump rule or guiding principle, ANOVA method works when there are

O worst, small no. of

O best, small no. of

O best, large no. of

O worst, large no. of

Question 18

This study source was downloaded by 100000824045226 from CourseHero.com on 01-03-2023 00:37:18 GMT -06:00

https://www.coursehero.com/file/61581571/BATC601-Predictive-Modelling-Test-2pdf/
Most frequent metrics to assess model accuracy in classification problems is.

O PCC

O ROC

O aUC

O None of the above

Question 19

If data is applied to clustering algorithm then it is difficult to understand summaries after clustering.

O without normalization

O with normalization

O with compression

O None of these

Question 20

What is the phenomenon called when a trend is seen in individual variables, but is reversed when
variables are combined’

O Simpson's paradox

O Redskin Rule

O Anscombe’s Quartet

O Platykurtic

Question 21

What is the correct two-way combinations /interactions possible, if the number of variables is 5?

O The number of possible two-way interactions is 2.

O The number of possible two-way interactions is 5.

O The number of possible two-way interactions is 10

This study source was downloaded by 100000824045226 from CourseHero.com on 01-03-2023 00:37:18 GMT -06:00

https://www.coursehero.com/file/61581571/BATC601-Predictive-Modelling-Test-2pdf/
O The number of possible two-way interactions is 20

Question 22

What are the challenges in using Predictive Analytics?

O Predictive models require data in the form of two-dimensional data (rows and columns).

O Often, deployment of predictive models require shift in resources for an organization.

O The models become too complex because of overfiting.

O All of these

Question 23

If the value of a variable can range from negative infinity to positive infinity, what is the type of this
variable?

O Categorical variables

O Continous variables

O Numeric Variables

Question 24

Which of the following is NOT one of the distance metric used in building the K-Means clustering model?

O Mahalanobis distance metric

O Milwaukee distance metric

© Manhattan distance metric

O Minkowski distance metric

Question 25

Which of the following statements is true?

This study source was downloaded by 100000824045226 from CourseHero.com on 01-03-2023 00:37:18 GMT -06:00

https://www.coursehero.com/file/61581571/BATC601-Predictive-Modelling-Test-2pdf/
O The median would not change much when there is a single large outlier

O The mean would not change much when there is a single large outlier

O The mean is defined as the value that is exactly 50 percent of the way from the minimum to maximum
value of the variable

O The calculation of mean requires the data to be first sorted

Question 26

Ensembles are the methods which not only increases model accuracy but also

O they increase only model sensitivity

O they reduce risk on deploying poor model

Question 27

What is true about predictive modeling algorithms, assuming there are two customer records in the
data who are actually brother and sister?

O Predictive algorithms treat the two customers records as dependent

O Predictive algorithms must know they are related

O Predictive algorithms treat these two no differently than any other two people with similar patterns
or behavior

O None of these

Question 28

Which of these is False about Correlations between two variables

O measurer the numerical relationship of one variable to another's

O one variable meaning is related to another's

O Both of these

This study source was downloaded by 100000824045226 from CourseHero.com on 01-03-2023 00:37:18 GMT -06:00

https://www.coursehero.com/file/61581571/BATC601-Predictive-Modelling-Test-2pdf/
Onone of the above

Question 29

Which of the following is NOT a Single-Variable Selection Technique?

O Chi-square Test

O Simpson's Paradox

O ANOVA

O Linear regression forward selection (1 step)

Question 30

In K MEANS, what is the number of clusters in the data?

O Algorithm will determine the same dynamically

O lt must be pre-specified

O lt is always 2

O lt is always 3

This study source was downloaded by 100000824045226 from CourseHero.com on 01-03-2023 00:37:18 GMT -06:00

https://www.coursehero.com/file/61581571/BATC601-Predictive-Modelling-Test-2pdf/
Powered by TCPDF (www.tcpdf.org)

You might also like