0% found this document useful (0 votes)
62 views7 pages

Esa - 2022 - Ue21cs241a

The document contains a sample question paper for an exam. It includes multiple choice and numerical questions related to statistics, probability, hypothesis testing, and linear regression. The questions cover topics such as sampling distributions, confidence intervals, hypothesis testing, probability distributions, correlation, and least squares regression.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
62 views7 pages

Esa - 2022 - Ue21cs241a

The document contains a sample question paper for an exam. It includes multiple choice and numerical questions related to statistics, probability, hypothesis testing, and linear regression. The questions cover topics such as sampling distributions, confidence intervals, hypothesis testing, probability distributions, correlation, and least squares regression.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)

1.a. A certain process for manufacturing integrated circuits has been in use for a
period of time, and it is known that 12% of the circuits it produces are defective.
A new process that is supposed to reduce the proportion of defectives is being
tested. In a simple random sample of 100 circuits produced by the new process,
12 were defective.

i. One of the engineers suggests that the test proves that the new process is no
better than the old process, since the proportion of defectives in the sample is
the same. Is this conclusion justified? Explain.
ii. Assume that there had been only 11 defective circuits in the sample of 100.
Would this have proven that the new process is better? Explain.
iii. Which outcome represents stronger evidence that the new process is better:
finding 11 defective circuits in the sample, or finding 2 defective circuits in the
sample?
iv. Say True or false: If the proportion of defectives in the sample is less than
12%, it is reasonable to conclude that the new process is better.
v. Say True or false: If the proportion of defectives in the sample is only slightly
less than 12%, the difference could well be due entirely to sampling variation,
and it is not reasonable to conclude that the new process is better.

Note:(each sub question carries 1 mark)

1.b. (i) List any three steps involved in understanding, cleaning and preparing
your data. (3 marks)
(ii) List any two uses of Web Scraping. (2 marks)

about:blank 1/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)

1.c. A sample of 100 adult women was taken, and each was asked how many
children she had. The results were as follows:

i.Find the sample mean number of children.


ii.Find the sample standard deviation of the number of children.
iii.Find the sample median of the number of children.
iv.What is the first quartile of the number of children?
v.What proportion of the women had more than the mean number of children?

Note: Each sub question carries 1 mark

1.d. Following are summary statistics for two data sets, A and B.

i.Compute the interquartile ranges for both A and B. (1 + 1 =2 marks)


ii.Do the summary statistics for A provide enough information to construct a
boxplot? If so, construct the boxplot. If not, explain why.
(1 + 2 = 3 Marks)

about:blank 2/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)

2.a. A survey of SUVs on a certain stretch of highway during morning commute


hours showed that 70% had only one occupant, 15% had 2, 10% had 3, 3% had 4,
and 2% had 5. Let X represent the number of occupants in a randomly chosen
SUV.

i.Find the probability mass function of X.


ii.Find P(X ≤ 2).
iii.Find P(X > 3).
iv.Find μX .
v.Find σX .

Note: Each sub question carries 1 mark

2.b. A light fixture holds two lightbulbs. Bulb A is of a type whose lifetime is
normally distributed with mean 800 hours and standard deviation 100 hours.
Bulb B has a lifetime that is normally distributed with mean 900 hours and
standard deviation 150 hours. Assume that the lifetimes of the bulbs
are independent. Another light fixture holds only one bulb. A bulb of type A is
installed, and when it burns out, a bulb of type B is installed. What is the
probability that the total lifetime of the two bulbs is more than 2000 hours?

2.c. Suppose 1,000 applicants show up for a job interview, but there are only 70
positions available. To select the best 70 people amongst the 1,000 applicants,
the employer gives an aptitude test to judge their abilities. The mean score on
the test is 60, with a standard deviation of 6. If an applicant scores an 84, can
they assume they are getting a job?

about:blank 3/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)

2.d. Write the general acceptance-rejection algorithm to generate random


variates.

3.a. Let X1, . . . , Xn be a random sample from a N(μ, σ2) population. Find the
MLEs of μ and of σ. (5 + 5 =10 marks)

3.b. Let X1 and X2 be independent, each with unknown mean μ and known
variance

σ2 = 1.Let Find the bias, variance, and

mean squared error of

(Note: marks splitup: 1 + 1 + 2 = 4 marks)

about:blank 4/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)

3.c. Estimate the confidence intervals for the following:


i. A group of 78 people enrolled in a weight-loss program that involved adhering
to a special diet and to a daily exercise program. After six months, their mean
weight loss was 25 pounds, with a sample standard deviation of 9 pounds. A
second group of 43 people went on the diet but didn’t exercise. After six months,
their mean weight loss was 14 pounds, with a sample standard deviation of 7
pounds. Find a 95% confidence interval for the mean difference between the
weight losses.

ii. In a random sample of 150 customers of a high-speed internet provider, 63


said that their service had been interrupted one or more times in the past
month. Find a 95% confidence interval for the proportion of customers whose
service was interrupted one or more times in the past month.

(Note: Marks splitup: 3 + 3 = 6 marks)

4.a. An advertisement company claims that it receives 10% responses from its
mailing. To test this claim, a random sample of 500 were surveyed with 40
responses. Test at the a = .05 significance level.

4.b. The thicknesses of eight pads designed for use in aircraft engine mounts are
measured. The results, in mm, are 41.83, 41.01, 42.68, 41.37, 41.83, 40.50, 41.70,
and 41.42. Assume that the thicknesses are a sample from an approximately
symmetric distribution. Can you conclude that the mean thickness is greater
than 41 mm? Compute the appropriate test statistic and find the P-value.

about:blank 5/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)

4.c. For the given table of observed values,


i. Construct the corresponding table of expected values.
ii. If appropriate, perform the chi-square test for the null hypothesis that the row
and column outcomes are independent. If not appropriate, explain why.

(Note: Marks splitup: 3 + 3 = 6 marks)

4.d. A vendor claims that no more than 15% of the parts she supplies are
defective. Let p denote the actual proportion of parts that are defective. A test is
made of the hypotheses H0 : p ≤ 0.15 versus H1 : p > 0.15. For each of the
following situations, determine whether the decision was correct, a type I error
occurred, or a type II error occurred.

i.The claim is true, and H0 is rejected.


ii.The claim is false, and H0 is rejected.
iii.The claim is true, and H0 is not rejected.
iv.The claim is false, and H0 is not rejected.

Note:(each sub question carries 1 mark)

5.a. Find the power of the 5% level test of H0: μ ≤ 85 versus H1: μ > 85 for the
mean yield of the new process under the alternative μ = 87, assuming n = 50 and
σ = 5.

about:blank 6/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)

5.b. The processing of raw coal involves “washing,” in which coal ash (nonorganic,
incombustible material) is removed. The article “Quantifying Sampling Precision
for Coal Ash Using Gy’s Discrete Model of the Fundamental Error” (Journal of
Coal Quality,1989:33–39) provides data relating the percentage of ash to the
density of a coal particle. The average percentage ash for five densities of coal
particles was measured. The data are presented in the following table:

i.Construct a scatterplot of percent ash (y) versus density (x). Verify that a linear
model is appropriate. (2 marks)
ii.Compute the least-squares line for predicting percent ash from density. (3
marks)
iii.If two coal particles differed in density by 0.1 g/cm3, by how much would you
predict their percent ash to differ? (1 mark)
iv.Predict the percent ash for particles with density 1.40 g/cm3. ( 1 mark)
v.Compute the fitted values. ( 1 mark)
vi.Compute the residuals. Which point has the residual with the largest
magnitude? (1 mark )
vii.Compute the correlation between density and percent ash. (2 marks)
viii.Compute the regression sum of squares, the error sum of squares, and the
total sum of squares. ( 1+ 1+1= 3 marks)
ix.Divide the regression sum of squares by the total sum of squares. What is the
relationship between this quantity and the correlation coefficient? (1 mark)

about:blank 7/7

You might also like