Esa - 2022 - Ue21cs241a
Esa - 2022 - Ue21cs241a
1.a. A certain process for manufacturing integrated circuits has been in use for a
period of time, and it is known that 12% of the circuits it produces are defective.
A new process that is supposed to reduce the proportion of defectives is being
tested. In a simple random sample of 100 circuits produced by the new process,
12 were defective.
i. One of the engineers suggests that the test proves that the new process is no
better than the old process, since the proportion of defectives in the sample is
the same. Is this conclusion justified? Explain.
ii. Assume that there had been only 11 defective circuits in the sample of 100.
Would this have proven that the new process is better? Explain.
iii. Which outcome represents stronger evidence that the new process is better:
finding 11 defective circuits in the sample, or finding 2 defective circuits in the
sample?
iv. Say True or false: If the proportion of defectives in the sample is less than
12%, it is reasonable to conclude that the new process is better.
v. Say True or false: If the proportion of defectives in the sample is only slightly
less than 12%, the difference could well be due entirely to sampling variation,
and it is not reasonable to conclude that the new process is better.
1.b. (i) List any three steps involved in understanding, cleaning and preparing
your data. (3 marks)
(ii) List any two uses of Web Scraping. (2 marks)
about:blank 1/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)
1.c. A sample of 100 adult women was taken, and each was asked how many
children she had. The results were as follows:
1.d. Following are summary statistics for two data sets, A and B.
about:blank 2/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)
2.b. A light fixture holds two lightbulbs. Bulb A is of a type whose lifetime is
normally distributed with mean 800 hours and standard deviation 100 hours.
Bulb B has a lifetime that is normally distributed with mean 900 hours and
standard deviation 150 hours. Assume that the lifetimes of the bulbs
are independent. Another light fixture holds only one bulb. A bulb of type A is
installed, and when it burns out, a bulb of type B is installed. What is the
probability that the total lifetime of the two bulbs is more than 2000 hours?
2.c. Suppose 1,000 applicants show up for a job interview, but there are only 70
positions available. To select the best 70 people amongst the 1,000 applicants,
the employer gives an aptitude test to judge their abilities. The mean score on
the test is 60, with a standard deviation of 6. If an applicant scores an 84, can
they assume they are getting a job?
about:blank 3/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)
3.a. Let X1, . . . , Xn be a random sample from a N(μ, σ2) population. Find the
MLEs of μ and of σ. (5 + 5 =10 marks)
3.b. Let X1 and X2 be independent, each with unknown mean μ and known
variance
about:blank 4/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)
4.a. An advertisement company claims that it receives 10% responses from its
mailing. To test this claim, a random sample of 500 were surveyed with 40
responses. Test at the a = .05 significance level.
4.b. The thicknesses of eight pads designed for use in aircraft engine mounts are
measured. The results, in mm, are 41.83, 41.01, 42.68, 41.37, 41.83, 40.50, 41.70,
and 41.42. Assume that the thicknesses are a sample from an approximately
symmetric distribution. Can you conclude that the mean thickness is greater
than 41 mm? Compute the appropriate test statistic and find the P-value.
about:blank 5/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)
4.d. A vendor claims that no more than 15% of the parts she supplies are
defective. Let p denote the actual proportion of parts that are defective. A test is
made of the hypotheses H0 : p ≤ 0.15 versus H1 : p > 0.15. For each of the
following situations, determine whether the decision was correct, a type I error
occurred, or a type II error occurred.
5.a. Find the power of the 5% level test of H0: μ ≤ 85 versus H1: μ > 85 for the
mean yield of the new process under the alternative μ = 87, assuming n = 50 and
σ = 5.
about:blank 6/7
9/21/23, 4:21 PM ESA - DEC - 2022 - UE21CS241A (set- 1)
5.b. The processing of raw coal involves “washing,” in which coal ash (nonorganic,
incombustible material) is removed. The article “Quantifying Sampling Precision
for Coal Ash Using Gy’s Discrete Model of the Fundamental Error” (Journal of
Coal Quality,1989:33–39) provides data relating the percentage of ash to the
density of a coal particle. The average percentage ash for five densities of coal
particles was measured. The data are presented in the following table:
i.Construct a scatterplot of percent ash (y) versus density (x). Verify that a linear
model is appropriate. (2 marks)
ii.Compute the least-squares line for predicting percent ash from density. (3
marks)
iii.If two coal particles differed in density by 0.1 g/cm3, by how much would you
predict their percent ash to differ? (1 mark)
iv.Predict the percent ash for particles with density 1.40 g/cm3. ( 1 mark)
v.Compute the fitted values. ( 1 mark)
vi.Compute the residuals. Which point has the residual with the largest
magnitude? (1 mark )
vii.Compute the correlation between density and percent ash. (2 marks)
viii.Compute the regression sum of squares, the error sum of squares, and the
total sum of squares. ( 1+ 1+1= 3 marks)
ix.Divide the regression sum of squares by the total sum of squares. What is the
relationship between this quantity and the correlation coefficient? (1 mark)
about:blank 7/7