11 The Normal Distribution

image

 

image  PROPERTIES

image  USING TABLES

image  A MODEL FOR MEASUREMENT

image  COMMONLY USED PROBABILITIES AND Z-SCORES

image  FINDING MEANS AND STANDARD DEVIATIONS

image  NORMAL APPROXIMATION TO THE BINOMIAL

image  CHECKING NORMALITY

 

Some of the most useful probability distributions are symmetric and bell-shaped:

image

In this topic we study one such distribution called the normal distribution (not all symmetric, unimodal distributions are normal). The normal distribution is valuable in describing various natural phenomena, especially those involving growth or decay. However, the real importance of the normal distribution in statistics is that it can be used to describe the results of sampling procedures.

The normal distribution can be viewed as a limiting case of the binomial. More specifically, we start with any fixed probability of success p and consider what happens as n becomes arbitrarily large. To obtain a visual representation of the limit, we draw histograms using an increasingly large n for each fixed p.

image

image

For greater clarity, different scales have been used for the histograms associated with different values of n. From the diagrams above it is reasonable to accept that as n becomes larger without bound, the resulting histograms approach a smooth bell-shaped curve. This is the curve associated with the normal distribution.

PROPERTIES OF THE NORMAL DISTRIBUTION

The normal distribution curve is bell-shaped and symmetric and has an infinite base. Long, flat-looking tails cover many values but only a small proportion of the area. The flat appearance of the tails is deceptive. Actually, far out in the tails, the curve drops proportionately at an ever-increasing rate. In other words, when two intervals of equal length are compared, the one closer to the center may experience a greater numerical drop, but the one further out in the tail experiences a greater drop when measured as a proportion of the height at the beginning of the interval.

The mean here is the same as the median and is located at the center. We want a unit of measurement that applies equally well to any normal distribution, and we choose a unit that arises naturally out of the curve’s shape. There is a point on each side where the slope is steepest. These two points are called points of inflection, and the distance from the mean to either point is precisely equal to one standard deviation. Thus, it is convenient to measure distances under the normal curve in terms of z-scores (recall from Topic 2 that z-scores are fractions or multiples of standard deviations from the mean).

image

image

TIP

The notation N(µ,image) stands for a normal model with mean µ and standard deviation image.

Mathematically, the formula for the normal curve turns out to be y = e z2/2, where y is the relative height above z-score. (By relative height we mean proportion of the height above the mean.) However, our interest is not so much in relative heights under the normal curve as it is in proportionate areas. The probability associated with any interval under the normal curve is equal to the proportionate area found under the curve and above the interval.

A useful property of the normal curve is that approximately 68% of the area (and thus 68% of the observations) falls within one standard deviation of the mean, approximately 95% of the area falls within two standard deviations of the mean, and approximately 99.7% of the area falls within three standard deviations of the mean.

image

USING TABLES OF THE NORMAL DISTRIBUTION

Table A in the Appendix shows areas under the normal curve. Specifically, this table shows the area to the left of a given point. It shows, for example, that to the left of a z-score of 1.2 there is 0.8849 of the area:

image

Because the total area is 1, the area to the right of a point can be calculated by calculating 1 minus the area to the left of the point. For example, the area to the right of a z-score of – 2.13 is 1 – 0.0166 = 0.9834. (Alternatively, by symmetry, we could have said that the area to the right of –2.13 is the same as that to the left of +2.13 and then read the answer directly from Table A.)

image

The area between two points can be found by subtraction. For example, the area between the z-scores of 1.23 and 2.71 is equal to 0.9966 – 0.8907 = 0.1059.

image

Given a probability, we can use Table A in reverse to find the appropriate z-score. For example, to find the z-score that has an area of 0.8982 to the left of it, we search for the probability closest to 0.8982 in the bulk of the table and read off the corresponding z-score. In this case we find 0.8980 with the z-score 1.27.

image

To find the z-score with an area of 0.72 to the right of it, we look for 1– 0.72 = 0.28 in the bulk of the table. The probability 0.2810 corresponds to the z-score –0.58.

USING A CALCULATOR WITH AREAS UNDER A NORMAL CURVE

On the TI-84, normalcdf(lowerbound, upperbound) gives the area (probability) between two z-scores, while invNorm(area) gives the z-score with the given area to its left. So we could have calculated above:

  normalcdf(–100, 1.2) = 0.8849

  normalcdf(–2.13, 100) = 0.9834

  normalcdf(1.23, 2.71) = 0.1060

  invNorm(.8982) = 1.271

  invNorm(.28) = –0.5828

The TI-84 also has the capability of working directly with raw scores instead of z-scores. In this case, the mean and standard deviation must be given:

  normalcdf(lowerbound, upperbound, mean, SD)

  invNorm(area, mean, SD)

image

THE NORMAL DISTRIBUTION AS A MODEL FOR MEASUREMENT

In solving a problem using the normal distribution as a model, drawing a picture of a normal curve with horizontal lines showing raw scores and z-scores is usually helpful.


imageEXAMPLE 11.1

The life expectancy of a particular brand of lightbulb is normally distributed with a mean of 1500 hours and a standard deviation of 75 hours.

TIP

Don’t use the normal model if the distribution isn’t symmetric and unimodal.

image

a.  What is the probability that a lightbulb will last less than 1410 hours?

TIP

Draw a picture!

Answer: The z-score of 1410 is image = –1.2, and from
Table A the probability to the left of –1.2 is 0.1151. [On the TI-84, normalcdf(0, 1410, 1500, 75) = 0.1151 and normalcdf(–10, –1.2) = 0.1151.]

image

b.  What is the probability that a lightbulb will last between 1563 and 1648 hours?

Answer: The z-score of 1563 is image and the z-score of 1648 is image In Table A, 0.84 and 1.97 give probabilities of 0.7995 and 0.9756, respectively. Thus between 1563 and 1648 there is a probability of 0.9756 – 0.7995 = 0.1761. [normalcdf(1563, 1648, 1500, 75) = 0.1762 and normalcdf(.84, 1.97) = 0.1760.]

image

c.  What is the probability that a lightbulb will last between 1416 and 1677 hours?

Answer: The z-score of 1416 is image and the z-score of 1677 is image In Table A, –1.12 gives a probability of 0.1314, and 2.36 gives a probability of 0.9909. Thus between 1416 and 1677 there is a probability of 0.9909 – 0.1314 = 0.8595. [normalcdf(1416, 1677, 1500, 75) = 0.8595 and normalcdf(–1.12, 2.36) = 0.8595.]

image


imageEXAMPLE 11.2

A packing machine is set to fill a cardboard box with a mean of 16.1 ounces of cereal. Suppose the amounts per box form a normal distribution with a standard deviation equal to 0.04 ounce.

image

a.  What percentage of the boxes will end up with at least 1 pound of cereal?

Answer: The z-score of 16 is image and –2.5 in Table A gives a probability of 0.0062. Thus the probability of having more than 1 pound in a box is 1 – 0.0062 = 0.9938 or 99.38%. [normalcdf(16, 1000, 16.10, .04) = 0.9938 and normalcdf(–2.5, 10) = 0.9938.]

image

b.  Ten percent of the boxes will contain less than what number of ounces?

Answer: In Table A, we note that .1 area (actually 0.1003) is found to the left of a –1.28 z-score. [On the TI-84, invNorm(.1) = –1.2816.] Converting the z-score of –1.28 into a raw score yields image ounces. [Or directly on the TI-84, invNorm(.1, 16.10, .04) = 16.049.]

image

c.  Eighty percent of the boxes will contain more than what number of ounces?

Answer: In Table A, we note that 1 – 0.8 = 0.2 area (actually 0.2005) is found to the left of a z-score of –0.84, and so to the right of a –0.84 z-score must be 80% of the area. [Or invNorm(.2) = 0.8416.] Converting the z-score of –0.84 into a raw score yields 16.1 – 0.84(0.04) = 16.066 ounces. [Or directly on the TI-84, invNorm(.2, 16.10, .04) = 16.066.]

image

d.  The middle 90% of the boxes will be between what two weights?

Answer: Ninety percent in the middle leaves five percent in each tail. In Table A, we note that .0505 area is found to the left of a z-score of –1.64, while 0.495 area is found to the left of a z-score of –1.65, and so we use the z-score of –1.645. Then 90% of the area is between z-scores of –1.645 and 1.645. [Or invNorm(.05) = –1.6449 and invNorm(.95) = 1.6449.] Converting z-scores to raw scores yields 16.1 – 1.645(0.04) = 16.034 ounces and 16.1 + 1.645(0.04) = 16.166 ounces, respectively, for the two weights between which we will find the middle 90% of the boxes. [invNorm(.05, 16.10, .04) = 16.034 and invNorm(.95, 16.10, .04) = 16.166.]

image


COMMONLY USED PROBABILITIES AND Z-SCORES

As can be seen from Example 11.2d, there is often an interest in the limits enclosing some specified middle percentage of the data. For future reference, the limits most frequently asked for are noted below in terms of z-scores.

image

Ninety percent of the values are between z-scores of –1.645 and +1.645, 95% of the values are between z-scores of –1.96 and +1.96, and 99% of the values are between z-scores of –2.576 and +2.576.

Sometimes the interest is in values with particular percentile rankings. For example,

image

Ninety percent of the values are below a z-score of 1.282, 95% of the values are below a z-score of 1.645, and 99% of the values are below a z-score of 2.326.

There are corresponding conclusions for negative z-scores:

image

Ninety percent of the values are above a z-score of –1.282, 95% of the values are above a z-score of –1.645, and 99% of the values are above a z-score of –2.326.

It is also useful to note the percentages corresponding to values falling between integer z-scores. For example,

image

Note that 68.26% of the values are between z-scores of –1 and +1, 95.44% of the values are between z-scores of –2 and +2, and 99.74% of the values are between z-scores of –3 and +3.


imageEXAMPLE 11.3

Suppose that the average height of adult males in a particular locality is 70 inches with a standard deviation of 2.5 inches.

a.   If the distribution is normal, the middle 95% of males are between what two heights?

Answer: As noted above, the critical z-scores in this case are ±1.96, and so the two limiting heights are 1.96 standard deviations from the mean. Therefore, 70 ± 1.96(2.5) = 70 ± 4.9, or from 65.1 to 74.9 inches.

b.  Ninety percent of the heights are below what value?

Answer: The critical z-score is 1.282, and so the value in question is 70 + 1.282(2.5) = 73.205 inches.

c.  Ninety-nine percent of the heights are above what value?

Answer: The critical z-score is –2.326, and so the value in question is 70 – 2.326(2.5) = 64.185 inches.

d.  What percentage of the heights are between z-scores of ±1? Of ±2? Of ±3?

Answer: 68.26%, 95.44%, and 99.74%, respectively.


FINDING MEANS AND STANDARD DEVIATIONS

If we know that a distribution is normal, we can calculate the mean µ and the standard deviation image using percentage information from the population.


imageEXAMPLE 11.4

Given a normal distribution with a mean of 25, what is the standard deviation if 18% of the values are above 29?

Answer: Looking for a 0.82 probability in Table A, we note that the corresponding z-score is 0.92. Thus 29 – 25 = 4 is equal to a standard deviation of 0.92, that is, 0.92image = 4, and image

image


 


imageEXAMPLE 11.5

Given a normal distribution with a standard deviation of 10, what is the mean if 21% of the values are below 50?

Answer: Looking for a 0.21 probability in Table A leads us to a z-score of –0.81. Thus 50 is –0.81 standard deviation from the mean, and so µ = 50 + 0.81(10) = 58.1.

image


 


imageEXAMPLE 11.6

Given a normal distribution with 80% of the values above 125 and 90% of the values above 110, what are the mean and standard deviation?

Answer: Table A gives critical z-scores of –0.84 and –1.28. Thus we have image Solving the system {125 – µ = –0.84image, 110 – µ = –1.28image} simultaneously gives µ = 153.64 and image = 34.09.

image


NORMAL APPROXIMATION TO THE BINOMIAL

Many practical applications of the binomial involve examples in which n is large. However, for large n, binomial probabilities can be quite tedious to calculate. Since the normal can be viewed as a limiting case of the binomial, it is natural to use the normal to approximate the binomial in appropriate situations.

TIP

When n is small, you cannot use the normal model to approximate the binomial model.

The binomial takes values only at integers, while the normal is continuous with probabilities corresponding to areas over intervals. Therefore, we establish a technique for converting from one distribution to the other. For approximation purposes we do as follows. Each binomial probability corresponds to the normal probability over a unit interval centered at the desired value. Thus, for example, to approximate the binomial probability of eight successes we determine the normal probability of being between 7.5 and 8.5.


imageEXAMPLE 11.7

Suppose that 15% of the cars coming out of an assembly plant have some defect. In a delivery of 40 cars what is the probability that exactly 5 cars have defects?

Answer: The actual answer is image but clearly this involves a nontrivial calculation. [If one has a calculator such as the TI-84, then binompdf(40, 0.15, 5) = 0.1692.] To approximate the answer using the normal, we first calculate the mean µ and the standard deviation image as follows:

image

We then calculate the appropriate z-scores: image

Looking up the corresponding probabilities in Table A, we obtain a final answer of 0.4129 – 0.2546 = 0.1583. (The actual answer is 0.1692.)

image


Even more useful are approximations relating to probabilities over intervals.

image


imageEXAMPLE 11.8

If 60% of the population support massive federal budget cuts, what is the probability that in a survey of 250 people at most 155 people support such cuts?

Answer: The actual answer is the sum of 156 binomial expressions:

image

However, a good approximation can be obtained quickly and easily by using the normal. We calculate µ and image:

image

The binomial of at most 155 successes corresponds to the normal probability of ≤155.5.

The z-score of 155.5 is image Using Table A leads us to a final answer of 0.7611. [Or binomcdf(250, .6, 155) = 0.7605.]

image


Is the normal a good approximation? The answer, of course, depends on the error tolerances in particular situations. A general rule of thumb is that the normal is a good approximation to the binomial whenever both np and n(1 – p) are greater than 10. (Some authors use 5 instead of 10 here.)

CHECKING NORMALITY

In the examples in this Topic we have assumed the population has a normal distribution. When you collect your own data, before you can apply the procedures developed above, you must decide whether it is reasonable to assume the data come from a normal population. In later Topics we will have to check for normality before applying certain inference procedures.

The initial check should be to draw a picture. Dotplots, stemplots, boxplots, and histograms are all useful graphical displays to show that data is unimodal and roughly symmetric.


imageEXAMPLE 11.9

The ages at inauguration of U.S. presidents from Washington to G. W. Bush were: {57, 61, 57, 57, 58, 57, 61, 54, 68, 51, 49, 64, 50, 48, 65, 52, 56, 46, 54, 49, 51, 47, 55, 55, 54, 42, 51, 56, 55, 51, 54, 51, 60, 61, 43, 55, 56, 61, 52, 69, 64, 46, 54}. Can we conclude that the distribution is roughly normal?

Answer: Entering the 43 data points in a graphing calculator gives the following histogram:

image

Alternatively, we could have used a dotplot, stemplot, or boxplot:

image

All the graphical displays indicate a distribution that is roughly unimodal and symmetric.


A more specialized graphical display to check normality is the normal probability plot. When the data distribution is roughly normal, the plot is roughly a diagonal straight line. While this plot more clearly shows deviations from normality, it is not as easy to understand as a histogram. The normal probability plot is difficult to calculate by hand; however, technology such as the TI-84 readily plots the graph. (On the TI-84, in STATPLOT the sixth choice in TYPE gives the normal probability plot.)

The normal probability plot for the data in Example 11.9 is:

image

A graph this close to a diagonal straight line indicates that the data have a distribution very close to normal.

Note that alternatively, one could have plotted the data (in this case, inaugural ages) on the x-axis and the normal scores on the y-axis.

SUMMARY

image  The normal distribution is one particular symmetric, unimodal distribution.

image  Drawing pictures of normal curves with horizontal lines showing raw scores and z-scores is usually helpful.

image  Areas (probabilities) under a normal curve can be found using the given table or calculator functions, such as normalcdf on TI calculators.

image  Critical values corresponding to given probabilities can be found using the given table or calculator functions, such as invNorm on TI calculators.

image  The normal probability plot is very useful in gauging whether a given distribution of data is approximately normal: If the plot is nearly straight, the data is nearly normal.

QUESTIONS ON TOPIC ELEVEN: THE NORMAL DISTRIBUTION

Multiple-Choice Questions

Directions: The questions or incomplete statements that follow are each followed by five suggested answers or completions. Choose the response that best answers the question or completes the statement.

1.  Which of the following is a true statement?

(A)  The area under a normal curve is always equal to 1, no matter what the mean and standard deviation are.

(B)  All bell-shaped curves are normal distributions for some choice of µ and image.

(C)  The smaller the standard deviation of a normal curve, the lower and more spread out the graph.

(D)  Depending upon the value of the standard deviation, normal curves with different means may be centered around the same number.

(E)  Depending upon the value of the standard deviation, the mean and median of a particular normal distribution may be different.

2.  Which of the following is a true statement?

(A)  The area under the standard normal curve between 0 and 2 is twice the area between 0 and 1.

(B)  The area under the standard normal curve between 0 and 2 is half the area between –2 and 2.

(C)  For the standard normal curve, the interquartile range is approximately 3.

(D)  For the standard normal curve, the range is 6.

(E)  For the standard normal curve, the area to the left of 0.1 is the same as the area to the right of 0.9.

3.  Populations P1 and P2 are normally distributed and have identical means. However, the standard deviation of P1 is twice the standard deviation of P2. What can be said about the percentage of observations falling within two standard deviations of the mean for each population?

(A)  The percentage for P1 is twice the percentage for P2.

(B)  The percentage for P1 is greater, but not twice as great, as the percentage for P2.

(C)  The percentage for P2 is twice the percentage for P1.

(D)  The percentage for P2 is greater, but not twice as great, as the percentage for P1.

(E)  The percentages are identical.

4.  Consider the following two normal curves:

image

Which has the larger mean and which has the larger standard deviation?

(A)  Larger mean, a; larger standard deviation, a

(B)  Larger mean, a; larger standard deviation, b

(C)  Larger mean, b; larger standard deviation, a

(D)  Larger mean, b; larger standard deviation, b

(E)  Larger mean, b; same standard deviation

In Questions 5–15, assume the given distributions are normal.

5.  A trucking firm determines that its fleet of trucks averages a mean of 12.4 miles per gallon with a standard deviation of 1.2 miles per gallon on cross-country hauls. What is the probability that one of the trucks averages fewer than 10 miles per gallon?

(A)  0.0082

(B)  0.0228

(C)  0.4772

(D)  0.5228

(E)  0.9772

6.  A factory dumps an average of 2.43 tons of pollutants into a river every week. If the standard deviation is 0.88 tons, what is the probability that in a week more than 3 tons are dumped?

(A)  0.2578

(B)  0.2843

(C)  0.6500

(D)  0.7157

(E)  0.7422

7.  An electronic product takes an average of 3.4 hours to move through an assembly line. If the standard deviation is 0.5 hour, what is the probability that an item will take between 3 and 4 hours?

(A)  0.2119

(B)  0.2295

(C)  0.3270

(D)  0.3811

(E)  0.6730

8.  The mean score on a college placement exam is 500 with a standard deviation of 100. Ninety-five percent of the test takers score above what?

(A)  260

(B)  336

(C)  405

(D)  414

(E)  664

9.  The average noise level in a restaurant is 30 decibels with a standard deviation of 4 decibels. Ninety-nine percent of the time it is below what value?

(A)  20.7

(B)  32.0

(C)  33.4

(D)  37.8

(E)  39.3

10.  The mean income per household in a certain state is $9500 with a standard deviation of $1750. The middle 95% of incomes are between what two values?

(A)  $5422 and $13,578

(B)  $6070 and $12,930

(C)  $6621 and $12,379

(D)  $7260 and $11,740

(E)  $8049 and $10,951

11.  One company produces movie trailers with mean 150 seconds and standard deviation 40 seconds, while a second company produces trailers with mean 120 seconds and standard deviation 30 seconds. What is the probability that two randomly selected trailers, one produced by each company, will combine to less than three minutes?

(A)  0.000

(B)  0.036

(C)  0.099

(D)  0.180

(E)  0.405

12.  Jay Olshansky from the University of Chicago was quoted in Chance News as arguing that for the average life expectancy to reach 100, 18% of people would have to live to 120. What standard deviation is he assuming for this statement to make sense?

(A)  21.7

(B)  24.4

(C)  25.2

(D)  35.0

(E)  111.1

13.  Cucumbers grown on a certain farm have weights with a standard deviation of 2 ounces. What is the mean weight if 85% of the cucumbers weigh less than 16 ounces?

(A)  13.92

(B)  14.30

(C)  14.40

(D)  14.88

(E)  15.70

14.  If 75% of all families spend more than $75 weekly for food, while 15% spend more than $150, what is the mean weekly expenditure and what is the standard deviation?

(A)  µ = 83.33, image = 12.44

(B)  µ = 56.26, image = 11.85

(C)  µ = 118.52, image = 56.26

(D)  µ = 104.39, image = 43.86

(E)  µ = 139.45, image = 83.33

15.  A coffee machine can be adjusted to deliver any fixed number of ounces of coffee. If the machine has a standard deviation in delivery equal to 0.4 ounce, what should be the mean setting so that an 8-ounce cup will overflow only 0.5% of the time?

(A)  6.97 ounces

(B)  7.22 ounces

(C)  7.34 ounces

(D)  7.80 ounces

(E)  9.03 ounces

16.  Assume that a baseball team has an average pitcher, that is, one whose probability of winning any decision is 0.5. If this pitcher has 30 decisions in a season, what is the probability that he will win at least 20 games?

(A)  0.0505

(B)  0.2514

(C)  0.2743

(D)  0.3333

(E)  0.4300

17.  Given that 10% of the nails made using a certain manufacturing process have a length less than 2.48 inches, while 5% have a length greater than 2.54 inches, what are the mean and standard deviation of the lengths of the nails? Assume that the lengths have a normal distribution.

(A)  µ = 2.506, image = 0.0205

(B)  µ = 2.506, image = 0.0410

(C)  µ = 2.516, image = 0.0825

(D)  µ = 2.516, image = 0.1653

(E)  The mean and standard deviation cannot be computed from the information given.

Free-Response Questions

Directions: You must show all work and indicate the methods you use. You will be graded on the correctness of your methods and on the accuracy of your final answers.

TWO OPEN-ENDED QUESTIONS

1.  The time it takes Steve to walk to school follows a normal distribution with mean 30 minutes and standard deviation 5 minutes, while the time it takes Jan to walk to school follows a normal distribution with mean 25 minutes and standard deviation 4 minutes. Assume their walking times are independent of each other.

(a)  If they leave at the same time, what is the probability that Steve arrives before Jan?

(b)  How much earlier than Jan should Steve leave so that he has a 90% chance of arriving before Jan?

2.  (a)  Two components are in series so the failure of either will cause the system to fail. The time to failure for a new component is normally distributed with a mean of 3000 hours and a standard deviation of 400 hours. One of the components has already run for 2500 hours, while the other has run for 2800 hours. Assuming component failures are independent, what is the probability the system survives for 10 more days (240 hours)?

(b)  If the components are put in parallel, the system will fail only if both components fail. If the two components in (a) are put in parallel, what is the probability that the system survives for 10 more days?

TWO INVESTIGATIVE TASKS

1.  When Michael Jordan came up to try for a sixth free throw after having made five straight free throws, the announcer commented that the law of averages would be working against Jordan.

(a)  What did the announcer mean? Was this a correct interpretation of probability? Explain.

(b)  Jordan makes 90% of his free throws. What is the probability that he will make six straight free throws? That he will make five straight free throws and then miss the next? That he will make the next free throw given that he has made the last five in a row?

(c)  How could you set up a simulation using a random number table to analyze the situation the announcer was commenting on?

(d)  If Jordan shoots six times from the free throw line, what are the mean and standard deviation for the number of shots he is expected to make?

(e)  If Jordan makes only 35 out of 40 free throw tries during the playoffs, is this sufficient evidence that the probability of his making a free throw is really below .90? Explain.

2.  Many people have lactose intolerance leading to cramps and diarrhea when they eat dairy products. Substantial relief can be obtained by taking dietary supplements of lactase, an enzyme. One product consists of caplets containing a mean of 9000 FCC lactase units with a standard deviation of 590 units and a normal distribution of lactase units. People with severe lactose intolerance may need to take two such tablets whenever they eat dairy products. Tablets with under 8500 FCC lactase units can produce noticeably less relief.

(a)  Determine the probability that a caplet has less than 8500 FCC lactase units. Round off to the nearest tenth.

(b)  Using the random number table below, run five simulations for the number of tablets a lab technician samples before finding two with less than 8500 units each.

    84177   06757   17613   15582   51506   81435   41050   92031   06449   05059

    59884   31180   53115   84469   94868   57967   05811   84514   75011   13006

    63395   55041   15866   06589   13119   71020   85940   91932   06488   74987

    54355   52704   90359   02649   47496   71567   94268   08844   26294   64759

    08989   57024   97284   00637   89283   03514   59195   07635   03309   72605

    29357   23737   67881   03668   33876   35841   52869   23114   15864   38942

(c)  The results of two 100-trial simulations, one looking for two tablets each less than 8500 units and one looking for two tablets each greater than 9000 units, are shown below. Which distribution is which? Explain your answer.

image

(d)  Using the correct barplot above, estimate the expected number of caplets a laboratory will sample before finding two with less than 8500 units each.