STATISTICS FOR PEOPLE WHO ARE SURE THEY CANT LEARN STATISTICS
T he following is aimed at the liberal arts graduate who has not taken a math course since high school and knows nothing whatsoever about statistics but wants to understand the statistical terms in the text.
DISTRIBUTIONS AND STANDARD DEVIATIONS
Why Do We Need Standard Deviation?
Every day, formally or informally, people make comparisonsamong people, among apples and oranges, among dairy cows or egg-laying hens, among the screws being coughed out by a screw machine. The standard deviation is a measure of how spread out the things being compared are. This egg is a lot bigger than average, a chicken farmer might say. The standard deviation gives him a way of saying precisely what he means by a lot.
What Is a Frequency Distribution?
To get a clear idea of what a frequency distribution is, imagine yourself back in your high school gym, with all the boys in the senior class assembled before you (including both sexes would complicate matters, and the main point of this discussion is to keep things simple). Line up these boys from left to right in order of height.
Now you have a long line going from shortest to tallest. As you look along the line you will see that only a few boys are conspicuously short and tall. Most are in the middle, and a lot of them seem identical in height. Is there any way to get a better idea of how this pattern looks?
Tape a series of cards to the floor in a straight line from left to right, with 60 inches and shorter written on the one at the far left, 80 inches and taller on the card at the far right, and cards in one-inch increments in between. Tell everyone to stand behind the card that corresponds to his height.
Someone loops a rope over the rafters and pulls you up in the air so you can look straight down on the tops of the heads of your classmates standing in their single files behind the height labels. The figure below shows what you see: a frequency distribution.
The raw material of a frequency distribution
![This a frequency distribution What good is it Looking at your high school - photo 2](/uploads/posts/book/113267/graph482.jpg)
This a frequency distribution. What good is it? Looking at your high school classmates standing around in a mob, you can tell very little about their height. Looking at those same classmates arranged into a frequency distribution, you can tell a lot, quickly and memorably.
How Is the Distribution Related to the Standard Deviation?
We still lack a convenient way of expressing where people are in that distribution. What does it mean to say that two different students are, say, 6 inches different in height? How big is a 6-inch difference? That brings us back to the standard deviation.
When it comes to high school students, you have a good idea of how big a 6-inch difference is. But what does a 6-inch difference mean if you are talking about the height of elephants? About the height of cats? It depends. And the things it depends on are the average height and how much height varies among the things you are measuring. A standard deviation gives you a way of taking both the average and that variability into account, so that 6 inches can be expressed in a way that means the same thing for high school students relative to other high school students, elephants relative to other elephants, and cats relative to other cats.
How Do You Compute a Standard Deviation?
Suppose that your high school class consisted of just two people who were 66 inches and 70 inches. Obviously, the average is 68 inches. Just as obviously, one person is 2 inches shorter than average, one person is 2 inches taller than average. The standard deviation is a kind of average of the differences from the mean2 inches, in this example. Suppose you add two more people to the class, one who is 64 inches and the other who is 72 inches. The mean hasnt changed (the two new people balance each other off exactly). But the newcomers are each 4 inches different from the average height of 68 inches. So the standard deviation, which measures the spread, has gotten bigger as well. Now two people are 4 inches different from the average and two people are 2 inches different from the average. That adds up to a total of 12 inches, divided among four persons. The simple average of these differences from the mean is three inches (124), which is almost (but not quite) what the standard deviation is. To be precise, the standard deviation is calculated by squaring the deviations from the mean, then summing them, then finding their average, then taking the square root of the result. In this example, two people are 4 inches from the mean and two are 2 inches from the mean. The sum of the squared deviations is 40 (i.e., 16+16+4+4). Their average is 10 (404). The square root of 10 is 3.16, which is the standard deviation for this example. The technical reasons for using the standard deviation instead of the simple average of the deviations from the mean are not necessary to go into, except that, in normal distributions, the standard deviation has wonderfully convenient properties. If you are looking for a short, easy way to think of a standard deviation, view it as the average difference from the mean.
As an example of how a standard deviation can be used to compare apples and oranges, suppose we are looking at the Olympic womens gymnastics team and mens NBA basketball teams. You notice a woman who is 5 feet 6 inches and a man who is 7 feet. You know from watching gymnastics on television that 5 feet 6 inches is tall for a woman gymnast, and 7 feet is tall even for a basketball player. But you want to do better than a general impression. Just how unusual is the woman, compared to the average gymnast on the U. S. womens team, and how unusual is the man, compared to the average basketball player on the U. S. mens team?
We gather data on height among all the women gymnasts, and determine that the mean is 5 feet 1 inch with a standard deviation (SD) of 2 inches. For the men basketball players, we find that the mean is 6 feet 6 inches and the SD is 4 inches. Thus the woman who is 5 feet 6 inches is 2.5 standard deviations taller than the average; the seven-foot man is only 1.5 standard deviations taller than the average. These numbers2.5 for the woman and 1.5 for the manare called standard scores in statistical jargon. Now we have an explicit numerical way to compare how different the two people are from their respective averages, and we have a basis for concluding that the woman who is 5 feet 6 inches is a lot taller relative to other female Olympic gymnasts than a 7-foot man is relative to other NBA basketball players.