Example: In a family with four children, the probabilities of 0, 1, 2, 3, or 4 girls will be binomially distributed if the probability of a girl on each birth is the same, regardless of the number and gender of previous births.
If p denotes the probability of the event occurring on any particular trial, and n denotes the number of trials, then the number of trials on which the event occurs has:
mean = np
standard deviation = sqrt(np(1-p))
This is a symmetrical, bell-shaped distribution, with two parameters, the mean, mu, and the standard deviation, sigma, whose values are 0 and 1 respectively in the case called a standard normal distribution.
"Normal" in this context is a technical term, with a technical meaning different from its everyday usage. You should not assume that this shape of distribution is "normal" in the everyday sense of that word, and that other shapes are "abnormal". Indeed, for this reason some authors have abandoned the term, Normal Distribution, and instead call it the Gaussian Distribution. These are two names for the same distribution.
The Central Limit Theorem states conditions under which a sequence of distributions of partial sums converges to a normal distribution.
Remark: Feller (1957, pp. 238-241), who along with Lindeberg was a leading probabilist examining necessary and sufficient conditions for convergence to normality, has a section on "variable distributions", in which he points out that the various terms of the sum actually do not need to be either independent or identically distributed, although those conditions do make the task of proving theorems more manageable. In applying the central limit theorem to biometric measurements, such as height, Feller remarks, "It is true that not all of the (terms) are mutually independent. However, the central limit theorem holds also for large classes of dependent variables, and, besides, it is plausible that the great majority of the (terms) can be treated as independent".
Example: Employees' current salaries are the salaries they had a decade ago, modified by successive pay raises. An egalitarian employer might consider awarding raises in similar dollar amounts, regardless of the employees' previous salaries. In some years $800 raises might be about average, with some getting more, some less; in other years $1200 raises might be about average, with some getting more, some less; etc. Thus after two such raises an employee who began at $40,000 would have a new salary in the vicinity of (40,000 + 800 + 1200), and similarly for subsequent years.
Example: Employees' current salaries are the salaries they had a decade ago, modified by successive pay raises. Rather than giving similar dollar amounts, most employers would give similar percentage raises. In some years 5% raises might be about average, with some getting more, some less; in other years 3% raises might be about average, with some getting more, some less; etc. Thus after two such raises an employee who began at $40,000 would have a new salary in the vicinity of ($40,000)(1.05)(1.03), and similarly for subsequent years.
If some variable X is lognormally distributed, then log(X) is normally distributed. This helps explain why researchers often transform variables, for example to analyze log(income) rather than income itself.
A lognormal distribution has two parameters, usually expressed in terms of the mean and standard deviation of the corresponding normal distribution.
Example: In calculating the distribution of the number of girls
in a two-child family, we would ordinarily assume p=1/2
throughout, for each family and regardless of the sex of any
previous children in that family. This would yield:
P(0 girls) = 1/4, P(1 girl) = 1/2, P(2 girls) = 1/4
which has a single mode, at 1 girl.
Suppose, instead, that there are two types of couples,
boy-prone and girl-prone, with P(girl|boy-prone) = .1,
P(boy-prone) = .5, P(girl|girl-prone) = .9, P(girl-prone) = .5.
Then applying the total probability rule we could calculate
P(girl) = .5, the same as before. However, the distribution in two-child
families would be:
P(0 girls) = .41, P(1 girl) = .18, P(2 girls) = .41.
Note that this is not only different, but also bimodal.
Empirically, there may be some heterogeneity of this
sort, with some couples slightly more boy-prone and others
slightly more girl-prone, but it is nowhere near as extreme as in
this numerical example.
Ordinarily the investigator will only select a single sample, not a large number of replications, as suggested in the imagery of sampling distributions. However, the investigator has no control over which of the outcomes in the sampling distribution he or she happens to get. Thus the strategy is to arrange the sampling distribution, which can be controlled, in such a manner that the vast majority of the possible samples would, if they happened to be the one actually selected, yield suitably accurate inferences about the population being sampled.
Specifically, if a particular statistic in the sample is to be used as an estimate of a parameter in the population, one would like:
Both frequentists and Bayesians use interval estimates, but they use somewhat different ways of describing them.
Example: Find a confidence interval for the proportion of all voters favoring a particular measure, based on the proportion of respondents in a sample favoring it. The sample proportion is an unbiased estimate of the population proportion, and it has a standard error of sqrt[p(1-p)/n]. In a sample of n=400, if p has a value near .8, this would work out to about .02, and 1.96 times that would be about .04, yielding an interval of .8-.04 to .8+.04, or .76 to .84. Thus instead of using .8 as a point estimate of the population proportion, one would use .76 to .84 as a 95% confidence interval.
Katz, Jack. 1982. "A Theory of Qualitative Methodology: The Social System of Analytic Fieldwork." Pages 197-218 in: Poor People's Lawyers in Transition. New Brunswick, NJ: Rutgers University Press. Reprinted, pages 127-148 in: Robert M. Emerson, ed. 1988. Contemporary Field Research: A Collection of Readings. Prospect Heights, IL: Waveland Press.
Seltzer, Judith A. 1991 "Legal Custody Arrangements and Children's Economic Welfare." American Journal of Sociology 96 (#4, January): 895-929.