“Tolerance interval”的意思、由来-开放百科全书

A tolerance interval is a statistical interval within which, with some confidence level, a specified proportion of a sampled population falls. "More specifically, a 100×p%/100×(1−α) tolerance interval provides limits within which at least a certain proportion (p) of the population falls with a given level of confidence (1−α)."^[1] "A (p, 1−α) tolerance interval (TI) based on a sample is constructed so that it would include at least a proportion p of the sampled population with confidence 1−α; such a TI is usually referred to as p-content − (1−α) coverage TI."^[2] "A (p, 1−α) upper tolerance limit (TL) is simply a 1−α upper confidence limit for the 100 p percentile of the population."^[2]

A tolerance interval can be seen as a statistical version of a probability interval. "In the parameters-known case, a 95% tolerance interval and a 95% prediction interval are the same."^[3] If we knew a population's exact parameters, we would be able to compute a range within which a certain proportion of the population falls. For example, if we know a population is normally distributed with mean

and standard deviation

, then the interval

includes 95% of the population (1.96 is the z-score for 95% coverage of a normally distributed population).

However, if we have only a sample from the population, we know only the sample mean

and sample standard deviation

, which are only estimates of the true parameters. In that case,

will not necessarily include 95% of the population, due to variance in these estimates. A tolerance interval bounds this variance by introducing a confidence level

, which is the confidence with which this interval actually includes the specified proportion of the population. For a normally distributed population, a z-score can be transformed into a "k factor" or tolerance factor^[4] for a given

via lookup tables or several approximation formulas.^[5] "As the degrees of freedom approach infinity, the prediction and tolerance intervals become equal."^[6]

Formulas

Normal case

Relation to other intervals

The tolerance interval is less widely known than the confidence interval and prediction interval, a situation some educators have lamented, as it can lead to misuse of the other intervals where a tolerance interval is more appropriate.^[7]^[8]

The tolerance interval differs from a confidence interval in that the confidence interval bounds a single-valued population parameter (the mean or the variance, for example) with some confidence, while the tolerance interval bounds the range of data values that includes a specific proportion of the population. Whereas a confidence interval's size is entirely due to sampling error, and will approach a zero-width interval at the true population parameter as sample size increases, a tolerance interval's size is due partly to sampling error and partly to actual variance in the population, and will approach the population's probability interval as sample size increases.^[7]^[8]

The tolerance interval is related to a prediction interval in that both put bounds on variation in future samples. The prediction interval only bounds a single future sample, however, whereas a tolerance interval bounds the entire population (equivalently, an arbitrary sequence of future samples). In other words, a prediction interval covers a specified proportion of a population on average, whereas a tolerance interval covers it with a certain confidence level, making the tolerance interval more appropriate if a single interval is intended to bound multiple future samples.^[8]^[9]

Examples

Calculation

One-sided normal tolerance intervals have an exact solution in terms of the sample mean and sample variance based on the noncentral t-distribution.^[16]

Two-sided normal tolerance intervals can be obtained based on the noncentral chi-squared distribution.^[10]

See also

References

1. ^D. S. Young (2010), Book Reviews: "Statistical Tolerance Regions: Theory, Applications, and Computation", TECHNOMETRICS, FEBRUARY 2010, VOL. 52, NO. 1, pp.143-144.
2. ^¹Krishnamoorthy, K. and Lian, Xiaodong(2011) 'Closed-form approximate tolerance intervals for some general linear models and comparison studies', Journal of Statistical Computation and Simulation,, First published on: 13 June 2011 {{DOI|10.1080/00949655.2010.545061}}
3. ^{{cite book|author=Thomas P. Ryan|title=Modern Engineering Statistics|url=https://books.google.com/books?id=aZn7XNphKcgC&pg=PA222|accessdate=22 February 2013|date=22 June 2007|publisher=John Wiley & Sons|isbn=978-0-470-12843-5|pages=222–}}
4. ^{{cite web |title= Statistical interpretation of data — Part 6: Determination of statistical tolerance intervals |publisher= ISO 16269-6|year= 2005 |page= 64}}
5. ^{{cite book | chapter = Tolerance intervals for a normal distribution | url = http://www.itl.nist.gov/div898/handbook/prc/section2/prc263.htm | title = Engineering Statistics Handbook | publisher = NIST/Sematech | year = 2010 | accessdate = 2011-08-26}}
6. ^{{Cite journal | last1 = De Gryze | first1 = S. | last2 = Langhans | first2 = I. | last3 = Vandebroek | first3 = M. | doi = 10.1016/j.chemolab.2007.03.002 | title = Using the correct intervals for prediction: A tutorial on tolerance intervals for ordinary least-squares regression | journal = Chemometrics and Intelligent Laboratory Systems | volume = 87 | issue = 2 | pages = 147 | year = 2007 | pmid = | pmc = }}
7. ^¹²{{cite journal | author = Stephen B. Vardeman | title = What about the Other Intervals? | journal = The American Statistician | volume = 46 | issue = 3 | year = 1992 | pages = 193–197 | jstor = 2685212 | doi=10.2307/2685212}}
8. ^¹²{{cite web | author = Mark J. Nelson | title = You might want a tolerance interval | url = http://www.kmjn.org/notes/tolerance_intervals.html | date = 2011-08-14 | accessdate = 2011-08-26}}
9. ^¹{{cite book | author = K. Krishnamoorthy| title = Statistical Tolerance Regions: Theory, Applications, and Computation | publisher = John Wiley and Sons | year = 2009 | isbn = 0-470-38026-8 | pages = 1–6}}
10. ^¹{{cite journal| author=Derek S. Young| title=tolerance: An R Package for Estimating Tolerance Intervals| journal=Journal of Statistical Software|date=August 2010| volume=36| number=5| pages=1–39| issn=1548-7660| url=http://www.jstatsoft.org/v36/i05| accessdate=19 February 2013}}, p.23