请输入您要查询的百科知识:

 

词条 Seven-number summary
释义

  1. (Parametric) Seven-number summary

  2. Bowley’s seven-figure summary

  3. See also

  4. References

In descriptive statistics, the seven-number summary is a collection of seven summary statistics, and is an extension of the five-number summary. There are two similar, common forms.

As with the five-number summary, it can be represented by a modified box plot, adding hatch-marks on the "whiskers" for two of the additional numbers.

(Parametric) Seven-number summary

The following percentiles are evenly spaced under a normally distributed variable:

  1. the 2nd percentile
  2. the 9th percentile
  3. the 25th percentile or lower quartile or first quartile
  4. the 50th percentile or median (middle value, or second quartile)
  5. the 75th percentile or upper quartile or third quartile
  6. the 91st percentile
  7. the 98th percentile

The middle three values – the lower quartile, median, and upper quartile – are the usual statistics from the five-number summary and are the standard values for the box in a box plot.

The two unusual percentiles at either end are used because the locations of all seven values will be approximately equally spaced if the data is normally distributed (four equally spaced percentiles with three digits of precision are 2.15, 8.87, 25.0, and 50.0). Some statistical tests require normally distributed data, so the plotted values provide a convenient visual check for validity of later tests, simply by scanning to see if the marks for those seven percentiles appear to be equal distances apart on the graph.

Notice that whereas the five-number summary makes no assumptions about the distribution of the data, the (parametric) seven-number summary is based on the normal distribution, and is not especially appropriate when normal data is not expected. However, the non-parametric seven number summary, discussed below, makes no assumptions.

The values can be represented using a modified box plot. The 2nd and 98th percentiles are represented by the ends of the whiskers, and hatch-marks across the whiskers mark the 9th and 91st percentiles.

Bowley’s seven-figure summary

Arthur Bowley used a set of non-parametric statistics, called a "seven-figure summary", including the extremes, deciles, and quartiles, along with the median.[1]

Thus the numbers are:

  1. the sample minimum
  2. the 10th percentile (first decile)
  3. the 25th percentile or lower quartile or first quartile
  4. the 50th percentile or median (middle value, or second quartile)
  5. the 75th percentile or upper quartile or third quartile
  6. the 90th percentile (last decile)
  7. the sample maximum

Note that the middle five of the seven numbers are very nearly the same as for the parametric seven number summary, above.

The addition of the deciles allow one to compute the interdecile range, which for a normal distribution can be scaled to give a reasonably efficient estimate of standard deviation, and the 10% midsummary, which when compared to the median gives an idea of the skewness in the tails.

See also

  • Three-point estimation
  • Stanine

References

1. ^{{cite book |last=Bowley |first=Arthur |author-link=Arthur Bowley |year=1920 |title=Elementary Manual of Statistics |edition=3rd |page=62 |quote=the seven positions are the maximum and minimum, median, quartiles, and two deciles}}

1 : Summary statistics

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/17 11:17:03