词条 | Grouped Dirichlet distribution | |||||||||||||||||||
释义 |
In statistics, the grouped Dirichlet distribution (GDD) is a multivariate generalization of the Dirichlet distribution It was first described by Ng et al 2008.[1] The Grouped Dirichlet distribution arises in the analysis of categorical data where some observations could fall into any of a set of other 'crisp' category. For example, one may have a data set consisting of cases and controls under two different conditions. With complete data, the cross-classification of disease status forms a 2(case/control)-x-(condition/no-condition) table with cell probabilities
If, however, the data includes, say, non-respondents which are known to be controls or cases, then the cross-classification of disease status forms a 2-x-3 table. The probability of the last column is the sum of the probabilities of the first two columns in each row, e.g.
The GDD allows the full estimation of the cell probabilities under such aggregation conditions.[1] Probability DistributionConsider the closed simplex set and . Writing for the first elements of a member of , the distribution of for two partitions has a density function given by where is the multivariate beta function. Ng et al[1] went on to define an m partition grouped Dirichlet distribution with density of given by where is a vector of integers with . The normalizing constant given by The authors went on to use these distributions in the context of three different applications in medical science. References1. ^1 2 {{cite journal| last=Ng| first=Kai Wang| title=Grouped Dirichlet distribution: A new tool for incomplete categorical data analysis| journal=Journal of Multivariate Analysis| year=2008| volume=99| pages=490–509}} 4 : Multivariate continuous distributions|Conjugate prior distributions|Exponential family distributions|Continuous distributions |
|||||||||||||||||||
随便看 |
|
开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。