请输入您要查询的百科知识:

 

词条 Generalized estimating equation
释义

  1. Formulation

  2. Computation

  3. References

  4. Further reading

In statistics, a generalized estimating equation (GEE) is used to estimate the parameters of a generalized linear model with a possible unknown correlation between outcomes.[1][2]

Parameter estimates from the GEE are consistent even when the covariance structure is misspecified, under mild regularity conditions. The focus of the GEE is on estimating the average response over the population ("population-averaged" effects) rather than the regression parameters that would enable prediction of the effect of changing one or more covariates on a given individual. GEEs are usually used in conjunction with Huber–White standard error estimates, also known as "robust standard error" or "sandwich variance" estimates. In the case of a linear model with a working independence variance structure, these are known as "heteroscedasticity consistent standard error" estimators. Indeed, the GEE unified several independent formulations of these standard error estimators in a general framework.

GEEs belong to a class of regression techniques that are referred to as semiparametric because they rely on specification of only the first two moments. They are a popular alternative to the likelihood–based generalized linear mixed model which is more sensitive to variance structure specification.[3] They are commonly used in large epidemiological studies, especially multi-site cohort studies, because they can handle many types of unmeasured dependence between outcomes.

Formulation

Given a mean model for subject and time that depends upon regression parameters , and variance structure, , the estimating equation is formed via:[4]

The parameters are estimated by solving and are typically obtained via the Newton–Raphson algorithm. The variance structure is chosen to improve the efficiency of the parameter estimates. The Hessian of the solution to the GEEs in the parameter space can be used to calculate robust standard error estimates. The term "variance structure" refers to the algebraic form of the covariance matrix between outcomes, Y, in the sample. Examples of variance structure specifications include independence, exchangeable, autoregressive, stationary m-dependent, and unstructured. The most popular form of inference on GEE regression parameters is the Wald test using naive or robust standard errors, though the Score test is also valid and preferable when it is difficult to obtain estimates of information under the alternative hypothesis. The likelihood ratio test is not valid in this setting because the estimating equations are not necessarily likelihood equations. Model selection can be performed with the GEE equivalent of the Akaike Information Criterion (AIC), the Quasi-AIC (QIC).[5]

Computation

Software for solving generalized estimating equations is available in MATLAB,[6] SAS (proc genmod[7]), SPSS (the gee procedure[8]), Stata (the xtgee command[9]) and R (packages gee,[10] geepack[11] and multgee[12]).

Comparisons among software packages for the analysis of binary correlated data [13][14] and ordinal correlated data[15] via GEE are available.

References

1. ^{{cite journal | journal=Biometrika | volume=73 | issue=1 | pages=13–22 | title=Longitudinal data analysis using generalized linear models | authors = Kung-Yee Liang and Scott Zeger | year=1986 | doi=10.1093/biomet/73.1.13}}
2. ^{{cite book | last = Hardin| first = James |author2=Hilbe, Joseph |authorlink2=Joseph Hilbe | title = Generalized Estimating Equations | publisher = London: Chapman and Hall/CRC | year = 2003 |isbn=978-1-58488-307-4 }}
3. ^{{cite journal |pmc=2883299|year=2010|author1=Fong|first1=Y|title=Bayesian inference for generalized linear mixed models|journal=Biostatistics|volume=11|issue=3|pages=397–412|last2=Rue|first2=H|last3=Wakefield|first3=J|doi=10.1093/biostatistics/kxp053|pmid=19966070}}
4. ^{{cite book | last = Diggle| first = Peter J. |author2=Patrick Heagerty |author3=Kung-Yee Liang |author4=Scott L. Zeger| title = Analysis of Longitudinal Data | publisher = Oxford Statistical Science Series | year = 2002 |isbn=978-0-19-852484-7}}
5. ^{{Citation | last= Pan | first= W. | title= Akaike's information criterion in generalized estimating equations | journal= Biometrics | year= 2001 | volume= 57 | pages= 120–125 | doi= 10.1111/j.0006-341X.2001.00120.x}}.
6. ^{{cite journal | journal=Journal of Statistical Software | volume=25 | issue=14 | pages=1–14 | title=GEEQBOX: A MATLAB Toolbox for Generalized Estimating Equations and Quasi-Least Squares | authors = Sarah J. Ratcliffe and Justine Shults | year=2008 |url = http://www.jstatsoft.org/v25/i14}}
7. ^{{cite web|url=http://support.sas.com/documentation/cdl/en/statug/63033/HTML/default/genmod_toc.htm |title=The GENMOD Procedure |location=The SAS Institute}}
8. ^{{cite web|url=http://www.spss.com/software/statistics/advanced-statistics/ |title=IBM SPSS Advanced Statistics|location=IBM SPSS website}}
9. ^{{cite web|url=https://www.stata.com/manuals13/xtxtgee.pdf |title=Stata's implementation of GEE |location=Stata website}}
10. ^{{cite web|url=https://cran.r-project.org/web/packages/gee/index.html |location=CRAN |title=gee: Generalized Estimation Equation solver}}
11. ^{{citation |url=https://cran.r-project.org/web/packages/geepack/index.html |location=CRAN |title=geepack: Generalized Estimating Equation Package}}
12. ^{{citation |url=https://cran.r-project.org/web/packages/multgee/index.html |location=CRAN |title=multgee: GEE solver for correlated nominal or ordinal multinomial responses using a local odds ratios parameterization}}
13. ^{{cite journal | journal=Biometrical Journal | volume=40 | issue=3 | pages=245–260 | title=The generalised estimating equations: a comparison of procedures available in commercial statistical software packages| authors =Andreas Ziegler and Ulrike Grömping | year=1998 | doi=10.1002/(sici)1521-4036(199807)40:3<245::aid-bimj245>3.0.co;2-n}}
14. ^{{cite journal | journal=The American Statistician | volume=53 | issue=2 | title=Review of software to fit generalized estimating equation regression models| authors = Nicholas J. HORTON and Stuart R. LIPSITZ | year=1999 | doi = 10.1080/00031305.1999.10474451 | pages=160–169| citeseerx=10.1.1.22.9325 }}
15. ^{{cite journal | journal=Computational Statistics & Data Analysis | title=GEE for longitudinal ordinal data: Comparing R-geepack, R-multgee, R-repolr, SAS-GENMOD, SPSS-GENLIN| authors = Nazanin Nooraee, Geert Molenberghs, and Edwin R. van den Heuvel | year=2014 | doi=10.1016/j.csda.2014.03.009 | volume=77 | pages=70–83| url=https://pure.rug.nl/ws/files/17588929/Title_and_contents_.pdf}}

Further reading

  • {{cite book | last = Hardin| first = James |author2=Hilbe, Joseph |authorlink2=Joseph Hilbe | title = Generalized Estimating Equations | publisher = London: Chapman and Hall/CRC | year = 2003 |isbn=978-1-58488-307-4 }}
  • {{Cite book | first= A. | last= Ziegler | title= Generalized Estimating Equations | publisher= Springer | year= 2011 |isbn=978-1-4614-0498-9 }}
{{Authority control}}{{DEFAULTSORT:Generalized Estimating Equations}}

3 : Regression analysis|Estimation methods|M-estimators

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/9/29 14:32:47