请输入您要查询的百科知识:

 

词条 Listwise deletion
释义

  1. Example

  2. Problems with listwise deletion

  3. Compared to other methods

  4. References

In statistics, listwise deletion is a method for handling missing data. In this method, an entire record is excluded from analysis if any single value is missing.[1]{{rp|6}}

Example

For example, consider the following questionnaire, as answered by 10 subjects:

Subject Age Gender Income
1 29 M $40,000
2 45 M $36,000
3 81 M --missing--
4 22 --missing-- $16,000
5 41 M $98,000
6 33 F $60,000
7 22 F $24,000
8 --missing-- F $81,000
9 33 F $55,000
10 45 F $80,000

A researcher is hoping to model income (dependent variable) based on age and gender (independent variables). Using listwise deletion, the researcher would remove subjects 3, 4, and 8 from the sample before performing any further analysis.

Problems with listwise deletion

Listwise deletion affects statistical power of the tests conducted.[2][3] Statistical power relies in part on high sample size. Because listwise deletion excludes data with missing values, it reduces the sample which is being statistically analysed.

Listwise deletion is also problematic when the reason for missing data may not be random (i.e., questions in questionnaires aiming to extract sensitive information).[3] Due to the method, much of the subjects' data will be excluded from analysis, leaving a bias in data findings. For instance, a questionnaire may include questions about respondents drug use history, current earnings, or sexual persuasions. Many of the subjects in the sample may not answer due to the intrusive nature of the questions, but may answer all other items. Listwise deletion will exclude these respondents from analysis. This may create a bias as participants who do divulge this information may have different characteristics than participants who do not. Multiple imputation is an alternate technique for dealing with missing data that attempts to eliminate this bias.

Compared to other methods

While listwise deletion does have its problems, it is preferable to many other methods for handling missing data.[1]{{rp|7}} In some cases, it may even be the least problematic method.[1]{{rp|6}} The following table provides some comparisons of listwise deletions to other methods:

Method Comparison
Pairwise deletion[1]{{rp>9}}
Dummy variable adjustment Produces biased estimates of coefficients.[4]

References

1. ^{{cite book |last=Allison |first=P. D. |year=2001 |title=Missing Data |series=Sage University Papers Series on Quantitative Applications in the Social Sciences |volume=07-136 |location=Thousand Oaks, CA |publisher=Sage |isbn= }}
2. ^{{cite journal |last=Roth |first=P. L. |year=1994 |title=Missing data: A conceptual review for applied psychologists |journal=Personnel Psychology |volume=47 |issue=3 |pages=537–559 |doi=10.1111/j.1744-6570.1994.tb01736.x }}
3. ^{{cite journal |last=Olinsky |first=A. |last2=Chen |first2=S. |last3=Harlow |first3=L. |year=2003 |title=The comparative efficacy of imputations methods for missing data in structural equation modeling |journal=European Journal of Operational Research |volume=151 |issue=1 |pages=53–79 |doi=10.1016/S0377-2217(02)00578-7 }}
4. ^{{cite journal |last=Jones |first=M. P. |year=1996 |title=Indicator and stratification methods for missing explanatory variables in multiple linear regression |journal=J. Amer. Statist. Assoc. |volume=91 |issue=433 |pages=222–230 |doi=10.1080/01621459.1996.10476680 }} As cited by Allison (2001), p. 10.
{{DEFAULTSORT:Listwise Deletion}}

1 : Missing data

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/9/23 5:17:28