请输入您要查询的百科知识:

 

词条 Data editing
释义

  1. Editing methods

     Interactive editing  Selective editing  Macro editing  Aggregation method  Distribution method  Automatic editing 

  2. See also

  3. Notes

  4. References

Data editing is defined as the process involving the review and adjustment of collected survey data. The purpose is to control the quality of the collected data.[1] Data editing can be performed manually, with the assistance of a computer or a combination of both.[2]

Editing methods

Interactive editing

The term interactive editing is commonly used for modern computer-assisted manual editing. Most interactive data editing tools applied at National Statistical Institutes (NSIs) allow one to check the specified edits during or after data entry, and if necessary to correct erroneous data immediately. Several approaches can be followed to correct erroneous data:

  • Recontact the respondent
  • Compare the respondent's data to his data from previous year
  • Compare the respondent's data to data from similar respondents
  • Use the subject matter knowledge of the human editor

Interactive editing is a standard way to edit data. It can be used to edit both categorical and continuous data.[3] Interactive editing reduces the time frame needed to complete the cyclical process of review and adjustment.[4]

Selective editing

Selective editing is an umbrella term for several methods to identify the influential errors, [5] and outliers.[6] Selective editing techniques aim to apply interactive editing to a well-chosen subset of the records, such that the limited time and resources available for interactive editing are allocated to those records where it has the most effect on the quality of the final estimates of publication figures. In selective editing, data is split into two streams:

  • The critical stream
  • The non-critical stream

The critical stream consists of records that are more likely to contain influential errors. These critical records are edited in a traditional interactive manner. The records in the non-critical stream which are unlikely to contain influential errors are not edited in a computer assisted manner.[7]

Macro editing

There are two methods of macro editing:[8]

Aggregation method

This method is followed in almost every statistical agency before publication: verifying whether figures to be published seem plausible. This is accomplished by comparing quantities in publication tables with same quantities in previous publications. If an unusual value is observed, a micro-editing procedure is applied to the individual records and fields contributing to the suspicious quantity.[9]

Distribution method

Data available is used to characterize the distribution of the variables. Then all individual values are compared with the distribution. Records containing values that could be considered uncommon (given the distribution) are candidates for further inspection and possibly for editing.[10]

Automatic editing

In automatic editing records are edited by a computer without human intervention[11]. Prior knowledge on the values of a single variable or a combination of variables can be formulated as a set of edit rules which specify or constrain the admissible values

See also

  • Data cleansing
  • Data pre-processing
  • Data wrangling

Notes

1. ^UNECE
2. ^http://www.statcan.gc.ca/edu/power-pouvoir/ch3/editing-edition/5214781-eng.htm
3. ^Waal, Ton de et al. "Handbook of Statistical Data Editing and Imputation". Wiley publication, 2011,p.15.
4. ^http://www.unece.org/fileadmin/DAM/stats/publications/editing/SDE1chA.pdf
5. ^the errors that have substantial impact on the publication figures
6. ^values that do not fit a model of data well
7. ^Waal, Ton de et al. "Handbook of Statistical Data Editing and Imputation". Wiley publication, 2011,p.16.
8. ^Waal, Ton de et al. "Handbook of Statistical Data Editing and Imputation". Wiley publication, 2011,p.16.
9. ^http://www.unece.org/fileadmin/DAM/stats/publications/editing/SDE1chB.pdf
10. ^Bethlehem,J. "Applied Survey Methods A Statistical Perspective ". Wiley publication, 2009,p.205.
11. ^Waal, Ton de et al. "Handbook of Statistical Data Editing and Imputation". Wiley publication

References

{{reflist}}{{data}}

2 : Survey methodology|Quantitative research

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/17 20:16:36