Mountain View
biomedical and healthCAre Data Discovery Index Ecosystem
help Advanced Search
Title: Replication data for: What To Do about Missing Data in Time-Series Cross-Sectional Data      
dateReleased:
08-07-2014
downloadURL: http://hdl.handle.net/1902.1/14316
ID:
hdl:1902.1/14316
description:
Applications of modern methods for analyzing data with missing values, based primarily on multiple imputation, have in the last half-decade become common in American politics and political behavior. Scholars in these fields have thus increasingly avoided the biases and inefficiencies caused by ad hoc methods like listwise deletion and best guess imputation. However, researchers in much of comparative politics and international relations, and others with similar data, have been unable to do the same because the best available imputation methods work poorly with the time-series cross-section data structures common in these fields. We attempt to rectify this situation. First, we build a multiple i mputation model that allows smooth time trends, shifts across cross-sectional units, and correlations over time and space, resulting in far more accurate imputations. Second, we build nonignorable missingness models by enabling analysts to incorporate knowledge from area studies experts via priors on individual missing cell values, rather than on difficult-to-interpret model parameters. Third, since these tasks could not be accomplished within existing imputation algorithms, in that they cannot handle as many variables as needed even in the simpler cross-sectional data for which they were designed, we also develop a new algorithm that substantially expands the range of computationally feasible data types and sizes for which multiple imputation can be used. These developments also made it possible to implement the methods introduced here in freely available open source software that is considerably more reliable than existing strategies. These developments also made it possible to implement the methods introduced here in freely available open source software, Amelia II: A Program for Missing Data, that is considerably more reliable than existing strategies. See also: Missing Data
description:
Honaker, James; King, Gary, 2010, "Replication data for: What To Do about Missing Data in Time-Series Cross-Sectional Data", http://hdl.handle.net/1902.1/14316, Harvard Dataverse, V5
name:
Honaker, James
King, Gary
homePage: http://www.harvard.edu/
name:
Harvard University
ID:
SCR:011273
abbreviation:
DataVerse
homePage: http://thedata.org/
name:
Dataverse Network Project
ID:
SCR:001997