Estimation of Conditional Prevalence from Group Testing Data with Missing Covariates |
Authors: |
Aurore Delaigle, Wei Huang, and Shaoke Lei |
Source: |
Journal of the American Statistical Association , Published online; DOI: 10.1080/01621459.2019.1566071 |
Topic(s): |
Data quality Data use
|
Country: |
More than one region
Multiple Regions
|
Published: |
APR 2019 |
Abstract: |
We consider estimating the conditional prevalence of a disease from data pooled according to the group testing mechanism. Consistent estimators have been proposed in the literature, but they rely on the data being available for all individuals. In infectious disease studies where group testing is frequently applied, the covariate is often missing for some individuals. There, unless the missing mechanism occurs completely at random, applying the existing techniques to the complete cases without adjusting for missingness does not generally provide consistent estimators, and finding appropriate modifications is challenging. We develop a consistent spline estimator, derive its theoretical properties, and show how to adapt local polynomial and likelihood estimators to the missing data problem. We illustrate the numerical performance of our methods on simulated and real examples. Supplementary materials for this article are available online.
Keywords: Bandwidth, Kernel, Likelihood, Local polynomial, Pooled data, Spline |
|