Impact of the non-probability sample on the accuracy of an estimator of a total when integrating non-probability and probability samples
Conference
65th ISI World Statistics Congress 2025
Format: IPS Abstract - WSC 2025
Keywords: composite estimator, multivariate normal distribution, poisson pseudo-sampling design, propensity score, variance
Session: IPS 700 - Non-probability and Probability Sample Integrated Estimators for the Population Parameters
Monday 6 October 2 p.m. - 3:40 p.m. (Europe/Amsterdam)
Abstract
Data from the non-probability and probability samples are combined to estimate finite population total. Assuming the values of the study variable are available in both samples, under independence of the pseudo-inclusion indicators to the non-probability sample, the integration of the non-probability sample and probability sample through the composite estimator of the population total is studied. The integration is composed of the linear combination of the inverse probability weighted (IPW) estimator and a traditional design-based estimator. Evaluating the variance of the former estimator, the randomness of the underlying non-probability sample is taken into account through the distribution of the estimated propensity scores. Non-sampling errors are not taken into account. This approach is compared with an estimator obtained by other authors and with a bootstrap variance estimator. The proposed linear combination is robust to the miss-specification of the model for the propensity scores due to incorporated estimator of bias of the IPW estimator. The number of Lithuanian companies possessing the websites is estimated in the simulation study. By combining the sample survey data and the big voluntary sample data, the properties of the introduced estimators are demonstrated.