65th ISI World Statistics Congress 2025

65th ISI World Statistics Congress 2025

IPS 806 - Advances in Handling Missing Data for EHR and Causal Inference

Category: IPS
Monday 6 October 2 p.m. - 3:40 p.m. (Europe/Amsterdam) Room - Europe 2

View proposal detail

Participants

QC
Qixuan Chen (Organiser)
LH
Liangyuan Hu (Chair)
RA
Rebecca Anthopolos (Presenter/Speaker)
QL
Qi Long (Presenter/Speaker)
RB
Rohit Bhattacharya (Presenter/Speaker)
VT
Vincent Tan (Presenter/Speaker)

Missing data are prevalent, affecting both randomized controlled trials and observational studies. The issue of missing data is a significant challenge in the electronic health records (EHR) analysis. EHRs, not initially collected for research, feature unique challenges in missing data handling, including data recorded at irregular intervals and varying frequencies across different measures. Similarly, the field of causal inference faces hurdles due to the prevalence of missing data, with most existing methods tailored for complete datasets. This gap underscores the urgency of developing causal inference methods that accommodate incomplete data. 

In this invited session, five distinguished speakers will showcase their latest research on addressing missing data, with applications in EHR analysis and causal inference. Professor Qi Long, from University of Pennsylvania, will share his recent research on addressing biased, incomplete data in EHR including more accurate assessment of the harmful impact of incomplete EHR data on algorithmic fairness, challenges associated with mitigating such bias, and potential strategies. Professor Rebecca Anthopolos, from New York University, will present a Bayesian nonparametric joint model of longitudinal BMI and time-to-diabetes diagnosis using longitudinal EHR data to evaluate the effectiveness of various static BMI cutoffs versus patient BMI trajectories for diabetes screening in Asians. To account for an informative visit process whereby a patient’s visit process may be associated with underlying health status, they added a recurrent event submodel for gap times between a patient’s clinic visits. To address missing data from depression screenings recorded in EHRs during routine clinical screenings, Professor Qixuan Chen, from Columbia University, will present an ordinal logistic Bayesian Additive Regression Trees model within a pattern-mixture framework. This model specifically aims to impute multiple missing scores in patient health questionnaires. Professor Rohit Bhattacharya, from Williams College, considers missingness in the context of causal inference when the outcome of interest may be missing. He will present a test to verify identification assumptions that are sufficient to correct for both self-censoring and confounding bias in using shadow variable method. Finally, Dr. Vincent Tan, from Vertex Pharmaceuticals, will show his research on causal inference in accounting for selection bias due to censoring by death using a multiple imputation approach to generate counterfactual predictive distributions of principal strata to estimate survivor average causal effects.


For more details on registrations and submissions for the 65th ISI World Statistics Congress 2025, please first login to your account. If you do not have an account then you can create one below:

  • X Cookies Policy

    We have placed cookies on your device to help make this website better.

    You can change your cookie settings in your web browser. Otherwise, we’ll assume you’re OK to continue.

    Some of the cookies we use are essential for the site to work.

    We also use some non-essential cookies to collect information for making reports and to help us improve the site. The cookies collect information in an anonymous form.

    To control third party cookies, you can also adjust your browser settings.

    Do Not Accept Third Party Cookies
    I'm fine with this