Biostatistics Weekly Seminar

A Statistical Perspective on Challenges and Opportunities in Research Using Real World Data

Rebecca Hubbard, PhD
University of Pennsylvania, Perelman School of Medicine

Using data generated as a by-product of electronic transactions to improve health and healthcare is a priority for the US healthcare system in the 21st century. Informaticians have played a leading role in the process of extracting “real world data” from electronic systems, with statisticians playing a more peripheral part. However, statistical insights on sampling and inference are key to drawing valid conclusions based on these messy and incomplete data sources. In this talk, I will use my research on electronic health records (EHR)-based phenotyping to motivate a discussion of the roles of informatics, statistics, and data science in the process of learning from healthcare data. EHR-based phenotyping is hampered by complex missing data patterns and heterogeneity across patients and healthcare systems, features which have been largely ignored by existing phenotyping methods. As a result, not only are EHR-derived phenotypes expected to be imperfect, but they often feature exposure-dependent differential misclassification, which can bias analyses towards or away from the null. I will review novel and existing approaches to EHR-based phenotyping, highlighting the impact of missing data on phenotype estimation. I will then review some results on the implications of using EHR-derived phenotypes with differential misclassification for bias and type I error of subsequent association studies using these phenotypes as outcomes. Finally, I will present an approach to correcting for phenotyping error that does not require knowledge of sensitivity and specificity of the phenotype. The overall goal of this presentation is to use the example of phenotyping to illustrate the unique contribution of statistics to the process of generating evidence from real world data.

MRBIII, Room 1220
29 May 2019

Speaker Itinerary
Topic attachments
I Attachment Action Size Date Who Comment
1154_0.jpgjpg 1154_0.jpg manage 37.5 K 18 Jan 2019 - 13:25 SrKrueger Auto-attached by ImagePlugin
Topic revision: r4 - 05 Apr 2019, SrKrueger

This site is powered by FoswikiCopyright © 2013-2017 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Vanderbilt Biostatistics Wiki? Send feedback