Using Data Integration and Predictive Analytics to Improve Diagnosis-Based Performance Measures

NIH RePORTER · VA · I01 · — · view on reporter.nih.gov ↗

Abstract

Background: VA performance monitoring makes extensive use of diagnosis-based quality measures that track delivery of care only among patients who have qualifying ICD-9 diagnosis codes. Diagnosis-based measures can be calculated using existing VA data, allowing for low-cost, near real-time performance monitoring. However, diagnosis-based measures can have critical validity problems if the targeted condition is under- or over-diagnosed to differing degrees across facilities. When variation is diagnosing and coding occurs, facility rankings on measured performance can be misleading: High performing facilities can score poorly, low performing facilities can score well, and facilities with the same real performance can fall at opposite ends of the facility rank distribution. Use of diagnosis-based process measures can therefore undermine one of the primary purposes of quality measurement: The comparison of facilities and systems. In addition, diagnosis- based measures cannot be used to detect gaps in access to care for patients who have a targeted condition but no qualifying diagnosis code. Finally, when diagnosis rates vary across patient subgroups, diagnosis-based measures cannot be used to detect and act on healthcare disparities. Problems with diagnosis-based measures could be remedied if true prevalence data were available: Comparisons of performance based on diagnosis- versus prevalence-based measures would detect facilities with anomalous diagnosis rates and distinguish variation in true performance from variation in case-finding. However, for many conditions, the electronic health record (EHR) does not contain data on true prevalence. Objectives: The goal of the proposed project is to develop a general method for improving diagnosis-based measures when valid prevalence data are not readily available. We propose to build a model for predicting prevalence using multiple sources of existing data and to validate it through a one-time collection of gold standard outcome data (survey-based SUD prevalence). Leveraging existing data with targeted collection of model development and validation data is a cost-effective strategy to improve diagnosis-based measures without requiring ongoing, expensive disease surveillance. Focusing on substance use disorder (SUD) care as an example, the objectives of this study are to: (a) assess the degree of SUD under- or over-diagnosis by comparing the proportion of patients with coded SUD diagnoses in the VA administrative data to SUD prevalence estimates obtained using a validated measure in a patient survey conducted at 30 VA healthcare systems; (b) refine and validate a model for predicting SUD prevalence among VA patients using multiple existing data sources; and (c) assess disparities in SUD diagnosis by comparing diagnosis rates to survey- based SUD prevalence estimates across patient age, sex, and racial/ethnic groups. Methods: We will collect data on DSM-IV and DSM-5-concordant SUD among VA patients using a va...

Key facts

NIH application ID: 10051319
Project number: 5I01HX002128-04
Recipient: VA GREATER LOS ANGELES HEALTHCARE SYSTEM
Principal Investigator: Katherine JoAnn Hoggatt
Activity code: I01
Funding institute: VA
Fiscal year: 2021
Award amount: —
Award type: 5
Project period: 2017-01-01 → 2021-03-31