Bundles and CDM
Space shortcuts
Space Tools
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Current »


Algorithm Details


Implements a loyalty cohort algorithm with the same general design defined in "External Validation of an Algorithm to Identify Patients with High Data-Completeness in Electronic Health Records for Comparative Effectiveness Research" by Lin et al.

  • Written primarily by Darren Henderson with contributions from: Jeff Klann, PhD; Andrew Cagan; Barbara Benoit
  • Calculates 20 variables over the baseline period and computes an overall score, the highest scoring individuals are an approximation of those most likely present for future follow-up
  • This script accepts an index_date and looks back n years previous to that date (baseline period).


Outline of algorithm

The program accepts a cohort definition consisting of: patient ids with per-patient index_dates (the date at which patient loyalty is to be evaluated), the number of years to look backwards from the index date to evaluate each binary variable, and a number of control variables to alter the behavior of the summary table output (such as number of lookback years and whether demographic data are stored in the observation_fact table). Both the model coefficients and the ontology elements mapped to each variable are stored in database tables and can be customized at each site. Specifically, the tool performs the following steps:

  1. Remove patients from the cohort if they have no visits or data (other than demographics) during the measure period. 
  2. Patients that are 18 and under at the beginning of the measure period (and those with an unknown age) are excluded from the analysis, because a pediatric population would require different proxies and measurements of utilization that are likely not generalizable to adults and was not considered in the original model. 
  3. Patients that are deceased at any time from the beginning of the measure period onward, are excluded.
  4. Compute the 20 binary variables by determining which patients have facts of the required types during the measure period, using a site-configurable table of ontology paths (i.e., folders). 
  5. Compute and save (in the database) the final loyalty scores using the published regression equation.
  6. Produce a summary output that implementers can use to validate their loyalty cohorts’ characteristics are as expected. This includes flag prevalence in the cohort, age and sex breakdowns, and average Charlson scores.

The loyalty score script also computes the Charlson Comorbidity Index for each patient.

  1. Compute and save Charlson comorbidities by examining diagnosis codes. At the index date for each patient in the cohort, a 1-year lookback is performed evaluating the diagnosis codes present in the ACT data model. Diagnosis codes associated with each Charlson Comorbidity disease group are retrieved from a data dictionary, and a patient is assigned the appropriate Charlson weights for each group present in the healthcare record. The index is calculated based on the patient's age group and the summation of these weights from each category. From this, the 10-year probability of survival can be calculated.

From supplementary data in Klann, Jeffrey G., Darren W. Henderson, Michele Morris, Hossein Estiri, Griffin M. Weber, Shyam Visweswaran, and Shawn N. Murphy. 2023. “A Broadly Applicable Approach to Enrich Electronic-Health-Record Cohorts by Identifying Patients with Complete Data: A Multisite Evaluation.” Journal of the American Medical Informatics Association: JAMIA, August. https://doi.org/10.1093/jamia/ocad166.

Variables and their coefficients

Variable

Coding System

Label

Any diagnosis code

I

Exactly 1 Diagnosis

Any ED visit

n/a

ED Visit

Any inpatient or outpatient encounter

n/a

In/Out-patient Visit

Any medication

R

Exactly 1 Medication

Any two diagnosis codes

I

2+ Diagnoses

Any two outpatient encounters

n/a

2+ Outpatient Visits

Any two visits with the same provider

C

2+ Visits with Same MD

Any three visits with the same provider

C

3+ Visits with Same MD

At least two medications

R

2+ Medications

At least two routine care fact types (bold-faced)

N/A

2+ Routine Care Facts

Body Mass Index measurement

I

BMI

Colonoscopy

C,H,I

Colonoscopy

Fecal occult blood test

C,H

Fecal Occult Test

General medical examination

I

Medical Exam

Having A1C ordered or value recorded

C,L

A1C

Influenza vaccine

C,H,I

Flu Shot

Mammography

C,I

Mammography

Pap smear

C,H

Pap test

Pneumococcal vaccine

C,H,I

Pneumococcal Vaccine

PSA Test

C,H,I,L

PSA Test

 

Table S1. The 20 variables used to compute a loyalty score, the terminologies within ACT to which we mapped the variables, and the label used in figures. Adapted from Table 2 in [29].

Terminology key: I = ICD-9 and ICD-10; R = RxNorm; C = CPT-4; L = LOINC; H = HCPCS


From supplementary data in Klann, Jeffrey G., Darren W. Henderson, Michele Morris, Hossein Estiri, Griffin M. Weber, Shyam Visweswaran, and Shawn N. Murphy. 2023. “A Broadly Applicable Approach to Enrich Electronic-Health-Record Cohorts by Identifying Patients with Complete Data: A Multisite Evaluation.” Journal of the American Medical Informatics Association: JAMIA, August. https://doi.org/10.1093/jamia/ocad166.


VariableCoefficient
Intercept-0.010
Having seen the same provider twice0.049
Having seen the same provider >=3 times0.087
Having general medical exam*0.078
Mammography*0.075
Pap smear*0.009
PSA Test*0.103
Colonoscopy*0.064
Fecal occult blood test*0.034
Influenza vaccine*0.102
Pneumococcal vaccine*0.031
Having BMI recorded*0.017
Having 2 of the above routine care facts**0.049
With any one medication use record0.002
With at least 2 medication use records0.074
Having A1C ordered or value recorded*0.018
Having at least one inpatient or outpatient encounter0.091
Having at least two outpatient encounters0.050
With 1 diagnosis recorded in the EHR-0.026
With at least 2 diagnoses recorded in the EHR0.037
Having any ED visit in the EHR0.078
** having 2 of the facts followed by *, EHR=electronic health records, PSA= prostate specific antigen

Table S1 from Lin KJ, Rosenthal GE, Murphy SN, Mandl KD, Jin Y, Glynn RJ, et al. External Validation of an Algorithm to Identify Patients with High Data-Completeness in Electronic Health Records for Comparative Effectiveness Research. Clin Epidemiol. 2020 Feb 4;12:133–41.

  • No labels