© 2001 by the American Diabetes Association, Inc.
Developing a Prediction Rule From Automated Clinical Databases to Identify High-Risk Patients in a Large Population With DiabetesDivision of Research, Kaiser Permanente of Northern California, Oakland, California
OBJECTIVETo develop and validate a prediction rule for identifying diabetic patients at high short-term risk of complications using automated data in a large managed care organization.
RESEARCH DESIGN AND METHODSRetrospective cohort analyses were performed in 57,722 diabetic members of Kaiser Permanente, Northern California, aged
RESULTSHistory of prior complications or related outpatient diagnoses were the strongest predictors in each complications set. For patients without previous events, treatment with insulin alone, serum creatinine CONCLUSIONSSimple prediction rules based on automated clinical data are useful in planning care management for populations with diabetes.
Abbreviations: AUC, area under the curve HMO, health maintenance organization OR, odds ratio ROC, receiver operating characteristics
Approximately 4% of most managed care populations have diabetes (1,2), but these patients account for nearly 12% of total health care expenditures (2). These costs also reflect catastrophic events in the lives of patients, because a large fraction of total costs result from hospitalization for disease complications (2). Recognizing the burden of this illness, many managed care organizations have developed intensive diabetes care programs, featuring multidisciplinary clinics or nurse case management (3,4,5) to improve pharmacotherapy, preventive screening, and support for self-care. However, inclusion of all diabetic patients in intensive disease management programs, including those at low risk for complications, would diminish the cost-effectiveness of these programs. Clinical prediction rules (6) are tools created by combining information from clinical data, usually using multivariate analyses, to estimate the probability of an outcome for individual patients. When applied to an entire population of members with diabetes, a prediction rule could be used to identify and rank members by their level of risk for complications. Despite the frequent availability of rich automated clinical data in health plan systems (7), prediction rules have not been widely used for diabetes. Instead, many programs focus solely on poor control of HbA1c levels to identify those in need of more intensive intervention. This study seeks to develop and test a prediction rule using automated clinical data that can be applied at the population level to improve this strategy. Several approaches are compared to identify the simplest rule that efficiently identifies high-risk patients.
This report is based on a retrospective cohort analysis conducted in the Northern California Kaiser Permanente Diabetes Registry. Kaiser Permanente, which is a group model health maintenance organization (HMO), had 2.5 million enrollees during the study period. The registry (2) is an ongoing epidemiological cohort of all HMO members with diabetes identified from four automated databases: pharmacy prescriptions for diabetes medications, abnormal HbA1c values ( 6.7%) in laboratory files, primary hospital discharge diagnoses of diabetes, and emergency department records of diabetes as the reason for visit. During the study period, this registry had a sensitivity of 90% when matched against >1,500 self-reported diabetic patients who responded to two large mailed surveys. The registry missed some diet-controlled subjects and the small proportion of members who never use the HMOs services. The registry also has been found to contain 2.5% false positives or members who do not truly have diabetes (J.V.S., unpublished data).
For these analyses, data gathered from electronic databases and from a mailed survey during the 2-year baseline period (19941995) were used to predict three sets of complications of diabetes occurring during 1996. The study sample consisted of the 57,722 registry members who were aged
Study outcomes
Candidate predictors Other candidate predictors included laboratory results (HbA1c, serum creatinine, and lipoprotein levels), pharmacy prescriptions (for hypoglycemic, lipid-lowering, and antihypertensive agents), outpatient visit counts by type, and responses to a 19941996 mailed survey (with telephone follow-up of nonrespondents) completed by 83% of the study sample. Survey items included demographics, self-reported behaviors, and information used to classify diabetes (age and obesity status at onset and patterns of insulin use). Both clinical and survey databases have relatively high rates of missing data for potential predictors. Approximately 2540% of cohort members had missing values for one or more key predictors, such as baseline HbA1c, serum creatinine, cholesterol, smoking status, and BMI. Because we wished to develop a tool applicable to an entire population, it was important that these subjects were included in the models. We therefore used "missing" categories for several key variables. Continuous predictors were converted to ordered categorical variables for this purpose.
Data analyses After examining bivariate associations of predictors and outcomes, separate stepwise logistic regression models were conducted in the derivation data set to build the best model for each complications set. On parameter estimates, P < 0.01 was required to include a predictor in each best model. Once each model was derived, coefficients for significant predictors were applied to predictor values of the validation data set members. Risk scores for each member were calculated by summing coefficients across all predictors, and the ability of these scores to predict complications in a new population was examined. Based on preliminary analyses, four simpler approaches to identifying and targeting high-risk patients were identified and compared with the best model. At an early stage in our analyses, we noted that events or related outpatient diagnoses during the baseline period were strong predictors of each complications set. Therefore, the first alternative was to use a "prior events" strategy that simply targeted patients with either of these predictors. Preliminary analyses also revealed that risk scores based only on the first three variables entering each model were nearly as sensitive as scores from the best models. Therefore, we evaluated "reduced models" that included only these first three variables.
The third comparison approach tested a simplified numerical risk score derived by replacing significant model coefficients with integer values as follows: a value of 1.0 for a (significant) multivariate odds ratio (OR) between 1.1 and 1.49, 2.0 for an OR between 1.50 and 1.99, and 3.0 for an OR of The fourth strategy was to simply rank patients on the basis of their average HbA1c level during 19941995 and to select patients in descending order of these values. We used percentiles rather than absolute values for cut points because HbA1c distributions vary across populations and over time.
Initial comparisons of these five approaches focused on sensitivity and positive predictive values in the validation data set. Continuous risk scores, which identified the 30% of patients with the highest predicted risk (or the highest HbA1c levels), were compared at the cut point. This cut point was chosen to be consistent with our health plans current policy of planning more intensive interventions for
For patients without prior inpatient events or related outpatient diagnoses, we re-examined the utility of the four remaining approaches. In this subgroup, the number of macro- and microvascular complications, infectious complications, and metabolic complications was greatly reduced, leaving just 723 subjects who experienced at least one event in 1996 (561 with a macro- and microvascular event, 453 with an infectious event, and 95 with a metabolic event). Because all complications are important from a disease management perspective, and in light of the overlap of many important predictors for two or all three sets of complications, we combined these end points and modeled risk for any event. Age, sex, and race were not included in this model, despite associations with one or more outcomes in the models described above, because these characteristics present no options for risk reduction. By removing them from the models, many of the associated, mutable risk factors should contribute more strongly to risk scores. We further excluded the 3.5% of remaining patients with serum creatinine levels
Number of subjects, demographic characteristics, and frequency of each set of complications were similar in the derivation and validation data sets (Table 1). Macro- and microvascular events occurred nearly three times as frequently as infectious events and >10 times as frequently as metabolic complications.
Descriptions of the best models For each complication, predictors are shown in the order of entry into stepwise models (Table 2), with ORs for each level of the predictor, and numerical scores assigned to levels that differed significantly from the referent group. Prior hospitalizations (during 19941995) for similar events were the strongest predictors of both infectious and metabolic complications and the second strongest predictor of macro- and microvascular complications. Related outpatient diagnoses were the strongest predictor of macro- and microvascular events and were also strongly predictive for infectious complications. There were no outpatient diagnoses for metabolic complications. Increasing age was the third predictor to enter macro- and microvascular and infectious complication models; age was inversely related to metabolic complications.
Several clinical predictors were common to two or all three complications sets. Use of insulin alone (i.e., without records of oral hypoglycemic agents) was associated with increased risk for all three complications sets. Hyperglycemia (average HbA1c level >10.0%), not having HbA1c measured during the baseline period, and elevation of total or LDL cholesterol were all associated with both macro- and microvascular and metabolic complications. Elevated serum creatinine level predicted both macro- and microvascular and infectious disease complications. Outpatient macro- and microvascular disease diagnoses were also a strong predictor of infectious disease events. Use of two or more different antihypertensive medications during the baseline period was a strong predictor of macro- and microvascular events. Interestingly, not having had an albuminuria/microalbuminuria screening, as well as the presence of microalbuminuria or albuminuria, predicted macro- and microvascular events.
Comparisons of the best model with simpler approaches
Simpler numerical scores performed nearly as well as risk scores calculated directly from coefficients of the best models for each complications set. ROC curve comparisons did not reveal any significant differences in AUCs between these two scores (for each complication, P > 0.05). All ROC curve comparisons are available upon request (J.V.S.). An approach based on selecting subjects solely on the basis of elevated HbA1c levels was far less efficient for each complication, whether evaluated at the upper 30% cut point or across the entire range using ROC curve comparisons.
Utility of risk scores in subjects without prior events
Cumulative sensitivity for 1996 events across the full range of each risk score is shown in Fig. 1. Model sensitivities were not as high in this patient subgroup as in the full sample because of the absence of the two strongest predictors (prior events and related diagnoses). Nevertheless, all three model-based approaches improved substantially over targeting based on HbA1c alone. The numerical score is shown as a black line because its seven observed scores do not fall at decile cut points. There was essentially no difference in performance between the best model and the numerical score as judged by comparison of ROC curves (P = 0.24). The AUC for the full model was slightly greater than the AUC for the three-variable model (64 vs. 61%, P = 0.03). Identifying patients simply on the basis of their previous HbA1c levels did little better than chance in identifying those at high short-term risk.
It is frequently observed that very small proportions of a population consume a large fraction of total health costs. In this diabetic population, 20% of the members accounted for 79% of the excess costs of care in 1995 (J.V.S., unpublished data), much of which was a result of hospitalization for complications (2). We aimed to develop a tool that could help to identify those members of a population at greatest risk for complications. A relatively short-term (1 year) follow-up period was used in these analyses, because decision makers who fund expensive disease management programs are highly sensitive to short-term financial considerations (11). Pronk et al. (12) have shown that elevated risk factor levels translate to increased costs for diabetic patients in the short term, and two recent trials of intensive interventions for diabetes (5,13) have shown that hospitalization rates and costs of care can be reduced within 12 months. However, the major predictors in our models (hypertension, hyperglycemia, elevated serum creatinine, use of insulin only, albuminuria, and dyslipidemia) are highly consistent with previous epidemiological (14,15,16) and intervention studies (17,18,19,20,21,22,23) that used a long-term perspective.
Several aspects of the findings should be highlighted. The importance of secondary prevention is demonstrated by the very strong predictive power of prior complications and related outpatient diagnoses. Patients with one or both of these markers accounted for well over half of the complications in 1996 and should clearly be among the first targeted by population diseasemanagement programs. More complex prediction scores, such as those developed here, would be most helpful for targeting primary prevention in the remaining 6070% of the diabetic population. HbA1c levels predicted increased risk for each set of complications, but model-based targeting improved substantially on selection that was based on elevated HbA1c levels. The simple numerical score, which proved to be as accurate as the score calculated directly from the best-model coefficients, would be the most convenient approach to applying our findings in other populations. In our data, a score Our analyses also indicate that sufficient information for predicting complications is captured in a very small number of commonly available variables. Even among patients with no prior events or related diagnoses, models containing just three variables were nearly as efficient as much more complex models in predicting short-term risk. Nearly all key variables come from data sources (hospital discharge files, outpatient visit claims, laboratory results, and pharmacy records) that are commonly available in health care systems. Several limitations of these analyses should be kept in mind. First, these risk scores were developed for use by programs that aim to support rather than replace clinical judgment. Although the models confirm the importance of several known clinical risk factors, the model scores derived from automated data are neither sufficiently accurate nor sufficiently complete to supplant decision- making by physicians who treat individual patients in clinical settings. Other information available to the clinician, such as comorbidities or known noncompliance, could easily overrule score-based decisions. The high levels of missing predictor information in our clinically derived data would be considered a serious limitation in epidemiological analyses. However, our aim was to produce a disease-management tool applicable to all members of a population rather than a biological or epidemiological model of complications. By including "missing" as a value for several predictors, we also learned that "missingness" itself can sometimes signal increased risk. We repeated the best models from Table 2, excluding patients with any missing values. Although sample size dropped by as much as 75%, results were essentially identical for macro- and microvascular and infectious models. The metabolic-events model was not interpretable, because the number of end points dropped to 27. Although age was a strong predictor and sex was a weak but significant predictor in at least one of the best models (Table 2), we included neither variable in the final model, because it would make little sense to target only the oldest patients or only one sex for disease management activities. In conclusion, automated data available in many HMOs could be used to more efficiently identify diabetic patients at high risk for complications. As databases derived directly from electronic medical records replace current systems, the precision and completeness of many predictors will improve, which will further add to the accuracy of predictive models.
This research was supported in part by a grant from Pfizer Pharmaceuticals. The authors wish to thank Lyn Wender for her editorial assistance.
Address correspondence and reprint requests to Joe V. Selby, Division of Research, Kaiser Permanente, 3505 Broadway, Oakland, CA 94611. E-mail: jvs{at}dor.kaiser.org. Received for publication 9 January 2001 and accepted in revised form 12 April 2001. A table elsewhere in this issue shows conventional and Système International (SI) units and conversion factors for many substances.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||