Diabetes Care 30:2619-2625, 2007 DOI: 10.2337/dc06-2479 © 2007 by the American Diabetes Association
Challenges in Design of Multicenter TrialsEnd points assessed longitudinally for change and monotonicity
1 Peripheral Neuropathy Research Laboratories, Mayo Clinic College of Medicine, Rochester, Minnesota Address correspondence and reprint requests to Dr. Peter J. Dyck, Mayo Clinic, 200 First St., SW, Rochester, MN 55905. E-mail: dyck.peter{at}mayo.edu
OBJECTIVE—Assessing clinimetric performance of diabetic sensorimotor polyneuropathy (DSPN) end points in single and multicenter trials. RESEARCH DESIGN AND METHODS—Assessed were placebo-treated patients with DSPN in the Viatris and Eli Lilly trials and an epidemiologic cohort.
RESULTS—Test reproducibility in clinical trial cohorts (rI CONCLUSIONS—The main reason why it is difficult to demonstrate monotonic worsening of neuropathic end points appears to be a very slow worsening of DSPN, a placebo effect for symptoms and signs, and measurement noise. Demonstrating disease progression in controlled trials of DSPN is more likely when 1) patients with developing rather than established DSPN are selected, 2) type 1 diabetic patients are preferentially recruited, 3) patients are selected who cannot or will not achieve ideal glycemic control, 4) end points chosen are known to show monotonic worsening, and 5) a restricted number of centers and expert examiners (trained, certified, using standard approaches, and reference values and interactive surveillance of tests) are used.
Abbreviations: DCCT, Diabetes Control and Complications Trial DSPN, diabetic sensorimotor polyneuropathy NC, nerve conduction NIS(LL), neuropathy impairment score of lower limbs NSC, neuropathy symptom and change score QST, quantitative sensation test
Rigorous control of glycemia retards or ameliorates diabetic sensorimotor polyneuropathy (DSPN), but long and expensive trials are needed (1–4). No adjuvant treatment, in addition to glycemic control, has achieved sufficient efficacy, considering side effects, to obtain regulatory approval (e.g., by the Food and Drug Administration) (5–13). This contrasts to interventions ameliorating sensory symptoms of pain or autonomic symptoms (gastric atony, diarrhea, and sexual dysfunction) receiving approval (14–20). Therefore, we ask, are controlled clinical trials of ancillary treatments doable, which end points should be used, and how long and rigorous do the trials need to be? Assuming that adjuvant efficacious treatments exist, why has it been so difficult to demonstrate efficacy? Possibilities are 1) interventions are not efficacious; 2) present diabetes care inhibits development of complications; 3) other diabetic complications (hypertension, hyperlipidemia, renal disease, and other) with possible adverse effects on DSPN (21) are now managed better; 4) the wrong kind, stage, or duration of DSPN is studied; 5) end points chosen are insufficiently sensitive, specific, monotonic (measuring a consistent trend of worsening or improvement with time), or generalizable (for study at many medical centers); 6) excessive recruitment of type 2 diabetic patients showing little change with time and excessive variability of measured end points; 7) both placebo and treated patients receive better than usual medical treatment (e.g., of hyperglycemia) while in the trial; and 8) studies are insufficiently powered. The present studies focus on the performance of neuropathy end points in a longitudinal epidemiology survey in Rochester, Minnesota (22), and in placebo arm patients of two pharmaceutical trials, the Viatris trial and the Eli Lilly trial, of mild severity DSPN in type 1 and type 2 diabetes.
Patient cohorts studied The three cohorts studied are dissimilar in important ways but are alike in having mild or moderate severity DSPN (stage 2B and 3 [23] are excluded), and severity of neuropathy was evaluated by the same approaches and criteria. Minimal criteria for the diagnosis of DSPN were the same: i.e., 5 NC nds (5 attributes of nerve conduction expressed as normal deviates) 95th percentile, neuropathy impairment score of lower limbs [NIS(LL)] 2 points, or Diabetes Control and Complications Trial (DCCT) criteria (two approaches). The epidemiologic study patients are consenting individuals with diabetes and distal symmetric DSPN from Rochester, and later Olmsted County, Minnesota (Rochester Diabetic Neuropathy Study) (22).
The Viatris trial patients (Nathan 1, Viatris) are placebo-treated individuals from a prospective double-blind multicenter trial of oral
The Lilly trial patients are placebo-treated individuals of two prospective double-blind multicenter trials (B7A-MC-MBCW and MBBP) of LY 333531 (a specific protein kinase C-ß inhibitor) for the symptoms of DSPN during a 1-year period. The criteria for study entry were as follows: diagnosis of DSPN, neuropathic symptoms (a symptom score >6 points), vibration detection threshold of the great toe using CASE IV
Personnel and performance of tests In the epidemiologic cohort, one of us (P.J.D.) performed all clinical evaluations. In the Viatris and Lilly trials, trained and certified (for the study) neurologists or diabetologists did the examinations. Nerve conductions were performed by neurologists or electromyography technicians under the supervision of physicians. Quantitative sensation testing was done by technicians using CASE IV (a computer-based system using standard and calibrated stimuli, predetermined and validated algorithms of testing, and null stimuli and reference values [percentiles and normal deviates] from the Rochester Diabetic Neuropathy Study cohort of healthy subjects) (24,25). All neurological evaluation sheets (Clinical Neuropathy Assessment, see below) were sent to a central reading and quality assurance center to interactively recognize omissions and errors, to electronically calculate scores, and to enter data into an electronic database (26).
Use of standard and independent tests
Analyses For reproducibility, intraclass correlations were used. For concordance between tests, Spearman rank-order correlations were used. For change with time, neuropathic end points were regressed on time and the derived slope was tested for statistical significance.
Study limitation
Demographic and disease characteristics In patients from the untreated (the epidemiologic study) and placebo arm (the Viatris and Lilly trials), glycemic control (A1C 8–9%) was not ideal (Table 1). All patients studied had mild-severity DSPN. However, by the criteria of the median values of NIS(LL), 5 NC nds, 3 QST tests nds, and heart rate with deep breathing nds, the Viatris trial cohort was most severe, followed by the Lilly trial and then by the epidemiologic cohort. By the criteria of symptoms (NSC severity) and DCCT criteria (summated values giving equal weights for decreased ankle reflexes, clinical vibration of toes, and neuropathic symptoms [from NSC]), the order of most to least severe was the Lilly trial, the Viatris trial, and the epidemiologic cohorts.
Reproducibility of nerve tests Reproducibility of neuropathy end points could only be assessed for the Viatris trial and the Lilly trial. Generally, reproducibility was in the range of rI = 0.7–0.85 (Table 2), somewhat lower than previously published values mainly from the epidemiologic cohort.
Concordance (r2 as percentages) between pairs of neuropathy end points There is considerable variability of associations between pairs of end points and among cohorts (Table 3). For many pairs of end points, the percent variability explained by the data are <10%; but for others, it was in the range of 15–35%. For the epidemiologic cohort, the Viatris trial, and the Lilly trial, neurological signs (NIS[LL]) were correlated with symptoms to a substantial degree, i.e., 6, 12, and 15% at the first and 15, 24, and 16% at the last examination, respectively. By comparison, 5 NC and NIS(LL) had values of 11, 17, and 0% and 14, 19, and 1%, respectively—a low correlation in the Lilly trial at both examinations. The mean values of the percent correlations were similar between the epidemiologic cohort and the Viatris trial at first and at last evaluations. It was lower for the Lilly trial cohort. In all cases, mean correlations were higher at the last examination than at the first examination.
Consistency of end point worsening over time (monotonicity) Estimates of monotonicity varied somewhat depending on criteria for diagnosis of DSPN (Table 4). Therefore, it was assessed using three criteria for the diagnosis of DSPN, i.e., NIS[LL] 2 points, 5 NC nds 95th, and DCCT with two or more of three criteria (two varieties). By these criteria, sural sensory nerve amplitude performed best, significantly worsening eight of a possible nine times, with the ninth time not showing a significant change. Peroneal nerve motor conduction velocity showed significant worsening in two of nine cases. Heart rate change with deep breathing showed neither worsening nor improvement. Improvement was found for ankle reflexes (significantly four of nine times); NSC(lower limbs) severity also showed significant improvement (three of nine times). Summated DCCT criteria showed significant improvement in three of nine cases. Tibial motor distal latency showed significant improvement in three cases.
These results were confirmed and strengthened by assessment of longitudinal data in patients in the epidemiologic cohort with DSPN for the full duration of study (median 9.5 years) (Table 5, available from the authors upon request). The end points showing consistent monotonic worsening without improvement were as follows: sural sensory amplitude, clinical vibration of toe (significant two of four times), and 3 QST nds (significant four of four times).
Testing for the effect of restricting entry into study by a percentile value of an end point and then following course using the same but unrestricted end point
This study does not directly address all of the possible reasons why it has been difficult to demonstrate efficacy from adjuvant treatments of DSPN but deals with issues of reproducibility, concordance between pairs of tests, monotonicity, and the rate of worsening over prolonged times in mild DSPN. Our results show that the main reason is that DSPN worsens very slowly and that many end points are not sufficiently sensitive, accurate, representative, reproducible, and especially monotonic to reliably recognize the small worsening occurring over a period of 4 years. The magnitude of worsening measured over a period of 4 years was very small and was not accompanied by increased neurological signs or symptoms, both showing improvement (albeit small) rather than worsening. One possible explanation for why only a slight deterioration was observed might be our use of the commonly used practice in controlled clinical trials to "carry last observations forward" with the consequence of underestimating actual deterioration. Here, it was not the reason for the low rate of worsening, since results were essentially unchanged with deletion of "carried forward" data. Because this slow rate of worsening was found for all three cohorts and had also been observed previously in DCCT studies (1,2), we attribute it to very slow worsening of DSPN.
Assuming that DSPN worsens very slowly and that adjuvant treatments are unlikely to improve nerve function to a greater degree than from euglycemia, end points needed to detect a difference between treatment and placebo must have excellent clinimetric characteristics. If, on the other hand, an adjuvant treatment causes a greater improvement than that from institution of euglycemia, one assumes that present monitoring approaches or analyses would be able to recognize and track improvement. Many of the end points studied were reasonably reproducible and concordant. By the criteria of monotonicity, some of the end points performed poorly. This was especially true for symptoms and neurological signs, which typically showed significant improvement rather than worsening. The failure of these clinical measures to perform well raises a serious question about the adequacy of the recently published research case definition of polyneuropathy (27). The end points that performed best were sural nerve amplitude and to a lesser degree peroneal motor conduction velocity, both showing monotonic worsening. This worsening is all the more meaningful, as we had corrected for the influence of change in age and weight. Considering the epidemiologic cohort patients with DSPN followed for the full duration of study (median 9.5 years), sural nerve amplitude continued to show significant monotonic worsening three of four times. By contrast, peroneal motor conduction velocity did not show monotonic worsening. The composite score ( What inferences about the conduct of future therapeutic trials of DSPN can be drawn from the present analysis, and are multicenter trials doable? Several important insights about how to improve future trials can be inferred. First, since near euglycemia prevents or ameliorates DSPN, success is more likely if patients who cannot, or will not, achieve good chronic glycemic control are included. Second, end points should be chosen differing on whether patients without or with DSPN are to be recruited for study—composite scores of NC and heart rate with deep breathing emphasized for the former and the two attributes of NC and composite score of QSTs identified here for the latter. Third, studies will need to be done for long times to show a treatment effect. Perhaps the best stage of DSPN for conduct of trials is patients on the verge of developing DSPN. These patients showed monotonic worsening of composite scores of nerve conduction and, to a lesser degree, of heart rate variability with deep breathing (28). Type 1 diabetic patients are preferable to type 2 diabetic patients because there is less variability of normal test results due to their younger age and because polyneuropathy worsens to a greater degree in type 1 diabetes (25,29). However, we acknowledge that selection of other patients, other stages of complication, or other methods of analyses may be shown to improve study performance unlike that shown here. The monotonic improvement of clinical signs and symptoms in placebo arm trial patients of controlled clinical trials demonstrated here is of concern, especially in light of their historical use in DSPN (30,31) and their importance in the recently published case definition of polyneuropathy (27). We judge the observed improvement we observed in the placebo arm of trials is mainly due to a placebo effect. Measures of neurological signs and symptoms, although good enough to demonstrate a large therapeutic response (e.g., in immune neuropathies [32,33]), are not as able to do so in DSPN with low rates of worsening. Finally, we note that restricting entry into study by use of a percentile abnormality of an end point and then using the same end point to measure change may result in a spurious estimate of improvement. Therefore, it is advisable to either incorporate a run-in phase and subsequent baseline measurements for analyses or use different criteria for entry into study than used for purposes of follow-up.
The work was supported in part from grants received from the National Institute of Neurologic Disorders and Stroke (NS36797). The authors thank patients and investigators of the Viatris and Eli Lilly trials, patients in the Rochester Diabetic Neuropathy Study, and Vicki Clark and Mary Lou Hunziker for their help. Additional contributing authors are Vladimir Skljarevski, MD, Eli Lilly and Co., Indianapolis, Indiana; and Phillip A. Low, MD, P. James B. Dyck, MD, Christopher J. Klein, MD (Peripheral Neuropathy Research Laboratories), and L. Joseph Melton, III, MD (Division of Epidemiology), Mayo Clinic College of Medicine, Rochester, Minnesota.
Published ahead of print at http://care.diabetesjournals.org on 18 May 2007. DOI: 10.2337/dc06-2479.
* See the ACKNOWLEDGMENTS for additional contributing authors. A table elsewhere in this issue shows conventional and Système International (SI) units and conversion factors for many substances. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Received for publication January 12, 2007. Accepted for publication May 9, 2007.
This article has been cited by other articles:
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||