The choice of outcome success thresholds may influence clinical management, pay-for-performance, and assessment of value-based care.
ObjectiveTo evaluate outcomes success thresholds in older adults using two different methods: 1) Minimal clinically important differences (MCIDs) of the Quick-DASH and 2) Dichotomization of the Quick-DASH based on low disability rating at discharge
DesignAn observational design (retrospective database study).
SettingDataset of 1109 patients with shoulder disorders.
Participants297 older adults patients who were diagnosed with rotator cuff related shoulder disorders and were managed through physical therapy treatment.
Main outcome measuresWe categorized and calculated how many patients met 8.0 and 16.0 point changes on the Quick-DASH. To evaluate outcomes success thresholds using dichotomization, patients who discharge score of ≤20 on the Quick-DASH were considered positive responders with successful outcomes.
ResultsThe percentage of positive responders who met the MCID thresholds for the Quick-DASH were 63.3% using MCID of 8.0 points, 39.7% using the MCID of 16.0 points, and 46.12% who met discharge score of ≤ 20 on the Quick-DASH. 39.0% met both MCID of 8.0 points and discharge score of ≤ 20 on the Quick-DASH. Only 28% met both MCID of 16.0 points and discharge score of = 20 on the Quick-DASH.
ConclusionThree different success threshold derivations classified patients into three very different assessments of success. Quick-DASH scores of ≤ 20 represent low levels of self-report disability at discharge and can be a stable clinical option for a measure of success to capture whether a treatment results in meaningful improvement.
Shoulder pain is among the most common musculoskeletal disorders, with a lifetime prevalence ranging from 6.7 to 67% in the general population.1 Shoulder pain is reported to result in limitations in social, work, and essential daily activities.2,3 Rotator cuff disorders are the most common cause of shoulder pain3,4 and frequently include degeneration of the rotator cuff musculature.4,5 Previous studies have indicated that rotator cuff disorders have been found to increase in prevalence with age6,7 and to persist in older people.3
Healthcare providers routinely use assessment results regarding patients status changes to monitor their prognosis and to make treatment decisions.8 Health status scales and questionnaires are unique because they provide a self-report of patients’ perspectives about their own health status.9 One of the most often used instruments for analyzing pain and disability of shoulder disorders is The Disabilities of the Arm, Shoulder and Hand questionnaire (DASH) outcome measure and its shortened version (Quick-DASH).10,11 A recent systematic review12 outlined these questionnaires as the most studied for patients with rotator cuff disorders. The questions address daily and social activities, which patients generally find difficult to perform because of pain.13,14 Both instruments are reliable, valid and responsive.4,11 Beyond this, in order for these instruments to be clinically useful, they must demonstrate an ability to detect patient's perspective of change (interpretability), otherwise known as minimal clinically important difference (MCID).15 The interpretation of change scores are essential to compare the current and previous assessment results of an outcome measure.8 Thus, by definition, changes in DASH scores exceeding the MCID are clinically relevant.9
Although readily adopted into clinical practice, the MCID score has been questioned for its stability and utility.15 At present, the MCID is calculated by at least nine methods using two general methodologies-distribution-based and anchor-based.9,12,15,16 Anchor-based methods are the preferred methodology (there are four methods this is used) as it is based on the patient's perception as the anchor for comparison.9,17 Further, calculations of an MCID are mediated by various factors, such as the health condition, descriptive factors (i.e. age and level of education), the outcome being measured, its baseline score, the time between baseline and follow-up and the intervention.16–18 Thus, healthcare providers must be cautious in accepting the transferability of an MCID score across all populations with varying levels of severity.11,15,16 Regarding the MCID for the Quick-DASH, previous studies indicated that there is a wide variability in this measure ranging from 8.0 to 16.0 points.11,12,19,20
Previous studies exploring shoulder disorders have focused on investigating the MCID in young patients (<50 years).11,19,20 Further investigation of the MCID for Quick-DASH is needed in older adults (≥65)21 since rotator cuff disorders are prevalent in this population.6,7 It is also important to explore if patients who meet the MCID also self-report low disability ratings at discharge. A study such as this has merit since some countries’22 healthcare providers are incentivized through compensation (i.e. pay-for-performance program) to provide better-quality care based on outcome and/or process measures and MCID threshold values.23,24 One of the obstacles to the success of pay-for-performance is the lack of valid and reliable information on care outcomes.24 Consequently, we aimed to evaluate outcomes success thresholds in older adults using two different mechanisms: (1) reported MCIDs of Quick-DASH and (2) dichotomization based on low disability rating at discharge. We hypothesized that our different measures of success will lead to the capture of distinctly dissimilar populations.
MethodsReporting guidelinesThis study used the REporting of studies Conducted using Observational Routinely collected Data (RECORD) to guide the reporting.25
Study designThe study was an observational design (retrospective database study).
Participants and settingParticipants included patients of both sexes, age ≥65 years, who were diagnosed with shoulder disorders (i.e. rotator cuff disorders) and were treated in a physical therapy clinic. These patients were extracted from a dataset of 1109 individuals with shoulder related diagnoses, who were treated from 2016 to 2017. We targeted the International Classification of Disease (ICD-9) codes consistent with rotator cuff related shoulder pain, which encompasses a spectrum of shoulder conditions including: subacromial pain (impingement) syndrome, rotator cuff tendinopathy, and symptomatic partial and full thickness rotator cuff tears.26 We targeted this subgroup of older adults (≥65) in an attempt to homogenize the shoulder diagnostic properties of the population.6,7 All patients signed a written consent form.
This study was approved by the Institutional Review Board of Duke University, Durham, NC, USA (Pro0092671).
TreatmentThe study evaluated outcomes from patients who were treated by physical therapists from multiple clinical practices. Treatment was pragmatic and was based on each patient's impairments and presentation. Interventions were coded by Common Procedural Treatments (CPT) and the majority of individuals received exercise-only based codes (57.31% (SD=14.60)) or manual therapy (20.42% (SD=9.82)). Over twenty percent of individuals received passive care options (22.27% SD=12.31)).
Descriptive variablesPatient characteristics were captured at baseline including age, sex, body mass index (BMI), diagnosis, total visits, total days of care, total number of comorbidities, household income, educational status, baseline disability (measured by Quick-DASH questionnaire) and baseline health-related quality of life (physical and mental components score, measured by 12-item Veteran's RAND Health Survey).27 The VR-12 is a reliable health questionnaire developed from a modified version of Short Form Survey (SF-36).28 Higher scores of the VR-12 indicate better health status.27
Outcome measuresAll patients completed the Quick-DASH questionnaire at baseline and at the end of a tailored physical therapy program. The Quick-DASH is a short version of the original DASH, developed by Beaton et al.13 in 2005, and contains 11-items, beyond an optional work and sport module.13,14 Each item has 5 response options (no difficult, mild difficult, moderate difficult, severe difficult and unable). The overall score ranges from 0 (no disability) to 100 points (most severe disability).29,30 Because of the shorter nature of the Quick-DASH, it may allow for expedited completion in the clinic setting.29,31
Data sources/measurementAll data were extracted from the ATI Patient Outcomes Registry. The ATI Patient Outcomes Registry collects observational, epidemiologic, financial and clinical data that supports innovative approaches to physical therapy and provides broader awareness of patient outcomes.32
The study was registered in ClinicalTrials.gov (NCT02285868) and the U.S. Department of Health and Human Services’ Agency for Healthcare Research and Quality in the Registry of Patient Registries (2608). The de-identified data were not re-coded or manipulated and represented the raw form of findings from clinical practice. Missing values were only present in 0.71 percent of variables. Nearly 85% of cases had complete variables.
Statistical methodsAll statistical analyses were performed using Statistical Package for the Social Sciences (SPSS) version 25.0 (Chicago, USA).33 To evaluate outcomes success thresholds using MCID we categorized and calculated how many patients did and did not meet a minimum of 8.0 point changes and a minimum of 16.0 point changes in Quick-DASH.11 The choice of these MCIDs was based on previous studies11,19 which investigated MCIDs for Quick-DASH using the point on the ROC curve19 and a triangulation of distribution and anchor-based approaches.11 We selected low and high MCIDs for Quick-DASH to better explore the limits of the MCID measures. These previous studies also calculated the standard error of measurement/SEM (which links the reliability to the standard deviation of the sample)11 and the minimum detectable change/MCD, (which is considered the smallest change in score, based on the SEM).11 The Standard error of measurement (SEM) calculated from 8.0-point changes in Quick-DASH is 4.8, which corresponds with minimum detectable change (MDC) values of 11.2 percentage points.19 The SEM and MDC calculated from 16.0-point changes in Quick-DASH are 5.51 and 12.85, respectively.11
To evaluate outcomes success thresholds using dichotomization based on self-rated disability at discharge, patients who scored ≤20 on the Quick-DASH were considered positive responders to treatment with low disability. The choice of this threshold was based on previous studies, which investigated the MCID in patients with other musculoskeletal disorders.34,35 Descriptive statistics were performed to describe all variables using univariate analyses of variance (ANOVA) with a Bonferroni Hochberg correction36 and Chi square analyses. Statistical significance was initially defined as p-value <0.05. We also compared the numbers of subjects who met success in both categories using a 2×2 contingency table.
ResultsFrom a dataset of 1109 individuals, 297 older adults with shoulder disorders who received physical therapy treatment were included (Fig. 1). Baseline characteristics of the patients are described in Table 1. Most of the study participants were women, overweight, with a mean age of 73 years (SD=6.82). The most frequent rotator cuff disorders diagnoses identified was bursa and tendon disorders. Total comorbidities ranged from 0 to 19 (mean=3.56; SD=2.45); total visits ranged from 3 to 36 (mean=14; SD=6) and total days of care ranged from 0 to 5491 (mean=113.68; 603.76). The baseline mean for the Quick-DASH and VR-12 scores demonstrated moderate levels of disability and health quality, respectively.
Patient demographic and clinical characteristics (n=297).
Variables | Mean (SD)/frequency | Range or percentage |
---|---|---|
Age | 73 (6.82) | 65–101 |
Sex | ||
Female | 172 | 57.9 |
Male | 125.0 | 42.1 |
Body mass index (BMI) | 29 (6.58) | 17.54–66.56 |
Diagnosis (ICD-9) | ||
Bursa and tendon disorders (726.10) | 140 | 47.1 |
Calcifying tendinitis of shoulder (726.11) | 1 | 0.3 |
Unspecified shoulder lesions (726.19) | 9 | 3.0 |
Complete rupture of rotator cuff (727.61) | 55 | 18.5 |
Rotator cuff (capsule) sprain (840.40) | 92 | 31.0 |
Total visits | 14 (6.00) | 3–36 |
Total days in care | 113.68 (603.76) | 0–5491 |
Total number of comorbidities | 3.56 (2.45) | 0–19 |
Household income (USD) | 64,052.34 (12,611.76) | 37,460.00–98,704.00 |
Education statusa | 10.75 (3.85) | 3.76–19.07 |
Baseline MCS of VR-12 | 42.55 (5.85) | 25.76–58.26 |
Baseline PCS of VR-12 | 38 (5.81) | 22.60–54.04 |
Initial Quick-DASH | 39.51 (16.81) | 16.00–88.00 |
ICD, International Diagnosis Code; USD, American dollar; MCS, mental component score; PCS, physical component score; VR, veteran's rand; DASH, The Disabilities of the Arm, Shoulder and Hand questionnaire. Higher scores for DASH indicates more disability. Higher scores for VR-12 indicates higher level of health.
Tables 2 and 3 display the results of demographic characteristics across measures of success (MCID of 8.0 and 16.0 and discharge score of ≤20 on the Quick-DASH, respectively). We found significant differences in initial Quick-DASH scores and total of comorbidities (p<0.05) among patients who met and did not meet the MCID point changes (8.0 and 16.0) on the Quick-DASH. Patients who met the MCID of 8.0- and 16.0-point change on the Quick-DASH had higher initial Quick-DASH score and had lower number of comorbidities.
Demographic characteristics across measures of success (MCID of 8.0 points and low disability at discharge).
Variables | Met 8.0 point change in Quick-DASHn=188 | Did not meet 8.0 point change in Quick-DASHn=109 | p value | Discharge score of ≤20 on the Quick-DASHn=137 | Discharge score of >20 on the Quick-DASHn=160 | p value |
---|---|---|---|---|---|---|
Age | 72.73 (6.48) | 73.45 (6.80) | 0.38 | 72.59 (6.45) | 73.35 (7.13) | 0.33 |
Sex | 0.31 | 0.04 | ||||
Female | 113 | 59 | 71 | 101 | ||
Male | 75 | 50 | 66 | 59 | ||
Body mass index (BMI) | 29.05 (7.07) | 28.88 (5.67) | 0.83 | 28.83 (6.47) | 29.12 (6.76) | 0.71 |
Diagnosis (ICD) | 0.86 | 0.55 | ||||
Bursa and tendon disorders (726.10) | 87 | 53 | 63 | 77 | ||
Calcifying tendinitis (726.11) | 1 | 0 | 1 | 0 | ||
Unspecified shoulder lesions (727.19) | 5 | 4 | 4 | 5 | ||
Complete rupture of rotator cuff (272.61) | 34 | 21 | 22 | 33 | ||
Rotator cuff (capsule) sprain (840.40) | 61 | 31 | 47 | 45 | ||
Baseline MCS of VR-12 | 42.58 (5.92) | 42.50 (5.74) | 0.90 | 43.42 (4.94) | 41.81 (6.45) | <0.01 |
Baseline PCS of VR-12 | 37.82 (5.72) | 38.28 (5.98) | 0.50 | 38.64 (5.28) | 37.43 (6.20) | 0.07 |
Initial Quick-DASH | 41.36 (16.63) | 36.31 (16.70) | <0.01 | 31.12 (12.63) | 46.69 (16.64) | <0.01 |
Total visits | 14.09 (5.70) | 13.95 (6.51) | 0.84 | 13.59 (5.54) | 14.42 (6.37) | 0.23 |
Total days in care | 107.35 (603.24) | 124.67 (607.71) | 0.82 | 86.70 (507.67) | 136.45 (675.28) | 0.50 |
Total number of comorbidities | 3.34 (2.42) | 3.93 (2.47) | 0.04 | 3.02 (1.94) | 4.01 (2.74) | <0.01 |
Household income (USD) | 63,782.78 (12,913.92) | 64,517.46 (12,121.17) | 0.64 | 64,342.82 (13.745,99) | 63,825.17 (11,688.74) | 0.73 |
Education status | 10.80 (3.81) | 10.68 (3.93) | 0.80 | 10.83 (3.93) | 10.70 (3.80) | 0.78 |
Categorical variables are expressed as number and continuous variables are expressed as mean (SD). ICD, International Diagnosis Code; USD, American dollar; MCS, mental component score; PCS, physical component score; VR, veteran's rand; DASH, The Disabilities of the Arm, Shoulder and Hand questionnaire. Higher scores for DASH indicates more disability. Higher scores for VR-12 indicates higher level of health.
Demographic characteristics across measures of success (MCID of 16.0 points and low disability at discharge).
Variables | Met 16.0 point change in Quick-DASHn=118 | Did not meet 16.0 point change in Quick-DASHn=179 | p value | Discharge score of ≤20 on the Quick-DASHn=137 | Discharge score of >20 on the Quick-DASHn=160 | p value |
---|---|---|---|---|---|---|
Age | 72.82 (7.72) | 73.11 (6.42) | 0.73 | 72.59 (6.45) | 73.35 (7.13) | 0.33 |
Sex | 0.11 | 0.04 | ||||
Female | 75 | 97 | 71 | 101 | ||
Male | 43 | 82 | 66 | 59 | ||
Body mass index (BMI) | 28.82 (6.71) | 29.10 (6.50) | 0.72 | 28.83 (6.47) | 29.12 (6.76) | 0.71 |
Diagnosis (ICD) | 0.17 | 0.55 | ||||
Bursa and tendon disorders (726.10) | 52 | 88 | 63 | 77 | ||
Calcifying tendinitis (726.11) | 1 | 1 | 1 | 0 | ||
Unspecified shoulder lesions (727.19) | 1 | 8 | 4 | 5 | ||
Complete rupture of rotator cuff (272.61) | 26 | 47 | 22 | 33 | ||
Rotator cuff (capsule) sprain (840.40) | 38 | 69 | 47 | 45 | ||
Baseline MCS of VR-12 | 42.09 (6.02) | 42.32 (5.74) | 0.40 | 43.42 (4.94) | 41.81 (6.45) | <0.01 |
Baseline PCS of VR-12 | 37.38 (5.36) | 38.38 (6.07) | 0.15 | 38.64 (5.28) | 37.43 (6.20) | 0.07 |
Initial Quick-DASH | 44.81 (16.60) | 36.01 (16.05) | <0.01 | 31.12 (12.63) | 46.69 (16.64) | <0.01 |
Total visits | 13.59 (5.56) | 14.34 (6.28) | 0.29 | 13.59 (5.54) | 14.42 (6.37) | 0.23 |
Total days in care | 138.56 (748.17) | 96.55 (481.88) | 0.58 | 86.70 (507.67) | 136.45 (675.28) | 0.50 |
Total number of comorbidities | 3.05 (2.06) | 3.89 (2.63) | <0.01 | 3.02 (1.94) | 4.01 (2.74) | <0.01 |
Household income (USD) | 64,324.00 (13,041.00) | 63,876.00 (12,362.00) | 0.77 | 64,342.82 (13.745,99) | 63,825.17 (11,688.74) | 0.73 |
Education status | 10.85 (3.77) | 10.69 (3.91) | 0.73 | 10.83 (3.93) | 10.70 (3.80) | 0.78 |
Categorical variables are expressed as number and continuous variables are expressed as mean (SD). ICD, International Diagnosis Code; USD, American dollar; MCS, mental component score; PCS, physical component score; VR, veteran's rand; DASH, The Disabilities of the Arm, Shoulder and Hand questionnaire. Higher scores for DASH indicates more disability. Higher scores for VR-12 indicates higher level of health.
Four variables (sex, baseline MCS of VR-12, initial Quick-DASH and total of comorbidities) revealed significant difference (p<0.05) between with self-reported low (≤20) versus high (>20) disability scores on the Quick-DASH. Patients who met the discharge score of ≤20 were predominantly women, had higher MCS of VR-12 (better mental health), lower initial Quick-DASH scores (lower disability) and lower number of comorbidities.
Table 4 reflect the overlap in patients meeting both definitions of success (low and high MCIDs versus low disability at discharge). From 297 patients, 116 patients (39.0%) met both MCID of 8.0 points and discharge score of ≤20 in Quick-DASH. Only 83 patients (28%) met both MCID of 16.0-point changes and discharge score of ≤20 in Quick-DASH.
Two by two table reflecting overlap in patients meeting both definitions of success.
A. MCID of 8.0 point change in Quick-DASH and discharge score of ≤ 20 on the Quick-DASH | Discharge score of ≤ 20 on theQuick-DASH | Discharge score of > 20 on theQuick-DASH |
---|---|---|
Met 8.0 point change in Quick-DASH | 116a(39%) | 72 (24%) |
Did not meet 8.0 point change in Quick-DASH | 21 (7%) | 88b(30%) |
B. MCID of 16.0 point change in Quick-DASH and discharge score of ≤ 20 on the Quick-DASH | Discharge score of ≤ 20 on theQuick-DASH | Discharge score of >> 20 on theQuick-DASH |
---|---|---|
Met 16.0 point change in Quick-DASH | 83a(28%) | 35 (12%) |
Did not meet 16.0 point change in Quick-DASH | 54 (18%) | 125b(42%) |
MCID, minimal clinically important difference.
This study sought to evaluate outcome success thresholds in older adults by using MCID and dichotomization based on self-reported disability scores at discharge. We hypothesized that our various measures of success would capture distinctly dissimilar populations, and indeed that was evident. Previously published Quick-DASH thresholds for MCID were used in this study.11,19 To better investigate the influence of a wider range in MCID values, we selected two different MCIDs (8.0 and 16.0 points for Quick-DASH). This study has importance since the way “success” is defined in a study can influence responders analyses,37 and could potentially influence reimbursement in pay for performance environments.
In our study, the percentage of positive responders who met the defined MCID thresholds were 63.3% using MCID of 8.0 points, 39.7% using the MCID of 16.0 points for the Quick-DASH, and 46.12% who met the discharge score of ≤20 on the Quick-DASH. As hypothesized, the definitions of “success” often did not overlap when comparing two different definitions. Indeed, different thresholds of success reflect different patient types and qualities. Notable significant differences in descriptive variables were present across definition comparisons. Further, when we evaluated whether the “success” definitions captured the same individuals, only 39.0%, and 27.9% were the same across our two comparisons (Table 4).
We feel that the notable differences across comparisons are even more compelling based on our method of patient selection. Rotator cuff related shoulder pain is more common in the older people.11,19,20 Patient age may increase the chance of having concomitant factors (e.g. degeneration, decreased vascularity, full thickness rotator cuff tears) that increase the likelihood of undergoing surgical repair and presenting re-tears following rotator cuff repair.38–42 We targeted individuals age 65 and older with rotator cuff specific disorders. We did so for several reasons, one being that we were keen to homogenize the patient population to a “like type” category. We feel that this subgroup from our original dataset of 1109 individuals should increase the likelihood of success measures identifying a similar patient phenotype. Nevertheless, we observed that patient-related findings dictated success grouping. We also observed that patients who have higher initial Quick-DASH scores (higher disability) and lower number of comorbidities are more likely to meet both MCID thresholds. In contrast, patients who present a better health quality, lower initial Quick-DASH scores and lower number of comorbidities were more likely to present with low disability at discharge. Indeed, previous studies have indicated that patient demographics and patient baseline status can significantly influence the MCID score.15,43
Our results are particularly important for the healthcare providers, policy makers, and insurance providers in light of today's environment associated with financial incentive programs (i.e. pay-for-performance). Pay-for-performance has become an increasingly common central strategy in the drive to improve health care.22 This program provides financial rewards or penalties to individual/group healthcare providers or institutions according to their performance on measures of quality.44 Reimbursing based on the MCID or threshold score may mean that clinicians will cherry pick selected patients (or potentially, avoid treating them) based on baseline criteria. Selected population such as those with chronic pain/disorders often fail to improve with conventional care and are especially vulnerable to pay-for-performance caveats.23
Individual healthcare providers, group providers and institutions should be aware that determining whether or not a patient successfully responded to treatment15 based solely on a defined MCID threshold may not truly represent a patient's recovery at discharge. Further, the MCID may be less or greater (as expected) than MDC given that is calculated based on patient response-anchored method, whereas the MCD is calculated as a statistical threshold based on SEM.15,45 Clinically, MDC alone does not provide information regarding the clinical significance of minimal amount of change that is free of random variation in measurement.45 One should especially be concerned of situations where the MCID is less than the MDC; the change associated with the MCID is not substantial enough to account for the error in the instrument. Policy makers and insurance providers should also consider and incorporate this information into healthcare pay-for-performance programs. We feel and others have reported that a discharge Quick-DASH score of ≤20 is holds face validity and represents those with low disability,34 but to our knowledge, it is not commonly used in clinical practice. Our results suggest that this measure, at a minimum, reasonably captures whether a treatment results in meaningful improvement.
To our knowledge, this is the first study to examine three measures of success to capture disability improvement in older adults with a diagnosis of rotator cuff disorder. Our sample size of 297 patients allows for better generalizations since it homogenized subjects into similar diagnostic categories. Our results should be viewed within the limitations of our study. Slightly over thirty individuals had baseline Quick-DASH scores that were less than 20 points, suggesting low initial disability. However, all subjects had Quick-DASH scores of 16 or greater, allowing all to theoretically meet the proposed MCID scores. Retrospective, observational registry studies can only study associations and not causality.46 In addition, we relied on multiple clinicians and researchers, for the accuracy of conduct and reporting at the time of documentation. Care varied among participants as did number of visits and follow up (3–36 visits). Pain intensity and some other psychosocial variables (e.g. anxiety, depression, catastrophizing, fear of movement) data and long-term follow up data were not available.
ConclusionDetermining whether a patient successfully responded to treatment based solely on a defined MCID threshold may not truly represent a patient's disability report at discharge. Three different measures of “success” scores identified three very different populations in older adults. Scores of ≤20 on the Quick-DASH represent low levels of self-report disability at discharge and can be a stable clinical measure of success to capture whether a treatment results in meaningful improvement.
Ethical approvalThis study was approved by the Institutional Review Board of Duke University, Durham, NC, USA (Pro0092671).
FundingThis research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Conflicts of interestThe authors declare no conflicts of interest.