Skip to content
Publicly Available Published by De Gruyter October 1, 2016

Rasch analysis resulted in an improved Norwegian version of the Pain Attitudes and Beliefs Scale(PABS)

  • Nicolaas D. Eland EMAIL logo , Alice Kvåle , Raymond W.J.G. Ostelo and Liv Inger Strand

Abstract

Background and aim

There is evidence that clinicians’ pain attitudes and beliefs are associated with the pain beliefs and illness perceptions of their patients and furthermore influence their recommendations for activity and work to patients with back pain. The Pain Attitudes and Beliefs Scale (PABS) is a questionnaire designed to differentiate between biomedical and biopsychosocial pain attitudes among health care providers regarding common low back pain. The original version had 36 items, and several shorter versions have been developed. Concern has been raised over the PABS’ internal construct validity because of low internal consistency and low explained variance. The aim of this study was to examine and improve the scale’s measurement properties and item performance.

Methods

A convenience sample of 667 Norwegian physiotherapists provided data for Rasch analysis. The biomedical and biopsychosocial subscales of the PABS were examined for unidimensionality, local response independency, invariance, response category function and targeting of persons and items. Reliability was measured with the person separation index (PSI). Items originally excluded by the developers of the scale because of skewness were re-introduced in a second analysis.

Results

Our analysis suggested that both subscales required removal of several psychometrically redundant and misfitting items to satisfy the requirements of the Rasch measurement model. Most biopsychosocial items needed revision of their scoring structure. Furthermore, we identified two items originally excluded because of skewness that improved the reliability of the subscales after reintroduction. The ultimate result was two strictly unidimensional subscales, each consisting of seven items, with invariant item ordering and free from any form of misfit. The unidimensionality implies that summation of items to valid total scores is justified. Transformation tables are provided to convert raw ordinal scores to unbiased interval-level scores. Both subscales were adequately targeted at the ability level of our physiotherapist population. Reliability of the biomedical subscale as measured with the PSI was 0.69. A low PSI of 0.64 for the biopsychosocial subscale indicated limitations with regard to its discriminative ability.

Conclusions

Rasch analysis produced an improved Norwegian version of the PABS which represents true (fundamental) measurement of clinicians’ biomedical and biopsychosocial treatment orientation. However, researchers should be aware of the low discriminative ability of the biopsychosocial subscale when analyzing differences and effect changes.

Implications

The study presents a revised PABS that provides interval-level measurement of clinicians’ pain beliefs. The revision allows for confident use of parametric statistical analysis. Further examination of discriminative validity is required.

1 Introduction

Low back pain (LBP) is a frequent reason to seek help from healthcare professionals, including physiotherapists [1]. Although clinical practice guidelines for LBP recommend a biopychosocial approach to care [2], a dominant biomedical approach has been shown to persist among physiotherapists [3, 4, 5, 6, 7]. Physiotherapists’ beliefs and attitudes have been found to correlate with the advice and treatment provided to patients [8, 9, 10, 11, 12, 13, 14] and appear to be associated with patients’ outcome [15]. To evaluate attitudes and beliefs among physiotherapists and to measure the effect of interventions aiming to change these attitudes, several questionnaires have been developed [8, 9, 16, 17, 18]. The Pain Attitudes and Beliefs Scale (PABS) [17] is one of the most widely used and thoroughly tested instruments. An amended 19-items version of the scale was developed from 36 items [19] and has been used in a number of cross-sectional and interventional studies [13, 20, 21, 22, 23, 24]. Although originally developed for physiotherapists, the instrument has also been used to assess medical doctors’ conceptions of LBP [25, 26, 27, 28]. The scale has further been adapted for beliefs regarding neck pain [29, 30].

A recent review concluded that the measurement properties of the PABS, although promising, could still be improved [31]. Previous studies consistently found a two-factor solution with Cronbach’s alpha values ranging from 0.72 to 0.84 for the biomedical factor and from 0.54 to 0.68 for the biopsychosocial factor [17, 19, 32, 33]. However, based on low percentages of explained variance of the two factors and the low internal consistency of the biopsychosocial subscale, internal construct validity of the PABS seems problematic. Furthermore, all studies that examined the factor structure of the PABS excluded eight to nineteen items from the original 36-item pool prior to factor analysis because of skewness or because the vast majority of therapists (>70%) showed extreme scores [19, 32, 33, 34]. Although consistently skewed items are undesirable and may bias the results of factor analysis, these items may capture important construct-relevant information [35]. In fact, such items may actually be the most important in a scale, providing valid scores at the extremes and thus extending the range of coverage on the construct [36].

Modern test theory with Rasch analysis is a sophisticated method for assessing whether a scale is unidimensional (which is considered an essential quality when summing individual items to obtain a valid total score) and whether items exhibit a consistent and invariant hierarchy of difficulty [37, 38, 39]. The Rasch model enables analysis of targeting of the items’ difficulty to the persons’ abilities by calibrating items and persons on a common scale with interval-level units (logits). This transformation from ordinal raw scores into interval-level measures provides greater accuracy when comparing scores between groups of persons and justifies the use of parametric statistics. Alternatively, Rasch analysis may assess whether items function the same way for different groups of persons (e.g. males and females) or not (invariance as determined by Differential Item Functioning), and whether the items’ response categories represent the intended logical, increasing level of the underlying trait [40, 41, 42, 43].

The aim of this study was to examine the scale- and item performance of the Norwegian version of the PABS and improve its psychometric properties, using Rasch analysis. To test whether reliability could be improved, we re-introduced items which had initially been discarded by the developers of the scale from the original 36-items pool. Furthermore, we wanted to test whether the PABS items were appropriately targeted for a physiotherapist population.

2 Methods

2.1 Design and participants

Data for our analysis of the PABS were collected from Norwegian physiotherapists responding to a cross-sectional web-based survey, as described in detail elsewhere [34]. Briefly, 3849 Norwegian physiotherapists were invited by e-mail to fill out the 36 items of the PABS, together with demographic and professional data. Written information was provided regarding the purpose of our study. Consent of responders was assumed if they completed the questionnaire. The study was accepted by the Norwegian Centre for Research Data (project nr. 28806). Responses from 921 therapists were obtained (response rate 24.8%). Therapists who had not been involved in back pain management for the last 6 months were excluded (n = 147). The remaining 774 participants filled out the PABS questionnaire, whereof 679 provided valid PABS scores.

The PABS was initially developed by adapting items from four questionnaires, as well as items developed by the researchers. The instrument aims to discriminate between biomedical and biopsychosocial treatment orientations of physiotherapists in LBP management [17]. Responders indicate on a six-point Likert scale (1 = totally disagree, 6 = totally agree) their endorsement on each statement. Treatment orientation is measured on two subscales, one labelled “biomedical” (10 items), and the other “biopsychosocial” (9 items). Subscale scores are calculated by a simple summation of the responses to the subscale items. Higher scores on a subscale indicate a stronger treatment orientation.

2.2 Data management and analysis

In order to assess whether discarded items from the original 36-item pool may contribute to measurement when added to the original subscales, all 36 PABS items were subjected to exploratory factor analysis (EFA) for assignment to either a biomedical or a biopsychosocial item set. The purpose of this EFA was not a validation of dimensionality, but rather to get an overview of item clustering and to identify a candidate set of items for Rasch analysis. Following principal component analysis with oblimin rotation of the total sample (n = 667), 20 items loaded onto the first component (ten items in addition to the original ten biomedical items), whereas 16 items loaded on the second component (seven items in addition to the original nine biopsychosocial items, see the supplementary material of this paper). Subsequently, four item sets were extracted corresponding to the original 10-items biomedical subscale (BMS-10), the original 9-items biopsychosocial subscale (BPSS-9), an extended 20-items biomedical item set (BMIS-20) and an extended 16-items biopsychosocial item set (BPSIS-16). The four item sets were separately entered into the RUMM2030 software package [44] and examined for item/scale performance, person measurement and scale-to-sample targeting.

The frequency of responses, including missing data, for each item was assessed. Gender and age were entered as person factors. Participants were categorized in five age groups: 20–30 years, 31–40 years, 41–50 years, 51–60 years and over 60 years of age. Significant likelihood ratio tests (p < 0.0001) suggested that the partial credit Rasch model was the most appropriate to use for analysis.

2.3 Rasch analysis

The Rasch measurement model presumes that any person should always have a greater probability of receiving a higher score on an “easier” item than on a more “difficult” item and that persons with a high level of the trait should have a higher probability of receiving a higher score on any items compared to persons with lower levels [45]. Essentially, Rasch analysis involves a series of tests to see if the data meet the presumptions of the Rasch model, which besides the above mentioned probabilistic relationship between items (stochastic ordering), include local independency and unidimensionality [37]. When Rasch requirements are fulfilled, the scale’s ordinal, raw scores may be transformed into linear, interval level measures and parametric calculations such as means and differences can be applied [36, 38, 41, 43, 46].

Overall fit of data to the model was checked with item-trait interaction statistics. A non-significant chi squared probability value (Bonferroni adjusted) indicates that the hierarchical ordering of the items is consistent across all levels of the trait and does not compromise the required property of invariance. The fit of individual items and persons was assessed by inspecting their fit residuals. This examines the difference between their observed and expected logit values. Potential misfit was considered if fit residuals were beyond standardized +±2.5 (99% CI), or if an item showed a significant chi squared probability value. High negative fit residuals are normally interpreted to indicate redundancy of an item, whereas high positive fit residuals indicate misfit. The mean of all fit residuals across all items of the scale and across all person estimates was examined and should be close to 0 with a standard deviation (SD) close to 1, preferably <1.40.

The assumption of local response independency is violated when responses of persons to an item not just depend on their trait level, but on their responses to other test items [47]. Local dependency artificially inflates reliability and results in spurious multidimensionality. Local dependent items were identified through a residual correlation matrix between all items. Two items were considered to be dependent if the residual correlation between them was more than 0.20 above the average residual correlation of all items [48].

Scores on individual items should only be summed if the scale measures one single latent construct. Violation of this assumption of unidimensionality is a potential source of misfit. To test the assumption of unidimensionality, independent t-tests were carried out on an individual basis. A t-test was done for each person, comparing the ability estimates derived from two subsets of the most diverting items [49].The number of significant t-tests in the sample determined the degree of unidimensionality of the scale. Significant multidimensionality was noted to be present if more than 5% of these t-tests were significant or if the lower confidence interval of the observed proportion fell below the 0.05 value in a binominal test of averages [50].

Differential Item Functioning (DIF) by gender and age was tested. DIF is identified when subgroups of respondents (e.g. males and females) with the same level of the trait, respond to items differently, thus violating the requirement of invariance.

The Person Separation Index (PSI) provides an indication of how reliably the (sub)scale is able to discriminate between person locations and is equivalent to Cronbach’s alpha for internal consistency. A PSI >0.70 was considered desirable for group-level comparisons and taken as evidence of sufficient discriminative ability [51].

Targeting of persons and items. Scale-to-sample targeting was made by comparing person locations from the sample with item locations of the scale. Their means should be close to 0 logits with a SD close to 1. A person-item threshold distribution histogram informs about the suitability of the sample for evaluating the scale and the suitability of the scale for measuring the sample.

Response Thresholds Ordering. An increase in response option in items, represented by their transition point between categories (thresholds), should reflect a logical progression of the underlying trait. If this does not occur, response thresholds are disordered. This may be the case when responders cannot reliably distinguish between the presented categories. Ordered response thresholds are a prerequisite to obtain reliable parameter estimates and necessary rescoring by collapsing categories was done before any further scale improvements were attempted [41, 52].

For scale refinement purposes, the Rasch analysis was progressed in two alternative ways [53]. In the original two subscales, resolution A attempted to account for the misfit that had been highlighted. Resolution A sought to maintain as many original scale items as possible by making the appropriate amendments to account for response dependency and DIF. Where amendments could not be made to account for the source of misfit, individual items were removed from the item set. For local dependency, the dependent items were grouped into “testlets”, meaning that the total raw score derived from the items did not change, but the dependent relationship between the items had been eliminated [54, 55].

In resolution B, misfitting items were removed iteratively to obtain a pure set of items which satisfied all fit parameters. Then, the removed items were individually reintroduced back to see whether or not the original source of misfit was still apparent. If the source of misfit was still present, then the item would again be removed. Resolution B sought to find a set of items, free from any form of significant individual or collective misfit, which act together to form a unidimensional scale [56].

As chi-squared statistics for almost all tests of model fit tend to become significant and will indicate misfit when sample size increases [57], we validated our results by creating two randomized split half samples and then repeat our analyses.

3 Results

The survey collected responses on PABS questionnaires from 679 physiotherapists. Of these, 12 were excluded because responders had not reported gender and/or age. The remaining 667 responders were included in analysis. Distributions of gender-, age- and professional background are shown in Table 1. One hundred and fifty-two responders (22.6%) had one or more missing items. The mean missing data on the PABS statements was 2.9% (range 0.4–5.5%). Missing data were taken into account by RUMM2030 and handled routinely.

Table 1

Characteristics of participants.

Total sample (n = 667) n (%)
Gender
 Male 251(37.6)
 Female 416(62.4)
Professional background
 Physiotherapists 216(32.4)
 Physiotherapy Specialists 51 (7.6)
 Manual Therapists 183 (27.4)
 Osteopaths 44(6.6)
 Psychomotor Physiotherapists 173(25.9)
Age
 20–30 years 73(10.9)
 31–40 years 188(28.2)
 41–50 years 169(25.3)
 51–60 years 168(25.2)
 >60 years 69(10.3)

3.1 Original biomedical subscale (BMS-10)

Initial analysis of the BMS-10 showed misfit to the Rasch model and slight response threshold disordering of item 24. Individual item fit revealed evidence of four problematic items displaying fit parameters outside the normally expected and accepted range.

Misfitting items were typically located at the extremes of the continuum. Summery fit statistics are presented in Table 2. Following rescoring of item 24, all items displayed ordered categories. At this stage, response dependency was apparent between item 10, 25, 20 and 22. There was no evidence of DIF by age or gender. Individual item locations on the logit scale, sources of item misfit at this stage and the rescored response code are summarized in Table 3. A hierarchy of item endorsement (ranging from most to least endorsed) across the sample was apparent. Items concerning the belief that tissue damage/structural deficits are important issues in LBP were located at the more “difficult” side of the item hierarchy, characterizing high levels of biomedical attitude when endorsed. Items concerning spinal vigilance and recommendations to adapt activity to pain seemed to cluster at the “easier” side of the hierarchy, characterizing lower levels of the trait (Table 3). The scale-to-sample targeting seemed adequate (Fig. 1). The mean person location (–0.66 logits, SD 0.71, range –3.15 to 1.80) indicated that the subscale was targeted at somewhat higher levels of biomedical treatment orientation possessed by responders in this sample. The item thresholds (range –2.63 to 3.69) spread over a broad range of the construct and exceeded the spread of person measures. A limited spread of the trait in the population could be read from the ten class interval locations ranging from –1.93 to +0.60 logits, thus covering less than three logits.

Table 2

Analysis and modification of the original biomedical subscale (BMS-10) and the 20-items extended biomedical item set (BMIS-20).

Items Reliability – PSI (alpha) Item-trait interaction χ2 probability Item fit residual Mean (SD) Person fit residual Mean (SD) Unidimensionality % significant t-tests (lower CI) Mean person location (SD)
Initial BMS-10 10 0.74 (0.76) 194.99 (80) p <0.00001 0.54 (2.23) −0.389 (1.40) 58 (622) 8.76% (0.07) −0.65 (0.67)
Rescore BMS-10 10 0.74 246.87 (90) p <0.00001 0.60 (2.38) −0.36 (1.35) 63 (662) 9.52% (0.08) −0.66 (0.71)
Resolution A BMS-10 6 0.67 133.47 (54) p <0.0001 0.87 (1.40) −0.32 (1.10) 40 (645) 6.20% (0.05) −0.61 (0.65)
Bi-factor solution 2 0.60 14.78 (18) p = 0.68[*] 0.15 (1.70) −0.60 (0.96) 16(606) 2.64% (0.009) −0.23 (0.45)
Resolution B BMS-10 6 0.66 76.17 (54) p = 0.03[**] 1.05 (1.35) −0.36 (1.19) 28 (662) 4.23% (0.03) −0.77 (0.81)
Initial BMS-20 20 0.81 (0.82) 525.16 (180) p <0.0001 0.87 (2.22) −0.22 (1.31) 96 (663)14.48% (0.13) −0.79 (0.59)
Rescore BMIS-20 20 0.80 5.43.13 (180) p <0.0001 0.61 (2.44) −0.21 (1.39) 107 (667) 16.04% (0.14) −0.88 (0.61)
Resolution B BMIS-20 7 0.69 (0.67) 82.85 (63) p = 0.05[***] 0.78 (1.27) −0.37 (1.23) 34 (663) 5.13% (0.03) −0.68 (0.76)
  1. A non-significant chi squared probability (larger than Bonferroni adjusted p = 0.05):

Fig. 1 
							Upper graph: Person-item threshold distribution of the 10-items biomedical subscale (BMS-10). The distribution of persons (upper plot) and items (lower plot) are compared on the same logit scale. Lower graph: Person-item threshold distribution of the refined, unidimensional 7-items biomedical solution. Item thresholds on the negative end of the continuum are more likely to be endorsed (easy items), whereas items at the positive end are less likely to be endorsed (difficult items).
Fig. 1

Upper graph: Person-item threshold distribution of the 10-items biomedical subscale (BMS-10). The distribution of persons (upper plot) and items (lower plot) are compared on the same logit scale. Lower graph: Person-item threshold distribution of the refined, unidimensional 7-items biomedical solution. Item thresholds on the negative end of the continuum are more likely to be endorsed (easy items), whereas items at the positive end are less likely to be endorsed (difficult items).

3.2 Scale refinement of the BMS-10

Despite accounting for response dependency by combining dependent items into testlets and removal of misfitting items, the item set continued to display a high degree of misfitting parameters and resolution A was not reached. Next, a bi-factor solution was sought [58, 46], since the BMS-10 was found to split into a negatively and a positively loaded subdomain in the residual correlation matrix; one subdomain referring to a belief that emphasizes tissue damage/structural deficits as an underlying cause of back pain, the other to a belief system that promotes spinal vigilance and restricting activity. This process rendered the subscale unidimensional with satisfactory fit to the model, but with a considerable drop in reliability (PSI = 0.60, Table 2) which made further analysis redundant. Resolution B was reached following the sequential removal of four items (items 20, 22, 25 and 35). The remaining set of six items was strictly unidimensional, invariant and free from any form of misfit, but with an insufficient reliability index (PSI = 0.66, Table 2). The reasons for removing items are listed in Table 4.

Table 3

Original biomedical subscale (BMS-10). Logit measures (locations) and fit statistics of individual items. Summary of individual sources of misfit following rescoring. Item order and mean location (SE) listed from “easy” (likely to be endorsed) to more “difficult” (less likely to be endorsed).

Original item number Biomedical subscale Logit measure Mean (SE) Fit residual <−2.5 or >+2.5 Chi-square probability Response threshold disordering/rescore coding Local response dependency Residual correlation (r)>0.20
22 If back pain increases in severity, I immediately adjust the intensity of my treatment accordingly −1.62(0.04) 2.91 (c) <0.0001[*] Item 14 (r =0.24)
35 In the long run, patients with back pain have a higher risk ofdeveloping spinal impairments −0.56(0.04) 4.57 (c) <0.0001[*]
24 Pain reduction is a precondition for the restoration of normal functioning −0.47 (0.07) −0.06 0.0002[*] 0-1-1-2-2-3
14 Patients with back pain should preferably practice only pain free movements −0.22(0.04) 2.25 0.197 Item 22 (r =0.24)
23 If therapy does not result in a reduction in back pain, there is a high risk of severe restrictions in the long term 0.21 (0.04) 1.93 0.716
10 Pain is a nociceptive stimulus, indicating tissue damage 0.29 (0.04) 0.73 0.410 Item 20 (r =0.30)
31 The severity of tissue damage determines the level of pain 0.44 (0.04) −0.33 0.039
30 If patients complain of pain during exercise, I worry that damage is being caused 0.47 (0.05) −0.10 0.02
25 Increased pain indicates new tissue damage or the spread of existing damage 0.54 (0.04) −3.69 (c) <0.0001[*] Item 20 (r =0.24)
20 Back pain indicates the presence of organic injury 0.90 (0.05) −1.24 0.006 Items 10 and 25 (r = 0.30 and 0.24)

Table 4

Resolution B of the four item sets. Items removed from the subscales and reasons for removal.

Misfit parameter Items removed

BMS-10 BPSS-9 BMIS-20 BPSIS-16
Underdiscrimination (fit residuals > +2.5) 22, 35 7, 27 2, 5, 28, 35 3, 7, 8, 16, 36
Overdiscrimination (fit residuals<-2.5) 25 25
Response dependency (residual correlation > 0.20) 20, 22, 25 6, 7, 27 1, 9, 13, 15, 20, 21, 22, 25 6, 16, 27, 32
DIF 2, 5, 9, 26 36
Significant chi square probability 22, 25, 35 7 2, 13, 15, 21, 22, 25, 28, 35 8, 16, 18, 36
  1. BMS-10: original 10 items biomedical subscaie. BPSS-9: original 9 items biopsychosocial subscaie. BM1S-20: extended biomedical 20-items set. BP1S-16: extended biopsychosocial 16-items set.

3.3 Inclusion of initially discarded items into the biomedical subscale

The original biomedical subscale (BMS-10) was supplemented with 10 items distracted from the original 36-items pool. The resulting item set (BMIS-20) failed to meet Rasch model expectations (Table 2) and eight items displayed disordered thresholds. Following rescoring all items displayed ordered thresholds. At this stage, extensive response dependency was apparent between ten items, whereas four items displayed DIF by gender. Sources of individual item misfit are summarized in the supplementary material of this paper. Scale-to-person targeting was comparable to the original BMS-10, as indicated by its mean person location (–0.88 logits, SD 0.61).

Misfitting and biased items were removed iteratively for scale refinement. The result was a resolution B corresponding to the biomedical 6-items core item set (items 10,14, 23, 24, 30, 31) supplemented with re-introduced item 4 (Reduction of daily physical exertion is a significant factor in treating back pain). This solution was strictly unidimensional and free from any form of misfit, whereas the PSI increased close to recommended values (PSI = 0.69). Reasons for removal of items are listed in Table 4.

3.4 Original biopsychosocial subscale (BPSS-9)

Initial analysis of the BPSS-9 revealed misfit between the data and the model (Table 5). Three items showed fit parameters outside the normally expected and accepted range. Seven of nine items had disordered thresholds, meaning that their response categories were not functioning as intended. Ten out of the 54 response categories (19%) had no or very few (>10) observations. These null response categories were located in the lowest categories of six items. Category threshold curves for items 6 and 30 are illustrated in Fig. 2.

Table 5

Analysis and modification of the original 9-items biopsychosocial subscaie (BPSS-9) and the 16-items extended biopsychosocial item set (BPS1S-16).

Items Reliability – PSI (alpha) Item-trait interaction χ2 probability Item fit residual Mean (SD) Person fit residual Mean (SD) Unidimensionality % significant t-tests (lowerCI) Mean person location (SD)
Initial BPSS-9 9 0.61 (0.59) 135.91 (81) p <0.0001 0.56(1.15) −0.32(1.10) 51 (660) 7.74% (0.06) 0.28 (0.59)
Rescore BPSS-9 9 0.60 116.68(72) p = 0.0006 0.24(1.25) −0.37 (1.29) 52 (660) 7.88% (0.06) 0.36 (0.73)
Resolution A BPSS-9 9 0.53 74.72(48) p = 0.008 0.30(1.45) −0.40(1.167) 44 (656) 6.71% (0.05) 0.38 (0.64)
Resolution B BPSS-9 6 0.61 (0.57) 63.78(48) p = 0.063[*] 0.22(1.00) −0.40(1.16) 43 (661) 6.51% (0.05) 0.69 (0.92)
Initial BPSIS-16 16 0.66 285.3(128) p <0.0001 0.71 (1.20) −0.29(1.26) 104 (663) 15.7% (0.14) 0.28 (0.46)
Rescore BPSIS-16 16 0.63 315.38(144) p <0.0001 0.35 (1.40) −0.32(1.45) 91 (663) 13.73% (0.12) 0.46 (0.57)
Resolution B BPSIS-16 7 0.64 (0.60) 88.52(63) p = 0.018[**] 0.23 (1.20) −0.40(1.21) 41 (661)6.20% (0.04) 0.57 (0.90)
  1. A non-significant chi squared probability (larger than Bonferroni adjusted p = 0.05).

Fig. 2 
							 Category probability curves. In item 30 (uppergraph), all response options are ordered. In item 6 (bottom graph), response options 1, 2 and 3 are disordered.
Fig. 2

Category probability curves. In item 30 (uppergraph), all response options are ordered. In item 6 (bottom graph), response options 1, 2 and 3 are disordered.

Following rescoring, overall fit slightly improved and all items displayed ordered categories. At this stage, response dependency was apparent between items 6, 7, 27 and 33. No evidence of any form for DIF was found. The summary fit statistics at this stage are presented in Table 5. Individual item locations on the logit scale, sources of item misfit and rescore codes are summarized in Table 6.

Table 6

Original biopsychosocial subscale (BPSS-9). Logit measures (locations) and fit statistics of individual items after rescoring. Summary of sources of misfit and category disordering with rescore coding. Item order and mean location (SE) from more to less likely to be endorsed.

Original item number Logit measure Mean (SE) Fit residual <-2.5 or <+2.5 Chi-squared probability[*] Response threshold disordering/rescore coding Local response dependency Residual correlation >0.20
33 Learning to cope with stress promotes recovery from back pain −0.93 (0.06) −0.54 0.019 0-0-0-1-2-3 With item 6 (r = 0.29)
6 Mental stress can cause back pain even in the absence of tissue damage −0.67 (0.07) 0.22 0.674 0-0-0-1-1-2 With item 33 (r =0.29)
11 A patient suffering from severe back pain will benefit from physical exercise −0.65 (0.05) 0.16 0.395 0-1-1-2-3-4
17 Therapy may have been successful even if pain remains −0.36 (0.04) −0.79 0.409
29 Even if the pain has worsened, the intensity of the next treatment can be increased −0.31 (0.05) −1.06 0.399
34 Exercises that may be back straining should not be avoided during the treatment −0.12 (0.07) −0.58 0.02 0-0-0-1-1-2
12 Functional limitations associated with back pain are the result of psychosocial factors 0.66 (0.06) 0.18 0.70 0-1-1-2-3-4
7 The cause of back pain is unknown 0.96 (0.05) 2.38 0.0005[*] 0-1-1-2-3-4 With item 27 (r = 0.29)
27 There is no effective treatment to eliminate back pain 1.42 (0.06) 2.24 0.035 0-1-1-2-2-3 With item 7 (r = 0.29)

The scale-to-sample targeting was adequate (Fig. 2). The mean person location (0.36 logits, SD 0.73, range –1.93 to 2.36) indicated that the subscale was targeted at slightly lower levels of biopsychosocial treatment orientation possessed by responders in this sample. The item thresholds covered a wide range of the underlying construct (range –2.84 to 3.43), exceeding the person measures. However, there were gaps between items near the mean person location, indicating a deficiency in measurement capacity. A limited spread of the trait in the population could be read from the nine class intervals which ranged from –0.75 to +2.01, covering less than 3 logits.

3.5 Scale refinement of the BPSS-9

A satisfactory resolution A could not be reached. Despite attempts to account for local dependency, the items set continued to display misfit parameters and weak fit to the model. Resolution B was reached following removal of three items (items 6, 7 and 27). The remaining set of six items was strictly unidimensional, invariant and free from any form of misfit. Reliability remained insufficient (PSI = 0.61). Summary fit statistics are presented in Table 5 and reasons for item removal in Table 4.

3.6 Inclusion of initially discarded items into the biopsychosocial subscale

The biopsychosocial subscale (BPSS-9) was supplemented with seven items distracted from the original 36-items pool. The resulting item set (BPSIS-16) failed to meet Rasch model expectations (Table 5). Twelve out of sixteen items displayed disordered thresholds. Following rescoring, all items displayed ordered thresholds. At this stage, sizeable response dependency was apparent between five items, whereas item 36 displayed DIF by gender. Five items showed fit parameters outside the accepted range. Details on sources of individual item misfit and rescore codes are summarized in the supplementary material of this paper.

Scale improvement resulted in a resolution B that was reached after successive removal of nine misfitting or biased items (Table 5). The remaining seven items (core item set 11, 12, 17, 29, 33, 34 in addition to re-introduced item 19 If ADL activities cause more back pain, this is not dangerous) were strictly unidimensional, conforming to the Rasch model and free from any form of misfit and with a PSI of 0.64.

All analyses were repeated using two randomized split half samples (n = 336 and n = 331) for validation reasons. Comparable fit statistics and resolutions were also found in these two smaller samples, suggesting that sample size did not influence fit to the model in this population.

3.7 Transformation of ordinal raw scores to interval scaling

After fit to the Rasch model was achieved for modifications of the two subscales, we were able to produce transformation tables (see Appendix) that can be used to convert raw ordinal-level scores to interval-level scores. These transformation tables can be used in parametric data analyses when there are no missing data and distributions are appropriate.

4 Discussion

We used Rasch analysis to evaluate the measurement properties of the Norwegian version of the PABS, a questionnaire which is still in a developmental stage. Our analysis suggested that the biomedical and the biopsychosocial subscales require removal of several psychometrically redundant items to satisfy the requirements of the Rasch measurement model. In addition, most biopsychosocial items needed revision of their scoring structure. Furthermore, we identified two candidate items from the original 36-items pool that improved reliability of the subscales when re-introduced. The ultimate result was two strictly unidimensional, invariant subscales, each consisting of seven items and free from any form of misfit. The unidimensionality implies that summation of items to valid total scores is justified. Transformation tables were provided to convert biased ordinal scores to unbiased interval-level scores, which is important when parametric statistical analysis is desired. Both item sets were adequately targeted at the ability level of our physiotherapist population. However, a low PSI (>0.70) indicated problems with reliability and discriminative ability of the biopsychosocial subscale.

4.1 Biomedical subscale

A set of seven items appeared to be the optimal solution for a unidimensional biomedical subscale of the Norwegian PABS version. Item 4 of the original 36-items version qualified for reintroduction. This item has previously been included in another study [34]. Multidimensionality of the original subscale was found to be related to the effects of local response dependency of individual misfitting items. Apparently, responses to these items were dependent on the responses to other items and not just on the persons’ trait level. This dependency may be explained by several items concerning tissue damage being too similar to each other. Local response dependency is known to be a factor which spuriously inflates reliability when not addressed [48] and this may explain why the internal consistency of our unidimensional 7-items solution (Cronbach’s alpha = 0.67) is lower than values reported in other studies, ranging from alpha = 0.72 to 0.84 [17, 19, 26, 32, 33, 59]. Reliability values for a bi-factor solution which retained all items, were even lower (PSI = 0.60).

4.2 Biopsychosocial subscale

Our analysis indicated that a set of seven items was the optimal solution for a unidimensional version of the Norwegian PABS. Item 19 of the original 36-items version was re-introduced. This item has previously been included in another study [32]. Removal of misfitting items 6, 7 and 27 was found to improve reliability and rendered a unidimensional biopsychosocial subscale. Apparently, the responses to these items were unrelated to the responses to the remaining items and the underlying trait. Items 7 and 27 seemed to represent an attitude challenging the traditional biomedical perspective and appeared hard to endorse, whereas the other items rather seem to validate a biopsychosocial approach, addressing the beneficial influence of activity despite pain, exercises and positive coping.

Most items of the biopsychosocial subscale were found to have incorrectly ordered response categories that needed to be modified [52]. Rescoring of items was necessary to obtain reliable parameter estimates [52], but had the follow-on effect of reducing the total scale score. With all response options in place, the biopsychosocial solution would be scored 7 to 42, whereas after rescoring the total scale score should be contracted to 7–32. Consequently, information values of any observation will be reduced and the precision of measurement decreased [45, 51]. We also found that response categories, mainly in the lower response options of the items, were not fully utilized by the responders. This was evident in our final 7-items solution, where only 15.6% of item responses (range 0.9% to 28.3%) were found in the lower three options of all seven items, indicating large agreement levels on biopsychosocial issues among physiotherapists.

Although our analysis improved reliability of the original 9-items biopsychosocial subscale, separation indexes were still below recommended values for two-group comparisons. The low internal consistency of the biopsychosocial subscale found in other studies has been related to the low number of items [26, 32]. However, in this study, the low PSI seems to be a function of the homogeneous group of physiotherapists to which the scale was administered, as shown by the limited distribution range of person locations along the scale (Fig. 2) and the limited variation on the levels of treatment orientation. The PSI depends on how well-targeted the scale is, but moreover on the “ability” distribution of the respondents, as it is harder to separate persons when they are close in ability [57]. Whether the high agreement levels among Norwegian physiotherapists on biopsychosocial issues also account to other countries or other health care professions is unknown and would be a subject for further research.

4.3 Distribution of scores

Although the person item threshold distributions (Figs. 1 and 3) suggested that both subscales were well-targeted, some item thresholds on the end range of the continuum were found to have no discriminative function, as there were no persons at that item locations [60]. Gaps between biopsychosocial items where the largest part of the sample was found indicate that persons cannot be sufficiently discriminated from each other. Apparently, items are needed at the middle/higher range of biopsychosocial orientation, as this is the range where researchers would be more apt to evaluate change over time.

Fig. 3 
							 Upper graph: Person-item threshold distribution graph of the rescored original 9-items biopsychosocial subscale (BPSS-9). Lower graph: Person-item threshold distribution graph of the refined, unidimensional 7-items biopsychosocial solution.
Fig. 3

Upper graph: Person-item threshold distribution graph of the rescored original 9-items biopsychosocial subscale (BPSS-9). Lower graph: Person-item threshold distribution graph of the refined, unidimensional 7-items biopsychosocial solution.

The transformation tables (see Appendix) allow for simple conversion to interval level scores. The use of PABS interval subscales holds important consequences for responsiveness and calculations of aspects such as minimal important change (MIC), since change here is linear throughout the range of the scale [43]. Conversely, small changes in the ordinal score will be more relevant at the margins than in the middle of the scale. As can be seen from the biopsychosocial transformation table, a raw score change of 3 points represents an interval-level change of at least 3.2 points at the upper or lower margins of the scale, but only 1.4 points in the middle range of the scale. Thus, the interval subscales appear more stable across the construct.

4.4 Strengths and limitations

Previous studies employed factor analysis to reduce the number of items and to determine the dimensionality of the PABS [17, 19, 21, 32, 33, 34]. The inconsistent number of items included by these studies might be due to limitations associated with using ordinal data in factor analysis [61, 62]. As Rasch analysis constructs linearity out of ordinality and provides item and person location on the variable, it seemed necessary to perform Rasch analysis of a large sample.

One other important reason for using Rasch modelling to evaluate item performance is the ability to examine whether response categories are correctly ordered, i.e. to evaluate whether the responders utilized the response categories as they were intended to (as logical increasing levels of treatment orientation) and that all response categories are utilized [57]. Conversely, classical test theory a priori assumes that response thresholds are ordered. Using a scale with disordered thresholds to detect the effect of interventions is problematic, since it will be difficult or even impossible to evaluate a change in categories [46]. Disordered response thresholds appeared to be a major problem in the biopsychosocial subscale and may have contributed to the limited distribution range of our population and the low reliability. Although all item disordering was resolved by collapsing the disordered thresholds, no substantial improvement in the fit to the model was seen. Hence, disordering could perhaps be explained by the presence of null response categories and not by the responders having problems discriminating between the categories [57].

The large sample size (n = 667) provided very robust estimation of the threshold parameters and consequently the response category disordering. However, our convenience sample was not representative for the source population of Norwegian physiotherapists.

An obvious selection bias was present with an overrepresentation of specialized physiotherapists [34]. Conversely, a wide variation in responses from a diversity of clinicians was required in order to avoid clustering of scores [60]. Our sample with different specialties seemed to provide this variability sufficiently.

5 Conclusion

In conclusion, our analysis offers new insights into the internal construct validity of the PABS, including response category functioning. We offer a refined Norwegian version that represents true (fundamental) measurement of biomedical and biopsychosocial treatment orientation. We have provided a transformation table to convert ordinal PABS scores into unbiased interval PABS scores. However, the scale has limitations: The separation index for the biopsychosocial subscale continued to be below recommended values for discriminating between two distinct groups of persons with different levels of the trait.

6 Implications

The revised PABS provides interval-level measurement and allows for confident use of parametric statistical analysis. However, researchers should be aware of the low discriminative ability of the biopsychosocial subscale when used to analyze differences and changes in treatment orientation. Our findings indicate a need for review of the number of response categories in the biopsychosocial subscale to accommodate them to the underlying latent construct. Further research on the scale’s discriminative validity is required.

Highlights

  • Clinicians’ pain beliefs are associated with their patients’ pain beliefs.

  • The PABS measures clinicians’ (mal)adaptive pain beliefs regarding LBP.

  • Rasch analysis improved the PABS’ psychometric properties.

  • The results enable confident use of parametric statistical analysis.

  • Discriminative ability of the biopsychosocial subscale is limited.


DOI of refers to article: http://dx.doi.org/10.1016/j.sjpain.2016.09.002.



Tel.: +47 909 83 795.

  1. Ethical issues: The study was accepted by the Norwegian Centre for Research Data (project nr. 28806). Consent of responders was assumed if they completed the questionnaire. Written information was provided to responders regarding the purpose of our study.

  2. Conflict of interest: Nicolaas Eland was supported by the Norwegian Fund for Post Graduate Training in Physiotherapy in writing the manuscript. The authors report no conflict of interests in relation to this paper.

References

[1] Werner EL, Ihlebaek C, Skouen JS, Laerum E. Beliefs about low back pain in the Norwegian general population: are they related to pain experiences and health professionals? Spine (Phila Pa 1976) 2005;30:1770–6.Search in Google Scholar

[2] Bekkering GE, Engers AJ, Wensing M, Hendriks HJ, van Tulder MW, Oostendorp RA, Bouter LM. Development of an implementation strategy for physiotherapy guidelines on low back pain. Aust J Physiother 2003;49:208–14.Search in Google Scholar

[3] Daykin AR, Richardson B. Physiotherapists’ pain beliefs and their influence on the management of patients with chronic low back pain. Spine (Phila Pa 1976) 2004;29:783–95.Search in Google Scholar

[4] Swinkels IC, van den Ende CH, van den Bosch W, Dekker J, Wimmers RH. Physiotherapy management of low back pain: does practice match the Dutch guidelines? Aust J Physiother 2005;51:35–41.Search in Google Scholar

[5] Pincus T, Vogel S, Breen A, Foster N, Underwood M. Persistent back pain—why do physical therapy clinicians continue treatment? A mixed methods study of chiropractors, osteopaths and physiotherapists. Eur J Pain 2006;10:67–76.Search in Google Scholar

[6] Pincus T, Foster NE, Vogel S, Santos R, Breen A, Underwood M. Attitudes to back pain amongst musculoskeletal practitioners: a comparison of professional groups and practice settings using the ABS-mp. Man Ther 2007;12:167–75.Search in Google Scholar

[7] Oostendorp RA, Elvers H, Mikolajewska E, Laekeman M, van TE, Samwel H, Duquet W. Manual physical therapists’ use of biopsychosocial history taking in the management of patients with back or neck pain in clinical practice. Scientific World Journal 2015;2015:170463.Search in Google Scholar

[8] Rainville J, Bagnall D, Phalen L. Health care providers’ attitudes and beliefs about functional impairments and chronic back pain. Clin J Pain 1995;11:287–95.Search in Google Scholar

[9] Linton SJ, Vlaeyen J, Ostelo R. The back pain beliefs of health care providers: are we fear-avoidant? J Occup Rehabil 2002;12:223–32.Search in Google Scholar

[10] Ostelo RW, Vlaeyen JW. Attitudes and beliefs of health care providers: extending the fear-avoidance model. Pain 2008;135:3–4.Search in Google Scholar

[11] Evans DW, Breen AC, Pincus T, Sim J, Underwood M, Vogel S, Foster NE. The effectiveness of a posted information package on the beliefs and behavior of musculoskeletal practitioners: the UK Chiropractors, Osteopaths, and Musculoskeletal Physiotherapists Low Back Pain ManagemENT (COMPLeMENT) randomized trial. Spine (Phila Pa 1976) 2010;35:858–66.Search in Google Scholar

[12] Domenech J, Sanchez-Zuriaga D, Segura-Orti E, Espejo-Tort B, Lison JF. Impact of biomedical and biopsychosocial training sessions on the attitudes, beliefs, and recommendations of health care providers about low back pain: a randomised clinical trial. Pain 2011;152:2557–63.Search in Google Scholar

[13] Simmonds MJ, Derghazarian T, Vlaeyen JW. Physiotherapists’ knowledge, attitudes, and intolerance of uncertainty influence decision making in low back pain. Clin J Pain 2012;28:467–74.Search in Google Scholar

[14] Darlow B, Fullen BM, Dean S, Hurley DA, Baxter GD, Dowell A. The association between health care professional attitudes and beliefs and the attitudes and beliefs, clinical management, and outcomes of patients with low back pain: a systematic review. Eur J Pain 2012;16:3–17.Search in Google Scholar

[15] Von Korff M, Barlow W, Cherkin D, Deyo RA. Effects of practice style in managing back pain. Ann Intern Med 1994;121:187–95.Search in Google Scholar

[16] Waddell G, Newton M, Henderson I, Somerville D, Main CJ. A Fear-Avoidance Beliefs Questionnaire (FABQ) and the role of fear-avoidance beliefs in chronic low back pain and disability. Pain 1993;52:157–68.Search in Google Scholar

[17] Ostelo RW, Stomp-van den Berg SG, Vlaeyen JW, Wolters PM, de Vet HC. Health care provider’s attitudes and beliefs towards chronic low back pain: the development of a questionnaire. Man Ther 2003;8:214–22.Search in Google Scholar

[18] Pincus T, Vogel S, Santos R, Breen A, Foster N, Underwood M. The attitudes to back pain scale in musculoskeletal practitioners (ABS-mp): the development and testing of a new questionnaire. Clin J Pain 2006;22:378–86.Search in Google Scholar

[19] Houben RM, Ostelo RW, Vlaeyen JW, Wolters PM, Peters M, Stomp-van den Berg SG. Health care providers’ orientations towards common low back pain predict perceived harmfulness of physical activities and recommendations regarding return to normal activity. Eur J Pain 2005;9:173–83.Search in Google Scholar

[20] Bishop A, Foster NE, Thomas E, Hay EM. How does the self-reported clinical management of patients with low back pain relate to the attitudes and beliefs of health care practitioners? A survey of UK general practitioners and physiotherapists. Pain 2008;135:187–95.Search in Google Scholar

[21] Bowey-Morris J, Purcell-Jones G, Watson PJ. Test–retest reliability of the pain attitudes and beliefs scale and sensitivity to change in a general practitioner population. Clin J Pain 2010;26:144–52.Search in Google Scholar

[22] Overmeer T, Boersma K, Denison E, Linton SJ. Does teaching physical therapists to deliver a biopsychosocial treatment program result in better patient outcomes? A randomized controlled trial. Phys Ther 2011;91:804–19.Search in Google Scholar

[23] Hendrick P, Mani R, Bishop A, Milosavljevic S, Schneiders AG. Therapist knowledge, adherence and use of low back pain guidelines to inform clinical decisions—a national survey of manipulative and sports physiotherapists in New Zealand. Man Ther 2013;18:136–42.Search in Google Scholar

[24] Beneciuk JM, George SZ. Pragmatic implementation of a stratified primary care model for low back pain management in outpatient physical therapy settings: two-phase, sequential pilot study. Phys Ther 2015.Search in Google Scholar

[25] Jellema P, van der Windt DA, van der Horst HE, Blankenstein AH, Bouter LM, Stalman WA. Why is a treatment aimed at psychosocial factors not effective in patients with (sub)acute low back pain? Pain 2005;118:350–9.Search in Google Scholar

[26] Watson PJ, Bowey J, Purcell-Jones G, Gales T. General practitioner sickness absence certification for low back pain is not directly associated with beliefs about back pain. Eur J Pain 2008;12:314–20.Search in Google Scholar

[27] Fullen BM, Baxter GD, Doody C, Daly LE, Hurley DA. General practitioners’ attitudes and beliefs regarding the management of chronic low back pain in Ireland: a cross-sectional national survey. Clin J Pain 2011;27:542–9.Search in Google Scholar

[28] Sit RW, Yip BH, Chan DC, Wong SY. Primary care physicians’ attitudes and beliefs towards chronic low back pain: an Asian study. PLoS One 2015;10:e0117521.Search in Google Scholar

[29] Vonk F, Pool JJ, Ostelo RW, Verhagen AP. Physiotherapists’ treatment approach towards neck pain and the influence of a behavioural graded activity training: an exploratory study. Man Ther 2009;14:131–7.Search in Google Scholar

[30] Mutsaers JH, Pool-Goudzwaard AL, Ostelo RW, Peters R, Koes BW, Verhagen AP. The psychometric properties of the PABS-PT in neck pain patients: a validation study. Man Ther 2014 Jan 18.Search in Google Scholar

[31] Mutsaers JH, Peters R, Pool-Goudzwaard AL, Koes BW, Verhagen AP. Psychometric properties of the Pain Attitudes and Beliefs Scale for Physiotherapists: a systematic review. Man Ther 2012.Search in Google Scholar

[32] Laekeman MA, Sitter H, Basler HD. The Pain Attitudes and Beliefs Scale for Physiotherapists: psychometric properties of the German version. Clin Rehabil 2008;22:564–75.Search in Google Scholar

[33] Dalkilinc M, Cirak Y, Yilmaz GD, Parlak DY. Validity and reliability of Turkish version of the Pain Attitudes and Beliefs Scale for Physiotherapists. Physiother Theory Pract 2014:1–8.Search in Google Scholar

[34] Eland ND, Kvale A, Ostelo RW, Strand LI. The Pain Attitudes and Beliefs Scale for Physiotherapists: dimensionality and internal consistency of the Norwegian version. Physiother Res Int 2016.Search in Google Scholar

[35] Clark LA, Watson D. Constructing validity: basic issues in objective scale development. Psychol Assess 1995;7:309–19.Search in Google Scholar

[36] Tennant A, McKenna SP, Hagell P. Application of Rasch analysis in the development and application of quality of life instruments. Value Health 2004;7(Suppl 1):S22–6.Search in Google Scholar

[37] Rasch G. Probabilistic models for some intelligence and attainment tests. Chicago: University of Chicago Press; 1980.Search in Google Scholar

[38] Bond TG, Fox CM. Applying the Rasch Model: fundamental measurement in the human sciences. 2nd ed. Routledge; 2007.Search in Google Scholar

[39] Christensen KB, Kreiner S, Mesbah M. Rasch models in health. Hoboken, NJ: ISTE Ltd/John Wiley and Sons Inc.; 2013.Search in Google Scholar

[40] Pesudovs K, Noble BA. Improving subjective scaling of pain using Rasch analysis. J Pain 2005;6:630–6.Search in Google Scholar

[41] Tennant A, Conaghan PG. The Rasch measurement model in rheumatology: what is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis Rheum 2007;57:1358–62.Search in Google Scholar

[42] Belvedere SL, de Morton NA. Application of Rasch analysis in health care is increasing and is applied for variable reasons in mobility instruments. J Clin Epidemiol 2010;63:1287–97.Search in Google Scholar

[43] Grimby G, Tennant A, Tesio L. The use of raw scores from ordinal scales: time to end malpractice? J Rehabil Med 2012;44:97–8.Search in Google Scholar

[44] Andrich D, Sheridan B, Lou G. RUMM2030. Perth, Australia: RUMM Laboratory; 2009.Search in Google Scholar

[45] Kreiner S, Christensen KB. Person parameter estimation and measurement in Rasch models. In: Christensen KB, Kreiner S, Mesbah M, editors. Rasch models in health. Hoboken, N.J: ISTE Ltd/John Wiley and Sons Inc; 2013. p. 63.Search in Google Scholar

[46] Pallant JF, Tennant A. An introduction to the Rasch measurement model: an example using the Hospital Anxiety and Depression Scale (HADS). Br J Clin Psychol 2007;46:1–18.Search in Google Scholar

[47] Marais I, Andrich D. Formalizing dimension and response violations of local independence in the unidimensional Rasch model. J Appl Meas 2008;9:200–15.Search in Google Scholar

[48] Marais I. Local independence. In: Christensen KB, Kreiner S, Mesbah M, editors. Rasch models in health. Hoboken, NJ: ISTE Ltd/John Wiley and Sons Inc; 2013. p. 111.Search in Google Scholar

[49] Smith J. Understanding Rasch measurement: detecting and evaluating the impact of multidimenstionality using item fit statistics and principal component analysis of residuals. J Appl Meas 2002;3:205–31.Search in Google Scholar

[50] Tennant A, Pallant JF. Unidimensionality Matters! (A Tale Of Two Smiths?). Rasch Meas 2006;20:1048–51.Search in Google Scholar

[51] Streiner DL, Norman GR. Health Measurement Scales a practical guide to their development and use. 4th ed. New York: Oxford University Press; 2008.Search in Google Scholar

[52] Linacre JM. Optimizing rating scale category effectiveness. J Appl Meas 2002;3:85–106.Search in Google Scholar

[53] Horton M, Wright N, Dyer W, Wright-Hughes A, Farrin A, Mohammed Z, Smith J, Heyes T, Gilbody S, Tennant A. Assessing the risk of self-harm in an adult offender population: an incidence cohort study. Health Technol Assess 2014;18, 1–viii.Search in Google Scholar

[54] Wainer H, Kiely G. Item clusters and computerizedadaptive testing: a case for testlets. J Educ Meas 1987;24:185–201.Search in Google Scholar

[55] Lundgren NA, Tennant A. Past and present issues in Rasch analysis: the functional independence measure (FIM) revisited. J Rehabil Med 2011;43:884–91.Search in Google Scholar

[56] Lange R, Thalbourne MA, Houran J, Storm L. The revised transliminality scale: reliability and validity data from a Rasch top-down purification procedure. Conscious Cogn 2000;9:591–617.Search in Google Scholar

[57] Chen WH, Lenderking W, Jin Y, Wyrwich KW, Gelhorn H, Revicki DA. Is Rasch model analysis applicable in small sample size pilot studies for assessing item characteristics? An example using PROMIS pain behavior item bank data. Qual Life Res 2014;23:485–93.Search in Google Scholar

[58] Reise SP, Morizot J, Hays RD. The role of the bifactor model in resolving dimensionality issues in health outcomes measures. Qual Life Res 2007;16(Suppl 1):19–31.Search in Google Scholar

[59] Magalhaes MO, Costa LO, Ferreira ML, Machado LA. Clinimetric testing of two instruments that measure attitudes and beliefs of health care providers about chronic low back pain. Rev Bras Fisioter 2011;15:249–56.Search in Google Scholar

[60] de Vet HCW, Terwee CB, Mokkink LB, Knol DL. Measument in medicine. A practical guide. New York: Cambridge University Press; 2011.Search in Google Scholar

[61] de Vet HC, Ader HJ, Terwee CB, Pouwer F. Are factor analytical techniques used appropriately in the validation of health status questionnaires? A systematic review on the quality of factor analysis of the SF-36. Qual Life Res 2005;14:1203–18.Search in Google Scholar

[62] Wright BD. Comparing Rasch measurement and factor analysis. Struct Equ Modeling 1996;3:3–24.Search in Google Scholar

Appendix A. Conversion table of the modified 7-items Norwegian biomedical subscale

Raw score Rasch converted score range 7 to 40
7 7.00
8 10.03
9 12.20
10 13.76
11 14.99
12 16.02
13 16.90
14 17.67
15 18.35
16 18.96
17 19.52
18 20.03
19 20.52
20 20.98
21 21.43
22 21.87
23 22.30
24 22.75
25 23.20
26 23.67
27 24.15
28 24.67
29 25.23
30 25.83
31 26.48
32 27.21
33 28.02
34 28.94
35 30.01
36 31.27
37 32.75
38 34.53
39 36.89
40 40.00
  1. Raw data must be adjusted before using the conversion table. Response categories need to be collapsed for item 24. This is done in SPSS using the following recode commands: 1 = 1, 2 = 2,3 = 2, 4 = 3,5 = 3,6 = 4 (Rasch recode 0-1-1-2-2-3).

Appendix B. Conversion table of the modified 7-items Norwegian biopsychosocial subscale.

Raw score Rasch converted score, range 7 to 32
7 7.00
8 9.51
9 11.22
10 12.38
11 13.27
12 13.99
13 14.61
14 15.16
15 15.67
16 16.14
17 16.61
18 17.07
19 17.54
20 18.03
21 18.56
22 19.13
23 19.77
24 20.46
25 21.23
26 22.08
27 23.01
28 24.05
29 25.26
30 26.77
31 28.92
32 32.00
  1. Response categories need to be collapsed for five of the seven items using the following recode commands: Item 11:: 1 = 1, 2 = 1, 3 = 1, 4 = 2, 5 = 3, 6 = 4 (Rasch code 0-0-0-1-2-3). Item 12: 1 = 1, 2 = 2,3 = 2, 4 = 3, 5 = 4, 6 = 5 (Rasch code 0-1-1-2-3-4). Item 17 need no recoding. Item 19: 1 = 1, 2 = 2, 3 = 2, 4 = 3, 5 = 3, 6 = 4 (Rasch recode 0-1-1-2-2-3). Item 29 need no recoding. Item 33:1 = 1, 2 = 1, 3 = 1, 4 = 2, 5 = 3, 6 = 4 (Rasch recode 0-0-0-1-2-3). Item 34: 1 = 1, 2 = 1, 31, = 4 = 2, 5 = 2, 6 = 3 (Rasch recode 0-0-0-1-1-2).

Appendix C. Supplementary data

Supplementary data associated with this article can be found, in the online version, at http://dx.doi.org/10.1016/j.sjpain.2016.06.009.

Received: 2016-01-16
Revised: 2016-06-20
Accepted: 2016-06-27
Published Online: 2016-10-01
Published in Print: 2016-10-01

© 2016 Scandinavian Association for the Study of Pain

Downloaded on 17.5.2024 from https://www.degruyter.com/document/doi/10.1016/j.sjpain.2016.06.009/html
Scroll to top button