The modified Fresno test is a questionnaire developed to evaluate Evidence-Based Practice among physical therapists. A previous study has indicated that a shorter version with 9 items would be more appropriate for Brazilian physical therapists.
ObjectivesTo test the measurement properties of the modified Fresno test - Brazilian-Portuguese short version.
MethodsTo analyze the reliability of the modified Fresno test - Brazilian-Portuguese short version by two raters (intra and inter-rater) in a sample of 133 physical therapists. The intraclass correlation coefficient (ICC2,1) and standard error of measurement (SEM) was used to assess the test-retest reliability. Internal consistency was tested using the Cronbach's alpha coefficient. An expert committee analyzed content validity using the content validity index (CVI). Confirmatory factor analysis (CFA) using the principal components method was used to assess construct validity. Responsiveness was estimated using effect size, and ceiling and floor effects were also investigated.
ResultsInter-rater and intra-rater reliability were, respectively: ICC= 0.93 (95 % CI 0.91, 0.95); rater 1 - ICC= 0.95 (95 % CI 0.94, 0.97); rater 2 - ICC= 0.98 (95 % CI 0.98, 0.99). The agreement was very good (values ≤ 5 %). Internal consistency was good for most instrument items (≥ 0.80). The CVI showed agreement among the expert committee members (0.96). The Cronbach's alpha coefficients calculated for the corrected item total showed values greater than 0.40. In the CFA, the “model 2” showed acceptable indices (≥ 0.90). Responsiveness was classified as very small. No ceiling and floor effects.
ConclusionThe Fresno Modified Test - Brazilian-Portuguese short version has good to excellent reliability. CFA showed that the fit indices were adequate to be used in the population of interest.
Evidence-based practice (EBP) can be defined as the conscious and judicious use of the best scientific evidence to guide therapeutic decisions.1-4 The decision-making process must carefully consider three requirements:5,6 high-quality clinical research, professional knowledge, and patient preferences. The EBP movement has steadily gained ground among physical therapists worldwide over the past decade.3 It is directly related to the rise of global health problems7 and advances in physical therapy studies and practice.8 For some areas of physical therapy, the degree of scientific evidence available for clinical decision-making is very advanced,9 although, for others, it is still very scarce.10 However, EBP is of paramount importance for patients, physical therapists, and health services as it empowers physical therapists to accurately diagnose diseases, predict their course, and provide effective interventions. Consequently, we expect EBP to reduce healthcare costs while enhancing investments from public and private agencies.11,12
Several measurement tools have been developed to evaluate the use of EBP in clinical practice.13-15 According to Shaneyfelt et al.,16 the Fresno test is the only instrument that evaluates all stages of the EBP implementation process using more realistic clinical scenarios, enabling the assessment of EBP knowledge and skills in medical professionals.16-19 The Fresno Test has adequate measurement properties and has been adapted to be used in other languages8,16-17 and by different health professionals.2,20-21
The modified Fresno test for physical therapists is an adapted version of the Fresno test.2,17 Regarding its measurement properties, there are strengths and weaknesses. The modified Fresno test has shown good internal consistency, excellent reliability, and little variability in intra- and inter-rater agreement. which may support its reliability, including the Brazilian-Portuguese version.1,2,22 For exploratory and confirmatory factor analyses, the Brazilian-Portuguese version of the modified Fresno test showed good factor validity and internal consistency suitable for its use by physical therapist and students.1,22 However, these analyses also revealed low reliability and low factor loading for items 2, 9, 10, and 11 of the instruments. Based on these results, a committee of experts suggested the modified Fresno test – Brazilian-Portuguese short version consisting of 9 items. This short version with 9 items showed better reliability than the modified full version with 13 items.22 Hence, the modified Fresno test - short version may be more suitable to objectively assess all steps in the implementation of EBP among Brazilian physical therapists. Therefore, the objective of this study was to test the reliability (i.e., intra-rater end inter-rater reliability, measurement error, and internal consistency), validity (i.e., content validity and construct validity), responsiveness, and ceiling and floor effects of the modified Fresno test – Brazilian-Portuguese short version (i.e., 9 items) among Brazilian physical therapists.
MethodsStudy designThis is a measurement property study of the modified Fresno test – Brazilian-Portuguese short version.22 We followed the recommendations from the COnsensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) initiative.23-25
Participants and recruitmentA convenience sample of physical therapists was invited to participate in the study. The sample size was estimated as proposed by the COSMIN initiative for measurement properties tests.23-25 Inclusion criteria were: (1) physical therapist professors from public and private institutions involved in clinical practice, and (2) physical therapist specialists in any clinical practice areas. Participants were excluded from the study if they refused to answer the test and if they started to answer but did not finish the online survey. The Universidade Cidade de São Paulo local research ethics committee (number #3.636.011/2019) approved the study. This measurement property study was conducted through an online survey using the SurveyMonkey platform. The study was advertised through direct e-mail and text messages to potential participants and posts on social networks (Instagram, Facebook, and Linked In).
ProcedureThe following sociodemographic variables were collected for each participant: age, sex, institution where they graduated (private or public funded), time (years) since graduation, and qualification. The SurveyMonkey platform recorded the time spent by participants to complete the test.
Modified Fresno test – Brazilian-Portuguese short versionData were collected using the modified Fresno Test - Brazilian-Portuguese short version with 9-items, proposed by Silva et al.22 The test features an entry-level text with completion instructions and two clinical scenarios. The participant must choose one of the clinical scenarios proposed by the test and have a maximum of 1 hour to answer all 9 items.2,17 The total score of the modified Fresno test – Brazilian-Portuguese short version is calculated by summing up the answers from each item. For example, item 2 asks the participant to name the research databases to answer the chosen clinical scenario, which has the following scoring criteria: (a) variety of sources, (b) convenience, (c) clinical relevance, and (d) validity.2,17 The answer to each item is scored on a 5-point Likert scale: (1) not evident, (2) limited, (3) minimal, (4) strong, and (5) excellent. The sum of the scores for each criterion results in a score per question that varies between 0 and 24 points: questions Q1 to Q6 allow a maximum score of 24 points for each question; for question Q7, the maximum score is 16 points, and for questions Q8 to Q9, a maximum score of 4 points per question. The total score is the sum of points for all questions and ranges from 0 to 168 points, with a higher score meaning a better understanding of EBP.22
Each participant answered the questionnaire twice (i.e., test-retest design), at the initial assessment (first assessment) and 7 days later (second assessment). At the first assessment, participants received instructions on how to answer the questionnaire and they completed the participant informed consent form. After agreeing to participate in the study and choosing one of the clinical scenarios presented, the participants had up to 1 hour to complete the questionnaire. After this first assessment, all participants were asked to answer the test again seven days later according to the instructions described in the questionnaire (second assessment). The instrument's total score was calculated by two independent assessors.22
Data analysisTwo assessors with experience in EBP analyzed the answers and calculated the total score of the questionnaire. They received a 3-hour training session about the scoring of the modified Fresno test – Brazilian-Portuguese short version. The first hour of the training was dedicated to explaining the questionnaire's scoring criteria. In the second hour of the training, a pilot test was conducted with each rater scoring one questionnaire. Finally, the third hour was dedicated to analysing and discussing the results of the pilot test. After the training, the assessors randomly scored each questionnaire blinded to the participants’ details. Each assessor received the questionnaires after the first and second assessments. Responsiveness of the instrument was analyzed including participants who answered the questionnaire again four months after the first assessment. Our hypothesis is that changes in effect size will be detected after 4 months following the administration of the questionnaire. The effect is related to the fact that when using the modified Fresno test – Portuguese-Brazilian short version, the participant will benefit from the training and access to the information contained therein, providing learning and the search for new knowledge for the use of EBP.22
Statistical analysisDescriptive statistics, including measures of central tendency (simple and relative frequencies, mean and median) and measures of dispersion (standard deviation), were used to describe the sociodemographic data, as well as the total scores and the score for each of the dimension (i.e., confirmatory factor analysis) of the modified Fresno test – Brazilian-Portuguese short version.22
For reliability and agreement we compared the score for each item and the total score of the test obtained in the first and second assessments for each assessor (intra-rater reliability) and between assessors for the score obtained in the first assessment (inter-rater reliability) with a 7-day interval. The reliability analyses followed the same method used in the analysis of the original instrument using intraclass correlation coefficient (ICC) of type 2,1 for continuous variables.2 The ICC was classified according to Fleiss (1986)26: poor (<0.4), good (0.4–0.75), and excellent reliability (>0.75). The Standard Error of Measurement (SEM) was used to assess agreement.13 The percentage of the SEM related to the total score of a questionnaire was interpreted according to Ostelo et al.27 as ≤ 5 %, very good; > 5 % and ≤ 10 %, good; > 10 % and ≤ 20 %, doubtful; and > 20 %, negative.27 Internal consistency analysis was performed by calculating the Cronbach's alpha coefficient for all items of the questionnaire and the corrected item-total correlations on data from assessor 1 collected at the first and second assessment.28 Alpha value of ≥ 0.7 was defined as “acceptable” and ≥ 0.80 as “good.” A value greater than 0.2 was considered for the corrected item-total correlations.29
Content validity was analyzed through the evaluation of a committee of 16 experts (physical therapists, statisticians, and epidemiologists), considering the content validity index. The content validity index measures the percentage of agreement among the members of the expert committee on the aspects addressed in the test items on items of the modified Fresno Test – Brazilian-Portuguese short version.22 The acceptable agreement index among the expert committee members was 0.80. Confirmatory factor analysis was performed using the principal components method for structural validity.30 The total score of the 266 responses in the test and retest was evaluated, totaling 1064 observations. The confirmatory factor analysis was used to investigate the adequacy of the modified Fresno test – a Brazilian-Portuguese short version, with 9 items, represented as “model 2”, with the model proposed by Silva et al.22 of 13 items, defined as “model 1”. The first model evaluated the questionnaire's structure composed of 3 (three) factors. The second model tested the hypothesis that the questionnaire is composed of 2 (two) factors. The maximum likelihood method was used for the analysis. The following indices were used for model fitting: chi-square index, where values with significance levels p < 0.05 are estimated. The Goodness of Fit Index indicates how much the proposed models explain variance and covariance. The comparative fit index, the normed fit index, and the non-normed fit index, representing the proportional improvement in model fit, were also analyzed. Values greater than 0.90 for these indices indicate an adequate model. The root mean square error of approximation considers the error of approximation in the population in a covariance matrix. Values of 0.08 or less represent a reasonable error. The expected cross-validation index, which indicates the best fit of the models and is appropriate for comparing ungrouped models, was also analyzed. There are no benchmark values to rank model fit, and it should be as low as possible.
The responsiveness of the modified Fresno test - Brazilian-Portuguese short version was analyzed by calculating the effect size.22 The effect size is the difference between the mean total score at the 4-month minus the mean total score at baseline (i.e., first assessment) divided by the standard deviation of the baseline scores. The effect size was interpreted as 0.01 as very small, 0.2 as small, 0.5 as medium, 0.8 as large, 1.2 as very large, and 2.0 as huge.31
To analyze the ceiling and floor effect, the assessor's percentage of minimum and maximum possible scores in the first and second assessments were calculated. Ceiling and floor effects were considered present when >15 % of respondents achieved the lowest (i.e., 0 points) or highest (i.e., 168 points) possible total score. The presence of ceiling effects may indicate that the items are not challenging enough or too challenging, respectively.23 The Statistical Package for the Social Sciences (SPSS IBM® - version 22.0) software was used for the statistical treatment of the data.
ResultsDescriptive and reliability analysisEligible physical therapists were recruited for this study between January 2020 and July 2021. A total of 133 physical therapists responded to the questionnaire at the first and second assessments and were included in the reliability analysis. Of these, 82 (62 %) participants answered the questionnaire again four months after the initial assessment and were included in the responsiveness analysis. Participants who were missing at four months did not complete the questionnaire due to lack of knowledge or disinterest in the study.
Table 1 presents the sociodemographic characteristics of the study participants. The sample consisted mainly of female participants (66.9 %), predominantly between 30 and 39 years old (46.6 %). Most of them have a maximum qualification of a master's degree (36.1 %), having graduated from a private educational institution (79.7 %) and completed the degree between 6 and 10 years ago (27.1 %).
Sociodemographic characteristics of the sample (N = 133).
The intra-rater reliability for assessor 1 showed ICCs for each item ranging from 0.63 to 1.0, and SEM ranged from 0.11 to 2.76. Intra-rater reliability was rated as excellent for 8 out of the 9 items. The only item that showed good intra-rater reliability was item 2. The ICCs for most of the items were higher for assessor 2 compared to the ones found for assessor 1. For assessor 2, ICCs for each item ranged from 0.88 to 0.99, and SEM ranged from 0.34 to 2.12 points. These results should be interpreted as excellent intra-rater reliability for each item rated by assessor 2.
For the total score of the modified Fresno test – short version, the intra-rater reliability was rated as excellent for both assessors. Assessor 1 and assessor 2 showed ICCs values of 0.95 (95 % CI: 0.94, 0.97) and 0.98 (95 % CI: 0.98, 0.99), respectively. Inter-rater reliability was rated as excellent, with ICC of 0.93 (95 % CI: 0.91, 0.95). The SEMs were 10.38 points for the inter-rater reliability, 8.72 points for the intra-rater reliability of assessor 1, and 4.52 points for the intra-rater reliability of assessor 2. The SEMs were classified as very good (values ≤ 5 % of the total score) (Table 2).
Intra- and inter-rater reliability of the modified Fresno test – Brazilian-Portuguese short version, per item, and the sum total of the items.
Abbreviations: CI, confidence interval; ICC, intraclass correlation coefficient; SEM, standard error of measurement.
The internal consistency obtained with Cronbach's alpha indicated a value of 0.93 for the nine items of the instrument. The α coefficients calculated for the corrected item-total correlations showed values greater than 0.40.
Content validityContent validity was assessed using the content validity index, which measures the proportion or percentage of agreement among the expert committee on the questionnaire items. A content validity index of 0.96 was found.
Construct validityThe confirmatory factor analysis showed the “Model 2”; with nine items and two factors provided a better fit compared to "Model 1″ with 13 items and three factors (Fig. 1).
The goodness of fit index and comparative fit index reached an acceptable level (≥ 0.90). “Model 2”, which tested a two-factor structure, provided a better fit to the data (Table 3). However, the fit indices of the second model also did not reach acceptable levels. Therefore, the second model, although better than the first model, did not adequately fit the data.
Fit indices for the two-factor models tested according to confirmatory factor analysis.
Models | X2 | df | GFI | CFI | RMSEA | SRMR | AIC | ECVI | NFI | NNFI |
---|---|---|---|---|---|---|---|---|---|---|
Model 1 | 254.58 | 62 | 0.85 | 0.82 | 0.11 | 0.10 | 312.58 | 1.38 | 0.78 | 0.73 |
Model 2 | 52.03 | 24 | 0.91 | 0.92 | 0.09 | 0.08 | 94.03 | 0.71 | 0.87 | 0.81 |
Abbreviations: AIC, akaike's information criterion; CFI, comparative fit index; ECVI, expected cross-validation index; GFI, goodness of fit index; NFI, normed fit index; NNFI, non-normed fit index; RMSEA, root mean square error of approximation; SRMR, standardized root mean residual.
The responsiveness analysis of the modified Fresno test - Brazilian-Portuguese short version is presented in Table 4, with the results of the effect sizes of the mean differences of the total scores in the first and second stages. The table analyzed the effect size appropriate for the paired t-test. For the total score, the effect size was 0.13 points and rated as “very small,” i.e., 4-month follow-up minus the baseline.
Effect sizes of the mean differences of the total scores of the modified Fresno test – Brazilian-Portuguese short version, according to the evaluation times (first and second stage).
Evaluation | N | ES (Glass's Delta) | 95 % CI | Interpretation1(Effect size) | ||
---|---|---|---|---|---|---|
Inf | Sup | |||||
Total Sum of Answers | 1st and 2nd STAGES | 133 | 0.13 | −0.11 | 0.37 | Very Small |
Abbreviations: CI, confidence interval; DP, standard deviation; ES, effect size.
No ceiling and floor effects were found among respondents.
DiscussionOur results showed excellent intra- and inter-rater reliability, very good intra- and inter-rater agreement, good internal consistency, and no ceiling or floor effects for the modified Fresno test – Brazilian-Portuguese short version. Content validity showed agreement among the expert committee members, and construct validity was adequate as a suitable alternative to assess EBP in physical therapists and students. The responsiveness analysis showed a very small effect size.
The scores on the individual questions and the total score of the instrument showed excellent intra- and inter-rater reliability results. These values were also equivalent to the original English version, which showed moderate to excellent intra-rater reliability for both raters for all questions and moderate to excellent inter-rater reliability for all questions. Internal consistency by Cronbach's alpha indicated satisfactory reliability for all instrument items. These values agree with the results obtained in studies developed by Silva et al.,1 Tilson,2 Ramos et al.17 and Argimon-Pallas et al.32 These studies state that the test is reliable in different languages and can be used to evaluate other professionals.
Confirmatory factor analysis (CFA) tested the proposed 2-factor model (model 2) against the short version proposed by Silva et al.22 Two of the nine fit indices (GFI and CFI) used by the maximum likelihood method reached acceptable values (≥ 0.90) in Model 2. The Normed Fit Index (NFI) and Non-Normed Fit Index (NNFI) indexes also showed values close to acceptable and represented the proportional improvement in the fit of model 2.
The responsiveness analysis evaluated the effect size for the total sum of responses, using Glass Delta values, presenting values of 0.13 points, classified as “very small” and” showing a small difference between the groups analyzed. This highlights that the modified Fresno test – Brazilian-Portuguese short version, was responsive in detecting group changes over time. Unresponsive instruments are unable to detect changes in a test over time.
The presence of ceiling and floor effects can influence the sensitivity and responsiveness of a measuring instrument. However, no ceiling and floor effects were observed for the respondents in the instrument's total score. This also highlights the sensitivity and ability of the instrument to detect changes in relation to knowledge about EBP over time. The ceiling and floor effects were not analyzed in the 13-item Modified Fresno Test.
The lack of knowledge about the adoption of EBP was the justification of 51 (38 %) of the participants who did not complete the instrument on the second occasion. These difficulties may be related to the biggest obstacle presented by students, physical therapists, and other health professionals in adopting EBP, which is the difficulty in searching, interpreting, and transposing evidence to clinical practice. This obstacle is directly related to the competencies and skills of each professional. Other barriers to the adoption of EBP are also pointed out in other studies, such as limited access to databases and full texts, language issues, and the time available for updating.
The sample used in this study may characterize a limitation. As in other studies of measurement properties, a convenience sample was used instead of determining a sample size to achieve statistically significant results. The 7-day time interval between test and retest applications may also be a study limitation and may bias the correlation coefficient calculation. More research should be done to confirm the instrument's measurement properties in other samples with different characteristics. It is suggested that studies be conducted comparing the test version presented here with other versions. Even so, it would be important to attest to the ability of the instrument to discriminate between theoretically different groups.
The strength of this study was to demonstrate that the modified Fresno test – Brazilian-Portuguese short version, maintained the quality of its measurement properties. This makes the instrument easier for respondents and more reliable for assessing EBP in online assessments.
ConclusionThe modified Fresno test – Brazilian-Portuguese short version showed excellent intra- and inter-rater reproducibility, very good agreement, and good internal consistency. The content validity test was in agreement among the experts and, through Confirmatory Factor Analysis, showed good fit indicesThese results confirm that the short version is a suitable alternative to comprehensively assess EBP in physical therapy professionals and students.
This study was financed in part by the Coordenação de Aperfeiçoamento Pessoal de Nível Superior – Brasil (CAPES) – Finance code 001.