Psychometric properties of the German version of Observer OPTION5
BMC Health Services Research volume 18, Article number: 74 (2018)
In order to conduct studies on shared decision-making (SDM) and to implement SDM in routine practice, psychometrically tested measures are needed. The development of the short 5-item version of the OPTION scale (Observer OPTION5) allows to assess SDM from an observer perspective. Observer OPTION5 is so far only available in English and Dutch. The aim of this study was to translate the Observer OPTION5 rating scale into German and to test its psychometric properties.
The German Observer OPTION5 was tested in a secondary data analysis of audio-recordings of patient-physician-consultations (N = 79) in German primary care practices. Demographic data were analysed using descriptive statistics. To assess inter- and intra-rater reliability, intraclass correlation coefficients (ICCs) were calculated. For assessing concurrent validity, a correlation (Spearman’s Rho) of the sum score of Observer OPTION5 and Observer OPTION12 was calculated.
The consultations dealt with decisions regarding type 2 diabetes (N = 31), chronic back pain (N = 23), depression (N = 20), and other diseases (N = 5). Analysis of inter-rater reliability yielded an ICC of 0.82 for the sum score; across the five single items ICCs ranged between 0.45 and 0.77. For the intra-rater reliability an ICC of 0.83 was observed for the total score; across the five single items ICCs ranged between 0.45 and 0.86. The Observer OPTION5 had a mean total score of 11.84 (SD = 11.92) and the Observer OPTION12 had a mean total score of 10.3 (SD = 7.9), both on a potential range of 0 to 100. The correlation between the total scores of Observer OPTION5 and Observer OPTION12 was r = 0.47 (p = 0.01).
The results regarding inter- and intra-rater reliability were excellent on the total score level. Observer OPTION5 showed moderate concurrent validity using Observer OPTON12. The results are generally comparable to the results of the original English version of Observer OPTION5. The German version of Observer OPTION5 can be used in research and evaluation of clinical practice. Nevertheless, further testing is adviced.
Over the last years, there has been a shift in physician-patient communication away from the paternalistic model of decision-making towards shared processes between physicians and patients [1, 2]. In the paternalistic model of decision-making, the physician is characterized as information keeper, who makes decisions for the patient in the intention to know what is best for the patient . Shared decision-making (SDM) is defined as a collaborative process that allows a patient and his/her provider(s) to make health care decisions together based on shared clinical and psychosocial information and the best available evidence .In the course of this process, the provider(s) support(s) the patient to engage in deliberation about the different diagnostic or treatment options in order to come to a shared and informed decision in concordance with the patient’s informed preferences .
To evaluate whether SDM has been implemented in health care, the physicians’ communicative skills for sharing information and for involving patients in the decision-making process have to be assessed. Therefore, the development and psychometric testing of observer rating scales that evaluate whether SDM took place is essential to allow standardised evaluation of physician-patient communication.
Although preferences for participation in decision-making differ between patients with different diagnoses, most patients want to be involved if more than one treatment option exists [4,5,6]. SDM is positively associated with patient outcomes (e.g., knowledge, satisfaction, decisional conflict, trust) . Despite patients’ preferences for SDM and its positive effects on patient outcomes, it is still not well implemented in routine practice [5, 8]. The discrepancy between patients seeking involvement and physicians obstructing this involvement can be analysed from a patient’s, a physician’s and an observer’s point of view [9, 10]. Observer rating can provide a general estimate of the involvement of both parties and permits an objective assessment of the SDM process in a consultation. Several observer rating scales exist in English, e.g. the Observer OPTION12,the Rochester Participatory Decision Making Scale, the Brief Decisison Support Analysis Tool, and the Decision Analysis System for Oncology .
So far the Observer OPTION12 (OPTION scale - observing patient involvement) is the only internationally widely used observer measure available in German language . Until now the Observer OPTION12 is a frequently applied observer measure to assess SDM. The Observer OPTION12 can be used by trained observers to assess SDM during a consultation, in communication trainings or in research using recorded consultations. In the development of new scales Observer OPTION12 has been often used as a comparator scale to assess validity [11, 12]. Despite its wide use, psychometric testing of Observer OPTION12 revealed a great variation in reliability across different studies , and the necessity for improvements concerning specific items. Several items of Observer OPTION12 (mainly focusing on the degree of exploration of the patient’s preferences and checking the patient’s understanding) were rarely observed (i.e. mainly rated 0) or not specific for SDM . Other items were revised or combined . This led to the development of Observer OPTION5 as a shorter and revised version of Observer OPTION12 . For the development of Observer OPTION5 published models were analysed to identify the core components of a conceptual framework of SDM. By using this framework, which includes data from an observational study of clinical practice in Canada and the existing experience of using Observer OPTION12, Observer OPTION5 was developed . Observer OPTION5 focuses on the core aspects of SDM and is shorter with only 5 items. Thus, the scale may be less time-consuming and easier to implement in clinical settings . Furthermore, the scale also assesses patient contribution to the decision-making process unlike the Observer OPTION12, which only assesses the physician’s contribution to the process. Both measures are described in more detail in the Methods section.
Psychometric testing of the English version of Observer OPTION5 showed adequate concurrent validity with Observer OPTION12 (r = 0.61), intra-rater reliability (r = 0.93) and inter-rater agreement (ICC = 0.67) . A Dutch version reached comparable results with good inter-rater agreement (k = 0.68) and a positive correlation with Observer OPTION12 (r = 0.71) . Based on results of these prior studies on the English and Dutch versions, we hypothesised that the German Observer OPTION5 version would reach comparable results [15, 16].
This study aimed to establish a German version of Observer OPTION5 and to test its psychometric characteristics.
The original English version of the Observer OPTION5 was translated into German to reach cross-cultural equivalence between these versions . In collaboration with the main developer (GE) of Observer OPTION5, we agreed on a translation process that consisted of a translation from the original English version to German by two independent bilingual translators (MK, WF (cp. Acknowledgements)), whose first language is German. In the next step, a third person (IS) suggested a third German version that combined the first two translations. Then the three translators reached consensus on one final version. This so-called ‘team translation approach’ does not include a backward translation , as a backward-translation does not necessarily reveal the major discrepancies of the original and targeted versions and provides no critical information regarding the underlying issue for the discrepancies [19, 20]. The final German version and the corresponding translated user manual were evaluated during the rater-training, which led to a subsequent revision of a few phrases in the manual. The German manual is available from the corresponding author upon request.
Psychometric testing and study design
This study used audio-recordings of patient-physician consultations to assess SDM using the Observer OPTION5. These data were collected in a different study on the psychometric testing of the 9-item Shared Decision Making Questionnaire (SDM-Q-9), funded by the German Ministry of Education and Research. In 2010 patient-physician consultations in primary care (i.e. in private practice non-hospital settings) were audio-recorded as part of this study. Additionally, demographic data of patients and physicians were collected via self-report questionnaires. Furthermore, the physicans provided information about the patients’ diagnosis and reason for the consulation. Inclusion critera for patients were 1) a diagnosis of type 2 diabetes, chronic back pain or depression, 2) above 18 years of age, 3) German-speaking and 4) facing a treatment decision in consideration of one of the three diagnoses named above. Patients with cognitive impairment were excluded. Few physicians with problems in including patients were instructed to include patients with other chronic diseases (e.g. hypertension). Most recorded consultations dealt with one specific decision, since this was the instruction for participating physicians .
Within the primary study, the recordings were evaluated with Observer OPTION12; these existing ratings of the Observer OPTION12 were re-used in this study.. A total of 79 audio-recordings were now additionally rated in this secondary data-analysis with the German version of Observer OPTION5. In the primary study a sample size was aimed that would allow the detection of correlations above 0.5 with a power of 80% to provide a solid basis for the psychometric analyses. With an estimated dropout of 20% of physicians and missing data (estimated 12.5% of consultations), a final sample size of N = 63 was definited to be adequate in the first study .
Rater training and rating process
The training of the two raters was undertaken by one of the authors (IS), who was trained for the rating of Observer OPTION12 and took part in a workshop on Observer OPTION5. Two reviewers (MKand JT), both familiar with the concept of SDM, were trained on how to use the Observer OPTION5 during a six-hour rater training. Five audio recordings and two video recordings were examined independently by all raters. The results were then discussed and consent was formed with help of the manual.
After the training the 79 records were evaluated separately by both raters in order to assess inter-rater reliability of the German version of Observer OPTION5. One of the raters (MK) rated them a second time within one month of the first rating to assess intra-rater reliability.
Observer OPTION12 and Observer OPTION5 scale
The Observer OPTION12 scale measures the degree of perceived SDM in a consultation. It focuses on the physician’s SDM behaviour and can be used in various medical situations . So far it has been translated into Chinese, Dutch, French, German, Italian, Spanish and Swedish . The scale consists of 12 items measuring aspects of SDM, which can be rated on a 5-point Likert scale (from 0 = the behaviour is not observed to 4 = the behaviour is observed and executed to a high standard) [21, 22]. Psychometric testing showed good inter-rater reliability (ICC = 0.77) . However, item independence requires further psychometric testing .
In Observer OPTION5 some of the Observer OPTION12 items were excluded or combined because the items were not specific enough for SDM or too idealistic to realise . Furthermore, the Observer OPTION5 allows to rate a physician’s reaction if a patient actively brings up a part of the SDM process. This focus on the dyadic process was added to the Observer OPTION5 rating scale, as it was a shortcoming of the Observer OPTION12, where it was only possible to rate actions of physicians.The items of Observer OPTION5 regarding SDM are observer rated and include the following: 1) informing the patient that a decision has to be made, 2) assuring that the patient will be supported and deliberate about options, 3) giving information on the options and mentioning pros and cons, 4) eliciting the patient’s preferences, and 5) how to integrate the patient’s preferences in the decision. These five items can be rated on a 5-point Likert scale, which is shown in Table 1.
In this study 79 audio-recordings were included. Descriptive statistics were calculated for demographic data. To test intra- and inter-rater reliability intra-class correlation coefficients (ICC) were calculated. This included an overall score and an item-by-item testing. Regarding the overall score, the results were rescaled to a total score of 0 to 100. For the ICC calculation the two-way-mixed model was used. An absolute agreement and a mean ICC were assessed. The comparison of Observer OPTION5 to the previous Observer OPTION12 scale was examined by testing concurrent validity. Since no normal distribution was found, a Spearman’s correlation was calculated. Spearman’s Rho is examined by an averaged cumulative value. For all measures a formative measure model was used and all data were analysed with SPSS Statistics 23 (SPSS Inc., Chicago, IL). Results of ICC between 0.75–1.0 were classified as excellent, 0.6–0.74 as good, 0.4–0.59 as moderate and 0–0.39 as poor .
The consultations dealt with decisions regarding type 2 diabetes in 31 consultations, chronic back pain in 23, depression in 20, hypertension in two and other diseases in three consultations. Two-thirds were female and one-third male patients and their mean age was 54.7 years. The majority of the patient sample had a low education level (52.6%) and were married (48%). Demographic and clinical characteristics are shown in Table 2.
The physician sample included eleven (45.5%) general practitioners, eight (33.3%) specialists for internal medicine, three (12.5%) orthopaedics and two (8.2%) psychiatrists. Physicians’ mean age was 49.4 with a mean of eleven years of professional experience. In Table 3 additional information on participating physicians are displayed.
The items were evaluated with a range on the total score of 0 to 47.5 after rescaling the scale to a total of 100. The average was 11.84 (SD 11.92) for the total score on the Observer OPTION5. Overall no item was rated with 4 = exemplary effort. The item frequencies are displayed seperately for each rater in Table 4 and Table 5. ICCs for the inter-rater reliability of single items ranged between 0.45 (item one) and 0.77 (item three). The overall inter-rater reliability observed an ICC of 0.82. For the intra-rater reliability the ICCs of single items lay between 0.45 (item two) and 0.86 (item one and three) and the total score reached an ICC of 0.83. Item two showed a deviating result with an ICC of 0.45. The results for inter- and intra-reliability and the mean item evaluations are displayed in Table 6.
A significant correlation (p = 0.01) between the Observer OPTION5 total score and Observer OPTION12 total was observed (r = 0.47). This shows a positive correlation. A scatterplot of the sum scores of both scales are shown in Fig. 1.
In this study a German version of the Observer OPTION5 scale was developed and psychometrically tested. As part of a secondary data analysis, audio recordings of primary care consultations were evaluated independently by two raters with the German Observer OPTION5. Comparable results to the English and Dutch version were hypothesised [15, 16]. The testing of the German version of Observer OPTION5 showed excellent inter- and intra-rater reliability on the total score levels (0.82 and 0.83). On the item level, the inter-rater and intra-rater reliabilities were moderate to excellent (0.45–0.86). No item was rated higher than three (=skilled effort), leading to a left-skewed distribution, which is comparable to the first psychometric testing of Observer OPTION5 . This result might be influenced by the physician sample, as none of the participating physicans had any particular training in SDM. A systematic review on studies using Observer OPTION12 found similarly low ratings in untrained healthcare providers .
The results regarding reliability are comparable to the first psychometric testing of the original English version of Observer OPTION5 (ICC = 0.67)  and the psychometric testing of the Dutch version (k = 0.68) . These high inter-rater reliability results (ICC = 0.82) in this study compared to inter-rater agreement (ICC = 0.67) in the first Observer OPTION5 testing of the English version  may be due to differences in the determination of the relevant decision. In the study at hand, mostly one main decision was dealt with in the consultations. In other studies, vague or many decisions within one consultation may cause lower inter-rater agreement, because raters might not focus on the same issue. The assessment of concurrent validity of the German Observer OPTION5 scale compared to Observer OPTION12 showed a moderately positive correlation. While the concurrent validity using a correlation to Observer OPTION12 (r = 0.47) is a bit lower than in the two other studies (r = 0.61; r = 0.71) [15, 16], we still found a significant moderate positive correlation , which is in line with our hypothesis. The comparatively smaller correlation might be influenced by the low variance in the Observer OPTION5 scores, which is known to deteriorate measures of association (also referred to as the ‘restriction of range’ problem).
These psychometric results indicate that the German version of Observer OPTION5 is a reliable and valid rating scale. It is the shortest available observer rating scale for SDM. This scale can be used to assess SDM in physician-patient-communication and to evaluate physicians’ communication skills. Furthermore, as suggested by Barr and colleagues , the Observer OPTION5 could possibly be used in communication trainings for physicians as a feedback tool to improve physicians’ SDM skills. However, further research the measure’s potential use as training tool is necessary.
A main strength of this study was the widespread assessment of psychometric properties including inter-rater, intra-rater and concurrent validity of the newly adapted German Observer OPTION5. Since testing showed positive agreement between the German Observer OPTION5 scale and the previous Observer OPTION12 scale, the German Observer OPTION5 was shown to be feasible for use as an observer rating scale in German speaking countries.
A limitation of this study is that the evaluated data showed low variance. The items were mostly rated with no effort (0) or minimal effort (1). Nevertheless, this study reached good psychometric results for inter-rater agreement, intra-rater agreement and concurrent validity. Furthermore, the psychometric properties of the German version of Observer OPTION5 were tested in an primary care setting with encounters focussing mainly on three chronic conditions. Generalizability beyond this setting is limited. Whenever a measure is used in a different setting, a different patient group or a different country psychometric properties should be re-established . Future studies should investigate other psychometric properties like responsiveness in order to establish a scale that can be used in intervention studies in the future. It would also be important to test Observer OPTION5 with a sample of physicians trained in SDM, in order to assess whether this leads to a higher variation of items distribution than in the present study.
This study shows that the developed German version of Observer OPTION5 has good inter-rater and intra-rater agreement. Furthermore, the results indicate moderate concurrent validity of Observer OPTION5. These results support the body of evidence regarding the validity and reliability of the tool. It can be used to evaluate decision-making processes in clinical practice settings and in health services research. Nevertheless, further testing is advised, especially before using the measure in other settings or with other patient groups.
- OPTION scale:
Observing patient involvement scale
Emanuel EJ, Emanuel LL. Four models of the physician-patient relationship. J Am Med Assoc. 1992;267(16):2221–6.
Charles C, Gafni A, Whelan T. Decision-making in the physician-patient encounter: revisiting the shared treatment decision-making model. Soc Sci Med. 1999;49(5):651–61.
Elwyn G, Frosch D, Thomson R, Joseph-Williams N, Lloyd A, Kinnersley P, Cording E, Tomson D, Dodd C, Rollnick S. Shared decision making: a model for clinical practice. J Gen Intern Med. 2012;27(10):1361–7.
Guadagnoli E, Ward P. Patient participation in decision-making. Soc Sci Med. 1998;47(3):329–39.
Dierckx K, Deveugele M, Roosen P, Devisch I. Implementation of shared decision making in physical therapy: observed level of involvement and patient preference. Phys Ther. 2013;93(10):1321–30.
Chewning B, Bylund CL, Shah B, Arora NK, Gueguen JA, Makoul G. Patient preferences for shared decisions: a systematic review. Patient Educ Couns. 2012;86(1):9–18.
Shay LA, Lafata JE. Where is the evidence? A systematic review of shared decision making and patient outcomes. Med Decis Mak. 2014;35(1):114–31.
Gravel K, Légaré F, Graham ID. Barriers and facilitators to implementing shared decision-making in clinical practice: a systematic review of health professionals’ perceptions. Implement Sci. 2006;1(1):16.
Sepucha KR, Scholl I. Measuring shared decision making: a review of constructs, measures, and opportunities for cardiovascular care. Circ Cardiovasc Qual Outcomes. 2014;7(4):620–6.
Scholl I, Koelewijn-van Loon M, Sepucha K, Elwyn G, Légaré F, Härter M, Dirmaier J. Measurement of shared decision making–a review of instruments. Z Evid Fortbild Qual Gesundhwes. 2011;105(4):313–24.
Scholl I, Kriston L, Dirmaier J, Härter M. Comparing the nine-item shared decision-making questionnaire to the OPTION scale–an attempt to establish convergent validity. Health Expect. 2015;18(1):137–50.
Scholl I, Nicolai J, Pahlke S, Kriston L, Krupat E, Härter M. The German version of the four habits coding scheme–association between physicians’ communication and shared decision making skills in the medical encounter. Patient Educ Couns. 2014;94(2):224–9.
Nicolai J, Moshagen M, Eich W, Bieber C. The OPTION scale for the assessment of shared decision making (SDM): methodological issues. Z Evid Fortbild Qual Gesundhwes. 2012;106(4):264–71.
Elwyn G, Tsulukidze M, Edwards A, Legare F, Newcombe R. Using a 'talk' model of shared decision making to propose an observation-based measure: observer OPTION 5 item. Patient Educ Couns. 2013;93(2):265–71.
Barr PJ, O’Malley AJ, Tsulukidze M, Gionfriddo MR, Montori V, Elwyn G. The psychometric properties of observer OPTION 5, an observer measure of shared decision making. Patient Educ Couns. 2015;98(8):970–6.
Stubenrouch FE, Pieterse AH, Falkenberg R, Santema TKB, Stiggelbout AM, van der Weijden T, Aarts JAW, Ubbink DT. OPTION 5 versus OPTION 12 instruments to appreciate the extent to which healthcare providers involve patients in decision-making. Patient Educ Couns. 2016;99(6):1062–8.
Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine (Phila Pa 1976). 2000;25(24):3186–91.
Center SR: Guidelines for Best Practice in Cross-Cultural Surveys. In. MI: Survey Research Center, Institute for Social Research, University of Michigan; 2016: http://www.ccsg.isr.umich.edu/.
Johnson TP: Approaches to equivalence in cross-cultural and cross-national survey research. In.: Harkness, Janet (Ed.); Zentrum für Umfragen, Methoden und Analysen -ZUMA- (Ed.): Cross-cultural survey equivalence; 1998: 1–40.
Harkness J, Schoua-Glusberg A: Questionnaires in translation. In.: Harkness, Janet (Ed.); Zentrum für Umfragen, Methoden und Analysen -ZUMA- (Ed.): Cross-cultural survey equivalence; 1998: 87–126.
Elwyn G, Hutchings H, Edwards A, Rapport F, Wensing M, Cheung WY, Grol R. The OPTION scale: measuring the extent that clinicians involve patients in decision-making tasks. Health Expect. 2005;8(1):34–42.
Couët N, Desroches S, Robitaille H, Vaillancourt H, Leblanc A, Turcotte S, Elwyn G, Légaré F. Assessments of the extent to which health-care providers involve patients in decision making: a systematic review of studies using the OPTION instrument. Health Expect. 2015;18(4):542–61.
Cicchetti DV. Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychol Assess. 1994;6(4):284.
Cohen J: Statistical power analysis for the behavioral sciences. Hilsdale. NJ: Lawrence Earlbaum Associates 1988, 2.
Streiner DL, Norman GR, Cairney J: Health measurement scales: a practical guide to their development and use: Oxford university press; 2008.
Elwyn G, Grande SW, Barr P: Observer OPTION 5 Manual. In. Darthmouth: The Dartmouth Institute for Health Policy and Clinical Practice; 2016: http://www.optioninstrument.org/raters-manual.html.
We would like to thank Wiebke Frerichs (WF) for her help in the translation process and Pola Hahlweg for the advice and support in the preparation of the manuscript.
The primary study that offered a context for the present study (secondary data analysis) was funded by the German Ministry of Education and Research (project number: 01GX0742).
Availability of data and materials
The data that support the findings of this study are available from the corresponding author (IS) upon reasonable request and after consultation with the Ethics Committee of the State Chamber of Physicians in Hamburg (Germany).
Ethics approval and consent to participate
The study was carried out in accordance with the Code of Ethics of the Declaration of Helsinki. Data for this study were collected within a previous study, which was approved by the Ethics Committee of the State Chamber of Physicians in Hamburg (Germany). Informed consent was obtained from all participants prior to the data collection. Since the study at hand was a secondary data analysis, no additional examination of the study by the ethics committee was necessary.
Consent for publication
MK and JT declare that they have no competing interests. MH declares that he is co-PI in a SDM research project funded by Mundipharma GmBH, a pharmaceutical company. IS conducted one physician training in shared-decision making within the research project funded by Mundipharma GmBH. The authors did not receive funding from Mundipharma GmBH for this paper, nor was the company involved in any steps of this study or publication process. GE declares that he has edited and published books that provide royalties on sales by the publishers: the books include Shared Decision Making (Oxford University Press) and Groups (Radcliffe Press). He has in the past provided consultancy for 1) Emmi Solutions LLC who develop patient decision support tools; 2) National Quality Forum on the certification of decision support tools; 3) Washington State Health Department on the certification of decision support tools; 4) SciMentum LLC, Amsterdam (workshops for shared decision making). He is currently director of &think LLC which owns the registered trademark for Option Grids TM patient decision aids. He provides consultancy in the domain of shared decision making and patient decision aids to: 1) Access Community Health Network, Chicago (Federally Qualified Medical Centers), and to 2) EBSCO Health Option Grids TM patient decision aids. GE initiated the Option Grid Collaborative, tools that are hosted on a website managed by Dartmouth College, on http://optiongrid.org/). Existing Option Grids hosted at this website are freely available until such time as the tools have expired. He owns copyright in measures of shared decision making and care integration, namely CollaboRATE, IntegRATE, and Observer OPTION. These measures are freely available for use.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Kölker, M., Topp, J., Elwyn, G. et al. Psychometric properties of the German version of Observer OPTION5. BMC Health Serv Res 18, 74 (2018). https://doi.org/10.1186/s12913-018-2891-6
- Shared decision-making
- Patient involvement