Skip to main content

Measuring and improving performance of clinicians: an application of patient-based records

Abstract

Backgound

Efforts to measure performance and identify its driving factors among clinicians are needed for building a high-quality clinician workforce. The availability of data is the most challenging thing. This paper presented a summary performance measure for clinicians and its application on examining factors that influence performance using routine patient-based records.

Methods

Perfomance indicators and difficulty score were extracted from electronic medical records (EMRs). Difficulty adjustment and standardized processing were used to obtain indicators which were comparable between specialties. Principal component analysis (PCA) was used to estimate the summary performance measure. The performance measure was then used to examine the influence of person-job fit and burnout through a mediator effect model and cluster analysis.

Results

A valid sample of 404 clinicians were included in this study, and 244 of them had valid response in the questionnaire. PCA explained 79.37% of the total variance presented by the four adjusted performance indicators. Non-performance attributes and performance driving factors help distinguish different clusters of clinicians. Burnout mediates the relationship between person-job fit and performance in a specific group of clinicians (β = 0.120, p = 0.008).

Conclusions

We demonstrated the analytical steps to estimate clinicians’ performance and its practical application using EMRs. Our findings provide insight into personnel classified management. Such practice can be applied in countries where electronic medical record systems are relatively less developed to continuously improve the application of performance management.

Peer Review reports

Introduction

Over ten years have passed since China’s comprehensive health system reform launched in 2009. China has made substantial progress in reducing cost and improving access and quality of healthcare [1,2,3,4,5,6]. More attention should be paid to the management of human resources, since it is one of three principal health system inputs [7]. It is important for human resources to ensure clinicians are able to maintain their superior status and centrality due to the exclusiveness and specificity of the profession's domain of knowledge [8, 9], and they drive a vast majority of treatment decisions and influence healthcare services [10]. It is estimated that up to 21% of the healthcare costs in the United States can be directly attributed to the services they provide [11]. Clinicians’ high workforce performance is critical to quality service delivery. Hence performance assessments are of great importance.

The most challenging thing in measuring performance is to ensure the results are fair and comparable. Relevant methods have been developed to adjust resource consumption and potential risk, such as Diagnosis Related Groups (DRGs) for hospital payment system and relative value units for medical service pricing and measuring individual labor value [12]. There are also other innovative approaches such as the Indexes of Complexity of Assistance for measuring nursing performance [13], and data envelopment analysis [14]. These methods are always based on a high-quality electronic medical records (EMRs) system and coding system [15]. However, for some countries, use of EMRs is just beginning and systems are not yet standardized. China has pushed the development of EMRs since 2006 [16] and launched a DRG system in 2009. Performance assessments based on these data systems have only been developed for regions, not individuals [17]. Taking into account the practical dilemma of performance assessments in these countries, more explicit and pragmatic methods are needed.

Performance assessments are always used for administrative, feedback and research goals [18], and their ultimate goal is to improve performance [19]. Many factors influence performance of clinicians. In addition to treating patients, clinicians have many other tasks, such as scientific research, teaching, and administrative responsibilities [20, 21]. Between 1998 and 2016, patient visits and inpatient admissions per clinician in China increased by 135% and 184%, respectively [22]. Many Chinese clinicians have also complained about the excessive pressure to publish research [23]. Studies have showed that multi-tasking may hinder performance when individuals’ ability to handle multiple tasks at once does not match the job requirement [24, 25]. These pressures are all factors which may lead to burnout [26, 27]. The overall prevalence of burnout symptoms among clinicians in China ranges from 66.5 to 87.8% [28]. Person-job fit (P-J fit) [29] and burnout [27, 28] both have significant effect on clinicians’ performance. The feeling of being depleted of one’s emotional and physical resources adversely affects the ability to function effectively [30], resulting in a mismatch between individual ability and job demand, which leads to a further decline in performance. Despite many studies examining the relationship between work attitude and subjective performance, little research has used objective performance [31]. A combination of influence factors and objective performance may improve the practical value of performance assessments.

Our study aims to demonstrat the analytical steps to measure clinicians’ performance and further apply the results to identify factors that influence performance. The clinicians in this study mainly refer to doctors who provide clinical services to patients, including internal and surgical medicine. By applying the performance measurement results to daily practice, this study can provide evidence and reference for continuous improvement in the application of performance management. Specifically, we examine the effects of person-job fit and burnout on clinicians’ performance. Four hypotheses are examined.

  • Hypothesis 1: High levels of P-J fit are associated with better performance.

  • Hypothesis 2: High levels of P-J fit are expected to be associated with low levels of burnout.

  • Hypothesis 3: High levels of burnout are linked with poor performance.

  • Hypothesis 4: Burnout mediates the relationship between P-J fit and performance.

Methods

Study setting and data source

The study sample was 426 clinicians who sought promotion in nine pilot clinical specialties undergoing promotion reform from 67 public hospitals in Shanghai, China, 2020.

Three data sources from Shanghai in 2019 were used: EMRs of inpatient record for performance indicators, personnel files from a human resource information system for basic demographic information, and a survey for attitude to work. The datasets were linked by a unique personal identification number assigned to every clinician.

We adopted some rules for data quality control. In the process of retrieving EMRs, those who had missing key variables or had outliers exceeding five standard deviations on performance indicators were excluded. 404 out of 426 clinicians met the criteria. For the survey, an encrypted link to access the questionnaire was sent to each clinician. A detailed explanation of the scope of the study was provided and informed consent was requested. Responses were uploaded directly to the questionnaire platform, without passing through the clinicians' supervisors. Clinicians who failed to submit questionnaires and whose answer time was too short to be valid were excluded. 254 of 426 clinicians met the criteria. Non-response bias was detected and confirmed to have no effect [32].

Measurements

Developing summary performance measure

In the database of EMRs, each patient can undergo multiple procedures, including surgeries and internal operations. These procedures result in an overall cost and time consumption, and each procedure is performed by one to three clinicians. So, the development of a summary performance score was mainly based on the procedures.

Constructing the difficulty score

First, we constructed a mathematical calculation to quantify the difficulty of clinicians’ medical procedure work. Each procedure has been given a grade (1 to 4) based on its technical difficulty, surgical complexity and risk by the health authority, and it is combined into the National Clinical Surgical Operation Classification Code system. For each procedure, a clinician be surgeon, first assistant or second assistant. Each grade and role have been assigned a technical weight by expert opinion. The difficulty score is derived from the grades of the operations a clinician performs and the role in those operations.

In the first sub-step, for a specific clinician \(m\) in specific procedure role \({R}_{i}\left(i=\mathrm{1,2},3\right)\), the number of procedures of grade \(j\) is \({N}_{mij}\left(j=\mathrm{1,2},\mathrm{3,4}\right)\), and the technical weight (See in Additional file 1: Table A2 and A3) of grade \(j\) is \({WG}_{j}\left(j=\mathrm{1,2},\mathrm{3,4}\right)\). Hence, the weighted average of procedure (WAP) for \({R}_{i}\) is given by Eq. (1):

$${WAP}_{mi}=\frac{{\sum }_{j=1}^{4}{N}_{mij}*{WG}_{j}}{{\sum }_{j=1}^{4}{N}_{mij}}$$
(1)

The second sub-step is to identify the percentage of each role the clinician m plays in the procedure. The number of procedures is \({N}_{mi}\). The weighted average percentage of procedure (WAPS) for \({R}_{i}\left(i=\mathrm{1,2},3\right)\) is given by Eq. (2):

$${WAPP}_{mi}=\frac{{N}_{mi}*{WAP}_{mi}}{{\sum }_{i=1}^{3}{N}_{mi}}$$
(2)

The third sub-step is to calculate the difficulty score. The technical weight (See in Additional file 1: Table A4) of role \({R}_{i}\left(i=\mathrm{1,2},3\right)\) is \({WR}_{i}\left(i=\mathrm{1,2},3\right)\). The difficulty score for clinician m is given by Eq. (3):

$${DS}_{m}={\sum }_{i=1}^{3}{WAPP}_{mi}*{WR}_{i}$$
(3)

According to this definition, the difficulty score is a continuous numerical variable ranging from 0 to 1. The higher the score, the more technically difficult the procedure, the more complex the procedure, and the higher the risk. The difficulty score makes it possible to compare between clinicians. Table 1 presents an example of a single clinician’s difficulty score.

Table 1 An example of difficulty score for specific clinician

Constructing the summary performance score

Indicator selection

We selected four indicators under the structure-process-outcome framework [33]. Structure is represented by the annual volume of surgery and operation (VSO). Process is represented by resource consumption, including the average LOS per patient (ALOS) and hospitalization expense per patient (AHE). Outcome is represented by the inpatient mortality rate (MR). We extracted these four indicators from EMRs for the whole population and used the difficulty score to make difficulty adjustment. VSO is multiplied by the difficulty score and the other three indicators are divided by difficulty score. To obtain indicators that are comparable between specialties, we used the deviation from the mean, with maximum difference normalization within specialties. These four adjusted indicators formed the input to principal component analysis (PCA) for constructing the summary performance score.

PCA is a data reduction technique that extracts the effective information from several inter-correlated indicators, and represents it as a set of orthogonal variables called principal components [34]. Mathematically, principal components were obtained from eigen-decomposition of positive semidefinite matrices and upon the singular value decomposition of rectangular matrices.

The principal components were combined into a single measure using the weight calculation method (entropy method in this study [35]). The resulting score was rescaled to a centesimal system by multiplying by 100.

Using the performance score to examine P-J fit and Burnout

P-J fit and burnout were measured using standard scales. The score for each scale is calculated by averaging the item score from each question. Both scales met the acceptable threshold (> 0.7) of Cronbach’s α and Kaiser–Meyer–Olkin (KMO) coefficient, with good internal consistency and construct validity [36, 37].

P-J fit

P-J fit is attained when an individual’s compatibility with a particular job exists and when an individual possesses the knowledge, skills and attitude which match the job requirements [38]. P-J fit was measured using a 6-item scale developed by Cable and DeRue (2002) (e.g., “The attributes that I look for in a job are fulfilled very well by my present job.”). The responses are anchored on a 5-point Likert scale, ranging from 1 = strongly disagree to 5 = strongly agree.

Burnout

Burnout is a psychological stress caused when there is a perceived imbalance between resources and demands, which reduces the motivation and effectiveness of individuals [39] and lead to impaired functioning on the job [40]. We measured burnout with the Chinese version of the 15-item Maslach Burnout Service Inventory (MBI-GS) scale [41] (e.g., “I feel emotionally drained from my work.”). The responses are anchored on a 7-point Likert scale reporting how often a feeling occurs, ranging from 0 = never to 6 = every day. Negative items were scored in reverse.

Other variables

We obtained information on clinicians’ gender, education, specialty, management position (including head of department, deputy department director and vice president of hospital) and duration in the current position. For education level, bachelor’s degree served as the reference category. Nine clinical specialties were merged into surgery, internal, gynecology and pediatric, with surgery as the reference category. Duration in the current position was measured in years and treated as a continuous variable. Promotion appraisal results were obtained for verifying the assessment effect. See Additional file 1: Table A1 for variable classification and coding.

Analytical approach

Statistical description and hypothesis testing

Descriptive statistics were calculated for all variables, including demographic characteristics, P-J fit, job burnout and the summary performance score. Bivariate analysis was conducted through one-way non-parametric test and Spearman rank correlation to test the overall differences between groups or variables.

Cluster analysis

Unsupervised clustering is a method for discovering groups and identifying new patterns in the data when predefined class labels have not been assigned [42]. Clustering involves partitioning data objects into subsets (clusters) based on similarity or dissimilarity. Samples within a cluster are more similar to each other than to samples in another cluster. We used the K-means algorithm [43] in this study.

Mediation analysis

Mediation effect analysis is used to analyze the process and mechanism through which an independent variable influences the dependent variable; that is, how independent variable X affects dependent variable Y through a mediation variable M. The three-step regression procedure introduced by Baron and Kenny was adapted to analyze the mediation effect in order to examine the study hypotheses [44, 45]. The bootstrapping procedure (500 iterations, bias-corrected, 95% Confidence Intervals (95%CI)) was applied.

Database management and data analysis software

To handle multiple sources of data, DBeaver 4.3.0 database management tool was used to access the database, and filter and link records. SPSS (SPSS for windows 7, version 21.0, SPSS Inc, Chicago, III) and AMOS 24.0 were used for statistical analysis. Visualizations of the results were developed using Medcalc 17.6. The threshold of (two tailed) statistical significance was set at 0.05.

Result

Constructing the summary performance score through PCA

Three components (F1, F2, F3) were extracted from the PCA, and these explained 79.37% of the total variance presented by the four adjusted indicators. The summary performance score was the weighted sum of these three components rescaled to 0 to 100 (Eq. 4). Figure 1 shows that the summary performance score followed a normal distribution.

Fig. 1
figure 1

Frequency distribution histogram (a) and normal Q-Q plot (b) of SPS in total population (n = 404)

$$SPS=\left(0.584*F1+0.177*F2+0.240*F3\right)*100$$
(4)

Taking appraisal results as a grouping variable, there was no difference between promoted and non-promoted clinicians on the four original performance indicators (p > 0.05). After adjustment, clinicians who were promoted scored higher than those who were not (Z = -2.257, p = 0.024) (Table 2).

Table 2 Descriptive statistical of original performance indicators and SPS (n = 244)

Demographic characteristics

This part of the analysis was conducted in the sample of clinicians who had valid response in both the questionnaire and the summary performance score (n = 244). P-J fit, burnout and SPS were significantly different for some of the demographic characteristics (Table 3).

Table 3 Summary statistics of variables of valid samples (n = 244)

Cluster analysis to categorize clinicians

Using K-means clustering, clinicians were partitioned into three groups based on non-performance attributes (See in Additional file 1: Figure A1). Disparities among clusters were examined for naming the three groups. We used age, whether the clinician holds a management position, professional title and degree to describe the room for growth, and we used degree and duration in the current position to describe the growth rate. Based on these two dimensions, we named the groups as Stars, Mainstays and Veterans (Fig. 2). For more detailed comparison, see Additional file 1: Figure A2, A3 and A4.

Fig. 2
figure 2

Clusters of the study sample

The role of burnout between P-J fit and SPS

P-J fit was positively associated with performance (β = 0.175, p = 0.006). Burnout was negatively associated with both P-J fit (β = -0.560, p < 0.001) and performance (β = -0.145, p = 0.024). Thus, hypotheses H1-H3 were supported (Table 4).

Table 4 Means, standard deviations, and correlations of the variables (n = 244)

After examining the direct association, we examined the mediator role of burnout between the P-J fit and performance. However, the mediation effect was not supported in all samples. We conducted the analysis in three groups separately and found a significant mediation effect which supported H4 in the group of Stars (Table 5). In the Veteran and Mainstay groups, only the effect between P-J fit and burnout was negatively significant, which supported H1.

Table 5 Direct and indirect effects and 95% confidence intervals for mediating models

In the absence of mediating variables, the direct effect of the models (the standard regression coefficient between independent and dependent variables) was 0.295 (p = 0.025). After introducing burnout, the direct effect of the model decreased to 0.175 (t = 1.291, p = 0.162). Thus, burnout completely mediated the relationship between P-J fit and performance. The standard regression coefficients between P-J fit and burnout, and between burnout and performance were -0.383 (t = -3.015, p = 0.009) and -0.313 (t = -2.314, p = 0.026). The effect size of the mediating effect was 0.120 (p = 0.008).

Discussion

Performance measure

From four indicators that reflect clinicians’ routine work and could be extracted from EMRs, we developed a difficulty-adjusted performance score. We found no significant difference between promoted and non-promoted clinicians on the original performance indicators, but clinicians who received promotion scored higher on our adjusted score. Many studies aimed at measuring performance or productivity of clinicians focus only on the individual level, which ignores the differences in work content and difficulty [12, 46]. Another study supported that the aggregation of multiple reliable indicators into a composite measure is a useful way to increase the reliability of clinicians’ performance scores [47, 48].

Our performance score uses the same data source for the difficulty score and the performance indicators, and the calculation depends only on the clinicians. Such practices reduce the workload of data collection and avoids involving other data systems like DRGs, so that it was more flexible for countries in similar situations.

The association of performance measure with burnout and P-J fit

Although composite measures of performance allowed horizontal comparison between clinicians, such practice was of little use on performance improvement. So, we further explored two driving factors of performance: P-J fit and burnout. We grouped clinicians based on non-performance attributes to look for meaningful combinations of clinician features.

Non-performance attributes distinguished Stars and Mainstay from Veterans. We found that Stars and Mainstay had higher burnout and were younger than Veterans. This finding is consistent with a systematic review which found that young clinicians are more at risk of burnout [28]. It explained that young clinicians always served as trainees or junior posts, so they were more likely to be overloaded, work longer hours, and be less rewarded. Furthermore, young clinicians had to follow society's script to complete many of their responsibilities [49], such as marrying and settling down, establishing a circle of friends, taking care of parents and raising children in the context of Chinese traditional culture.

The association of performance score with burnout and P-J fit distinguished Stars from Mainstay and Veteran. Stars had the same level of P-J fit and performance score as the other two groups. Burnout only showed a significant negative effect between P-J fit and performance in Stars. Burnout did hinder Stars’ skills translating to performance. In other words, they should have the potential to achieve better performance if their burnout were alleviated.

Although Mainstays had similar burnout scores as Stars, burnout did not prevent them from using their acquired skills to perform daily tasks. This may be explained by the fact that they were better in resilience. So they could adapt well in face of adversity and stress [50].

Veterans had a lower level of burnout than the other two groups. This may be attributed to the fact that they tend to have senior positions in the organization and have more work experience, which allowed them to better manage their workloads.

Implications and limitations

It is vital for healthcare human resource managers to understand the distinctions of different clinician segmentation. Our findings can be used to improve performance of clinicians.

Our findings demonstrate that by giving proper attention to the burnout of clinicians like Stars can improve their performance. Such people grow fast and have potential to become outstanding clinicians. However, their emotional processing ability does not match the rapid development of their work, which results in burnout and further hinders performance. Efforts to strengthen resilience and address the pressure from external environments are both effective in reducing burnout [51]. Five options for alleviating burnout have been identified, including staff development, job structure, management development, organizational problem-solving process and agency goals [39]. Hospital administrators could choose and develop appropriate strategies according to the resources and needs of the hospital.

Our findings also recommend differentiating the emphasis of the other two groups. Clinicians like Mainstays are more likely to hold junior positions at this stage. Thus, they may play a limited role in leading clinical decision making. Hospital administrators may benefit from hiring or promoting those employees who can avoid adverse effects of burnout by self-regulation [52]. Proper appointment and issuing clinical privileges may be useful for them to alleviate burnout and enhance self-efficacy [53].

For clinicians like Veterans, although their growth rates are relatively slow, they have become senior employees of the hospital. Their most valuable asset to the hospital is their rich clinical experience. Hospital administrators may take appropriate measures to encourage them to put their practical experience into producing scientific research and improving the quality of medical students.

There are several limitations to this study. First, our inclusion of only four main performance indicators is limited by the availability of data, which may result in the omission of some important but difficult to obtain performance indicators. For example, we were not able to obtain information on patient case mix, which could lead to evaluation bias. Second, since the data source is inpatient records, this can lead to the omission of some important work content of clinicians, such as outpatient work and clinical teaching. Admittedly, the performance indicators we selected in this study could not adapted to all specialists (e.g., most internal medicine physicians). We believe that there is no one-size-fits-all performance measure for all the clinicians, and that it is necessary to adapt the performance evaluation according to the heterogeneity of clinicians' work content. ​Other researchers can select appropriate indicators based on real-world situations in combination with the analysis procedure presented in this study for a comprehensive evaluation, and apply them to practice for performance improvement. In this study, the difficulty scores and performance indicators are derived from the same data source which is in-patient records. For other clinician groups like most internal specialties whose work content is recorded in other data sources, such as out-patient record, indicators used for performance or difficulty measurement need to be further investigated. And corresponding evaluation tools can be continuously designed, implemented, and evaluated in other clinician groups in the further according to the analytical steps and the application presented in this study.

Conclusion

Our study demonstrates the analytical steps to measure clinicians’ performance and its practical application based on patient-based records. We applied difficulty adjustment to ensure the rationality of the evaluation results. Using composite scores, we further examine the driving factors of performance. The resulting clusters provide insight into personnel management. Such practice can be relatively applied in countries where electronic medical record systems are relatively less developed since similar data sources are available in most areas.

Availability of data and materials

The datasets generated and/or analyzed during the current study are not publicly available due to personal privacy and interests but are available from the corresponding author on reasonable request.

Abbreviations

EMRs:

Electric medical records

P-J fit:

Person-environment fit

PCA:

Principle component analysis

DRGs:

Diagnosis related groups

WAP:

Weighted average of procedure

WAPP:

Weighted average percentage of procedure

VSO:

Annual volume of surgery and operation

ALOS:

Average length of stay per patient

AHE:

Hospitalization expense per patient

MR:

Inpatient mortality rate

KMO:

Kaiser-Meyer-Olkin

SPS:

Summary performance score

CI:

Confidence intervals

References

  1. Xiong X, Zhang Z, Ren J, et al. Impact of universal medical insurance system on the accessibility of medical service supply and affordability of patients in China. PLoS One. 2018;13(3):e0193273.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Leilei D, Pengpeng Y, Haagsma JA, et al. The burden of injury in China, 1990–2017: findings from the global burden of disease study 2017. Lancet Public Health. 2019;4(9):e449–61.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Dachuan Y. Does basic medical insurance for urban residents raise medical service utilization? An empirical evaluation of system running effect. Financ Econ Res. 2015;30:117–28.

  4. Liu X, Wong H, Liu K. Outcome-based health equity across different social health insurance schemes for the elderly in China. BMC Health Serv Res. 2015;16(1):1–12.

    Article  Google Scholar 

  5. Yip WC-M, Powell-Jackson T, Chen W, et al. Capitation combined with pay-for-performance improves antibiotic prescribing practices in rural China. In: Health Care Policy in East Asia: A World Scientific Reference: Volume 1: Health Care System Reform and Policy Research in China. World Scientific; 2020. p. 163–193.

  6. Zhang Y, Ma Q, Chen Y, Gao H. Effects of public hospital reform on inpatient expenditures in rural China. Health Econ. 2017;26(4):421–30.

    Article  PubMed  Google Scholar 

  7. Reinhardt UE, Cheng T. The world health report 2000-Health systems: improving performance. B World Health Organ, 2000;78:1064.

  8. Suddaby R, Viale T. Professionals and field-level change: Institutional work and the professional project. Curr Sociol. 2011;59(4):423–42.

    Article  Google Scholar 

  9. Bucher SV, Chreim S, Langley A, Reay T. Contestation about collaboration: discursive boundary work among professions. Organ Stud. 2016;37(4):497–522.

    Article  Google Scholar 

  10. Lesser CS, Fineberg HV, Cassel CK. Physician payment reform: principles that should shape it. Health Aff. 2010;29(5):948–52.

    Article  Google Scholar 

  11. Hartman M, Martin A, McDonnell P, Catlin A. National health spending in 2007: slower drug spending contributes to lowest rate of overall growth since 1998. Health Aff. 2009;28(1):246–61.

    Article  Google Scholar 

  12. Foster K, Penninti P, Shang J, Kekre S, Hegde GG, Venkat A. Leveraging big data to balance new key performance indicators in emergency physician management networks. Prod Oper Manag. 2018;27(10):1795–815.

    Article  Google Scholar 

  13. Schenone D, Rasero L, Cavaliere B. Fee payment system for nursing performance: an operational proposal through the use of ICA (Indexes of Complexity of Assistance) methodology. Acta Bio Med. 2018;89(Suppl 6):87.

    Google Scholar 

  14. Johannessen KA, Kittelsen SA, Hagen TP. Assessing physician productivity following Norwegian hospital reform: a panel and data envelopment analysis. Soc Sci Med. 2017;175:117–26.

    Article  PubMed  Google Scholar 

  15. Jacobs JP, Lahey SJ, Nichols FC, et al. How is physician work valued? Ann Thorac Surg. 2017;103(2):373–80.

    Article  PubMed  Google Scholar 

  16. Lei J, Sockolow P, Guan P, Meng Q, Zhang J. A comparison of electronic health records at two major Peking University Hospitals in China to United States meaningful use objectives. BMC Med Inf Decis Making. 2013;13(1):1–9.

    Google Scholar 

  17. Feng L, Tian Y, He M, et al. Impact of DRGs-based inpatient service management on the performance of regional inpatient services in Shanghai, China: an interrupted time series study, 2013–2019. BMC Health Serv Res. 2020;20(1):1–9.

    Article  Google Scholar 

  18. Breaugh JA. Applied Psychology in Personnel Management. Acad Manage Rev. 1978;3(4):929–31.

  19. Behn RD. Why measure performance? Different purposes require different measures. Publ Admin Rev. 2003;63(5):586–606.

    Article  Google Scholar 

  20. Nassar AK, Waheed A, Tuma F. Academic clinicians’ workload challenges and burnout analysis. Cureus. 2019;11(11):e6108.

    PubMed  PubMed Central  Google Scholar 

  21. Lambert VA, Lambert CE. Literature review of role stress/strain on nurses: an international perspective. Nurs Health Sci. 2001;3(3):161–72.

    Article  CAS  PubMed  Google Scholar 

  22. Fu Y, Schwebel DC, Hu G. Physicians’ workloads in China: 1998–2016. Int J Env Res Public Health. 2018;15(8):1649.

    Article  Google Scholar 

  23. Hong J, Xu J, Sun X. Young doctors and the pressure of publication. Lancet. 2013;381(9876):e10-11.

    Article  PubMed  Google Scholar 

  24. Bluedorn S. Organizational behavior implications of the congruence between preferred polychronicity and experienced work-unit polychronicity. J Organ Behav. 1999;20(1):75–99.

    Article  Google Scholar 

  25. Madjar N, Oldham GR. Task Rotation and Polychronicity: Effects on Individuals’ Creativity. Hum Perform. 2006;19(2):117–31.

    Article  Google Scholar 

  26. Shanafelt TD, Noseworthy JH. Executive leadership and physician well-being: nine organizational strategies to promote engagement and reduce burnout. Paper presented at: Mayo Clinic Proceedings. Elsevier. 2017;92(1):129–46. https://www.mayoclinicproceedings.org/article/S0025-6196(16)30625-5/fulltext.

  27. Wallace JE, Lemaire JB, Ghali WA. Physician wellness: a missing quality indicator. Lancet. 2009;374(9702):1714–21.

    Article  PubMed  Google Scholar 

  28. Lo D, Wu F, Chan M, Chu R, Li D. A systematic review of burnout among doctors in China: a cultural perspective. Asia Pac Fam Med. 2018;17(1):1–13.

    Article  Google Scholar 

  29. Bhat ZH, Rainayee RA. Examining the mediating role of person–job fit in the relationship between training and performance: A civil servant perspective. Glob Bus Rev. 2019;20(2):529–48.

    Article  Google Scholar 

  30. Bakker AB, Demerouti E, Verbeke W. Using the job demands-resources model to predict burnout and performance. Hum Resour Manag. 2004;43(1):83–104.

    Article  Google Scholar 

  31. Kristof-Brown AL, Zimmerman RD, Johnson EC. Consequences of individuals’s fit at work: A meta-analysis of person–job, person–organization, person–group, and person–supervisor fit. Pers Psychol. 2005;58(2):281–342.

    Article  Google Scholar 

  32. Minye D, Yuyin X, Chenshu S, et al. Which Contributes to Clinical Performance: Academic Output or Person–Environment Fit?. Front Public Health. 2022;10:801917.

  33. Donabedian A. Evaluating the quality of medical care. Milbank Q. 2005;83(4):691.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Abdi H, Williams LJ. Principal component analysis. Wiley Interdisc Rev Comput Stats. 2010;2(4):433–59.

    Article  Google Scholar 

  35. Guo X. Entropy method and its application in comprehensive evaluation. Finance Trade Res. 1994;5(6):56–60.

    Google Scholar 

  36. Hair Jr JF, Hult GTM, Ringle C, Sarstedt M. A primer on partial least squares structural equation modeling (PLS-SEM). Sage publications; 2016. https://www.researchgate.net/publication/236032728_A_Primer_on_Partial_Least_Squares_Structural_Equation_Modeling#fullTextFileContent.

  37. Kaiser HF. An index of factorial simplicity. Psychometrika. 1974;39(1):31–6.

    Article  Google Scholar 

  38. Edwards JR. Person-job fit: A conceptual integration, literature review, and methodological critique. In C. L. Cooper & I. T. Robertson. Eds. International Review of Industrial and Organizational Psychology. John Wiley & Sons; 1991;6(6):283–357. https://www.psycnet.apa.org/record/1991-98004-008.

  39. Cherniss C, Cherniss C. Staff burnout: Job stress in the human services. 1980.

  40. Taris TW. Is there a relationship between burnout and objective performance? A critical review of 16 studies. Work Stress. 2006;20(4):316–34.

    Article  Google Scholar 

  41. Wu S, Zhu W, Wang Z, Wang M, Lan Y. Relationship between burnout and occupational stress among nurses in China. J Adv Nursing. 2007;59(3):233–9.

    Article  Google Scholar 

  42. Frades I, Matthiesen R. Overview on techniques in cluster analysis. Bioinform Methods Clin Res. 2010;593:81–107.

    Article  CAS  Google Scholar 

  43. Yadav J, Sharma M. A Review of K-mean Algorithm. Int J Eng Trends Technol. 2013;4(7):2972–6.

    Google Scholar 

  44. Baron RM, Kenny DA. The moderator–mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. JPSP. 1986;51(6):1173.

    CAS  Google Scholar 

  45. Judd CM, Kenny DA. Process analysis: Estimating mediation in treatment evaluations. Eval Rev. 1981;5(5):602–19.

    Article  Google Scholar 

  46. Hasan MZ, Dinsa GD, Berman P. A Practical Measure of Health Facility Efficiency: an Innovation in the Application of Routine Health Information to Determine Health Worker Productivity in Ethiopia. Hum Resour Health. 2021;19:96.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Caldis T. Composite health plan quality scales. Health Care Financ Rev. 2007;28(3):95.

    PubMed  PubMed Central  Google Scholar 

  48. Kaplan SH, Griffith JL, Price LL, Pawlson LG, Greenfield S. Improving the reliability of physician performance assessment: identifying the “physician effect” on quality and creating composite measures. Med Care. 2009;47(4):378–87.

    Article  PubMed  Google Scholar 

  49. Schlarb C. The human odyssey: Navigating the twelve stages of life. J Trans Psychol. 2008;40(2):251.

  50. Connor KM, Davidson JR. Development of a new resilience scale: the Connor-Davidson resilience scale (CD-RISC). Depress Anxiety. 2003;18(2):76–82.

    Article  PubMed  Google Scholar 

  51. West CP, Dyrbye LN, Sinsky C, et al. Resilience and burnout among physicians and the general US working population. JAMA Netw Open. 2020;3(7):e209385–e209385.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Rajper ZA, Ghumro IA, Mangi RA. Linking person job fit to employee job performance amid employees of services sector: the role of burnout as mediator. J Soc Admin Sci. 2019;6(4):188–99.

    Google Scholar 

  53. Vogel R, Hattke F. How is the use of performance information related to performance of public sector professionals? Evidence from the field of academic research. Public Perform Manag Rev. 2018;41(2):390–414.

    Article  Google Scholar 

Download references

Acknowledgements

We are very grateful for all the research project team and colleagues for their help in implementing survey, gathering information and sharing their views with us. The authors also acknowledge all the clinicians for their precious time participating our survey. And we also thank the valuable comments and corrections from reviewers and editors, which made our manuscript more rigorous. The burnout scale was used with permission from the copyright holder (John Wiley & Sons Ltd., Chichester, West Sussex, UK).

Funding

This research project was funded by a grant from the National Natural Science Foundation of China (Grant number 72074147,72293585).

Author information

Authors and Affiliations

Authors

Contributions

MD and YX drafted the manuscript and conducted data analysis. MD contributed to organization and coordination. YX administered the electronic version of the survey. CS provided methodological guidance. GL was responsible for the research design, and in charge of manuscript revision and project guidance. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Guohong Li.

Ethics declarations

Ethics approval and consent to participate

All methods were carried out in accordance with declaration of the National Statement on Ethical Conduct in Human Research. Ethical approval was obtained on 20 February 2020 an Institutional Review Board (the institutional review board of School of Public Health, Shanghai Jiao Tong University School of Medicine, IRB SJUPN-202008). The informed consent was obtained from all subjects. All subjects were clinicians, they were all informed of the study and gave consent to participate by clicking the button “Acknowledge” on the encrypted access link.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Table A1. Definitions of clinician level variables. Table A2. Part of fact table of procedure level. Table A3. Fact table of proceduregrading. Table A4. Fact table of procedure role. Figure A1. Dendrogram ofClustering. Figure A2. Distribution of continuous demographic characteristic by clusters (n=244). Fig. A3. Distribution of classified demographic characteristic by clusters (n=244). Fig. A4. Distribution of SPS by clusters (n =244).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dong, M., Xiao, Y., Shi, C. et al. Measuring and improving performance of clinicians: an application of patient-based records. BMC Health Serv Res 23, 775 (2023). https://doi.org/10.1186/s12913-023-09772-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12913-023-09772-2

Keywords