Skip to main content


A comparison of comorbidities obtained from hospital administrative data and medical charts in older patients with pneumonia



The use of comorbidities in risk adjustment for health outcomes research is frequently necessary to explain some of the observed variations. Medical charts reviews to obtain information on comorbidities is laborious. Increasingly, electronic health care databases have provided an alternative for health services researchers to obtain comorbidity information. However, the rates obtained from databases may be either over- or under-reported. This study aims to (a) quantify the agreement between administrative data and medical charts review across a set of comorbidities; and (b) examine the factors associated with under- or over-reporting of comorbidities by administrative data.


This is a retrospective cross-sectional study of patients aged 55 years and above, hospitalized for pneumonia at 3 acute care hospitals. Information on comorbidities were obtained from an electronic administrative database and compared with information from medical charts review. Logistic regression was performed to identify factors that were associated with under- or over-reporting of comorbidities by administrative data.


The prevalence of almost all comorbidities obtained from administrative data was lower than that obtained from medical charts review. Agreement between comorbidities obtained from medical charts and administrative data ranged from poor to very strong (kappa 0.01 to 0.78). Factors associated with over-reporting of comorbidities were increased length of hospital stay, disease severity, and death in hospital. In contrast, those associated with under-reporting were number of comorbidities, age, and hospital admission in the previous 90 days.


The validity of using secondary diagnoses from administrative data as an alternative to medical charts for identification of comorbidities varies with the specific condition in question, and is influenced by factors such as age, number of comorbidities, hospital admission in the previous 90 days, severity of illness, length of hospitalization, and whether inhospital death occurred. These factors need to be taken into account when relying on administrative data for comorbidity information.

Peer Review reports


Pre-existing conditions or comorbidities have been used in risk adjustment for health outcomes research [1, 2]. The number and type of comorbidities can have a significant impact on patient outcomes and may explain some of the observed variations [39]. Traditionally, medical charts were used to obtain information on comorbidities. This is a very laborious process. With the advent of electronic health care databases that capture financial data for the purpose of claims, such administrative data have provided an alternative for health services researchers to obtain comorbidity information for outcomes research [1013].

While information obtained from administrative databases has been used for risk adjustment [1416], the evidence on accuracy of secondary diagnoses from administrative databases as a substitute for comorbidities has been mixed [1730]. Previous studies have found that administrative databases tend to under-report some comorbidities but overestimate the prevalence of others when compared with information from medical charts [1721, 24, 2831]. Comorbidities that showed better agreement between both data sources were solid tumor [17], diabetes mellitus [25], connective tissue disease [17, 19], and chronic pulmonary disease [25]. The conditions with poor agreement were renal disease [17], dementia [17], hypertension [19], diabetes with complications [17, 25] and peripheral vascular disease [20].

Reasons for the discordance of comorbidity assignment from both sources of information have been offered. Romano et al [24] and Powell et al [21] found that conditions which were asymptomatic tended to be under-reported in administrative data. Iezzoni et al [18] suggested that some acute medical conditions or complications were deemed by coders to be more important than others, thereby creating coding bias.

Humphries et al [25] found that although the agreement between comorbdities from two different sources as measured by kappa statistics was only fair, there was no significant difference in the predictive value for all-cause mortality in a group of patients who have undergone percutaneous coronary intervention. Newschaffer et al [29] found similar results in a population of patients with breast cancer. Van Doorn [32] showed the same findings in a population of older adults. Susser et al [33] found that the Charlson Comorbidity Index (CCI) [16] constructed with comorbidities obtained from either administrative data or self-report had similar predictive validity for functional decline and health services utilization.

To date, there has not been any study that identified patient characteristics associated with the likelihood of over- or under-reporting of comorbidities in administrative data, particularly in older populations where the prevalence of comorbidities is higher. This study aimed to (a) quantify the agreement between administrative data and medical charts review across a set of comorbidities in older hospitalized persons; and (b) examine the factors associated with under- or over-reporting of comorbidities by the administrative data. We hypothesized that patients with high number of comorbidities were more likely to have under-reporting of comorbidities in the administrative data, while those who had longer lengths of hospital stay were more likely to have over-reporting.


Study population

The study population comprised patients aged 55 years and above who were hospitalized for pneumonia between 1 January and 31 December 2007 at 3 acute care hospitals in Singapore. They were identified from hospital administrative data of the National Healthcare Group (NHG) Operations Data Store (ODS) through the coding classification of the Australian National Diagnosis Related Groups (AN-DRG) version 3.1. Those assigned to DRG170 (Respiratory infections/inflammations age > 54 with complications) and DRG171 (Respiratory infections/inflammations age > 54 without complications or age < 55 with complications) were included. In addition, patients with DRG003 (Tracheostomy except for mouth, larynx or pharynx disorder age >15) were checked against their respective International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9CM) codes for pneumonia. Those with primary ICD-9CM codes of 481, 482, 485, and 486 were also included. In Singapore, primary diagnosis refers to the reason for admission. Patients admitted for pneumonia were selected for this study as it is an acute medical condition, and not a comorbidity.

Data collection

Information was obtained from two sources, namely medical charts and routine hospital administrative data. For medical charts, data was extracted from the emergency department notes, inpatient notes, and specialist consultation letters by a trained research nurse. Ten percent of the medical charts were reviewed by the author to check for consistency. A data collection form was used to record information on demographic characteristics (age, gender, and ethnic group), hospitalization (length of stay, comorbidities, and hospital admissions in the previous 90 days), physical examination (altered mental status, respiratory rate, and blood pressure) and selected laboratory data (serum urea level). For comorbidities, the set of 30 conditions listed by Elixhauser et al [34] was used, with the exception of Human Immunodeficiency Virus (HIV) infection because the medical charts of patients with HIV were not available for review. Only those documented by the attending physicians during the first 24 hours of admission were included to ensure that complications occurring during the course of the hospital stay were excluded. The research nurse who performed data extraction from medical charts was blinded to information from the hospital administrative data.

For administrative data, only secondary codes for each index admissions selected were used without 'looking back' at previous admissions. Secondary or additional ICD-9CM codes were extracted from the administrative databases and mapped to the 29 comorbidities. These ICD-9CM codes were entered into the hospital administrative databases by trained clinical coders after patients were discharged. There was no limit to the number of codes in the secondary diagnoses field. The clinical coders in the hospitals were a mix of non-practicing physicians as well as professionally trained coders with clinical background (nursing or allied health). All clinical coders were trained by expert coders and the coding practice adheres to the Singapore Coding Directive, a national coding standard. The coding accuracy is monitored through periodic and stringent audits by the Ministry of Health of Singapore for the purpose of funding/reimbursement that is DRG-based and hence dependent on comorbities reported. All comorbidities reflected by secondary ICD-9CM codes in the routine administrative data were included.

A total of 29 comorbidities were included. Comorbidities derived from medical charts were considered the reference ("gold standard"). Each comorbidity derived from administrative database was re-coded to specify if it was under-reported or over-reported compared to the "gold standard". The total number of comorbidity comparisons was the number of hospital episodes multiplied by the 29 comorbidities.

Mortality is an important outcome and an indicator of quality of care for patients with pneumonia. Prediction tools are used to predict and stratify patients' mortality risk and a means for deciding on the course of clinical management. For this study, selected clinical data to construct the CURB (Confusion, Urea > 7 mmol/L, Respiratory rate > 30/min, Blood pressure with low systolic <90 or diastolic <60 mmHg) score to stratify pneumonia severity by risk of death [35] were also collected. A score of 0-1 predicts lowest risk of mortality and a score of 4 predicts the highest risk.

Data analysis

The unit of analysis was hospital episode. Agreement between the two sources of data was quantified using Cohen's kappa coefficient, κ. Kappa value above 0.75 indicates an excellent level of agreement beyond chance, 0.40 through 0.75 represent fair to good agreement, and kappa value less than 0.4 indicates poor agreement [36]. Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were calculated for each comorbid condition. Sensitivity was defined as the proportion of medical charts with documentation of the comorbidity that was also coded with that comorbidity in the hospital administrative data. Hospital episodes where comorbidities that were documented in the medical charts but were not coded in the hospital administrative data were assigned as under-reporting. Specificity was defined as the proportion of medical charts without documentation of the comorbidity which was not coded with that condition in the hospital administrative data too. Hospital episodes where comorbidities that were not documented in the medical charts but were coded in the hospital administrative data were assigned as over-reporting.

Multinomial regression was performed to identify factors that were associated with over-reporting of comorbidities for all comparisons, followed by for each of the 29 comorbidities in turn. This process was repeated for under-reporting of comorbidities. The independent variables included age, gender, disease severity, number of comorbidities, previous hospital admissions within the last 90days, length of hospital stay, and inhospital death. Odds ratios for these factors were obtained for both over- and under-reporting with the reference being same-coding. Hierarchical multinomial regression modeling was performed for these analyses using the STATA program for generalized linear latent and mixed models (GLLAMM) [37]. This is to account for the effect of clustering due to multiple hospital episodes for the same patient during the study period.

All statistical tests were carried out using the STATA version 9.2 (Stata Corp, College Station, Texas). Statistical significance was taken at p-values of less than 0.05.

The study was approved by the National Healthcare Group's Institutional Review Board.


A total of 3517 hospital admissions for pneumonia that satisfied the criteria for the study population were identified. Of these, 46 admissions (1.3%) were excluded from the study as their medical charts were not available during the period of review (Figure 1). The characteristics of the remaining 3471 admissions are summarized in Table 1.

Figure 1

Study flow diagram.

Table 1 Patient characteristics (n = 3471)

Table 2 shows the prevalence of the comorbidities obtained from medical charts and administrative data, and the agreement between both data sources. The prevalence of comorbidities obtained from medical charts review and administrative data ranged from 0.1 to 59.6 percent and 0.3 to 34.3 percent, respectively. The prevalence rates of all comorbidities derived from medical charts were higher than that obtained from routine administrative databases except for diabetes with complications, coagulopathy, deficiency anemias, weight loss, blood loss anemia, and paralysis. Drug abuse and obesity were not coded in the administrative data for any of the cases reviewed. The number of secondary diagnoses coded in the administrative database for each case range from 0 to 21, with a median of 6 diagnoses.

Table 2 Prevalence of comorbidities and agreement between data derived from medical charts and administrative data (n = 3471)

The overall agreement between both data sources using kappa statistics ranged widely from 0.01 (poor agreement) to 0.78 (excellent agreement). Diabetes mellitus (uncomplicated and complicated), metastatic cancer, chronic pulmonary disease, lymphoma and alcohol abuse reported the highest kappa statistic values of 0.53 to 0.78.

Table 3 shows the sensitivity, specificity, positive predictive value and negative predictive values. There were 6 comorbidities with sensitivity of more than 50 percent. Those that were most under-reported were peptic ulcer disease, renal failure, depression, paralysis and psychoses, with sensitivity of less than 9 percent. The specificity of the comorbidities was good, with only 6 comorbidities having specificity of less than 96%. These conditions were deficiency anemias, fluid and electrolyte disorders, coagulopathy, hypertension, chronic pulmonary disease, and uncomplicated diabetes, and represent conditions most over-reported in the administrative data. The positive predictive values (PPV) for the comorbidities ranged from 0.8 to 94.4 percent. The negative predictive values (NPV) for all the comorbidities were more than 80 percent, except for hypertension (58.6%).

Table 3 Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) of comorbidities (n = 3471)

There were 100,659 comorbidity comparisons available for analyses. In the multinomial regression analysis, factors that were significantly associated with over-reporting were increased length of stay, disease severity and inhospital death. Factors that were significant for under-reporting were number of comorbidities, age and hospital admission in the previous 90 days (Table 4).

Table 4 Multinomial regression to determine factors that are associated with over-reporting and under-reporting of comorbidity (n = 100659)

Increasing number of concomitant comorbidities and inhospital death increased the likelihood of over-reporting for several out of 10 individual comorbidities among the Elixhauser list (Table 5). Higher number of concurrent comorbidities consistently increased the likelihood of under-reporting across all the selected comorbidities, while increasing age and hospitalization in the previous 90 days did so for several of them (Table 6).

Table 5 Factors associated with over-reporting of 10 selected comorbidities
Table 6 Factors associated with under-reporting of 10 selected comorbidities

Hierarchical modeling obtained very similar odds ratios and their 95% confidence intervals for factors explaining over- and under-reporting of comorbidities by the administrative data.


To the best of our knowledge, this is the first study that has evaluated the reliability of secondary diagnoses in the administrative data as a surrogate for comorbidities in Singapore. The results were consistent with other studies where the prevalence of comorbidities obtained from administrative data was lower than that obtained from medical charts for most conditions [17, 1921, 24, 3840]. This is despite the fact that both the research nurse who abstracted information from medical charts and the clinical coders responsible for assigning the ICD-9CM codes in the administrative database had obtained their information from the same source. The discordance can be explained by examining the reasons for which the information was collected. Pre-existing conditions were documented in medical charts to assist clinicians in clinical care decisions. Secondary diagnoses codes from administrative data do not differentiate between pre-existing conditions or complications that occurred during the hospitalization [41]. There is no flag in the administrative data indicating that the conditions existed on or after admission. Therefore, conditions such as fluid and electrolyte disorders, deficiency anemias, blood loss anemia, diabetes with complications, and coagulopathy reported higher prevalence rates in administrative data than in medical charts because it is highly likely that these conditions arose during hospitalization as a result of worsening medical condition, complications of treatment, or confirmation of new medical conditions through laboratory tests. Clinical coders extract information for reimbursement purposes, and were more likely to include any conditions that have an impact on the utilization of resources during the episode of hospitalization. Hence, these conditions would be coded as secondary diagnoses in the administrative data, whilst the research nurse abstracting the information from the medical charts would have excluded them. If these conditions were used in risk adjustment to compare outcomes, these could potentially cause an "over-adjustment" of risk [4248]. On the other hand, chronic conditions such as depression, psychoses, peptic ulcer disease, paralysis, or renal failure [49] were under-reported. These conditions may not have been active problems during hospitalization for pneumonia and therefore, did not contribute to increased resource utilization. It is most likely that the patients with history of drug abuse as documented in the medical charts, were no longer receiving treatment for it, and the condition would not be coded by the clinical coders. Similarly, obesity was not coded as a secondary diagnosis as the patients were not likely to have received treatment for obesity during their hospitalization for pneumonia.

The kappa statistics for diabetes (uncomplicated and complicated), metastatic cancer and COPD showed substantial agreement, as was found in other studies [17, 21, 25]. Similarly, Quan and colleagues [40] reported that comorbidities obtained from ICD-9-CM data had sensitivity ranging from 9.3% to 83.1%. The wide range for PPV was a reflection of the different prevalence rates for individual comorbidities. Paralysis, blood loss anemia and weight loss with the lowest PPV had the lowest prevalence in the medical charts as well.

Factors associated with over-reporting were length of hospitalization, severity of illness, and inhospital death. This is not unexpected as these factors were likely to be associated with an increase in the number of investigations and interventions during the hospital episode, resulting in identification of additional concurrent conditions and complications. These conditions were more likely to be coded as secondary diagnoses in the administrative data because they were related to increased resource utilization. Other researchers have previously found that some medical conditions or complications of treatment were judged more important than other chronic conditions when patients were critically ill or when they died [18, 21, 23, 50].

Age, previous hospital admission and number of comorbidities were associated with under-reporting of comorbidities in the administrative database., Although there is no limit to the number of secondary diagnoses that can be coded, having a higher number of concurrent comorbidities may result in less important comorbidities being disregarded. Advancing age [8] and recent hospitalization are themselves associated with increased number of comorbidities. The association of these two factors with under-reporting may be a reflection of residual confounding of number of comorbidities that is not accounted for due to the specification of our regression models.

For the individual comorbidities, the same factors associated with over- or under-reporting for the whole study population were represented for most of the important ones we selected for detailed analyses. This was most clearly seen with number of concurrent comorbidities being associated with under-reporting for all 10 conditions. For length of hospitalization, severity of illness, inhospital death, age, and hospital admission in the previous 90 days, smaller sample sizes or lack of true effect may have accounted for the absence of association observed for some comorbidities. Nevertheless, we argue that the overall picture supports the results for the whole study population.

The main strength of this study is its large sample size. In addition, as there is no limit to the number of secondary diagnoses that can be coded in the administrative data, it is unlikely that the lower prevalence of comorbidities in the administrative data were due to restrictions imposed by the prevailing health information system.

There are several limitations of the study. Firstly, the findings of this study may not be generalized to older patients hospitalized for other acute illnesses because we only studied those with pneumonia. The use of DRG codes for pneumonia to identify cases could have included cases that did not meet the BTS criteria for pneumonia as DRG codes were assigned to reflect resource utilization. However, as the main objectives of this study were to compare the comorbidities obtained from two sources, and to identify the possible factors that may be associated with under or over-reporting of the comorbidities, and not on the outcomes for patients with pneumonia, the inclusion of such was not likely to have affected the findings. Secondly, documentation of comorbidities in the medical charts within the first 24 hours of admission may be incomplete in a busy ward environment with many patients waiting to be reviewed and therefore, may not be the ideal "gold standard" where comorbidities are concerned. To address this, we ensured that the research nurse was familiar with documentation in the medical charts and was trained to abstract very specific conditions. Attending physicians also had access to an electronic medical records system that contained information on known comorbidities. Therefore, we believe that the likelihood of documentation of important comorbidities being omitted is low. Thirdly, this study involved documentation of comorbidities in a single health care system and may not necessarily mirror the situation in other systems. Further research on factors associated with under- and over-reporting of comorbidities by administrative data in other health systems is needed to confirm our findings. Fourthly, we acknowledge that there may be other factors that could be associated with over- or under-reporting of the comorbidities but were not measured in our study. Although there were little prior research on this subject to guide us, we have included plausible factors on the basis of clinical opinion.


Our study confirmed that the prevalence of almost all comorbidities obtained from administrative data was lower than that obtained from medical chart review. The validity of secondary diagnoses from administrative data varies with the specific comorbidity in question, and is influenced by factors such as age, number of comorbidities, hospital admission in the preceding 90 days, severity of illness, length of hospitalization, and whether inhospital death occurred. While some comorbidities were reported as secondary diagnoses with a reasonable level of accuracy, there were several that may not be used interchangeably. Researchers should be cautious and take these findings into account when using this source of information as an alternative to medical chart reviews for the purpose of measuring comorbidity burden. This may also affect policy decisions by hospital administrators as it may underestimate the true burden of illnesses. Further research is needed to confirm our findings in other patient populations.


  1. 1.

    Incalzi RA, Capparella O, Gemma A, Landi F, Bruno E, Di Meo F, Carbonin P: The interaction between age and comorbidity contributes to predicting the mortality of geriatric patients in the acute-care hospital. J Intern Med. 1997, 242: 291-298. 10.1046/j.1365-2796.1997.00132.x.

  2. 2.

    Desai MM, Bogardus ST, Williams CS, Vitagliano G, Inouye SK: Development and validation of a risk-adjustment index for older patients: the high-risk diagnoses for the elderly scale. J Am Geriatr Soc. 2002, 50: 474-481. 10.1046/j.1532-5415.2002.50113.x.

  3. 3.

    Shwartz M, Iezzoni LI, Moskowitz MA, Ash AS, Sawitz E: The importance of comorbidities in explaining differences in patient costs. Med Care. 1996, 34: 767-782. 10.1097/00005650-199608000-00005.

  4. 4.

    Greenfield S, Aronow HU, Elashoff RM, Watanabe D: Flaws in mortality data. The hazards of ignoring comorbid disease. Jama. 1988, 260: 2253-2255. 10.1001/jama.260.15.2253.

  5. 5.

    Cesari M, Onder G, Russo A, Zamboni V, Barillaro C, Ferrucci L, Pahor M, Bernabei R, Landi F: Comorbidity and physical function: results from the aging and longevity study in the Sirente geographic area (ilSIRENTE study). Gerontology. 2006, 52: 24-32. 10.1159/000089822.

  6. 6.

    Battafarano RJ, Piccirillo JF, Meyers BF, Hsu HS, Guthrie TJ, Cooper JD, Patterson GA: Impact of comorbidity on survival after surgical resection in patients with stage I non-small cell lung cancer. J Thorac Cardiovasc Surg. 2002, 123: 280-287. 10.1067/mtc.2002.119338.

  7. 7.

    Hall SF, Groome PA, Rothwell D: The impact of comorbidity on the survival of patients with squamous cell carcinoma of the head and neck. Head Neck. 2000, 22: 317-322. 10.1002/1097-0347(200007)22:4<317::AID-HED1>3.0.CO;2-0.

  8. 8.

    Gijsen R, Hoeymans N, Schellevis FG, Ruwaard D, Satariano WA, van den Bos GA: Causes and consequences of comorbidity: a review. J Clin Epidemiol. 2001, 54: 661-674. 10.1016/S0895-4356(00)00363-2.

  9. 9.

    Majeed A, Bindman AB, Weiner JP: Use of risk adjustment in setting budgets and measuring performance in primary care II: advantages, disadvantages, and practicalities. Bmj. 2001, 323: 607-610. 10.1136/bmj.323.7313.607.

  10. 10.

    Roos LL, Sharp SM, Cohen MM, Wajda A: Risk adjustment in claims-based research: the search for efficient approaches. J Clin Epidemiol. 1989, 42: 1193-1206. 10.1016/0895-4356(89)90118-2.

  11. 11.

    Melfi C, Holleman E, Arthur D, Katz B: Selecting a patient characteristics index for the prediction of medical outcomes using administrative claims data. J Clin Epidemiol. 1995, 48: 917-926. 10.1016/0895-4356(94)00202-2.

  12. 12.

    Coleman AL, Morgenstern H: Use of insurance claims databases to evaluate the outcomes of ophthalmic surgery. Surv Ophthalmol. 1997, 42: 271-278. 10.1016/S0039-6257(97)00095-7.

  13. 13.

    Xuan J, Duong PT, Russo PA, Lacey MJ, Wong B: The economic burden of congestive heart failure in a managed care population. Am J Manag Care. 2000, 6: 693-700.

  14. 14.

    D'Hoore W, Sicotte C, Tilquin C: Risk adjustment in outcome assessment: the Charlson comorbidity index. Methods Inf Med. 1993, 32: 382-387.

  15. 15.

    Romano PS, Roos LL, Jollis JG: Adapting a clinical comorbidity index for use with ICD-9-CM administrative data: differing perspectives. J Clin Epidemiol. 1993, 46: 1075-1079. 10.1016/0895-4356(93)90103-8. discussion 1081-1090

  16. 16.

    Charlson ME, Pompei P, Ales KL, MacKenzie CR: A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987, 40: 373-383. 10.1016/0021-9681(87)90171-8.

  17. 17.

    Malenka DJ, McLerran D, Roos N, Fisher ES, Wennberg JE: Using administrative data to describe casemix: a comparison with the medical record. J Clin Epidemiol. 1994, 47: 1027-1032. 10.1016/0895-4356(94)90118-X.

  18. 18.

    Iezzoni LI, Foley SM, Daley J, Hughes J, Fisher ES, Heeren T: Comorbidities, complications, and coding bias. Does the number of diagnosis codes matter in predicting in-hospital mortality?. Jama. 1992, 267: 2197-2203. 10.1001/jama.267.16.2197.

  19. 19.

    Hawker GA, Coyte PC, Wright JG, Paul JE, Bombardier C: Accuracy of administrative data for assessing outcomes after knee replacement surgery. J Clin Epidemiol. 1997, 50: 265-273. 10.1016/S0895-4356(96)00368-X.

  20. 20.

    Kieszak SM, Flanders WD, Kosinski AS, Shipp CC, Karp H: A comparison of the Charlson comorbidity index derived from medical record data and administrative billing data. J Clin Epidemiol. 1999, 52: 137-142. 10.1016/S0895-4356(98)00154-1.

  21. 21.

    Powell H, Lim LL, Heller RF: Accuracy of administrative data to assess comorbidity in patients with heart disease. an Australian perspective. J Clin Epidemiol. 2001, 54: 687-693. 10.1016/S0895-4356(00)00364-4.

  22. 22.

    Waite K, Oddone E, Weinberger M, Samsa G, Foy M, Henderson W: Lack of association between patients' measured burden of disease and risk for hospital readmission. J Clin Epidemiol. 1994, 47: 1229-1236. 10.1016/0895-4356(94)90127-9.

  23. 23.

    Normand SL, Morris CN, Fung KS, McNeil BJ, Epstein AM: Development and validation of a claims based index for adjusting for risk of mortality: the case of acute myocardial infarction. J Clin Epidemiol. 1995, 48: 229-243. 10.1016/0895-4356(94)00126-B.

  24. 24.

    Romano PS, Roos LL, Luft HS, Jollis JG, Doliszny K: A comparison of administrative versus clinical data: coronary artery bypass surgery as an example. Ischemic Heart Disease Patient Outcomes Research Team. J Clin Epidemiol. 1994, 47: 249-260. 10.1016/0895-4356(94)90006-X.

  25. 25.

    Humphries KH, Rankin JM, Carere RG, Buller CE, Kiely FM, Spinelli JJ: Co-morbidity data in outcomes research: are clinical data derived from administrative databases a reliable alternative to chart review?. J Clin Epidemiol. 2000, 53: 343-349. 10.1016/S0895-4356(99)00188-2.

  26. 26.

    Blumenthal D: Quality of health care. Part 4: The origins of the quality-of-care debate. N Engl J Med. 1996, 335: 1146-1149. 10.1056/NEJM199610103351511.

  27. 27.

    Sarfati D, Hill S, Purdie G, Dennett E, Blakely T: How well does routine hospitalisation data capture information on comorbidity in New Zealand?. N Z Med J. 123: 50-61.

  28. 28.

    Preen DB, Holman CD, Lawrence DM, Baynham NJ, Semmens JB: Hospital chart review provided more accurate comorbidity information than data from a general practitioner survey or an administrative database. J Clin Epidemiol. 2004, 57: 1295-1304. 10.1016/j.jclinepi.2004.03.016.

  29. 29.

    Newschaffer CJ, Bush TL, Penberthy LT: Comorbidity measurement in elderly female breast cancer patients with administrative and medical records data. J Clin Epidemiol. 1997, 50: 725-733. 10.1016/S0895-4356(97)00050-4.

  30. 30.

    McCarthy EP, Iezzoni LI, Davis RB, Palmer RH, Cahalane M, Hamel MB, Mukamal K, Phillips RS, Davies DT: Does clinical evidence support ICD-9-CM diagnosis coding of complications?. Med Care. 2000, 38: 868-876. 10.1097/00005650-200008000-00010.

  31. 31.

    Wilchesky M, Tamblyn RM, Huang A: Validation of diagnostic codes within medical services claims. J Clin Epidemiol. 2004, 57: 131-141. 10.1016/S0895-4356(03)00246-4.

  32. 32.

    van Doorn C, Bogardus ST, Williams CS, Concato J, Towle VR, Inouye SK: Risk adjustment for older hospitalized persons: a comparison of two methods of data collection for the Charlson index. J Clin Epidemiol. 2001, 54: 694-701. 10.1016/S0895-4356(00)00367-X.

  33. 33.

    Susser SR, McCusker J, Belzile E: Comorbidity information in older patients at an emergency visit: self-report vs. administrative data had poor agreement but similar predictive validity. J Clin Epidemiol. 2008, 61: 511-515. 10.1016/j.jclinepi.2007.07.009.

  34. 34.

    Elixhauser A, Steiner C, Harris DR, Coffey RM: Comorbidity measures for use with administrative data. Med Care. 1998, 36: 8-27. 10.1097/00005650-199801000-00004.

  35. 35.

    BTS Guidelines for the Management of Community Acquired Pneumonia in Adults. Thorax. 2001, 56 (Suppl 4): IV1-64.

  36. 36.

    Fleiss JL: The measurement of interrater agreement. Statistical methods for rates and proportions. Edited by: Fleiss JL. 1981, New York, NY: John Wiley and Sons, Inc, 212-236. 2

  37. 37.

    Rabe-Hesketh S, Skrondal A: Multilevel and longitudinal modeling using Stata. 2008, College Station, TX: Stata Press, Second

  38. 38.

    Virnig BA, McBean M: Administrative data for public health surveillance and planning. Annu Rev Public Health. 2001, 22: 213-230. 10.1146/annurev.publhealth.22.1.213.

  39. 39.

    Quan H, Parsons GA, Ghali WA: Validity of information on comorbidity derived rom ICD-9-CCM administrative data. Med Care. 2002, 40: 675-685. 10.1097/00005650-200208000-00007.

  40. 40.

    Quan H, Li B, Saunders LD, Parsons GA, Nilsson CI, Alibhai A, Ghali WA: Assessing validity of ICD-9-CM and ICD-10 administrative data in recording clinical conditions in a unique dually coded database. Health Serv Res. 2008, 43: 1424-1441. 10.1111/j.1475-6773.2007.00822.x.

  41. 41.

    Pine M, Norusis M, Jones B, Rosenthal GE: Predictions of hospital mortality rates: a comparison of data sources. Ann Intern Med. 1997, 126: 347-354.

  42. 42.

    Best WR, Khuri SF, Phelan M, Hur K, Henderson WG, Demakis JG, Daley J: Identifying patient preoperative risk factors and postoperative adverse events in administrative databases: results from the Department of Veterans Affairs National Surgical Quality Improvement Program. J Am Coll Surg. 2002, 194: 257-266. 10.1016/S1072-7515(01)01183-8.

  43. 43.

    Gordon HS, Johnson ML, Wray NP, Petersen NJ, Henderson WG, Khuri SF, Geraci JM: Mortality after noncardiac surgery: prediction from administrative versus clinical data. Med Care. 2005, 43: 159-167. 10.1097/00005650-200502000-00009.

  44. 44.

    Hannan EL, Racz MJ, Jollis JG, Peterson ED: Using Medicare claims data to assess provider quality for CABG surgery: does it work well enough?. Health Serv Res. 1997, 31: 659-678.

  45. 45.

    Weintraub WS, Deaton C, Shaw L, Mahoney E, Morris DC, Saunders C, Canup D, Connolly S, Culler S, Becker ER, Kosinski A, Boccuzzi SJ: Can cardiovascular clinical characteristics be identified and outcome models be developed from an in-patient claims database?. Am J Cardiol. 1999, 84: 166-169. 10.1016/S0002-9149(99)00228-3.

  46. 46.

    Parker JP, Li Z, Damberg CL, Danielsen B, Carlisle DM: Administrative versus clinical data for coronary artery bypass graft surgery report cards: the view from California. Med Care. 2006, 44: 687-695. 10.1097/01.mlr.0000215815.70506.b6.

  47. 47.

    Wray NP, Hollingsworth JC, Peterson NJ, Ashton CM: Case-mix adjustment using administrative databases: a paradigm to guide future research. Med Care Res Rev. 1997, 54: 326-356. 10.1177/107755879705400306.

  48. 48.

    Salem-Schatz S, Moore G, Rucker M, Pearson SD: The case for case-mix adjustment in practice profiling. When good apples look bad. Jama. 1994, 272: 871-874. 10.1001/jama.272.11.871.

  49. 49.

    Kern EF, Maney M, Miller DR, Tseng CL, Tiwari A, Rajan M, Aron D, Pogach L: Failure of ICD-9-CM codes to identify patients with comorbid chronic kidney disease in diabetes. Health Serv Res. 2006, 41: 564-580. 10.1111/j.1475-6773.2005.00482.x.

  50. 50.

    Movig KL, Leufkens HG, Lenderink AW, Egberts AC: Validity of hospital discharge International Classification of Diseases (ICD) codes for identifying patients with hyponatremia. J Clin Epidemiol. 2003, 56: 530-535. 10.1016/S0895-4356(03)00006-4.

Pre-publication history

  1. The pre-publication history for this paper can be accessed here:

Download references


The authors would like to thank S.M. Mohd Khalid for research assistance, and Chen Yi Loong for her explanation of the coding practices in Singapore. The National Medical Research Council of Singapore provided funding for this study (NMRC/1122/2007).

Author information

Correspondence to Wai Fung Chong.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

WFC carried out the study and drafted the manuscript. YYD conceived the study, participated in the study design and performed the statistical analysis. BHH participated in the design of the study and helped to draft the manuscript. All authors read and approved the final manuscript.

Wai Fung Chong, Yew Yoong Ding and Bee Hoon Heng contributed equally to this work.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Chong, W.F., Ding, Y.Y. & Heng, B.H. A comparison of comorbidities obtained from hospital administrative data and medical charts in older patients with pneumonia. BMC Health Serv Res 11, 105 (2011).

Download citation


  • Positive Predictive Value
  • Administrative Data
  • Medical Chart
  • Administrative Database
  • Secondary Diagnosis