- Open Access
High-cost high-need patients in Medicaid: segmenting the population eligible for a national complex case management program
BMC Health Services Research volume 21, Article number: 1143 (2021)
High-cost high-need patients are typically defined by risk or cost thresholds which aggregate clinically diverse subgroups into a single ‘high-need high-cost’ designation. Programs have had limited success in reducing utilization or improving quality of care for high-cost high-need Medicaid patients, which may be due to the underlying clinical heterogeneity of patients meeting high-cost high-need designations.
Our objective was to segment a population of high-cost high-need Medicaid patients (N = 676,161) eligible for a national complex case management program between January 2012 and May 2015 to disaggregate clinically diverse subgroups. Patients were eligible if they were in the top 5 % of annual spending among UnitedHealthcare Medicaid beneficiaries. We used k-means cluster analysis, identified clusters using an information-theoretic approach, and named clusters using the patients’ pattern of acute and chronic conditions. We assessed one-year overall and preventable hospitalizations, overall and preventable emergency department (ED) visits, and cluster stability.
Six clusters were identified which varied by utilization and stability. The characteristic condition patterns were: 1) pregnancy complications, 2) behavioral health, 3) relatively few conditions, 4) cardio-metabolic disease, and complex illness with relatively 5) low or 6) high resource use. The patients varied by cluster by average ED visits (2.3–11.3), hospitalizations (0.3–2.0), and cluster stability (32–91%).
We concluded that disaggregating subgroups of high-cost high-need patients in a large multi-state Medicaid sample identified clinically distinct clusters of patients who may have unique clinical needs. Segmenting previously identified high-cost high-need populations thus may be a necessary strategy to improve the effectiveness of complex case management programs in Medicaid.
Rising costs in Medicaid are a key threat to balanced budget requirements in US states, enhancing pressure on Medicaid administrators to control costs, especially during periods of economic recession . The majority of expenditure in any insured group concentrates in a few patients, with 5% of individuals accounting for approximately 50% of expenditure year-to-year . The stability of this cost distribution has led to an actuarily important designation of patients as high-cost high-need, although defining groups by cost or risk thresholds aggregates clinically distinct subgroups and definitions for high-cost high-need vary . There is also substantial regression to the mean of the cost distribution among individuals in this grouping of outliers . High-cost high-need patients have been disaggregated into clinically distinct subgroups or ‘segmented’ at various cost thresholds in many risk-bearing practice settings  and integrated health systems [6, 7], as well as at the population level by insurance risk pool including Medicare  and Medicare Advantage , with the results being incorporated into a national model of high-cost high-need patients  and informing value-based payment models in the Medicare program .
Preliminary descriptions of high-cost high-need patients in Medicaid have shown aggregation by cost thresholds has led to clinically heterogeneous high-cost high-need populations—some with multiple co-morbidities , substance use disorder, or serious mental illness , and many with unmet health-related social needs possibly contributing to their use of health care . The paucity of multi-state or national Medicaid high-cost high-need segmentation analyses may not just be overlooking opportunities to test value-based payment models, but also may be exacerbating disparities when using uniform complex case management programs. Complex case management programs, frequently phone-based care coordination programs led by RN case managers , have been shown to be effective in Medicare  but have been difficult to effectively implement in Medicaid, leading to notable high profile programs with null findings , and some small positive demonstrations .
The objective of our study is to disaggregate or segment a multi-state sample of high-cost high-need Medicaid patients who were eligible for a national complex case management program led by UnitedHealthcare to address medical, behavioral, and unmet social needs through a phone-based RN-led complex case management intervention, which was evaluated previously by our research team and found to be ineffective at reducing hospitalizations or emergency room visits in most Medicaid beneficiaries .
Our beginning sample consisted of 676,161 Medicaid beneficiaries age 21 and above who were identified as eligible for a complex case management program implemented by UnitedHealthcare (UHC) in 15 states between Jan 1, 2012 and May 1, 2015. Individuals with one full year of observation before enrollment in the program were included in our analytic sample (N = 484,328) and our sample included only the year of pre-enrollment observation time. This study is not evaluating the complex case management program for which they were eligible. UHC’s criteria used to identify eligible individuals were: 1) patients in the top 5% of spending the prior year, and 2) UHC’s proprietary risk score identifying patients likely to persist in the top 5% of spending in the following year. This risk score has been internally validated by UHC and is used across their enterprise. UCLA authors are blinded to all score components as well as the derivation and validation of risk score.
Data available for analysis included demographic information, medical, pharmacy, and laboratory claims, and unmet social need and barriers to care data for a subset of patients. We include demographic information as well as all medical (including behavioral health) claims. Acute and chronic conditions were assessed from ICD-9-CM codes for all claims in the observation period and grouped according to the Agency for Healthcare Research and Quality (AHRQ) multi-level Clinical Classification Software (CCS) for ICD-9-CM . Utilization was not used to define clusters despite demonstrated variation in high-cost high-need patients’ utilization trajectories  to allow for unbiased comparisons between clusters. Preventable hospitalizations were assessed as conditions considered possibly preventable in the AHRQ prevention quality indicators  and preventable ED visits were assessed using the NYU/Billings algorithm for assessing possibly preventable ED utilization .
Choice of partitional clustering and k-means
Statistical learning can be divided into supervised and unsupervised methods, where supervised methods such as regression and classification have a set of features and a response measured on these features is predicted. Unsupervised learning methods such as principal component analysis and cluster analysis are a set of statistical tools intended for when there is no response, and the goal is to observe patterns in the set of features, such as the clinical subgroups aggregated within a high-cost high-need population, not prediction . Cluster analyses are a broad set of techniques to find subgroups or clusters in a data set, and are defined as a special case of a normal mixture model which assumes that the mixture components (i.e., clusters) have spherical covariance matrices (i.e., equal variance) and equal sampling probabilities . Partitional clustering techniques (like k-means) assign all observations (n) to one of a number of pre-specified subgroups (k). All observations must be assigned, and the subgroups are non-overlapping. The k-means algorithm begins by randomly assigning each observation to one of ‘k’ clusters in n-dimensional space, and then calculates the centroid (i.e., mean position) of the cluster. It iterates by reassigning the closest observations to the previously calculated centroid, then recalculating the centroid. This proceeds until the cluster assignment is stable. We chose k-means partitional clustering as a robust and computationally efficient technique, which was important given the large sample size.
Number of clusters and mixed data
There is no obvious way to determine the optimal number of subgroups in partitional cluster analysis, and techniques to accomplish this are an active area of investigation . In our approach we used an information-theoretic or ‘jump’ method to identify the number of clusters where the improvement in explanatory power (or reduction in error) of the clusters ‘jumped’ the most when we increased the number of clusters from k to k + 1 . This is measured by the change in the test statistic of interest (partial F-test statistic) which decreases monotonically when adding an additional cluster. To determine the optimal number of clusters we pre-specified clusters (k) between 2 and 20 and plotted the partial F statistic by cluster number (k). Modifications to cluster analyses allowing for mixed datasets are another ongoing area of investigation to allow for clusters to be developed using both continuous and categorical data features . As the clustering proceeds in n-dimensional space, it is not intuitive how categorical variables would be arrayed in space. To account for categorical data features, we transformed all demographic and condition categorical variables to polar coordinates, as this was a high performing approach compared to other transformations and was also computationally efficient .
Cluster analyses are unsupervised methods and as such variation in the data features determines cluster assignment. Partitional clustering forces all observations to one of a prespecified number of clusters. Given all observations are forced into a cluster, there is the need to verify that observations are reliably assigned to similar clusters when sampling from similar datasets. This ‘cluster stability’ is an important robustness check. We resampled with replacement (bootstrapped) a large number of replications (n = 500) and identified cluster assignments for each iteration. A limitation to the k-means protocol in SAS is the ordering of the clusters is random (i.e. cluster ‘1’ in the original analysis may be labeled cluster ‘3’ in one of the bootstrapped iterations). To overcome this, we reassigned cluster numbers based on the highest interrater reliability (kappa) between the original cluster and the bootstrapped clusters.
There are many indexes of cluster stability, and we chose two commonly used measures. Each index compares one set of observations with another set (i.e. our ‘original’ sample and the 500 bootstrapped iterations) and compares dissimilarity. The Rand Index (or % observed agreement) represents the percentage of time the original cluster assignment agreed with the bootstrapped assignment for all possible comparisons. The Jaccard coefficient (or % overlap) measures only the times when the same assignment was made as a proportion of the total times the cluster was assigned in either the original or bootstrap sample. The observed agreement (Rand) is thus an easily identifiable measure of cluster similarity, and the percent overlap (Jaccard) as a more sensitive measure of cluster similarity . As these are uncommon measures, we explain the cluster stability indexes more in supplementary materials (Additional file 2).
We used demographic information (age, gender, Medicaid program of eligibility, race/ethnicity, language) and diagnostic groupings using the AHRQ CCS multi-level categories (at the 2nd level) as inputs to the k-means partitional clustering algorithm. To create a clinically meaningful label for each cluster we examined the descriptive statistics for our cluster inputs (e.g. demographics and condition categories) and identified the features of the cluster that that maximally deviated from the overall mean. Through discussion among the investigators, we identified cluster labels that were both representative of the pattern of outliers and reasonable to clinicians.
Demographics and diagnostic categories
We determined the optimal number of clusters to be six, and partitioning of observations was non-overlapping and varied (13–21%) between clusters (Table 1). Demographics, including average age and gender, and program of Medicaid eligibility varied by cluster (Table 1). Acute and chronic conditions varied widely by cluster, and these were the primary features we used for assigning a descriptive label to each cluster, and these we displayed as a heat map (Fig. 1). Cluster 1 had a pattern of relatively fewer diagnoses, so we labeled it ‘Relatively Healthy’. Nearly all (98%) individuals in the second cluster were diagnosed with a complication of pregnancy so we labeled this cluster ‘Pregnancy Complications’. The highest percent of individuals with mental illness (90%) and second highest rate of neurologic disorders (86%) were seen in the 3rd cluster, so we labeled it ‘Behavioral Health’. The 4th cluster, despite having a relatively low prevalence of most diagnoses, had the second highest rates of endocrine (96%) and cardiovascular (93%) disorders and so we labeled it labeled ‘Cardiometabolic’. The remaining groups had similar frequencies of diagnoses across a large number of diagnostic categories but very different utilization patterns and so we labeled them ‘Complex Illness Lower Resource Use’ and ‘Complex Illness Higher Resource Use’ to reflect these differences in utilization but shared medical complexity.
Utilization and preventable utilization
One-year mean utilization varied by cluster, including hospital admissions (0.3–2.0 admissions), days in hospital (2.1–16.0 days), and ED visits (2.3–11.3 visits). Preventable utilization varied by cluster for mean preventable hospitalizations (0.01–0.33 admissions) and mean preventable ED visits (1.1–6.3 visits). Mean hospital admissions in the major complex illness cluster were six times higher than in any other cluster (Table 2). Mean ED visits were highest in the complex illness higher resource use cluster and second highest in the behavioral health cluster.
Cluster stability was assessed through repeated resampling with replacement and stability determined by percent agreement with the original cluster assignment (Table 3). There was a gradient of cluster stability as defined by the Jaccard coefficient from less stable to very stable (32–91%). For each cluster we report the mean of the Jaccard and Rand coefficients, respectively. Two of the six clusters, the cluster of relatively healthy patients (84, 96%) and of women experiencing complications of pregnancy (91, 99%) were very stable. The clusters of patients experiencing utilization due to behavioral health needs (48, 88%) and complex illness with higher resource use (64, 94%) were fairly stable, and the remaining two clusters, one of patients with cardiometabolic disease (32, 81%) and complex illness with lower resource use (42, 86%) were less stable.
High-cost high-need patients as defined by a risk or cost threshold aggregate clinically diverse patient subgroups, and complex case management programs may be improved by first disaggregating or segmenting the underlying high-cost high-need population. We found in a large, multi-state sample of high-cost high-need Medicaid patients six clusters which varied by demographics, utilization, and preventable utilization. Consistent with our hypothesis clinically distinct clusters such as women experiencing complications of pregnancy or patients with utilization related to behavioral health concerns emerged from the data and may be appropriate targets for specialized complex case management programs in Medicaid. Interestingly, these clinically distinct clusters were also some of the most stable, reinforcing their utility as possible targets for complex case management programs.
Cluster stability measures are important for considering how small changes in the underlying population may affect the emergent subgroups. Cluster assignment in our analysis was primarily driven by the pattern of acute and chronic conditions, and thus, the large number of high-cost high-need patients with multiple chronic conditions were more challenging to separate from others and we observed these clusters to be the least stable. This does not necessarily mean complex case management programs designed to manage patients with multiple chronic conditions will be ineffective—to the contrary, most current programs target this population. The stability measures are more important for identifying unique and stable subpopulations previously unobservable in the aggregated high-cost high-need population. It is intuitive that women experiencing complications of pregnancy and patients seeking care for behavioral health needs have different care coordination needs than patients whose care is defined by complex illness. We also found a less stable cluster of patients defined by cardiometabolic disease, which is not to say patients with these conditions are not likely to be found in high-cost high-need Medicaid populations, only that in this population patients with cardiometabolic disease frequently had other conditions which caused them to be challenging to distinguish from patients with complex illness. Finally, the relatively stable group of high-cost high need patients who were using relatively lower levels of care warrant further investigation. Perhaps they are patients whose need for medical care has regressed to the mean. This cluster may yet have their own needs for complex case management as yet unidentified by our study.
Our study has several limitations. Although our sample consists of a large multi-state Medicaid population from a major health plan, small states or individual health systems may have unique clusters that don’t persist in a large, multi-state sample. Also, because our team had data only for those eligible for the complex case management program, these individuals may differ systematically from the overall population in ways we didn’t anticipate.
Moving forward, we hope this work fosters continued innovation in population health management. It provides a framework that can be used by state Medicaid agencies and Medicaid managed care organizations to segment their high-cost high-need populations to determine how best to tailor their population health strategy. Once administrators disaggregate stable subgroups, the next step is to tailor complex case management to the unique clinical needs of the subgroup. There has been substantial work done in this area in Medicaid, including outpatient extensivist teams for patients with behavioral health conditions , qualitative work addressing the needs of pregnant women utilizing unscheduled care , disease-focused complex case management , and more intensive home-based interventions utilizing community health workers [18, 32]. Finally, the increasing availability of national systematic samples may facilitate developing a generalizable model of high-cost high-need patient subgroups in Medicaid which would speed the development of tools to identify patients appropriate for complex case management or other intervention strategies.
We found that using an unsupervised approach (cluster analysis) we were able to reliably disaggregate a high-cost high-need population of Medicaid patients into clinically distinct subgroups. These subgroups may benefit from targeted complex case management programs. We encourage state Medicaid administrators as well as managed Medicaid health plan leaders to consider segmentation as a strategy to identify distinct subgroups within a heterogeneous high-cost high-need population.
Availability of data and materials
The datasets analyzed were raw medical and eligibility claims from UnitedHealthcare and are not publicly available, and are not available due to the requirements of the data use agreement between UCLA researchers and United Healthgroup, but deidentified variables that are a byproduct of the analysis are available from the corresponding author and UnitedHealthcare on reasonable request.
Yocom CL. Medicaid: A Small Share of Enrollees Consistently Accounted for a Large Share of Expenditures. 2015. http://www.gao.gov/assets/680/670112.pdf.
Cohen S, Yu W, Machlin S, Chevan J. The concentration and persistence in the level of health expenditures over time: Estimates for the US population, 2008–2009. Stat Br. 2011;(January):2008–9 http://www.ahrq.gov/legacy/about/cfact/cfactbib55.htm.
Wammes JJG, Tanke M, Jonkers W, Westert GP, Van Der Wees P, Jeurissen PPT. Characteristics and healthcare utilisation patterns of high-cost beneficiaries in the Netherlands: a cross-sectional claims database study. BMJ Open. 2017;7(11):1–11. https://doi.org/10.1136/bmjopen-2017-017775.
Rinehart DJ, Durfee J, Melinkovich P, et al. For many patients who use large amounts of health care services, the need is intense yet temporary. Health Aff. 2015;34(8):1312–9. https://doi.org/10.1377/hlthaff.2014.1186.
O’Malley AS, Rich EC, Sarwar R, et al. How accountable care organizations use population segmentation to Care for High-Need, high-Cost patients. Issue Brief (Commonw Fund). 2019;2019(January):1–17 http://www.ncbi.nlm.nih.gov/pubmed/30645057.
Davis AC, Shen E, Shah NR, Glenn BA, Ponce N, Telesca D, et al. Segmentation of high-Cost adults in an integrated healthcare system based on empirical clustering of acute and chronic conditions. J Gen Intern Med. 2018;33(12):2171–9. https://doi.org/10.1007/s11606-018-4626-0.
de Oliveira C, Cheng J, Kurdyak P. Determining preventable acute care spending among high-cost patients in a single-payer public health care system. Eur J Health Econ. 2019;20(6):869–78. https://doi.org/10.1007/s10198-019-01051-4.
Joynt KE, Figueroa JF, Beaulieu N, Wild RC, Orav EJ, Jha AK. Segmenting high-cost Medicare patients into potentially actionable cohorts. Healthcare. 2017;5(1–2):62–7. https://doi.org/10.1016/j.hjdsi.2016.11.002.
Powers BW, Yan J, Zhu J, Linn KA, Jain SH, Kowalski JL, et al. Subgroups of high-Cost Medicare advantage patients: an observational study. J Gen Intern Med. 2019;34(2):218–25. https://doi.org/10.1007/s11606-018-4759-1.
Long P, Abrams M, Milstein A, et al. Effective Care for High Needs Patients: opportunities for improving outcomes, value and health. Natl Acad Med. 2017;162 https://lccn.loc.gov/2017041343.
Newsroom C. HHS To Transform Care Delivery for Patients with Chronic Kidney Disease. https://www.cms.gov/newsroom/press-releases/hhs-transform-care-delivery-patients-chronickidney-disease.
Kronick RG, Bella M, Gilmer TP. The Faces of Medicaid: Refining the Portrait of people with Multiple Chronic Conditions. Cent Heal Care Strateg Inc. 2009;1(October):78.
Buck JA, Teich JL, Miller K. Use of mental health and substance abuse services among high-cost medicaid enrollees. Admin Pol Ment Health. 2003;31(1):3–14. https://doi.org/10.1023/A:1026089422101.
Chisolm DJ, Brook DL, Applegate MS, Kelleher KJ. Social determinants of health priorities of state Medicaid programs. BMC Health Serv Res. 2019;19(1):1–7. https://doi.org/10.1186/s12913-019-3977-5.
Powers BW, Chaguturu SK, Ferris TG. Optimizing high-risk care management. JAMA. 2015;313(8):795–6. https://doi.org/10.1001/jama.2014.18171.
Peikes D, Chen A, Schore J, Brown R. Effects of care coordination on hospitalization, quality of care, and health care expenditures among medicare beneficiaries 15 randomized trials. JAMA - J Am Med Assoc. 2009;301(6):603–18. https://doi.org/10.1001/jama.2009.126.
Finkelstein A, Zhou A, Taubman S, Doyle J. Health care hotspotting ’ a randomized, controlled trial. N Engl J Med. 2020;382(2):152–62. https://doi.org/10.1056/NEJMsa1906848.
Powers BW, Modarai F, Palakodeti S, et al. Impact of complex care management on spending and utilization for high-need, high-cost Medicaid patients. Am J Manag Care. 2020;26(2):E57–63. https://doi.org/10.37765/ajmc.2020.42402.
Duru OK, Harwood J, Moin T, Jackson NJ, Ettner SL, Vasilyev A, et al. Evaluation of a National Care Coordination Program to reduce utilization among high-cost, high-need Medicaid beneficiaries with diabetes. Med Care. 2020;58(6):S14–21. https://doi.org/10.1097/mlr.0000000000001315.
Cost H. HCUP Clinical Classifications Software (CCS) for ICD-10. Healthcare Cost and Utilization Project (HCUP). 2009; (Accessed 27 Dec 2017). http://www.hcup-us.ahrq.gov/toolssoftware/icd_10/ccs_icd_10.jsp.
Wong ES, Yoon J, Piegari RI, Rosland AMM, Fihn SD, Chang ET. Identifying latent subgroups of high-risk patients using risk score trajectories. J Gen Intern Med. 2018;33(12):2120–6. https://doi.org/10.1007/s11606-018-4653-x.
McClellan M. AHRQ Guide to Prevention Quality Indicators. 2001. doi:https://doi.org/10.1136/bmj.4.5941.418-a, 4, 5941, 418
Ballard DW, Price M, Fung V, Brand R, Reed ME, Fireman B, et al. Validation of an algorithm for categorizing the severity of hospital emergency department visits. Med Care. 2010;48(1):58–63. https://doi.org/10.1097/MLR.0b013e3181bd49ad.
James G, Witten D, Trevor Hastie RT. An Introduction to Statistical Learning : With Applications in R; 2013.
Morissette L, Chartier S. The k-means clustering technique: General considerations and implementation in Mathematica. Tutor Quant Methods Psychol. 2013;9(1):15–24. https://doi.org/10.20982/tqmp.09.1.p015.
Sugar CA, James GM. Finding the number of clusters in a dataset: an information-theoretic approach. J Am Stat Assoc. 2003;98(463):750–63. https://doi.org/10.1198/016214503000000666.
Ahmad A, Khan SS. Survey of state-of-the-art mixed data clustering algorithms. IEEE Access. 2019;7(i):31883–902. https://doi.org/10.1109/ACCESS.2019.2903568.
Jose-Luis FB-R, Diez JL. Geometrical codification for clustering mixed categorical and numerical databases. J Intell Informait Syst. 2012;39(1):167–85. https://doi.org/10.1007/s10844-011-0187-y.
Rand WM. Objective criteria for the evaluation of clustering methods. J Am Stat Assoc. 1971;66(336):846–50. https://doi.org/10.2307/2284239.
Komaromy M, Bartlett J, Gonzales-van Horn SR, et al. A novel intervention for high-need, high-Cost Medicaid patients: a study of ECHO Care. J Gen Intern Med. 2020;35(1):21–7. https://doi.org/10.1007/s11606-019-05206-0.
Mehta PK, Carter T, Vinoya C, Kangovi S, Srinivas SK. Understanding high utilization of unscheduled Care in Pregnant Women of low socioeconomic status. Women’s Heal issues Off Publ Jacobs Inst Women’s Heal. 2017;27(4):441–8. https://doi.org/10.1016/j.whi.2017.01.007.
Kangovi S, Mitra N, Norton L, Harte R, Zhao X, Carter T, et al. Effect of community health worker support on clinical outcomes of low-income patients across primary care facilities: a randomized clinical trial. JAMA Intern Med. 2018;178(12):1635–43. https://doi.org/10.1001/jamainternmed.2018.4630.
This work was supported by funding from the Centers for Disease Control and Prevention, #DP006128.
Ethics approval and consent to participate
This study was approved by UCLA IRB #16–000276, and as this was a secondary analysis de-identified demographic data, medical and pharmacy claims, and health plan and practice level variables from a national sample of Medicaid plans from a major health insurer, with a master research agreement in place with the health insurer since 2011 with a project specific data use agreement agreed both by our institution and the insurer, informed consent was not required by institutional IRB for this secondary analysis of de-identified data. All methods were carried out in accordance with relevant guidelines and regulations.
Consent for publication
All authors have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1: Appendix 1.
Annotated code to run k-means clustering protocol. Code for SAS 9.4.
Additional file 2: Appendix 2.
Cluster Stability Indexes.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Quinton, J.K., Duru, O.K., Jackson, N. et al. High-cost high-need patients in Medicaid: segmenting the population eligible for a national complex case management program. BMC Health Serv Res 21, 1143 (2021). https://doi.org/10.1186/s12913-021-07116-6
- Complex case management
- Care management
- High-cost high-need
- Cluster analysis