Skip to main content

Making sense of the French public hospital system: a network-based approach to hospital clustering using unsupervised learning methods



Hospitals in the public and private sectors tend to join larger organizations to form hospital groups. This increasingly frequent mode of functioning raises the question of how countries should organize their health system, according to the interactions already present between their hospitals. The objective of this study was to identify distinctive profiles of French hospitals according to their characteristics and their role in the French hospital network.


Data were extracted from the national hospital database for year 2016. The database was restricted to public hospitals that practiced medicine, surgery or obstetrics. Hospitals profiles were determined using the k-means method. The variables entered in the clustering algorithm were: the number of stays, the effective diversity of hospital activity, and a network-based mobility indicator (proportion of stays followed by another stay in a different hospital of the same Regional Hospital Group within 90 days).


Three hospital groups were identified by the clustering algorithm. The first group was constituted of 34 large hospitals (median 82,100 annual stays, interquartile range 69,004 – 117,774) with a very diverse activity. The second group contained medium-sized hospitals (with a median of 258 beds, interquartile range 164 - 377). The third group featured less diversity regarding the type of stay (with a mean of 8 effective activity domains, standard deviation 2.73), a smaller size and a higher proportion of patients that subsequently visited other hospitals (11%). The most frequent type of patient mobility occurred from the hospitals in group 2 to the hospitals in group 1 (29%). The reverse direction was less frequent (19%).


The French hospital network is organized around three categories of public hospitals, with an unbalanced and disassortative patient flow. This type of organization has implications for hospital planning and infectious diseases control.

Peer Review reports


Coordination of hospital activity across multiple sites has been increasingly frequent in the United States and European countries since 1970 [1]. In various industries, the coordination of processes over multiple sites is seen as a way to improve performance. However, the extent to which this principle applies to healthcare has been debated [2, 3]. Integrated healthcare is increasingly seen as an attempt to improve healthcare quality [4]. Hospitals can assemble in informal networks or health systems having a common leadership [3]. These networks should offer accessible care, in a coordinated effort with a common information and quality insurance system with financial incentives [5]. Horizontal integration refers to organizations that acquire or integrate with other organizations that provide similar services, whereas vertical integration happens when the parties in question offer different level of care, services or functions [6]. Drawing on the example of the loss of market share of the Kaiser Permanenente HMO (Health Maintenance Organization), some authors have made the case for flexible organizations, with partners acting as equal team members rather than leaders and followers [2]. However, a strong hierarchical structure with clear targets has been shown to be efficient in helping regions to recover from poor hospital performance [7]. Overall, vertical integration of health systems is on the increase [8]. For example, the reduction in use of inpatient care and financial incentives in the Affordable Care Act have transformed the United States healthcare system from a variety of independent hospitals to a few locally integrated health care systems. The treatment of severe disease requires specialized infrastructure that is only found in large university hospitals, sometimes called tertiary care hospitals. Regionalization allows patients to get adequate care in these high-volume centers [9, 10], although the relationship between volume and quality has been debated [11]. These hospitals naturally assume the role of leaders in hospital systems. Following the trend towards increasing hospital system integration, nationwide reforms have taken place in several countries to restructure and coordinate hospital systems. In Denmark in 2007, a nationwide healthcare regionalization reform decreased the number of acute hospitals from 40 to 21, with encouraging results for productivity and quality of care, although the number of medical professionals also increased during this period [12]. Hospital networks have been introduced to Belgium in 2020, with the aim to strengthen local hospital collaborations and concentrate complex procedures in a limited number of reference centers [1].

In 2016, the French Public Health law no 2016-41 for the improvement of the healthcare system made it compulsory for hospitals to join a Regional Hospital Group (RHG). This law was followed by the creation of 136 collaborative hospital networks, each one having a designated leader. Before this law, the regional health agencies lacked access to information regarding transfers and patient flow between hospitals. Policymakers need to be able to determine if new laws and regulations have the desired effect, and they will attempt to make amendments to the text if they have not achieved what was aimed for. However, this process presupposes some degree of knowledge about the system for which the regulation is intended. Any attempt to reform public hospital systems needs to be based on an analysis of hospital data, be it local or foreign, as the experience of other countries may provide useful models for future regulation. The analysis of hospital data may often start as a description of the hospitals that are part of the system. Hospitals can be considered as individual entities or as parts of broader networks. Previous work has shown that the number of patients shared between physicians can predict relationships between medical providers [13]. Likewise, Social Network Analysis conducted on administrative hospital data can allow the identification of relations between hospitals [14]. These relationships can be inferred from the observation of frequent patient transfers, which denote a certain degree of trust between hospitals and are a valid proxy for the ability to collaborate for the benefit of the patient [15, 16]. Direct transfers are frequent: 1 million in France in 2014 [17]. They represent 6.4% of patient admitted to Intensive Care Units in the United States [18]. They tend to have longer lengths of stay and cost more than twice as much as those not transferred, which makes them of particular interest to hospital managers [19]. Understanding the patterns of patient mobility between hospitals is a first step towards the optimization of interactions between hospitals. Although network-level variables have not been consistently associated with quality of care, an increasing number of publications suggest such a relationship [14, 16]. Integration (distributed goals) and differentiation (coordination of specialization) are frequent in efficient organizations [20]. However, these properties have seldom been studied in real-world healthcare systems [21, 22], despite recent work calling for such studies [23]. The characteristics of hospital transfers need to be identified to ascertain how these transfers occur in relation to integrated hospital networks. The objective of this study was to use unsupervised learning methods and social network analysis to identify clusters of French public hospitals according to their characteristics and their role in the national hospital network, by taking into account patient transfers within Regional Hospital Groups in the clustering algorithm. A secondary objective was to describe patterns of patient mobility between these hospitals.


Data source

We used data from the French administrative hospital database. French public and private hospitals are financed through a Diagnosis-Related Group (DRG)-based prospective payment system [24]. The source database includes all hospital discharges (public and private) on the entire territory of France including overseas. The database contains information on the patients’ diagnoses during their hospital stay, encoded using the International Classification of Diseases, 10th Edition (ICD-10). Hospital stays are classified in diagnosis groups (Catégories Majeures Diagnostiques, CMD) and then in DRGs according to ICD-10 principal and secondary diagnosis codes, surgical and non-surgical procedures (Classification Commune des Actes Medicaux, CCAM) [25] and the patients’ age. As the database is used for billing hospital stays, it is subjected to numerous local and national quality controls including regular inspections by officially appointed medical information specialists. This ensures that the data stay comprehensive and accurate.

In this database, hospital stays are classified using DRG codes. However, there are more than 600 DRGs with differing severity subclasses, which makes them unpractical for the realization of summary statistics. To solve this problem, we used a higher-level classification. The Activity Domains classification assigns Activity Domains to subsets of DRGs. It is curated by the French national agency in charge of medical information [26].

Hospitals with less than 500 stays per year in Medicine, Surgery and Obstetrics were excluded. Some Regional Hospital Groups with particular geographic situations (e.g. overseas) or specializing in psychiatry (without patients in Medicine, Surgery and Obstetrics) were also excluded (list in Appendix 1). The hospitals situated inside Paris are grouped under a single legal entity responsible for over 916,000 stays (“Assistance Publique - Hôpitaux de Paris”). As they were exempt from participating in a Regional Hospital Group during the study period, they were also excluded. Patient transfers with a length of stay < 48 h in the receiving hospital were excluded. Invalid stays (due to errors in Diagnosis Related Groups or invalid patient anonymization number) and stays for iterative treatments were excluded.

Ethical and regulatory considerations

The study was declared to the French registry of studies using healthcare data (N° F20210106180024). As the study was retrospective, based on anonymized data and purely observational, it was exempt from Institutional Review Board approval according to the French Public Health Code (L1121-1, law number 2012-300, 5 march 2012).

Statistical methods

French public hospitals were described using two complementary methods. First, a cluster analysis was conducted to identify distinctive hospital profiles; then a Social Network Analysis was conducted to understand how these profiles interacted in the hospital network.

Clusters of hospitals were determined using the k-means method. This algorithm has the advantage of being applicable to numerical data, performs well when data are of good quality, and is easy to understand [27]. The principle of this algorithm is to choose a number K of centroids that corresponds to the number of clusters. The initial centroids are chosen randomly. Each point is then assigned to the cluster of the nearest centroid. After this step, new centroids are recalculated as the center of gravity of all points that belong to each cluster. This process is repeated until convergence is achieved. The clustering algorithm was applied to public hospitals with an activity in Medicine, Surgery and Obstetrics (MSO) in metropolitan France (excluding oversea territories) for year 2016.

The active variables included in the algorithm were:

  • The number of stays in each hospital during the study period;

  • The effective number [28] of activity domains;

  • A mobility index.

The variables to be included in the algorithm were chosen following a preliminary exploratory analysis of the relationships between hospital-level variables. The number of stays in each hospital was included in the clustering algorithm because it was highly associated with numerous other hospital-level characteristics such as the number of beds and the number of medical and non-medical staff members. This variable can be used as a proxy for the size of the hospital, while also conveying additional information on its activity. The number of activity domains further describes the activity performed in each hospital, by characterizing the diversity of hospital stays. Finally, the mobility index was used as a network-level variable that summarizes the position of the hospital in relation to other hospitals of the same group. Together, these variables were chosen because they captured different aspects of the functioning of a hospital at the hospital level and at the network level, and because they could be expressed as numerical variables, which is a prerequisite for inclusion in the clustering algorithm. The number of variables was limited to three to improve clarity and interpretability, and because this allows a data visualization in two dimensions which makes it possible to quickly understand the key characteristics of the system.

The effective number of activity domains was calculated by using a diversity index derived from Simpson [28, 29] applied to the Activity Domains of stays produced by the hospital during the year. This method was chosen because using the absolute number of Activity Domains would result in an overestimation of the diversity of activities performed by hospitals on a regular basis. For instance, it would be misleading to say that a hypothetical hospital having 500 stays for activity “A”, 3 stays for activity “B”, and 2 stays for activity “C” performs three distinct activities on a regular basis.

The effective diversity derived from Simpson’s index is defined as:

$$ \mathbf{D}=\frac{\mathbf{1}}{\sum_{\mathbf{i}=\mathbf{1}}^{\mathbf{S}}{{\mathbf{p}}_{\mathbf{i}}}^{\mathbf{2}}} $$

With pi corresponding to the proportion p of hospital stays of the type i. In this study, the types of hospital stays corresponded to the activity domains of stays produced at the hospital during the study period, encoded using the Activity Domain classification of the French national agency in charge of medical information. Simpson’s diversity index belongs to a subclass of indices that aim to allow the quantification of diversity. Simpson’s diversity index can be interpreted as the probability that two stays taken at random in the hospital’s activity during the year are of the same type. It has the property of favoring frequent activity domains, which means that adding a single stay with a new activity domain to a hospital will not have much impact on the value of the index.

The “effective number” of different hospital stays is derived from Simpson’s diversity index by taking its inverse [28]. It has desirable mathematical properties to quantify diversity: in a hospital practicing all types of stays in equal proportions, if the absolute number of different hospital stays doubles, the effective number of hospital stays will also double.

The mobility index used in this study was the percentage of stays that were followed by a transfer or readmission in another hospital of the same Regional Hospital Group within 90 days. These stays were called « Regional Hospital Group classifying stays ». Mobility includes transfers, which can be identified from electronic health records by comparing the entry and discharge dates of hospital stays: a transfer occurs when a patient enters a new hospital on the same day that he/she was discharged from the previous one. It is theoretically possible for a patient to be hospitalized in two different hospitals without any collaboration process taking place. As these cases occur randomly, they are not expected to be a major source of bias. Extending the delay between stays to 90 days allows us to account for the cases where hospitalization in the second hospital are programmed at a later date.

All active variables were standardized by subtracting the mean and dividing by the standard deviation to be on a comparable scale before entering the classification algorithm. The variables introduced in the clustering algorithm were not weighted.

Finding the optimal number of clusters

There are several methods to determine the optimal number of clusters, each with advantages and drawbacks. In this study, the number of clusters was determined by using a selection of the various methods available and applying a majority rule. Thirty methods included in the NbClust R package were used, and the number of clusters most frequently designed as optimal among these methods was chosen as our number of clusters. The number of clusters was then verified by using the elbow method.

Description of hospital clusters and their activity domains

The types of stays that constituted the activity of the hospital clusters identified by the classification algorithm were described using the Activity Domains classification [26]. The number of stays was calculated for each activity domain in each cluster. To show the differences between the distributions of activity in each cluster, the percentage of activity for each domain was calculated. The denominator for the calculations of these percentages was the sum of all stays in each cluster.

Social network analysis of patient mobility

A Social Network Analysis (SNA) was performed to visualize the relationships between hospitals identified by studying patient transfers. Patient mobility was represented by the edges of a directed graph. The hospitals constituted the vertices of the graph. To facilitate the interpretation and readability of the graph, relations with less than 100 patients for year 2016 were excluded as not significant. Social network metrics were calculated using the igraph package available on the Comprehensive R Archive Network (CRAN). The assortativity degree measures the tendency of graph elements to associate with elements with a similar degree [30]. Transitivity is the tendency of elements that are connected together to share mutual connections. Edge density is the ratio of the number of existing edges by all possible edges. Data analysis was performed using R software version 4.0.2 (


A total of 427 hospitals representing 8,790,293 stays were included in the clustering algorithm. The flow chart of the study is presented in Fig. 1. Due to their specific situations, 39 public hospitals were excluded (6.2% of all public hospitals registered in 2016). There were 414,254 cases of patient mobility in the hospitals considered for this study. The proportion of mobility that occurred within 0, 1, 7, 30, and 60 days of the index stay was 43.6, 46.0, 54.1, 73.0 and 88.7% respectively. Overall, only 45.5% of the mobility occurred within Regional Hospital Groups (53.6% if only hospital relationships with ≥100 patients were considered).

Fig. 1
figure 1

Flow chart of the study

Number of hospital activity domains

The relationship between absolute and relative number of activity domains is shown in Fig. 2. As predicted, the effective number of activity domains appeared more adequate than the absolute number of activity domains for differentiating the diversity of hospital activity. There was a ceiling effect for the absolute number of activity domains, with several hospitals attaining the highest values. Ceiling effects occur when a high proportion of participants in an experiment attain the maximum score, making them seem similar despite the existence of differences inside the group. This supports our choice of preferring to use the effective rather than absolute number of activity domains, because being spared by the ceiling effect the effective number of activity domains is able to further distinguish how varied these hospitals are in terms of diversity of the performed activities. An exploratory analysis showed that omitting the effective number of activity domains variable in the clustering algorithm changed the assigned cluster of 16.0% of the hospitals.

Fig. 2
figure 2

Absolute and effective numbers of activity domains per hospital

Number of clusters

Seven out of 30 clustering separation metrics recommended three as the number of clusters that conveyed most information. This is consistent with the aspect of the elbow plot, which shows that the additional contribution to the percentage of variance explained was smaller after three clusters (Fig. 3). Following the elbow plot and the majority rule, the number of clusters was set to three.

Fig. 3
figure 3

Cluster separation according to the numbers of groups in the clustering algorithm. SS: Sum of Squares

Hospital profiles

The hospital profiles identified by the k-means algorithm are presented in Table 1.

Table 1 Hospital profiles identified by the k-means algorithm

Patients’ mobility towards other hospitals of the Regional Hospital Group was inversely proportional to the total number of stays per hospital (Fig. 4). The hospitals included in each cluster had different activity profiles (Table 2). Cardiovascular catheterization, the treatment of cancer, neurological disease and organ transplantation were preferentially performed in Group 1 hospitals. Conversely, digestive and pneumology activity domains represented a higher proportion of stays in Group 3 and 2 hospitals.

Fig. 4
figure 4

Clustering algorithm results displayed in the active variable space. The patient mobility index indicates the mobility of patients within the same Regional Hospital Group. A high mobility index means that patients often move to other hospitals of the same Regional Hospital Group

Table 2 Activity domains of the hospital profiles

Social network analysis

The Social Network Analysis was conducted in two stages: Stage 1 kept only the strongest links between hospitals (at least 100 patients in either direction). Stage 2 considered all links indiscriminately.

When restricted to strong edges (> 100 patients per directed edge), the network had an assortativity degree of − 0.263. The unrestricted network had a more neutral assortativity degree of − 0.043. Overall, this is consistent with the fact that the mobility of patients was preferentially directed towards hospitals with different profiles. The larger hospitals acted as reference care providers and performed treatments for patients that would have been difficult to manage in small hospitals (Fig. 5). The transitivity of the strong edges (> 100 patients) network was 18.0% (39.0% for the overall network considering all edges), which is also consistent with a network made of multiple loosely connected centralized islets. The overall edge density on the study territory was 10.6% (0.6% when restricting to edges > 100 patients).

Fig. 5
figure 5

Social Network Analysis (SNA) of public hospitals in France centered on the Auvergne-Rhône-Alpes region. The hospitals are mapped to vertices colored by their class. The edges represent patient flow. Edges colored in purple are between hospitals of the same Regional Hospital Group. Edges are limited to relations with > 100 patients

Mobility patterns in hospital clusters are presented in Table 3. Group 1 mainly acted as a recipient of patients and had the highest incoming patients per hospital ratio (Table 3). However, due to the large number of hospitals included in Group 2, the raw number of patients received in Group 2 (193,938) was higher than in Group 1 (166,953). Group 3 hospitals referred to Group 2 hospitals before referring to Group 1 hospitals, perhaps because Group 1 hospitals were less frequent.

Table 3 Mobility types between clusters for year 2016


Using unsupervised learning methods, we identified three hospital classes with different degrees of network centrality in French public hospital data for year 2016.

The hospitals assigned to Group 1 deliver most of the country’s experimental (quaternary) and specialized (tertiary) care. They do not need to transfer patients to other hospitals of the same Regional Hospital Group because they themselves host a sufficient array of healthcare services. Patient-sharing is easier within these teaching hospitals, corridor distance being only one of the factors involved [31]. These hospitals have the potential to act as information brokers and set quality of care standards [32]. However, their size can make it difficult for them to be located in the most central urban areas of their cities [33]. On the other hand, Group 2 and 3 hospitals often need to transfer patients with complex needs to these centers. The situation is similar in the United States, where between one third and one half of myocardial infarction patients will be transferred from a non-revascularization center to a center equipped for the procedure [34]. In the future, smaller hospitals are expected to develop the activities for which they used to transfer their patients, and larger hospitals are expected to develop innovative activities not available elsewhere [35].

Despite studies suggesting that hospital networks occur endogenously out of previous relationships [15], our data shows that hospital collaboration networks are often different from what would be expected based on administrative boundaries such as region limits (only 45.5% of the mobility occurring within a Regional Hospital Group). In his 1994 study, Bolland [36] had observed that referrals and planning represent two dimensions that are largely non-overlapping in a variety of American counties. Instead, the observed mobility relationships between hospitals often involve the referral of patients to reference hospitals regardless of their belonging (belonging to the same regional group or not). This is a problem, because it suggests that the hospitals that exchange patients and would ideally work together as a unified infrastructure (“basic systems and services that are reliable, standardized, and widely accessible”) could lack the common management and unity of purpose (no formal embedding) [37] to conduct patient transfers efficiently [34]. It is well known that the negotiation process for a transfer can be difficult [38], and this difficulty is expected to be greater if there are administrative barriers between hospitals [5]. Likewise, these transfers “across borders” could result in difficulties of communication [39, 40].

On the other hand, networks that are open to outside influence are closer to the unconstrained network structure that shows the most favorable profile for the diffusion of innovation [41]. Our observations are consistent with data obtained by Le Meur [42] and Rankin [43], where hospital networks have a high degree of centralization. At a lower level, this could be linked with data indicating that well-connected doctors have a positive impact on patient experience [4447]. This centralized type of organization is consistent with good quality of care. Less centralized networks are likely the result of capacity overload rather than the need to transfer to more adequate facilities [48]. In our study, the most central hospitals had a higher proportion of cancer patients, a finding consistent with data from Orange County (California, USA) [49]. They are more likely to have neurology wards and trauma centers, to perform organ transplantation, and to be affiliated to medical schools, as was also noted by a nationwide American study focusing on critical care patients [50]. These hospitals are expected to have lower readmission rates (as shown in an Italian study) [16], and lower mortality and length of stay especially for the more severe cases [51]. Therefore, referral to these hospitals can be life-saving [52] if improving the quality of care in smaller hospitals is not possible. This shows that the structure of hospital networks could be similar across very different countries. The tendency to send patients to these hospitals could also be exacerbated by rivalry between smaller hospitals, due to niche overlap [53]. Lomi et al. have contended that hospitals that meet in a restricted spatial setting (which ordinarily generates rivalry) face new opportunities for collaboration due to the institutional nature of hospitals and to mutual forbearance, a tendency to spontaneously specialize [54]. However in the case of the French network this rarely occurs, possibly due to the rural setting of directly competing hospitals and the similarity of their case mixes [15, 55]. Thus, contrary to technology firms, hospitals that share the same case mix are less likely to form bonds [56]. This could explain the findings of a study suggesting that competition does not always favor productivity in hospitals [57]. The temptation to treat all patients in Group 1 hospitals would conduct to a situation experienced in the United States during the 1980s: a decrease of commitment of smaller hospitals, with an excessive tendency to transfer patients, which led to saturation of the larger centers [58]. Nonetheless, based on our results, policymakers could suggest a revision of the contracts binding public hospitals groups, with an emphasis on linking smaller hospitals to larger referral centers, in order to make the composition of these groups consistent with observed collaboration patterns, which could improve the overall coordination of care. Drawing on the aforementioned past experience in the USA, this should be done in a manner that preserves capability for urgent care in smaller centers.

Social Network Analysis has rarely been applied in the hospital setting at the national level, yet this method could be useful in guiding data-based interventions [21, 59]. For instance, it could help in predicting the spread of hospital-acquired infections [6062]. Our study has some limitations: although we included the vast majority of French public hospitals in this nationwide study, some hospitals were excluded due to their specific geographic situation (e.g. Hospitals of Paris, overseas hospitals). Although these centers represented less than 10% of public hospitals, additional studies are needed to evaluate their patterns of patient mobility before any change of policy is implemented.

In further studies, characteristics associated with network strength could be studied by using gravity models, which account for the distance between hospitals in addition to their size [63, 64]. Future research also needs to ascertain if the national hospital network displays integration and differentiation [23] through the elaboration of formal healthcare pathways, or if patient mobility is due to other reasons like differences in quality of care. Further studies could explore the specific diseases and types of activities involved in patient mobility or transfers.

Availability of data and materials

The data that support the findings of this study are available from the French national agency in charge of medical information, but restrictions apply to the availability of these data, which were used under the MR005 legal framework, and so are not publicly available. Aggregated and anonymized data are however available from the authors upon reasonable request with permission by the French national agency for medical information.


  1. De Regge M, De Pourcq K, Van de Voorde C, Van den Heede K, Gemmel P, Eeckloo K. The introduction of hospital networks in Belgium: the path from policy statements to the 2019 legislation. Health Policy. 2019;123:601–5.

    Article  PubMed  Google Scholar 

  2. Goldsmith JC. The illusive logic of integration. Healthc Forum J. 1994;37:26–31.

    CAS  PubMed  Google Scholar 

  3. De Pourcq K, De Regge M, Van den Heede K, Van de Voorde C, Paul G, Eeckloo K. The role of governance in different types of interhospital collaborations: a systematic review. Health Policy. 2019;123:472–9.

    Article  PubMed  Google Scholar 

  4. Enthoven AC. Integrated delivery systems: the cure for fragmentation. Am J Manag Care. 2009;15(10 Suppl):S284–90.

    PubMed  Google Scholar 

  5. Pourat N, Davis AC, Salce E, Hilberman D, Roby DH, Kominski GF. In ten California counties, notable progress in system integration within the safety net, although challenges remain. Health Aff (Millwood). 2012;31:1717–27.

    Article  Google Scholar 

  6. Heeringa J, Mutti A, Furukawa MF, Lechner A, Maurer KA, Rich E. Horizontal and Vertical Integration of Health Care Providers: A Framework for Understanding Various Provider Organizational Structures. Int J Integr Care. 20.

  7. Nuti S, Vola F, Bonini A, Vainieri M. Making governance work in the health care sector: evidence from a ‘natural experiment’ in Italy. Health Econ Policy Law. 2016;11:17–38.

    Article  PubMed  Google Scholar 

  8. Nikpay SS, Richards MR, Penson D. Hospital-Physician Consolidation Accelerated In The Past Decade In Cardiology, Oncology. Health Aff (Millwood). 2018;37:1123–7.

    Article  Google Scholar 

  9. Lumpkin S, Stitzenberg K. Regionalization and its alternatives. Surg Oncol Clin N Am. 2018;27:685–704.

    Article  PubMed  Google Scholar 

  10. Kahn JM, Asch RJ, Iwashyna TJ, Rubenfeld GD, Angus DC, Asch DA. Perceived barriers to the regionalization of adult critical care in the United States: a qualitative preliminary study. BMC Health Serv Res. 2008;8:239.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Payet C, Polazzi S, Lifante J-C, Cotte E, Grinberg D, Carty MJ, et al. Influence of trends in hospital volume over time on patient outcomes for high-risk surgery. BMC Health Serv Res. 2020;20:274.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Christiansen T, Vrangbæk K. Hospital centralization and performance in Denmark-ten years on. Health Policy. 2018;122:321–8.

    Article  PubMed  Google Scholar 

  13. Barnett ML, Landon BE, O’Malley AJ, Keating NL, Christakis NA. Mapping physician networks with self-reported and administrative data: mapping physician networks. Health Serv Res. 2011;46:1592–609.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Moen EL, Austin AM, Bynum JP, Skinner JS, O’Malley AJ. An analysis of patient-sharing physician networks and implantable cardioverter defibrillator therapy. Health Serv Outcome Res Methodol. 2016;16:132–53.

    Article  Google Scholar 

  15. Di Vincenzo F. Exploring the networking behaviors of hospital organizations. BMC Health Serv Res. 2018;18:334.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Lomi A, Mascia D, Vu DQ, Pallotti F, Conaldi G, Iwashyna TJ. Quality of care and interhospital collaboration: a study of patient transfers in Italy. Med Care. 2014;52:407–14.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Nekkab N, Crépey P, Astagneau P, Opatowski L, Temime L. Assessing the role of inter-facility patient transfer in the spread of carbapenemase-producing Enterobacteriaceae : the case of France between 2012 and 2015. Sci Rep. 2020;10:14910.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Iwashyna TJ, Christie JD, Kahn JM, Asch DA. Uncharted paths. Chest. 2009;135:827–33.

    Article  PubMed  Google Scholar 

  19. Reimer AP, Schiltz N, Koroukian SM, Madigan EA. NATIONAL INCIDENCE OF MEDICAL TRANSFER: Patient characteristics and regional variation. J Health Hum Serv Adm. 2016;38:509–28.

    PubMed  Google Scholar 

  20. Lawrence PR, Lorsch JW. Differentiation and integration in complex organizations. Adm Sci Q. 1967;12:1.

    Article  Google Scholar 

  21. Chambers D, Wilson P, Thompson C, Harden M. Social network analysis in healthcare settings: a systematic scoping review. PLoS One. 2012;7:e41911.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Armitage GD, Suter E, Oelke ND, Adair CE. Health systems integration: state of the evidence. Int J Integr Care. 2009;9 Accessed 3 May 2021.

  23. van der Ham A, van Merode F, Ruwaard D, van Raak A. Identifying integration and differentiation in a Hospital’s logistical system: a social network analysis of a case study. BMC Health Serv Res. 2020;20:857.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Or Z, Bellanger M. France: implementing homogeneous patient groups in a mixed market. In: Busse RGA, Quentin W, Wiley M, editors. Diagnosis-related groups in Europe. New York: Open University press; 2011. p. 221–41.

    Google Scholar 

  25. Trombert-Paviot B, Rector A, Baud R, Zanstra P, Martin C, van der Haring E, et al. The development of CCAM: the new French coding system of clinical procedures. HIM J. 2003;31(1):1–11.

    Google Scholar 

  26. Agence Technique de l’Information sur l’Hospitalisation. Regroupements des GHM en V2016. Accessed 27 Sep 2021.

  27. Jid AA, Almahri F, Bell D, Arzoky M. Personas Design for Conversational Systems in Education. Informatics. 2019;6:46.

    Article  Google Scholar 

  28. Jost L. Entropy and diversity. Oikos. 2006;113:363–75.

    Article  Google Scholar 

  29. Simpson EH. Measurement of diversity. Nature. 1949;163:688.

    Article  Google Scholar 

  30. Newman MEJ. Assortative mixing in networks. Phys Rev Lett. 2002;89:208701.

    Article  CAS  PubMed  Google Scholar 

  31. Linde S. The formation of physician patient sharing networks in medicare: exploring the effect of hospital affiliation. Health Econ. 2019;28:1435–48.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Blanchet K, James P. How to do (or not to do) … a social network analysis in health systems research. Health Policy Plan. 2012;27:438–46.

    Article  PubMed  Google Scholar 

  33. Ni J, Qian T, Xi C, Rui Y, Wang J. Spatial distribution characteristics of healthcare facilities in Nanjing: network point pattern analysis and correlation analysis. Int J Environ Res Public Health. 2016;13:833.

    Article  PubMed Central  Google Scholar 

  34. Iwashyna TJ. The incomplete infrastructure for interhospital patient transfer. Crit Care Med. 2012;40:2470–8.

    Article  PubMed  Google Scholar 

  35. Stadtfeld C, Mascia D, Pallotti F, Lomi A. Assimilation and differentiation: a multilevel perspective on organizational and network change. Soc Networks. 2016;44:363–74.

    Article  Google Scholar 

  36. Bolland JM, Wilson JV. Three faces of integrative coordination: a model of interorganizational relations in community-based health and human services. Health Serv Res. 1994;29:341–66.

    CAS  PubMed  PubMed Central  Google Scholar 

  37. Kitts JA, Lomi A, Mascia D, Pallotti F, Quintane E. Investigating the temporal dynamics of Interorganizational exchange: patient transfers among Italian hospitals. Am J Sociol. 2017;123:850–910.

    Article  Google Scholar 

  38. Bosk EA, Veinot T, Iwashyna TJ. Which patients and where: a qualitative study of patient transfers from community hospitals. Med Care. 2011;49:592–8.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Giuliani E. Network dynamics in regional clusters: evidence from Chile. Res Policy. 2013;42:1406–19.

    Article  Google Scholar 

  40. Coleman EA, Parry C, Chalmers S, Min S-J. The care transitions intervention: results of a randomized controlled trial. Arch Intern Med. 2006;166:1822–8.

    Article  PubMed  Google Scholar 

  41. Gibbons DE. Interorganizational network structures and diffusion of information through a health system. Am J Public Health. 2007;97:1684–92.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Le Meur N, Ferrat L, Gao F, Quidu F, Louazel M. Maillage territorial des établissements de santé : apport des modèles issus de la théorie des graphes. J Gestion Écon Méd. 2017;35:197.

    Article  Google Scholar 

  43. Rankin DA, Matthews SD. Social network analysis of patient movement across health care entities in Orange County, Florida. Public Health Rep. 2020;135:452–60.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Bari V, Hirsch JS, Narvaez J, Sardinia R, Bock KR, Oppenheim MI, et al. An approach to predicting patient experience through machine learning and social network analysis. J Am Med Inform Assoc. 2020;27:1834–43.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Kuo Y-F, Agrawal P, Chou L-N, Jupiter D, Raji MA. Assessing association between team structure and health outcome and cost by social network analysis. J Am Geriatr Soc. 2020;69:946–54.

    Article  Google Scholar 

  46. Yeh C-M, Chou Y-J, Lin S-K, Liu C-J, Huang N. Patient-sharing relationship between Chinese medicine doctors and other physicians: costs and outcomes of breast cancer survivorship care. J Cancer Surviv. 2021;15:922–32.

    Article  PubMed  Google Scholar 

  47. Durojaiye AB, Levin S, Toerper M, Kharrazi H, Lehmann HP, Gurses AP. Evaluation of multidisciplinary collaboration in pediatric trauma care using EHR data. J Am Med Inform Assoc. 2019;26:506–15.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Mascia D, Angeli F, Di Vincenzo F. Effect of hospital referral networks on patient readmissions. Soc Sci Med. 2015;132:113–21.

    Article  PubMed  Google Scholar 

  49. Lee BY, McGlone SM, Song Y, Avery TR, Eubank S, Chang C-C, et al. Social network analysis of patient sharing among hospitals in Orange County, California. Am J Public Health. 2011;101:707–13.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Iwashyna TJ, Christie JD, Moody J, Kahn JM, Asch DA. The structure of critical care transfer networks. Med Care. 2009;47:787–93.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Nathens AB, Jurkovich GJ, Maier RV, Grossman DC, MacKenzie EJ, Moore M, et al. Relationship between trauma center volume and outcomes. JAMA. 2001;285:1164–71.

    Article  CAS  PubMed  Google Scholar 

  52. Dudley RA, Johansen KL, Brand R, Rennie DJ, Milstein A. Selective referral to high-volume hospitals: estimating potentially avoidable deaths. JAMA. 2000;283:1159–66.

    Article  CAS  PubMed  Google Scholar 

  53. Mascia D, Pallotti F, Angeli F. Don’t stand so close to me: competitive pressures, proximity and inter-organizational collaboration. Reg Stud. 2017;51:1348–61.

    Article  Google Scholar 

  54. Lomi A, Pallotti F. Relational collaboration among spatial multipoint competitors. Soc Networks. 2012;34:101–11.

    Article  Google Scholar 

  55. Mascia D, Di Vincenzo F, Cicchetti A. Dynamic analysis of interhospital collaboration and competition: empirical evidence from an Italian regional health system. Health Policy. 2012;105:273–81.

    Article  PubMed  Google Scholar 

  56. Madhavan R, Gnyawali DR, He J. Two’s company, Three’s a crowd? Triads in Cooperative-Competitive Networks. AMJ. 2004;47:918–27.

    Article  Google Scholar 

  57. Mascia D, Di Vincenzo F. Understanding hospital performance: the role of network ties and patterns of competition. Health Care Manag Rev. 2011;36:327–37.

    Article  Google Scholar 

  58. Rokos IC, Sanddal ND, Pancioli AM, Wolff C, Gaieski DF. Inter-hospital communications and transport: turning one-way funnels into two-way networks. Acad Emerg Med. 2010;17:1279–85.

    Article  PubMed  Google Scholar 

  59. Kohler K, Ercole A. Can network science reveal structure in a complex healthcare system? A network analysis using data from emergency surgical services. BMJ Open. 2020;10:e034265.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Nekkab N, Astagneau P, Temime L, Crépey P. Spread of hospital-acquired infections: a comparison of healthcare networks. PLoS Comput Biol. 2017;13:e1005666.

    Article  PubMed  PubMed Central  Google Scholar 

  61. So MKP, Tiwari A, Chu AMY, Tsang JTY, Chan JNL. Visualizing COVID-19 pandemic risk through network connectedness. Int J Infect Dis. 2020;96:558–61.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Churak A, Poolkhet C, Tamura Y, Sato T, Fukuda A, Thongratsakul S. Evaluation of nosocomial infections through contact patterns in a small animal hospital using social network analysis and genotyping techniques. Sci Rep. 2021;11:1647.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. Teow KL, Tan KB, Phua HP, Zhu Z. Applying gravity model to predict demand of public hospital beds. Oper Res Health Care. 2018;17:65–70.

    Article  Google Scholar 

  64. Shahriar S, Qian L, Kea S, Abdullahi NM. The gravity model of trade: a theoretical perspective. RIC. 2019;5:21–42.

    Article  Google Scholar 

Download references


The authors would like to thank the staff of the Agence Technique de l’Information sur l’Hospitalisation for their technical assistance.

Informed consent statement

Not applicable.

Conflict of interest

The authors declare that they have no conflict of interest regarding this study.


This research received no external funding.

Author information

Authors and Affiliations



SSa, MCC and JC designed the study. SSt, MCC, SSa, DL, ALG and JC curated the database. The statistical analysis was conducted by JC. The methodology was developed by: JC, SSa and MCC. JC wrote the original manuscript. JC, ALG, DL, SSt, MCC and SSa edited the manuscript for critical intellectual content. MCC, SSa, DL and SSt supervised the project. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Jan Chrusciel.

Ethics declarations

Ethics approval and consent to participate

The study was declared to the French registry of studies using healthcare data (N° F20210106180024). As the study was retrospective, based on anonymized data and purely observational, it was exempt from Institutional Review Board approval according to the French Public Health Code (L1121-1, law number 2012-300, 5 march 2012).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chrusciel, J., Le Guillou, A., Daoud, E. et al. Making sense of the French public hospital system: a network-based approach to hospital clustering using unsupervised learning methods. BMC Health Serv Res 21, 1244 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: