- Research article
- Open Access
Assessing social preferences in reimbursement negotiations for new Pharmaceuticals in Oncology: an experimental design to analyse willingness to pay and willingness to accept
BMC Health Services Research volume 21, Article number: 234 (2021)
Price negotiations for specialty pharmaceuticals take place in a complex market setting. The determination of the added value of new treatments and the related societal willingness to pay are of increasing importance in policy reform debates. From a behavioural economics perspective, potential cognitive biases and other-regarding concerns affecting outcomes of reimbursement negotiations are of interest. An experimental setting to investigate social preferences in reimbursement negotiations for novel, oncology pharmaceuticals was used. Of interest were differences in social preferences caused by incremental changes of the patient outcome.
An online experiment was conducted in two separate runs (n = 202, n = 404) on the Amazon Mechanical Turk (MTurk) platform. Populations were split into two (run one) and four (run two) equally sized treatment groups for hypothetical reimbursement decisions. Participants were randomly assigned to the role of a public price regulator for pharmaceuticals (buyer) or a representative of a pharmaceutical company (seller). In run two, role groups were further split into two different price magnitude framings (“real world” vs unconverted “real payoff” prices). Decisions had real monetary effects on other participants (in the role of premium payers or investors) and via charitable donations to a patient organisation (patient benefit).
56 (run one) and 59 (run two) percent of participants stated strictly monotone preferences for incremental patient benefit. The mean incremental cost-effectiveness ratio (ICER) against standard of care (SoC) was higher than the initial ICER of the SoC against no care. Regulators stated lower reservation prices in the “real world” prices group compared to their colleagues in the unconverted payoff group. No price group showed any reluctance to trade. Overall, regulators rated the relevance of the patient for their decision higher and the relevance of their own role lower compared to sellers.
The price magnitude of current oncology treatments affects stated preferences for incremental survival, and assigned responsibilities lead to different opinions on the relevance of affected stakeholders. The design is useful to further assess effects of reimbursement negotiations on societal outcomes like affordability (cost) or availability (access) of new pharmaceuticals and test behavioural policy interventions.
Price negotiations for specialty pharmaceuticalsFootnote 1 are characterized by specific features: They predominantly take place in bilateral oligopolistic or bilateral monopolistic market settings (patent-protection on the supply side and few to single payers on the demand side) [5, 9,10,11,12]. Further, the demand side is in most health systems divided in a “tripartite structure” of decider (provider), consumer (patient) and payer (insurance) [9, 11, 13, 14]. On top of that, the evaluation of the product characteristics (effectiveness, quality and safety) is inherently and increasingly complex [9, 14]. The availability of product information and thus the complexity of price negotiations has further increased with the introduction of cost-effectiveness models [5, 15, 16]. And it will further increase due to the demand to include also budget impact, burden of disease, socio-economic impact etc. in economic valuations for price determination besides cost-effectiveness [17,18,19,20]. To make a new treatment available to patients, sellers (pharmaceutical companies) and buyers (governmental regulators, health insurances or budget holders) need to agree on a reimbursable price. This holds true even in health systems with regulated pricing rules, since the market authorization holder can usually decide not to supply the respective market with the product. The determination of the added value of a new pharmaceutical and the society’s willingness to pay for an additional, quality-adjusted life year (QALY) plays an increasing role in policy reform debates [5, 16, 21,22,23,24,25]. Although the reform efforts are generally focused on the rules of value assessment and price determination, the process of reimbursement negotiations is itself subject to demands for reform [5, 7, 26,27,28].
The aim of our study is to broaden the debate from a behavioural perspective. The overarching ambition of our research is to bridge the gap between established health economic behavioural research on the one hand and the current discourse in research and policy on reforming pricing policies for new specialty pharmaceuticals on the other. Over the past decades, behavioural economic studies have provided insights and evidence on how individuals deviate from the standard assumption of neoclassical models [29,30,31,32,33]. In price negotiations in general, and reimbursement negotiations in particular, deviations due to anchoring effects, trade uncertainties, regret avoidance or concerns for others might affect negotiation outcomes [28, 31, 33, 34]. The following sections 2.3 to 2.5 summarize the relevant literature.
We used an experimental setting to investigate social preferences in reimbursement negotiations for novel, specialty pharmaceuticals. The aspired design shall be useful to assess negotiation situations in a controlled laboratory environment. Our main interest lies on differences in stated social preferences caused by incremental changes of the patient outcome. Of further interest are differences in those preferences between different treatment groups, especially in response to the assigned roles (valuation gaps). Preferences are measured by stated reservation prices complemented by statements on the relevance of affected stakeholders. In principle, the design might be applicable to any new therapy that has a defined benefit for the patient compared to an existing alternative, measured in terms of life expectancy and quality of life. However, the study is primarily intended to contribute to the current debate on how public pricing policies for “new high-cost innovative medicines” (European Commission) can be reformed to improve “value for money” [5, 22, 33]. Since healthcare payers and regulators in OECD are faced with increasing numbers of these high-priced pharmaceuticals, which are often (predominantly) categorized as “specialty pharmaceuticals” [5,6,7, 15]. Global spending for specialty pharmaceuticals is projected to make up 40% in 2024 . Especially new oncology treatments account for a high and increasing proportion of pharmaceutical expenditures in developed countries [7, 35]. This is likely to be accentuated as pharmaceutical companies’ oncology pipelines have grown by almost 80% in the last decade and now make up 30% of late-stage pipelines [36, 37]. Not surprisingly, regulators and payers are concerned with the “growth of pharmaceutical expenditures due to new high-cost innovative medicines” and their added value [5, 7]. For this reason, the implemented decision situation was geared towards the evaluation of specialty therapies with life-prolonging effects as in oncology.
The analysis of bargaining behaviour with offer statements and assessment of related societal effects is subject to a subsequent study , building on the findings below. We consider the division into two studies as necessary to clearly separate the analysis of stated preferences from the analysis of the interactive behaviour in a negotiation. It is crucial to understand whether societal effects of the reimbursement negotiation result from either contextual framing effects (of the assigned role and the setting) or from the negotiation interaction (bargaining). Our design is based on findings in the following fields of research:
Valuation gaps: willingness to pay vs. willingness to accept
Potential endowment effects, status-quo biases, exchange asymmetries, reluctances to trade or valuation gaps in decisions have been studied in behavioural economics since the seventies (see Zeiler  and Korobkin  for an overview). A valuation gap (VG) “exists when the most a person is willing to pay for an item (WTP) is less than the least amount that same person is willing to accept (WTA) to give up the same item […]” . This contradicts the neoclassic assumption that choices along the indifference curves are reversible [34, 38,39,40,41]. There have been different theories proposed to explain the phenomenon with no clear leader so farFootnote 2 . Although no “endowed” item is exchanged in our setting, role-related expectations, a different focus of the seller/buyer role [43, 49,50,51,52,53] or in general valuation and trade uncertainties [44, 45], regret avoidance (“bad-deal aversion”) [46,47,48] or moral commitments [54, 55] might cause a valuation gap . Our research interest is not to contribute to the explanation efforts in general but to evaluate possible VGs for new pharmaceuticals in a plausible and relevant laboratory setting.
Preferences for a quality adjusted life year (QALY)
There is an increasing interest in literature for preferences for a QALY, since cost-effectiveness thresholds are used in different countries to assess the value of new therapies [22, 23, 56, 57]. Several studies have already tried to assess WTP for a QALY in direct stated preference studies, mostly using some format of contingent valuation method [23, 58]. They did this either in general for a QALY of the overall population, or in specific disease areas and patient populations (e.g. diabetes) [59,60,61]. Our interest does not lie in nominal WTP values, but rather in differences induced the by role, the decision situation and the social effects of the decision.
Social preferences in health economic laboratory experiments
Experimental games like the ultimatum or the dictator game have been widely used to investigate how much individuals deviate from a self-interested utility maximizing behaviour if social norms and consequences are introduced [62,63,64]. Findings are of special use to understand individual contributions to public goods . However, there is still a surprisingly small number of experimental literature on redistribution, other-regarding or social preferences in choices of heath care provision [65,66,67,68,69,70,71]. Even a fewer number of these include, beside the treating physician and the patient, a third-party payer [67, 72]. The laboratory experiments on health insurance choices on the other side mostly focus on individual risk preferences (e.g. ), moral hazard (e.g. ) or willingness to pool (e.g. ), but not on social preferences for incremental (insurable) QALYs. No experimental evidence on pricing negotiations for pharmaceuticals was found in our systematic literature review .
Our experimental design integrates the three research interests to assess exchange asymmetries (WTP vs. WTA), QALY preferences (WTP for health) and social preferences (WTA or WTP reflecting distributional effects) . This is necessary to assess preferences in reimbursement negotiations for new health interventions, particularly for specialty pharmaceuticals.
Participants were randomly assigned into two (run one) and four (run two) treatment groups of equal size. Populations were split to play either in the role of a health minister (regulator) or as a representative of a pharmaceutical company (seller). In run two the role groups were further split into two different price magnitude framings. One with fictive “real world” prices (100 k$ group) vs one with prices at the “real payoff” level (1$ group). While final payoffs were equal in both groups, prices in the “fictive price” group were converted for the game to 100,000$ = 1USD (in the following, prices for the 100 k$ group are used for simplification). Participants were informed that their decisions would have real monetary consequences on other participants (payoffs for their passive role as premium payers or investors) and in form of charitable donations to a patient association (proxy for patient benefit).Footnote 3 See Table 1 below for an overview of the runs and treatment groups.
Overview setting (contextual framing)
The reimbursement situation involved a hypothetical country with seven citizens, represented by five different types of stakeholders (Fig. 1). A single patient was chosen for comparability with existing experiments on physician treatment situations (e.g. [66,67,68,69,70]) while the funders were represented by a group of two each. A small patient number is also a plausible framing considering the increasing proportion of small and orphan indications in specialty treatment areas in general and oncology particular, for which high-priced pharmaceuticals are offered [5, 6, 25, 78, 79]. Regarding the number of affected funders the setting builds on the design of Kesternich et al.  as well as on Schumacher et al. who found that deciders in a distributional decision situation “attach the same weights to small and large groups“ of payers .
The role of each stakeholder was explained as follows (scenario displayed in Fig. 1):
The patient is suffering from a deadly blood cancer and is under treatment with an existing therapy (current standard of care, SoC), a pharmaceutical product with a known benefit to the patient.
The regulator is in charge of regulating prices for pharmaceuticals, eligible for payment by the public health insurance.
The seller represents an international pharmaceutical company that developed a new product to treat the patient. He is in charge of negotiating an officially reimbursed price for the product.
The premium payers finance the public health insurance. The collected premiums are the only financial source to pay for the patient’s treatment. Only treatments approved by the government for reimbursement are covered. If health expenditures are lower than the actual premiums, payers can accumulate savings. If the expenditures are higher, they will have to eat up their savings.
The investors have invested their savings in the past into the pharmaceutical company. They expect a return on their investment, which compensates them for the additional risk they took, compared to a “risk-free” investment in a government bond for example.
Overview decision situation
The reimbursement process was explained as follows:
The seller offers a new treatment at a proposed price. If the regulator considers the price to be too high, he will refuse approval of the product. Vice versa, if the seller receives a (simultaneous) counterproposal considered too low, he will not introduce the product in this market. If both parties agree, the patient will get access to the new treatment and the related benefits (life expectancy in months m, quality of life in percent q). The two investors will in consequence receive a revenue (price divided by two) and the two payers will pay the cost (leaving them an equal share of the difference between premiums and price). The patient’s quality of life is equal to his work ability. The income for a healthy person (q = 100%) is 10,000$ per month or 120,000$ per year. This gives the patient a benefit (total income) of q × 10,000$ × m. See Table 2 below for an overview.
The regulator and the seller are both employed and receive a fixed salary. An agreement is possible with a reservation price (xS) of the seller below or equal to the reservation price (xR) of the regulator. The price decision is constrained to a range between 50,000$ (which is the price of the current SoC) and 500,000$.
Price parameters were chosen to meet the following conditions:
The fictive yearly incomes had to be in a plausible range for the framed responsibility (average compensation of middle managers in public services in OECD ).
Fictive price and salary numbers, as well as real payoffs, had to be easy to use in explanations and calculations, since the topic is complex for an experiment.
Conversion between fictive and real payoff amounts had to be simple and comparable while fulfilling at the same the time minimum wage requirement of the MTurk platform.
A “robust finding” from past laboratory experiments in economics is that “individuals take into account the welfare of all parties and have a preference for efficient outcomes” [33, 62, 63, 80, 83, 84] and that “non-selfish preferences are the rule rather than the exception” . Building on these research results, the model used in this study was based on a simple CESFootnote 4-function, where a rational decider (regulator, seller) should maximize his or her social utility considering the utility of the other involved stakeholders besides his or her own. For details on the underlying model, see Additional file 2.
Null hypothesis 1: reservation prices (x) are randomly distributed between 50,000$ and 500,000$ in each round (r) and do not differ between rounds with incremental patient benefit (E[Xr] = 275,000$).
The first hypothesis tests whether the mean participant responds to the incremental patient benefit between rounds and, if not, whether the mean price is equal to the expected value of a random distribution. The null hypothesis assumes a rational decider with no other-regarding preferences.
Null hypothesis 2: reservation prices equal Xr = 120,000$ in each round.
The second hypothesis tests whether the mean participant responds to the incremental patient benefit and, if not, whether the mean price leads to an equal distribution of payoffs between the two passive funders. The null hypothesis is based on the assumption that a rational decider does not care about incremental patient benefit, but is averse to inequality between funders.Footnote 5
Null hypothesis 3: the incremental cost-effectiveness ratio (ICER) compared to the given standard of care (SoC) equals 20,000$ for every round.
The third hypothesis tests whether the mean participant applies a simplifying heuristic to reduce the complexity of the decision situation instead of reflecting on the reservation price. To do this, the player simply divides the costs of the given standard treatment of 50,000$ by its effect of 5 months survival and increases the price for a new therapy option by 10,000$ for every additional month of survival. Reflecting quality of life this corresponds to an ICER of 20,000$ per quality adjusted life month (QALM, see Additional file 2). The null hypothesis assumes a bounded-rational participant with efficiency concernsFootnote 6
Null hypothesis 4: reservation prices converted to payoff-magnitude do not differ between price groups for any round.
The fourth hypothesis tests whether the conversion (framing) of the price range for the 100 k$ group to the magnitude of real world list prices for pharmaceuticals in oncology has an impact on the stated reservation prices (converted back to the level of the final payoffs). The null hypothesis assumes a rational decider who is only interested in the actual payoffs to affected stakeholders at the end of the experiment.
Null hypothesis 5: reservation prices do not differ between role groups for any round.
The fifth hypothesis tests for potential valuation differences (WTP ≠ WTA) with special interest in valuation gaps (WTP < WTA, representing a reluctance to trade). The null hypothesis assumes a rational decider for whom the reservation price is independent of the assigned role.Footnote 7
Null hypothesis 6: ranking of the stakeholders’ relevance is identical for all groups.
The sixth hypothesis tests for possible differences in the weighting of the stakeholders involved, which is determined after the pricing decisions have been made. The null hypothesis assumes a rational decider for whom the weighting of affected stakeholders is independent of the assigned role and who is only interested in the actual payoffs to affected stakeholders at the end of the experiment.
An online experiment was conducted in two separate runs (n = 202, n = 404) on the Amazon Mechanical Turk (MTurk) platform.Footnote 8 The platform allows to conduct an experiment at a narrow timeframe, easily process monetary payoffs and has known reliability (see 3.5). Players had to use a slider to submit their prices.Footnote 9 Regulators were instructed to state the “absolute maximum price”, which they would “still consider reasonable and fair for the new product”, while sellers were instructed to state their “absolute minimum price”. Both roles received an identical introduction, apart from their role framing. Participants played the same five rounds to state their reservation prices in both runs. In run one, participants played the reservation price game twice, switching to the opposite role for the second game. Participants had to rank all involved stakeholders after each game in terms of their relevance for the decision. An introductory training was voluntary in run one and mandatory in run two. To control for understanding of the setting, participants had to answer four comprehension questions in run two, three of them forcing correct answers, one without feedback (effect of incremental survival on patient benefit). To test for insufficient attention, an instructional manipulation check was implemented in run two [87, 88].
We used the strategy method  to trigger the financial consequencesFootnote 10: Participants were informed at the beginning of the experiment that all games would be potentially relevant for the social payoffs. After the experiment, in run one, one out of ten rounds was randomly selected for each participant and implemented for all four passive stakeholders randomly assigned. For run two, all rounds of game two were implemented. Participants were given full information on how any potential price would affect all stakeholders in each round, provided with a table that dynamically displayed all social payoffs for each slider position (see Additional file 1). Further they were shown their decisions from the previous rounds for comparison. In run two a message was displayed, if the entered price was equal or lower than in the previous round. However, participants were allowed to ignore this and submit any price in the given range.
For details on the displayed screens and the payoffs at the end of the experiment, see Additional file 1. The full experimental instructions used in this and the subsequent study  are available in Additional file 3 (run one) and Additional file 4 (run two).
Preference elicitation method
While single binary-choice surveys are currently the preferred format for public goods preference elicitations, alternative formats of contingent valuation can be necessary and incentive-compatible under certain conditions [91, 92]. The implementation of a dichotomous choice format or a bidding procedure would neither have been appropriate nor implementable in the present setting.Footnote 11 The contingent valuation method was used in a direct matching format instead, in line with comparable laboratory experiments on social preferences in health care [66,67,68,69,70]. The decision situation, while not dichotomous, was discrete in the sense that participants were forced to choose their reservation price from a defined price range with a finite number of increments.Footnote 12 For each of these price increments all monetary consequences for every stakeholder involved were displayed dynamically in the decision table (see Figure 12 in Additional file 1). The table further displayed the change for all stakeholders for any potential price decision versus the status quo (baseline).Footnote 13 In order to ensure incentive compatibility, three known sources of bias had to be addressed: bias due to strategic behaviour (i), bias due to lack of understanding of the monetary consequences (ii), bias due to a lack of engagement in the game or indifference to the social payments (iii) [91, 97, 98].
As per the model presented, a rational player had no monetary incentive to over- or understate preference statements. Since this would have led to a suboptimal distribution of payoffs between payers and investors after the experiment, without benefiting the patient or the negotiators themselves.
To avoid bias due to over- or underestimation of cost and benefit of the decision, participants were provided with the above mentioned decision table consisting of all relevant monetary consequences for any potential price (multi-attribute alternatives). In addition, performance in the introductory training was surveyed to control the results for proper understanding (see 3.3.).
In online experiments in general, it is important to distinguish participants who understand and follow the instructions from those who focus on completing the survey as quickly as possible to minimise the time spent [87, 88]. In order to do this, the populations were divided after completion of the experiment into those with strictly monotone preferences, as per introduction and training, and those without. The grouping was tested for differences in time spent on the experiment, performance on the training and performance on the attention screener (see 3.3.). Results were reported separately for the two groups.
No expert population was deliberately selected for the experiment. The expert surveys conducted in the recent past with competent authorities indicate how small the available pool might be (see e.g. [8, 26, 99, 100]). More importantly, the individual experience of professionals due to country specifics, focus in therapy area etc. would not be controllable. Also, the connection to the behavioural economic evidence from laboratory experiments (valuation gaps, social preferences) introduced above would hardly be plausible, nor would the implementation of monetary incentives. Instead the MTurk population with studied characteristics was used . In past years, convincing evidence has been generated to support the reliability of MTurk results compared to laboratory and field experiments in general and the usefulness for assessing social preferences specifically [87, 102,103,104,105,106,107,108,109]. To avoid bias due to health system-related differences, participants had to be US resident (the vast majority of MTurk workers logs in from the US with over 70% ), at least 18 years of age. The target population was deliberately not focused on the health sector, but relevant professional experience was surveyed (health service providers, public authorities, pharmaceutical companies, health insurance companies, etc.). Additional demographic information, risk behaviour and health experience were surveyed at the end of the experiment. The respective variables were used to control the results.
We used Chi-square, Cramer’s V, Fisher’s exact test, independent and paired sample t-tests, independent Mann-Whitney U and Wilcoxon matched-pairs signed-rank tests, as well as hierarchical linear regressions (two-level random effects model).
Prices in the 100 k$ groups were converted in the following to the real payoff magnitude (at the end of the experiment) for comparison with the 1$ groups.
Average prices increased overall with each consecutive round and incremental patient benefit in all three reservation price games (p < 0.01, except first increment run one p < 0.05). All mean prices were different from the expected value of a random distribution (null hypothesis 1 rejected at p < 0.01, except first round in run one p < 0.05). Furthermore, for all rounds they were higher than 1.2$, which would have distributed assets evenly between payers and investors (null hypothesis 2 rejected at p < 0.01).
Whereas mean reservation prices suggested strictly monotone preferences for incremental patient benefit overall, this was only true for 56 and 57 (run one) respectively 59% (run two) of the respondents. While the majority of the participants submitted four consecutive times a higher price, 14 respectively 17% stated once or twice non-strict preferences for incremental patient benefit in run two. The group of participants with strictly monotone preferences had significantly lower mean reservation prices compared to the other participants (p < 0.01, except run one two rounds p < 0.05, one round not significant). For the latter group, the difference to the expected value of a random distribution was still significant (p < 0.01) for run one and the first three rounds in run two (p < 0.05). However, the significantly higher variances in this preference group (p < 0.01) raise the question whether these respondents focused on finishing the experiment as fast as possible to collect the fixed salary, rather than on deciding on their reservation price as per instruction. These participants spent significantly less time on the experiment in both runs (p < 0.01). They also did significantly worse in answering comprehension question four and detecting the attention screening question (p < 0.01) with 81% missing the screener.
The reservation prices per round were not different between the two runs for participants with monotone preferences, while prices for participants with non-monotone preferences were not comparable (p < 0.01, first round p < 0.05). Unless stated otherwise the following results will focus on the monotone preference group in run two.
Cost-effectiveness as simplifying heuristic
The cost-effective prices were the most frequent answers looking at the modal value (except for one round in run one with two equal modes). However, the mean ICER per QALM versus SoC was significantly higher than 0.2$ overall (null hypothesis 3 rejected at p < 0.01, Fig. 2) and for all treatment groups (p < 0.01, except group three p < 0.05 for the first three rounds, round four not significant). The consecutive ICERs between rounds decreased in the monotone group from round one to three, increasing again in round five (p < 0.01, in round four not significant). One explanation could be that the mean participant did not adjust to the higher incremental survival in round four (+ 3 m) compared to the rounds two, three and five (+ 2 m).
Valuation differences due to the price magnitude
Mean reservation prices between the two price groups did not differ overall (p > 0.05). They did, however, looking at WTP and WTA separately (Fig. 3). Regulators stated lower prices in the 100 k$ group compared to their colleagues in the 1$ group in round two to five (p < 0.05). While sellers of the 100 k$ group had higher reservation prices, but only in the first round (p < 0.05). If we filter results further for participants who detected the attention screener, the effect becomes significant for all rounds in the regulator group (null hypothesis 4 rejected at p < 0.01) and no round in the seller group.
We tested two potential valuation gaps, one between subjects (null hypothesis 5a) and one within subjects (null hypothesis 5b, run one). The paired t-test did not differ for the same round between games for none of the rounds in run one, hence we cannot assume valuation gaps within subjects (null hypothesis 5b not rejected with p > 0.05).
Looking at the mean reservation prices in run two (Fig. 3), we found a potential valuation gap between regulators and sellers in the 100 k$ group since WTP < WTA for all rounds. This would represent a reluctance to trade. However, this gap was not significant overall (null hypothesis 5a not rejected with p > 0.05). In the 1$ price group, we found a negative gap (“preference range”) with WTP > WTA for all rounds. This gap was significant for round one and two (null hypothesis 5a rejected at p < 0.05). The Mann-Whitney test confirmed no role-related differences for the 100 k$ group, while distributions were different for two rounds in the 1$ group (p < 0.05). The later effect vanished if controlled with the attention screener.
Ranking of stakeholders’ relevance
In the first game, the patient was for the majority of participants the most important stakeholder (both runs), followed by the own role (run two p < 0.01). After the role switch in run one, the own role caught up to the patient, rated together more often first, compared to the other stakeholders (p < 0.01). The ranking of the most relevant stakeholder differed between preference groups in both runs and all games (p < 0.01, p < 0.05 for the second game in run one) with a higher rating of the patient’s relevance by participants with strict monotone preferences.
Looking at the full stakeholder ranking (not only number ones), participants with monotone preferences in run two ranked their negotiation partner third, after the patient (first p < 0.01) and the own role (second p < 0.01), before investors and premium payers (p < 0.05, Wilcoxon test).
Interestingly, participants in run two, who rated the patient as number one, submitted lower prices in all rounds compared to those who rated themselves first (p < 0.01). Controlling for preferences, the effect is still significant in all rounds for participants with strict monotone preferences (p < 0.05), but only for the first round in the non-monotone group (p < 0.05).
The ranking was further significantly different between the two roles in the group with monotone preferences in both runs (null hypothesis 6a rejected at p < 0.01, p < 0.5 for game one, run one) with regulators rating the patient more often as most important. The ranking of the own role as most important on the other side was more frequent in sellers. The ranking did not differ between the two price groups (null hypothesis 6b not rejected at p > 0.01).
Full regression model
A linear regression (two-level random effects model) over all 1190 reservation prices of the monotone group in run two was performed (Table 3). The results confirm the findings above in general, particularly increasing preferences for incremental patient benefit (p < 0.01). The regression confirms the finding regarding price magnitude (4.3 above) with lower price statements for regulators in the 100 k$ group compared to their colleagues in the 1$ group (p < 0.01 with model 2, p < 0.05 with model 1 and 3). In addition to the above findings, prices increased less in the $100 k group (p < 0.05), accompanied by a higher average price across all rounds (p < 0.01). The regression further confirms the finding regarding role-related differences (4.4 above); sellers had lower prices than regulators in the 1$ group (p < 0.05), while prices did not differ between roles in the 100 k group (p > 0.05). In contrast to the results above, controlling for sufficient attention (screener) did not neutralize the role-related difference in the $1 group. The regression finally confirms the correlation between patient orientation and reservation price (4.5 above). With the refinement that players who rated the patient as the most important stakeholder did not generally have lower reservation prices than players who prioritised their own role (p > 0.05). Rather, the prices of the former rose less than those of the latter (p < 0.05). However, the effect is weak and limited to the 100 k group. The extended model 3 also shows that the effect seems to be driven by self-oriented players and differs between roles, with higher price sensitivity for sellers who prioritise their own role over the patient (p < 0.05). The attention screener explained price differences overall (p < 0.01) and in the 1$ prices group (p < 0.05), while the comprehension question had no additional impact (p > 0.05).
The objective of the present study was to assess differences in stated social preferences caused by incremental changes of the patient outcome in a controlled experiment. Of special interest were differences between assigned roles (valuation gaps) and the price magnitudes. The design proposed led to meaningful results with a majority of participants stating strictly monotone preferences for incremental survival of the patient. No systematic reluctance to trade (valuation gaps) was found which would prevent an agreement (patient access) between mean negotiators in a subsequent trade interaction. However, the found impact of the assigned role on the stated relevance of the stakeholders involved could have a potential influence in a subsequent price negotiation.Footnote 14 From a methodological point of view, the valuation differences found in the $1 group require further investigation and potential improvement of the design. The preference range could be due to the complexity of the decision situation in general, which should, however, affect both price groups equally.
The price magnitude of current oncology treatments seems to affect stated preferences for incremental survival. Differences found between price groups call for further investigation of different price framings. It cannot be completely ruled out that the given (limited) price range had an influence on the stated preferences. Such an anchoring effect should be the same across price groups, since the real payoffs were also the same. However, the mean differences found could vary just as well with a larger or smaller given price range. This might be important, especially since anchoring in real reimbursement negotiations, driven by the first offer (but also by the standard of care), is likely to play a relevant role for negotiation outcomes .
The finding that participants did not apply a stable cost-effectiveness rule as a simplifying heuristic could be a promising starting point for further research. To address the need mentioned above to understand the influence of the complexity of the decision situation, as well as the impact of nominal price anchors. The incremental cost-effectiveness ratio (ICER) reflects financial consequences in relation to the resulting patient benefit. The introduction of an ICER-calculator in the pricing decision could help to reduce potential behavioural biases. The available evidence from laboratory experiments shows that participants do show efficiency concern and use or are susceptible to simplifying heuristics [67, 73, 83, 86]. At the same time, investigating the use of cost-effectiveness in reimbursement negotiations would be a plausible bridge to the real world setting and related reform debates. The key figure has been found to be the most important predictor for NICE decisions for example [27, 112]. While other countries such as France, Germany, Sweden or Italy also require or allow elements of economic evaluation (like cost-effectiveness or cost-benefit) to directly or indirectly influence their reimbursement decisions, the evidence for the effectiveness of such measures is generally still scarce [18, 76, 113, 114]. The design presented could be enhanced by introducing an ICER tool to test its potential impact on reservation prices and value-based price negotiations.
Our subsequent study on the design presented will complete the negotiation setting with binding price offers to an assigned negotiation partner . This is important, since the effective bargaining behaviour is expected to have an additional impact on prices offered, compared to the players’ private reservation price. The analysis of the related negotiation outcome will further allow us to analyse and discuss societal effects regarding cost and availability of new pharmaceuticals. A third experimental study will finally test, whether behavioural (policy) interventions have a positive or negative impact on these societal effects.
The heuristics and implications of behavioural economics have been integrated into policy analysis and implementations over the past decades, in health care as well as in other policy areas [32, 33, 115,116,117,118,119,120]. Yet, external validity of results from laboratory experiments, especially from social-preference games, depend greatly on the relevant context of the experiment [33, 64, 119, 121,122,123,124,125,126]. A concern towards the study presented could be the transferability of observed behaviour from the mean participant to a professional negotiator in a real life reimbursement situation . While the difference in experience or sophistication should not be neglected, we still deem the results relevant due to two main reasons. On one side empirical research comparing inexperienced with experienced and professional traders in financial markets have shown, “that experience reduces behavioural biases, but biases remain relevant even for experienced traders” [33, 127,128,129,130,131,132,133,134,135]. This has recently been confirmed for bounded rationality and social preferences in experiments between physician and student populations [69, 71, 72]. On the other side certain biases are much more likely to be linked to “trade uncertainties” and related anticipated regrets which cannot be fully reduced by market experience [44,45,46,47,48]. At least not in markets where “the subjects of the negotiation as well as the relevant rules of the game” are relatively complex and less stable compared to simple, repetitive good exchanges . On a continuum of trade complexity pharmaceutical reimbursement negotiations might be closer to financial market trades than to repetitive trades of exchangeable commodities. However, the weakness of the presented study is certainly the fact that it does not reflect the expressed preferences of professional individuals, which have a direct influence on the reimbursement of new specialty pharmaceuticals in real-world. Moreover, it focuses on a hypothetical situation that had social consequences (on patients and fellow players), but did not take into account long-term consequences due to dependence on an organisation as an employed or mandated negotiator. As mentioned at the beginning, we see our study as bridging the gap between established behavioural research in health economics on the one hand and the current discourse in research and politics on reforming pricing policies for new specialty pharmaceuticals on the other. Further research along these lines could seek to connect with the policy-oriented research with observational data of expert surveys on reimbursement decisions (see , as well as [8, 26, 100]). Laboratory experiments can serve as complements to non-experimental methods  and have the potential to be a “‘wind tunnel’ before implementing large-scale studies or institutional changes of the healthcare market” [120, 136]. In our setting of interest their potential in this regard might even be higher, since reliable real-world data on pharmaceutical pricing negotiations is hardly available, not least because of confidential agreements in many European countries [5, 8, 25, 26, 33, 100].
Price negotiations for specialty pharmaceuticals take place in a complex market setting. The determination of the added value of a novel treatment and the related societal willingness to pay are of increasing importance in policy reform debates [5, 16, 21,22,23,24,25]. Not only the pricing rules but also the process of reimbursement negotiation itself is subject to demands for reform [5, 7, 26,27,28]. From a behavioural economics perspective potential cognitive biases affecting negotiations outcomes are of interest [28, 31, 33, 34]. Laboratory experiments could provide a useful test environment for behavioural policy interventions. Assuming that bounded rationality and other-regarding concerns may differ between inexperienced and experienced traders, but remain relevant in both. As is the case in other markets with complex transactions. Our findings show, that the price magnitude in a reimbursement negotiation for pharmaceuticals in oncology has an impact on stated preferences for incremental survival. Further, the assigned role in the negotiation has an impact on the stated relevance of affected stakeholders. Both could have an undesirable influence on reimbursement negotiations. The design was found useful to further assess the effects of the negotiation setting on societal outcomes like cost and availability of new specialty pharmaceuticals in an experimental setting and test appropriate behavioural policy interventions.
Availability of data and materials
Experimental instructions used in this and the subsequent study  are available in Additional file 3 and Additional file 4. Datasets (fully anonymized) analysed during the current and the subsequent study  are available in the zenodo repository https://doi.org/10.5281/zenodo.3575971 upon request.
We use the term “pharmaceutical” in this study synonymous to the terms “medicine”, “medicinal product” or “drug product”. Following the definition of the World Health Organization (WHO) and the European Medicines Agency (EMA) this refers to any “substance or combination of substances that is intended to treat, prevent or diagnose a disease, or to restore, correct or modify physiological functions by exerting a pharmacological, immunological or metabolic action” [1,2,3]. This definition is similar to the one used in the United States by the Food and Drug Administration (FDA) . Our study focuses on the reimbursement of “new high-cost innovative medicines” (European Commission ) which are often (predominantly) categorized as “specialty pharmaceuticals” . There is no standard definition of a speciality pharmaceutical, let alone a scientific definition of an “innovative” therapy [6, 7]. Following the frequently referenced definition of IQVIA, we understand it to mean pharmaceutical products that treat chronic, complex or rare diseases (i.e. often more serious medical conditions such as cancers, autoimmune diseases, hepatitis C, etc.) [6, 8]. They are initiated or managed by a specialist and are typically offered at a high list price, so patients need financial support to pay for them .
The earlier explanations suggested that individuals experience disutility from losing an endowed good (loss aversion) which triggers reluctance to trade [34, 40]. Later attempts explain the phenomenon e.g. by extra utility experienced from owning a good, respectively a different valuation depending on ownership or role [34, 42, 43]. Newer evidence suggested to explain valuation gaps by choice or trade uncertainty or by regret-avoidance [44,45,46,47,48]. None of these attempts has been able to predict all published empirical findings [34, 38].
Donated to the Leukemia & Lymphoma Society (LLS). Participants were further informed, that LLS “provides financial support for patients with blood cancer (https://www.lls.org/support/financial-support)”.
Constant elasticity of substitution
For considerations on efficiency concerns see for example .
The incremental cost-effectiveness ratio of the given standard of care versus no treatment can be derived by dividing the incremental costs (50,000$-0$) by the incremental, quality adjusted survival (5×0.5–0×0.5). For details on efficiency and effectiveness measures, see Additional file 2. For considerations on simplifying heuristics, or on efficiency concerns from a behavioural economics perspective, see for example [67, 73, 83, 86].
The experiment was implemented as Qualtrics survey and linked on MTurk as Human Intelligence Task (HIT). The Decision Science Laboratory of ETH Zurich (DeSciL) executed the experimental runs and delivered anonymized data to the researchers. Bonus distributions were performed by the DeSciL to ensure participants remain anonymous to the researchers. The first run of the experiment was conducted on 18 February 2019, run two on 2 May 2019. We ran a technical pre-test of the survey on 22 December 2018 (n = 31) to check for basic understanding by MTurk users, the indicated estimation of time needed in the informed consent and potential errors in the survey flow.
The price range started at 50,000$ (representing the price of the available standard of care SoC) and ended at 500,000$ for the 100 k$ group, respectively at 0.5$ and 5$ for the 1$ group. The slider allowed the participant to “discretely” compare choices in the price range by increments of 2000$ (100 k$ group) respectively 0.02 (1$ group). Default position of the slider was SoC at the very left.
Meaning that participants did not know which of the games they played would be relevant for payoff. This has been used in similar laboratory experiments, based on the findings of Brandts and Charness (2011) who found no difference between the strategy method, where players make “conditional decisions for each possible information set” versus the direct-response method [67, 80, 85, 89, 90].
To obtain actual WTP and WTA values for the negotiation setting, a single binary or dichotomous choice would not have been sufficient. This was however crucial for the extended setting of the follow-up study with price offers paired with an actual negotiation partner . Alternatively, the decisions would have had to be repeated in an iterative process, which would have meant an inflationary increase in the duration per round and the experiment overall. In addition, the statements in these procedures can also be distorted by anchor effects and interpretation errors due to the increased complexity of the decision . Due to the complexity of the decision situation the implementation of a single bid mechanism like the Becker-DeGroot-Marschak (BDM)  or the Random Price Voting Mechanism (RPVM)  had to be abandoned as well, in light of the known challenges with the BDM mechanism [34, 95, 96].
A closed-ended format was chosen since an unlimited price range would have required players to be provided with unlimited starting capital to avoid a negative payoff for payers. According to standard laboratory policies, participation in an experiment may not result in any payment obligation for the players in real world. Otherwise, a price increase by the players above a certain level would have no or a distorted distribution effect.
Consequently, in the broadest sense, participants had to choose from a (multinomial) discrete ‘choice set’, where the set was however not observable in a single view with all properties per choice.
- 100 k$:
Incremental cost-effectiveness ratio
Amazon Mechanical Turk
Organisation for Economic Co-operation and Development
Quality adjusted life month
Quality-adjusted life year
Quality of life
Standard of care
Willingness to accept
Willingness to pay
European Medicines Agency (EMA): Glossary. 2020. https://www.ema.europa.eu/en/glossary/medicinal-product. Accessed 31 Dec 2020.
World Health Organization: Essential Medicines and Health Products: Prequalification of medicines - Glossary. 2020. https://extranet.who.int/pqweb/content/glossary. Accessed 31 Dec 2020.
Vogler S, Zimmermann N. Glossary of pharmaceutical terms. In: WHO collaborating Centre for Pharmaceutical Pricing and Reimbursement Policies (ed.). Vol. update July 2016. Vienna: WHO Collaborating Centre for Pharmaceutical Pricing and Reimbursement Policies; 2016.
U.S. Food and Drug Administration (FDA): Drugs@FDA Glossary of Terms. 2017. https://www.fda.gov/drugs/drug-approvals-and-databases/drugsfda-glossary-terms. Accessed 31 Dec 2020.
Expert Panel on effective ways of investing in Health (EXPH). Opinion on Innovative payment models for high-cost innovative medicines. Luxembourg: Publications Office of the European Union; 2018.
Kleinrock M, Muñoz E. Global medicine spending and usage trends, outlook to 2024. In: IQVIA Institute for Human Data Science; 2020.
OECD: Pharmaceutical innovation and access to medicines. OECD Health Policy Studies, (2018).
Morgan SG, Vogler S, Wagner AK. Payers’ experiences with confidential pharmaceutical price discounts: A survey of public and statutory health systems in North America, Europe, and Australasia. Health Policy. 2017;121(4):354–62. https://doi.org/10.1016/j.healthpol.2017.02.002.
Hajen L, Paetow H, Schumacher H. Gesundheitsökonomie: Strukturen - Methoden - Praxisbeispiele, 8th ed. Stuttgart: Kohlhammer; 2017.
Mankiw NG. Principles of economics. 8 ed. Cengage Learning; 2017.
Schoonveld E. The price of global health: drug pricing strategies to balance patient access and the funding of innovation. 2nd ed. London: Routledge; 2016.
Grepperud S, Pedersen PA. Positioning and negotiations: the case of pharmaceutical pricing. Eur J Pol Econ. 2020;62:101853.
Mossialos E, Dixon A, Figueras J, Kutzin J. Funding health care: options for Europe. Buckingham: Open University Press; 2002.
Mankiw NG. The economics of healthcare; 2017.
Vogler S, Paris V, Ferrario A, Wirtz VJ, de Joncheere K, Schneider P, Pedersen HB, Dedet G, Babar ZU. How can pricing and reimbursement policies improve affordable access to medicines? Lessons Learned from European Countries. Appl Health Econ Health Policy. 2017;15(3):307–21. https://doi.org/10.1007/s40258-016-0300-z.
Belloni A, Morgan D, Paris, V. Pharmaceutical expenditure and policies: past trends and future challenges. In: OECD Health Working Papers. vol. 87. Paris: OECD Publishing; 2016. https://doi.org/10.1787/5jm0q1f4cdq7-en.
Angelis A, Kanavos P. Value-based assessment of new medical technologies: towards a robust methodological framework for the application of multiple criteria decision analysis in the context of health technology assessment. Pharmacoeconomics. 2016;34(5):435–46. https://doi.org/10.1007/s40273-015-0370-z.
Angelis A, Lange A, Kanavos P. Using health technology assessment to assess the value of new medicines: results of a systematic review and expert consultation across eight European countries. Eur J Health Econ. 2018;19(1):123–52. https://doi.org/10.1007/s10198-017-0871-0.
Danzon PM. Affordability challenges to value-based pricing: mass diseases, orphan diseases, and cures. Value Health. 2018;21(3):252–7. https://doi.org/10.1016/j.jval.2017.12.018.
Lakdawalla DN, Doshi JA, Garrison LP Jr, Phelps CE, Basu A, Danzon PM. Defining elements of value in health care-A health economics approach: an ISPOR special task force report . Value Health. 2018;21(2):131–9. https://doi.org/10.1016/j.jval.2017.12.007.
Pani L, Montilla S, Nemeth G, Russo P, Viceconte G, Vogler S. Balancing access to medicines and sustainability in Europe: an analysis from the network of competent authorities on pricing and reimbursement (CAPR). Pharmacol Res. 2016;111:247–50. https://doi.org/10.1016/j.phrs.2016.05.022.
Paris V, Belloni A. Value in pharmaceutical pricing. Paris: OECD Publishing; 2013.
Cameron D, Ubels J, Norstrom F. On what basis are medical cost-effectiveness thresholds set? Clashing opinions and an absence of data: a systematic review. Glob Health Action. 2018;11(1):1447828. https://doi.org/10.1080/16549716.2018.1447828.
Thokala P, Ochalek J, Leech AA, Tong T. Cost-effectiveness thresholds: the past, the present and the future. Pharmacoeconomics. 2018;36(5):509–22. https://doi.org/10.1007/s40273-017-0606-1.
Godman B, Bucsics A, Vella Bonanno P, Oortwijn W, Rothe CC, Ferrario A, Bosselli S, Hill A, Martin AP, Simoens S, Kurdi A, Gad M, Gulbinovic J, Timoney A, Bochenek T, Salem A, Hoxha I, Sauermann R, Massele A, Guerra AA Jr, Petrova G, Mitkova Z, Achniotou G, Laius O, Sermet C, Selke G, Kourafalos V, Yfantopoulos J, Magnusson E, Joppi R, Oluka M, Kwon HY, Jakupi A, Kalemeera F, Fadare JO, Melien O, Pomorski M, Wladysiuk M, Markovic-Pekovic V, Mardare I, Meshkov D, Novakovic T, Furst J, Tomek D, Zara C, Diogene E, Meyer JC, Malmstrom R, Wettermark B, Matsebula Z, Campbell S, Haycox A. Barriers for Access to New Medicines: Searching for the Balance Between Rising Costs and Limited Budgets. Front Public Health. 2018;6:328. https://doi.org/10.3389/fpubh.2018.00328.
Vogler S. Fair prices for medicines? Exploring competent authorities’ and public payers’ preferences on pharmaceutical policies. Empirica. 2019;46(3):443–69. https://doi.org/10.1007/s10663-019-09446-5.
Walton MJ, O'Connor J, Carroll C, Claxton L, Hodgson R. A review of issues affecting the efficiency of decision making in the NICE single technology appraisal process. Pharmacoecon Open. 2019;3(3):403–10. https://doi.org/10.1007/s41669-018-0113-0.
Oliver A. Lowering the bucks for the bang: viewing pharmaceutical price negotiations through a behavioural lens. Behav Public Policy. 2019:1–12. https://doi.org/10.1017/bpp.2019.15.
Camerer C, Loewenstein G. Chapter 1: Behavioural economics - past, present & future. In: Camerer C, Loewenstein G, Rabin M, editors. Advances in behavioral economics, vol. roundtable series in behavioral economics. Princeton: Princeton University Press; 2004. p. 1–61.
DellaVigna S. Psychology and economics: evidence from the field. J Econ Lit. 2009;47(2):315–72.
Mathis K, Steffen AD. From Rational Choice to Behavioural Economics. In: European Perspectives on Behavioural Law and Economics, vol. 2. European Perspectives on Behavioural Law and Economics. Economic Analysis of Law in European Legal Scholarship. Cham: Springer; 2015. p. 31–48.
Chetty R. Behavioral economics and public policy: A pragmatic perspective. Am Econ Rev. 2015;105(5):1–33.
Wettstein DJ, Boes S. The impact of reimbursement negotiations on cost and availability of new pharmaceuticals: evidence from an online experiment. Health Econ Rev. 2020;10(1):13. https://doi.org/10.1186/s13561-020-00267-y.
Zeiler K. What explains observed reluctance to trade? A comprehensive literature review. In: Zeiler JTaK, editor. Research Handbook on Behavioral Law and Economics. Research Handbooks in Law and Economics series; 2018. p. 347–430.
Aitken M, Kleinrock M. Global oncology trend report - A review of 2015 and outlook to 2020. In: IMS Institute for Healthcare Informatics; 2016.
Aitken M, Kleinrock M, Simorellis A, Nass D. Global oncology trends 2018, Innovation, Expansion and Disruption. In: IQVIA Institute for Human Data Science; 2018.
Aitken M, Kleinrock M, Nass D, Simorellis A. Global oncology trends 2019, therapeutics, clinical development and health system implications. In: IQVIA Institute for Human Data Science; 2019.
Korobkin R. Wrestling with the Endowment Effect, or How to do Law and Economics without the Coase Theorem. In: Zamir E, Teichman D, editors. The Oxford Handbook of Behavioral Economics and the Law, vol. 300; 2014. p. 323–6.
Knetsch JL. The endowment effect and evidence of nonreversible indifference curves. Am Econ Rev. 1989;79(5):1277–84.
Kahneman D, Knetsch JL, Thaler RH. Anomalies: the endowment effect, loss aversion, and status quo bias. J Econ Perspect. 1991;5(1):193–206.
Kahneman D, Knetsch JL, Thaler RH. Experimental tests of the endowment effect and the Coase theorem. J Polit Econ. 1990;98(6):1325–48.
Loewenstein G, Adler D. A bias in the prediction of tastes. Econ J. 1995;105(431):929–37.
Carmon Z, Ariely D. Focusing on the forgone: how value can appear so different to buyers and sellers. J Consum Res. 2000;27(3):360–70. https://doi.org/10.1086/317590.
Engelmann D, Hollard G. Reconsidering the effect of market experience on the “endowment effect”. Econometrica. 2010;78(6):2005–19.
Ratan A. Anticipated regret or endowment effect? A reconsideration of exchange asymmetry in laboratory experiments. BE J Econ Anal Policy. 2013;14(1):277–98.
Isoni A. The willingness-to-accept/willingness-to-pay disparity in repeated markets: loss aversion or ‘bad-deal’ aversion? Theor Decis. 2011;71(3):409–30. https://doi.org/10.1007/s11238-010-9207-6.
Weaver R, Frederick S. A reference price theory of the endowment effect. J Mark Res. 2012;49(5):696–707.
Arlen J, Tontrup S. Does the endowment effect justify legal intervention? The debiasing effect of institutions. J Legal Stud. 2015;44(1):143–82.
Nayakankuppam D, Mishra H. The endowment effect: rose-tinted and dark-tinted glasses. J Consum Res. 2005;32(3):390–5.
Okada EM. Uncertainty, risk aversion, and WTA vs. WTP Mark Sci. 2010;29(1):75–84.
Johnson EJ, Haubl G, Keinan A. Aspects of endowment: a query theory of value construction. J. Exp Psychol Learn Mem Cogn. 2007;33(3):461–74. https://doi.org/10.1037/0278-7322.214.171.1241.
Ashby NJ, Dickert S, Glöckner A. Focusing on what you own: biased information uptake due to ownership. Judgm Decis Mak. 2012;7(3):254.
Pachur T, Scheibehenne B. Constructing preference from experience: the endowment effect reflected in external information search. J Exp Psychol Learn Mem Cogn. 2012;38(4):1108–16. https://doi.org/10.1037/a0027637.
Boyce RR, Brown TC, McClelland GH, Peterson GL, Schulze WD. An experimental examination of intrinsic values as a source of the WTA-WTP disparity. Am Econ Rev. 1992;82(5):1366–73.
Walker ME, Morera OF, Vining J, Orland B. Disparate WTA–WTP disparities: the influence of human versus natural causes. J Behav Decis Mak. 1999;12(3):219–32.
Neumann PJ, Thorat T, Zhong Y, Anderson J, Farquhar M, Salem M, Sandberg E, Saret CJ, Wilkinson C, Cohen JT. A systematic review of cost-effectiveness studies reporting cost-per-DALY averted. Plos One. 2016;11(12):e0168512. https://doi.org/10.1371/journal.pone.0168512.
Woods B, Revill P, Sculpher M, Claxton K. Country-level cost-effectiveness thresholds: initial estimates and the need for further research. Value Health. 2016;19(8):929–35. https://doi.org/10.1016/j.jval.2016.02.017.
Nimdet K, Chaiyakunapruk N, Vichansavakul K, Ngorsuraches S. A systematic review of studies eliciting willingness-to-pay per quality-adjusted life year: does it justify CE threshold? Plos One. 2015;10(4):e0122760. https://doi.org/10.1371/journal.pone.0122760.
Feher MD, Brazier J, Schaper N, Vega-Hernandez G, Nikolajsen A, Bogelund M. Patients’ with type 2 diabetes willingness to pay for insulin therapy and clinical outcomes. BMJ Open Diabetes Res Care. 2016;4(1):e000192. https://doi.org/10.1136/bmjdrc-2016-000192.
Rowen D, Stevens K, Labeit A, Elliott J, Mulhern B, Carlton J, Basarir H, Ratcliffe J, Brazier J. Using a discrete-choice experiment involving cost to value a classification system measuring the quality-of-life impact of self-Management for Diabetes. Value Health. 2018;21(1):69–77. https://doi.org/10.1016/j.jval.2017.06.016.
Ryen L, Svensson M. The willingness to pay for a quality adjusted life year: A review of the empirical literature. Health Econ. 2015;24(10):1289–301. https://doi.org/10.1002/hec.3085.
Andreoni J, Miller J. Giving according to GARP: an experimental test of the consistency of preferences for altruism. Econometrica. 2002;70(2):737–53.
Charness G, Rabin M. Understanding social preferences with simple tests. Q J Econ. 2002;117(3):817–69.
Levitt SD, List JA. What do laboratory experiments measuring social preferences reveal about the real world? J Econ Perspect. 2007;21(2):153–74.
Godager G, Wiesen D. Profit or patients’ health benefit? Exploring the heterogeneity in physician altruism. J Health Econ. 2013;32(6):1105–16. https://doi.org/10.1016/j.jhealeco.2013.08.008.
Hennig-Schmidt H, Wiesen D. Other-regarding behavior and motivation in health care provision: an experiment with medical and non-medical students. Soc Sci Med. 2014;108:156–65. https://doi.org/10.1016/j.socscimed.2014.03.001.
Kesternich I, Schumacher H, Winter J. Professional norms and physician behavior: Homo oeconomicus or homo hippocraticus? J Public Econ. 2015;131:1–11. https://doi.org/10.1016/j.jpubeco.2015.08.009.
Hennig-Schmidt H, Selten R, Wiesen D. How payment systems affect physicians’ provision behaviour--an experimental investigation. J Health Econ. 2011;30(4):637–46. https://doi.org/10.1016/j.jhealeco.2011.05.001.
Brosig-Koch J, Hennig-Schmidt H, Kairies-Schwarz N, Wiesen D. Using artefactual field and lab experiments to investigate how fee-for-service and capitation affect medical service provision. J Econ Behav Organ. 2016;131:17–23.
Brosig-Koch J, Hennig-Schmidt H, Kairies-Schwarz N, Wiesen D. The effects of introducing mixed payment Systems for Physicians: experimental evidence. Health Econ. 2017;26(2):243–62. https://doi.org/10.1002/hec.3292.
Wang J, Iversen T, Hennig-Schmidt H, Godager G. Are patient-regarding preferences stable? Evidence from a laboratory experiment with physicians and medical students from different countries. Eur Econ Rev. 2020;125:103411.
Reif S, Hafner L, Seebauer M. Physician behavior under prospective payment schemes-evidence from Artefactual field and lab experiments. Int J Environ Res Public Health. 2020;17(15):5540. https://doi.org/10.3390/ijerph17155540.
Kairies-Schwarz N, Kokot J, Vomhof M, Weßling J. Health insurance choice and risk preferences under cumulative prospect theory–an experiment. J Econ Behav Organ. 2017;137:374–97.
Huck S, Lünser G, Spitzer F, Tyran J-R. Medical insurance and free choice of physician shape patient overtreatment: A laboratory experiment. J Econ Behav Organ. 2016;131:78–105.
Mimra W, Nemitz J, Waibel C. Voluntary pooling of genetic risk: A health insurance experiment. J Econ Behav Organ. 2019. https://doi.org/10.1016/j.jebo.2019.04.001.
Wettstein DJ, Boes S. Effectiveness of National Pricing Policies for patent-protected pharmaceuticals in the OECD: A systematic literature review. Appl Health Econ Health Policy. 2019;17(2):143–62. https://doi.org/10.1007/s40258-018-0437-z.
Alekseev A, Charness G, Gneezy U. Experimental methods: when and why contextual instructions are important. J Econ Behav Organ. 2017;134:48–59. https://doi.org/10.1016/j.jebo.2016.12.005.
IQVIA: EFPIA Patient W.A.I.T. Indicator 2018 survey. In: EFPIA, editor. European Federation of Pharmaceutical Industries and Associations (EFPIA),, online; 2019.
Vella Bonanno P, Bucsics A, Simoens S, Martin AP, Oortwijn W, Gulbinovic J, Rothe C, Timoney A, Ferrario A, Gad M, Salem A, Hoxha I, Sauermann R, Kamusheva M, Dimitrova M, Petrova G, Laius O, Selke G, Kourafalos V, Yfantopoulos J, Magnusson E, Joppi R, Jakupi A, Bochenek T, Wladysiuk M, Furtado C, Markovic-Pekovic V, Mardare I, Meshkov D, Furst J, Tomek D, Cortadellas MO, Zara C, Haycox A, Campbell S, Godman B. Proposal for a regulation on health technology assessment in Europe - opinions of policy makers, payers and academics from the field of HTA. Expert Rev Pharmacoecon Outcomes Res. 2019;19(3):251–61. https://doi.org/10.1080/14737167.2019.1575730.
Schumacher H, Kesternich I, Kosfeld M, Winter J. One, two, many—insensitivity to group size in games with concentrated benefits and dispersed costs. Rev Econ Stud. 2017;84(3):1346–77.
Kos M. Medicine Prices in European Countries. In: Vogler S, editor. Medicine Price Surveys, Analyses and Comparisons. London: Elsevier; 2019. p. 11–29.
OECD. Government at a glance 2017: Government at a Glance; 2017.
Engelmann D, Strobel M. Inequality aversion, efficiency, and maximin preferences in simple distribution experiments. Am Econ Rev. 2004;94(4):857–69.
Fisman R, Kariv S, Markovits D. Individual preferences for giving. Am Econ Rev. 2007;97(5):1858–76.
Bruhin A, Fehr E, Schunk D. The many faces of human sociality: uncovering the distribution and stability of social preferences. J Eur Econ Assoc. 2018;17(4):1025–69.
Tversky A, Kahneman D. Judgment under uncertainty: heuristics and biases. Science. 1974;185(4157):1124–31. https://doi.org/10.1126/science.185.4157.1124.
Thomas KA, Clifford S. Validity and mechanical Turk: an assessment of exclusion methods and interactive experiments. Comput Human Behav. 2017;77:184–97.
Berinsky AJ, Margolis MF, Sances MW. Separating the shirkers from the workers? Making sure respondents pay attention on self-administered surveys. Am J Pol Sci. 2014;58(3):739–53.
Brandts J, Charness G. The strategy versus the direct-response method: a first survey of experimental comparisons. Exp Econ. 2011;14(3):375–98. https://doi.org/10.1007/s10683-011-9272-x.
Hergueux J, Jacquemet N. Social preferences in the online laboratory: a randomized experiment. Exp Econ. 2015;18(2):251–83.
Johnston RJ, Boyle KJ, Adamowicz W, Bennett J, Brouwer R, Cameron TA, Hanemann WM, Hanley N, Ryan M, Scarpa R. Contemporary guidance for stated preference studies. J Assoc Environ Resour Econ. 2017;4(2):319–405.
Vossler CA, Holladay JS. Alternative value elicitation formats in contingent valuation: mechanism design and convergent validity. J Public Econ. 2018;165:133–45. https://doi.org/10.1016/j.jpubeco.2018.07.004.
Becker GM, DeGroot MH, Marschak J. Measuring utility by a single-response sequential method. Behav Sci. 1964;9(3):226–32. https://doi.org/10.1002/bs.3830090304.
Messer KD, Poe GL, Rondeau D, Schulze WD, Vossler CA. Social preferences and voting: an exploration using a novel preference revealing mechanism. J Public Econ. 2010;94(3–4):308–17. https://doi.org/10.1016/j.jpubeco.2009.12.004.
Cason TN, Plott CR. Misconceptions and game form recognition: challenges to theories of revealed preference and framing. J Polit Econ. 2014;122(6):1235–70.
Dave C, Eckel CC, Johnson CA, Rojas C. Eliciting risk preferences: when is simple better? J Risk Uncertain. 2010;41(3):219–43.
Carson RT, Groves T. Incentive and informational properties of preference questions. Environ Resour Econ. 2007;37(1):181–210.
Lunander A. Inducing incentives to understate and to overstate willingness to pay within the open-ended and the dichotomous-choice elicitation formats: an experimental study. J Environ Econ Manage. 1998;35(1):88–102. https://doi.org/10.1006/jeem.1997.1016.
Vogler S. Medicine Price surveys, Analyses and Comparisons: Evidence and Methodology Guidance. London: Elsevier; 2018.
Mardetko N, Kos M, Vogler S. Review of studies reporting actual prices for medicines. Expert Rev Pharmacoecon Outcomes Res. 2019;19(2):159–79. https://doi.org/10.1080/14737167.2019.1552137.
Difallah D, Filatova E, Ipeirotis P. Demographics and dynamics of mechanical Turk workers. In: Proceedings of the eleventh ACM international conference on web search and data mining; 2018. p. 135–43.
Buhrmester M, Kwang T, Gosling SD. Amazon's mechanical Turk: A new source of inexpensive, yet high-quality, data? Perspect Psychol Sci. 2011;6(1):3–5. https://doi.org/10.1177/1745691610393980.
Clifford S, Jewell RM, Waggoner PD. Are samples drawn from mechanical Turk valid for research on political ideology? Res Pol. 2015;2(4):2053168015622072. https://doi.org/10.1177/2053168015622072.
Höglinger M, Wehrli S. Measuring social preferences on Amazon mechanical Turk; 2017.
Coppock A. Generalizing from survey experiments conducted on mechanical Turk: A replication approach. Polit Sci Res Methods. 2018;7(3):613–28. https://doi.org/10.1017/psrm.2018.10.
Johnson D, Ryan J. Amazon mechanical turk workers can provide consistent and economically meaningful data; 2018.
Arechar AA, Kraft-Todd G, Rand DG. Turking overtime: how participant characteristics and behavior vary over time and day on Amazon mechanical Turk. J Econ Sci Assoc. 2017;3(1):1–11. https://doi.org/10.1007/s40881-017-0035-0.
Goodman JK, Cryder CE, Cheema A. Data collection in a flat world: the strengths and weaknesses of mechanical Turk samples. J Behav Decis Mak. 2013;26(3):213–24.
Arechar AA, Gachter S, Molleman L. Conducting interactive experiments online. Exp Econ. 2018;21(1):99–131. https://doi.org/10.1007/s10683-017-9527-2.
du Plessis D, Sake JK, Halling K, Morgan J, Georgieva A, Bertelsen N. Patient centricity and pharmaceutical companies: is it feasible? Ther Innov Regul Sci. 2017;51(4):460–7. https://doi.org/10.1177/2168479017696268.
Katsanis, L.P., Pitta, D., Morinville, A.: Patient centricity: lip service or genuine commitment? A qualitative examination of the pharmaceutical industry. International Journal of Pharmaceutical and Healthcare Marketing ahead-of-print(ahead-of-print) (2020). doi: https://doi.org/10.1108/ijphm-02-2020-0010.
Dakin H, Devlin N, Feng Y, Rice N, O'Neill P, Parkin D. The influence of cost-effectiveness and other factors on Nice decisions. Health Econ. 2015;24(10):1256–71. https://doi.org/10.1002/hec.3086.
Jommi C, Armeni P, Costa F, Bertolani A, Otto M. Implementation of value-based pricing for medicines. Clin Ther. 2020;42(1):15–24. https://doi.org/10.1016/j.clinthera.2019.11.006.
Panteli D, Arickx F, Cleemput I, Dedet G, Eckhardt H, Fogarty E, Gerkens S, Henschke C, Hislop J, Jommi C, Kaitelidou D, Kawalec P, Keskimaki I, Kroneman M, Lopez Bastida J, Pita Barros P, Ramsberg J, Schneider P, Spillane S, Vogler S, Vuorenkoski L, Wallach Kildemoes H, Wouters O, Busse R. Pharmaceutical regulation in 15 European countries review. Health Syst Transit. 2016;18(5):1–122.
Lunn P. Regulatory policy and Behavioural economics. Paris: OECD Publishing; 2014.
Oliver A. Behavioural public policy. Cambridge: Cambridge University Press; 2013.
Geiger N. Behavioural economics and economic policy: A comparative study of recent trends. OEconomia. 2016;6-1(6–1):81–113. https://doi.org/10.4000/oeconomia.2230.
Oliver A. The origins of Behavioural public policy. Cambridge: Cambridge University Press; 2017.
Galizzi MM, Wiesen D. Behavioral experiments in health economics. In: Hamilton JH, Dixit A, Edwards S, Judd K editors. Oxford Research Encyclopedia of Economics and Finance. Oxford: Oxford University Press; 2018.
Cox JC, Green EP, Hennig-Schmidt H. Experimental and behavioral economics of healthcare. J Econ Behav Organ. 2016;131:A1–4. https://doi.org/10.1016/j.jebo.2016.10.011.
Levitt SD, List JA. On the generalizability of lab behaviour to the field. Can J Econ. 2007;40(2):347–70.
Falk A, Heckman JJ. Lab experiments are a major source of knowledge in the social sciences. Science. 2009;326(5952):535–8. https://doi.org/10.1126/science.1168244.
Camerer C. The promise and success of lab-field generalizability in experimental economics: A critical reply to Levitt and List. Available at SSRN 1977749; 2011.
Riedl A, Smeets P. Why do Investors hold socially responsible mutual funds? J Financ. 2017;72(6):2505–50. https://doi.org/10.1111/jofi.12547.
Galizzi MM, Navarro-Martínez D. On the external validity of social preference games: a systematic lab-field study. Manag Sci. 2019;65(3):976–1002.
Kessler JB, Vesterlund L. The external validity of laboratory experiments: the misleading emphasis on quantitative effects. In: R., G., Fréchette, A., editor. Handbook of experimental economic methodology, vol. 18. UK: Oxford University Press Oxford; 2015. p. 391–406.
Shogren J. Behavioural economics and environmental incentives; 2012. https://doi.org/10.1787/5k8zwbhqs1xn-en.
Feng L, Seasholes MS. Do investor sophistication and trading experience eliminate behavioral biases in financial markets? Rev Finance. 2005;9(3):305–51. https://doi.org/10.1007/s10679-005-2262-0.
Chen G, Kim KA, Nofsinger JR, Rui OM. Trading performance, disposition effect, overconfidence, representativeness bias, and experience of emerging market investors. J Behav Decis Mak. 2007;20(4):425–51.
Kourtidis D, Ševi Ž, Chatzoglou P. Investors’ trading activity, a behavioural perspective: professionals vs. individuals. Int J Behav Account Finance. 2011;2(3–4):346–66.
Kourtidis D, Šević Ž, Chatzoglou P. Investors’ trading activity: A behavioural perspective and empirical results. J Socio-Econ. 2011;40(5):548–57. https://doi.org/10.1016/j.socec.2011.04.008.
Chang TY, Solomon DH, Westerfield MM. Looking for someone to blame: delegation, cognitive dissonance, and the disposition effect. J Financ. 2016;71(1):267–302.
Chiang Y-M, Hirshleifer D, Qian Y, Sherman AE. Do investors learn from experience? Evidence from frequent IPO investors. Rev Financ Stud. 2011;24(5):1560–89.
Zahera SA, Bansal R. A study of prominence for disposition effect: a systematic review. Qual Res Financ Markets. 2019;11(1):2–21. https://doi.org/10.1108/qrfm-07-2018-0081.
Forman J, Horton J. Overconfidence, position size, and the link to performance. J Empir Financ. 2019;53:291–309.
Kagel JH, Roth AE. The handbook of experimental economics, Volume 2. Princeton: Princeton University Press; 2020.
The authors thank Dr. Stefan Wehrli and Lea Imhof from the ETH Decision Science Laboratory (DeSciL) for their support to implement the experiment on Amazon MTurk.
No funding was received from any private or public institution.
Ethics approval and consent to participate
The research design was submitted to the national database BASEC (Business Administration System for Ethics Committees) of swissethics (Swiss Ethics Committees on research involving humans) before the implementation of the experiment. The local ethics commission of the Canton of Zurich reviewed the application and confirmed that the project does not fall within the scope of the Swiss Human Research Act (HRA) and therefore no authorization was required. Participants had to state an informed consent (electronically) prior to the experiment.
Consent for publication
All authors provided consent for their names to be included for publication.
The first author (DWE) is an external PhD student at the University of Lucerne and was employed at the pharmaceutical company Takeda Pharma AG Switzerland and at the health insurance company Helsana AG while performing this research. Neither the former nor the later employer of the first author nor any other private or public entity supported or influenced this research in any relevant way. The second author (SB) has no conflicts of interest to declare. All of the authors submitted a signed Conflict of Interest disclosure form.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Wettstein, D.J., Boes, S. Assessing social preferences in reimbursement negotiations for new Pharmaceuticals in Oncology: an experimental design to analyse willingness to pay and willingness to accept. BMC Health Serv Res 21, 234 (2021). https://doi.org/10.1186/s12913-021-06231-8
- Willingness to pay
- Willingness to accept
- Medicines regulation
- Pharmaceutical policies
- Value-based pricing
- Behavioural economics