Technical Report: Coopetition in Heterogeneous Cross-Silo Federated Learning

Chao Huang    Justin Dachille    and Xin Liu
Abstract

In cross-silo federated learning (FL), companies collaboratively train a shared global model without sharing heterogeneous data. Prior related work focused on algorithm development to tackle data heterogeneity. However, the dual problem of coopetition, i.e., FL collaboration and market competition, remains under-explored. This paper studies the FL coopetition using a dynamic two-period game model. In period 1, an incumbent company trains a local model and provides model-based services at a chosen price to users. In period 2, an entrant company enters, and both companies decide whether to engage in FL collaboration and then compete in selling model-based services at different prices to users. Analyzing the two-period game is challenging due to data heterogeneity, and that the incumbent’s period one pricing has a temporal impact on coopetition in period 2, resulting in a non-concave problem. To address this issue, we decompose the problem into several concave sub-problems and develop an algorithm that achieves a global optimum. Numerical results on three public datasets show two interesting insights. First, FL training brings model performance gain as well as competition loss, and collaboration occurs only when the performance gain outweighs the loss. Second, data heterogeneity can incentivize the incumbent to limit market penetration in period 1 and promote price competition in period 2.

\paperid

123

1 Introduction

Cross-silo federated learning (FL) is a distributed machine learning paradigm where multiple companies or organizations train a shared model collaboratively without directly exchanging local data [11]. Typically, the process involves each participant training a local model on their dataset and then sharing model updates with a coordinating server that aggregates these updates to improve a global model. This local training and aggregation iteration continues until the global model converges [10]. Potential application scenarios for cross-silo FL are abundant [7]. For example, in healthcare, hospitals can collaborate on medical research (e.g., disease diagnosis) without sharing sensitive patient data. In finance, different banks can use FL for improved fraud detection without exposing customer data. In smart manufacturing, companies can enhance predictive maintenance without revealing proprietary operational data.

While a significant volume of recent research has focused on improving the model performance in cross-silo FL (e.g., [42, 19, 21]), the critical aspect of market competition remains less explored. In practice, companies can utilize the shared global model developed through FL to offer model-based services, hence competing for the same pool of potential users [40]. This competition can manifest across various industries. In healthcare, hospitals may compete to attract patients by offering diagnostic services or personalized treatment plans. In finance, banks might leverage an enhanced fraud detection system to attract customers by offering greater security and reduced risk of fraud. In smart manufacturing, companies could use the shared global model to optimize operations and attract clients/partnerships via more efficient production services. The dual focus of FL collaboration and market competition is termed as FL coopetition, which is the focus of this paper.

Refer to caption
Figure 1: The two-period system model.

While a few recent studies looked at FL coopetition [9, 34, 38, 40], they focused on static competition and overlooked the important aspect of dynamic market entry. Market entrant usually serves as a catalyst for technology innovation (e.g., model and algorithm development in cross-silo FL) and helps provide insights into the interactions between incumbents and entrants [2]. This also impacts the socioeconomic landscape by promoting coopetition and increasing the quality of services (via FL) available to users in the market.

In this paper, we study FL coopetition using a dynamic market entry model that spans two time periods involving three entities: users, an incumbent company, and an entrant company (see Fig. 1):

  • Period 1: Only an incumbent company exists in the market, providing users with the model-based service. For example, consider a healthcare provider that has developed a new AI-based tool for early diagnosis of a specific disease and is the only service provider in the market. The company trains a local model on its dataset and offers services to patients at a chosen price.

  • Period 2: An entrant company enters the market and coexists with the incumbent in providing user services. The incumbent and the entrant need to decide strategically on whether to engage in FL collaboration. If they collaborate, they will use a jointly developed global model to enhance and offer their diagnostic services. If they opt against collaboration, each will continue using individually trained models. In addition, since users can now choose between different service providers, there is a price competition between the incumbent and the entrant.

Given this dynamic model of FL coopetition, our first question is:

Question 1.

Will the incumbent collaborate with the entrant via FL in the presence of competition?

To answer Question 1, we model the FL collaboration and market competition as two (intertwined) games between the incumbent and the entrant. Each company aims to maximize its profit (from selling model-based services to users). We are interested in solving the game-theoretical equilibrium, which is highly non-trivial due to two reasons. First, companies in practice are heterogeneous regarding their data distributions and quantities [28]. Second, users may exhibit heterogeneous preferences towards companies’ services (even if the services have similar qualities). This can happen when some users find that a particular company’s service matches more closely with their individual characteristics, e.g., demographic profiles and personal attributes [25]. It is challenging to analyze (or even model) the game. To address this issue, we resort to the renowned Hotelling model in economics [3] and generalize it to model various types of heterogeneity. We characterized the equilibrium with arbitrary data distributions/quantities of companies and under minor assumptions about users’ preferences.

Among aforementioned heterogeneity types, data heterogeneity receives the most research attention from the FL community and continues to be the major bottleneck [41]. There are many excellent studies on algorithm development to mitigate the client drift issue caused by data heterogeneity, e.g., [15, 16, 31, 17]. We will show that our model and analysis are orthogonal to any FL algorithms. Since we focus on FL coopetition, our second key question is:

Question 2.

How does data heterogeneity affect FL collaboration and market competition?

To answer Question 2, we solve the equilibrium of the two-period game and conduct numerical experiments on three public datasets under various levels of data heterogeneity. Notice that the decision-making of the incumbent’s pricing in period 1 presents challenges due to its temporal impacts. That is, it not only influences user decisions in period 1, but also affects the decisions of both companies regarding FL collaboration (with heterogeneous data) and price competition, as well as user decisions in period 2. We will show that this interdependence makes the equilibrium analysis a challenging non-concave problem.

Our key contributions are summarized as follows:

  • To our best knowledge, this is the first work to study FL coopetition from a temporal market entry perspective. We formulate a generic two-period model that accommodates heterogeneity including companies’ data distributions and quantities as well as users’ service preferences. Our model is also orthogonal to any FL algorithms.

  • We provide game-theoretical solutions to the two-period model. The analysis involves solving a non-concave problem. To tackle this challenge, we decompose the problem into several manageable concave sub-problems and further develop an algorithm that achieves the global optimum.

  • We conduct numerical experiments on three public datasets and show two interesting results. First, even under highly heterogeneous data, FL training improves model performance (compared to local learning). However, companies may avoid collaborating in the presence of competition, as FL collaboration also benefits the competitor. Second, data heterogeneity can incentivize the incumbent to limit market penetration in period 1 and promote price competition in period 2.

1.1 Related Work

Heterogeneous FL. Data heterogeneity, also commonly referred to as non-IID data,111Non-IID means not identically and independently distributed. is known to cause significant performance loss in FL due to the client drift issue. Many studies have focused on algorithm development to tackle client drift. One commonly used technique is regularization by adding proximal terms to restrain local updates with respect to the global model, e.g., FedProx [16], SCAFFOLD [12], MOON [15], FedUV [31]. Other works have approached the problem using personalization (e.g., [24, 27]) and local learning generality (e.g., [23]).

This paper focuses on how data heterogeneity affects FL coopetition. Our model and analysis are orthogonal to any FL algorithms mentioned above and beyond.

Incentives for cross-silo FL. Our work is related to game theoretical and, in particular, incentive studies for cross-silo FL, e.g., [35, 43, 6, 18, 7]. For example, Tang and Wong [35] proposed an auction-like mechanism to encourage organizations’ FL training participation. Zhang et al. [43] studied how organizations participate in long-term collaboration. However, these studies overlooked the important aspect of market competition.

Only until very recently, a few papers have looked at market competition in cross-silo FL [9, 34, 40, 38]. Huang et al. [9] studied an oligopoly market. Wu and Yu [40] focused on a fully competitive market. Tan [34] developed an algorithm to find stable collaboration structures among companies. Tsoy and Konstantinov [38] studied both the price and quantity competitions in cross-silo FL. However, these studies did not study how data heterogeneity affects competition. More importantly, they looked at a static competition model while we study a dynamic model where the the incumbent and entrant have temporal interactions. The dynamic model renders the problem a challenging non-concave one.

Market competition in economics. Market competition is an extensively studied topic in economics, e.g., see [36, 33] for comprehensive discussions on theories of competition. Most pertaining to our work is the renowned Hotelling model (e.g., [25, 4]), where companies at different spatial locations compete for users. Our model and analysis differ from the conventional Hotelling due to the unique features of data heterogeneity and collaborative training in FL. That is, prior studies considered that each company can independently decide their service qualities to attract users. In cross-silo FL, however, the qualities of companies’ model-based services are dependent on their heterogeneous data distributions and the FL collaboration strategies. This makes our analysis more challenging than and conclusions different from prior literature.

In summary, to our best knowledge, this paper is the first to study FL coopetition using a dynamic market entry model and also considering data heterogeneity.

The rest of the paper is organized as follows. Sec. 2 introduces the system model, and Sec. 3 provides both analytical and algorithmic solutions. Sec. 4 presents the numerical results. We discuss the extensions of our model in Sec. 5 and conclude in Sec. 6.

2 Model

Sec. 2.1 introduces the companies and users. Sec. 2.2 models their two-period game-theoretical interactions.

2.1 Companies and Users

Companies. We consider a two-period interaction with time index t𝒯={1,2}𝑡𝒯12t\in\mathcal{T}=\{1,2\}italic_t ∈ caligraphic_T = { 1 , 2 }. In period t=1𝑡1t=1italic_t = 1, there is an incumbent company I𝐼Iitalic_I who holds a private dataset 𝒟Isubscript𝒟𝐼\mathcal{D}_{I}caligraphic_D start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT with size DI=|𝒟I|subscript𝐷𝐼subscript𝒟𝐼D_{I}=|\mathcal{D}_{I}|italic_D start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT = | caligraphic_D start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT |. It trains a machine learning model using 𝒟Isubscript𝒟𝐼\mathcal{D}_{I}caligraphic_D start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT and provides model-based services (e.g., disease diagnosis) to users at a price pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT. In period t=2𝑡2t=2italic_t = 2, an entrant company E𝐸Eitalic_E who has a local private dataset 𝒟Esubscript𝒟𝐸\mathcal{D}_{E}caligraphic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT with size DE=|𝒟E|subscript𝐷𝐸subscript𝒟𝐸D_{E}=|\mathcal{D}_{E}|italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = | caligraphic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT | enters the market.222See Sec. 5 for discussions on how to extend our model and analysis 1) with more than two companies, and 2) across t3𝑡3t\geq 3italic_t ≥ 3 periods. Since both companies I𝐼Iitalic_I and E𝐸Eitalic_E are in the market, they can choose to collaborate via FL to improve the performance of their models and the quality of model-based services. We use ri{1,0},i{I,E}formulae-sequencesubscript𝑟𝑖10𝑖𝐼𝐸r_{i}\in\{1,0\},i\in\{I,E\}italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ { 1 , 0 } , italic_i ∈ { italic_I , italic_E } to denote the collaboration decision, where ri=1subscript𝑟𝑖1r_{i}=1italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 means FL collaboration and ri=0subscript𝑟𝑖0r_{i}=0italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 means no collaboration. More specifically,

  • If rIrE=1subscript𝑟𝐼subscript𝑟𝐸1r_{I}\cdot r_{E}=1italic_r start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ⋅ italic_r start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 1, both companies train a shared global model using 𝒟Isubscript𝒟𝐼\mathcal{D}_{I}caligraphic_D start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT and 𝒟Esubscript𝒟𝐸\mathcal{D}_{E}caligraphic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT till convergence. They will use the converged model to generate model-based services.

  • If rIrE=0subscript𝑟𝐼subscript𝑟𝐸0r_{I}\cdot r_{E}=0italic_r start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ⋅ italic_r start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 0, each company uses its own data to train a local model, based on which services are generated.

We use qi,tsubscript𝑞𝑖𝑡q_{i,t}italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT to denote company i𝑖iitalic_i’s quality of service in period t𝑡titalic_t. In particular, qi,2subscript𝑞𝑖2q_{i,2}italic_q start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT is a function of 𝒓={rI,rE}𝒓subscript𝑟𝐼subscript𝑟𝐸\boldsymbol{r}=\{r_{I},r_{E}\}bold_italic_r = { italic_r start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT } that depends on both companies’ decisions on FL collaboration (and data heterogeneity), which we refer to as qi,2(𝒓)subscript𝑞𝑖2𝒓q_{i,2}(\boldsymbol{r})italic_q start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( bold_italic_r ) hereafter.

Besides the potential FL collaboration, both companies compete for the same users at prices pi,2subscript𝑝𝑖2p_{i,2}italic_p start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT. For simplicity, let 𝒑={pI,1,pI,2,pE,2}.𝒑subscript𝑝𝐼1subscript𝑝𝐼2subscript𝑝𝐸2\boldsymbol{p}=\{p_{I,1},p_{I,2},p_{E,2}\}.bold_italic_p = { italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT } . Next, we introduce how users respond to companies’ model-based services and prices.

Users. The market consists of a large number of users. To model this, we consider a continuum of users and normalize the total mass to be one. Each user decides whether to buy the service, and if so, from which company. Similar to [20], we assume that each user buys at most once, i.e., if a user buys from company I𝐼Iitalic_I in period 1, it will not buy from company I𝐼Iitalic_I or E𝐸Eitalic_E in period 2 due to them offering substitutable services. Let dn,t{,i}subscript𝑑𝑛𝑡𝑖d_{n,t}\in\{\emptyset,i\}italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT ∈ { ∅ , italic_i } denote user n𝑛nitalic_n’s purchasing decision in period t𝑡titalic_t, where dn,t=subscript𝑑𝑛𝑡d_{n,t}=\emptysetitalic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT = ∅ means no purchasing and dn,t=isubscript𝑑𝑛𝑡𝑖d_{n,t}=iitalic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT = italic_i means purchasing from company i𝑖iitalic_i.

A user’s payoff consists of three parts discussed below.

  • First, a user obtains a utility from enjoying the model-based service, where the utility increases in the quality of service qi,tsubscript𝑞𝑖𝑡q_{i,t}italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT. This paper considers a linear function (e.g., [20]). It is easy to extend the model to an arbitrary non-decreasing function (similar to [9]).

  • Second, users have heterogeneous preferences towards companies’ services. This can arise due to the traveling cost or brand loyalty over a particular company [25]. In the context of cross-silo FL, some users may find that a company’s service matches more closely with their individual characteristics (e.g., demographic profiles and personal attributes). To model users’ heterogeneous preferences, we use the renowned Hotelling model [3, 36], in which users are located on a line [0,1]01[0,1][ 0 , 1 ] and companies I𝐼Iitalic_I and E𝐸Eitalic_E are at the ending points ϕ=0italic-ϕ0\phi=0italic_ϕ = 0 and ϕ=1italic-ϕ1\phi=1italic_ϕ = 1, respectively. The users’ locations ϕnsubscriptitalic-ϕ𝑛\phi_{n}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT follow a distribution with a PDF h(ϕ)italic-ϕh(\phi)italic_h ( italic_ϕ ) and a CDF H(ϕ)𝐻italic-ϕH(\phi)italic_H ( italic_ϕ ), which are known to both companies due to market research [20, 9]. Here, we model a user’s preference misalignment as the distance to the company where the service is purchased.333The Hotelling model and its variants (e.g., [29]) have been widely used to model user preferences in the economics literature. One can understand such preference misalignment as the dissimilarity between users’ characteristics and the companies’ data distributions.

  • Third, a user needs to pay a price for the service.

Next, we define a user n𝑛nitalic_n’s payoff as follows:

un,t(dn,t;𝒓,𝒑)subscript𝑢𝑛𝑡subscript𝑑𝑛𝑡𝒓𝒑\displaystyle u_{n,t}(d_{n,t};\boldsymbol{r},\boldsymbol{p})italic_u start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT ; bold_italic_r , bold_italic_p ) (1)
={0,if dn,t=,wqqI,t(𝒓)wϕϕnwppI,t,if dn,t=I,wqqE,t(𝒓)utilitywϕ(1ϕn)preferencemisalignmentwppE,tprice,if dn,t=E,absentcases0if subscript𝑑𝑛𝑡limit-fromsubscript𝑤𝑞subscript𝑞𝐼𝑡𝒓subscript𝑤italic-ϕsubscriptitalic-ϕ𝑛subscript𝑤𝑝subscript𝑝𝐼𝑡if subscript𝑑𝑛𝑡𝐼subscriptsubscript𝑤𝑞subscript𝑞𝐸𝑡𝒓utilitysubscriptsubscript𝑤italic-ϕ1subscriptitalic-ϕ𝑛preferencemisalignmentsubscriptsubscript𝑤𝑝subscript𝑝𝐸𝑡priceif subscript𝑑𝑛𝑡𝐸\displaystyle=\begin{cases}0,&\text{if }d_{n,t}=\emptyset,\\ w_{q}q_{I,t}(\boldsymbol{r})-\hskip 28.45274ptw_{\phi}\phi_{n}\hskip 25.60747% pt-w_{p}p_{I,t},&\text{if }d_{n,t}=I,\\ \underbrace{w_{q}q_{E,t}(\boldsymbol{r})}_{\rm utility}-\underbrace{w_{\phi}(1% -\phi_{n})}_{\rm preference~{}misalignment}-\underbrace{w_{p}p_{E,t}}_{\rm price% },&\text{if }d_{n,t}=E,\\ \end{cases}= { start_ROW start_CELL 0 , end_CELL start_CELL if italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT = ∅ , end_CELL end_ROW start_ROW start_CELL italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , italic_t end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , italic_t end_POSTSUBSCRIPT , end_CELL start_CELL if italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT = italic_I , end_CELL end_ROW start_ROW start_CELL under⏟ start_ARG italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_E , italic_t end_POSTSUBSCRIPT ( bold_italic_r ) end_ARG start_POSTSUBSCRIPT roman_utility end_POSTSUBSCRIPT - under⏟ start_ARG italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( 1 - italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG start_POSTSUBSCRIPT roman_preference roman_misalignment end_POSTSUBSCRIPT - under⏟ start_ARG italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , italic_t end_POSTSUBSCRIPT end_ARG start_POSTSUBSCRIPT roman_price end_POSTSUBSCRIPT , end_CELL start_CELL if italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT = italic_E , end_CELL end_ROW

where wq,wϕ,wpsubscript𝑤𝑞subscript𝑤italic-ϕsubscript𝑤𝑝w_{q},w_{\phi},w_{p}italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT are positive constants. Without loss of generality, we normalize wϕsubscript𝑤italic-ϕw_{\phi}italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT to 1111. Note that in period 1, only company I𝐼Iitalic_I is in the market, so a user can only choose to buy from I𝐼Iitalic_I or not buy.

2.2 The Two-Period Game

We model the interactions among the two companies and users as a two-period game (see Fig. 2). At the beginning of period 1, company I𝐼Iitalic_I provides service using locally trained model with quality qI,1subscript𝑞𝐼1q_{I,1}italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT at price pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT. Then, users make purchasing decisions. In period 2, company E𝐸Eitalic_E enters. Both companies decide whether to collaborate via FL 𝒓𝒓\boldsymbol{r}bold_italic_r, based on which they generate model-based services with quality qi,2(𝒓)subscript𝑞𝑖2𝒓q_{i,2}(\boldsymbol{r})italic_q start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( bold_italic_r ). Then, they initiate price competition by selling services at prices pi,2subscript𝑝𝑖2p_{i,2}italic_p start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT.

Users’ decision problem. We consider that users are myopic [13, 20]. That is, in period 1, if a user finds purchasing from company I𝐼Iitalic_I is better off than not purchasing, it will purchase without anticipating the potential entry of company E𝐸Eitalic_E and its service in period 2. Hence, each user’s decision problem is as follows.

P1:maxdn,tun,t(dn,t;𝒓,𝒑),t.\displaystyle\textbf{P1}:\quad\quad\max_{d_{n,t}}\;\;\;u_{n,t}(d_{n,t};% \boldsymbol{r},\boldsymbol{p}),\quad\quad\forall t.P1 : roman_max start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT ; bold_italic_r , bold_italic_p ) , ∀ italic_t . (2)
Refer to caption
Figure 2: The two-period game.

Companies’ decision problem. Both companies aim to maximize their own profit. For company E𝐸Eitalic_E, its goal is to maximize the profit obtained in period 2, i.e.,

WE(𝒓,𝒑)=01(pE,2cE)𝟙dn,2=E(𝒓,𝒑)h(ϕ)𝑑ϕ,subscript𝑊𝐸𝒓𝒑superscriptsubscript01subscript𝑝𝐸2subscript𝑐𝐸subscript1subscript𝑑𝑛2𝐸𝒓𝒑italic-ϕdifferential-ditalic-ϕW_{E}(\boldsymbol{r},\boldsymbol{p})=\int_{0}^{1}(p_{E,2}-c_{E})\cdot\mathbbm{% 1}_{d_{n,2}=E}(\boldsymbol{r},\boldsymbol{p})\cdot h(\phi)d\phi,italic_W start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ) ⋅ blackboard_1 start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , 2 end_POSTSUBSCRIPT = italic_E end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) ⋅ italic_h ( italic_ϕ ) italic_d italic_ϕ , (3)

where 𝟙1\mathbbm{1}blackboard_1 is an indicator function, meaning 𝟙dn,2=E=1subscript1subscript𝑑𝑛2𝐸1\mathbbm{1}_{d_{n,2}=E}=1blackboard_1 start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , 2 end_POSTSUBSCRIPT = italic_E end_POSTSUBSCRIPT = 1 if and only if dn,2=Esubscript𝑑𝑛2𝐸d_{n,2}=Eitalic_d start_POSTSUBSCRIPT italic_n , 2 end_POSTSUBSCRIPT = italic_E. Here, cEsubscript𝑐𝐸c_{E}italic_c start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT is the marginal service cost.444Similar to [7], this paper normalizes the FL collaboration cost, in terms of computation and communication, to be zero. Our analysis and conclusions will not change even if we consider a non-zero cost. Hence, company E𝐸Eitalic_E’s problem is as follows.

P2:maxrE,pE,2WE(𝒓,𝒑).\textbf{P2}:\quad\quad\max_{r_{E},p_{E,2}}\;\;\;W_{E}(\boldsymbol{r},% \boldsymbol{p}).P2 : roman_max start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) . (4)

Unlike users who are myopic, company I𝐼Iitalic_I is assumed to be forward-looking [32].555This is reasonable due to companies usually having more resources and information than individual users when making decisions. That is, it will strategically set the price in period 1, anticipating the reactions of users and company E𝐸Eitalic_E in period 2. Here, we consider that company I𝐼Iitalic_I knows the information about qi,2(𝒓),𝒓subscript𝑞𝑖2𝒓for-all𝒓q_{i,2}(\boldsymbol{r}),\forall\boldsymbol{r}italic_q start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( bold_italic_r ) , ∀ bold_italic_r, which can be acquired via market research [20, 9]. In particular, company I𝐼Iitalic_I’s aims to maximize its two-period profits:

WI,1(pI,1)=01(pI,1cI)𝟙dn,1=I(pI,1)h(ϕ)𝑑ϕ,subscript𝑊𝐼1subscript𝑝𝐼1superscriptsubscript01subscript𝑝𝐼1subscript𝑐𝐼subscript1subscript𝑑𝑛1𝐼subscript𝑝𝐼1italic-ϕdifferential-ditalic-ϕW_{I,1}(p_{I,1})=\int_{0}^{1}(p_{I,1}-c_{I})\cdot\mathbbm{1}_{d_{n,1}=I}(p_{I,% 1})\cdot h(\phi)d\phi,italic_W start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) ⋅ blackboard_1 start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT = italic_I end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ⋅ italic_h ( italic_ϕ ) italic_d italic_ϕ , (5)
WI,2(𝒓,𝒑)=01(pI,2cI)𝟙dn,2=I(𝒓,𝒑)h(ϕ)𝑑ϕ,subscript𝑊𝐼2𝒓𝒑superscriptsubscript01subscript𝑝𝐼2subscript𝑐𝐼subscript1subscript𝑑𝑛2𝐼𝒓𝒑italic-ϕdifferential-ditalic-ϕW_{I,2}(\boldsymbol{r},\boldsymbol{p})=\int_{0}^{1}(p_{I,2}-c_{I})\cdot% \mathbbm{1}_{d_{n,2}=I}(\boldsymbol{r},\boldsymbol{p})\cdot h(\phi)d\phi,italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) ⋅ blackboard_1 start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , 2 end_POSTSUBSCRIPT = italic_I end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) ⋅ italic_h ( italic_ϕ ) italic_d italic_ϕ , (6)

where cIsubscript𝑐𝐼c_{I}italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT is the marginal service cost. Hence, we formulate company I𝐼Iitalic_I’s problem below.

P3:maxrI,pI,1,pI,2WI(𝒓,𝒑)=WI,1(pI,1)+WI,2(𝒓,𝒑).\textbf{P3}:\quad\quad\max_{r_{I},p_{I,1},p_{I,2}}\;\;\;W_{I}(\boldsymbol{r},% \boldsymbol{p})=W_{I,1}(p_{I,1})+W_{I,2}(\boldsymbol{r},\boldsymbol{p}).P3 : roman_max start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) = italic_W start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) + italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) . (7)

Next, we solve the two-period game.

3 Solving the Two-Period Game

We first solve the user decision in Sec. 3.1. Then, we solve the companies’ period 2 decisions in Sec. 3.2. Sec. 3.3 presents an algorithm to solve company I𝐼Iitalic_I’s pricing in period 1. The proofs for all technical results are left to Appendices at the end of this paper.

3.1 Users’ Optimal Purchasing in Two Periods

Since users are myopic, we can solve users’ optimal purchasing in the two periods separately.

Lemma 1.

Each user n𝑛nitalic_n’s optimal purchasing in period 1 is

dn,1(pI,1)={I,ifϕnwqqI,1wppI,1,,else.subscriptsuperscript𝑑𝑛1subscript𝑝𝐼1cases𝐼ifsubscriptitalic-ϕ𝑛subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝subscript𝑝𝐼1otherwiseelseotherwise\displaystyle d^{*}_{n,1}(p_{I,1})=\begin{cases}I,\;{\rm if}\;\phi_{n}\leq w_{% q}q_{I,1}-w_{p}p_{I,1},\\ \emptyset,\;{\rm else}.\end{cases}italic_d start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) = { start_ROW start_CELL italic_I , roman_if italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ∅ , roman_else . end_CELL start_CELL end_CELL end_ROW (8)

Lemma 1 shows that a user is more likely to purchase from company I𝐼Iitalic_I if the model-based service has a better quality qI,1subscript𝑞𝐼1q_{I,1}italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT, or the price pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT is lower, or the preference misalignment ϕnsubscriptitalic-ϕ𝑛\phi_{n}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is smaller.

dn,2(𝒓,𝒑)subscriptsuperscript𝑑𝑛2𝒓𝒑\displaystyle d^{*}_{n,2}(\boldsymbol{r},\boldsymbol{p})italic_d start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , 2 end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) (9)
={I,ifdn,1(pI,1)=,ϕn[0,max(min(wqqI,2(𝒓)wppI,2,1+wq(qI,2(𝒓)qE,2(𝒓))wp(pI,2pE,2)2,1),0)],E,ifdn,1(pI,1)=,ϕn[min(max(1wqqE,2(𝒓)+wppE,2,1+wq(qI,2(𝒓)qE,2(𝒓)wp(pI,2pE,2)2,0),1),1],,else.\displaystyle=\begin{cases}I,&{\rm if}\;d^{*}_{n,1}(p_{I,1})=\emptyset,\phi_{n% }\in\left[0,\max\left(\min\left(w_{q}q_{I,2}(\boldsymbol{r})-w_{p}p_{I,2},% \frac{1+w_{q}\left(q_{I,2}(\boldsymbol{r})-q_{E,2}(\boldsymbol{r})\right)-w_{p% }(p_{I,2}-p_{E,2})}{2},1\right),0\right)\right],\\ E,&{\rm if}\;d^{*}_{n,1}(p_{I,1})=\emptyset,\phi_{n}\in\left[\min\left(\max% \left(1-w_{q}q_{E,2}(\boldsymbol{r})+w_{p}p_{E,2},\frac{1+w_{q}\left(q_{I,2}(% \boldsymbol{r})-q_{E,2}(\boldsymbol{r}\right)-w_{p}(p_{I,2}-p_{E,2})}{2},0% \right),1\right),1\right],\\ \emptyset,\ &{\rm else}.\end{cases}= { start_ROW start_CELL italic_I , end_CELL start_CELL roman_if italic_d start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) = ∅ , italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ [ 0 , roman_max ( roman_min ( italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT , divide start_ARG 1 + italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( bold_italic_r ) ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ) end_ARG start_ARG 2 end_ARG , 1 ) , 0 ) ] , end_CELL end_ROW start_ROW start_CELL italic_E , end_CELL start_CELL roman_if italic_d start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) = ∅ , italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ [ roman_min ( roman_max ( 1 - italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( bold_italic_r ) + italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT , divide start_ARG 1 + italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ) end_ARG start_ARG 2 end_ARG , 0 ) , 1 ) , 1 ] , end_CELL end_ROW start_ROW start_CELL ∅ , end_CELL start_CELL roman_else . end_CELL end_ROW
Lemma 2.

Assume wqqI,2(𝐫)wppI,2wqqE,2(𝐫)wppE,2subscript𝑤𝑞subscript𝑞𝐼2𝐫subscript𝑤𝑝subscript𝑝𝐼2subscript𝑤𝑞subscript𝑞𝐸2𝐫subscript𝑤𝑝subscript𝑝𝐸2w_{q}q_{I,2}(\boldsymbol{r})-w_{p}p_{I,2}\geq w_{q}q_{E,2}(\boldsymbol{r})-w_{% p}p_{E,2}italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ≥ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT. Each user n𝑛nitalic_n’s optimal purchasing in period 2 is given in (9).

The assumption in Lemma 2 is without loss of generality. One can similarly analyze the case where wqqI,2(𝒓)wppI,2<wqqE,2(𝒓)wppE,2subscript𝑤𝑞subscript𝑞𝐼2𝒓subscript𝑤𝑝subscript𝑝𝐼2subscript𝑤𝑞subscript𝑞𝐸2𝒓subscript𝑤𝑝subscript𝑝𝐸2w_{q}q_{I,2}(\boldsymbol{r})-w_{p}p_{I,2}<w_{q}q_{E,2}(\boldsymbol{r})-w_{p}p_% {E,2}italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT < italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT. Importantly, Lemma 2 shows that users’ purchasing in period 2 depends on companies’ FL collaboration 𝒓𝒓\boldsymbol{r}bold_italic_r and their prices 𝒑𝒑\boldsymbol{p}bold_italic_p, including company I𝐼Iitalic_I’s period 1 price pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT. As will be shown, such temporal dependence will render company I𝐼Iitalic_I’s decision problem challenging.

3.2 Companies’ Decisions in Period 2

In period 2, given pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT in period 1, both companies decide the FL collaboration 𝒓𝒓\boldsymbol{r}bold_italic_r and prices {pI,2,pE,2}subscript𝑝𝐼2subscript𝑝𝐸2\{p_{I,2},p_{E,2}\}{ italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT } to maximize profits. We solve the pricing in Sec. 3.2.1 and FL collaboration in Sec. 3.2.2.

3.2.1 Companies’ Pricing in Period 2

We first solve companies’ optimal pricing in period 2, given pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT and 𝒓𝒓\boldsymbol{r}bold_italic_r. Since both companies’ prices will affect users’ purchasing decisions and hence the company profits, the companies are playing a price competition game, which we model as follows.

Game 1.

(Price Competition in Period 2) The price competition game in period 2 is defined as a tuple {I,E},𝒫=pi,2,𝒲=Wi,2delimited-⟨⟩formulae-sequence𝐼𝐸𝒫productsubscript𝑝𝑖2𝒲productsubscript𝑊𝑖2\langle\{I,E\},\mathcal{P}=\prod p_{i,2},\mathcal{W}=\prod W_{i,2}\rangle⟨ { italic_I , italic_E } , caligraphic_P = ∏ italic_p start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT , caligraphic_W = ∏ italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ⟩, where each company i𝑖iitalic_i in {I,E}𝐼𝐸\{I,E\}{ italic_I , italic_E } decides the pricing pi,2subscript𝑝𝑖2p_{i,2}italic_p start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT to maximize its own profit Wi,2subscript𝑊𝑖2W_{i,2}italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT in (3) and (6).

We aim to find the Nash equilibrium (NE) of Game 1.

Definition 1.

A profile 𝐩2(𝐫,pI,1)=(𝐩i,2(𝐫,pI,1),𝐩j,2(𝐫,pI,1))superscriptsubscript𝐩2𝐫subscript𝑝𝐼1superscriptsubscript𝐩𝑖2𝐫subscript𝑝𝐼1superscriptsubscript𝐩𝑗2𝐫subscript𝑝𝐼1\boldsymbol{p}_{2}^{*}(\boldsymbol{r},p_{I,1})=(\boldsymbol{p}_{i,2}^{*}(% \boldsymbol{r},p_{I,1}),\boldsymbol{p}_{j,2}^{*}(\boldsymbol{r},p_{I,1}))bold_italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) = ( bold_italic_p start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) , bold_italic_p start_POSTSUBSCRIPT italic_j , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ) is an NE of Game 1 if for all i{I,E}𝑖𝐼𝐸i\in\{I,E\}italic_i ∈ { italic_I , italic_E }, pi,2(𝐫,pI,1)pi,2(𝐫,pI,1)superscriptsubscript𝑝𝑖2𝐫subscript𝑝𝐼1superscriptsubscript𝑝𝑖2𝐫subscript𝑝𝐼1p_{i,2}^{\prime}(\boldsymbol{r},p_{I,1})\neq p_{i,2}^{*}(\boldsymbol{r},p_{I,1})italic_p start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ≠ italic_p start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ),

Wi,2(pi,2(𝒓,pI,1),pj,2(𝒓,pI,1))Wi,2(pi,2(𝒓,pI,1),pj,2(𝒓,pI,1)),subscript𝑊𝑖2superscriptsubscript𝑝𝑖2𝒓subscript𝑝𝐼1superscriptsubscript𝑝𝑗2𝒓subscript𝑝𝐼1subscript𝑊𝑖2superscriptsubscript𝑝𝑖2𝒓subscript𝑝𝐼1superscriptsubscript𝑝𝑗2𝒓subscript𝑝𝐼1W_{i,2}(p_{i,2}^{*}(\boldsymbol{r},p_{I,1}),p_{j,2}^{*}(\boldsymbol{r},p_{I,1}% ))\geq W_{i,2}(p_{i,2}^{\prime}(\boldsymbol{r},p_{I,1}),p_{j,2}^{*}(% \boldsymbol{r},p_{I,1})),italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) , italic_p start_POSTSUBSCRIPT italic_j , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ) ≥ italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) , italic_p start_POSTSUBSCRIPT italic_j , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ) , (10)

where ji𝑗𝑖j\neq iitalic_j ≠ italic_i and j{I,E}𝑗𝐼𝐸j\in\{I,E\}italic_j ∈ { italic_I , italic_E }.

NE is considered as a stable strategy profile as no company can achieve a higher profit via unilaterally changing its pricing. Notice that an arbitrary choice of H()𝐻H(\cdot)italic_H ( ⋅ ) can easily render the equilibrium analysis intractable. To facilitate theoretical analysis, we make some minor assumptions on H()𝐻H(\cdot)italic_H ( ⋅ ) (and h()h(\cdot)italic_h ( ⋅ )).

Assumption 1.

Users’ preference distribution satisfies: (i) h(θ)>0𝜃0h(\theta)>0italic_h ( italic_θ ) > 0 and is continuous. (ii) h(θ)/[1H(θ)]𝜃delimited-[]1𝐻𝜃h(\theta)/[1-H(\theta)]italic_h ( italic_θ ) / [ 1 - italic_H ( italic_θ ) ] is increasing in θ𝜃\thetaitalic_θ.

Assumption 1 holds for many widely used distributions, e.g., Gaussian, uniform, and gamma distributions. Now, we discuss the equilibrium existence below.

Proposition 1.

Under Assumption 1, Game 1’s NE exists.

Proposition 1 can be proved by showing that the price competition game is a concave game, and in particular, each organization’s profit is quasi-concave in its price. Refer to the supplementary material where we have developed a best response algorithm to compute the price equilibrium.

Next, we provide a result that establishes the relationship between the optimal pricing and companies’ quality of service.

Corollary 1.

Each company i𝑖iitalic_i’s optimal pricing in period 2 is a non-decreasing function in qi,2(𝐫)subscript𝑞𝑖2𝐫q_{i,2}(\boldsymbol{r})italic_q start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( bold_italic_r ) but a non-increasing function in qj,2(𝐫)subscript𝑞𝑗2𝐫q_{j,2}(\boldsymbol{r})italic_q start_POSTSUBSCRIPT italic_j , 2 end_POSTSUBSCRIPT ( bold_italic_r ), where i,j{I,E},ijformulae-sequence𝑖𝑗𝐼𝐸𝑖𝑗i,j\in\{I,E\},i\neq jitalic_i , italic_j ∈ { italic_I , italic_E } , italic_i ≠ italic_j.

Corollary 1 can be proved via contradiction. It means that each company will set a higher price if it has a better quality of service, while it will lower the price if its competitor has a better quality.

3.2.2 Companies’ FL Collaboration in Period 2

Now, we solve companies’ optimal FL collaboration in period 2, given pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT in period 1. The companies decide collaboration strategy 𝒓𝒓\boldsymbol{r}bold_italic_r for FL, anticipating the price equilibrium of Game 1. We model the two companies’ interactions as a collaboration game.

Game 2.

(FL Collaboration Game in Period 2) The FL collaboration game is a tuple {I,E},=ri,𝒲=Wi,2delimited-⟨⟩formulae-sequence𝐼𝐸productsubscript𝑟𝑖𝒲productsubscript𝑊𝑖2\langle\{I,E\},\mathcal{R}=\prod r_{i},\mathcal{W}=\prod W_{i,2}\rangle⟨ { italic_I , italic_E } , caligraphic_R = ∏ italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , caligraphic_W = ∏ italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ⟩, where each company i𝑖iitalic_i decides its FL collaboration risubscript𝑟𝑖r_{i}italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to maximize Wi,2(𝐫,𝐩2(𝐫,pI,1))subscript𝑊𝑖2𝐫superscriptsubscript𝐩2𝐫subscript𝑝𝐼1W_{i,2}(\boldsymbol{r},\boldsymbol{p}_{2}^{*}(\boldsymbol{r},p_{I,1}))italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ), where 𝐩2(𝐫,pI,1)superscriptsubscript𝐩2𝐫subscript𝑝𝐼1\boldsymbol{p}_{2}^{*}(\boldsymbol{r},p_{I,1})bold_italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) is the NE of Game 1.

We aim to solve Game 2’s NE defined below.

Definition 2.

A strategy profile 𝐫=(ri,rj)superscript𝐫superscriptsubscript𝑟𝑖superscriptsubscript𝑟𝑗\boldsymbol{r}^{*}=(r_{i}^{*},r_{j}^{*})bold_italic_r start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = ( italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_r start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) is an NE of Game 2 if i{I,E}for-all𝑖𝐼𝐸\forall i\in\{I,E\}∀ italic_i ∈ { italic_I , italic_E }, ririfor-allsuperscriptsubscript𝑟𝑖superscriptsubscript𝑟𝑖\forall r_{i}^{\prime}\neq r_{i}^{*}∀ italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT,

Wi,2(𝒓,𝒑2(𝒓,pI,1))Wi,2((ri,rj),𝒑2((ri,rj),pI,1),W_{i,2}(\boldsymbol{r}^{*},\boldsymbol{p}_{2}^{*}(\boldsymbol{r}^{*},p_{I,1}))% \geq W_{i,2}((r_{i}^{\prime},r_{j}^{*}),\boldsymbol{p}_{2}^{*}((r_{i}^{\prime}% ,r_{j}^{*}),p_{I,1}),italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( bold_italic_r start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_r start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ) ≥ italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( ( italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_r start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) , bold_italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( ( italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_r start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) , (11)

where ji𝑗𝑖j\neq iitalic_j ≠ italic_i and j{I,E}𝑗𝐼𝐸j\in\{I,E\}italic_j ∈ { italic_I , italic_E }.

Next, we characterize the NE in Proposition2.

Proposition 2.

The profile (1,1)11(1,1)( 1 , 1 ) is the NE of Game 2 if and only if i{I,E}for-all𝑖𝐼𝐸\forall i\in\left\{I,E\right\}∀ italic_i ∈ { italic_I , italic_E }, Wi,2((1,1),𝐩2((1,1),pI,1))Wi,2((0,0),𝐩2((0,0),pI,1)).subscript𝑊𝑖211superscriptsubscript𝐩211subscript𝑝𝐼1subscript𝑊𝑖200superscriptsubscript𝐩200subscript𝑝𝐼1W_{i,2}((1,1),\boldsymbol{p}_{2}^{*}((1,1),p_{I,1}))\geq W_{i,2}((0,0),% \boldsymbol{p}_{2}^{*}((0,0),p_{I,1})).italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( ( 1 , 1 ) , bold_italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( ( 1 , 1 ) , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ) ≥ italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( ( 0 , 0 ) , bold_italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( ( 0 , 0 ) , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ) . Otherwise, (0,0)00(0,0)( 0 , 0 ) is the NE.

Proposition 2 means that the two companies will collaborate in FL training if and only if both achieves a higher profit than no collaboration. Next, we characterize a somewhat counter-intuitive result.

Theorem 1.

Under Assumption 1, there exists a pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT such that (0,0)00(0,0)( 0 , 0 ) is the NE of Game 2 even if qi,2((1,1))>qi,2((0,0)),isubscript𝑞𝑖211subscript𝑞𝑖200for-all𝑖q_{i,2}((1,1))>q_{i,2}((0,0)),\forall iitalic_q start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( ( 1 , 1 ) ) > italic_q start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( ( 0 , 0 ) ) , ∀ italic_i.

Theorem 1 shows that even if FL collaboration improves both companies’ quality of service, it may not be the NE. This is because FL brings model performance gain and competition loss at the same time. That is, the shared FL model also improves the competitor’s quality of service. According to Corollary 1, a company needs to set a lower price to attract users, which can lead to a lower profit. We will show in Sec. 4 that FL is the NE only when the model performance gain is significant and outweighs the competition loss.

3.3 Company I𝐼Iitalic_I’s Pricing in Period 1

We solve company I𝐼Iitalic_I’s optimal pricing in period 1. In particular, company I𝐼Iitalic_I decides pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT to maximize its total profit in (7), anticipating the equilibria of the price competition game and FL collaboration game.

Note that solving this problem is challenging due to a few reasons. First, company I𝐼Iitalic_I’s period-1-price pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT affects users’ decisions in both periods 1 and 2 (see (8)-(9)). This in turn will affect companies’ price competition and FL collaboration in period 2, leading to a highly coupled analysis. Second, even if we established equilibrium existence in Propositions 1-2, we still lack closed-form equilibrium solutions, making the characterization of the profit function challenging. Third, it is easy to showcase that company I𝐼Iitalic_I’s total profit is non-concave in pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT if we use a Gaussian distribution for ϕnsubscriptitalic-ϕ𝑛\phi_{n}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

To address this challenge, we develop an algorithm to compute company I𝐼Iitalic_I’s optimal pricing in period 1 in Algorithm 1. For ease of presentation, we use NEp(pI,1,l,h)subscriptNE𝑝subscript𝑝𝐼1𝑙\text{NE}_{p}(p_{I,1},l,h)NE start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT , italic_l , italic_h ) and NEr(pI,1,l,h)subscriptNE𝑟subscript𝑝𝐼1𝑙\text{NE}_{r}(p_{I,1},l,h)NE start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT , italic_l , italic_h ) to denote the equilibrium calculated on price pI,1[l,h]subscript𝑝𝐼1𝑙p_{I,1}\in[l,h]italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ∈ [ italic_l , italic_h ] of Game 1 and Game 2, respectively. The calculation of NEpsubscriptNE𝑝\text{NE}_{p}NE start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT is based on the a best response algorithm (details in appendix). The calculation of NErsubscriptNE𝑟\text{NE}_{r}NE start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT is based on Proposition 2. Note that even if the problem is non-concave, the proposed algorithm can return the global optimal solution.

Theorem 2.

Under Assumption 1, Algorithm 1 converges to company I𝐼Iitalic_I’s optimal pricing in period 1.

The key rationale is that even if the problem is non-concave, we can decompose the problem into several concave sub-problems (i.e., sub-problem a𝑎aitalic_a in lines 2-6, sub-problem b𝑏bitalic_b in lines 7-11, and sub-problem c𝑐citalic_c in lines 12-16). Then it suffices to locate the optimal solutions for each sub-problem and then compare those to achieve the global optimum.

Table 1: Impact of data heterogeneity β𝛽\betaitalic_β on model accuracy in %percent\%%.

CIFAR10 β=𝛽\beta=\inftyitalic_β = ∞ β=1.0𝛽1.0\beta=1.0italic_β = 1.0 β=0.5𝛽0.5\beta=0.5italic_β = 0.5 β=0.1𝛽0.1\beta=0.1italic_β = 0.1 β=0.01𝛽0.01\beta=0.01italic_β = 0.01 I𝐼Iitalic_I local 60.81 ± 4.32 55.34 ± 1.82 53.42 ± 5.37 39.23 ± 6.84 43.27 ± 4.03 E𝐸Eitalic_E local 57.56 ± 3.75 54.50 ± 3.58 49.30 ± 7.41 48.17 ± 5.93 33.51 ± 4.95 FedAvg 69.81 ± 2.26 69.01 ± 2.22 66.96 ± 2.77 57.40 ± 9.02 51.89 ± 6.49 CIFAR100 β=𝛽\beta=\inftyitalic_β = ∞ β=1.0𝛽1.0\beta=1.0italic_β = 1.0 β=0.5𝛽0.5\beta=0.5italic_β = 0.5 β=0.1𝛽0.1\beta=0.1italic_β = 0.1 β=0.01𝛽0.01\beta=0.01italic_β = 0.01 I𝐼Iitalic_I local 19.00 ± 4.28 18.50 ± 2.79 17.37 ± 2.11 17.17 ± 3.06 16.94 ± 1.89 E𝐸Eitalic_E local 17.93 ± 2.78 17.46 ± 2.39 17.12 ± 2.66 18.09 ± 4.49 14.26 ± 1.79 FedAvg 32.80 ± 3.64 31.16 ± 3.46 29.92 ± 3.45 26.58 ± 3.84 21.03 ± 1.63

Table 2: Impact of data quantity DEsubscript𝐷𝐸D_{E}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT on model accuracy in %percent\%%.

CIFAR10 DE=3ksubscript𝐷𝐸3𝑘D_{E}=3kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 3 italic_k DE=5ksubscript𝐷𝐸5𝑘D_{E}=5kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 5 italic_k DE=8ksubscript𝐷𝐸8𝑘D_{E}=8kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 8 italic_k DE=10ksubscript𝐷𝐸10𝑘D_{E}=10kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 10 italic_k I𝐼Iitalic_I local 38.47 ± 9.91 38.63 ± 8.10 39.80 ± 8.08 40.56 ± 8.25 E𝐸Eitalic_E local 48.07 ± 8.99 49.27 ± 2.84 51.84 ± 0.66 53.23 ± 0.77 FedAvg 59.69 ± 9.55 60.66 ± 1.62 62.12 ± 3.63 62.88 ± 5.50 CIFAR100 DE=3ksubscript𝐷𝐸3𝑘D_{E}=3kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 3 italic_k DE=5ksubscript𝐷𝐸5𝑘D_{E}=5kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 5 italic_k DE=8ksubscript𝐷𝐸8𝑘D_{E}=8kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 8 italic_k DE=10ksubscript𝐷𝐸10𝑘D_{E}=10kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 10 italic_k I𝐼Iitalic_I local 17.44 ± 2.35 17.20 ± 1.61 18.64 ± 3.03 19.85 ± 4.60 E𝐸Eitalic_E local 16.66 ± 2.17 18.30 ± 1.16 21.50 ± 3.84 22.92 ± 4.87 FedAvg 25.40 ± 2.95 26.14 ± 0.41 29.10 ± 1.15 32.08 ± 1.45 HAM10000 DE=2ksubscript𝐷𝐸2𝑘D_{E}=2kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 2 italic_k DE=3ksubscript𝐷𝐸3𝑘D_{E}=3kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 3 italic_k DE=4ksubscript𝐷𝐸4𝑘D_{E}=4kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 4 italic_k DE=5ksubscript𝐷𝐸5𝑘D_{E}=5kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 5 italic_k I𝐼Iitalic_I local 76.37 ± 1.65 76.28 ± 0.52 75.61 ± 0.87 75.68 ± 1.24 E𝐸Eitalic_E local 73.67 ± 1.87 76.88 ± 1.44 77.99 ± 1.24 78.11 ± 1.29 FedAvg 79.90 ± 0.91 80.50 ± 0.72 80.89 ± 0.99 81.09 ± 1.48

Algorithm 1 Optimization of Period 1 Pricing
1:  initialize oI,oE,wq,wp,wϕsubscript𝑜𝐼subscript𝑜𝐸subscript𝑤𝑞subscript𝑤𝑝subscript𝑤italic-ϕo_{I},o_{E},w_{q},w_{p},w_{\phi}italic_o start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT , italic_o start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT. Let pI,1,a=pI,1,b=pI,1,c=0subscript𝑝𝐼1𝑎subscript𝑝𝐼1𝑏subscript𝑝𝐼1𝑐0p_{I,1,a}=p_{I,1,b}=p_{I,1,c}=0italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_a end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_b end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_c end_POSTSUBSCRIPT = 0.
2:  while not convergent do
3:     Compute NEp(pI,1,a,0,(wqqI,11)/wp)subscriptNE𝑝subscript𝑝𝐼1𝑎0subscript𝑤𝑞subscript𝑞𝐼11subscript𝑤𝑝\text{NE}_{p}\left(p_{I,1,a},0,(w_{q}q_{I,1}-1)/{w_{p}}\right)NE start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_a end_POSTSUBSCRIPT , 0 , ( italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - 1 ) / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) and NEr(pI,1,a,0,(wqqI,11)/wp)subscriptNE𝑟subscript𝑝𝐼1𝑎0subscript𝑤𝑞subscript𝑞𝐼11subscript𝑤𝑝\text{NE}_{r}\left(p_{I,1,a},0,(w_{q}q_{I,1}-1)/{w_{p}}\right)NE start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_a end_POSTSUBSCRIPT , 0 , ( italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - 1 ) / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT )
4:     Update pI,1,asubscript𝑝𝐼1𝑎p_{I,1,a}italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_a end_POSTSUBSCRIPT using gradient ascent
5:  end while
6:  Return converged solution as pI,1,asubscriptsuperscript𝑝𝐼1𝑎p^{*}_{I,1,a}italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_I , 1 , italic_a end_POSTSUBSCRIPT
7:  while not convergent do
8:     Compute NEp(pI,1,b,(wqqI,11)/wp,wqqI,1/wp)subscriptNE𝑝subscript𝑝𝐼1𝑏subscript𝑤𝑞subscript𝑞𝐼11subscript𝑤𝑝subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝\text{NE}_{p}\left(p_{I,1,b},(w_{q}q_{I,1}-1)/{w_{p}},w_{q}q_{I,1}/w_{p}\right)NE start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_b end_POSTSUBSCRIPT , ( italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - 1 ) / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) and NEr(pI,1,b,(wqqI,11)/wp,wqqI,1/wp)subscriptNE𝑟subscript𝑝𝐼1𝑏subscript𝑤𝑞subscript𝑞𝐼11subscript𝑤𝑝subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝\text{NE}_{r}\left(p_{I,1,b},(w_{q}q_{I,1}-1)/{w_{p}},w_{q}q_{I,1}/w_{p}\right)NE start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_b end_POSTSUBSCRIPT , ( italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - 1 ) / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT )
9:     Update pI,1,bsubscript𝑝𝐼1𝑏p_{I,1,b}italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_b end_POSTSUBSCRIPT using gradient ascent
10:  end while
11:  Return converged solution as pI,1,bsubscriptsuperscript𝑝𝐼1𝑏p^{*}_{I,1,b}italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_I , 1 , italic_b end_POSTSUBSCRIPT
12:  while not convergent do
13:     Compute NEp(pI,1,c,wqqI,1/wp,)subscriptNE𝑝subscript𝑝𝐼1𝑐subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝\text{NE}_{p}\left(p_{I,1,c},w_{q}q_{I,1}/w_{p},\infty\right)NE start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_c end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , ∞ ) and NEr(pI,1,c,wqqI,1/wp,)subscriptNE𝑟subscript𝑝𝐼1𝑐subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝\text{NE}_{r}\left(p_{I,1,c},w_{q}q_{I,1}/w_{p},\infty\right)NE start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_c end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , ∞ )
14:     Update pI,1,csubscript𝑝𝐼1𝑐p_{I,1,c}italic_p start_POSTSUBSCRIPT italic_I , 1 , italic_c end_POSTSUBSCRIPT using gradient ascent
15:  end while
16:  Return converged solution as pI,1,csubscriptsuperscript𝑝𝐼1𝑐p^{*}_{I,1,c}italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_I , 1 , italic_c end_POSTSUBSCRIPT
17:  pI,1argmax{WI(pI,1,a),WI(pI,1,b),WI(pI,1,c)}subscriptsuperscript𝑝𝐼1subscript𝑊𝐼subscriptsuperscript𝑝𝐼1𝑎subscript𝑊𝐼subscriptsuperscript𝑝𝐼1𝑏subscript𝑊𝐼subscriptsuperscript𝑝𝐼1𝑐p^{*}_{I,1}\leftarrow\arg\max\left\{W_{I}(p^{*}_{I,1,a}),W_{I}(p^{*}_{I,1,b}),% W_{I}(p^{*}_{I,1,c})\right\}italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ← roman_arg roman_max { italic_W start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_I , 1 , italic_a end_POSTSUBSCRIPT ) , italic_W start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_I , 1 , italic_b end_POSTSUBSCRIPT ) , italic_W start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ( italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_I , 1 , italic_c end_POSTSUBSCRIPT ) }.

4 Numerical Results

We conduct numerical experiments to gain more useful insights. In Sec. 4.1, we discuss the simulation setup. In Sec. 4.2, we train FL models on three different datasets and report the test accuracy. In Sec. 4.3, we use the accuracy results to compute the solutions to the two-period game.

4.1 Simulation Setup

We train ResNet-18 on CIFAR-10 and HAM10000, as well as train ResNet-50 on CIFAR-100 using FedAvg [22].666Our model is compatible with any FL algorithms, and this paper focuses on FedAvg, which till today, remains one of the state-of-the-art FL algorithms. CIFAR-10 (100) is a balanced dataset on 10(100)1010010(100)10 ( 100 ) classes with 50k50𝑘50k50 italic_k training and 10k10𝑘10k10 italic_k test data [14]. HAM10000 is an imbalanced medical dataset on 7 classes with 10,0151001510,01510 , 015 images from four domains, where each domain represents an institution who collected dermoscopic (skin disease) images [37]. We consider that the two companies have heterogeneous data, which are sampled using the Dirichlet distribution with a controlling parameter β>0𝛽0\beta>0italic_β > 0, where a smaller β𝛽\betaitalic_β implies higher data heterogeneity [5]. We study two scenarios:

  • Impact of data heterogeneity: On CIFAR-10 and CIFAR-100, we use β{,1,0.5,0.1,0.01}𝛽10.50.10.01\beta\in\{\infty,1,0.5,0.1,0.01\}italic_β ∈ { ∞ , 1 , 0.5 , 0.1 , 0.01 }. For each β𝛽\betaitalic_β, we sample 5k5𝑘5k5 italic_k data for company I𝐼Iitalic_I and 5k5𝑘5k5 italic_k data for company E𝐸Eitalic_E. We do not explore the impact of data heterogeneity on HAM10000 as the dataset itself is highly imbalanced.

  • Impact of data quantity: On CIFAR-10 and CIFAR-100, we fix β=0.1𝛽0.1\beta=0.1italic_β = 0.1, assign company I𝐼Iitalic_I 5k5𝑘5k5 italic_k data, and change company E𝐸Eitalic_E’s data volume DE{3k,5k,8k,10k}subscript𝐷𝐸3𝑘5𝑘8𝑘10𝑘D_{E}\in\{3k,5k,8k,10k\}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ∈ { 3 italic_k , 5 italic_k , 8 italic_k , 10 italic_k }. On HAM10000, we randomly assign company I𝐼Iitalic_I 2k2𝑘2k2 italic_k data and company E𝐸Eitalic_E data of size DE{2k,3k,4k,5k}subscript𝐷𝐸2𝑘3𝑘4𝑘5𝑘D_{E}\in\{2k,3k,4k,5k\}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ∈ { 2 italic_k , 3 italic_k , 4 italic_k , 5 italic_k }.

The key hyper-parameters are summarized as follows. We use 100100100100 rounds for FL training, where each round consists of 5555 local epochs. We use SGD as the optimizer, and choose learning rate lr=0.001subscript𝑙𝑟0.001l_{r}=0.001italic_l start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = 0.001 and batch size B=64𝐵64B=64italic_B = 64. We further report the results when each company only uses its own data to train a local model. Each experiment is repeated over 3 runs using different random seeds.

4.2 Training Results

Table 1 reports the training results under different levels of data heterogeneity, where I𝐼Iitalic_I (E) local” means that company I𝐼Iitalic_I (E𝐸Eitalic_E) trains a model using its local data without FL. From this table, we make two observations.

First, on both datasets, FedAvg consistently outperforms local training models across all values of β𝛽\betaitalic_β, including the highly heterogeneous case β=0.01𝛽0.01\beta=0.01italic_β = 0.01. This suggests the benefit of FL collaboration in improving local model performances. Second, as β𝛽\betaitalic_β decreases, there is a general trend of performance decline for FedAvg. This is due to the notorious client drift issue in FL, which has been observed in many distributed learning literature (e.g., [39, 16]). Despite that data heterogeneity hurts FL performance, we will show a counter-intuitive result in Sec. 4.3 that data heterogeneity promotes price competition.

Table 2 reports the accuracy results under different data quantities. We observe that FedAvg again outperforms local training across different data quantities on the three datasets. Also, increasing data volume DEsubscript𝐷𝐸D_{E}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT consistently improves model accuracy across all datasets, underscoring the importance of data quantity in model training.

4.3 Equilibrium Results

Now, we use Tables 1-2 to calculate the equilibrium of the two period game. We consider that companies’ quality of service is an increasing function in the model accuracy. In particular, we use qi,t(A)=Asubscript𝑞𝑖𝑡𝐴𝐴q_{i,t}(A)=Aitalic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ( italic_A ) = italic_A, where A𝐴Aitalic_A is the mean accuracy value from the tables. We further consider that users’ type (preference) ϕnsubscriptitalic-ϕ𝑛\phi_{n}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is heterogeneous, which follows a uniform distribution on support [0,1]01[0,1][ 0 , 1 ].777We have also tested Gaussian distributions and results carry over. We choose coefficients wp=wϕ=wp=1subscript𝑤𝑝subscript𝑤italic-ϕsubscript𝑤𝑝1w_{p}=w_{\phi}=w_{p}=1italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT = italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = 1 and normalize the service costs cIsubscript𝑐𝐼c_{I}italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT and cEsubscript𝑐𝐸c_{E}italic_c start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT to 00. We use Proposition 2 to calculate the equilibrium FL collaboration, in which we use the best response algorithm [26] to compute the equilibrium of Game 1. We further use Algorithm 1 to compute the optimal period 1 pricing of company I𝐼Iitalic_I.

Table 3: FL collaboration under different β𝛽\betaitalic_β.
β=𝛽\beta=\inftyitalic_β = ∞ β=1𝛽1\beta=1italic_β = 1 β=0.5𝛽0.5\beta=0.5italic_β = 0.5 β=0.1𝛽0.1\beta=0.1italic_β = 0.1 β=0.01𝛽0.01\beta=0.01italic_β = 0.01
CIFAR-10 \checkmark \checkmark \checkmark \checkmark \checkmark
CIFAR-100 \checkmark \checkmark \checkmark \checkmark \checkmark
Table 4: FL collaboration under different DEsubscript𝐷𝐸D_{E}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT.

DE=3ksubscript𝐷𝐸3𝑘D_{E}=3kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 3 italic_k DE=5ksubscript𝐷𝐸5𝑘D_{E}=5kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 5 italic_k DE=8ksubscript𝐷𝐸8𝑘D_{E}=8kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 8 italic_k DE=10ksubscript𝐷𝐸10𝑘D_{E}=10kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 10 italic_k CIFAR-10 \checkmark \checkmark \checkmark \checkmark CIFAR-100 \checkmark \checkmark \checkmark \checkmark DE=2ksubscript𝐷𝐸2𝑘D_{E}=2kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 2 italic_k DE=3ksubscript𝐷𝐸3𝑘D_{E}=3kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 3 italic_k DE=4ksubscript𝐷𝐸4𝑘D_{E}=4kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 4 italic_k DE=5ksubscript𝐷𝐸5𝑘D_{E}=5kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 5 italic_k HAM10000 ×\times× ×\times× ×\times× ×\times×

Refer to caption
(a) Impact of β𝛽\betaitalic_β on CIFAR-10.
Refer to caption
(b) Impact of β𝛽\betaitalic_β on CIFAR-100.
Refer to caption
(c) Impact of DEsubscript𝐷𝐸D_{E}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT on CIFAR-100.
Refer to caption
(d) Impact of DEsubscript𝐷𝐸D_{E}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT on HAM10000.
Figure 3: Optimal two-period pricing on different datasets.

Optimal FL collaboration. We first investigate the equilibrium FL collaboration and report the results in Tables 3-4, where \checkmark means collaboration and ×\times× means no collaboration. We observe that while FL enhances model performance across the three datasets (see Table 1), intriguingly, collaboration is observed only on CIFAR datasets but not on HAM10000. This may seem counter-intuitive at first glance because the improved performance through FL suggests a natural incentive to collaborative behavior. However, it is crucial to note that improved performance alone does not necessitate collaboration in a competitive environment. Recall from Corollary 1 that a company’s price increases in its own model performance but at the same time decreases in its competitor’s model performance. Hence, collaboration is a beneficial strategy only when the performance gains are substantial enough to outweigh the loss from price competition. For instance, CIFAR datasets exhibit significant performance enhancements (e.g., β=0.1𝛽0.1\beta=0.1italic_β = 0.1 on CIFAR-10 in Table 1), which motivates collaboration, while the less significant improvements in HAM10000 (e.g., DE=3ksubscript𝐷𝐸3𝑘D_{E}=3kitalic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 3 italic_k in Table 2) are insufficient to motivate the same collaborative effort. We summarize the above observation below.

Observation 1.

FL collaboration is the NE only when the model performance improvement is substantial.

Optimal Pricing. We now investigate how the optimal pricing changes with data heterogeneity β𝛽\betaitalic_β and data quantity DEsubscript𝐷𝐸D_{E}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT. The results are reported in Fig. 3. Note that in period 2 the two companies’ optimal pricing are the same due to 1) both companies having the same (global) model, and 2) users’ preference ϕnsubscriptitalic-ϕ𝑛\phi_{n}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT follows a uniform distribution, which is symmetric for both companies.

From Fig. 3(a)-(b), it is a bit surprising to observe that both companies optimal pricing can increase in data heterogeneity (i.e., a smaller β𝛽\betaitalic_β). While the expectation is that higher heterogeneity leads to worse performance and hence users are willingly to pay, leading to smaller prices. However, this is not always the case. The reason is that when data is highly heterogeneous, even if the model performance declines, the companies still obtain a big performance improvement from FL compared to local learning (e.g., see β=0.1𝛽0.1\beta=0.1italic_β = 0.1 in Table 1). As a result, company I𝐼Iitalic_I sets a high price in period 1 so that no users buy low-quality service in this period. For example, in Fig. 3(a) at β=0.1𝛽0.1\beta=0.1italic_β = 0.1, I𝐼Iitalic_I sets pI,1=1subscriptsuperscript𝑝𝐼11p^{*}_{I,1}=1italic_p start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT = 1 and based on Lemma 1, none of the users will purchase from I𝐼Iitalic_I. Instead, both companies attract users to buy high-quality services at higher prices in period 2, which results in higher profits. This observation has an interesting implication, i.e., data heterogeneity does not always prevent but can promote price competition. In particular, the incumbent company I𝐼Iitalic_I does not harvest most users in the starting period but prefers to compete with the entrant company using better model-based services. We summarize the above observations as follows.

Observation 2.

Data heterogeneity can promote price competition between incumbent and entrant companies.

From Fig. 3(c)-(d), we observe different trends of pricing as a function of DEsubscript𝐷𝐸D_{E}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT on the two datasets. The reason is that the optimal FL collaboration decisions are different (refer to Table 4). On CIFAR-100, the companies collaborate via FL and the performance improvement tends to be larger as DEsubscript𝐷𝐸D_{E}italic_D start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT increases. Hence, company I𝐼Iitalic_I uses a very large price in period 1 so that no users buy in period 1, and instead users will buy higher-quality services in period 2 at a larger price. On HAM10000, however, the companies do not collaborate in FL. As a result, company I𝐼Iitalic_I is inclined to set a smaller price in period 1 so that most users subscribe to I𝐼Iitalic_I in the first period. Then in period 2, the companies participate in more fierce price competition due to a smaller user pool, leading to a smaller price.

5 Discussions

We have provided a first study on FL coopetition using a dynamic two-period model with two companies. The analysis is highly non-trivial with this stylized model involving a non-concave problem. Next, we discuss how to extend our model with more than two companies across multiple time periods.

Extension to multiple companies. Our model can be extended to scenarios with two or more companies. In particular, consider N3𝑁3N\geq 3italic_N ≥ 3 companies where N1𝑁1N-1italic_N - 1 of them coexist in period 1, and an entrant enters in period 2. This can model real-world market conditions where multiple companies often coexist and new entrants periodically disrupt existing equilibriums. In this case, we can similarly analyze the user behavior by comparing the payoffs obtained when purchasing from different companies. For companies’ FL collaboration, we can use the coalitional game theory to find stable FL collaboration structures, e.g., [1, 34]. For companies’ price competitions, prior studies on oligopoly competition [36] can offer theoretical and algorithmic solutions. However, how to optimally set prices in period 1 (which affects period 2 outcomes) remains a challenging but interesting problem to explore in future work.

Extension to multiple time periods. It is possible to extend our model to where there are t3𝑡3t\geq 3italic_t ≥ 3 periods, and in each period a new entrant company enters the market. For user decisions, it suffices to restrict attention in each period and compute the highest payoff achieved among existing companies. However, we note that solving companies’ FL collaboration and price competition over t3𝑡3t\geq 3italic_t ≥ 3 periods in closed-form is generally analytically intractable. One particular challenge is to estimate future entrants and their properties (e.g., data distributions). One possible remedy is to resort to reinforcement learning for sequential decision making, and in particular, multi-armed bandit [30, 8]. More specifically, one could model the agents as companies, the arms as FL collaboration and pricing, and the reward signals as profits achieved from users, which takes into account competitions and model improvement from FL training.

6 Conclusion

This work studied the under-explored problem of FL coopetition (FL collaboration and market competition) using a dynamic two-period model. One challenge pertains to multi-dimensional heterogeneity in terms of companies’ data distributions and quantities, as well as users’ service preferences. Another challenges is associated with solving a non-concave optimization problem. We decomposed the problem into multiple concave sub-problems and managed to characterize the solutions to the two-period game. Theoretical analysis and numerical experiments on three datasets have lead to two implications. First, FL brings both model performance gain and competition loss, and collaboration occurs only when the performance gain outweighs the loss. Second, data heterogeneity can incentivize the incumbent to limit market penetration in period 1 and promote price competition in period 2.

For future work, it is interesting to study how data quality (e.g., label correctness and label sparsity) affects FL coopetition and further develop robust algorithms to enhance performance. It is also interesting to incorporate privacy enhancing techniques, which can be of particular interest to companies, into the coopetition analysis.

7 Appendices

We organize the supplementary materials as follows.

  • In Appendix 8, we prove Lemma 1.

  • In Appendix 9, we prove Lemma 2.

  • In Appendix 10, we prove Proposition 1.

  • In Appendix 11, we prove Corollary 1.

  • In Appendix 12, we prove Proposition 2.

  • In Appendix 13, we prove Theorem 1.

  • In Appendix 14, we prove Theorem 2.

  • In Appendix 15, we present a best response algorithm to calculate the NE of Game 1.

8 Proof of Lemma 1

Proof.

Consider the payoff function for user n𝑛nitalic_n in period t=1𝑡1t=1italic_t = 1 given by:

un,t(dn,t;𝒓,𝒑)={0,if dn,t=,wqqI,t(𝒓)wϕϕnwppI,t,if dn,t=I.subscript𝑢𝑛𝑡subscript𝑑𝑛𝑡𝒓𝒑cases0if subscript𝑑𝑛𝑡subscript𝑤𝑞subscript𝑞𝐼𝑡𝒓subscript𝑤italic-ϕsubscriptitalic-ϕ𝑛subscript𝑤𝑝subscript𝑝𝐼𝑡if subscript𝑑𝑛𝑡𝐼u_{n,t}(d_{n,t};\boldsymbol{r},\boldsymbol{p})=\begin{cases}0,&\text{if }d_{n,% t}=\emptyset,\\ w_{q}q_{I,t}(\boldsymbol{r})-w_{\phi}\phi_{n}-w_{p}p_{I,t},&\text{if }d_{n,t}=% I.\end{cases}italic_u start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT ; bold_italic_r , bold_italic_p ) = { start_ROW start_CELL 0 , end_CELL start_CELL if italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT = ∅ , end_CELL end_ROW start_ROW start_CELL italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , italic_t end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , italic_t end_POSTSUBSCRIPT , end_CELL start_CELL if italic_d start_POSTSUBSCRIPT italic_n , italic_t end_POSTSUBSCRIPT = italic_I . end_CELL end_ROW

At t=1𝑡1t=1italic_t = 1, the user can either choose to purchase from I𝐼Iitalic_I or not purchase.

For purchasing from I𝐼Iitalic_I, the utility is:

un,1(I;𝒓,𝒑)=wqqI,1(𝒓)wϕϕnwppI,1.subscript𝑢𝑛1𝐼𝒓𝒑subscript𝑤𝑞subscript𝑞𝐼1𝒓subscript𝑤italic-ϕsubscriptitalic-ϕ𝑛subscript𝑤𝑝subscript𝑝𝐼1u_{n,1}(I;\boldsymbol{r},\boldsymbol{p})=w_{q}q_{I,1}(\boldsymbol{r})-w_{\phi}% \phi_{n}-w_{p}p_{I,1}.italic_u start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT ( italic_I ; bold_italic_r , bold_italic_p ) = italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT .

For the user to prefer purchasing over not purchasing, we require:

wqqI,1(𝒓)wϕϕnwppI,10.subscript𝑤𝑞subscript𝑞𝐼1𝒓subscript𝑤italic-ϕsubscriptitalic-ϕ𝑛subscript𝑤𝑝subscript𝑝𝐼10w_{q}q_{I,1}(\boldsymbol{r})-w_{\phi}\phi_{n}-w_{p}p_{I,1}\geq 0.italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ≥ 0 .

Thus, the condition for user n𝑛nitalic_n to choose I𝐼Iitalic_I over \emptyset at t=1𝑡1t=1italic_t = 1 is (when we normalize wϕ=1subscript𝑤italic-ϕ1w_{\phi}=1italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT = 1):

ϕnwqqI,1wppI,1.subscriptitalic-ϕ𝑛subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝subscript𝑝𝐼1\phi_{n}\leq w_{q}q_{I,1}-w_{p}p_{I,1}.italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT .

This, we finish the proof for Lemma 1. ∎

9 Proof of Lemma 2

Proof.

Given the assumption wqqI,2(𝒓)wppI,2wqqE,2(𝒓)wppE,2subscript𝑤𝑞subscript𝑞𝐼2𝒓subscript𝑤𝑝subscript𝑝𝐼2subscript𝑤𝑞subscript𝑞𝐸2𝒓subscript𝑤𝑝subscript𝑝𝐸2w_{q}q_{I,2}(\boldsymbol{r})-w_{p}p_{I,2}\geq w_{q}q_{E,2}(\boldsymbol{r})-w_{% p}p_{E,2}italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ≥ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT, purchasing from I𝐼Iitalic_I is more efficient from a “cost-benefit” perspective. Moreover, a user will buy in period 2 if it does not buy in period 1, i.e., dn,1=.subscript𝑑𝑛1d_{n,1}=\emptyset.italic_d start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT = ∅ .

For user n𝑛nitalic_n to choose I𝐼Iitalic_I in period 2, the payoff should be larger than that from not buying or buying from E𝐸Eitalic_E. That is

wqqI,t(𝒓)wϕϕnwppI,tmax{0,wqqE,t(𝒓)wϕ(1ϕn)wppE,t}.subscript𝑤𝑞subscript𝑞𝐼𝑡𝒓subscript𝑤italic-ϕsubscriptitalic-ϕ𝑛subscript𝑤𝑝subscript𝑝𝐼𝑡0subscript𝑤𝑞subscript𝑞𝐸𝑡𝒓subscript𝑤italic-ϕ1subscriptitalic-ϕ𝑛subscript𝑤𝑝subscript𝑝𝐸𝑡w_{q}q_{I,t}(\boldsymbol{r})-w_{\phi}\phi_{n}-w_{p}p_{I,t}\geq\max\left\{0,w_{% q}q_{E,t}(\boldsymbol{r})-w_{\phi}(1-\phi_{n})-w_{p}p_{E,t}\right\}.italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , italic_t end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , italic_t end_POSTSUBSCRIPT ≥ roman_max { 0 , italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_E , italic_t end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( 1 - italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , italic_t end_POSTSUBSCRIPT } .

Rearranging this, ϕnsubscriptitalic-ϕ𝑛\phi_{n}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT should be within:

ϕnmax(min(wqqI,2(𝒓)wppI,2,1+wq(qI,2(𝒓)qE,2(𝒓))wp(pI,2pE,2)2),1).subscriptitalic-ϕ𝑛subscript𝑤𝑞subscript𝑞𝐼2𝒓subscript𝑤𝑝subscript𝑝𝐼21subscript𝑤𝑞subscript𝑞𝐼2𝒓subscript𝑞𝐸2𝒓subscript𝑤𝑝subscript𝑝𝐼2subscript𝑝𝐸221\phi_{n}\leq\max\left(\min\left(w_{q}q_{I,2}(\boldsymbol{r})-w_{p}p_{I,2},% \frac{1+w_{q}(q_{I,2}(\boldsymbol{r})-q_{E,2}(\boldsymbol{r}))-w_{p}(p_{I,2}-p% _{E,2})}{2}\right),1\right).italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ roman_max ( roman_min ( italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT , divide start_ARG 1 + italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( bold_italic_r ) ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ) end_ARG start_ARG 2 end_ARG ) , 1 ) .

To choose E𝐸Eitalic_E, ϕnsubscriptitalic-ϕ𝑛\phi_{n}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT should be high, reflecting a strong preference misalignment favoring E𝐸Eitalic_E despite its lower net utility. Similarly, we obtain:

ϕnmin(max(1wqqE,2(𝒓)+wppE,2,1+wq(qI,2(𝒓)qE,2(𝒓))wp(pI,2pE,2)2),0).subscriptitalic-ϕ𝑛1subscript𝑤𝑞subscript𝑞𝐸2𝒓subscript𝑤𝑝subscript𝑝𝐸21subscript𝑤𝑞subscript𝑞𝐼2𝒓subscript𝑞𝐸2𝒓subscript𝑤𝑝subscript𝑝𝐼2subscript𝑝𝐸220\phi_{n}\geq\min\left(\max\left(1-w_{q}q_{E,2}(\boldsymbol{r})+w_{p}p_{E,2},% \frac{1+w_{q}(q_{I,2}(\boldsymbol{r})-q_{E,2}(\boldsymbol{r}))-w_{p}(p_{I,2}-p% _{E,2})}{2}\right),0\right).italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≥ roman_min ( roman_max ( 1 - italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( bold_italic_r ) + italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT , divide start_ARG 1 + italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r ) - italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( bold_italic_r ) ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ) end_ARG start_ARG 2 end_ARG ) , 0 ) .

If neither condition for I𝐼Iitalic_I nor E𝐸Eitalic_E is satisfied, user n𝑛nitalic_n will not purchase, thus choosing \emptyset.

Thus, we finish the proof. ∎

10 Proof of Proposition 1

Proof.

We need a useful mathematical result to prove Lemma 2. We state it as a lemma below.

Lemma 3.

There exists a Nash equilibrium if

  1. 1.

    The company set is finite.

  2. 2.

    The strategy space is closed, bounded, and convex.

  3. 3.

    The company profit functions are continuous and quasi-concave in their strategies.

One can easily check that Game 1 has a finite company set 𝒩={I,E}𝒩𝐼𝐸\mathcal{N}=\{I,E\}caligraphic_N = { italic_I , italic_E }. Note that even if each company’s strategy pi,t0subscript𝑝𝑖𝑡0p_{i,t}\geq 0italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ≥ 0 is unbounded, we can narrow down the strategy space to [0,wqqi,t]0subscript𝑤𝑞subscript𝑞𝑖𝑡[0,w_{q}q_{i,t}][ 0 , italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ], as any wqqi,tsubscript𝑤𝑞subscript𝑞𝑖𝑡w_{q}q_{i,t}italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT leads to a zero profit. In this case, the reduced strategy space for each company is closed, bounded, and convex. It remains to show that each company’s profit function is quasi-concave in pi,tsubscript𝑝𝑖𝑡p_{i,t}italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT. We begin by deriving the decision threshold ϕsuperscriptitalic-ϕ\phi^{*}italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for a user to choose organization I over E. Given the user utility functions and assuming:

wqqI,2wϕϕwppI,2wqqE,2wϕ(1ϕ)wppE,2subscript𝑤𝑞subscript𝑞𝐼2subscript𝑤italic-ϕitalic-ϕsubscript𝑤𝑝subscript𝑝𝐼2subscript𝑤𝑞subscript𝑞𝐸2subscript𝑤italic-ϕ1italic-ϕsubscript𝑤𝑝subscript𝑝𝐸2w_{q}q_{I,2}-w_{\phi}\phi-w_{p}p_{I,2}\geq w_{q}q_{E,2}-w_{\phi}(1-\phi)-w_{p}% p_{E,2}italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ≥ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( 1 - italic_ϕ ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT

Rearranging the terms gives:

wϕ(12ϕ)wq(qI,2qE,2)wp(pI,2pE,2)subscript𝑤italic-ϕ12italic-ϕsubscript𝑤𝑞subscript𝑞𝐼2subscript𝑞𝐸2subscript𝑤𝑝subscript𝑝𝐼2subscript𝑝𝐸2w_{\phi}(1-2\phi)\leq w_{q}(q_{I,2}-q_{E,2})-w_{p}(p_{I,2}-p_{E,2})italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( 1 - 2 italic_ϕ ) ≤ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT )

Solving for ϕitalic-ϕ\phiitalic_ϕ, we find an indifference point:

ϕ=12(1+wq(qI,2qE,2)wp(pI,2pE,2)wϕ)superscriptitalic-ϕ121subscript𝑤𝑞subscript𝑞𝐼2subscript𝑞𝐸2subscript𝑤𝑝subscript𝑝𝐼2subscript𝑝𝐸2subscript𝑤italic-ϕ\phi^{*}=\frac{1}{2}\left(1+\frac{w_{q}(q_{I,2}-q_{E,2})-w_{p}(p_{I,2}-p_{E,2}% )}{w_{\phi}}\right)italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 + divide start_ARG italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT end_ARG )

The profit function WI,2subscript𝑊𝐼2W_{I,2}italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT can be written as:

WI,2=(pI,2cI)H(ϕ)subscript𝑊𝐼2subscript𝑝𝐼2subscript𝑐𝐼𝐻superscriptitalic-ϕW_{I,2}=(p_{I,2}-c_{I})H(\phi^{*})italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT = ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) italic_H ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )

Taking the first derivative with respect to pI,2subscript𝑝𝐼2p_{I,2}italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT, we get:

WI,2pI,2=H(ϕ)+(pI,2cI)h(ϕ)ϕpI,2.subscript𝑊𝐼2subscript𝑝𝐼2𝐻superscriptitalic-ϕsubscript𝑝𝐼2subscript𝑐𝐼superscriptitalic-ϕsuperscriptitalic-ϕsubscript𝑝𝐼2\frac{\partial W_{I,2}}{\partial p_{I,2}}=H(\phi^{*})+(p_{I,2}-c_{I})h(\phi^{*% })\frac{\partial\phi^{*}}{\partial p_{I,2}}.divide start_ARG ∂ italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_ARG = italic_H ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) + ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) italic_h ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) divide start_ARG ∂ italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_ARG .

Now, taking the second derivative, we find:

2WI,2pI,22=(pI,2cI)(h(ϕ)(ϕpI,2)2+h(ϕ)2ϕpI,22)superscript2subscript𝑊𝐼2superscriptsubscript𝑝𝐼22subscript𝑝𝐼2subscript𝑐𝐼superscriptsuperscriptitalic-ϕsuperscriptsuperscriptitalic-ϕsubscript𝑝𝐼22superscriptitalic-ϕsuperscript2superscriptitalic-ϕsuperscriptsubscript𝑝𝐼22\frac{\partial^{2}W_{I,2}}{\partial p_{I,2}^{2}}=(p_{I,2}-c_{I})\left(h^{% \prime}(\phi^{*})\left(\frac{\partial\phi^{*}}{\partial p_{I,2}}\right)^{2}+h(% \phi^{*})\frac{\partial^{2}\phi^{*}}{\partial p_{I,2}^{2}}\right)divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) ( italic_h start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ( divide start_ARG ∂ italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_h ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG )

Given that ϕpI,20superscriptitalic-ϕsubscript𝑝𝐼20\frac{\partial\phi^{*}}{\partial p_{I,2}}\leq 0divide start_ARG ∂ italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_ARG ≤ 0, 2ϕpI,22=0superscript2superscriptitalic-ϕsubscriptsuperscript𝑝2𝐼20\frac{\partial^{2}\phi^{*}}{\partial p^{2}_{I,2}}=0divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_ARG = 0, h(ϕ)0superscriptsuperscriptitalic-ϕ0h^{\prime}(\phi^{*})\leq 0italic_h start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ 0, we can see 2WI,2pI,220superscript2subscript𝑊𝐼2superscriptsubscript𝑝𝐼220\frac{\partial^{2}W_{I,2}}{\partial p_{I,2}^{2}}\leq 0divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ≤ 0. Thus, WI,2subscript𝑊𝐼2W_{I,2}italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT is concave in pI,2subscript𝑝𝐼2p_{I,2}italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT.

A similar process can be applied for WE,2subscript𝑊𝐸2W_{E,2}italic_W start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT. Assuming the utility functions dictate user decisions based on:

wqqI,2wϕϕwppI,2wqqE,2wϕ(1ϕ)wppE,2,subscript𝑤𝑞subscript𝑞𝐼2subscript𝑤italic-ϕitalic-ϕsubscript𝑤𝑝subscript𝑝𝐼2subscript𝑤𝑞subscript𝑞𝐸2subscript𝑤italic-ϕ1italic-ϕsubscript𝑤𝑝subscript𝑝𝐸2w_{q}q_{I,2}-w_{\phi}\phi-w_{p}p_{I,2}\leq w_{q}q_{E,2}-w_{\phi}(1-\phi)-w_{p}% p_{E,2},italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT italic_ϕ - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ≤ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( 1 - italic_ϕ ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ,

we can rearrange this inequality to find the cutoff value ϕsuperscriptitalic-ϕ\phi^{*}italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT where users switch preference from I to E:

wϕ(12ϕ)wq(qE,2qI,2)wp(pE,2pI,2),subscript𝑤italic-ϕ12italic-ϕsubscript𝑤𝑞subscript𝑞𝐸2subscript𝑞𝐼2subscript𝑤𝑝subscript𝑝𝐸2subscript𝑝𝐼2w_{\phi}(1-2\phi)\geq w_{q}(q_{E,2}-q_{I,2})-w_{p}(p_{E,2}-p_{I,2}),italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( 1 - 2 italic_ϕ ) ≥ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ) ,
ϕ=12(1+wq(qE,2qI,2)wp(pE,2pI,2)wϕ).superscriptitalic-ϕ121subscript𝑤𝑞subscript𝑞𝐸2subscript𝑞𝐼2subscript𝑤𝑝subscript𝑝𝐸2subscript𝑝𝐼2subscript𝑤italic-ϕ\phi^{*}=\frac{1}{2}\left(1+\frac{w_{q}(q_{E,2}-q_{I,2})-w_{p}(p_{E,2}-p_{I,2}% )}{w_{\phi}}\right).italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 + divide start_ARG italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ) - italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT end_ARG ) .

The profit function WE,2subscript𝑊𝐸2W_{E,2}italic_W start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT can then be expressed as:

WE,2=(pE,2cE)ϕ1h(ϕ)𝑑ϕ=(pE,2cE)(1H(ϕ)).subscript𝑊𝐸2subscript𝑝𝐸2subscript𝑐𝐸superscriptsubscriptsuperscriptitalic-ϕ1italic-ϕdifferential-ditalic-ϕsubscript𝑝𝐸2subscript𝑐𝐸1𝐻superscriptitalic-ϕW_{E,2}=(p_{E,2}-c_{E})\int_{\phi^{*}}^{1}h(\phi)d\phi=(p_{E,2}-c_{E})(1-H(% \phi^{*})).italic_W start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT = ( italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ) ∫ start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_h ( italic_ϕ ) italic_d italic_ϕ = ( italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ) ( 1 - italic_H ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) .

First, compute the first derivative:

WE,2pE,2=1H(ϕ)+(pE,2cE)h(ϕ)ϕpE,2.subscript𝑊𝐸2subscript𝑝𝐸21𝐻superscriptitalic-ϕsubscript𝑝𝐸2subscript𝑐𝐸superscriptitalic-ϕsuperscriptitalic-ϕsubscript𝑝𝐸2\frac{\partial W_{E,2}}{\partial p_{E,2}}=1-H(\phi^{*})+(p_{E,2}-c_{E})h(\phi^% {*})\frac{\partial\phi^{*}}{\partial p_{E,2}}.divide start_ARG ∂ italic_W start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT end_ARG = 1 - italic_H ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) + ( italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ) italic_h ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) divide start_ARG ∂ italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT end_ARG .

indicating that as pE,2subscript𝑝𝐸2p_{E,2}italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT increases, ϕsuperscriptitalic-ϕ\phi^{*}italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT increases, reducing the interval over which E𝐸Eitalic_E is chosen.

Now, compute the second derivative:

2WE,2pE,22=(pE,2cE)(h(ϕ)(ϕpE,2)2+h(ϕ)2ϕpE,22),superscript2subscript𝑊𝐸2superscriptsubscript𝑝𝐸22subscript𝑝𝐸2subscript𝑐𝐸superscriptsuperscriptitalic-ϕsuperscriptsuperscriptitalic-ϕsubscript𝑝𝐸22superscriptitalic-ϕsuperscript2superscriptitalic-ϕsuperscriptsubscript𝑝𝐸22\frac{\partial^{2}W_{E,2}}{\partial p_{E,2}^{2}}=(p_{E,2}-c_{E})\left(h^{% \prime}(\phi^{*})\left(\frac{\partial\phi^{*}}{\partial p_{E,2}}\right)^{2}+h(% \phi^{*})\frac{\partial^{2}\phi^{*}}{\partial p_{E,2}^{2}}\right),divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_W start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = ( italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ) ( italic_h start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ( divide start_ARG ∂ italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_h ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) ,

Note that h(ϕ)0superscriptsuperscriptitalic-ϕ0h^{\prime}(\phi^{*})\leq 0italic_h start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ 0 and h(ϕ)0superscriptitalic-ϕ0h(\phi^{*})\geq 0italic_h ( italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≥ 0, the terms involving derivatives of hhitalic_h indicate a negative value for the second derivative when the interval [ϕ,1]superscriptitalic-ϕ1[\phi^{*},1][ italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , 1 ] decreases as pE,2subscript𝑝𝐸2p_{E,2}italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT increases. Thus, WE,2subscript𝑊𝐸2W_{E,2}italic_W start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT is concave in pE,2subscript𝑝𝐸2p_{E,2}italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT.

Hence we finish the proof for Proposition 1. ∎

11 Proof of Corollary 1

Proof.

Part 1: Increasing Function of Its Own Quality qi,tsubscript𝑞𝑖𝑡q_{i,t}italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT

Assume for contradiction that as qi,tsubscript𝑞𝑖𝑡q_{i,t}italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT increases, pi,tsubscript𝑝𝑖𝑡p_{i,t}italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT decreases. If qi,tsubscript𝑞𝑖𝑡q_{i,t}italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT increases, this leads to increased user valuation for company i𝑖iitalic_i’s product, which increases the demand at any given price level. However, if pi,tsubscript𝑝𝑖𝑡p_{i,t}italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT decreases while qi,tsubscript𝑞𝑖𝑡q_{i,t}italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT increases, the company misses the opportunity to maximize its profit by leveraging the higher perceived value of a higher quality product. This contradicts profit maximization, thus our assumption must be false. Hence, pi,tsubscript𝑝𝑖𝑡p_{i,t}italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT must be an increasing function of qi,tsubscript𝑞𝑖𝑡q_{i,t}italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT.

Part 2: Decreasing Function of Competitor’s Quality qj,tsubscript𝑞𝑗𝑡q_{j,t}italic_q start_POSTSUBSCRIPT italic_j , italic_t end_POSTSUBSCRIPT

Assume for contradiction that as qj,tsubscript𝑞𝑗𝑡q_{j,t}italic_q start_POSTSUBSCRIPT italic_j , italic_t end_POSTSUBSCRIPT increases, pi,tsubscript𝑝𝑖𝑡p_{i,t}italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT increases. An increase in qj,tsubscript𝑞𝑗𝑡q_{j,t}italic_q start_POSTSUBSCRIPT italic_j , italic_t end_POSTSUBSCRIPT enhances the competitor j𝑗jitalic_j’s product attractiveness, likely diverting users away from i𝑖iitalic_i’s product, especially if pi,tsubscript𝑝𝑖𝑡p_{i,t}italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT also increases. This would lead to a decrease in the demand for i𝑖iitalic_i’s product and thus a reduction in its profits. To counterbalance the increase in qj,tsubscript𝑞𝑗𝑡q_{j,t}italic_q start_POSTSUBSCRIPT italic_j , italic_t end_POSTSUBSCRIPT and maintain competitiveness, rational behavior would dictate that i𝑖iitalic_i should decrease pi,tsubscript𝑝𝑖𝑡p_{i,t}italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT to retain market share and profit margins. Therefore, our assumption leads to a contradiction with the profit-maximizing behavior expected in competitive markets. Hence, pi,tsubscript𝑝𝑖𝑡p_{i,t}italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT must decrease as qj,tsubscript𝑞𝑗𝑡q_{j,t}italic_q start_POSTSUBSCRIPT italic_j , italic_t end_POSTSUBSCRIPT increases.

Thus, we conclude that the equilibrium pricing pi,tsubscript𝑝𝑖𝑡p_{i,t}italic_p start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT is an increasing function of its own quality qi,tsubscript𝑞𝑖𝑡q_{i,t}italic_q start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT and a decreasing function of the competitor’s quality qj,tsubscript𝑞𝑗𝑡q_{j,t}italic_q start_POSTSUBSCRIPT italic_j , italic_t end_POSTSUBSCRIPT. ∎

12 Proof of Proposition 2

Proof.

The proof of Proposition 2 is immediate given that in a two-company case, if any one of the company does not participate in FL, then FL ceases to occur. Hence, based on Definition 2, we know that the profile (1,1)11(1,1)( 1 , 1 ) is the NE of Game 2 if and only if i{I,E}for-all𝑖𝐼𝐸\forall i\in\left\{I,E\right\}∀ italic_i ∈ { italic_I , italic_E }, Wi,2((1,1),𝒑2((1,1),pI,1))Wi,2((0,0),𝒑2((0,0),pI,1)).subscript𝑊𝑖211superscriptsubscript𝒑211subscript𝑝𝐼1subscript𝑊𝑖200superscriptsubscript𝒑200subscript𝑝𝐼1W_{i,2}((1,1),\boldsymbol{p}_{2}^{*}((1,1),p_{I,1}))\geq W_{i,2}((0,0),% \boldsymbol{p}_{2}^{*}((0,0),p_{I,1})).italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( ( 1 , 1 ) , bold_italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( ( 1 , 1 ) , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ) ≥ italic_W start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ( ( 0 , 0 ) , bold_italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( ( 0 , 0 ) , italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ) .

13 Proof of Theorem 1

Proof.

We prove Theorem 1 by studying a concrete scenario using a uniform distribution of users’ preferences. It consists of three steps.

  1. 1.

    Uniform distribution satisfy the assumptions.

  2. 2.

    Use a particular value of pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT and derive the users’ purchasing in period 1.

  3. 3.

    Construct accuracy scenarios that are consistent with Theorem 1.

1. For the uniform distribution, we know that h(θ)>0𝜃0h(\theta)>0italic_h ( italic_θ ) > 0 and is continuous:

  • h(ϕn)=1>0subscriptitalic-ϕ𝑛10h(\phi_{n})=1>0italic_h ( italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) = 1 > 0

  • The function h(ϕn)subscriptitalic-ϕ𝑛h(\phi_{n})italic_h ( italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) is constant over its domain, hence it is continuous.

Further, to show whether h(ϕn)1H(ϕn)subscriptitalic-ϕ𝑛1𝐻subscriptitalic-ϕ𝑛\frac{h(\phi_{n})}{1-H(\phi_{n})}divide start_ARG italic_h ( italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG start_ARG 1 - italic_H ( italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG is increasing in ϕnsubscriptitalic-ϕ𝑛\phi_{n}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, we calculate

h(ϕn)1H(ϕn)=11ϕn,subscriptitalic-ϕ𝑛1𝐻subscriptitalic-ϕ𝑛11subscriptitalic-ϕ𝑛\frac{h(\phi_{n})}{1-H(\phi_{n})}=\frac{1}{1-\phi_{n}},divide start_ARG italic_h ( italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG start_ARG 1 - italic_H ( italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG = divide start_ARG 1 end_ARG start_ARG 1 - italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ,

which increases in ϕnsubscriptitalic-ϕ𝑛\phi_{n}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. Hence, the uniform distribution satisfies Assumption 1.

2. We consider that pI,1wqqI,1/wpsubscript𝑝𝐼1subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝p_{I,1}\geq w_{q}q_{I,1}/w_{p}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ≥ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT so that no users will buy in period 1.

3. Now we construct a scenario where local learning leads to qI,2(0,0)=0.72,qE,2(0,0)=0.73formulae-sequencesubscript𝑞𝐼2000.72subscript𝑞𝐸2000.73q_{I,2}(0,0)=0.72,q_{E,2}(0,0)=0.73italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( 0 , 0 ) = 0.72 , italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( 0 , 0 ) = 0.73 , while FL training leads to qI,2(1,1)=0.75,qE,2(1,1)=0.75formulae-sequencesubscript𝑞𝐼2110.75subscript𝑞𝐸2110.75q_{I,2}(1,1)=0.75,q_{E,2}(1,1)=0.75italic_q start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( 1 , 1 ) = 0.75 , italic_q start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT ( 1 , 1 ) = 0.75. This implies that FL improves both companies’ qualities. Now, consider wq=wp=wϕ=1,cI=CE=0formulae-sequencesubscript𝑤𝑞subscript𝑤𝑝subscript𝑤italic-ϕ1subscript𝑐𝐼subscript𝐶𝐸0w_{q}=w_{p}=w_{\phi}=1,c_{I}=C_{E}=0italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT = italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_w start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT = 1 , italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT = italic_C start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 0, and report the equilibrium profit (calculated using best response in Section 8 which according to [25] converges to NE.)

- rI=1subscript𝑟𝐼1r_{I}=1italic_r start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT = 1 rE=0subscript𝑟𝐸0r_{E}=0italic_r start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 0
rI=1subscript𝑟𝐼1r_{I}=1italic_r start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT = 1 WI,2=0.061,WE,2=0.0675formulae-sequencesubscript𝑊𝐼20.061subscript𝑊𝐸20.0675W_{I,2}=0.061,W_{E,2}=0.0675italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT = 0.061 , italic_W start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT = 0.0675 -
rE=0subscript𝑟𝐸0r_{E}=0italic_r start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT = 0 - WI,2=0.0624,WE,2=0.0624formulae-sequencesubscript𝑊𝐼20.0624subscript𝑊𝐸20.0624W_{I,2}=0.0624,W_{E,2}=0.0624italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT = 0.0624 , italic_W start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT = 0.0624

From this table, we see that even if participating in FL leads to higher accuracy for both qualities, the profit of both company is lower, resulting in non-participation as equilibrium.

Hence, we complete the proof for Theorem 1. ∎

14 Proof of Theorem 2

We aim to prove the concavity in three sub-problems (corresponding to different regions).

  1. 1.

    Case a: pI,1[0,(wqqI,11)/wp]subscript𝑝𝐼10subscript𝑤𝑞subscript𝑞𝐼11subscript𝑤𝑝p_{I,1}\in[0,(w_{q}q_{I,1}-1)/{w_{p}}]italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ∈ [ 0 , ( italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - 1 ) / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ]

  2. 2.

    Case b: pI,1[(wqqI,11)/wp,wqqI,1/wp]subscript𝑝𝐼1subscript𝑤𝑞subscript𝑞𝐼11subscript𝑤𝑝subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝p_{I,1}\in[(w_{q}q_{I,1}-1)/{w_{p}},w_{q}q_{I,1}/{w_{p}}]italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ∈ [ ( italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - 1 ) / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ]

  3. 3.

    Case c: pI,1[wqqI,1/wp,]subscript𝑝𝐼1subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝p_{I,1}\in[w_{q}q_{I,1}/{w_{p}},\infty]italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ∈ [ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , ∞ ]

1. We first prove Case a. When pI,1[0,(wqqI,11)/wp]subscript𝑝𝐼10subscript𝑤𝑞subscript𝑞𝐼11subscript𝑤𝑝p_{I,1}\in[0,(w_{q}q_{I,1}-1)/{w_{p}}]italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ∈ [ 0 , ( italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - 1 ) / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ], it is easy to show that all users will buy from company I𝐼Iitalic_I. Hence,

WI=WI,1(pI,1)=01(pI,1cI)𝟙dn,1=I(pI,1)h(ϕ)𝑑ϕ=pI,1cI,subscript𝑊𝐼subscript𝑊𝐼1subscript𝑝𝐼1superscriptsubscript01subscript𝑝𝐼1subscript𝑐𝐼subscript1subscript𝑑𝑛1𝐼subscript𝑝𝐼1italic-ϕdifferential-ditalic-ϕsubscript𝑝𝐼1subscript𝑐𝐼W_{I}=W_{I,1}(p_{I,1})=\int_{0}^{1}(p_{I,1}-c_{I})\cdot\mathbbm{1}_{d_{n,1}=I}% (p_{I,1})\cdot h(\phi)d\phi=p_{I,1}-c_{I},italic_W start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT = italic_W start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) ⋅ blackboard_1 start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT = italic_I end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ⋅ italic_h ( italic_ϕ ) italic_d italic_ϕ = italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ,

which is clearly concave in pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT.

2. Now we prove case b. e analyze the concavity of WIsubscript𝑊𝐼W_{I}italic_W start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT by examining its components WI,1(pI,1)subscript𝑊𝐼1subscript𝑝𝐼1W_{I,1}(p_{I,1})italic_W start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) and WI,2(𝒓,𝒑)subscript𝑊𝐼2𝒓𝒑W_{I,2}(\boldsymbol{r},\boldsymbol{p})italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ).

Analysis of WI,1(pI,1)subscript𝑊𝐼1subscript𝑝𝐼1W_{I,1}(p_{I,1})italic_W start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ): WI,1(pI,1)subscript𝑊𝐼1subscript𝑝𝐼1W_{I,1}(p_{I,1})italic_W start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) is given by:

WI,1(pI,1)=01(pI,1cI)𝟙dn,1=I(pI,1)h(ϕ)𝑑ϕ,subscript𝑊𝐼1subscript𝑝𝐼1superscriptsubscript01subscript𝑝𝐼1subscript𝑐𝐼subscript1subscript𝑑𝑛1𝐼subscript𝑝𝐼1italic-ϕdifferential-ditalic-ϕW_{I,1}(p_{I,1})=\int_{0}^{1}(p_{I,1}-c_{I})\cdot\mathbbm{1}_{d_{n,1}=I}(p_{I,% 1})\cdot h(\phi)d\phi,italic_W start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) ⋅ blackboard_1 start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT = italic_I end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) ⋅ italic_h ( italic_ϕ ) italic_d italic_ϕ ,

where 𝟙dn,1=I(pI,1)=1subscript1subscript𝑑𝑛1𝐼subscript𝑝𝐼11\mathbbm{1}_{d_{n,1}=I}(p_{I,1})=1blackboard_1 start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT = italic_I end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) = 1 if wppI,1wqqI,1subscript𝑤𝑝subscript𝑝𝐼1subscript𝑤𝑞subscript𝑞𝐼1w_{p}p_{I,1}\leq w_{q}q_{I,1}italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ≤ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT, and 0 otherwise. This corresponds to a concave function.

Analysis of WI,2(𝒓,𝒑)subscript𝑊𝐼2𝒓𝒑W_{I,2}(\boldsymbol{r},\boldsymbol{p})italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ): WI,2(𝒓,𝒑)subscript𝑊𝐼2𝒓𝒑W_{I,2}(\boldsymbol{r},\boldsymbol{p})italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) depends on pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT linearly as pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT only affects user range in period 2.

Combining the Functions: Since both components are concave and WI,2subscript𝑊𝐼2W_{I,2}italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT does not depend on pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT, the sum WI(𝒓,𝒑)=WI,1(pI,1)+WI,2(𝒓,𝒑)subscript𝑊𝐼𝒓𝒑subscript𝑊𝐼1subscript𝑝𝐼1subscript𝑊𝐼2𝒓𝒑W_{I}(\boldsymbol{r},\boldsymbol{p})=W_{I,1}(p_{I,1})+W_{I,2}(\boldsymbol{r},% \boldsymbol{p})italic_W start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) = italic_W start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ) + italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) retains concavity with respect to pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT, as the addition of a constant (or concave independent term) to a concave function does not affect its concavity. Hence, WIsubscript𝑊𝐼W_{I}italic_W start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT is concave in pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT.

3. Now we prove case c. When pI,1wqqI,1/wpsubscript𝑝𝐼1subscript𝑤𝑞subscript𝑞𝐼1subscript𝑤𝑝p_{I,1}\geq w_{q}q_{I,1/w_{p}}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT ≥ italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_I , 1 / italic_w start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT, no one buys in period 1, i.e., WI,1=0subscript𝑊𝐼10W_{I,1}=0italic_W start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT = 0. WI,2subscript𝑊𝐼2W_{I,2}italic_W start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT trivially does not depend on pI,1subscript𝑝𝐼1p_{I,1}italic_p start_POSTSUBSCRIPT italic_I , 1 end_POSTSUBSCRIPT. Hence, WIsubscript𝑊𝐼W_{I}italic_W start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT is also concave in this range.

Thus, we finish the proof of Theorem 2.

15 Best Response Algorithm

We provide a best response algorithm below to calculate the equilibrium of Game 1.

Algorithm 2 Best Response Algorithm for Price Competition Game
1:  Initialize pI,2(0)superscriptsubscript𝑝𝐼20p_{I,2}^{(0)}italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT and pE,2(0)superscriptsubscript𝑝𝐸20p_{E,2}^{(0)}italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT arbitrarily
2:  Set convergence tolerance ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0
3:  Set k0𝑘0k\leftarrow 0italic_k ← 0
4:  repeat
5:     kk+1𝑘𝑘1k\leftarrow k+1italic_k ← italic_k + 1
6:     Compute pI,2(k)superscriptsubscript𝑝𝐼2𝑘p_{I,2}^{(k)}italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT as:
pI,2(k)=argmaxpI,201(pI,2cI)𝟙dn,2=I(𝒓,𝒑)h(ϕ)𝑑ϕsuperscriptsubscript𝑝𝐼2𝑘subscriptsubscript𝑝𝐼2superscriptsubscript01subscript𝑝𝐼2subscript𝑐𝐼subscript1subscript𝑑𝑛2𝐼𝒓𝒑italic-ϕdifferential-ditalic-ϕp_{I,2}^{(k)}=\arg\max_{p_{I,2}}\int_{0}^{1}(p_{I,2}-c_{I})\cdot\mathbbm{1}_{d% _{n,2}=I}(\boldsymbol{r},\boldsymbol{p})\cdot h(\phi)d\phiitalic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT = roman_arg roman_max start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) ⋅ blackboard_1 start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , 2 end_POSTSUBSCRIPT = italic_I end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) ⋅ italic_h ( italic_ϕ ) italic_d italic_ϕ
7:     Compute pE,2(k)superscriptsubscript𝑝𝐸2𝑘p_{E,2}^{(k)}italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT as:
pE,2(k)=argmaxpE,201(pE,2cE)𝟙dn,2=E(𝒓,𝒑)h(ϕ)𝑑ϕsuperscriptsubscript𝑝𝐸2𝑘subscriptsubscript𝑝𝐸2superscriptsubscript01subscript𝑝𝐸2subscript𝑐𝐸subscript1subscript𝑑𝑛2𝐸𝒓𝒑italic-ϕdifferential-ditalic-ϕp_{E,2}^{(k)}=\arg\max_{p_{E,2}}\int_{0}^{1}(p_{E,2}-c_{E})\cdot\mathbbm{1}_{d% _{n,2}=E}(\boldsymbol{r},\boldsymbol{p})\cdot h(\phi)d\phiitalic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT = roman_arg roman_max start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_E end_POSTSUBSCRIPT ) ⋅ blackboard_1 start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n , 2 end_POSTSUBSCRIPT = italic_E end_POSTSUBSCRIPT ( bold_italic_r , bold_italic_p ) ⋅ italic_h ( italic_ϕ ) italic_d italic_ϕ
8:     Check for convergence:
if |pI,2(k)pI,2(k1)|<ϵ and |pE,2(k)pE,2(k1)|<ϵ then stopif superscriptsubscript𝑝𝐼2𝑘superscriptsubscript𝑝𝐼2𝑘1italic-ϵ and superscriptsubscript𝑝𝐸2𝑘superscriptsubscript𝑝𝐸2𝑘1italic-ϵ then stop\text{if }|p_{I,2}^{(k)}-p_{I,2}^{(k-1)}|<\epsilon\text{ and }|p_{E,2}^{(k)}-p% _{E,2}^{(k-1)}|<\epsilon\text{ then stop}if | italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT | < italic_ϵ and | italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT - italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT | < italic_ϵ then stop
9:  until Convergence
10:  return pI,2(k),pE,2(k)superscriptsubscript𝑝𝐼2𝑘superscriptsubscript𝑝𝐸2𝑘p_{I,2}^{(k)},p_{E,2}^{(k)}italic_p start_POSTSUBSCRIPT italic_I , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_E , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT

References

  • Donahue and Kleinberg [2021] K. Donahue and J. Kleinberg. Optimality and stability in federated learning: A game-theoretic approach. Advances in Neural Information Processing Systems, 34:1287–1298, 2021.
  • Duffy and Hopkins [2005] J. Duffy and E. Hopkins. Learning, information, and sorting in market entry games: theory and evidence. Games and Economic behavior, 51(1):31–62, 2005.
  • Hotbllino [1929] H. Hotbllino. Stability in competition. The economic journal, 39(153):41–57, 1929.
  • Hotelling [1929] H. Hotelling. Stability in competition. The Economic Journal, 39(153):41–57, 1929. ISSN 00130133, 14680297. URL https://www.jstor.org/stable/2224214.
  • Hsu et al. [2019] T.-M. H. Hsu, H. Qi, and M. Brown. Measuring the effects of non-identical data distribution for federated visual classification. arXiv preprint arXiv:1909.06335, 2019.
  • Huang et al. [2022] C. Huang, S. Ke, C. Kamhoua, P. Mohapatra, and X. Liu. Incentivizing data contribution in cross-silo federated learning. arXiv preprint arXiv:2203.03885, 2022.
  • Huang et al. [2023a] C. Huang, M. Tang, Q. Ma, J. Huang, and X. Liu. Promoting collaborations in cross-silo federated learning: Challenges and opportunities. IEEE Communications Magazine, 2023a.
  • Huang et al. [2023b] C. Huang, H. Yu, J. Huang, and R. Berry. An online inference-aided incentive framework for information elicitation without verification. IEEE Journal on Selected Areas in Communications, 41(4):1167–1185, 2023b.
  • Huang et al. [2024] C. Huang, S. Ke, and X. Liu. Duopoly business competition in cross-silo federated learning. IEEE Transactions on Network Science and Engineering, 11(1):340–351, 2024.
  • Huang et al. [2021] Y. Huang, L. Chu, Z. Zhou, L. Wang, J. Liu, J. Pei, and Y. Zhang. Personalized cross-silo federated learning on non-iid data. In Proceedings of the AAAI conference on artificial intelligence, volume 35, pages 7865–7873, 2021.
  • Kairouz et al. [2021] P. Kairouz, H. B. McMahan, B. Avent, A. Bellet, M. Bennis, A. N. Bhagoji, K. Bonawitz, Z. Charles, G. Cormode, R. Cummings, et al. Advances and open problems in federated learning. Foundations and trends® in machine learning, 14(1–2):1–210, 2021.
  • Karimireddy et al. [2020] S. P. Karimireddy, S. Kale, M. Mohri, S. Reddi, S. Stich, and A. T. Suresh. Scaffold: Stochastic controlled averaging for federated learning. In International conference on machine learning, pages 5132–5143. PMLR, 2020.
  • Kremer et al. [2017] M. Kremer, B. Mantin, and A. Ovchinnikov. Dynamic pricing in the presence of myopic and strategic consumers: Theory and experiment. Production and Operations Management, 26(1):116–133, 2017.
  • Krizhevsky et al. [2009] A. Krizhevsky, G. Hinton, et al. Learning multiple layers of features from tiny images. 2009.
  • Li et al. [2021a] Q. Li, B. He, and D. Song. Model-contrastive federated learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10713–10722, 2021a.
  • Li et al. [2020] T. Li, A. K. Sahu, M. Zaheer, M. Sanjabi, A. Talwalkar, and V. Smith. Federated optimization in heterogeneous networks. Proceedings of Machine learning and systems, 2:429–450, 2020.
  • Li et al. [2021b] T. Li, S. Hu, A. Beirami, and V. Smith. Ditto: Fair and robust federated learning through personalization. In International conference on machine learning, pages 6357–6368. PMLR, 2021b.
  • Li et al. [2023] Y. Li, X. Wang, R. Zeng, M. Yang, K. Li, M. Huang, and S. Dustdar. Varf: An incentive mechanism of cross-silo federated learning in mec. IEEE Internet of Things Journal, 2023.
  • Liu et al. [2022] K. Liu, S. Hu, S. Z. Wu, and V. Smith. On privacy and personalization in cross-silo federated learning. Advances in neural information processing systems, 35:5925–5940, 2022.
  • Ma et al. [2018] Q. Ma, B. Shou, J. Huang, and T. Başar. Dynamic pricing in the presence of participation-dependent social learning. In Proceedings of the Eighteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing, pages 101–110, 2018.
  • Marfoq et al. [2020] O. Marfoq, C. Xu, G. Neglia, and R. Vidal. Throughput-optimal topology design for cross-silo federated learning. Advances in Neural Information Processing Systems, 33:19478–19487, 2020.
  • McMahan et al. [2017] B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y Arcas. Communication-efficient learning of deep networks from decentralized data. In Artificial intelligence and statistics, pages 1273–1282. PMLR, 2017.
  • Mendieta et al. [2022] M. Mendieta, T. Yang, P. Wang, M. Lee, Z. Ding, and C. Chen. Local learning matters: Rethinking data heterogeneity in federated learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8397–8406, 2022.
  • Oh et al. [2022] J. Oh, S. Kim, and S.-Y. Yun. Fedbabu: Towards enhanced representation for federated image classification. Proc. ICLR, 2022.
  • Osborne and Pitchik [1987] M. J. Osborne and C. Pitchik. Equilibrium in hotelling’s model of spatial competition. Econometrica: Journal of the Econometric Society, pages 911–922, 1987.
  • Osborne and Rubinstein [1994] M. J. Osborne and A. Rubinstein. A course in game theory. MIT press, 1994.
  • Pillutla et al. [2022] K. Pillutla, K. Malik, A.-R. Mohamed, M. Rabbat, M. Sanjabi, and L. Xiao. Federated learning with partial model personalization. In International Conference on Machine Learning, pages 17716–17758. PMLR, 2022.
  • Qin et al. [2023] Z. Qin, S. Deng, M. Zhao, and X. Yan. Fedapen: Personalized cross-silo federated learning with adaptability to statistical heterogeneity. In Proc. ACM SIGKDD, pages 1954–1964, 2023.
  • Salop [1979] S. C. Salop. Monopolistic competition with outside goods. The Bell Journal of Economics, pages 141–156, 1979.
  • Slivkins et al. [2019] A. Slivkins et al. Introduction to multi-armed bandits. Foundations and Trends® in Machine Learning, 12(1-2):1–286, 2019.
  • Son et al. [2024] H. M. Son, M. H. Kim, T.-M. Chung, C. Huang, and X. Liu. Feduv: Uniformity and variance for heterogeneous federated learning. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2024.
  • Spann et al. [2015] M. Spann, M. Fischer, and G. J. Tellis. Skimming or penetration? strategic dynamic pricing for new products. Marketing Science, 34(2):235–249, 2015.
  • Stigler [1964] G. J. Stigler. A theory of oligopoly. Journal of political Economy, 72(1):44–61, 1964.
  • Tan et al. [2024] S. Tan, H. Cheng, X. Wu, H. Yu, T. He, Y. S. Ong, C. Wang, and X. Tao. Fedcompetitors: Harmonious collaboration in federated learning with competing participants. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 38, pages 15231–15239, 2024.
  • Tang and Wong [2021] M. Tang and V. W. Wong. An incentive mechanism for cross-silo federated learning: A public goods perspective. In IEEE INFOCOM 2021-IEEE Conference on Computer Communications, pages 1–10. IEEE, 2021.
  • Tirole [1988] J. Tirole. The theory of industrial organization. MIT press, 1988.
  • Tschandl et al. [2018] P. Tschandl, C. Rosendahl, and H. Kittler. The ham10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Scientific data, 5(1):1–9, 2018.
  • Tsoy and Konstantinov [2024] N. Tsoy and N. Konstantinov. Strategic data sharing between competitors. Advances in Neural Information Processing Systems, 36, 2024.
  • Woodworth et al. [2020] B. E. Woodworth, K. K. Patel, and N. Srebro. Minibatch vs local sgd for heterogeneous distributed learning. Advances in Neural Information Processing Systems, 33:6281–6292, 2020.
  • Wu and Yu [2022] X. Wu and H. Yu. Mars-fl: Enabling competitors to collaborate in federated learning. IEEE Transactions on Big Data, 2022.
  • Ye et al. [2023] M. Ye, X. Fang, B. Du, P. C. Yuen, and D. Tao. Heterogeneous federated learning: State-of-the-art and research challenges. ACM Computing Surveys, 56(3):1–44, 2023.
  • Zhang et al. [2020] C. Zhang, S. Li, J. Xia, W. Wang, F. Yan, and Y. Liu. {{\{{BatchCrypt}}\}}: Efficient homomorphic encryption for {{\{{Cross-Silo}}\}} federated learning. In USENIX annual technical conference, pages 493–506, 2020.
  • Zhang et al. [2022] N. Zhang, Q. Ma, and X. Chen. Enabling long-term cooperation in cross-silo federated learning: A repeated game perspective. IEEE Transactions on Mobile Computing, 2022.