Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms

\nameFan Yao1 \email[email protected] \AND\nameYiming Liao2 \email[email protected] \AND\nameJingzhou Liu2 \email[email protected] \AND\nameShaoliang Nie2 \email[email protected] \AND\nameQifan Wang2 \email[email protected] \AND\nameHaifeng Xu3 \email[email protected] \AND\nameHongning Wang1 \email[email protected]



\addr1Department of Computer Science, University of Virginia, USA
\addr2Meta, USA
\addr3Department of Computer Science, University of Chicago, USA
Abstract

On User-Generated Content (UGC) platforms, recommendation algorithms significantly impact creators’ motivation to produce content as they compete for algorithmically allocated user traffic. This phenomenon subtly shapes the volume and diversity of the content pool, which is crucial for the platform’s sustainability. In this work, we demonstrate, both theoretically and empirically, that a purely relevance-driven policy with low exploration strength boosts short-term user satisfaction but undermines the long-term richness of the content pool. In contrast, a more aggressive exploration policy may slightly compromise user satisfaction but promote higher content creation volume. Our findings reveal a fundamental trade-off between immediate user satisfaction and overall content production on UGC platforms. Building on this finding, we propose an efficient optimization method to identify the optimal exploration strength, balancing user and creator engagement. Our model can serve as a pre-deployment audit tool for recommendation algorithms on UGC platforms, helping to align their immediate objectives with sustainable, long-term goals.

1 Introduction

User-generated content (UGC) platforms have become an indispensable component of our daily lives (Bobadilla et al., 2013; Santos, 2022). Those platforms, including various social media (e.g., Facebook, Instagram), streaming services (e.g., YouTube, TikTok) and many more, count on algorithmic recommendation algorithms (Koren et al., 2009; Bobadilla et al., 2013) to help content consumers (i.e., users) navigate the vast ocean of content generated by creators. Unlike other content recommendation platforms such as Netflix and Spotify, user experience on UGC platforms critically relies on the active participation of creators (Zhuang et al., 2023), as the goal of enhancing user engagement is inherently linked to the abundancy and diversity of the content pool.

Recent studies have begun to explore how a platform’s algorithmic decisions, such as their employed recommendation algorithms and revenue sharing agreements, might influence the behavior of content creators and subsequently affect user welfare (Yao et al., 2023, 2024b, 2024c; Jagadeesan et al., 2024; Hron et al., 2022; Immorlica et al., 2024; Hu et al., 2023; Prasad et al., 2023). A common technique used in these works is to model the competition among creators who strive to establish their brand and comparative advantages by selecting topics that maximize the traffic or rewards from the platform. While these competition models offer valuable insights into how platforms’ interventions could lead to suboptimal outcomes in terms of content diversity and issues related to popularity bias, they ignore the dimension of content creation volume — an equally critical aspect of such competition dynamics. In fact, there are evidence suggesting that traffic received directly influences creator productivity (Hu et al., 2024; Zeng et al., 2024). Hu et al. (2024) found that creators whose content received boosted exposure significantly increased their video production without compromising quality on a leading video recommendation platform, and Zeng et al. (2024)’s field experiment on a large-scale video sharing social network showed that while popularity-based recommendation strategies boost content consumption, they can reduce content production. Similar effects has also been documented on Instagram. For instance, when the recommender system allocates more traffic to targeted groups of creators, their production frequency increases, as indicated by metrics such as the number of daily active creators and the daily average creation volume per group. However, disproportionately boosting traffic to certain creator groups can negatively impact others; for instance, directing more traffic to popular, or “head”, creators often diminishes engagement from less prominent, or “tail”, creators. And overall, a statistically significant positive correlation exists between content viewership and the corresponding creator’s productivity111Further details regarding the Instagram findings are protected under a non-disclosure agreement (NDA) and may be disclosed in future versions of this work..

Motivated by real-world evidence and the research gap in modeling production frequency within content creation competition, we introduce a new game-theoretical model, named Cournot Content Creation Competition (C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT), aiming to study the impact of a platform’s recommendation strategy on creators’ production willingness. Our C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT framework builds upon the Content Creator Competition (C3superscript𝐶3C^{3}italic_C start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT) framework introduced in Yao et al. (2023, 2024b, 2024c), assuming that creators compete for platform-allocated user traffic (Glotfelter, 2019; Hodgson, 2021). However, unlike C3superscript𝐶3C^{3}italic_C start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT and similar previous models, our approach models the competition where creators are aware of their expertise and consistently produce within their niche, but strategically adjust their production frequency to balance gain and cost, which is often observed on mature platforms such as YouTube and Instagram. Our proposed competition model resonates with the well-established Cournot competition (Cournot, 1838) in economics, where firms compete for revenue by strategically setting their production quantities. Hence, it inherits its name.

Our C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT framework offers a powerful tool for analyzing the competition dynamics among creators, as it always yields a unique Pure Nash equilibrium (PNE) that enables a precise prediction of the total content creation volume under any specific recommendation strategy. Furthermore, our in-depth analysis of C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT’s equilibrium reveals a critical and interesting insight: while increased recommendation accuracy boosts immediate user satisfaction, it simultaneously reduces creators’ motivation to produce content, potentially compromising long-term user engagement. This finding, supported by both theoretical analyses and simulations, suggests the necessity to balance user and creator engagement through a careful control over the recommendation algorithm’s exploration strength at a per-user basis. We formulated this mechanism design challenge as a bi-level optimization problem and tackled it using a projected gradient descent approach with an efficient gradient approximation scheme, providing an effective method to achieve the optimal trade-off between user satisfaction and creator productivity.

In summary, our contributions are threefold: (a) modeling-wise, we introduce a new game-theoretical framework, C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT, to investigate how recommendation algorithms affect content creation frequency among creators; (b) conceptually, we reveal a new insight that managing the exploration strength of the recommendation algorithm can balance between short-term user satisfaction and long-term creator engagement at equilibrium; and (c) technique-wise, we reformulate the mechanism design problem of identifying the optimal engagement trade-off at the equilibrium into a solvable offline optimization problem, tackled using approximated gradient descent. Our C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT framework and its derived solution serve as a pre-deployment audit tool for platforms, assessing the effects of algorithmic choices on creator and user engagement.

2 Related Work

The study of online content creation economy has captured the attention of machine learning community recently, leading to a diverse collection of models addressing the dynamics of content creator competition (Ghosh and McAfee, 2011; Ghosh and Hummel, 2013; Ben-Porat and Tennenholtz, 2018; Yao et al., 2024b, 2023; Zhu et al., 2023; Hu et al., 2023; Jagadeesan et al., 2024; Hron et al., 2022; Dean et al., 2024). In these models, creators strategically select the type (Xu et al., 2024), topic (Hron et al., 2022; Jagadeesan et al., 2024; Yao et al., 2023), or quality (Ghosh and McAfee, 2011; Hu et al., 2023) of their content, competing for resources such as traffic (Hron et al., 2022; Ben-Porat and Tennenholtz, 2017; Ghosh and Hummel, 2013), user engagement (Yao et al., 2023), or platform-provided incentives (Zhu et al., 2023; Yao et al., 2024b). Some models aim to explore the properties of creator-side equilibrium, investigating how creators specialize at equilibrium (Jagadeesan et al., 2024), the impact of creators’ strategic behaviors on social welfare (Yao et al., 2023), and the design of optimization methods for long-term welfare considering these behaviors (Ben-Porat and Tennenholtz, 2017, 2018; Yao et al., 2024b; Zhu et al., 2023; Hu et al., 2023; Immorlica et al., 2024; Mladenov et al., 2020). Unlike these works, which generally assume equally paced creation frequency among creators, our proposed C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT games consider scenarios where creators strategically control their creation quantities, allowing us to analyze how recommendation algorithms influence overall content creation volume at the equilibrium.

Our model of the platform’s recommendation algorithm draws inspiration from the proportional allocation concept in game theory, applicable to resource distribution (Caragiannis and Voudouris, 2016; Nguyen and Vojnovic, 2011) and contest design (Tullock, 1980). We assume that each user contributes a unit of traffic, which is allocated to creators based on both of their merit and effort (content creation frequency). This modeling is closely related to the Tullock contest (Tullock, 1980), also known as the lottery contest, where the probability of winning a fixed prize is proportionate to the effort expended relative to the total effort by all contestants. While the Nash equilibrium of the one-dimensional Tullock contest with homogeneous costs is well-understood (Ewerhart, 2015, 2017), our work extends this framework to include heterogeneous convex costs. A recent study (Yao et al., 2024a) also explored competition between human and Generative AI creators within a similar setup, examining its impact on total content creation volume. However, they did not address the influence of different recommendation algorithms, which we investigate in our work.

In a broader sense, our work contributes to a line of research evaluating the impact of recommender systems on individuals, specifically exploring how deployed algorithms shape user and content creator behavior (Dean et al., 2024; Cen et al., 2023, 2024; Lin et al., 2024; Dean and Morgenstern, 2022; Kalimeris et al., 2021; Eilat and Rosenfeld, 2023) and how we can design new algorithms to address these effects (Carroll et al., 2021; Yao et al., 2022a; Brantley et al., 2024; Yao et al., 2022b; Biyik et al., 2023; Prasad et al., 2023; Agarwal and Brown, 2023). Our study introduces a key insight: recommender algorithms optimized solely for user satisfaction can unintentionally reduce content creators’ willingness to engage, thereby impacting long-term user engagement. We address this challenge by proposing a solution that balances creator engagement and user satisfaction through imposing exploration strengths tailored to individual users.

3 The Cournot Content Creation Competition

In this section, we introduce the formulation of Cournot Content Creation Competition (C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT), which models the competition among creators for traffic on UGC platforms. This model considers the potential impact of the platform’s traffic reallocation mechanisms, such as their deployed recommendation algorithms, where creators strategically choose their production frequencies to optimize their allocated traffic. Each C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT game instance 𝒢𝒢\mathcal{G}caligraphic_G is characterized by a tuple (n,m,M,{ci}i=1n,{βi}i=1m)𝑛𝑚𝑀superscriptsubscriptsubscript𝑐𝑖𝑖1𝑛superscriptsubscriptsubscript𝛽𝑖𝑖1𝑚(n,m,M,\{c_{i}\}_{i=1}^{n},\{\beta_{i}\}_{i=1}^{m})( italic_n , italic_m , italic_M , { italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , { italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ). We detail each component of this tuple as follows:

  1. 1.

    Basic setups. There is a set of content creators denoted by [n]={1,,n}delimited-[]𝑛1𝑛[n]=\{1,\cdots,n\}[ italic_n ] = { 1 , ⋯ , italic_n }, and a set of users denoted by {uj}j=1msuperscriptsubscriptsubscript𝑢𝑗𝑗1𝑚\{u_{j}\}_{j=1}^{m}{ italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT. We assume each user j𝑗jitalic_j has a stable preference over creators and such relationship is captured by an n𝑛nitalic_n-by-m𝑚mitalic_m matrix M𝑀Mitalic_M with its (i,j)𝑖𝑗(i,j)( italic_i , italic_j )-th entry wij[0,1]subscript𝑤𝑖𝑗01w_{ij}\in[0,1]italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∈ [ 0 , 1 ] denoting the strength of user j𝑗jitalic_j’s preference over creator i𝑖iitalic_i’s content. Each creator determines a production frequency xi0subscript𝑥𝑖subscriptabsent0x_{i}\in\mathbb{R}_{\geq 0}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT in a unit amount of time (e.g., one week/one month). For the purpose of our analysis, xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT can be interpreted interchangeably as either the production frequency or volume, provided there is no ambiguity222For the elegance of our theoretical analysis, we treat xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as a continuous variable, although the key messages and insights of this paper are preserved if xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is discrete.. Follow the terminology of game theory literature, xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is referred to as the action or pure strategy of creator i𝑖iitalic_i. Each creator i𝑖iitalic_i is associated with a cost function cisubscript𝑐𝑖c_{i}italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, which characterizes the cost for creating content at frequency xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We have two assumptions about cisubscript𝑐𝑖c_{i}italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT: 1. cisubscript𝑐𝑖c_{i}italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is increasing in xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and ci(xi)+subscript𝑐𝑖subscript𝑥𝑖c_{i}(x_{i})\rightarrow+\inftyitalic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) → + ∞ as xi+subscript𝑥𝑖x_{i}\rightarrow+\inftyitalic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT → + ∞, which reflects that content creation is always not free. 2. cisubscript𝑐𝑖c_{i}italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is convex in xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, indicating a non-decreasing marginal cost of improving production frequency.

  2. 2.

    Platform intervention. We assume that each user ujsubscript𝑢𝑗u_{j}italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT contributes a unit amount of traffic, and the platform redistributes the total user traffic based on a relevance-based recommendation algorithm that adheres to certain probabilistic principle. Specifically, the recommendation algorithm matches ujsubscript𝑢𝑗u_{j}italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to each piece of content produced by creator i𝑖iitalic_i with a probability proportional to exp(βjσij)subscript𝛽𝑗subscript𝜎𝑖𝑗\exp(\beta_{j}\sigma_{ij})roman_exp ( italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ), where σij=wij+ϵijsubscript𝜎𝑖𝑗subscript𝑤𝑖𝑗subscriptitalic-ϵ𝑖𝑗\sigma_{ij}=w_{ij}+\epsilon_{ij}italic_σ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT represents the algorithm’s estimated relevance score. This score combines the true preference score wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT with independent Gaussian noise ϵij𝒩(0,σ2)similar-tosubscriptitalic-ϵ𝑖𝑗𝒩0superscript𝜎2\epsilon_{ij}\sim\mathcal{N}(0,\sigma^{2})italic_ϵ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∼ caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). The parameter βj0subscript𝛽𝑗0\beta_{j}\geq 0italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ 0 governs the exploration of the matching for user ujsubscript𝑢𝑗u_{j}italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT: a higher βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT results in a more precise matching, while a lower βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT introduces more exploration into the matching results 333The randomness in matching results may stem from either the imperfect estimation of the preference score wijsubscript𝑤𝑖𝑗w_{ij}italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT or intentionally injected exploration strength based on the intervention mechanism. In this paper, we focus on the latter source of randomness and analyze how such exploration strength might affect the outcomes.. In this work, we analyze intervention mechanisms that operate under this Personalized Probabilistic Matching (PPM) principle, parameterized by 𝜷={βi}i=1m𝜷superscriptsubscriptsubscript𝛽𝑖𝑖1𝑚\bm{\beta}=\{\beta_{i}\}_{i=1}^{m}bold_italic_β = { italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, and refer to it as PPM(𝜷𝜷\bm{\beta}bold_italic_β) for ease of notations.

  3. 3.

    Creator utility. Creators are reward-seeking individuals who try to maximize the expected traffic for their created content while carefully balancing the costs. Under PPM(𝜷𝜷\bm{\beta}bold_italic_β) and when each creator i𝑖iitalic_i produces xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT copies of content, the platform will allocate to i𝑖iitalic_i the amount of traffic from ujsubscript𝑢𝑗u_{j}italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT proportional to 𝔼ϵij𝒩(0,σ2)[xieβj(wij+ϵij)]=xieβjwijeσ2βj22subscript𝔼similar-tosubscriptitalic-ϵ𝑖𝑗𝒩0superscript𝜎2delimited-[]subscript𝑥𝑖superscript𝑒subscript𝛽𝑗subscript𝑤𝑖𝑗subscriptitalic-ϵ𝑖𝑗subscript𝑥𝑖superscript𝑒subscript𝛽𝑗subscript𝑤𝑖𝑗superscript𝑒superscript𝜎2superscriptsubscript𝛽𝑗22\mathbb{E}_{\epsilon_{ij}\sim\mathcal{N}(0,\sigma^{2})}\left[x_{i}e^{\beta_{j}% (w_{ij}+\epsilon_{ij})}\right]=x_{i}e^{\beta_{j}w_{ij}}\cdot e^{\frac{\sigma^{% 2}\beta_{j}^{2}}{2}}blackboard_E start_POSTSUBSCRIPT italic_ϵ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∼ caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT [ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ] = italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋅ italic_e start_POSTSUPERSCRIPT divide start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT. Here we should note that xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT does not suggest the creator would create xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT pieces of identical content, but amount of content following his/her expertise. Therefore, we formulate creator i𝑖iitalic_i’s utility function as the following:

    ui(xi,𝒙i;𝜷)subscript𝑢𝑖subscript𝑥𝑖subscript𝒙𝑖𝜷\displaystyle u_{i}(x_{i},\bm{x}_{-i};\bm{\beta})italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ; bold_italic_β ) =j=1m(𝔼[xieβjσij]k=1n𝔼[xkeβjσkj])ci(xi)absentsuperscriptsubscript𝑗1𝑚𝔼delimited-[]subscript𝑥𝑖superscript𝑒subscript𝛽𝑗subscript𝜎𝑖𝑗superscriptsubscript𝑘1𝑛𝔼delimited-[]subscript𝑥𝑘superscript𝑒subscript𝛽𝑗subscript𝜎𝑘𝑗subscript𝑐𝑖subscript𝑥𝑖\displaystyle=\sum_{j=1}^{m}\left(\frac{\mathbb{E}[x_{i}e^{\beta_{j}\sigma_{ij% }}]}{\sum_{k=1}^{n}\mathbb{E}[x_{k}e^{\beta_{j}\sigma_{kj}}]}\right)-c_{i}(x_{% i})= ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( divide start_ARG blackboard_E [ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT blackboard_E [ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] end_ARG ) - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )
    =j=1m(xieβjwijk=1nxkeβjwkj)ci(xi),absentsuperscriptsubscript𝑗1𝑚subscript𝑥𝑖superscript𝑒subscript𝛽𝑗subscript𝑤𝑖𝑗superscriptsubscript𝑘1𝑛subscript𝑥𝑘superscript𝑒subscript𝛽𝑗subscript𝑤𝑘𝑗subscript𝑐𝑖subscript𝑥𝑖\displaystyle=\sum_{j=1}^{m}\left(\frac{x_{i}e^{\beta_{j}w_{ij}}}{\sum_{k=1}^{% n}x_{k}e^{\beta_{j}w_{kj}}}\right)-c_{i}(x_{i}),= ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( divide start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , (1)

    where 𝒙i0n1subscript𝒙𝑖superscriptsubscriptabsent0𝑛1\bm{x}_{-i}\in\mathbb{R}_{\geq 0}^{n-1}bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT denotes the strategy profile of all creators except i𝑖iitalic_i.

The C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT game models the scenario where creators are aware of their expertise (i.e., what topic to create), and compete purely on creation quantity. This concept is akin to the extensively studied Cournot competition (Cournot, 1838) model in economics, where firms independently determine the output of homogeneous products at different costs. However, our model diverges from the classic Cournot competition in several key aspects. First of all, they have different revenue functions: in Cournot competition, the revenue for each firm is calculated as the product of price and production quantity, and the price only depends on all firms’ joint strategy. In contrast, in the C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT game, the gain of a creator not only relies on other creators’ decision but also hinges on the platform’s traffic allocation algorithm. In addition, while Cournot competition typically incorporates only linear cost functions, the C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT game accommodates general convex cost functions, offering a more nuanced reflection of the real cost faced by creators. These distinctions highlight the unique aspects of our model, while maintaining a conceptual link to traditional economic theories of competition.

Research questions:

Under the C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT framework, an important and natural research question is how we can predict creators’ strategic choices in the competition. This is a fundamental question in game theory and we employ the concept of Pure Nash Equilibrium (PNE) (Nash Jr, 1950) to characterize the outcome of C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT games. The definition of PNE is given by the following:

Definition 1

A joint strategy profile of all creators 𝐱=(x1,,xn)superscript𝐱subscriptsuperscript𝑥1subscriptsuperscript𝑥𝑛\bm{x}^{*}=(x^{*}_{1},\cdots,x^{*}_{n})bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = ( italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) forms a pure Nash equilibrium (PNE), if for every creator i𝑖iitalic_i, xisubscriptsuperscript𝑥𝑖x^{*}_{i}italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is a best response strategy that maximizes uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT given other creators’ strategy 𝐱isubscript𝐱𝑖\bm{x}_{-i}bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT; formally,

ui(xi,𝒙i)ui(xi,𝒙i) for every xi0,i[n].formulae-sequencesubscript𝑢𝑖subscriptsuperscript𝑥𝑖subscriptsuperscript𝒙𝑖subscript𝑢𝑖subscript𝑥𝑖subscriptsuperscript𝒙𝑖 for every subscript𝑥𝑖subscriptabsent0for-all𝑖delimited-[]𝑛u_{i}(x^{*}_{i},\bm{x}^{*}_{-i})\geq u_{i}(x_{i},\bm{x}^{*}_{-i})\,\,\text{ % for every }x_{i}\in\mathbb{R}_{\geq 0},\forall i\in[n].italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) ≥ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) for every italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT , ∀ italic_i ∈ [ italic_n ] . (2)

In other word, a PNE represents a stable state when everyone is satisfied with their strategies and does not want to deviate. As we will demonstrate in the subsequent section, a PNE always exists in C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT under any PPM(𝜷𝜷\bm{\beta}bold_italic_β) and can be computed efficiently. This finding forms the basis for our further theoretical analysis and empirical simulations.

In addition to the predictability and stability of creators’ production strategies, it is equally crucial for platform designers to develop metrics that encourage the prosperity of the content ecosystem. They must also devise algorithmic solutions to optimize these metrics, balancing the trade-off between engagement of users and creators, using the available “knob” PPM(𝜷𝜷\bm{\beta}bold_italic_β). We will explore these issues in the upcoming technical discussions.

4 The PNEs of C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT Games and Their Properties

As widely known, the PNE does not always exist (Debreu, 1952; Fan, 1952; Glicksberg, 1952). However, our first main result establishes that, under mild assumptions, C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT always admits a unique PNE.

Theorem 1

For any C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT instance 𝒢𝒢\mathcal{G}caligraphic_G(n,m,M,{ci}i=1n,𝛃)𝑛𝑚𝑀superscriptsubscriptsubscript𝑐𝑖𝑖1𝑛𝛃(n,m,M,\{c_{i}\}_{i=1}^{n},\bm{\beta})( italic_n , italic_m , italic_M , { italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , bold_italic_β ). If each cisubscript𝑐𝑖c_{i}italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is convex in xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, 𝒢𝒢\mathcal{G}caligraphic_G admits a unique PNE.

Note that the primary challenge in proving Theorem 1 is to show 𝒢𝒢\mathcal{G}caligraphic_G is a strictly monotone game, whereas the existence and uniqueness of PNE in such games is a classic result from Rosen (1965). Theorem 1 is interesting from multiple perspectives. First, it strictly generalizes previous equilibrium existence results in classic Tullock contest (Pérez-Castrillo and Verdier, 1992; Cornes and Hartley, 2005), which corresponds to the special case when m=1𝑚1m=1italic_m = 1 and w11==wn1subscript𝑤11subscript𝑤𝑛1w_{11}=\cdots=w_{n1}italic_w start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT = ⋯ = italic_w start_POSTSUBSCRIPT italic_n 1 end_POSTSUBSCRIPT. Second, the fact that C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT games are monotone is significant because it is well-known that the PNE of strictly monotone games can be found efficiently. For example, many natural multi-agent online learning dynamics such as mirror descent (Bravo et al., 2018), accelerated optimistic gradient (Cai and Zheng, 2023), and payoff-based learning (Tatarenko and Kamgarpour, 2020) guarantee the last-iterate convergence to the unique PNE in strictly monotone games, even when players have mere zeroth order feedback about their utility functions. These results suggest that the PNE of 𝒢𝒢\mathcal{G}caligraphic_G is achievable if all creators use a reasonable update rule in their strategies. This observation not only makes this equilibrium a plausible prediction of real-world competition but also paves the way to our simulation-based studies in our experiments, where we use multi-agent mirror descent with perfect gradient to numerically solve the PNE of C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT.

In addition to the existence and uniqueness properties, the following corollary characterizes the first-order characterization of 𝒢𝒢\mathcal{G}caligraphic_G’s PNE.

Corollary 1

The unique PNE 𝐱(𝛃)superscript𝐱𝛃\bm{x}^{*}(\bm{\beta})bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) of any 𝒢𝒢\mathcal{G}caligraphic_G(n,m,M,{ci}i=1n,𝛃)𝑛𝑚𝑀superscriptsubscriptsubscript𝑐𝑖𝑖1𝑛𝛃(n,m,M,\{c_{i}\}_{i=1}^{n},\bm{\beta})( italic_n , italic_m , italic_M , { italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , bold_italic_β ) satisfies the following first-order condition:

uixi|𝒙=𝒙(β)=0,1in.formulae-sequenceevaluated-atsubscript𝑢𝑖subscript𝑥𝑖𝒙superscript𝒙𝛽01𝑖𝑛\frac{\partial u_{i}}{\partial x_{i}}\bigg{|}_{\bm{x}=\bm{x}^{*}(\beta)}=0,% \quad 1\leq i\leq n.divide start_ARG ∂ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG | start_POSTSUBSCRIPT bold_italic_x = bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( italic_β ) end_POSTSUBSCRIPT = 0 , 1 ≤ italic_i ≤ italic_n . (3)

Since we have already shown the existence and uniqueness of 𝒢𝒢\mathcal{G}caligraphic_G’s PNE, Corollary 1 follows immediately according to Definition 1. Corollary 1 is useful for establishing further properties of C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT games in Section 5.

5 The Trade-Off Between User and Creator Engagement

We have established that a unique PNE exists in any C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT game under any PPM(𝜷𝜷\bm{\beta}bold_italic_β) and can be naturally achieved by competing content creators. This raises a crucial question for platform designers: how should the quality of the PNE be evaluated? For any mature UGC platform, it is essential to balance user satisfaction, which is key to short-term prosperity, with creator engagement, which is crucial for long-term sustainability. Within our C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT framework, this requires the platform designer to generate matching results that not only guarantee high user satisfaction (by improving the average matching quality at PNE) but also stimulate substantial content creation volume (by encouraging creators to increase their production frequency xsuperscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT at PNE). We define these two objectives as follows:

Definition 2

For any C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT instance 𝒢𝒢\mathcal{G}caligraphic_G(n,m,M,{ci}i=1n,𝛃)𝑛𝑚𝑀superscriptsubscriptsubscript𝑐𝑖𝑖1𝑛𝛃(n,m,M,\{c_{i}\}_{i=1}^{n},\bm{\beta})( italic_n , italic_m , italic_M , { italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , bold_italic_β ), let 𝐱(𝛃)=(x1,,xn)superscript𝐱𝛃superscriptsubscript𝑥1superscriptsubscript𝑥𝑛\bm{x}^{*}(\bm{\beta})=(x_{1}^{*},\cdots,x_{n}^{*})bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) = ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , ⋯ , italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) be the unique PNE under PPM(𝛃𝛃\bm{\beta}bold_italic_β). Then the (short-term) total user satisfaction is defined as

U(𝒙(𝜷);𝜷)=j=1mi=1n(wijxieβjwijk=1nxkeβjwkj),𝑈superscript𝒙𝜷𝜷superscriptsubscript𝑗1𝑚superscriptsubscript𝑖1𝑛subscript𝑤𝑖𝑗subscriptsuperscript𝑥𝑖superscript𝑒subscript𝛽𝑗subscript𝑤𝑖𝑗superscriptsubscript𝑘1𝑛subscriptsuperscript𝑥𝑘superscript𝑒subscript𝛽𝑗subscript𝑤𝑘𝑗U(\bm{x}^{*}(\bm{\beta});\bm{\beta})=\sum_{j=1}^{m}\sum_{i=1}^{n}\left(\frac{w% _{ij}x^{*}_{i}e^{\beta_{j}w_{ij}}}{\sum_{k=1}^{n}x^{*}_{k}e^{\beta_{j}w_{kj}}}% \right),italic_U ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) ; bold_italic_β ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( divide start_ARG italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) , (4)

and the (long-term) total content creation volume is defined as

V(𝒙(𝜷))=i=1nxi.𝑉superscript𝒙𝜷superscriptsubscript𝑖1𝑛superscriptsubscript𝑥𝑖V(\bm{x}^{*}(\bm{\beta}))=\sum_{i=1}^{n}x_{i}^{*}.italic_V ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT . (5)

The social welfare of the whole system is measured by a linear combination of U𝑈Uitalic_U and V𝑉Vitalic_V, defined as

Wλ(𝒙(𝜷);𝜷)=U(𝒙(𝜷);𝜷)+λV(𝒙(𝜷);𝜷).subscript𝑊𝜆superscript𝒙𝜷𝜷𝑈superscript𝒙𝜷𝜷𝜆𝑉superscript𝒙𝜷𝜷W_{\lambda}(\bm{x}^{*}(\bm{\beta});\bm{\beta})=U(\bm{x}^{*}(\bm{\beta});\bm{% \beta})+\lambda V(\bm{x}^{*}(\bm{\beta});\bm{\beta}).italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) ; bold_italic_β ) = italic_U ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) ; bold_italic_β ) + italic_λ italic_V ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) ; bold_italic_β ) . (6)

If we denote πj(𝒙(𝜷),𝜷)=i=1n(wijxieβjwijk=1nxkeβjwkj)subscriptsuperscript𝜋𝑗superscript𝒙𝜷𝜷superscriptsubscript𝑖1𝑛subscript𝑤𝑖𝑗subscriptsuperscript𝑥𝑖superscript𝑒subscript𝛽𝑗subscript𝑤𝑖𝑗superscriptsubscript𝑘1𝑛subscriptsuperscript𝑥𝑘superscript𝑒subscript𝛽𝑗subscript𝑤𝑘𝑗\pi^{*}_{j}(\bm{x}^{*}(\bm{\beta}),\bm{\beta})=\sum_{i=1}^{n}\left(\frac{w_{ij% }x^{*}_{i}e^{\beta_{j}w_{ij}}}{\sum_{k=1}^{n}x^{*}_{k}e^{\beta_{j}w_{kj}}}\right)italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) , bold_italic_β ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( divide start_ARG italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) as the indicator of an individual user’s satisfaction or utility, which is the expected matching scores of user j𝑗jitalic_j at the PNE under PPM(𝜷𝜷\bm{\beta}bold_italic_β). And the total user satisfaction measure U=j=1mπj𝑈superscriptsubscript𝑗1𝑚subscriptsuperscript𝜋𝑗U=\sum_{j=1}^{m}\pi^{*}_{j}italic_U = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is the accumulated user utility. We argue that U𝑈Uitalic_U primarily serves as a metric for short-term welfare evaluation, since it focuses solely on user satisfaction at a specific instance of matching outcomes but does not capture the dynamics of user engagement over time. This overlooks the crucial fact that sustained user engagement on a platform requires a continuous supply of relevant content, as users can hardly be satisfied by their previously consumed material. This limitation is also evident in U𝑈Uitalic_U’s mathematical formulation: its value remains unchanged with a rescaling of {xi}superscriptsubscript𝑥𝑖\{x_{i}^{*}\}{ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT }, indicating that it fails to reflect changes in content volume or frequency that might affect long-term user engagement. On the other hand, the long-term prosperity of a UGC platform is fundamentally linked to the engagement of content creators. Therefore, we introduce the total content creation volume V𝑉Vitalic_V as an indicator of the long-term welfare of the platform.

For the platform designers, it is essential to develop metrics that balance both short-term and long-term considerations. Thus, we propose a hybrid social welfare metric, W𝑊Witalic_W, which combines U𝑈Uitalic_U and V𝑉Vitalic_V to reflect both user satisfaction and content supply sustainability. However, understanding the mechanisms to optimize U𝑈Uitalic_U and V𝑉Vitalic_V independently is critical. In the following sections, we will explore the optimal matching mechanisms tailored to the exclusive objectives of U𝑈Uitalic_U (short-term) and V𝑉Vitalic_V (long-term), and then present an efficient algorithm designed to optimize W𝑊Witalic_W.

Interestingly, our findings suggest that both U𝑈Uitalic_U and V𝑉Vitalic_V exhibit monotonicity with respect to the parameter 𝜷𝜷\bm{\beta}bold_italic_β, even in scenarios involving a homogeneous user population. This uniform behavior of U𝑈Uitalic_U and V𝑉Vitalic_V offers valuable insights into how the adjustment of exploration strength could potentially impact platform performance. Our forthcoming theorem formally characterizes these observations.

Theorem 2

Consider any C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT game with m=1𝑚1m=1italic_m = 1. If the elements of M=[w1,,wn]𝑀superscriptsubscript𝑤1subscript𝑤𝑛topM=[w_{1},\cdots,w_{n}]^{\top}italic_M = [ italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_w start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT are not identical, it holds that:

  1. 1.

    U(β)𝑈𝛽U(\beta)italic_U ( italic_β ) defined in Eq. (4) is strictly increasing in β𝛽\betaitalic_β.

  2. 2.

    V(β)𝑉𝛽V(\beta)italic_V ( italic_β ) defined in Eq. (5) is strictly decreasing in β[β0,+)𝛽subscript𝛽0\beta\in[\beta_{0},+\infty)italic_β ∈ [ italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , + ∞ ) for some β0>0subscript𝛽00\beta_{0}>0italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0.

Theorem 2 conveys two significant insights. The first one, though perhaps unsurprising, reveals that improving matching accuracy corresponds to an increase in expected user satisfaction. Despite its intuitiveness, this is a strong observation because it holds without relying on any specific structural assumptions about creator cost functions. This means that regardless of the potential complexity in equilibrium structures due to creator costs, and even when the order of xisubscriptsuperscript𝑥𝑖x^{*}_{i}italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT does not align with a creator’s capability wisubscript𝑤𝑖w_{i}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, the metric U𝑈Uitalic_U is still monotonically increasing with respect to β𝛽\betaitalic_β. The second insight may surprise some readers: it suggests that while keep increasing the matching accuracy motivates some creators to produce more content, it demotivates others, resulting in a net decrease in the overall volume of content creation. This finding illustrates an intrinsic trade-off between short-term matching accuracy and long-term content supply: strategies that enhance short-term user satisfaction can inadvertently reduce content creation frequency across creators. To the best of our knowledge, this result is novel and has not been discussed in similar studies.

Here is an intuitive explanation for why a large β𝛽\betaitalic_β diminishes creators’ willingness to produce content. As the traffic allocation becomes more deterministic, the marginal gain from increasing production frequency diminishes because the amount of traffic accrued is largely determined by the relevance score, rather than volume. In the extreme case where β+𝛽\beta\rightarrow+\inftyitalic_β → + ∞, only the most relevant creator captures all the user traffic, regardless of her production volume. Consequently, due to the presence of production costs, this creator, and others, will only sustain the minimum viable productivity. Conversely, in the other extreme scenario where β=0𝛽0\beta=0italic_β = 0, i.e., user traffic is distributed uniformly among creators irrespective of relevance, the gain for each creator depends solely on their production frequency, prompting a productivity arms race. Clearly, both extremes are suboptimal, but they effectively illustrate the rationale behind our theoretical findings.

Although in Theorem 2 we consider the game instance 𝒢𝒢\mathcal{G}caligraphic_G with m=1𝑚1m=1italic_m = 1 as a representative snapshot of how creators compete for a single unit of user traffic (i.e., homogeneous user population), extending the time frame to encompass a sequence of heterogeneous users suggests that the observed trade-off between U𝑈Uitalic_U and V𝑉Vitalic_V remains consistent. In our experiments, we will demonstrate this trade-off in broader settings through simulations, e.g. when m>1𝑚1m>1italic_m > 1 and with various complex user distributions.

The proof of Theorem 2, while delivering a clear message, is far from trivial. Since the dependencies of U𝑈Uitalic_U and V𝑉Vitalic_V on β𝛽\betaitalic_β are indirectly linked through xsuperscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, which lacks a closed form, the derivation of their derivatives with respect to β𝛽\betaitalic_β necessitates the use of the implicit function theorem (Krantz and Parks, 2002) to articulate the derivative of xsuperscript𝑥x^{*}italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT with respect to β𝛽\betaitalic_β. This involves a complex matrix inverse, which we simplify using the Sherman–Morrison formula (Sherman, 1949) due to its structure being a diagonal matrix with a rank-one update. This proof technique not only supports our theorem but also inspires a novel first-order optimization approach to address the hybrid social welfare optimization discussed in Section 7. Detailed proofs are provided in Appendix A.2.

6 Finding the Optimal Trade-off through Optimization

Our theory thus far indicates that optimizing both user satisfaction U𝑈Uitalic_U and creator engagement V𝑉Vitalic_V is non-trivial, even when the user population is homogeneous, as achieving the optimal of U𝑈Uitalic_U and V𝑉Vitalic_V simultaneously is impossible. Consequently, an essential and intriguing question arises within any specific competitive environment C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT: how can we identify the optimal trade-off between these two factors by optimizing any given welfare metric Wλsubscript𝑊𝜆W_{\lambda}italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT?

Generally, the welfare metric W𝑊Witalic_W is influenced by three factors: the platform’s algorithmic recommendation policy PPM(𝜷)𝜷(\bm{\beta})( bold_italic_β ), the resulting content creation profile 𝒙superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT at the PNE induced by 𝜷𝜷\bm{\beta}bold_italic_β, and the relevance matrix M𝑀Mitalic_M. Thus, we can formulate the resulting optimization problem (OP) as follows:

Findargmax𝜷0mWλ(𝒙(𝜷),𝜷)Findsubscript𝑚𝑎𝑥𝜷superscriptsubscriptabsent0𝑚subscript𝑊𝜆superscript𝒙𝜷𝜷\displaystyle\text{Find}\quad\arg\mathop{max}_{\bm{\beta}\in\mathbb{R}_{\geq 0% }^{m}}W_{\lambda}(\bm{x}^{*}(\bm{\beta}),\bm{\beta})Find roman_arg start_BIGOP italic_m italic_a italic_x end_BIGOP start_POSTSUBSCRIPT bold_italic_β ∈ blackboard_R start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) , bold_italic_β ) (7)
s.t.𝒙(𝜷)is the PNE of𝒢.\displaystyle s.t.\quad~{}~{}\bm{x}^{*}(\bm{\beta})~{}\text{is the PNE of}~{}% \text{{$\mathcal{G}$}{}}.italic_s . italic_t . bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) is the PNE of caligraphic_G .

In general, OP (7) presents a formidable challenge, as solving for a PNE of a game is known to be difficult (Daskalakis et al., 2009). Fortunately, the nice structure of C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT allows us to utilize the implicit characterization of the PNE detailed in Corollary 1 to tackle OP (7) effectively. In the subsequent section, we demonstrate that the gradient of Wλsubscript𝑊𝜆W_{\lambda}italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT w.r.t. 𝜷𝜷\bm{\beta}bold_italic_β can be explicitly computed.

6.1 The Derivation of Exact Gradient

According to the chain rule, the first-order gradient of Wλsubscript𝑊𝜆W_{\lambda}italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT w.r.t. 𝜷𝜷\bm{\beta}bold_italic_β can be expressed as

dWλd𝜷=dU(𝒙(𝜷),𝜷)d𝜷+λdV(𝒙(𝜷))d𝜷=(U𝒙+λV𝒙)d𝒙d𝜷+U𝜷.𝑑subscript𝑊𝜆𝑑𝜷𝑑𝑈superscript𝒙𝜷𝜷𝑑𝜷𝜆𝑑𝑉superscript𝒙𝜷𝑑𝜷𝑈superscript𝒙𝜆𝑉superscript𝒙𝑑superscript𝒙𝑑𝜷𝑈𝜷\displaystyle\frac{dW_{\lambda}}{d\bm{\beta}}=\frac{dU(\bm{x}^{*}(\bm{\beta}),% \bm{\beta})}{d\bm{\beta}}+\lambda\frac{dV(\bm{x}^{*}(\bm{\beta}))}{d\bm{\beta}% }=\left(\frac{\partial U}{\partial\bm{x}^{*}}+\lambda\frac{\partial V}{% \partial\bm{x}^{*}}\right)\cdot\frac{d\bm{x}^{*}}{d\bm{\beta}}+\frac{\partial U% }{\partial\bm{\beta}}.divide start_ARG italic_d italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT end_ARG start_ARG italic_d bold_italic_β end_ARG = divide start_ARG italic_d italic_U ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) , bold_italic_β ) end_ARG start_ARG italic_d bold_italic_β end_ARG + italic_λ divide start_ARG italic_d italic_V ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_italic_β ) ) end_ARG start_ARG italic_d bold_italic_β end_ARG = ( divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG + italic_λ divide start_ARG ∂ italic_V end_ARG start_ARG ∂ bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG ) ⋅ divide start_ARG italic_d bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_d bold_italic_β end_ARG + divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_β end_ARG . (8)

The evaluation of the gradient of Wλsubscript𝑊𝜆W_{\lambda}italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT relies on the calculation of three vectors, U𝒙,V𝒙1×n𝑈superscript𝒙𝑉superscript𝒙superscript1𝑛\frac{\partial U}{\partial\bm{x}^{*}},\frac{\partial V}{\partial\bm{x}^{*}}\in% \mathbb{R}^{1\times n}divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG , divide start_ARG ∂ italic_V end_ARG start_ARG ∂ bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG ∈ blackboard_R start_POSTSUPERSCRIPT 1 × italic_n end_POSTSUPERSCRIPT, and U𝜷1×m𝑈𝜷superscript1𝑚\frac{\partial U}{\partial\bm{\beta}}\in\mathbb{R}^{1\times m}divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_β end_ARG ∈ blackboard_R start_POSTSUPERSCRIPT 1 × italic_m end_POSTSUPERSCRIPT, as well as a Jacobian matrix d𝒙d𝜷n×m𝑑superscript𝒙𝑑𝜷superscript𝑛𝑚\frac{d\bm{x}^{*}}{d\bm{\beta}}\in\mathbb{R}^{n\times m}divide start_ARG italic_d bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_d bold_italic_β end_ARG ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_m end_POSTSUPERSCRIPT. The computations of U𝒙𝑈superscript𝒙\frac{\partial U}{\partial\bm{x}^{*}}divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG, V𝒙𝑉superscript𝒙\frac{\partial V}{\partial\bm{x}^{*}}divide start_ARG ∂ italic_V end_ARG start_ARG ∂ bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG and U𝜷𝑈𝜷\frac{\partial U}{\partial\bm{\beta}}divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_β end_ARG are straightforward and computationally light, which position the main challenge as the computation of d𝒙d𝜷𝑑superscript𝒙𝑑𝜷\frac{d\bm{x}^{*}}{d\bm{\beta}}divide start_ARG italic_d bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_d bold_italic_β end_ARG. Fortunately, the first-order characterization of 𝒙superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT by Corollary 1 enables us to express the gradient of 𝒙superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT w.r.t. 𝜷𝜷\bm{\beta}bold_italic_β using implicit function derivation (Krantz and Parks, 2002) as d𝒙d𝜷=(F𝒙)1F𝜷𝑑superscript𝒙𝑑𝜷superscript𝐹superscript𝒙1𝐹𝜷\frac{d\bm{x}^{*}}{d\bm{\beta}}=-\left(\frac{\partial F}{\partial\bm{x}^{*}}% \right)^{-1}\cdot\frac{\partial F}{\partial\bm{\beta}}divide start_ARG italic_d bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_d bold_italic_β end_ARG = - ( divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⋅ divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_β end_ARG, where F(𝒙,𝜷)=(uixi)i=1n𝐹𝒙𝜷superscriptsubscriptsubscript𝑢𝑖subscript𝑥𝑖𝑖1𝑛F(\bm{x},\bm{\beta})=\left(\frac{\partial u_{i}}{\partial x_{i}}\right)_{i=1}^% {n}italic_F ( bold_italic_x , bold_italic_β ) = ( divide start_ARG ∂ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is an n𝑛nitalic_n-valued function, and both F𝒙𝐹superscript𝒙\frac{\partial F}{\partial\bm{x}^{*}}divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG and F𝜷𝐹𝜷\frac{\partial F}{\partial\bm{\beta}}divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_β end_ARG are matrices of dimensions n×n𝑛𝑛n\times nitalic_n × italic_n and n×m𝑛𝑚n\times mitalic_n × italic_m, respectively. The following proposition provides the exact formula for the gradient. The calculation is straightforward and we omit the detailed derivation.

Proposition 1

Let 𝐱=(x1,,xn)superscript𝐱superscriptsubscript𝑥1superscriptsubscript𝑥𝑛\bm{x}^{*}=(x_{1}^{*},\cdots,x_{n}^{*})bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , ⋯ , italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) be the PNE of 𝒢(n,m,M,{ci}i=1n,𝛃)𝒢𝑛𝑚𝑀superscriptsubscriptsubscript𝑐𝑖𝑖1𝑛𝛃\mathcal{G}(n,m,M,\{c_{i}\}_{i=1}^{n},\bm{\beta})caligraphic_G ( italic_n , italic_m , italic_M , { italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , bold_italic_β ). Then, the Jacobian matrix of 𝐱superscript𝐱\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT as a function of 𝛃𝛃\bm{\beta}bold_italic_β is

d𝒙d𝜷=(D+YZ)1Bn×m,𝑑superscript𝒙𝑑𝜷superscript𝐷𝑌superscript𝑍top1𝐵superscript𝑛𝑚\frac{d\bm{x}^{*}}{d\bm{\beta}}=\left(D+YZ^{\top}\right)^{-1}B\in\mathbb{R}^{n% \times m},divide start_ARG italic_d bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_d bold_italic_β end_ARG = ( italic_D + italic_Y italic_Z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_B ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_m end_POSTSUPERSCRIPT , (9)

where D𝐷Ditalic_D is an n×n𝑛𝑛n\times nitalic_n × italic_n diagonal matrix given by

D=diag(c1′′+j=1mP1j2x12,,cn′′+j=1mPnj2xn2),𝐷diagsubscriptsuperscript𝑐′′1superscriptsubscript𝑗1𝑚subscriptsuperscript𝑃21𝑗subscriptsuperscript𝑥absent21subscriptsuperscript𝑐′′𝑛superscriptsubscript𝑗1𝑚subscriptsuperscript𝑃2𝑛𝑗subscriptsuperscript𝑥absent2𝑛D=\text{diag}\left(c^{\prime\prime}_{1}+\sum_{j=1}^{m}\frac{P^{2}_{1j}}{x^{*2}% _{1}},\cdots,c^{\prime\prime}_{n}+\sum_{j=1}^{m}\frac{P^{2}_{nj}}{x^{*2}_{n}}% \right),italic_D = diag ( italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_P start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUPERSCRIPT ∗ 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , ⋯ , italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_P start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUPERSCRIPT ∗ 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) , (10)

and B,Y,Z𝐵𝑌𝑍B,Y,Zitalic_B , italic_Y , italic_Z are n×msuperscript𝑛𝑚\mathbb{R}^{n\times m}blackboard_R start_POSTSUPERSCRIPT italic_n × italic_m end_POSTSUPERSCRIPT matrices calculated as follows (1in,1jmformulae-sequence1𝑖𝑛1𝑗𝑚1\leq i\leq n,1\leq j\leq m1 ≤ italic_i ≤ italic_n , 1 ≤ italic_j ≤ italic_m):

Y=[Pij(12Pij)xi]ij,Z=[Pijxi]ij,B=[Pij(12Pij)xi(wijk=1nwkjPkj)]ij,formulae-sequence𝑌subscriptdelimited-[]subscript𝑃𝑖𝑗12subscript𝑃𝑖𝑗subscriptsuperscript𝑥𝑖𝑖𝑗formulae-sequence𝑍subscriptdelimited-[]subscript𝑃𝑖𝑗subscriptsuperscript𝑥𝑖𝑖𝑗𝐵subscriptdelimited-[]subscript𝑃𝑖𝑗12subscript𝑃𝑖𝑗subscriptsuperscript𝑥𝑖subscript𝑤𝑖𝑗superscriptsubscript𝑘1𝑛subscript𝑤𝑘𝑗subscript𝑃𝑘𝑗𝑖𝑗Y=\left[\frac{P_{ij}(1-2P_{ij})}{x^{*}_{i}}\right]_{ij},\quad Z=\left[\frac{P_% {ij}}{x^{*}_{i}}\right]_{ij},\quad B=\left[\frac{P_{ij}(1-2P_{ij})}{x^{*}_{i}}% \cdot\left(w_{ij}-\sum_{k=1}^{n}w_{kj}P_{kj}\right)\right]_{ij},italic_Y = [ divide start_ARG italic_P start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ] start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT , italic_Z = [ divide start_ARG italic_P start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ] start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT , italic_B = [ divide start_ARG italic_P start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ⋅ ( italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT , (11)

where ci′′subscriptsuperscript𝑐′′𝑖c^{\prime\prime}_{i}italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the second-order derivative of creator i𝑖iitalic_i’s cost function, and Pij=xiexp(βjwij)k=1nxkexp(βjwkj)subscript𝑃𝑖𝑗subscriptsuperscript𝑥𝑖subscript𝛽𝑗subscript𝑤𝑖𝑗superscriptsubscript𝑘1𝑛subscriptsuperscript𝑥𝑘subscript𝛽𝑗subscript𝑤𝑘𝑗P_{ij}=\frac{x^{*}_{i}\exp(\beta_{j}w_{ij})}{\sum_{k=1}^{n}x^{*}_{k}\exp(\beta% _{j}w_{kj})}italic_P start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = divide start_ARG italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_exp ( italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_exp ( italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT ) end_ARG is the probability that creator i𝑖iitalic_i is matched with user j𝑗jitalic_j at the PNE under PPM(𝛃)𝛃(\bm{\beta})( bold_italic_β ).

6.2 Optimization with Approximated Gradients

Proposition 1 together with Eq. (8) offers us a possibility to directly apply gradient-based approaches for solving OP (7). However, the gradient computation requires the inversion of an n×n𝑛𝑛n\times nitalic_n × italic_n matrix, whose time complexity is O(n3)𝑂superscript𝑛3O(n^{3})italic_O ( italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) and thus too cumbersome. To reduce the computational burden, we propose to approximately compute the gradient using the Sherman–Morrison-Woodbury formula (Sherman, 1949) to approximate the matrix inverse, inspired by the specific structure of the RHS of Eq. (9). According to Sherman–Morrison-Woodbury formula, it holds that

(D+YZ)1=D1D1Y(I+ZD1Y)1ZD1,superscript𝐷𝑌superscript𝑍top1superscript𝐷1superscript𝐷1𝑌superscript𝐼superscript𝑍topsuperscript𝐷1𝑌1superscript𝑍topsuperscript𝐷1(D+YZ^{\top})^{-1}=D^{-1}-D^{-1}Y\left(I+Z^{\top}D^{-1}Y\right)^{-1}Z^{\top}D^% {-1},( italic_D + italic_Y italic_Z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_Y ( italic_I + italic_Z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_Y ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_Z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT , (12)

and the computation of the RHS of Eq. (12) now requires a time complexity of O(n2m+nm2+m3)𝑂superscript𝑛2𝑚𝑛superscript𝑚2superscript𝑚3O(n^{2}m+nm^{2}+m^{3})italic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_m + italic_n italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_m start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ). However, the size of the user population m𝑚mitalic_m in practical scenarios is often even larger than n𝑛nitalic_n. To efficiently compute the RHS of Eq. (12), we propose a method to “sketch” the matrices Y𝑌Yitalic_Y and Z𝑍Zitalic_Z by sampling a subset of users. Initially, each column of Y𝑌Yitalic_Y and Z𝑍Zitalic_Z corresponds to a user index j𝑗jitalic_j. We begin by sampling a sub-population of 𝒳𝒳\mathcal{X}caligraphic_X, indexed by \mathcal{I}caligraphic_I, with ||=m~=[δm]~𝑚delimited-[]𝛿𝑚|\mathcal{I}|=\tilde{m}=[\delta m]| caligraphic_I | = over~ start_ARG italic_m end_ARG = [ italic_δ italic_m ], where δ(0,1]𝛿01\delta\in(0,1]italic_δ ∈ ( 0 , 1 ] denotes the sampling rate. With this sampled index set \mathcal{I}caligraphic_I, we construct matrices Y~,Z~n×m~𝑌~𝑍superscript𝑛𝑚\tilde{Y},\tilde{Z}\in\mathbb{R}^{n\times m}over~ start_ARG italic_Y end_ARG , over~ start_ARG italic_Z end_ARG ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_m end_POSTSUPERSCRIPT, where the (i,j)𝑖𝑗(i,j)( italic_i , italic_j )-th entries are defined as follows:

Y~ij=Pij(12Pij)xi,Z~ij=Pijxi,formulae-sequencesubscript~𝑌𝑖𝑗subscript𝑃𝑖superscript𝑗12subscript𝑃𝑖superscript𝑗subscriptsuperscript𝑥𝑖subscript~𝑍𝑖𝑗subscript𝑃𝑖superscript𝑗subscriptsuperscript𝑥𝑖\tilde{Y}_{ij}=\frac{P_{ij^{\prime}}(1-2P_{ij^{\prime}})}{x^{*}_{i}},\quad% \tilde{Z}_{ij}=\frac{P_{ij^{\prime}}}{x^{*}_{i}},over~ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = divide start_ARG italic_P start_POSTSUBSCRIPT italic_i italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG , over~ start_ARG italic_Z end_ARG start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = divide start_ARG italic_P start_POSTSUBSCRIPT italic_i italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG , (13)

with jsuperscript𝑗j^{\prime}italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT being uniformly sampled from \mathcal{I}caligraphic_I. Given that Y~,Z~~𝑌~𝑍\tilde{Y},\tilde{Z}over~ start_ARG italic_Y end_ARG , over~ start_ARG italic_Z end_ARG now possess reduced ranks of [δm]delimited-[]𝛿𝑚[\delta m][ italic_δ italic_m ], the computational complexity of evaluating (D+Y~Z~)1superscript𝐷~𝑌superscript~𝑍top1(D+\tilde{Y}\tilde{Z}^{\top})^{-1}( italic_D + over~ start_ARG italic_Y end_ARG over~ start_ARG italic_Z end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT is significantly lowered to O(n2m~+nm~2+m~3)𝑂superscript𝑛2~𝑚𝑛superscript~𝑚2superscript~𝑚3O(n^{2}\tilde{m}+n\tilde{m}^{2}+\tilde{m}^{3})italic_O ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over~ start_ARG italic_m end_ARG + italic_n over~ start_ARG italic_m end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + over~ start_ARG italic_m end_ARG start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ). Algorithm 1 describes the steps for addressing OP (7).

Algorithm 1 Approximated Gradient Descent for Solving OP (7).
  Input: The environment specified by 𝒢𝒢\mathcal{G}caligraphic_G, maximum iteration number T𝑇Titalic_T, sample rate δ𝛿\deltaitalic_δ, learning rate η𝜂\etaitalic_η, initial mechanism PPM(𝜷𝜷\bm{\beta}bold_italic_β).
  for t[T]𝑡delimited-[]𝑇t\in[T]italic_t ∈ [ italic_T ] do
     Find the PNE 𝒙superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT of 𝒢𝒢\mathcal{G}caligraphic_G under PPM(𝜷𝜷\bm{\beta}bold_italic_β) using Algorithm 2,
     Uniformly sample [δm]delimited-[]𝛿𝑚[\delta m][ italic_δ italic_m ] users from 𝒳𝒳\mathcal{X}caligraphic_X and use them to compute matrices Y~,Z~~𝑌~𝑍\tilde{Y},\tilde{Z}over~ start_ARG italic_Y end_ARG , over~ start_ARG italic_Z end_ARG in Eq. (13),
     Compute the approximated gradient dWλd𝜷𝑑subscript𝑊𝜆𝑑𝜷\frac{dW_{\lambda}}{d\bm{\beta}}divide start_ARG italic_d italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT end_ARG start_ARG italic_d bold_italic_β end_ARG using (8),(9),(12) with sketched matrices Y~,Z~~𝑌~𝑍\tilde{Y},\tilde{Z}over~ start_ARG italic_Y end_ARG , over~ start_ARG italic_Z end_ARG,
     Update 𝜷=𝜷+ηdWλd𝜷.𝜷𝜷𝜂𝑑subscript𝑊𝜆𝑑𝜷\bm{\beta}=\bm{\beta}+\eta\frac{dW_{\lambda}}{d\bm{\beta}}.bold_italic_β = bold_italic_β + italic_η divide start_ARG italic_d italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT end_ARG start_ARG italic_d bold_italic_β end_ARG .

Algorithm 1 requires solving for the PNE of 𝒢𝒢\mathcal{G}caligraphic_G each time when 𝜷𝜷\bm{\beta}bold_italic_β is updated. To accomplish this, we employ the multi-agent mirror descent method, as proposed in Bravo et al. (2018) and detailed in Algorithm 2 in Appendix, to serve as a subroutine444In Bravo et al. (2018), the algorithm is guaranteed to converge to PNE under zeroth order feedback. Here we use the perfect gradient as input and thus the convergence is also guaranteed.. To accelerate the convergence of Algorithm 1, the PNE strategy 𝒙superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT obtained under the previous 𝜷𝜷\bm{\beta}bold_italic_β is used as the initial strategy for computing the new PNE after updating 𝜷𝜷\bm{\beta}bold_italic_β. Further implementation details are provided in the experiment section.

7 Experiments

To validate our theoretical findings and demonstrate the performance of Algorithm 1, we conduct simulations on instances of 𝒢𝒢\mathcal{G}caligraphic_G constructed from both synthetic data and the MovieLens-1m dataset (Harper and Konstan, 2015). In our experiments, Algorithm 2 is employed to solve the PNE for each instance of 𝒢𝒢\mathcal{G}caligraphic_G. Below, we first outline the specifications of these two simulation environments and then present our results.

Synthetic environment For the synthetic environment, we construct the user population 𝒳𝒳\mathcal{X}caligraphic_X by setting an embedding dimension d=32𝑑32d=32italic_d = 32 and independently sampling 50505050 cluster centers, denoted as {𝐜1,,𝐜50}subscript𝐜1subscript𝐜50\{\mathbf{c}_{1},\dots,\mathbf{c}_{50}\}{ bold_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_c start_POSTSUBSCRIPT 50 end_POSTSUBSCRIPT }, from the unit sphere 𝕊d1superscript𝕊𝑑1\mathbb{S}^{d-1}blackboard_S start_POSTSUPERSCRIPT italic_d - 1 end_POSTSUPERSCRIPT. For each center 𝐜isubscript𝐜𝑖\mathbf{c}_{i}bold_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, users belonging to cluster-i𝑖iitalic_i are generated by sampling independently from a Gaussian distribution 𝒩(𝐜i,0.52𝐈d)𝒩subscript𝐜𝑖superscript0.52subscript𝐈𝑑\mathcal{N}(\mathbf{c}_{i},0.5^{2}\mathbf{I}_{d})caligraphic_N ( bold_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , 0.5 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ). The sizes of the 50505050 user clusters are determined uniformly at random, ensuring the total size of 𝒳𝒳\mathcal{X}caligraphic_X is m=1000𝑚1000m=1000italic_m = 1000. Similarly, n=200𝑛200n=200italic_n = 200 creators are generated, and the relevance matrix Mn×m𝑀superscript𝑛𝑚M\in\mathbb{R}^{n\times m}italic_M ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_m end_POSTSUPERSCRIPT is defined by the dot product between each user-creator pair, which are then normalized to the range [0,1]01[0,1][ 0 , 1 ]. This synthetic dataset encapsulates a class of clustered user and creator preference distributions. On the creators’ side, their cost functions are set to ci(x)=cixρsubscript𝑐𝑖𝑥subscript𝑐𝑖superscript𝑥𝜌c_{i}(x)=c_{i}x^{\rho}italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x ) = italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT italic_ρ end_POSTSUPERSCRIPT, with the default ρ=1.5𝜌1.5\rho=1.5italic_ρ = 1.5. The marginal costs {ci}i=1nsuperscriptsubscriptsubscript𝑐𝑖𝑖1𝑛\{c_{i}\}_{i=1}^{n}{ italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT are randomly sampled from a uniform distribution 𝒰[0.1,0.5]𝒰0.10.5\mathcal{U}[0.1,0.5]caligraphic_U [ 0.1 , 0.5 ].

Environment constructed from MovieLens-1m dataset We use deep matrix factorization (Fan and Cheng, 2018) to train user and movie embeddings (with dimension set to 32323232) by fitting the observed ratings in the range of 1 to 5. To ensure the quality of the trained embeddings, we performed a 5-fold cross-validation and obtained an averaged RMSE===0.739 on the test sets. With the same hyper-parameter, we train the user/item embeddings with the complete dataset. We randomly select m=1000𝑚1000m=1000italic_m = 1000 user embeddings to construct the population 𝒳𝒳\mathcal{X}caligraphic_X and n=200𝑛200n=200italic_n = 200 movie embeddings as the creator profiles. Similarly, The relevance matrix Mn×m𝑀superscript𝑛𝑚M\in\mathbb{R}^{n\times m}italic_M ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_m end_POSTSUPERSCRIPT is given by the dot product between each user-creator pair normalized to [0,1]01[0,1][ 0 , 1 ] and creators’ cost functions are the same as we specified in the synthetic environment.

7.1 The Empirical Trade-Offs Between U𝑈Uitalic_U and V𝑉Vitalic_V

Figure 1 illustrates the content creation frequency xisuperscriptsubscript𝑥𝑖x_{i}^{*}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, user utility πjsuperscriptsubscript𝜋𝑗\pi_{j}^{*}italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, and their corresponding aggregated values U=jπj𝑈subscript𝑗superscriptsubscript𝜋𝑗U=\sum_{j}\pi_{j}^{*}italic_U = ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, V=ixi𝑉subscript𝑖superscriptsubscript𝑥𝑖V=\sum_{i}x_{i}^{*}italic_V = ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT under the PNE induced by different homogeneous β𝛽\betaitalic_β (i.e., all users share the same β𝛽\betaitalic_β). The result in the right panel shows that a larger β𝛽\betaitalic_β enhances overall user satisfaction U𝑈Uitalic_U but undermines total content creation V𝑉Vitalic_V. As β𝛽\betaitalic_β increases, the drop in V𝑉Vitalic_V becomes more significant. This empirical finding supports Theorem 2 and suggests it holds under broader settings without the assumptions on creator cost function and user population structure. The left and middle plots illustrate each creator i𝑖iitalic_i’s creation frequency xisuperscriptsubscript𝑥𝑖x_{i}^{*}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and each user j𝑗jitalic_j’s utility πjsubscriptsuperscript𝜋𝑗\pi^{*}_{j}italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT at the PNE, such that both xisuperscriptsubscript𝑥𝑖x_{i}^{*}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and πjsubscriptsuperscript𝜋𝑗\pi^{*}_{j}italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are rearranged in descending order. They show that when β𝛽\betaitalic_β is shared across all users, its change affects xisubscriptsuperscript𝑥𝑖x^{*}_{i}italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and πjsubscriptsuperscript𝜋𝑗\pi^{*}_{j}italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in the same direction.

Refer to caption
Refer to caption
Refer to caption
Figure 1: The left and the middle panel: the empirical distributions of content creation frequency xisuperscriptsubscript𝑥𝑖x_{i}^{*}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and each user’s individual utility πjsuperscriptsubscript𝜋𝑗\pi_{j}^{*}italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Different colors represent results for PNEs induced by different β𝛽\betaitalic_β. Right: the total content creation V𝑉Vitalic_V and total user satisfaction U𝑈Uitalic_U obtained under different β𝛽\betaitalic_β. Error bars obtained from 10 independently generated environments.

7.2 The Optimal PPM(β)𝛽(\beta)( italic_β ) Found by Algorithm 1

Next, we use Algorithm 1 to find the optimal PPM(𝜷𝜷\bm{\beta}bold_italic_β) and investigate the properties of the optimal 𝜷𝜷\bm{\beta}bold_italic_β. We set λ=0.5𝜆0.5\lambda=0.5italic_λ = 0.5 and aim to maximize the objective Wλ=U+0.5Vsubscript𝑊𝜆𝑈0.5𝑉W_{\lambda}=U+0.5Vitalic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT = italic_U + 0.5 italic_V, more results under different choices of λ𝜆\lambdaitalic_λ can be found in Appendix B. The initial 𝜷𝜷\bm{\beta}bold_italic_β is set to (100,,100)100100(100,\cdots,100)( 100 , ⋯ , 100 ), representing a nearly deterministic matching for every user. Algorithm 1 is then run to update 𝜷𝜷\bm{\beta}bold_italic_β. The sample rate and learning rate are set to δ=0.1,η=200formulae-sequence𝛿0.1𝜂200\delta=0.1,\eta=200italic_δ = 0.1 , italic_η = 200. In addition to searching for personalized βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT for each user j𝑗jitalic_j, we also attempt to find a homogeneous β𝛽\betaitalic_β (i.e., a fixed βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT for each j𝑗jitalic_j) using Algorithm 1555The gradient of Wλsubscript𝑊𝜆W_{\lambda}italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT with respect to a homogeneous β𝛽\betaitalic_β can be readily obtained by summing all the partial derivatives of Wλsubscript𝑊𝜆W_{\lambda}italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT with respect to βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT..

The first and third panels in Figure 2 show the evolution of Wλsubscript𝑊𝜆W_{\lambda}italic_W start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT during the optimization process in both synthetic and MovieLens environments. As illustrated, Algorithm 1 successfully finds a better PPM(𝜷𝜷\bm{\beta}bold_italic_β) compared to the baseline of exact matching for all users, with a significant gain of over 20% in the welfare metric. Furthermore, in both environments, personalized β𝛽\betaitalic_β leads to a slightly better outcome compared to homogeneous β𝛽\betaitalic_β.

The second and fourth panels depict the optimal βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT for each user j𝑗jitalic_j, arranged in descending order. These panels provide insights into how such a mechanism achieves better trade-offs. For each user index j𝑗jitalic_j on the x𝑥xitalic_x-axis, we also plot the average and the standard deviation of the relevance scores {wij}i=1nsuperscriptsubscriptsubscript𝑤𝑖𝑗𝑖1𝑛\{w_{ij}\}_{i=1}^{n}{ italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT associated with each user j𝑗jitalic_j over all creators, shown as orange and green lines. Based on the definition, users with smaller average scores and higher standard deviations are considered more “picky” or selective, indicating high relevance scores with a small group of creators and low scores with many others. Conversely, users with higher average scores and smaller standard deviations are less selective and more open to exploration. The results show that the optimal 𝜷𝜷\bm{\beta}bold_italic_β tends to increase the exploration strengths (by deploying smaller βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT) for less selective users. This approach is intuitive, as it safely increases exploration while minimizing losses in user engagement.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 2: Panel 1,2: social welfare improving curve under Algorithm 1, and the distribution of the obtained optimal βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in the synthetic environment. Panel 3,4: the same plots in the MovieLens environment. λ=0.5𝜆0.5\lambda=0.5italic_λ = 0.5.

8 Conclusion

In this work, we introduced a new game-theoretical model C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT (Cournot Content Creation Competition) to explore how creators strategically determine their creation frequency under a UGC platform’s recommendation algorithm. Our investigations reveal a critical balance between user satisfaction and creator engagement, mediated by the exploration strength of the recommendation. The existence and uniqueness of the PNE of C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT games provide a predictive framework for assessing the effects of algorithmic choices on content diversity and volume. Through both theoretical analysis and empirical simulations, we demonstrated how varying the exploration strength can either enhance user engagement at the cost of reduced content diversity or encourage richer content creation at the expense of immediate user satisfaction. These findings disclose the delicate trade-offs platform designers face and highlight the utility of our model as a pre-deployment audit tool for optimizing recommendation algorithms to balance platforms’ long-term and short-term objectives.

While our C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT model offers insights into strategic differentiation among creators regarding production quantity, it relies on a simplified assumption that creators maintain a fixed niche, consistently producing content on the same topic with similar quality. This assumption, though useful for modeling purposes, may be restrictive in real-world scenarios where creators dynamically adjust topics, vary content quality, and scale production quantity. Exploring the dynamics where creators compete across heterogeneous dimensions—such as topic variety, content quality, and production quantity—would be a valuable direction for future research. We leave this intriguing problem for future work.

Acknowledgment.

This work is supported in part by the NSF Award IIS-2128019, NSF Award CCF-2303372, AI2050 program at Schmidt Sciences (Grant G-24-66104) and Army Research Office Award W911NF-23-1-0030.


References

  • Agarwal and Brown (2023) Arpit Agarwal and William Brown. Online recommendations for agents with discounted adaptive preferences. arXiv preprint arXiv:2302.06014, 2023.
  • Ben-Porat and Tennenholtz (2017) Omer Ben-Porat and Moshe Tennenholtz. Shapley facility location games. In International Conference on Web and Internet Economics, pages 58–73. Springer, 2017.
  • Ben-Porat and Tennenholtz (2018) Omer Ben-Porat and Moshe Tennenholtz. A game-theoretic approach to recommendation systems with strategic content providers. Advances in Neural Information Processing Systems, 31, 2018.
  • Biyik et al. (2023) Erdem Biyik, Fan Yao, Yinlam Chow, Alex Haig, Chih-wei Hsu, Mohammad Ghavamzadeh, and Craig Boutilier. Preference elicitation with soft attributes in interactive recommendation. arXiv preprint arXiv:2311.02085, 2023.
  • Bobadilla et al. (2013) Jesús Bobadilla, Fernando Ortega, Antonio Hernando, and Abraham Gutiérrez. Recommender systems survey. Knowledge-based systems, 46:109–132, 2013.
  • Brantley et al. (2024) Kianté Brantley, Zhichong Fang, Sarah Dean, and Thorsten Joachims. Ranking with long-term constraints. In Proceedings of the 17th ACM International Conference on Web Search and Data Mining, pages 47–56, 2024.
  • Bravo et al. (2018) Mario Bravo, David Leslie, and Panayotis Mertikopoulos. Bandit learning in concave n-person games. Advances in Neural Information Processing Systems, 31, 2018.
  • Cai and Zheng (2023) Yang Cai and Weiqiang Zheng. Doubly optimal no-regret learning in monotone games. In International Conference on Machine Learning, pages 3507–3524. PMLR, 2023.
  • Caragiannis and Voudouris (2016) Ioannis Caragiannis and Alexandros A Voudouris. Welfare guarantees for proportional allocations. Theory of Computing Systems, 59:581–599, 2016.
  • Carroll et al. (2021) Micah Carroll, Dylan Hadfield-Menell, Stuart Russell, and Anca Dragan. Estimating and penalizing preference shift in recommender systems. In Proceedings of the 15th ACM Conference on Recommender Systems, pages 661–667, 2021.
  • Cen et al. (2023) Sarah H Cen, Andrew Ilyas, and Aleksander Madry. User strategization and trustworthy algorithms. arXiv preprint arXiv:2312.17666, 2023.
  • Cen et al. (2024) Sarah H Cen, Andrew Ilyas, Jennifer Allen, Hannah Li, and Aleksander Madry. Measuring strategization in recommendation: Users adapt their behavior to shape future content. arXiv preprint arXiv:2405.05596, 2024.
  • Cornes and Hartley (2005) Richard Cornes and Roger Hartley. Asymmetric contests with general technologies. Economic theory, 26:923–946, 2005.
  • Cournot (1838) Antoine Augustin Cournot. Recherches sur les principes mathématiques de la théorie des richesses, volume 48. L. Hachette, 1838.
  • Daskalakis et al. (2009) Constantinos Daskalakis, Paul W Goldberg, and Christos H Papadimitriou. The complexity of computing a nash equilibrium. SIAM Journal on Computing, 39(1):195–259, 2009.
  • Dean and Morgenstern (2022) Sarah Dean and Jamie Morgenstern. Preference dynamics under personalized recommendations. In Proceedings of the 23rd ACM Conference on Economics and Computation, pages 795–816, 2022.
  • Dean et al. (2024) Sarah Dean, Evan Dong, Meena Jagadeesan, and Liu Leqi. Recommender systems as dynamical systems: Interactions with viewers and creators. In Workshop on Recommendation Ecosystems: Modeling, Optimization and Incentive Design, 2024.
  • Debreu (1952) Gerard Debreu. A social equilibrium existence theorem. Proceedings of the National Academy of Sciences, 38(10):886–893, 1952.
  • Eilat and Rosenfeld (2023) Itay Eilat and Nir Rosenfeld. Performative recommendation: diversifying content via strategic incentives. In International Conference on Machine Learning, pages 9082–9103. PMLR, 2023.
  • Ewerhart (2015) Christian Ewerhart. Mixed equilibria in tullock contests. Economic Theory, 60:59–71, 2015.
  • Ewerhart (2017) Christian Ewerhart. The lottery contest is a best-response potential game. Economics Letters, 155:168–171, 2017. ISSN 0165-1765. doi: https://doi.org/10.1016/j.econlet.2017.03.030. URL https://www.sciencedirect.com/science/article/pii/S0165176517301325.
  • Fan and Cheng (2018) Jicong Fan and Jieyu Cheng. Matrix completion by deep matrix factorization. Neural Networks, 98:34–41, 2018.
  • Fan (1952) Ky Fan. Fixed-point and minimax theorems in locally convex topological linear spaces. Proceedings of the National Academy of Sciences, 38(2):121–126, 1952.
  • Ghosh and Hummel (2013) Arpita Ghosh and Patrick Hummel. Learning and incentives in user-generated content: Multi-armed bandits with endogenous arms. In Proceedings of the 4th conference on Innovations in Theoretical Computer Science, pages 233–246, 2013.
  • Ghosh and McAfee (2011) Arpita Ghosh and Preston McAfee. Incentivizing high-quality user-generated content. In Proceedings of the 20th international conference on World wide web, pages 137–146, 2011.
  • Glicksberg (1952) Irving L Glicksberg. A further generalization of the kakutani fixed theorem, with application to nash equilibrium points. Proceedings of the American Mathematical Society, 3(1):170–174, 1952.
  • Glotfelter (2019) Angela Glotfelter. Algorithmic circulation: how content creators navigate the effects of algorithms on their work. Computers and composition, 54:102521, 2019.
  • Harper and Konstan (2015) F Maxwell Harper and Joseph A Konstan. The movielens datasets: History and context. Acm transactions on interactive intelligent systems (tiis), 5(4):1–19, 2015.
  • Hodgson (2021) Thomas Hodgson. Spotify and the democratisation of music. Popular Music, 40(1):1–17, 2021.
  • Hron et al. (2022) Jiri Hron, Karl Krauth, Michael I Jordan, Niki Kilbertus, and Sarah Dean. Modeling content creator incentives on algorithm-curated platforms. arXiv preprint arXiv:2206.13102, 2022.
  • Hu et al. (2024) Qinlu Hu, Ni Huang, and Renyu Philip Zhang. Viewer traffic allocation for small creator development: Experimental evidence from short-video platforms. Available at SSRN 4888995, 2024.
  • Hu et al. (2023) Xinyan Hu, Meena Jagadeesan, Michael I Jordan, and Jacob Steinhard. Incentivizing high-quality content in online recommender systems. arXiv preprint arXiv:2306.07479, 2023.
  • Immorlica et al. (2024) Nicole Immorlica, Meena Jagadeesan, and Brendan Lucier. Clickbait vs. quality: How engagement-based optimization shapes the content landscape in online platforms. In Proceedings of the ACM on Web Conference 2024, pages 36–45, 2024.
  • Jagadeesan et al. (2024) Meena Jagadeesan, Nikhil Garg, and Jacob Steinhardt. Supply-side equilibria in recommender systems. Advances in Neural Information Processing Systems, 36, 2024.
  • Kalimeris et al. (2021) Dimitris Kalimeris, Smriti Bhagat, Shankar Kalyanaraman, and Udi Weinsberg. Preference amplification in recommender systems. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pages 805–815, 2021.
  • Koren et al. (2009) Yehuda Koren, Robert Bell, and Chris Volinsky. Matrix factorization techniques for recommender systems. Computer, 42(8):30–37, 2009.
  • Krantz and Parks (2002) Steven George Krantz and Harold R Parks. The implicit function theorem: history, theory, and applications. Springer Science & Business Media, 2002.
  • Lin et al. (2024) Tao Lin, Kun Jin, Andrew Estornell, Xiaoying Zhang, Yiling Chen, and Yang Liu. User-creator feature dynamics in recommender systems with dual influence. arXiv preprint arXiv:2407.14094, 2024.
  • Mladenov et al. (2020) Martin Mladenov, Elliot Creager, Omer Ben-Porat, Kevin Swersky, Richard Zemel, and Craig Boutilier. Optimizing long-term social welfare in recommender systems: A constrained matching approach. In International Conference on Machine Learning, pages 6987–6998. PMLR, 2020.
  • Nash Jr (1950) John F Nash Jr. Equilibrium points in n-person games. Proceedings of the national academy of sciences, 36(1):48–49, 1950.
  • Nguyen and Vojnovic (2011) Thành Nguyen and Milan Vojnovic. Weighted proportional allocation. ACM SIGMETRICS Performance Evaluation Review, 39(1):133–144, 2011.
  • Pérez-Castrillo and Verdier (1992) J David Pérez-Castrillo and Thierry Verdier. A general analysis of rent-seeking games. Public choice, 73(3):335–350, 1992.
  • Prasad et al. (2023) Siddharth Prasad, Martin Mladenov, and Craig Boutilier. Content prompting: Modeling content provider dynamics to improve user welfare in recommender ecosystems. arXiv preprint arXiv:2309.00940, 2023.
  • Rosen (1965) J Ben Rosen. Existence and uniqueness of equilibrium points for concave n-person games. Econometrica: Journal of the Econometric Society, pages 520–534, 1965.
  • Santos (2022) Marcelo Luis Barbosa dos Santos. The “so-called” ugc: an updated definition of user-generated content in the age of social media. Online Information Review, 46(1):95–113, 2022.
  • Sherman (1949) Jack Sherman. Adjustment of an inverse matrix corresponding to changes in the elements of a given column or row of the original matrix. Annu. Math. Statist., 20:621, 1949.
  • Tatarenko and Kamgarpour (2020) Tatiana Tatarenko and Maryam Kamgarpour. Bandit learning in convex non-strictly monotone games. arXiv preprint arXiv:2009.04258, 2020.
  • Tullock (1980) Gordon Tullock. Efficient rent seeking. In Toward a theory of the rent-seeking society. College Station, TX: Texas A&M University Press, 1980.
  • Xu et al. (2024) Renzhe Xu, Haotian Wang, Xingxuan Zhang, Bo Li, and Peng Cui. Ppa-game: Characterizing and learning competitive dynamics among online content creators. arXiv preprint arXiv:2403.15524, 2024.
  • Yao et al. (2022a) Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, and Haifeng Xu. Learning from a learning user for optimal recommendations. In International Conference on Machine Learning, pages 25382–25406. PMLR, 2022a.
  • Yao et al. (2022b) Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, and Haifeng Xu. Learning the optimal recommendation from explorative users. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 9457–9465, 2022b.
  • Yao et al. (2023) Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, and Haifeng Xu. How bad is top-k𝑘kitalic_k recommendation under competing content creators? In International Conference on Machine Learning. PMLR, 2023.
  • Yao et al. (2024a) Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, and Haifeng Xu. Human vs. generative ai in content creation competition: Symbiosis or conflict? In International Conference on Machine Learning. PMLR, 2024a.
  • Yao et al. (2024b) Fan Yao, Chuanhao Li, Karthik Abinav Sankararaman, Yiming Liao, Yan Zhu, Qifan Wang, Hongning Wang, and Haifeng Xu. Rethinking incentives in recommender systems: Are monotone rewards always beneficial? Advances in Neural Information Processing Systems, 36, 2024b.
  • Yao et al. (2024c) Fan Yao, Yiming Liao, Mingzhe Wu, Chuanhao Li, Yan Zhu, James Yang, Jingzhou Liu, Qifan Wang, Haifeng Xu, and Hongning Wang. User welfare optimization in recommender systems with competing content creators. In Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 3874–3885, 2024c.
  • Zeng et al. (2024) Zhiyu Zeng, Zhiqi Zhang, Dennis Zhang, and Tat Chan. The impact of recommender systems on content consumption and production: Evidence from field experiments and structural modeling. Available at SSRN, 2024.
  • Zhu et al. (2023) Banghua Zhu, Sai Praneeth Karimireddy, Jiantao Jiao, and Michael I Jordan. Online learning in a creator economy. arXiv preprint arXiv:2305.11381, 2023.
  • Zhuang et al. (2023) Wei Zhuang, Qingfeng Zeng, Yu Zhang, Chunmei Liu, and Weiguo Fan. What makes user-generated content more helpful on social media platforms? insights from creator interactivity perspective. Information processing & management, 60(2):103201, 2023.

Appendix to Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms

A Omitted Proofs

A.1 Proof of Theorem 1

Proof  First of all, we argue that given any 𝒢𝒢\mathcal{G}caligraphic_G(n,m,M,{ci},𝜷)𝑛𝑚𝑀subscript𝑐𝑖𝜷(n,m,M,\{c_{i}\},\bm{\beta})( italic_n , italic_m , italic_M , { italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } , bold_italic_β ), for any creator i𝑖iitalic_i, there exists an δi>0subscript𝛿𝑖0\delta_{i}>0italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 such that any xi[0,δi]subscript𝑥𝑖0subscript𝛿𝑖x_{i}\in[0,\delta_{i}]italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ [ 0 , italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] cannot be an equilibrium strategy. This is because given any 𝒙i0n1subscript𝒙𝑖subscriptsuperscript𝑛1absent0\bm{x}_{-i}\in\mathbb{R}^{n-1}_{\geq 0}bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT, uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as a function of xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT has a continuous and strictly positive gradient at xi=0subscript𝑥𝑖0x_{i}=0italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0, meaning that there exists a δi>0subscript𝛿𝑖0\delta_{i}>0italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 such that uixi|xi=t>0,t[0,δi]formulae-sequenceevaluated-atsubscript𝑢𝑖subscript𝑥𝑖subscript𝑥𝑖𝑡0for-all𝑡0subscript𝛿𝑖\frac{\partial u_{i}}{\partial x_{i}}\bigg{|}_{x_{i}=t}>0,\forall t\in[0,% \delta_{i}]divide start_ARG ∂ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG | start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_t end_POSTSUBSCRIPT > 0 , ∀ italic_t ∈ [ 0 , italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] regardless of what other creators’ strategies are. In other word, for any xiδisubscript𝑥𝑖subscript𝛿𝑖x_{i}\leq\delta_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, creator i𝑖iitalic_i can always increase her strategy to strictly improve her utility. As a result, any potential PNE 𝒙superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT must satisfy that xiδisuperscriptsubscript𝑥𝑖subscript𝛿𝑖x_{i}^{*}\geq\delta_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ≥ italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

On the other hand, since ci(xi)+subscript𝑐𝑖subscript𝑥𝑖c_{i}(x_{i})\rightarrow+\inftyitalic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) → + ∞ when xi+subscript𝑥𝑖x_{i}\rightarrow+\inftyitalic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT → + ∞ but the traffic gain for each creator is at most m𝑚mitalic_m, we have ui(xi,𝒙i),𝒙i0n1formulae-sequencesubscript𝑢𝑖subscript𝑥𝑖subscript𝒙𝑖for-allsubscript𝒙𝑖subscriptsuperscript𝑛1absent0u_{i}(x_{i},\bm{x}_{-i})\rightarrow-\infty,\forall\bm{x}_{-i}\in\mathbb{R}^{n-% 1}_{\geq 0}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) → - ∞ , ∀ bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT when xi+subscript𝑥𝑖x_{i}\rightarrow+\inftyitalic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT → + ∞. As a result, any equilibrium strategy must also be upper bounded by a uniform constant Δ>0Δ0\Delta>0roman_Δ > 0.

To argue the existence and uniqueness of PNE of 𝒢𝒢\mathcal{G}caligraphic_G, in the following we may with out loss of generality restrict each creator i𝑖iitalic_i’s strategy set to a convex set [δi,Δ]subscript𝛿𝑖Δ[\delta_{i},\Delta][ italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , roman_Δ ].

For any fixed 𝜷=(β1,,βm)𝜷subscript𝛽1subscript𝛽𝑚\bm{\beta}=(\beta_{1},\cdots,\beta_{m})bold_italic_β = ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ), let aij=exp(βjwij)subscript𝑎𝑖𝑗subscript𝛽𝑗subscript𝑤𝑖𝑗a_{ij}=\exp(\beta_{j}w_{ij})italic_a start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = roman_exp ( italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) and Eq. (1) can be simplified to

ui(xi,𝒙i;𝜷)=j=1m(xiaijk=1nxkakj)ci(xi),subscript𝑢𝑖subscript𝑥𝑖subscript𝒙𝑖𝜷superscriptsubscript𝑗1𝑚subscript𝑥𝑖subscript𝑎𝑖𝑗superscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘𝑗subscript𝑐𝑖subscript𝑥𝑖\displaystyle u_{i}(x_{i},\bm{x}_{-i};\bm{\beta})=\sum_{j=1}^{m}\left(\frac{x_% {i}a_{ij}}{\sum_{k=1}^{n}x_{k}a_{kj}}\right)-c_{i}(x_{i}),italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ; bold_italic_β ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( divide start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT end_ARG ) - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , (14)

For simplicity we denote gj(𝒙)=k=1nxkakjsubscript𝑔𝑗𝒙superscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘𝑗g_{j}(\bm{x})=\sum_{k=1}^{n}x_{k}a_{kj}italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_x ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT and uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT can be expressed as ui(xi,𝒙i)=j=1mxiaijgj(𝒙)ci(xi)subscript𝑢𝑖subscript𝑥𝑖subscript𝒙𝑖superscriptsubscript𝑗1𝑚subscript𝑥𝑖subscript𝑎𝑖𝑗subscript𝑔𝑗𝒙subscript𝑐𝑖subscript𝑥𝑖u_{i}(x_{i},\bm{x}_{-i})=\sum_{j=1}^{m}\frac{x_{i}a_{ij}}{g_{j}(\bm{x})}-c_{i}% (x_{i})italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_x ) end_ARG - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). Our proof starts from a sufficient condition from Rosen (1965) for a game to be monotone. A game is said to satisfy the diagonal strict concavity (DSC) condition if (1) each player has a concave utility function in his own strategy in a convex strategy space; and (2) there exists some non-zero parameter λ=(λ1,,λn)𝜆subscript𝜆1subscript𝜆𝑛\lambda=(\lambda_{1},\cdots,\lambda_{n})italic_λ = ( italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) such that the Hessian matrix given by

Hkl(𝒙;λ)λk22uk(𝒙)xkxl+λl22ul(𝒙)xlxksubscript𝐻𝑘𝑙𝒙𝜆subscript𝜆𝑘2superscript2subscript𝑢𝑘𝒙subscript𝑥𝑘subscript𝑥𝑙subscript𝜆𝑙2superscript2subscript𝑢𝑙𝒙subscript𝑥𝑙subscript𝑥𝑘H_{kl}(\bm{x};\lambda)\triangleq\frac{\lambda_{k}}{2}\frac{\partial^{2}u_{k}(% \bm{x})}{\partial x_{k}\partial x_{l}}+\frac{\lambda_{l}}{2}\frac{\partial^{2}% u_{l}(\bm{x})}{\partial x_{l}\partial x_{k}}italic_H start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT ( bold_italic_x ; italic_λ ) ≜ divide start_ARG italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_italic_x ) end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∂ italic_x start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_ARG + divide start_ARG italic_λ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( bold_italic_x ) end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∂ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG (15)

is strictly negative-definite. In Rosen (1965), it is shown that any game satisfying λ𝜆\lambdaitalic_λ-DSC condition has a unique pure Nash equilibrium (PNE); such a game is often referred to as monotone games.

First of all, we already argued that each creator i𝑖iitalic_i’s strategy set is [δi,Δ]subscript𝛿𝑖Δ[\delta_{i},\Delta][ italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , roman_Δ ], which is a convex set. Core to our proof is to show that game 𝒢𝒢\mathcal{G}caligraphic_G is 𝟏1\bm{1}bold_1-DSC under the theorem conditions. Direct calculation shows that for any 1kln1𝑘𝑙𝑛1\leq k\leq l\leq n1 ≤ italic_k ≤ italic_l ≤ italic_n,

2uk(𝒙)xkxl=akjaljgj3(gj+2akjxk),superscript2subscript𝑢𝑘𝒙subscript𝑥𝑘subscript𝑥𝑙subscript𝑎𝑘𝑗subscript𝑎𝑙𝑗superscriptsubscript𝑔𝑗3subscript𝑔𝑗2subscript𝑎𝑘𝑗subscript𝑥𝑘\displaystyle\frac{\partial^{2}u_{k}(\bm{x})}{\partial x_{k}\partial x_{l}}=% \frac{a_{kj}a_{lj}}{g_{j}^{3}}\cdot(-g_{j}+2a_{kj}x_{k}),divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_italic_x ) end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∂ italic_x start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_ARG = divide start_ARG italic_a start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_l italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ⋅ ( - italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 2 italic_a start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ,
2ul(𝒙)xlxk=akjaljgj3(gj+2aljxl),superscript2subscript𝑢𝑙𝒙subscript𝑥𝑙subscript𝑥𝑘subscript𝑎𝑘𝑗subscript𝑎𝑙𝑗superscriptsubscript𝑔𝑗3subscript𝑔𝑗2subscript𝑎𝑙𝑗subscript𝑥𝑙\displaystyle\frac{\partial^{2}u_{l}(\bm{x})}{\partial x_{l}\partial x_{k}}=% \frac{a_{kj}a_{lj}}{g_{j}^{3}}\cdot(-g_{j}+2a_{lj}x_{l}),divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( bold_italic_x ) end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∂ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG = divide start_ARG italic_a start_POSTSUBSCRIPT italic_k italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_l italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ⋅ ( - italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 2 italic_a start_POSTSUBSCRIPT italic_l italic_j end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) ,

and therefore the Hessian matrix of 𝒢𝒢\mathcal{G}caligraphic_G specified by the RHS of Eq. (15) is equal to

[H(𝒙)]delimited-[]𝐻𝒙\displaystyle-[H(\bm{x})]- [ italic_H ( bold_italic_x ) ] =j=1mgj3[a1ja2j][2i1xiaiji{1,2}xiaiji{1,2}xiaij2i2xiaij][a1j,a2j,]+[2c1x12002c2x22]absentsuperscriptsubscript𝑗1𝑚superscriptsubscript𝑔𝑗3matrixsubscript𝑎1𝑗subscript𝑎2𝑗matrix2subscript𝑖1subscript𝑥𝑖subscript𝑎𝑖𝑗subscript𝑖12subscript𝑥𝑖subscript𝑎𝑖𝑗subscript𝑖12subscript𝑥𝑖subscript𝑎𝑖𝑗2subscript𝑖2subscript𝑥𝑖subscript𝑎𝑖𝑗matrixsubscript𝑎1𝑗subscript𝑎2𝑗matrixsuperscript2subscript𝑐1superscriptsubscript𝑥1200superscript2subscript𝑐2superscriptsubscript𝑥22\displaystyle=\sum_{j=1}^{m}g_{j}^{-3}\begin{bmatrix}a_{1j}\\ a_{2j}\\ \vdots\end{bmatrix}\begin{bmatrix}2\sum_{i\neq 1}x_{i}a_{ij}&\sum_{i\notin\{1,% 2\}}x_{i}a_{ij}&\ldots\\ \sum_{i\notin\{1,2\}}x_{i}a_{ij}&2\sum_{i\neq 2}x_{i}a_{ij}&\ldots\\ \vdots&\vdots&\ddots\end{bmatrix}\begin{bmatrix}a_{1j},a_{2j},\ldots\\ \end{bmatrix}+\begin{bmatrix}\frac{\partial^{2}c_{1}}{\partial x_{1}^{2}}&0&% \ldots\\ 0&\frac{\partial^{2}c_{2}}{\partial x_{2}^{2}}&\ldots\\ \vdots&\vdots&\ddots\end{bmatrix}= ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT [ start_ARG start_ROW start_CELL italic_a start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_a start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW end_ARG ] [ start_ARG start_ROW start_CELL 2 ∑ start_POSTSUBSCRIPT italic_i ≠ 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_i ∉ { 1 , 2 } end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL end_ROW start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_i ∉ { 1 , 2 } end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_CELL start_CELL 2 ∑ start_POSTSUBSCRIPT italic_i ≠ 2 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL end_ROW end_ARG ] [ start_ARG start_ROW start_CELL italic_a start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT , … end_CELL end_ROW end_ARG ] + [ start_ARG start_ROW start_CELL divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_CELL start_CELL 0 end_CELL start_CELL … end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL divide start_ARG ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_CELL start_CELL … end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL end_ROW end_ARG ]
j=1mgj3𝒂jHj𝒂j+H0,absentsuperscriptsubscript𝑗1𝑚superscriptsubscript𝑔𝑗3subscript𝒂𝑗subscript𝐻𝑗superscriptsubscript𝒂𝑗topsubscript𝐻0\displaystyle\triangleq\sum_{j=1}^{m}g_{j}^{-3}\bm{a}_{j}H_{j}\bm{a}_{j}^{\top% }+H_{0},≜ ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT bold_italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT bold_italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , (16)

where gjsubscript𝑔𝑗g_{j}italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in the above expressions denotes gj(𝒙)subscript𝑔𝑗𝒙g_{j}(\bm{x})italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_x ), vector 𝒂j=(a1j,,anj)subscript𝒂𝑗superscriptsubscript𝑎1𝑗subscript𝑎𝑛𝑗top\bm{a}_{j}=(a_{1j},\cdots,a_{nj})^{\top}bold_italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_a start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , ⋯ , italic_a start_POSTSUBSCRIPT italic_n italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. We can see that if all the cost functions are strictly convex, the second diagonal matrix H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in the RHS of Eq. (16) is strictly positive-definite (PD). Therefore, it suffices to show that (1) for all j[m]𝑗delimited-[]𝑚j\in[m]italic_j ∈ [ italic_m ], Hjsubscript𝐻𝑗H_{j}italic_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is PD. To see this, let zi=xiaijsubscript𝑧𝑖subscript𝑥𝑖subscript𝑎𝑖𝑗z_{i}=x_{i}a_{ij}italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT and we show that for any 𝒚=(y1,,yn)n𝒚subscript𝑦1subscript𝑦𝑛superscript𝑛\bm{y}=(y_{1},\cdots,y_{n})\in\mathbb{R}^{n}bold_italic_y = ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, 𝒚Hj𝒚0superscript𝒚topsubscript𝐻𝑗𝒚0\bm{y}^{\top}H_{j}\bm{y}\geq 0bold_italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT bold_italic_y ≥ 0, and the equality holds if and only if 𝒚=𝟎𝒚0\bm{y}=\bm{0}bold_italic_y = bold_0. In fact, note that

𝒚Hj𝒚superscript𝒚topsubscript𝐻𝑗𝒚\displaystyle\bm{y}^{\top}H_{j}\bm{y}bold_italic_y start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT bold_italic_y =2i=1nyi2(jizj)+2i<jyiyj(k{i,j}zk)absent2superscriptsubscript𝑖1𝑛superscriptsubscript𝑦𝑖2subscript𝑗𝑖subscript𝑧𝑗2subscript𝑖𝑗subscript𝑦𝑖subscript𝑦𝑗subscript𝑘𝑖𝑗subscript𝑧𝑘\displaystyle=2\sum_{i=1}^{n}y_{i}^{2}\left(\sum_{j\neq i}z_{j}\right)+2\sum_{% i<j}y_{i}y_{j}\left(\sum_{k\notin\{i,j\}}z_{k}\right)= 2 ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) + 2 ∑ start_POSTSUBSCRIPT italic_i < italic_j end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_k ∉ { italic_i , italic_j } end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT )
=i=1nyi2(jizj)+[i=1nyi2(jizj)+2i<jyiyj(k{i,j}zk)]absentsuperscriptsubscript𝑖1𝑛superscriptsubscript𝑦𝑖2subscript𝑗𝑖subscript𝑧𝑗delimited-[]superscriptsubscript𝑖1𝑛superscriptsubscript𝑦𝑖2subscript𝑗𝑖subscript𝑧𝑗2subscript𝑖𝑗subscript𝑦𝑖subscript𝑦𝑗subscript𝑘𝑖𝑗subscript𝑧𝑘\displaystyle=\sum_{i=1}^{n}y_{i}^{2}\left(\sum_{j\neq i}z_{j}\right)+\left[% \sum_{i=1}^{n}y_{i}^{2}\left(\sum_{j\neq i}z_{j}\right)+2\sum_{i<j}y_{i}y_{j}% \left(\sum_{k\notin\{i,j\}}z_{k}\right)\right]= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) + [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) + 2 ∑ start_POSTSUBSCRIPT italic_i < italic_j end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_k ∉ { italic_i , italic_j } end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ]
=i=1nyi2(jizj)+k=1nzk[jkyj2+2i<j,ik,jkyiyj]absentsuperscriptsubscript𝑖1𝑛superscriptsubscript𝑦𝑖2subscript𝑗𝑖subscript𝑧𝑗superscriptsubscript𝑘1𝑛subscript𝑧𝑘delimited-[]subscript𝑗𝑘superscriptsubscript𝑦𝑗22subscriptformulae-sequence𝑖𝑗formulae-sequence𝑖𝑘𝑗𝑘subscript𝑦𝑖subscript𝑦𝑗\displaystyle=\sum_{i=1}^{n}y_{i}^{2}\left(\sum_{j\neq i}z_{j}\right)+\sum_{k=% 1}^{n}z_{k}\left[\sum_{j\neq k}y_{j}^{2}+2\sum_{i<j,i\neq k,j\neq k}y_{i}y_{j}\right]= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT [ ∑ start_POSTSUBSCRIPT italic_j ≠ italic_k end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 2 ∑ start_POSTSUBSCRIPT italic_i < italic_j , italic_i ≠ italic_k , italic_j ≠ italic_k end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ]
=i=1nyi2(jizj)+i=1nzi(jiyj)20.absentsuperscriptsubscript𝑖1𝑛superscriptsubscript𝑦𝑖2subscript𝑗𝑖subscript𝑧𝑗superscriptsubscript𝑖1𝑛subscript𝑧𝑖superscriptsubscript𝑗𝑖subscript𝑦𝑗20\displaystyle=\sum_{i=1}^{n}y_{i}^{2}\left(\sum_{j\neq i}z_{j}\right)+\sum_{i=% 1}^{n}z_{i}\left(\sum_{j\neq i}y_{j}\right)^{2}\geq 0.= ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_j ≠ italic_i end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ 0 . (17)

Because xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are all strictly positive, each zisubscript𝑧𝑖z_{i}italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT must also be strictly positive. Hence, Eq. (17) can take value zero if and only if yi=0,i[n]formulae-sequencesubscript𝑦𝑖0for-all𝑖delimited-[]𝑛y_{i}=0,\forall i\in[n]italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 , ∀ italic_i ∈ [ italic_n ]. Therefore, Hjsubscript𝐻𝑗H_{j}italic_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is PD for any j[m]𝑗delimited-[]𝑚j\in[m]italic_j ∈ [ italic_m ], which completes the proof.

 

A.2 Proof of Theorem 2

First let’s recall the definition of U,V𝑈𝑉U,Vitalic_U , italic_V when m=1𝑚1m=1italic_m = 1:

U(𝒙(β);β)=i=1n(wixieβwik=1nxkeβwk),𝑈superscript𝒙𝛽𝛽superscriptsubscript𝑖1𝑛subscript𝑤𝑖subscriptsuperscript𝑥𝑖superscript𝑒𝛽subscript𝑤𝑖superscriptsubscript𝑘1𝑛subscriptsuperscript𝑥𝑘superscript𝑒𝛽subscript𝑤𝑘U(\bm{x}^{*}(\beta);\beta)=\sum_{i=1}^{n}\left(\frac{w_{i}x^{*}_{i}e^{\beta w_% {i}}}{\sum_{k=1}^{n}x^{*}_{k}e^{\beta w_{k}}}\right),italic_U ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( italic_β ) ; italic_β ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( divide start_ARG italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) , (18)
V(𝒙)=i=1nxi.𝑉superscript𝒙superscriptsubscript𝑖1𝑛superscriptsubscript𝑥𝑖V(\bm{x}^{*})=\sum_{i=1}^{n}x_{i}^{*}.italic_V ( bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT . (19)

In the following, we prove the monotonicity of U(β)𝑈𝛽U(\beta)italic_U ( italic_β ) and V(β)𝑉𝛽V(\beta)italic_V ( italic_β ) by showing dlnUdβ>0𝑑𝑈𝑑𝛽0\frac{d\ln U}{d\beta}>0divide start_ARG italic_d roman_ln italic_U end_ARG start_ARG italic_d italic_β end_ARG > 0 and dVdβ<0𝑑𝑉𝑑𝛽0\frac{dV}{d\beta}<0divide start_ARG italic_d italic_V end_ARG start_ARG italic_d italic_β end_ARG < 0, respectively. Before presenting the detailed proof, we first derive some relevant definitions and their properties that will be used in the proof.

For simplicity we omit the superscript in 𝒙superscript𝒙\bm{x}^{*}bold_italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and simply use 𝒙𝒙\bm{x}bold_italic_x to refer to the PNE of 𝒢𝒢\mathcal{G}caligraphic_G. When m=1𝑚1m=1italic_m = 1, the creator utility function writes

ui(xi,xi)=xieβwik=1nxkeβwkci(xi),i[n].formulae-sequencesubscript𝑢𝑖subscript𝑥𝑖subscript𝑥𝑖subscript𝑥𝑖superscript𝑒𝛽subscript𝑤𝑖superscriptsubscript𝑘1𝑛subscript𝑥𝑘superscript𝑒𝛽subscript𝑤𝑘subscript𝑐𝑖subscript𝑥𝑖𝑖delimited-[]𝑛u_{i}(x_{i},x_{-i})=\frac{x_{i}e^{\beta w_{i}}}{\sum_{k=1}^{n}x_{k}e^{\beta w_% {k}}}-c_{i}(x_{i}),i\in[n].italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) = divide start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_i ∈ [ italic_n ] . (20)

Let Pi=xiaik=1nxkaksubscript𝑃𝑖subscript𝑥𝑖subscript𝑎𝑖superscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘P_{i}=\frac{x_{i}a_{i}}{\sum_{k=1}^{n}x_{k}a_{k}}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG, where ai=eβwisubscript𝑎𝑖superscript𝑒𝛽subscript𝑤𝑖a_{i}=e^{\beta w_{i}}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. First of all, we claim that it is without loss of generality to consider the regime where Pi13subscript𝑃𝑖13P_{i}\leq\frac{1}{3}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 3 end_ARG. To see this, consider the following two C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT instances:

𝒢1(n,m=1,𝒘=(w1,,wn),𝒄=(c1,,cn),𝜷),\displaystyle\mathcal{G}_{1}(n,m=1,\bm{w}=(w_{1},\cdots,w_{n}),\bm{c}=(c_{1},% \cdots,c_{n}),\bm{\beta}),caligraphic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_n , italic_m = 1 , bold_italic_w = ( italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_w start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) , bold_italic_c = ( italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_c start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) , bold_italic_β ) ,
𝒢2(3n,m=1,[𝒘,𝒘,𝒘],[𝒄/3,𝒄/3,𝒄/3],[𝜷,𝜷,𝜷]).\displaystyle\mathcal{G}_{2}(3n,m=1,[\bm{w},\bm{w},\bm{w}],[\bm{c}/3,\bm{c}/3,% \bm{c}/3],[\bm{\beta},\bm{\beta},\bm{\beta}]).caligraphic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 3 italic_n , italic_m = 1 , [ bold_italic_w , bold_italic_w , bold_italic_w ] , [ bold_italic_c / 3 , bold_italic_c / 3 , bold_italic_c / 3 ] , [ bold_italic_β , bold_italic_β , bold_italic_β ] ) .

Clearly, both games 𝒢1,𝒢2subscript𝒢1subscript𝒢2\mathcal{G}_{1},\mathcal{G}_{2}caligraphic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , caligraphic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT have unique PNE. Let the PNE of 𝒢1subscript𝒢1\mathcal{G}_{1}caligraphic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT be denoted by 𝒙1superscriptsubscript𝒙1\bm{x}_{1}^{*}bold_italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. In 𝒢2subscript𝒢2\mathcal{G}_{2}caligraphic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, since its 3n3𝑛3n3 italic_n players are divided into three identical groups, its PNE can be represented as (𝒙2,𝒙2,𝒙2)superscriptsubscript𝒙2superscriptsubscript𝒙2superscriptsubscript𝒙2(\bm{x}_{2}^{*},\bm{x}_{2}^{*},\bm{x}_{2}^{*})( bold_italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), where each group of n𝑛nitalic_n players follow the same strategy. Moreover, for any 1in1𝑖𝑛1\leq i\leq n1 ≤ italic_i ≤ italic_n, it is straightforward to observe that the i𝑖iitalic_i-th player’s utility functions in 𝒢1subscript𝒢1\mathcal{G}_{1}caligraphic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and 𝒢2subscript𝒢2\mathcal{G}_{2}caligraphic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT differ only by a multiplicative constant of 3333. Consequently, we have 𝒙1=𝒙2superscriptsubscript𝒙1superscriptsubscript𝒙2\bm{x}_{1}^{*}=\bm{x}_{2}^{*}bold_italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = bold_italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Therefore, for any C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT instance 𝒢1subscript𝒢1\mathcal{G}_{1}caligraphic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT with m=1𝑚1m=1italic_m = 1, we can always construct an equivalent instance 𝒢2subscript𝒢2\mathcal{G}_{2}caligraphic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT that shares the same PNE structure, while ensuring Pi13subscript𝑃𝑖13P_{i}\leq\frac{1}{3}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 3 end_ARG. This justifies the assumptions that, without loss of generality, we can take Pi13subscript𝑃𝑖13P_{i}\leq\frac{1}{3}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 3 end_ARG.

Another property we need is that the PNE strategy xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT of any player i𝑖iitalic_i in 𝒢1subscript𝒢1\mathcal{G}_{1}caligraphic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is bounded in a compact region [0,L]0𝐿[0,L][ 0 , italic_L ] for some constant L𝐿Litalic_L, regardless of the values of 𝜷𝜷\bm{\beta}bold_italic_β. To see this, note that ci(xi)subscript𝑐𝑖subscript𝑥𝑖c_{i}(x_{i})italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is increasing in xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and goes to infinity as xi+subscript𝑥𝑖x_{i}\rightarrow+\inftyitalic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT → + ∞ while Pisubscript𝑃𝑖P_{i}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is upper bounded by 1111. As a result, for any xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT such that ui(xi,𝒙i)1ci(xi)<ci(0)=ui(0,𝒙i)subscript𝑢𝑖subscript𝑥𝑖subscript𝒙𝑖1subscript𝑐𝑖subscript𝑥𝑖subscript𝑐𝑖0subscript𝑢𝑖0subscript𝒙𝑖u_{i}(x_{i},\bm{x}_{-i})\leq 1-c_{i}(x_{i})<-c_{i}(0)=u_{i}(0,\bm{x}_{-i})italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) ≤ 1 - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) < - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 0 ) = italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 0 , bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ), xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT cannot be a PNE strategy as switching to 00 increases player i𝑖iitalic_i’s utility. Therefore, if we take

L=maxi[n]{infx{x0:1ci(xi)<ci(0)}},𝐿subscript𝑚𝑎𝑥𝑖delimited-[]𝑛subscriptinfimum𝑥conditional-set𝑥01subscript𝑐𝑖subscript𝑥𝑖subscript𝑐𝑖0L=\mathop{max}_{i\in[n]}\{\inf_{x}\{x\geq 0:1-c_{i}(x_{i})<-c_{i}(0)\}\},italic_L = start_BIGOP italic_m italic_a italic_x end_BIGOP start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT { roman_inf start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT { italic_x ≥ 0 : 1 - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) < - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 0 ) } } ,

it holds that xiL,i[n]formulae-sequencesubscript𝑥𝑖𝐿for-all𝑖delimited-[]𝑛x_{i}\leq L,\forall i\in[n]italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_L , ∀ italic_i ∈ [ italic_n ].

Let F(x,β)=(uixi)i=1n𝐹𝑥𝛽superscriptsubscriptsubscript𝑢𝑖subscript𝑥𝑖𝑖1𝑛F(x,\beta)=\left(\frac{\partial u_{i}}{\partial x_{i}}\right)_{i=1}^{n}italic_F ( italic_x , italic_β ) = ( divide start_ARG ∂ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT be an n𝑛nitalic_n-value function. From Corollary 1 we know 𝒙,β𝒙𝛽\bm{x},\betabold_italic_x , italic_β satisfy F(𝒙,β)=0𝐹𝒙𝛽0F(\bm{x},\beta)=0italic_F ( bold_italic_x , italic_β ) = 0. And Theorem 1 guarantees that for any β0𝛽0\beta\geq 0italic_β ≥ 0, the 𝒙𝒙\bm{x}bold_italic_x implicitly determined by F(𝒙,β)=0𝐹𝒙𝛽0F(\bm{x},\beta)=0italic_F ( bold_italic_x , italic_β ) = 0 exists and is unique. Therefore, by the implicit function theorem (Krantz and Parks, 2002), the derivative of 𝒙𝒙\bm{x}bold_italic_x w.r.t. β𝛽\betaitalic_β can be written as

d𝒙dβ=(F𝒙)1Fβ,𝑑𝒙𝑑𝛽superscript𝐹𝒙1𝐹𝛽\frac{d\bm{x}}{d\beta}=-\left(\frac{\partial F}{\partial\bm{x}}\right)^{-1}% \cdot\frac{\partial F}{\partial\beta},divide start_ARG italic_d bold_italic_x end_ARG start_ARG italic_d italic_β end_ARG = - ( divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⋅ divide start_ARG ∂ italic_F end_ARG start_ARG ∂ italic_β end_ARG ,

where [F𝒙]n×nsubscriptdelimited-[]𝐹𝒙𝑛𝑛\left[\frac{\partial F}{\partial\bm{x}}\right]_{n\times n}[ divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x end_ARG ] start_POSTSUBSCRIPT italic_n × italic_n end_POSTSUBSCRIPT is the Jacobian matrix (which is also the Hessian of 𝒢𝒢\mathcal{G}caligraphic_G when λ=𝟏𝜆1\lambda=\bm{1}italic_λ = bold_1, see Eq. (15)) and Fβn×1𝐹𝛽superscript𝑛1\frac{\partial F}{\partial\beta}\in\mathbb{R}^{n\times 1}divide start_ARG ∂ italic_F end_ARG start_ARG ∂ italic_β end_ARG ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × 1 end_POSTSUPERSCRIPT is the partial derivative of F𝐹Fitalic_F w.r.t. β𝛽\betaitalic_β.

The first-order derivative of uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT can be calculated as

uixi=eβwik=1nxkeβwkxi(eβwik=1nxkeβwk)2ci(xi),subscript𝑢𝑖subscript𝑥𝑖superscript𝑒𝛽subscript𝑤𝑖superscriptsubscript𝑘1𝑛subscript𝑥𝑘superscript𝑒𝛽subscript𝑤𝑘subscript𝑥𝑖superscriptsuperscript𝑒𝛽subscript𝑤𝑖superscriptsubscript𝑘1𝑛subscript𝑥𝑘superscript𝑒𝛽subscript𝑤𝑘2subscriptsuperscript𝑐𝑖subscript𝑥𝑖\frac{\partial u_{i}}{\partial x_{i}}=\frac{e^{\beta w_{i}}}{\sum_{k=1}^{n}x_{% k}e^{\beta w_{k}}}-x_{i}\left(\frac{e^{\beta w_{i}}}{\sum_{k=1}^{n}x_{k}e^{% \beta w_{k}}}\right)^{2}-c^{\prime}_{i}(x_{i}),divide start_ARG ∂ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG = divide start_ARG italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG - italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( divide start_ARG italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , (21)

and we can use it to further obtain the following explicit expressions in terms of the derivatives of F𝐹Fitalic_F:

(Fixi)=1xi2Pi2(12Pi)1xi2Pi2ci′′(xi),subscript𝐹𝑖subscript𝑥𝑖1superscriptsubscript𝑥𝑖2superscriptsubscript𝑃𝑖212subscript𝑃𝑖1superscriptsubscript𝑥𝑖2superscriptsubscript𝑃𝑖2superscriptsubscript𝑐𝑖′′subscript𝑥𝑖\displaystyle\left(\frac{\partial F_{i}}{\partial x_{i}}\right)=-\frac{1}{x_{i% }^{2}}P_{i}^{2}\left(1-2P_{i}\right)-\frac{1}{x_{i}^{2}}P_{i}^{2}-c_{i}^{% \prime\prime}(x_{i}),( divide start_ARG ∂ italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) = - divide start_ARG 1 end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - divide start_ARG 1 end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ,
(Fixj)=1xixjPiPj(12Pi),ji,formulae-sequencesubscript𝐹𝑖subscript𝑥𝑗1subscript𝑥𝑖subscript𝑥𝑗subscript𝑃𝑖subscript𝑃𝑗12subscript𝑃𝑖𝑗𝑖\displaystyle\left(\frac{\partial F_{i}}{\partial x_{j}}\right)=-\frac{1}{x_{i% }x_{j}}P_{i}P_{j}\left(1-2P_{i}\right),j\neq i,( divide start_ARG ∂ italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) = - divide start_ARG 1 end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_j ≠ italic_i ,
(Fiβ)=Pixi(12Pi)(wik=1nPkwk).subscript𝐹𝑖𝛽subscript𝑃𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑤𝑖superscriptsubscript𝑘1𝑛subscript𝑃𝑘subscript𝑤𝑘\displaystyle\left(\frac{\partial F_{i}}{\partial\beta}\right)=\frac{P_{i}}{x_% {i}}\cdot(1-2P_{i})\cdot\left(w_{i}-\sum_{k=1}^{n}P_{k}w_{k}\right).( divide start_ARG ∂ italic_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_β end_ARG ) = divide start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ⋅ ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⋅ ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) . (22)

Let’s define a positive definite diagonal matrix

D=diag(P12x12+c1′′,,Pn2xn2+cn′′).𝐷diagsuperscriptsubscript𝑃12superscriptsubscript𝑥12subscriptsuperscript𝑐′′1superscriptsubscript𝑃𝑛2superscriptsubscript𝑥𝑛2subscriptsuperscript𝑐′′𝑛D=\text{diag}\left(\frac{P_{1}^{2}}{x_{1}^{2}}+c^{\prime\prime}_{1},\cdots,% \frac{P_{n}^{2}}{x_{n}^{2}}+c^{\prime\prime}_{n}\right).italic_D = diag ( divide start_ARG italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , divide start_ARG italic_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) .

Since ci′′>0subscriptsuperscript𝑐′′𝑖0c^{\prime\prime}_{i}>0italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0, we can introduce variables

δi(0,1) such that Pi2xi2+ci′′=Pi2δixi2.subscript𝛿𝑖01 such that superscriptsubscript𝑃𝑖2superscriptsubscript𝑥𝑖2subscriptsuperscript𝑐′′𝑖superscriptsubscript𝑃𝑖2subscript𝛿𝑖superscriptsubscript𝑥𝑖2\delta_{i}\in(0,1)\text{~{}such that~{}}\frac{P_{i}^{2}}{x_{i}^{2}}+c^{\prime% \prime}_{i}=\frac{P_{i}^{2}}{\delta_{i}x_{i}^{2}}.italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ ( 0 , 1 ) such that divide start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (23)

In addition, let’s also define

𝒚=(P1(12P1)x1,,Pn(12Pn)xn),𝒛=(P1x1,,Pnxn),formulae-sequence𝒚superscriptsubscript𝑃112subscript𝑃1subscript𝑥1subscript𝑃𝑛12subscript𝑃𝑛subscript𝑥𝑛top𝒛superscriptsubscript𝑃1subscript𝑥1subscript𝑃𝑛subscript𝑥𝑛top\bm{y}=\left(\frac{P_{1}(1-2P_{1})}{x_{1}},\cdots,\frac{P_{n}(1-2P_{n})}{x_{n}% }\right)^{\top},\bm{z}=\left(\frac{P_{1}}{x_{1}},\cdots,\frac{P_{n}}{x_{n}}% \right)^{\top},bold_italic_y = ( divide start_ARG italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , ⋯ , divide start_ARG italic_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , bold_italic_z = ( divide start_ARG italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , ⋯ , divide start_ARG italic_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , (24)

then we have F𝒙=D+𝒚𝒛𝐹𝒙𝐷𝒚superscript𝒛top\frac{\partial F}{\partial\bm{x}}=D+\bm{y}\bm{z}^{\top}divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x end_ARG = italic_D + bold_italic_y bold_italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and from Sherman–Morrison formula (Sherman, 1949), it holds that

(F𝒙)1superscript𝐹𝒙1\displaystyle-\left(\frac{\partial F}{\partial\bm{x}}\right)^{-1}- ( divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT =(D+𝒚𝒛)1absentsuperscript𝐷𝒚superscript𝒛top1\displaystyle=(D+\bm{y}\bm{z}^{\top})^{-1}= ( italic_D + bold_italic_y bold_italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT
=D1D1𝒚𝒛D11+𝒛D1𝒚,absentsuperscript𝐷1superscript𝐷1𝒚superscript𝒛topsuperscript𝐷11superscript𝒛topsuperscript𝐷1𝒚\displaystyle=D^{-1}-\frac{D^{-1}\bm{y}\bm{z}^{\top}D^{-1}}{1+\bm{z}^{\top}D^{% -1}\bm{y}},= italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - divide start_ARG italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_y bold_italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG 1 + bold_italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_y end_ARG , (25)

where

D1=diag(δ1x12P12,,δnxn2Pn2),superscript𝐷1diagsubscript𝛿1superscriptsubscript𝑥12superscriptsubscript𝑃12subscript𝛿𝑛superscriptsubscript𝑥𝑛2superscriptsubscript𝑃𝑛2\displaystyle D^{-1}=\text{diag}\left(\frac{\delta_{1}x_{1}^{2}}{P_{1}^{2}},% \cdots,\frac{\delta_{n}x_{n}^{2}}{P_{n}^{2}}\right),italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = diag ( divide start_ARG italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , ⋯ , divide start_ARG italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) ,
D1𝒚=(δ1x1(12P1)P1,,δnxn(12Pn)Pn),superscript𝐷1𝒚superscriptsubscript𝛿1subscript𝑥112subscript𝑃1subscript𝑃1subscript𝛿𝑛subscript𝑥𝑛12subscript𝑃𝑛subscript𝑃𝑛top\displaystyle D^{-1}\bm{y}=\left(\frac{\delta_{1}x_{1}(1-2P_{1})}{P_{1}},% \cdots,\frac{\delta_{n}x_{n}(1-2P_{n})}{P_{n}}\right)^{\top},italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_y = ( divide start_ARG italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , ⋯ , divide start_ARG italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ,
𝒛D1=(δ1x1P1,,δnxnPn).superscript𝒛topsuperscript𝐷1subscript𝛿1subscript𝑥1subscript𝑃1subscript𝛿𝑛subscript𝑥𝑛subscript𝑃𝑛\displaystyle\bm{z}^{\top}D^{-1}=\left(\frac{\delta_{1}x_{1}}{P_{1}},\cdots,% \frac{\delta_{n}x_{n}}{P_{n}}\right).bold_italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = ( divide start_ARG italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , ⋯ , divide start_ARG italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) .

Since Pi13subscript𝑃𝑖13P_{i}\leq\frac{1}{3}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 3 end_ARG, we have 12Pi13>012subscript𝑃𝑖1301-2P_{i}\geq\frac{1}{3}>01 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ divide start_ARG 1 end_ARG start_ARG 3 end_ARG > 0. With all the notations introduced so far we are now ready to give the formal proof of Theorem 2.

Proof 

We prove the monotonicity of U(β)𝑈𝛽U(\beta)italic_U ( italic_β ) and V(β)𝑉𝛽V(\beta)italic_V ( italic_β ) by showing dlnUdβ>0𝑑𝑈𝑑𝛽0\frac{d\ln U}{d\beta}>0divide start_ARG italic_d roman_ln italic_U end_ARG start_ARG italic_d italic_β end_ARG > 0 and dVdβ<0𝑑𝑉𝑑𝛽0\frac{dV}{d\beta}<0divide start_ARG italic_d italic_V end_ARG start_ARG italic_d italic_β end_ARG < 0.

The monotonicity of U(β)𝑈𝛽U(\beta)italic_U ( italic_β ): The first-order derivative of lnU𝑈\ln Uroman_ln italic_U w.r.t. β𝛽\betaitalic_β is given by

dlnUdβ𝑑𝑈𝑑𝛽\displaystyle\frac{d\ln U}{d\beta}divide start_ARG italic_d roman_ln italic_U end_ARG start_ARG italic_d italic_β end_ARG =1U(U𝒙d𝒙dβ+Uβ)absent1𝑈𝑈𝒙𝑑𝒙𝑑𝛽𝑈𝛽\displaystyle=\frac{1}{U}\cdot\left(\frac{\partial U}{\partial\bm{x}}\cdot% \frac{d\bm{x}}{d\beta}+\frac{\partial U}{\partial\beta}\right)= divide start_ARG 1 end_ARG start_ARG italic_U end_ARG ⋅ ( divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_x end_ARG ⋅ divide start_ARG italic_d bold_italic_x end_ARG start_ARG italic_d italic_β end_ARG + divide start_ARG ∂ italic_U end_ARG start_ARG ∂ italic_β end_ARG )
=1UU𝒙(F𝒙)1Fβ+1UUβ.absent1𝑈𝑈𝒙superscript𝐹𝒙1𝐹𝛽1𝑈𝑈𝛽\displaystyle=-\frac{1}{U}\cdot\frac{\partial U}{\partial\bm{x}}\cdot\left(% \frac{\partial F}{\partial\bm{x}}\right)^{-1}\cdot\frac{\partial F}{\partial% \beta}+\frac{1}{U}\cdot\frac{\partial U}{\partial\beta}.= - divide start_ARG 1 end_ARG start_ARG italic_U end_ARG ⋅ divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_x end_ARG ⋅ ( divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⋅ divide start_ARG ∂ italic_F end_ARG start_ARG ∂ italic_β end_ARG + divide start_ARG 1 end_ARG start_ARG italic_U end_ARG ⋅ divide start_ARG ∂ italic_U end_ARG start_ARG ∂ italic_β end_ARG . (26)

where U𝒙1×n𝑈𝒙superscript1𝑛\frac{\partial U}{\partial\bm{x}}\in\mathbb{R}^{1\times n}divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_x end_ARG ∈ blackboard_R start_POSTSUPERSCRIPT 1 × italic_n end_POSTSUPERSCRIPT is the partial derivative of U𝑈Uitalic_U w.r.t. 𝒙𝒙\bm{x}bold_italic_x. Let ai=eβwisubscript𝑎𝑖superscript𝑒𝛽subscript𝑤𝑖a_{i}=e^{\beta w_{i}}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, we will first show 1UUβ01𝑈𝑈𝛽0\frac{1}{U}\cdot\frac{\partial U}{\partial\beta}\geq 0divide start_ARG 1 end_ARG start_ARG italic_U end_ARG ⋅ divide start_ARG ∂ italic_U end_ARG start_ARG ∂ italic_β end_ARG ≥ 0. In fact, calculation shows

1UUβ1𝑈𝑈𝛽\displaystyle\frac{1}{U}\cdot\frac{\partial U}{\partial\beta}divide start_ARG 1 end_ARG start_ARG italic_U end_ARG ⋅ divide start_ARG ∂ italic_U end_ARG start_ARG ∂ italic_β end_ARG =k=1nxkakk=1nwkxkak((k=1nwk2xkak)(k=1nxkak)(k=1nwkxkak)2(k=1nxkak)2)absentsuperscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑘1𝑛superscriptsubscript𝑤𝑘2subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘superscriptsuperscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘2superscriptsuperscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘2\displaystyle=\frac{\sum_{k=1}^{n}x_{k}a_{k}}{\sum_{k=1}^{n}w_{k}x_{k}a_{k}}% \cdot\left(\frac{(\sum_{k=1}^{n}w_{k}^{2}x_{k}a_{k})(\sum_{k=1}^{n}x_{k}a_{k})% -(\sum_{k=1}^{n}w_{k}x_{k}a_{k})^{2}}{(\sum_{k=1}^{n}x_{k}a_{k})^{2}}\right)= divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ⋅ ( divide start_ARG ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG )
=k=1nwk2xkakk=1nwkxkakk=1nwkxkakk=1nxkak.absentsuperscriptsubscript𝑘1𝑛superscriptsubscript𝑤𝑘2subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘\displaystyle=\frac{\sum_{k=1}^{n}w_{k}^{2}x_{k}a_{k}}{\sum_{k=1}^{n}w_{k}x_{k% }a_{k}}-\frac{\sum_{k=1}^{n}w_{k}x_{k}a_{k}}{\sum_{k=1}^{n}x_{k}a_{k}}.= divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG - divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG . (27)

From Cauchy–Schwarz inequality, it holds that

k=1nwk2xkakk=1nxkak(k=1nwk2xkakxkak)2=(k=1nwkxkak)2.superscriptsubscript𝑘1𝑛superscriptsubscript𝑤𝑘2subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘superscriptsuperscriptsubscript𝑘1𝑛superscriptsubscript𝑤𝑘2subscript𝑥𝑘subscript𝑎𝑘subscript𝑥𝑘subscript𝑎𝑘2superscriptsuperscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘2\sum_{k=1}^{n}w_{k}^{2}x_{k}a_{k}\cdot\sum_{k=1}^{n}x_{k}a_{k}\geq\left(\sum_{% k=1}^{n}\sqrt{w_{k}^{2}x_{k}a_{k}\cdot x_{k}a_{k}}\right)^{2}=\left(\sum_{k=1}% ^{n}w_{k}x_{k}a_{k}\right)^{2}.∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋅ ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT square-root start_ARG italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋅ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

Therefore, the RHS of Eq. (27) is greater than or equal to 00. Hence, it suffices to show

1UU𝒙(F𝒙)1Fβ>0.1𝑈𝑈𝒙superscript𝐹𝒙1𝐹𝛽0-\frac{1}{U}\cdot\frac{\partial U}{\partial\bm{x}}\cdot\left(\frac{\partial F}% {\partial\bm{x}}\right)^{-1}\cdot\frac{\partial F}{\partial\beta}>0.- divide start_ARG 1 end_ARG start_ARG italic_U end_ARG ⋅ divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_x end_ARG ⋅ ( divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⋅ divide start_ARG ∂ italic_F end_ARG start_ARG ∂ italic_β end_ARG > 0 . (28)

Also note that

1UUxi1𝑈𝑈subscript𝑥𝑖\displaystyle\frac{1}{U}\cdot\frac{\partial U}{\partial x_{i}}divide start_ARG 1 end_ARG start_ARG italic_U end_ARG ⋅ divide start_ARG ∂ italic_U end_ARG start_ARG ∂ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG =k=1nxkakk=1nwkxkak(wiai(k=1nxkak)aik=1nwkxkak(k=1nxkak)2)absentsuperscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘subscript𝑤𝑖subscript𝑎𝑖superscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘subscript𝑎𝑖superscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘superscriptsuperscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘2\displaystyle=\frac{\sum_{k=1}^{n}x_{k}a_{k}}{\sum_{k=1}^{n}w_{k}x_{k}a_{k}}% \cdot\left(\frac{w_{i}a_{i}(\sum_{k=1}^{n}x_{k}a_{k})-a_{i}\sum_{k=1}^{n}w_{k}% x_{k}a_{k}}{(\sum_{k=1}^{n}x_{k}a_{k})^{2}}\right)= divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ⋅ ( divide start_ARG italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG )
=wiaik=1nwkxkakaik=1nxkakabsentsubscript𝑤𝑖subscript𝑎𝑖superscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘subscript𝑎𝑖superscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘\displaystyle=\frac{w_{i}a_{i}}{\sum_{k=1}^{n}w_{k}x_{k}a_{k}}-\frac{a_{i}}{% \sum_{k=1}^{n}x_{k}a_{k}}= divide start_ARG italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG - divide start_ARG italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG
=wiaik=1nwkxkakPixi,absentsubscript𝑤𝑖subscript𝑎𝑖superscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘subscript𝑃𝑖subscript𝑥𝑖\displaystyle=\frac{w_{i}a_{i}}{\sum_{k=1}^{n}w_{k}x_{k}a_{k}}-\frac{P_{i}}{x_% {i}},= divide start_ARG italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG - divide start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG , (29)

and substitute Eq. (22), (25), and (29) into the LHS of Eq. (28), we obtain

1UU𝒙(F𝒙)1Fβ1𝑈𝑈𝒙superscript𝐹𝒙1𝐹𝛽\displaystyle-\frac{1}{U}\cdot\frac{\partial U}{\partial\bm{x}}\cdot\left(% \frac{\partial F}{\partial\bm{x}}\right)^{-1}\cdot\frac{\partial F}{\partial\beta}- divide start_ARG 1 end_ARG start_ARG italic_U end_ARG ⋅ divide start_ARG ∂ italic_U end_ARG start_ARG ∂ bold_italic_x end_ARG ⋅ ( divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⋅ divide start_ARG ∂ italic_F end_ARG start_ARG ∂ italic_β end_ARG
=\displaystyle== [wiaik=1nwkxkakPixi]i[n][D1D1𝒚𝒛D11+𝒛D1𝒚][Pixi(12Pi)(wik=1nPkwk)]i[n]subscriptsuperscriptdelimited-[]subscript𝑤𝑖subscript𝑎𝑖superscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘subscript𝑃𝑖subscript𝑥𝑖top𝑖delimited-[]𝑛delimited-[]superscript𝐷1superscript𝐷1𝒚superscript𝒛topsuperscript𝐷11superscript𝒛topsuperscript𝐷1𝒚subscriptdelimited-[]subscript𝑃𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑤𝑖superscriptsubscript𝑘1𝑛subscript𝑃𝑘subscript𝑤𝑘𝑖delimited-[]𝑛\displaystyle\left[\frac{w_{i}a_{i}}{\sum_{k=1}^{n}w_{k}x_{k}a_{k}}-\frac{P_{i% }}{x_{i}}\right]^{\top}_{i\in[n]}\cdot\left[D^{-1}-\frac{D^{-1}\bm{y}\bm{z}^{% \top}D^{-1}}{1+\bm{z}^{\top}D^{-1}\bm{y}}\right]\cdot\left[\frac{P_{i}}{x_{i}}% \cdot(1-2P_{i})\cdot\left(w_{i}-\sum_{k=1}^{n}P_{k}w_{k}\right)\right]_{i\in[n]}[ divide start_ARG italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG - divide start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT ⋅ [ italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - divide start_ARG italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_y bold_italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG 1 + bold_italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_y end_ARG ] ⋅ [ divide start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ⋅ ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⋅ ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT
=\displaystyle== 1Ti=1nδi(12Pi)(wiT)2(i=1nδi(12Pi)(wiT))2T(1+i=1nδi(12Pi)),1𝑇superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑤𝑖𝑇2superscriptsuperscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖subscript𝑤𝑖𝑇2𝑇1superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖\displaystyle\frac{1}{T}\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T)^{2}-\frac{% \left(\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T)\right)^{2}}{T(1+\sum_{i=1}^{% n}\delta_{i}(1-2P_{i}))},divide start_ARG 1 end_ARG start_ARG italic_T end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_T ( 1 + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_ARG , (30)

where T=k=1nwkxkakk=1nxkak=i=1nPiwi𝑇superscriptsubscript𝑘1𝑛subscript𝑤𝑘subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑘1𝑛subscript𝑥𝑘subscript𝑎𝑘superscriptsubscript𝑖1𝑛subscript𝑃𝑖subscript𝑤𝑖T=\frac{\sum_{k=1}^{n}w_{k}x_{k}a_{k}}{\sum_{k=1}^{n}x_{k}a_{k}}=\sum_{i=1}^{n% }P_{i}w_{i}italic_T = divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Therefore, it suffices to prove

i=1nδi(12Pi)(wiT)2(i=1nδi(12Pi)(wiT))21+i=1nδi(12Pi)>0.superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑤𝑖𝑇2superscriptsuperscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖subscript𝑤𝑖𝑇21superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖0\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T)^{2}-\frac{\left(\sum_{i=1}^{n}% \delta_{i}(1-2P_{i})(w_{i}-T)\right)^{2}}{1+\sum_{i=1}^{n}\delta_{i}(1-2P_{i})% }>0.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG > 0 . (31)

Since 12Pi>012subscript𝑃𝑖01-2P_{i}>01 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0, from Cauchy–Schwarz inequality it holds that

i=1nδi(12Pi)(wiT)2(1+i=1nδi(12Pi))superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑤𝑖𝑇21superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖\displaystyle\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T)^{2}\cdot\left(1+\sum_% {i=1}^{n}\delta_{i}(1-2P_{i})\right)∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ ( 1 + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) )
=\displaystyle== i=1nδi(12Pi)(wiT)2i=1nδi(12Pi)+i=1nδi(12Pi)(wiT)2superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑤𝑖𝑇2superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑤𝑖𝑇2\displaystyle\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T)^{2}\cdot\sum_{i=1}^{n% }\delta_{i}(1-2P_{i})+\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T)^{2}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
\displaystyle\geq (i=1nδi(12Pi)(wiT))2+i=1nδi(12Pi)(wiT)2superscriptsuperscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖subscript𝑤𝑖𝑇2superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑤𝑖𝑇2\displaystyle\left(\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T)\right)^{2}+\sum% _{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T)^{2}( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
>\displaystyle>> (i=1nδi(12Pi)(wiT))2,superscriptsuperscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖subscript𝑤𝑖𝑇2\displaystyle\left(\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T)\right)^{2},( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

where the last inequality holds because {𝒘i}subscript𝒘𝑖\{\bm{w}_{i}\}{ bold_italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } are not identical so there exists at least one j[n]𝑗delimited-[]𝑛j\in[n]italic_j ∈ [ italic_n ] such that δj(12Pj)(wjT)2>0subscript𝛿𝑗12subscript𝑃𝑗superscriptsubscript𝑤𝑗𝑇20\delta_{j}(1-2P_{j})(w_{j}-T)^{2}>0italic_δ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT > 0. Therefore, Eq. (31) holds and we have dlnUdβ>0𝑑𝑈𝑑𝛽0\frac{d\ln U}{d\beta}>0divide start_ARG italic_d roman_ln italic_U end_ARG start_ARG italic_d italic_β end_ARG > 0.

The monotonicity of V(β)𝑉𝛽V(\beta)italic_V ( italic_β ): Next we show dVdβ<0𝑑𝑉𝑑𝛽0\frac{dV}{d\beta}<0divide start_ARG italic_d italic_V end_ARG start_ARG italic_d italic_β end_ARG < 0. Since Vβ=0𝑉𝛽0\frac{\partial V}{\partial\beta}=0divide start_ARG ∂ italic_V end_ARG start_ARG ∂ italic_β end_ARG = 0, this is equivalent to show

V𝒙(F𝒙)1Fβ>0.𝑉𝒙superscript𝐹𝒙1𝐹𝛽0\frac{\partial V}{\partial\bm{x}}\cdot\left(\frac{\partial F}{\partial\bm{x}}% \right)^{-1}\cdot\frac{\partial F}{\partial\beta}>0.divide start_ARG ∂ italic_V end_ARG start_ARG ∂ bold_italic_x end_ARG ⋅ ( divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⋅ divide start_ARG ∂ italic_F end_ARG start_ARG ∂ italic_β end_ARG > 0 . (32)

Note that dVdxi=1𝑑𝑉𝑑subscript𝑥𝑖1\frac{dV}{dx_{i}}=1divide start_ARG italic_d italic_V end_ARG start_ARG italic_d italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG = 1, we have

V𝒙(F𝒙)1Fβ𝑉𝒙superscript𝐹𝒙1𝐹𝛽\displaystyle\frac{\partial V}{\partial\bm{x}}\cdot\left(\frac{\partial F}{% \partial\bm{x}}\right)^{-1}\cdot\frac{\partial F}{\partial\beta}divide start_ARG ∂ italic_V end_ARG start_ARG ∂ bold_italic_x end_ARG ⋅ ( divide start_ARG ∂ italic_F end_ARG start_ARG ∂ bold_italic_x end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⋅ divide start_ARG ∂ italic_F end_ARG start_ARG ∂ italic_β end_ARG
=\displaystyle== [1,1,,1][D1D1𝒚𝒛D11+𝒛D1𝒚][Pixi(12Pi)(wik=1nPkwk)]i[n]111delimited-[]superscript𝐷1superscript𝐷1𝒚superscript𝒛topsuperscript𝐷11superscript𝒛topsuperscript𝐷1𝒚subscriptdelimited-[]subscript𝑃𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑤𝑖superscriptsubscript𝑘1𝑛subscript𝑃𝑘subscript𝑤𝑘𝑖delimited-[]𝑛\displaystyle-\left[1,1,\cdots,1\right]\cdot\left[D^{-1}-\frac{D^{-1}\bm{y}\bm% {z}^{\top}D^{-1}}{1+\bm{z}^{\top}D^{-1}\bm{y}}\right]\cdot\left[\frac{P_{i}}{x% _{i}}\cdot(1-2P_{i})\cdot\left(w_{i}-\sum_{k=1}^{n}P_{k}w_{k}\right)\right]_{i% \in[n]}- [ 1 , 1 , ⋯ , 1 ] ⋅ [ italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - divide start_ARG italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_y bold_italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG 1 + bold_italic_z start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_italic_y end_ARG ] ⋅ [ divide start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ⋅ ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ⋅ ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT italic_i ∈ [ italic_n ] end_POSTSUBSCRIPT
=\displaystyle== i=1nδixi(12Pi)(wiT)Pi+11+i=1nδi(12Pi)i=1nδixi(12Pi)Pii=1nδi(12Pi)(wiT).superscriptsubscript𝑖1𝑛subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑤𝑖𝑇subscript𝑃𝑖11superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖subscript𝑤𝑖𝑇\displaystyle-\sum_{i=1}^{n}\frac{\delta_{i}x_{i}(1-2P_{i})(w_{i}-T)}{P_{i}}+% \frac{1}{1+\sum_{i=1}^{n}\delta_{i}(1-2P_{i})}\sum_{i=1}^{n}\frac{\delta_{i}x_% {i}(1-2P_{i})}{P_{i}}\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T).- ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG + divide start_ARG 1 end_ARG start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) . (33)

To show the RHS of Eq. (33) is positive, it suffices to show

(1+i=1nδi(12Pi))i=1nδixi(12Pi)(wiT)Pi<i=1nδixi(12Pi)Pii=1nδi(12Pi)(wiT).1superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑤𝑖𝑇subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖subscript𝑤𝑖𝑇\left(1+\sum_{i=1}^{n}\delta_{i}(1-2P_{i})\right)\sum_{i=1}^{n}\frac{\delta_{i% }x_{i}(1-2P_{i})(w_{i}-T)}{P_{i}}<\sum_{i=1}^{n}\frac{\delta_{i}x_{i}(1-2P_{i}% )}{P_{i}}\sum_{i=1}^{n}\delta_{i}(1-2P_{i})(w_{i}-T).( 1 + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG < ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_T ) . (34)

Plugin T=i=1nPiwi𝑇superscriptsubscript𝑖1𝑛subscript𝑃𝑖subscript𝑤𝑖T=\sum_{i=1}^{n}P_{i}w_{i}italic_T = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT into Eq. (34), it is equivalent to show that

(1+i=1nδi(12Pi))i=1nδixi(12Pi)wiPii=1nPiwii=1nδixi(12Pi)Pi1superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑤𝑖subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝑃𝑖subscript𝑤𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑃𝑖\displaystyle\left(1+\sum_{i=1}^{n}\delta_{i}(1-2P_{i})\right)\sum_{i=1}^{n}% \frac{\delta_{i}x_{i}(1-2P_{i})w_{i}}{P_{i}}-\sum_{i=1}^{n}P_{i}w_{i}\sum_{i=1% }^{n}\frac{\delta_{i}x_{i}(1-2P_{i})}{P_{i}}( 1 + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG - ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG
<\displaystyle<< i=1nδixi(12Pi)Pii=1nδi(12Pi)wi,superscriptsubscript𝑖1𝑛subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖12subscript𝑃𝑖subscript𝑤𝑖\displaystyle\sum_{i=1}^{n}\frac{\delta_{i}x_{i}(1-2P_{i})}{P_{i}}\sum_{i=1}^{% n}\delta_{i}(1-2P_{i})w_{i},∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ,

and note that 1=i=1nPi1superscriptsubscript𝑖1𝑛subscript𝑃𝑖1=\sum_{i=1}^{n}P_{i}1 = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, it is equivalent to show

i=1n[δi(12Pi)+Pi]i=1nδixi(12Pi)wiPi<i=1nδixi(12Pi)Pii=1n[δi(12Pi)+Pi]wi.superscriptsubscript𝑖1𝑛delimited-[]subscript𝛿𝑖12subscript𝑃𝑖subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑤𝑖subscript𝑃𝑖superscriptsubscript𝑖1𝑛subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑃𝑖superscriptsubscript𝑖1𝑛delimited-[]subscript𝛿𝑖12subscript𝑃𝑖subscript𝑃𝑖subscript𝑤𝑖\sum_{i=1}^{n}[\delta_{i}(1-2P_{i})+P_{i}]\sum_{i=1}^{n}\frac{\delta_{i}x_{i}(% 1-2P_{i})w_{i}}{P_{i}}<\sum_{i=1}^{n}\frac{\delta_{i}x_{i}(1-2P_{i})}{P_{i}}% \sum_{i=1}^{n}[\delta_{i}(1-2P_{i})+P_{i}]w_{i}.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT [ italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG < ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT [ italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT . (35)

Let ai=δixi(12Pi)Pi,bi=δi(12Pi)+Piformulae-sequencesubscript𝑎𝑖subscript𝛿𝑖subscript𝑥𝑖12subscript𝑃𝑖subscript𝑃𝑖subscript𝑏𝑖subscript𝛿𝑖12subscript𝑃𝑖subscript𝑃𝑖a_{i}=\frac{\delta_{i}x_{i}(1-2P_{i})}{P_{i}},b_{i}=\delta_{i}(1-2P_{i})+P_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG , italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, then Eq. (35) is equivalent to

i=1nbii=1naiwi<i=1naii=1nbiwii>j(wiwj)(biaibjaj)>0.superscriptsubscript𝑖1𝑛subscript𝑏𝑖superscriptsubscript𝑖1𝑛subscript𝑎𝑖subscript𝑤𝑖superscriptsubscript𝑖1𝑛subscript𝑎𝑖superscriptsubscript𝑖1𝑛subscript𝑏𝑖subscript𝑤𝑖subscript𝑖𝑗subscript𝑤𝑖subscript𝑤𝑗subscript𝑏𝑖subscript𝑎𝑖subscript𝑏𝑗subscript𝑎𝑗0\sum_{i=1}^{n}b_{i}\cdot\sum_{i=1}^{n}a_{i}w_{i}<\sum_{i=1}^{n}a_{i}\cdot\sum_% {i=1}^{n}b_{i}w_{i}\Longleftrightarrow\sum_{i>j}(w_{i}-w_{j})\left(\frac{b_{i}% }{a_{i}}-\frac{b_{j}}{a_{j}}\right)>0.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟺ ∑ start_POSTSUBSCRIPT italic_i > italic_j end_POSTSUBSCRIPT ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ( divide start_ARG italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG - divide start_ARG italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) > 0 .

Next, without loss of generality we show that for any 1i<jn1𝑖𝑗𝑛1\leq i<j\leq n1 ≤ italic_i < italic_j ≤ italic_n, if wi>wjsubscript𝑤𝑖subscript𝑤𝑗w_{i}>w_{j}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT then it also holds that biai>bjajsubscript𝑏𝑖subscript𝑎𝑖subscript𝑏𝑗subscript𝑎𝑗\frac{b_{i}}{a_{i}}>\frac{b_{j}}{a_{j}}divide start_ARG italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG > divide start_ARG italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG for sufficiently large β𝛽\betaitalic_β. In fact, from Pi=xieβwik=1nxkeβwksubscript𝑃𝑖subscript𝑥𝑖superscript𝑒𝛽subscript𝑤𝑖superscriptsubscript𝑘1𝑛subscript𝑥𝑘superscript𝑒𝛽subscript𝑤𝑘P_{i}=\frac{x_{i}e^{\beta w_{i}}}{\sum_{k=1}^{n}x_{k}e^{\beta w_{k}}}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_β italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG and Eq. (23) we obtain

biaiajbj=subscript𝑏𝑖subscript𝑎𝑖subscript𝑎𝑗subscript𝑏𝑗absent\displaystyle\frac{b_{i}}{a_{i}}\cdot\frac{a_{j}}{b_{j}}=divide start_ARG italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ⋅ divide start_ARG italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG = δj(δi(12Pi)+Pi)(12Pj)δi(δj(12Pj)+Pj)(12Pi)eβ(wiwj)subscript𝛿𝑗subscript𝛿𝑖12subscript𝑃𝑖subscript𝑃𝑖12subscript𝑃𝑗subscript𝛿𝑖subscript𝛿𝑗12subscript𝑃𝑗subscript𝑃𝑗12subscript𝑃𝑖superscript𝑒𝛽subscript𝑤𝑖subscript𝑤𝑗\displaystyle\frac{\delta_{j}(\delta_{i}(1-2P_{i})+P_{i})(1-2P_{j})}{\delta_{i% }(\delta_{j}(1-2P_{j})+P_{j})(1-2P_{i})}\cdot e^{\beta(w_{i}-w_{j})}divide start_ARG italic_δ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_δ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) + italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ⋅ italic_e start_POSTSUPERSCRIPT italic_β ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
=\displaystyle== 1+Piδi/(12Pi)1+Pjδj/(12Pj)eβ(wiwj)1subscript𝑃𝑖subscript𝛿𝑖12subscript𝑃𝑖1subscript𝑃𝑗subscript𝛿𝑗12subscript𝑃𝑗superscript𝑒𝛽subscript𝑤𝑖subscript𝑤𝑗\displaystyle\frac{1+P_{i}\delta_{i}/(1-2P_{i})}{1+P_{j}\delta_{j}/(1-2P_{j})}% \cdot e^{\beta(w_{i}-w_{j})}divide start_ARG 1 + italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG start_ARG 1 + italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT / ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG ⋅ italic_e start_POSTSUPERSCRIPT italic_β ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT
=\displaystyle== 1+Pi3(12Pi)(Pi2+xi2ci′′(xi))1+Pj3(12Pj)(Pj2+xj2cj′′(xj))eβ(wiwj).1superscriptsubscript𝑃𝑖312subscript𝑃𝑖superscriptsubscript𝑃𝑖2superscriptsubscript𝑥𝑖2superscriptsubscript𝑐𝑖′′subscript𝑥𝑖1superscriptsubscript𝑃𝑗312subscript𝑃𝑗superscriptsubscript𝑃𝑗2superscriptsubscript𝑥𝑗2superscriptsubscript𝑐𝑗′′subscript𝑥𝑗superscript𝑒𝛽subscript𝑤𝑖subscript𝑤𝑗\displaystyle\frac{1+\frac{P_{i}^{3}}{(1-2P_{i})(P_{i}^{2}+x_{i}^{2}c_{i}^{% \prime\prime}(x_{i}))}}{1+\frac{P_{j}^{3}}{(1-2P_{j})(P_{j}^{2}+x_{j}^{2}c_{j}% ^{\prime\prime}(x_{j}))}}\cdot e^{\beta(w_{i}-w_{j})}.divide start_ARG 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) end_ARG end_ARG start_ARG 1 + divide start_ARG italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG ( 1 - 2 italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ( italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ) end_ARG end_ARG ⋅ italic_e start_POSTSUPERSCRIPT italic_β ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT . (36)

On the one hand, because 12Pi,12Pj[13,1]12subscript𝑃𝑖12subscript𝑃𝑗1311-2P_{i},1-2P_{j}\in[\frac{1}{3},1]1 - 2 italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , 1 - 2 italic_P start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ [ divide start_ARG 1 end_ARG start_ARG 3 end_ARG , 1 ], xi,xj[0,L]subscript𝑥𝑖subscript𝑥𝑗0𝐿x_{i},x_{j}\in[0,L]italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ [ 0 , italic_L ], and ci′′(xi),cj′′(xj)>0superscriptsubscript𝑐𝑖′′subscript𝑥𝑖superscriptsubscript𝑐𝑗′′subscript𝑥𝑗0c_{i}^{\prime\prime}(x_{i}),c_{j}^{\prime\prime}(x_{j})>0italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) > 0, the first terms of the LHS of Eq. (36) is a positive number lower bounded away from zero. On the other hand, wi>wjsubscript𝑤𝑖subscript𝑤𝑗w_{i}>w_{j}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ensures that the second term eβ(wiwj)superscript𝑒𝛽subscript𝑤𝑖subscript𝑤𝑗e^{\beta(w_{i}-w_{j})}italic_e start_POSTSUPERSCRIPT italic_β ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT can be arbitrarily large as long as β𝛽\betaitalic_β is sufficiently large. Therefore, there must exist a β0>0subscript𝛽00\beta_{0}>0italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0 such that for any β>β0𝛽subscript𝛽0\beta>\beta_{0}italic_β > italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, biaiajbj>1subscript𝑏𝑖subscript𝑎𝑖subscript𝑎𝑗subscript𝑏𝑗1\frac{b_{i}}{a_{i}}\cdot\frac{a_{j}}{b_{j}}>1divide start_ARG italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ⋅ divide start_ARG italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG > 1 holds for any i>j𝑖𝑗i>jitalic_i > italic_j. As a result, (wiwj)(biaibjaj)>0subscript𝑤𝑖subscript𝑤𝑗subscript𝑏𝑖subscript𝑎𝑖subscript𝑏𝑗subscript𝑎𝑗0(w_{i}-w_{j})\left(\frac{b_{i}}{a_{i}}-\frac{b_{j}}{a_{j}}\right)>0( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ( divide start_ARG italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG - divide start_ARG italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) > 0 holds, which completes the proof.

 

B Additional Experiments

We use the following Multi-agent Mirror Descent (MMD) algorithm as the PNE solver of C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT, whose convergence is guaranteed by Bravo et al. (2018). Since each creator’s strategy set 𝒳i=[0,+)subscript𝒳𝑖0\mathcal{X}_{i}=[0,+\infty)caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = [ 0 , + ∞ ), we can simply choose a projection mapping Proj𝒳i(𝒙)=(max(xi,0))i=1nsubscriptProjsubscript𝒳𝑖𝒙superscriptsubscript𝑚𝑎𝑥subscript𝑥𝑖0𝑖1𝑛\text{Proj}_{\mathcal{X}_{i}}(\bm{x})=(\mathop{max}(x_{i},0))_{i=1}^{n}Proj start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_x ) = ( start_BIGOP italic_m italic_a italic_x end_BIGOP ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , 0 ) ) start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. The gradients of utility functions can be implemented directly since they have closed forms. Through our experiment, the default T=10000,η=0.1,ϵ=1e2,xi(0)=1.0formulae-sequence𝑇10000formulae-sequence𝜂0.1formulae-sequenceitalic-ϵ1𝑒2superscriptsubscript𝑥𝑖01.0T=10000,\eta=0.1,\epsilon=1e-2,x_{i}^{(0)}=1.0italic_T = 10000 , italic_η = 0.1 , italic_ϵ = 1 italic_e - 2 , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = 1.0. Algorithm 2 is a simplified version of Algorithm 1 in Bravo et al. (2018) where we replace the gradient estimation to the exact gradient. According to Theorem 5.1 in Bravo et al. (2018), Algorithm 2 converges to the unique PNE of any C4superscript𝐶4C^{4}italic_C start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT game with probability 1.

Algorithm 2 Multi-agent Mirror Descent (MMD) with perfect gradient
  Input: Maximum iteration number T𝑇Titalic_T, step size η𝜂\etaitalic_η, each player i𝑖iitalic_i’s utility function uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, error tolerance ϵitalic-ϵ\epsilonitalic_ϵ, initial strategy 𝒙i=𝒙i(0)subscript𝒙𝑖subscriptsuperscript𝒙0𝑖\bm{x}_{i}=\bm{x}^{(0)}_{i}bold_italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = bold_italic_x start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.
  repeat
     Compute the exact gradient 𝒈i=iui(𝒙i,𝒙i),i[n]formulae-sequencesubscript𝒈𝑖subscript𝑖subscript𝑢𝑖subscript𝒙𝑖subscript𝒙𝑖for-all𝑖delimited-[]𝑛\bm{g}_{i}=\nabla_{i}u_{i}(\bm{x}_{i},\bm{x}_{-i}),\forall i\in[n]bold_italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ∇ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_x start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ) , ∀ italic_i ∈ [ italic_n ],
     Update 𝒙iProj𝒳i(𝒙i+η𝒈i),i[n]formulae-sequencesubscript𝒙𝑖subscriptProjsubscript𝒳𝑖subscript𝒙𝑖𝜂subscript𝒈𝑖for-all𝑖delimited-[]𝑛\bm{x}_{i}\leftarrow\text{Proj}_{\mathcal{X}_{i}}(\bm{x}_{i}+\eta\bm{g}_{i}),% \forall i\in[n]bold_italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ← Proj start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_η bold_italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , ∀ italic_i ∈ [ italic_n ].
  until Maximum iteration number is reached or (𝒈1,,𝒈n)2<ϵ.subscriptnormsubscript𝒈1subscript𝒈𝑛2italic-ϵ\|(\bm{g}_{1},\cdots,\bm{g}_{n})\|_{2}<\epsilon.∥ ( bold_italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , bold_italic_g start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < italic_ϵ .
  Output: (x1,,xn)subscript𝑥1subscript𝑥𝑛(\bm{x}_{1},\cdots,\bm{x}_{n})( bold_italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , bold_italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ).

Figure 3 illustrates the trade-off between U𝑈Uitalic_U and V𝑉Vitalic_V in the MovieLens environment, and Figure 4 plots the same information as shown in Figure 2 but with a different value of λ=0.1𝜆0.1\lambda=0.1italic_λ = 0.1. As we can see, the optimal PPM(𝜷𝜷\bm{\beta}bold_italic_β) found by Algorithm 1 conveys a consistent message: it prioritizes the recommendation accuracy for users with more determined preferences while increasing the exploration strength for less selective users.

Refer to caption
Refer to caption
Refer to caption
Figure 3: The left and the middle panel: the empirical distributions of content creation frequency xisuperscriptsubscript𝑥𝑖x_{i}^{*}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and each user’s individual utility πjsuperscriptsubscript𝜋𝑗\pi_{j}^{*}italic_π start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Different colors represent results for PNEs induced by different β𝛽\betaitalic_β. Right: the total content creation V𝑉Vitalic_V and total user satisfaction U𝑈Uitalic_U obtained under different β𝛽\betaitalic_β. Error bars obtained from 10 independently generated environments.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 4: Panel 1,2: social welfare improving curve under Algorithm 1, and the distribution of the obtained optimal βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT in the synthetic environment. Panel 3,4: the same plots in the MovieLens environment. λ=0.1𝜆0.1\lambda=0.1italic_λ = 0.1.