Article
Open access
Published: 24 June 2024

Evolutionary dynamics of any multiplayer game on regular graphs

Nature Communications volume 15, Article number: 5349 (2024) Cite this article

584 Accesses
4 Altmetric
Metrics details

Subjects

Abstract

Multiplayer games on graphs are at the heart of theoretical descriptions of key evolutionary processes that govern vital social and natural systems. However, a comprehensive theoretical framework for solving multiplayer games with an arbitrary number of strategies on graphs is still missing. Here, we solve this by drawing an analogy with the Balls-and-Boxes problem, based on which we show that the local configuration of multiplayer games on graphs is equivalent to distributing k identical co-players among n distinct strategies. We use this to derive the replicator equation for any n-strategy multiplayer game under weak selection, which can be solved in polynomial time. As an example, we revisit the second-order free-riding problem, where costly punishment cannot truly resolve social dilemmas in a well-mixed population. Yet, in structured populations, we derive an accurate threshold for the punishment strength, beyond which punishment can either lead to the extinction of defection or transform the system into a rock-paper-scissors-like cycle. The analytical solution also qualitatively agrees with the phase diagrams that were previously obtained for non-marginal selection strengths. Our framework thus allows an exploration of any multi-strategy multiplayer game on regular graphs.

Reconstructing higher-order interactions in coupled dynamical systems

Article Open access 18 June 2024

A benchmarking study of quantum algorithms for combinatorial optimization

Article Open access 22 June 2024

Assembly theory explains and quantifies selection and evolution

Article Open access 04 October 2023

Introduction

Multi-strategy evolutionary dynamics in nature often lead to diverse and complex phenomena, such as cyclic dominance that is captured by the well-known rock-paper-scissors game¹. Experimental evidence from diverse contexts, ranging from the three-morph mating system of the side-blotched lizard² and Escherichia coli populations³, to human economic behaviors⁴, demonstrates the occurrence of the rock-paper-scissors cycle in various real-world scenarios. Theoretical models of the rock-paper-scissors cycle have been explored in both two-player⁵ and multiplayer game frameworks⁶, contributing to an understanding of its underlying properties—as a consequence of strategy diversity, the intransitive interaction may emerge spontaneously. This phenomenon can be illustrated when we extend the basic two-strategy model of the evolution of cooperation by adding additional strategies that punish defectors^7,8 or reward cooperators^9,10. The additional strategies are necessary when considering more realistic models, which underlines the importance of a multi-strategy approach.

Previous research in evolutionary dynamics primarily focused on two-strategy systems, where the unconditional cooperator and defector strategies represent the fundamental conflict of individual and collective interests¹¹. While cooperation can maximize mutual benefits, defection, despite offering higher personal payoff, reduces overall benefits to others. Consequently, defection often appears as the dominant strategy. An escape route from this dilemma could be a spatially structured population^12,13, where individuals interact with fixed neighbors but still adopt the strategies of those with higher payoffs. This setting allows cooperation to form clusters, utilizing the advantage of collective payoffs thus resisting the invasion of defection, a concept known as spatial reciprocity¹⁴. It is recognized that no simple closed-form solution exists for general evolutionary dynamics in structured populations, unless by chance P = NP, Polynomial time equals to Nondeterministic Polynomial time¹⁵. However, in the weak selection limit, where the influence of the game on strategy updates is marginal, analytical solutions have been obtained from infinite¹⁶ to finite populations¹⁷, and from regular^18,19 to arbitrary graphs^20,21,22,23. This line of research has led to the development of evolutionary graph theory¹⁴.

In evolutionary graph theory, a widely used mathematical technique is the pair approximation^{24,25,26,27,28}. This method applies to infinite populations on regular graphs and has revealed the well-known ‘b/c > k’ rule, which states that evolution favors cooperation when the benefit-to-cost ratio exceeds the number of neighbors²⁹. Pair approximation is also capable of analyzing more complex models, including unequal interaction and dispersal graphs³⁰, asymmetric networks³¹, and stochastic games³², predicting simulation outcomes with high accuracy. Notably, pair approximation has been applied to multi-strategy two-player games³³, leading to the replicator equations for arbitrary n-strategy two-player games on a regular graph, as an important extension of the traditional replicator equations used in well-mixed populations³⁴.

Unlike two-player games, multiplayer games exhibit much greater complexity, primarily due to their potentially nonlinear payoff functions³⁵. In a structured population, multiplayer games require each individual to organize a game within their neighbors and themselves, which implies that individuals participate in games organized by both themselves and their neighbors, thereby interacting with second-order neighbors. Such interactions lead to higher-order interactions^36,37, which cannot be simply reduced to a superposition of pairwise interactions. The complexity of multiplayer games can also be illustrated from the perspective of structure coefficients on graphs: a two-strategy two-player game needs only one structure coefficient³⁸, a multi-strategy two-player game requires three^39,40, but a two-strategy (k + 1)-player game needs as many as k structure coefficients^41,42. The number of potential equilibrium points in general multiplayer games also indicates their complexity^43,44. Even so, in the absence of triangle motifs, two-strategy multiplayer games can still be theoretically analyzed using pair approximation^45,46, whose results are consistent with predictions obtained by other more precise methods^47,48.

With two-strategy two-player²⁹, multi-strategy two-player³³, and two-strategy multiplayer games⁴⁶ all thoroughly studied, the analytical solution for multi-strategy multiplayer games on graphs remains unexplored. The range of potential models for multiplayer games with more than two strategies is vast, drawing from co-evolutionary strategies such as punishment^8,49,50,51, reward^9,52,53, and the loner strategy^4,54. Multi-strategy systems in multiplayer games have unique characteristics that multistrategy two-player games do not capture: the payoff function can be nonlinear. For example, in pool punishment, the payoff structure depends solely on whether there is at least one punishing player among the k + 1 players. This uniqueness reinforces the significance of studying multistrategy multiplayer games.

However, previous research on these games on graphs has largely been limited to numerical simulations, which do not allow for the exploration of the complete parameter space. In the absence of mathematical tools for evolutionary graph theory in multi-strategy multiplayer games, recent studies have attempted to bypass this challenge by incorporating the third strategy within the existing two strategies. For instance, punishing or rewarding behaviors have been added to the existing cooperation strategy in the traditional two-strategy system^55,56. This approach allows for the examination of additional mechanisms like punishment and reward within the two-strategy system’s framework. Yet, these alternative attempts still could not capture further rich dynamics, such as cyclic dominance, which is only possible in systems with at least three strategies.

In this work, we provide an analytical framework that addresses the gaps in multi-strategy multiplayer games in the realm of evolutionary graph theory. Inspired by the Balls-and-Boxes problem, we demonstrate that for a given multi-strategy multiplayer game, counting the co-player configurations of a focal individual is equivalent to distributing k identical co-players into n distinct strategies (Fig. 1). On this basis, we develop a bottom-up approach for calculating the group-based payoff of individuals on regular graphs, deriving replicator equations on regular graphs in the weak selection limit and the absence of triangle motifs. Our results include two commonly used update rules, namely pairwise comparison (PC)⁵⁷ and death-birth (DB)²⁹, applicable to arbitrary multiplayer games with any multi-strategy space in structured populations, where each individual has the same number of neighbors. Using the punishment mechanism^7,58,59 in the context of the tragedy of the commons⁶⁰ as an example, we explore the well-known second-order free-riding problem analytically, obtaining an accurate threshold of punishment strength necessary to resolve the social dilemma in structured populations. Additionally, our theoretical solutions can qualitatively reproduce the phase diagrams observed in previous numerical simulation studies under non-marginal selection strength.

**Fig. 1: Generalized payoff matrix for multi-strategy multiplayer games, reducible to two-strategy or two-player formats.**

Results

Model overview

We consider an infinite population on a regular graph, where each individual has k neighbors. An individual can adopt one of n strategies, labeled by the numbers 1, 2, …, n. On a regular graph, the number of co-players in every multiplayer game is equivalent to the constant number k of neighbors. For a given individual, suppose that there are k₁ co-players employing strategy 1, k₂ co-players employing strategy 2, and so on, up to k_n co-players employing strategy n. In this context, the co-player configuration of an individual can be represented by k = (k₁, k₂, …, k_n), which satisfies the condition $\sum_{l=1}^{n}{k}_{l}=k$. As illustrated in Fig. 2a, counting the number of possible configurations of k is analogous to the classic Balls-and-Boxes problem, distributing k identical balls (i.e., co-players) into n distinct boxes (i.e., strategies), allowing for the possibility of empty boxes (e.g., k₁ = 0). Hence, there are ${\mathbb{C}}(k+n-1,\, k)=(k+n-1)!/[(n-1)!k!]$ possible configurations of co-player strategy configurations k.

**Fig. 2: The Balls-and-Boxes problem, payoff calculation, and strategy updates.**

Interaction occurs between an individual and its k co-players. In a multiplayer game involving the focal individual and k co-players, the payoff is uniquely determined by the strategy of the focal individual and the strategy configuration of the co-players. For a focal individual employing strategy i with the co-player configuration k, its payoff is denoted by a_i∣k. It can be observed that the ‘generalized payoff matrix’ comprises $n\times {\mathbb{C}}(k+n-1,\, k)$ elements represented by a_i∣k through all possible focal strategies i = 1, 2, …, n and co-player strategy configurations k. For two-strategy two-player games (n = 2, k = 1), the number of elements in the payoff matrix reduces to $2\times {\mathbb{C}}(2,1)=4$; for multi-strategy two-player games (k = 1), it reduces to $n\times {\mathbb{C}}(n,1)={n}^{2}$; for two-strategy multiplayer games (n = 2), it reduces to $2\times {\mathbb{C}}(k+1,\, k)=2(k+1)$ (Fig. 1).

The accumulated payoff of a focal individual is collected from the 1 + k games organized by itself and its neighbors, as depicted in Fig. 2b. Upon obtaining the accumulated payoffs π, we convert them into fitness, denoted as $F=\exp (\delta \pi )$^21,48,61,62. Strategies that yield higher fitness are more likely to reproduce. Here, δ → 0⁺ represents a weak selection limit. The rationale behind weak selection is that, in reality, many factors other than the investigated game influence the probability of reproduction²⁹.

There are various commonly used strategy update rules. For simplicity, we focus on the pairwise comparison (PC) rule⁵⁷ in the main text (another well-known rule, the death-birth, is discussed in Supplementary Information). During each elementary step, an individual A and one of its neighbors B are randomly selected from the population. Their payoffs are computed as π_A and π_B and then transformed into fitness values F_A and F_B. Individual A adopts the strategy of individual B with a probability proportional to their fitness in the pair,

$$W=\frac{{F}_{B}}{{F}_{A}+{F}_{B}}=\frac{1}{1+\exp [-\delta ({\pi }_{B}-{\pi }_{A})]}.$$

(1)

Or, individual A keeps its own strategy with the remaining probability F_A/(F_A + F_B). Eq. (1) indicates that individual A has a marginal tendency to either maintain its own strategy or adopt the one of individual B, depending on who has higher fitness. The evolution of strategies under the PC update process is illustrated in Fig. 2c.

Group-based payoff with any number of strategies

To formally analyze the evolutionary dynamics, we construct the system as described in Supplementary Note 1.1. According to pair approximation²⁹, there are two key concepts, the frequency of i-players (i.e., individuals employing strategy i), denoted as x_i, where ${\sum }_{i=1}^{n}{x}_{i}=1$, and the probability of an i-player being adjacent to a j-player, denoted by q_i∣ j, with ${\sum }_{j=1}^{n}{q}_{j| i}=1$. By separating different time scales, we find that q_i∣ j = x_i(k − 2)/(k − 1) + θ_ij/(k − 1), where θ_ij = 1 if i = j and θ_ij = 0 otherwise (Supplementary Note 5.1). In other words, the value of q_i∣ j can be determined by the value of x_i.

To express necessary computations, we introduce a variation of k, denoted as k_+l = (k₁, k₂, …, k_l + 1, …, k_n), where ${\sum }_{l=1}^{n}{k}_{l}=k-1$. This represents a co-player configuration in which there is at least one l-player. Among the remaining k − 1 co-players, the numbers of players adopting strategies 1, 2, …, n are k₁, k₂, …, k_n, respectively.

We label the payoff that an individual obtains in a multiplayer game as the single-game payoff. ${\langle {a}_{X| {{{{{{{\bf{k}}}}}}}}}\rangle }_{Y}$ is used to denote the expected single-game payoff for an X-player over the possible co-player configurations k, where the k members in k are neighbors of a Y-player, as defined by Eq. (12) in the Methods. Similarly, the notation ${\langle {a}_{X| {{{{{{{{\bf{k}}}}}}}}}_{+l}}\rangle }_{Y}$ differs in that it is over k − 1 unknown members in the possible co-player configurations k_+l, with one known l-player.

Furthermore, we use the notation $\langle {\pi }_{X}^{{{{{{{{\bf{k}}}}}}}}}\rangle$ to represent the expected accumulated payoff of an X-player obtained in the 1 + k games organized by the player and its neighbors, across all possible neighbor configurations k of the X-player, defined by Eq. (10) in the Methods. Similarly, $\langle {\pi }_{X}^{{{{{{{{{\bf{k}}}}}}}}}_{+i}}\rangle$ denotes the expected accumulated payoff over the configurations where the remaining k − 1 neighbors are unknown besides a known i-player, as defined by Eq. (11) in the Methods.

Through bottom-up calculations from the microscopic level (Methods), we establish the following relationship between the expected accumulated and single-game payoffs. For i-players, the relation is given by

$$\left\langle {\pi }_{i}^{{{{{{{{\bf{k}}}}}}}}}\right\rangle={\langle {a}_{i| {{{{{{{\bf{k}}}}}}}}}\rangle }_{i}+k{\sum }_{l=1}^{n}{q}_{l| i}{\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+l}^{{\prime} }}\rangle }_{l}.$$

(2)

Intuitively, the expected accumulated payoff of i-players, $\langle {\pi }_{i}^{{{{{{{{\bf{k}}}}}}}}}\rangle$, is composed by the expected single-game payoff from the game they organize, ${\langle {a}_{i| {{{{{{{\bf{k}}}}}}}}}\rangle }_{i}$, and the games organized by their k neighbors, $k\sum_{l=1}^{n}{q}_{l| i}{\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+l}^{{\prime} }}\rangle }_{l}$. Here, the different notation ${{{{{{{{\bf{k}}}}}}}}}^{{\prime} }=({k}_{1}^{{\prime} },\, {k}_{2}^{{\prime} },\ldots,\, {k}_{n}^{{\prime} })$ from k is an independent configuration to clarify the priority in the summation.

A further concept is the expected accumulated payoff of a j-player who has at least one i-player as a neighbor, which is related to the expected single-game payoff as follows:

$$\left\langle {\pi }_{j}^{{{{{{{{{\bf{k}}}}}}}}}_{+i}}\right\rangle={\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+i}}\rangle }_{j}+{\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+i}^{{\prime} }}\rangle }_{i}+(k-1){\sum }_{l=1}^{n}{q}_{l| \, j}{\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+l}^{{\prime} }}\rangle }_{l}.$$

(3)

Here, ${\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+i}}\rangle }_{j}$, ${\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+i}^{{\prime} }}\rangle }_{i}$, and $(k-1)\sum_{l=1}^{n}{q}_{l| \, j}{\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+l}^{{\prime} }}\rangle }_{l}$ are the expected single-game payoff from the game organized by the j-player itself, the game organized by the fixed i-player neighbor, and the games organized by the remaining k − 1 neighbors of the j-player.

General replicator equations

The evolution of frequencies x₁, x₂, …, x_n can be deduced through the microscopic strategy update process. Specifically, in an infinite population, i.e., N → ∞, a single unit of time comprises N elementary steps, ensuring that each individual has an opportunity to update their strategy. During each elementary step, the frequency of i-players increases by 1/N when a focal j-player (where j ≠ i) is chosen to update its strategy and is replaced by an i-player. Similarly, the frequency of i-players decreases by 1/N when a focal i-player is selected to update its strategy and the player who takes the position is not an i-player. Based on this perception, we derive a simple form of the replicator equations for i = 1, 2, …, n in the weak selection limit (Supplementary Note 2):

$${\dot{x}}_{i}=\frac{\delta }{2}{x}_{i}\left(\left\langle {\pi }_{i}^{{{{{{{{\bf{k}}}}}}}}}\right\rangle -\mathop{\sum }_{j=1}^{n}{q}_{j| i}\left\langle {\pi }_{j}^{{{{{{{{{\bf{k}}}}}}}}}_{+i}}\right\rangle \right).$$

(4)

We find that Eq. (4) offers an intuitive understanding, if we introduce the following two concepts: (1) ${\pi }_{i}^{(0)}=\langle {\pi }_{i}^{{{{{{{{\bf{k}}}}}}}}}\rangle$, the expected accumulated payoff of the i-player (zero steps away on the graph), and (2) ${\pi }_{i}^{(1)}=\sum_{j=1}^{n}{q}_{j| i}\langle {\pi }_{j}^{{{{{{{{{\bf{k}}}}}}}}}_{+i}}\rangle$, the expected accumulated payoff of the i-player’s neighbors (one step away on the graph). These concepts suggest that ${\dot{x}}_{i}\propto {x}_{i}({\pi }_{i}^{(0)}-{\pi }_{i}^{(1)})$. Under pairwise comparison, the reproduction rate of i-players is dependent on how their accumulated payoff exceeds that of their neighbors. In essence, the evolution of x_i is the competition between an individual and its first-order neighbors, which aligns with the results obtained by a different theoretical framework in two-strategy systems^18,20,32. We further extend it to n-strategy systems in the framework of pair approximation. We also verify that the death-birth rule is essentially the competition between an individual and its second-order neighbors for n-strategy systems (Supplementary Information).

Applying Eqs. (2) and (3) to Eq. (4), we can transform the expected accumulated payoff in the replicator equations into the expected single-game payoff, as shown in Eq. (13) in the Methods, which keeps the simplest irreducible computational complexity given the payoff structure a_i∣k. In particular, we only need to calculate two types of quantities, ${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{i}$ and ${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{j}$ for i, j = 1, 2, …, n, based on the given payoff structure a_i∣k. The diagonal elements of these quantities coincide, as demonstrated in Eqs. (14) and (15) in the Methods. Therefore, for any given payoff structure a_i∣k, there are at most (2n − 1)n distinct quantities to calculate manually when determining the replicator equations. The computational complexity is thus O(n²), square of the number of strategies, which can be solved within polynomial time. We also find that the computational complexity under the death-birth rule is O(n³), cubic of the number of strategies, which can also be solved within polynomial time (Supplementary Information).

For specific payoff structures, the computational complexity can be further reduced. A common example is linear systems. In such systems, the payoff function includes at most linear terms in k₁, k₂, …, k_n. This allows us to express the general payoff function as ${a}_{i| {{{{{{{\bf{k}}}}}}}}}={\sum }_{j=1}^{n}{b}_{ij}{k}_{j}+{c}_{i}$, where b_ij represents the coefficient of the linear term and c_i is the constant term for i, j = 1, 2, …, n. Applying this special payoff structure to Eq. (13) in the Methods, we can obtain a simplified form of the replicator equation for linear systems,

$${\dot{x}}_{i}=\frac{\delta (k-2)}{2(k-1)}{x}_{i}\left((k+1)({\bar{\pi }}_{i}-\bar{\pi })+3\mathop{\sum }_{j=1}^{n}{x}_{j}({b}_{ii}-{b}_{ij}-{b}_{ji}-{b}_{jj})+6\mathop{\sum }_{j=1}^{n}\mathop{\sum }_{l=1}^{n}{x}_{j}{x}_{l}{b}_{jl}\right).$$

(5)

Here, ${\bar{\pi }}_{i}$ and $\bar{\pi }$ denote the mean payoff of i-players and all players in a well-mixed population, which can be directly calculated using the traditional replicator dynamics approach (Methods).

As a frequently studied example, the public goods game involves n = 2 strategies within a linear payoff structure. Strategy 1, cooperation (C), pays a cost c which is multiplied by a synergy factor r and distributed among all k + 1 players, while strategy 2, defection (D), pays nothing. The payoff structure can be expressed as b₁₁ = b₂₁ = rc/(k + 1), b₁₂ = b₂₂ = 0, c₁ = rc/(k + 1) − c, c₂ = 0. Consequently, ${\dot{x}}_{i}\propto {x}_{i}({\bar{\pi }}_{i}-\bar{\pi })$, indicating that evolution favors cooperation when r > k + 1 (Supplementary Note 3.1). Coincidentally, the public goods game exhibits an equivalence between well-mixed and structured populations under pairwise comparison⁶³, a phenomenon not necessarily observed under other update rules⁴⁵. This equivalence provides a unique opportunity: when introducing additional strategies into the public goods game, the distinct effects of these new strategies in structured populations can be isolated without interference from the existing two strategies. For a general condition when pairwise comparison equates well-mixed and structured populations, we refer to the Supplementary Note 3.1.3.

We apply the multi-strategy multiplayer framework to various additional mechanisms in public goods games, including punishment^8,51 (n = 3), reward⁶⁴ (n = 3), and multi-stage investment⁶⁵ (n = 4) (Supplementary Note 3). Here, we present the applications to two punishment types, peer and pool punishment, by which we revisit the well-known second-order free-rider problem in structured populations.

Peer punishment in public goods games

In public goods games with peer punishment^66,67, the payoff structure is linear (Supplementary Note 3.2), which allows us to utilize Eq. (5) directly.

There are n = 3 strategies: 1 = Cooperation (C), 2 = Defection (D), and 3 = Peer punishment (E). Besides the two strategies in the public goods game, the third strategy, peer punishment, pays a cost α for punishing a co-player who defects. A defector, when punished, incurs a fine β. Thus, given k₂ defective co-players, a punishing player has αk₂ paid, and given k₃ punishment co-players, a defector has βk₃ charged. Furthermore, it is assumed that punishing players also perform the cooperative behavior, investing c to the common pool. This makes the strategy C the second-order free-rider who exploits the effort in punishment of strategy E.

The first question is how the behaviors of peer punishment in structured populations, obtained by our framework (Supplementary Note 3.2), differ from the ones in a well-mixed population. We find that peer punishment introduces a bi-stable space of the system state, as seen in Fig. 3a, b. Even when r < k + 1, the system can either evolve to a final state where strategies E and C coexist, or to a state dominated by strategy D, depending on the initial conditions. As the punishing fine β increases, the basin of attraction for strategy D diminishes. In a well-mixed population, strategy D maintains a basin of attraction regardless of the punishment strength (Fig. 3c). This aligns with previous findings that peer punishment does not truly resolve social dilemmas in well-mixed populations⁵⁹. However, in structured populations, we observe that the basin of attraction for strategy D can be entirely eliminated if the punishing fine β exceeds a critical threshold, β > β^⋆, where

$${\beta }^{\star }=\frac{k+1}{3}\left(-\frac{rc}{k+1}+c+k\alpha \right)-\alpha.$$

(6)

Consequently, in such scenarios, the system consistently converges to a coexistence of strategies E and C (Fig. 3d). The numerical observation from previous research suggests that peer punishment can effectively resolve social dilemmas in structured populations. Our analysis adds an analytical perspective to this conclusion.

**Fig. 3: Peer punishment can resolve the social dilemma of public goods game in structured populations.**

The distinct roles of peer punishment in well-mixed and structured populations can be attributed to the fraction of defectors, ${x}_{D}^{(DE)}$, in an unstable edge equilibrium, ${{{{{{{{\bf{x}}}}}}}}}^{(DE)}=(0,\, {x}_{D}^{(DE)},\, 1-{x}_{D}^{(DE)})$, as presented in Fig. 3e. When ${x}_{D}^{(DE)} \, > \, 1$, this unstable equilibrium disappears, rendering the D-vertex equilibrium unstable. In a well-mixed population, ${x}_{D}^{(DE)} \, < \, 1$ and ${x}_{D}^{(DE)}\to 1$ as β → ∞, indicating that the described scenario is unattainable. However, in structured populations, ${x}_{D}^{(DE)} \, > \, 1$ becomes feasible once β > β^⋆. Additionally, peer punishment acts as a double-edged sword. When ${x}_{D}^{(DE)} \, < \, 0$, the system invariably converges to the full defection state, signifying ineffective punishment. As the punishing fine β increases, peer punishment first becomes effective in well-mixed populations when $\beta \, > \, {\beta }_{0}^{{{{{{{{\rm{WM}}}}}}}}}$. Structured populations, in contrast, require a higher β₀ value for punishment to be effective. In particular, peer punishment is less advantageous in structured populations than in well-mixed populations when β < β₌ (Fig. 3a, b). However, structured populations gain an advantage when β > β₌, and can eventually lead to the extinction of defection at sufficient high β > β^⋆ values. The comparison between well-mixed and structured populations in relation to the punishing fine is illustrated in Fig. 3f, and the expressions for key β values are listed in Fig. 4.

**Fig. 4: Analytical results of public goods game with peer punishment in both well-mixed and structured populations.**

We also compare the analytical predictions by our framework to the results from previous work, which was only at a numerical level. As shown in Fig. 5, we find our analytical results align qualitatively with the α-β phase diagrams presented in previous research⁵¹. Although there are differences in detail between the results obtained from non-marginal selection through numerical simulations (Fig. 5a, c) and those derived under weak selection via analytical solutions (Fig. 5b, d), both approaches consistently predict unique behaviors in structured populations that are absent in well-mixed populations. For instance, both the nonmarginal and weak selection strengths indicate the existence of a (C + E)_V phase at low α and high β, where strategy D becomes extinct and strategies C and E coexist, equivalent to the Voter model^68,69. Moreover, at moderate levels of α and β, we anticipate a D ⇔ (C+E)_V phase under weak selection. In this phase, the system evolves towards either D or (C+E)_V based on the initial state, although strategy C may eventually become extinct due to the continuous introduction of a small number of defectors⁷⁰. A similar phase, named D_h(E), is detected under non-marginal selection. The term ‘h’ denotes ‘homoclinic instability’, implying that strategy E can overcome D through a nucleation mechanism, particularly if a small colony of E players survives after the extinction of cooperators. This likelihood increases with larger populations.

**Fig. 5: Phase diagrams of the system behavior with respect to punishing cost α and fine β are qualitatively similar under non-marginal and weak selection strength.**

Pool punishment in public goods games

Another example is pool punishment in public goods games^8,71. From the perspective of computational complexity, pool punishment differs from peer punishment in its nonlinear payoff structure, which requires utilizing Eq. (13).

Similarly, there are n = 3 strategies: 1 = Cooperation (C), 2 = Defection (D), and 3 = Pool punishment (O). Again, based on the 2-strategy public goods game, the third strategy, pool punishment, contributes a cost α to the public pool for punishment. A defector is punished with a fine β if the public pool for punishment has funds (i.e., there is at least one punisher among the co-players); if no punishers are present, the defector incurs no charge. Irrespective of the number of defecting co-players 0 ≤ k₂ ≤ k, a punishing player pays α. It is also assumed that those employing pool punishment engage in cooperative behavior, investing c to the common pool, making the strategy C a second-order free-rider.

Our analysis in structured populations (Supplementary Note 3.3) and the traditional analysis for well-mixed populations reveal that pool punishment does not change the fact that the system cannot converge to a defection-free state when r < k + 1, as demonstrated in Fig. 6a, b. However, along the DO edge (i.e., without the presence of strategy C), the system can evolve to a final state of either full D or full O, depending on the initial conditions. As the punishing fine β increases, the attraction basin for strategy D shrinks. In well-mixed populations, strategy D retains an attraction basin regardless of the punishment strength (Fig. 6c). This is consistent with previous findings that pool punishment does not effectively resolve social dilemmas in well-mixed populations⁵⁹. However, in structured populations, the attraction basin for strategy D can be completely eliminated if the punishing fine β exceeds a critical threshold, β > β^⋆, where

$${\beta }^{\star }=\frac{k+1}{2}\left(-\frac{rc}{k+1}+c+\alpha \right).$$

(7)

Given that O and C are still unstable, the system consequently enters a cyclic dominance pattern among D, O, and C in such scenarios (Fig. 6d). The cyclic dominance follows the sequence D → O → C → D → ⋯ . Numerical observations from previous studies suggest that pool punishment can resolve social dilemmas in structured populations by inducing a cycle of defection⁸. Our theoretical approach provides accurate insight into this phenomenon.

**Fig. 6: Pool punishment can resolve the social dilemma of public goods game in structured populations.**

Similarly, the distinct impacts of pool punishment in well-mixed and structured populations can be identified by the fraction of defectors, ${x}_{D}^{(DO)}$, in an unstable edge equilibrium, ${{{{{{{{\bf{x}}}}}}}}}^{(DO)}=(0,\, {x}_{D}^{(DO)},\, 1-{x}_{D}^{(DO)})$, as shown in Fig. 6e. When ${x}_{D}^{(DO)} \, > \, 1$, this unstable equilibrium vanishes, leading to instability of the D-vertex equilibrium. In well-mixed populations, ${x}_{D}^{(DO)} \, < \, 1$ and ${x}_{D}^{(DO)}\to 1$ as β → ∞, suggesting that the described scenario is unfeasible. Conversely, in structured populations, ${x}_{D}^{(DO)} \, > \, 1$ becomes true once β > β^⋆. Pool punishment also presents a paradoxical effect: when ${x}_{D}^{(DO)} \, < \, 0$, the system consistently converges to the full defection state, even along the DO edge, indicating ineffective punishment. As the punishing fine β increases, pool punishment first becomes effective in well-mixed populations at $\beta \, > \, {\beta }_{0}^{{{{{{{{\rm{WM}}}}}}}}}$. Structured populations, in contrast, require a bit higher β₀ threshold for effective punishment. Pool punishment is less advantageous in structured populations than in well-mixed populations when β < β₌ (Fig. 6a, b). Nevertheless, structured populations gain an advantage when β > β₌, and can eventually prevent the fixation of defection by inducing cyclic dominance among the three strategies at sufficient high β > β^⋆. The comparison between well-mixed and structured populations in relation to the punishing fine is shown in Fig. 6f, with the expressions for key β values listed in Fig. 7.

**Fig. 7: Analytical results of public goods game with pool punishment in both well-mixed and structured populations.**

Again, we compare our analytical predictions to the results from previous numerical work. The analytical results are in qualitative agreement with the α-β phase diagrams from previous research⁸, as shown in Fig. 8. Again, while there are detailed differences between outcomes derived from non-marginal selection through numerical simulation (Fig. 8a, c) and those obtained under weak selection with analytical methods (Fig. 8b, d), both approaches indicate distinct behavioral patterns in structured populations that are not observed in well-mixed populations. For example, both non-marginal and weak selection indicate the existence of a cyclic dominance phase, (D + C + O)_C, at low α and high β, where strategy D invades C, strategy C invades O, and strategy O invades D. Moreover, at moderate levels of α and β, we predict a D_O⇔D phase under weak selection. In this phase, the system consistently evolves towards full D in the three-strategy space; however, in the absence of strategy C, the system instead evolves towards either full O or full D based on the initial state. A comparable phase, named F_O⇔D, is detected under non-marginal selection. The term ‘F’ denotes ‘fixation’, which means that system evolves towards either full O or full D.

**Fig. 8: Phase diagrams of the system behavior with pool punishment are qualitatively similar under non-marginal and weak selection strength.**

Discussion

Spatial evolutionary dynamics under weak selection can be considered as the incorporation of a marginal game effect (δ → 0⁺) on the Voter model^68,69 (δ = 0). In structured populations, identical strategies naturally become adjacent to each other, forming clusters through neutral drift, a process independent of the game, as described by the first-order Taylor expansion in edge dynamics. This inherent tendency for the same strategies to cluster together leads to what is known as spatial reciprocity, a phenomenon captured by the second-order Taylor expansion. Simply put, under weak selection, clusters of the same strategy, caused by spatial structures, unilaterally affect the emergence of cooperation. Conversely, the evolution of cooperation does not influence the spatial pattern of these clusters. This character under weak selection reduces computational complexity, making the closed solution for various evolutionary dynamics such as multi-strategy systems on graphs possible.

In the family of evolutionary graph theory with weak selection and pair approximation, which covers two-strategy two-player, multi-strategy two-player, and two-strategy multiplayer games, we fill in the last piece of the puzzle: the multi-strategy multiplayer games. For a focal individual, we illustrate every possible configuration in which k identical co-players are distributed among n distinct strategies. On this basis, we calculate the group-based payoff for any number of strategies via a bottom-up approach. While we identify each co-player by pair approximation, the (k + 1)-player game is treated as a whole and the smallest indivisible unit in our statistical analysis. In this way, the payoff computation for the focal individual is not merely a sum of pairwise interactions, but rather an n-element function of the configuration k = (k₁, k₂, …, k_n), determined by all co-players simultaneously. The nonlinearity of payoff functions cannot be derived from the superposition pairwise interactions, which reflects the higher-order properties of multi-strategy multiplayer games that are different from multi-strategy two-player games³⁵.

Building on the group-based payoff calculation, we develop strategy update dynamics on regular graphs using the standard pair approximation method^29,33 under two common update rules: pairwise comparison and death-birth. Interestingly, our general findings are in line with those previously obtained through a different theoretical approach for two-strategy systems¹⁸. In particular, our n-strategy replicator equations imply that pairwise comparison equates to competition among all n strategies between first-order neighbors, while death-birth is equivalent to competition among second-order neighbors. While this is consistent with the previous conclusions for two-strategy systems¹⁸, our results further extend them to the generalized n-strategy space.

It is worth mentioning that by contrasting a profile of our results with the other approach in two-strategy public goods games^47,48, we can see the limitations of pair approximation: unlike their approach, which can account for triangle motifs, our pair approximation cannot. According to previous works on pair approximation^45,46, we see that under the death-birth rule (i.e., second-order neighbor competition), pair approximation results align with the other approach only in the absence of triangle motifs. Under the pairwise comparison rule (i.e., first-order neighbor competition), however, triangle motifs appear to have no effect on multiplayer games⁴⁸, where the results of pair approximation always match those of the other approach, which considers triangle motifs. This is one reason why pairwise comparison is the primary focus of this paper. For rigor, applying our framework to a specific network structure is best followed by our basic assumption: the absence of triangle motifs. We look forward to a new theory in the future that will cancel this assumption.

For any multi-strategy multiplayer game in our framework, we need only input the payoff function a_i∣k for each strategy i across all (k + n − 1)!/[(n − 1)!k!] co-player strategy configurations k. Then, we can apply the general formula provided in this work to obtain the replicator equations on a regular graph. For general payoff functions, we have decomposed the general replicator equation into sums of expected single-game payoffs, as shown in Eq. (13) (for PC updates) and Supplementary Eq. (S184) (for DB updates). From there, it simplifies the problem to calculating the single games under different strategy configurations and then summing them up. The computation is feasible in polynomial time, which is related to the number of strategies n. We find the computational complexity is O(n²) for pairwise comparison and O(n³) for death-birth. For certain specific payoff functions, the general formula may be further simplified, depending on whether the expected payoff across different strategy configurations has a simple primitive functional form. As an example, we provide a simple general formula for linear payoff functions in both pairwise comparison and death-birth updates, as shown in Eq. (5) and Supplementary Eq. (S187).

As an application of our theoretical framework, we revisit the second-order free-riding problem. In a simple three-strategy system of cooperation, defection, and cooperative punishment, the defection strategy is a free-rider from cooperation, while the original cooperation is also a free-rider from cooperative punishment. Prior research has shown that costly punishment in well-mixed populations cannot truly resolve social dilemmas⁵⁹, although in structured populations it can^8,51. We further interpret the conclusion within our analytical framework, revealing an accurate threshold for punishment strength β^⋆ in both linear peer punishment and nonlinear pool punishment systems. When the punishment strength β > β^⋆, costly punishment can resolve the social dilemma in structured populations. In peer punishment, a sufficiently strong punishment eliminates the attraction basin of full defection in the bi-stable state space. In pool punishment, a strong enough punishment leads the system to a rock-paper-scissors-like cyclic dominance. The results obtained under weak selection also qualitatively reproduce the phase diagrams found in earlier numerical studies under non-marginal selection^8,51, identifying unique phases observable only in structured populations.

In addition, our general n-strategy dynamics framework can reduce to classic two-strategy multiplayer game dynamics at n = 2. First, although some prior work has explored specific models under pairwise comparison^55,72, to our knowledge, no work has provided a general replicator equation and discussion for two-strategy multiplayer games under pairwise comparison. As a complement to this, we discuss the general replicator equation when n = 2 for two-strategy multiplayer games under pairwise comparison in Supplementary Note 2.5. Second, the general replicator equations for two-strategy multiplayer games under death-birth have been discussed by Li et al.⁴⁶. We show that our results obtained under death-birth are identical to theirs at n = 2 (Supplementary Note 4.5).

Our theoretical framework is widely applicable, yielding analytical solutions for numerous multistrategy multiplayer game models previously proposed. Besides the two punishment mechanisms investigated in the main text, we also explore the reward mechanism^9,52,53,64 (a mirror mechanism to punishment) and multi-stage public goods game⁶⁵ (a four-strategy system) in Supplementary Information. Classic three-strategy games remaining unexplored include tax-based reward and punishment systems¹⁰, the loner strategy^4,54, and so on. Moreover, a pair of additional strategies can be introduced together to create four-strategy systems, such as the competition between peer and pool punishment⁵¹. In fact, provided coevolutionary factors expressed as payoff functions of co-player configurations, any multi-strategy multiplayer game system can be analyzed within our framework.

Methods

Bottom-up statistical quantities

Here, we provide the microscopic details behind the expected payoffs. There are several necessary variations of k for expressing the details. One is k_+l, which still contains k co-players but describe a configuration with at least one l-player. The variables satisfy $\sum_{\ell=1}^{n}{k}_{\ell }=k-1$, with the number of l-players (when ℓ = l) written as k_l + 1. Similarly, k_−i,+j can describe a configuration where the variables satisfy ${\sum }_{\ell=1}^{n}{k}_{\ell }=k$, with the number of i-players as k_i − 1 and j-players as k_j + 1. Also, note that ${{{{{{{{\bf{k}}}}}}}}}^{{\prime} }$ and k^″ are different variables and have no relation with k. The primes are only to distinguish the sequence of summations: we do the computation on k^″, ${{{{{{{{\bf{k}}}}}}}}}^{{\prime} }$, and finally k.

We start from the level where k is given. Given neighbor configuration k for a focal j-player, its accumulated payoff can be expressed as

$${\pi }_{j}^{{{{{{{{\bf{k}}}}}}}}}={a}_{j| {{{{{{{\bf{k}}}}}}}}}+\mathop{\sum}_{l=1}^{n}{k}_{l}\mathop{\sum}_{{\sum}_{\ell=1}^{n}{k}_{\ell }^{{\prime} }=k-1}\frac{(k-1)!}{{\prod}_{\ell=1}^{n}{k}_{\ell }^{{\prime} }!}\left({\prod }_{\ell=1}^{n}{{q}_{\ell | l}}^{{k}_{\ell }^{{\prime} }}\right){a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+l}^{{\prime} }}.$$

(8)

The j-player accumulates payoff from the games organized by itself and its $\sum_{l=1}^{n}{k}_{l}=k$ neighbors. The visualization of Eq. (8) is shown in Fig. 9a and b. Similarly, the accumulated payoff of an i-player neighboring a j-player, given the j-player’s neighbor configuration k, can be expressed and calculated by

$$\pi_{i| \, j}^{{{{{{\mathbf{k}}}}}}} =a_{i|{{{{{{\mathbf{k}}}}}}}_{-i,+j}}+{\sum}_{{\sum}_{l=1}^n k^{\prime}_l=k-1}\frac{(k-1)!}{{\prod}_{l=1}^n k^{\prime}_l!} \left(\mathop{\prod}_{l=1}^n {q_{l|i}}^{k^{\prime\prime}_l}\right) \\ \left(a_{i|{{{{{{\mathbf{k}}}}}}}^{\prime}_{+j}}+{\sum}_{l=1}^n k^{\prime}_l {\sum}_{{\sum}_{\ell=1}^n k^{\prime\prime}_\ell=k-1}\frac{(k-1)!}{{\prod}_{\ell=1}^n k^{\prime}_\ell!} \left({\prod}_{\ell=1}^n {q_{\ell|l}}^{k^{\prime\prime}_\ell}\right) a_{i|{{{{{{\mathbf{k}}}}}}}^{\prime\prime}_{+l}} \right).$$

(9)

The i-player accumulates payoff from the games organized by the j-player (Fig. 9c), itself (Fig. 9d), and its remaining $\sum_{l=1}^{n}{k}_{l}^{{\prime} }=k-1$ neighbors (Fig. 9e). Further explanations of Eqs. (8) and (9) can be found in Supplementary Information.

**Fig. 9: Visualization of the bottom-up statistical payoff calculation when every individual has k = 4 neighbors.**

Based on the microscopic quantities given specific k, we can further express expected values over all possible k. The expected payoff of a focal X-player over all possible k can be statistically computed as

$$\left\langle {\pi }_{X}^{{{{{{{{\bf{k}}}}}}}}}\right\rangle={\sum}_{{\sum}_{{i}^{{\prime} }=1}^{n}{k}_{{i}^{{\prime} }}=k}\frac{k!}{{\prod}_{{i}^{{\prime} }=1}^{n}{k}_{{i}^{{\prime} }}!}\left({\prod }_{{i}^{{\prime} }=1}^{n}{{q}_{{i}^{{\prime} }| X}}^{{k}_{{i}^{{\prime} }}}\right){\pi }_{X}^{{{{{{{{\bf{k}}}}}}}}}.$$

(10)

The possible neighbor configurations satisfying $\sum_{{i}^{{\prime} }=1}^{n}{k}_{{i}^{{\prime} }}=k$ are found around the X-player as identified by ${q}_{{i}^{{\prime} }| X}$. Here, ${i}^{{\prime} }$ is an independent count, which has no relation with i.

Similarly, the expected payoff of an i-player neighboring an X-player over all possible neighbor configuration k_+i of the X-player can be expressed as

$$\left\langle {\pi }_{i| X}^{{{{{{{{{\bf{k}}}}}}}}}_{+i}}\right\rangle={\sum}_{\sum_{{i}^{{\prime} }=1}^{n}{k}_{{i}^{{\prime} }}=k-1}\frac{(k-1)!}{{\prod}_{{i}^{{\prime} }=1}^{n}{k}_{{i}^{{\prime} }}!}\left({\prod }_{{i}^{{\prime} }=1}^{n}{{q}_{{i}^{{\prime} }| X}}^{{k}_{{i}^{{\prime} }}}\right){\pi }_{i| X}^{{{{{{{{{\bf{k}}}}}}}}}_{+i}}.$$

(11)

Here, k_+i is because we have a specific i-player in the neighbor configuration of the X-player. The remaining k − 1 neighbors $\sum_{{i}^{{\prime} }=1}^{n}{k}_{{i}^{{\prime} }}=k-1$ of the X-player are found around the X-player, as identified by ${q}_{{i}^{{\prime} }| X}$.

Eqs. (9) and (11) may seem redundant because they do not appear directly in the final results. However, they are crucial in the process of deductions for both pairwise comparison and death-birth rules.

We also have the similar notation for expected single-game payoff. To specify, the expected payoff of an i-player in a single game over all co-player configuration k, where k is found neighboring an X-player, is expressed as

$${\langle {a}_{i| {{{{{{{\bf{k}}}}}}}}}\rangle }_{X}={\sum}_{\sum_{{i}^{{\prime} }=1}^{n}{k}_{{i}^{{\prime} }}=k}\frac{k!}{{\prod}_{{i}^{{\prime} }=1}^{n}{k}_{{i}^{{\prime} }}!}\left({\prod }_{{i}^{{\prime} }=1}^{n}{{q}_{{i}^{{\prime} }| X}}^{{k}_{{i}^{{\prime} }}}\right){a}_{i| {{{{{{{\bf{k}}}}}}}}}.$$

(12)

The concepts in Eqs. (8)–(12) are sufficient to identify the difference between this work and the previous literature^29,33,46. In particular, they emphasize that the minimal unit to refer is the co-player configuration k, based on which the payoff of the multiplayer game is computed. We do not try to decompose the multi-body interaction identified by k into multiple pairwise interactions.

Combined with the pair approximation method³³ and detailed calculations in the strategy evolution dynamics, we can then obtain the master equation (4) in the main text (see Supplementary Note 2.4.3 for the approach of pair approximation deduction).

The decomposition to single games

The following form of the master equation is important, which holds for any n-strategy multiplayer game and allows us to obtain the replicator dynamics by summing the expected payoff calculations in a series of single games:

$${\dot{x}}_{i}= \frac{\delta (k-2)}{2(k-1)}{x}_{i}{\sum }_{j=1}^{n}{x}_{j}\Bigg({\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{i}+(k-1){\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{j}+{\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+i}}\rangle }_{i} -{\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+i}}\rangle }_{j}\\ -{\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+i}}\rangle }_{i}-(k-2)\mathop{\sum }_{j=1}^{n}{x}_{l}{\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+l}}\rangle }_{l}-{\langle {a}_{j| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{j}\Bigg).$$

(13)

In application, given the payoff functions a_i∣k where i = 1, 2, …, n, we can compute all 〈 ⋅ 〉 terms and then ensemble them to obtain the replicator equations. The result of each 〈 ⋅ 〉 should be a function of x₁, x₂, …, x_n (transformed from q_j∣i manually), degree k, and game parameters.

The advantage of Eq. (13) is that we have attributed everything about 〈 ⋅ 〉 into two types, the ‘${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{i}$ type’ and the ‘${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{j}$ type’. They can be expressed by matrices through i and j:

The ${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{i}$ type:

$${\left[{\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{i}\right]}_{ij}=\left(\begin{array}{cccc}{\langle {a}_{1| {{{{{{{{\bf{k}}}}}}}}}_{+1}}\rangle }_{1}&{\langle {a}_{1| {{{{{{{{\bf{k}}}}}}}}}_{+2}}\rangle }_{1}&\cdots \,&{\langle {a}_{1| {{{{{{{{\bf{k}}}}}}}}}_{+n}}\rangle }_{1}\\ {\langle {a}_{2| {{{{{{{{\bf{k}}}}}}}}}_{+1}}\rangle }_{2}&{\langle {a}_{2| {{{{{{{{\bf{k}}}}}}}}}_{+2}}\rangle }_{2}&\cdots \,&{\langle {a}_{2| {{{{{{{{\bf{k}}}}}}}}}_{+n}}\rangle }_{2}\\ \vdots &\vdots &\ddots &\vdots \\ {\langle {a}_{n| {{{{{{{{\bf{k}}}}}}}}}_{+1}}\rangle }_{n}&{\langle {a}_{n| {{{{{{{{\bf{k}}}}}}}}}_{+2}}\rangle }_{n}&\cdots \,&{\langle {a}_{n| {{{{{{{{\bf{k}}}}}}}}}_{+n}}\rangle }_{n}\end{array}\right).$$

(14)

The ${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{j}$ type:

$${\left[{\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{j}\right]}_{ij}=\left(\begin{array}{cccc}{\langle {a}_{1| {{{{{{{{\bf{k}}}}}}}}}_{+1}}\rangle }_{1}&{\langle {a}_{1| {{{{{{{{\bf{k}}}}}}}}}_{+2}}\rangle }_{2}&\cdots \,&{\langle {a}_{1| {{{{{{{{\bf{k}}}}}}}}}_{+n}}\rangle }_{n}\\ {\langle {a}_{2| {{{{{{{{\bf{k}}}}}}}}}_{+1}}\rangle }_{1}&{\langle {a}_{2| {{{{{{{{\bf{k}}}}}}}}}_{+2}}\rangle }_{2}&\cdots \,&{\langle {a}_{2| {{{{{{{{\bf{k}}}}}}}}}_{+n}}\rangle }_{n}\\ \vdots &\vdots &\ddots &\vdots \\ {\langle {a}_{n| {{{{{{{{\bf{k}}}}}}}}}_{+1}}\rangle }_{1}&{\langle {a}_{n| {{{{{{{{\bf{k}}}}}}}}}_{+2}}\rangle }_{2}&\cdots \,&{\langle {a}_{n| {{{{{{{{\bf{k}}}}}}}}}_{+n}}\rangle }_{n}\end{array}\right).$$

(15)

There are n² elements in each matrix. Their diagonals are equal, meaning we can compute n fewer elements. Therefore, the total amount of computation is n² + n² − n = (2n − 1)n elements. The computational complexity is O(n²), which can be accomplished in polynomial time (we also see that the death-birth rule’s computational complexity is O(n³), as specified in Supplementary Information).

Special linear system

Although Eq. (13) allows general calculations of any multiplayer game, we do not have to employ it directly every time. For some special payoff structures, we can deduce simplified general forms in advance. Here, we present the general results of a commonly studied subclass, the linear multiplayer games.

The linear multiplayer games in this work are defined as those whose payoff structure can be expressed as linear functions of co-player ${{{\mathbf{k}}}}=(k_1,k_2,\, \ldots ,k_n)$. That is, ${a}_{i| {{{{{{{\bf{k}}}}}}}}}={\sum }_{j=1}^{n}{b}_{ij}{k}_{j}+{c}_{i}$, where

$${{{{{{{\bf{b}}}}}}}}=\left(\begin{array}{cccc}{b}_{11}&{b}_{12}&\cdots \,&{b}_{1n}\\ {b}_{21}&{b}_{22}&\cdots \,&{b}_{2n}\\ \vdots &\vdots &\ddots &\vdots \\ {b}_{n1}&{b}_{n2}&\cdots \,&{b}_{nn}\end{array}\right),\, {{{{{{{\bf{c}}}}}}}}=\left(\begin{array}{c}{c}_{1}\\ {c}_{2}\\ \vdots \\ {c}_{n}\end{array}\right).$$

(16)

For linear multiplayer games, the payoff structure is completely determined by the matrix b and c.

Once we apply ${a}_{i| {{{{{{{\bf{k}}}}}}}}}={\sum }_{j=1}^{n}{b}_{ij}{k}_{j}+{c}_{i}$ to compute the ‘${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{i}$ type’ and the ‘${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{j}$ type’ as shown in Eqs. (14) and (15) and transform all q_j∣i quantities to x_j quantities, we can obtain ${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{i}=(k-2){\sum }_{l=1}^{n}{b}_{il}{x}_{l}+{b}_{ii}+{b}_{ij}+{c}_{i}$, ${\langle {a}_{i| {{{{{{{{\bf{k}}}}}}}}}_{+j}}\rangle }_{j}=(k-2){\sum }_{l=1}^{n}{b}_{il}{x}_{l}+2{b}_{ij}+{c}_{i}$. Then, substituting the results into Eq. (13) leads to Eq. (5) in the main text.

As we mentioned in the main text, in Eq. (5), ${\bar{\pi }}_{i}$ and $\bar{\pi }$ are mean payoffs of i-players and all players in a well-mixed population. They can be calculated by the traditional replicator dynamics, but to specify, using the matrix b and c, they can also be written as ${\bar{\pi }}_{i}=k{\sum }_{l=1}^{n}{x}_{l}{b}_{il}+{c}_{i}$, $\bar{\pi }={\sum }_{i=1}^{n}{x}_{i}{\bar{\pi }}_{i}=k{\sum }_{i=1}^{n}{\sum }_{l=1}^{n}{x}_{i}{x}_{l}{b}_{il}+{\sum }_{i=1}^{n}{x}_{i}{c}_{i}$. Logically, this is how we replace the corresponding terms in Eq. (5) by ${\bar{\pi }}_{i}$ and $\bar{\pi }$.

Data availability

All data generated or analysed during this study are included within the paper and its supplementary information files.

References

Szolnoki, A. et al. Cyclic dominance in evolutionary games: a review. J. R. Soc. Interface 11, 20140735 (2014).
Article PubMed PubMed Central Google Scholar
Sinervo, B. & Lively, C. M. The rock-paper-scissors game and the evolution of alternative male strategies. Nature 380, 240–243 (1996).
Article ADS CAS Google Scholar
Kerr, B., Riley, M. A., Feldman, M. W. & Bohannan, B. J. Local dispersal promotes biodiversity in a real-life game of rock-paper-scissors. Nature 418, 171–174 (2002).
Article ADS CAS PubMed Google Scholar
Hauert, C., De Monte, S., Hofbauer, J. & Sigmund, K. Volunteering as red queen mechanism for cooperation in public goods games. Science 296, 1129–1132 (2002).
Article ADS CAS PubMed Google Scholar
Hofbauer, J. & Sigmund, K. Evolutionary Games and Population Dynamics (Cambridge University Press, 1998).
Semmann, D., Krambeck, H.-J. & Milinski, M. Volunteering leads to rock-paper-scissors dynamics in a public goods game. Nature 425, 390–393 (2003).
Article ADS CAS PubMed Google Scholar
Fehr, E. & Gächter, S. Altruistic punishment in humans. Nature 415, 137–140 (2002).
Article ADS CAS PubMed Google Scholar
Szolnoki, A., Szabó, G. & Perc, M. Phase diagrams for the spatial public goods game with pool punishment. Phys. Rev. E 83, 036101 (2011).
Article ADS Google Scholar
Sigmund, K., Hauert, C. & Nowak, M. A. Reward and punishment. Proc. Natl Acad. Sci. 98, 10757–10762 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, S., Liu, L. & Chen, X. Tax-based pure punishment and reward in the public goods game. Phys. Lett. A 386, 126965 (2021).
Article MathSciNet CAS Google Scholar
Sigmund, K. The Calculus of Selfishness (Princeton University Press, 2010).
Nowak, M. A. & May, R. M. Evolutionary games and spatial chaos. Nature 359, 826–829 (1992).
Article ADS Google Scholar
Nowak, M. A. Five rules for the evolution of cooperation. Science 314, 1560–1563 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Nowak, M. A., Tarnita, C. E. & Antal, T. Evolutionary dynamics in structured populations. Philos. Trans. R. Soc. B: Biol. Sci. 365, 19–30 (2010).
Article Google Scholar
Ibsen-Jensen, R., Chatterjee, K. & Nowak, M. A. Computational complexity of ecological and evolutionary spatial dynamics. Proc. Natl Acad. Sci. 112, 15636–15641 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Lieberman, E., Hauert, C. & Nowak, M. A. Evolutionary dynamics on graphs. Nature 433, 312–316 (2005).
Article ADS CAS PubMed Google Scholar
Taylor, P. D., Day, T. & Wild, G. Evolution of cooperation in a finite homogeneous graph. Nature 447, 469–472 (2007).
Article ADS CAS PubMed Google Scholar
Allen, B. & Nowak, M. A. Games on graphs. EMS Surv. Math. Sci. 1, 113–151 (2014).
Article MathSciNet Google Scholar
Débarre, F., Hauert, C. & Doebeli, M. Social evolution in structured populations. Nat. Commun. 5, 3409 (2014).
Article ADS PubMed Google Scholar
Allen, B. et al. Evolutionary dynamics on any population structure. Nature 544, 227–230 (2017).
Article ADS CAS PubMed Google Scholar
McAvoy, A., Allen, B. & Nowak, M. A. Social goods dilemmas in heterogeneous societies. Nat. Hum. Behav. 4, 819–831 (2020).
Article PubMed Google Scholar
Su, Q., McAvoy, A., Mori, Y. & Plotkin, J. B. Evolution of prosocial behaviours in multilayer populations. Nat. Hum. Behav. 6, 338–348 (2022).
Article PubMed Google Scholar
Su, Q., McAvoy, A. & Plotkin, J. B. Strategy evolution on dynamic networks. Nat. Comput. Sci. 3, 763–776 (2023).
Article PubMed Google Scholar
Gutowitz, H. A., Victor, J. D. & Knight, B. W. Local structure theory for cellular automata. Phys. D. 28, 18–48 (1987).
Article MathSciNet Google Scholar
Matsuda, H., Tamachi, N., Sasaki, A. & Ogita, N. in Mathematical Topics in Population Biology, Morphogenesis and Neurosciences. Lecture Notes in Biomathematics 154–161 (Springer, 1987).
Szabó, G., Szolnoki, A. & Bodócs, L. Correlations induced by transport in one-dimensional lattice gas. Phys. Rev. A 44, 6375 (1991).
Article ADS PubMed Google Scholar
Matsuda, H., Ogita, N., Sasaki, A. & Satō, K. Statistical mechanics of population: the lattice Lotka-Volterra model. Prog. Theor. Phys. 88, 1035–1049 (1992).
Article ADS Google Scholar
Szabó, G. & Fath, G. Evolutionary games on graphs. Phys. Rep. 446, 97–216 (2007).
Article ADS MathSciNet Google Scholar
Ohtsuki, H., Hauert, C., Lieberman, E. & Nowak, M. A. A simple rule for the evolution of cooperation on graphs and social networks. Nature 441, 502–505 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Ohtsuki, H., Nowak, M. A. & Pacheco, J. M. Breaking the symmetry between interaction and replacement in evolutionary dynamics on graphs. Phys. Rev. Lett. 98, 108106 (2007).
Article ADS PubMed PubMed Central Google Scholar
Su, Q., Allen, B. & Plotkin, J. B. Evolution of cooperation with asymmetric social interactions. Proc. Natl Acad. Sci. 119, e2113468118 (2022).
Article CAS PubMed Google Scholar
Su, Q., McAvoy, A., Wang, L. & Nowak, M. A. Evolutionary dynamics with game transitions. Proc. Natl Acad. Sci. 116, 25398–25404 (2019).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Ohtsuki, H. & Nowak, M. A. The replicator equation on graphs. J. Theor. Biol. 243, 86–97 (2006).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Taylor, P. D. & Jonker, L. B. Evolutionary stable strategies and game dynamics. Math. Biosci. 40, 145–156 (1978).
Article MathSciNet Google Scholar
Perc, M., Gómez-Gardenes, J., Szolnoki, A., Floría, L. M. & Moreno, Y. Evolutionary dynamics of group interactions on structured populations: a review. J. R. Soc. Interface 10, 20120997 (2013).
Article PubMed PubMed Central Google Scholar
Alvarez-Rodriguez, U. et al. Evolutionary dynamics of higher-order interactions in social networks. Nat. Hum. Behav. 5, 586–595 (2021).
Article PubMed Google Scholar
Battiston, F. et al. The physics of higher-order interactions in complex systems. Nat. Phys. 17, 1093–1098 (2021).
Article CAS Google Scholar
Tarnita, C. E., Ohtsuki, H., Antal, T., Fu, F. & Nowak, M. A. Strategy selection in structured populations. J. Theor. Biol. 259, 570–581 (2009).
Article ADS MathSciNet PubMed Google Scholar
Tarnita, C. E., Wage, N. & Nowak, M. A. Multiple strategies in structured populations. Proc. Natl Acad. Sci. 108, 2334–2337 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
McAvoy, A. & Wakeley, J. Evaluating the structure-coefficient theorem of evolutionary game theory. Proc. Natl Acad. Sci. 119, e2119656119 (2022).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Wu, B., Traulsen, A. & Gokhale, C. S. Dynamic properties of evolutionary multi-player games in finite populations. Games 4, 182–199 (2013).
Article MathSciNet Google Scholar
McAvoy, A. & Hauert, C. Structure coefficients and strategy selection in multiplayer games. J. Math. Biol. 72, 203–238 (2016).
Article MathSciNet PubMed Google Scholar
Duong, M. H. & Han, T. A. On the expected number of equilibria in a multi-player multi-strategy evolutionary game. Dyn. Games Appl. 6, 324–346 (2016).
Article MathSciNet Google Scholar
Duong, M. H. & Han, T. A. Analysis of the expected density of internal equilibria in random evolutionary multi-player multi-strategy games. J. Math. Biol. 73, 1727–1760 (2016).
Article MathSciNet PubMed Google Scholar
Li, A., Wu, B. & Wang, L. Cooperation with both synergistic and local interactions can be worse than each alone. Sci. Rep. 4, 5536 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, A., Broom, M., Du, J. & Wang, L. Evolutionary dynamics of general group interactions in structured populations. Phys. Rev. E 93, 022407 (2016).
Article ADS MathSciNet PubMed Google Scholar
Su, Q., Li, A., Wang, L. & Eugene Stanley, H. Spatial reciprocity in the evolution of cooperation. Proc. R. Soc. B 286, 20190041 (2019).
Article PubMed PubMed Central Google Scholar
Wang, C. & Szolnoki, A. Inertia in spatial public goods games under weak selection. Appl. Math. Comput. 449, 127941 (2023).
MathSciNet Google Scholar
Sigmund, K. Punish or perish? Retaliation and collaboration among humans. Trends Ecol. Evol. 22, 593–600 (2007).
Article PubMed Google Scholar
Helbing, D., Szolnoki, A., Perc, M. & Szabó, G. Punish, but not too hard: how costly punishment spreads in the spatial public goods game. N. J. Phys. 12, 083005 (2010).
Article Google Scholar
Szolnoki, A., Szabó, G. & Czakó, L. Competition of individual and institutional punishments in spatial public goods games. Phys. Rev. E 84, 046106 (2011).
Article ADS Google Scholar
Rand, D. G., Dreber, A., Ellingsen, T., Fudenberg, D. & Nowak, M. A. Positive interactions promote public cooperation. Science 325, 1272–1275 (2009).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Hilbe, C. & Sigmund, K. Incentives and opportunism: from the carrot to the stick. Proc. R. Soc. B: Biol. Sci. 277, 2427–2433 (2010).
Article Google Scholar
Szabó, G. & Hauert, C. Phase transitions and volunteering in spatial public goods games. Phys. Rev. Lett. 89, 118101 (2002).
Article ADS PubMed Google Scholar
Wang, S., Chen, X., Xiao, Z. & Szolnoki, A. Decentralized incentives for general well-being in networked public goods game. Appl. Math. Comput. 431, 127308 (2022).
MathSciNet Google Scholar
Sun, Z., Chen, X. & Szolnoki, A. State-dependent optimal incentive allocation protocols for cooperation in public goods games on regular networks. IEEE Trans. Netw. Sci. Eng. 10, 3975–3988 (2023).
MathSciNet Google Scholar
Szabó, G. & Tőke, C. Evolutionary prisoner’s dilemma game on a square lattice. Phys. Rev. E 58, 69 (1998).
Article ADS Google Scholar
Ohtsuki, H., Iwasa, Y. & Nowak, M. A. Indirect reciprocity provides only a narrow margin of efficiency for costly punishment. Nature 457, 79–82 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Sigmund, K., De Silva, H., Traulsen, A. & Hauert, C. Social learning promotes institutions for governing the commons. Nature 466, 861–863 (2010).
Article ADS CAS PubMed Google Scholar
Hardin, G. The tragedy of the commons. Science 162, 1243–1248 (1968).
Article ADS CAS PubMed Google Scholar
Wang, C., Zhu, W. & Szolnoki, A. The conflict between self-interaction and updating passivity in the evolution of cooperation. Chaos. Solit. Fractals 173, 113667 (2023).
Article MathSciNet Google Scholar
Wang, C., Zhu, W. & Szolnoki, A. When greediness and self-confidence meet in a social dilemma. Phys. A 625, 129033 (2023).
Article MathSciNet Google Scholar
Zhang, W. & Brandes, U. Is cooperation sustained under increased mixing in evolutionary public goods games on networks? Appl. Math. Comput. 438, 127604 (2023).
MathSciNet Google Scholar
Szolnoki, A. & Perc, M. Reward and cooperation in the spatial public goods game. Europhys. Lett. 92, 38003 (2010).
Article ADS Google Scholar
Szolnoki, A. & Chen, X. Tactical cooperation of defectors in a multi-stage public goods game. Chaos Solit. Fractals 155, 111696 (2022).
Article Google Scholar
Brandt, H., Hauert, C. & Sigmund, K. Punishment and reputation in spatial public goods games. Proc. R. Soc. B 270, 1099–1104 (2003).
Article PubMed PubMed Central Google Scholar
Helbing, D., Szolnoki, A., Perc, M. & Szabó, G. Evolutionary establishment of moral and double moral standards through spatial interactions. PLoS Comput. Biol. 6, e1000758 (2010).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Clifford, P. & Sudbury, A. A model for spatial conflict. Biometrika 60, 581–588 (1973).
Article MathSciNet Google Scholar
Liggett, T. M. Interacting Particle Systems (Springer, 1985).
Helbing, D., Szolnoki, A., Perc, M. & Szabó, G. Defector-accelerated cooperativeness and punishment in public goods games with mutations. Phys. Rev. E 81, 057104 (2010).
Article ADS Google Scholar
Sasaki, T., Uchida, S. & Chen, X. Voluntary rewards mediate the evolution of pool punishment for maintaining public goods in large populations. Sci. Rep. 5, 8917 (2015).
Article ADS PubMed PubMed Central Google Scholar
Luo, Q., Liu, L. & Chen, X. Evolutionary dynamics of cooperation in the N-person stag hunt game. Phys. D. 424, 132943 (2021).
Article MathSciNet Google Scholar

Download references

Acknowledgements

M.P. was supported by the Slovenian Research and Innovation Agency (Javna agencija za znanstvenoraziskovalno in inovacijsko dejavnost Republike Slovenije) (Grant Nos. P1-0403 and N1-0232). A.S. was supported by the National Research, Development and Innovation Office (NKFIH) under Grant No. K142948.

Author information

Authors and Affiliations

Department of Computational and Data Sciences, George Mason University, Fairfax, VA, 22030, USA
Chaoqian Wang
Faculty of Natural Sciences and Mathematics, University of Maribor, Koroška cesta 160, 2000, Maribor, Slovenia
Matjaž Perc
Community Healthcare Center Dr. Adolf Drolc Maribor, Vošnjakova ulica 2, 2000, Maribor, Slovenia
Matjaž Perc
Complexity Science Hub Vienna, Josefstädterstraße 39, 1080, Vienna, Austria
Matjaž Perc
Department of Physics, Kyung Hee University, 26 Kyungheedae-ro, Dongdaemun-gu, Seoul, Republic of Korea
Matjaž Perc
Institute of Technical Physics and Materials Science, Centre for Energy Research, P.O. Box 49, H-1525, Budapest, Hungary
Attila Szolnoki

Authors

Chaoqian Wang
View author publications
You can also search for this author in PubMed Google Scholar
Matjaž Perc
View author publications
You can also search for this author in PubMed Google Scholar
Attila Szolnoki
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.W. conceived and designed the research with contributions from M.P. and A.S.; C.W. performed the calculations; C.W. and A.S. analyzed the results; C.W., M.P., and A.S. wrote the paper and approved the submission.

Corresponding author

Correspondence to Chaoqian Wang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks The Anh Han and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, C., Perc, M. & Szolnoki, A. Evolutionary dynamics of any multiplayer game on regular graphs. Nat Commun 15, 5349 (2024). https://doi.org/10.1038/s41467-024-49505-5

Download citation

Received: 23 December 2023
Accepted: 05 June 2024
Published: 24 June 2024
DOI: https://doi.org/10.1038/s41467-024-49505-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Evolutionary dynamics of any multiplayer game on regular graphs

Subjects

Abstract

Similar content being viewed by others

Reconstructing higher-order interactions in coupled dynamical systems

A benchmarking study of quantum algorithms for combinatorial optimization

Assembly theory explains and quantifies selection and evolution

Introduction

Results

Model overview

Group-based payoff with any number of strategies

General replicator equations

Peer punishment in public goods games

Pool punishment in public goods games

Discussion

Methods

Bottom-up statistical quantities

The decomposition to single games

Special linear system

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

About this article

Cite this article

Comments

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Reconstructing higher-order interactions in coupled dynamical systems

A benchmarking study of quantum algorithms for combinatorial optimization

Assembly theory explains and quantifies selection and evolution

Introduction

Results

Model overview

Group-based payoff with any number of strategies

General replicator equations

Peer punishment in public goods games

Pool punishment in public goods games

Discussion

Methods

Bottom-up statistical quantities

The decomposition to single games

Special linear system

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links