Strategyproofness




In game theory, an asymmetric game where players have private information is said to be strategyproof (SP) if it is a weakly-dominant strategy for every player to reveal his/her private information,[1]:244 i.e. you fare best or at least not worse by being truthful, regardless of what the others do.


SP is also called truthful or dominant-strategy-incentive-compatible (DSIC),[1]:415 to distinguish it from other kinds of incentive compatibility.


An SP game is not always immune to collusion, but its robust variants are; with group strategyproofness no group of people can collude to misreport their preferences in a way that makes every member better off, and with strong group strategyproofness no group of people can collude to misreport their preferences in a way that makes at least one member of the group better off without making any of the remaining members worse off.[2]




Contents






  • 1 Examples


  • 2 Notation


  • 3 Characterization


    • 3.1 Outcome-function characterization




  • 4 Truthful mechanisms in single-parameter domains


  • 5 Truthfulness with-high-probability


  • 6 False-name-proofness


  • 7 See also


  • 8 Further reading


  • 9 References





Examples


Typical examples of SP mechanisms are majority voting between two alternatives, second-price auction and any VCG mechanism.


Typical examples of a mechanisms that are not SP are plurality voting between three or more alternatives, and first-price auction.


SP is also applicable in network routing. Consider a network as a graph where each edge (i.e. link) has an associated cost of transmission, privately known to the owner of the link. The owner of a link wishes to be compensated for relaying messages. As the sender of a message on the network, one wants to find the least cost path. There are efficient methods for doing so, even in large networks. However, there is one problem: the costs for each link are unknown. A naive approach would be to ask the owner of each link the cost, use these declared costs to find the least cost path, and pay all links on the path their declared costs. However, it can be shown that this payment scheme is not SP, that is, the owners of some links can benefit by lying about the cost. We may end up paying far more than the actual cost. It can be shown that given certain assumptions about the network and the players (owners of links), a variant of the VCG mechanism is SP.



Notation


There is a set X{displaystyle X}X of possible outcomes.


There are n{displaystyle n}n agents which have different valuations for each outcome. The valuation of agent i{displaystyle i}i is represented as a function:


vi:X⟶R+{displaystyle v_{i}:Xlongrightarrow R_{+}}{displaystyle v_{i}:Xlongrightarrow R_{+}}

which expresses the value it has for each alternative, in monetary terms.


It is assumed that the agents have Quasilinear utility functions; this means that, if the outcome is x{displaystyle x}x and in addition the agent receives a payment pi{displaystyle p_{i}}p_{i} (positive or negative), then the total utility of agent i{displaystyle i}i is:


ui:=vi(x)+pi{displaystyle u_{i}:=v_{i}(x)+p_{i}}{displaystyle u_{i}:=v_{i}(x)+p_{i}}

The vector of all value-functions is denoted by v{displaystyle v}v.


For every agent i{displaystyle i}i, the vector of all value-functions of the other agents is denoted by v−i{displaystyle v_{-i}}{displaystyle v_{-i}}. So v≡(vi,v−i){displaystyle vequiv (v_{i},v_{-i})}{displaystyle vequiv (v_{i},v_{-i})}.


A mechanism is a pair of functions:



  • An Outcome{displaystyle Outcome}{displaystyle Outcome} function, that takes as input the value-vector v{displaystyle v}v and returns an outcome x∈X{displaystyle xin X}xin X (it is also called a social choice function);

  • A Payment{displaystyle Payment}{displaystyle Payment} function, that takes as input the value-vector v{displaystyle v}v and returns a vector of payments, (p1,…,pn){displaystyle (p_{1},dots ,p_{n})}{displaystyle (p_{1},dots ,p_{n})}, determining how much each player should receive (a negative payment means that the player should pay a positive amount).


A mechanism is called strategyproof if, for every player i{displaystyle i}i and for every value-vector of the other players v−i{displaystyle v_{-i}}{displaystyle v_{-i}}:


vi(Outcome(vi,v−i))+Paymenti(vi,v−i)≥vi(Outcome(vi′,v−i))+Paymenti(vi′,v−i){displaystyle v_{i}(Outcome(v_{i},v_{-i}))+Payment_{i}(v_{i},v_{-i})geq v_{i}(Outcome(v_{i}',v_{-i}))+Payment_{i}(v_{i}',v_{-i})}{displaystyle v_{i}(Outcome(v_{i},v_{-i}))+Payment_{i}(v_{i},v_{-i})geq v_{i}(Outcome(v_{i}',v_{-i}))+Payment_{i}(v_{i}',v_{-i})}


Characterization


It is helpful to have simple conditions for checking whether a given mechanism is SP or not. This subsection shows two simple conditions that are both necessary and sufficient.


If a mechanism is SP, then it must satisfy the following two conditions, for every agent i{displaystyle i}i:[1]:226


1. The payment to agent i{displaystyle i}i is a function of the chosen outcome and of the valuations of the other agents v−i{displaystyle v_{-i}}{displaystyle v_{-i}} - but not a direct function of the agent's own valuation vi{displaystyle v_{i}}v_{i}. Formally, there exists a price function Pricei{displaystyle Price_{i}}{displaystyle Price_{i}}, that takes as input an outcome x∈X{displaystyle xin X}xin X and a valuation vector for the other agents v−i{displaystyle v_{-i}}{displaystyle v_{-i}}, and returns the payment for agent i{displaystyle i}i, such that for every vi,vi′,v−i{displaystyle v_{i},v_{i}',v_{-i}}{displaystyle v_{i},v_{i}',v_{-i}}, if:


Outcome(vi,v−i)=Outcome(vi′,v−i){displaystyle Outcome(v_{i},v_{-i})=Outcome(v_{i}',v_{-i})}{displaystyle Outcome(v_{i},v_{-i})=Outcome(v_{i}',v_{-i})}

then:


Paymenti(vi,v−i)=Paymenti(vi′,v−i){displaystyle Payment_{i}(v_{i},v_{-i})=Payment_{i}(v_{i}',v_{-i})}{displaystyle Payment_{i}(v_{i},v_{-i})=Payment_{i}(v_{i}',v_{-i})}

PROOF: If Paymenti(vi,v−i)>Paymenti(vi′,v−i){displaystyle Payment_{i}(v_{i},v_{-i})>Payment_{i}(v_{i}',v_{-i})}{displaystyle Payment_{i}(v_{i},v_{-i})>Payment_{i}(v_{i}',v_{-i})} then an agent with valuation vi′{displaystyle v_{i}'}{displaystyle v_{i}'} prefers to report vi{displaystyle v_{i}}v_{i}, since it gives him the same outcome and a larger payment; similarly, if Paymenti(vi,v−i)<Paymenti(vi′,v−i){displaystyle Payment_{i}(v_{i},v_{-i})<Payment_{i}(v_{i}',v_{-i})}{displaystyle Payment_{i}(v_{i},v_{-i})<Payment_{i}(v_{i}',v_{-i})} then an agent with valuation vi{displaystyle v_{i}}v_{i} prefers to report vi′{displaystyle v_{i}'}{displaystyle v_{i}'}.


As a corollary, there exists a "price-tag" function, Pricei{displaystyle Price_{i}}{displaystyle Price_{i}}, that takes as input an outcome x∈X{displaystyle xin X}xin X and a valuation vector for the other agents v−i{displaystyle v_{-i}}{displaystyle v_{-i}}, and returns the payment for agent i{displaystyle i}i For every vi,v−i{displaystyle v_{i},v_{-i}}{displaystyle v_{i},v_{-i}}, if:


Outcome(vi,v−i)=x{displaystyle Outcome(v_{i},v_{-i})=x}{displaystyle Outcome(v_{i},v_{-i})=x}

then:


Paymenti(vi,v−i)=Pricei(x,v−i){displaystyle Payment_{i}(v_{i},v_{-i})=Price_{i}(x,v_{-i})}{displaystyle Payment_{i}(v_{i},v_{-i})=Price_{i}(x,v_{-i})}

2. The selected outcome is optimal for agent i{displaystyle i}i, given the other agents' valuations. Formally:


Outcome(vi,v−i)∈arg⁡maxx[vi(x)+Pricei(x,v−i)]{displaystyle Outcome(v_{i},v_{-i})in arg max _{x}[v_{i}(x)+Price_{i}(x,v_{-i})]}{displaystyle Outcome(v_{i},v_{-i})in arg max _{x}[v_{i}(x)+Price_{i}(x,v_{-i})]}

where the maximization is over all outcomes in the range of Outcome(⋅,v−i){displaystyle Outcome(cdot ,v_{-i})}{displaystyle Outcome(cdot ,v_{-i})}.


PROOF: If there is another outcome x′=Outcome(vi′,v−i){displaystyle x'=Outcome(v_{i}',v_{-i})}{displaystyle x'=Outcome(v_{i}',v_{-i})} such that vi(x′)+Pricei(x′,v−i)>vi(x)+Pricei(x,v−i){displaystyle v_{i}(x')+Price_{i}(x',v_{-i})>v_{i}(x)+Price_{i}(x,v_{-i})}{displaystyle v_{i}(x')+Price_{i}(x',v_{-i})>v_{i}(x)+Price_{i}(x,v_{-i})}, then an agent with valuation vi{displaystyle v_{i}}v_{i} prefers to report vi′{displaystyle v_{i}'}{displaystyle v_{i}'}, since it gives him a larger total utility.


Conditions 1 and 2 are not only necessary but also sufficient: any mechanism that satisfies conditions 1 and 2 is SP.


PROOF: Fix an agent i{displaystyle i}i and valuations vi,vi′,v−i{displaystyle v_{i},v_{i}',v_{-i}}{displaystyle v_{i},v_{i}',v_{-i}}. Denote:




x:=Outcome(vi,v−i){displaystyle x:=Outcome(v_{i},v_{-i})}{displaystyle x:=Outcome(v_{i},v_{-i})} - the outcome when the agent acts truthfully.


x′:=Outcome(vi′,v−i){displaystyle x':=Outcome(v_{i}',v_{-i})}{displaystyle x':=Outcome(v_{i}',v_{-i})} - the outcome when the agent acts untruthfully.


By property 1, the utility of the agent when playing truthfully is:


ui(vi)=vi(x)+Pricei(x,v−i){displaystyle u_{i}(v_{i})=v_{i}(x)+Price_{i}(x,v_{-i})}{displaystyle u_{i}(v_{i})=v_{i}(x)+Price_{i}(x,v_{-i})}

and the utility of the agent when playing untruthfully is:


ui(vi′)=vi(x′)+Pricei(x′,v−i){displaystyle u_{i}(v_{i}')=v_{i}(x')+Price_{i}(x',v_{-i})}{displaystyle u_{i}(v_{i}')=v_{i}(x')+Price_{i}(x',v_{-i})}

By property 2:


ui(vi)≥ui(vi′){displaystyle u_{i}(v_{i})geq u_{i}(v_{i}')}{displaystyle u_{i}(v_{i})geq u_{i}(v_{i}')}

so it is a dominant strategy for the agent to act truthfully.



Outcome-function characterization


The actual goal of a mechanism is its Outcome{displaystyle Outcome}{displaystyle Outcome} function; the payment function is just a tool to induce the players to be truthful. Hence, it is useful to know, given a certain outcome function, whether it can be implemented using a SP mechanism or not (this property is also called implementability). The Monotonicity (mechanism design) property is necessary, and often also sufficient.



Truthful mechanisms in single-parameter domains


A single-parameter domain is a game in which each player i gets a certain positive value vi for "winning" and a value 0 for "losing". A simple example is a single-item auction, in which vi is the value that player i assigns to the item.


For this setting, it is easy to characterize truthful mechanisms. Begin with some definitions.


A mechanism is called normalized if every losing bid pays 0.


A mechanism is called monotone if, when a player raises his bid, his chances of winning (weakly) increase.


For a monotone mechanism, for every player i and every combination of bids of the other players, there is a critical value in which the player switches from losing to winning.


A normalized mechanism on a single-parameter domain is truthful iff the following two conditions hold:[1]:229-230



  1. The assignment function is monotone in each of the bids, and:

  2. Every winning bid pays the critical value.



Truthfulness with-high-probability


For every constant ϵ>0{displaystyle epsilon >0}epsilon >0, a randomized mechanism is called truthful with probability 1−ϵ{displaystyle 1-epsilon }1-epsilon if for every agent and for every vector of bids, the probability that the agent benefits by bidding non-truthfully is at most ϵ{displaystyle epsilon }epsilon , where the probability is taken over the randomness of the mechanism.[1]:349


If the constant ϵ{displaystyle epsilon }epsilon goes to 0 when the number of bidders grows, then the mechanism is called truthful with high probability. This notion is weaker than full truthfulness, but it is still useful in some cases; see e.g. consensus estimate.



False-name-proofness


A new type of fraud that has become common with the abundance of internet-based auctions is false-name bids – bids submitted by a single bidder using multiple identifiers such as multiple e-mail addresses.


False-name-proofness means that there is no incentive for any of the players to issue false-name-bids. This is a stronger notion than strategyproofness. In particular, the Vickrey–Clarke–Groves (VCG) auction is not false-name-proof.[3]


False-name-proofness is importantly different from group strategyproofness because it assumes that an individual alone can simulate certain behaviours that would normally require the collusive coordination of multiple individuals.



See also



  • Incentive compatibility


  • Individual rationality - means that a player cannot lose by playing the game (i.e. a player has no incentive to avoid playing the game).



Further reading



  • Parkes, David C. (2004), On Learnable Mechanism Design, in: Tumer, Kagan and David Wolpert (Eds.): Collectives and the Design of Complex Systems, New York u.a.O., pp. 107–133.


  • On Asymptotic Strategy-Proofness of Classical Social Choice Rules An article by Arkadii Slinko about strategy-proofness in voting systems.



References





  1. ^ abcde Vazirani, Vijay V.; Nisan, Noam; Roughgarden, Tim; Tardos, Éva (2007). Algorithmic Game Theory (PDF). Cambridge, UK: Cambridge University Press. ISBN 0-521-87282-0..mw-parser-output cite.citation{font-style:inherit}.mw-parser-output .citation q{quotes:"""""""'""'"}.mw-parser-output .citation .cs1-lock-free a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/6/65/Lock-green.svg/9px-Lock-green.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .citation .cs1-lock-limited a,.mw-parser-output .citation .cs1-lock-registration a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/d/d6/Lock-gray-alt-2.svg/9px-Lock-gray-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .citation .cs1-lock-subscription a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/a/aa/Lock-red-alt-2.svg/9px-Lock-red-alt-2.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration{color:#555}.mw-parser-output .cs1-subscription span,.mw-parser-output .cs1-registration span{border-bottom:1px dotted;cursor:help}.mw-parser-output .cs1-ws-icon a{background:url("//upload.wikimedia.org/wikipedia/commons/thumb/4/4c/Wikisource-logo.svg/12px-Wikisource-logo.svg.png")no-repeat;background-position:right .1em center}.mw-parser-output code.cs1-code{color:inherit;background:inherit;border:inherit;padding:inherit}.mw-parser-output .cs1-hidden-error{display:none;font-size:100%}.mw-parser-output .cs1-visible-error{font-size:100%}.mw-parser-output .cs1-maint{display:none;color:#33aa33;margin-left:0.3em}.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration,.mw-parser-output .cs1-format{font-size:95%}.mw-parser-output .cs1-kern-left,.mw-parser-output .cs1-kern-wl-left{padding-left:0.2em}.mw-parser-output .cs1-kern-right,.mw-parser-output .cs1-kern-wl-right{padding-right:0.2em}


  2. ^ "Group Strategy-proofness And Social Choice Between Two Alternatives" (PDF).


  3. ^ Yokoo, M.; Sakurai, Y.; Matsubara, S. (2004). "The effect of false-name bids in combinatorial auctions: New fraud in internet auctions". Games and Economic Behavior. 46: 174. doi:10.1016/S0899-8256(03)00045-9.










Popular posts from this blog

澳門輕軌系統

水泉澳邨

Indian Forest Service