We're programmed to do two things and we do them all the time: seek reward and avoid punishment. That's why we generally don't do things like put our hands in a blender or run up massive amounts of debt that we have no intention of paying back. I say generally because I know some of you reading this will come up with some whacky counter-example, but before you do please think on this. Just because you don't perceive something as a reward or a punishment it does not mean that others have the same opinion. I don't like pain too much but masochists do. What I see as punishment they see as reward. So, in this sense human beings are rational. They always seek to maximize their own utility in everything they do. In the abstract, utility can be thought of as a number or a value. If it is greater than some threshold, say zero, were happy, if it is less, we're not.
Humans can also make trade-offs. There is a cost associated with doing work. We spend our time doing something that, let's face it, we would rather not be doing. But if that work brings a bigger reward then we're quite happy to do it. So utility, in this sense, is reward minus cost.
Now let's consider the situation where two people have to work together in order to get some reward. While the reward may be the same for each of them (a simplifying assumption) the cost is clearly dependent on the division of labour. If one person does all the work then the other gets greater utility. Such situations often challenge our sense of fairness.
Game Theory
OK, so we have an assignment to complete and we have to work together on it. Given that we'll get the same mark once it's handed in it might be fair to say that our assessment of the reward \(R\) is equal. We know what we need to do in order to get that mark and have assessed the total amount of work as \(W\). Let's assume that \(W>R>\frac{W}{2}\). If the work is shared equally we then each of us will do \(\frac{W}{2}\) work. But we each have two strategies we could adopt: we can choose to work on the assignment or we could goof off. In the case where the work is shared equally our utility \(U\) can be calculated as \(U=R-\frac{W}{2}\). Where I do all the work and you do nothing my utility is \(U=R-W\) which by our definition earlier we know to be less than zero. Your utility, if I do all the work, is \(U=R\). Of course if neither of us does any work then there is no reward at all. This allows us to construct the following table.| The assignment problem payoff matrix |
In the diagram the utility I get for the different combinations is in blue and the utility you get is in red. Let's say \(W>R>\frac{W}{2}\). I can maximize my utility by not doing any work and you could do the same. So, the outcome we get is that we both goof off and the work does not get done. That is because \(0>R-W\). Or to put it another way we both think that the reward is not worth the effort if we were to do it on our own. We can each maximise our utility if we expect the other to do the work for us, so, nothing gets done. What might surprise you is that this is an equilibrium as long as \(W>R>\frac{W}{2}\). Specifically it's called a Nash equilibrium after John Nash the mathematician who's story is told in "A Beautiful Mind"
The really cool thing about this very simple idea is how it applies to many things that we do or see. For example, game theory can be used to explain the arms race during the cold war. If my country can overwhelm yours with our weapons then I have an advantage (reward). Your country's strategy in response is to build more weapons than my country has to negate my country's advantage and put yours in an advantageous position. So my country's response to that is to make even more weapons and so on...
Game theory can also explain responses to terrorism. I could go on the offense, attacking terrorist bases but I'd make myself more of a target in doing so. Alternatively, I could increase security and defensive measures making it harder to be attacked. If we're both facing the same terrorist threat then we both have the same options. However I might be more willing to take a defensive stance if you take an offensive stance and vice-versa. What decisions are actually made would depend on how we each perceive the rewards and costs.
It also explains competition amongst rivals in the animal kingdom, strategies in poker, pricing strategies in a competitive environment, bidding at auctions etc etc
In fact, the auction is a nice little diversion. Did you know, for example, that when you bid your true value of a product on eBay, that strategy is weakly dominant over any other eBay bidding strategy (including sniping)? It's true. You can't say you've won an auction if you pay more than you believe the item is worth. Why would you do that? If you bid below your true valuation of the item and lose - well you've lost. However, if you bid exactly what you think it is worth you will only lose the auction if the price exceeds that value - so it's not exactly losing is it? If the bids are less that this amount - even if one or more bidders use a sniping strategy, you'll get the item and be guaranteed not to pay more than you think it's worth.
So, next time you're thinking that someone is acting irrationally it might be worth considering what rewards they are pursuing or what punishments they are avoiding and how these might relate to the decisions they are making in an environment where the decision is made in response to or in anticipation of decisions made by others which might affect them.








