An Introduction to decision theory by Martin Peterson

What it is about

As the title suggests, this is a high level introduction to and overview of the main concepts of decision theory. It covers the basic ideas from philosophical as well as technical/mathematical perspective. It does include some maths and at least some knowledge of probability theory and high school algebra is definitely needed to understand the key concepts. The book also covers Bayesian reasoning and Game theory both of which are very relevant for anyone making decisions under risk and/or uncertainty/ignorance.


Related titles

  • The Decision Book: Fifty Models for Strategic Thinking by Mikael Krogerus 

  • Rational Choice in an Uncertain World by Reid Hastie

  • Thinking, Fast and Slow by Daniel Kahneman

Categories

decision-making | rationality



Key ideas & Notes

  • Chapter 1: Introduction

    • Descriptive Decision Theory — How people actually make decisions. How people actually behave is likely to change over time and across cultures

    • Normative Decision Theory —How people ought to make (rational) decisions. Should be agnostic of culture and time

    • A decision can be right without being rational and can be rational without being right

    • Decision is RIGHT if and only if its actual outcome is at least as good as that of every other possible outcomes.

    • Decision is RATIONAL if and only if the decision maker chooses to do what she has the most reason to do at that point in time

    • INSTRUMENTAL rationality

      • Assumes a decision maker has some aim

      • Aims in themselves cannot be irrational and are external to the decision theory

      • To be instrumentally rational means to do what one has the most reason to expect will fulfil ones aim.

    • Decisions under RISK

      • The probability of the possible outcomes IS known by the decision maker

      • Most widely used approach is the principle of maximising the expected value

      • The total value of an act equals the sum of the values of its possible outcomes weighted by the probability for each outcome

    • Decision under UNCERTAINTY / IGNORANCE

      • The probability of the possible outcomes IS NOT known by the decision maker

    • SOCIAL CHOICE theory — Aims to establish principles for how decisions involving more than one decision maker out to be made.

  • Chapter 2: The Decision Matrix

    • DECISION PROBLEM

      • States — Parts of the world that are not and outcome or an act. Devices needed for applying acts

      • Outcomes — What ultimately matters

      • Acts — Instruments to reach the outcomes.

    • SCALES

      • ORDINAL scales — merely express preference ordering. No information about by how much one preference is better/worse than another preference. Qualitative comparison of objects

      • CARDINAL scales

        • INTERVAL scales — Accurately reflect the difference between the objects being measured (but not the ratio). Example: Comparing Celsius and Fahrenheit scales. Quantitative comparison of objects.

        • RATIO scales — Reflect ratios. A pair of ratio scales are equivalent if and only if each can be transformed into the other by multiplying all values by some positive constant. Quantitative comparison of objects. Example: Height, Weight

  • Chapter 3: Decisions under ignorance

    • The decision maker knows what the alternatives and what outcomes they may result it, but is unable to assign any probabilities to the states corresponding to the outcomes

    • Dominance — A1 dominates A2 if A1 always leads to better outcomes regardless no matter what states of the world happen to be true

    • Maximin

      • Focus on the worst possible outcome

      • One should maximise the minimal value obtainable with each act

      • Ai ≥ Aj if and only if min(Ai) ≥ min(Aj) for all i and j

      • Allows value to be measure on Cardinal scale

    • Leximin — Lexical Maximin

      • Addresses a scenario where worst outcomes are identical

      • If worst outcomes are equal, choose the alternative with the best second worst outcome (if equal, choose third, fourth etc)

      • Allows value to be measure on Cardinal scale

    • Maximax — Maximise the maximum value obtainable with an act. Allows value to be measure on Cardinal scale

    • Optimism-Pesimism rule

      • Generalisation of the maximin and maximax rules

      • Requires value to be measure on an Interval scale

      • Consider best and worst possible outcomes of each alternative and then choose an altertive according to the decision makers level of optimism/pesimism

    • Minimax Regret — The best alternative is the one that minimises the maximum amount of regret

    • Principle of insufficient reason

      • If one has no reason to think that one state of the world is more likely than another, then all states should be assigned equal probability

      • Enables transforming decisions under ignorance into decisions under risk

  • Chapter 4: Decisions under risk

    • The decision maker knows the probabilities of the outcomes

    • Principle of maximising the expected value favoured by most experts

    • Expected Value

      • EV = p1 · v1 + p2 · v2 + ... + pn · vn

      • Value is evaluated from the decisions maker's point of view

    • Expected Utility

      • Utility of an outcome depends on how valuable the outcomes is from the decision maker's perspective

      • EU = p1 · u1 + p2 · u2 + ... + pn · un

      • Based on the law of large numbers, in the long run it is best to maximise expected utility

      • Assumes that probability of each outcome remains stable over time.

  • Chapter 6: The Mathematics of probability

    • Kolmogorov (1933) — Every probability is a real number between 0 and 1: 1 ≥ p(A) ≥ 0

    • The probability of the entire sample space is 1: p(S) = 1

    • If two events are mutually exclusive, then the probability that one of them will occur equals the probability of he first + the probability of the second

      • if A ∩ B = 0, then p(A U B) = p(A) + p(B)

      • if A and B are mutually exclusive then p(A ∨ B) = p(A) + p(B)

      • p(A) + p(negA) = 1

      • If A and B are logically equivalent, then p(A) = p(B)

    • Conditional probability

      • p(A|B) = p(A ∧ B) / p(B)

      • A is independent of B if and only if p(A) = p(A|B)

      • If A is independent of B then p(A ∧ B) = p(A) · p(B)

    • Bayes Theorem

      • p(B|A) = p(B) · p(A|B) / p(A)

      • p(B) is prior probability

      • Unknown priors is a problem, but can be reduced by applying Bayes theorem over and over again.

      • The more times the Bayes theorem is applied, the closer to the truth (posterior probability value) we will get (regardless of the initial prior value!)

  • Chapter 7: The Philosophy of probability

    • Objectivists — Believe that statements about probability refer to the facts in the real world

    • Subjectivists — Deny objectivists' claims. Statements about probability refer to the degree to which the speaker believes in something. Probabilities are entities that humans somehow create in their own minds. When two decision makers hold different subjective probabilities they just happen to believe something to different degrees.

    • Classical interpretation

      • Probability of an event seen as a fraction of the total number of possible ways in which the event can occur

      • Presupposes that all possible outcomes are equally likely

    • Frequency interpretation

      • The probability of an event is the ratio between the number of time the event occurred and the total number of observed cases.

      • Always defined relative to some reference class

      • Key challenge - how to determine which reference class is suitable and why?

    • Propensity Interpretation

      • Developed by Karl Popper in 1950s

      • Probability is related to certain features of the real world based on their tendency to give rise to a certain effect

      • Propensity can't be directly observed --> We need to rely on indirect evidence

      • Propensities have a temporal direction: if A has a propensity to give rise to B, then A cannot occur after B (like causality). In contrast, probabilities do NOT have a temporal direction.

    • Logical interpretation

      • Developed by Keynes and Carnap

      • Often call epistemic probability

      • Probability is a logical relation between a hypothesis and the evidence supporting it.

  • Chapter 8: Why should we accept preference axioms?

    • RISK AVERSION

      • Against actuarial risks — Risk averter is defined as one who, starting from a position of certainty, is unwilling to take a bet which is actuarially fair.. Prefers (smaller) certainty even over lotteries that may bring more significant gain.

      • Against utility risks —The risk averter should not only maximise utility but also, at least sometimes, apply some decision rule that puts emphasis on avoiding bad outcomes. E.g. maximin rule

      • Against epistemic risks — Example: The maximin criterion for expected utilities (MMEU) - the alternative with the largest minimal utility ought to be selected. Encourages decision makers to expect worst

    • Utility function — Is able to represent those preferences if it is possible to assign a real number to each alternative, in such a way that alternative a is assigned a number greater than alternative b if, and only if, the individual prefers alternative a to alternative b. A decision maker with a concave utility function will always prefer a smaller prize for certain over actuarially fair lottery over larger and smaller prizes. The flatter the utility function is in a given interval the more actuarially risk averse the decision maker is in that interval

  • Chapter 9: Causal vs. Evidential decision theory

    • Newcomb's problem — Shows that Dominance principle and maximising expected utility theory yield conflicting recommendations

    • Causal decision theory — Decision maker should keep all her beliefs about causal processes fixed during the decision making process and always choose an alternative that is optimal according to these beliefs. Causal structure is forward looking and completely insensitive to the past. Recommends to do what is likely going to bring the best possible result, while holding fixed all views about the likely causal structure of the world.

    • Evidential Decision Theory — Instead of asking "If I do X will Y happen" we should ask "What is the probability that if I were to do X, then Y would be the case, given that I do X?"

  • Chapter 10: Bayesian vs. non-Bayesian decision theory

    • Bayesianism

      • Epistemic component — What rational agents ought to believe and which combinations of beliefs and desires are rationally permisable.

      • Deliberative component — Tells us what action is rational for the agent to perform given his or her present state of mind.

      • Ordering axiom — Most fundamental axiom of Beysianism. For any two uncertain prospects, the decision maker must be able to state a clear and unambiguous preference and all such preferences must be asymmetric and transitive.

    • Non-Bayesian theories

      • Externalist — An act is rational not merely by virtue of what the decision maker believes and desires. Rejects Humean belief-desire account which is considered too narrow

      • Internalist — Decision maker's beliefs and desires is all that matters when adjudicating whether an act is rational or not. In contrast to Bayesianism, preferences over risky acts are not just tools to measure degrees of belief and desires but constitute reasons to preferring one risky act over another

  • Chapter 11: Game Theory

    • Studies decisions where outcomes depend partly on what other people/decision makers do

    • Prisoner's dilemma

      • What is optimal for each individual need NOT coincide with what is optimal for the group.

      • Individual rationality sometimes comes into conflict with group rationality.

      • Arise whenever a game is symmetrical i.e. everyone is facing the same strategies and outcomes and the ranking of outcomes is 2,2 4,1, 1,4 3,3

    • Non-cooperative, simultaneous-strategy, symmetric, nonzero-sum and finite game

    • Taxonomy

      • zero sum — you win as much as your opponent(s) lose (casino games, chess). The total amount of money/units of utility is fixed no matter what happens.

      • Non-cooperative — Players are not able to form binding agreements (but players CAN cooperate).

      • Simultaneous vs sequential

        • Simultaneous move — Players decide on their strategies without knowing what the other player(s) will do.

        • Sequential move — players have some (or full) information about the strategies played by other players in earlier rounds

      • Symmetric — All players face the same strategies and outcomes. Identity of players does not matter

      • Equilibrium

        • A pair of strategies is in equilibrium if and only if it holds that once this pair of strategies is chosen none of the players can reach a better outcome by UNILATERALLY switching to another strategy

        • To find equilibrium strategies look for strategies that fulfil the minimax condition. The outcome is determined by the minimal value of the row and the maximal value of the column

      • Finitely vs infinitely iterated

        • Finitely iterated — Games iterated finite number of times. In finitely iterated game rational players will behave in exactly the same way as in the one-shot version of the game

        • Infinitely iterated — Players do not know in advance whether they are about to play the last round of the game. The games does NOT actually need to be iterated ad infinitum. Each player may adjust his next move to what the opponent did in the previous round. Tit-for-tat is a best strategy: always cooperate in the first round and thereafter adjust your behaviour to whatever your opponent did in the previous round.

  • Chapter 12: Non zero-sum and cooperative games

    • Nash equilibrium

      • Rational players will do whatever they can to ensure that they do not feel unnecessarily unhappy about their decision.

      • An equilibrium point is a set of strategies such that each player's strategy maximises his pay-off if the strategies of the others are held fixed. Thus each player's strategy is optimal against those of the others. [Nash 1950]

      • Many non zero-sum games have more than one Nash equilibrium which means such games can't be "solved" in a sense that we can't exactly figure out what rational players would do by just applying Nash equilibrium concept.

    • Pareto efficient state —A state is Pareto efficient if and only if no one's utility level can be increased unless the utility level for someone else is decreased.

  • Chapter 13: Social Choice Theory

    • Analyses collective decision problems and how a set of individual preference orderings G can be aggregated in a systematic manner into a Social preference ordering S.

    • Social choice problem — Any decision problem faced by a group in which each individual is wiling to state at least ordinal preferences over outcomes. How to translate INDIVIDUAL preference ordering into SOCIAL preference ordering.

    • Social state — The state of the world that includes everything that individuals care about.

    • Social Welfare Function (SWF) — any decision rule that aggregates a set of individual preference orderings over social states into a social preference ordering over those states. (e.g. Majority rule). Any normatively reasonable SWF should be non-dictatorial i.e. S must not always coincide with the preference ordering of a particular individual. For every possible combination of individual preference orderings, an SWF must produce a social preference ordering that is complete, asymmetric and transitive. (i.e. Majority rule does not meet this condition)

  • Chapter 14: Descriptive decision theory

    • People often act in ways that are irrational

    • Certainty effect — People value certain gain more that equally large expected gain. People reason incorrectly about small probabilities

    • Loss aversion is often stronger than gain preference

    • Prospect Theory — Descriptive theory of choice under risk. Expected utility principle should be modified by introducing two weighing functions: one for value and one for probability. The value of Loss or Gain is NOT linear as prescribed by expected utility theory.


Summary

A somewhat technical book which some may find a bit heavy going in places, but it provides a good and fairly comprehensive introduction to main aspects, models and methods of decision theory. It is not a ‘practical’ book in a sense that it does not provide a set of specific real-world examples or situations and how to apply decision models and approaches. I would recommend this book to anyone who is interested in broadening their knowledge of decision theory and rational decision making. It also works really well as a ‘springboard’ for further study of specific approaches and methods and their practical application in critical decision making in life and in business. PS: The mindmap below contains additional details, concepts and ideas not covered in the overview above.


Mindmap