Multi-attribute utility

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

In decision theory, a multi-attribute utility function is used to represent the preferences of an agent over bundles of goods under uncertainty.

Preliminaries

A person has to decide between two or more options. The decision is based on the attributes of the options.

The simplest case is when there is only one attribute, e.g.: money. It is usually assumed that all people prefer more money to less money; hence, the problem in this case is trivial: select the option that gives you more money.

In reality, there are two or more attributes. For example, a person has to select between two employment options: option A gives him $12K per month and 20 days of vacation, while option B gives him $15K per month and only 10 days of vacation. The person has to decide between (12K,20) and (15K,10). Different people may have different preferences. Under certain conditions, a person's preferences can be represented by a numeric function. The article ordinal utility describes some properties of such functions and some ways by which they can be calculated.

Another consideration that might complicate the decision problem is uncertainty. This complication exists even when there is a single attribute, e.g.: money. For example, option A might be a lottery with 50% chance to win $2, while option B is to win $1 for sure. The person has to decide between the lottery <2:0> and the lottery <1:1>. Again, different people may have different preferences. Again, under certain conditions the preferences can be represented by a numeric function. Such functions are called cardinal utility functions. The article Von Neumann–Morgenstern utility theorem describes some ways by which they can be calculated.

The most general situation is that there are both multiple attributes and uncertainty. For example, option A may be a lottery with a 50% chance to win two apples and two bananas, while option B is to win two bananas for sure. The decision is between <(2,2):(0,0)> and <(2,0):(2,0)>. The preferences here can be represented by cardinal utility functions which take several variables (the attributes).[1]:26–27 Such functions are the focus of the current article.

The goal is to calculate a utility function u(x_1,...,x_n) which represents the person's preferences on lotteries of bundles. I.e, lottery A is preferred over lottery B if and only if the expectation of the function u is higher under A than under B:

E_A[u(x_1,...,x_n)] > E_B[u(x_1,...,x_n)]

Assessing a multi-attribute cardinal utility function

If the number of possible bundles is finite, u can be constructed directly as explained by von Neumann and Morgenstern (VNM): order the bundles from least preferred to most preferred, assign utility 0 to the former and utility 1 to the latter, and assign to each bundle in between a utility equal to the probability of an equivalent lottery.[1]:222–223

If the number of bundles is infinite, one option is to start by ignorning the randomness, and assess an ordinal utility function v(x_1,...,x_n) which represents the person's utility on sure bundles. I.e, a bundle x is preferred over a bundle y if and only if the function v is higher for x than for y:

v(x_1,...,x_n) > v(y_1,...,y_n)

This function, in effect, converts the multi-attribute problem to a single-attribute problem: the attribute is v. Then, VNM can be used to construct the function u.[1]:219–220

Note that u must be a positive monotone transformation of v. This means that there is a monotonically increasing function r: \mathbb{R}\to \mathbb{R}, such that:

u(x_1,...,x_n) = r(v(x_1,...,x_n))

The problem with this approach is that it is not easy to assess the function r. When assessing a single-attribute cardinal utility function using VNM, we ask questions such as: "What probability to win $2 is equivalent to $1?". So to assess the function r, we have to ask a question such as: "What probability to win 2 units of value is equivalent to 1 value?". The latter question is much harder to answer than the former, since it involves "value", which is an abstract quantity.

A possible solution is to calculate n one-dimensional cardinal utility functions - one for each attribute. For example, suppose there are two attributes: apples (x_1) and bananas (x_2), both range between 0 and 99. Using VNM, we can calculate the following 1-dimensional utility functions:

  • u(x_1,0) - a cardinal utility on apples when there are no bananas (the southern boundary of the domain);
  • u(99,x_2) - a cardinal utility on bananas when apples are at their maximum (the eastern boundary of the domain).

Using linear transformations, scale the functions such that they have the same value on (99,0).

Then, for every bundle (x_1',x_2'), find an equivalent bundle (a bundle with the same v) which is either of the form (x_1,0) or of the form (99,x_2), and set its utility to the same number.[1]:221–222

Often, certain independence properties between attributes can be used to make the construction of a utility function easier.

Additive independence

The strongest independence property is called additive independence. Two attributes, 1 and 2, are called additive independent, if the preference between two lotteries (defined as joint probability distributions on the two attributes) depends only on their marginal probability distributions (the marginal PD on attribute 1 and the marginal PD on attribute 2).

This means, for example, that the following two lotteries are equivalent:

  • L: An equal-chance lottery between (x_1,x_2) and (y_1,y_2);
  • M: An equal-chance lottery between (x_1,y_2) and (y_1,x_2).

In both these lotteries, the marginal PD on attribute 1 is 50% for x_1 and 50% for y_1. Similarly, the marginal PD on attribute 2 is 50% for x_2 and 50% for y_2. Hence, if an agent has additive-independent utilities, he must be indifferent between these two lotteries.[1]:229–232

A fundamental result in utility theory is that, two attributes are additive-independent, if and only if their two-attribute utility function is additive and has the form:

u(x_1,x_2)=u_1(x_1) + u_2(x_2)

PROOF:

\longrightarrow

If the attributes are additive-independent, then the lotteries L and M, defined above, are equivalent. This means that their expected utility is the same, i.e.: E_L[u]=E_M[u]. Multiplying by 2 gives:

u(x_1,x_2)+u(y_1,y_2)=u(x_1,y_2)+u(y_1,x_2)

This is true for any selection of the x_i and y_i. Assume now that y_1 and y_2 are fixed. Arbitrarily set u(y_1,y_2)=0. Write: u_1(x_1) = u(x_1,y_2) and u_2(x_2) = u(y_1,x_2). The above equation becomes:

u(x_1,x_2) = u_1(x_1)+u_2(x_2)

\longleftarrow

If the function u is additive, then by the rules of expectation, for every lottery L:

E_L[u(x_1,x_2)] = E_L[u_1(x_1)] + E_L[u_2(x_2)]

This expression depends only on the marginal probability distributions of L on the two attributes.

This result generalizes to any number of attributes: iff preferences over lotteries on attributes 1,...,n depend only on their marginal probability distributions, then the n-attribute utility function is additive:[1]:295

u(x_1,\dots,x_n) = \sum_{i=1}^n{k_i u_i(x_i)}

where u and the u_i are normalized to the range [0,1], and the k_i are normalization constants.

Much of the work in additive utility theory has been done by Peter C. Fishburn.

Utility independence

A slightly weaker independence property is utility independence. Attribute 1 is utility-independent of attribute 2, if the conditional preferences on lotteries on attribute 1 given a constant value of attribute 2, do not depend on that constant value.

This means, for example, that the preference between a lottery <(x_1,x_2):(y_1,x_2)> and a lottery <(x'_1,x_2):(y'_1,x_2)> is the same, regardless of the value of x_2.

Note that utility independence (in contrast to additive independence) is not symmetric: it is possible that attribute 1 is utility-independent of attribute 2 and not vice versa.[1]:224–229

If attribute 1 is utility-independent of attribute 2, then the utility function for every value of attribute 2 is a linear transformation of the utility function for every other value of attribute 2. Hence it can be written as:

u(x_1,x_2)=c_1(x_2)+c_2(x_2)\cdot u(x_1,x_2^0)

when x_2^0 is a constant value for attribute 2. Similarly, If attribute 2 is utility-independent of attribute 1:

u(x_1,x_2)=d_1(x_1)+d_2(x_1)\cdot u(x_1^0,x_2)

If the attributes are mutually utility independent, then the utility function u has the following multi-linear form:[1]:233–235

u(x_1,x_2)=u_1(x_1)+u_2(x_2)+k\cdot u_1(x_1)\cdot u_2(x_2)

Where k is a constant which can be positive, negative or 0.

  • When k=0, the function u is additive and the attributes are additive-independent.
  • When k\neq 0, the utility function is multiplicative, since it can be written as:
[k u(x_1,x_2)+1]=[k u_1(x_1)+1] \cdot [k u_2(x_2)+1]
where each term is a linear transformation k\cdot+1 of a utility function.

These results can be generalized to any number of attributes. Given attributes 1,...,n, if any subset of the attributes is utility-independent of its complement, then the n-attribute utility function is multi-linear and has one of the following forms:

1 + k u(x_1,\dots,x_n) = \prod_{i=1}^n{1+k k_i u_i(x_i)}

where:

  • The u and the u_i are normalized to the range [0,1];
  • The k_i are constants in [0,1];
  • k is a constant which is either in (-1,0) or in (0,\infty) (note that the limit when k\to 0 is the additive form).

Comparison of independence concepts

It is useful to compare three different concepts related to independence of attributes: Additive-independence (AI), Utility-independence (UI) and Preference-independence (PI).[1]:344

AI and UI both concern preferences on lotteries and are explained above. PI concerns preferences on sure outcomes and is explained in the article on ordinal utility.

Their implication order is as follows:

AI ⇒ UI ⇒ PI

AI is a symmetric relation (if attribute 1 is AI of attribute 2 then attribute 2 is AI of attribute 1), while UI and PI are not.

AI implies mutual UI. The opposite is, in general, not true; it is true only if k=0 in the multi-linear formula for UI attributes. But if, in addition to mutual UI, there exist x_1,x_2,y_1,y_2 for which the two lotteries L and M, defined above, are equivalent - then k must be 0, and this means that the preference relation must be AI.[1]:238–239

UI implies PI. The opposite is, in general, not true. But if:

  • there are at least 3 essential attributes, and:
  • all pairs of attributes {1,i} are PI of their complement, and:
  • attribute 1 is UI of its complement,

then all attributes are mutually UI. Moreover, in that case there is a simple relation between the cardinal utility function u representing the preferences on lotteries, and the ordinal utility function v representing the preferences on sure bundles. The function u must have one of the following forms:[1]:330–332[2]

  • Additive: u(x_1,...,x_n) = v(x_1,...,x_n)
  • Multiplicative: u(x_1,...,x_n) = [exp(R\cdot v(x_1,...,x_n))-1]/[exp(R)-1]

where R\neq 0.

PROOF: It is sufficient to prove that u has constant absolute risk aversion with respect to the value v.

  • The PI assumption with n\geq 3 imply that the value function is additive, i.e.:
v(x_1,\dots,x_n)=\sum_{i=1}^{n}{\lambda_i v_i(x_i)}
  • Let x_1, z_1 be two different values for attribute 1. Let y_1 be the certainty-equivalent of the lottery <x_1:z_1>. The UI assumption implies that, for every combination (w_2,\dots,w_n) of values of the other attributes, the following equivalence holds:
(y_1,w)\sim<(x_1,w):(z_1,w)>
  • The two previous statements imply that for every w, the following equivalence holds in the value space:
\lambda_1 v_1(y_1) + \sum_{i=2}^n{\lambda_i v_i(w_i)} \sim <\lambda_1 v_1(x_1) + \sum_{i=2}^n{\lambda_i v_i(w_i)} : \lambda_1 v_1(z_1) + \sum_{i=2}^n{\lambda_i v_i(w_i)}>
  • This implies that, adding any quantity to both sides of a lottery (through the term \sum_{i=2}^n{\lambda_i v_i(w_i)}), increases the certainty-equivalent of the lottery by the same quantity.
  • The latter fact implies constant risk aversion.

References

  1. 1.00 1.01 1.02 1.03 1.04 1.05 1.06 1.07 1.08 1.09 1.10 1.11 Lua error in package.lua at line 80: module 'strict' not found.
  2. This idea is attributed to Richard F. Meyer and John W. Pratt.