Hedging against deep and moral uncertainty

post by MichaelStJules · 2020-09-12T23:44:02.379Z · score: 28 (13 votes) · EA · GW · 6 comments

Contents

  Introduction
  An illustrative example
  Dependence
  Lexicality and deontological constraints
  Other potential examples
  Questions and possible implications
None
6 comments

Summary: Like for quantified risk, we can sometimes hedge against deep uncertainty and moral uncertainty: we can sometimes choose a portfolio of interventions which looks good in expectation to all (or more) worldviews - empirical and ethical beliefs - we find plausible, even if each component intervention is plausibly harmful or not particularly good in expectation according to some plausible worldview. We can sometimes do better than nothing in expectation when this wasn't possible by choosing a single intervention, and we can often improve the minimum expected value. I think doing so can therefore sometimes reduce complex cluelessness.

My recommendations are the following:

  1. We should, when possible, avoid portfolios (and interventions) which are robustly dominated by any other in expectation -  those worse in expectation than another under all plausible worldviews (or those ruled out by the maximality rule; EA Forum post on the paper [EA · GW]). I think this is rationally required under consequentialism, assuming standard rationality axioms under uncertainty.
  2. I further endorse choosing portfolios among those that are robustly positive in expectation - better in expectation than doing nothing under all worldviews we find plausible - if any are available. However, this is more a personal preference than a (conditional) requirement like 1, although I think something that's often implicitly assumed in EA. I think this would lead us to allocate more to work for nonhuman animals and s-risks.
  3. EAs should account for interactions between causes and conflicts in judgements about the sign of the expected value of different interventions according to different worldviews. I think this is being somewhat neglected as many EA organizations or divisions in EA organizations are cause-specific.
  4. Approaches for moral uncertainty and deep uncertainty are better applied to portfolios of interventions than to each intervention in isolation, since portfolios can promote win-wins.
  5. We should not commit to priors arbitrarily. If you don't feel justified in choosing one prior over all others (see the reference class problem), this is what sensitivity analysis and other approaches to decision making under deep uncertainty are for, and sometimes hedging can help, as I hope to illustrate in this post.

 

Introduction

EAs have often argued against diversification and for funding only the most cost-effective intervention, at least for individual donors where the marginal returns on donations are roughly constant. However, this assumes away a lot of uncertainty we could have; we might not believe any specific intervention is the most cost-effective. Due to deep uncertainty, we might not be willing to commit to a specific single joint probability distribution for the effects of our interventions, since we can't justify any choice over all others. Due to moral uncertainty, we might not be confident in how to ethically value different outcomes or actions. This can result in complex cluelessness, according to which we just don't know whether we should believe a given intervention is better or worse than another in expectation; it could go either way.

Sometimes, using a portfolio of interventions can be robustly better in expectation than doing nothing, while none of the best individual interventions according to some worldview are, since they're each plausibly harmful in expectation (whether or not we're committed to the claim that they definitely are harmful in expectation, since we may have deep or moral uncertainty about that). For example, cost-effective work in one cause might plausibly harm another cause more in expectation, and we don't know how to trade off between the two causes.

We might expect to find such robustly positive portfolios in practice where the individual interventions are not robust, because the interventions most robustly cost-effective in one domain, effect or worldview will not systematically be the most harmful in others, and if they aren't so harmful that they can't be cost-effectively compensated for with interventions optimized for cost-effectiveness in those other domains, effects or worldviews. The aim of this post is to make a more formal and EA-relevant illustration of the following reason for hedging:

We can sometimes choose portfolios which look good to all (or more) worldviews we find plausible, even if each component intervention is plausibly harmful or not particularly good in expectation according to some plausible worldview.

Diversification is of course not new in the EA community; it's an approach taken by Open Phil, and this post builds upon their "Strong uncertainty" factor, although most organizations tend not to consider the effects of interventions on non-target causes/worldviews, where hedging becomes useful.

 

An illustrative example

I will assume, for simplicity, constant marginal cost-effectiveness across each domain/effect/worldview, and that the effects of the different interventions are independent of one another. Decreasing marginal cost-effectiveness is also a separate reason for diversification, so by assuming a constant rate (which I expect is also approximately true for small donors), we can consider the uncertainty argument independently. (Thanks to Michael_Wiebe [EA · GW] for pointing this out.)

Suppose you have deep or moral uncertainty about the effects of a given global health and poverty intervention on nonhuman animals, farmed or wild, enough uncertainty that your expected value for the intervention ranges across positive and negative values, where the negative comes from effects on nonhuman animals, due to moral uncertainty about how to weigh the experiences of nonhuman animals and welfare in the wild, the meat eater problem (it may increase animal product consumption) and deep uncertainty about the effects on wild animals.

You could represent the expected cost-effectiveness across these components as a set of vectors with ranges. Assuming independent effects on each component, you might write this as the following set, a box in 3 dimensions:

Here, the first component, , is the range of expected cost-effectiveness for the humans (living in poverty), the second, , for farmed animals, and the third, , for wild animals. These aren't necessarily in the same comparable utility units across these three components. The point is that two of the components are plausibly negative in expectation, while the first is only positive in expectation, and it's plausible that the intervention does more harm than good in expectation or does more good than harm in expectation. (Depending on your kind of uncertainty, you might be able to just add each of the components instead, e.g. as , but I will continue to illustrate with separate components, since that's more general and can capture deeper uncertainty and worse moral uncertainty.)

You might also have an intervention targeting farmed animals and deep or moral uncertainty about its effects on wild animals. Suppose you represent the expected effectiveness as follows, with the effects on humans first, then on farmed animals and then on wild animals, as before:

You represent the expected effectiveness for a wild animal intervention as follows, with the effects on humans first, then on farmed animals and then on wild animals:

And finally, you have a default "do nothing" or "business as usual" option, e.g. spending selfishly:

I model  as all 0s, since I'm considering the differences in value with doing nothing, not the expected value in the universe.

Now, based on this example, we aren't confident that any of these interventions are better in expectation than , doing nothing, and generally, none of them definitely beat any other in expectation, so on this basis, we might say all of them are permissible according to the maximality rule. However, there are portfolios of these interventions that are better than doing nothing. Assuming a budget of 10 units, one such portfolio (better than ) is the following:

That is, you spend 4 times as much on  as , and 5 times as much on  as . We can divide by 10 to normalize. Notice that each components is strictly positive, so that this portfolio is good in expectation - better than  (and ) -  for humans, farmed animals and wild animals simultaneously.

According to recommendation 1, , doing nothing, is now ruled out, impermissible. This does not depend on the fact that I took differences with doing nothing, since we can shift them all the same way.

According to recommendation 2, which I only weakly endorse, each individual intervention is now ruled out, since each was plausibly negative (compared to doing nothing) in expectation, and we must choose among the portfolios that are robustly positive in expectation. Similarly, portfolios with only two interventions are also ruled out, since at least one of their components will have negative values in its range.

I constructed this example by thinking about how I could offset each intervention's harms with another's. The objection that offsetting is suboptimal [EA · GW] doesn't apply, since, by construction, I can't decide which of the interventions is best in expectation, although I know it's not doing nothing.

Note also that the cost-effectiveness values do not depend on when the effects occur. Similarly, we can hedge over time: the plausible negative effects of one intervention can be made up for with positive effects from another that occur far earlier or later in time.

 

Dependence

Now, we assumed the interventions' components were independent of one another and of the other interventions' components. With dependence, all the portfolios that were robustly at least as good as doing nothing will still be robustly as good as doing nothing, since the lower bounds under the independent case are lower bounds for the dependent case, but we could have more such portfolios. On the other hand, different portfolios could become dominated by others when modelling dependence that weren't under the assumption of independence.

 

Lexicality and deontological constraints

Under some deontological ethical theories, rule violations (you commit) can't be compensated for, no matter how small. You could represent rule violations as  - or multiples of it, without multiplying through, or using vectors for individual components to capture lexicality. Portfolios that include interventions that violate some rule will generally also violate that rule. However, we should be careful to not force cardinalization on theories that are only meant to be ordinal and do not order risky lotteries according to standard rationality axioms; see some quotes from MacAskill's thesis here on this [EA(p) · GW(p)], and this section from MichaelA's post [EA · GW].

 

Other potential examples

  1. Pairing human life-saving interventions with family planning interventions can potentially minimize externalities due to human population sizes, which we may have deep uncertainty about (although this requires taking a close look at the population effects of each, and it may not work out). These interventions could even target different regions based on particular characteristics, e.g. average quality of life, meat consumption. Counterfactually reducing populations where average welfare is worse (or meat consumption is higher) and increasing it the same amount where it's better (or meat consumption is lower) increases average and total human welfare (or total farmed animal welfare, assuming net negative lives) without affecting human population size. Of course, this is a careful balancing act, especially under deep uncertainty. Furthermore, there may remain other important externalities.
  2. We might find it plausible that incremental animal welfare reform contributes to complacency and moral licensing and have deep uncertainty about whether this is actually the case in expectation, but we might find more direct advocacy interventions that can compensate for this potential harm so that their combination is robustly positive.
  3. Extinction risk interacts with animal welfare in many ways: extinction would end factory farming, could wipe out all wild animals if complete, could prevent us from addressing wild animal suffering if only humans go extinct, and could allow us to spread animal suffering to other planets if we don't go extinct. There are other interactions and correlations with s-risks, too, since things that risk extinction could also lead to far worse outcomes (e.g. AI risk, conflict), or could prevent s-risks.
  4. Animal advocacy seems good for s-risks due to moral circle expansion [EA · GW], but there are also plausible effects going in the opposite direction, including correlations with environmentalism or "wrong" population ethics, near-misses and strategic threats [LW · GW].
  5. In the wild animal welfare space, I've been told about pairing interventions that reduce painful causes of death with population control methods to get around uncertainty about the net welfare in the wild. In principle, with a portfolio approach, it may not be necessary to pair these interventions on the same population to ensure a positive outcome in expectation, although applying them to the same population may prevent ecological risks and reduce uncertainty further.
  6. Substitution effects between animal products. We might have moral uncertainty about the sign of the expected value of an intervention raising the price of fish, in case it leads consumers to eat more chicken, and similarly for an intervention raising the price of chicken, in case it leads consumers to eat more fish. Combining both interventions can reduce both chicken and fish consumption. As before, these interventions do not have to even target the same region, as long as the increase in fish consumption in the one region is smaller than the increase in the other (assuming similar welfare, amount of product per animal, etc. or taking these into account), and the same for chicken consumption.

 

Questions and possible implications

6 comments

Comments sorted by top scores.

comment by MichaelPlant · 2020-09-14T10:44:59.731Z · score: 4 (2 votes) · EA(p) · GW(p)

I thought these ideas were interesting, but it would be useful to have a less technical and/or more intuitive explanation.

comment by MichaelStJules · 2020-09-14T16:43:35.939Z · score: 2 (1 votes) · EA(p) · GW(p)

Does the "Other potential examples" section help? Maybe I should have that section before the technical example?

comment by Michael_Wiebe · 2020-09-13T17:09:41.561Z · score: 4 (3 votes) · EA(p) · GW(p)

I think this would be easier to explain with a two-sector model: ie, just  and . Also, would it be easier to just work with algebra? Ie,  .

Assuming a budget of 6 units

How does this fit with ? That's 10 units, no?

I will assume, for simplicity, constant marginal cost-effectiveness across each domain/effect/worldview

It's worth emphasizing that this assumption rules out the diminishing returns case for diversifying; this is a feature, since we want to isolate the uncertainty-case for diversifying.

comment by MichaelStJules · 2020-09-13T17:42:50.536Z · score: 3 (2 votes) · EA(p) · GW(p)

I think this would be easier to explain with a two-sector model

I think it would get part of it across slightly more easily, although I don't think the burden is large. I think a 2-sector model might give the false impression that you should try to pair interventions so that each makes up for the negatives of the other, whereas with a good enough example for 3, people might more intuitively grasp that you have far more flexibility.

Also, would it be easier to just work with algebra? Ie,  .

I'd have to write down a bunch of inequalities to get a portfolio that's better than  and ensure that one even exists, which I think would be much harder to follow (and do, for me). I expect people would get that the general problem is a system of linear inequalities, although it's not central to the point I'm making.

How does this fit with ? That's 10 units, no?

 Ha. Thanks for pointing this out! I'll fix this.

It's worth emphasizing that this assumption rules out the diminishing returns case for diversifying; this is a feature, since we want to isolate the uncertainty-case for diversifying.

Ya, this is a good point. I'll mention this in the text.

comment by Michael_Wiebe · 2020-09-14T18:47:13.800Z · score: 1 (1 votes) · EA(p) · GW(p)

Re algebra, are you defending the numbers you gave as reasonable? Otherwise, if we're just making up numbers, might as well do the general case.

comment by MichaelStJules · 2020-09-14T19:29:46.520Z · score: 2 (1 votes) · EA(p) · GW(p)

I think the general case (with the independence and constant marginal cost-effectiveness assumptions) will be harder to follow for some readers (and not easier to follow for anyone), much more work for me (I'm not sure how I would approach it yet), and not general enough to be very useful. Even more generally, it's a multi-objective linear program, which we would solve algorithmically, not symbolically for a closed form solution.