Linch's Shortform

post by Linch · 2019-09-19T00:28:40.280Z · score: 8 (2 votes) · EA · GW · 41 comments

41 comments

Comments sorted by top scores.

comment by Linch · 2020-09-24T08:48:56.001Z · score: 42 (16 votes) · EA(p) · GW(p)

Here are some things I've learned from spending the better part of the last 6 months either forecasting or thinking about forecasting, with an eye towards beliefs that I expect to be fairly generalizable to other endeavors.

Note that I assume that anybody reading this already has familiarity with Phillip Tetlock's work on (super)forecasting, particularly Tetlock's 10 commandments for aspiring superforecasters.

1. Forming (good) outside views is often hard but not impossible. I think there is a common belief/framing in EA and rationalist circles that coming up with outside views is easy, and the real difficulty is a) originality in inside views, and also b) a debate of how much to trust outside views vs inside views.

I think this is directionally true (original thought is harder than synthesizing existing views) but it hides a lot of the details. It's often quite difficult to come up with and balance good outside views that are applicable to a situation. See Manheim [LW · GW] and Muelhauser [LW · GW] for some discussions of this.

2. For novel out-of-distribution situations, "normal" people often trust centralized data/ontologies more than is warranted. See here for a discussion. I believe something similar is true for trust of domain experts, though this is more debatable.

3. The EA community overrates the predictive validity and epistemic superiority of forecasters/forecasting.

(Note that I think this is an improvement over the status quo in the broader society, where by default approximately nobody trusts generalist forecasters at all)

I've had several conversations where EAs will ask me to make a prediction, I'll think about it a bit and say something like "I dunno, 10%?"and people will treat it like a fully informed prediction to make decisions about, rather than just another source of information among many.

I think this is clearly wrong. I think in any situation where you are a reasonable person and you spent 10x (sometimes 100x or more!) time thinking about a question then I have, you should just trust your own judgments much more than mine on the question.

To a first approximation, good forecasters have three things: 1) They're fairly smart. 2) They're willing to actually do the homework. 3) They have an intuitive sense of probability.

This is not nothing, but it's also pretty far from everything you want in a epistemic source.

4. The EA community overrates Superforecasters and Superforecasting techniques. I think the types of questions and responses Good Judgment .* is interested in is a particular way [EA(p) · GW(p)] to look at the world. I don't think it is always applicable (easy EA-relevant example: your Brier score is basically the same if you give 0% for 1% probabilities, and vice versa), and it's bad epistemics to collapse all of the "figure out the future in a quantifiable manner" to a single paradigm.

Likewise, I don't think there's a clear dividing line between good forecasters and GJP-certified Superforecasters, so many of the issues I mentioned in #3 are just as applicable here.

I'm not sure how to collapse all the things I've learned on this topic in a few short paragraphs, but the tl;dr is that I trusted superforecasters much more than I trusted other EAs before I started forecasting stuff, and now I consider their opinions and forecasts "just" an important overall component to my thinking, rather than a clear epistemic superior to defer to.

5. Good intuitions are really important. I think there's a Straw Vulcan approach to rationality where people think "good" rationality is about suppressing your System 1 in favor of clear thinking and logical propositions from your system 2. I think there's plenty of evidence for this being wrong*. For example, the cognitive reflection test was originally supposed to be a test of how well people suppress their "intuitive" answers to instead think through the question and provide the right "unintuitive answers", however we've later learned (one fairly good psych study. May not replicate, seems to accord with my intuitions and recent experiences) that more "cognitively reflective" people also had more accurate initial answers when they didn't have the time to think through the question.

On a more practical level, I think a fair amount of good thinking is using your System 2 to train your intuitions, so you have better and better first impressions and taste for how to improve your understanding of the world in the future.

*I think my claim so far is fairly uncontroversial, for example I expect CFAR to agree with a lot of what I say.

6. Relatedly, most of my forecasting mistakes are due to emotional rather than technical reasons. Here's a Twitter thread from May exploring why; I think I mostly still stand by this.

comment by Aaron Gertler (aarongertler) · 2020-09-28T09:09:26.142Z · score: 10 (9 votes) · EA(p) · GW(p)

Consider making this a top-level post! That way, I can give it the "Forecasting" tag so that people will find it more often later, which would make me happy, because I like this post.

comment by Linch · 2020-10-03T19:46:54.574Z · score: 6 (3 votes) · EA(p) · GW(p)

Thanks! Posted [EA · GW].

comment by Linch · 2020-09-30T00:55:48.428Z · score: 2 (1 votes) · EA(p) · GW(p)

Thanks for the encouragement and suggestion! Do you have recommendations for a really good title?

comment by Aaron Gertler (aarongertler) · 2020-09-30T08:19:01.741Z · score: 2 (1 votes) · EA(p) · GW(p)

Titles aren't my forte. I'd keep it simple. "Lessons learned from six months of forecasting" or "What I learned after X hours of forecasting" (where "X" is an estimate of how much time you spent over six months).

comment by NunoSempere · 2020-09-30T16:09:06.710Z · score: 1 (1 votes) · EA(p) · GW(p)

I second this.

comment by Linch · 2019-09-19T00:28:40.458Z · score: 30 (14 votes) · EA(p) · GW(p)

cross-posted from Facebook.

Sometimes I hear people who caution humility say something like "this question has stumped the best philosophers for centuries/millennia. How could you possibly hope to make any progress on it?". While I concur that humility is frequently warranted and that in many specific cases that injunction is reasonable [1], I think the framing is broadly wrong.


In particular, using geologic time rather than anthropological time hides the fact that there probably weren't that many people actively thinking about these issues, especially carefully, in a sustained way, and making sure to build on the work of the past. For background, 7% of all humans who have ever lived are alive today, and living people compose 15% of total human experience [2] so far!!!


It will not surprise me if there are about as many living philosophers today as there were dead philosophers in all of written history.


For some specific questions that particularly interest me (eg. population ethics, moral uncertainty), the total research work done on these questions is generously less than five philosopher-lifetimes. Even for classical age-old philosophical dilemmas/"grand projects" (like the hard problem of consciousness), total work spent on them is probably less than 500 philosopher-lifetimes, and quite possibly less than 100.


There are also solid outside-view reasons to believe that the best philosophers today are just much more competent [3] than the best philosophers in history, and have access to much more resources[4].


Finally, philosophy can build on progress in natural and social sciences (eg, computers, game theory).


Speculating further, it would not surprise me, if, say, a particularly thorny and deeply important philosophical problem can effectively be solved in 100 more philosopher-lifetimes. Assuming 40 years of work and $200,000/year per philosopher, including overhead, this is ~800 million, or in the same ballpark as the cost of developing a single drug[5].


Is this worth it? Hard to say (especially with such made-up numbers), but the feasibility of solving seemingly intractable problems no longer seems crazy to me.


[1] For example, intro philosophy classes will often ask students to take a strong position on questions like deontology vs. consequentialism, or determinism vs. compatibilism. Basic epistemic humility says it's unlikely that college undergrads can get those questions right in such a short time.


[2] https://eukaryotewritesblog.com/2018/10/09/the-funnel-of-human-experience/


[3] Flynn effect, education, and education of women, among others. Also, just https://en.wikipedia.org/wiki/Athenian_democracy#Size_and_make-up_of_the_Athenian_population. (Roughly as many educated people in all of Athens at any given time as a fairly large state university). Modern people (or at least peak performers) being more competent than past ones is blatantly obvious in other fields where priority is less important (eg, marathon runners, chess).


[4] Eg, internet, cheap books, widespread literacy, and the current intellectual world is practically monolingual.


[5] https://en.wikipedia.org/wiki/Cost_of_drug_development

comment by MathiasKirkBonde · 2019-09-19T22:41:31.709Z · score: 4 (3 votes) · EA(p) · GW(p)

If a problem is very famous and unsolved, doesn't those who tried solving it include many of the much more competent philosophers alive today? The fact that the problem has not been solved by any of them either would suggest to me it's a hard problem.

comment by saulius · 2019-09-20T20:37:52.218Z · score: 2 (3 votes) · EA(p) · GW(p)

Honest question: are there examples of philosophical problems that were solved in the last 50 years? And I mean solved by doing philosophy not by doing mostly unrelated experiments (like this one). I imagine that even if some philosophers felt they answered a question, other would dispute it. More importantly, the solution would likely be difficult to understand and hence it would be of limited value. I'm not sure I'm right here.

comment by saulius · 2019-09-20T20:46:10.063Z · score: 2 (1 votes) · EA(p) · GW(p)

After a bit more googling I found this which maybe shows that there have been philosophical problems solved recently. I haven't read about that specific problem though. It's difficult to imagine a short paper solving the hard problem of consciousnesses though.

comment by Linch · 2020-10-14T19:01:20.093Z · score: 9 (5 votes) · EA(p) · GW(p)

I enjoyed this list of philosophy's successes, but none of them happened in the last 50 years.

comment by Linch · 2020-10-14T19:00:44.376Z · score: 2 (1 votes) · EA(p) · GW(p)

I'll be interested in having someone with a history of philosophy background weigh in on the Gettier question specifically. I thought Gettier problems were really interesting when I first heard about them, but I've also heard that "knowledge as justified true belief" wasn't actually all that dominant a position before Gettier came along.

comment by Linch · 2020-02-24T21:45:08.719Z · score: 26 (9 votes) · EA(p) · GW(p)

cross-posted from Facebook.

Catalyst (biosecurity conference funded by the Long-Term Future Fund) was incredibly educational and fun.

Random scattered takeaways:

1. I knew going in that everybody there will be much more knowledgeable about bio than I was. I was right. (Maybe more than half the people there had PhDs?)

2. Nonetheless, I felt like most conversations were very approachable and informative for me, from Chris Bakerlee explaining the very basics of genetics to me, to asking Anders Sandberg about some research he did that was relevant to my interests, to Tara Kirk Sell detailing recent advances in technological solutions in biosecurity, to random workshops where novel ideas were proposed...

3. There's a strong sense of energy and excitement from everybody at the conference, much more than other conferences I've been in (including EA Global).

4. From casual conversations in EA-land, I get the general sense that work in biosecurity was fraught with landmines and information hazards, so it was oddly refreshing to hear so many people talk openly about exciting new possibilities to de-risk biological threats and promote a healthier future, while still being fully cognizant of the scary challenges ahead. I guess I didn't imagine there were so many interesting and "safe" topics in biosecurity!

5. I got a lot more personally worried about coronavirus than I was before the conference, to the point where I think it makes sense to start making some initial preparations and anticipate lifestyle changes.

6. There was a lot more DIY/Community Bio representation at the conference than I would have expected. I suspect this had to do with the organizers' backgrounds; I imagine that if most other people were to organize biosecurity conferences, it'd be skewed academic a lot more.

7. I didn't meet many (any?) people with a public health or epidemiology background.

8. The Stanford representation was really high, including many people who have never been to the local Stanford EA club.

9. A reasonable number of people at the conference were a) reasonably interested in effective altruism b) live in the general SF area and c) excited to meet/network with EAs in the area. This made me slightly more optimistic (from a high prior) about the value of doing good community building work in EA SF.

10. Man, the organizers of Catalyst are really competent. I'm jealous.

11. I gave significant amounts of money to the Long-Term Future Fund (which funded Catalyst), so I'm glad Catalyst turned out well. It's really hard to forecast the counterfactual success of long-reach plans like this one, but naively it looks like this seems like the right approach to help build out the pipeline for biosecurity.

12. Wow, evolution is really cool.

13. Talking to Anders Sandberg made me slightly more optimistic about the value of a few weird ideas in philosophy I had recently, and that maybe I can make progress on them (since they seem unusually neglected).

14. Catalyst had this cool thing where they had public "long conversations" where instead of a panel discussion, they'd have two people on stage at a time, and after a few minutes one of the two people get rotated out. I'm personally not totally sold on the format but I'd be excited to see more experiments like that.

15. Usually, conferences or other conversational groups I'm in have one of two failure modes: 1) there's an obvious hierarchy (based on credentials, social signaling, or just that a few people have way more domain knowledge than others) or 2) people are overly egalitarian and let useless digressions/opinions clog up the conversational space. Surprisingly neither happened much here, despite an incredibly heterogeneous group (from college sophomores to lead PIs of academic biology labs to biotech CEOs to DiY enthusiasts to health security experts to randos like me)

16. Man, it seems really good to have more conferences like this, where there's a shared interest but everybody come from different fields so it's less obviously hierarchal/status-jockeying.

17. I should probably attend more conferences/network more in general.

18. Being the "dumbest person in the room" gave me a lot more affordance to ask silly questions and understand new stuff from experts. I actually don't think I was that annoying, surprisingly enough (people seemed happy enough to chat with me).

19. Partially because of the energy in the conference, the few times where I had to present EA, I mostly focused on the "hinge of history/weird futuristic ideas are important and we're a group of people who take ideas seriously and try our best despite a lot of confusion" angle of EA, rather than the "serious people who do the important, neglected and obviously good things" angle that I usually go for. I think it went well with my audience today, though I still don't have a solid policy of navigating this in general.

20. Man, I need something more impressive on my bio than "unusually good at memes."

comment by Linch · 2020-02-25T09:14:32.333Z · score: 11 (7 votes) · EA(p) · GW(p)

Publication bias alert: Not everybody liked the conference as much as I did. Someone I know and respect thought some of the talks weren't very good (I agreed with them about the specific examples, but didn't think it mattered because really good ideas/conversations/networking at an event + gestalt feel is much more important for whether an event is worthwhile to me than a few duds).

That said, on a meta level, you might expect that people who really liked (or hated, I suppose) a conference/event/book to write detailed notes about it than people who were lukewarm about it.

comment by Habryka · 2020-02-25T04:40:36.160Z · score: 6 (3 votes) · EA(p) · GW(p)
11. I gave significant amounts of money to the Long-Term Future Fund (which funded Catalyst), so I'm glad Catalyst turned out well. It's really hard to forecast the counterfactual success of long-reach plans like this one, but naively it looks like this seems like the right approach to help build out the pipeline for biosecurity.

I am glad to hear that! I sadly didn't end up having the time to go, but I've been excited about the project for a while.

comment by mike_mclaren · 2020-03-01T15:29:31.732Z · score: 3 (2 votes) · EA(p) · GW(p)

Thanks for your report! I was interested but couldn't manage the cross country trip and definitely curious to hear what it was like.

comment by tessa · 2020-03-04T02:56:26.737Z · score: 2 (2 votes) · EA(p) · GW(p)

I'd really appreciate ideas for how to try to confer some of what it was like to people who couldn't make it. We recorded some of the talks and intend to edit + upload them, we're writing a "how to organize a conference" postmortem / report, and one attendee is planning to write a magazine article, but I'm not sure what else would be useful. Would another post like this be helpful?

comment by mike_mclaren · 2020-03-05T11:31:01.082Z · score: 2 (2 votes) · EA(p) · GW(p)

We recorded some of the talks and intend to edit + upload them, we're writing a "how to organize a conference" postmortem / report, and one attendee is planning to write a magazine article

That all sounds useful and interesting to me!

Would another post like this be helpful?

I think multiple posts following events on the personal experiences from multiple people (organizers and attendees) can be useful simply for the diversity of their perspectives. Regarding Catalyst in particular I'm curious about the variety of backgrounds of the attendees and how their backgrounds shaped their goals and experiences during the meeting.

comment by Linch · 2020-02-26T03:25:01.393Z · score: 23 (15 votes) · EA(p) · GW(p)

Over a year ago, someone asked the EA community whether it’s valuable to become world-class at an unspecified non-EA niche or field. Our Forum’s own Aaron Gertler [EA · GW] responded in a post [EA · GW], saying basically that there’s a bunch of intangible advantages for our community to have many world-class people, even if it’s in fields/niches that are extremely unlikely to be directly EA-relevant.

Since then, Aaron became (entirely in his spare time, while working 1.5 jobs) a world-class Magic the Gathering player, recently winning the DreamHack MtGA tournament and getting $30,000 in prize monies, half of which he donated to Givewell.

I didn’t find his arguments overwhelmingly persuasive at the time, and I still don’t. But it’s exciting to see other EAs come up with unusual theories of change, actually executing on them, and then being wildly successful.

comment by Linch · 2020-01-30T01:58:18.660Z · score: 22 (8 votes) · EA(p) · GW(p)

cross-posted from Facebook.

Reading Bryan Caplan and Zach Weinersmith's new book has made me somewhat more skeptical about Open Borders (from a high prior belief in its value).

Before reading the book, I was already aware of the core arguments (eg, Michael Huemer's right to immigrate, basic cosmopolitanism, some vague economic stuff about doubling GDP).

I was hoping the book will have more arguments, or stronger versions of the arguments I'm familiar with.

It mostly did not.

The book did convince me that the prima facie case for open borders was stronger than I thought. In particular, the section where he argued that a bunch of different normative ethical theories should all-else-equal lead to open borders was moderately convincing. I think it will have updated me towards open borders if I believed in stronger "weight all mainstream ethical theories equally" moral uncertainty, or if I previously had a strong belief in a moral theory that I previously believed was against open borders.

However, I already fairly strongly subscribe to cosmopolitan utilitarianism and see no problem with aggregating utility across borders. Most of my concerns with open borders are related to Chesterton's fence, and Caplan's counterarguments were in three forms:

1. Doubling GDP is so massive that it should override any conservativism prior.
2. The US historically had Open Borders (pre-1900) and it did fine.
3. On the margin, increasing immigration in all the American data Caplan looked at didn't seem to have catastrophic cultural/institutional effects that naysayers claim.

I find this insufficiently persuasive.
___
Let me outline the strongest case I'm aware of against open borders:
Countries are mostly not rich and stable because of the physical resources, or because of the arbitrary nature of national boundaries. They're rich because of institutions and good governance. (I think this is a fairly mainstream belief among political economists). These institutions are, again, evolved and living things. You can't just copy the US constitution and expect to get a good government (IIRC, quite a few Latin American countries literally tried and failed).

We don't actually understand what makes institutions good. Open Borders means the US population will ~double fairly quickly, and this is so "out of distribution" that we should be suspicious of the generalizability of studies that look at small marginal changes.
____
I think Caplan's case is insufficiently persuasive because a) it's not hard for me to imagine situations bad enough to be worse than doubling GDP is good, 2)Pre-1900 US was a very different country/world, 3) This "out of distribution" thing is significant.

I will find Caplan's book more persuasive if he used non-US datasets more, especially data from places where immigration is much higher than the US (maybe within the EU or ASEAN?).

___

I'm still strongly in favor of much greater labor mobility on the margin for both high-skill and low-skill workers. Only 14.4% of the American population are immigrants right now, and I suspect the institutions are strong enough that changing the number to 30-35% is net positive. [EDIT: Note that this is intuition rather than something backed by empirical data or explicit models]

I'm also personally in favor (even if it's negative expected value for the individual country) of a single country (or a few) trying out open borders for a few decades and for the rest of us to learn from their successes and failures. But that's because of an experimentalist social scientist mindset where I'm perfectly comfortable with "burning" a few countries for the greater good (countries aren't real, people are), and I suspect the government of most countries aren't thrilled about this.

___

Overall, 4/5 stars. Would highly recommend to EAs, especially people who haven't thought much about the economics and ethics of immigration.

comment by Aaron Gertler (aarongertler) · 2020-01-30T13:48:53.569Z · score: 2 (1 votes) · EA(p) · GW(p)

If you email this to him, maybe adding a bit more polish, I'd give ~40% odds he'll reply on his blog, given how much he loves to respond to critics who take his work seriously.

It's not hard for me to imagine situations bad enough to be worse than doubling GDP is good

I actually find this very difficult without envisioning extreme scenarios (e.g. a dark-Hansonian world of productive-but-dissatisfied ems). Almost any situation with enough disutility to counter GDP doubling seems like it would, paradoxically, involve conditions that would reduce GDP (war, large-scale civil unrest, huge tax increases to support a bigger welfare state).

Could you give an example or two of situations that would fit your statement here?

comment by Linch · 2020-02-04T02:23:17.497Z · score: 1 (1 votes) · EA(p) · GW(p)
Almost any situation with enough disutility to counter GDP doubling seems like it would, paradoxically, involve conditions that would reduce GDP (war, large-scale civil unrest, huge tax increases to support a bigger welfare state).

I think there was substantial ambiguity in my original phrasing, thanks for catching that!

I think there are at least four ways to interpret the statement.

It's not hard for me to imagine situations bad enough to be worse than doubling GDP is good

1. Interpreting it literally: I am physically capable (without much difficulty) of imagining situations that are bad to a degree worse than doubling GDP is good.

2. Caplan gives some argument for doubling of GDP that seems persuasive, and claims this is enough to override a conservatism prior, but I'm not confident that the argument is true/robust, and I think it's reasonable to believe that there are possible bad consequences that are bad enough that even if I give >50% probability (or >80%), this is not automatically enough to override a conservatism prior, at least not without thinking about it a lot more.

3. Assume by construction that world GDP will double in the short term. I still think there's a significant chance that the world will be worse off.

4. Assume by construction that world GDP will double, and stay 2x baseline until the end of time. I still think there's a significant chance that the world will be worse off.

__

To be clear, when writing the phrasing, I meant it in terms of #2. I strongly endorse #1 and tentatively endorse #3, but I agree that if you interpreted what I meant as #4, what I said was a really strong claim and I need to back it up more carefully.

comment by Aaron Gertler (aarongertler) · 2020-02-04T06:03:02.574Z · score: 2 (1 votes) · EA(p) · GW(p)

Makes sense, thanks! The use of "doubling GDP is so massive that..." made me think that you were taking that as given in this example, but worrying that bad things could result from GDP-doubling that justified conservatism. That was certainly only one of a few possible interpretations; I jumped too easily to conclusions.

comment by Linch · 2020-02-04T08:17:46.126Z · score: 1 (1 votes) · EA(p) · GW(p)

That was not my intent, and it was not the way I parsed Caplan's argument.

comment by Linch · 2020-09-29T01:59:06.300Z · score: 20 (9 votes) · EA(p) · GW(p)

Do people have advice on how to be more emotionally resilient in the face of disaster?

I spent some time this year thinking about things that are likely to be personally bad in the near-future (most salient to me right now is the possibility of a contested election + riots, but this is also applicable to the ongoing Bay Area fires/smoke and to a lesser extent the ongoing pandemic right now, as well as future events like climate disasters and wars). My guess is that, after a modicum of precaution, the direct objective risk isn't very high, but it'll *feel* like a really big deal all the time.

In other words, being perfectly honest about my own personality/emotional capacity, there's a high chance that if the street outside my house is rioting, I just won't be productive at all (even if I did the calculations and the objective risk is relatively low).

So I'm interested in anticipating this phenomenon and building emotional resilience ahead of time so such issues won't affect me as much.

I'm most interested in advice for building emotional resilience for disaster/macro-level setbacks. I think it'd also be useful to build resilience for more personal setbacks (eg career/relationship/impact), but I naively suspect that this is less tractable.

Thoughts?

comment by gavintaylor · 2020-10-09T00:10:44.062Z · score: 5 (3 votes) · EA(p) · GW(p)

The last newsletter from Spencer Greenberg/Clearer Thinking might be helpful:

https://www.clearerthinking.org/post/2020/10/06/how-resetting-your-psychological-baseline-can-make-your-life-better

comment by Linch · 2020-10-10T03:08:39.347Z · score: 7 (4 votes) · EA(p) · GW(p)

Wow, reading this was actually surprisingly helpful for some other things I'm going through. Thanks for the link!

comment by Misha_Yagudin · 2020-09-29T10:50:35.580Z · score: 2 (2 votes) · EA(p) · GW(p)

I think it is useful to separately deal with the parts of a disturbing event over which you have an internal or external locus of control. Let's take a look at riots:

  • An external part is them happening in your country. External locus of control means that you need to accept the situation. Consider looking into Stoic literature and exercises (say, negative visualizations) to come to peace with that possibility.
  • An internal part is being exposed to dangers associated with them. Internal locus of control means that you can take action to mitigate the risks. Consider having a plan to temporarily move to a likely peaceful area within your country or to another county.
comment by Linch · 2020-09-12T09:27:18.675Z · score: 13 (5 votes) · EA(p) · GW(p)

I'm worried about a potential future dynamic where an emphasis on forecasting/quantification in EA (especially if it has significant social or career implications) will have adverse effects on making people bias towards silence/vagueness in areas where they don't feel ready to commit to a probability forecast.

I think it's good that we appear to be moving in the direction of greater quantification and being accountable for probability estimates, but I think there's the very real risk that people see this and then become scared of committing their loose thoughts/intuitive probability estimates on record. This may result in us getting overall worse group epistemics because people hedge too much and are unwilling to commit to public probabilities.

See analogy to Jeff Kaufman's arguments on responsible transparency consumption:

https://www.jefftk.com/p/responsible-transparency-consumption

comment by Linch · 2020-10-13T18:50:43.435Z · score: 12 (4 votes) · EA(p) · GW(p)

Malaria kills a lot more people >age 5 than I would have guessed (Still more deaths <=5 than >5, but a much smaller ratio than I intuitively believed). See C70-C72 of GiveWell's cost-effectiveness estimates for AMF, which itself comes from the Global Burden of Disease Study.

I've previously cached the thought that malaria primarily kills people who are very young, but this is wrong.

I think the intuition slip here is that malaria is a lot more fatal for young people. However, there are more older people than younger people.

comment by Linch · 2020-10-15T07:13:13.525Z · score: 11 (3 votes) · EA(p) · GW(p)

What will a company/organization that has a really important secondary mandate to focus on general career development of employees actually look like? How would trainings be structured, what would growth trajectories look like, etc?

When I was at Google, I got the distinct impression that while "career development" and "growth" were common buzzwords, most of the actual programs on offer were more focused on employee satisfaction/retention than growth. (For example, I've essentially never gotten any feedback on my selection of training courses or books that I bought with company money, which at the time I thought was awesome flexibility, but in retrospect was not a great sign of caring about growth on the part of the company).

Edit: Upon a reread I should mention that there are other ways for employees to grow within the company, eg by having some degree of autonomy over what projects they want to work on.

I think there are theoretical reasons for employee career growth being underinvested by default. Namely, that the costs of career growth are borne approximately equally between the employer and the employee (obviously this varies from case to case), while the benefits of career growth are mostly accrued by the employee and their future employers.

This view will predict that companies will mostly only invest in general career development/growth of employees if one of a number of conditions are met:

  • The investment is so valuable that it pays for itself over the expected length of the employee's tenor
  • The "investment" has benefits to the company (and hopefully the employee as well) other than via building the employee's skillsets. Eg, it makes the employee more likely to stay at the company.
  • Relatedly, the investment only grows (or disproportionately grows) the employee's skillset in ways that's relevant to the work of the company, and not that of other workplaces (even close competitors)
  • There are principal-agent problems within the company, such that managers are not always acting in the company's best interest when they promote career development for their underlings.

I suppose that in contrast to companies, academia is at least incentivized to focus on general career development (since professors are judged at least somewhat on the quality of their graduate students' outputs/career trajectories). I don't know in practice how much better academia is than industry however. (It is at least suggestive that people often take very large pay cuts to do graduate school).

I think the question of how to do employee career development well is particularly interesting/relevant to EA organizations, since there's a sense in which developing better employees is a net benefit to "team EA" even if your own org doesn't benefit, or might die in a year or three. A (simplified) formal view of this is that effective altruism captures the value of career development over the expected span of someone continuing to do EA activities [EA · GW].*

*eg, doing EA-relevant research or policy work, donating, working at an EA org, etc.

comment by Ozzie Gooen (oagr) · 2020-10-15T17:44:02.895Z · score: 4 (2 votes) · EA(p) · GW(p)

Definitely agreed. That said, I think some of this should probably be looked through the lens of "Should EA as a whole help people with personal/career development rather than specific organizations, as the benefits will accrue to the larger community (especially if people only stay at orgs for a few years).

I'm personally in favor of expensive resources being granted to help people early in their careers. You can also see some of this in what OpenPhil/FHI funds; there's a big focus on helping people get useful PhDs. (though this helps a small minority of the entire EA movement)

comment by Linch · 2020-01-17T13:24:11.084Z · score: 11 (9 votes) · EA(p) · GW(p)

I find the unilateralist’s curse a particularly valuable concept to think about. However, I now worry that “unilateralist” is an easy label to tack on, and whether a particular action is unilateralist or not is susceptible to small changes in framing.

Consider the following hypothetical situations:

  1. Company policy vs. team discretion
    1. Alice is a researcher in a team of scientists at a large biomedical company. While working on the development of an HIV vaccine, the team accidentally created an air-transmissible variant of HIV. The scientists must decide whether to publish their discovery with the rest of the company, knowing that leaks may exist, and the knowledge may be used to create a devastating biological weapon, but also that it could help those who hope to develop defenses against such weapons, including other teams within the same company. Most of the team thinks they should keep it quiet, but company policy is strict that such information must be shared with the rest of the company to maintain the culture of open collaboration.
    2. Alice thinks the rest of the team should either share this information or quit. Eventually, she tells her vice president her concerns, who relayed it to the rest of the company in a company-open document.
    3. Alice does not know if this information ever leaked past the company.
  2. Stan and the bomb
    1. Stan is an officer in charge of overseeing a new early warning system intended to detect (nuclear) intercontinental ballistic missiles from an enemy country. A warning system appeared to have detected five missiles heading towards his homeland, quickly going through 30 early layers of verification. Stan suspects this is a false alarm, but is not sure. Military instructions are clear that such warnings must immediately be relayed upwards.
    2. Stan decided not to relay the message to his superiors, on the grounds that it was probably a false alarm and he didn’t want his superiors to mistakenly assume otherwise and therefore start a catastrophic global nuclear war.
  3. Listen to the UN, or other countries with similar abilities?
    1. Elbonia, a newly founded Republic, has an unusually good climate engineering program. Elbonian scientists and engineers are able to develop a comprehensive geo-engineering solution that they believe can reverse the climate crisis at minimal risk. Further, the United Nations’ General Assembly recently passed a resolution that stated in no uncertain terms that any nation in possession of such geo-engineering technology must immediately a) share the plans with the rest of the world and b) start the process of lowering the world’s temperature by 2 °C.
    2. However, there’s one catch: Elbonian intelligence knows (or suspects) that five other countries have developed similar geo-engineering plans, but have resolutely refused to release or act on them. Furthermore, four of the five countries have openly argued that geo-engineering is dangerous and has potentially catastrophic consequences, but refused to share explicit analysis why (Elbonia’s own risk assessment finds little evidence of such dangers).
    3. Reasoning that he should be cooperative with the rest of the world, the prime minister of Elbonia made the executive decision to obey the General Assembly’s resolution and start lowering the world’s temperature.
  4. Cooperation with future/past selves, or other people?
    1. Ishmael’s crew has a white elephant holiday tradition, where individuals come up with weird and quirky gifts for the rest of the crew secretly, and do not reveal what the gifts are until Christmas. Ishmael comes up with a brilliant gift idea and hides it.
    2. While drunk one day with other crew members, Ishmael accidentally lets slip that he was particularly proud of his idea. The other members egg him on to reveal more. After a while, Ishmael finally relents when some other crew members reveal their ideas, reasoning that he shouldn’t be a holdout. Ishmael suspects that he will regret his past self’s decision when he becomes more sober.

Putting aside whether the above actions were correct or not, in each of the above cases, have the protagonists acted unilaterally?

I think this is a hard question to answer. My personal answer is “yes,” but I think another reasonable person can easily believe that the above protagonists were fully cooperative. Further, I don’t think the hypothetical scenarios above were particularly convoluted edge cases. I suspect that in real life, figuring out whether the unilateralist’s curse applies to your actions will hinge on subtle choices of reference classes. I don’t have a good solution to this.

comment by JP Addison (jpaddison) · 2020-01-19T02:25:22.498Z · score: 3 (2 votes) · EA(p) · GW(p)

I really like this (I think you could make it top level if you wanted). I think these of these are cases of multiple levels of cooperation. If you're part of an organization that wants to be uncooperative (and you can't leave cooperatively), then you're going to be uncooperative with one of them.

comment by Linch · 2020-01-19T04:15:12.399Z · score: 1 (1 votes) · EA(p) · GW(p)

Good point. Now that you bring this up, I vaguely remember a Reddit AMA where an evolutionary biologist made the (obvious in hindsight, but never occurred to me at the time) claim that with multilevel selection, altruism on one level is often means defecting on a higher (or lower) level. Which probably unconsciously inspired this post!

As for making it top level, I originally wanted to include a bunch of thoughts on the unilateralist's curse as a post, but then I realized that I'm a one-trick pony in this domain...hard to think of novel/useful things that Bostrom et. al hasn't already covered!

comment by Linch · 2020-10-13T20:54:00.699Z · score: 5 (3 votes) · EA(p) · GW(p)

I'm now pretty confused about whether normative claims can be used as evidence in empirical disputes. I generally believed no, with the caveat that for humans, moral beliefs are built on a scaffolding of facts, and sometimes it's easier to respond to an absurd empirical claim with the moral claim that has the gestalt sense of empirical beliefs if there isn't an immediately accessible empirical claim.

I talked to a philosopher who disagreed, and roughly believed that strong normative claims can be used as evidence against more confused/less certain empirical claims, and I got a sense from the conversation that his view is much more common in academic philosophy than mine.

Would like to investigate further.

comment by MichaelDickens · 2020-10-15T03:27:43.611Z · score: 8 (2 votes) · EA(p) · GW(p)

I haven't really thought about it, but it seems to me that if an empirical claim implies an implausible normative claim, that lowers my subjective probability of the empirical claim.

comment by Linch · 2020-03-11T21:00:58.105Z · score: 4 (2 votes) · EA(p) · GW(p)

Updated version on https://docs.google.com/document/d/1BDm_fcxzmdwuGK4NQw0L3fzYLGGJH19ksUZrRloOzt8/edit?usp=sharing

Cute theoretical argument for #flattenthecurve at any point in the distribution

  1. What is #flattenthecurve?
    1. The primary theory behind #flattenthecurve is assuming that everybody who will get COVID-19 will eventually get it anyway...is there anything else you can do?
    2. It turns out it’s very valuable to
      1. Delay the spread so that a) the peak of the epidemic spread is lower (#flattenthecurve)
      2. Also to give public health professionals, healthcare systems, etc more time to respond (see diagram below)
      3. A tertiary benefit is that ~unconstrained disease incidence (until it gets to herd immunity levels) is not guaranteed, with enough time to respond, aggressive public health measures (like done in Wuhan, Japan, South Korea etc) can arrest the disease at well below herd immunity levels
  2. Why should you implement #flattenthecurve
    1. If you haven’t been living under a rock, you’ll know that COVID-19 is a big deal
    2. We have nowhere near the number of respirators, ICU beds, etc, for the peak of uncontrolled transmission (Wuhan ran out of ICU beds, and they literally built a dozen hospitals in a week, a feat Western governments may have trouble doing)
    3. https://www.flattenthecurve.com/ has more detailed arguments
  3. What are good #flattenthecurve policies?
    1. The standard stuff like being extremely aggressive about sanitation and social distancing
    2. https://www.flattenthecurve.com/ has more details
  4. When should you implement #flattenthecurve policies?
    1. A lot of people are waiting for specific “fire alarms” (eg, public health authorities sounding the bell, the WHO calling it a pandemic, X cases in a city) before they start taking measures.
    2. I think this is wrong.
    3. The core (cute) theoretical argument I have is that if you think #flattenthecurve is at all worth doing at any time, as long as you're confident you are on the growth side of the exponential growth curve, slowing the doubling time from X days (say) to 2X days is good for #flattenthecurve and public health perspective no matter where you are on the curve.
  5. Wait, what?
    1. Okay, let’s consider a few stricter versions of the problem
    2. Exponential growth guaranteed + all of society
      1. One way to imagine this is if #society all implemented your policy (because of some Kantian or timeless decision theory sense, say)
      2. Suppose you are only willing to take measures for Y weeks, and for whatever reason the measures are only strong enough to slow down the virus's spread rather than reverse the curve.
      3. if the doubling rate is previously 3 days and everybody doing this can push it down to 8 days (or push it up to 2 days), then it's roughly equally good (bad) no matter when on the curve you do those measures.
    3. Exponential growth NOT guaranteed + all of society
      1. Next, relax the assumption of exponential growth being guaranteed and assume that measures are strong enough to reverse the curve of exponential growth (as happened in China, South Korea, Japan)
      2. I think you get the same effect where the cost of X weeks of your measures should be the same no matter where you are on the curve, plus now you got rid of the disease (with the added benefit that if you initiate your measures early, less people die/get sick directly and it's easier to track new cases)
      3. A downside is that a successful containment strategy means you get less moral credit/people will accuse you of fearmongering, etc.
    4. NOT all of society
      1. Of course, as a private actor you can’t affect all of society. Realistically (if you push hard), your actions will be correlated with only a small percentage of society. So relax the assumption that everybody does it, and assume only a few % of people will do the same actions as you.
      2. But I think for #flattenthecurve purposes, the same arguments still roughly hold.
      3. Now you’re just (eg) slowing the growth rate from 3 days to 3.05 days instead of 3 days to 8 days.
      4. But the costs are ~ linear to the number of people who implement #flattenthecurve policies, and the benefits are still invariant to timing.
  6. Practical considerations
    1. How do we know that we are on the growth side of the exponential/S curve?
      1. Testing seems to lag actual cases a lot.
      2. My claim is that approximately if your city has at least one confirmed or strongly suspected case of community transmission, you’re almost certainly on the exponential trajectory
    2. Aren’t most other people’s actions different depending on where you are on the curve?
      1. Sure, so maybe some mitigation actions are more effective depending on other people’s actions (eg, refusing to do handshakes may be more effective when not everybody has hand sanitizer than when everybody regularly uses hand sanitizer, for example)
      2. I think the general argument is still the same however
    3. Over the course of an epidemic, wouldn’t the different actions result in different R0 and doubling times, so you’re you're then doing distancing or whatever from a different base?
      1. Okay, I think this is the best theoretical argument against the clean exponential curve stuff.
      2. I still think it’s not obvious that you should do more #flattenthecurve policies later on, if anything this pushes you to doing it earlier
  7. Conclusion
    1. If you think #flattenthecurve is worthwhile to do at all (which I did not argue for much here, but is extensively argued elsewhere), it’s at least as good to do it now as it is to do it later, and plausibly better to do soon rather than later.
comment by Linch · 2019-12-28T03:35:47.270Z · score: 3 (2 votes) · EA(p) · GW(p)

Economic benefits of mediocre local human preferences modeling.

Epistemic status: Half-baked, probably dumb.

Note: writing is mediocre because it's half-baked.

Some vague brainstorming of economic benefits from mediocre human preferences models.

Many AI Safety proposals include understanding human preferences as one of its subcomponents [1]. While this is not obviously good[2], human modeling seems at least plausibly relevant and good.

Short-term economic benefits often spur additional funding and research interest [citation not given]. So a possible question to ask if we can get large economic benefits from a system with the following properties (each assumption can later be relaxed):

1. Can run on a smartphone in my pocket

2. Can approximate simple preference elicitations at many times a second

3. Low fidelity, has both high false-positive and false-negative rates

4. Does better on preferences with lots of training data ("in-distribution")

5. Initially works better on simple preferences (preference elicitations takes me 15 seconds to think about an answer, say), but has continuous economic benefits from better and better models.

An *okay* answer to this question is recommender systems (ads, entertainment). But I assume those are optimized to heck already so it's hard for an MVP to win.

I think a plausibly better answer to this is market-creation/bidding. The canonical example is ridesharing like Uber/Lyft, which sells a heterogeneous good to both drivers and riders. Right now they have a centralized system that tries to estimate market-clearing prices, but imagine instead if riders and drivers bid on how much they're willing to pay/take for a ride from X to Y with Z other riders?

Right now, this is absurd because human preference elicitations take up time/attention for humans. If a driver has to scroll through 100 possible rides in her vicinity, the experience will be strictly worse.

But if a bot could report your preferences for you? I think this could make markets a lot more efficient, and also gives a way to price in increasingly heterogeneous preferences. Some examples:

1. I care approximately zero about cleanliness or make of a car, but I'm fairly sensitive to tobacco or marijuana smell. If you had toggles for all of these things in the app, it'd be really annoying.

2. A lot of my friends don't like/find it stressful to make small talk on a trip, but I've talked to drivers who chose this job primarily because they want to talk on the job. It'd be nice if both preferences are priced in.

3. Some riders like drivers who speak their native language, and vice versa.

A huge advantage of these markets is that "mistakes" are pricey but not incredibly so. Ie, I'd rather not overbid for a trip that isn't worth it, but the consumer/driver surplus from pricing in heterogeneous preferences at all can easily make up for the occasional (or even frequent) mispricing.

There's probably a continuous extension of this idea to matching markets with increasingly sparse data (eg, hiring, dating).

One question you can ask is why is it advantageous to have this run on a client machine at all, instead of aggregative human preference modeling that lots of large companies (including Uber) already do?

The honest high-level answer is that I guess this is a solution in search of a problem, which is rarely a good sign...

A potential advantage of running it on your smartphone (imagine a plug-in app that runs "Linch's Preferences" with an API other people can connect to) is that it legally makes the "Marketplace" idea for Uber and companies like Uber more plausible? Like right now a lot of them claim to have a marketplace except they look a lot like command-and-control economies; if you have a personalized bot on your client machine bidding on prices, then I think the case would be easier to sell.


[1] https://openai.com/blog/deep-reinforcement-learning-from-human-preferences/

[2] https://intelligence.org/2019/02/22/thoughts-on-human-models/

comment by Linch · 2020-07-18T23:27:20.955Z · score: 2 (1 votes) · EA(p) · GW(p)

Cross-posted from Facebook

On the meta-level, I want to think hard about the level of rigor I want to have in research or research-adjacent projects.

I want to say that the target level of rigor I should have is substantially higher than for typical FB or Twitter posts, and way lower than research papers.

But there's a very wide gulf! I'm not sure exactly what I want to do, but here are some gestures at the thing:

- More rigor/thought/data collection should be put into it than 5-10 minutes typical of a FB/Twitter post, but much less than a hundred or a few hundred hours on papers.
- I feel like there are a lot of things that are worth somebody looking into it for a few hours (more rarely, a few dozen), but nowhere near the level of a typical academic paper?
- Examples that I think are reflective of what I'm interested in are some of Georgia Ray and Nuno Sempere's lighter posts, as well as Brian Bi's older Quora answers on physics. (back when Quora was good)
- "research" has the connotation of pushing the boundaries of human knowledge, but by default I'm more interested in pushing the boundaries of my own knowledge? Or at least the boundaries of my "group's" knowledge.
- If the search for truthful things shakes out to have some minor implications on something no other human currently knows, that's great, but by default I feel like aiming for novelty is too constraining for my purposes.
- Forecasting (the type done on Metaculus or Good Judgment Open) feels like a fair bit of this. Rarely do forecasters (even/especially really good ones) discover something nobody already knows; rather than the difficulty comes almost entirely in finding evidence that's already "out there" somewhere in the world and then weighing the evidence and probabilities accordingly.
- I do think more forecasting should be done. But forecasting itself provides very few bits of information (just the final probability distribution on a well-specified problem). Often, people are also interested in your implicit model, the most salient bits of evidence you discovered, etc. This seems like a good thing to communicate.
- It's not clear what the path to impact here is. Probably what I'm interested in is what Stefan Schubert calls "improving elite epistemics," but I'm really confused on whether/why that's valuable.
- Not everything I or anybody does has to be valuable, but I think I'd be less excited to do medium rigor stuff if there's no or minimal impact on the world?
- It's also not clear to me how much I should trust my own judgement (especially in out-of-distribution questions, or questions much harder to numerically specify than forecasting).
- How do I get feedback? The obvious answer is from other EAs, but I take seriously worries that our community is overly insular.
- Academia, in contrast, has a very natural expert feedback mechanism in peer review. But as mentioned earlier, peer review pre-supposes a very initially high level of rigor that I'm usually not excited about achieving for almost all of my ideas.
- Also on a more practical level, it might just be very hard for me to achieve paper-worthy novelty and rigor in all but a handful of ideas?
- In the few times in the past I reached out to experts (outside the EA community) for feedback, I managed to get fairly useful stuff, but I strongly suspect this is easier for precise well-targeted questions than some of the other things I'm interested in?
- Also varies from field to field, for example a while back I was able to get some feedback on questions like water rights, but I couldn't find public contact information from climate modeling scientist after a modest search (presumably because the latter is much more politicized these days)
- If not for pre-existing connections and connections-of-connections, I also suspect it'd be basically impossible to get ahold of infectious disease or biosecurity people to talk to in 2020.
- In terms of format, "blog posts" seems the most natural. But I think blog posts could mean anything from "Twitter post with slightly more characters" to "stuff Gwern writes 10,000 words on." So doesn't really answer the question of what to do about the time/rigor tradeoff.

Another question that is downstream of what I want to do is branding. Eg, some people have said that I should call myself an "independent researcher," but this feels kinda pretentious to me? Like when I think "independent research" I think "work of a level of rigor and detail that could be publishable if the authors wanted to conform to publication standards," but mostly what I'm interested in is lower quality than that? Examples of what I think of as "independent research" are stuff that Elizabeth van Nostrand, Dan Luu, Gwern, and Alexey Guzey sometimes work on (examples below).

____

Stefan Schubert on elite epistemics: https://twitter.com/StefanFSchubert/status/1248930229755707394

Negative examples (too little rigor):

- Pretty much all my FB posts?

Negative examples (too much rigor/effort):

- almost all academic papers

- many of Gwern's posts

- eg https://www.gwern.net/Melatonin

- https://davidroodman.com/david/The%20risk%20of%20geomagnetic%20storms%205%20dr.pdf

- https://danluu.com/input-lag/

- https://guzey.com/books/why-we-sleep/

(To be clear, by "negative examples" I don't mean to associate them with negative valence. I think a lot of those work is extremely valuable to have, it's just that I don't think most of the things I want to do are sufficiently interesting/important to spend as much time on. Also on a practical level, I'm not yet strong enough to replicate most work on that level).

Positive examples:

https://brianbi.ca/physics

https://nunosempere.github.io/ea/PastPandemics

https://eukaryotewritesblog.com/2019/05/19/naked-mole-rats-a-case-study-in-biological-weirdness/