Announcing the Harvard AI Safety Team 2022-06-30T18:34:04.032Z


Comment by Alexander Davies on Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover · 2022-07-22T10:44:27.069Z · EA · GW

I'm pretty unconvinced that your "suggests a significant number of fundamental breakthroughs remain to achieve PASTA" is strong enough to justify the odds being "approximately 0," especially when the evidence is mostly just expecting  tasks to stay hard as we scale (something which seems hard to predict, and easy to get wrong). Though it does seem that innovation in certain domains may lead to long episode lengths and inaccurate human evaluation, it also seems like innovation in certain fields (e.g., math) could easily not have this problem (i.e., in cases where verifying is much easier than solving).

Comment by Alexander Davies on Next week I'm interviewing Will MacAskill — what should I ask? · 2022-04-10T17:51:22.053Z · EA · GW

I'd like his thoughts on focusing on "Long-Termism" vs "Existential Risk," especially in the context of community building (i.e. responding "Long-Termism" vs. "Existential Risk" and Simplify EA Pitches to "Holy Shit, X-Risk"). 

Comment by Alexander Davies on [deleted post] 2022-04-08T02:02:18.496Z

Worth noting that (1) the AST is for people already planning to go into alignment after graduating (and isn't an intro program), and (2) I usually have backups prepared in case people have already read the thing (I don't think showing up 30 minutes in would be great!).

Comment by Alexander Davies on [deleted post] 2022-04-07T15:09:51.180Z

Thanks for the Harvard AI Safety Team shout-out! I do think in person reading is great, because it (1) creates a super low barrier to showing up, and (2) feels good/productive to be in a room with everyone silently reading. Two points on this:

  1. We usually read for much more than 30 minutes. Our meetings are 2 hours (5:30-7:30), and often over half is silently reading (usually alternating with discussion/lecture).
  2. Many people (myself included) prefer reading physical paper (shame!). I usually print out the readings (and I've given out binders). I think there are some people who learn better reading on paper, but wouldn't be bothered to actually print things out.