Untrusted smart models and trusted dumb models — LessWrong

Untrusted smart models and trusted dumb models — LessWrong
[Ryan Greenblatt originally made this point to me a while ago, and we then developed a bunch of these ideas together. Thanks to Paul Christiano and a…
Crossposted from the AI Alignment Forum . May contain more technical jargon than usual. Read More



Related Stories

See All