Discussion about this post

User's avatar
Inside The Black Box's avatar

The most important part of this report might be footnote 2. METR acknowledges companies could exit the process silently at any time, that they applied a "relatively high bar" before pushing back on redactions, and that they refrained from unflattering claims to preserve working relationships. METR is doing important work here, but they're also describing the structural limits of voluntary evaluation. The evaluator needs the relationship more than the company needs the evaluation.

No posts

Ready for more?