METR

METR

Home
Archive
About

Sitemap - 2024 - METR

Evaluating frontier AI R&D capabilities of language model agents against human experts

The Rogue Replication Threat Model

Response to Bureau of Industry and Security’s proposed AI reporting requirements

New Support Through The Audacious Project

Details about METR’s preliminary evaluation of o1-preview

Response to U.S. AISI Draft “Managing Misuse Risk for Dual-Use Foundation Models”

Common Elements of Frontier AI Safety Policies

Details about METR’s preliminary evaluation of GPT-4o

An update on our general capability evaluations

Response to NIST Draft Generative AI Profile

ML Engineers Needed for New AI R&D Evals Project

Emma Abele is METR’s new Executive Director

Autonomy Evaluation Resources

Example autonomy evaluation protocol

Guidelines for capability elicitation

Measuring the impact of post-training enhancements

Portable Evaluation Tasks via the METR Task Standard

2023 Year In Review

© 2026 METR · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture