
benchmarks safetyNew
METR
Independent frontier-model dangerous-capability evaluator
weight 0.0Flux 62▴FreeLaunched 2026-06
💸 No earnings reported yet
What it is
METR (Model Evaluation and Threat Research) is a nonprofit that runs autonomous-capability and dangerous-capability evaluations of frontier models, including pre-deployment testing for major labs and AI Safety Institutes.
How AI plugs in
Designs and runs empirical evaluations measuring AI systems' autonomous and dangerous capabilities.
★ Reviews
No reviews yet — be the first.Your rating
