METR logo

METR

AI ResearchVerified90% conf

AI research organization that evaluates and benchmarks frontier AI systems for autonomous task performance

metr.org

📍 Berkeley, California, USA

Verified Data

🚀
Funding$38M via The Audacious Project
🔗metr.org
📊
Monthly Traffic30K-100K monthly visitors

Estimated 30k-100k visitors (niche research site, categorized under Philosophy/Science on SimilarWeb).

🧑‍💻
Team Size38 employees
🔗metr.org
📈
Growthresearch task horizons doubling every 7 months

research task horizons reported to be doubling every 7 months.

🏷️
StageGrant-funded (2024)
📅
Founded2023

Company Profile

ModelNon-profit research organization
VerticalAI safety, AI development, Government/Policy
ClientsOpenAI, Anthropic, Google DeepMind, European AI Office
BuyersFrontier AI model developers and international AI safety institutes
PricingFree evaluations (grant-funded non-profit)

Contact

Strategic Analysis

Strategy

Independent non-profit model maintaining neutrality by not charging fees for evaluations, funded through grants and donations. Focus on being the trusted third-party evaluator for frontier AI models across major developers and government agencies. Spun off from Alignment Research Center to establish independence and scale evaluation capabilities.

Tactics

Direct partnerships with major AI labs (OpenAI, Anthropic, Google DeepMind) for model evaluations. Technical assistance contracts with government bodies like the European AI Office. Open research publication and dataset releases (like MALT) to establish thought leadership. Grant funding strategy through prestigious programs like The Audacious Project.

Competitive Positioning

Positioned as the independent, neutral evaluator in AI safety space, differentiating from internal company evaluations or academic research with limited scale. Competes with other AI safety organizations but maintains unique position through non-profit status and direct access to frontier models from major labs.

Marketing Approach

Research-led approach through publishing studies and datasets that generate media coverage. Direct engagement with policymakers and AI developers rather than broad consumer marketing. Thought leadership through controversial findings (like AI coding productivity study) that drive industry discussion.

Notable

Spun off from Alignment Research Center (ARC), funded through The Audacious Project

🔗 Source ↗

Tech Stack

PythonPythonPyTorchPyTorchReactReactNetlifyNetlifyGitHubGitHub

Recent News

Related AI Research Companies

Discovery Sources

Are LLM merge rates not getting better?
Hacker News Front PageMar 13, 2026
Many SWE-bench-Passing PRs would not be merged
Hacker News Front PageMar 12, 2026

Signals

web traffic30K-100K monthly visitors

Estimated 30k-100k visitors (niche research site, categorized under Philosophy/Science on SimilarWeb).

growth rateresearch task horizons doubling every 7 months

research task horizons reported to be doubling every 7 months.

team size38 employees🔗 source ↗
funding raised$38M via The Audacious Project🔗 source ↗
web traffic100K-150K monthly visits

~100,000 - 150,000 monthly visits (Ranked #30 in Science and Education > Philosophy category)

growth rateRebranded from ARC Evals in 2024, expanded to 47 staff, established regulatory partnerships🔗 source ↗
team size47 employees🔗 source ↗
user count4+ major AI lab partners plus multiple government agencies🔗 source ↗
funding raised$38M+ from The Audacious Project and Open Philanthropy grants

Estimated $38M+ committed via The Audacious Project (TED) and additional grants from Open Philanthropy/Coefficient Giving. Stage: Non-profit / Grant-funded.

trend indicatorSoftware Engineering🔗 source ↗
trend indicatorLLM🔗 source ↗
trend indicatorMachine Learning🔗 source ↗
trend indicatorAI🔗 source ↗
trend indicatorBenchmarking🔗 source ↗
trend indicatorResearch🔗 source ↗
trend indicatorAI🔗 source ↗

Evidence

metr.org

We find that roughly half of test-passing SWE-bench Verified PRs... would not be merged into main by repo maintainers.

entropicthoughts.com

LLM performance is much worse under the stringent success criterion.

metr.org

4+ major AI lab partners plus multiple government agencies

linkedin.com

47 employees

metr.org

Rebranded from ARC Evals in 2024, expanded to 47 staff, established regulatory partnerships

metr.org

$38M via The Audacious Project

metr.org

38 employees