A robot is sprinting towards you. Do you want it running on Claude or Grok?

AI & Machine Learning, Software Development(openrouter.ai)view on HackerNews
AImachine learninglanguage modelsbattle royalealignmentcost per wincost per killElo systemmultiplayer gameOpenRouterRoyaleBench.

Author: Usu

Date: 6/17/2026

Article Summary:
The author, Jacky Liang, ran an experiment where 11 large language models (LLMs) were dropped into a 2D battle royale game, and the results showed that the model that won the most games was not the one with the highest score on usual benchmarks, but rather the one that was less aligned with traditional notions of "good" behavior.