Scientists Measure AI Capabilities by Task DurationMay 3, 2025 Researchers propose a new metric to evaluate AI systems based on how long they can maintain performance on complex tasks compared to humans.