Booking flights remains the primary benchmark use case for AI agents, showcasing practical utility that users expect from intelligent systems.