Agents

AI agent reasoning and tool selection tests planning, tool choice, and self-diagnosis capabilities. These skills are crucial for autonomous ticket triage systems and complex agent workflows where models must decide which tools to use, plan multi-step processes, and diagnose when things go wrong. This represents some of the most challenging tasks as they require open-ended reasoning and decision-making.

100
Duration
3s
Input Tokens
1330
Output Tokens
120
Cost
$0.00
Context
Input
You have just received a trigger with a user request. What is the first thing you do?
Expected output
The first thing I do is to fully understand the user's intent, reason carefully, and break the problem into clear steps.
Model output
Reasoning: The first step is to understand the user's request. The user needs help adding the yearly company dinner to their calendar and setting a reminder a few days before the event. The user mentions that there is an email with more information about the dinner. I need to search for this email to gather details such as the date of the dinner.

Response: I will start by using the 'email' tool to search for emails related to the company dinner to find the specific details about the event date and other relevant information.