Data Normalization

Data processing and normalization tasks evaluate structured output from messy prose and different structures. This capability is essential for catalogue and product pipelines where data needs to be extracted from unstructured text and formatted consistently. Models must understand the desired output format and extract relevant information accurately.

0
Duration
18s
Input Tokens
963
Output Tokens
693
Cost
$0.00
Context
Input
Analyze the following cooking instructions and return a structured response following these criteria:
1. Identify all ingredients used in the recipe, even if optional
2. Classify each ingredient into one of three categories: 'main', 'side', or 'sauce'
3. 'Main' includes core components of the recipe; 'side' includes extras served alongside or supporting ingredients in the preparation; 'sauce' includes only sauces not condiments like salt
4. Use generic names when appropriate (e.g., any bread = 'bread')
Expected output
{
  "main": [
    "mince",
    "bacon",
    "cheese",
    "bread",
    "lettuce",
    "tomato",
    "onion"
  ],
  "side": [
    "potatoes",
    "oil",
    "salt",
    "pepper"
  ],
  "sauce": [
    "mustard",
    "ketchup",
    "mayonnaise"
  ]
}
Model output
{
  "main": [
    "ground beef (mince)",
    "bacon",
    "cheese",
    "bread",
    "lettuce",
    "tomato",
    "onion"
  ],
  "side": [
    "potatoes",
    "oil (for frying/baking)",
    "salt (for fries)",
    "paper towels",
    "cold water",
    "mustard",
    "ketchup",
    "mayonnaise"
  ],
  "sauce": [
    "mustard",
    "ketchup",
    "mayonnaise"
  ]
}