Fix tool_call_accuracy evaluator sample format causing "Tool definition not found" error #41620

Copilot · 2025-06-17T17:29:00Z

The sample code for ToolCallAccuracyEvaluator in evaluation_samples_evaluate.py was using incorrect parameter formats that caused a "Tool definition not found" error when users tried to run it.

Issue

The sample had two format problems:

tool_calls format: Used nested dict structure instead of the expected flat list format
tool_definitions format: Used single dict instead of list, and missing required "type" field

Before (broken sample):

tool_calls={
    "type": "tool_call",
    "tool_call": {
        "id": "call_eYtq7fMyHxDWIgeG2s26h0lJ",
        "type": "function",
        "function": {
            "name": "fetch_weather",
            "arguments": {"location": "New York"}
        }
    }
},
tool_definitions={
    "id": "fetch_weather",
    "name": "fetch_weather",
    "description": "Fetches the weather information for the specified location.",
    "parameters": {...}
}

After (working sample):

tool_calls=[
    {
        "type": "tool_call",
        "tool_call_id": "call_eYtq7fMyHxDWIgeG2s26h0lJ", 
        "name": "fetch_weather",
        "arguments": {"location": "New York"}
    }
],
tool_definitions=[
    {
        "name": "fetch_weather",
        "type": "function",
        "description": "Fetches the weather information for the specified location.",
        "parameters": {...}
    }
]

Validation

✅ Fixed sample now parses correctly without errors
✅ Existing unit test format continues to work (no regressions)
✅ Original problematic format still fails as expected (good validation)

The sample now matches the format expected by the evaluator implementation and demonstrated in the unit tests.

Fixes #41543.

💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click here to start the survey.

Co-authored-by: singankit <[email protected]>

Initial plan for issue

477926a

Copilot AI assigned Copilot and singankit Jun 17, 2025

Copilot started work on behalf of singankit June 17, 2025 17:29 View session

Fix tool_call_accuracy evaluator sample format issues

ac03349

Co-authored-by: singankit <[email protected]>

Copilot AI changed the title ~~[WIP] [evaluation] sample for tool_call_accuracy raises exception~~ Fix tool_call_accuracy evaluator sample format causing "Tool definition not found" error Jun 17, 2025

Copilot AI requested a review from singankit June 17, 2025 17:38

Copilot finished work on behalf of singankit June 17, 2025 17:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix tool_call_accuracy evaluator sample format causing "Tool definition not found" error #41620

Fix tool_call_accuracy evaluator sample format causing "Tool definition not found" error #41620

Uh oh!

Copilot AI commented Jun 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Fix tool_call_accuracy evaluator sample format causing "Tool definition not found" error #41620

Are you sure you want to change the base?

Fix tool_call_accuracy evaluator sample format causing "Tool definition not found" error #41620

Uh oh!

Conversation

Copilot AI commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue

Before (broken sample):

After (working sample):

Validation

Uh oh!

Uh oh!

Copilot AI commented Jun 17, 2025 •

edited

Loading