Implement Input Validation for Agentic Evaluators by m7md7sien · Pull Request #44618 · Azure/azure-sdk-for-python

m7md7sien · 2026-01-12T18:36:33Z

Description

Please add an informative description that covers that changes made by the pull request and link all relevant issues.

If an SDK is being regenerated based on a new API spec, a link to the pull request containing these API spec changes should be included above.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

Copilot

Pull request overview

This PR implements input validation infrastructure for agentic evaluators in the Azure AI Evaluation SDK. The changes introduce a comprehensive validator system to validate inputs before processing in various evaluator classes.

Changes:

Added new validator infrastructure with abstract interface and concrete implementations
Integrated validators into 13 evaluator classes to validate inputs before processing
Created validation constants for message roles and content types

Reviewed changes

Copilot reviewed 21 out of 21 changed files in this pull request and generated 14 comments.

Show a summary per file

File	Description
`_validator_interface.py`	Abstract base class defining validation interface
`_validation_constants.py`	Enums for message roles and content types
`conversation_validator.py`	Validates conversation-style query/response inputs
`_tool_definitions_validator.py`	Validates tool definitions alongside conversations
`_tool_calls_validator.py`	Validates tool calls alongside tool definitions
`_task_navigation_efficiency_validator.py`	Validates task navigation inputs
`__init__.py`	Exports validator classes
`_tool_selection.py`	Integrated ToolCallsValidator
`_tool_output_utilization.py`	Integrated ToolDefinitionsValidator
`_tool_input_accuracy.py`	Integrated ToolDefinitionsValidator
`_tool_call_success.py`	Integrated ToolDefinitionsValidator
`_tool_call_accuracy.py`	Integrated ToolCallsValidator
`_task_navigation_efficiency.py`	Integrated TaskNavigationEfficiencyValidator
`_task_completion.py`	Integrated ToolDefinitionsValidator
`_task_adherence.py`	Integrated ToolDefinitionsValidator
`_intent_resolution.py`	Integrated ToolDefinitionsValidator
`_groundedness.py`	Integrated ConversationValidator with dual validator setup
`_fluency.py`	Integrated ConversationValidator
`_relevance.py`	Integrated ConversationValidator
`_coherence.py`	Integrated ConversationValidator

...-ai-evaluation/azure/ai/evaluation/_evaluators/_common/_validators/conversation_validator.py

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_fluency/_fluency.py

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_coherence/_coherence.py

...e-ai-evaluation/azure/ai/evaluation/_evaluators/_common/_validators/_tool_calls_validator.py

...azure-ai-evaluation/azure/ai/evaluation/_evaluators/_intent_resolution/_intent_resolution.py

...ure-ai-evaluation/azure/ai/evaluation/_evaluators/_tool_call_accuracy/_tool_call_accuracy.py

...ion/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_task_completion/_task_completion.py

...ation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_task_adherence/_task_adherence.py

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_relevance/_relevance.py

...valuation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_groundedness/_groundedness.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

...e-ai-evaluation/azure/ai/evaluation/_evaluators/_common/_validators/_tool_calls_validator.py

Implement Input Validation for Agentic Evaluators

94429f4

Copilot AI review requested due to automatic review settings January 12, 2026 18:36

m7md7sien requested a review from a team as a code owner January 12, 2026 18:36

github-actions bot added the Evaluation Issues related to the client library for Azure AI Evaluation label Jan 12, 2026

Copilot started reviewing on behalf of m7md7sien January 12, 2026 18:37 View session

Copilot AI reviewed Jan 12, 2026

View reviewed changes

m7md7sien and others added 4 commits January 12, 2026 21:09

Apply suggestions from code review

c873930

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Fix build errors

6c49b3c

run black

93cfff8

Fix tests

d524521

m7md7sien force-pushed the mohessie/evaluators_validators branch from ebdf9d0 to d524521 Compare January 14, 2026 16:16

m7md7sien added 2 commits January 14, 2026 18:59

Merge branch 'main' into mohessie/evaluators_validators

0df80ce

Merge branch 'main' into mohessie/evaluators_validators

209b6dc

m7md7sien enabled auto-merge (squash) January 18, 2026 14:41

Merge branch 'main' into mohessie/evaluators_validators

efa53e1

ashaabansoliman approved these changes Feb 10, 2026

View reviewed changes

...e-ai-evaluation/azure/ai/evaluation/_evaluators/_common/_validators/_tool_calls_validator.py Show resolved Hide resolved

m7md7sien merged commit 65612ca into main Feb 10, 2026
20 checks passed

m7md7sien deleted the mohessie/evaluators_validators branch February 10, 2026 21:36

m7md7sien mentioned this pull request Feb 11, 2026

Inline Validators in Evaluator Files Pending SDK Release Azure/azureml-assets#4771

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Input Validation for Agentic Evaluators#44618

Implement Input Validation for Agentic Evaluators#44618
m7md7sien merged 8 commits intomainfrom
mohessie/evaluators_validators

m7md7sien commented Jan 12, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

m7md7sien commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

m7md7sien commented Jan 12, 2026 •

edited

Loading