Skip to content

Conversation

shyamnamboodiripad
Copy link
Contributor

@shyamnamboodiripad shyamnamboodiripad commented Apr 1, 2025

The new evaluators will ship in a new Microsoft.Extensions.AI.Evaluation.Safety package.

Also includes the following public API changes:

  • Add a Metadata dictionary on EvaluationMetric.
  • Make EvaluationMetric.Diagnostics nullable.
  • Convert instance functions on some (fully mutable) result types to extension methods in the same namespace.

Some more work remains before this can be merged:

  • Add a couple more evaluators that are currently missing
  • Add more documentation comments for the new evaluators and metrics
  • Display metadata for individual metrics in the report
  • Some miscellaneous clean up
  • Add few more tests

Fixes #5937

Microsoft Reviewers: Open in CodeFlow

@shyamnamboodiripad shyamnamboodiripad requested review from a team as code owners April 1, 2025 11:03
@github-actions github-actions bot added the area-ai-eval Microsoft.Extensions.AI.Evaluation and related label Apr 1, 2025
Copy link
Member

@peterwald peterwald left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

@shyamnamboodiripad shyamnamboodiripad force-pushed the rai branch 2 times, most recently from 058b183 to 1fde648 Compare April 1, 2025 20:54
@shyamnamboodiripad shyamnamboodiripad force-pushed the rai branch 5 times, most recently from 40fb1c6 to 00644d3 Compare April 3, 2025 14:23
Includes the following API changes:
- Add a Metadata dictionary on EvaluationMetric
- Make EvaluationMetric.Diagnostics nullable
- Convert instance functions on some (fully mutable) result types to extension methods in the same namespace
@shyamnamboodiripad shyamnamboodiripad enabled auto-merge (squash) April 4, 2025 11:10
@shyamnamboodiripad shyamnamboodiripad merged commit 68d1661 into dotnet:main Apr 4, 2025
6 checks passed
@shyamnamboodiripad shyamnamboodiripad deleted the rai branch April 4, 2025 11:43
@github-actions github-actions bot locked and limited conversation to collaborators May 5, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
area-ai-eval Microsoft.Extensions.AI.Evaluation and related
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[AI Evaluation] Introduce RAI evaluators
3 participants