AI validation workflows
Established an AI validation framework using PromptFoo and Azure AI Foundry.
Defined evaluation suites covering prompt regressions, tool-use correctness, hallucination rates, and high-level (e2e) success rate. Added into CI pipelines and evaluated and gathered telemetry over time.
Azure AI Foundry was used for model hosting, dataset management and tracing, and PromptFoo for the assertions and grading.