hidai25/eval-view

Regression testing framework for AI agents. Save golden baselines, detect behavioral drift, and block regressions in CI. Works with LangGraph, CrewAI, OpenAI, Claude, and any HTTP API.

Category
Developer Tools
Language
Python
License
Apache-2.0
Stars
80
Source
https://github.com/hidai25/eval-view

Related MCP Servers

Compare