AI Design Evaluation System

Configure different codebase branches and analyse the results

Run Evaluation
Run automated evaluation tests against the production Cloud Run endpoint. Results will be tagged with the selected branch for comparison.

Note: All evaluations run against the production Cloud Run endpoint

• Results are tagged with the selected branch for comparison

• View completed results in the Results Dashboard tab

• Compare branches in the Compare Branches tab

Recent Results
Past evaluation runs and their scores