AI Design Evaluation System

Configure different codebase branches and analyse the results

Run Evaluation

Run automated evaluation tests against the production Cloud Run endpoint. Results will be tagged with the selected branch for comparison.

Select Branch to Evaluate

Test Limit

Note: All evaluations run against the production Cloud Run endpoint

• Results are tagged with the selected branch for comparison

• View completed results in the Results Dashboard tab

• Compare branches in the Compare Branches tab