Skip to content
← prismbi.ai

Evaluations

Audience: Data team, tenant admin
Goal: Run systematic quality checks on natural-language query behavior.

Navigation note: Open /evaluations directly—the route is not in the Admin Portal sidebar (bookmark it or link from internal docs).

Workflow

  1. Go to /evaluations.
  2. Create or select an evaluation dataset (import CSV with labeled prompts and expected behaviors where supported).
  3. Start a run against a datasource and planner configuration.
  4. Review per-case results and comparison charts.
  5. Cancel in-flight runs if needed.

When to use evaluations

  • After semantic catalog or runtime changes
  • Before promoting a BI app to a wider audience
  • Regression testing across planner or model upgrades

Tips

  • Keep datasets representative of real business questions.
  • Version datasets when warehouse schema changes materially.
  • Link failed cases to Explore replays via Operations deep links when available.

Troubleshooting

IssueWhat to try
Run stuckCancel and retry; check agent health in Operations
Import rejectedValidate CSV columns against template
Results differ from ExploreConfirm dataset uses same datasource and runtime flags

Conversational analytics for governed enterprise data.