QLANKR Test

AI Agent Evaluation Platform · test.qlankr.com

What QLANKR Test does

QLANKR Test evaluates AI systems across multiple quality dimensions using independent AI judges. Users submit agent output (chat transcripts, RAG Q&A pairs, tool call traces, classification results, generated content) and receive a QI score from 0 to 100 with per-dimension breakdowns, identified strengths, and specific improvement recommendations. Results are presented as shareable report cards.

Who it is for

How it works

  1. Select an assessment template (e.g., Support Agent, RAG Accuracy, Tool-Use Correctness)
  2. Submit your agent's output data
  3. Independent AI judges evaluate across multiple quality dimensions
  4. Receive a QI score (0-100) with per-dimension breakdowns and actionable recommendations

Assessment types

10 assessment templates are available:

QI scoring

QI (QLANKR Intelligence) is a composite score from 0 to 100. It is the average of dimension scores, each independently evaluated by an AI judge. Pro users get 3-judge consensus scoring with agreement metrics. Scores map to bands:

Key differentiators

Pricing

Links

Contact

QLANKR Test · Stockholm, Sweden · test.qlankr.com