NewEvaluate AI judgment, not prompts

Zepai Assess

Measure AI judgment
across your team
and your candidates.

Send a test to your candidates or team members. Each one designs an AI system for their role. You see the judgment behind every decision.

3 free assessmentsNo credit cardResults in minutes

How it works

Three steps. One clear criterion.

Choose the role and send

Select the role to assess, enter candidate emails and set the time limit. Each one receives a unique link.

The candidate designs

Without creating an account, the candidate completes an AI system design form for their role. There are no single correct answers — judgment is what's being evaluated.

You see the results

Global score, per-stage breakdown with feedback, and exactly what the candidate designed. All in your evaluator dashboard.

What you see as evaluator

Not just a number. The judgment behind it.

Global score

A 0–100 index reflecting overall design quality, comparable across candidates in the same role.

Per-stage breakdown

Specific feedback for each of the 6 design stages: objective, capabilities, criteria, restrictions, autonomy, and oversight.

Full design

You see exactly what the candidate designed — not just whether they got it right, but how they thought through the system.

Available roles

Six roles at launch.

Each test is designed for the real context of the role — the problem, the tools, and the decisions that person faces.

Sales Rep

AI for Sales · L1

Data Analyst

AI for Analysts · L1

Operations Manager

AI for Operations · L1

Customer Success

AI for CS · L1

Marketing Manager

AI for Marketing · L1

Project Manager

AI for Projects · L1

Why it's different

Not multiple choice.
Real judgment.

TestGorilla and similar tools evaluate whether someone knows concepts. Zepai Assess evaluates how they decide when there's no single right answer — exactly what matters when working with AI.

Traditional tests

—What is an LLM?

—What is a prompt used for?

—Multiple choice, single answer

Zepai Assess

What should this system never do?

When does it need human oversight?

Open design, judgment evaluation

Pricing

Start free. Scale when you need to.

Free

when you sign up

3 assessments included

All available roles

Full results with breakdown

Pack 10

$79

10 assessments

All roles

Full results with breakdown

Pack 25

$149

25 assessments

All roles

Full results with breakdown

Priority support

Pack 50

$249

50 assessments

All roles

Full results with breakdown

Priority support

Start today

Three assessments.
Free.

No credit card. No time limit. Start evaluating your team's AI judgment today.

Measure AI judgmentacross your teamand your candidates.

Three steps. One clear criterion.

Choose the role and send

The candidate designs

You see the results

Not just a number. The judgment behind it.

Global score

Per-stage breakdown

Full design

Six roles at launch.

Not multiple choice.Real judgment.

Start free. Scale when you need to.

Three assessments.Free.

Measure AI judgment
across your team
and your candidates.

Not multiple choice.
Real judgment.

Three assessments.
Free.