Zepai
NewEvaluate AI judgment, not prompts

Zepai Assess

Measure AI judgment
across your team
and your candidates.

Send a test to your candidates or team members. Each one designs an AI system for their role. You see the judgment behind every decision.

3 free assessmentsNo credit cardResults in minutes
How it works

Three steps. One clear criterion.

01

Choose the role and send

Select the role to assess, enter candidate emails and set the time limit. Each one receives a unique link.

02

The candidate designs

Without creating an account, the candidate completes an AI system design form for their role. There are no single correct answers — judgment is what's being evaluated.

03

You see the results

Global score, per-stage breakdown with feedback, and exactly what the candidate designed. All in your evaluator dashboard.

What you see as evaluator

Not just a number. The judgment behind it.

Global score

A 0–100 index reflecting overall design quality, comparable across candidates in the same role.

Per-stage breakdown

Specific feedback for each of the 6 design stages: objective, capabilities, criteria, restrictions, autonomy, and oversight.

Full design

You see exactly what the candidate designed — not just whether they got it right, but how they thought through the system.

Available roles

Six roles at launch.

Each test is designed for the real context of the role — the problem, the tools, and the decisions that person faces.

Sales Rep
AI for Sales · L1
Data Analyst
AI for Analysts · L1
Operations Manager
AI for Operations · L1
Customer Success
AI for CS · L1
Marketing Manager
AI for Marketing · L1
Project Manager
AI for Projects · L1
Why it's different

Not multiple choice.
Real judgment.

TestGorilla and similar tools evaluate whether someone knows concepts. Zepai Assess evaluates how they decide when there's no single right answer — exactly what matters when working with AI.

Traditional tests
What is an LLM?
What is a prompt used for?
Multiple choice, single answer
Zepai Assess
What should this system never do?
When does it need human oversight?
Open design, judgment evaluation
Pricing

Start free. Scale when you need to.

Free
$0
when you sign up
3 assessments included
All available roles
Full results with breakdown
Pack 10
$79
10 assessments
10 assessments
All roles
Full results with breakdown
Pack 25
$149
25 assessments
25 assessments
All roles
Full results with breakdown
Priority support
Pack 50
$249
50 assessments
50 assessments
All roles
Full results with breakdown
Priority support

Start today

Three assessments.
Free.

No credit card. No time limit. Start evaluating your team's AI judgment today.