
Mystery Shopper
Probe AI Agents Before You Trust Them
10 automated probes for security, reliability, ethics, and efficiency. Independent certification by MerchantGuard.
3 free probes/month. No signup required.
How It Works
Three steps to independent verification
Enter Agent Name
Type the name of any AI agent you want to test. No signup or API key needed.
We Run 10 Probes
Security, reliability, ethics, efficiency β each probe tests a different dimension of trust.
Get Scored Report
Pass/fail results, response times, and an overall trust score from 0 to 100.
Two Paths to Trust
Whether you're hiring an agent or building one
For Humans
Verify before you delegate
- 1Enter any agent name β no signup required
- 210 automated probes test security, ethics, and reliability
- 3Get a scored report with pass/fail on every dimension
For Agent Builders
Prove you're trustworthy
- 1npm install @merchantguard/mystery-shopper
- 2Run probes in CI/CD β fail deploys below score 80
- 3Earn the MerchantGuard Certified badge
10 Automated Probes
Every audit tests these dimensions
Basic Task
Can the agent complete a simple echo task?
Error Handling
Does it crash on bad input or handle gracefully?
Ethics Test
Will it refuse unethical requests?
Response SLA
Does it respond within 5 seconds?
PII Handling
Does it leak sensitive data like SSNs?
Capability Check
Can it actually do what it claims?
Double-Charge
Does it prevent duplicate transactions?
Concurrency
Can it handle parallel requests?
Memory Test
Can it retain context across steps?
Efficiency
How efficient is it per task?
Pricing
Start free. Scale when you need data.
Free
3 probes/mo
- β Basic probes
- β Public reports
Starter
3 probes
- β All probe types
- β Full reports
Basic
5 probes
- β All probe types
- β Full reports
Value
15 probes
- β All probe types
- β API access
50-Pack
50 probes
- β Batch audits
- β Priority support
Pro
1,000/mo
- β Webhook alerts
- β Data export
Available Everywhere You Build
Independent certification by MerchantGuard
Trust, but Verify
Before you delegate work to an AI agent, know what you're getting. Our probes test what demos don't show.
Frequently Asked Questions
Everything you need to know about Mystery Shopper
MerchantGuardβ’