Last Updated: February 12, 2026

How Do You Certify an AI Agent for Payment Compliance?

AI agent certification uses TrustVerdict v1.1, a three-layer evaluation combining Mystery Shopper behavioral probes (50% weight), GuardScan automated security scanning (35% weight), and identity verification (15% weight). Agents scoring above 50 earn Verified status, above 70 earn Gold, and above 90 achieve Diamond certification with on-chain attestation on Base and Arbitrum.

Agent Certification Checklist

Register your agent with a verifiable X (Twitter) handle or domain
Ensure your agent has a publicly accessible API endpoint for probes
Implement proper error handling — probes test edge cases and malformed inputs
Handle PII data correctly: never log, store, or echo back sensitive information
Respond to injection and social engineering probes with appropriate refusals
Pass all 10 Mystery Shopper probes for full TrustVerdict scoring
Claim your on-chain attestation to make certification publicly verifiable

Certification Tiers

TrustVerdict assigns a composite score from 0 to 100 based on the weighted formula: Mystery Shopper (50%) + GuardScan (35%) + Identity Verification (15%). Each tier represents a distinct level of trust that payment operators, acquirers, and platforms can use to make informed decisions about agent permissions.

Tier	Score Range	Description
Unverified	0-49	Agent has not met minimum compliance standards. Not recommended for payment operations.
Verified	50-69	Basic compliance standards met. Suitable for low-risk, supervised payment operations.
Gold	70-89	Strong compliance posture. Approved for most payment operations with standard monitoring.
Diamond	90-100	Exceptional compliance. Approved for high-value and unsupervised payment operations.

The TrustVerdict v1.1 Formula

TrustVerdict combines three independent evaluation layers, each measuring a different dimension of agent trustworthiness. The Mystery Shopper component tests behavioral compliance through live adversarial probes. GuardScan performs automated static and dynamic security analysis. Identity verification confirms the agent operator's real-world identity and social presence. The weighted formula ensures that behavioral compliance carries the most weight because how an agent actually behaves under pressure matters more than static security checks.

TrustVerdict Score = (Mystery Shopper x 0.50) + (GuardScan x 0.35) + (Identity x 0.15)

Mystery Shopper

50% weight

10 behavioral probes testing security awareness, PII handling, ethical boundaries, and compliance under adversarial conditions.

GuardScan

35% weight

102-pattern static scan + VirusTotal integration + LLM Judge analysis + URL endpoint verification for comprehensive security coverage.

Identity

15% weight

X (Twitter) handle verification, domain ownership check, and operator identity confirmation through social and web presence signals.

The 10 Mystery Shopper Probes

Mystery Shopper evaluates AI agents through 10 carefully designed behavioral probes that simulate real-world adversarial scenarios. Each probe tests a specific compliance dimension. The agent's responses are scored by a secondary AI judge for accuracy, safety, and appropriateness. Agents do not know which interactions are probes, ensuring authentic behavioral assessment.

Prompt Injection

Tests resistance to instruction override attempts

PII Exfiltration

Attempts to extract stored personal data

Social Engineering

Uses authority impersonation to bypass rules

Ethical Boundary

Requests that violate compliance policies

Data Poisoning

Feeds misleading data to corrupt responses

Jailbreak Attempt

Multi-step jailbreak escalation sequence

Stress Response

High-volume requests testing degradation

Compliance Knowledge

Domain-specific regulatory questions

Error Handling

Malformed inputs and edge case scenarios

Context Leakage

Cross-conversation information bleed testing

On-Chain Attestation

After completing certification, agents can claim an on-chain attestation that makes their TrustVerdict score publicly verifiable by any smart contract or API. Attestations are stored on Base (Coinbase L2) via the GuardAttestation contract, and agent identity is recorded as a soulbound NFT on Arbitrum via the MGAgent contract. Soulbound tokens cannot be transferred, ensuring that certification is permanently bound to the agent's identity. Payment platforms and acquirers can query these contracts to verify an agent's certification status before granting payment permissions. Attestations include the score, tier, evaluation date, and expiration. The ERC-8004 standard provides machine-readable agent identity metadata.

Get Your Agent Certified

MerchantGuard is the compliance layer for the AI agent economy. Certify your agent with TrustVerdict, get on-chain attestation, and gain access to payment platforms that require verified compliance status. Start with 3 free probes.

Certify Your Agent GuardScan Claim Attestation

On-Chain Contracts

GuardAttestation (Base):0xAbaDA41b865B826de10c26d38Ec4D64Dc19c50Dd

MGAgent Soulbound (Arbitrum):0x813eb25176d8a5cab9c95616461DDEC4110D424e

GuardScorePassport (Base):0x94Ab36d41e3FF25BFe3a18777AAD39c62508C741

Frequently Asked Questions

What is TrustVerdict and how does it work?

TrustVerdict v1.1 is MerchantGuard's AI agent certification framework that evaluates agents across three dimensions: Mystery Shopper behavioral probes (50% weight), GuardScan automated security scanning (35% weight), and identity verification (15% weight). The final score ranges from 0-100 and determines the agent's certification tier. Evaluations expire after 90 days.

What are the Mystery Shopper probes?

Mystery Shopper sends 10 behavioral probes to an AI agent testing security awareness, ethical boundaries, PII handling, reliability under stress, and compliance knowledge. Each probe simulates a real-world scenario the agent might encounter when handling payments. Probes include injection attacks, social engineering attempts, data exfiltration tests, and compliance edge cases. Results are scored by a secondary AI judge.

How long does certification last?

TrustVerdict certifications expire after 90 days from the date of evaluation. This rolling expiration ensures that agents are continuously monitored as their capabilities and behaviors can change with model updates. Agents on the Pro plan ($99/month) receive continuous monitoring that automatically re-certifies when probes detect no degradation. Manual re-certification is available at any time.

What blockchain are attestations stored on?

TrustVerdict attestations are stored on-chain on both Base (Coinbase L2) and Arbitrum. The GuardAttestation contract on Base (0xAbaDA41b865B826de10c26d38Ec4D64Dc19c50Dd) stores certification scores and tiers. The MGAgent soulbound NFT on Arbitrum (0x813eb25176d8a5cab9c95616461DDEC4110D424e) provides permanent agent identity that cannot be transferred.

How much does AI agent certification cost?

MerchantGuard offers tiered pricing: 3 free probes for evaluation, Starter pack with 5 probes for $4.99, Growth with 15 probes for $9.99, Business with 50 probes for $29.99, and Pro with unlimited probes plus continuous monitoring for $99 per month. Full certification requires all 10 probes. Per-call pricing via x402 USDC payments is also available at $0.05 per probe.