Defining Sonate's platform KPIs (FAR-A, FAR-H, PFI, TIS), evaluation protocol, and statistical reporting conventions. Version: 2025.11.08-r1
📋 Implementation Status
Core trust scoring operational with 6-principle weighted algorithm (compliance_score, guilt_score). Advanced metrics (FAR-A, FAR-H, PFI, TIS) methodology defined, implementation in progress.
Clear, auditable, directionally correct metrics for AI trust infrastructure
False Apology Rate (Automated)
Unnecessary or policy-misfired apologies issued by the system (including automated guardrail apologies) when no breach or harm occurred. Lower is better.
< 0.50%
An apology is "unnecessary" if post-hoc adjudication finds no breach, harm, or policy trigger. Includes both model-initiated and guardrail-initiated apologies.
Factual Assertion Failure (Hallucination) Rate
Share of factual claims that fail verification against the ground-truth set or authoritative references. Lower is better.
< 1.00% across general tasks; < 0.25% for regulated domains
Factual claims are extracted by the claim-miner; verification uses citation checks, structured tests, or oracle datasets.
Protected Fairness Index
Distributional parity across protected attributes (e.g., gender, ethnicity, age) for key outcomes (access, approval, refusal, latency, quality). Higher is better.
Where Δ = 1 - min(pi/max(pj)) for rates, clamped to [0,1]
≥ 0.98 (enterprise), ≥ 0.95 (public)
Report attribute coverage, sample sizes per group, and uncertainty (see §3).
Trust Integrity Score
Composite integrity of execution and auditability. Higher is better.
≥ 0.995
Publish per-component scores and weights; show receipt examples.
How we measure and validate our trust metrics
Where αs is a fixed publishable weight vector (e.g., [0.25, 0.35, 0.30, 0.10]). Publish the weight vector and keep it stable across releases; log any changes in the change-log.
Confidence intervals, rounding rules, and disclosure requirements
User-verifiable cryptographic audit trails
ts - Timestampactor - Actor identifierintent - User intentinputs_hash - Hash of inputsoutputs_hash - Hash of outputspolicy_id - Policy identifierguardrail_action - Guardrail action takened25519_sig - Ed25519 signatureledger_height - Ledger positionprev_hash - Previous entry hashinputs_hash/outputs_hash from the payloadprev_hash → hash(payload) consistency at ledger_heightTry our minimalist web verifier (pure JS) to paste a receipt and verify its signature and hash-chain integrity
Open In-Browser VerifierProvider-agnostic policy mapping with receipt evidence
| Policy | Trigger (examples) | Action | Evidence in Receipt |
|---|---|---|---|
| Safety | Toxicity/violence threshold crossed | Apologize + refuse | policy_id, scores, guardrail_action=refuse |
| Privacy | PII detected | Mask + human-approve | policy_id, pii_spans, human_flag=true |
| Compliance | Regulated-domain query | Escalate to human | policy_id, routing=human |
| Continuity | Long-running context | Summarize + confirm | policy_id, continuity=true |
Each row links to a sample Trust Receipt demonstrating the evidence captured.
View Sample ReceiptsReady-to-render component calculations for TIS
Standards for public reporting and versioning
/changelog and include the diff of weights, sampling, or definitions## 2025.11.08-r1 - Initial public methodology. - Weight vector for complexity normalization set to [0.25, 0.35, 0.30, 0.10]. - Introduced LVP/RC/GA/AD components for TIS.
For /metrics-methodology
Structured data for SEO and discoverability
<script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "SoftwareApplication",
"name": "Sonate — AI Trust Infrastructure",
"applicationCategory": "SecurityApplication",
"operatingSystem": "Web",
"url": "https://yseeku.com/",
"softwareVersion": "2025.11.08-r1",
"offers": {
"@type": "Offer",
"price": "0",
"priceCurrency": "USD"
}
}
</script><script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "TechArticle",
"headline": "Sonate Metrics Methodology & Assumptions",
"datePublished": "2025-11-08",
"dateModified": "2025-11-08",
"author": {
"@type": "Organization",
"name": "Yseeku"
},
"about": ["Trust Receipts","Hash-chained Ledger","AI Guardrails","Fairness","Hallucination"]
}
</script>User-verifiable record of an action, signed (Ed25519), placed into an append-only, hash-chained ledger.
Each entry references the previous entry's hash; tamper-evident.
Decentralized Identifiers & Verifiable Credentials for actor identity and attestations.