Skip to content

AI Slop Prevention Score

The AI Slop Prevention Score is a composite metric that quantifies the quality and trustworthiness of an AI response. It combines 6 weighted components into a single 0.0-1.0 score.

Components

Component Weight Range Description
Grounding Coverage 0.25 0=grounded, 1=ungrounded Whether output is backed by evidence
Schema Compliance 0.15 0=valid, 1=invalid Whether output matches expected schema
Unsupported Claim Ratio 0.20 0-1 Ratio of claims not found in context
Genericity Score 0.15 0=specific, 1=generic Density of filler phrases
Policy Risk Score 0.15 0-1 Policy violation severity
Action Risk Score 0.10 0-1 Risk level of proposed actions

Formula

slop_score = 0.25 * grounding_coverage
           + 0.15 * schema_compliance
           + 0.20 * unsupported_claim_ratio
           + 0.15 * genericity_score
           + 0.15 * policy_risk_score
           + 0.10 * action_risk_score

Decision Thresholds

Score Range Decision Meaning
0.0 - 0.3 pass Output is clean
0.3 - 0.7 repair Output needs improvement
0.7 - 1.0 reject Output is unacceptable

Thresholds are configurable per tenant via SLOP_SCORE_THRESHOLD_PASS and SLOP_SCORE_THRESHOLD_REPAIR.

Source

src/agentguard/slop_score/