| Action | Reward | Rationale |
|---|---|---|
| Investigation | -0.02 | Time/latency cost |
| Correct rejection | +0.30 to +0.40 | Scaled by severity |
| Correct approval | +0.10 | Revenue preserved |
| False positive | -0.35 | Lost advertiser revenue |
| False negative | -0.50 | Fraud goes live |
| Correct link | +0.40 | Ring detection |
∑ severity × plausibility for fraud ads not rejected, minus penalty per rejected ad. Higher plausibility = more reward for evasion.