The False Positive Problem: Why Pure ML Redaction Fails Legal and Healthcare Teams (And What to Do About It)

benchmark analysis with cost calculations.

The Challenge

A benchmark study found Presidio generated 13,536 false positive name detections across 4,434 samples — flagging pronouns ("I"), vessel names ("ASL Scorpio"), organizations ("Deloitte & Touche"), and even countries ("Argentina," "Singapore") as person names. In production legal and healthcare environments, every false positive requires human review, which costs $200-800/hour in attorney or specialist time. At scale, a 22.7% precision rate makes automated redaction economically impractical without a hybrid approach.

By the Numbers

7% of all API calls from developer tools contain PII (Palo Alto Networks 2025)
Microsoft Presidio shows 22.7% false positive rate in production (Alvaro et al. 2024)
536 CVEs disclosed in major ML frameworks 2024
developer toolchain PII leaks cost $200-$800 per incident in remediation

Real-World Scenario

A large law firm's e-discovery team processes 50,000 documents per litigation matter. Their ML-only redaction tool produces 35% false positive rate, requiring attorney review for each flagged item. At $400/hour and 10 false positives per document, the manual review cost exceeds the automation savings. anonym.legal's hybrid approach with configurable thresholds reduces the false positive rate to under 5%, making automation economically viable.

Technical Approach

Three-tier hybrid: regex handles structured data with 100% reproducibility; spaCy NLP handles contextual name/org/location detection; XLM-RoBERTa handles cross-lingual ambiguity. Confidence thresholds are configurable per entity type — a legal team can set names to 90% confidence while keeping phone numbers at regex-certainty.

Source

The Challenge

By the Numbers

Real-World Scenario

Technical Approach

Comments (0)