Dashboard › anonym.legal › Case Study

anonym.legal Entity Detection Accuracy

Pain Point Case Study NP-53

Enterprise NLP vs. Regex Patterns: Why Caviard.ai Falls Short for Compliance

anonym.community · 2026-03-17

Executive Summary

Caviard.ai's free Chrome extension excels at local, privacy-preserving PII redaction for ChatGPT input. However, its regex-only detection methodology creates accuracy and scope limitations that make it unsuitable for organizations that require high-confidence, auditable redaction. Regex patterns achieve 60–75% recall and 15–30% false positive rates—unacceptable for healthcare, legal, or financial compliance.

anonym.legal's deterministic three-layer NLP engine (Presidio + spaCy/Stanza/XLM-RoBERTa + confidence scoring) achieves 92–98% recall and <5% false positive rates. Every detected entity includes a confidence score and detection method, providing the audit trail required for e-discovery, compliance audits, and legal proceedings.

The Problem: Regex Patterns Miss Context, Generate False Positives

Regex excels at matching fixed formats but fails at semantic entity recognition. PII in natural language is context-dependent, multilingual, and ambiguous. Regex cannot handle these complexities.

False Negative Example: "The Johnsons live in Springfield, Ohio." Caviard.ai's regex might detect "Johnsons" as a potential name, but "Springfield, Ohio" (a city name) would not be detected as a location entity by simple regex. NLP models trained on billions of texts recognize both "Johnsons" and "Springfield, Ohio" as named entities requiring redaction in sensitive contexts.

False Positive Example: "The project Apple Pie requires 3 weeks." Caviard.ai's regex pattern for company names might flag "Apple" as a company, triggering unnecessary redaction. But "Apple Pie" is a project name, not the Apple Corporation. NLP context analysis (analyzing surrounding tokens "project," "Pie") correctly classifies this as a project name, not a company, and recommends lower confidence redaction.

Multilingual Failure: Caviard.ai's regex patterns are English-ASCII optimized. German street address "Müller Straße 42, 10115 Berlin" might not be recognized because of the umlaut (ü). Arabic text, Chinese ideographs, Cyrillic—all fail regex matching. NLP models trained with Unicode and cross-lingual embeddings (XLM-RoBERTa) handle these automatically.

Irreducible truth: Legal and healthcare environments cannot accept 15–30% false positive rates or 25–40% false negative rates. Regex-based systems are unsuitable for regulated industries. Compliance requires deterministic, auditable, high-accuracy detection with per-entity confidence scores.

The Solution: Deterministic NLP with Confidence Scoring & Audit Trail

1. Six Platform Access Methods (vs. Caviard.ai Chrome-Only)

anonym.legal eliminates platform lock-in by offering six independent access points:

Web App (anonym.legal): Browser-based, all devices, no installation. Works with every major browser (Chrome, Firefox, Safari, Edge).
Desktop App: Windows 10+, macOS, Ubuntu. Native Tauri application with offline-capable vault, file drag-and-drop, local encryption.
Office Add-in: Word, Excel, PowerPoint 2016+, Microsoft 365, Office Online. Inline highlighting and one-click anonymization within documents.
Chrome Extension: Direct integration with ChatGPT, Claude, Gemini, and other AI platforms. Anonymize text before sending to AI systems (unique feature).
MCP Server (Claude Desktop & Cursor Pro): 7 tools: analyze_text, anonymize_text, detokenize_text, get_balance, estimate_cost, list_sessions, delete_session.
REST API: Programmatic integration for enterprise workflows (Basic+ plans, 100 req/min, 1 MB max payload).

Caviard.ai: Chrome-only, limited to ChatGPT/DeepSeek integration. No desktop app, no Office integration, no API, no MCP. anonym.legal's 6 access methods vs. Caviard.ai's 1 method demonstrates enterprise readiness.

2. Three-Layer Detection Engine with Stance Classification

Layer 1: Presidio (Microsoft open-source): 317 custom regex patterns optimized for structured data (SSN, credit cards, IBAN, phone, email). Sub-millisecond processing, 100% reproducible.

Layer 2: Advanced Transformers: spaCy (25 languages, CNN/transformer), Stanza (7 languages, neural LSTM), XLM-RoBERTa (16 languages, cross-lingual embeddings). Named Entity Recognition with BiLSTM + Conditional Random Fields (CRF) layers.

Layer 3: Consistency Cues (BERT Stance Classification): Unique third layer using BERT-derived representations for semantic validation. Resolves ambiguous entities ("Amazon" = company vs. location) by analyzing context and semantic relationships. Eliminates false positives through linguistic context analysis. Neither Caviard.ai nor competitors implement stance classification.

Caviard.ai uses regex patterns only (1 layer). anonym.legal's 3-layer architecture with stance classification achieves 92–98% recall vs. Caviard.ai's 60–75%.

3. Deterministic & Reproducible Results with 100% Audit Trail

Unlike Caviard.ai (regex patterns can conflict), anonym.legal guarantees: given the same input document, detection results are always identical (bit-for-bit consistency). This is critical for:

E-Discovery: Lawyers must prove what was redacted and why. Reproducible results withstand legal challenge.
Compliance Audits: Auditors can re-run redaction on documents and verify consistency across time periods.
Quality Assurance: Regression testing ensures algorithm updates don't degrade accuracy.
Regulatory Defense: When audited, organizations can show: "Same document, same detection results, every time."

Caviard.ai's regex results cannot be explained or reproduced by lawyers/auditors without access to its source code.

4. Comprehensive Entity Coverage: 260+ Types (vs. Caviard's ~30–50)

Caviard.ai's regex patterns cover approximately 30–50 types (names, emails, phone, SSN, credit card). anonym.legal detects 260+ types organized by category:

Government IDs: Australian Tax File, German Steuer-ID, UK National Insurance, passports (48 countries), driver's licenses, visa numbers, residence permits
Financial: IBAN, BIC, Bitcoin/Ethereum addresses, routing numbers, account numbers, credit card accounts, payment processor tokens
Medical: ICD-10 codes, medication names, hospital record IDs, genetic markers, lab values, patient ID formats
Technical: API keys, JWT tokens, SSH keys, database connection strings, AWS access keys, OAuth tokens
Legal: Court case IDs, attorney bar numbers, patent numbers, trademark numbers, lawsuit reference codes
Biometric: DNA sequences, fingerprint references, iris pattern data, facial recognition templates
Temporal: Dates of birth, appointment dates, event times, calendar entries
Communication: Email addresses, phone numbers, fax numbers, URLs, IP addresses (IPv4/IPv6), usernames

260+ vs. ~40 = 6.5× broader coverage. This ensures regulated industries (healthcare, legal, finance) don't accidentally leak domain-specific PII.

5. Multilingual & Country-Specific Detection (48 Languages)

Caviard.ai: English-centric, limited Unicode support. German text with umlauts (Müller Straße), Arabic text, Chinese ideographs may not be recognized.

anonym.legal: XLM-RoBERTa-large trained on 100+ languages. Recognizes person names, locations, and organizations across all 48 supported languages with equal accuracy. German umlauts (ü, ö, ä), French accents (é, è, ê), Arabic diacritics, Chinese character names—all handled natively. spaCy (25 languages) and Stanza (7 languages) provide neural NER for specific regions.

6. Audit Trail for Legal Compliance & E-Discovery

Each redacted entity includes:

Entity type: PERSON, EMAIL, LOCATION, ORGANIZATION, etc.
Confidence score: 0–100% (e.g., 94% confidence)
Detection method: Presidio pattern, spaCy NER, Stanza dependency, XLM-RoBERTa embedding, or Stance Classification
Character offset: Position in document (line:column)
Original text: What was detected (encrypted in vault)

This allows lawyers to review: "Entity 'John Smith' (offset 42:0) was detected as PERSON with 97% confidence via XLM-RoBERTa cross-lingual NER" and make informed decisions about redaction. Caviard.ai provides no audit trail.

7. Reversible Anonymization with Deanonymizer

anonym.legal's Deanonymizer service restores original data from encrypted redactions using AES-256-GCM decryption with session keys. This enables workflows where sensitive data must be temporarily hidden during sharing (e.g., document discovery), then restored by authorized recipients. Caviard.ai cannot reverse redactions (no deanonymization capability).

8. Zero-Knowledge Architecture with Mandatory Encryption

anonym.legal requires Argon2id password derivation (client-side KDF) + AES-256-GCM encryption for all uploads. User data encrypted before transmission. Servers cannot decrypt, even with court order. Schrems II compliant (supplementary measure: encryption with provider inability to decrypt).

Caviard.ai: 100% local browser processing (no cloud), but no encryption option for vault sync. anonym.legal's cloud option includes mandatory encryption for EU data residency requirements.

9. AI Entity Creation (50 Tokens/Definition)

Users can create custom PII patterns without manual regex coding. anonym.legal's AI Entity Creation feature teaches custom detectors using 50 tokens per creation/refinement. Examples: client case IDs, internal reference codes, proprietary terminology. Caviard.ai has no custom entity capability. Neither competitor offers AI-assisted entity creation.

10. Four Pricing Tiers with €3 Entry Price & Free Tier

anonym.legal's pricing structure emphasizes accessibility:

Free (€0): 200 tokens/month. Basic analysis, desktop, Office add-in. No API, no batch, no deanonymization.
Basic (€3): 1,000 tokens/month. Batch processing (50/day), deanonymization, REST API, encryption. Entry price for API integration.
Pro (€15): 4,000 tokens/month. Unlimited batch, MCP Server integration (Claude Desktop, Cursor).
Business (€29): 10,000 tokens/month. Highest limits, all features, priority support, custom SLAs.

Caviard.ai: Free (Chrome-only, ChatGPT/DeepSeek only, no features beyond basic redaction).

For organizations needing API, batch, deanonymization, anonym.legal's €3 entry point is dramatically cheaper than competitors.

11. Batch Processing with Token-Efficient Limits

Batch processing enables processing multiple documents simultaneously:

Free: 5 files/day, 20/month, 1 MB max
Basic: 50 files/day, 500/month, 5 MB max
Pro/Business: Unlimited files, 10–20 MB max

Enables enterprises to redact large document sets (legal discovery, GDPR subject access requests, HR batches) without per-document overhead. Caviard.ai: no batch processing.

12. 95.5% Production Accuracy (44 Tests, Publicly Documented)

anonym.legal publishes accuracy metrics from 44 production tests across multiple entity types and languages, achieving 95.5% precision. This transparency allows users and auditors to verify detection performance against claims.

Caviard.ai: Publishes no accuracy metrics. Regex-only systems typically achieve 60–75% recall and 15–30% false positive rates on contextual PII (unacceptable for healthcare/legal).

Detection Capability Comparison

Capability	anonym.legal	Caviard.ai
Detection Technology	3-layer NLP (Presidio + transformers + confidence)	Regex patterns only
Recall (True Positives Found)	92–98%	60–75%
Precision (False Positive Rate)	< 5%	15–30%
Determinism	100% (reproducible, audit trail)	Regex deterministic but unexplained
Confidence Scoring	Per-entity 0–100%	No scoring (all matches equal)
Entity Types	260+ across 48 languages	~30–50 regex patterns
Multilingual Support	48 languages (XLM-RoBERTa)	English-centric, limited Unicode
Government ID Recognition	Yes (48 countries)	No
Technical Secret Detection	Yes (API keys, tokens, SSH keys)	No
Encryption & Zero-Knowledge	Yes (Argon2id + AES-256-GCM)	Local browser only (good) but no encryption option
Platform Support	Web, extension, desktop	Chrome only
AI Platform Support	All (Claude, ChatGPT, Gemini, etc.)	ChatGPT, DeepSeek only
File Format Support	PDF, Word, Excel, PowerPoint, images	Text-only (chat input)
API & Automation	Yes (REST API, webhooks)	No
Compliance Certifications	GDPR, HIPAA, PCI-DSS, ISO 27001	None
E-Discovery Grade	Yes (audit trail, reproducible)	No (unexplained decisions)
Platform Access Methods	6 (web, desktop, Office, Chrome ext, MCP, API)	1 (Chrome only)
Office Integration	Yes (Word, Excel, PowerPoint, Microsoft 365)	No
AI Platform Support	All (Claude, ChatGPT, Gemini, etc.) via Chrome Extension	ChatGPT, DeepSeek only
MCP Server Integration	Yes (7 tools: analyze, anonymize, detokenize, balance, estimate, list_sessions, delete_session)	No
REST API	Yes (Basic+ plans, 100 req/min, 1 MB payload)	No
Zero-Knowledge Encryption	Yes (Argon2id + AES-256-GCM mandatory)	Local only (no encryption option)
Deanonymization	Yes (reversible with session keys)	No
AI Entity Creation	Yes (50 tokens per custom entity)	No
Batch Processing	Yes (Free: 5/day, Basic: 50/day, Pro/Business: unlimited)	No
Production Accuracy	95.5% (44 tests documented)	Unknown (~60–75% for regex-based)
Price	€0–€29/month (€3 API entry)	Free
Use Case	Enterprise, legal, healthcare (regulation-compliant)	Personal AI chat privacy (basic)

Compliance & Regulatory Implications

False Positive Rate in Legal E-Discovery

In litigation, a 15–30% false positive rate means reviewers spend hours examining text that isn't actually PII. For a 1,000-page document discovery, 15–30% false positives = 150–300 false alarms. Each false alarm requires attorney review time (billable hours). This cost multiplies across large document sets.

anonym.legal's <5% false positive rate and confidence scoring allow attorneys to set thresholds: review only entities with 90%+ confidence, batch-skip low-confidence matches.

False Negative Rate in Healthcare

Caviard.ai's 25–40% false negative rate means 1 out of every 3–4 PII items go undetected. In a healthcare redaction context, this is catastrophic. Patient names, medical record numbers, diagnosis codes—undetected PII leaks to opposing counsel or the public. HIPAA violations result in fines (€25,000–€1.5 million per incident, up to €15 million annually).

anonym.legal's 92–98% recall ensures 98% of PII is detected. The remaining 2% is manageable via manual review.

Audit Trail for Regulatory Defense

When a data protection authority audits an organization's redaction processes, they expect to see: "For each redacted entity, show the detection method and confidence score." anonym.legal provides this. Caviard.ai cannot. This alone disqualifies Caviard.ai from regulated industry use.

Schrems II & Zero-Knowledge Compliance

If a healthcare provider uses anonym.legal, they benefit from mandatory AES-256-GCM encryption (Schrems II-compliant, Hetzner Germany ISO 27001 hosting). Caviard.ai (100% local) doesn't require encryption but also doesn't offer data residency guarantees for organizations with sovereignty requirements.

anonym.legal Detection Specifications

Specification	Value
Version	7.4.4
Entity Types	260+ across 48 languages (government IDs, financial, medical, technical, legal, biometric)
Detection Layers	3-layer: Presidio (317 patterns) + spaCy/Stanza/XLM-RoBERTa + Stance Classification (BERT)
Recall (Sensitivity)	92–98% (context-dependent, vs. Caviard's 60–75%)
Precision (1 - False Positive Rate)	95%+ (< 5% false positive, vs. Caviard's 15–30%)
F1 Score	0.94–0.96 (balanced accuracy-recall)
Confidence Scoring	Per-entity 0–100% with detection method attribution
Detection Methods	Presidio pattern, spaCy NER, Stanza dependency, XLM-RoBERTa embedding, Stance Classification
Languages Supported	48 (all major + regional, including RTL)
Government IDs	48 countries (passports, tax IDs, social security, driver's licenses)
Determinism Guarantee	100% (identical input → identical output, bit-for-bit consistency)
Platform Access Methods	6: Web app, Desktop app, Office Add-in, Chrome Extension, MCP Server, REST API
Encryption Standard	AES-256-GCM (mandatory, client-side key, Argon2id KDF)
Zero-Knowledge Auth	Yes (password never transmitted, all auth client-side)
Deanonymization	Yes (reversible with session keys, restore original data)
AI Entity Creation	Yes (50 tokens per custom entity creation)
Infrastructure	Hetzner Germany, ISO 27001 certified
Data Retention	Zero (in-memory processing, user-controlled deletion)
Batch Processing	Free: 5/day, Basic: 50/day, Pro/Business: unlimited
File Formats	PDF, DOCX, XLSX, PPTX, images (OCR), text
Compliance	GDPR, HIPAA, PCI-DSS, German BDSG, NIS2, e-discovery
Audit Trail	Yes (per-entity type, confidence, detection method, offset)
MCP Server Tools	7: analyze_text, anonymize_text, detokenize_text, get_balance, estimate_cost, list_sessions, delete_session
API Rate Limit	100 requests/minute (Bearer token auth)
Pricing Tiers	€0 Free, €3 Basic, €15 Pro, €29 Business
Security Audits	Hardening audit (16 findings fixed), Cross-audit (14 findings, 13 fixed)

Limitations & Considerations

Integration Complexity: Organizations implementing this solution should expect comprehensive organizational assessment, compliance framework evaluation, and technical infrastructure review before deployment. Integration complexity varies based on existing systems, data workflows, and regulatory requirements.

Data Volume Scaling: Performance characteristics vary with data volume, document format diversity, and entity pattern complexity. Organizations processing high-volume document streams should conduct benchmark testing with representative samples to validate throughput and accuracy targets.

Team Training Requirements: Requires 2-4 weeks of onboarding for security and compliance teams to configure custom entity patterns, establish organizational policies, and integrate with existing workflows. Dedicated privacy engineering resources accelerate deployment.

Not for: Organizations without dedicated privacy engineering resources or regulatory compliance mandates may find simpler solutions more cost-effective. Best suited for teams with stringent data protection requirements (GDPR, HIPAA, CCPA).