The Data Protection detector identifies, classifies, and manages the exposure of Personally Identifiable Information (PII) in LLM inputs and outputs. It is designed to prevent data leakage and ensure compliance with privacy regulations (GDPR, LGPD, CCPA) by detecting sensitive entities like names, documents, and contact details.Documentation Index
Fetch the complete documentation index at: https://docs.guardion.ai/llms.txt
Use this file to discover all available pages before exploring further.
Capabilities
The current modelpii-v0 is optimized for multilingual inputs and supports the following entity categories:
| Category | Label | Coverage Examples |
|---|---|---|
| Contact | CONTACT | Email addresses, phone numbers (mobile/landline), social media handles. |
| Document | DOCUMENT | National IDs (CPF, CNPJ, SSN), Passports, Driver’s Licenses (CNH/RG), Tax IDs. |
| Location | LOCATION | Street addresses, cities, states, zip/postal codes. |
| Personal | NAME | Full names, first names, and family names. |
Threshold Configuration
You can adjust the sensitivity of the detector using threshold levels. A lower threshold increases recall (catches more, potential false positives), while a higher threshold increases precision.| Level | Threshold | Confidence |
|---|---|---|
| L1 | 0.9 | Confident (Recommended for automation) |
| L2 | 0.8 | Very Likely |
| L3 | 0.7 | Likely |
| L4 | 0.6 | Less Likely |
Usage
1. Define a Policy
First, configure a guard policy to specific the behavior. You can choose to simply monitor or block/redact.2. Evaluate Content
Send the content to the Guard API. If PII is detected, the response will contain both a diagnosticbreakdown and a correction object containing the redacted text.
Redaction Format:
Detected entities are replaced with a vaulted token format: [CATEGORY_HASH].
- Example:
john@example.com→[CONTACT_A1B2C3D4]
API Example
In this scenario, a user submits a prompt containing a mix of contact information and government documents. The API detects these entities and returns a safe, redacted version.Request Payload
Response
The response flags the content as unsafe (flagged: true) and provides the sanitized text in the correction object.