ops0ops0

Sensitive Data Detection

ops0 integrates with Microsoft Presidio to automatically detect personally identifiable information (PII) and other sensitive data in your infrastructure configurations. This helps prevent accidental exposure of sensitive data in IaC code, configuration files, and discovered resources.


What Gets Scanned

SourceDescription
IaC CodeTerraform files, variable values, and outputs
Configuration FilesAnsible playbooks, Kubernetes manifests
Discovered ResourcesTags, metadata, and configuration data from cloud scans
Environment VariablesValues passed to deployments

Detected Data Types

Presidio can identify a wide range of sensitive data patterns:

CategoryExamples
Personal InformationNames, email addresses, phone numbers
FinancialCredit card numbers, bank account numbers, IBAN
Government IDsSocial security numbers, passport numbers, driver's license
HealthcareMedical record numbers
NetworkIP addresses, URLs with credentials
AuthenticationAPI keys, passwords, tokens

How It Works

Automatic Scanning

When you create or modify IaC code, ops0 sends the content to the Presidio analyzer for PII detection.

Detection Results

Presidio returns any detected sensitive data entities with confidence scores and entity types.

Alerts

If sensitive data is detected, ops0 flags the affected files and shows a warning before deployment.

Remediation

You can replace hardcoded sensitive values with variable references, vault lookups, or environment variables.


Confidence Scores

ScoreMeaning
High (0.8-1.0)Strong pattern match, very likely sensitive data
Medium (0.5-0.79)Possible sensitive data, review recommended
Low (0.0-0.49)Weak match, likely a false positive

Integration with Policies

Sensitive data detection works alongside ops0's policy engine. You can create policies that block deployments containing detected PII, or set them to warn-only mode for less critical environments.


Troubleshooting

False Positives
Some infrastructure values (like CIDR blocks or resource IDs) may trigger false positives. Review the confidence score and entity type to determine if the detection is accurate.
Detection Not Running
The Presidio analyzer service must be running and healthy. Check the service status in your deployment dashboard.