DataPrivix
Menu
How it works

Data anonymization for logs, PDFs, and datasets

DataPrivix ingests logs, datasets, and PDFs, applies exclude rules up front, then masks identifiers and performs true PDF redaction before exporting safe artifacts — ready for sharing and analytics (PII / GDPR / RGPD / DCP).

DataPrivix anonymization workflowInput files move into the DataPrivix engine. Some are excluded and bounce back. Accepted files are masked and appear as cleaned output.Data Files(logs, datasets)Custom PDFsUnstructured PII dataDataPrivix EngineFiltering • Masking • PDF redactionRules: Email mask • IP mask • Token redactionExclude patternse.g. *.zip, secrets/*ONMask + redact PIIemail → *** , ip → ***.*.*.*ONSafe FilesReadyuser_id: ***ip: 192.168.*.*Redacted PDFsSent to Analytics DWExcludedRAW DATA SOURCESCLEANED OUTPUTACTIVE PROCESSING
1) Ingest
File-based inputs

Logs, datasets, and custom PDFs in one workflow.

2) Protect
Rules + redaction

Exclude patterns, mask PII, redact PDFs, keep structure.

3) Export
Safe artifacts

Cleaned output you can share internally or send to analytics.

Data anonymization software • Free / Pro / Enterprise
PIIGDPRRGPDDCP

Securely anonymize logs, PDFs, and datasets — fast and offline-first.

DataPrivix helps teams anonymize sensitive data before sharing it. Remove identifiers and secrets from logs and exports, and apply true PDF redaction for native-text PDFs — while preserving structure and readability.

Inputs
files • folders • archives
Output
.tar.gz (structure preserved)
Deploy
wheel • Docker • offline
Redaction preview
Pro
Input
user=bob email=bob@acme.com ip=10.42.12.9
Authorization: Bearer eyJhbGciOi...
Output
user=<REDACTED> email=<REDACTED> ip=<REDACTED>
Authorization: Bearer <REDACTED>
Rules
JSON (v1/v2)
Mode
Offline-first
Designed for
Support workflows

Preserve structure, keep artifacts usable.

Deployment
Wheel + Docker

Fast installs, reproducible builds.

Trust
Offline-first

Keep sensitive data inside your network.

Benefits

Reduce risk, accelerate collaboration

Anonymize before sharing. Keep your data useful. Move faster with confidence.

Share safely

Remove identifiers and secrets from logs and exports while keeping context.

Scale to large bundles

Parallel processing and fast I/O patterns help tackle large support bundles (Pro).

Stay in control

Configure rules, exclude patterns, logging, and deployment mode to fit your environment.

Highlights

Everything you need for file-based anonymization

A practical toolbox: from basic redaction to enterprise-ready deployment patterns.

Built for files, not demos

Anonymize folders, single files, and archives while preserving structure for support workflows.

Rules you control

Start with baseline redaction patterns, then extend with your own rules for your domain.

Filename anonymization

Optionally anonymize output file and folder names to avoid leaking hostnames or IDs (Pro).

Preview before you ship

Validate outputs and iterate faster with an interactive preview mode (Pro).

Parallel processing

Speed up large bundles with configurable concurrency (Pro).

Profiling & suggestions

Find likely sensitive fields and generate rule suggestions to reduce blind spots (Pro).

Offline-first

Designed for on-prem and restricted environments. Keep data in your network.

Major upgrade

Secure PDF Redaction

Remove sensitive data from documents with true, irreversible redaction.

True redaction (not just visual masking)

Remove underlying PDF text so copy/paste and extraction can’t recover the original values.

Secure PDF anonymization for sharing

Redact identifiers in invoices, reports, and ticket attachments before vendor or cross-team sharing.

Maintain structure and readability

Preserve layout for reviewers so the document remains usable in audits and incident workflows.

Rules-driven consistency

Apply the same policy-driven rules you use for logs and exports, now extended to native-text PDFs.

Learn how DataPrivix approaches true redaction (and why masking can fail) on the PDF redaction tool page.

Secure PDF Redaction

Text is permanently removed — not just hidden

Pro
Document preview
searchable-text PDF
Invoice:ACME-2026-031Customer:REDACTED
Email:REDACTEDToken:REDACTED
What stays intact
  • • Page layout
  • • Tables & headings
  • • Readability
  • • Usability for support

Note: True redaction removes underlying text content so it can’t be recovered by copy/paste or text extraction.

Credibility note

Designed for secure data handling: offline-first workflows, deterministic rule evaluation, and exportable artifacts for review.

Demo

See DataPrivix in action

A quick walkthrough of Pro workflows: preview mode, advanced rules, and large-bundle readiness.

What you’ll learn in 90 seconds
  • • How anonymization preserves structure while removing identifiers
  • • What Pro unlocks: filename anonymization, preview mode, parallel processing, advanced actions
  • • Why offline-first delivery matters for security reviews
Pro edition walkthrough
Pro

Preview anonymization, use Pro rule actions, and scale to large bundles.

Use cases

Logs are just the beginning

DataPrivix supports many real-world artifacts teams share every day.

Support bundles

Anonymize and archive diagnostic trees for vendor support.

Text exports

Mask identifiers in CSV-like exports and plain text dumps.

Application logs

Redact secrets and identifiers without losing traceability.

Diagnostics

Sanitize config snippets, traces, and command outputs.

Incident reviews

Prepare data for cross-team reviews and post-mortems.

Pre-production

Share sanitized samples to reproduce issues faster.

Solutions

Start from your workflow

Dedicated pages for common buyer intents: PDF redaction, log anonymization, and data anonymization software.

PDF Redaction Tool

True redaction for native-text PDFs—built for secure sharing, audits, and vendor workflows.

Log Anonymization

Anonymize logs while preserving debugging value—PII, secrets, identifiers, and joinability.

Data Anonymization Software

Build a repeatable pipeline for file-based anonymization across exports, diagnostics, and datasets.

Editions

Start Free. Unlock Pro when you’re ready.

Free covers the basics. Pro accelerates large workflows and adds premium capabilities.

Free
€0

For evaluation and light workloads with basic anonymization.

  • Basic anonymization (logs, exports, diagnostics)
  • Legacy replacement rules (fixed string replace)
  • Sequential processing
  • Workload limits (10 MB file / 50 MB archive; 200 MB extracted; 200 files)
  • CLI + DataPrivix Console
Pro
Contact

For operational datasets and support bundles with speed, preview, and advanced actions.

Includes PDF Redaction
  • Enterprise-grade document anonymization
  • True PDF redaction (native text PDFs)
  • Rules v2 + built-in actions
  • Masking, redaction, secure hash, bucketing
  • Anonymize file & folder names
  • Parallel processing
  • Batch processing (multiple inputs per run)
  • No Free-edition workload limits
  • Preview mode
  • Profiling mode
  • Safe cancel + partial preservation
  • Commercial license (signed key)
Enterprise
Custom

For regulated environments, large deployments, and integrations.

Includes PDF Redaction
  • All Pro features
  • Spark engine connect
  • Onboarding + support
  • Custom integrations
  • Private deployment options
FAQ

Answers to common questions

If you need a security review or a custom license, contact sales.

Ready to see it on your data?

Try the demo flow or talk to us about Pro and Enterprise licensing.