Self-hosted PII redaction

Redact PII and PHI. Keep your data yours.

Open source, self-hosted PII and PHI redaction software that runs entirely on your desktop or in your cloud, built for healthcare, finance, legal, and government workloads. Did we mention it's all open source?

Open Source Redaction Software → Consulting services

Redacting PII and PHI since 2017.

Trusted in

Healthcare
Finance
Legal
AI & Machine Learning
Government
Contact Centers
Insurance
AI Training Data
Education

Philter

Redact PII and PHI on the desktop or at pipeline scale

Redact sensitive documents on your computer, or redact text at scale inside your own pipelines.

On your desktop

Philter Desktop

PC-based redaction for small law firms, solo practitioners, and clinics. Redact .txt, .docx, and .pdf files on a single Windows computer, with no server to stand up and nothing sent to a cloud service. Everything runs on your computer.

Download Learn more →

In your pipelines

Philter

The turnkey, self-hosted REST API redacts PII and PHI at scale inside your own cloud or data center, built for healthcare, finance, legal, and government text workloads. Call it from your services and it never sends data to a third party.

$ curl http://localhost:8080/api/filter \
    --data "His SSN was 123-45-6789." \
    -H "Content-type: text/plain"

His SSN was ***********.

Launch → Learn more →

Consulting

Your trusted PII redaction partner

Bring in the team that built the software. We assess what you need and deliver an implementation plan, then either build it inside your own cloud and validate it on your data, or hand the plan to your engineers to build. Either way you own the result, and you work directly with the people who wrote the code, not a vendor you renew every year.

See how we work → Book a 30-min call

Setup and Handoff

We stand up PII redaction in your own cloud, configure and validate it on your data, then hand you a running system you own. No in-house engineering team required. If you would rather build from the Discovery plan yourself, that works too.

Privacy Architecture

We design end-to-end PII protection for your data and AI workloads: data flows, redaction layers, audit trails, and the guardrails that keep generative-AI features aligned with HIPAA, GDPR, and CCPA.

Custom Detection Models

Off-the-shelf models miss the identifiers that matter most in your domain. We train specialized PII and PHI detectors on your data, measured against precision and recall you can put in front of an auditor.

Recent blog posts

Practical posts on PII redaction, AI privacy, and self-hosted compliance. View all posts →

July 30, 2026 · Philter, Redaction

Why Your LLM Provider's Built-In Guardrails Are Not Enough

OpenAI, Anthropic, Google, and AWS offer PII filtering, but it all runs after your data leaves your network. Why filtering out is not the same as never sending.

Read post →

July 30, 2026 · Philter, Redaction

Redact Both Sides of the LLM: Inputs and Outputs

Prompt redaction protects the provider boundary. Output scanning protects everything downstream. An LLM needs PII redaction on both sides, not one.

Read post →

July 23, 2026 · Redaction, Philter

Can an LLM Leak Its Training Data, and Why You Cannot Un-Train PII

Research shows LLMs memorize and leak training data, and unlearning it afterward is unreliable. The dependable control is to redact before training.

Read post →