Talk to the Team

Tell us about your stack and the privacy problems you're trying to solve. We typically respond within one business day.

Prefer email? support@philterd.ai

Prefer to skip the form? Pick a time on our calendar →
or send a message

Please do not enter PII or PHI in this form. If you need to share an example, use a sanitized one.

Start here

Six posts, from "what is PII" to shipping it in production. New here? Start at the top. Evaluating? The middle ones do the heavy lifting.

  1. Start here Introducing PhiSQL: The Query Language for PII Operations

    What PII actually is (and isn't): the operational definition engineers and compliance teams can both work from.

  2. The architectural case Why API-Based Redaction is a Security Antipattern

    Why sending sensitive data through a managed redaction API is a deeper mistake than it looks, and what to do instead.

  3. Evaluation The TCO of "Free" Cloud PII Redaction: AWS Comprehend, Google DLP, vs Self-Hosted at Scale

    A worked-example TCO comparison of AWS Comprehend, Google DLP, and self-hosted Philter at production volumes. The break-even sits closer than most teams expect.

  4. Evaluation Beyond Regex: Why General LLMs Fail at PII Discovery

    The technical case for purpose-built NLP lenses over generic LLMs: what accuracy looks like when your models were trained for the job.

  5. Implementation Automating HIPAA Safe Harbor: A Blueprint for Healthcare Data Pipelines

    A concrete blueprint for automating HIPAA Safe Harbor de-identification across healthcare data pipelines: all 18 protected identifiers, end to end.

  6. Implementation Building a Privacy-Aware RAG System

    How to keep PII out of vector stores, retrieval contexts, and the LLM's response: the architecture pattern for production RAG on regulated data.