Setup and Handoff
We stand up PII redaction in your own cloud, configure and validate it on your data, then hand you a running system you own. No in-house engineering team required.
Consulting
Bring in the team that built the software. We design, deploy, and validate PII redaction inside your own cloud, then hand you a system you own outright. No in-house engineering team required.
Engagements are led by Jeff Zemerick, the creator of Philter and the Philterd open source toolkit. You get the person who wrote the code, not a sales engineer reading from a runbook. We have deployed redaction pipelines processing millions of records daily across healthcare, financial services, and government, including PHI redaction projects under HIPAA.
Three ways we plug in, from a full build to training models on your data.
We stand up PII redaction in your own cloud, configure and validate it on your data, then hand you a running system you own. No in-house engineering team required.
We design end-to-end PII protection for your data and AI workloads: data flows, redaction layers, audit trails, and the guardrails that keep generative-AI features aligned with HIPAA, GDPR, and CCPA.
Off-the-shelf models miss the identifiers that matter most in your domain. We train specialized PII and PHI detectors on your data, measured against precision and recall you can put in front of an auditor.
Every engagement runs the same three phases, and ends with you owning the result.
We learn how your firm works, your compliance requirements, and where sensitive data lives. No slides, no pitch. 30 minutes.
We design the redaction layer and deploy it inside your cloud. You see every step; nothing is a black box.
You own the system outright: the open source software, the infrastructure, and the operational knowledge. No ongoing license, no vendor lock-in. We stay available whenever you want us back.
Want to see a specific engagement end to end? AI training-data de-identification walks through the phases, the deliverables you keep, how your data is handled, and how we scope it.
We work across regulated industries where a PII leak carries real consequences.
HIPAA Safe Harbor de-identification, clinical NLP, PHI redaction for research and analytics pipelines.
PCI scope reduction, GLBA compliance, PII redaction for banking and fintech data flows.
Court filing redaction, e-discovery, FRBP 9037 compliance for law firms and legal tech.
FOIA processing, FedRAMP-ready deployments, GovCloud and air-gapped environments.
LLM prompt guardrails, RAG pipeline redaction, training data de-identification.
Claims processing, underwriting pipelines, GLBA and NAIC compliance.
A few engagements we have delivered.
A law firm with no in-house engineers needed PII removed from federal bankruptcy filings under Rule 9037. We designed the redaction, stood up the AWS deployment, and handed back a system that redacts documents automatically as staff save them.
Read the case study →Embedded real-time PII redaction into a bilingual (English/French) patient chatbot so sensitive information is stripped before messages reach human agents or analytics storage.
Read the case study →Deployed Philter inside an AWS data pipeline to de-identify clinical narrative text flowing from an EHR into an analytics database, enabling research access without HIPAA restrictions.
Read the case study →Three principles shape everything we build: your data never leaves your perimeter, the engine is open source and auditable, and the models are purpose-built for PII and PHI.
Philter and the rest of the Philterd toolkit run inside your cloud. Your data never leaves your perimeter, never reaches a third-party API, and never lands in someone else's logs.
Transparency is the only way to verify privacy software. Our core engine is Apache 2.0 licensed, so your engineers can read every line, audit every decision, and extend the stack on their own terms.
Generic LLMs make poor privacy filters. We train and ship specialized NLP and deep-learning models built specifically for PII and PHI detection. They are accurate, tunable, and operationally affordable at scale.
Describe the sensitive data your firm handles and we'll show you how we'd set up redaction in your environment. We'll get back to you within one business day.