Philter
Turnkey, self-hosted PII redaction with a clean API. Drops into any pipeline that needs sensitive data removed from text — and runs entirely inside your cloud.
Community
Philterd is open source software for PII and PHI redaction. The toolkit, the issues, the conversations — all happen in public. Here’s how to participate.
10 Apache 2.0 projects under github.com/philterd — read the code, file issues, contribute pull requests.
Turnkey, self-hosted PII redaction with a clean API. Drops into any pipeline that needs sensitive data removed from text — and runs entirely inside your cloud.
The core redaction, anonymization, masking, and replacement library underneath Philter. Available in Java, Python, .NET, and Go.
The trained AI and NLP models that find PII and PHI in text, plus the service that hosts them. Designed to plug directly into Phileas and Philter.
High-speed discovery scanner that crawls files and storage to map where sensitive information actually lives across your environment.
Intelligent monitoring that tracks PII flow across the organization and alerts on suspicious activity or unusual trends.
Drop-in proxy that redacts PII and PHI before prompts reach LLM providers like OpenAI and Anthropic Claude.
Standalone audit tool that scores redaction policies on precision and recall, so policy changes can be measured rather than guessed at.
Human-in-the-loop PII redaction. Search, review, and override automated detection decisions with structured exemption codes — built for AI training-data prep and regulated everyday workflows.
Privacy-first analytics that applies differential privacy to PII counts, preserving statistical utility without exposing individuals.
Web console that lets non-technical users build and deploy redaction rules through a visual, no-code interface.
Ask questions, share patterns, propose features. Discussions live alongside each repo’s issues — start at github.com/orgs/philterd/discussions.
Found a bug? File it in the relevant repo’s issue tracker — we triage weekly. For security-sensitive reports, see SECURITY.md in any repo.
Need an SLA, a custom evaluation on your data, or hands-on engineering? Get in touch — we work directly with healthcare, finance, legal, and government teams.
Pull requests welcome — bug fixes, new entity detectors, language support, docs improvements. Each repo has its own CONTRIBUTING.md.
Issues labeled good first issue across the philterd org.
No "good first issue" tickets are open right now — nice problem to have. Browse the repos directly for other ways to contribute, or open an issue with your idea.
Each project ships independently. Per-repo release feeds:
Infrequent updates of technical posts and community highlights.
Philterd is built and maintained by Jeff Zemerick — PMC Chair of Apache OpenNLP and primary developer behind the entire toolkit. The contact form below reaches Jeff directly.