Common deployments
1. Healthcare chatbots. Patient-facing or provider-facing chatbots that call hosted LLMs. The user message can contain anything (“my mom Linda is 72 with hypertension and her MRN is…”). Philter AI Proxy with the medical chatbot policy strips identifiers before the prompt reaches the LLM, preserves clinical context for the model to actually be useful. Architecture.
2. Enterprise RAG systems. A company-internal RAG system trained on contracts, support tickets, customer correspondence, or product documentation that contains PII. Standard RAG architectures index unredacted documents, which means anyone with query access can retrieve PII. Privacy-aware RAG architecture inserts the redaction at ingestion, before chunking and embedding.
3. Fine-tuning corpora. Teams fine-tuning open-source models on real customer data (support tickets, transcripts, internal communications). The training-data prep policy handles the aggressive redaction needed for training data — once PII is in the model weights it’s extractable, sometimes years later.
4. Outbound LLM gateways. Some enterprises proxy ALL outbound LLM traffic through a central gateway for governance, cost control, and PII guardrails. Philter AI Proxy is the PII layer in that gateway. Single point of policy enforcement across every team using LLMs in the org.
What teams need to be careful about
- Post-hoc filtering doesn’t fix training data. If you trained on un-redacted PII, no inference-time filter recovers from that. Carlini et al.’s extraction research demonstrated extractability years after training. Pre-training redaction is the only reliable defense.
- Embedding inversion is real. Published research shows you can reconstruct text chunks from their vector representations, especially with smaller embedding models. The “it’s just numbers in the vector store” argument doesn’t hold for sensitive content.
- Defense in depth. Redacting prompts via Philter AI Proxy doesn’t eliminate the BAA requirement (for HIPAA workloads) or the data-processing agreement (for GDPR). It’s a layer; the legal agreements are a separate layer.
- Tokens matter for downstream models. If your downstream LLM is fine-tuned to expect specific tokens (
<patient>,[NAME]), align the redaction format with what the model expects. Mismatch hurts accuracy.