Talk to the Team

Tell us about your stack and the privacy problems you're trying to solve. We typically respond within one business day.

Prefer email? support@philterd.ai

Prefer to skip the form? Pick a time on our calendar →
or send a message

Please do not enter PII or PHI in this form. If you need to share an example, use a sanitized one.

← All lenses

PII Lens

English Names (Medium)

Mid-size English person-name detector, fine-tuned from GLiNER medium on NVIDIA Nemotron-PII. The recommended default in the ph-eye-pii-en family; Phileas handles structured identifiers via its pattern-based layer.

  • Status available
  • License CC-BY-4.0
  • Version 1.0.0
  • Updated 2026-06-17
  • PhEye compatibility >=1.0.0
  • Languages en
  • Model size 745 MB
  • Author Philterd

Entities detected

  • PERSON

When to load this lens

Load this lens for English person-name detection when you want the best accuracy/latency balance. It is the recommended default for most workloads, and a focused PERSON detector; emails, phone numbers, IDs, and other structured PII are handled by Phileas's pattern-based detection, not this model.

Pairs well with

  • English Names (Small): Low-latency English person-name detector, fine-tuned from GLiNER small on NVIDIA Nemotron-PII. The on-device size in the ph-eye-pii-en family; Phileas handles structured identifiers via its pattern-based layer.
  • English Names (Large): Highest-capacity English person-name detector, fine-tuned from GLiNER large on NVIDIA Nemotron-PII. The server-side size in the ph-eye-pii-en family; Phileas handles structured identifiers via its pattern-based layer.

What this lens detects

  • PERSON: people’s names as they appear in English text.

This is a name-only lens. Emails, phone numbers, SSNs, credit cards, IP addresses, and other structured PII follow regular patterns and are detected by Phileas’s pattern-based (regex, checksum, and dictionary) layer, not by this model. Compose this lens with that layer for full coverage.

Why this lens

This is the mid-size, recommended-default member of the ph-eye-pii-en family, fine-tuned from urchade/gliner_medium-v2.1 (DeBERTa-v3-base) on the synthetic nvidia/Nemotron-PII dataset. It is the balance point of the three sizes: stronger generalization than the small lens, a smaller footprint than the large one. Like its siblings it is recall-leaning by design, since in redaction a missed name is a leak while an extra span is only over-redaction. A confidence threshold around 0.70 is a sensible starting operating point; lower it to push recall higher.

When to use this

  • The default English name detector for most documents and workloads.
  • Server-side or GPU-accelerated pipelines that want accuracy without the largest model’s footprint.
  • As the English name detector composed with Phileas’s pattern-based detection for structured PII.

Known limitations

  • Names only. This lens detects PERSON. Other PII is handled by Phileas’s pattern-based detection; compose accordingly.
  • English only. For other languages, load the corresponding language lens when available.
  • Trained on synthetic data. Reported accuracy is in-distribution on Nemotron-PII and is a ceiling, not a production guarantee; validate precision and recall on your own text. The model is recall-leaning, so expect some over-redaction and tune the threshold to your precision/recall balance.
  • The underlying model is licensed CC-BY-4.0; the Nemotron-PII training data requires attribution to NVIDIA.

Use this lens with PhEye, Phileas, or Philter

PhEye loads this lens at configuration time and exposes it to Phileas and Philter automatically. Have questions about a specific deployment? Talk to the team.

About PhEye →