PII Lens

English Names (Medium)

Mid-size English person-name detector, fine-tuned from GLiNER medium on NVIDIA Nemotron-PII. The recommended default in the ph-eye-pii-en family; Phileas handles structured identifiers via its pattern-based layer.

Status available
License CC-BY-4.0
Version 1.0.0
Updated 2026-06-17
PhEye compatibility >=1.0.0
Languages en
Model size 745 MB
Author Philterd

Entities detected

PERSON

When to load this lens

Load this lens for English person-name detection when you want the best accuracy/latency balance. It is the recommended default for most workloads, and a focused PERSON detector; emails, phone numbers, IDs, and other structured PII are handled by Phileas's pattern-based detection, not this model.

Pairs well with

English Names (Small): Low-latency English person-name detector, fine-tuned from GLiNER small on NVIDIA Nemotron-PII. The on-device size in the ph-eye-pii-en family; Phileas handles structured identifiers via its pattern-based layer.
English Names (Large): Highest-capacity English person-name detector, fine-tuned from GLiNER large on NVIDIA Nemotron-PII. The server-side size in the ph-eye-pii-en family; Phileas handles structured identifiers via its pattern-based layer.

What this lens detects

PERSON: people’s names as they appear in English text.

This is a name-only lens. Emails, phone numbers, SSNs, credit cards, IP addresses, and other structured PII follow regular patterns and are detected by Phileas’s pattern-based (regex, checksum, and dictionary) layer, not by this model. Compose this lens with that layer for full coverage.

Why this lens

This is the mid-size, recommended-default member of the ph-eye-pii-en family, fine-tuned from urchade/gliner_medium-v2.1 (DeBERTa-v3-base) on the synthetic nvidia/Nemotron-PII dataset. It is the balance point of the three sizes: stronger generalization than the small lens, a smaller footprint than the large one. Like its siblings it is recall-leaning by design, since in redaction a missed name is a leak while an extra span is only over-redaction. A confidence threshold around 0.70 is a sensible starting operating point; lower it to push recall higher.

When to use this

The default English name detector for most documents and workloads.
Server-side or GPU-accelerated pipelines that want accuracy without the largest model’s footprint.
As the English name detector composed with Phileas’s pattern-based detection for structured PII.

Known limitations

Names only. This lens detects PERSON. Other PII is handled by Phileas’s pattern-based detection; compose accordingly.
English only. For other languages, load the corresponding language lens when available.
Trained on synthetic data. Reported accuracy is in-distribution on Nemotron-PII and is a ceiling, not a production guarantee; validate precision and recall on your own text. The model is recall-leaning, so expect some over-redaction and tune the threshold to your precision/recall balance.
The underlying model is licensed CC-BY-4.0; the Nemotron-PII training data requires attribution to NVIDIA.

Use this lens with PhEye, Phileas, or Philter

PhEye loads this lens at configuration time and exposes it to Phileas and Philter automatically. Have questions about a specific deployment? Talk to the team.

About PhEye →