Purpose-built models
Not a generic LLM. PhEye serves NLP models trained specifically for PII and PHI entity recognition: higher precision, faster inference, and a tiny fraction of the compute cost of an LLM at the same task.
Not a generic LLM. PhEye serves NLP models trained specifically for PII and PHI entity recognition: higher precision, faster inference, and a tiny fraction of the compute cost of an LLM at the same task.
Swap models per workload: general-purpose, healthcare, multilingual, and more. Browse the lens catalog →
Deploy PhEye alongside the data. Sensitive text never leaves your infrastructure: no third-party API, no model-provider account, no outbound dependency.
Standard Docker images run inference on CPU with no special hardware required. GPU-accelerated images (built on PyTorch with CUDA) are available for workloads that need faster throughput or handle high request volume.
PhEye is the default model server for both Phileas and Philter; wire it in via configuration. Or call its HTTP API directly from anything that speaks JSON.
Every detection comes with a numeric confidence score between 0 and 100. Tune precision and recall by filtering at a threshold: accept everything above 75, drop everything below 50, decide policy by entity type.
PII lenses are swappable AI / NLP models that plug into PhEye at configuration time. Philter (or Phileas, or any HTTP client) calls PhEye's /find endpoint with text; PhEye runs the loaded lenses, merges their detections, and returns entities with confidence scores. The calling code never has to know which lenses are loaded.
Each PhEye Docker image ships with one model baked in at build time. Select the model that matches your language and entity type, or run multiple containers in parallel for broader coverage. Custom models can be developed on request; contact us to discuss your requirements.
| Model | Language | Entities detected | Notes |
|---|---|---|---|
| pii_base | English | Person, location, org, phone, email, date, URL, and more | General-purpose PII detection. Broad coverage across common entity types — the right starting point for most English-language workloads. Powered by philterd/ph-eye-pii-base (GLiNER). |
| hospitals | English | Hospital names, room numbers, clinical providers | Specialized for healthcare facility identifiers. Detects hospital names, room and ward numbers, and clinical provider references in clinical notes and administrative text. Powered by knowledgator/gliner-pii-base-v1.0 (GLiNER). |
| medical_conditions | English | Disease, disorder | Identifies medical conditions and disease/disorder mentions in clinical or biomedical text. Powered by blaze999/Medical-NER (Transformers token-classification). |
| french_persons | French | Person | Person-name detection in French-language text, using a multilingual news-trained model. Powered by EmergentMethods/gliner_medium_news-v2.1 (GLiNER). |
| french_medical | French | Disease (Maladie) | Medical condition detection in French-language clinical text. Powered by almanach/camembert-bio-gliner-v0.1 (GLiNER, CamemBERT backbone). |
Need a model that isn't listed? Talk to the team about custom model support.
Three ways to get going: deploy the open source yourself, spin it up from a cloud marketplace, or work with our team directly. Pick the path that fits.