Talk to an Expert

Tell us about your stack and the privacy problems you're trying to solve. We typically respond within one business day.

Prefer to skip the form? Pick a time on our calendar →
or send a message

Please do not enter PII or PHI in this form. If you need to share an example, use a sanitized one.

← All lenses

PII Lens

COVID-19

Pandemic-era documents have a vocabulary that pre-2020 healthcare models don't fully cover. Use this lens alongside Healthcare for clinical text from 2020-onward.

  • Status available
  • License Apache-2.0
  • Version 1.0.0
  • Updated 2026-05-22
  • PhEye compatibility >=1.0.0
  • Languages en
  • Model size 75 MB
  • Author Philterd

Entities detected

  • TEST_RESULT
  • VACCINE_BATCH
  • VARIANT
  • PANDEMIC_CLINICAL_TERM

When to load this lens

Add this lens when the corpus contains COVID-era clinical text — test results, vaccination records, variant mentions, pandemic-specific clinical references.

Pairs well with

  • Healthcare — Clinical-text lens trained for entities that matter in EHR exports, clinical notes, discharge summaries, and medical-chatbot transcripts — higher recall than general NER on the healthcare-specific surface.
  • Hospital Identifiers — Narrower healthcare-adjacent lens for environments where hospital and room identifiers are the binding constraint — bed-management systems, patient-flow analytics, discharge planning tools.

What this lens detects

Four entity classes specific to pandemic-era clinical text:

  • Test result terminologySARS-CoV-2 positive, antigen-negative, PCR confirmed, rapid test negative, etc. The vocabulary was barely-existent in pre-2020 clinical corpora and a Healthcare-lens-trained-on-2018-data misses most of it.
  • Vaccine batch identifiers — lot numbers, batch codes, manufacturer-specific identifiers from the Pfizer / Moderna / J&J / AstraZeneca campaigns.
  • Variant names — Alpha, Beta, Gamma, Delta, Omicron, BA.5, JN.1, etc., as named entities. (Useful where variant identification is part of the clinical context.)
  • Pandemic-specific clinical termsmonoclonal antibody, convalescent plasma, Paxlovid, terms that entered routine clinical vocabulary during the pandemic.

This is a supplemental lens. It doesn’t try to detect generic clinical entities — load it alongside the Healthcare lens, which handles the broader clinical vocabulary.

When to use this

  • Clinical text from 2020-onward that touches respiratory illness, vaccination, or infectious-disease workflow.
  • Vaccination-record processing for compliance, employer attestation, or research.
  • Post-COVID research corpora where the variant and treatment vocabulary needs to be preserved structurally (and redacted, if appropriate).

Known limitations

  • Vocabulary drift. The COVID clinical vocabulary continues to evolve (new variants, new treatments, new test types). The lens is versioned; future versions will track the evolution.
  • Distinguishes COVID-era PHI but not COVID-era diagnoses. This is a PII lens, not a clinical-coding tool. It finds entities to redact; it doesn’t categorize them as protected vs unprotected — that’s the policy engine’s job.

Use this lens with PhEye, Phileas, or Philter

PhEye loads this lens at configuration time and exposes it to Phileas and Philter automatically. Have questions about a specific deployment? Talk to the team.

About PhEye →