Presidio - Data Protection and De-identification
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
I don't have an immediate use for this now, but there's been times I sure wish I knew about or had this.