Company
Founded in Switzerland.
Artificial Intelligence Suisse SA, PO 280, Delemont, Switzerland.
Open-source and enterprise PII masking datasets for training privacy-preserving NLP models. From foundational research data to production-grade European multilingual collections.
The largest collection: 2M+ synthetic PII examples across 32 European locales and 98 entity types, spanning 5 industry verticals.
Rows
2M+
Locales
32
Entities
98
License
CC-BY-4.0 + Enterprise
1M+ examples across 8 languages and 11 regions. Open core (580K) plus 5 enterprise industry datasets, with models and a live demo.
Rows
1M+
Languages
8
Entities
40+
License
CC-BY-4.0 + Enterprise
The original foundational datasets that started it all. 4 releases from 65K to 400K examples, supporting privacy masking research since 2024.
Rows
400K+
Languages
6
Entities
17-54
License
Variable
We can generate bespoke PII datasets tailored to your domain, locale requirements, and entity types. Reach out to discuss your needs.