Anonymisation of personal data

Protecting personal data such as name, address, and date of birth, as well as other identifiers, play an important role in every company, especially with regards to the EU General Data Protection Regulation (GDPR). If documents have to be processed outside the company for procedures such as external audits, archiving or the like the personal information in data has to be anonymized beforehand.

Simple keyword-based methods aren’t able to carry out automatic anonymization, especially if the quality of the OCR-documents is fluctuating. The Glanos DataSphere combines semantic and linguistic methods with extensive dictionaries, which makes it possible to identify and anonymize sensitive information such as name, address and date of birth with just a small amount of training data anonymized by hand, even in documents with bad OCR quality.

Advantages of our solutions:

  • Customers get access to the Glanos DataSphere locally or online to control the automatic automatization their own.
  • Reliable anonymization with the help of linguistic and AI-based methods from the first data set.
  • Improvement of results after providing only a few documents as an example.
  • Flexible pseudonymization and k-anonymization of sensitive data.
  • Continuous improvement of quality thanks to self-learning algorithms integrated into the DataSphere.
Search