Skip to main content Link Search Menu Expand Document (external link)

Data confidentiality

Data confidentiality in machine translation


Data confidentiality terms or policies govern how machine translation providers use client data.

Some machine translation providers use client data to train and improve their translation systems. Confidential data for machine translation include:

  • Client consumer user data
  • Client business transaction data
  • Client business data

Data confidentiality terms or policies ensure that machine translation providers comply with:

  • Contracts
  • Laws, regulation, and standards, like HIPAA or GDPR
  • Security reviews

Data types

Approaches

  • Client-specific - The data is not used to train models for any other clients.
  • No-trace - The request data is never stored, but only in memory for the lifetime of the request.
  • Encryption - The training and request data are not human-readable.
  • Auto-delete - The request data is cached, and deleted after a set period, for example after 72 hours.
  • Delete on request - The training data or request data is deleted upon the client’s request.
  • On-premise deployment (“on-prem”)
  • On-device models

Risk types

Confidential data can be exposed when:

  • Training data is accessed.
  • Request data is accessed.
  • Training data is included as part of translation output.

Edit this article →

Machine Translate is created and edited by contributors like you!

Learn more about contributing →

Licensed under CC-BY-SA-4.0.

Cite this article →