Security

Translation Data Retention Questions to Ask Before Uploading Files

June 2, 2026 広報スタッフ

Translation Data Retention Questions to Ask Before Uploading Files

You need to translate a contract, a financial report, or an employee document. You find an online translation tool, upload the file, and get the translation. What happens to that file after you close the browser tab?

Most people never ask. But the answer matters, especially when the documents contain sensitive business information, personal data, or content subject to regulatory requirements. This article lists the data retention questions you should ask before uploading files to any translation service.

Why Data Retention Matters for Translation

When you upload a document to a translation service, you are sending your content to a third-party server. That content might include:

  • Client names, addresses, and contact information
  • Financial figures and pricing data
  • Employee personal information
  • Proprietary processes or trade secrets
  • Contract terms and legal obligations
  • Product roadmaps and strategic plans

If the translation service stores your documents indefinitely, that data remains on someone else's infrastructure subject to their security practices, their breach risk, and their terms of service. If the service uses your content to train AI models, your confidential information may influence the output other users receive.

The FTC provides guidance on vendor security that is directly relevant to translation services. Understanding how your data is handled after translation is a core part of vendor evaluation.

Source: https://www.ftc.gov/business-guidance/small-businesses/cybersecurity/vendor-security

Essential Questions to Ask

Question 1: How Long Are Uploaded Documents Stored?

Some translation services process your document and delete it immediately. Others store it for hours, days, or indefinitely. Ask for a specific retention period.

What to look for:

  • Immediate deletion after processing: The document exists on the server only during active translation.
  • Short-term retention: Documents are stored for a defined period (24 hours, 7 days) and then automatically deleted.
  • Indefinite retention: Documents remain on the server until you manually delete them — or may never be deleted.
  • Retention tied to account status: Documents are deleted when you close your account.

The shorter the retention period, the lower the risk. If the vendor does not specify a retention period, ask directly. If they cannot give you a clear answer, that is a red flag.

Question 2: Is My Content Used to Train AI Models?

Some AI translation services use uploaded content to improve their models. This means your documents may influence the translations other users receive. For business documents containing proprietary information, this is a significant concern.

What to look for:

  • Explicit opt-out: The service allows you to disable model training on your content.
  • Enterprise accounts excluded: Business and enterprise plans do not use content for training, even if free plans do.
  • No training on any content: The service does not use any uploaded content for model training.

Check the terms of service, not just the marketing page. The marketing page may say "your data is safe" while the terms grant the company broad rights to use your content.

Question 3: Can I Delete My Data?

Even if the vendor stores your documents temporarily, you should have the ability to delete them. Ask:

  • Is there a delete function in the user interface?
  • Does deleting a document remove it from all backups, or just from active storage?
  • Can you request bulk deletion of all your data?
  • Is deletion confirmed in writing or through a verifiable process?

The right to delete your data is a baseline requirement. If the vendor cannot delete your documents on request, you have no control over your data after upload.

Question 4: Where Are the Servers Located?

The physical location of the servers that process your documents matters for several reasons:

  • Data sovereignty laws: Some jurisdictions require that certain types of data remain within specific geographic boundaries.
  • Regulatory compliance: Industry-specific regulations may restrict where data can be processed.
  • Legal jurisdiction: If a data breach occurs, the laws that apply depend on where the data was stored.

Ask where the translation servers are located and whether you can choose a specific region for processing.

Question 5: What Security Measures Protect My Documents?

Data retention risk is not just about how long documents are stored but how well they are protected while stored. Ask about:

  • Encryption in transit: Is your document encrypted while being uploaded and downloaded? (TLS/SSL is the standard.)
  • Encryption at rest: Are stored documents encrypted on the server? If so, what encryption standard is used?
  • Access controls: Who at the vendor can access your documents? Is access logged and audited?
  • Infrastructure security: Does the vendor use reputable cloud infrastructure (AWS, Azure, Google Cloud) with their security certifications, or proprietary infrastructure?

The CISA provides cybersecurity guidance for small businesses that includes evaluating third-party vendor security.

Source: https://www.cisa.gov/cyber-guidance-small-businesses

Question 6: What Happens If There Is a Data Breach?

No system is perfectly secure. Ask what happens if the vendor experiences a breach:

  • Will you be notified? How quickly?
  • What remediation does the vendor offer?
  • Is there liability protection or insurance?
  • Does the vendor have an incident response plan?

The FTC's data breach response guide for businesses outlines what responsible vendors should have in place.

Source: https://www.ftc.gov/business-guidance/resources/data-breach-response-guide-business

Question 7: Is There a Data Processing Agreement Available?

For businesses subject to GDPR, CCPA, or other data protection regulations, a Data Processing Agreement (DPA) may be legally required. Even if you are not subject to these regulations, a DPA signals that the vendor takes data handling seriously.

A DPA typically specifies:

  • What data is processed and for what purpose
  • How long data is retained
  • Security measures in place
  • Sub-processor details (if the vendor uses other services)
  • Data breach notification procedures
  • Your rights regarding your data

If the vendor offers a DPA, read it. If they do not, ask why.

Question 8: Are Translations Cached or Logged?

Even if the source document is deleted, the translated output might be stored. Ask:

  • Are translations stored separately from source documents?
  • Is there a translation history that persists after document deletion?
  • Are API logs retained that contain document content or translations?

Some services keep a translation history for convenience (so you can retrieve past translations without re-uploading). This is useful but means your content persists even after you delete the source file.

Red Flags

Watch for these warning signs when evaluating a translation service's data practices:

  • No privacy policy or terms of service published: If you cannot read the terms before signing up, do not sign up.
  • Vague language about data handling: "We take your privacy seriously" without specifics is not a data retention policy.
  • No way to contact anyone about data concerns: If there is no email, chat, or phone option for data privacy questions, the vendor is not prepared to handle them.
  • Free services with no clear business model: If the service is free and you cannot tell how the company makes money, your data may be the product.
  • Terms that grant broad content licenses: Language like "you grant us a worldwide license to use, reproduce, and distribute your content" in the terms of service is a warning sign.

Green Flags

Positive indicators that a translation service takes data handling seriously:

  • Published retention periods: The vendor clearly states how long documents are stored and provides mechanisms for deletion.
  • Opt-out of model training: Business and enterprise accounts are excluded from AI model training by default, with a clear explanation of how to verify this.
  • Transparent security practices: The vendor publishes information about encryption, infrastructure, and access controls without requiring you to ask.
  • Responsive to questions: When you contact the vendor about data practices, they provide clear, specific answers rather than vague reassurances.
  • Compliance certifications: The vendor holds relevant certifications (such as SOC 2, ISO 27001) and can provide evidence upon request.

Practical Steps Before Uploading

Before uploading sensitive documents to any translation service:

  1. Read the privacy policy and terms of service. Look for the specific data retention, model training, and deletion sections.
  2. Contact the vendor with questions. If the published information is unclear, ask directly. How they respond tells you as much as the answer itself.
  3. Test with non-sensitive documents first. Upload a test document that does not contain confidential information. Verify the workflow, check the output quality, and confirm that the delete function works as described.
  4. Remove sensitive content when possible. If the document contains data that is not needed for translation (financial figures in a contract, personal information in a policy document), consider removing or redacting it before upload.
  5. Use a business or enterprise plan when available. These plans often have stronger data protections than free or consumer plans.

A Note on Different Document Types

Not all documents carry the same data risk. Consider the sensitivity level before choosing a translation approach:

  • Public content (published articles, marketing materials): Lower risk. Most translation services are appropriate.
  • Internal content (meeting notes, project updates): Moderate risk. Choose a service with clear retention policies and deletion capability.
  • Confidential content (contracts, financial reports, strategic documents): Higher risk. Requires strong data protections, short retention, and ideally a DPA.
  • Regulated content (health information, financial filings, personal data): Highest risk. May require specialized services with compliance certifications. Consult with your legal or compliance team before using any AI translation tool for these documents.

For legal, medical, or regulatory content, AI translation can produce a draft, but appropriate subject-matter review is essential. The data handling concerns are additional to — not a substitute for — the accuracy concerns.

Building Data Retention Into Your Translation Policy

If your team translates documents regularly, create a simple policy that answers:

  • Which translation services are approved for which sensitivity levels?
  • Who is responsible for deleting documents after translation?
  • What types of documents require additional review before upload?
  • How often do we re-evaluate our translation vendor's data practices?

A written policy prevents inconsistent decisions and ensures that everyone on the team understands the data handling expectations. Review the policy annually and update it when you change vendors, when your vendor updates their terms, or when your organization's data classification requirements change. A policy that is written but never revisited quickly becomes outdated and may provide a false sense of security.

Taking Action

If you have not evaluated your current translation tool's data retention practices, do it this week. Read the privacy policy. Find the answers to the eight questions above. If any answer is unclear, contact the vendor. If any answer is unsatisfactory, consider whether the risk is acceptable for the types of documents you translate.

For teams that translate sensitive business documents, choosing a translation service with clear, transparent data practices is as important as choosing one that produces accurate translations. Quality matters, but so does controlling what happens to your content after you click "translate."

Jitan Translate provides AI-assisted document translation across PDF, DOCX, PPTX, and XLSX formats. Review the data handling practices as part of your vendor evaluation process.

Source: https://jitantranslate.com/en/blog/pdf/translate-pdf-without-losing-formatting/

How JITAN helps in this scenario

JITAN provides high-quality AI translation at a low cost, preserving document layout while accounting for context.

Try JITAN