GDPR and LLMs in the Enterprise: What Organisations Really Need to Know

ChatGPT, Microsoft Copilot, Google Gemini — language models have entered the everyday workplace. But what data protection obligations do they actually create? This guide explains legal basis requirements, DPA obligations and data residency questions that every enterprise decision-maker needs to understand.

Key Takeaways

Every input of personal data into an external LLM constitutes a data transfer — without a DPA (Data Processing Agreement) and legal basis, it is unlawful under GDPR.

The three critical obligations in the GDPR-LLM context are: establish legal basis, conclude a DPA, and document processing activities in the Record of Processing Activities (RoPA).

Data residency is not a marketing argument: if you do not know where prompts and responses are stored, you have no control over third-country transfers.

Microsoft Copilot for M365 must be assessed differently under data protection law than the consumer version of ChatGPT — the difference lies in the contract framework, not the technology.

An internal LLM usage policy with clear prohibited categories is the most important immediately actionable instrument — even before any technical solution.

Why GDPR and LLMs are an underestimated combination

When an employee enters a customer name, a contract detail or salary information into a language model, it is not a trivial matter under data protection law. It constitutes processing of personal data by a third party — with all the consequences that GDPR attaches to that. Yet surveys from 2025 and 2026 consistently reveal the same pattern: employees use AI tools in their daily work, IT tolerates it, and the legal team is the last to know.

The problem is not the technology — language models are legitimate and useful in many scenarios. The problem is the missing foundation: no DPA, no usage policy, no documentation in the Record of Processing Activities (RoPA). For organisations operating under GDPR, and in some cases under sector-specific regulations, this is a structural compliance risk — not merely a theoretical one.

GDPR LLM enterprise: the four compliance building blocks — legal basis, DPA, data residency, and RoPA — Fig. 1: The four GDPR compliance building blocks for LLM deployment — legal basis, DPA, data residency and Record of Processing Activities.

The three core obligations: legal basis, DPA and documentation

Any organisation using external language models in a business context must be able to answer three questions in a legally defensible way. These are not optional — they are at the centre of every GDPR audit by a supervisory authority.

1. What is the legal basis? In practice, three bases apply to the processing of personal data by an external model: legitimate interest (Art. 6(1)(f) GDPR), consent (Art. 6(1)(a)), or contract performance (Art. 6(1)(b)). Legitimate interest is most commonly applicable but requires a documented balancing test — particularly where customer data is involved.

2. Is a DPA in place? Where an LLM provider processes data on behalf of the organisation, a data processing agreement under Art. 28 GDPR is mandatory. OpenAI provides a DPA for enterprise customers; Microsoft concludes one under the Microsoft Customer Agreement. Organisations using the consumer version of ChatGPT without a separate contract have no DPA — and therefore no lawful processor relationship.

3. Is the processing recorded in the RoPA? The Record of Processing Activities under Art. 30 GDPR must capture every processing activity — including purpose, categories of data subjects, recipients and retention periods. LLM usage is a distinct activity that must be documented separately, including which provider acts as processor and where data is processed.

Data residency: where do your prompts actually go?

Data residency refers to the physical and legal location where data is stored. With LLMs, this is not straightforward: a prompt entered by an employee in the office can be processed by a data centre in the United States, stored in a backup copy in Ireland, and retained for security audit purposes for up to 30 days — depending on the provider and product version. Third-country transfers under Arts. 44 ff. GDPR require either adequacy decisions (e.g. for US providers under the EU–US Data Privacy Framework), Standard Contractual Clauses (SCCs) or Binding Corporate Rules.

Microsoft Copilot for Microsoft 365 processes data under the EU Data Boundary commitment — meaning processing and storage location is generally within the EU or EEA. This is a material data protection difference compared to OpenAI's consumer products, where data is processed in US data centres by default. Organisations that are unaware of this distinction are effectively deploying technically identical technologies under entirely different legal frameworks.

A pragmatic requirement therefore is: before any LLM tool is approved for enterprise use, the Data Protection Officer (or external DPO) must have confirmed the provider's data residency commitments in writing — and these commitments must be reflected in the RoPA.

Usage policy: the most important immediately actionable instrument

Even when all contractual foundations are in place, the greatest residual risk is the unintentional misuse of LLMs by employees. An internal usage policy defines clearly which categories of data must never be entered into an external language model. Typical prohibited categories include: personal data of customers or employees without an explicit process decision, confidential business information (strategic plans, M&A information), health and financial data, and credentials and passwords.

The policy does not need to be long — two pages with concrete examples and a clear escalation path are more effective than a 40-page document nobody reads. What matters is that it is communicated, signed and embedded in onboarding documentation.

GDPR LLM enterprise: three-stage compliance process from legal review and DPA to the internal usage policy — Fig. 2: Three-stage LLM compliance process — from legal review and contract framework to the internal usage policy.

Organisations that address these three layers — legal basis, contract framework and internal policy — in a structured way are not only GDPR-compliant. They also lay the foundation for deploying LLMs productively over the long term, without being caught off guard by supervisory authority inquiries. GDPR-compliant AI deployment is not a brake — it is the basis for lasting trust with customers, employees and partners.

Frequently Asked Questions

Is using ChatGPT in the workplace fundamentally prohibited? expand_more

No — but without the right foundations, it is unlawful as soon as personal data is involved. Organisations using ChatGPT Enterprise with an active DPA and documented legal basis act in compliance. Those using the free consumer version for customer data do not.

Do I need to involve my Data Protection Officer before introducing an LLM? expand_more

Yes, in every case — regardless of whether an internal or external DPO is appointed. Where sensitive data is processed systematically, a Data Protection Impact Assessment (DPIA) under Art. 35 GDPR may also be required before operations begin.

What happens if an employee accidentally enters customer data? expand_more

If the data was processed by the provider without an adequate contractual basis, this constitutes a notifiable data breach. Under Art. 33 GDPR, organisations have 72 hours to report incidents with a high risk for data subjects to the competent supervisory authority. An internal usage policy with clear prohibitions can substantially reduce the risk of such incidents.

How does a self-hosted internal LLM differ from an external service under GDPR? expand_more

With a self-hosted model (e.g. Llama on company-owned infrastructure), there is no processor relationship with an external provider — data does not leave the organisation's own infrastructure. However, internal documentation and security obligations remain. For high-risk applications, a DPIA may still be required. The operational and security burden lies entirely with the organisation itself.

Key Takeaways

Why GDPR and LLMs are an underestimated combination

The three core obligations: legal basis, DPA and documentation

Data residency: where do your prompts actually go?

Usage policy: the most important immediately actionable instrument

Frequently Asked Questions

Related Insights

AI Governance: Roles and Responsibilities Matrix for Day-to-Day Operations

RAG for Internal Documents: Pre-Launch Checklist

EU AI Act SME Obligations 2026: What Now Applies and Where Transition Periods End

Deploy AI in a GDPR-compliant way