Skip to content

Does AI-Generated Email Hurt Your Deliverability?

 

I want to start with what I actually know, from my own work, rather than what the studies say — because the studies and the lived experience do not always land in the same place.

Across the audits I have done and the conversations I have had with other deliverability specialists, a pattern keeps showing up: programmes that have shifted heavily to AI-generated email copy are seeing engagement drop and inbox placement get worse. Not always dramatically. Not always immediately. But consistently enough, across enough accounts, that it is not a coincidence.

That is my starting point. Now let me explain what is driving it, what the research confirms, and what else is happening in the inbox — because AI is affecting deliverability in ways beyond just the copy question, and some of those ways are more significant than most people realise.

 

Are spam filters targeting AI-generated content? Not quite. 

The question I get asked most often is whether Gmail, Outlook, or Yahoo are detecting AI-generated copy and sending it to spam. The short answer is no, not in the way people imagine. Spam filters are not running your email through an AI detector. There is no switch that says "ChatGPT wrote this, spam folder."

What spam filters are doing, and have always done, is measuring engagement. Who opens, who clicks, who replies, who moves the email out of spam, who marks it as spam, how quickly people interact after receipt. Your sender reputation is built from those signals, accumulated over time, across your whole sending programme. It is a trust score, and it reflects how much your subscribers actually want to hear from you.

AI-generated email does not trigger spam filters. But what it tends to produce does.

When you use AI to write email copy without doing the work of making it specific, relevant, and distinctly yours — which is how most teams are using it, because that is the path of least resistance — you end up with copy that sounds like everyone else's AI-generated email. Generic. Competent but forgettable. Written in a register that could belong to any brand in your sector. Your subscribers, who have a finely tuned sense for what is and is not worth their time, respond accordingly. Open rates may hold (for reasons we will come to shortly). Clicks decline. Replies drop. The engagement signals that inbox providers use to make placement decisions get worse.

And worse engagement signals mean worse deliverability. Not immediately. Not from one campaign. But sustained over months of lower-engagement sends, the inbox providers learn that your emails are not the ones your subscribers are interested in, and they begin treating them accordingly.

 

The data:

Validity's 2026 Deliverability Benchmark Report states this directly: AI has made it easier for spammers to flood inboxes, and mailbox providers have responded by making their filters more sophisticated and harder for all senders to navigate. Litmus confirmed the same in their State of Email 2026 research: there is a downstream deliverability impact from AI use across the industry, partly driven by inbox providers adding their own AI features, and partly because AI has eroded inbox trust by enabling bad actors at scale.

 

→ Litmus: The Dangers of Generative AI in Email Marketing (litmus.com/blog/the-dangers-of-generative-ai-in-email-marketing)
→ Validity 2026 Deliverability Benchmark Report (validity.com)

 

The volume problem is making this worse for everyone

There is a second mechanism worth understanding alongside the engagement one. Over 51% of spam sent in 2026 is AI-generated. Phishing emails that used to take skilled human effort to craft can now be produced in seconds, in bulk, in convincing language, in any tone. Because of this, inbox providers have had to tighten their filters across the board — not just for obvious spam, but for borderline email from legitimate senders too.

You are navigating a harder inbox environment than you were two years ago, and the reason is not anything you did. It is the collateral consequence of AI lowering the floor for bad actors. Every legitimate sender is paying a small tax on bad actors' AI use, and the brands generating the lowest engagement signals are paying the highest rate.

This matters because it changes the risk calculation on AI-generated copy. It was never zero risk. It is even lower margin now.

 

What to do:

The practical test: pull the click rate and complaint rate from your last six sends of AI-generated copy and compare them to your pre-AI equivalent. Not your open rate — that number is now less reliable than ever, for reasons covered below. Click rate and complaint rate. If click rate has drifted down and complaint rate has drifted up since you started using AI heavily for copy, you have found your culprit.

 

 

The homogenisation problem — why AI copy erodes your sender association

There is a subtler version of this problem that does not show up directly in deliverability data but feeds into it over time.

When multiple brands in the same sector are all using the same AI tools, prompted in similar ways, drawing from similar training data, the copy they produce trends towards a shared register. Same sentence rhythms. Same vocabulary. Same opening gambits. Same call-to-action structures. Anyone reading email from two or three brands in the same space using AI heavily will start to notice — not necessarily consciously — that the emails feel interchangeable.

For deliverability, this matters because of word association and sender familiarity. The associations your subscribers build with your sender name, what you talk about, how you sound, what they can expect from you — are what determine whether your email gets a fair reading or a reflexive dismiss. When your copy no longer sounds distinctly like you, those associations weaken. The familiarity that earned you an engaged open last year becomes harder to maintain.

I covered this in more depth in the piece on the billboard effect and email as an awareness channel: every email you send is building or eroding an association in your subscriber's mind, whether they open it or not. AI copy that sounds like everyone else is eroding a distinctive association and replacing it with a generic one. That has commercial consequences well beyond any individual campaign.

Email health check

Free Email Marketing Health Check

Audit your entire email ecosystem in under 30 minutes. Get a personalised health score, traffic-light priorities, and clear actions you can start on today.


Gemini in Gmail, the bigger deliverability story that most people are not talking about yet

The AI-generated copy question is important. But there is a second way AI is changing deliverability that is, in my view, more significant and less understood: what is happening inside the inbox at the provider level.

In January 2026, Google announced that Gmail is "entering the Gemini era." Powered by Gemini 3, Google's most advanced AI model, Gmail rolled out a set of features that fundamentally change how email is processed, sorted, and surfaced before a subscriber ever sees it.

This is not a spam filter update. It is a structural change to how the inbox works.

 

What Gemini in Gmail actually does

The three features that matter for deliverability

AI Overviews

- Gmail now synthesises email threads into concise summaries. When a subscriber opens an email thread, Gemini generates a summary of the key points. They can also ask their inbox natural language questions — "what offers have I received from this brand?" or "did my order ship?" — and Gemini surfaces answers drawn from their email history without them opening any individual email. Google is explicit about the intent: AI Overviews exist so users "don't have to open and read through individual messages."

AI Inbox

- currently in trusted tester phase, broader rollout expected throughout 2026. This is a new inbox view that gives subscribers a personalised briefing: to-dos extracted from emails, topics to catch up on, urgent items surfaced, and everything else — which for most subscribers means marketing email — deprioritised as background noise. Gmail prioritises by relationship signals (people the subscriber emails frequently, contacts in their address book, inferred relationships from message content) and by actionability (bills, appointments, time-sensitive items).

Relevance-based Promotions tab

- already live for most users. Gmail moved the Promotions tab from chronological sorting to relevance-based sorting. An email that arrived an hour ago from a brand with strong engagement history can now sit above an email that arrived yesterday from a brand with weaker signals. Your position in the Promotions tab is no longer determined by when you sent. It is determined by how much your subscribers have historically wanted what you send.

Taken together, these three changes mean that "delivered to inbox" is no longer a binary outcome. There is now a gradient of visibility within the Gmail inbox, and where your email sits on that gradient is determined by the engagement quality of your entire programme — not any individual send.

 

The data:

According to Folderly's deliverability platform, which monitors millions of emails daily, up to 40% of emails that technically reach Gmail inboxes are being deprioritised by AI filtering. Delivered is not the same as seen. It never fully was, but the gap is now larger and more systematic than it has ever been.

 

→ Folderly: How Gmail's Gemini AI Changes Email Deliverability in 2026 (folderly.com/blog/gmail-gemini-ai-email-deliverability-2026)
→ Google: Gmail is entering the Gemini era (blog.google/products-and-platforms/products/gmail/gmail-is-entering-the-gemini-era/)
→ Emailexpert: Gmail Launches AI Inbox (emailexpert.com/gmail-launches-ai-inbox/)


What this means for how you think about engagement signals

Gemini is not assessing your email against a spam rulebook. It is making relevance predictions based on the history between your sender domain and each individual subscriber. High engagement history — genuine opens, clicks, replies, email moves — tells Gemini that this subscriber finds what you send worth their attention. That subscriber gets your email surfaced. Low engagement history tells Gemini the opposite.

This is why AI-generated copy that produces lower engagement is a deliverability problem at the Gmail level, not just a commercial performance problem. The engagement shortfall feeds directly into Gemini's relevance model, and over time, that model decides how visible your emails are. The mechanism runs from copy quality → engagement → Gemini relevance score → inbox visibility.

The expert community is putting it starkly. Manu Cinca of Stacked Marketer: "With AI Inbox, your email will be one of many that Gemini pulls into a summary, so it's about convincing Gemini to show your email and content to the user. Because if Gemini becomes the gatekeeper, you might no longer have a direct connection with a subscriber."

Marc Thomas of Positive Human frames it differently but consistently: "I think Gmail's AI Inbox will benefit good email marketing and continue to punish bad email marketing. It surfaces good information, which is what you should have been sending the whole time, and relegates purely promotional information to more of a potluck."

He is right. And the programmes that have been investing in relevance, in list quality, in intent-based sending rather than volume-based sending — those programmes will be rewarded by Gemini. The programmes using AI to send more generic email faster will be penalised by it.

→ Harro: Gmail's AI Inbox may redefine deliverability (harro.com/2026/05/08/gmails-ai-inbox-may-redefine-deliverability/)

 

The open rate inflation problem — why your data just got even less reliable

There is a specific measurement consequence of the Gemini rollout that needs naming directly, because it is already causing marketers to misread their programme performance.

Gmail auto-opens emails to generate AI summaries. Every time Gemini reads your email to create an overview or extract a task, that registers as an open in your ESP. Industry open rates have risen to 45.6% on average — a number that looks healthy but is significantly inflated by machine opens, not human ones. Meanwhile, click-through rates have dropped from 4.35% to 3.93%.

If you are using open rate as a primary performance metric — or worse, using opens to trigger re-engagement flows, segment your active list, or inform suppression decisions — you are now working from data that is even further from reality than it was after Apple Mail Privacy Protection inflated it from the Apple side. The two inflation sources compound each other.

This is not a new argument from me. I have been saying for years that open rate is the wrong metric to lead with. Gemini has made the case even more obvious. Click rate, reply rate, complaint rate, meaningful actions — these are the numbers that reflect what subscribers are actually doing.

→ Stop Reporting on Opens and Clicks. Here's What to Measure Instead (weareastral.co.uk/thevault/stop-reporting-on-opens-and-clicks.-heres-what-to-measure-instead)
→ Zepic: Gmail Update 2026 — What It Means for Email Marketing (zepic.com/article/gmail-update-2026-gemini-integration-what-it-means-for-email-marketing-in-2026)

 

UK/EEA note:

Gmail's Smart Features — including the Gemini AI features — are off by default in the UK, EEA, Japan, and Switzerland. Users in these regions need to actively opt in to enable them. This means the Gemini inbox effects described above are less immediate for UK-based subscriber lists than for US-based ones. That said, rollout is expanding and the direction of travel is clear. Build your programme for the world Gemini is creating, not the one that exists today.

 

 

Outlook and Copilot — the inbox environment that was already harder and is getting more so

Gmail gets most of the conversation because of its scale — over 3 billion users, accounting for roughly 25-32% of all email opens globally. But if you send to a B2B audience, Microsoft Outlook is probably your most important inbox environment, and the Outlook story is in some ways more concerning than the Gmail one.

Outlook has historically been the toughest major inbox provider on deliverability. In 2025 and into 2026, Outlook's inbox placement rate sits at around 77.4% — compared to Gmail at 89.8% and Yahoo at 87.3%. Spam rates at Outlook exceed 14%, the highest of any major provider. One in four marketing emails sent to an Outlook address is not reaching the inbox. That was already the case before AI got involved.

Microsoft has been adding AI-driven inbox prioritisation for years — the Focused Inbox, introduced with Acompli in 2014, was an early version of this. In April 2025 they added "Prioritise My Inbox," where Copilot classifies emails as High, Normal, or Low priority based on individual relationship and behaviour signals. In April 2026, they rolled out Copilot Agent Mode for Outlook in their Frontier program — agentic inbox triage, automatic rule-setting, draft follow-ups, and more — currently for enterprise users but with broader rollout expected.

The deliverability implication of Copilot prioritisation is the same as Gemini's but expressed differently. Copilot sorts by relationship signal and actionability. Emails from known, frequently-contacted people rise. Emails that look like tasks or time-sensitive items rise. Marketing emails from brands, even well-run ones, are structurally disadvantaged in a model built to serve the recipient's working life rather than the sender's commercial goals.

There is no fix for this in the conventional deliverability sense. Authentication, list hygiene, and complaint rate management all remain necessary — but they are not sufficient. The sufficient condition is that your subscribers actually want what you send, often enough, that the engagement history Copilot draws on puts you in a favourable position. Which brings the Outlook conversation back to the same place as the Gmail one: relevance is now a deliverability factor, not just a performance one.

 

The data:

Provider inbox placement rates in 2025: Gmail 89.8%. Yahoo 87.3%. Apple Mail 82.0%. Microsoft Outlook 77.4%. The gap between Gmail and Outlook is 12.4 percentage points — more than one in eight emails sent to an Outlook address that lands safely at Gmail will not reach the Outlook inbox. For B2B email programmes sending predominantly to Microsoft addresses, the deliverability environment is materially harder than the industry average suggests. (Source: Validity 2026 Deliverability Benchmark Report)

 

→ Microsoft Copilot in Outlook — April 2026 Agent Mode rollout (testingcatalog.com/microsoft-copilot-in-outlook-adds-ai-to-manage-inbox-and-calendar/)
→ Practical 365: Copilot in Outlook brings AI to email filtering (practical365.com/prioritize-my-inbox/)

 

One more thing: image-heavy emails are now a bigger problem than they were

This is a point that came out of the expert conversations around the Gemini launch and it is worth including separately because it affects a specific type of email programme.

Gemini summarises email content. To summarise content, it needs to be able to read it. An email that is primarily one large image, or a collection of images with minimal text, cannot be read, summarised, or assessed for relevance by Gemini. An email that Gemini cannot process is an email it cannot promote.

Email designer and deliverability expert Jen Blair put it plainly: "Gemini can summarise the text, but it cannot summarise the design. The age of the one-big-image email is officially dead. If your email is not readable by Gemini, your deliverability may suffer if Gmail decides to take a dim view of emails it cannot scan and summarise."

This has always been a deliverability risk, image-only emails trip filters because filters cannot assess the content, and they have always relied on alt text for accessibility. But Gemini adds a second layer: even if the image-heavy email passes spam filtering, it cannot earn a favourable relevance ranking because there is nothing for the AI to rank. Proper alt text, readable text content, and a clear structure are now both accessibility practice and deliverability practice.

 

What to do:

If your email template is more than 60% image with minimal live text, it is worth reviewing. Not to remove the design — email design matters and Gemini cannot replace it — but to make sure there is enough readable text structure for the AI layer to parse what your email is about and who it is relevant to. Think of it as writing for two audiences: the subscriber and the AI that decides whether to surface it.

 

→ Bloomreach: Gmail Changed the Game — What Google's AI-Powered Inbox Means for Email Marketers (bloomreach.com/en/blog/gemini-gmail)

 

Pulling it together — five ways AI is affecting deliverability right now

These are not hypothetical future risks. They are what is happening now, based on what I see in audits and what the data confirms.

1. AI copy → lower engagement → worse deliverability

The mechanism is engagement-based, not spam-detection-based

Spam filters are not detecting AI-generated content. What they are detecting is the consequence of it — generic, low-relevance email that subscribers do not click. Lower click signals feed into sender reputation models at every inbox provider, and sustained low engagement produces sustained inbox placement deterioration. The fix is not to stop using AI. It is to stop using AI as a shortcut that bypasses the thinking that makes email relevant.

2. AI spam volume → stricter filters for everyone

Bad actors have made the environment harder for legitimate senders

Over half of spam sent globally is now AI-generated. Inbox providers have responded by tightening their filters across the board, which means legitimate senders with marginal engagement signals are getting caught in nets that were built in response to AI-enabled spammers. The standard for "good enough to reach the inbox" has risen, and it will continue to rise.

3. Gemini in Gmail → semantic layer on top of spam filtering

Content quality and relevance are now inbox placement signals

Gmail has added a relevance assessment layer that operates above traditional spam filtering. Even emails that clear spam filters can be deprioritised if Gemini's model rates them as low relevance for that specific subscriber. The ranking signals are engagement depth, actionability, relationship strength, and content clarity. Promotional email that lacks specificity, personalisation, and genuine value is structurally disadvantaged in this model — regardless of authentication and list hygiene.

4. Gmail auto-opens → open rate data is even less reliable than it was

If you are using opens to make programme decisions, stop

Gemini opens emails to generate summaries. This registers as an open in your ESP. Industry open rates have risen to 45.6% on average — an inflation driven by machine opens, not human ones. Any programme decision based on open rate is now based on data that is further from reality than it has ever been. Click rate, reply rate, and complaint rate are the metrics worth tracking.

5. Outlook Copilot → email triage by relationship signal

B2B senders are operating in the hardest inbox environment of any major provider

Microsoft's AI-driven inbox prioritisation — Focused Inbox, Prioritise My Inbox, and now Copilot Agent Mode — all sort by relationship signal and actionability. Marketing email is structurally disadvantaged in a model built to serve the recipient's working life. Combined with Outlook's already-below-average inbox placement rate of 77.4%, B2B email programmes sending primarily to Microsoft addresses are navigating a materially harder environment than the industry averages suggest.

What all of this actually points to

The common thread running through every way AI is affecting deliverability is the same one that has always been true of email: the programmes that send relevant, specific, valuable content to people who actually want it will perform well. The programmes that use whatever is available — AI, volume, clever tactics — to shortcut that fundamental equation will perform worse.

What has changed is the speed and precision with which inbox providers can identify the difference. Gemini does not just sort email into spam or not-spam. It makes individual-level relevance predictions for each subscriber, based on their full engagement history with each sender, and uses those predictions to determine visibility. The more generic your email, the more it looks like everything else, the weaker those signals are, and the less visible you become over time.

AI-generated copy is not the enemy. AI used to produce more generic email faster than you could produce it manually — that is the enemy. The tool does not cause the problem. The way most teams are using it does.

The answer is not to stop using AI in email. It is to make sure AI is making your email more relevant and more distinctly yours, not less. And to build the kind of programme — clean data, engaged list, intent-based segmentation, honest measurement — that earns good inbox placement regardless of what the inbox providers decide to do next.


 

Like this blog? You'll love RE:markable

 RE:markable is the weekly email about emails. Dropping the latest email marketing news, updates, insights, free resources, upcoming masterclasses, webinars, and of course, a little inbox mischief. 

Email, CRM and HubSpot Support

I help marketers and businesses globally improve, design and fix their email, CRM, and HubSpot ecosystems, from strategy through to execution.

My services include:

  • Email marketing strategy, audits, training, workshops, and consultancy

  • CRM strategy and enablement

  • Full HubSpot implementations, optimisation and onboarding through my agency

If you’re looking for experienced external support (and lots of enjoyment along the way), this is where to start.