MiN8T Replaces Stripo · BeeFree · ZeroBounce · NeverBounce · Zapier · Teams · Litmus
All Tools Home Try MiN8T free

A/B Test Sample Size Calculator

Built by the MiN8T Engineering Team · Updated 28 May 2026

Plan email A/B tests properly. Enter your baseline rate (open or click), the lift you want to detect (your MDE), and your statistical power + confidence interval levels - the calculator returns the minimum sample size per variant. The math is the standard two-sided test for two proportions. Free, no signup, instant.

Key facts

  • Required sample size for 5% MDE (minimum detectable effect) at 95% confidence: ~3,800 sends per variant.
  • Email A/B test results stabilize after 24 hours of send time in most consumer-facing campaigns.
  • The standard email industry assumption: 20% open rate baseline, 2% click-through baseline.
  • Subject-line tests require larger samples than CTA tests because open rate variance is higher. Report results with a p-value, not just a winner.
Required sample per variant
- subscribers
-
Total list size
-
Days at your cadence
-
Detectable lift
Enter values to compute.
Sample size vs detectable lift at your power + confidence
Common scenarios

How the math works

The calculator uses the standard two-proportion z-test:

n = (z_alpha + z_beta)^2 * (p1*(1-p1) + p2*(1-p2)) / (p1 - p2)^2

Where:

  • p1 = baseline rate (e.g. 0.22 for 22% open rate)
  • p2 = lifted rate (e.g. 0.242 for +10% relative lift on 22%)
  • z_alpha = critical value at confidence level (1.96 for 95% two-sided, 1.645 for 95% one-sided)
  • z_beta = critical value at power (0.84 for 80%, 1.28 for 90%)
  • n = required sample per variant

Total list size = n × number of variants. Days needed = total / daily send capacity (if you set one).

FAQ

What does the calculator compute?

Minimum sample size per variant to detect a given lift in a binary outcome (open or click) with given statistical power and confidence. Standard z-test for two proportions.

What is statistical power?

Probability of correctly detecting a real lift when one exists. Standard is 80% - if your variant is actually better, an 80%-power test will detect it 80% of the time. Higher power reduces false negatives but needs a larger sample. Below 70% is generally too unreliable.

What is confidence level?

Probability of NOT incorrectly declaring a winner when no real difference exists. 95% (5% false-positive rate) is standard. 99% (1% false-positive) is stricter and needs more sample. For email, 95% is appropriate.

One-sided or two-sided test?

One-sided tests for a lift in a specific direction. Two-sided tests for any difference. One-sided needs fewer samples but only lets you conclude in the predicted direction. Use one-sided when you have a clear hypothesis.

Why do small lifts need huge samples?

Sample size scales as 1/(lift)^2. Detecting a 1% lift takes 4x as many subscribers as a 2% lift. The signal gets buried in random variation. With 10,000 subscribers, you may not be able to reliably detect anything smaller than a 5% absolute lift on a 22% baseline.

Absolute vs relative lift?

Absolute lift is in percentage points: 22% + 2pp = 24%. Relative lift is in percent: 22% × 1.10 = 24.2%. They diverge at low baselines. Click rates are usually framed in absolute; open rates in relative.

Can I peek at results early?

Peeking inflates false-positive rate. The math assumes you wait until the planned sample size, then compute once. For the ability to stop early, use sequential frameworks like AGILE or Bayesian methods - the math is different.

What happens after I have the sample size?

Send to a random subset with at least N per variant. Wait for the open/click window (48-72h for opens, longer for clicks). Compute actual rates. Run a z-test for two proportions. If p < (1 - confidence), you have a winner. If the test is on subject lines specifically, draft both variants through the Subject Line Analyzer first so each candidate passes the basic hygiene checks (length, spam triggers, CAPS, emoji) before you commit a sample-size budget to it.

Run real A/B tests in MiN8T

MiN8T's editor includes built-in subject-line A/B testing, send-time optimization, and content variants. Plan with this calculator, execute in the editor.

Peak into MiN8T Editor →

About this data

Sample size is computed via the standard two-proportion z-test: n = (z_α + z_β)² × (p₁(1-p₁) + p₂(1-p₂)) / (p₁ - p₂)². z_α and z_β are looked up from the standard normal distribution at the chosen confidence and power. Defaults match email-marketing convention: 95% confidence, 80% power, one-sided.

Last reviewed: 28 May 2026 · Source code: open-source patterns documented at min8t.com/api

More free tools for email marketers

Subject Line Analyzer

Score your subject line on length, spam triggers, sentiment.

Spam Score Checker

21-rule SpamAssassin-style score for any HTML email body.

Inbox Preview

Side-by-side preview of your HTML email in Gmail, Apple Mail, and Outlook.

Bulletproof Button Generator

Email-safe CTA buttons with VML for Outlook desktop.

UTM Builder

Build tracked links for your campaigns with one click.

Email Header Analyzer

Trace email hops, decode SPF/DKIM/DMARC verdicts.

MJML Converter

Compile MJML markup to email-safe HTML.

Ready to start?

Stay up to date - the latest on email design and deliverability.

Let's get you building. Start your free account today.

MiN8T

The email marketing operations platform.
Replace Stripo, ZeroBounce, BeeFree, Litmus, and 4 more tools.
Inbox guaranteed across 108+ ESPs.

Product

  • API
  • Pricing
  • Free Tools

Resources

  • Blog
  • Insights & Guides
  • Documentation

Legal

  • Privacy Policy
  • Terms of Service
© 2026 MiN8T. All rights reserved. Powered by ABLA.
108+ ESP Integrations SOC 2 Compliant GDPR Ready