AB Test Duration Calculator and Sample Size Estimation

Use this AB test duration calculator to estimate the sample size needed for your experiment and how long your test may need to run before you can reach a reliable result. This can help you plan tests more effectively, avoid ending experiments too early, and build a more realistic CRO roadmap.

In A/B testing, one of the most common mistakes is launching a test without understanding whether you have enough traffic to reach significance in a reasonable timeframe. A test duration calculator helps solve that problem by giving you a clearer view of the visitor volume and timing required before you start.

What is an AB test duration calculator?

An AB test duration calculator is a tool that helps estimate how many visitors you need in order to detect a meaningful difference between two variations, and how long it may take to collect that sample size based on your traffic levels.

It is useful because test results can be misleading when sample sizes are too small. A variation may appear to be winning early on, but without enough data, that result may simply be noise rather than a real performance difference.

How to use this calculator

To use the calculator, enter the required inputs based on your current website performance and the change you want to detect.

These usually include:

Baseline conversion rate – your current conversion rate before the test
Minimum detectable effect – the smallest uplift you want the test to be able to detect
Confidence level – the level of statistical confidence you want in the result
Traffic volume – the number of users or sessions likely to be exposed to the experiment
Number of variations – how many versions are being compared in the test

Once entered, the calculator can estimate the sample size required and the likely duration of the experiment.

Inputs

Current conversion rate (%)

Expected improvement (relative lift, %)

Example: 10 means “10% lift” (e.g., 3.0% → 3.3%).

Number of variations in test

Include control. Example: A/B = 2, A/B/C = 3.

Visitors per day (to the tested flow)

This is a directional planning tool. It doesn’t replace a full power analysis and assumes stable traffic + conversion behaviour.

Results

Estimated sample size (total)

—

Estimated test duration

— days

Tip: In practice, we usually add a buffer (e.g., +10–20%) and ensure we cover full weekly cycles.

* Using a 95% confidence level

What sample size means in A/B testing

Sample size is the number of visitors or observations needed before you can reasonably trust the result of your experiment. If your sample size is too small, your test may produce false positives, false negatives, or unstable results that change dramatically over time.

In practical terms, sample size helps answer the question: Do we have enough data to tell whether this change really made a difference?

The required sample size depends on several factors, including:

your current conversion rate
the size of the improvement you want to detect
your chosen confidence level
the amount of traffic included in the test

How long should an A/B test run?

There is no universal number of days that every A/B test should run. Test duration depends on traffic, conversion rate, and the size of the effect you are trying to measure.

A high-traffic website with a strong baseline conversion rate may be able to reach significance relatively quickly. A lower-traffic website or a test targeting a small uplift may need much longer to collect enough data.

As a general principle, a test should run long enough to:

reach the required sample size
capture normal variation across weekdays and weekends
avoid calling results too early based on temporary swings

Stopping a test before these conditions are met can lead to poor decisions and wasted development effort.

Why test duration matters

Getting test duration right is one of the most important parts of experimentation planning. If a test is too short, you risk acting on unreliable data. If it runs unnecessarily long, you slow down your roadmap and reduce the number of experiments you can run over time.

Estimating duration before launch helps teams:

set realistic expectations with stakeholders
prioritise tests with meaningful upside
avoid launching experiments that are unlikely to conclude properly
build a more structured experimentation program

What affects AB test duration?

Several factors influence how long an experiment needs to run.

Traffic volume

More traffic usually means faster results, because you can reach the required sample size sooner.

Baseline conversion rate

Tests on pages with higher conversion rates may reach useful conclusions faster than tests on low-converting pages.

Minimum detectable effect

If you want to detect a very small uplift, you usually need a much larger sample size and a longer test duration.

Confidence level

Higher confidence requirements generally require more data.

Variation split

If traffic is split across multiple variants, each variation receives fewer visitors, which can extend test duration.

Common mistakes when estimating test duration

Stopping tests too early

One of the most common errors in experimentation is ending a test as soon as a winner appears. Early results are often unstable and can reverse as more data comes in.

Ignoring traffic realities

Some experiments look promising in theory but are unrealistic given the site’s traffic volume. Estimating duration before launch helps prevent this.

Testing for tiny uplifts on low-traffic pages

If traffic is low and the uplift you want to detect is very small, the required duration may be impractical.

Forgetting seasonality and behavioural variation

User behaviour can vary significantly by day of week, promotion period, or campaign activity. Tests should run long enough to avoid being distorted by short-term anomalies.

Why sample size estimation matters in CRO

Sample size estimation is not just a statistical exercise. It directly affects how useful your CRO program will be in practice.

If you know how long tests are likely to take and which experiments are realistic, you can build a roadmap based on achievable learning rather than guesswork. This helps teams spend more time on high-value tests and less time on experiments that are unlikely to reach a clear conclusion.

Frequently asked questions

What does an AB test duration calculator do?

It estimates the sample size required for an experiment and how long the test may need to run based on your traffic and conversion assumptions.

Why is sample size important in A/B testing?

Sample size is important because it helps determine whether your result is likely to be reliable rather than a random fluctuation.

How long should I run an A/B test?

You should run an A/B test until it reaches the required sample size and has covered enough time to reflect normal user behaviour patterns.

Can I stop a test early if one variation looks like it is winning?

Stopping early is risky because early winners often change as more data comes in. It is usually better to wait until the test has enough data to support a reliable conclusion.

What if my site does not have enough traffic?

If your site has limited traffic, you may need to test bigger changes, use stronger hypotheses, focus on higher-traffic pages, or rely more on qualitative research and lower-volume learning methods.

Plan smarter experiments

Good experimentation is not just about launching tests. It is about choosing the right opportunities, estimating feasibility, and making decisions based on reliable data. If you want help building a stronger experimentation and CRO program, Kraken Data can help.

Talk to Kraken Data about A/B testing and CRO

A/B Test Duration & Sample Size Calculator

What is an AB test duration calculator?

How to use this calculator