CustomFit.ai โ€” Website personalization, A/B testing and CRO for Shopify and D2C
Product
Features
โœฑ
Website Personalization
Adapt to each visitor's behavior & intent
โง–
A/B & Multivariate Testing
Rigorous experimentation
โœจ
AI CopilotNEW
Personalize with a prompt
๐Ÿค–
AI WingmanNEW
Auto-optimize toward winners
๐ŸŽฏ
AI Conversion OptimizerNEW
GPT-grade test ideas
โœŽ
No-Code Visual Editor
Drag-and-drop edit any element
โ–ฆ
Product Recommendations
Personalized recs that lift AOV
โš‘
Feature Flags
Ship safely with kill-switches
โ—ง
Chrome Extension
Edit your store in the browser
โง‰
Shopify, WooCommerce & more
All platform integrations
View all features โ†’
Use Cases
$
Price A/B Testing
Test price points to maximize revenue
โ–ฆ
Theme A/B Testing
Compare whole layouts & designs
๐Ÿ—‚
Template A/B Testing
Test whole PDP/PLP templates
๐Ÿท
Discount A/B Testing
Find the offer that converts
๐Ÿšš
Shipping A/B Testing
Thresholds, speed & copy
โœ
Content A/B Testing
Copy, images & reviews
๐Ÿ’ณ
Checkout Gateway A/B
Payments & one-click
โŒ–
Geo-Based Personalization
Per-location content & offers
โšก
Buyer-Intent Nudges
Exit-intent & retargeting
โ†”
Split-URL / Redirection
Full-page redirect tests
View all use cases โ†’
Solutions & Guides
โคข
Conversion Rate Optimization
The complete CRO guide
โง–
A/B Testing Software
Buyer's guide for D2C
๐Ÿ›’
Cart Abandonment Recovery
Win back lost carts
๐Ÿ“ฐ
Landing Page Optimization
Convert more paid traffic
S
Shopify A/B Testing
Test your store, no code
S
Shopify Personalization
Tailor the store per shopper
โ—”
First-Time Visitor Offers
Convert new shoppers with trust & offers
โ˜…
Repeat-Customer Experiences
Reward and re-engage loyal buyers
โ—Ž
Campaign-Matched Pages
Match the landing page to the ad
โŒ–
Location-Based Experiences
Currency, language & regional offers
Explore CRO โ†’
Customer stories
GIVA
+32%
conversion via personalized recs
GIVA
Mamaearth
+18%
revenue lift from PDP A/B tests
ME
The Sleep Company
+24%
AOV from product recommendations
TSC
Read customer stories โ†’
Integrations
SWsfGA+15
โœฆ
Not sure where to start?
Let AI Copilot pick your first tests

โ€œWe wake up to evidence-backed tests ready to deploy โ€” not a backlog of maybe ideas.โ€

AN
Anirudh S.
Growth ยท Chargebee
โ˜…โ˜…โ˜…โ˜…โ˜…4.8on G2 ยท 2,400+ brands
Talk to our team โ†’
Widgets
Integrations
Ecommerce & Checkout
Shopify
Shopline
Shoplazza
GoKwik
ShopFlo
Razorpay Magic Checkout
Breeze
Shiprocket
View all integrations โ†’
Analytics & Behavior
Google Analytics 4
Microsoft Clarity
Hotjar
Mixpanel
Amplitude
Heap
Adobe Analytics
Segment (CDP)
View all integrations โ†’
Engagement, CRM & More
Klaviyo
MoEngage
CleverTap
WebEngage
HubSpot
Salesforce
Slack
Meta Ads
View all integrations โ†’
CustomersPricing
Resources
CRO
โ–ค
Playbooks
Proven strategies to boost conversions
๐ŸŽ™
Interviews
D2C leaders & marketing experts
โ–ถ
Webinars
Live deep dives & product sessions
Learn
โœŽ
Blog
Tips, experiments & best practices
๐Ÿ“•
Free E-Books
Mastering personalization
๐Ÿ“–
Conversion Glossary
Every CRO term, defined
โœฆAI CopilotNEWLog inBook a demo
Start free trial
Select your platform โ€” Install in 2 minsWe'll tailor the setup
โšก Risk-free 14-day trial ยท No credit card ยท Cancel anytime
S
Shopify
Install from Shopify App Store
โ€บ
W
WooCommerce
Install the WooCommerce plugin
โ€บ
B
BigCommerce
Install from BigCommerce App Marketplace
โ€บ
SL
Shopline
Install from Shopline App Store
โ€บ
M
Salesforce / Magento
Install from the marketplace
โ€บ
SZ
Shoplazza
Install from Shoplazza App Store
โ€บ
WP
WordPress / Webflow
Install plugin or paste the script
โ€บ
โ—ง
Others
Custom-built on React, Next.js, etc.
โ€บ
Tip: pick your platform โ€” we handle the restBook a demo โ†’
Product
Website PersonalizationA/B & Multivariate TestingAI CopilotAI WingmanAI Conversion OptimizerNo-Code Visual EditorProduct RecommendationsFeature FlagsView all features โ†’
Use Cases
Price A/B TestingTheme A/B TestingTemplate A/B TestingDiscount A/B TestingShipping A/B TestingContent A/B TestingCheckout Gateway A/BGeo-Based PersonalizationBuyer-Intent NudgesSplit-URL / Redirection
Solutions & Guides
Conversion Rate OptimizationA/B Testing SoftwareCart Abandonment RecoveryLanding Page OptimizationShopify A/B TestingShopify Personalization
Explore
WidgetsIntegrationsCustomersPricing
Resources
BlogPlaybooksWebinarsInterviewsE-BooksConversion Glossary
Platforms
ShopifyShoplineShoplazzaChrome ExtensionAll integrations
Start free trialBook a demo
Homeโ€บBlogโ€บab testingโ€บHow Long Should You Run an A/B Test?

How Long Should You Run an A/B Test?

SKSharan KumarCo-Founder & CTO, CustomFit.aiJanuary 15, 20257 min read
On this page
  1. Why Test Duration Matters More Than Statistical Significance
  2. The Right Formula: How to Calculate Test Duration
  3. Minimum Duration Rules: Always Apply These
  4. Common Mistakes Indian D2C Brands Make with Test Duration
  5. How CustomFit.ai Handles Test Duration
  6. Tips / Best Practices
  7. Key Takeaways
0%
How Long Should You Run an A/B Test?

From the conversion glossary

Concepts referenced in this article, defined.

Definition
What Is Sample Size? Definition & Guide
Definition
What Is Baseline? Definition, Formula & Guide
Definition
What Is Significance? Definition, Formula & Guide
Definition
What Is Variant? Definition, Formula & Guide
Definition
What Is Lift? Definition, Formula & Guide
โ† Back to Ab Testing guide
Try CustomFit.ai

Run A/B tests and personalize your store without code. 14-day free trial, no credit card.

Start free trial โ†’
Share
XLinkedInEmail

Related articles

ab testing

Statistical Significance in A/B Testing: A Plain-English Guide

Statistical significance in A/B testing means there's less than a 5% chance your result is random. Here's what p-values, confidence levels, and sample size mean for your tests.

Sapna Joharยท 12 min read
ab testing

How A/B Testing Works: Step-by-Step Explained

A/B testing works by splitting traffic between two versions of a page, measuring which performs better on a conversion metric, and declaring a winner at statistical significance.

Sapna Joharยท 10 min read
ab testing

A/B Testing vs Split Testing: What's the Difference?

A/B testing and split testing are the same thing โ€” two names for the same experiment. Here's why the terms are used interchangeably and what actually matters.

Sapna Joharยท 7 min read

Start lifting conversions today.

Run rigorous A/B tests and personalize every visit on Shopify or any storefront โ€” no engineers required.

Start free trialBook a demo

Built for every D2C category

๐Ÿงด
Skincare
๐Ÿ’„
Beauty
๐ŸŒฟ
Wellness
โ˜•
F&B
๐Ÿ‘Ÿ
Apparel
๐Ÿ’
Jewelry
๐Ÿ›‹๏ธ
Home
๐Ÿผ
Baby
Live ยท Right now
Mamaearth โ€” free-shipping band +12.4% AOVGIVA โ€” festive collection page +34% revenueBellavita โ€” PDP CTA test +27.4% CVRKapiva โ€” Quiz-driven recs +9.48% CTRThe Sleep Co โ€” landing personalized 2ร— capturesPlum โ€” Returning shopper swap +18.2% CVRMamaearth โ€” free-shipping band +12.4% AOVGIVA โ€” festive collection page +34% revenueBellavita โ€” PDP CTA test +27.4% CVRKapiva โ€” Quiz-driven recs +9.48% CTRThe Sleep Co โ€” landing personalized 2ร— capturesPlum โ€” Returning shopper swap +18.2% CVR
Get in touch

Tell us about your store.

We reply within an hour during business hours. No sales pitch, no spam โ€” just answers from someone who's seen 2,400+ D2C stores.

โœ“ Reply within 1 hourโœ“ No spam, everโœ“ Free demo & setup help
โœ“ Thanks! We'll be in touch shortly.
CustomFit.ai

The all-in-one website personalization, A/B testing & CRO platform for high-growth D2C brands. Made by marketers, fueled by coffee.

in๐•โ—Žโ–ถf
Product
  • Features
  • A/B Testing
  • Personalization
  • AI Copilot
  • AI Wingman
  • AI Conversion Optimizer
  • Feature Flags
  • Widgets
  • Integrations
  • ROI Calculator
Platforms
  • Shopify
  • Shopline
  • Shoplazza
  • Salesforce
  • Chrome Extension
  • All Integrations
Resources
  • Blog
  • Playbooks
  • Webinars
  • GrowthFit Interviews
  • Free E-Books
  • Conversion Glossary
  • Case Studies
Compare
  • vs VWO
  • vs Optimizely
  • vs Google Optimize
  • vs Mutiny
  • vs Intelligems
  • vs Shoplift
  • vs AB Tasty
  • vs Convert
  • vs Kameleoon
Company
  • About Us
  • Partners
  • CustomFit Awards
  • Recognition
  • Contact
  • Privacy Policy
  • Terms & Conditions
ยฉ 2026 CustomFit.ai ยท Valley Monks Pvt Ltd ยท Made by marketers, fueled by coffee, and obsessed with conversions.
SOC 2 Type II ยท GDPR ยท CCPA ยท ISO 27001

Run your A/B test long enough to collect a statistically valid sample โ€” typically a minimum of 7 days and until you reach your pre-calculated sample size. Stopping early because one variant looks like it's winning is the single most common mistake in A/B testing. The answer depends on your traffic volume, baseline conversion rate, and the minimum detectable effect you care about.

Most D2C brands in India make this mistake constantly: they see Variant B performing 20% better after two days and declare victory โ€” only to watch conversions revert after rolling out the change. This guide gives you the exact framework to know when your test is truly done.

Why Test Duration Matters More Than Statistical Significance

Statistical significance is a probability, not a certainty. At 95% confidence, you're accepting a 1-in-20 chance that your result is random noise. The earlier you stop, the worse this gets.

The peeking problem: Every time you check results and consider stopping, you're running an implicit hypothesis test. If you check daily for two weeks, you've run 14 implicit tests โ€” not one. This dramatically inflates your false-positive rate.

Day-of-week effects: Consumer behavior on weekdays differs from weekends. Indian shoppers buying beauty products on Nykaa or Plum's website behave differently on Saturday evenings versus Tuesday mornings. A test running only 3 days may capture only weekday behavior.

Novelty effects: When you change something on your site, some visitors click it simply because it's new. This inflates early results. Running a test for at least one full business cycle smooths this out.

Seasonality and campaigns: A test launched during a sale or festive season (Diwali, Holi, Raksha Bandhan) captures atypical behavior. Either avoid launching tests during major promotions or explicitly account for this in your analysis.

The Right Formula: How to Calculate Test Duration

Peeking

Use this process before launching any test:

Step 1: Establish your baseline conversion rate Pull your current conversion rate for the specific goal you're testing. If you're testing a product page add-to-cart button, your baseline might be 4.2%. Use at least 30 days of historical data.

Step 2: Define your Minimum Detectable Effect (MDE) This is the smallest improvement worth detecting. If you need at least a 15% relative lift (e.g., 4.2% โ†’ 4.83%) to justify the effort, set that as your MDE. Smaller MDEs require larger samples.

Step 3: Set your statistical parameters

  • Confidence level: 95% (standard)
  • Statistical power: 80% (standard) โ€” this means an 80% chance of detecting a real effect

Step 4: Calculate required sample size For a baseline CVR of 4%, MDE of 15%, 95% confidence, 80% power:

  • Required visitors per variant: ~5,000
  • Total visitors needed: ~10,000

Step 5: Divide by daily traffic If your page gets 500 visitors/day, you need 20 days minimum. Round up to the nearest full week โ€” so 3 weeks.

Quick reference table:

Daily VisitorsBaseline CVRMDETest Duration
2003%20%6โ€“8 weeks
5004%15%3โ€“4 weeks
1,0005%10%3 weeks
2,0005%10%1โ€“2 weeks
5,000+5%10%7โ€“10 days

Minimum Duration Rules: Always Apply These

Regardless of what your sample size calculator says, follow these non-negotiable minimums:

1. Always run for at least 7 full days This captures at least one complete weekly cycle. A Kapiva ayurvedic supplement brand, for example, sees very different conversion patterns on weekdays (research intent) versus weekends (purchase intent).

2. Run through at least one full business cycle If your brand runs weekly email campaigns, run the test long enough to include two send cycles. If you do UPI cashback promotions every fortnight, include both cycles.

3. Don't run longer than 4โ€“6 weeks Extended tests get contaminated by seasonality shifts, competitor actions, and user learning effects. If you can't reach significance in 6 weeks, either increase traffic to the test (via paid promotion) or increase your MDE.

4. Pre-commit to your duration before launch Write it down. "This test runs from March 1โ€“21 and will be evaluated on March 22, regardless of interim results." This prevents peeking-induced bias.

Common Mistakes Indian D2C Brands Make with Test Duration

Sample size

Stopping during a sale spike: Mamaearth or mCaffeine brands often run tests during their sale events and declare winners based on inflated sale-period CVRs. The winner often fails after the sale ends because it was optimized for a different customer cohort.

Ignoring COD vs prepaid split: Indian ecommerce has a unique COD (cash on delivery) behavior. COD customers have different purchase patterns and return rates. If your test shifts the COD/prepaid ratio, your CVR lift may be artificial.

Testing on too-narrow segments: Running a test only on mobile visitors but applying results to all devices is a common error. Always segment your results by device and validate before full rollout.

Confusing sessions with visitors: Some analytics tools report sessions, not unique visitors. A single visitor might have 3 sessions. Use unique visitors for sample size calculations.

How CustomFit.ai Handles Test Duration

CustomFit.ai runs on your Shopify store and includes a built-in sample size calculator that tells you exactly how long to run each test before you launch. The platform:

  • Flags if you're about to stop a test prematurely
  • Shows confidence intervals, not just a "winning" label
  • Automatically pauses tests when reaching the pre-set sample size
  • Sends alerts when tests hit 95% confidence

This is especially useful for brands like Bellavita, which achieved an 11% CVR improvement โ€” those results came from tests that ran to statistical completion, not from early winners being called.

Tips / Best Practices

  1. Use a sample size calculator every single time โ€” don't guess. Tools like Evan Miller's calculator or CustomFit.ai's built-in tool take 2 minutes.

  2. Write your test plan before launch โ€” include start date, end date, sample size target, and what "winning" means in absolute numbers, not just percentages.

  3. Never peek at results and adjust the duration โ€” if you extend a test because the current variant is losing, you've invalidated the test.

  4. Run one full festive cycle minimum for seasonal businesses โ€” for brands selling during Diwali, Holi, or Valentine's Day, test during the season and validate outside it too.

  5. Split traffic 50/50 unless you have strong reasons not to โ€” unequal splits require larger total sample sizes and increase test duration.

  6. Document novelty effects โ€” for major redesigns, watch your data for a novelty spike in the first 3โ€“5 days and weight later data more heavily.

  7. Validate on a holdout group โ€” after declaring a winner, roll out to 80% of traffic and keep 20% on control for 1 week to confirm the lift holds.

Key Takeaways

  • Never stop an A/B test just because one variant looks like it's winning โ€” always complete your pre-determined sample size
  • Run a minimum of 7 days to capture weekly traffic variation, even on high-traffic sites
  • Calculate required sample size before launching using your baseline CVR, MDE, confidence level, and power
  • Indian D2C brands must account for COD behavior, festive season effects, and weekly promotional cycles when setting test duration
  • Cap tests at 4โ€“6 weeks maximum to avoid seasonal contamination
  • Pre-commit to your test end date before launch to eliminate peeking bias

Related reading: A/B Testing Statistical Significance | Sample Size Calculator | A/B Testing Metrics | What Is Sample Ratio Mismatch | A/B Testing Pillar Guide