CustomFit.ai โ€” Website personalization, A/B testing and CRO for Shopify and D2C
Product
Features
โœฑ
Website Personalization
Adapt to each visitor's behavior & intent
โง–
A/B & Multivariate Testing
Rigorous experimentation
โœจ
AI CopilotNEW
Personalize with a prompt
๐Ÿค–
AI WingmanNEW
Auto-optimize toward winners
๐ŸŽฏ
AI Conversion OptimizerNEW
GPT-grade test ideas
โœŽ
No-Code Visual Editor
Drag-and-drop edit any element
โ–ฆ
Product Recommendations
Personalized recs that lift AOV
โš‘
Feature Flags
Ship safely with kill-switches
โ—ง
Chrome Extension
Edit your store in the browser
โง‰
Shopify, WooCommerce & more
All platform integrations
View all features โ†’
Use Cases
$
Price A/B Testing
Test price points to maximize revenue
โ–ฆ
Theme A/B Testing
Compare whole layouts & designs
๐Ÿ—‚
Template A/B Testing
Test whole PDP/PLP templates
๐Ÿท
Discount A/B Testing
Find the offer that converts
๐Ÿšš
Shipping A/B Testing
Thresholds, speed & copy
โœ
Content A/B Testing
Copy, images & reviews
๐Ÿ’ณ
Checkout Gateway A/B
Payments & one-click
โŒ–
Geo-Based Personalization
Per-location content & offers
โšก
Buyer-Intent Nudges
Exit-intent & retargeting
โ†”
Split-URL / Redirection
Full-page redirect tests
View all use cases โ†’
Solutions & Guides
โคข
Conversion Rate Optimization
The complete CRO guide
โง–
A/B Testing Software
Buyer's guide for D2C
๐Ÿ›’
Cart Abandonment Recovery
Win back lost carts
๐Ÿ“ฐ
Landing Page Optimization
Convert more paid traffic
S
Shopify A/B Testing
Test your store, no code
S
Shopify Personalization
Tailor the store per shopper
โ—”
First-Time Visitor Offers
Convert new shoppers with trust & offers
โ˜…
Repeat-Customer Experiences
Reward and re-engage loyal buyers
โ—Ž
Campaign-Matched Pages
Match the landing page to the ad
โŒ–
Location-Based Experiences
Currency, language & regional offers
Explore CRO โ†’
Customer stories
GIVA
+32%
conversion via personalized recs
GIVA
Mamaearth
+18%
revenue lift from PDP A/B tests
ME
The Sleep Company
+24%
AOV from product recommendations
TSC
Read customer stories โ†’
Integrations
SWsfGA+15
โœฆ
Not sure where to start?
Let AI Copilot pick your first tests

โ€œWe wake up to evidence-backed tests ready to deploy โ€” not a backlog of maybe ideas.โ€

AN
Anirudh S.
Growth ยท Chargebee
โ˜…โ˜…โ˜…โ˜…โ˜…4.8on G2 ยท 2,400+ brands
Talk to our team โ†’
Widgets
Integrations
Ecommerce & Checkout
Shopify
Shopline
Shoplazza
GoKwik
ShopFlo
Razorpay Magic Checkout
Breeze
Shiprocket
View all integrations โ†’
Analytics & Behavior
Google Analytics 4
Microsoft Clarity
Hotjar
Mixpanel
Amplitude
Heap
Adobe Analytics
Segment (CDP)
View all integrations โ†’
Engagement, CRM & More
Klaviyo
MoEngage
CleverTap
WebEngage
HubSpot
Salesforce
Slack
Meta Ads
View all integrations โ†’
CustomersPricing
Resources
CRO
โ–ค
Playbooks
Proven strategies to boost conversions
๐ŸŽ™
Interviews
D2C leaders & marketing experts
โ–ถ
Webinars
Live deep dives & product sessions
Learn
โœŽ
Blog
Tips, experiments & best practices
๐Ÿ“•
Free E-Books
Mastering personalization
๐Ÿ“–
Conversion Glossary
Every CRO term, defined
โœฆAI CopilotNEWLog inBook a demo
Start free trial
Select your platform โ€” Install in 2 minsWe'll tailor the setup
โšก Risk-free 14-day trial ยท No credit card ยท Cancel anytime
S
Shopify
Install from Shopify App Store
โ€บ
W
WooCommerce
Install the WooCommerce plugin
โ€บ
B
BigCommerce
Install from BigCommerce App Marketplace
โ€บ
SL
Shopline
Install from Shopline App Store
โ€บ
M
Salesforce / Magento
Install from the marketplace
โ€บ
SZ
Shoplazza
Install from Shoplazza App Store
โ€บ
WP
WordPress / Webflow
Install plugin or paste the script
โ€บ
โ—ง
Others
Custom-built on React, Next.js, etc.
โ€บ
Tip: pick your platform โ€” we handle the restBook a demo โ†’
Product
Website PersonalizationA/B & Multivariate TestingAI CopilotAI WingmanAI Conversion OptimizerNo-Code Visual EditorProduct RecommendationsFeature FlagsView all features โ†’
Use Cases
Price A/B TestingTheme A/B TestingTemplate A/B TestingDiscount A/B TestingShipping A/B TestingContent A/B TestingCheckout Gateway A/BGeo-Based PersonalizationBuyer-Intent NudgesSplit-URL / Redirection
Solutions & Guides
Conversion Rate OptimizationA/B Testing SoftwareCart Abandonment RecoveryLanding Page OptimizationShopify A/B TestingShopify Personalization
Explore
WidgetsIntegrationsCustomersPricing
Resources
BlogPlaybooksWebinarsInterviewsE-BooksConversion Glossary
Platforms
ShopifyShoplineShoplazzaChrome ExtensionAll integrations
Start free trialBook a demo
Homeโ€บBlogโ€บab testingโ€บA/B Testing Documentation Template (Free)

A/B Testing Documentation Template (Free)

SKSharan KumarCo-Founder & CTO, CustomFit.aiJanuary 15, 20259 min read
On this page
  1. Why Documentation Transforms Testing Programs
  2. The Complete A/B Testing Documentation Template
  3. 1. Test Overview
  4. 2. Hypothesis Statement
  5. 3. Test Setup
  6. 4. What Changes in the Variant
  7. 5. Metrics
  8. 6. QA Checklist
  9. 7. Results
  10. 8. Segment Analysis
  11. 9. Decision and Implementation
  12. 10. Learnings and Next Steps
  13. Where to Store Your A/B Test Documentation
  14. Tips and Best Practices
  15. Key Takeaways
0%
A/B Testing Documentation Template (Free)

From the conversion glossary

Concepts referenced in this article, defined.

Definition
What Is Variant? Definition, Formula & Guide
Definition
What Is Control? Definition, Formula & Guide
Definition
What Is Hypothesis? Definition & Guide
Definition
What Is Baseline? Definition, Formula & Guide
Definition
What Is Significance? Definition, Formula & Guide
โ† Back to Ab Testing guide
Try CustomFit.ai

Run A/B tests and personalize your store without code. 14-day free trial, no credit card.

Start free trial โ†’
Share
XLinkedInEmail

Related articles

ab testing

Statistical Significance in A/B Testing: A Plain-English Guide

Statistical significance in A/B testing means there's less than a 5% chance your result is random. Here's what p-values, confidence levels, and sample size mean for your tests.

Sapna Joharยท 12 min read
ab testing

How A/B Testing Works: Step-by-Step Explained

A/B testing works by splitting traffic between two versions of a page, measuring which performs better on a conversion metric, and declaring a winner at statistical significance.

Sapna Joharยท 10 min read
ab testing

A/B Testing vs Split Testing: What's the Difference?

A/B testing and split testing are the same thing โ€” two names for the same experiment. Here's why the terms are used interchangeably and what actually matters.

Sapna Joharยท 7 min read

Start lifting conversions today.

Run rigorous A/B tests and personalize every visit on Shopify or any storefront โ€” no engineers required.

Start free trialBook a demo

Built for every D2C category

๐Ÿงด
Skincare
๐Ÿ’„
Beauty
๐ŸŒฟ
Wellness
โ˜•
F&B
๐Ÿ‘Ÿ
Apparel
๐Ÿ’
Jewelry
๐Ÿ›‹๏ธ
Home
๐Ÿผ
Baby
Live ยท Right now
Mamaearth โ€” free-shipping band +12.4% AOVGIVA โ€” festive collection page +34% revenueBellavita โ€” PDP CTA test +27.4% CVRKapiva โ€” Quiz-driven recs +9.48% CTRThe Sleep Co โ€” landing personalized 2ร— capturesPlum โ€” Returning shopper swap +18.2% CVRMamaearth โ€” free-shipping band +12.4% AOVGIVA โ€” festive collection page +34% revenueBellavita โ€” PDP CTA test +27.4% CVRKapiva โ€” Quiz-driven recs +9.48% CTRThe Sleep Co โ€” landing personalized 2ร— capturesPlum โ€” Returning shopper swap +18.2% CVR
Get in touch

Tell us about your store.

We reply within an hour during business hours. No sales pitch, no spam โ€” just answers from someone who's seen 2,400+ D2C stores.

โœ“ Reply within 1 hourโœ“ No spam, everโœ“ Free demo & setup help
โœ“ Thanks! We'll be in touch shortly.
CustomFit.ai

The all-in-one website personalization, A/B testing & CRO platform for high-growth D2C brands. Made by marketers, fueled by coffee.

in๐•โ—Žโ–ถf
Product
  • Features
  • A/B Testing
  • Personalization
  • AI Copilot
  • AI Wingman
  • AI Conversion Optimizer
  • Feature Flags
  • Widgets
  • Integrations
  • ROI Calculator
Platforms
  • Shopify
  • Shopline
  • Shoplazza
  • Salesforce
  • Chrome Extension
  • All Integrations
Resources
  • Blog
  • Playbooks
  • Webinars
  • GrowthFit Interviews
  • Free E-Books
  • Conversion Glossary
  • Case Studies
Compare
  • vs VWO
  • vs Optimizely
  • vs Google Optimize
  • vs Mutiny
  • vs Intelligems
  • vs Shoplift
  • vs AB Tasty
  • vs Convert
  • vs Kameleoon
Company
  • About Us
  • Partners
  • CustomFit Awards
  • Recognition
  • Contact
  • Privacy Policy
  • Terms & Conditions
ยฉ 2026 CustomFit.ai ยท Valley Monks Pvt Ltd ยท Made by marketers, fueled by coffee, and obsessed with conversions.
SOC 2 Type II ยท GDPR ยท CCPA ยท ISO 27001

A/B testing without documentation is an activity. A/B testing with documentation is a program. Documentation captures why you ran each test, what you expected, what actually happened, and what you learned โ€” building institutional knowledge that compounds over time. This free template covers everything your team needs to document before, during, and after every test. Copy and use it.

Why Documentation Transforms Testing Programs

Most D2C teams run A/B tests without proper documentation. The result: they re-test the same ideas years later (because no one remembers the results), they can't explain to new team members why the site looks the way it does, and they can't build on previous learnings because those learnings were never captured.

Teams with good documentation do the opposite. They know exactly which elements on their site have been tested, what the results were, and what hypotheses remain to be tested. New team members onboard faster. Investors can see a rigorous experimentation process. The testing program becomes a strategic asset, not just a set of experiments.

The Complete A/B Testing Documentation Template

Example

Below is the full template. Copy this into Notion, Google Docs, or Airtable for each test.

1. Test Overview

Test Name: [Descriptive name, e.g., "Homepage Hero โ€” Value Prop vs Social Proof Focus"]
Test ID: [ID from testing tool]
Created by: [Name]
Creation date: [Date]
Status: Planning / Running / Completed / Abandoned

2. Hypothesis Statement

The most important part of any test document. A good hypothesis has three components: observation, proposed change, and expected outcome.

Observation:
[What did you see in your data, session recordings, or customer feedback
that suggests this change is worth testing?]

Example: "Heatmap data shows 78% of mobile visitors never scroll past the
hero section on the homepage. The current hero focuses on product imagery
with no explicit value proposition. Customer interviews indicate that
new visitors don't understand what differentiates us from competitors."

Proposed Change:
[What specifically will you change in the variant?]

Example: "Replace the current product-image-focused hero with a headline
stating our core value proposition ('India's fastest-delivered skincare,
with a 30-day returns guarantee'), with one product image and a prominent CTA."

Expected Outcome:
[What metric do you expect to improve, by approximately how much, and why?]

Example: "We expect the variant to improve homepage-to-product-page
click-through rate by 15โ€“20%, because communicating the value proposition
clearly reduces the information gap that causes visitors to bounce without
engaging further."

3. Test Setup

Test Type: A/B / Multivariate / Personalization

Pages in scope:
- [List every URL or page template included in the test]

Traffic allocation:
- Control: [X]%
- Variant A: [Y]%
- Variant B (if multivariate): [Z]%

Target audience:
- All visitors / New visitors only / Returning visitors only /
  Mobile only / [Custom segment definition]

Testing tool: [CustomFit.ai / VWO / etc.]
Test ID in tool: [ID]

Launch date: [Date]
Planned end date: [Date]
Maximum duration: [e.g., 30 days]

Sample size target: [Required visitors per variant for significance]
Statistical significance target: [95% / 90% confidence]

4. What Changes in the Variant

Variant description:
[Clear written description of every change made in each variant.
Be specific enough that someone could recreate the variant from the description alone.]

Variant A changes:
1. [Change 1 โ€” element, change made, reason]
2. [Change 2]
3. [Change 3]

What STAYS THE SAME:
[List elements that are intentionally kept identical between control and variant.
This is important for isolating the effect of the changes.]

5. Metrics

Primary metric (decision metric):
[One metric that determines whether the variant wins or loses]
Example: "Purchase completion rate (orders / sessions)"

Secondary metrics (diagnostic):
[2โ€“4 additional metrics you'll track to understand how the primary
metric changed and why]
Example:
- Homepage to product page click-through rate
- Add-to-cart rate
- Revenue per session

Guardrail metrics:
[Metrics you're monitoring to ensure the variant doesn't cause harm
in other areas]
Example:
- Return rate (we don't want a misleading variant to increase returns)
- Customer support contact rate (we don't want confused customers)
- Page load time (we don't want the variant to slow the page significantly)

Baseline (pre-test) values:
- Primary metric baseline: [X%] (measured over [date range])
- Secondary metric 1 baseline: [Y%]
- Secondary metric 2 baseline: [Z%]

6. QA Checklist

Sections

[ ] Desktop render verified (Chrome)
[ ] Mobile render verified (iPhone)
[ ] Mobile render verified (Android)
[ ] Conversion tracking fires correctly
[ ] Traffic split verified (both variants receiving visitors)
[ ] Anti-flicker functioning (no visible flicker on throttled connection)
[ ] No JavaScript console errors
[ ] Checkout flow complete walkthrough done
[ ] Form submissions functional
[ ] Page speed impact checked (baseline vs. test)

QA completed by: [Name]
QA date: [Date]
QA notes: [Any issues found and resolved]

7. Results

(Complete after test ends)

Test end date: [Date]
Reason for ending: [Reached significance / Reached max duration / Abandoned]

Final sample sizes:
- Control: [N visitors]
- Variant A: [N visitors]
- Variant B (if applicable): [N visitors]

Primary metric results:
- Control: [X%]
- Variant A: [Y%]
- Absolute difference: [+/-Z percentage points]
- Relative difference: [+/-W%]
- Statistical significance: [X% confidence]
- Result: Control wins / Variant A wins / Inconclusive

Secondary metrics:
- [Metric 1]: Control [X%] vs Variant [Y%] โ€” [direction]
- [Metric 2]: Control [X%] vs Variant [Y%] โ€” [direction]

Revenue impact estimate:
- Estimated incremental revenue from winner (if implemented):
  [CVR improvement] ร— [monthly sessions] ร— [average order value] = [โ‚นX/month]

8. Segment Analysis

Device type results:
- Mobile: Control [X%] vs Variant [Y%] โ€” [Win/Loss/Neutral]
- Desktop: Control [X%] vs Variant [Y%] โ€” [Win/Loss/Neutral]

Traffic source results:
- Paid traffic: Control [X%] vs Variant [Y%]
- Organic: Control [X%] vs Variant [Y%]
- Direct: Control [X%] vs Variant [Y%]

New vs returning visitor results:
- New visitors: Control [X%] vs Variant [Y%]
- Returning visitors: Control [X%] vs Variant [Y%]

Key segment finding:
[1โ€“2 sentences summarizing the most important segment insight]

9. Decision and Implementation

Decision: Implement winner / Stay on control / Run follow-up test / Needs more data

If implementing:
- Implementation date: [Date]
- Implementation method: [CustomFit.ai rollout / Developer implementation]
- Implemented by: [Name]

If not implementing:
- Reason: [Why the winner wasn't implemented despite statistical significance]

10. Learnings and Next Steps

What we learned:
[2โ€“4 bullet points of genuine insight from this test โ€”
applicable beyond just this specific test]

Example:
โ€ข Adding specific delivery promise ("arrives by [date]") outperforms
  generic "fast shipping" messaging for mobile visitors โ€” suggests
  specificity matters more for mobile where context is limited.
โ€ข Trust signals work differently for paid vs organic traffic โ€” paid
  traffic needs more reassurance before converting.

What we'd do differently:
[Honest reflection on test design, setup, or analysis that could be improved]

Follow-up tests generated:
1. [Hypothesis for test inspired by this result]
2. [Hypothesis]

Test ID linked to this result: [If a follow-up test has already been created]

Where to Store Your A/B Test Documentation

Notion: Best for teams that want rich documentation with linked databases. Create a master "Experiments" database where each record is one test, with all 10 sections as properties or sub-pages.

Airtable: Best for teams that want spreadsheet-style filtering and reporting alongside rich documentation. The grid view lets you filter tests by status, page, primary metric, or result.

Google Sheets: Simplest starting point. A single spreadsheet with one row per test and key columns covers the essentials. Not great for rich text fields but works for smaller teams.

Confluence: Common in larger organizations with existing Atlassian tooling. Good integration with Jira for test backlog management.

The right tool is whichever one your team will actually use consistently. Start with Google Sheets if you're unsure โ€” you can migrate to a richer tool as your documentation practice matures.

Tips and Best Practices

Write the hypothesis before setting up the test. The temptation is to set up the test in your testing tool first, then write documentation later. Reverse this order. Writing the hypothesis forces clarity about what you're testing and why, which often reveals test design issues before you waste time on a poorly-specified experiment.

Document null results as thoroughly as winners. A test that shows no significant difference between control and variant is still valuable โ€” it tells you the change didn't matter, which is itself a finding. "We tested X, it didn't work, here's what we learned" is exactly the institutional knowledge that prevents repeated effort.

Review test documentation in retrospective. Monthly or quarterly, review recent completed tests as a team. Look for patterns: do tests on mobile consistently outperform desktop? Do tests on paid traffic pages behave differently from organic pages? These patterns generate new hypotheses.

Link related tests. When test B was inspired by learning from test A, link them in your documentation. This creates a searchable chain of reasoning that helps new team members understand the experimentation history.

Key Takeaways

  • A/B test documentation transforms a series of experiments into a compounding learning program.
  • The hypothesis statement โ€” observation, proposed change, expected outcome โ€” is the most important document to write before any test.
  • Document all 10 sections: hypothesis, setup, variant changes, metrics, QA, results, segments, decision, and learnings.
  • Document null results and losing tests as thoroughly as winning ones โ€” the learning is equally valuable.
  • Use CustomFit.ai's dashboard for test management and pair it with Notion or Airtable for documentation that your whole team can access and build on.