What Is P-Value? Definition & Guide

The p-value is the probability that the observed difference between your A/B test variants occurred by random chance, assuming there is no real difference between them. A p-value of 0.05 means there's a 5% probability your result is a fluke. Conventionally, when the p-value drops below 0.05, the result is considered statistically significant — meaning you can reject the null hypothesis that the variants perform equally.

Formula / How to Calculate P-Value

The p-value is derived from a test statistic (such as a z-score or chi-squared statistic). For a two-proportion z-test commonly used in A/B testing:

z = (p1 − p2) / SE

Where SE (standard error) = √(p̂(1 − p̂)(1/n1 + 1/n2)), and p̂ is the pooled conversion rate.

The p-value is then looked up from the z-distribution (or calculated by your testing tool). In most A/B testing platforms, p-value is displayed directly so you don't need to compute it manually.

Why P-Value Matters for Ecommerce

The p-value is the number that separates a gut feeling from a defensible business decision. Without it, you're just pattern-matching on noisy data — which is how brands make costly mistakes like rolling out a "winning" variant that was just lucky during a three-day weekend traffic spike. Indian D2C brands running tests during high-traffic periods like sales events need to be especially cautious: festive traffic behaves differently from normal traffic, and a low p-value during a Diwali sale might not hold in regular weeks. Understanding p-value helps you know when your test result is genuinely trustworthy.

Real-World Example

A Shopify store selling fitness supplements tested two versions of their landing page for Meta ad traffic. After 5 days, Variant B showed a 14% higher conversion rate. But when the team checked the p-value, it was 0.18 — meaning there was an 18% probability this difference was just noise. They kept the test running. By day 18, with 6,200 sessions per variant, the p-value dropped to 0.03, confirming the result was real. The lesson: a promising-looking lift with a high p-value is still an inconclusive test.

How to Improve / Optimize P-Value Outcomes

Understand what p-value is NOT: It does not tell you the probability that the null hypothesis is true, and it does not tell you the size of the effect. It only measures evidence against randomness.
Choose your significance threshold before the test: Industry standard is p < 0.05, but for low-risk UI changes, some teams accept p < 0.10. For major product changes, require p < 0.01.
Don't stop a test the moment p < 0.05: If you check daily and stop as soon as significance is reached, your actual false positive rate is much higher than 5% due to peeking.
Run tests long enough to get sufficient sample size: The fastest way to achieve a low p-value honestly is to expose enough visitors to each variant — not to stop testing when you get lucky early.
Combine p-value with effect size: A statistically significant result with a p-value of 0.04 but only a 0.3% absolute improvement may not be worth acting on. Significance and magnitude are both important.

P-Value in A/B Testing

P-value is the core output of hypothesis testing in every A/B test. Most experimentation platforms present it as a confidence percentage (95% confidence = p-value of 0.05) to make it more intuitive. Whether you're reading raw p-values or confidence percentages, the underlying math and meaning are identical.

Run smarter A/B tests with CustomFit.ai — 14-day free trial, no credit card required.

Put this into practice

Run A/B tests and personalize your store without code. 14-day free trial, no credit card.

Start free trial →

Articles about What Is P-Value? Definition & Guide

In-depth guides and case studies where this concept is put to work.

A/B Testing: The Complete Guide for Ecommerce & D2C Brands
Master A/B testing — from hypothesis to statistical significance — with practical examples for Shopify, WooCommerce, and D2C brand…

← Back to Conversion Glossary

Formula / How to Calculate P-Value

The p-value is derived from a test statistic (such as a z-score or chi-squared statistic). For a two-proportion z-test commonly used in A/B testing:

z = (p1 − p2) / SE

Where SE (standard error) = √(p̂(1 − p̂)(1/n1 + 1/n2)), and p̂ is the pooled conversion rate.

The p-value is then looked up from the z-distribution (or calculated by your testing tool). In most A/B testing platforms, p-value is displayed directly so you don't need to compute it manually.

Why P-Value Matters for Ecommerce

Real-World Example

How to Improve / Optimize P-Value Outcomes

Understand what p-value is NOT: It does not tell you the probability that the null hypothesis is true, and it does not tell you the size of the effect. It only measures evidence against randomness.

Choose your significance threshold before the test: Industry standard is p < 0.05, but for low-risk UI changes, some teams accept p < 0.10. For major product changes, require p < 0.01.

Don't stop a test the moment p < 0.05: If you check daily and stop as soon as significance is reached, your actual false positive rate is much higher than 5% due to peeking.

Run tests long enough to get sufficient sample size: The fastest way to achieve a low p-value honestly is to expose enough visitors to each variant — not to stop testing when you get lucky early.

Combine p-value with effect size: A statistically significant result with a p-value of 0.04 but only a 0.3% absolute improvement may not be worth acting on. Significance and magnitude are both important.

P-Value in A/B Testing

Formula / How to Calculate P-Value

Why P-Value Matters for Ecommerce

Real-World Example

How to Improve / Optimize P-Value Outcomes

P-Value in A/B Testing

Related Terms

Put this into practice

Articles about What Is P-Value? Definition & Guide

Built for every D2C category

Formula / How to Calculate P-Value

Why P-Value Matters for Ecommerce

Real-World Example

How to Improve / Optimize P-Value Outcomes

P-Value in A/B Testing

Related Terms