Skill Not Found

Why A/B Testing matters for Marketing Analysts

A/B testing lets Marketing Analysts prove what actually works—landing pages that convert, ads that lower CPA, emails that get more clicks. You move strategy from opinions to measurable impact. With solid tests, you can prioritize budgets, reduce risk, and explain results clearly to stakeholders.

Jargon to know

Variant: Competing versions (A = control, B = treatment).
Primary metric: The success metric tied to your goal (e.g., conversion rate).
Guardrail metric: Must not worsen while optimizing (e.g., bounce rate, refund rate).
MDE: Minimum Detectable Effect you care to detect (e.g., +10%).
Power & alpha: Statistical settings controlling false negatives and false positives.
Sample size: Required traffic per variant to reach reliable conclusions.

Who this is for and prerequisites

Who this is for: Marketing Analysts, growth and performance marketers, and product-minded analysts who influence campaigns, landing pages, or funnels.
Prerequisites: Basic spreadsheet skills, comfort with ratios and percentages, and familiarity with core marketing metrics (CTR, CVR, CPA, AOV). SQL is a plus but not required.

Quick wins you can do today

[ ] Pick one high-traffic page or campaign and define a sharp hypothesis (what, why, expected change).
[ ] Choose one primary metric and 1–2 guardrails. Write them down before launching.
[ ] Estimate sample size using a simple formula (see Examples) and set a realistic run time.
[ ] Pre-commit stop rules: run until sample size or a max of N days, whichever comes later.
[ ] Create a one-slide decision rubric: ship if effect ≥ MDE and guardrails stable.

Roadmap: from idea to decision

1) Groundwork

Identify a bottleneck (e.g., low CVR on mobile landing).
Draft a hypothesis: which change, why it should help, and how much.
Pick primary/guardrail metrics; define audience and traffic allocation.

2) Design

Set MDE and compute sample size per variant.
Decide duration and ramp-up plan (e.g., 20% ➜ 50% ➜ 100%).
Predefine exclusions (bots, internal IPs) and assignment unit (user/session).

3) Implement

Randomize consistently (same user sticks to same variant).
Track exposure and outcomes with timestamps.
Log guardrail metrics and segment keys (device, channel, geo).

4) Run

Monitor data quality daily (traffic balance, event fires).
Do not peek-decide; stick to pre-committed stop rules.
Pause only for data quality issues or severe guardrail breaks.

5) Analyze

Compute effect sizes and confidence/uncertainty.
Check randomization balance on pre-experiment metrics.
Segment only for insights, not to fish for significance.

6) Decide and Iterate

Ship, roll back, or iterate based on the pre-set rubric.
Document results, caveats, and next test ideas.
Update your experimentation backlog and playbooks.

Worked examples

Example 1: Sample size for a landing page conversion test

Baseline conversion rate (p): 4% (0.04). MDE: +20% relative ➜ absolute difference d = 0.04 × 0.20 = 0.008. Use a quick approximation for two-proportion tests with 95% confidence and 80% power:

Approximate n per variant ≈ 16 × p × (1 − p) / d²

Compute: 16 × 0.04 × 0.96 = 0.6144. d² = 0.008² = 0.000064. n ≈ 0.6144 / 0.000064 = 9,600 per variant (rough). Plan at least ~10k visitors per variant and run long enough to reach it. Note: This is a rule-of-thumb approximation; precise calculators may vary slightly.

Example 2: Analyze results (SQL-style aggregation)

Goal: Compare conversion and revenue across variants.

-- Assume a table 'events' with user_id, variant ('A'/'B'), converted (0/1), revenue (numeric)
SELECT
  variant,
  COUNT(DISTINCT user_id) AS users,
  SUM(converted) AS conversions,
  AVG(converted)::numeric(10,4) AS cvr,
  SUM(revenue) AS rev,
  (SUM(revenue) / NULLIF(SUM(converted),0))::numeric(10,2) AS aov
FROM events
GROUP BY variant;

Then compute lift: lift = (CVR_B − CVR_A) / CVR_A. For statistical testing, use a two-proportion z-test or a vetted stats tool, and report confidence intervals, not just p-values.

Example 3: Email subject line test (open rate)

A: 20,000 sent, 5,200 opens ➜ 26.0%
B: 20,000 sent, 5,460 opens ➜ 27.3%

Relative lift = (0.273 − 0.260) / 0.260 ≈ 5.0%. Check significance with a two-proportion test. If significant and no guardrail worsens (e.g., unsubscribes), ship B. If not, keep learning and test the pre-header next.

Example 4: Paid ads creative test (CPA and guardrails)

Spend: A $5,000, B $5,000; Conversions: A 200 (CPA $25), B 240 (CPA $20.83).
Guardrail: Post-click bounce rate must not increase by >2 p.p.

If B’s bounce rate rose from 45% to 48% (+3 p.p.), investigate quality. You might ramp B with a cap and run a follow-up test focusing on landing relevance.

Example 5: Audience targeting vs holdout (incrementality)

Targeted group CVR: 3.2% (exposed to ads)
Holdout CVR: 2.7% (not exposed)

Incremental lift = 3.2% − 2.7% = 0.5 p.p. Relative = 0.5 / 2.7 ≈ 18.5%. If cost per incremental conversion meets target, scale. If not, refine the audience or creative.

Drills and exercises

[ ] Write three hypotheses: one for a landing page, one email, one ad creative. Include expected direction and MDE.
[ ] For each hypothesis, compute rough sample size with the quick formula.
[ ] Define one primary and two guardrail metrics for each test.
[ ] Draft a stop rule that avoids p-hacking (e.g., run to N visits per variant and at least 14 days).
[ ] Build a simple results table template: variant, traffic, conversions, CVR, lift, CI, guardrails, decision.

Common mistakes and how to debug

Stopping early on a good day: Pre-commit stop rules and stick to them.
Multiple peeks, shifting goals: Decide metrics and analysis plan upfront.
Broken randomization: Check pre-test covariates; if imbalanced, pause and fix.
Too many variants with little traffic: Prefer fewer, higher-powered tests.
Ignoring guardrails: Winning CVR with worse refunds or bounce may hurt the business.
Mixing units: Assign users but count sessions can bias results. Keep assignment and measurement aligned.

Debug checklist

[ ] Variant exposure counts roughly equal (given allocation)?
[ ] Event fires present and consistent across devices?
[ ] Spike analysis: did traffic sources or device mix shift?
[ ] Any analytics filters or bots skewing numbers?
[ ] Re-compute metrics from raw logs to confirm dashboard figures.

Mini project: Ship a landing page A/B test end-to-end

Pick a high-traffic landing page and a single clear change (e.g., headline).
Write the hypothesis with target MDE, primary metric (CVR), and guardrails (bounce, refund rate).
Estimate sample size per variant and the planned run time.
Implement consistent user-level randomization and tracking of exposure and conversions.
Run to completion. Monitor only for data quality or severe guardrail breaks.
Analyze: report CVR, lift, confidence interval, guardrails, and a ship/iterate decision.
Publish a one-page readout and list the next two follow-up tests.

Learning path

Start with hypothesis writing and landing page basics.
Learn creative and messaging tests for ads and email.
Practice audience/targeting tests and budget splits.
Master guardrail metrics and quality checks.
Get comfortable with run time, sample size, and stop rules.
Refine reporting for clear stakeholder decisions.

Subskills

Marketing Hypothesis Definition
Landing Page Experiment Basics
Creative And Messaging Tests
Audience And Targeting Tests
Budget Split Testing Basics
Guardrail Metrics And Quality Checks
Running Time And Sample Size Basics
Reporting Results For Stakeholders

Next steps

Pick one real test to run this week using the roadmap above.
Practice analysis with the drills, then take the skill exam below.
Document results and convert learnings into a reusable playbook.
Move on to deeper experimentation topics after you pass the exam.

Note on progress: Anyone can take the exam for free. Logged-in users will have their progress saved automatically.

Menu

A/B Testing

Table of Contents

Why A/B Testing matters for Marketing Analysts

Who this is for and prerequisites

Quick wins you can do today

Roadmap: from idea to decision

1) Groundwork

2) Design

3) Implement

4) Run

5) Analyze

6) Decide and Iterate

Worked examples

Drills and exercises

Common mistakes and how to debug

Mini project: Ship a landing page A/B test end-to-end

Learning path

Subskills

Next steps

A/B Testing — Skill Exam

Topics

Audience And Targeting Tests

Creative And Messaging Tests

Marketing Hypothesis Definition

Landing Page Experiment Basics

Budget Split Testing Basics

Guardrail Metrics And Quality Checks

Running Time And Sample Size Basics

Reporting Results For Stakeholders

Have questions about A/B Testing?

AI Assistant