SlideShare a Scribd company logo
1 of 40
The Art of A/B Testing
@mike_greenfield
numeratechoir.com
500 Startups, 2013-05-08
What’s an A/B Test?
Definition: An A/B Test is a means by which
a product’s users are randomly given
one of two or more experiences.
Usage: Companies use A/B Tests to
discover which experience is most
effective.
A/B Testing Overview
Why You Should A/B Test
What You Should A/B Test
When You Should A/B Test
How You Should A/B Test
What I’ve Learned from A/B Testing
Why You Should A/B Test
Humans’ quantitative intuition is poor
Beliefs:
1) “I have great product intuition”
2) “This is business, not science”
3) “It leads to a local maximum”
Reality:
At Internet scale, testing and measuring
properly has a huge return on investment.
“I have great product intuition”
• Improved signup flow: often a disaster
• Sharing on Facebook: can be good or
bad
• Home page changes: usually a mixed
bag
…but product changes != progress
“This is business, not science”
• Requires oversight and sleuthing
• Only works for big improvements
• Requires other factors to stay the same
…but change-then-measure is flawed
Test Test Type Goal
Change Button
Color
Optimization Increased clickthrough
Old Site Design
vs. New Site
Design
Holistic (aka don’t
shoot yourself in
the foot)
Make an informed
decision on old vs. new
“It leads to a local maximum”
…so test holistically
What You Should A/B Test
Test an almost viral flow
1000
signups
300
inviting
friends
3000
invitations
1500
invitation
clicks
900
signups
K-factor = 0.9
☑ A/B Test this flow!
Don’t test a non-viral flow
1000
signups
100
inviting
friends
600
invitations
200
invitation
clicks
80
signups
K-factor = 0.08
Spend time elsewhere
Test email
subject lines
Subject: The Craziest Thing
My Child Has Done
Subject: 5 Embarrassing Kid
Moments
vs.
Test email
subject lines
Subject: The Craziest Thing
My Child Has Done
Subject: 5 Embarrassing Kid
Moments
vs.
+153%
Improve purchase funnel
Get 2011 Picks
Get Bracketbrains
vs.
Improve purchase funnel
Get 2011 Picks
Get Bracketbrains
vs.
+30%
Validate your new design*
* if you have scale
Scale What to Do Reasoning
Early stage Just change it. Nothing to lose, no data to
test.
Something to lose A/B test it. The existing product is
probably more effective
than you think.
Understand the
consequences of an
“upgrade.”
When You Should A/B Test
Test only if you can get to
statistical significance
(Google: “split test calculator”)
Big: 200,000
emails with a 3%
CTR option and a
4% CTR option
-------------------------
6000 vs. 8000
Small: 200 emails
with a 3% CTR
option and a 4%
CTR option
--------------------------
6 vs. 8
Test only if you can get to business
significance
• Only things that can cumulatively
have a meaningful impact on your
business
• For emails, a small list means small
improvements: don’t test
• For virality, small changes matter if and
only if you’re close to K=1
Rule of thumb: test every user-
facing change that will be seen
by 10,000-100,000 people
How You Should A/B Test
Okay Choice: Use Commercial
Tools
• MixPanel
• Unbounce
• Optimizely
• Google Analytics
Best Choice: Build Your Own
Framework
• Yep, it’s work with no immediate
payoff
• Your mom won’t care
• Your users won’t care
But…
• There are simple ways to get started
• It gives you tons of flexibility
Why Build Your Own
• Incorporate tests in many places
(page ordering, new designs, email
content, email strategy, mobile)
• Look at results holistically
• Go back and see how any test
influences anything, not just the stats
you’re tracking
Code
It needs to be super simple to create a test
{ab_test_if test=“signup_reason”
option=“awesome” user=$viewer}
because it’s awesome!
{/ab_test_if}
{ab_test_if test=“signup_reason” option=“free”
user=$viewer}
because it’s free!
{/ab_test_if}
Data Structure
AB_TESTS (id, name, time_created)
AB_TEST_OPTIONS
(id, ab_test_id, weight, name)
USER_AB_TEST_OPTIONS
(id, user_id/visitor_id, ab_test_option_id, t
ime_created)
Reporting
select
AB_TEST_OPTION_ID,
ACTIVITY_ID,
count(distinct USER_ID) USERS_DOING_ACTIVITY
from USER_ACTIVITY a, USER_AB_TEST_OPTIONS b
where a.USER_ID=b.USER_ID
and AB_TEST_OPTION_ID in (…)
and ACTIVITY_ID in (…)
and a.time_created>b.time_created
GROUP BY AB_TEST_OPTION_ID, ACTIVITY_ID;
Scaling
• An A/B testing system can yield a lot of
DB writes
• Reporting means many long-running
SQL queries
• Need to batch several aspects
Run Some A/A Tests
• Test two versions of the same thing
• If results are wildly off, something’s
wrong with the testing system
• Deciding too early is a major issue: it’s
usually best to be conservative before
choosing a winner
More framework implementation
details: bit.ly/artofabtesting
What I’ve learned from A/B testing
Focus on 1 item in emails
• Clear subject focused on that item (Why
the Giants will win the World Series)
• Body of text focused on that item
(peripheral content is okay on the
periphery)
• Clear, big clickthrough action in the
email body (See why the Giants will win)
• Require clickthrough to get the full story
In signup, minimize distraction
• Provide context/messaging of what
the product is, but don’t make it
clickable
• Clear “next” or “continue” steps to
guide user through the process
• Remove unnecessary navigation
Highlight friends, not your product
• Most effective: your friends are doing
something; you should join them
• Unless you’re Apple, no one cares
about your new feature or new design
• People probably don’t care about
your fancy new social network
• “Join my circle because I trust you”
beats “check out this great product”
When possible,
TELL USERS WHAT TO DO
3 Things to Remember
A/B testing = good culture
• Data trump opinions
• Iterate quickly but intelligently
• Everyone gets better at predicting
product success
Test changes if they’re likely to
have both statistical and business
significance
Validate the Big Stuff
A/B Test Holistically; testing
isn’t a substitute for product
vision.
Optimize the Small Stuff
The details matter more than you
think.
Thanks.
mike@mikegreenfield.com
@mike_greenfield

More Related Content

Viewers also liked

10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made MillionsMark Fidelman
 
The Science of Marketing Automation
The Science of Marketing AutomationThe Science of Marketing Automation
The Science of Marketing AutomationHubSpot
 
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and RevenueStop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and RevenueJosh Patrice
 
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices 10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices Kissmetrics on SlideShare
 
The Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startupprThe Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startupprOnboardly
 
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...GeekWire
 
Biz Dev 101 - An Interactive Workshop on How Deals Get Done
Biz Dev 101 - An Interactive Workshop on How Deals Get DoneBiz Dev 101 - An Interactive Workshop on How Deals Get Done
Biz Dev 101 - An Interactive Workshop on How Deals Get DoneScott Pollack
 
Google Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for MeasurementGoogle Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for MeasurementOrbit Media Studios
 
Mastering Google Adwords In 30 Minutes
Mastering Google Adwords In 30 MinutesMastering Google Adwords In 30 Minutes
Mastering Google Adwords In 30 MinutesNik Cree
 
No excuses user research
No excuses user researchNo excuses user research
No excuses user researchLily Dart
 
How Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer AcquisitionHow Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer AcquisitionKissmetrics on SlideShare
 
How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling Elle Shelley
 
Wireframes - a brief overview
Wireframes - a brief overviewWireframes - a brief overview
Wireframes - a brief overviewJenni Leder
 
A Guide to User Research (for People Who Don't Like Talking to Other People)
A Guide to User Research (for People Who Don't Like Talking to Other People)A Guide to User Research (for People Who Don't Like Talking to Other People)
A Guide to User Research (for People Who Don't Like Talking to Other People)Stephanie Wills
 
Using Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo TestingUsing Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo TestingSean Ellis
 
SQL Tutorial for Marketers
SQL Tutorial for MarketersSQL Tutorial for Marketers
SQL Tutorial for MarketersJustin Mares
 
How to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook RetargetingHow to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook RetargetingDigital Marketer
 
Understand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesUnderstand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesTheFamily
 
User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.John Whalen
 

Viewers also liked (20)

10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions
 
The Science of Marketing Automation
The Science of Marketing AutomationThe Science of Marketing Automation
The Science of Marketing Automation
 
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and RevenueStop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
 
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices 10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
 
The Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startupprThe Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startuppr
 
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
 
Biz Dev 101 - An Interactive Workshop on How Deals Get Done
Biz Dev 101 - An Interactive Workshop on How Deals Get DoneBiz Dev 101 - An Interactive Workshop on How Deals Get Done
Biz Dev 101 - An Interactive Workshop on How Deals Get Done
 
Google Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for MeasurementGoogle Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for Measurement
 
Mastering Google Adwords In 30 Minutes
Mastering Google Adwords In 30 MinutesMastering Google Adwords In 30 Minutes
Mastering Google Adwords In 30 Minutes
 
No excuses user research
No excuses user researchNo excuses user research
No excuses user research
 
HTML & CSS Masterclass
HTML & CSS MasterclassHTML & CSS Masterclass
HTML & CSS Masterclass
 
How Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer AcquisitionHow Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer Acquisition
 
How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling
 
Wireframes - a brief overview
Wireframes - a brief overviewWireframes - a brief overview
Wireframes - a brief overview
 
A Guide to User Research (for People Who Don't Like Talking to Other People)
A Guide to User Research (for People Who Don't Like Talking to Other People)A Guide to User Research (for People Who Don't Like Talking to Other People)
A Guide to User Research (for People Who Don't Like Talking to Other People)
 
Using Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo TestingUsing Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo Testing
 
SQL Tutorial for Marketers
SQL Tutorial for MarketersSQL Tutorial for Marketers
SQL Tutorial for Marketers
 
How to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook RetargetingHow to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook Retargeting
 
Understand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesUnderstand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakes
 
User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.
 

More from 500 Startups (20)

Get on Board
Get on BoardGet on Board
Get on Board
 
Connected Analytics
Connected AnalyticsConnected Analytics
Connected Analytics
 
Sira Medical
Sira MedicalSira Medical
Sira Medical
 
The Atlas
The AtlasThe Atlas
The Atlas
 
Trash Warrior
Trash WarriorTrash Warrior
Trash Warrior
 
Thematic
ThematicThematic
Thematic
 
Shiplyst
ShiplystShiplyst
Shiplyst
 
Renetec
RenetecRenetec
Renetec
 
Predina
PredinaPredina
Predina
 
Pluto
PlutoPluto
Pluto
 
Plant an App
Plant an AppPlant an App
Plant an App
 
Pilota
PilotaPilota
Pilota
 
Mero Technologies
Mero TechnologiesMero Technologies
Mero Technologies
 
Omnitron Sensors
Omnitron SensorsOmnitron Sensors
Omnitron Sensors
 
Juked
JukedJuked
Juked
 
GamerzClass
GamerzClassGamerzClass
GamerzClass
 
eino
einoeino
eino
 
Cenos
CenosCenos
Cenos
 
Bliinx
BliinxBliinx
Bliinx
 
Butlr
ButlrButlr
Butlr
 

Recently uploaded

Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...itnewsafrica
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...Nikki Chapple
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialJoão Esperancinha
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentMahmoud Rabie
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 

Recently uploaded (20)

Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorial
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career Development
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 

A/B Testing with Mike Greenfield

  • 1. The Art of A/B Testing @mike_greenfield numeratechoir.com 500 Startups, 2013-05-08
  • 2. What’s an A/B Test? Definition: An A/B Test is a means by which a product’s users are randomly given one of two or more experiences. Usage: Companies use A/B Tests to discover which experience is most effective.
  • 3. A/B Testing Overview Why You Should A/B Test What You Should A/B Test When You Should A/B Test How You Should A/B Test What I’ve Learned from A/B Testing
  • 4. Why You Should A/B Test
  • 5. Humans’ quantitative intuition is poor Beliefs: 1) “I have great product intuition” 2) “This is business, not science” 3) “It leads to a local maximum” Reality: At Internet scale, testing and measuring properly has a huge return on investment.
  • 6. “I have great product intuition” • Improved signup flow: often a disaster • Sharing on Facebook: can be good or bad • Home page changes: usually a mixed bag …but product changes != progress
  • 7. “This is business, not science” • Requires oversight and sleuthing • Only works for big improvements • Requires other factors to stay the same …but change-then-measure is flawed
  • 8. Test Test Type Goal Change Button Color Optimization Increased clickthrough Old Site Design vs. New Site Design Holistic (aka don’t shoot yourself in the foot) Make an informed decision on old vs. new “It leads to a local maximum” …so test holistically
  • 9. What You Should A/B Test
  • 10. Test an almost viral flow 1000 signups 300 inviting friends 3000 invitations 1500 invitation clicks 900 signups K-factor = 0.9 ☑ A/B Test this flow!
  • 11. Don’t test a non-viral flow 1000 signups 100 inviting friends 600 invitations 200 invitation clicks 80 signups K-factor = 0.08 Spend time elsewhere
  • 12. Test email subject lines Subject: The Craziest Thing My Child Has Done Subject: 5 Embarrassing Kid Moments vs.
  • 13. Test email subject lines Subject: The Craziest Thing My Child Has Done Subject: 5 Embarrassing Kid Moments vs. +153%
  • 14. Improve purchase funnel Get 2011 Picks Get Bracketbrains vs.
  • 15. Improve purchase funnel Get 2011 Picks Get Bracketbrains vs. +30%
  • 16. Validate your new design* * if you have scale Scale What to Do Reasoning Early stage Just change it. Nothing to lose, no data to test. Something to lose A/B test it. The existing product is probably more effective than you think. Understand the consequences of an “upgrade.”
  • 17. When You Should A/B Test
  • 18. Test only if you can get to statistical significance (Google: “split test calculator”) Big: 200,000 emails with a 3% CTR option and a 4% CTR option ------------------------- 6000 vs. 8000 Small: 200 emails with a 3% CTR option and a 4% CTR option -------------------------- 6 vs. 8
  • 19. Test only if you can get to business significance • Only things that can cumulatively have a meaningful impact on your business • For emails, a small list means small improvements: don’t test • For virality, small changes matter if and only if you’re close to K=1
  • 20. Rule of thumb: test every user- facing change that will be seen by 10,000-100,000 people
  • 21. How You Should A/B Test
  • 22. Okay Choice: Use Commercial Tools • MixPanel • Unbounce • Optimizely • Google Analytics
  • 23. Best Choice: Build Your Own Framework • Yep, it’s work with no immediate payoff • Your mom won’t care • Your users won’t care But… • There are simple ways to get started • It gives you tons of flexibility
  • 24. Why Build Your Own • Incorporate tests in many places (page ordering, new designs, email content, email strategy, mobile) • Look at results holistically • Go back and see how any test influences anything, not just the stats you’re tracking
  • 25. Code It needs to be super simple to create a test {ab_test_if test=“signup_reason” option=“awesome” user=$viewer} because it’s awesome! {/ab_test_if} {ab_test_if test=“signup_reason” option=“free” user=$viewer} because it’s free! {/ab_test_if}
  • 26. Data Structure AB_TESTS (id, name, time_created) AB_TEST_OPTIONS (id, ab_test_id, weight, name) USER_AB_TEST_OPTIONS (id, user_id/visitor_id, ab_test_option_id, t ime_created)
  • 27. Reporting select AB_TEST_OPTION_ID, ACTIVITY_ID, count(distinct USER_ID) USERS_DOING_ACTIVITY from USER_ACTIVITY a, USER_AB_TEST_OPTIONS b where a.USER_ID=b.USER_ID and AB_TEST_OPTION_ID in (…) and ACTIVITY_ID in (…) and a.time_created>b.time_created GROUP BY AB_TEST_OPTION_ID, ACTIVITY_ID;
  • 28. Scaling • An A/B testing system can yield a lot of DB writes • Reporting means many long-running SQL queries • Need to batch several aspects
  • 29. Run Some A/A Tests • Test two versions of the same thing • If results are wildly off, something’s wrong with the testing system • Deciding too early is a major issue: it’s usually best to be conservative before choosing a winner
  • 30. More framework implementation details: bit.ly/artofabtesting
  • 31. What I’ve learned from A/B testing
  • 32. Focus on 1 item in emails • Clear subject focused on that item (Why the Giants will win the World Series) • Body of text focused on that item (peripheral content is okay on the periphery) • Clear, big clickthrough action in the email body (See why the Giants will win) • Require clickthrough to get the full story
  • 33. In signup, minimize distraction • Provide context/messaging of what the product is, but don’t make it clickable • Clear “next” or “continue” steps to guide user through the process • Remove unnecessary navigation
  • 34. Highlight friends, not your product • Most effective: your friends are doing something; you should join them • Unless you’re Apple, no one cares about your new feature or new design • People probably don’t care about your fancy new social network • “Join my circle because I trust you” beats “check out this great product”
  • 36. 3 Things to Remember
  • 37. A/B testing = good culture • Data trump opinions • Iterate quickly but intelligently • Everyone gets better at predicting product success
  • 38. Test changes if they’re likely to have both statistical and business significance
  • 39. Validate the Big Stuff A/B Test Holistically; testing isn’t a substitute for product vision. Optimize the Small Stuff The details matter more than you think.