SlideShare a Scribd company logo
1 of 87
Download to read offline
So hey
My name’s Dan McKinley. I’m a general purpose web …ish programmer guy.
I got my start in the financial industry back in the early 2000’s. I did that for a few years and 
then I freaked out and wound up at a startup called Etsy, in Brooklyn. That was back in 2007.
Pretty much the whole talk will be about my time at Etsy. I left earlier this year to start a new 
company with some other folks from Etsy. But I still think Etsy is awesome. I had a great time 
there and learned an awful lot. So I have a few talks left to do about this I guess.
I realize we’re on the west coast and people may not be familiar with Etsy, so I’ll give you a 
little background. Etsy’s a marketplace for handmade and vintage goods. It gets about 4 
million uniques a day, and it sold well over a billion dollars in goods in 2013. 
What I’m trying to do with this slide is convince you that Etsy is pretty freaking big, even if you 
haven’t heard of it. It’s certainly within the top 100 websites in the US.
But back when I started in 2007, it was tiny. Less than 20 people.
That journey was a wild ride. It was tumultuous. There were three CEO changes while I was 
there. There was plenty of arguing about plenty of things.
And the most contentious issue of all had to be this. What should we spend our time doing? 
I’m sure this is a hot topic at most companies. Well it was at Etsy too.
I’d say at all points, this was more or less the company line. We should be data driven. 
Someone heard that Google was data driven, and we wanted to be like Google. So we should 
be data driven. There was one problem with this.
And that was that nobody knew what that meant, or what it would imply. You’d get this advice 
to “be data driven” all the time, with no additional context or advice.
Or even worse, you’d get this. If you wanted to get a project approved, you’d dress it up in 
data. You’d sprinkle some numbers on it and wave a dead chicken at it to show you were 
being data driven. It was display behavior.
But eventually I think I started to figure some of this out. I started to figure out what it would 
mean to be data driven while you’re picking priorities. 
And I also realized that it’s not that complicated. 
That’s what this talk is about.
I want to go through the story of how I figured some of this out in more detail.
And that story starts back in 2007.
Etsy in its early days was a magical place. I would not want anyone to get the wrong 
impression here and think I’m talking smack about Etsy in the early days. It was awesome. I 
miss 2007 all the time.
But I think it’s also important to be honest. Etsy grew geometrically because it was a thing that 
was out there waiting to be discovered. It didn’t grow that way because of carefully planned 
product and marketing efforts. 
Sure, there was agency in the creation of the site. And people worked hard. But the massive 
growth that happened wasn’t strongly connected to the actions of employees. This is utterly 
obvious to those of us that were there.
But few of us, as human beings, are hardwired to see things that way. Instead we all tend to 
think that if your work takes off, it means you’re smart and your actions are good. 
We assume that the methods that precede success caused it.
We assumed that the success was our doing, and not happenstance. And I dunno, maybe we 
developed outsized egos.
We didn’t really question ourselves. We created a thing and people loved it. And we believed 
that to be a repeatable process.
If you build it, they will come, in other words.
And you know what, if the site’s growth is really insane, it looks like it’s working. You can 
release things and as long as they don’t completely destroy everything it will look like you’re a 
genius. All the graphs will go up and to the right. 
And that’s awesome for as long as you don’t think about it too hard.
But of course we went and spoiled all of that by thinking about it too hard.
A thing I noticed (and I’m sure I wasn’t the only person to notice this) was that we had been 
consistently deleting features after a while because nobody wound up using them. 
We’d release a feature, and a year or two later it would need maintenance or become a 
support headache. At that point we’d look to see if anyone was using it, and pretty often the 
answer would be “no.” So we’d just kill the thing.
A good example of that is a feature called Alchemy. This was a feature that let you describe 
an item you wanted, and then have Etsy shop owners bid on making it for you. 
That’s a neat idea, and it was in the New York times a few times and was generally 
considered awesome when it came out back in 2008.
But then we took it down for good in 2011. I looked into it at the time, and I think the statistic 
was that in three years Alchemy sold about as many items as the rest of the site sells in less 
than a day. It was a giant bust.
Around that time I started selfishly wondering if there was some way I could avoid 
participating in projects like alchemy. Because there’s this weird thing about me: I prefer not to 
work on things that aren’t going to be used and are just destined to be taken down after a year 
or two.
Around the same time we started A/B testing things in earnest. And I latched onto that as a 
possible solution. Instead of pushing things out to everyone all at once we’d do a split test and 
try to get a quantitative measure of how we were doing.
The thing that A/B testing revealed right away was: holy crap! We’ve been delusional about 
our abilities up to this point. 
A huge percentage of products that we tested either had no effect, or made things worse. 
Usually only slightly worse, but still. And we certainly weren’t having the positive impact we 
imagined.
For example in early 2011 I was involved in redesigning our homepage. Five of us put about 
four months of engineering effort into it. The CEO worked on it with us. We imagined that this 
would be really important, being the homepage and all. 
But we released it and its primary effect was: zilch. And it slightly reduced the number of 
people signing up. We wound up just throwing all of this work away. 
And that was a really crappy experience that I didn’t want to repeat.
At this point our problem seemed to be that we were still picking projects based on which one 
sounded the best.
But the way we decided whether or not things were working after the fact had changed. We 
were evaluating our results with A/B testing. And that turned out to be really important.
Once we were looking at how we were really performing when we were releasing products, 
we’d opened Pandora’s box. 
It would be pretty disingenuous for us to just ignore this problem and go back to not A/B 
testing. 
But we weren’t happy releasing products that had a minor, neutral, or even negative impact.
Our batting average with products was terrible. But over time I did figure out a way to get 
slightly better at this.
Back at the beginning we started with this. We’d have an idea, we’d code it, and then we’d 
push it out.
Then in 2010 we tacked this A/B testing step on to the end.
But that was blowing up in our faces and pretty often it really looked more like this. We’d have 
to spend a lot more time in testing than we’d planned on, because our metrics got worse.
And occasionally we couldn’t resolve that, and we might not even release stuff.
But around 2012 I settled into a process that looked more like this. It’s a little more 
complicated, but it worked a lot better. 
The premise of this is basically, “hey, maybe we should incorporate data earlier on in the 
process.”
The notable feature here is an explicit step at the beginning where I try to validate ideas using 
data before doing anything else. I’m going to spend most of the rest of the talk discussing this.
The other thing I started trying to do in earnest was to avoid over-committing. We’d explicitly 
try to build minimal versions of things and A/B test with those.
Doing that took a lot of discipline sometimes. For example this is a screenshot of one of my 
prototypes that was notoriously ugly, even when it wasn’t trying to sell you rhinoceros beetle 
taxidermy. 
I would come into work every day while this experiment was running and have a discussion 
with a designer or a support person about how crappy it looked in some edge cases. I’d smile 
and nod and say we’d fix the rough edges after we were sure it was going to last. And 
eventually we did that.
If a project survived the first round of A/B tests, you would go back and apply some polish to 
it.
The nice thing about this process is that it gives me at least two relatively inexpensive places 
where I can decide to give up on a project that isn’t working. Of course I still have the option of 
scrapping it at the very end, but I’m less likely have to exercise that.
One thing about following this process is that you tend to have a lot of downtime while you’re 
waiting for A/B tests to finish running. 
So I started pipelining other projects. I’d get the process going on something else while I was 
waiting for experimental results. It wasn’t uncommon for me to have two or three or five 
running experiments at once.
Obviously that’s a bookkeeping challenge. You have many oars in the water at once. You 
have to keep meticulous notes just to remember what you were doing.
It’s also a challenge to get over the feeling that analysis isn’t work, at least not in the same 
way that coding features is work. 
Time spent doing analysis can be a lot more valuable than time spent coding. But that’s 
contrary to all of our instincts as engineers.
Instead of just leaving you with that abstract stuff, I’ll go through a couple of examples of 
doing the validation on a potential project.
I’ll go through two ideas, they’re both real things that came up at some point. I think they both 
sound like pretty good ideas when you first hear them. And early on that would have been 
enough for someone to work on them.
The first idea I’ll go through is building a landing page for furniture that’s local to the visitor.
The general idea would be: Etsy already has these landing pages for all of its categories. And 
furniture is pretty hard to buy on the internet, because shipping it is difficult. 
Maybe we could improve the experience by making the pages show items that are 
geographically close to the visitor.
To get stated, we assess is how large the audience for this is, and what their current behavior 
is like.
I’d start by looking at the overall fraction of page views that are on the page we’d be talking 
about changing. And in this case, these pages have quite a bit of traffic. It’s not the most 
important page, but it’s something.
And of course not all of those pageviews are for furniture, but a decent amount of them are. 
So far so good.
Going one step further, I'd see how many people buy things that they first encountered on 
these pages. And If you do that the project looks a little less attractive, because these pages 
really don’t sell many items. 
That doesn’t necessarily mean that the browse pages are bad. It’s probably just that the traffic 
on those pages tends to be pretty removed from the purchasing decision. 
Compare that to the search page, which has a ton of traffic but also sells a ton of items. Once 
you’ve gone to the trouble of typing a search, you’re a lot more likely to be in buying mode.
But wait: furniture is expensive. So maybe this is still worth digging into. 
Sure, we won’t sell many more items total, but the ones we do sell will cost more. 
If the average order on Etsy is $40—which by the way it’s not, I am making up financial details 
in this talk—let’s guess that the average order we’ll create will be ten times that. Let’s guess 
$400.
From there, we can start to answer what changing the state of the world would imply.
You get a formula like this. We have a certain number of visitors to the site, some percentage 
of them convert into purchases, and each purchase is worth some average amount. And then 
we multiply that by how much we’re going to improve matters.
I just talked through estimating the first three terms. Now we’re going to take some wild 
guesses at the fourth.
And if you do that with this project, it looks like this. I’ve picked a half percent through a two 
percent increase in sales, which based on our past experience might be even a little nuts. A 
2% increase in conversions on a page like this would be relatively unheard of. 
But let’s say we really hit it out of the park and managed a 2% increase. That would mean 
Etsy would sell $1000 more per day.
There’s one thing to consider here, which is that Etsy doesn’t keep all of the money when an 
item sells. Etsy only takes 3.5% of the purchase price as a fee. So if we include that it adds 
some additional context. 
The net benefit to Etsy for this feature starts to look pretty bad. If two engineers work on this 
for a month and they have a designer for half that time, then this feature would take a long 
time to earn back their salaries. Not to mention HR overhead, managerial overhead, the 
incremental electricity required to run the feature, etc.
Another thing we can look at is how long we’d have to run this experiment to expect to get a 
statistically significant result. In this example the experiment would have to run for the rest of 
the decade. That is not ideal.
I think in this case it’s clear that we should spend time on something else. And the work I did 
to determine this only took me about an afternoon. I didn’t waste a month of my life on it, so I 
count that as a victory.
Let’s go through another example. This time we’ll model sending emails to people.
Like most e-commerce sites Etsy has a cart with a couple of steps. You add something to 
your cart, you fill out your credit card and shipping address, and so on. 
People can quit at any step along the way. The idea here would be to wait five days and then 
send people an email asking if they really meant to buy that thing they started buying.
The equation in this case looks a little different. We have a set of people who are eligible each 
day. They have an average value of stuff in their carts. And we hope to reactivate a 
percentage of them.
So if there are about 20,000 people eligible for this email every day, and their average 
purchase is $40, our model looks like this. 
I plugged in some guesses for how many people would complete the purchase, starting at 1% 
on the lower end. I think 1% is a reasonable guess because these people are already pretty 
far down the checkout funnel. 
On the low end of things, we’d sell $8,000 more per day.
Again we have to relate that to how much the company actually makes. And this time around 
it looks much better: about $100K per year in revenue.
In this case our exploration found a project that looks like it’s worth doing.
I actually built that feature while I was at Etsy. It wound up being between a half and one 
percent of total sales. That might not sound like a lot, but remember that huge percentage 
gains are really hard to come by. This was a really big deal, as far as these things go.
So I hope you can see that even crude arithmetic can make one project look like a turkey and 
another worth doing, when they both sound cool without the arithmetic.
If I can extrapolate for a few minutes,
The metaphor that I have in my head when I think about this kind of project validation is 
Archimedes with his lever. (Not to be grandiose or anything.) Archimedes said that given a 
place to stand, he could move the Earth. 
Picking products based on data is "finding your place to stand” before you apply your effort.
Your ability to move the needle is a function of volume and the audience. This is the 
relationship you're really trying to reason about. 
By volume I mean how many people there are, and by audience I mean the characteristics of 
those people. How often they buy. How much they spend. etc. 
In the furniture landing page example I gave you there was a lot of volume, but the audience 
was made up of people that didn’t buy very much. That made it an unattractive project.
To be successful at this kind of work, it really helps to be able to recall your core business 
metrics without digging them up. 
Obviously these are the ones you care about for an e-commerce site like Etsy. You would 
care about different metrics for other kinds of sites.
A really good way to keep those numbers top of mind is to make it impossible for people not 
to see them. At Etsy we had this toolbar that employees could see. I wrote some code to stick 
page-specific business metrics into it. 
So people would go about their business on the site every day, and they’d just be involuntarily 
exposed to relevant numbers. There was no escape.
Remember we’re making educated guesses when we prioritize. Just getting to within an order 
of magnitude of the right answer is your goal. 
Without doing this, this your hunches are unlikely to even be that accurate.
I don’t want to come across as saying this is the one true way to choose projects. I do not 
believe that this method is always practical or appropriate.
If your company is brand new, you won’t have any data, so you can’t do this. Everything you 
do will probably be some kind of crap shoot. 
More mature companies should have tons of data, and could theoretically do this for every 
project. But even then that’s probably still not what you should do.
I don’t think there’s anything wrong with working on features that aren’t going to move the 
needle.
As long as you know that you’re working on a feature that isn’t going to move the needle. 
What I think is a tragedy is to work on something thinking it will have a big impact, when you 
should have known better.
So I’d like to leave you with just a couple of thoughts.
As I’ve tried to demonstrate, a lot of product ideas sound awesome. 
I think that “sounding awesome” is a completely unremarkable feature of a product idea.
I think that it’s pretty common for engineers to assume that people are doing this work for 
them. That was certainly my implicit assumption when I started my career. And most of the 
engineers I’ve worked with have had the same notion. 
But people might not be doing anything like this when they’re assigning you projects. You 
should make sure that they are. Or you should do it yourself.
I don’t encourage open rebellion or anything. But I think that engineers and product managers 
should realize that not only are they capable of doing the work, they might be the people 
MOST capable of doing the work. If your data tooling isn’t sophisticated, engineers might be 
the only people who can get some of these numbers.
I've met so many engineers in my career that "just want to build cool stuff." That’s what I want 
too! And personally I think having an impact is pretty cool. 
(h/t John Miles White)
Data driven-products-now

More Related Content

What's hot

Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJXDriving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJXDATAVERSITY
 
What is data engineering?
What is data engineering?What is data engineering?
What is data engineering?yongdam kim
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkDatabricks
 
Getting Started with Delta Lake on Databricks
Getting Started with Delta Lake on DatabricksGetting Started with Delta Lake on Databricks
Getting Started with Delta Lake on DatabricksKnoldus Inc.
 
State of Data Governance in 2021
State of Data Governance in 2021State of Data Governance in 2021
State of Data Governance in 2021DATAVERSITY
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Data weekender4.2 azure purview erwin de kreuk
Data weekender4.2  azure purview erwin de kreukData weekender4.2  azure purview erwin de kreuk
Data weekender4.2 azure purview erwin de kreukErwin de Kreuk
 
Cost Efficiency Strategies for Managed Apache Spark Service
Cost Efficiency Strategies for Managed Apache Spark ServiceCost Efficiency Strategies for Managed Apache Spark Service
Cost Efficiency Strategies for Managed Apache Spark ServiceDatabricks
 
RWDG: Measuring Data Governance Performance
RWDG: Measuring Data Governance PerformanceRWDG: Measuring Data Governance Performance
RWDG: Measuring Data Governance PerformanceDATAVERSITY
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeKent Graziano
 
[Webinar Slides] Developing a Successful Data Retention Policy
[Webinar Slides] Developing a Successful Data Retention Policy [Webinar Slides] Developing a Successful Data Retention Policy
[Webinar Slides] Developing a Successful Data Retention Policy AIIM International
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerDatabricks
 
Data Quality Management - Data Issue Management & Resolutionn / Practical App...
Data Quality Management - Data Issue Management & Resolutionn / Practical App...Data Quality Management - Data Issue Management & Resolutionn / Practical App...
Data Quality Management - Data Issue Management & Resolutionn / Practical App...Burak S. Arikan
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata ManagementDATAVERSITY
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lakeJames Serra
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceDenodo
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureDatabricks
 
Hadoop Ecosystem | Hadoop Ecosystem Tutorial | Hadoop Tutorial For Beginners ...
Hadoop Ecosystem | Hadoop Ecosystem Tutorial | Hadoop Tutorial For Beginners ...Hadoop Ecosystem | Hadoop Ecosystem Tutorial | Hadoop Tutorial For Beginners ...
Hadoop Ecosystem | Hadoop Ecosystem Tutorial | Hadoop Tutorial For Beginners ...Simplilearn
 

What's hot (20)

Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJXDriving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
 
What is data engineering?
What is data engineering?What is data engineering?
What is data engineering?
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Getting Started with Delta Lake on Databricks
Getting Started with Delta Lake on DatabricksGetting Started with Delta Lake on Databricks
Getting Started with Delta Lake on Databricks
 
State of Data Governance in 2021
State of Data Governance in 2021State of Data Governance in 2021
State of Data Governance in 2021
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Data weekender4.2 azure purview erwin de kreuk
Data weekender4.2  azure purview erwin de kreukData weekender4.2  azure purview erwin de kreuk
Data weekender4.2 azure purview erwin de kreuk
 
Cost Efficiency Strategies for Managed Apache Spark Service
Cost Efficiency Strategies for Managed Apache Spark ServiceCost Efficiency Strategies for Managed Apache Spark Service
Cost Efficiency Strategies for Managed Apache Spark Service
 
RWDG: Measuring Data Governance Performance
RWDG: Measuring Data Governance PerformanceRWDG: Measuring Data Governance Performance
RWDG: Measuring Data Governance Performance
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on Snowflake
 
[Webinar Slides] Developing a Successful Data Retention Policy
[Webinar Slides] Developing a Successful Data Retention Policy [Webinar Slides] Developing a Successful Data Retention Policy
[Webinar Slides] Developing a Successful Data Retention Policy
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
 
Data Quality Management - Data Issue Management & Resolutionn / Practical App...
Data Quality Management - Data Issue Management & Resolutionn / Practical App...Data Quality Management - Data Issue Management & Resolutionn / Practical App...
Data Quality Management - Data Issue Management & Resolutionn / Practical App...
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and Hudi
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Data Modeling & Metadata Management
Data Modeling & Metadata ManagementData Modeling & Metadata Management
Data Modeling & Metadata Management
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and Governance
 
Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
 
Hadoop Ecosystem | Hadoop Ecosystem Tutorial | Hadoop Tutorial For Beginners ...
Hadoop Ecosystem | Hadoop Ecosystem Tutorial | Hadoop Tutorial For Beginners ...Hadoop Ecosystem | Hadoop Ecosystem Tutorial | Hadoop Tutorial For Beginners ...
Hadoop Ecosystem | Hadoop Ecosystem Tutorial | Hadoop Tutorial For Beginners ...
 

Similar to Data driven-products-now

Product Management: Wrangling People
Product Management: Wrangling PeopleProduct Management: Wrangling People
Product Management: Wrangling PeopleJanet Brunckhorst
 
Pretotype it (first pretotype edition) - ProductCamp Nuremberg 2014
Pretotype it (first pretotype edition) - ProductCamp Nuremberg 2014Pretotype it (first pretotype edition) - ProductCamp Nuremberg 2014
Pretotype it (first pretotype edition) - ProductCamp Nuremberg 2014pcampger
 
Five Ways to Get Better Data From Our Users
Five Ways to Get Better Data From Our UsersFive Ways to Get Better Data From Our Users
Five Ways to Get Better Data From Our UsersSajid Reshamwala
 
L3 cmpt y2 evaluation template (1)
L3 cmpt y2 evaluation template (1)L3 cmpt y2 evaluation template (1)
L3 cmpt y2 evaluation template (1)Cobabb
 
How PBworks Used Lean Startup Techniques
How PBworks Used Lean Startup TechniquesHow PBworks Used Lean Startup Techniques
How PBworks Used Lean Startup TechniquesDavid E. Weekly
 
Fmp evaluation v1.8
Fmp evaluation v1.8Fmp evaluation v1.8
Fmp evaluation v1.8Will Cave
 
Fmp evaluation v1.9
Fmp    evaluation v1.9Fmp    evaluation v1.9
Fmp evaluation v1.9Will Cave
 
L3 cmpt y2 evaluation template
L3 cmpt y2 evaluation templateL3 cmpt y2 evaluation template
L3 cmpt y2 evaluation templateEllieOlivellaBell
 
If you want to be creative, and self improvement you.pdf
If you want to be creative, and self improvement you.pdfIf you want to be creative, and self improvement you.pdf
If you want to be creative, and self improvement you.pdfcakepearls0
 
How to increase productivity and eliminate mistakes fast
How to increase productivity and eliminate mistakes fastHow to increase productivity and eliminate mistakes fast
How to increase productivity and eliminate mistakes fastthebeachbumentrepreneur
 
L3 cmpt y2 evaluation template (2) copy
L3 cmpt y2 evaluation template (2)   copyL3 cmpt y2 evaluation template (2)   copy
L3 cmpt y2 evaluation template (2) copyLily809411
 
Good Testers are Often Lucky
Good Testers are Often LuckyGood Testers are Often Lucky
Good Testers are Often LuckyTEST Huddle
 
The 3 Major New Google Ranking Factors That No One Is Talking About Yet
The 3 Major New Google Ranking Factors That No One Is Talking About YetThe 3 Major New Google Ranking Factors That No One Is Talking About Yet
The 3 Major New Google Ranking Factors That No One Is Talking About YetMatthew Woodward
 
Justin Jackson's Build & Launch eBook pdf
 Justin Jackson's Build & Launch eBook pdf Justin Jackson's Build & Launch eBook pdf
Justin Jackson's Build & Launch eBook pdfzubeditufail
 
Cap Watkins, Designing for Scale, WarmGun 2013
Cap Watkins, Designing for Scale, WarmGun 2013Cap Watkins, Designing for Scale, WarmGun 2013
Cap Watkins, Designing for Scale, WarmGun 2013500 Startups
 

Similar to Data driven-products-now (20)

Product Management: Wrangling People
Product Management: Wrangling PeopleProduct Management: Wrangling People
Product Management: Wrangling People
 
Scalding at Etsy
Scalding at EtsyScalding at Etsy
Scalding at Etsy
 
Pretotype it (first pretotype edition) - ProductCamp Nuremberg 2014
Pretotype it (first pretotype edition) - ProductCamp Nuremberg 2014Pretotype it (first pretotype edition) - ProductCamp Nuremberg 2014
Pretotype it (first pretotype edition) - ProductCamp Nuremberg 2014
 
Five Ways to Get Better Data From Our Users
Five Ways to Get Better Data From Our UsersFive Ways to Get Better Data From Our Users
Five Ways to Get Better Data From Our Users
 
L3 cmpt y2 evaluation template (1)
L3 cmpt y2 evaluation template (1)L3 cmpt y2 evaluation template (1)
L3 cmpt y2 evaluation template (1)
 
How PBworks Used Lean Startup Techniques
How PBworks Used Lean Startup TechniquesHow PBworks Used Lean Startup Techniques
How PBworks Used Lean Startup Techniques
 
Fmp evaluation v1.8
Fmp evaluation v1.8Fmp evaluation v1.8
Fmp evaluation v1.8
 
Evaluation fmp
Evaluation fmpEvaluation fmp
Evaluation fmp
 
Fmp evaluation v1.9
Fmp    evaluation v1.9Fmp    evaluation v1.9
Fmp evaluation v1.9
 
Fmp problem solving
Fmp problem solvingFmp problem solving
Fmp problem solving
 
The Physical Web
The Physical WebThe Physical Web
The Physical Web
 
L3 cmpt y2 evaluation template
L3 cmpt y2 evaluation templateL3 cmpt y2 evaluation template
L3 cmpt y2 evaluation template
 
If you want to be creative, and self improvement you.pdf
If you want to be creative, and self improvement you.pdfIf you want to be creative, and self improvement you.pdf
If you want to be creative, and self improvement you.pdf
 
How to increase productivity and eliminate mistakes fast
How to increase productivity and eliminate mistakes fastHow to increase productivity and eliminate mistakes fast
How to increase productivity and eliminate mistakes fast
 
L3 cmpt y2 evaluation template (2) copy
L3 cmpt y2 evaluation template (2)   copyL3 cmpt y2 evaluation template (2)   copy
L3 cmpt y2 evaluation template (2) copy
 
7 products in_7_days
7 products in_7_days7 products in_7_days
7 products in_7_days
 
Good Testers are Often Lucky
Good Testers are Often LuckyGood Testers are Often Lucky
Good Testers are Often Lucky
 
The 3 Major New Google Ranking Factors That No One Is Talking About Yet
The 3 Major New Google Ranking Factors That No One Is Talking About YetThe 3 Major New Google Ranking Factors That No One Is Talking About Yet
The 3 Major New Google Ranking Factors That No One Is Talking About Yet
 
Justin Jackson's Build & Launch eBook pdf
 Justin Jackson's Build & Launch eBook pdf Justin Jackson's Build & Launch eBook pdf
Justin Jackson's Build & Launch eBook pdf
 
Cap Watkins, Designing for Scale, WarmGun 2013
Cap Watkins, Designing for Scale, WarmGun 2013Cap Watkins, Designing for Scale, WarmGun 2013
Cap Watkins, Designing for Scale, WarmGun 2013
 

Recently uploaded

(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 

Recently uploaded (20)

(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 

Data driven-products-now

  • 1.
  • 3. My name’s Dan McKinley. I’m a general purpose web …ish programmer guy.
  • 4. I got my start in the financial industry back in the early 2000’s. I did that for a few years and then I freaked out and wound up at a startup called Etsy, in Brooklyn. That was back in 2007.
  • 5. Pretty much the whole talk will be about my time at Etsy. I left earlier this year to start a new company with some other folks from Etsy. But I still think Etsy is awesome. I had a great time there and learned an awful lot. So I have a few talks left to do about this I guess.
  • 6. I realize we’re on the west coast and people may not be familiar with Etsy, so I’ll give you a little background. Etsy’s a marketplace for handmade and vintage goods. It gets about 4 million uniques a day, and it sold well over a billion dollars in goods in 2013. What I’m trying to do with this slide is convince you that Etsy is pretty freaking big, even if you haven’t heard of it. It’s certainly within the top 100 websites in the US.
  • 7. But back when I started in 2007, it was tiny. Less than 20 people.
  • 8. That journey was a wild ride. It was tumultuous. There were three CEO changes while I was there. There was plenty of arguing about plenty of things.
  • 9. And the most contentious issue of all had to be this. What should we spend our time doing? I’m sure this is a hot topic at most companies. Well it was at Etsy too.
  • 10. I’d say at all points, this was more or less the company line. We should be data driven. Someone heard that Google was data driven, and we wanted to be like Google. So we should be data driven. There was one problem with this.
  • 11. And that was that nobody knew what that meant, or what it would imply. You’d get this advice to “be data driven” all the time, with no additional context or advice.
  • 12. Or even worse, you’d get this. If you wanted to get a project approved, you’d dress it up in data. You’d sprinkle some numbers on it and wave a dead chicken at it to show you were being data driven. It was display behavior.
  • 13. But eventually I think I started to figure some of this out. I started to figure out what it would mean to be data driven while you’re picking priorities. And I also realized that it’s not that complicated. That’s what this talk is about.
  • 14. I want to go through the story of how I figured some of this out in more detail.
  • 15. And that story starts back in 2007.
  • 16. Etsy in its early days was a magical place. I would not want anyone to get the wrong impression here and think I’m talking smack about Etsy in the early days. It was awesome. I miss 2007 all the time.
  • 17. But I think it’s also important to be honest. Etsy grew geometrically because it was a thing that was out there waiting to be discovered. It didn’t grow that way because of carefully planned product and marketing efforts. Sure, there was agency in the creation of the site. And people worked hard. But the massive growth that happened wasn’t strongly connected to the actions of employees. This is utterly obvious to those of us that were there.
  • 18. But few of us, as human beings, are hardwired to see things that way. Instead we all tend to think that if your work takes off, it means you’re smart and your actions are good. We assume that the methods that precede success caused it.
  • 19. We assumed that the success was our doing, and not happenstance. And I dunno, maybe we developed outsized egos.
  • 20. We didn’t really question ourselves. We created a thing and people loved it. And we believed that to be a repeatable process.
  • 21. If you build it, they will come, in other words.
  • 22. And you know what, if the site’s growth is really insane, it looks like it’s working. You can release things and as long as they don’t completely destroy everything it will look like you’re a genius. All the graphs will go up and to the right. And that’s awesome for as long as you don’t think about it too hard.
  • 23. But of course we went and spoiled all of that by thinking about it too hard.
  • 24. A thing I noticed (and I’m sure I wasn’t the only person to notice this) was that we had been consistently deleting features after a while because nobody wound up using them. We’d release a feature, and a year or two later it would need maintenance or become a support headache. At that point we’d look to see if anyone was using it, and pretty often the answer would be “no.” So we’d just kill the thing.
  • 25. A good example of that is a feature called Alchemy. This was a feature that let you describe an item you wanted, and then have Etsy shop owners bid on making it for you. That’s a neat idea, and it was in the New York times a few times and was generally considered awesome when it came out back in 2008.
  • 26. But then we took it down for good in 2011. I looked into it at the time, and I think the statistic was that in three years Alchemy sold about as many items as the rest of the site sells in less than a day. It was a giant bust.
  • 27. Around that time I started selfishly wondering if there was some way I could avoid participating in projects like alchemy. Because there’s this weird thing about me: I prefer not to work on things that aren’t going to be used and are just destined to be taken down after a year or two.
  • 28. Around the same time we started A/B testing things in earnest. And I latched onto that as a possible solution. Instead of pushing things out to everyone all at once we’d do a split test and try to get a quantitative measure of how we were doing.
  • 29. The thing that A/B testing revealed right away was: holy crap! We’ve been delusional about our abilities up to this point. A huge percentage of products that we tested either had no effect, or made things worse. Usually only slightly worse, but still. And we certainly weren’t having the positive impact we imagined.
  • 30. For example in early 2011 I was involved in redesigning our homepage. Five of us put about four months of engineering effort into it. The CEO worked on it with us. We imagined that this would be really important, being the homepage and all. But we released it and its primary effect was: zilch. And it slightly reduced the number of people signing up. We wound up just throwing all of this work away. And that was a really crappy experience that I didn’t want to repeat.
  • 31. At this point our problem seemed to be that we were still picking projects based on which one sounded the best.
  • 32. But the way we decided whether or not things were working after the fact had changed. We were evaluating our results with A/B testing. And that turned out to be really important.
  • 33. Once we were looking at how we were really performing when we were releasing products, we’d opened Pandora’s box. It would be pretty disingenuous for us to just ignore this problem and go back to not A/B testing. But we weren’t happy releasing products that had a minor, neutral, or even negative impact.
  • 34. Our batting average with products was terrible. But over time I did figure out a way to get slightly better at this.
  • 35. Back at the beginning we started with this. We’d have an idea, we’d code it, and then we’d push it out.
  • 36. Then in 2010 we tacked this A/B testing step on to the end.
  • 37. But that was blowing up in our faces and pretty often it really looked more like this. We’d have to spend a lot more time in testing than we’d planned on, because our metrics got worse.
  • 38. And occasionally we couldn’t resolve that, and we might not even release stuff.
  • 39. But around 2012 I settled into a process that looked more like this. It’s a little more complicated, but it worked a lot better. The premise of this is basically, “hey, maybe we should incorporate data earlier on in the process.”
  • 40. The notable feature here is an explicit step at the beginning where I try to validate ideas using data before doing anything else. I’m going to spend most of the rest of the talk discussing this.
  • 41. The other thing I started trying to do in earnest was to avoid over-committing. We’d explicitly try to build minimal versions of things and A/B test with those.
  • 42. Doing that took a lot of discipline sometimes. For example this is a screenshot of one of my prototypes that was notoriously ugly, even when it wasn’t trying to sell you rhinoceros beetle taxidermy. I would come into work every day while this experiment was running and have a discussion with a designer or a support person about how crappy it looked in some edge cases. I’d smile and nod and say we’d fix the rough edges after we were sure it was going to last. And eventually we did that.
  • 43. If a project survived the first round of A/B tests, you would go back and apply some polish to it.
  • 44. The nice thing about this process is that it gives me at least two relatively inexpensive places where I can decide to give up on a project that isn’t working. Of course I still have the option of scrapping it at the very end, but I’m less likely have to exercise that.
  • 45. One thing about following this process is that you tend to have a lot of downtime while you’re waiting for A/B tests to finish running. So I started pipelining other projects. I’d get the process going on something else while I was waiting for experimental results. It wasn’t uncommon for me to have two or three or five running experiments at once.
  • 46. Obviously that’s a bookkeeping challenge. You have many oars in the water at once. You have to keep meticulous notes just to remember what you were doing.
  • 47. It’s also a challenge to get over the feeling that analysis isn’t work, at least not in the same way that coding features is work. Time spent doing analysis can be a lot more valuable than time spent coding. But that’s contrary to all of our instincts as engineers.
  • 48. Instead of just leaving you with that abstract stuff, I’ll go through a couple of examples of doing the validation on a potential project.
  • 49. I’ll go through two ideas, they’re both real things that came up at some point. I think they both sound like pretty good ideas when you first hear them. And early on that would have been enough for someone to work on them.
  • 50. The first idea I’ll go through is building a landing page for furniture that’s local to the visitor.
  • 51. The general idea would be: Etsy already has these landing pages for all of its categories. And furniture is pretty hard to buy on the internet, because shipping it is difficult. Maybe we could improve the experience by making the pages show items that are geographically close to the visitor.
  • 52. To get stated, we assess is how large the audience for this is, and what their current behavior is like.
  • 53. I’d start by looking at the overall fraction of page views that are on the page we’d be talking about changing. And in this case, these pages have quite a bit of traffic. It’s not the most important page, but it’s something.
  • 54. And of course not all of those pageviews are for furniture, but a decent amount of them are. So far so good.
  • 55. Going one step further, I'd see how many people buy things that they first encountered on these pages. And If you do that the project looks a little less attractive, because these pages really don’t sell many items. That doesn’t necessarily mean that the browse pages are bad. It’s probably just that the traffic on those pages tends to be pretty removed from the purchasing decision. Compare that to the search page, which has a ton of traffic but also sells a ton of items. Once you’ve gone to the trouble of typing a search, you’re a lot more likely to be in buying mode.
  • 56. But wait: furniture is expensive. So maybe this is still worth digging into. Sure, we won’t sell many more items total, but the ones we do sell will cost more. If the average order on Etsy is $40—which by the way it’s not, I am making up financial details in this talk—let’s guess that the average order we’ll create will be ten times that. Let’s guess $400.
  • 57. From there, we can start to answer what changing the state of the world would imply.
  • 58. You get a formula like this. We have a certain number of visitors to the site, some percentage of them convert into purchases, and each purchase is worth some average amount. And then we multiply that by how much we’re going to improve matters.
  • 59. I just talked through estimating the first three terms. Now we’re going to take some wild guesses at the fourth.
  • 60. And if you do that with this project, it looks like this. I’ve picked a half percent through a two percent increase in sales, which based on our past experience might be even a little nuts. A 2% increase in conversions on a page like this would be relatively unheard of. But let’s say we really hit it out of the park and managed a 2% increase. That would mean Etsy would sell $1000 more per day.
  • 61. There’s one thing to consider here, which is that Etsy doesn’t keep all of the money when an item sells. Etsy only takes 3.5% of the purchase price as a fee. So if we include that it adds some additional context. The net benefit to Etsy for this feature starts to look pretty bad. If two engineers work on this for a month and they have a designer for half that time, then this feature would take a long time to earn back their salaries. Not to mention HR overhead, managerial overhead, the incremental electricity required to run the feature, etc.
  • 62. Another thing we can look at is how long we’d have to run this experiment to expect to get a statistically significant result. In this example the experiment would have to run for the rest of the decade. That is not ideal.
  • 63. I think in this case it’s clear that we should spend time on something else. And the work I did to determine this only took me about an afternoon. I didn’t waste a month of my life on it, so I count that as a victory.
  • 64. Let’s go through another example. This time we’ll model sending emails to people.
  • 65. Like most e-commerce sites Etsy has a cart with a couple of steps. You add something to your cart, you fill out your credit card and shipping address, and so on. People can quit at any step along the way. The idea here would be to wait five days and then send people an email asking if they really meant to buy that thing they started buying.
  • 66. The equation in this case looks a little different. We have a set of people who are eligible each day. They have an average value of stuff in their carts. And we hope to reactivate a percentage of them.
  • 67. So if there are about 20,000 people eligible for this email every day, and their average purchase is $40, our model looks like this. I plugged in some guesses for how many people would complete the purchase, starting at 1% on the lower end. I think 1% is a reasonable guess because these people are already pretty far down the checkout funnel. On the low end of things, we’d sell $8,000 more per day.
  • 68. Again we have to relate that to how much the company actually makes. And this time around it looks much better: about $100K per year in revenue.
  • 69. In this case our exploration found a project that looks like it’s worth doing.
  • 70. I actually built that feature while I was at Etsy. It wound up being between a half and one percent of total sales. That might not sound like a lot, but remember that huge percentage gains are really hard to come by. This was a really big deal, as far as these things go.
  • 71. So I hope you can see that even crude arithmetic can make one project look like a turkey and another worth doing, when they both sound cool without the arithmetic.
  • 72. If I can extrapolate for a few minutes,
  • 73. The metaphor that I have in my head when I think about this kind of project validation is Archimedes with his lever. (Not to be grandiose or anything.) Archimedes said that given a place to stand, he could move the Earth. Picking products based on data is "finding your place to stand” before you apply your effort.
  • 74. Your ability to move the needle is a function of volume and the audience. This is the relationship you're really trying to reason about. By volume I mean how many people there are, and by audience I mean the characteristics of those people. How often they buy. How much they spend. etc. In the furniture landing page example I gave you there was a lot of volume, but the audience was made up of people that didn’t buy very much. That made it an unattractive project.
  • 75. To be successful at this kind of work, it really helps to be able to recall your core business metrics without digging them up. Obviously these are the ones you care about for an e-commerce site like Etsy. You would care about different metrics for other kinds of sites.
  • 76. A really good way to keep those numbers top of mind is to make it impossible for people not to see them. At Etsy we had this toolbar that employees could see. I wrote some code to stick page-specific business metrics into it. So people would go about their business on the site every day, and they’d just be involuntarily exposed to relevant numbers. There was no escape.
  • 77. Remember we’re making educated guesses when we prioritize. Just getting to within an order of magnitude of the right answer is your goal. Without doing this, this your hunches are unlikely to even be that accurate.
  • 78. I don’t want to come across as saying this is the one true way to choose projects. I do not believe that this method is always practical or appropriate.
  • 79. If your company is brand new, you won’t have any data, so you can’t do this. Everything you do will probably be some kind of crap shoot. More mature companies should have tons of data, and could theoretically do this for every project. But even then that’s probably still not what you should do.
  • 80. I don’t think there’s anything wrong with working on features that aren’t going to move the needle.
  • 81. As long as you know that you’re working on a feature that isn’t going to move the needle. What I think is a tragedy is to work on something thinking it will have a big impact, when you should have known better.
  • 82. So I’d like to leave you with just a couple of thoughts.
  • 83. As I’ve tried to demonstrate, a lot of product ideas sound awesome. I think that “sounding awesome” is a completely unremarkable feature of a product idea.
  • 84. I think that it’s pretty common for engineers to assume that people are doing this work for them. That was certainly my implicit assumption when I started my career. And most of the engineers I’ve worked with have had the same notion. But people might not be doing anything like this when they’re assigning you projects. You should make sure that they are. Or you should do it yourself.
  • 85. I don’t encourage open rebellion or anything. But I think that engineers and product managers should realize that not only are they capable of doing the work, they might be the people MOST capable of doing the work. If your data tooling isn’t sophisticated, engineers might be the only people who can get some of these numbers.
  • 86. I've met so many engineers in my career that "just want to build cool stuff." That’s what I want too! And personally I think having an impact is pretty cool. (h/t John Miles White)