SlideShare a Scribd company logo
1 of 47
Download to read offline
What is your data strategy
and why is it wrong?
Dylan Gregersen
Data Engineering Meetup Aug 2018
My name is Dylan Gregersen
I like these things... You can find me at…
dylangregersen
I am the lead data
scientist at...
How do you define Data
Science?
Rachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline
Data Science is the process of
collecting, cleaning, analyzing,
visualizing, and communicating
data in order to solve problems
in the real world.
Data science is...
What people think data science is...
People often think data science
is all about mathematics,
algorithms, and something call
“machine learning”
Rachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline
What most data science is...
Data science actually consists
mostly of data collection,
cleaning, and organization
(often 80% of the work)
Rachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline
What people forget that data science is
People tend to forget the skills
needed in data science to
communicate results so someone
can take an action in the real
worldRachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline
Rachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline
Data science is a process
When doing data science we...
1. Collect Data: We must first collect
and store information about real
world phenomena
2. Structure Data: Then we structure
that data into conceptual models of
the phenomena.
3. Extract Insight: We use our data
model to understand something
about the phenomena
4. Solve Problems: We apply our
understanding to solve a problem
by taking an action
Data science is successful when you learn
something about the real world which
helps you solve a problem by taking an
action.
Data Strategy #1
Know what problem you
are trying to solve
“Can I have the number
of X for last month?”
Identifying the problem
U: What is my conference room utilization?
Identifying the problem
U: What is my conference room utilization?
Me: What problem are you trying to solve?
U: I want to know which rooms are underutilized
Me: Why do you want to know?
U: To improve the efficiency of conference rooms use
Me: What are you going to do with that information?
A: Repurpose rooms who’s meeting usage is less than 50%
Problem: Conference rooms should be used efficiently
Action: repurpose rooms with usage less than 50%, also heavily used areas
Metric: room utilization = hours in use / available hours per day
Identifying the problem
U: What is my conference room utilization?
Me: What problem are you trying to solve?
U: I want to know which rooms are underutilized
Me: Why do you want to know?
U: To improve the efficiency of conference rooms use
Me: What are you going to do with that information?
A: Repurpose rooms who’s meeting usage is less than 50%
Identifying the problem
U: What is my conference room utilization?
U: What is my conference room utilization?
Me: What problem are you trying to solve?
U: I want to know which departments are using the rooms the most.
Me: Why do you want to know?
U: To adjust the rooms to meet their needs
Me: What are you going to do with that information?
A: Buy new technology or furniture to better meet those needs
Identifying the problem
Problem: Change meeting rooms to fit the needs of department
Action: make purchasing decisions about technology or furniture
Metrics: room utilization, organizer’s department, occupancy size,
technology or furniture used
U: What is my conference room utilization?
Me: What problem are you trying to solve?
U: I want to know which departments are using the rooms the most.
Me: Why do you want to know?
U: To adjust the rooms to meet their needs
Me: What are you going to do with that information?
A: Buy new technology or furniture to better meet those needs
Identifying the problem
What problem are you
trying to solve?
What action will you take
with this number?
What problem are you
trying to solve?
What action will you take
with this number?
Data Strategy #2
Start simple and mature
complexity over time
“Can you predict which
customers will renew?”
The data science hierarchy
of needs describes the
stages of data complexity
and insights
Say hello to….
The Data Science Process
The Data Science Process
The Data Science Process
The Data Science Process
First point of value
Descriptive Analytics are your first
stage where you can actually solve a
problem and take an action.
Especially important for business end
users who want to apply the results
of your analysis.
First point of value
Descriptive Analytics are your first
stage where you can actually solve a
problem and take an action.
Especially important for business end
users who want to apply the results
of your analysis.
Your early projects should not try to
extend beyond this stage
First point of value
Focus first on counting
These will be...
● Easier to explain to your
stakeholders
● Faster to build and for
stakeholders to realize value
● Easier to focus on good
infrastructure and process.
Including tests and alerting.
First point of value
Businesses spend 1-3
months to get this into
production the first time
They spend 1-3 years to
really get this right
Descriptive Analytics are your
first stage where you can actually
solve a problem and take an
action.
Businesses spend 1-3
months to get this into
production the first time
They spend 1-3 years to
really get this right
1-2 years to do this well
1-2 years integrate these
1+ years modeling to
integrate optimizations
Businesses spend 1-3
months to get this into
production the first time
They spend 1-3 years to
really get this right
1-2 years to do this well
1-2 years integrate these
1+ years modeling to
integrate optimizations
Data Strategy #3
Practice good product
development and iterate
“Can you also
include...?”
Traditional product development lifecycle
Developing a data product is the same as any product.
Having this process in place will mean more success in your
data endeavours.
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch
Delivery
Understanding the problem to solve
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch
Delivery
Know the problem to solve and what action will be taken
● Identify the stakeholders
● Document the possible questions your stakeholders have
● Dive deep to find the root problem the stakeholders need to solve
● Identify the action they’re going to take once they have the information
What is the scope of needs for to answer the question and
figuring out who needs to be involved
● What are the short-term and long-term goals for data?
● Who are the supporters and who are the opponents?
● Assuming we do this perfectly, what will we build first?
● What is the most evil thing which can be done?
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch
Delivery
Assess what other opportunities there are
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch
Delivery
Create a requirements documentation which outlines what
you plan to deliver
● Determine your project’s definition of success, when are you successful?
● Do product, design, and architecture reviews
● Determine team dependencies and business requirements
● Estimate costs, timelines and milestones
Figure out a plan for answering the question
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch
Delivery
As you develop the end deliverables you’re also building the
infrastructure, testing & QA, alerting
● Stay focused
● Document other questions and possible data sources
● Build good architecture with testing and alerts
● Manage quality, only let clean data in!
● Backup and security
Create something magical!
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch
Delivery
Once you’ve completed, you need to package deliver in a way
which your stakeholders can utilize
● Learn to speak the language of your stakeholders (executives or engineers)
● Review with stakeholders
● Evaluate expectations
Communicate your insights
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch + Maintain
Delivery
Data reports can become irrelevant and errors can arise so it
is important to do ongoing reviews of the data
● Review dashboards: is data still relevant and actionable?
● Metrics meetings: does everyone still understand the data and are there new
definitions which need to be evaluated?
● Domain specific reviews: meet with stakeholders and see what data is
valuable to them and what actions they take.
Plan to review the value of your insights
You win by continuing the product development
You win by continuing the product development lifecycle,
starting with data basics, and progressing data complexity
over time.
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch + Maintain
Delivery
Rinse and Repeat
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch +
Maintain
Delivery
Know what problem you are trying
to solve
Start simple and mature
complexity over time
Practice good product
development and iterate
Strategies:
Know what problem you are
trying to solve
Start simple and mature
complexity over time
Practice good product
development and iterate
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch +
Maintain
Delivery
Data science is successful when
you learn something about the real
world which helps you solve a
problem by taking an action.
Strategies:
References and Resources
● Rachel Schutt & Cathy O’Neil (2013) Doing Data Science: Straight Talk From the
Frontline, Sebastopol, CA: O’Reilly
● DJ Patil & Hilary Mason (2015) Data Driven. Sebastopol, CA: O’Reilly
● DJ Patil (2011) Building Data Science Teams. Sebastopol, CA: O’Reilly
● Monica Rogati (2017) The AI Hierarchy of Needs
● Nick Crocker (2014) Thirty Things I’ve Learned
● Tavish Srivastava (2015) 13 Tips to make you awesome in Data Science / Analytics Jobs
● Daniel Tunkelang (2017) 10 Things Everyone Should Know About Machine Learning
● DJ Patil - Everything We Wish We'd Known About Building Data Products
Know what problem you are
trying to solve
Start simple and mature
complexity over time
Practice good product
development and iterate
Concept
Idea Generation
Research
Assess
Opportunity
Analysis
Business
Assessment
Develop
Create
Launch +
Maintain
Delivery
Data science is successful when
you learn something about the real
world which helps you solve a
problem by taking an action.
Strategies:

More Related Content

Recently uploaded

Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...HyderabadDolls
 
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service AvailableVastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Availablegargpaaro
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?RemarkSemacio
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxronsairoathenadugay
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...gajnagarg
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...vershagrag
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...HyderabadDolls
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...HyderabadDolls
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numberssuginr1
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...HyderabadDolls
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...gajnagarg
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...kumargunjan9515
 
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 

Recently uploaded (20)

Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
 
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service AvailableVastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
High Profile Call Girls Service in Jalore { 9332606886 } VVIP NISHA Call Girl...
 
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
 

Featured

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Featured (20)

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 

What is your data strategy and why is it wrong?

  • 1. What is your data strategy and why is it wrong? Dylan Gregersen Data Engineering Meetup Aug 2018
  • 2. My name is Dylan Gregersen I like these things... You can find me at… dylangregersen I am the lead data scientist at...
  • 3. How do you define Data Science?
  • 4. Rachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline Data Science is the process of collecting, cleaning, analyzing, visualizing, and communicating data in order to solve problems in the real world. Data science is...
  • 5. What people think data science is... People often think data science is all about mathematics, algorithms, and something call “machine learning” Rachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline
  • 6. What most data science is... Data science actually consists mostly of data collection, cleaning, and organization (often 80% of the work) Rachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline
  • 7. What people forget that data science is People tend to forget the skills needed in data science to communicate results so someone can take an action in the real worldRachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline
  • 8. Rachel Schutt & Cathy O’Neil in Doing Data Science: Straight Talk From the Frontline Data science is a process When doing data science we... 1. Collect Data: We must first collect and store information about real world phenomena 2. Structure Data: Then we structure that data into conceptual models of the phenomena. 3. Extract Insight: We use our data model to understand something about the phenomena 4. Solve Problems: We apply our understanding to solve a problem by taking an action
  • 9. Data science is successful when you learn something about the real world which helps you solve a problem by taking an action.
  • 10. Data Strategy #1 Know what problem you are trying to solve
  • 11. “Can I have the number of X for last month?”
  • 12. Identifying the problem U: What is my conference room utilization?
  • 13. Identifying the problem U: What is my conference room utilization? Me: What problem are you trying to solve? U: I want to know which rooms are underutilized Me: Why do you want to know? U: To improve the efficiency of conference rooms use Me: What are you going to do with that information? A: Repurpose rooms who’s meeting usage is less than 50%
  • 14. Problem: Conference rooms should be used efficiently Action: repurpose rooms with usage less than 50%, also heavily used areas Metric: room utilization = hours in use / available hours per day Identifying the problem U: What is my conference room utilization? Me: What problem are you trying to solve? U: I want to know which rooms are underutilized Me: Why do you want to know? U: To improve the efficiency of conference rooms use Me: What are you going to do with that information? A: Repurpose rooms who’s meeting usage is less than 50%
  • 15. Identifying the problem U: What is my conference room utilization?
  • 16. U: What is my conference room utilization? Me: What problem are you trying to solve? U: I want to know which departments are using the rooms the most. Me: Why do you want to know? U: To adjust the rooms to meet their needs Me: What are you going to do with that information? A: Buy new technology or furniture to better meet those needs Identifying the problem
  • 17. Problem: Change meeting rooms to fit the needs of department Action: make purchasing decisions about technology or furniture Metrics: room utilization, organizer’s department, occupancy size, technology or furniture used U: What is my conference room utilization? Me: What problem are you trying to solve? U: I want to know which departments are using the rooms the most. Me: Why do you want to know? U: To adjust the rooms to meet their needs Me: What are you going to do with that information? A: Buy new technology or furniture to better meet those needs Identifying the problem
  • 18. What problem are you trying to solve? What action will you take with this number?
  • 19. What problem are you trying to solve? What action will you take with this number?
  • 20. Data Strategy #2 Start simple and mature complexity over time
  • 21. “Can you predict which customers will renew?”
  • 22. The data science hierarchy of needs describes the stages of data complexity and insights Say hello to….
  • 23. The Data Science Process
  • 24. The Data Science Process
  • 25. The Data Science Process
  • 26. The Data Science Process
  • 27. First point of value Descriptive Analytics are your first stage where you can actually solve a problem and take an action. Especially important for business end users who want to apply the results of your analysis.
  • 28. First point of value Descriptive Analytics are your first stage where you can actually solve a problem and take an action. Especially important for business end users who want to apply the results of your analysis. Your early projects should not try to extend beyond this stage
  • 29. First point of value Focus first on counting These will be... ● Easier to explain to your stakeholders ● Faster to build and for stakeholders to realize value ● Easier to focus on good infrastructure and process. Including tests and alerting.
  • 30. First point of value Businesses spend 1-3 months to get this into production the first time They spend 1-3 years to really get this right Descriptive Analytics are your first stage where you can actually solve a problem and take an action.
  • 31. Businesses spend 1-3 months to get this into production the first time They spend 1-3 years to really get this right 1-2 years to do this well 1-2 years integrate these 1+ years modeling to integrate optimizations
  • 32. Businesses spend 1-3 months to get this into production the first time They spend 1-3 years to really get this right 1-2 years to do this well 1-2 years integrate these 1+ years modeling to integrate optimizations
  • 33. Data Strategy #3 Practice good product development and iterate
  • 35. Traditional product development lifecycle Developing a data product is the same as any product. Having this process in place will mean more success in your data endeavours. Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch Delivery
  • 36. Understanding the problem to solve Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch Delivery Know the problem to solve and what action will be taken ● Identify the stakeholders ● Document the possible questions your stakeholders have ● Dive deep to find the root problem the stakeholders need to solve ● Identify the action they’re going to take once they have the information
  • 37. What is the scope of needs for to answer the question and figuring out who needs to be involved ● What are the short-term and long-term goals for data? ● Who are the supporters and who are the opponents? ● Assuming we do this perfectly, what will we build first? ● What is the most evil thing which can be done? Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch Delivery Assess what other opportunities there are
  • 38. Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch Delivery Create a requirements documentation which outlines what you plan to deliver ● Determine your project’s definition of success, when are you successful? ● Do product, design, and architecture reviews ● Determine team dependencies and business requirements ● Estimate costs, timelines and milestones Figure out a plan for answering the question
  • 39. Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch Delivery As you develop the end deliverables you’re also building the infrastructure, testing & QA, alerting ● Stay focused ● Document other questions and possible data sources ● Build good architecture with testing and alerts ● Manage quality, only let clean data in! ● Backup and security Create something magical!
  • 40. Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch Delivery Once you’ve completed, you need to package deliver in a way which your stakeholders can utilize ● Learn to speak the language of your stakeholders (executives or engineers) ● Review with stakeholders ● Evaluate expectations Communicate your insights
  • 41. Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch + Maintain Delivery Data reports can become irrelevant and errors can arise so it is important to do ongoing reviews of the data ● Review dashboards: is data still relevant and actionable? ● Metrics meetings: does everyone still understand the data and are there new definitions which need to be evaluated? ● Domain specific reviews: meet with stakeholders and see what data is valuable to them and what actions they take. Plan to review the value of your insights
  • 42. You win by continuing the product development You win by continuing the product development lifecycle, starting with data basics, and progressing data complexity over time. Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch + Maintain Delivery
  • 43. Rinse and Repeat Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch + Maintain Delivery
  • 44. Know what problem you are trying to solve Start simple and mature complexity over time Practice good product development and iterate Strategies:
  • 45. Know what problem you are trying to solve Start simple and mature complexity over time Practice good product development and iterate Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch + Maintain Delivery Data science is successful when you learn something about the real world which helps you solve a problem by taking an action. Strategies:
  • 46. References and Resources ● Rachel Schutt & Cathy O’Neil (2013) Doing Data Science: Straight Talk From the Frontline, Sebastopol, CA: O’Reilly ● DJ Patil & Hilary Mason (2015) Data Driven. Sebastopol, CA: O’Reilly ● DJ Patil (2011) Building Data Science Teams. Sebastopol, CA: O’Reilly ● Monica Rogati (2017) The AI Hierarchy of Needs ● Nick Crocker (2014) Thirty Things I’ve Learned ● Tavish Srivastava (2015) 13 Tips to make you awesome in Data Science / Analytics Jobs ● Daniel Tunkelang (2017) 10 Things Everyone Should Know About Machine Learning ● DJ Patil - Everything We Wish We'd Known About Building Data Products
  • 47. Know what problem you are trying to solve Start simple and mature complexity over time Practice good product development and iterate Concept Idea Generation Research Assess Opportunity Analysis Business Assessment Develop Create Launch + Maintain Delivery Data science is successful when you learn something about the real world which helps you solve a problem by taking an action. Strategies: