Simo Ahava discusses data quality and the importance of a data-driven process and culture. He advocates for breaking down silos between teams by implementing a shared data layer and involving stakeholders from all teams in iterative development through a definition of done that incorporates data tracking requirements. Empowering developers to facilitate data collection and analysis, and hiring hybrid profiles with both business and technical skills can also improve data quality.
2. Simo Ahava
Senior Data Advocate, Reaktor
Google Developer Expert, Google Analytics
Blogger, developer, www.simoahava.com
Twitter-er, @SimoAhava
Google+:er, +SimoAhava
3. Data quality isn’t fixed.
Depending on the
hypothesis, a single data
set can shift from
useless to incredibly
insightful without a
single datum changing
shape, size, form, or
function.
#1 Data is subjective
10. The root of all evil
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
11. The root of all evil
The "project"
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
12. Your organization is
creating absurd
amounts of data with
every passing second,
and it’s very difficult to
adapt to the fluctuations
without an agile,
process-driven mindset.
#2 Data is a process
13. The project is often a
series of handovers,
breeding non-
involvement.
26. Silos, so what?
As long as the work gets done, right?
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
27. Data is the lifeblood of the
organization. It flows
through all departments,
across job titles,
permeating the very
fabric of the organization,
reinforcing its
foundations for growth.
#3 Data abhors silos
29. Monthly reports which lack relevance, are rife with generic suggestions that lack research
in the context of your business, reiteration of previous month’s points, even if there are
solid reasons why they weren’t addressed.
30. Ridiculously ugly and ineffective JavaScript hacks for measurement points which should
be tackled in the Data Layer.
31. Hiding behind data, and passing blame to other silos.
Could someone fix the
Bounce Rate metric on our
site?
32. Analytics feature requests are deprioritized, and deployed extremely infrequently.
Fix
transactionRevenue
to show revenue, not
customer weight.
33. Communication is difficult due to the overhead of meeting face-to-face, project plans are
set in stone during sales, and it’s difficult to change existing project goals or set new ones
due to consultants being hired as "extra pairs of hands" rather than advisors.
35. Cure I: The Data Layer
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
36. Cure I: The Data Layer
Using technology to solve communication problems
@SimoAhava from @ReaktorNow | #SPWK | 5 Feb 2016
37. Typically, there are three
definitions
of Data Layer that we use in
the digital world.
38.
39. 1. Set of business
requirements
for tracking
digital assets,
visits, and
visitors.
40. 1. Set of business
requirements
for tracking
digital assets,
visits, and
visitors.
2. Encoded, global
data structure,
accessed and
modified by
connected
platforms.
41. 1. Set of business
requirements
for tracking
digital assets,
visits, and
visitors.
2. Encoded, global
data structure,
accessed and
modified by
connected
platforms.
2. Data model of a
connected
platform, which
copies or digests
information in the
global structure.
42. 1. Set of business
requirements
for tracking
digital assets,
visits, and
visitors.
2. Encoded, global
data structure,
accessed and
modified by
connected
platforms.
2. Data model of a
connected
platform, which
copies or digests
information in the
global structure.
dataLayer.push({
'pageType' : 'home'
});
google_tag_manager['GTM-123']
.dataLayer
.set('pageType', 'home');
43. Across all three definitions,
the purpose of a Data Layer
is simple:
44. DMP / DWH / TMS / etc.
X X
Actions Presentation
Data Layer
45. DMP / DWH / TMS / etc.
X X
Actions Presentation
Data Layer
The purpose of a Data Layer is to provide a bilateral layer on the digital
asset, which decouples, normalises, and uniformly encodes semantic
information passed through and stored within.
46. The Data Layer is a joint
venture, where people and
systems communicate
across silos.
54. Definition of Done
Developed features
do not impede
measurement.
Developed features
are trackable.
Sprint
If necessary, feature
is encoded with
tracking attributes.
If necessary, feature
is linked to a Data
Layer object.
Feature
55. Definition of Done
Developed features
do not impede
measurement.
Developed features
are trackable.
Sprint
If necessary, feature
is encoded with
tracking attributes.
If necessary, feature
is linked to a Data
Layer object.
Feature
Attribute syntax is
correct for tracking.
Data Layer object
syntax is correct.
Task
64. 1. JavaScript: www.codecademy.com, www.codeschool.com,
Professional JavaScript for Web Developers, DOM
Enlightenment…
2. Digital analytics: www.kaushik.net, www.simoahava.com,
Successful Analytics, Practical Google Analytics and Google Tag
Manager for Developers…
3. Training, courses, certifications: Digital Analytics Association,
Digital Analytics Fundamentals (Google), Market Motive…
4. Conferences: MeasureCamp, SMX, eMetrics, Digital Analytics
Hub, ConversionXL, Superweek, All Things Data…
67. + Passionate, actively interested
+ Understands ever-changing requirements
+ Good grasp of digital tech
+ Statistical mindset
+ Knows the product / service inside and out
+ Critical about the present, curious about the future
69. + Dedicated sandbox
+ Website or blog to test
new ideas on
+ Test and debug setups in
Google Analytics and
Google Tag Manager
+ Utilization of GTM
environments