SlideShare a Scribd company logo
1 of 3
Download to read offline
SOLUTION BRIEF

The Virtual Data Steward
Data Management 3.0
Empower Your Data Stewards to do More With Less

Are You Serious About Data
Governance?
Every company that is serious about data
governance needs data stewards. Data
stewards connect business information
requirements and processes with
information technology capabilities. This
function is essential to bridging data
management policies and standards to
day-to-day operational practices.
Data stewards improve the reusability,
accessibility, and quality of an
organization’s data. It is the data
steward’s responsibility to approve
business-naming standards, develop
consistent data definitions, document
business rules, monitor the quality of the
data in the data repository, and define
security requirements. A common and
seemingly simple example is looking at
two records and determining if they are
identical entities after a computer system
cannot confidently make that decision.
As critical as the role is, many companies
struggle with proper data stewardship, and
consequently overall data governance.
Midsize companies often make due
without data stewards early in their
growth and pay the price later with data
completeness and consistency issues.
Large organizations, by their nature, have
large amounts of data accumulated over
many years. Data may be out of date
or inconsistent across applications or
company divisions. In each of these
instances, organizations that struggle
with proper data stewardship eventually
face the challenge of potentially basing
critical business decisions off of bad or
incomplete data.

Large companies that grow via acquisition
face yet another problem. Aligning the
data between the acquiring company and
the acquired company can be a daunting
task; it may take several quarters, even
several years to fix data problems, and
often requires temporary staff to be hired
and assigned.
Despite decades of software development,
differences in naming standards and
definitions inevitably cause problems that
only humans can resolve. And with the
amount and variety of data growing as
fast as ever – from adding social media
handles to geolocation data – what is
a company to do? Crowdsourcing is a
means to gain access to millions of
people willing to perform work for pay,
and is the basis for recruiting and training
thousands of data stewards to work for
your organization.

What is Crowdsourcing?
Coined by Wired reporter Jeff Howe,
“crowdsourcing” is the act of taking a job
traditionally performed by a designated
person (usually an employee) and
outsourcing it to an undefined, generally
large group of people in the form of an
open call. Some people use the term
crowdsourcing broadly to describe many
different models, such as crowd funding,
crowd design contests, and crowd
ideation platforms. For the purposes of
this article, we limit crowdsourcing to the
act of distributing small, simple tasks
– microtasks – among a large group of
people online.

Virtual Data Steward
Advantages
•	 Easily scale throughput
up and down
•	 Leverage local knowledge
globally
•	 Increase efficiency and
quality
Microtasking is the act of dividing a
large task into smaller and well-defined
microtasks. For example, dividing a
customer record to be verified into
discrete fields, such as company name,
street address, phone number, company
website URL, and LinkedIn profile, is
microtasking. The idea is that verifying 10
URLs is faster and simpler than verifying
10 complete customer records. Once a
person gets good at the URL verification
task, they can do it faster and with greater
accuracy. Other people in the crowd can
take care of verifying street addresses
and phone numbers. Yet other people
can set off on researching and verifying
LinkedIn profiles.
Microtasks require human intelligence
and therefore are performed online by
a person, usually with some amount of
research, as opposed to being automated
algorithmically. The benefit to microtasking
is that a large volume of work can be
completed through the crowd with minimal
training.
Microtasking has many use cases, but
works best for low-complexity, high-volume
work. Some common uses are:

Data Collection and Enhancement
•• Finding or appending existing business
data with updated information

Data Categorization
•• Organizing data into predefined
categories

Content Creation and Moderation
•• Creating or reviewing short-form
content, such as product descriptions

Sentiment Analysis
•• Collecting public sentiment on a
particular product or service, typically
from social media sources
So, how then can crowdsourcing help
data stewardship? The answer is using
the crowd to augment internal data
stewardship, in what I term the virtual
data steward.

Virtual Data Steward
The virtual data steward is a person or set
of people in the crowd who completes

microtasks assigned to them by an
internal data steward. Using virtual data
stewards has several advantages:

Scale Throughput Up and Down
An organization can quickly process
a large volume of data – backlogs
resulting from system migrations or high
transactional volumes, for example – and
hire the crowd virtual data stewards to
process only those tasks. It can scale
back down afterward.

Leverage Local Knowledge
Crowd workers are located in more than
200 countries and have knowledge
of regional address conventions,
neighborhoods, phone syntax and all
kinds of local knowledge an outsourcer in
a single country cannot match.

Language Skills
Crowd workers can speak hundreds
of languages and many are capable of
translation or transliteration.

Increase Efficiency
A variable workforce is a less expensive
workforce, and is usually more cost
effective than hiring employees or
outsourced consultants.

Increase Quality
An option with virtual data stewards is
plurality – multiple people completing and
verifying individual data elements, which
improves overall quality.

A Win for the Data Steward
Internal data stewards should welcome,
rather than fear the emergence of the
virtual data steward. Mixing internal and
virtual data stewards means:

Increase Bandwidth
Many internal data stewards are
overwhelmed with data and can barely
keep up. Virtual data stewards free them
up to complete their work.

Focus on higher value work
Virtual data stewards can take the lower
complexity, or country- or languagespecific, work off the plates of internal
data stewards. This allows internal
staff to work on higher value and higher
complexity work, such as business rule
definition.

“Autodesk
processes
approximately
100,000 records
a month via virtual
data stewards.”
Ultimately, having help from virtual
data stewards makes the internal data
steward’s day-to-day job more fulfilling by
reducing some of the monotonous work.
Perhaps most interestingly, virtual data
stewards make possible the acquisition
and validation of entirely new data – data
valuable to the organization – such as
social media handle or GPS location
information, to name a few.

Virtual Data Stewards at
Autodesk: Enhancing Sales
Leads
Autodesk is one of the world’s 25 largest
software companies. The company
provides design, engineering and
entertainment software to customers in
architecture, manufacturing, building, and
media and entertainment. In a move from
selling individual products to end-to-end
solutions, Autodesk needed a better way
to identify its most promising sales leads
to incentivize its sales team to pursue
solution sales.
To do this, Autodesk’s CRM system
needed complete data for every lead:
industry, company size, parent and child
companies, website URL, executive team
bios and contact information. This data
historically came from multiple sources
with varying quality. One source was
Dun & Bradstreet, but it could provide
enhancement for just 70 percent of
Autodesk’s CRM database. Almost a third
of Autodesk’s potential sales were not
being used to incentivize its sales force.

To boost data quality, Autodesk turned
to crowdsourcing. Autodesk has a small
staff of internal data stewards, and uses
the crowd as virtual data stewards.
Autodesk funnels business records
missing key data into the CrowdFlower’s
platform via a direct API connection.
Virtual data stewards from the crowd first
work on cleaning bad data, then enrich
business records with company hierarchy
and categorization information to provide
critical support for targeting and solution
selling. They also cross-check and match
entries with the different data sources,
de-duplicate redundant information, and
categorize by business industry code,
allowing accurate reporting of customers
and sales by industry. The results are
automatically transmitted to Autodesk,
where its internal data steward oversees
results.
Autodesk is currently processing
approximately 100,000 records a
month via virtual data stewards. To
date, Autodesk improved the its data
completeness from 70 percent to 85
percent, at a cost that is 75 percent less
than paying an outsourcing company.
As a side benefit, instead of licensing
from data providers as it did in the past,
Autodesk retains the data it receives
back from the crowd, avoiding annual
data licensing fees.

Getting Started
Companies interested in
learning how to leverage
the crowd as virtual
data stewards should
speak to a CrowdFlower
crowdsourcing specialist. A
specialist can review data
requirements and make a
recommendation on the
best approach to creating
virtual data stewards with
our crowd. CrowdFlower
offers customers the choice
of a managed service –
with monthly quality and
throughput SLAs – or a
license to our technology
platform to manage the
process internally. Our
system integration partners
also offer a combination
of crowdsourcing and data
management expertise.

About CrowdFlower
CrowdFlower combines human intelligence with the scalability and efficiency of computer algorithms to offer quality-ensured
processing of business information. CrowdFlower’s platform provides solutions to a wide variety of data needs for enterprises
such as product catalog enhancement, content generation, image moderation, and business listing enrichment. With 5 million
Crowd Contributors completing millions of judgments each month for over 500 customers, CrowdFlower is the leader in enterprise
crowdsourcing. The company has successfully worked with well-respected enterprises including Apple, AT&T, Autodesk, eBay, Ford,
LinkedIn, Microsoft, Sears, Toshiba and Twitter.

For more information, visit www.crowdflower.com or email sales@crowdflower.com.

CrowdFlower, Inc. • 2111 Mission Street, Suite 302, San Francisco, CA 94110 • (415) 471-1920
Copyright © 2013 CrowdFlower. All rights reserved. CrowdFlower is a registered trademark in the U.S.A. and certain other countries. All other
trademarks or registered trademarks, product names and company names or logos cited are the property of their respective owners.

More Related Content

What's hot

Trends 2011 and_beyond_business_intelligence
Trends 2011 and_beyond_business_intelligenceTrends 2011 and_beyond_business_intelligence
Trends 2011 and_beyond_business_intelligencedivjeev
 
Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...
Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...
Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...Krishnan Parasuraman
 
The data value map for GDPR - How to extract Business Value from your GDPR Pr...
The data value map for GDPR - How to extract Business Value from your GDPR Pr...The data value map for GDPR - How to extract Business Value from your GDPR Pr...
The data value map for GDPR - How to extract Business Value from your GDPR Pr...Ken O'Connor
 
Top 3 Hot Data Security And Privacy Technologies
Top 3 Hot Data Security And Privacy TechnologiesTop 3 Hot Data Security And Privacy Technologies
Top 3 Hot Data Security And Privacy TechnologiesTyrone Systems
 
Information Management best_practice_guide
Information Management best_practice_guideInformation Management best_practice_guide
Information Management best_practice_guideChristopher Bradley
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Denodo
 
Applications of AI in Supply Chain Management: Hype versus Reality
Applications of AI in Supply Chain Management: Hype versus RealityApplications of AI in Supply Chain Management: Hype versus Reality
Applications of AI in Supply Chain Management: Hype versus RealityGanes Kesari
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformDATAVERSITY
 
Demystifying Big Data for Associations
Demystifying Big Data for AssociationsDemystifying Big Data for Associations
Demystifying Big Data for AssociationsPatrick Dorsey
 
Big Data Pushes Enterprises into Data-Driven Mode, Makes Demands for More App...
Big Data Pushes Enterprises into Data-Driven Mode, Makes Demands for More App...Big Data Pushes Enterprises into Data-Driven Mode, Makes Demands for More App...
Big Data Pushes Enterprises into Data-Driven Mode, Makes Demands for More App...Dana Gardner
 
Data Modelling is NOT just for RDBMS's
Data Modelling is NOT just for RDBMS'sData Modelling is NOT just for RDBMS's
Data Modelling is NOT just for RDBMS'sChristopher Bradley
 
Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
 Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Mindshappiestmindstech
 
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...Happiest Minds Technologies
 
Big Data in Financial Services: How to Improve Performance with Data-Driven D...
Big Data in Financial Services: How to Improve Performance with Data-Driven D...Big Data in Financial Services: How to Improve Performance with Data-Driven D...
Big Data in Financial Services: How to Improve Performance with Data-Driven D...Perficient, Inc.
 
data-to-insight-to-action-taking-a-business-process-view-for-analytics-to-del...
data-to-insight-to-action-taking-a-business-process-view-for-analytics-to-del...data-to-insight-to-action-taking-a-business-process-view-for-analytics-to-del...
data-to-insight-to-action-taking-a-business-process-view-for-analytics-to-del...Sokho TRINH
 
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...Dana Gardner
 

What's hot (20)

Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
Trends 2011 and_beyond_business_intelligence
Trends 2011 and_beyond_business_intelligenceTrends 2011 and_beyond_business_intelligence
Trends 2011 and_beyond_business_intelligence
 
Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...
Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...
Big Data Journeys: Review of roadmaps taken by early adopters to achieve thei...
 
The data value map for GDPR - How to extract Business Value from your GDPR Pr...
The data value map for GDPR - How to extract Business Value from your GDPR Pr...The data value map for GDPR - How to extract Business Value from your GDPR Pr...
The data value map for GDPR - How to extract Business Value from your GDPR Pr...
 
Top 3 Hot Data Security And Privacy Technologies
Top 3 Hot Data Security And Privacy TechnologiesTop 3 Hot Data Security And Privacy Technologies
Top 3 Hot Data Security And Privacy Technologies
 
Information Management best_practice_guide
Information Management best_practice_guideInformation Management best_practice_guide
Information Management best_practice_guide
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
 
Applications of AI in Supply Chain Management: Hype versus Reality
Applications of AI in Supply Chain Management: Hype versus RealityApplications of AI in Supply Chain Management: Hype versus Reality
Applications of AI in Supply Chain Management: Hype versus Reality
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
Data Management for Dummies
Data Management for DummiesData Management for Dummies
Data Management for Dummies
 
Demystifying Big Data for Associations
Demystifying Big Data for AssociationsDemystifying Big Data for Associations
Demystifying Big Data for Associations
 
Big Data Pushes Enterprises into Data-Driven Mode, Makes Demands for More App...
Big Data Pushes Enterprises into Data-Driven Mode, Makes Demands for More App...Big Data Pushes Enterprises into Data-Driven Mode, Makes Demands for More App...
Big Data Pushes Enterprises into Data-Driven Mode, Makes Demands for More App...
 
Data Management
Data Management Data Management
Data Management
 
Data Modelling is NOT just for RDBMS's
Data Modelling is NOT just for RDBMS'sData Modelling is NOT just for RDBMS's
Data Modelling is NOT just for RDBMS's
 
Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
 Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
Big Data 101 - Creating Real Value from the Data Lifecycle - Happiest Minds
 
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
 
Big Data in Financial Services: How to Improve Performance with Data-Driven D...
Big Data in Financial Services: How to Improve Performance with Data-Driven D...Big Data in Financial Services: How to Improve Performance with Data-Driven D...
Big Data in Financial Services: How to Improve Performance with Data-Driven D...
 
SegmentOfOne
SegmentOfOneSegmentOfOne
SegmentOfOne
 
data-to-insight-to-action-taking-a-business-process-view-for-analytics-to-del...
data-to-insight-to-action-taking-a-business-process-view-for-analytics-to-del...data-to-insight-to-action-taking-a-business-process-view-for-analytics-to-del...
data-to-insight-to-action-taking-a-business-process-view-for-analytics-to-del...
 
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
The Open Group Conference Panel Explores How the Big Data Era Now Challenges ...
 

Viewers also liked

Sap increase your return on information by focusing on data governance - ma...
Sap   increase your return on information by focusing on data governance - ma...Sap   increase your return on information by focusing on data governance - ma...
Sap increase your return on information by focusing on data governance - ma...Bertille Laudoux
 
Using the information server toolset to deliver end to end traceability
Using the information server toolset to deliver end to end traceabilityUsing the information server toolset to deliver end to end traceability
Using the information server toolset to deliver end to end traceabilityIBM Sverige
 
Bridging the Data Security Gap
Bridging the Data Security GapBridging the Data Security Gap
Bridging the Data Security Gapxband
 
World of Watson 2016 - Data lake or Data Swamp
World of Watson 2016 - Data lake or Data SwampWorld of Watson 2016 - Data lake or Data Swamp
World of Watson 2016 - Data lake or Data SwampKeith Redman
 
Real-World Data Governance - Tools of Data Governance - Purchased and Develop...
Real-World Data Governance - Tools of Data Governance - Purchased and Develop...Real-World Data Governance - Tools of Data Governance - Purchased and Develop...
Real-World Data Governance - Tools of Data Governance - Purchased and Develop...DATAVERSITY
 
Real-World Data Governance: Business Glossaries and Data Governance
Real-World Data Governance: Business Glossaries and Data GovernanceReal-World Data Governance: Business Glossaries and Data Governance
Real-World Data Governance: Business Glossaries and Data GovernanceDATAVERSITY
 
"Modell Deutschland" - Infografik
"Modell Deutschland" - Infografik"Modell Deutschland" - Infografik
"Modell Deutschland" - InfografikWWF Deutschland
 
Successful stewardship Presentation
Successful stewardship PresentationSuccessful stewardship Presentation
Successful stewardship PresentationCertus Solutions
 
Leveraging Information Steward
Leveraging Information StewardLeveraging Information Steward
Leveraging Information StewardMethod360
 
IBM InfoSphere Stewardship Center for iis dqec
IBM InfoSphere Stewardship Center for iis dqecIBM InfoSphere Stewardship Center for iis dqec
IBM InfoSphere Stewardship Center for iis dqecIBMInfoSphereUGFR
 
MDM Architecture - SAP
MDM Architecture - SAPMDM Architecture - SAP
MDM Architecture - SAPCapgemini
 
Présentation IBM InfoSphere Information Server 11.3
Présentation IBM InfoSphere Information Server 11.3Présentation IBM InfoSphere Information Server 11.3
Présentation IBM InfoSphere Information Server 11.3IBMInfoSphereUGFR
 
Business objects data services in an sap landscape
Business objects data services in an sap landscapeBusiness objects data services in an sap landscape
Business objects data services in an sap landscapePradeep Ketoli
 
Bhawani prasad data integration-ppt
Bhawani prasad data integration-pptBhawani prasad data integration-ppt
Bhawani prasad data integration-pptBhawani N Prasad
 
593 Managing Enterprise Data Quality Using SAP Information Steward
593 Managing Enterprise Data Quality Using SAP Information Steward593 Managing Enterprise Data Quality Using SAP Information Steward
593 Managing Enterprise Data Quality Using SAP Information StewardVinny (Gurvinder) Ahuja
 
Sap information steward
Sap information stewardSap information steward
Sap information stewardytrhvk
 
Data Architecture for Data Governance
Data Architecture for Data GovernanceData Architecture for Data Governance
Data Architecture for Data GovernanceDATAVERSITY
 

Viewers also liked (20)

New Data Governance Lambda architecute
New Data Governance Lambda architecuteNew Data Governance Lambda architecute
New Data Governance Lambda architecute
 
Sap increase your return on information by focusing on data governance - ma...
Sap   increase your return on information by focusing on data governance - ma...Sap   increase your return on information by focusing on data governance - ma...
Sap increase your return on information by focusing on data governance - ma...
 
Datastewards
DatastewardsDatastewards
Datastewards
 
Using the information server toolset to deliver end to end traceability
Using the information server toolset to deliver end to end traceabilityUsing the information server toolset to deliver end to end traceability
Using the information server toolset to deliver end to end traceability
 
Bridging the Data Security Gap
Bridging the Data Security GapBridging the Data Security Gap
Bridging the Data Security Gap
 
World of Watson 2016 - Data lake or Data Swamp
World of Watson 2016 - Data lake or Data SwampWorld of Watson 2016 - Data lake or Data Swamp
World of Watson 2016 - Data lake or Data Swamp
 
Real-World Data Governance - Tools of Data Governance - Purchased and Develop...
Real-World Data Governance - Tools of Data Governance - Purchased and Develop...Real-World Data Governance - Tools of Data Governance - Purchased and Develop...
Real-World Data Governance - Tools of Data Governance - Purchased and Develop...
 
Real-World Data Governance: Business Glossaries and Data Governance
Real-World Data Governance: Business Glossaries and Data GovernanceReal-World Data Governance: Business Glossaries and Data Governance
Real-World Data Governance: Business Glossaries and Data Governance
 
"Modell Deutschland" - Infografik
"Modell Deutschland" - Infografik"Modell Deutschland" - Infografik
"Modell Deutschland" - Infografik
 
Successful stewardship Presentation
Successful stewardship PresentationSuccessful stewardship Presentation
Successful stewardship Presentation
 
BP_SAP_MDM
BP_SAP_MDMBP_SAP_MDM
BP_SAP_MDM
 
Leveraging Information Steward
Leveraging Information StewardLeveraging Information Steward
Leveraging Information Steward
 
IBM InfoSphere Stewardship Center for iis dqec
IBM InfoSphere Stewardship Center for iis dqecIBM InfoSphere Stewardship Center for iis dqec
IBM InfoSphere Stewardship Center for iis dqec
 
MDM Architecture - SAP
MDM Architecture - SAPMDM Architecture - SAP
MDM Architecture - SAP
 
Présentation IBM InfoSphere Information Server 11.3
Présentation IBM InfoSphere Information Server 11.3Présentation IBM InfoSphere Information Server 11.3
Présentation IBM InfoSphere Information Server 11.3
 
Business objects data services in an sap landscape
Business objects data services in an sap landscapeBusiness objects data services in an sap landscape
Business objects data services in an sap landscape
 
Bhawani prasad data integration-ppt
Bhawani prasad data integration-pptBhawani prasad data integration-ppt
Bhawani prasad data integration-ppt
 
593 Managing Enterprise Data Quality Using SAP Information Steward
593 Managing Enterprise Data Quality Using SAP Information Steward593 Managing Enterprise Data Quality Using SAP Information Steward
593 Managing Enterprise Data Quality Using SAP Information Steward
 
Sap information steward
Sap information stewardSap information steward
Sap information steward
 
Data Architecture for Data Governance
Data Architecture for Data GovernanceData Architecture for Data Governance
Data Architecture for Data Governance
 

Similar to Virtual Data Steward: Data Management 3.0

Data as a Service (DaaS): The What, Why, How, Who, and When
Data as a Service (DaaS): The What, Why, How, Who, and WhenData as a Service (DaaS): The What, Why, How, Who, and When
Data as a Service (DaaS): The What, Why, How, Who, and WhenRocketSource
 
Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various typesloginworks software
 
what-is-datafication-and-why-is-it-the-future-of-business-in-2023.pdf
what-is-datafication-and-why-is-it-the-future-of-business-in-2023.pdfwhat-is-datafication-and-why-is-it-the-future-of-business-in-2023.pdf
what-is-datafication-and-why-is-it-the-future-of-business-in-2023.pdfTemok IT Services
 
Data Analytics And Business Decision.pdf
Data Analytics And Business Decision.pdfData Analytics And Business Decision.pdf
Data Analytics And Business Decision.pdfCiente
 
Data Analytics And Business Decision.pdf
Data Analytics And Business Decision.pdfData Analytics And Business Decision.pdf
Data Analytics And Business Decision.pdfCiente
 
Accelerating Time to Success for Your Big Data Initiatives
Accelerating Time to Success for Your Big Data InitiativesAccelerating Time to Success for Your Big Data Initiatives
Accelerating Time to Success for Your Big Data Initiatives☁Jake Weaver ☁
 
Top 10 Digital Transformation Trends For Business
Top 10 Digital Transformation Trends For BusinessTop 10 Digital Transformation Trends For Business
Top 10 Digital Transformation Trends For BusinessAlbiorix Technology
 
Big-Data-The-Case-for-Customer-Experience
Big-Data-The-Case-for-Customer-ExperienceBig-Data-The-Case-for-Customer-Experience
Big-Data-The-Case-for-Customer-ExperienceAndrew Smith
 
Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various typesloginworks software
 
To Become a Data-Driven Enterprise, Data Democratization is Essential
To Become a Data-Driven Enterprise, Data Democratization is EssentialTo Become a Data-Driven Enterprise, Data Democratization is Essential
To Become a Data-Driven Enterprise, Data Democratization is EssentialCognizant
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperExperian
 
Analytics big data ibm
Analytics big data ibmAnalytics big data ibm
Analytics big data ibmAccenture
 
Current trends in enterprise application integration
Current trends in enterprise application integrationCurrent trends in enterprise application integration
Current trends in enterprise application integrationVisionet Systems, Inc.
 
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Stuart Blair
 
Why Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfWhy Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfData Science Council of America
 
Why Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfWhy Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfData Science Council of America
 

Similar to Virtual Data Steward: Data Management 3.0 (20)

Data as a Service (DaaS): The What, Why, How, Who, and When
Data as a Service (DaaS): The What, Why, How, Who, and WhenData as a Service (DaaS): The What, Why, How, Who, and When
Data as a Service (DaaS): The What, Why, How, Who, and When
 
Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various types
 
Cloud Analytics Playbook
Cloud Analytics PlaybookCloud Analytics Playbook
Cloud Analytics Playbook
 
what-is-datafication-and-why-is-it-the-future-of-business-in-2023.pdf
what-is-datafication-and-why-is-it-the-future-of-business-in-2023.pdfwhat-is-datafication-and-why-is-it-the-future-of-business-in-2023.pdf
what-is-datafication-and-why-is-it-the-future-of-business-in-2023.pdf
 
Data Analytics And Business Decision.pdf
Data Analytics And Business Decision.pdfData Analytics And Business Decision.pdf
Data Analytics And Business Decision.pdf
 
Data Analytics And Business Decision.pdf
Data Analytics And Business Decision.pdfData Analytics And Business Decision.pdf
Data Analytics And Business Decision.pdf
 
6 Reasons to Use Data Analytics
6 Reasons to Use Data Analytics6 Reasons to Use Data Analytics
6 Reasons to Use Data Analytics
 
Accelerating Time to Success for Your Big Data Initiatives
Accelerating Time to Success for Your Big Data InitiativesAccelerating Time to Success for Your Big Data Initiatives
Accelerating Time to Success for Your Big Data Initiatives
 
Data Analytics.pptx
Data Analytics.pptxData Analytics.pptx
Data Analytics.pptx
 
Hybrid IT
Hybrid ITHybrid IT
Hybrid IT
 
Top 10 Digital Transformation Trends For Business
Top 10 Digital Transformation Trends For BusinessTop 10 Digital Transformation Trends For Business
Top 10 Digital Transformation Trends For Business
 
Big-Data-The-Case-for-Customer-Experience
Big-Data-The-Case-for-Customer-ExperienceBig-Data-The-Case-for-Customer-Experience
Big-Data-The-Case-for-Customer-Experience
 
Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various types
 
To Become a Data-Driven Enterprise, Data Democratization is Essential
To Become a Data-Driven Enterprise, Data Democratization is EssentialTo Become a Data-Driven Enterprise, Data Democratization is Essential
To Become a Data-Driven Enterprise, Data Democratization is Essential
 
Big Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White PaperBig Data is Here for Financial Services White Paper
Big Data is Here for Financial Services White Paper
 
Analytics big data ibm
Analytics big data ibmAnalytics big data ibm
Analytics big data ibm
 
Current trends in enterprise application integration
Current trends in enterprise application integrationCurrent trends in enterprise application integration
Current trends in enterprise application integration
 
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
 
Why Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfWhy Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdf
 
Why Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdfWhy Big Data Automation is Important for Your Business.pdf
Why Big Data Automation is Important for Your Business.pdf
 

More from CrowdFlower

Building Better Models Faster Using Active Learning
Building Better Models Faster Using Active LearningBuilding Better Models Faster Using Active Learning
Building Better Models Faster Using Active LearningCrowdFlower
 
Active Learning and Human-in-the-Loop
Active Learning and Human-in-the-LoopActive Learning and Human-in-the-Loop
Active Learning and Human-in-the-LoopCrowdFlower
 
CrowdFlower NDA Crowds - Secure, exceptional tasking at a massive scale.
CrowdFlower NDA Crowds - Secure, exceptional tasking at a massive scale. CrowdFlower NDA Crowds - Secure, exceptional tasking at a massive scale.
CrowdFlower NDA Crowds - Secure, exceptional tasking at a massive scale. CrowdFlower
 
CrowdFlower Product Webinar - Graphical Editor and Visual Reports
CrowdFlower Product Webinar - Graphical Editor and Visual ReportsCrowdFlower Product Webinar - Graphical Editor and Visual Reports
CrowdFlower Product Webinar - Graphical Editor and Visual ReportsCrowdFlower
 
How Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment AnalysisHow Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment AnalysisCrowdFlower
 
Humanizing The Machine
Humanizing The MachineHumanizing The Machine
Humanizing The MachineCrowdFlower
 
Open Data Science Conference 2015
Open Data Science Conference 2015Open Data Science Conference 2015
Open Data Science Conference 2015CrowdFlower
 
Productive Out-of-the-Box | Tooling with Yeoman to Rapidly Develop Ember.js A...
Productive Out-of-the-Box | Tooling with Yeoman to Rapidly Develop Ember.js A...Productive Out-of-the-Box | Tooling with Yeoman to Rapidly Develop Ember.js A...
Productive Out-of-the-Box | Tooling with Yeoman to Rapidly Develop Ember.js A...CrowdFlower
 
Expert Crowdsourcing with Flash Teams | CrowdConf 2013 poster
Expert Crowdsourcing with Flash Teams | CrowdConf 2013 posterExpert Crowdsourcing with Flash Teams | CrowdConf 2013 poster
Expert Crowdsourcing with Flash Teams | CrowdConf 2013 posterCrowdFlower
 
The State of Enterprise Crowdsourcing 2013
The State of Enterprise Crowdsourcing 2013The State of Enterprise Crowdsourcing 2013
The State of Enterprise Crowdsourcing 2013CrowdFlower
 
CrowdFlower University Oct. 21 2013
CrowdFlower University Oct. 21 2013CrowdFlower University Oct. 21 2013
CrowdFlower University Oct. 21 2013CrowdFlower
 

More from CrowdFlower (12)

7 Myths of AI
7 Myths of AI7 Myths of AI
7 Myths of AI
 
Building Better Models Faster Using Active Learning
Building Better Models Faster Using Active LearningBuilding Better Models Faster Using Active Learning
Building Better Models Faster Using Active Learning
 
Active Learning and Human-in-the-Loop
Active Learning and Human-in-the-LoopActive Learning and Human-in-the-Loop
Active Learning and Human-in-the-Loop
 
CrowdFlower NDA Crowds - Secure, exceptional tasking at a massive scale.
CrowdFlower NDA Crowds - Secure, exceptional tasking at a massive scale. CrowdFlower NDA Crowds - Secure, exceptional tasking at a massive scale.
CrowdFlower NDA Crowds - Secure, exceptional tasking at a massive scale.
 
CrowdFlower Product Webinar - Graphical Editor and Visual Reports
CrowdFlower Product Webinar - Graphical Editor and Visual ReportsCrowdFlower Product Webinar - Graphical Editor and Visual Reports
CrowdFlower Product Webinar - Graphical Editor and Visual Reports
 
How Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment AnalysisHow Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment Analysis
 
Humanizing The Machine
Humanizing The MachineHumanizing The Machine
Humanizing The Machine
 
Open Data Science Conference 2015
Open Data Science Conference 2015Open Data Science Conference 2015
Open Data Science Conference 2015
 
Productive Out-of-the-Box | Tooling with Yeoman to Rapidly Develop Ember.js A...
Productive Out-of-the-Box | Tooling with Yeoman to Rapidly Develop Ember.js A...Productive Out-of-the-Box | Tooling with Yeoman to Rapidly Develop Ember.js A...
Productive Out-of-the-Box | Tooling with Yeoman to Rapidly Develop Ember.js A...
 
Expert Crowdsourcing with Flash Teams | CrowdConf 2013 poster
Expert Crowdsourcing with Flash Teams | CrowdConf 2013 posterExpert Crowdsourcing with Flash Teams | CrowdConf 2013 poster
Expert Crowdsourcing with Flash Teams | CrowdConf 2013 poster
 
The State of Enterprise Crowdsourcing 2013
The State of Enterprise Crowdsourcing 2013The State of Enterprise Crowdsourcing 2013
The State of Enterprise Crowdsourcing 2013
 
CrowdFlower University Oct. 21 2013
CrowdFlower University Oct. 21 2013CrowdFlower University Oct. 21 2013
CrowdFlower University Oct. 21 2013
 

Recently uploaded

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 

Recently uploaded (20)

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 

Virtual Data Steward: Data Management 3.0

  • 1. SOLUTION BRIEF The Virtual Data Steward Data Management 3.0 Empower Your Data Stewards to do More With Less Are You Serious About Data Governance? Every company that is serious about data governance needs data stewards. Data stewards connect business information requirements and processes with information technology capabilities. This function is essential to bridging data management policies and standards to day-to-day operational practices. Data stewards improve the reusability, accessibility, and quality of an organization’s data. It is the data steward’s responsibility to approve business-naming standards, develop consistent data definitions, document business rules, monitor the quality of the data in the data repository, and define security requirements. A common and seemingly simple example is looking at two records and determining if they are identical entities after a computer system cannot confidently make that decision. As critical as the role is, many companies struggle with proper data stewardship, and consequently overall data governance. Midsize companies often make due without data stewards early in their growth and pay the price later with data completeness and consistency issues. Large organizations, by their nature, have large amounts of data accumulated over many years. Data may be out of date or inconsistent across applications or company divisions. In each of these instances, organizations that struggle with proper data stewardship eventually face the challenge of potentially basing critical business decisions off of bad or incomplete data. Large companies that grow via acquisition face yet another problem. Aligning the data between the acquiring company and the acquired company can be a daunting task; it may take several quarters, even several years to fix data problems, and often requires temporary staff to be hired and assigned. Despite decades of software development, differences in naming standards and definitions inevitably cause problems that only humans can resolve. And with the amount and variety of data growing as fast as ever – from adding social media handles to geolocation data – what is a company to do? Crowdsourcing is a means to gain access to millions of people willing to perform work for pay, and is the basis for recruiting and training thousands of data stewards to work for your organization. What is Crowdsourcing? Coined by Wired reporter Jeff Howe, “crowdsourcing” is the act of taking a job traditionally performed by a designated person (usually an employee) and outsourcing it to an undefined, generally large group of people in the form of an open call. Some people use the term crowdsourcing broadly to describe many different models, such as crowd funding, crowd design contests, and crowd ideation platforms. For the purposes of this article, we limit crowdsourcing to the act of distributing small, simple tasks – microtasks – among a large group of people online. Virtual Data Steward Advantages • Easily scale throughput up and down • Leverage local knowledge globally • Increase efficiency and quality
  • 2. Microtasking is the act of dividing a large task into smaller and well-defined microtasks. For example, dividing a customer record to be verified into discrete fields, such as company name, street address, phone number, company website URL, and LinkedIn profile, is microtasking. The idea is that verifying 10 URLs is faster and simpler than verifying 10 complete customer records. Once a person gets good at the URL verification task, they can do it faster and with greater accuracy. Other people in the crowd can take care of verifying street addresses and phone numbers. Yet other people can set off on researching and verifying LinkedIn profiles. Microtasks require human intelligence and therefore are performed online by a person, usually with some amount of research, as opposed to being automated algorithmically. The benefit to microtasking is that a large volume of work can be completed through the crowd with minimal training. Microtasking has many use cases, but works best for low-complexity, high-volume work. Some common uses are: Data Collection and Enhancement •• Finding or appending existing business data with updated information Data Categorization •• Organizing data into predefined categories Content Creation and Moderation •• Creating or reviewing short-form content, such as product descriptions Sentiment Analysis •• Collecting public sentiment on a particular product or service, typically from social media sources So, how then can crowdsourcing help data stewardship? The answer is using the crowd to augment internal data stewardship, in what I term the virtual data steward. Virtual Data Steward The virtual data steward is a person or set of people in the crowd who completes microtasks assigned to them by an internal data steward. Using virtual data stewards has several advantages: Scale Throughput Up and Down An organization can quickly process a large volume of data – backlogs resulting from system migrations or high transactional volumes, for example – and hire the crowd virtual data stewards to process only those tasks. It can scale back down afterward. Leverage Local Knowledge Crowd workers are located in more than 200 countries and have knowledge of regional address conventions, neighborhoods, phone syntax and all kinds of local knowledge an outsourcer in a single country cannot match. Language Skills Crowd workers can speak hundreds of languages and many are capable of translation or transliteration. Increase Efficiency A variable workforce is a less expensive workforce, and is usually more cost effective than hiring employees or outsourced consultants. Increase Quality An option with virtual data stewards is plurality – multiple people completing and verifying individual data elements, which improves overall quality. A Win for the Data Steward Internal data stewards should welcome, rather than fear the emergence of the virtual data steward. Mixing internal and virtual data stewards means: Increase Bandwidth Many internal data stewards are overwhelmed with data and can barely keep up. Virtual data stewards free them up to complete their work. Focus on higher value work Virtual data stewards can take the lower complexity, or country- or languagespecific, work off the plates of internal data stewards. This allows internal staff to work on higher value and higher complexity work, such as business rule definition. “Autodesk processes approximately 100,000 records a month via virtual data stewards.”
  • 3. Ultimately, having help from virtual data stewards makes the internal data steward’s day-to-day job more fulfilling by reducing some of the monotonous work. Perhaps most interestingly, virtual data stewards make possible the acquisition and validation of entirely new data – data valuable to the organization – such as social media handle or GPS location information, to name a few. Virtual Data Stewards at Autodesk: Enhancing Sales Leads Autodesk is one of the world’s 25 largest software companies. The company provides design, engineering and entertainment software to customers in architecture, manufacturing, building, and media and entertainment. In a move from selling individual products to end-to-end solutions, Autodesk needed a better way to identify its most promising sales leads to incentivize its sales team to pursue solution sales. To do this, Autodesk’s CRM system needed complete data for every lead: industry, company size, parent and child companies, website URL, executive team bios and contact information. This data historically came from multiple sources with varying quality. One source was Dun & Bradstreet, but it could provide enhancement for just 70 percent of Autodesk’s CRM database. Almost a third of Autodesk’s potential sales were not being used to incentivize its sales force. To boost data quality, Autodesk turned to crowdsourcing. Autodesk has a small staff of internal data stewards, and uses the crowd as virtual data stewards. Autodesk funnels business records missing key data into the CrowdFlower’s platform via a direct API connection. Virtual data stewards from the crowd first work on cleaning bad data, then enrich business records with company hierarchy and categorization information to provide critical support for targeting and solution selling. They also cross-check and match entries with the different data sources, de-duplicate redundant information, and categorize by business industry code, allowing accurate reporting of customers and sales by industry. The results are automatically transmitted to Autodesk, where its internal data steward oversees results. Autodesk is currently processing approximately 100,000 records a month via virtual data stewards. To date, Autodesk improved the its data completeness from 70 percent to 85 percent, at a cost that is 75 percent less than paying an outsourcing company. As a side benefit, instead of licensing from data providers as it did in the past, Autodesk retains the data it receives back from the crowd, avoiding annual data licensing fees. Getting Started Companies interested in learning how to leverage the crowd as virtual data stewards should speak to a CrowdFlower crowdsourcing specialist. A specialist can review data requirements and make a recommendation on the best approach to creating virtual data stewards with our crowd. CrowdFlower offers customers the choice of a managed service – with monthly quality and throughput SLAs – or a license to our technology platform to manage the process internally. Our system integration partners also offer a combination of crowdsourcing and data management expertise. About CrowdFlower CrowdFlower combines human intelligence with the scalability and efficiency of computer algorithms to offer quality-ensured processing of business information. CrowdFlower’s platform provides solutions to a wide variety of data needs for enterprises such as product catalog enhancement, content generation, image moderation, and business listing enrichment. With 5 million Crowd Contributors completing millions of judgments each month for over 500 customers, CrowdFlower is the leader in enterprise crowdsourcing. The company has successfully worked with well-respected enterprises including Apple, AT&T, Autodesk, eBay, Ford, LinkedIn, Microsoft, Sears, Toshiba and Twitter. For more information, visit www.crowdflower.com or email sales@crowdflower.com. CrowdFlower, Inc. • 2111 Mission Street, Suite 302, San Francisco, CA 94110 • (415) 471-1920 Copyright © 2013 CrowdFlower. All rights reserved. CrowdFlower is a registered trademark in the U.S.A. and certain other countries. All other trademarks or registered trademarks, product names and company names or logos cited are the property of their respective owners.