SlideShare a Scribd company logo
1 of 27
March 26, 2014
#gltrain
Welcome
Steve Ressler
Founder & President
GovLoop
- Tweets: #gltrain
- Slides, Video archive & other resources will be emailed to
you later this week.
- Check out the GovLoop VIP
programhttp://www.govloop.com/TrainingVIPs
- Be sure to fill out the evaluation to obtain your 3 CPE’s at
the end of the event.
The Data Revolution
“As we sit on the cusp of remarkable
innovations, we must remember that
this time, modern innovations are
powered by data.”
Our Goals - GovLoop Research
• Our research: empower and educate
• Our basic formula:
– Survey of GovLoop Audience
– Case study from state, local and federal government
– Best practices, challenges
– Cheat sheet
• Our mission: Help you do your job better
What You’ll Find in the Report
• A local government spotlight
showing how the city of Louisville,
KY., has leveraged data to improve
services
• A federal government case study
highlighting the Army’s Enterprise
Management Decision Support
program.
• Industry insights on the current
big data landscape.
• 8 Strategies and best practices for
smart big data adoption and
analysis.
• GovLoop’s big data cheat sheet.
GovLoop Survey Data
Defining Big Data
“Big Data refers to the massive amounts
of data that collect over time that are difficult to
analyze and handle using common database
management tools.” – PC Magazine
The Method for an Integrated
Knowledge Environment open-source
project. The MIKE project argues that
big data is not a function of the size of
a data set but its complexity.
Consequently, it is the high degree of
permutations and interactions within a
data set that defines big data.
The National Institute of Standards
and Technology. NIST argues that big
data is data which “exceed(s) the
capacity or capability of current or
conventional methods and systems.”
In other words, the notion of “big” is
relative to the current standard of
computation.
Definitions from: The Big Data Conundrum: How to Define It?
A Working Definition
“Big data is a term describing the storage
and analysis of large and or complex data
sets using a series of techniques
including, but not limited to: NoSQL,
MapReduce and machine learning.”
See full study: Undefined By Data: A Survey of Big Data Definitions
- Jonathan Stuart Ward and Adam Barker, School of Computer
Science, University of St. Andrews, UK.
Do we care?
• Leveraging data in new ways to meet
mission need.
• Unlocking new insights by synthesizing
data across your department
• Collaborating and sharing resources
Convergence of Technology
You can’t talk about big data without
thinking about cloud and mobile.
3 Big Data Case Studies
Social Welfare
• The oldest case is the Famine Early Warning Systems
Network (FEWS NET) developed by U.S. Agency for
International Development in 1986.
• The $25 million dollar program helps optimize the
distribution of up to $1.5 billion dollars per year in USAID
Food for Peace assistance.
Smarter Healthcare
• Predicting the likelihood of hospitalization or death
within 90 days, the Patient Care Assessment System
(PCAS) calculates the Care Assessment Needs (CAN)
Score.
• This score allows the Veterans Health Administration
(VHA) to focus care teams and proactively care for
their patients.
• The system collects 120 unique elements for 5.25
million patients and is supported by an 80-terabyte
corporate data warehouse.
Biometric Intelligence
• Since 2003, US armed forces have collected biometric
information from non-US citizens in Iraq and Afghanistan.
• It identifies enemy combatants and permits access into
controlled areas.
• The system, known as The Automated Biometric
Identification System (ABIS), stores 4.4 million unique
identities and has identified over 3,000 enemy
combatants, added 190,000 identities to the watch list,
and protected the welfare of the United States and its
allies.
Big Data’s Challenges – From the
Survey
• Data governance
• Data locked in legacy systems
• Unclear mission and goals
• Lack of support
• Lack of clarity on metrics
Big Data’s Core Challenge
How do we find the needle in the
haystack…when the haystack keeps
getting exponentially bigger?
Our Big Data Best Practices
1: Executive Leadership
2: Business Before Treasure
3: Know Thyself: Define Use Cases
4: Leverage Existing Resources & Augment
5: Integrate Legacy Systems
6: Partner With Industry
7: Focus on Governance and Data Quality
8: Have a Deep Bench - Train
Where to begin?
• What problem are we trying to solve?
• How do we engage the right people?
• How do we break down silos?
• What kinds of data do we need?
• What do I need to be able to do with the data?
• Who needs access and when?
• Can I leverage existing technology my agency has?
• What does success look like?
Before we wrap up…
Today, public service
offers a unique
opportunity
Photo license: by Pa1nt, FlickR Creative Commons

More Related Content

What's hot

Trans-media Project
Trans-media ProjectTrans-media Project
Trans-media Project
kwebb2
 
Usdn inc regional networks guidebook 2.0
Usdn inc regional networks guidebook 2.0Usdn inc regional networks guidebook 2.0
Usdn inc regional networks guidebook 2.0
Working Wikily
 
Data fluency in the age of surveillance capitalism
Data fluency in the age of surveillance capitalismData fluency in the age of surveillance capitalism
Data fluency in the age of surveillance capitalism
Mary Aviles
 
JASON MOGUS: How to win in the 21st Century
JASON MOGUS: How to win in the 21st CenturyJASON MOGUS: How to win in the 21st Century
JASON MOGUS: How to win in the 21st Century
NetSquared Vancouver
 
High Performance Communities Praxis Strategy Group
High Performance Communities   Praxis Strategy GroupHigh Performance Communities   Praxis Strategy Group
High Performance Communities Praxis Strategy Group
droby
 

What's hot (20)

External Collaboration: Lessons Learned (So Far)
External Collaboration: Lessons Learned (So Far)External Collaboration: Lessons Learned (So Far)
External Collaboration: Lessons Learned (So Far)
 
Trans-media Project
Trans-media ProjectTrans-media Project
Trans-media Project
 
Social Media & PR: View from the Bridge
Social Media & PR: View from the BridgeSocial Media & PR: View from the Bridge
Social Media & PR: View from the Bridge
 
Launch of Roofs to Roots and Release of Housing & Transportation Affordabilit...
Launch of Roofs to Roots and Release of Housing & Transportation Affordabilit...Launch of Roofs to Roots and Release of Housing & Transportation Affordabilit...
Launch of Roofs to Roots and Release of Housing & Transportation Affordabilit...
 
Usdn inc regional networks guidebook 2.0
Usdn inc regional networks guidebook 2.0Usdn inc regional networks guidebook 2.0
Usdn inc regional networks guidebook 2.0
 
Data fluency in the age of surveillance capitalism
Data fluency in the age of surveillance capitalismData fluency in the age of surveillance capitalism
Data fluency in the age of surveillance capitalism
 
JASON MOGUS: How to win in the 21st Century
JASON MOGUS: How to win in the 21st CenturyJASON MOGUS: How to win in the 21st Century
JASON MOGUS: How to win in the 21st Century
 
Content Curation: The new communications responsibility
Content Curation: The new communications responsibilityContent Curation: The new communications responsibility
Content Curation: The new communications responsibility
 
NYSS Open Gov West
NYSS  Open Gov WestNYSS  Open Gov West
NYSS Open Gov West
 
navigating the new social: Gov 2.0 and community engagement
navigating the new social: Gov 2.0 and community engagementnavigating the new social: Gov 2.0 and community engagement
navigating the new social: Gov 2.0 and community engagement
 
Underpinning innovation through geography 16062010
Underpinning innovation through geography 16062010Underpinning innovation through geography 16062010
Underpinning innovation through geography 16062010
 
Gov 2.0 and Open Data Sustainability
Gov 2.0 and Open Data SustainabilityGov 2.0 and Open Data Sustainability
Gov 2.0 and Open Data Sustainability
 
Five Insights for Event Marketers
Five Insights for Event MarketersFive Insights for Event Marketers
Five Insights for Event Marketers
 
Network effectiveness Surfrider
Network effectiveness SurfriderNetwork effectiveness Surfrider
Network effectiveness Surfrider
 
High Performance Communities Praxis Strategy Group
High Performance Communities   Praxis Strategy GroupHigh Performance Communities   Praxis Strategy Group
High Performance Communities Praxis Strategy Group
 
futurethink: Future of Social Networks
futurethink: Future of Social Networksfuturethink: Future of Social Networks
futurethink: Future of Social Networks
 
connecting Justice: social media and citizen engagement
connecting Justice: social media and citizen engagementconnecting Justice: social media and citizen engagement
connecting Justice: social media and citizen engagement
 
Team 621 Hacking for Diplomacy week 8
Team 621 Hacking for Diplomacy week 8Team 621 Hacking for Diplomacy week 8
Team 621 Hacking for Diplomacy week 8
 
Connecting Justice - social media and citizen engagement
Connecting Justice - social media and citizen engagementConnecting Justice - social media and citizen engagement
Connecting Justice - social media and citizen engagement
 
Using Smart Technology to Improve Global Health Initiatives
Using Smart Technology to Improve Global Health InitiativesUsing Smart Technology to Improve Global Health Initiatives
Using Smart Technology to Improve Global Health Initiatives
 

Similar to Examining the Big Data Frontier

Big Data Brown Bag
Big Data Brown BagBig Data Brown Bag
Big Data Brown Bag
usmanqureshi
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in Education
Philip Piety
 
Business Analytics and Data mining.pdf
Business Analytics and Data mining.pdfBusiness Analytics and Data mining.pdf
Business Analytics and Data mining.pdf
ssuser0413ec
 
strata_ny_2016_version_final_no_animation
strata_ny_2016_version_final_no_animationstrata_ny_2016_version_final_no_animation
strata_ny_2016_version_final_no_animation
Taposh Dutta Roy
 
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big DataMicrosoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
Health Catalyst
 

Similar to Examining the Big Data Frontier (20)

Big Data Brown Bag
Big Data Brown BagBig Data Brown Bag
Big Data Brown Bag
 
Bigdata and Hadoop with applications
Bigdata and Hadoop with applicationsBigdata and Hadoop with applications
Bigdata and Hadoop with applications
 
The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...
The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...
The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...
 
NCME Big Data in Education
NCME Big Data  in EducationNCME Big Data  in Education
NCME Big Data in Education
 
State of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - KeynoteState of Florida Neo4J Graph Briefing - Keynote
State of Florida Neo4J Graph Briefing - Keynote
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
BIG DATA.ppt
BIG DATA.pptBIG DATA.ppt
BIG DATA.ppt
 
Business Analytics and Data mining.pdf
Business Analytics and Data mining.pdfBusiness Analytics and Data mining.pdf
Business Analytics and Data mining.pdf
 
strata_ny_2016_version_final_no_animation
strata_ny_2016_version_final_no_animationstrata_ny_2016_version_final_no_animation
strata_ny_2016_version_final_no_animation
 
Big_Data.pptx
Big_Data.pptxBig_Data.pptx
Big_Data.pptx
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptx
 
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big DataMicrosoft: A Waking Giant In Healthcare Analytics and Big Data
Microsoft: A Waking Giant In Healthcare Analytics and Big Data
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Practical Data Management Plans
Practical Data Management PlansPractical Data Management Plans
Practical Data Management Plans
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdf
 
Managing data responsibly to enable research interity
Managing data responsibly to enable research interityManaging data responsibly to enable research interity
Managing data responsibly to enable research interity
 
Microsoft: A Waking Giant in Healthcare Analytics and Big Data
Microsoft: A Waking Giant in Healthcare Analytics and Big DataMicrosoft: A Waking Giant in Healthcare Analytics and Big Data
Microsoft: A Waking Giant in Healthcare Analytics and Big Data
 

More from GovLoop

Week Three
Week ThreeWeek Three
Week Three
GovLoop
 
FHWA Week Two
FHWA Week TwoFHWA Week Two
FHWA Week Two
GovLoop
 
Social Media Presentation for The Center for Organizational Effectiveness
Social Media Presentation for The Center for Organizational EffectivenessSocial Media Presentation for The Center for Organizational Effectiveness
Social Media Presentation for The Center for Organizational Effectiveness
GovLoop
 

More from GovLoop (20)

How is GovLoop Transforming Learning for Government?
How is GovLoop Transforming Learning for Government?How is GovLoop Transforming Learning for Government?
How is GovLoop Transforming Learning for Government?
 
Teaching vs learning
Teaching vs learningTeaching vs learning
Teaching vs learning
 
Next Gen: Critical Conversations Slide Deck
Next Gen: Critical Conversations Slide DeckNext Gen: Critical Conversations Slide Deck
Next Gen: Critical Conversations Slide Deck
 
Internet of Things: Lightning Round, Sargent
Internet of Things: Lightning Round, SargentInternet of Things: Lightning Round, Sargent
Internet of Things: Lightning Round, Sargent
 
Internet of Things: Lightning Round, Ronzio
Internet of Things: Lightning Round, RonzioInternet of Things: Lightning Round, Ronzio
Internet of Things: Lightning Round, Ronzio
 
Internet of Things: Lightning Round, Hite
Internet of Things: Lightning Round, HiteInternet of Things: Lightning Round, Hite
Internet of Things: Lightning Round, Hite
 
Internet of Things: Lightning Round, Fritzinger
Internet of Things: Lightning Round, FritzingerInternet of Things: Lightning Round, Fritzinger
Internet of Things: Lightning Round, Fritzinger
 
Internet of Things: Lightning Round, McKinney
Internet of Things: Lightning Round, McKinneyInternet of Things: Lightning Round, McKinney
Internet of Things: Lightning Round, McKinney
 
Internet of Things: Government Keynote, Randy Garrett
Internet of Things: Government Keynote, Randy GarrettInternet of Things: Government Keynote, Randy Garrett
Internet of Things: Government Keynote, Randy Garrett
 
Leap Not Creep Participant Guide Pre-Course Through Week 3 - 20140722
Leap Not Creep Participant Guide Pre-Course Through Week 3 - 20140722Leap Not Creep Participant Guide Pre-Course Through Week 3 - 20140722
Leap Not Creep Participant Guide Pre-Course Through Week 3 - 20140722
 
Week Three
Week ThreeWeek Three
Week Three
 
FHWA Week Two
FHWA Week TwoFHWA Week Two
FHWA Week Two
 
Building Powerful Outreach - Executive Research Brief
Building Powerful Outreach - Executive Research BriefBuilding Powerful Outreach - Executive Research Brief
Building Powerful Outreach - Executive Research Brief
 
Turning Big Data into Big Decisions
Turning Big Data into Big DecisionsTurning Big Data into Big Decisions
Turning Big Data into Big Decisions
 
The Need for NoSQL - MarkLogic
The Need for NoSQL - MarkLogicThe Need for NoSQL - MarkLogic
The Need for NoSQL - MarkLogic
 
Capitalizing on the Cloud
Capitalizing on the CloudCapitalizing on the Cloud
Capitalizing on the Cloud
 
Build Better Virtual Events & Training for your Agency
Build Better Virtual Events & Training for your AgencyBuild Better Virtual Events & Training for your Agency
Build Better Virtual Events & Training for your Agency
 
Social Media Presentation for The Center for Organizational Effectiveness
Social Media Presentation for The Center for Organizational EffectivenessSocial Media Presentation for The Center for Organizational Effectiveness
Social Media Presentation for The Center for Organizational Effectiveness
 
Guide to Managing the Presidential Management Fellows (PMF) Application Proce...
Guide to Managing the Presidential Management Fellows (PMF) Application Proce...Guide to Managing the Presidential Management Fellows (PMF) Application Proce...
Guide to Managing the Presidential Management Fellows (PMF) Application Proce...
 
Winning the Cybersecurity Battle
Winning the Cybersecurity BattleWinning the Cybersecurity Battle
Winning the Cybersecurity Battle
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

Examining the Big Data Frontier

  • 2. Welcome Steve Ressler Founder & President GovLoop - Tweets: #gltrain - Slides, Video archive & other resources will be emailed to you later this week. - Check out the GovLoop VIP programhttp://www.govloop.com/TrainingVIPs - Be sure to fill out the evaluation to obtain your 3 CPE’s at the end of the event.
  • 3. The Data Revolution “As we sit on the cusp of remarkable innovations, we must remember that this time, modern innovations are powered by data.”
  • 4. Our Goals - GovLoop Research • Our research: empower and educate • Our basic formula: – Survey of GovLoop Audience – Case study from state, local and federal government – Best practices, challenges – Cheat sheet • Our mission: Help you do your job better
  • 5. What You’ll Find in the Report • A local government spotlight showing how the city of Louisville, KY., has leveraged data to improve services • A federal government case study highlighting the Army’s Enterprise Management Decision Support program. • Industry insights on the current big data landscape. • 8 Strategies and best practices for smart big data adoption and analysis. • GovLoop’s big data cheat sheet.
  • 7. Defining Big Data “Big Data refers to the massive amounts of data that collect over time that are difficult to analyze and handle using common database management tools.” – PC Magazine The Method for an Integrated Knowledge Environment open-source project. The MIKE project argues that big data is not a function of the size of a data set but its complexity. Consequently, it is the high degree of permutations and interactions within a data set that defines big data. The National Institute of Standards and Technology. NIST argues that big data is data which “exceed(s) the capacity or capability of current or conventional methods and systems.” In other words, the notion of “big” is relative to the current standard of computation. Definitions from: The Big Data Conundrum: How to Define It?
  • 8. A Working Definition “Big data is a term describing the storage and analysis of large and or complex data sets using a series of techniques including, but not limited to: NoSQL, MapReduce and machine learning.” See full study: Undefined By Data: A Survey of Big Data Definitions - Jonathan Stuart Ward and Adam Barker, School of Computer Science, University of St. Andrews, UK.
  • 9. Do we care? • Leveraging data in new ways to meet mission need. • Unlocking new insights by synthesizing data across your department • Collaborating and sharing resources
  • 10. Convergence of Technology You can’t talk about big data without thinking about cloud and mobile.
  • 11. 3 Big Data Case Studies
  • 12. Social Welfare • The oldest case is the Famine Early Warning Systems Network (FEWS NET) developed by U.S. Agency for International Development in 1986. • The $25 million dollar program helps optimize the distribution of up to $1.5 billion dollars per year in USAID Food for Peace assistance.
  • 13. Smarter Healthcare • Predicting the likelihood of hospitalization or death within 90 days, the Patient Care Assessment System (PCAS) calculates the Care Assessment Needs (CAN) Score. • This score allows the Veterans Health Administration (VHA) to focus care teams and proactively care for their patients. • The system collects 120 unique elements for 5.25 million patients and is supported by an 80-terabyte corporate data warehouse.
  • 14. Biometric Intelligence • Since 2003, US armed forces have collected biometric information from non-US citizens in Iraq and Afghanistan. • It identifies enemy combatants and permits access into controlled areas. • The system, known as The Automated Biometric Identification System (ABIS), stores 4.4 million unique identities and has identified over 3,000 enemy combatants, added 190,000 identities to the watch list, and protected the welfare of the United States and its allies.
  • 15. Big Data’s Challenges – From the Survey • Data governance • Data locked in legacy systems • Unclear mission and goals • Lack of support • Lack of clarity on metrics
  • 16. Big Data’s Core Challenge How do we find the needle in the haystack…when the haystack keeps getting exponentially bigger?
  • 17. Our Big Data Best Practices
  • 19. 2: Business Before Treasure
  • 20. 3: Know Thyself: Define Use Cases
  • 21. 4: Leverage Existing Resources & Augment
  • 23. 6: Partner With Industry
  • 24. 7: Focus on Governance and Data Quality
  • 25. 8: Have a Deep Bench - Train
  • 26. Where to begin? • What problem are we trying to solve? • How do we engage the right people? • How do we break down silos? • What kinds of data do we need? • What do I need to be able to do with the data? • Who needs access and when? • Can I leverage existing technology my agency has? • What does success look like?
  • 27. Before we wrap up… Today, public service offers a unique opportunity Photo license: by Pa1nt, FlickR Creative Commons