SlideShare a Scribd company logo
1 of 24
B I G D A T A L I T T L E
D E V I C E S
W H A T I T W I L L D O T O U S A N D F O R U S
W H A T I S B I G D A T A ?
0 - 2 0 0 3
5 exabytes
2 0 1 1
2.5 exabytes per day
P E R S P E C T I V E S
1MB 1GB 1TB 2PB 5EB
W H E R E ’ S I T C O M I N G F R O M ?
Source: domo.com 2012
W H A T D O E S I T L O O K L I K E ?
D E F I N I T I O N S
• Big Data: unstructured data, don’t know what questions are yet
• Business Intelligence: structured data, know what the questions
you want answered
• Statistics: structured data, not realtime, no action taken as a
result
• Machine Learning: creation of algorithms and applying them to
data sets in an attempt to learn from data
• Predictive Analytics: extracting existing data to predict trends
W H Y N O W ?
• 2003: Doug Cutting & Mike Cafarella, Nutch
• 2004:Google Labs: Map Reduce
• 2006:Doug Cutting moves to Yahoo and creates Hadoop
• 2008: Yahoo open sources Hadoop, Apache Software Foun
• 2009: Matei Zaharia starts Spark at UC Berkley
• 2013: Spark open sourced under Apache
M A P R E D U C E
Traditional / Sequential
Map
Reduce
S P A R K
x 100
Map
Reduce
C A S E S
W H A T I T W I L L D O T O U S
S E C U R I T Y - P R I V A C Y
N S A P R I S M
P R O F I L I N G
V U L N E R A B I L I T
Y
• Target
• Home Depot
• Michaels
• Blue Cross Blue Shield
• Sony Entertainment
S O C I E T Y
C O M M E R C E
A M A Z O N D A S H
C O M M E R C E
A M A Z O N
C A S E S
W H A T I T W I L L D O F O R U S
S P O R T S
S A B E R M E T R I C S ( M O N E Y B A L L )
95%
5%
P R O D U C T I V I T Y
G O O G L E N O W
P O L I T I C S
O B A M A C A M P A I G N 2 0 1 2
S C I E N C E
M O N T E R E Y B A Y A Q U A R I U M R E S E A R C H I N S T I T U T E
H E A L T H
A P P L E R E S E A R C H K I T
xt, Stanford says that it would normally take a national year-long effort to get that kind of scale. The flood of dat
M O R E R E A D I N G
• http://www.domo.com/blog/2014/04/data-never-sleeps-2-0/
• http://www.redorbit.com/education/reference_library/general-2/history-of/1113190638/the-history-of-
mobile-phone-technology/
• http://www.forbes.com/sites/gilpress/2013/05/09/a-very-short-history-of-big-data/
• http://www.wired.com/2015/04/robots-roam-earths-imperiled-oceans/?mbid=nl_041315
• http://www.allbusiness.com/what-does-your-supermarket-know-about-you-15611312-1.html
• http://www.geekwire.com/2015/baseball-analytics-mystery-mlb-team-uses-a-cray-supercomputer-to-
crunch-data/
• http://www.geekwire.com/2015/this-big-data-startup-just-raised-cash-to-analyze-driver-behavior-creating-
safety-scores-for-individual-
motorists/?utm_source=GeekWire+Daily+Digest&utm_campaign=20eb1892b3-daily-digest-
email&utm_medium=email&utm_term=04e93fc7dfd-20eb1892b3-
233387065&mc_cid=20eb1892b3&mc_eid=7b61e5049a
• http://www.newyorker.com/culture/culture-desk/the-horror-of-amazons-new-dash-button
• https://www.amazon.com/oc/dash-button
• http://harvardmagazine.com/2014/03/why-big-data-is-a-big-deal http://www.businessinsider.com/big-data-
is-growing-thanks-to-mobile-2013-1http://venturebeat.com/2015/04/03/how-microsofts-using-big-data-to-
predict-traffic-jams-up-to-an-hour-in-advance/
• http://www.engadget.com/2015/04/13/ibm-watson-health-
cloud/?utm_source=Feed_Classic_Full&utm_medium=feed&utm_campaign=Engadget&?ncid=rss_full
?

More Related Content

What's hot

A Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity ModelsA Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity ModelsData Crossroads
 
A Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity ModelsA Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity ModelsData Crossroads
 
Agile Marketing For The Real World event - Signal - 6th Nov 2019
Agile Marketing For The Real World event - Signal - 6th Nov 2019Agile Marketing For The Real World event - Signal - 6th Nov 2019
Agile Marketing For The Real World event - Signal - 6th Nov 2019Lauren Cormack
 
You Created a Plugin. Now What?
You Created a Plugin. Now What?You Created a Plugin. Now What?
You Created a Plugin. Now What?Adam W. Warner
 
SharePoint Saturday Redmond - Building solutions with the future in mind
SharePoint Saturday Redmond - Building solutions with the future in mindSharePoint Saturday Redmond - Building solutions with the future in mind
SharePoint Saturday Redmond - Building solutions with the future in mindChris Johnson
 
People Centred Design & Working Agile
People Centred Design & Working AgilePeople Centred Design & Working Agile
People Centred Design & Working AgileDavid Haddow
 
Statistical Programming with JavaScript
Statistical Programming with JavaScriptStatistical Programming with JavaScript
Statistical Programming with JavaScriptDavid Simons
 
Choosing the Right Database
Choosing the Right DatabaseChoosing the Right Database
Choosing the Right DatabaseDavid Simons
 
Decoupled APIs through Microservices
Decoupled APIs through MicroservicesDecoupled APIs through Microservices
Decoupled APIs through MicroservicesDavid Simons
 
Work in the web (Web Teaching Day 2015)
Work in the web (Web Teaching Day 2015)Work in the web (Web Teaching Day 2015)
Work in the web (Web Teaching Day 2015)Luke Whitehouse
 
Bristol Uni - Use Cases of NoSQL
Bristol Uni - Use Cases of NoSQLBristol Uni - Use Cases of NoSQL
Bristol Uni - Use Cases of NoSQLDavid Simons
 
Network x python_meetup_2015-08-27
Network x python_meetup_2015-08-27Network x python_meetup_2015-08-27
Network x python_meetup_2015-08-27Chris Allison
 
From Content Strategy to Drupal Site Building - Connecting the dots
From Content Strategy to Drupal Site Building - Connecting the dotsFrom Content Strategy to Drupal Site Building - Connecting the dots
From Content Strategy to Drupal Site Building - Connecting the dotsRonald Ashri
 
Getting Things Done met David Allen - Masterclass met ScaleUp Company 22 apri...
Getting Things Done met David Allen - Masterclass met ScaleUp Company 22 apri...Getting Things Done met David Allen - Masterclass met ScaleUp Company 22 apri...
Getting Things Done met David Allen - Masterclass met ScaleUp Company 22 apri...Erno Hannink
 
Choosing the right database
Choosing the right databaseChoosing the right database
Choosing the right databaseDavid Simons
 
Data Modelling at Scale
Data Modelling at ScaleData Modelling at Scale
Data Modelling at ScaleDavid Simons
 
Ninja Correlation of APT Binaries
Ninja Correlation of APT BinariesNinja Correlation of APT Binaries
Ninja Correlation of APT BinariesCODE BLUE
 

What's hot (20)

A Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity ModelsA Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity Models
 
A Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity ModelsA Comparative Study of Data Management Maturity Models
A Comparative Study of Data Management Maturity Models
 
Agile Marketing For The Real World event - Signal - 6th Nov 2019
Agile Marketing For The Real World event - Signal - 6th Nov 2019Agile Marketing For The Real World event - Signal - 6th Nov 2019
Agile Marketing For The Real World event - Signal - 6th Nov 2019
 
You Created a Plugin. Now What?
You Created a Plugin. Now What?You Created a Plugin. Now What?
You Created a Plugin. Now What?
 
Yammer time
Yammer timeYammer time
Yammer time
 
SharePoint Saturday Redmond - Building solutions with the future in mind
SharePoint Saturday Redmond - Building solutions with the future in mindSharePoint Saturday Redmond - Building solutions with the future in mind
SharePoint Saturday Redmond - Building solutions with the future in mind
 
People Centred Design & Working Agile
People Centred Design & Working AgilePeople Centred Design & Working Agile
People Centred Design & Working Agile
 
Statistical Programming with JavaScript
Statistical Programming with JavaScriptStatistical Programming with JavaScript
Statistical Programming with JavaScript
 
Choosing the Right Database
Choosing the Right DatabaseChoosing the Right Database
Choosing the Right Database
 
Decoupled APIs through Microservices
Decoupled APIs through MicroservicesDecoupled APIs through Microservices
Decoupled APIs through Microservices
 
Work in the web (Web Teaching Day 2015)
Work in the web (Web Teaching Day 2015)Work in the web (Web Teaching Day 2015)
Work in the web (Web Teaching Day 2015)
 
Slip indholdet fri
Slip indholdet friSlip indholdet fri
Slip indholdet fri
 
Bristol Uni - Use Cases of NoSQL
Bristol Uni - Use Cases of NoSQLBristol Uni - Use Cases of NoSQL
Bristol Uni - Use Cases of NoSQL
 
Network x python_meetup_2015-08-27
Network x python_meetup_2015-08-27Network x python_meetup_2015-08-27
Network x python_meetup_2015-08-27
 
From Content Strategy to Drupal Site Building - Connecting the dots
From Content Strategy to Drupal Site Building - Connecting the dotsFrom Content Strategy to Drupal Site Building - Connecting the dots
From Content Strategy to Drupal Site Building - Connecting the dots
 
Getting Things Done met David Allen - Masterclass met ScaleUp Company 22 apri...
Getting Things Done met David Allen - Masterclass met ScaleUp Company 22 apri...Getting Things Done met David Allen - Masterclass met ScaleUp Company 22 apri...
Getting Things Done met David Allen - Masterclass met ScaleUp Company 22 apri...
 
Choosing the right database
Choosing the right databaseChoosing the right database
Choosing the right database
 
Data Modelling at Scale
Data Modelling at ScaleData Modelling at Scale
Data Modelling at Scale
 
Ninja Correlation of APT Binaries
Ninja Correlation of APT BinariesNinja Correlation of APT Binaries
Ninja Correlation of APT Binaries
 
Azinova - Company Profile
Azinova - Company ProfileAzinova - Company Profile
Azinova - Company Profile
 

Similar to Big Data and Small Devices: What will it do for us and to us

Web User Experience in 2021
Web User Experience in 2021Web User Experience in 2021
Web User Experience in 2021Drew Gorton
 
Ellicium Solutions - Making Data Science Work
Ellicium  Solutions - Making Data Science Work Ellicium  Solutions - Making Data Science Work
Ellicium Solutions - Making Data Science Work Ellicium Solutions Inc.
 
Why Every Product Manager Needs to Know Big Data
Why Every Product Manager Needs to Know Big DataWhy Every Product Manager Needs to Know Big Data
Why Every Product Manager Needs to Know Big DataJeremy Horn
 
From Content Strategy to Drupal Site Building - Connecting the Dots
From Content Strategy to Drupal Site Building - Connecting the DotsFrom Content Strategy to Drupal Site Building - Connecting the Dots
From Content Strategy to Drupal Site Building - Connecting the DotsRonald Ashri
 
From the right process to a solid cultural change
From the right process to a solid cultural changeFrom the right process to a solid cultural change
From the right process to a solid cultural changeFrancesco Zaia
 
Jonathan Carrillo | Resume | Mulitmedia Designer
Jonathan Carrillo | Resume | Mulitmedia DesignerJonathan Carrillo | Resume | Mulitmedia Designer
Jonathan Carrillo | Resume | Mulitmedia DesignerJonathan Carrillo
 
Architecting your IT career
Architecting your IT careerArchitecting your IT career
Architecting your IT careerJohn Mark Troyer
 
Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...
Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...
Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...Massimiliano Crosato
 
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Sergii Khomenko
 
Development and Deployment: The Human Factor
Development and Deployment: The Human FactorDevelopment and Deployment: The Human Factor
Development and Deployment: The Human FactorBoris Adryan
 
Slides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI PerformanceSlides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI PerformanceDATAVERSITY
 
MVP-Style Influencer Programs for Fun & Profit
MVP-Style Influencer Programs for Fun & ProfitMVP-Style Influencer Programs for Fun & Profit
MVP-Style Influencer Programs for Fun & ProfitJohn Mark Troyer
 
Data Interoperability for Learning Analytics and Lifelong Learning
Data Interoperability for Learning Analytics and Lifelong LearningData Interoperability for Learning Analytics and Lifelong Learning
Data Interoperability for Learning Analytics and Lifelong LearningMegan Bowe
 
Information Security Project Management
Information Security Project ManagementInformation Security Project Management
Information Security Project ManagementIgor Pertsovsky
 
Being Strategic With Social Media to deliver on Corporate Objectives
Being Strategic With Social Media to deliver on Corporate ObjectivesBeing Strategic With Social Media to deliver on Corporate Objectives
Being Strategic With Social Media to deliver on Corporate ObjectivesBank of Ireland
 
DevSecOps Through Blunt Force Trauma, I'm the Trauma
DevSecOps Through Blunt Force Trauma, I'm the TraumaDevSecOps Through Blunt Force Trauma, I'm the Trauma
DevSecOps Through Blunt Force Trauma, I'm the TraumaDevOpsDays DFW
 
CIA For WordPress Developers
CIA For WordPress DevelopersCIA For WordPress Developers
CIA For WordPress DevelopersDavid Brumbaugh
 
Social Networks of Freelance Translators
Social Networks of Freelance TranslatorsSocial Networks of Freelance Translators
Social Networks of Freelance TranslatorsMarie Groß
 
Metaverse (A comprehensive Introduction)
Metaverse (A comprehensive Introduction)Metaverse (A comprehensive Introduction)
Metaverse (A comprehensive Introduction)MuhammadAhmad1046
 

Similar to Big Data and Small Devices: What will it do for us and to us (20)

Web User Experience in 2021
Web User Experience in 2021Web User Experience in 2021
Web User Experience in 2021
 
Ellicium Solutions - Making Data Science Work
Ellicium  Solutions - Making Data Science Work Ellicium  Solutions - Making Data Science Work
Ellicium Solutions - Making Data Science Work
 
Why Every Product Manager Needs to Know Big Data
Why Every Product Manager Needs to Know Big DataWhy Every Product Manager Needs to Know Big Data
Why Every Product Manager Needs to Know Big Data
 
From Content Strategy to Drupal Site Building - Connecting the Dots
From Content Strategy to Drupal Site Building - Connecting the DotsFrom Content Strategy to Drupal Site Building - Connecting the Dots
From Content Strategy to Drupal Site Building - Connecting the Dots
 
From the right process to a solid cultural change
From the right process to a solid cultural changeFrom the right process to a solid cultural change
From the right process to a solid cultural change
 
Jonathan Carrillo | Resume | Mulitmedia Designer
Jonathan Carrillo | Resume | Mulitmedia DesignerJonathan Carrillo | Resume | Mulitmedia Designer
Jonathan Carrillo | Resume | Mulitmedia Designer
 
Architecting your IT career
Architecting your IT careerArchitecting your IT career
Architecting your IT career
 
Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...
Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...
Mirko Lorenz Data Driven Journalism Overview Seminar Ordine dei Giornalisti d...
 
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
 
Development and Deployment: The Human Factor
Development and Deployment: The Human FactorDevelopment and Deployment: The Human Factor
Development and Deployment: The Human Factor
 
Slides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI PerformanceSlides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI Performance
 
MVP-Style Influencer Programs for Fun & Profit
MVP-Style Influencer Programs for Fun & ProfitMVP-Style Influencer Programs for Fun & Profit
MVP-Style Influencer Programs for Fun & Profit
 
Data Interoperability for Learning Analytics and Lifelong Learning
Data Interoperability for Learning Analytics and Lifelong LearningData Interoperability for Learning Analytics and Lifelong Learning
Data Interoperability for Learning Analytics and Lifelong Learning
 
Information Security Project Management
Information Security Project ManagementInformation Security Project Management
Information Security Project Management
 
Being Strategic With Social Media to deliver on Corporate Objectives
Being Strategic With Social Media to deliver on Corporate ObjectivesBeing Strategic With Social Media to deliver on Corporate Objectives
Being Strategic With Social Media to deliver on Corporate Objectives
 
DevSecOps Through Blunt Force Trauma, I'm the Trauma
DevSecOps Through Blunt Force Trauma, I'm the TraumaDevSecOps Through Blunt Force Trauma, I'm the Trauma
DevSecOps Through Blunt Force Trauma, I'm the Trauma
 
CIA For WordPress Developers
CIA For WordPress DevelopersCIA For WordPress Developers
CIA For WordPress Developers
 
SENCER_panel.ppt
SENCER_panel.pptSENCER_panel.ppt
SENCER_panel.ppt
 
Social Networks of Freelance Translators
Social Networks of Freelance TranslatorsSocial Networks of Freelance Translators
Social Networks of Freelance Translators
 
Metaverse (A comprehensive Introduction)
Metaverse (A comprehensive Introduction)Metaverse (A comprehensive Introduction)
Metaverse (A comprehensive Introduction)
 

Recently uploaded

Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 

Recently uploaded (20)

Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 

Big Data and Small Devices: What will it do for us and to us

  • 1. B I G D A T A L I T T L E D E V I C E S W H A T I T W I L L D O T O U S A N D F O R U S
  • 2. W H A T I S B I G D A T A ? 0 - 2 0 0 3 5 exabytes 2 0 1 1 2.5 exabytes per day
  • 3. P E R S P E C T I V E S 1MB 1GB 1TB 2PB 5EB
  • 4. W H E R E ’ S I T C O M I N G F R O M ? Source: domo.com 2012
  • 5. W H A T D O E S I T L O O K L I K E ?
  • 6. D E F I N I T I O N S • Big Data: unstructured data, don’t know what questions are yet • Business Intelligence: structured data, know what the questions you want answered • Statistics: structured data, not realtime, no action taken as a result • Machine Learning: creation of algorithms and applying them to data sets in an attempt to learn from data • Predictive Analytics: extracting existing data to predict trends
  • 7. W H Y N O W ? • 2003: Doug Cutting & Mike Cafarella, Nutch • 2004:Google Labs: Map Reduce • 2006:Doug Cutting moves to Yahoo and creates Hadoop • 2008: Yahoo open sources Hadoop, Apache Software Foun • 2009: Matei Zaharia starts Spark at UC Berkley • 2013: Spark open sourced under Apache
  • 8. M A P R E D U C E Traditional / Sequential Map Reduce
  • 9. S P A R K x 100 Map Reduce
  • 10. C A S E S W H A T I T W I L L D O T O U S
  • 11. S E C U R I T Y - P R I V A C Y N S A P R I S M
  • 12. P R O F I L I N G
  • 13. V U L N E R A B I L I T Y • Target • Home Depot • Michaels • Blue Cross Blue Shield • Sony Entertainment
  • 14. S O C I E T Y
  • 15. C O M M E R C E A M A Z O N D A S H
  • 16. C O M M E R C E A M A Z O N
  • 17. C A S E S W H A T I T W I L L D O F O R U S
  • 18. S P O R T S S A B E R M E T R I C S ( M O N E Y B A L L ) 95% 5%
  • 19. P R O D U C T I V I T Y G O O G L E N O W
  • 20. P O L I T I C S O B A M A C A M P A I G N 2 0 1 2
  • 21. S C I E N C E M O N T E R E Y B A Y A Q U A R I U M R E S E A R C H I N S T I T U T E
  • 22. H E A L T H A P P L E R E S E A R C H K I T xt, Stanford says that it would normally take a national year-long effort to get that kind of scale. The flood of dat
  • 23. M O R E R E A D I N G • http://www.domo.com/blog/2014/04/data-never-sleeps-2-0/ • http://www.redorbit.com/education/reference_library/general-2/history-of/1113190638/the-history-of- mobile-phone-technology/ • http://www.forbes.com/sites/gilpress/2013/05/09/a-very-short-history-of-big-data/ • http://www.wired.com/2015/04/robots-roam-earths-imperiled-oceans/?mbid=nl_041315 • http://www.allbusiness.com/what-does-your-supermarket-know-about-you-15611312-1.html • http://www.geekwire.com/2015/baseball-analytics-mystery-mlb-team-uses-a-cray-supercomputer-to- crunch-data/ • http://www.geekwire.com/2015/this-big-data-startup-just-raised-cash-to-analyze-driver-behavior-creating- safety-scores-for-individual- motorists/?utm_source=GeekWire+Daily+Digest&utm_campaign=20eb1892b3-daily-digest- email&utm_medium=email&utm_term=04e93fc7dfd-20eb1892b3- 233387065&mc_cid=20eb1892b3&mc_eid=7b61e5049a • http://www.newyorker.com/culture/culture-desk/the-horror-of-amazons-new-dash-button • https://www.amazon.com/oc/dash-button • http://harvardmagazine.com/2014/03/why-big-data-is-a-big-deal http://www.businessinsider.com/big-data- is-growing-thanks-to-mobile-2013-1http://venturebeat.com/2015/04/03/how-microsofts-using-big-data-to- predict-traffic-jams-up-to-an-hour-in-advance/ • http://www.engadget.com/2015/04/13/ibm-watson-health- cloud/?utm_source=Feed_Classic_Full&utm_medium=feed&utm_campaign=Engadget&?ncid=rss_full
  • 24. ?

Editor's Notes

  1. This is the ultimate Big Data scenario. It’s bigger than big data. When building the NSA Prism data center in Utah, they referred to Yottabyte storage. Calculations at the time suggested that it would cost trillions to create that size storage array.
  2. Each of these are cases of a data breach, where customer data was stolen. In most cases, these are things like credit card data, address data. When we get to breaches like Blue Cross, the scenario starts to darken. This is only the beginning, once more of what represents who you are is online, the greater the risks of having that identity stolen.
  3. For starters, Bolding notes that 95 percent of baseball stats have been created over the last five years thanks to the growing amount of data sensors and innovative methods of analyzing players. “They are gathering so much data that a single person with an Excel spreadsheet can no longer analyze, in a sophisticated way, all the data they have,” Bolding said. “They need bigger and bigger computers to be able to analyze the data.” As popularized by Michael Lewis’ Moneyball and the subsequent movie, using baseball data to drive decisions about player personnel — and ultimately win more games — was a strategy first used successfully by the Oakland in 2003.
  4. The intent of Media Optimizer was to enable much more targeted ad purchases. Prior to Media Optimizer, TV ad buys were based on broad demographics, which is both costly and inefficient. With Media Optimizer in place, the campaign could use statistical analysis to identify the target voters in the DNC database. Next, the voter data was enriched, both with demographics data from TV ratings as well as advertisement pricing data. Finally, the results were fed back into Vertica and reanalyzed for further tuning. With the overall picture combining likely voters for Obama, the shows they watch, and the prices of the ads -- as well as the analysis feedback loop -- it was much easier to determine the most efficient ad buys. One result was that the Obama campaign purchased twice the number of cable TV advertisements as the Romney campaign, many during niche programs, aimed at the precise demographic slices the Obama campaign was trying to reach.
  5. MBARI has a fleet of them, three different kinds—autonomous machines that prowl the open oceans gathering data, allowing researchers to monitor it in real time. The machines do not tire, and they cannot drown. They survive shark bites. They can roam for months on end, beaming a steady stream of data to scientists sitting safely onshore.