SlideShare a Scribd company logo
1 of 38
Data complexity: variety and velocity
Petabytes
Massive Compute
and Storage
Deployment
expertise
Data of all Volume
Variety, Velocity
Speed Scale Economics
Always Up,
Always On Open and flexible
Time to value
Big Data
HDInsight
Script SQL NoSQL StreamingBatch
Map
reduce
In Memory
Core Engine
• Microsoft’s cloud Hadoop offering
• 100% open source Apache Hadoop
• Built on the latest releases for Hadoop
• Up and running in minutes with no hardware to deploy
• .NET and Java skills and deep integration to Visual Studio
• Utilize familiar BI tools for analysis including Microsoft Excel
• 99.9% Enterprise Service Level Agreement
HDInsight
Script SQL NoSQL StreamingBatch
Map
reduce
In Memory
Core Engine
HDInsight
Script SQL NoSQL StreamingBatch
Map
reduce
In Memory
Core Engine
Microsoft contribution to
Apache code
Data Node Data Node Data Node Data Node
Task Tracker Task Tracker Task Tracker Task Tracker
Name Node
Job Tracker
HMaster
Coordination
Region Server Region Server Region Server Region Server
• Random, fast (realtime) read/write access to your Big Data.
• Host very large tables (billions of rows X millions of columns) on clusters of
commodity hardware.
• Runs on top of the Hadoop Distributed File System (HDFS)
• Provides flexibility in that new columns can be added to column families at any
time
HDInsight
Script SQL NoSQL StreamingBatch
Map
reduce
In Memory
Core Engine
Stream
processin
g
Search and query
Data analytics (Excel)
Web/thick client
dashboards
Devices to take action
RabbitMQ /
ActiveMQ
HDInsight
Script SQL NoSQL StreamingBatch
Map
reduce
In Memory
Core Engine
• Single execution model for multiple tasks (SQL queries, Streaming, Machine
Learning, and Graph)
• Processing up to 100x faster performance
• Developer friendly (Java, Python, Scala)
• BI tool of choice (Power BI, Tabelau, Qlik, SAP)
• Notebook experience (Jupyter/iPython, Zeppelin)
HDInsight
Script SQL NoSQL StreamingBatch
Map
reduce
In Memory
Core Engine
Spark SQL Spark
Streaming
Machine
Learning
Graph
HDInsight
Script SQL NoSQL StreamingBatch
Map
reduce
In Memory
Core Engine
Spark for Azure HDInsight
In-memory computation engine – Fully managed
• Managed & supported by Microsoft
• Familiarity of Windows
• Re-use common tools, documentation, samples from Hadoop/Linux ecosystem
• Add Hadoop projects that were authored on Linux to HDInsight
• Easier transition from on-premise to cloud
Partner Spotlight: AtScale
Analysts Use Traditional BI Tools Against HDInsight
• HDFS For the Cloud
• Unlimited Storage, Petabyte Files
• Optimized for Massive Throughput
• High frequency, low latency, read immediately
• Managed and secured
PB
TB GB
PB
TB
Neudesic partnered with one of the nation's largest utility companies that recently
deployed Smart Utility Meters for power customers, nearly a million meters sending
usage data every 15 minutes.
The result: an Azure hybrid big data processing solution that enabled the customer
to perform gap analytics: a process for identifying gaps that exist in the power
usage readings, over 7x faster than their previous solution! Billions of Smart Meter
reads get processed to identify the nature and duration of the gaps to mitigate
revenue losses.
Smart Meters Business Rules
Processing
BI Layer
Blob Storage
HDInsightInput Processed Output data
ELT
Local SQL DB for Customer
and other confidential data
Extract processed data from
blob storage
AZCopy
AZCopy SSIS
Input files
Big Data in Retail
• Clickstream analytics
• Online recommendation engine
• 360° view of the customer
• Analyze brand sentiment
• Localized, personalized promotions
• Optimal store layout
Leading computer
manufacturer in world
• Use clickstream to deliver custom
website ecommerce experience
• Targeted ads for abandoned carts
• Use unstructured data from
website and social for data mining
• Combine w/sales data for 360 view
• Gather data from table-side
devices at restaurants
• Predict promotions/offers and
content to upsell to guests
• Gather social media sentiment
from customer feedback
• Combined with POS data, can
determine right product mix
Leading Multi-national
Retailer
• Track weather information
(temperature/forecast) to predict
shelf space for different seasons
• Sentiment analysis on feedback
Leading clothing online
retailer
• Use clickstream to understand who
is viewing their site
• Building recommendation engine
based on users’ clickpaths
Ziosk turned to Microsoft gold partner, Artis
Consulting to deploy a hybrid deployment
consisting of the Analytics Platform System, Azure
HDInsight, Power BI, and Azure Machine Learning
“Until now, we haven’t had the ability to
optimize the guest experience based on
their specific interactions with the devices.
With Azure, we can close the loop.”
Kevin Mowry
Ziosk
Chief Software Architect
Big Data in Health
• Predictive Analysis of Patient Health
& Clinical Decision Support
• Population, risk, and Care
management
• Real-time quality measures to assist
providers w/regulatory requirements
• Medical research data (eg. genomics)
• Recruit cohorts for pharmaceutical
trials
• Process large volumes of data from
any healthcare provider EHR
system
• Assist in showing compliance
• Store 7-30 years of data to meet
audit requirements
• Scan handwritten notes and do
natural language processing
• Analyze if symptoms might map to
bigger outbreak
• Collect clinical trial data (from
automated equipment, sensors)
• Find patterns on this data
(chemical compositions, enzymes)
• Process 6 years worth of data in a
few hours without any
infrastructure
Big Data Financial
Services
• New account risk screens
• Fraud prevention
• Trading risk
• Maximize deposit spread
• Insurance underwriting
• Accelerate loan processing
• Actively monitor currencies used by UK
manufacturers in supply chain to do risk analysis
• Monitor UK GDP to help customers stay on top of
economic trends
• Needed to handle increasing amounts of finance,
compliance, and legal data from trading operations
• Trading data drives strategic decisions
• Track customer feedback on social media and on
their blog posts/website to understand loyalty
• Predict at-risks clients to reach out to
• Process data for actuaries to analyze results to
understand risks for insurance companies
• Milliman’s application understands relationships
between people, process, and technology to
manage risk
Tangerine partners with Microsoft to build a
solution with Analytics Platform System for the
data warehouse and uses PolyBase to query Azure
HDInsight in the cloud.
“With pre-built integration using PolyBase
to query both the relational data
warehouse and Hadoop in the cloud, the
solution will allow us to reap the benefits of
both relational and non-relational data
regardless of where it lives.”
http://azure.microsoft.com/en-us/documentation/services/hdinsight/
http://azure.microsoft.com/en-us/documentation/articles/hdinsight-learn-map/
http://www.microsoftvirtualacademy.com/training-courses/getting-started-with-microsoft-big-data
http://channel9.msdn.com/Shows/Data-Exposed
http://azure.microsoft.com/en-us/pricing/free-trial/
Data complexity: variety and velocity with HDInsight

More Related Content

What's hot

Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big DataDataWorks Summit
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...Daniel Martin
 
Sidecars and a Microservices Mesh
Sidecars and a Microservices MeshSidecars and a Microservices Mesh
Sidecars and a Microservices MeshRed Hat Developers
 
Choosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudChoosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudJames Serra
 
Oracle Big data at work
Oracle Big data at workOracle Big data at work
Oracle Big data at worksolarisyougood
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data avanttic Consultoría Tecnológica
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemJames Serra
 
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud Stefan Lipp
 
Enterprise Data Warehouse Optimization: 7 Keys to Success
Enterprise Data Warehouse Optimization: 7 Keys to SuccessEnterprise Data Warehouse Optimization: 7 Keys to Success
Enterprise Data Warehouse Optimization: 7 Keys to SuccessHortonworks
 
Red hat ceph storage customer presentation
Red hat ceph storage customer presentationRed hat ceph storage customer presentation
Red hat ceph storage customer presentationRodrigo Missiaggia
 
How Apache Spark and Apache Hadoop are being used to keep banking regulators ...
How Apache Spark and Apache Hadoop are being used to keep banking regulators ...How Apache Spark and Apache Hadoop are being used to keep banking regulators ...
How Apache Spark and Apache Hadoop are being used to keep banking regulators ...DataWorks Summit
 
Simplifying Big Data Integration with Syncsort DMX and DMX-h
Simplifying Big Data Integration with Syncsort DMX and DMX-hSimplifying Big Data Integration with Syncsort DMX and DMX-h
Simplifying Big Data Integration with Syncsort DMX and DMX-hPrecisely
 
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningDataWorks Summit
 
How Experian increased insights with Hadoop
How Experian increased insights with HadoopHow Experian increased insights with Hadoop
How Experian increased insights with HadoopPrecisely
 

What's hot (20)

Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Db2 analytics accelerator on ibm integrated analytics system technical over...
Db2 analytics accelerator on ibm integrated analytics system   technical over...Db2 analytics accelerator on ibm integrated analytics system   technical over...
Db2 analytics accelerator on ibm integrated analytics system technical over...
 
Sidecars and a Microservices Mesh
Sidecars and a Microservices MeshSidecars and a Microservices Mesh
Sidecars and a Microservices Mesh
 
Choosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloudChoosing technologies for a big data solution in the cloud
Choosing technologies for a big data solution in the cloud
 
Oracle Big data at work
Oracle Big data at workOracle Big data at work
Oracle Big data at work
 
Big Data at your Desk with KNIME
Big Data at your Desk with KNIMEBig Data at your Desk with KNIME
Big Data at your Desk with KNIME
 
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
 
Big Data: Myths and Realities
Big Data: Myths and RealitiesBig Data: Myths and Realities
Big Data: Myths and Realities
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
 
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
 
Hadoop for the Masses
Hadoop for the MassesHadoop for the Masses
Hadoop for the Masses
 
Enterprise Data Warehouse Optimization: 7 Keys to Success
Enterprise Data Warehouse Optimization: 7 Keys to SuccessEnterprise Data Warehouse Optimization: 7 Keys to Success
Enterprise Data Warehouse Optimization: 7 Keys to Success
 
IBM Power8 announce
IBM Power8 announceIBM Power8 announce
IBM Power8 announce
 
Red hat ceph storage customer presentation
Red hat ceph storage customer presentationRed hat ceph storage customer presentation
Red hat ceph storage customer presentation
 
How Apache Spark and Apache Hadoop are being used to keep banking regulators ...
How Apache Spark and Apache Hadoop are being used to keep banking regulators ...How Apache Spark and Apache Hadoop are being used to keep banking regulators ...
How Apache Spark and Apache Hadoop are being used to keep banking regulators ...
 
Simplifying Big Data Integration with Syncsort DMX and DMX-h
Simplifying Big Data Integration with Syncsort DMX and DMX-hSimplifying Big Data Integration with Syncsort DMX and DMX-h
Simplifying Big Data Integration with Syncsort DMX and DMX-h
 
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine Learning
 
Instrumenting your Instruments
Instrumenting your Instruments Instrumenting your Instruments
Instrumenting your Instruments
 
Data-In-Motion Unleashed
Data-In-Motion UnleashedData-In-Motion Unleashed
Data-In-Motion Unleashed
 
How Experian increased insights with Hadoop
How Experian increased insights with HadoopHow Experian increased insights with Hadoop
How Experian increased insights with Hadoop
 

Viewers also liked

Cortana Analytics Suite
Cortana Analytics SuiteCortana Analytics Suite
Cortana Analytics SuiteJames Serra
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...MSAdvAnalytics
 
Azure Stream Analytics
Azure Stream AnalyticsAzure Stream Analytics
Azure Stream AnalyticsJames Serra
 
IBM Watson Analytics Presentation
IBM Watson Analytics PresentationIBM Watson Analytics Presentation
IBM Watson Analytics PresentationIan Balina
 
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Accelerating Business Intelligence Solutions with Microsoft Azure   passAccelerating Business Intelligence Solutions with Microsoft Azure   pass
Accelerating Business Intelligence Solutions with Microsoft Azure passJason Strate
 
Georgia Azure Event - Scalable cloud games using Microsoft Azure
Georgia Azure Event - Scalable cloud games using Microsoft AzureGeorgia Azure Event - Scalable cloud games using Microsoft Azure
Georgia Azure Event - Scalable cloud games using Microsoft AzureMicrosoft
 
OpenPOWER Roadmap Toward CORAL
OpenPOWER Roadmap Toward CORALOpenPOWER Roadmap Toward CORAL
OpenPOWER Roadmap Toward CORALinside-BigData.com
 
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_finalPresentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_finalDiego Alberto Tamayo
 
Oracle Solaris Software Integration
Oracle Solaris Software IntegrationOracle Solaris Software Integration
Oracle Solaris Software IntegrationOTN Systems Hub
 
Open Innovation with Power Systems
Open Innovation with Power Systems Open Innovation with Power Systems
Open Innovation with Power Systems IBM Power Systems
 
Expert summit SQL Server 2016
Expert summit   SQL Server 2016Expert summit   SQL Server 2016
Expert summit SQL Server 2016Łukasz Grala
 
Oracle Solaris Secure Cloud Infrastructure
Oracle Solaris Secure Cloud InfrastructureOracle Solaris Secure Cloud Infrastructure
Oracle Solaris Secure Cloud InfrastructureOTN Systems Hub
 
Oracle Solaris Build and Run Applications Better on 11.3
Oracle Solaris  Build and Run Applications Better on 11.3Oracle Solaris  Build and Run Applications Better on 11.3
Oracle Solaris Build and Run Applications Better on 11.3OTN Systems Hub
 

Viewers also liked (19)

Cortana Analytics Suite
Cortana Analytics SuiteCortana Analytics Suite
Cortana Analytics Suite
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
 
Azure Stream Analytics
Azure Stream AnalyticsAzure Stream Analytics
Azure Stream Analytics
 
IBM Watson Analytics Presentation
IBM Watson Analytics PresentationIBM Watson Analytics Presentation
IBM Watson Analytics Presentation
 
Accelerating Business Intelligence Solutions with Microsoft Azure pass
Accelerating Business Intelligence Solutions with Microsoft Azure   passAccelerating Business Intelligence Solutions with Microsoft Azure   pass
Accelerating Business Intelligence Solutions with Microsoft Azure pass
 
Georgia Azure Event - Scalable cloud games using Microsoft Azure
Georgia Azure Event - Scalable cloud games using Microsoft AzureGeorgia Azure Event - Scalable cloud games using Microsoft Azure
Georgia Azure Event - Scalable cloud games using Microsoft Azure
 
OpenPOWER Roadmap Toward CORAL
OpenPOWER Roadmap Toward CORALOpenPOWER Roadmap Toward CORAL
OpenPOWER Roadmap Toward CORAL
 
The State of Linux Containers
The State of Linux ContainersThe State of Linux Containers
The State of Linux Containers
 
OpenPOWER Update
OpenPOWER UpdateOpenPOWER Update
OpenPOWER Update
 
IBM POWER8 as an HPC platform
IBM POWER8 as an HPC platformIBM POWER8 as an HPC platform
IBM POWER8 as an HPC platform
 
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_finalPresentacin webinar move_up_to_power8_with_scale_out_servers_final
Presentacin webinar move_up_to_power8_with_scale_out_servers_final
 
Bitcoin explained
Bitcoin explainedBitcoin explained
Bitcoin explained
 
Blockchain
BlockchainBlockchain
Blockchain
 
Oracle Solaris Software Integration
Oracle Solaris Software IntegrationOracle Solaris Software Integration
Oracle Solaris Software Integration
 
Open Innovation with Power Systems
Open Innovation with Power Systems Open Innovation with Power Systems
Open Innovation with Power Systems
 
Expert summit SQL Server 2016
Expert summit   SQL Server 2016Expert summit   SQL Server 2016
Expert summit SQL Server 2016
 
Puppet + Windows Nano Server
Puppet + Windows Nano ServerPuppet + Windows Nano Server
Puppet + Windows Nano Server
 
Oracle Solaris Secure Cloud Infrastructure
Oracle Solaris Secure Cloud InfrastructureOracle Solaris Secure Cloud Infrastructure
Oracle Solaris Secure Cloud Infrastructure
 
Oracle Solaris Build and Run Applications Better on 11.3
Oracle Solaris  Build and Run Applications Better on 11.3Oracle Solaris  Build and Run Applications Better on 11.3
Oracle Solaris Build and Run Applications Better on 11.3
 

Similar to Data complexity: variety and velocity with HDInsight

Hadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsHadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsDataWorks Summit
 
NYC Data Amp - Microsoft Azure and Data Services Overview
NYC Data Amp - Microsoft Azure and Data Services OverviewNYC Data Amp - Microsoft Azure and Data Services Overview
NYC Data Amp - Microsoft Azure and Data Services OverviewTravis Wright
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Group
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?James Serra
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data productsVikas Sardana
 
Girish Juneja - Intel Big Data & Cloud Summit 2013
Girish Juneja - Intel Big Data & Cloud Summit 2013Girish Juneja - Intel Big Data & Cloud Summit 2013
Girish Juneja - Intel Big Data & Cloud Summit 2013IntelAPAC
 
ISV Showcase: End-to-end Machine Learning using H2O on Azure
ISV Showcase: End-to-end Machine Learning using H2O on AzureISV Showcase: End-to-end Machine Learning using H2O on Azure
ISV Showcase: End-to-end Machine Learning using H2O on AzureMicrosoft Tech Community
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMBig Data Joe™ Rossi
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMBig Data Joe™ Rossi
 
利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料Amazon Web Services
 
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAmazon Web Services
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Cloudera, Inc.
 
Intro to Product Development
Intro to Product DevelopmentIntro to Product Development
Intro to Product DevelopmentPuja Pramudya
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise deteo
 
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSIONCisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSIONRenee Yao
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big DataInfochimps, a CSC Big Data Business
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantageAmazon Web Services
 

Similar to Data complexity: variety and velocity with HDInsight (20)

Hadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural PatternsHadoop in the Cloud: Common Architectural Patterns
Hadoop in the Cloud: Common Architectural Patterns
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
NYC Data Amp - Microsoft Azure and Data Services Overview
NYC Data Amp - Microsoft Azure and Data Services OverviewNYC Data Amp - Microsoft Azure and Data Services Overview
NYC Data Amp - Microsoft Azure and Data Services Overview
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data products
 
Girish Juneja - Intel Big Data & Cloud Summit 2013
Girish Juneja - Intel Big Data & Cloud Summit 2013Girish Juneja - Intel Big Data & Cloud Summit 2013
Girish Juneja - Intel Big Data & Cloud Summit 2013
 
ISV Showcase: End-to-end Machine Learning using H2O on Azure
ISV Showcase: End-to-end Machine Learning using H2O on AzureISV Showcase: End-to-end Machine Learning using H2O on Azure
ISV Showcase: End-to-end Machine Learning using H2O on Azure
 
OC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBMOC Big Data Monthly Meetup #6 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
 
SD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBMSD Big Data Monthly Meetup #4 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
 
利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料利用 Amazon QuickSight 視覺化分析服務剖析資料
利用 Amazon QuickSight 視覺化分析服務剖析資料
 
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
 
Microsoft Big Data
Microsoft Big DataMicrosoft Big Data
Microsoft Big Data
 
Intro to Product Development
Intro to Product DevelopmentIntro to Product Development
Intro to Product Development
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise
 
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSIONCisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
 
Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantage
 

More from MSAdvAnalytics

Cortana Analytics Workshop: Predictive Maintenance in the IoT Era
Cortana Analytics Workshop: Predictive Maintenance in the IoT EraCortana Analytics Workshop: Predictive Maintenance in the IoT Era
Cortana Analytics Workshop: Predictive Maintenance in the IoT EraMSAdvAnalytics
 
Cortana Analytics Workshop: Cortana Analytics for Retail
Cortana Analytics Workshop: Cortana Analytics for RetailCortana Analytics Workshop: Cortana Analytics for Retail
Cortana Analytics Workshop: Cortana Analytics for RetailMSAdvAnalytics
 
Cortana Analytics Workshop: Cortana Analytics for Marketing
Cortana Analytics Workshop: Cortana Analytics for MarketingCortana Analytics Workshop: Cortana Analytics for Marketing
Cortana Analytics Workshop: Cortana Analytics for MarketingMSAdvAnalytics
 
Cortana Analytics Workshop: Real-Time Data Processing -- How Do I Choose the ...
Cortana Analytics Workshop: Real-Time Data Processing -- How Do I Choose the ...Cortana Analytics Workshop: Real-Time Data Processing -- How Do I Choose the ...
Cortana Analytics Workshop: Real-Time Data Processing -- How Do I Choose the ...MSAdvAnalytics
 
Cortana Analytics Workshop: Operationalizing Your End-to-End Analytics Solution
Cortana Analytics Workshop: Operationalizing Your End-to-End Analytics SolutionCortana Analytics Workshop: Operationalizing Your End-to-End Analytics Solution
Cortana Analytics Workshop: Operationalizing Your End-to-End Analytics SolutionMSAdvAnalytics
 
Cortana Analytics Workshop: Azure Data Catalog
Cortana Analytics Workshop: Azure Data CatalogCortana Analytics Workshop: Azure Data Catalog
Cortana Analytics Workshop: Azure Data CatalogMSAdvAnalytics
 
Cortana Analytics Workshop: Connecting Cortana Analytics Faster -- Any Source...
Cortana Analytics Workshop: Connecting Cortana Analytics Faster -- Any Source...Cortana Analytics Workshop: Connecting Cortana Analytics Faster -- Any Source...
Cortana Analytics Workshop: Connecting Cortana Analytics Faster -- Any Source...MSAdvAnalytics
 
Cortana Analytics Workshop: Real-World Data Collection for Cortana Analytics
Cortana Analytics Workshop: Real-World Data Collection for Cortana AnalyticsCortana Analytics Workshop: Real-World Data Collection for Cortana Analytics
Cortana Analytics Workshop: Real-World Data Collection for Cortana AnalyticsMSAdvAnalytics
 
Cortana Analytics Workshop: Insights and Predictions -- Integrating and Deplo...
Cortana Analytics Workshop: Insights and Predictions -- Integrating and Deplo...Cortana Analytics Workshop: Insights and Predictions -- Integrating and Deplo...
Cortana Analytics Workshop: Insights and Predictions -- Integrating and Deplo...MSAdvAnalytics
 
Cortana Analytics Workshop: Cortana Analytics -- Security, Privacy & Compliance
Cortana Analytics Workshop: Cortana Analytics -- Security, Privacy & ComplianceCortana Analytics Workshop: Cortana Analytics -- Security, Privacy & Compliance
Cortana Analytics Workshop: Cortana Analytics -- Security, Privacy & ComplianceMSAdvAnalytics
 
Cortana Analytics Workshop: Developing for Power BI
Cortana Analytics Workshop: Developing for Power BICortana Analytics Workshop: Developing for Power BI
Cortana Analytics Workshop: Developing for Power BIMSAdvAnalytics
 
Cortana Analytics Workshop: Milliman Integrate for Cortana Analytics
Cortana Analytics Workshop: Milliman Integrate for Cortana AnalyticsCortana Analytics Workshop: Milliman Integrate for Cortana Analytics
Cortana Analytics Workshop: Milliman Integrate for Cortana AnalyticsMSAdvAnalytics
 
Cortana Analytics Workshop: Intelligent Retail -- The Machine Learning Approach
Cortana Analytics Workshop: Intelligent Retail -- The Machine Learning ApproachCortana Analytics Workshop: Intelligent Retail -- The Machine Learning Approach
Cortana Analytics Workshop: Intelligent Retail -- The Machine Learning ApproachMSAdvAnalytics
 
Cortana Analytics Workshop: Azure Data Lake
Cortana Analytics Workshop: Azure Data LakeCortana Analytics Workshop: Azure Data Lake
Cortana Analytics Workshop: Azure Data LakeMSAdvAnalytics
 
Cortana Analytics Workshop: Using the Cortana Analytics Process
Cortana Analytics Workshop: Using the Cortana Analytics ProcessCortana Analytics Workshop: Using the Cortana Analytics Process
Cortana Analytics Workshop: Using the Cortana Analytics ProcessMSAdvAnalytics
 
Cortana Analytics Workshop: Building Next-Generation Smart Grids
Cortana Analytics Workshop: Building Next-Generation Smart GridsCortana Analytics Workshop: Building Next-Generation Smart Grids
Cortana Analytics Workshop: Building Next-Generation Smart GridsMSAdvAnalytics
 
Cortana Analytics Workshop: Deep Neural Networks
Cortana Analytics Workshop: Deep Neural NetworksCortana Analytics Workshop: Deep Neural Networks
Cortana Analytics Workshop: Deep Neural NetworksMSAdvAnalytics
 
Cortana Analytics Workshop: AI -- Assistive Intelligence
Cortana Analytics Workshop: AI -- Assistive IntelligenceCortana Analytics Workshop: AI -- Assistive Intelligence
Cortana Analytics Workshop: AI -- Assistive IntelligenceMSAdvAnalytics
 
Cortana Analytics Workshop: Big Data @ Microsoft
Cortana Analytics Workshop: Big Data @ MicrosoftCortana Analytics Workshop: Big Data @ Microsoft
Cortana Analytics Workshop: Big Data @ MicrosoftMSAdvAnalytics
 
Cortana Analytics Workshop: Power BI 2.0
Cortana Analytics Workshop: Power BI 2.0Cortana Analytics Workshop: Power BI 2.0
Cortana Analytics Workshop: Power BI 2.0MSAdvAnalytics
 

More from MSAdvAnalytics (20)

Cortana Analytics Workshop: Predictive Maintenance in the IoT Era
Cortana Analytics Workshop: Predictive Maintenance in the IoT EraCortana Analytics Workshop: Predictive Maintenance in the IoT Era
Cortana Analytics Workshop: Predictive Maintenance in the IoT Era
 
Cortana Analytics Workshop: Cortana Analytics for Retail
Cortana Analytics Workshop: Cortana Analytics for RetailCortana Analytics Workshop: Cortana Analytics for Retail
Cortana Analytics Workshop: Cortana Analytics for Retail
 
Cortana Analytics Workshop: Cortana Analytics for Marketing
Cortana Analytics Workshop: Cortana Analytics for MarketingCortana Analytics Workshop: Cortana Analytics for Marketing
Cortana Analytics Workshop: Cortana Analytics for Marketing
 
Cortana Analytics Workshop: Real-Time Data Processing -- How Do I Choose the ...
Cortana Analytics Workshop: Real-Time Data Processing -- How Do I Choose the ...Cortana Analytics Workshop: Real-Time Data Processing -- How Do I Choose the ...
Cortana Analytics Workshop: Real-Time Data Processing -- How Do I Choose the ...
 
Cortana Analytics Workshop: Operationalizing Your End-to-End Analytics Solution
Cortana Analytics Workshop: Operationalizing Your End-to-End Analytics SolutionCortana Analytics Workshop: Operationalizing Your End-to-End Analytics Solution
Cortana Analytics Workshop: Operationalizing Your End-to-End Analytics Solution
 
Cortana Analytics Workshop: Azure Data Catalog
Cortana Analytics Workshop: Azure Data CatalogCortana Analytics Workshop: Azure Data Catalog
Cortana Analytics Workshop: Azure Data Catalog
 
Cortana Analytics Workshop: Connecting Cortana Analytics Faster -- Any Source...
Cortana Analytics Workshop: Connecting Cortana Analytics Faster -- Any Source...Cortana Analytics Workshop: Connecting Cortana Analytics Faster -- Any Source...
Cortana Analytics Workshop: Connecting Cortana Analytics Faster -- Any Source...
 
Cortana Analytics Workshop: Real-World Data Collection for Cortana Analytics
Cortana Analytics Workshop: Real-World Data Collection for Cortana AnalyticsCortana Analytics Workshop: Real-World Data Collection for Cortana Analytics
Cortana Analytics Workshop: Real-World Data Collection for Cortana Analytics
 
Cortana Analytics Workshop: Insights and Predictions -- Integrating and Deplo...
Cortana Analytics Workshop: Insights and Predictions -- Integrating and Deplo...Cortana Analytics Workshop: Insights and Predictions -- Integrating and Deplo...
Cortana Analytics Workshop: Insights and Predictions -- Integrating and Deplo...
 
Cortana Analytics Workshop: Cortana Analytics -- Security, Privacy & Compliance
Cortana Analytics Workshop: Cortana Analytics -- Security, Privacy & ComplianceCortana Analytics Workshop: Cortana Analytics -- Security, Privacy & Compliance
Cortana Analytics Workshop: Cortana Analytics -- Security, Privacy & Compliance
 
Cortana Analytics Workshop: Developing for Power BI
Cortana Analytics Workshop: Developing for Power BICortana Analytics Workshop: Developing for Power BI
Cortana Analytics Workshop: Developing for Power BI
 
Cortana Analytics Workshop: Milliman Integrate for Cortana Analytics
Cortana Analytics Workshop: Milliman Integrate for Cortana AnalyticsCortana Analytics Workshop: Milliman Integrate for Cortana Analytics
Cortana Analytics Workshop: Milliman Integrate for Cortana Analytics
 
Cortana Analytics Workshop: Intelligent Retail -- The Machine Learning Approach
Cortana Analytics Workshop: Intelligent Retail -- The Machine Learning ApproachCortana Analytics Workshop: Intelligent Retail -- The Machine Learning Approach
Cortana Analytics Workshop: Intelligent Retail -- The Machine Learning Approach
 
Cortana Analytics Workshop: Azure Data Lake
Cortana Analytics Workshop: Azure Data LakeCortana Analytics Workshop: Azure Data Lake
Cortana Analytics Workshop: Azure Data Lake
 
Cortana Analytics Workshop: Using the Cortana Analytics Process
Cortana Analytics Workshop: Using the Cortana Analytics ProcessCortana Analytics Workshop: Using the Cortana Analytics Process
Cortana Analytics Workshop: Using the Cortana Analytics Process
 
Cortana Analytics Workshop: Building Next-Generation Smart Grids
Cortana Analytics Workshop: Building Next-Generation Smart GridsCortana Analytics Workshop: Building Next-Generation Smart Grids
Cortana Analytics Workshop: Building Next-Generation Smart Grids
 
Cortana Analytics Workshop: Deep Neural Networks
Cortana Analytics Workshop: Deep Neural NetworksCortana Analytics Workshop: Deep Neural Networks
Cortana Analytics Workshop: Deep Neural Networks
 
Cortana Analytics Workshop: AI -- Assistive Intelligence
Cortana Analytics Workshop: AI -- Assistive IntelligenceCortana Analytics Workshop: AI -- Assistive Intelligence
Cortana Analytics Workshop: AI -- Assistive Intelligence
 
Cortana Analytics Workshop: Big Data @ Microsoft
Cortana Analytics Workshop: Big Data @ MicrosoftCortana Analytics Workshop: Big Data @ Microsoft
Cortana Analytics Workshop: Big Data @ Microsoft
 
Cortana Analytics Workshop: Power BI 2.0
Cortana Analytics Workshop: Power BI 2.0Cortana Analytics Workshop: Power BI 2.0
Cortana Analytics Workshop: Power BI 2.0
 

Recently uploaded

Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 

Recently uploaded (20)

Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 

Data complexity: variety and velocity with HDInsight

  • 1.
  • 2.
  • 3.
  • 4.
  • 5. Data complexity: variety and velocity Petabytes
  • 6.
  • 7. Massive Compute and Storage Deployment expertise Data of all Volume Variety, Velocity Speed Scale Economics Always Up, Always On Open and flexible Time to value
  • 9. HDInsight Script SQL NoSQL StreamingBatch Map reduce In Memory Core Engine
  • 10. • Microsoft’s cloud Hadoop offering • 100% open source Apache Hadoop • Built on the latest releases for Hadoop • Up and running in minutes with no hardware to deploy • .NET and Java skills and deep integration to Visual Studio • Utilize familiar BI tools for analysis including Microsoft Excel • 99.9% Enterprise Service Level Agreement HDInsight Script SQL NoSQL StreamingBatch Map reduce In Memory Core Engine
  • 11. HDInsight Script SQL NoSQL StreamingBatch Map reduce In Memory Core Engine Microsoft contribution to Apache code
  • 12. Data Node Data Node Data Node Data Node Task Tracker Task Tracker Task Tracker Task Tracker Name Node Job Tracker HMaster Coordination Region Server Region Server Region Server Region Server • Random, fast (realtime) read/write access to your Big Data. • Host very large tables (billions of rows X millions of columns) on clusters of commodity hardware. • Runs on top of the Hadoop Distributed File System (HDFS) • Provides flexibility in that new columns can be added to column families at any time HDInsight Script SQL NoSQL StreamingBatch Map reduce In Memory Core Engine
  • 13. Stream processin g Search and query Data analytics (Excel) Web/thick client dashboards Devices to take action RabbitMQ / ActiveMQ HDInsight Script SQL NoSQL StreamingBatch Map reduce In Memory Core Engine
  • 14. • Single execution model for multiple tasks (SQL queries, Streaming, Machine Learning, and Graph) • Processing up to 100x faster performance • Developer friendly (Java, Python, Scala) • BI tool of choice (Power BI, Tabelau, Qlik, SAP) • Notebook experience (Jupyter/iPython, Zeppelin) HDInsight Script SQL NoSQL StreamingBatch Map reduce In Memory Core Engine Spark SQL Spark Streaming Machine Learning Graph HDInsight Script SQL NoSQL StreamingBatch Map reduce In Memory Core Engine
  • 15. Spark for Azure HDInsight In-memory computation engine – Fully managed
  • 16.
  • 17. • Managed & supported by Microsoft • Familiarity of Windows • Re-use common tools, documentation, samples from Hadoop/Linux ecosystem • Add Hadoop projects that were authored on Linux to HDInsight • Easier transition from on-premise to cloud
  • 18.
  • 19.
  • 20. Partner Spotlight: AtScale Analysts Use Traditional BI Tools Against HDInsight
  • 21. • HDFS For the Cloud • Unlimited Storage, Petabyte Files • Optimized for Massive Throughput • High frequency, low latency, read immediately • Managed and secured
  • 22.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28. Neudesic partnered with one of the nation's largest utility companies that recently deployed Smart Utility Meters for power customers, nearly a million meters sending usage data every 15 minutes. The result: an Azure hybrid big data processing solution that enabled the customer to perform gap analytics: a process for identifying gaps that exist in the power usage readings, over 7x faster than their previous solution! Billions of Smart Meter reads get processed to identify the nature and duration of the gaps to mitigate revenue losses. Smart Meters Business Rules Processing BI Layer Blob Storage HDInsightInput Processed Output data ELT Local SQL DB for Customer and other confidential data Extract processed data from blob storage AZCopy AZCopy SSIS Input files
  • 29.
  • 30. Big Data in Retail • Clickstream analytics • Online recommendation engine • 360° view of the customer • Analyze brand sentiment • Localized, personalized promotions • Optimal store layout Leading computer manufacturer in world • Use clickstream to deliver custom website ecommerce experience • Targeted ads for abandoned carts • Use unstructured data from website and social for data mining • Combine w/sales data for 360 view • Gather data from table-side devices at restaurants • Predict promotions/offers and content to upsell to guests • Gather social media sentiment from customer feedback • Combined with POS data, can determine right product mix Leading Multi-national Retailer • Track weather information (temperature/forecast) to predict shelf space for different seasons • Sentiment analysis on feedback Leading clothing online retailer • Use clickstream to understand who is viewing their site • Building recommendation engine based on users’ clickpaths
  • 31. Ziosk turned to Microsoft gold partner, Artis Consulting to deploy a hybrid deployment consisting of the Analytics Platform System, Azure HDInsight, Power BI, and Azure Machine Learning “Until now, we haven’t had the ability to optimize the guest experience based on their specific interactions with the devices. With Azure, we can close the loop.” Kevin Mowry Ziosk Chief Software Architect
  • 32. Big Data in Health • Predictive Analysis of Patient Health & Clinical Decision Support • Population, risk, and Care management • Real-time quality measures to assist providers w/regulatory requirements • Medical research data (eg. genomics) • Recruit cohorts for pharmaceutical trials • Process large volumes of data from any healthcare provider EHR system • Assist in showing compliance • Store 7-30 years of data to meet audit requirements • Scan handwritten notes and do natural language processing • Analyze if symptoms might map to bigger outbreak • Collect clinical trial data (from automated equipment, sensors) • Find patterns on this data (chemical compositions, enzymes) • Process 6 years worth of data in a few hours without any infrastructure
  • 33. Big Data Financial Services • New account risk screens • Fraud prevention • Trading risk • Maximize deposit spread • Insurance underwriting • Accelerate loan processing • Actively monitor currencies used by UK manufacturers in supply chain to do risk analysis • Monitor UK GDP to help customers stay on top of economic trends • Needed to handle increasing amounts of finance, compliance, and legal data from trading operations • Trading data drives strategic decisions • Track customer feedback on social media and on their blog posts/website to understand loyalty • Predict at-risks clients to reach out to • Process data for actuaries to analyze results to understand risks for insurance companies • Milliman’s application understands relationships between people, process, and technology to manage risk
  • 34. Tangerine partners with Microsoft to build a solution with Analytics Platform System for the data warehouse and uses PolyBase to query Azure HDInsight in the cloud. “With pre-built integration using PolyBase to query both the relational data warehouse and Hadoop in the cloud, the solution will allow us to reap the benefits of both relational and non-relational data regardless of where it lives.”
  • 35.
  • 36.