SlideShare a Scribd company logo
1 of 39
Robin Vermeirsch
Securing SaaS applications
Who is using SaaS applications today?
Who knows what users are doing in the cloud?
Lack of visibility
Compliance
Threat prevention
Data security
Security in the SaaS world
• Security Policies/requirements are developed for on premises
solutions.
• In many cases SaaS applications are a initiated by the business
• SaaS providers implement ‘some’ security, but does it fit my needs?
• Limited control/visibility what users are doing in the cloud.
• No visibility over anomalies over different applications.
Security in the SaaS world
Does not meet
requirements
SHOWSTOPPER
Requirements
met by adding
control
COMPENSATED
Requirements
met by SaaS
provider
ACCEPTABLE
Change architecture
Adjustment expectations
Src: http://www.gartner.com/webinar/3100619
Evolution in security
Transport
• IP Firewalling
• Segmentation
Protocol inspection
• Proxies
• Deep inspection
Application Protection
• MDM
• Web Application
Firewalls
Data Centric
Audit & Protection
(DCAP)
• CASB
• SPSM
• CDPG
+ Unmanaged devices
Shadow IT
Company data is spread over multiple providers
 How to protect DATA?
Note trend of
ABAC in DEV
What do we need?
But how?
CASB (Gartner)
• on-premises, or cloud-based security policy enforcement points
• placed between cloud service consumers and cloud service providers
• to combine and interject enterprise security policies as the cloud-
based resources are accessed.
• consolidate multiple types of security policy enforcement.
http://www.gartner.com/it-glossary/cloud-access-security-
brokers-casbs
Options to add security
SaaS
IaaS /PaaS
SPSM
Saas Platform Security
Management
CASB
Cloud Access Security
Brokers
CDPG
Cloud Data Protection Gateway
Encryption
Tokenization
Masking
User activity
monitoring
Data discovery
DLP
Remediation
Usage discovery
User activity monitoring
DLP (passive and active)
User activity blocking (real time)
Data discovery
SSO
Vendors: http://www.gartner.com/webinar/3100619
REALTIME RETROACTIVE
Architecture Options
• Using forward proxies
• Integration existing proxies
• Placing Reverse proxies
• Using Endpoint agents
+ IDaaS/MDM/Log
integrations
Hackers/
unkown
endpoints
Approved
Endpoint
Unknown SaaSApproved SaaS
Reverse Proxy
Forward Proxy
Control
Access
&
Actions
Existing Proxy
Block
Actions
Architecture Solutions
• CASB (Cloud Access Security Brokers)
• Forward Proxy
• Reverse Proxy
• API Integration
• CDPG (Cloud Data Prot. GW)
• Forward Proxy
• Reverse Proxy
• SPSM (SaaS Platform Mgmt)
• API integration
Hackers/
unkown
endpoints
Approved
Endpoint
Unknown SaaSApproved SaaS
Reverse Proxy
Forward Proxy
Control
Access
&
Actions
Existing Proxy
Block
Actions
CASB
SPSM
CDPG
Where should you look at?
Impact on
functionality &
operational risk
Src: http://www.gartner.com/webinar/3100619
Implementation strategy
Implementation strategy
Src: http://www.gartner.com/webinar/3100619
Start small and add functionality
Benefits implementing CASB
Call to action
• Detect shadow IT today (=High Risk)
• Start controlling access to SaaS applications
• Get visibility over user activity in SaaS applications
• Protect your company data in SaaS applications
Xylos Cloud Services
PaaS: Debunking myths on
data & analytics in the cloud
Tim Jacobs – 25 Feb 2016
Agenda
• PaaS?
• Myth #1
• Myth #2
• Myth #3
• Conclusions
PaaS?
• Provides a platform for:
• Development (cloud native apps)
• Content distribution (media / CDN)
• Internet of Things
• Automation
• Data processing & analytics
Data and data analytics?
Prescriptive
analytics
Predictive
Analytics
Diagnostic Analysis
Descriptive Analytics
Data Collection
Big Data
IoT
Hypes linked together
Analytics
IoTBig Data
Debunking myths on data & analytics in the cloud
• Myth #1 – Predictive analytics & big data are just BI on steroids
• Myth #2 – All my data needs to go to the cloud! Y0u f00lz cr4zy?
• Myth #3 – You need to hold 3 PhD’s to do predictive analytics
Myth
confirmed?
Is it
plausible?
Blow everything up
No No
Yes
Yes
Myth Debunking Flowchart
Agenda
• PaaS?
• Myth #1
“Predictive analytics & big data are just BI on steroids”
• Myth #2
• Myth #3
• Conclusions
New in the data landscape…
1. “Big” data
2. “Artificial Intelligence” & learning from data
3. Fast & ubiquitious network connectivity
Evolution of data
(R)Evolution in data, the questions & tooling
Standard reports
Ad-hoc reports
Query & drilldown
Alerts
Statistical analysis
Forecasting/extrapolation
Predictive modeling
Optimization
Degree of intelligence
Value
Descriptive
Analytics
Predictive
Analytics
What happened?
How many? How often? Where?
Where exactly is the problem?
What actions are needed?
Why is this happening?
What if these trends continue?
What will happen next?
What is the best that can happen?
Traditional BI questions
ETL Tools, SQL & variants
Big data, or not.
New type of questions
New tooling, ELT,
machine learning, …
Big Data Traditional BI
Predictive
Analytics
• BI and Predictive Analytics worlds are converging :
• BI platform extensions to Big Data-esque & Advanced Analytics-y operations
• Big Data tooling gets SQL-like interfaces:
Drill, Impala, Hive, SparkSQL, HAWQ, Presto, Vortex, …
• Big Data tooling can do descriptive and predictive analytics:
MLLib, H2O, Oryx, Mahout, SAMOA, FlinkML, …
(R)Evolution & convolution
Agenda
• PaaS?
• Myth #1
• Myth #2
“All my data needs to go to the cloud! Y0u f00lz cr4zy?”
• Myth #3
• Conclusions
On-premises or cloud?
• Advantages of cloud:
• Start fast & fail fast
• Easy consumption of created data models
• Democratic in pricing & availability of algorithms
• Attention points for cloud (mostly exceptions!):
• Data privacy: legislation ↔ provider
• Data volume & velocity: bandwidth
Getting data to the cloud
• Transfer existing data:
• “Just upload the CSV”
• Azure Data Factory
(SQL, Oracle, DB2, MySQL, Sybas, PostgreSQL, ODBC, HDFS, … )
• Capture event/streaming data:
• Eventhub / IoT hub
Scheduling / transformation
Event
Hub
Stream
A.
Azure Data Factory
Blob
Storage
Data Lake
Data
Warehouse
Data BaseDirect
Data
Mgmt
Gateway
File
Data Base
Data
Warehouse IoT IoT
IoT IoT
© The Cloud ®™
Conclusion
• Compliant solutions available through provider
• Subsetting & anonimization easily possible with data transfer tools
Agenda
• PaaS?
• Myth #1
• Myth #2
• Myth #3
“You need to hold 3 PhD’s to do advanced analytics”
• Conclusions
Predictive Analytics
• Azure ML studio has a low learning curve
• Modular, drag & drop
• Pre-built machine learning algoritms with meaningful default settings
• Use case: very easy to publish
“predictive engine” for your
own applications
• Do you need expert knowledge?
• Is the out of the box 70% accuracy sufficient?
• Or do you need 95% prediction accuracy?
Example: predicting Belgian house prices
Model Features Prediction
accuracy
Linear 1 Just based on m2 living area 48,40%
Linear 2 m2 living area & postal code 69,43%
Linear 3 m2 living area, postal code, #
bedrooms, house type
70,36%
Decision Tree 1 m2 living area, postal code, #
bedrooms, house type
70,41%
Linear 4 Linear in: postal code, # bedrooms,
house type
3rd power in: m2 living area
71,17%
Agenda
• PaaS?
• Myth #1
• Myth #2
• Myth #3
• Conclusions
Conclusions
• Three valid use cases for data in the cloud:
• Reporting & analytics on big data sets, with new types of intelligence
• Storing and synchronizing (subsets of) your data in the cloud
• Adding intelligence to existing applications you develop
• Advantages of cloud:
• Easy to start, quick to get to results, fast decommissioning once completed
• Democratizing of tools & algorithmes lowers starting threshold
• Xylos can help with:
• advanced expertise (data scientists)
• data collection & storage expertise
• data consumption / visualization expertise
Xylos Cloud Services

More Related Content

What's hot

Cloud security, Cloud security Access broker, CSAB's 4 pillar, deployment mode
Cloud security, Cloud security Access broker, CSAB's 4 pillar, deployment modeCloud security, Cloud security Access broker, CSAB's 4 pillar, deployment mode
Cloud security, Cloud security Access broker, CSAB's 4 pillar, deployment modeHimani Singh
 
5 Highest-Impact CASB Use Cases
5 Highest-Impact CASB Use Cases5 Highest-Impact CASB Use Cases
5 Highest-Impact CASB Use CasesNetskope
 
Unleash Team Productivity with Real-Time Operations (DEV203-S) - AWS re:Inven...
Unleash Team Productivity with Real-Time Operations (DEV203-S) - AWS re:Inven...Unleash Team Productivity with Real-Time Operations (DEV203-S) - AWS re:Inven...
Unleash Team Productivity with Real-Time Operations (DEV203-S) - AWS re:Inven...Amazon Web Services
 
Managing Effective Security Policies Across Hybrid and Multi-Cloud Environment
Managing Effective Security Policies Across Hybrid and Multi-Cloud EnvironmentManaging Effective Security Policies Across Hybrid and Multi-Cloud Environment
Managing Effective Security Policies Across Hybrid and Multi-Cloud EnvironmentAlgoSec
 
The evolution of IT in a cloud world
The evolution of IT in a cloud worldThe evolution of IT in a cloud world
The evolution of IT in a cloud worldZscaler
 
63 Requirements for CASB
63 Requirements for CASB63 Requirements for CASB
63 Requirements for CASBKyle Watson
 
Cloud Security 101 by Madhav Chablani
Cloud Security 101 by Madhav ChablaniCloud Security 101 by Madhav Chablani
Cloud Security 101 by Madhav ChablaniOWASP Delhi
 
Rethinking Cybersecurity for the Digital Transformation Era
Rethinking Cybersecurity for the Digital Transformation EraRethinking Cybersecurity for the Digital Transformation Era
Rethinking Cybersecurity for the Digital Transformation EraZscaler
 
Workshop on CASB Part 2
Workshop on CASB Part 2Workshop on CASB Part 2
Workshop on CASB Part 2Priyanka Aash
 
Maximize your cloud app control with Microsoft MCAS and Zscaler
Maximize your cloud app control with Microsoft MCAS and ZscalerMaximize your cloud app control with Microsoft MCAS and Zscaler
Maximize your cloud app control with Microsoft MCAS and ZscalerAnkit Dua
 
Webinar compiled powerpoint
Webinar compiled powerpointWebinar compiled powerpoint
Webinar compiled powerpointCloudPassage
 
Security Breakout Session
Security Breakout Session Security Breakout Session
Security Breakout Session Splunk
 
Best Practices for Driving Software Quality through a Federated Application S...
Best Practices for Driving Software Quality through a Federated Application S...Best Practices for Driving Software Quality through a Federated Application S...
Best Practices for Driving Software Quality through a Federated Application S...DevOps.com
 
Secure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift EnvironmentsSecure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift EnvironmentsDevOps.com
 
The secure, direct to-internet branch
The secure, direct to-internet branchThe secure, direct to-internet branch
The secure, direct to-internet branchZscaler
 
Capgemini Oracle Cloud Access Security Broker
Capgemini Oracle Cloud Access Security BrokerCapgemini Oracle Cloud Access Security Broker
Capgemini Oracle Cloud Access Security BrokerJohan Louwers
 
Secure remote access to AWS your users will love
Secure remote access to AWS your users will loveSecure remote access to AWS your users will love
Secure remote access to AWS your users will loveZscaler
 

What's hot (20)

Biznet Gio Presentation - Database Security
Biznet Gio Presentation - Database SecurityBiznet Gio Presentation - Database Security
Biznet Gio Presentation - Database Security
 
Biznet Gio Presentation - Cloud Computing
Biznet Gio Presentation - Cloud ComputingBiznet Gio Presentation - Cloud Computing
Biznet Gio Presentation - Cloud Computing
 
Cloud security, Cloud security Access broker, CSAB's 4 pillar, deployment mode
Cloud security, Cloud security Access broker, CSAB's 4 pillar, deployment modeCloud security, Cloud security Access broker, CSAB's 4 pillar, deployment mode
Cloud security, Cloud security Access broker, CSAB's 4 pillar, deployment mode
 
5 Highest-Impact CASB Use Cases
5 Highest-Impact CASB Use Cases5 Highest-Impact CASB Use Cases
5 Highest-Impact CASB Use Cases
 
Unleash Team Productivity with Real-Time Operations (DEV203-S) - AWS re:Inven...
Unleash Team Productivity with Real-Time Operations (DEV203-S) - AWS re:Inven...Unleash Team Productivity with Real-Time Operations (DEV203-S) - AWS re:Inven...
Unleash Team Productivity with Real-Time Operations (DEV203-S) - AWS re:Inven...
 
Managing Effective Security Policies Across Hybrid and Multi-Cloud Environment
Managing Effective Security Policies Across Hybrid and Multi-Cloud EnvironmentManaging Effective Security Policies Across Hybrid and Multi-Cloud Environment
Managing Effective Security Policies Across Hybrid and Multi-Cloud Environment
 
The evolution of IT in a cloud world
The evolution of IT in a cloud worldThe evolution of IT in a cloud world
The evolution of IT in a cloud world
 
Governing in the Cloud
Governing in the CloudGoverning in the Cloud
Governing in the Cloud
 
63 Requirements for CASB
63 Requirements for CASB63 Requirements for CASB
63 Requirements for CASB
 
Cloud Security 101 by Madhav Chablani
Cloud Security 101 by Madhav ChablaniCloud Security 101 by Madhav Chablani
Cloud Security 101 by Madhav Chablani
 
Rethinking Cybersecurity for the Digital Transformation Era
Rethinking Cybersecurity for the Digital Transformation EraRethinking Cybersecurity for the Digital Transformation Era
Rethinking Cybersecurity for the Digital Transformation Era
 
Workshop on CASB Part 2
Workshop on CASB Part 2Workshop on CASB Part 2
Workshop on CASB Part 2
 
Maximize your cloud app control with Microsoft MCAS and Zscaler
Maximize your cloud app control with Microsoft MCAS and ZscalerMaximize your cloud app control with Microsoft MCAS and Zscaler
Maximize your cloud app control with Microsoft MCAS and Zscaler
 
Webinar compiled powerpoint
Webinar compiled powerpointWebinar compiled powerpoint
Webinar compiled powerpoint
 
Security Breakout Session
Security Breakout Session Security Breakout Session
Security Breakout Session
 
Best Practices for Driving Software Quality through a Federated Application S...
Best Practices for Driving Software Quality through a Federated Application S...Best Practices for Driving Software Quality through a Federated Application S...
Best Practices for Driving Software Quality through a Federated Application S...
 
Secure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift EnvironmentsSecure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift Environments
 
The secure, direct to-internet branch
The secure, direct to-internet branchThe secure, direct to-internet branch
The secure, direct to-internet branch
 
Capgemini Oracle Cloud Access Security Broker
Capgemini Oracle Cloud Access Security BrokerCapgemini Oracle Cloud Access Security Broker
Capgemini Oracle Cloud Access Security Broker
 
Secure remote access to AWS your users will love
Secure remote access to AWS your users will loveSecure remote access to AWS your users will love
Secure remote access to AWS your users will love
 

Similar to Securing SaaS Apps with CASB and Cloud Security

Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudInside Analysis
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Productioniguazio
 
TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...
TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...
TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...Trivadis
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Sri Ambati
 
RightScale Roadtrip - Accelerate to Cloud
RightScale Roadtrip - Accelerate to CloudRightScale Roadtrip - Accelerate to Cloud
RightScale Roadtrip - Accelerate to CloudRightScale
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Inside Analysis
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big DataInfochimps, a CSC Big Data Business
 
Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...
Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...
Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...confluent
 
What is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PMWhat is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PMProduct School
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsLooker
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise AnalyticsDATAVERSITY
 
RightScale Roadtrip - Accelerate To Cloud
RightScale Roadtrip - Accelerate To CloudRightScale Roadtrip - Accelerate To Cloud
RightScale Roadtrip - Accelerate To CloudRightScale
 
Multi-Cloud Breaks IT Ops: Best Practices to De-Risk Your Cloud Strategy
Multi-Cloud Breaks IT Ops: Best Practices to De-Risk Your Cloud StrategyMulti-Cloud Breaks IT Ops: Best Practices to De-Risk Your Cloud Strategy
Multi-Cloud Breaks IT Ops: Best Practices to De-Risk Your Cloud StrategyThousandEyes
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningProvectus
 
Near realtime AI deployment with huge data and super low latency - Levi Brack...
Near realtime AI deployment with huge data and super low latency - Levi Brack...Near realtime AI deployment with huge data and super low latency - Levi Brack...
Near realtime AI deployment with huge data and super low latency - Levi Brack...Sri Ambati
 
Tech trends - Get some of these skills to stay current
Tech trends - Get some of these skills to stay currentTech trends - Get some of these skills to stay current
Tech trends - Get some of these skills to stay currentSandeep Bhatnagar
 
Finding the needle in the haystack: how Nestle is leveraging big data to defe...
Finding the needle in the haystack: how Nestle is leveraging big data to defe...Finding the needle in the haystack: how Nestle is leveraging big data to defe...
Finding the needle in the haystack: how Nestle is leveraging big data to defe...Big Data Spain
 
DT Company Overview January 2013
DT Company Overview January 2013DT Company Overview January 2013
DT Company Overview January 2013DataTactics
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data EngineeringDurga Gadiraju
 

Similar to Securing SaaS Apps with CASB and Cloud Security (20)

Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the Cloud
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Production
 
TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...
TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...
TechEvent 2019: More Agile, More AI, More Cloud! Less Work?!; Oliver Dörr - T...
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...
 
RightScale Roadtrip - Accelerate to Cloud
RightScale Roadtrip - Accelerate to CloudRightScale Roadtrip - Accelerate to Cloud
RightScale Roadtrip - Accelerate to Cloud
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
 
Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...
Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...
Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...
 
What is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PMWhat is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PM
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
RightScale Roadtrip - Accelerate To Cloud
RightScale Roadtrip - Accelerate To CloudRightScale Roadtrip - Accelerate To Cloud
RightScale Roadtrip - Accelerate To Cloud
 
Multi-Cloud Breaks IT Ops: Best Practices to De-Risk Your Cloud Strategy
Multi-Cloud Breaks IT Ops: Best Practices to De-Risk Your Cloud StrategyMulti-Cloud Breaks IT Ops: Best Practices to De-Risk Your Cloud Strategy
Multi-Cloud Breaks IT Ops: Best Practices to De-Risk Your Cloud Strategy
 
Euro IT Group
Euro IT GroupEuro IT Group
Euro IT Group
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
Near realtime AI deployment with huge data and super low latency - Levi Brack...
Near realtime AI deployment with huge data and super low latency - Levi Brack...Near realtime AI deployment with huge data and super low latency - Levi Brack...
Near realtime AI deployment with huge data and super low latency - Levi Brack...
 
Tech trends - Get some of these skills to stay current
Tech trends - Get some of these skills to stay currentTech trends - Get some of these skills to stay current
Tech trends - Get some of these skills to stay current
 
Finding the needle in the haystack: how Nestle is leveraging big data to defe...
Finding the needle in the haystack: how Nestle is leveraging big data to defe...Finding the needle in the haystack: how Nestle is leveraging big data to defe...
Finding the needle in the haystack: how Nestle is leveraging big data to defe...
 
DT Company Overview January 2013
DT Company Overview January 2013DT Company Overview January 2013
DT Company Overview January 2013
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 

Recently uploaded

Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...NETWAYS
 
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝soniya singh
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfakankshagupta7348026
 
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...NETWAYS
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@vikas rana
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )Pooja Nehwal
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyPooja Nehwal
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...henrik385807
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfhenrik385807
 
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxGenesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxFamilyWorshipCenterD
 

Recently uploaded (20)

Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
Open Source Camp Kubernetes 2024 | Monitoring Kubernetes With Icinga by Eric ...
 
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝
 
Motivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdfMotivation and Theory Maslow and Murray pdf
Motivation and Theory Maslow and Murray pdf
 
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
OSCamp Kubernetes 2024 | SRE Challenges in Monolith to Microservices Shift at...
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 
call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@call girls in delhi malviya nagar @9811711561@
call girls in delhi malviya nagar @9811711561@
 
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
WhatsApp 📞 9892124323 ✅Call Girls In Juhu ( Mumbai )
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
 
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfCTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf
 
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
CTAC 2024 Valencia - Sven Zoelle - Most Crucial Invest to Digitalisation_slid...
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdfOpen Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
Open Source Strategy in Logistics 2015_Henrik Hankedvz-d-nl-log-conference.pdf
 
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Vaishnavi 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptxGenesis part 2 Isaiah Scudder 04-24-2024.pptx
Genesis part 2 Isaiah Scudder 04-24-2024.pptx
 

Securing SaaS Apps with CASB and Cloud Security

  • 2. Who is using SaaS applications today?
  • 3. Who knows what users are doing in the cloud?
  • 4. Lack of visibility Compliance Threat prevention Data security
  • 5. Security in the SaaS world • Security Policies/requirements are developed for on premises solutions. • In many cases SaaS applications are a initiated by the business • SaaS providers implement ‘some’ security, but does it fit my needs? • Limited control/visibility what users are doing in the cloud. • No visibility over anomalies over different applications.
  • 6. Security in the SaaS world Does not meet requirements SHOWSTOPPER Requirements met by adding control COMPENSATED Requirements met by SaaS provider ACCEPTABLE Change architecture Adjustment expectations Src: http://www.gartner.com/webinar/3100619
  • 7. Evolution in security Transport • IP Firewalling • Segmentation Protocol inspection • Proxies • Deep inspection Application Protection • MDM • Web Application Firewalls Data Centric Audit & Protection (DCAP) • CASB • SPSM • CDPG + Unmanaged devices Shadow IT Company data is spread over multiple providers  How to protect DATA? Note trend of ABAC in DEV
  • 8. What do we need?
  • 10. CASB (Gartner) • on-premises, or cloud-based security policy enforcement points • placed between cloud service consumers and cloud service providers • to combine and interject enterprise security policies as the cloud- based resources are accessed. • consolidate multiple types of security policy enforcement. http://www.gartner.com/it-glossary/cloud-access-security- brokers-casbs
  • 11. Options to add security SaaS IaaS /PaaS SPSM Saas Platform Security Management CASB Cloud Access Security Brokers CDPG Cloud Data Protection Gateway Encryption Tokenization Masking User activity monitoring Data discovery DLP Remediation Usage discovery User activity monitoring DLP (passive and active) User activity blocking (real time) Data discovery SSO Vendors: http://www.gartner.com/webinar/3100619 REALTIME RETROACTIVE
  • 12. Architecture Options • Using forward proxies • Integration existing proxies • Placing Reverse proxies • Using Endpoint agents + IDaaS/MDM/Log integrations Hackers/ unkown endpoints Approved Endpoint Unknown SaaSApproved SaaS Reverse Proxy Forward Proxy Control Access & Actions Existing Proxy Block Actions
  • 13. Architecture Solutions • CASB (Cloud Access Security Brokers) • Forward Proxy • Reverse Proxy • API Integration • CDPG (Cloud Data Prot. GW) • Forward Proxy • Reverse Proxy • SPSM (SaaS Platform Mgmt) • API integration Hackers/ unkown endpoints Approved Endpoint Unknown SaaSApproved SaaS Reverse Proxy Forward Proxy Control Access & Actions Existing Proxy Block Actions CASB SPSM CDPG
  • 14. Where should you look at? Impact on functionality & operational risk Src: http://www.gartner.com/webinar/3100619
  • 18. Call to action • Detect shadow IT today (=High Risk) • Start controlling access to SaaS applications • Get visibility over user activity in SaaS applications • Protect your company data in SaaS applications
  • 20. PaaS: Debunking myths on data & analytics in the cloud Tim Jacobs – 25 Feb 2016
  • 21. Agenda • PaaS? • Myth #1 • Myth #2 • Myth #3 • Conclusions
  • 22. PaaS? • Provides a platform for: • Development (cloud native apps) • Content distribution (media / CDN) • Internet of Things • Automation • Data processing & analytics
  • 23. Data and data analytics? Prescriptive analytics Predictive Analytics Diagnostic Analysis Descriptive Analytics Data Collection Big Data IoT
  • 25. Debunking myths on data & analytics in the cloud • Myth #1 – Predictive analytics & big data are just BI on steroids • Myth #2 – All my data needs to go to the cloud! Y0u f00lz cr4zy? • Myth #3 – You need to hold 3 PhD’s to do predictive analytics Myth confirmed? Is it plausible? Blow everything up No No Yes Yes Myth Debunking Flowchart
  • 26. Agenda • PaaS? • Myth #1 “Predictive analytics & big data are just BI on steroids” • Myth #2 • Myth #3 • Conclusions
  • 27. New in the data landscape… 1. “Big” data 2. “Artificial Intelligence” & learning from data 3. Fast & ubiquitious network connectivity Evolution of data
  • 28. (R)Evolution in data, the questions & tooling Standard reports Ad-hoc reports Query & drilldown Alerts Statistical analysis Forecasting/extrapolation Predictive modeling Optimization Degree of intelligence Value Descriptive Analytics Predictive Analytics What happened? How many? How often? Where? Where exactly is the problem? What actions are needed? Why is this happening? What if these trends continue? What will happen next? What is the best that can happen? Traditional BI questions ETL Tools, SQL & variants Big data, or not. New type of questions New tooling, ELT, machine learning, …
  • 29. Big Data Traditional BI Predictive Analytics • BI and Predictive Analytics worlds are converging : • BI platform extensions to Big Data-esque & Advanced Analytics-y operations • Big Data tooling gets SQL-like interfaces: Drill, Impala, Hive, SparkSQL, HAWQ, Presto, Vortex, … • Big Data tooling can do descriptive and predictive analytics: MLLib, H2O, Oryx, Mahout, SAMOA, FlinkML, … (R)Evolution & convolution
  • 30. Agenda • PaaS? • Myth #1 • Myth #2 “All my data needs to go to the cloud! Y0u f00lz cr4zy?” • Myth #3 • Conclusions
  • 31. On-premises or cloud? • Advantages of cloud: • Start fast & fail fast • Easy consumption of created data models • Democratic in pricing & availability of algorithms • Attention points for cloud (mostly exceptions!): • Data privacy: legislation ↔ provider • Data volume & velocity: bandwidth
  • 32. Getting data to the cloud • Transfer existing data: • “Just upload the CSV” • Azure Data Factory (SQL, Oracle, DB2, MySQL, Sybas, PostgreSQL, ODBC, HDFS, … ) • Capture event/streaming data: • Eventhub / IoT hub Scheduling / transformation Event Hub Stream A. Azure Data Factory Blob Storage Data Lake Data Warehouse Data BaseDirect Data Mgmt Gateway File Data Base Data Warehouse IoT IoT IoT IoT © The Cloud ®™
  • 33. Conclusion • Compliant solutions available through provider • Subsetting & anonimization easily possible with data transfer tools
  • 34. Agenda • PaaS? • Myth #1 • Myth #2 • Myth #3 “You need to hold 3 PhD’s to do advanced analytics” • Conclusions
  • 35. Predictive Analytics • Azure ML studio has a low learning curve • Modular, drag & drop • Pre-built machine learning algoritms with meaningful default settings • Use case: very easy to publish “predictive engine” for your own applications • Do you need expert knowledge? • Is the out of the box 70% accuracy sufficient? • Or do you need 95% prediction accuracy?
  • 36. Example: predicting Belgian house prices Model Features Prediction accuracy Linear 1 Just based on m2 living area 48,40% Linear 2 m2 living area & postal code 69,43% Linear 3 m2 living area, postal code, # bedrooms, house type 70,36% Decision Tree 1 m2 living area, postal code, # bedrooms, house type 70,41% Linear 4 Linear in: postal code, # bedrooms, house type 3rd power in: m2 living area 71,17%
  • 37. Agenda • PaaS? • Myth #1 • Myth #2 • Myth #3 • Conclusions
  • 38. Conclusions • Three valid use cases for data in the cloud: • Reporting & analytics on big data sets, with new types of intelligence • Storing and synchronizing (subsets of) your data in the cloud • Adding intelligence to existing applications you develop • Advantages of cloud: • Easy to start, quick to get to results, fast decommissioning once completed • Democratizing of tools & algorithmes lowers starting threshold • Xylos can help with: • advanced expertise (data scientists) • data collection & storage expertise • data consumption / visualization expertise

Editor's Notes

  1. Both known and unknown applications
  2. CSP = Cloud Service Provider
  3. Andere segmeten  Uitleggen dat het segmentatie per functionaleit zetten
  4. From now on: All is CASB
  5. PaaS in a nutshell: the provider gives you everything you need, a platform, to get started with applications & data… What do you use that platform for? - You see some example use cases here - They all have in common that you still need to do something “developish” to get it to work. The platform gives you the building blocks to construct a solution – it is not a ready to use piece of software by itself, that’s the SaaS world… There are many interesting use cases for all these scenarios that you can use Platform As A Service for, but we’ll focus on data in the next 15 mins.
  6. Collect data Then do some basic statistics on that collected data, that is descriptive analytics like “what was the average revenu per sales last year” You can also go further and not only look at the data from the past, but also learn from it… and based on what you learn, you can make predictions of the future… that is the realm of predictive analytics with methods like statistical learning or machine learning. This is of course very related to other trends such as Internet of Things and Big Data… -> You probably want to collect information from your things -> If there are a lot of things, it can become a big data problem, where again you can do descriptive & predictive analytics. Important: -> Internet of Things discussion is mostly about hardware, which is nice & fun – we like that a lot as geeks – but the business value is of course in what you do with the data … what you do with the data is analytics. -> Big Data platforms provide you with the tools to process terabytes & petabytes of data, but the real value is not in the tooling or the development inside these tools – also interesting & necessary – but rather again what you do, this time with the humungeous amount of data…
  7. So it is all related, but for us, the analytics is in the center… it’s the driver why you do IoT or Big Data…
  8. Myth 1 -- big data & analytics is nothing new… hrmmmppfffff Myth 2 – All your data needs to go to the cloud, are you crazy? … Crazy… yes… the rest…. Hrmmmppfffff Myth 3 – In order to do predictive analytics, you need to have at least 3 PhD’s in mathematics…. Hrmmmppfffff So, let’s approach this MythBuster style, and see if we can blow something up…
  9. Big Data -> Not per se the volume, which is why some people prefer to drop the word “big” … But certainly the VELOCITY of the new data (social media, clickstream), and the VARIETY aspects. Is this useful? Different story, but there is certainly a difference in how data is treated & what strategic value it has, has opposed to earlier. Artificial Intelligence, like self-driving cars and computers that will take over the world… -> Much more compute power available so now it is feasible to do training of algorithms and continuously learning from data & making predictions on the spot This is all related to cheaper hardware (storage for “big” data, compute for AI) but adding to that the fact that we have fast network connectivity almost everywhere, makes every device a possible data source & data consumer. There is certainly something going on…
  10. Some people call it a revolution, others will call it an evolution, but certainly there is a difference in the tooling and in the questions you can ask. Subtle but important difference: ETL versus ELT -> ETL: typically in a data warehouse context, where you take the data, structure it and then do things with -> ELT: more the data lake approach: store the raw data, and see if and how you structure it afterwards – that is because inserting structure upfront limits what you can do with the data afterwards, it can insert some bias in what you can do with the data. Example: if you structurally remove timestamps, you cannot ask any time related questions anymore afterwards = TRIVIAL More subtle: if you remove faulty records, then you cannot ask details about how often corrupt data was sent, or how often a device failed…
  11. Some people call it a revolution, others will call it an evolution, but certainly there is a difference in the tooling and in the questions you can ask. Conclusion: they have different origin, but are growing closer together – however, there are also some differences in philosophy and approach that ML tools open source: http://journalofbigdata.springeropen.com/articles/10.1186/s40537-015-0032-1
  12. Advantages: - You can get started right away, without prior installation of hardware or software. If it doesn’t work or the experiment you are doing fails, you can get rid of it equally fast. The example shown was created & published in 2 hours (!). - No upfront investment in hardware/software, and the pricing is actually cheap, there is a good value for money. - Because your data & data processing is in the cloud, you can easily publish it to applications, mobile devices, 3rd parties, … Attention points: - Data privacy: just to be clear, the major cloud platform vendors provide a better security layer than any of you can ever do. This is not an attention point about whether your data can be leaked or hacked – but really about government regulation that prevents certain types of data to be placed internationally or across European borders. This CAN be an issue, typically for highly regulated verticals such as healthcare or government. For most of you, it will most likely NOT be a legal issue but rather a “trust” issue between you and the provider. In my personal opinion (yet IANAL), this is a transient problem – if this regulation impedes innovation, then either the regulation will change, or the major cloud providers will work around it. The market opportunity is too big to just let it pass. - Data volume & velocity CAN be an attention point, in particular if you need a lot of data (volume) or fast data (velocity) to be copied e.g. on-premises. Getting a lot of data inside the cloud is usually easy, getting it out requires more thought. This CAN be an issue in case you need access to raw, unprocessed data. Typically, if you need to store in your on-premises BI platform or data warehouse just summarized data, then there is not really an issue.
  13. Many possible outputs: blob storage, a data base, data warehouse, data lake… You can upload the data directly to the target, in the file format that is necessary to do so… Or, you can use the Azure Data Factory to do the necessary transformations on your data. It uses an on-premises component to capture the data and put it in the destination of your choice. Finally, for streaming data there are also options but let’s not go into details here. The transformation of the data is key to understand the myth – in this process you can: - take subsets of data – so certainly not all your data needs to go the cloud - anonimize your data – typically for descriptive statistics, you need aggregated data (so no specifics about individual items), and for predictive statistics, typically you need numeric data on a case/person, without knowing who that person is.
  14. To summarize It looks like a scene from the Mad Max movie, but it really is Mythbusters…
  15. Here, the “intelligence” is that we do not explicitly tell our program to multiply for example €1000 per m2 and substract/add a predefined value based on the postal code. Instead, based on the data we feed, we let the algorithm decide by itself what are the appropriate prices per m2 or the added value of having more bedrooms, without defining this “business logic” ourselves. We see that out of the box, literally in a 2 minute exercise, we can publish a model that gets approximately 50% of the predictions right. This can be enough for your application – for example, if you are writing a game and need for your virtual characters/enemies to predict what the human player characters are going to do… if you get 50% right, it will already be a very though game to play… If you bring in experts, they will probably tell you to start playing around with: - More data (increase accuracy to about 70%) - Use different models (slight increase compared to just more data) - Start tweaking & tuning the model complexity The latter is really a battle to conquer every additional percentage of accuracy in predictions. Obviously, to get high degrees of prediction, you will need more skilled people… Adding more intelligence to your application can be done very easily… so the myth is partially true…