SlideShare a Scribd company logo
1 of 40
Bangalore
Mining Information from Data on Cloud 
Ganesh Raja, Solutions Architect 
Amazon Internet Services
What is Big Data ? 
When your data sets become 
so large that you have to start innovating how to Collect, 
Store, Organize, Analyze and Share it 
Its tough because of 
Velocity, Volume and Variety
380,000,000 Tweets/Day 
200,000,000+ New Photos / Day 
Processes 1.5M+ log events per second 
80% Of Data on WWW – Last 2 Years
The cost of data generation 
is falling
Generation 
Collection & storage 
Analytics & computation 
Collaboration & sharing 
Lower cost, 
higher 
throughput 
Highly 
constrained
Cloud Computing 
Elastic & Highly Scalable 
+ 
No capital expense 
+ 
Pay-per-use 
+ 
On-demand 
$0 
= Remove constraints
Generation 
Collection & storage 
Analytics & computation 
Collaboration & sharing 
Accelerated
Big data and AWS Cloud computing 
Big Data AWS Cloud Computing 
Variety, volume, and velocity 
requiring new tools 
Variety of compute, storage, and 
networking options 
Massive datasets Massive, virtually unlimited capacity 
Iterative, experimental style of data 
Iterative, experimental style of 
manipulation and analysis 
infrastructure deployment/usage 
Frequently not steady-state workload; 
peaks and valleys 
At its most efficient with highly 
variable workloads
Big Data Technology 
Technologies and techniques for 
working productively with data, 
at any scale
One tool to rule 
them all
Big Data & Analytics @ AWS 
COLLECT STORE ANALYZE SHARE 
Direct Connect S3 
Import Export 
S3 EC2 
DynamoDB Redshift 
Glacier 
EMR 
Data Pipeline 
AWS BIG DATA 
PORTFOLIO 
Amazon Kinesis
Store anything 
Object storage 
Scalable 
99.999999999% durability 
Amazon 
S3
Real-time processing 
High throughput; elastic 
Easy to use 
EMR, S3, Redshift, DynamoDB 
Integrations 
Amazon 
Kinesis
NoSQL Database 
Seamless scalability 
Zero admin 
Single digit millisecond latency 
Amazon 
DynamoDB
Relational data warehouse 
Massively parallel 
Petabyte scale 
Fully managed 
$1,000/TB/Year 
Amazon 
Redshift
Hadoop/HDFS clusters 
Hive, Pig, Impala, Hbase 
Easy to use; fully managed 
On-demand and spot pricing 
Tight integration with S3, 
DynamoDB, and Kinesis 
Amazon 
Elastic 
MapReduce
The right tools. 
At the right scale. 
At the right time.
Bangalore
Panel Miner: Data Mining and 
Visualization using AWS 
Ganesh Sankaralingam
LatentView at a Glance 
Build Analytics 
Centers of Excellence (COEs) 
Analyze Business problems both Qualitatively & 
Quantitatively and provide Actionable Insights 
Onsite-offshore global delivery model that 
helps in-house teams do more with less 
Identified as “Cool Vendor” in 
Analytics by Gartner 2014 
Won the Deloitte Technology 
Fast 50 India award for 5 
consecutive years (2009 – 13) 
‘Top Innovator’ awarded to 
LatentView by Developer Week 
(Conference & Festival 2013) 
Recognized as a global 'Market 
Leader‘ in the Analytics space 
by SourcingLine 
Top Finalist in the ‘We Love Our 
Workplace 2013’ category.
Business Pain Points: 
Required to combine different types of data to make Business decisions 
Within the firewall Outside the firewall 
Internal 
Structured 
Data 
External 
Structured 
Data 
Within the firewall Outside the firewall 
External 
Unstructured 
Data 
Internal 
Unstructured 
Data 
o ERP, Legacy data 
o RDBMS or excel 
format 
o Email text, Customer 
service notes, Yammer 
o Webserver logs 
o Survey 
o Market Research 
o Macroeconomics 
o Promotions 
o Social media, News 
articles, Panel data 
o Real time visualization 
of Machine logs (IOT)
Technical Pain Points: 
Required to combine different types of data 
Transform unstructured data into 
structured data queried using SQL 
statements 
Automated scalable framework to 
process > 500K small files in 
constant time 
Achieve high efficiency converting 
unstructured data to structured 
data 
Control Security and access for 
different business users 
Minimize the costs and time 
running distributed jobs 
Store and Retrieve data for Analysis 
purposes in the cost and time 
efficient manner 
Track various processes in the 
AWS platform
Why AWS? 
Cost of Ownership 
Scalable, Easy to use 
Easy to acquire additional machines based on needs 
PetaByte level scalability (1 000 000 000 000 000 Bytes) 
Data Security 
High Availability 
Technology Breadth and Technical support
PANEL MINER 
Converting Unstructured to Structured Data using AWS Infrastructure 
Unstructured 
Data 
Data Collection 
EC2, S3 
Download, 
Extract, Clean 
and Stage Data 
for Processing 
Python 
Parser to Convert 
Unstructured 
Data into 
Structured Data 
EMR 
Hadoop 
Optimized Data 
Processing 
Redshift 
Data 
Warehousing, 
and Reporting 
Structured 
Data 
Analysis Using 
excel, tableau 
and other 
visualization tools
Key Benefits and Learnings with AWS
Bangalore
Analytics in the Cloud 
Leverage AWS to scale Big Data Analytics 
Shrirang Bapat, VP Engineering, PubMatic
Shrirang Bapat 
Data Enthusiast 
Innovation Agent 
Agile Evangelist 
VP Engineering at PubMatic 
Your Speaker Today
Every Ad 
Every Screen 
IAB Standard Banners 
IAB Rising Stars 
Native and custom units 
Mobile Applications 
Tablet Applications 
Rich media: MRAID 1 & 
2, ORRMA, interstitial 
Video: VAST, VPAID 
Mobile & Tablet 
Optimized Web 
Desktop Web 
One Platform 
Multi-Format, Multi-Screen, Multi-Channel 
Every Sales 
Channel 
Direct Sales Integration 
Programmatic Direct 
• Private Marketplace 
• Automated Guaranteed 
Open Auction 
Spot-buys
Premium at Scale, Across All Buying Channels 
33 
Programmatic 
Direct 
Channels Definition Value 
Automated 
Guaranteed 
Direct bought 
guaranteed inventory 
access, non-RTB 
Predictable and 
scalable high value 
placements 
Open Market 
RTB based inventory 
access in open 
marketplace 
Efficient and targeted 
audience buying 
Private 
Marketplace 
Direct bought RTB 
based inventory 
access 
Controlled buying with 
price agreements for 
bids
PubMatic is the Only 
Publisher-Focused Software Platform at Scale 
94.5% U.S. Reach, Larger Than Google 
(comScore March 2014) 
Industry’s Best Results, Independent & Flexible 
5 Data Centers, 4 Trillion RTB Requests Monthly 
500+ People Doing Business in 30 Countries
5PB 
35 
4,000,000,000,000+ 
Bids 
6AWS Regions 
350,000,000,000+ 
Impressions
Real-time Slice and Dice 
Hyper growth 
Older infrastructure 
Time to market 
New Architecture
• Big Data Pipeline 
• Real Time 
• EMR and HBASE 
37 
• Adserving 
PubMatic on AWS
Storage Database Stream Processing Compute 
EMR Networking Monitoring DNS 
38 
AWS Services
If you only take away 3 things… 
Ease of Use 
Reduced 
Time to Market 
DevOps
Mining Information from Data on Cloud

More Related Content

What's hot

Big Data Analytics & Architecture
Big Data Analytics & ArchitectureBig Data Analytics & Architecture
Big Data Analytics & ArchitectureAnjani Phuyal
 
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...Tyler Wishnoff
 
Introduction to Data Analysis, Storage & Processing Solutions
Introduction to Data Analysis, Storage & Processing SolutionsIntroduction to Data Analysis, Storage & Processing Solutions
Introduction to Data Analysis, Storage & Processing SolutionsAnjani Phuyal
 
AWS Enterprise Day | Big Data Analytics
AWS Enterprise Day | Big Data AnalyticsAWS Enterprise Day | Big Data Analytics
AWS Enterprise Day | Big Data AnalyticsAmazon Web Services
 
Using Big Data to Driving Big Engagement
Using Big Data to Driving Big EngagementUsing Big Data to Driving Big Engagement
Using Big Data to Driving Big EngagementAmazon Web Services
 
Customer Keynote: PIXNET Media Inc.- Business Intelligent and Analysis: Empir...
Customer Keynote: PIXNET Media Inc.- Business Intelligent and Analysis: Empir...Customer Keynote: PIXNET Media Inc.- Business Intelligent and Analysis: Empir...
Customer Keynote: PIXNET Media Inc.- Business Intelligent and Analysis: Empir...Amazon Web Services
 
Machine Learning & Data Lake for IoT scenarios on AWS
Machine Learning & Data Lake for IoT scenarios on AWSMachine Learning & Data Lake for IoT scenarios on AWS
Machine Learning & Data Lake for IoT scenarios on AWSAmazon Web Services
 
Scaling Production Machine Learning Pipelines with Databricks
Scaling Production Machine Learning Pipelines with DatabricksScaling Production Machine Learning Pipelines with Databricks
Scaling Production Machine Learning Pipelines with DatabricksDatabricks
 
Turn Big Data Into Big Value On Informatica and Amazon
Turn Big Data Into Big Value On Informatica and AmazonTurn Big Data Into Big Value On Informatica and Amazon
Turn Big Data Into Big Value On Informatica and AmazonAmazon Web Services
 
Machines and the Magic of Fast Learning
Machines and the Magic of Fast LearningMachines and the Magic of Fast Learning
Machines and the Magic of Fast LearningSingleStore
 
#DataOnCloud London event
#DataOnCloud London event#DataOnCloud London event
#DataOnCloud London eventHARMAN Services
 
Introduction to Google Cloud Platform for Big Data - Trusted Conf
Introduction to Google Cloud Platform for Big Data - Trusted ConfIntroduction to Google Cloud Platform for Big Data - Trusted Conf
Introduction to Google Cloud Platform for Big Data - Trusted ConfIn Marketing We Trust
 
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your DataMongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your DataMongoDB
 
Spark Summit East Keynote by Anjul Bhambhri
Spark Summit East Keynote by Anjul BhambhriSpark Summit East Keynote by Anjul Bhambhri
Spark Summit East Keynote by Anjul BhambhriJen Aman
 
There is more to Big Data than data
There is more to Big Data than dataThere is more to Big Data than data
There is more to Big Data than dataCapgemini
 
Importance of global certifications
Importance of global certificationsImportance of global certifications
Importance of global certificationsAnjani Phuyal
 
Building the Ideal Stack for Machine Learning
Building the Ideal Stack for Machine LearningBuilding the Ideal Stack for Machine Learning
Building the Ideal Stack for Machine LearningSingleStore
 
Explore Your Data Using Amazon QuickSight and Build Your First Machine Learni...
Explore Your Data Using Amazon QuickSight and Build Your First Machine Learni...Explore Your Data Using Amazon QuickSight and Build Your First Machine Learni...
Explore Your Data Using Amazon QuickSight and Build Your First Machine Learni...Amazon Web Services
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesJames Serra
 

What's hot (20)

Big Data Analytics & Architecture
Big Data Analytics & ArchitectureBig Data Analytics & Architecture
Big Data Analytics & Architecture
 
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
Lightning-Fast, Interactive Business Intelligence Performance with MicroStrat...
 
Introduction to Data Analysis, Storage & Processing Solutions
Introduction to Data Analysis, Storage & Processing SolutionsIntroduction to Data Analysis, Storage & Processing Solutions
Introduction to Data Analysis, Storage & Processing Solutions
 
AWS Enterprise Day | Big Data Analytics
AWS Enterprise Day | Big Data AnalyticsAWS Enterprise Day | Big Data Analytics
AWS Enterprise Day | Big Data Analytics
 
Using Big Data to Driving Big Engagement
Using Big Data to Driving Big EngagementUsing Big Data to Driving Big Engagement
Using Big Data to Driving Big Engagement
 
Customer Keynote: PIXNET Media Inc.- Business Intelligent and Analysis: Empir...
Customer Keynote: PIXNET Media Inc.- Business Intelligent and Analysis: Empir...Customer Keynote: PIXNET Media Inc.- Business Intelligent and Analysis: Empir...
Customer Keynote: PIXNET Media Inc.- Business Intelligent and Analysis: Empir...
 
Machine Learning & Data Lake for IoT scenarios on AWS
Machine Learning & Data Lake for IoT scenarios on AWSMachine Learning & Data Lake for IoT scenarios on AWS
Machine Learning & Data Lake for IoT scenarios on AWS
 
Scaling Production Machine Learning Pipelines with Databricks
Scaling Production Machine Learning Pipelines with DatabricksScaling Production Machine Learning Pipelines with Databricks
Scaling Production Machine Learning Pipelines with Databricks
 
Turn Big Data Into Big Value On Informatica and Amazon
Turn Big Data Into Big Value On Informatica and AmazonTurn Big Data Into Big Value On Informatica and Amazon
Turn Big Data Into Big Value On Informatica and Amazon
 
Machines and the Magic of Fast Learning
Machines and the Magic of Fast LearningMachines and the Magic of Fast Learning
Machines and the Magic of Fast Learning
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
#DataOnCloud London event
#DataOnCloud London event#DataOnCloud London event
#DataOnCloud London event
 
Introduction to Google Cloud Platform for Big Data - Trusted Conf
Introduction to Google Cloud Platform for Big Data - Trusted ConfIntroduction to Google Cloud Platform for Big Data - Trusted Conf
Introduction to Google Cloud Platform for Big Data - Trusted Conf
 
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your DataMongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
MongoDB World 2019: re:Innovate from Siloed to Deep Insights on Your Data
 
Spark Summit East Keynote by Anjul Bhambhri
Spark Summit East Keynote by Anjul BhambhriSpark Summit East Keynote by Anjul Bhambhri
Spark Summit East Keynote by Anjul Bhambhri
 
There is more to Big Data than data
There is more to Big Data than dataThere is more to Big Data than data
There is more to Big Data than data
 
Importance of global certifications
Importance of global certificationsImportance of global certifications
Importance of global certifications
 
Building the Ideal Stack for Machine Learning
Building the Ideal Stack for Machine LearningBuilding the Ideal Stack for Machine Learning
Building the Ideal Stack for Machine Learning
 
Explore Your Data Using Amazon QuickSight and Build Your First Machine Learni...
Explore Your Data Using Amazon QuickSight and Build Your First Machine Learni...Explore Your Data Using Amazon QuickSight and Build Your First Machine Learni...
Explore Your Data Using Amazon QuickSight and Build Your First Machine Learni...
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use Cases
 

Viewers also liked

Science of Facebook
Science of FacebookScience of Facebook
Science of FacebookDan Zarrella
 
Agile Tour 2016 Pune Agile innovation way - your toll road to success by Shr...
Agile Tour 2016 Pune  Agile innovation way - your toll road to success by Shr...Agile Tour 2016 Pune  Agile innovation way - your toll road to success by Shr...
Agile Tour 2016 Pune Agile innovation way - your toll road to success by Shr...India Scrum Enthusiasts Community
 
Agile Tour 2016 Pune - Agile innovation in learning and development by Sarmis...
Agile Tour 2016 Pune - Agile innovation in learning and development by Sarmis...Agile Tour 2016 Pune - Agile innovation in learning and development by Sarmis...
Agile Tour 2016 Pune - Agile innovation in learning and development by Sarmis...India Scrum Enthusiasts Community
 
Why Scale Matters and How the Cloud is Really Different (at scale)
Why Scale Matters and How the Cloud is Really Different (at scale)Why Scale Matters and How the Cloud is Really Different (at scale)
Why Scale Matters and How the Cloud is Really Different (at scale)Amazon Web Services
 
Journey Through the Cloud - Digital Media
Journey Through the Cloud - Digital MediaJourney Through the Cloud - Digital Media
Journey Through the Cloud - Digital MediaAmazon Web Services
 
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...Amazon Web Services
 
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...Amazon Web Services
 
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014Amazon Web Services
 
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar Amazon Web Services
 
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...Amazon Web Services
 
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014Amazon Web Services
 
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014Amazon Web Services
 
Fostering a Culture of Innovation with Cloud
Fostering a Culture of Innovation with CloudFostering a Culture of Innovation with Cloud
Fostering a Culture of Innovation with CloudAmazon Web Services
 
AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
 AWS Government, Education, and Nonprofits Symposium London, United Kingdom L... AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...Amazon Web Services
 
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridgeAWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridgeAmazon Web Services
 
T2 – Continuous integration on aws
T2 – Continuous integration on awsT2 – Continuous integration on aws
T2 – Continuous integration on awsAmazon Web Services
 

Viewers also liked (20)

Science of Facebook
Science of FacebookScience of Facebook
Science of Facebook
 
Agile Tour 2016 Pune Agile innovation way - your toll road to success by Shr...
Agile Tour 2016 Pune  Agile innovation way - your toll road to success by Shr...Agile Tour 2016 Pune  Agile innovation way - your toll road to success by Shr...
Agile Tour 2016 Pune Agile innovation way - your toll road to success by Shr...
 
Creating a Cross-Corporate Knowledge Pool
Creating a Cross-Corporate Knowledge PoolCreating a Cross-Corporate Knowledge Pool
Creating a Cross-Corporate Knowledge Pool
 
Softex_ Clinic Manager
Softex_ Clinic ManagerSoftex_ Clinic Manager
Softex_ Clinic Manager
 
Agile Tour 2016 Pune - Agile innovation in learning and development by Sarmis...
Agile Tour 2016 Pune - Agile innovation in learning and development by Sarmis...Agile Tour 2016 Pune - Agile innovation in learning and development by Sarmis...
Agile Tour 2016 Pune - Agile innovation in learning and development by Sarmis...
 
Why Scale Matters and How the Cloud is Really Different (at scale)
Why Scale Matters and How the Cloud is Really Different (at scale)Why Scale Matters and How the Cloud is Really Different (at scale)
Why Scale Matters and How the Cloud is Really Different (at scale)
 
Journey Through the Cloud - Digital Media
Journey Through the Cloud - Digital MediaJourney Through the Cloud - Digital Media
Journey Through the Cloud - Digital Media
 
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
 
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
 
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
 
Workshop part2 – Big Data
Workshop part2 – Big DataWorkshop part2 – Big Data
Workshop part2 – Big Data
 
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
 
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
 
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
 
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
 
Fostering a Culture of Innovation with Cloud
Fostering a Culture of Innovation with CloudFostering a Culture of Innovation with Cloud
Fostering a Culture of Innovation with Cloud
 
AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
 AWS Government, Education, and Nonprofits Symposium London, United Kingdom L... AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
 
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridgeAWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
 
AWS Service Drill Downs
AWS Service Drill DownsAWS Service Drill Downs
AWS Service Drill Downs
 
T2 – Continuous integration on aws
T2 – Continuous integration on awsT2 – Continuous integration on aws
T2 – Continuous integration on aws
 

Similar to Mining Information from Data on Cloud

Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantageAmazon Web Services
 
Tapping the cloud for real time data analytics
 Tapping the cloud for real time data analytics Tapping the cloud for real time data analytics
Tapping the cloud for real time data analyticsAmazon Web Services
 
AWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions ShowcaseAWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions ShowcaseAmazon Web Services
 
Big Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of LightBig Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of LightAmazon Web Services LATAM
 
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...Amazon Web Services
 
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAmazon Web Services
 
Big Data Use Cases and Solutions in the AWS Cloud
Big Data Use Cases and Solutions in the AWS CloudBig Data Use Cases and Solutions in the AWS Cloud
Big Data Use Cases and Solutions in the AWS CloudAmazon Web Services
 
Big Data Solutions Day - Calgary
Big Data Solutions Day - CalgaryBig Data Solutions Day - Calgary
Big Data Solutions Day - CalgaryAmazon Web Services
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesAmazon Web Services
 
Vancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam ElmalakVancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam ElmalakAmazon Web Services
 
High-performance database technology for rock-solid IoT solutions
High-performance database technology for rock-solid IoT solutionsHigh-performance database technology for rock-solid IoT solutions
High-performance database technology for rock-solid IoT solutionsClusterpoint
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesAmazon Web Services
 
Amazon Web Services
Amazon Web ServicesAmazon Web Services
Amazon Web ServicesJisc
 
Bringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceBringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceSalesforce Developers
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?James Serra
 
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...Washington Technology Industry Association
 
Financial Services Analytics on AWS
Financial Services Analytics on AWSFinancial Services Analytics on AWS
Financial Services Analytics on AWSAmazon Web Services
 

Similar to Mining Information from Data on Cloud (20)

Using real time big data analytics for competitive advantage
 Using real time big data analytics for competitive advantage Using real time big data analytics for competitive advantage
Using real time big data analytics for competitive advantage
 
Tapping the cloud for real time data analytics
 Tapping the cloud for real time data analytics Tapping the cloud for real time data analytics
Tapping the cloud for real time data analytics
 
AWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions ShowcaseAWS Webcast - Informatica - Big Data Solutions Showcase
AWS Webcast - Informatica - Big Data Solutions Showcase
 
Solving Big Data problems on AWS by Rajnish Malik
Solving Big Data problems on AWS by Rajnish MalikSolving Big Data problems on AWS by Rajnish Malik
Solving Big Data problems on AWS by Rajnish Malik
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
Big Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of LightBig Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of Light
 
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
AWS Summit 2013 | Singapore - Big Data Analytics, Presented by AWS, Intel and...
 
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
 
Big Data Use Cases and Solutions in the AWS Cloud
Big Data Use Cases and Solutions in the AWS CloudBig Data Use Cases and Solutions in the AWS Cloud
Big Data Use Cases and Solutions in the AWS Cloud
 
Big Data Solutions Day - Calgary
Big Data Solutions Day - CalgaryBig Data Solutions Day - Calgary
Big Data Solutions Day - Calgary
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 
Vancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam ElmalakVancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam Elmalak
 
High-performance database technology for rock-solid IoT solutions
High-performance database technology for rock-solid IoT solutionsHigh-performance database technology for rock-solid IoT solutions
High-performance database technology for rock-solid IoT solutions
 
Modern Data Architectures for Business Outcomes
Modern Data Architectures for Business OutcomesModern Data Architectures for Business Outcomes
Modern Data Architectures for Business Outcomes
 
2016 AWS Big Data Solution Days
2016 AWS Big Data Solution Days2016 AWS Big Data Solution Days
2016 AWS Big Data Solution Days
 
Amazon Web Services
Amazon Web ServicesAmazon Web Services
Amazon Web Services
 
Bringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to SalesforceBringing the Power of Big Data Computation to Salesforce
Bringing the Power of Big Data Computation to Salesforce
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?
 
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
 
Financial Services Analytics on AWS
Financial Services Analytics on AWSFinancial Services Analytics on AWS
Financial Services Analytics on AWS
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 

Recently uploaded (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 

Mining Information from Data on Cloud

  • 2. Mining Information from Data on Cloud Ganesh Raja, Solutions Architect Amazon Internet Services
  • 3. What is Big Data ? When your data sets become so large that you have to start innovating how to Collect, Store, Organize, Analyze and Share it Its tough because of Velocity, Volume and Variety
  • 4. 380,000,000 Tweets/Day 200,000,000+ New Photos / Day Processes 1.5M+ log events per second 80% Of Data on WWW – Last 2 Years
  • 5. The cost of data generation is falling
  • 6. Generation Collection & storage Analytics & computation Collaboration & sharing Lower cost, higher throughput Highly constrained
  • 7. Cloud Computing Elastic & Highly Scalable + No capital expense + Pay-per-use + On-demand $0 = Remove constraints
  • 8. Generation Collection & storage Analytics & computation Collaboration & sharing Accelerated
  • 9. Big data and AWS Cloud computing Big Data AWS Cloud Computing Variety, volume, and velocity requiring new tools Variety of compute, storage, and networking options Massive datasets Massive, virtually unlimited capacity Iterative, experimental style of data Iterative, experimental style of manipulation and analysis infrastructure deployment/usage Frequently not steady-state workload; peaks and valleys At its most efficient with highly variable workloads
  • 10. Big Data Technology Technologies and techniques for working productively with data, at any scale
  • 11. One tool to rule them all
  • 12. Big Data & Analytics @ AWS COLLECT STORE ANALYZE SHARE Direct Connect S3 Import Export S3 EC2 DynamoDB Redshift Glacier EMR Data Pipeline AWS BIG DATA PORTFOLIO Amazon Kinesis
  • 13. Store anything Object storage Scalable 99.999999999% durability Amazon S3
  • 14. Real-time processing High throughput; elastic Easy to use EMR, S3, Redshift, DynamoDB Integrations Amazon Kinesis
  • 15. NoSQL Database Seamless scalability Zero admin Single digit millisecond latency Amazon DynamoDB
  • 16. Relational data warehouse Massively parallel Petabyte scale Fully managed $1,000/TB/Year Amazon Redshift
  • 17. Hadoop/HDFS clusters Hive, Pig, Impala, Hbase Easy to use; fully managed On-demand and spot pricing Tight integration with S3, DynamoDB, and Kinesis Amazon Elastic MapReduce
  • 18. The right tools. At the right scale. At the right time.
  • 19.
  • 21. Panel Miner: Data Mining and Visualization using AWS Ganesh Sankaralingam
  • 22. LatentView at a Glance Build Analytics Centers of Excellence (COEs) Analyze Business problems both Qualitatively & Quantitatively and provide Actionable Insights Onsite-offshore global delivery model that helps in-house teams do more with less Identified as “Cool Vendor” in Analytics by Gartner 2014 Won the Deloitte Technology Fast 50 India award for 5 consecutive years (2009 – 13) ‘Top Innovator’ awarded to LatentView by Developer Week (Conference & Festival 2013) Recognized as a global 'Market Leader‘ in the Analytics space by SourcingLine Top Finalist in the ‘We Love Our Workplace 2013’ category.
  • 23. Business Pain Points: Required to combine different types of data to make Business decisions Within the firewall Outside the firewall Internal Structured Data External Structured Data Within the firewall Outside the firewall External Unstructured Data Internal Unstructured Data o ERP, Legacy data o RDBMS or excel format o Email text, Customer service notes, Yammer o Webserver logs o Survey o Market Research o Macroeconomics o Promotions o Social media, News articles, Panel data o Real time visualization of Machine logs (IOT)
  • 24. Technical Pain Points: Required to combine different types of data Transform unstructured data into structured data queried using SQL statements Automated scalable framework to process > 500K small files in constant time Achieve high efficiency converting unstructured data to structured data Control Security and access for different business users Minimize the costs and time running distributed jobs Store and Retrieve data for Analysis purposes in the cost and time efficient manner Track various processes in the AWS platform
  • 25. Why AWS? Cost of Ownership Scalable, Easy to use Easy to acquire additional machines based on needs PetaByte level scalability (1 000 000 000 000 000 Bytes) Data Security High Availability Technology Breadth and Technical support
  • 26. PANEL MINER Converting Unstructured to Structured Data using AWS Infrastructure Unstructured Data Data Collection EC2, S3 Download, Extract, Clean and Stage Data for Processing Python Parser to Convert Unstructured Data into Structured Data EMR Hadoop Optimized Data Processing Redshift Data Warehousing, and Reporting Structured Data Analysis Using excel, tableau and other visualization tools
  • 27. Key Benefits and Learnings with AWS
  • 28.
  • 30. Analytics in the Cloud Leverage AWS to scale Big Data Analytics Shrirang Bapat, VP Engineering, PubMatic
  • 31. Shrirang Bapat Data Enthusiast Innovation Agent Agile Evangelist VP Engineering at PubMatic Your Speaker Today
  • 32. Every Ad Every Screen IAB Standard Banners IAB Rising Stars Native and custom units Mobile Applications Tablet Applications Rich media: MRAID 1 & 2, ORRMA, interstitial Video: VAST, VPAID Mobile & Tablet Optimized Web Desktop Web One Platform Multi-Format, Multi-Screen, Multi-Channel Every Sales Channel Direct Sales Integration Programmatic Direct • Private Marketplace • Automated Guaranteed Open Auction Spot-buys
  • 33. Premium at Scale, Across All Buying Channels 33 Programmatic Direct Channels Definition Value Automated Guaranteed Direct bought guaranteed inventory access, non-RTB Predictable and scalable high value placements Open Market RTB based inventory access in open marketplace Efficient and targeted audience buying Private Marketplace Direct bought RTB based inventory access Controlled buying with price agreements for bids
  • 34. PubMatic is the Only Publisher-Focused Software Platform at Scale 94.5% U.S. Reach, Larger Than Google (comScore March 2014) Industry’s Best Results, Independent & Flexible 5 Data Centers, 4 Trillion RTB Requests Monthly 500+ People Doing Business in 30 Countries
  • 35. 5PB 35 4,000,000,000,000+ Bids 6AWS Regions 350,000,000,000+ Impressions
  • 36. Real-time Slice and Dice Hyper growth Older infrastructure Time to market New Architecture
  • 37. • Big Data Pipeline • Real Time • EMR and HBASE 37 • Adserving PubMatic on AWS
  • 38. Storage Database Stream Processing Compute EMR Networking Monitoring DNS 38 AWS Services
  • 39. If you only take away 3 things… Ease of Use Reduced Time to Market DevOps

Editor's Notes

  1. M2M, Sensor Data is the new King. Stephen Forbes defined it that we live in SPIME world (Space and Time) and data will be collected and connected – which bringsthe Internet Of Thing concept
  2. However, we don’t believe that there is one tool that can do everything, but rather if you use the right tools, you can build a highly configurable big data architcture to meet your specific needs.
  3. This is how the AWS Big Data portfolio looks like. We have tools like Direct Connect and Import Export that can bring in a lot of data. We can push that data into a number of sources from S3 and DynamoDB to EMR and RedShift for analysis. Amazon Redshift provides a fast, fully managed, petabyte-scale data warehouse for less than $1000 per terabyte per year. Amazon Elastic MapReduce provides a managed, easy to use analytics platform built around the powerful Hadoop framework. Recently we announced Amazon Kinesis, a managed service for real-time processing of streaming big data. Amazon Kinesis supports data throughput from megabytes to gigabytes of data per second and can scale seamlessly to handle streams from hundreds of thousands of different data sources. The tools to support big data collection, computation along with collaboration and sharing are all available in a couple of clicks, with AWS.
  4. Fundamental storage at internet scale, it can store any number of objects from 1 byte to 5 TB in size It is engineered for 11 9’s of durability replicating your data at least three times in three distinct physical data centers we call availability zones We have customers such as Dropbox, Spotify, Pinterest store billions of objects or files as photos, videos, songs, or any other type of file.
  5. Amazon Kinesis is a fully managed service for real-time processing of streaming data at massive scale. Amazon Kinesis can collect and process hundreds of terabytes of data per hour from hundreds of thousands of sources. For instance, instead of having to process log files in batch, you can have log events stream into Kinesis and then have workers with the Kinesis client library read from the stream and process the informaiton and drive a real time dashboard. Later on today, we will have the product manager, Adi Krishnan, for Amazon Kinesis give a deep dive into the service
  6. DynamoDB is a fast, fully managed NoSQL database service that makes it simple and cost-effective to store and retrieve any amount of data, and serve any level of request traffic. Its guaranteed throughput and single-digit millisecond latency make it a great fit for gaming, ad tech, mobile and many other applications. Runs on solid state hard drives for high speed performance at scale and you can provision reads and writes to a table without having to worry about the admin of scaling or sharding, it is done all behind the scenes for you.   “We were able to start with only a few requests per minute and scale to over 40,000, all with just a few button presses on the way up” said Charles Ju, co-founder of PennyPop.  “DynamoDB is easily the simplest and most scalable part of our application backend.”   “DynamoDB's biggest cost saving is not just the efficiencies and ease of use, but rather the opportunity cost of maintenance.  Building, maintaining, and sharding large live data-centric real-time projects is incredibly hard and requires many people to both create and maintain these projects.  DynamoDB has done an excellent job consolidating this within Amazon itself so the web community at large can focus on what it does best -- building great features and applications."   “We still only have 2 server engineers running a MMORPG where millions of players have come to enjoy the game…We are leaner than any other MMORPG at our size that I know of.”
  7. Provision a petabyte scale cluster to handle complex SQL queries in just a few minutes. You can get either a HDD drive based cluster or the recently introduced SSD based cluster that is smaller in total cluster size but higher performance per GB This data warehouse solution is about a tenth of what traditional solutions cost of comparable size. Redshift can drive business intelligence tools such as Jaspersoft or Microstrategy because it supports standard SQL and can connect using ODBC or JDBC drivers.
  8. When you think of big data these days, Hadoop is always an integral part. When you take the benefits of what the cloud can do along with the computational paradigm of MapReduce, you get Elastic MapReduce. Customers have launched millions of clusters to run big data workloads. Amazon Elastic MapReduce A key tool in the toolbox to help with ‘Big Data’ challenges Makes possible analytics processes previously not feasible Cost effective when leveraged with EC2 spot market
  9. In summary, AWS provides you the tools so you can pick the right one at the scale that you need when you need it.