SlideShare a Scribd company logo
1 of 32
Download to read offline
Why Everything You Know About bigdata
Is A Lie
-Delivering Data Driven Business Insights
Adopt
MarketInnovate
Sunil S Ranka
Director – Big Data and Advance Analytics
Key Topics
 About Jade
 About Me
 What is Big Data
 Key Myths
 Why everything is Lie
 Real World Example
 Next Steps
Technology Projects750+
200+ Customers
Referenceable
100%
98%Customer
Retention
500IT professional
worldwide
Services High-Tech Manufacturing Energy Social Media & Entertainment
5 Global Delivery
Centers
8 Offices
Worldwide
Atlanta
Pune
Noida
San Jose
Los Angeles
London
Hyderabad
San Diego
Global Delivery Model Serving Many Industries
Strategic
Partnerships
Salesforce.com
Sales, Service, Marketing,
force.com
Testing
Tools/Frameworks
QC, QTP, Selenium, LoadRunner,
JIRA Bugzilla, JUnit, TestNG
Microsoft
Dynamics, SharePoint,
Office 365, Lync, BI
Custom Development
Java, .Net, J2EE, Product
Engineering, Open Source
Technologies
Integration
Oracle SOA, Tibco, Weblogic,
Oracle Cloud Platform, ICS, JCS,
Mulesoft, Dell Boomi
Infrastructure
Management
IBM AIX, HP-UX, RHEL,
OEL Linux, Windows Server
Cloud Financials, Projects, SCM,
HCM and EBS Financials,
Procurement, Value Chain, CRM,
Demantra, Agile, GRC
Oracle EBS Suite
ServiceNow
IT Service Automation Applications,
CreateNow Development Suite,
Orchestration, Discovery
Big Data & Analytics
Hadoop, KNIME, R, Tableau, Hadoop
Jade Global Clientele (Representative list)
Dilbert On Big Data
During a Data Strategy Session
About Me
• Venture Partner : Investing and Advisor with early stage startups focusing on Data.
• Director – Big Data and Advance Analytics
• Oracle ACE (Business Intelligence with Proficiency in Big Data)
• Extensively worked with fortune 500 leaders.
• Held positions of Head Of Product Development, Architect, etc.
• http://sranka.wordpress.com, sunil_ranka
• Featured Tech writer for IT Next Magazine.
• Speaking engagements at following conferences :
• COLLABORATE ( 2009, 2010 , 2011 ,2012, 2013,2015)
• BIWA SIG TechCast Series (2010 , 2011 , 2012, 2013,2014,2016),
• NorCal OAUG-2010 at Santa Clara Convention Center, CA
• Session speaker at NoCouG in San Francisco
• Oracle Open World ( 2009 , 2010 , 2012)
My Tag Line :: “Superior BI is the antidote to Business Failure”
Why Data Is Important
Data is the new Oil. Data is just like crude. It’s
valuable, but if unrefined it cannot really be used.
– Clive Humby, DunnHumby
11
We have for the first time an economy based on
a key resource [Information] that is not only renewable,
but self-generating. Running out of it is not a problem,
but drowning in it is.
– John Naisbitt
Big Data and Analytics is Helping
Smarter Revenue
Management
Smarter Healtcare
Analytics
$16Billion
Reduced
Improper Payment
Smarter Crime
Prevention
Helps detect life
threatening conditions
up to 24 hours sooner
30%
Cut
serious crime
by
Tax Agency
* Courtesy - IBM
Analytics Maturity Pyramid
No Reporting
Struggling to get basic information
Reactive Analytics
Concerned with current Issues
What Happened ?
Diagnostic Analytics
Hindsight
Why it Happened ?
Predictive Analytics
Insight
What will Happened?
Prescriptive Analytics
Foresight
What should I do ?
What is Big Data
Big data Represents new data features created by today’s Data Driven Organization for Decision
Making
volume
Variety
Velocity
Value
Data At Scale
Terabyte To Petabyte of Data
Data In Many Forms
Structured, unstructured, text, Media
Data In Motion
Analysis of stream data to make decision in real time
Data with Insight
Deriving valuable insight from the data
Characteristicsofbigdata
Harnessing Big Data
 OLTP: Online Transaction Processing (DBMSs)
 OLAP: Online Analytical Processing (Data Warehousing)
 RTAP: Real-Time Analytics Processing (Big Data Architecture & technology)
15
Who’s Generating Big Data
Social media and networks
(all of us are generating data)
Scientific instruments
(collecting all sorts of data)
Mobile devices
(tracking all objects all the time)
Sensor technology and networks
(measuring all kinds of data)
 The progress and innovation is no longer hindered by the ability to
collect data
 But, by the ability to manage, analyze, summarize, visualize, and discover
knowledge from the collected data in a timely manner and in a scalable
fashion
16
The Model Has Changed…
 The Model of Generating/Consuming Data has
Changed
Old Model: Few companies are generating data, all others are consuming data
New Model: all of us are generating data, and all of us are consuming data
17
What are the Myths
Myths
 Big data will change everything.
 Big data means 'a lot' of data
 Data lake is big Data
 Hive can be used for reporting
 Big Data is Only for Large Corporations
 You Need to Hire a Big Data Scientist to Start With Big Data
 Big Data Technology Will Eliminate the Need for Data
Integration
 The only cost for big data is hardware and software.
Myths Continues…
 Big data applications require little or no performance optimization.
 I don’t have enough data for big data.
 Big Data predicts the future.
 Hadoop is the Holy Grail of big data.
 Big data is an IT matter.
 Data warehouses aren’t needed for advanced analytics.
 Hadoop will replace enterprise data warehouses
 With huge volumes of data, small data quality issues are acceptable
What Really works
Big Data Needs Diversified Skill Sets
Math and
Operations Research
Expertise
Develop analytic algorithms
Visualization
Expertise
Interpret data sets,
determine correlations and
present in meaningful ways
Tool Developers
Mask complexity and
analytics to lower skills
boundaries
Industry Vertical
Domain Expertise
Develop hypothesis, identify
relevant business issues,
ask the right questions
Data Experts
Data architecture, management,
governance, policy
Decision Making
Executive and
Management
Apply information to solve
business issues
"By 2015, big data demand will reach 4.4 million jobs globally, but only one-third of those jobs will be filled."
Source: Gartner "Gartner's Top Predictions for IT Organizations and Users, 2013 and Beyond: Balancing Economics, Risk, Opportunity and Innovation" 19 Oct 2012
Industry Implementation Trends
Hybrid Approach
(Large Enterprises)
• Building Hybrid environments as
they want to leverage their existing
investments in their traditional
environments
• Setting up their own internal cloud
environments for security,
regulatory issues as well as to
achieve cloud benefits of simplicity
and elasticity
Migrating Legacy Applications
(Medium Enterprises)
• All new investments are in the
cloud
• Migrating existing on premise to
cloud based on ROI & Business
Objectives
Starting with Cloud
(Small & Startup Enterprises)
• Embracing cloud as they do not
have any legacy systems
Different Phase
• Expand to multiple usecase
• Establish IT SLAs, ROI Metrics and growth Plans
• Expand to more advanced predictive capabilities
• Enable a platform capable of managing greater
volumes and variety of data
• Look to partners to simplify and modernize existing
platform with cost-effective delivery models
• Optimize and integrate apps on converged data
platform
• Establish digital business practices as the new normal
supported by all key executive sponsors
• Provide detailed business SLAs, revenue targets, and
other financial targets
• Normalize data lifecycle/governance, data
monetization, microservice development
• Work with Business and identify usecase
• Commit dedicated resources to development and
operations
• Develop an agile project plan
• Educate business users on analytics
• Accelerate analytics knowledge and skills required to
support to value creation
• Use partners to supplement analytic skills gaps
•Understanding capability of big data ecosystem
•Develop Basic Skills in big Data Management
•Create a Pilot Use Case
•Establish leadership commitment
•Establish working infrastructure
Phase1
(Experimental)
Phase2
(Implementation)
Phase3
(Expansion)
Phase4
(Optimization)
Real World
Data Lake Reference Architecture
Data Lake
Measure
Normalization
and
integration
Master
Metadata
Feature
Surrogate
Keys
Key
Exists
Exception
Handling
Feature DataSet
Customer
Institution
Accounts
Measure Data Set
Key
Accounts
Partnership
Sales
GL
Margins
Derived/Aggregated Fact
Gross Margin
Aggregates
Unified
Customer View
Unified Sales
Views
Unified Partner
Views
Data Staging
Company 1
Data
Company 2
Data
Company 3
Data
Company 4
Data
Predictive Analytics Layer
(Machine Learning)
Predictive Analytics Outcome
- Customer Retention
- Cross Sell Up Sell
- Customer Segmentations
- Customer 360
- Revenue Forecast
- Customer Churn
Reusable
Jade
Connectors
Data Service
Layer
Real-Time
Analytics
Hour/Daily
Report
Weekly/
Monthly
Report
API Layer
Reporting
Layer
Data Lake
Consumption
Zone
Source
System
File Data
OB Data
ETL Extracts
Streaming
Transient
Loading Zone
Raw Data
Refined
Data
Trusted
Data
Discovery
Sandbox
Original unaltered
data attributes
Tokenized Data
APIs
Reference Data Master Data
Data Wrangling
Data Discovery
Exploratory Analytics
Metadata Data Quality Data Catalog Security
Hadoop Data Lake
Integrate to
common format
Data Validation
Data Cleansing
Aggregations
OLP or ODS
Enterprise Data
Warehouse
Logs
(or other unstructured
data)
Cloud Services
Business Analysts
Researchers
Data Scientists
Data Lake Reference Architecture
Where Does Big Data Fit In
Analytics Cloud/OnPrem
Data Cloud/OnPrem
Hive Metastore
Elastic Cloud HDFS
Infinite Compute
Hadoop/Spark
Ingest Transform Analyze
External
Dashboards
Internal
Dashboards
Tableau Excel R Zeppelin
Web interface for distributed users
Data set definition
Social metadata dictionary
Export Web interface to dash-
boarding, query, and
data dictionary
Integrated ingestion,
transformation, and
query application for
business analysts
World-class, elastic
Big Data infrastructure
Hybrid Analytics Cloud/On Premises
Analytics Cloud/OnPrem
Analytics Cloud/OnPrem
Hive Metastore
Elastic Cloud HDFS
Infinite Compute
Hadoop/Spark
External
Dashboards
Internal
Dashboards
Tableau Excel R Zeppelin
Web interface for distributed users
Data set definition
Social metadata dictionary
Export Web interface to dash-
boarding, query, and data
dictionary
Integrated ingestion,
transformation, and query
application for business
analysts
World-class, elastic
Big Data infrastructure
Build reports
and
dashboards
Build outgoing
connectors
Ingest Transform Analyze
Business
Analytics, data
science
training
Write ETL and
perform data
engineering
Build
connectors
Hybrid Analytics Cloud/OnPrem
How We Can Help

More Related Content

What's hot

Importance of Big data for your Business
Importance of Big data for your BusinessImportance of Big data for your Business
Importance of Big data for your Businessazuyo.com
 
Death of the Dashboard
Death of the DashboardDeath of the Dashboard
Death of the DashboardDATAVERSITY
 
Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture Johan Himberg
 
Role of Data in Digital Transformation
Role of Data in Digital TransformationRole of Data in Digital Transformation
Role of Data in Digital TransformationVMware Tanzu
 
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...DATAVERSITY
 
Big data and your career final
Big data and your career finalBig data and your career final
Big data and your career finalMarina Kerbel
 
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...Sandra Fernandes
 
3D Data Strategy Framework
3D Data Strategy Framework3D Data Strategy Framework
3D Data Strategy FrameworkDaniel Ren
 
Smarter analytics101 v2.0.1
Smarter analytics101 v2.0.1Smarter analytics101 v2.0.1
Smarter analytics101 v2.0.1Jenawahl
 
Shane Greenstein Future Assembly 11/17/2015
Shane Greenstein Future Assembly 11/17/2015Shane Greenstein Future Assembly 11/17/2015
Shane Greenstein Future Assembly 11/17/2015Adrienne Debigare
 
Building an Effective Organizational Analytics Capability
Building an Effective Organizational Analytics CapabilityBuilding an Effective Organizational Analytics Capability
Building an Effective Organizational Analytics CapabilityJeff Crawford
 
Big data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sightBig data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sightJyrki Määttä
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
Career Prospects and Scope of Data Science in India
Career Prospects and Scope of Data Science in IndiaCareer Prospects and Scope of Data Science in India
Career Prospects and Scope of Data Science in Indiaachaljain11
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects FailSense Corp
 
5 Steps to Transform into a Data-Driven Organization - Ganes Kesari - Gramen...
 5 Steps to Transform into a Data-Driven Organization - Ganes Kesari - Gramen... 5 Steps to Transform into a Data-Driven Organization - Ganes Kesari - Gramen...
5 Steps to Transform into a Data-Driven Organization - Ganes Kesari - Gramen...Ganes Kesari
 
Data strategy demistifying data
Data strategy demistifying dataData strategy demistifying data
Data strategy demistifying dataHans Verstraeten
 
Data Driven Decisions: Building an Insight Driven Culture
Data Driven Decisions: Building an Insight Driven CultureData Driven Decisions: Building an Insight Driven Culture
Data Driven Decisions: Building an Insight Driven CultureAmazon Web Services
 

What's hot (19)

Analytics3.0 e book
Analytics3.0 e bookAnalytics3.0 e book
Analytics3.0 e book
 
Importance of Big data for your Business
Importance of Big data for your BusinessImportance of Big data for your Business
Importance of Big data for your Business
 
Death of the Dashboard
Death of the DashboardDeath of the Dashboard
Death of the Dashboard
 
Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture
 
Role of Data in Digital Transformation
Role of Data in Digital TransformationRole of Data in Digital Transformation
Role of Data in Digital Transformation
 
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
 
Big data and your career final
Big data and your career finalBig data and your career final
Big data and your career final
 
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
 
3D Data Strategy Framework
3D Data Strategy Framework3D Data Strategy Framework
3D Data Strategy Framework
 
Smarter analytics101 v2.0.1
Smarter analytics101 v2.0.1Smarter analytics101 v2.0.1
Smarter analytics101 v2.0.1
 
Shane Greenstein Future Assembly 11/17/2015
Shane Greenstein Future Assembly 11/17/2015Shane Greenstein Future Assembly 11/17/2015
Shane Greenstein Future Assembly 11/17/2015
 
Building an Effective Organizational Analytics Capability
Building an Effective Organizational Analytics CapabilityBuilding an Effective Organizational Analytics Capability
Building an Effective Organizational Analytics Capability
 
Big data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sightBig data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sight
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
Career Prospects and Scope of Data Science in India
Career Prospects and Scope of Data Science in IndiaCareer Prospects and Scope of Data Science in India
Career Prospects and Scope of Data Science in India
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
 
5 Steps to Transform into a Data-Driven Organization - Ganes Kesari - Gramen...
 5 Steps to Transform into a Data-Driven Organization - Ganes Kesari - Gramen... 5 Steps to Transform into a Data-Driven Organization - Ganes Kesari - Gramen...
5 Steps to Transform into a Data-Driven Organization - Ganes Kesari - Gramen...
 
Data strategy demistifying data
Data strategy demistifying dataData strategy demistifying data
Data strategy demistifying data
 
Data Driven Decisions: Building an Insight Driven Culture
Data Driven Decisions: Building an Insight Driven CultureData Driven Decisions: Building an Insight Driven Culture
Data Driven Decisions: Building an Insight Driven Culture
 

Similar to Why Everything You Know About bigdata Is A Lie

Big Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightBig Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightSunil Ranka
 
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
 It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201... It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...Edgar Alejandro Villegas
 
Day 2 aziz apj aziz_big_datakeynote_press
Day 2 aziz apj aziz_big_datakeynote_pressDay 2 aziz apj aziz_big_datakeynote_press
Day 2 aziz apj aziz_big_datakeynote_pressIntelAPAC
 
From Customer Insights to Action
From Customer Insights to ActionFrom Customer Insights to Action
From Customer Insights to ActionCapgemini
 
Big Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesBig Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesSAP Technology
 
big data analytics pgpmx2015
big data analytics pgpmx2015big data analytics pgpmx2015
big data analytics pgpmx2015Sanmeet Dhokay
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentationPriyesh Patel
 
IBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big DataIBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big DataIBM Software India
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Capgemini
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Big Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperBig Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperVasu S
 
Big Data - Bridging Technology and Humans
Big Data - Bridging Technology and HumansBig Data - Bridging Technology and Humans
Big Data - Bridging Technology and HumansMark Laurance
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Denodo
 
Building a Business Case for Innovation: Project Considerations for Cloud, Mo...
Building a Business Case for Innovation: Project Considerations for Cloud, Mo...Building a Business Case for Innovation: Project Considerations for Cloud, Mo...
Building a Business Case for Innovation: Project Considerations for Cloud, Mo...Fred Isbell
 
Analytics Service Framework
Analytics Service Framework Analytics Service Framework
Analytics Service Framework Vishwanath Ramdas
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXtsigitnist02
 

Similar to Why Everything You Know About bigdata Is A Lie (20)

Big Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to ForesightBig Data : From HindSight to Insight to Foresight
Big Data : From HindSight to Insight to Foresight
 
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
 It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201... It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
It’s Not About Big Data – It’s About Big Insights - SAP Webinar - 20 Aug 201...
 
Day 2 aziz apj aziz_big_datakeynote_press
Day 2 aziz apj aziz_big_datakeynote_pressDay 2 aziz apj aziz_big_datakeynote_press
Day 2 aziz apj aziz_big_datakeynote_press
 
From Customer Insights to Action
From Customer Insights to ActionFrom Customer Insights to Action
From Customer Insights to Action
 
Big Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped OpportunitiesBig Data, Big Thinking: Untapped Opportunities
Big Data, Big Thinking: Untapped Opportunities
 
The value of our data
The value of our dataThe value of our data
The value of our data
 
big data analytics pgpmx2015
big data analytics pgpmx2015big data analytics pgpmx2015
big data analytics pgpmx2015
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentation
 
IBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big DataIBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big Data
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Big Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - WhitepaperBig Data Trends and Challenges Report - Whitepaper
Big Data Trends and Challenges Report - Whitepaper
 
Big Data - Bridging Technology and Humans
Big Data - Bridging Technology and HumansBig Data - Bridging Technology and Humans
Big Data - Bridging Technology and Humans
 
Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
Data is not the new snake oil
Data is not the new snake oilData is not the new snake oil
Data is not the new snake oil
 
Building a Business Case for Innovation: Project Considerations for Cloud, Mo...
Building a Business Case for Innovation: Project Considerations for Cloud, Mo...Building a Business Case for Innovation: Project Considerations for Cloud, Mo...
Building a Business Case for Innovation: Project Considerations for Cloud, Mo...
 
Analytics Service Framework
Analytics Service Framework Analytics Service Framework
Analytics Service Framework
 
Big data
Big dataBig data
Big data
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
 

Recently uploaded

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Karmanjay Verma
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Jeffrey Haguewood
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentMahmoud Rabie
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialJoão Esperancinha
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessWSO2
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...itnewsafrica
 
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sectoritnewsafrica
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 

Recently uploaded (20)

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career Development
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorial
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with Platformless
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
 
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
4. Cobus Valentine- Cybersecurity Threats and Solutions for the Public Sector
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 

Why Everything You Know About bigdata Is A Lie

  • 1. Why Everything You Know About bigdata Is A Lie -Delivering Data Driven Business Insights Adopt MarketInnovate Sunil S Ranka Director – Big Data and Advance Analytics
  • 2. Key Topics  About Jade  About Me  What is Big Data  Key Myths  Why everything is Lie  Real World Example  Next Steps
  • 4. Services High-Tech Manufacturing Energy Social Media & Entertainment 5 Global Delivery Centers 8 Offices Worldwide Atlanta Pune Noida San Jose Los Angeles London Hyderabad San Diego Global Delivery Model Serving Many Industries
  • 5. Strategic Partnerships Salesforce.com Sales, Service, Marketing, force.com Testing Tools/Frameworks QC, QTP, Selenium, LoadRunner, JIRA Bugzilla, JUnit, TestNG Microsoft Dynamics, SharePoint, Office 365, Lync, BI Custom Development Java, .Net, J2EE, Product Engineering, Open Source Technologies Integration Oracle SOA, Tibco, Weblogic, Oracle Cloud Platform, ICS, JCS, Mulesoft, Dell Boomi Infrastructure Management IBM AIX, HP-UX, RHEL, OEL Linux, Windows Server Cloud Financials, Projects, SCM, HCM and EBS Financials, Procurement, Value Chain, CRM, Demantra, Agile, GRC Oracle EBS Suite ServiceNow IT Service Automation Applications, CreateNow Development Suite, Orchestration, Discovery Big Data & Analytics Hadoop, KNIME, R, Tableau, Hadoop
  • 6. Jade Global Clientele (Representative list)
  • 8. During a Data Strategy Session
  • 9. About Me • Venture Partner : Investing and Advisor with early stage startups focusing on Data. • Director – Big Data and Advance Analytics • Oracle ACE (Business Intelligence with Proficiency in Big Data) • Extensively worked with fortune 500 leaders. • Held positions of Head Of Product Development, Architect, etc. • http://sranka.wordpress.com, sunil_ranka • Featured Tech writer for IT Next Magazine. • Speaking engagements at following conferences : • COLLABORATE ( 2009, 2010 , 2011 ,2012, 2013,2015) • BIWA SIG TechCast Series (2010 , 2011 , 2012, 2013,2014,2016), • NorCal OAUG-2010 at Santa Clara Convention Center, CA • Session speaker at NoCouG in San Francisco • Oracle Open World ( 2009 , 2010 , 2012) My Tag Line :: “Superior BI is the antidote to Business Failure”
  • 10. Why Data Is Important
  • 11. Data is the new Oil. Data is just like crude. It’s valuable, but if unrefined it cannot really be used. – Clive Humby, DunnHumby 11 We have for the first time an economy based on a key resource [Information] that is not only renewable, but self-generating. Running out of it is not a problem, but drowning in it is. – John Naisbitt
  • 12. Big Data and Analytics is Helping Smarter Revenue Management Smarter Healtcare Analytics $16Billion Reduced Improper Payment Smarter Crime Prevention Helps detect life threatening conditions up to 24 hours sooner 30% Cut serious crime by Tax Agency * Courtesy - IBM
  • 13. Analytics Maturity Pyramid No Reporting Struggling to get basic information Reactive Analytics Concerned with current Issues What Happened ? Diagnostic Analytics Hindsight Why it Happened ? Predictive Analytics Insight What will Happened? Prescriptive Analytics Foresight What should I do ?
  • 14. What is Big Data Big data Represents new data features created by today’s Data Driven Organization for Decision Making volume Variety Velocity Value Data At Scale Terabyte To Petabyte of Data Data In Many Forms Structured, unstructured, text, Media Data In Motion Analysis of stream data to make decision in real time Data with Insight Deriving valuable insight from the data Characteristicsofbigdata
  • 15. Harnessing Big Data  OLTP: Online Transaction Processing (DBMSs)  OLAP: Online Analytical Processing (Data Warehousing)  RTAP: Real-Time Analytics Processing (Big Data Architecture & technology) 15
  • 16. Who’s Generating Big Data Social media and networks (all of us are generating data) Scientific instruments (collecting all sorts of data) Mobile devices (tracking all objects all the time) Sensor technology and networks (measuring all kinds of data)  The progress and innovation is no longer hindered by the ability to collect data  But, by the ability to manage, analyze, summarize, visualize, and discover knowledge from the collected data in a timely manner and in a scalable fashion 16
  • 17. The Model Has Changed…  The Model of Generating/Consuming Data has Changed Old Model: Few companies are generating data, all others are consuming data New Model: all of us are generating data, and all of us are consuming data 17
  • 18. What are the Myths
  • 19. Myths  Big data will change everything.  Big data means 'a lot' of data  Data lake is big Data  Hive can be used for reporting  Big Data is Only for Large Corporations  You Need to Hire a Big Data Scientist to Start With Big Data  Big Data Technology Will Eliminate the Need for Data Integration  The only cost for big data is hardware and software.
  • 20. Myths Continues…  Big data applications require little or no performance optimization.  I don’t have enough data for big data.  Big Data predicts the future.  Hadoop is the Holy Grail of big data.  Big data is an IT matter.  Data warehouses aren’t needed for advanced analytics.  Hadoop will replace enterprise data warehouses  With huge volumes of data, small data quality issues are acceptable
  • 22. Big Data Needs Diversified Skill Sets Math and Operations Research Expertise Develop analytic algorithms Visualization Expertise Interpret data sets, determine correlations and present in meaningful ways Tool Developers Mask complexity and analytics to lower skills boundaries Industry Vertical Domain Expertise Develop hypothesis, identify relevant business issues, ask the right questions Data Experts Data architecture, management, governance, policy Decision Making Executive and Management Apply information to solve business issues "By 2015, big data demand will reach 4.4 million jobs globally, but only one-third of those jobs will be filled." Source: Gartner "Gartner's Top Predictions for IT Organizations and Users, 2013 and Beyond: Balancing Economics, Risk, Opportunity and Innovation" 19 Oct 2012
  • 23. Industry Implementation Trends Hybrid Approach (Large Enterprises) • Building Hybrid environments as they want to leverage their existing investments in their traditional environments • Setting up their own internal cloud environments for security, regulatory issues as well as to achieve cloud benefits of simplicity and elasticity Migrating Legacy Applications (Medium Enterprises) • All new investments are in the cloud • Migrating existing on premise to cloud based on ROI & Business Objectives Starting with Cloud (Small & Startup Enterprises) • Embracing cloud as they do not have any legacy systems
  • 24. Different Phase • Expand to multiple usecase • Establish IT SLAs, ROI Metrics and growth Plans • Expand to more advanced predictive capabilities • Enable a platform capable of managing greater volumes and variety of data • Look to partners to simplify and modernize existing platform with cost-effective delivery models • Optimize and integrate apps on converged data platform • Establish digital business practices as the new normal supported by all key executive sponsors • Provide detailed business SLAs, revenue targets, and other financial targets • Normalize data lifecycle/governance, data monetization, microservice development • Work with Business and identify usecase • Commit dedicated resources to development and operations • Develop an agile project plan • Educate business users on analytics • Accelerate analytics knowledge and skills required to support to value creation • Use partners to supplement analytic skills gaps •Understanding capability of big data ecosystem •Develop Basic Skills in big Data Management •Create a Pilot Use Case •Establish leadership commitment •Establish working infrastructure Phase1 (Experimental) Phase2 (Implementation) Phase3 (Expansion) Phase4 (Optimization)
  • 26. Data Lake Reference Architecture Data Lake Measure Normalization and integration Master Metadata Feature Surrogate Keys Key Exists Exception Handling Feature DataSet Customer Institution Accounts Measure Data Set Key Accounts Partnership Sales GL Margins Derived/Aggregated Fact Gross Margin Aggregates Unified Customer View Unified Sales Views Unified Partner Views Data Staging Company 1 Data Company 2 Data Company 3 Data Company 4 Data Predictive Analytics Layer (Machine Learning) Predictive Analytics Outcome - Customer Retention - Cross Sell Up Sell - Customer Segmentations - Customer 360 - Revenue Forecast - Customer Churn Reusable Jade Connectors Data Service Layer Real-Time Analytics Hour/Daily Report Weekly/ Monthly Report API Layer Reporting Layer Data Lake
  • 27. Consumption Zone Source System File Data OB Data ETL Extracts Streaming Transient Loading Zone Raw Data Refined Data Trusted Data Discovery Sandbox Original unaltered data attributes Tokenized Data APIs Reference Data Master Data Data Wrangling Data Discovery Exploratory Analytics Metadata Data Quality Data Catalog Security Hadoop Data Lake Integrate to common format Data Validation Data Cleansing Aggregations OLP or ODS Enterprise Data Warehouse Logs (or other unstructured data) Cloud Services Business Analysts Researchers Data Scientists Data Lake Reference Architecture
  • 28. Where Does Big Data Fit In
  • 29.
  • 30. Analytics Cloud/OnPrem Data Cloud/OnPrem Hive Metastore Elastic Cloud HDFS Infinite Compute Hadoop/Spark Ingest Transform Analyze External Dashboards Internal Dashboards Tableau Excel R Zeppelin Web interface for distributed users Data set definition Social metadata dictionary Export Web interface to dash- boarding, query, and data dictionary Integrated ingestion, transformation, and query application for business analysts World-class, elastic Big Data infrastructure Hybrid Analytics Cloud/On Premises
  • 31. Analytics Cloud/OnPrem Analytics Cloud/OnPrem Hive Metastore Elastic Cloud HDFS Infinite Compute Hadoop/Spark External Dashboards Internal Dashboards Tableau Excel R Zeppelin Web interface for distributed users Data set definition Social metadata dictionary Export Web interface to dash- boarding, query, and data dictionary Integrated ingestion, transformation, and query application for business analysts World-class, elastic Big Data infrastructure Build reports and dashboards Build outgoing connectors Ingest Transform Analyze Business Analytics, data science training Write ETL and perform data engineering Build connectors Hybrid Analytics Cloud/OnPrem
  • 32. How We Can Help

Editor's Notes

  1. Oil which is the fuel for modern economy for centuries. However, Oil in its raw form has little value. It needs to be refined and separated into a large number of consumer products, from petrol and kerosene to asphalt and chemical reagents used to make plastics and pharmaceuticals. It is also used in manufacturing a wide variety of materials. Big Data is just like oil, in it’s raw form it provide no value to enterprise, until it is processed and valuable and actionable business insights are “distilled”. Just like the technology that made available 100 years ago to discover oil and process it to consumable products. Big Data technology is going to transform and revolutionize the way enterprise get and use.