SlideShare a Scribd company logo
1 of 70
© Copyright 2000-2014 TIBCO Software Inc.
Hadoop and Data Warehouse –
Friends, Enemies or Profiteers?
What about Real Time?
Kai Wähner
kwaehner@tibco.com
@KaiWaehner
www.kai-waehner.de
© Copyright 2000-2014 TIBCO Software Inc.
Disclaimer
!

These opinions are my own and do not necessarily 
represent my employer
© Copyright 2000-2014 TIBCO Software Inc.
Key Messages
Big Data is not just Hadoop, concentrate on Business Value!
A good Big Data Architecture combines DWH, Hadoop and Real Time!
The Integration Layer is getting even more important in the Big Data Era!
© Copyright 2000-2014 TIBCO Software Inc.
Agenda


•  Terminology 
•  Data Warehouse and Business Intelligence
•  Big Data Processing with Hadoop
•  Big Data Processing in Real Time
© Copyright 2000-2014 TIBCO Software Inc.
Agenda


•  Terminology 
•  Data Warehouse and Business Intelligence
•  Big Data Processing with Hadoop
•  Big Data Processing in Real Time
© Copyright 2000-2014 TIBCO Software Inc.
Big Data Architecture
DWH	
  /	
  BI	
  
Hadoop	
  
Real	
  Time	
  
Big	
  Data	
  Architecture	
  
© Copyright 2000-2014 TIBCO Software Inc.
DWH means analyzing OLAP Cubes
h9p://www.exforsys.com/tutorials/msas/data-­‐warehouse-­‐database-­‐and-­‐oltp-­‐database.html	
  
© Copyright 2000-2014 TIBCO Software Inc.
Big Data means analyzing Everything
h9p://blogs.teradata.com/internaDonal/tag/hadoop/	
  
•  Store	
  everything	
  
•  Even	
  without	
  structure	
  
•  Use	
  whatever	
  you	
  need	
  (now	
  or	
  later)	
  
© Copyright 2000-2014 TIBCO Software Inc.
Big Data: Three shifts in the Way we analyze Information
•  Messiness:	
  Using	
  ALL	
  data,	
  not	
  just	
  samples	
  
•  Also	
  bad	
  data	
  (e.g.	
  Word	
  spell	
  checker,	
  Google	
  auto-­‐complete	
  and	
  „did	
  
you	
  mean...“	
  recommendaDon	
  	
  
•  Correla-ons:	
  Instead	
  of	
  causaliDes	
  
•  May	
  not	
  tell	
  us	
  WHY	
  something	
  is	
  happening,	
  but	
  THAT	
  it	
  is	
  happening	
  
•  In	
  many	
  situaDons,	
  this	
  is	
  good	
  enough	
  
•  What	
  drug	
  substance	
  cures	
  cancer?	
  When	
  should	
  I	
  buy	
  an	
  airplane	
  Dcket?	
  
	
  
•  Datafica-on:	
  Store,	
  process,	
  combine,	
  reuse,	
  enhance	
  all	
  data!	
  
•  DigitalisaDon	
  (Amazon	
  Kindle	
  à	
  Read)	
  vs.	
  DataficaDon	
  (Google	
  Books	
  à	
  
Read,	
  Search,	
  Process,	
  ...)	
  	
  
•  Words	
  becomes	
  data:	
  Google	
  books:	
  not	
  just	
  read,	
  but	
  also	
  search,	
  
analyse,	
  etc.	
  
•  LocaDons	
  becomes	
  data:	
  GPS:	
  not	
  just	
  navigaDon,	
  but	
  also	
  insurance	
  
costs,	
  economic	
  routes,	
  etc.	
  	
  	
  
© Copyright 2000-2014 TIBCO Software Inc.
What is Big Data? The combined Vs of Big Data
Volume	
  	
  
(terabytes,	
  
petabytes)	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Variety	
  	
  
(social	
  networks,	
  
blog	
  posts,	
  logs,	
  
sensors,	
  etc.)	
  
	
  	
  	
  	
  	
  Velocity	
  	
  
	
  	
  	
  	
  	
  	
  (realDme)	
  
	
  
	
  
	
  
	
  
Value	
  
X
© Copyright 2000-2014 TIBCO Software Inc.
Real Time
Wikipedia Definition:
•  Real time programs must guarantee response within strict time constraints, often referred to as
"deadlines”. Real time responses are often understood to be in the order of milliseconds, and
sometimes microseconds.
•  The term "near real time” refers to the time delay introduced, by automated data processing or
network transmission.
•  The distinction between the terms "near real time" and "real time" is somewhat nebulous and
must be defined for the situation at hand. 
Hereby, for this talk, I define:
–  Real time == response in nanoseconds || microseconds || milliseconds || <= one second
–  Near real time == (response time > one second)
© Copyright 2000-2014 TIBCO Software Inc.
Agenda


•  Terminology 
•  Data Warehouse and Business Intelligence
•  Big Data Processing with Hadoop
•  Big Data Processing in Real Time
© Copyright 2000-2014 TIBCO Software Inc.
Big Data Architecture
DWH	
  /	
  BI	
  
Hadoop	
  
Real	
  Time	
  
Big	
  Data	
  Architecture	
  
© Copyright 2000-2014 TIBCO Software Inc.
DWH vs. BI
•  Data Warehouse (DWH) à Storage
•  Business Intelligence (BI) à Analytics

•  Both terms are often used as synonym, i.e. when someone talks
about a DWH, this might include analytics
•  BI can be used without a DWH
© Copyright 2000-2014 TIBCO Software Inc.
Typical DWH Process
h9p://wikibon.org/blog/not-­‐your-­‐fathers-­‐data-­‐analyDcs/	
  
	
  
A	
  DWH	
  is	
  „Business	
  Case	
  driven“:	
  
•  ReporDng	
  
•  Dashboards	
  
•  Drill	
  Down	
  AnalyDcs	
  
	
  
Different	
  DWH	
  OpDons:	
  
•  Enterprise	
  DWH	
  (	
  ==	
  EDW)	
  	
  
•  Department	
  /	
  Project	
  DWH	
  
•  Embedded	
  BI	
  (into	
  ApplicaDons)	
  
	
  
© Copyright 2000-2014 TIBCO Software Inc.
BI == Reporting + Statistics + Data Discovery
DWH	
  
BI	
  
© Copyright 2000-2014 TIBCO Software Inc.
BI Visualization
© Copyright 2000-2014 TIBCO Software Inc.
Products
DWH 
•  SQL: e.g. MySQL
•  MPP: e.g. Teradata, EMC Greenplum, IBM Netezza 
–  Scale very well (almost linear), very high performance, hardware / software costs
also increase a lot

BI 
•  Microsoft Excel
•  BI Tools: e.g. TIBCO Spotfire, Tableau, MicroStrategy

Hint: Good BI tools
•  allow data discovery / visualization using different sources, not just DWH
•  are easy to use
© Copyright 2000-2014 TIBCO Software Inc.
BI Tool Example: TIBCO Spotfire
© Copyright 2000-2014 TIBCO Software Inc.
BI Tool Example: TIBCO Spotfire 

The	
  whole	
  team	
  needs	
  analyDcs.	
  Spo`ire	
  is	
  for	
  
everyone,	
  helping	
  users	
  with	
  a	
  variety	
  of	
  skill	
  
levels	
  to	
  visualize,	
  explore	
  and	
  share	
  
informaDon:	
  It	
  has	
  
	
  
•  At-­‐a-­‐glance	
  business	
  facts	
  for	
  managers	
  
•  Dashboards	
  for	
  front-­‐line	
  decision-­‐makers	
  
•  Visual	
  discovery	
  for	
  business	
  users	
  
•  Deep	
  data	
  exploraDon	
  for	
  analysts	
  
•  Advanced	
  predicDve	
  analyDcs	
  for	
  
staDsDcians	
  
•  And	
  beauDful	
  visualizaDons	
  to	
  empress	
  
your	
  execuDves	
  
© Copyright 2000-2014 TIBCO Software Inc.
Example: TIBCO Spotfire
© Copyright 2000-2014 TIBCO Software Inc.
Live Demo
„TIBCO	
  Spo`ire“	
  in	
  acDon...	
  
© Copyright 2000-2014 TIBCO Software Inc.
DWH Real World Use Case
h9p://spo`ire.Dbco.com/resources/content-­‐center?Content%20Type=Case%20Studies	
  
© Copyright 2000-2014 TIBCO Software Inc.
DWH Real World Use Case
h9p://spo`ire.Dbco.com/resources/content-­‐center?Content%20Type=Case%20Studies	
  
© Copyright 2000-2014 TIBCO Software Inc.
Embedded BI Real World Use Case
h9ps://www.jaspersod.com/embeddedShowcase/periscope.html	
  
© Copyright 2000-2014 TIBCO Software Inc.
Problems of a DWH
No flexibility / agility 
•  Just structured data
•  Just some (maybe aggregated) history data
•  Just good for already known business cases

Low speed
•  ETL is batch, usually takes hours or sometimes even days
•  No proactive reactions possible à “too late architecture”

High costs (per GB)
•  Just selected data 
•  Too old data is often outsourced to archives
© Copyright 2000-2014 TIBCO Software Inc.
Classic BI vs. Big Data BI
© Copyright 2000-2014 TIBCO Software Inc.
Agenda


•  Terminology 
•  Data Warehouse and Business Intelligence
•  Big Data Processing with Hadoop
•  Big Data Processing in Real Time
© Copyright 2000-2014 TIBCO Software Inc.
Big Data Architecture
DWH	
  /	
  BI	
  
Hadoop	
  
Real	
  Time	
  
Big	
  Data	
  Architecture	
  
© Copyright 2000-2014 TIBCO Software Inc.
Why no longer DWH, but Hadoop?
Hadoop was built to solve problems of RDBMS and DWH… 

Benefits of Hadoop:
•  Store and analyze all data
–  all data == not just selected (maybe aggregated) data
–  all data == structured + semi-structured + unstructured
à be more flexible, adapt to changing business cases
•  Better performance (massively parallel)
•  Ad hoc data discovery – also for big data volumes
•  Save money (commodity hardware, open source software)
© Copyright 2000-2014 TIBCO Software Inc.
What is Hadoop?
Apache Hadoop, an open-source software library, is a
framework that allows for the distributed processing of
large data sets across clusters of commodity hardware
using simple programming models. It is designed to scale
up from single servers to thousands of machines, each
offering local computation and storage.
© Copyright 2000-2014 TIBCO Software Inc.
MapReduce
Simple	
  example:	
  
	
  
•  Input:	
  (very	
  large)	
  text	
  files	
  with	
  lists	
  of	
  strings,	
  such	
  as:	
  	
  
	
  „318,	
  0043012650999991949032412004...0500001N9+01111+99999999999...“	
  
•  We	
  are	
  interested	
  just	
  in	
  some	
  content:	
  year	
  and	
  temperate	
  (marked	
  in	
  red)	
  
•  The	
  Map	
  Reduce	
  funcDon	
  has	
  to	
  compute	
  the	
  maximum	
  temperature	
  for	
  every	
  year	
  
© Copyright 2000-2014 TIBCO Software Inc.
Hadoop Products
MapReduce
HDFS
Ecosystem
Features
included
few many
Apache
Hadoop
© Copyright 2000-2014 TIBCO Software Inc.
Hadoop Ecosystem
© Copyright 2000-2014 TIBCO Software Inc.
Hadoop Products
MapReduce
HDFS
Ecosystem
Features
included
Hadoop	
  
DistribuDon	
  
few many
Apache
Hadoop
Packaging
Deployment-Tooling
Support
+
© Copyright 2000-2014 TIBCO Software Inc.
Hadoop Distributions
(…	
  some	
  more	
  arising)	
  
EMR	
  
© Copyright 2000-2014 TIBCO Software Inc.
Hadoop Products
MapReduce
HDFS
Ecosystem
Features
included
Hadoop	
  
DistribuDon	
  
Big	
  Data	
  Suite	
  
few many
Apache
Hadoop
Packaging
Deployment-Tooling
Support
+
Tooling / Modeling
Code Generation
Scheduling
Integration
+
© Copyright 2000-2014 TIBCO Software Inc.
Big Data Integration Suite: TIBCO BusinessWorks
© Copyright 2000-2014 TIBCO Software Inc.
Live Demo
„TIBCO	
  BusinessWorks“	
  in	
  acDon...	
  
© Copyright 2000-2014 TIBCO Software Inc.
Hadoop Real World Use Case:
Replace ETL to improve Performance
“The advantage of their new system is that they can now look at their
data [from their log processing system] in anyway they want:
•  Nightly MapReduce jobs collect statistics about their mail system such as spam counts by
domain, bytes transferred and number of logins. 
•  When they wanted to find out which part of the world their customers logged in from, a quick
[ad hoc] MapReduce job was created and they had the answer within a few hours. Not really
possible in your typical ETL system.”

http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-query-terabytes-data
(	
  no	
  TIBCO	
  reference)	
  
© Copyright 2000-2014 TIBCO Software Inc.
•  A lot of data must be stored „forever“
•  Numbers increase exponentially
•  Goal: As cheap as possible
•  Problem: Queries must still be possible (compliance!)
•  Solution: Commodity servers and „Hadoop querying“
Global	
  Parcel	
  Service	
  
h9p://archive.org/stream/BigDataImPraxiseinsatz-­‐SzenarienBeispieleEffekte/Big_Data_BITKOM-­‐Lei`aden_Sept.2012#page/n0/mode/2up	
  
Hadoop Real World Use Case:
Storage to reduce Costs
(	
  no	
  TIBCO	
  reference)	
  
© Copyright 2000-2014 TIBCO Software Inc.
DWH or Hadoop? 

DWH	
   Hadoop	
  
Data	
   Structured	
   All	
  data	
  
Maturity	
   Established	
  in	
  Enterprise	
   New	
  concepts	
  
Tooling	
   Installed,	
  good	
  
knowledge	
  and	
  
experience	
  
New	
  tools,	
  coding	
  
required,	
  business	
  can	
  
sDll	
  use	
  SQL-­‐similar	
  
queries	
  or	
  same	
  BI	
  tool	
  
Costs	
   High	
  (per	
  GB)	
   Low	
  (per	
  GB)	
  
© Copyright 2000-2014 TIBCO Software Inc.
DWH plus Hadoop?
DWH and Hadoop complement each other very well
•  Store all data in Hadoop (cheap per GB)
•  ETL from Hadoop to DWH (expensive per GB)
•  Create specific reports / dashboards in DWH (leverage existing products and knowledge)
•  Do Ad Hoc (Big) Data Discovery directly in Hadoop, no DWH needed

Good BI tools support both, DWH and Hadoop!

For example, TIBCO Spotfire has connectors to: 
•  RDBMS (e.g. MySQL)
•  MPP (e.g. Teradata, IBM Netezza, Greenplum)
•  Hadoop (e.g. Hive, Impala)
•  In-Memory (e.g. TIBCO ActiveSpaces, SAP HANA)
© Copyright 2000-2014 TIBCO Software Inc.
Recommendation DWH vs. Hadoop vs. XYZ 
•  Short	
  term:	
  
Use	
  Hadoop	
  (only)	
  when	
  you	
  can	
  save	
  (a	
  lot	
  of)	
  money	
  or	
  when	
  you	
  can	
  not	
  solve	
  your	
  business	
  problem	
  
without	
  Hadoop.	
  A	
  lot	
  of	
  things	
  have	
  to	
  be	
  improved,	
  e.g.	
  governance,	
  security,	
  performance,	
  and	
  tool	
  
support.	
  
	
  
•  	
  Long	
  term:	
  
Hadoop	
  can	
  replace	
  DWH	
  (as	
  you	
  can	
  create	
  a	
  DWH	
  on	
  top	
  of	
  Hadoop	
  with	
  SQL	
  interface	
  already	
  today)!	
  
	
  
•  Be	
  aware:	
  
A	
  lot	
  of	
  other	
  opDons	
  emerge	
  for	
  analyzing	
  big	
  data	
  besides	
  Hadoop,	
  e.g.	
  
-­‐  AnalyDcal	
  databases	
  with	
  SQL	
  interface	
  (MemSQL,	
  Citus	
  Data)	
  
-­‐  Log	
  AnalyDcs	
  (Splunk,	
  TIBCO	
  LogLogic)	
  
-­‐  Graph	
  databases	
  (Neo4j,	
  InfiniteGraph)	
  
© Copyright 2000-2014 TIBCO Software Inc.
Vendors Strategy...
Hadoop vendors push Hadoop as DWH replacement
à Called e.g. „Enterprise Data Hub“ (Cloudera) or „Data Lake“ (Hortonworks)

h9p://gigaom.com/2013/10/29/clouderas-­‐plan-­‐to-­‐become-­‐the-­‐center-­‐of-­‐your-­‐data-­‐universe/	
   h9p://hortonworks.com/wp-­‐content/uploads/downloads/2013/04/
Hortonworks.ApacheHadoopPa9ernsOfUse.v1.0.pdf	
  
© Copyright 2000-2014 TIBCO Software Inc.
Vendors Strategy...
MPP / DWH vendors add Hadoop support as
complementary addon to their DWH
à  Reason (probably): Market pressure! 

à  Benefit: One platform (including tooling and support) for DWH and Hadoop
© Copyright 2000-2014 TIBCO Software Inc.
Example: EMC combines DWH and Hadoop
h9p://wikibon.org/wiki/v/EMC_Integrates_Greenplum_DB_and_Hadoop_with_Pivotal_HD	
   h9p://www.gopivotal.com/big-­‐data/pivotal-­‐hd	
  
© Copyright 2000-2014 TIBCO Software Inc.
Example: Teradata combines DWH and Hadoop
h9p://www.teradata.com/Teradata-­‐Enterprise-­‐Access-­‐for-­‐Hadoop/	
  
h9p://gigaom.com/2014/04/07/teradata-­‐says-­‐hadoop-­‐is-­‐good-­‐for-­‐business-­‐but-­‐for-­‐how-­‐long/	
  
© Copyright 2000-2014 TIBCO Software Inc.
Hadoop evolving from Batch to Near Real Time 

Hadoop is MapReduce == Batch (== hours, minutes, seconds)
•  Good for complex transformations / computations of big data volumes 
•  Not so good for ad hoc data exploration
•  Improvements: Hive Stinger (Hortonworks) etc.

Non-MapReduce processing engines added in the meantime (YARN makes it possible) 
•  Ad hoc data discovery (== seconds)
•  Hive / Pig with Apache Tez replacing MapReduce under the hood for data processing
•  New Query engines, e.g. Impala (Cloudera) or Apache Drill (MapR)

MPP vendors (e.g. Teradata, EMC Greenplum) also add own query engines 
•  Offer fast data exploration (without MapReduce)

Some Hadoop problems remain
•  No good, easy tooling (Hadoop ecosystem) à might be solved next years
•  Missing maturity (alpha / beta versions) à might be solved next years
•  No “real time” (== ms, ns), but “near real time” (> 1 sec) à “too late architecture”
© Copyright 2000-2014 TIBCO Software Inc.
Agenda


•  Terminology 
•  Data Warehouse and Business Intelligence
•  Big Data Processing with Hadoop
•  Big Data Processing in Real Time
© Copyright 2000-2014 TIBCO Software Inc.
Big Data Architecture
DWH	
  /	
  BI	
  
Hadoop	
  
Real	
  Time	
  
Big	
  Data	
  Architecture	
  
© Copyright 2000-2014 TIBCO Software Inc.
Real Time: “The Two-Second Advantage”
“A	
  li&le	
  bit	
  of	
  the	
  right	
  informa2on,	
  just	
  a	
  
li&le	
  bit	
  beforehand	
  –	
  whether	
  it	
  is	
  a	
  
couple	
  of	
  seconds,	
  minutes	
  or	
  hours	
  –	
  is	
  
more	
  valuable	
  than	
  all	
  of	
  the	
  informa2on	
  
in	
  the	
  world	
  six	
  months	
  later…	
  this	
  is	
  the	
  
two-­‐second	
  advantage.” 	
   	
  	
  
	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Vikek	
  Ranadivé,	
  Founder	
  and	
  CEO	
  of	
  TIBCO	
  
© Copyright 2000-2014 TIBCO Software Inc.
The Value of Data decreases over Time
© Copyright 2000-2014 TIBCO Software Inc.
What is Big Data? The combined Vs of Big Data
Volume	
  	
  
(terabytes,	
  
petabytes)	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Variety	
  	
  
(social	
  networks,	
  
blog	
  posts,	
  logs,	
  
sensors,	
  etc.)	
  
	
  	
  	
  	
  	
  Velocity	
  	
  
	
  	
  	
  	
  	
  	
  (realDme)	
  
	
  
	
  
	
  
	
  
X
Fast	
  	
  
Data	
  
© Copyright 2000-2014 TIBCO Software Inc.
Real Time Architecture?

EVENTS	
  
Mainframe/ERP/DB/App	
  
ACTION	
  
TransacDon	
  Based	
  Architectures	
  
EVENTS	
  
Mainframe/ERP/DB/App	
  
ACTION	
  
Behavior	
  Based	
  Architectures	
  
TransacDon	
  
Data,	
  Event	
  and	
  
AnalyDcs	
  
Not	
  ElasDc,	
  Doesn’t	
  Scale,	
  
	
  “Always	
  Late”	
  architecture	
  and	
  analyDcs	
  
	
  	
  
ElasDc,	
  Scales,	
  Real	
  Dme	
  architecture	
  	
  
(Events,	
  Data	
  and	
  AnalyDcs)	
  
© Copyright 2000-2014 TIBCO Software Inc.
Complex Event / Stream Processing / In-Memory
Concepts 
•  Streams: Monitoring millions of events in a specific time window to react proactively
•  Stateful: Collect, filter and correlate events with state to anticipate outcomes and react proactively 
•  Transactional: Highly performant transactional event processing


Products vs. Frameworks
•  Products are mature, mission-critical, in production, e.g. TIBCO StreamBase, IBM InfoSphere Streams
•  Open Source Frameworks, e.g. “Apache Spark” and “Apache Storm” 
–  Future will tell us about performance, tooling, support, etc.
–  Can be combined with Hadoop
–  Are complementary to Products such as TIBCO StreamBase


In-Memory 
•  Can also be used for “big data” (Terabytes possible!)
•  Usually complementary, i.e. they can be / have to be combined with stream processing / complex event
processing
© Copyright 2000-2014 TIBCO Software Inc.
Stream Processing Architecture
LiveView Datamart
Con-nuous	
  Query	
  
Continuous Query Processor
Ad	
  Hoc	
  Query	
  
Alerts	
  
CEP	
  
Messaging	
  (low	
  latency)	
  
Messaging	
  (JMS)	
  
Social	
  Media	
  Data	
  
Market	
  Data	
  
In-­‐Memory	
  
ESB	
  Integra-on	
  
Sensor	
  Data	
  
Historical	
  
Data	
  
JDBC	
  
Ac-veSpaces	
  
Enterprise	
  
data	
  
© Copyright 2000-2014 TIBCO Software Inc.
Stream Processing Architecture (Example: TIBCO StreamBase)
TIBCO StreamBase
Con-nuous	
  
Query	
  
Continuous Query Processor
Ad	
  Hoc	
  Query	
  
Alerts	
  
Active Tables
Trading	
  Signal	
  
Transac-on	
  Cost	
  
Orders	
  /	
  Execu-ons	
  
Market	
  Data	
  
Alert	
  SeMng	
  
TIBCO LiveViewSnapshot	
  AND	
  always-­‐live	
  
updates	
  
Quickly	
  connect	
  to	
  streams	
  
An;cipate	
  opportuni;es,	
  proac;ve	
  ac;on	
  
© Copyright 2000-2014 TIBCO Software Inc.
Example: TIBCO StreamBase Tooling
StreamBase Development Studio 
•  Visual Development
•  Visual Debugging
•  Feed Simulation
•  Unit Testing
StreamBase LiveView
•  Real Time Analytics and Visualization
•  Ad hoc queries
•  Alerts and Notifications
•  Web, Mobile and API Integration
© Copyright 2000-2014 TIBCO Software Inc.
Real World: Real-Time Trade Surveillance
Applica-ons	
  
IntegraDon	
  
NormalizaDon	
  
AggregaDon	
  
CorrelaDon	
  
Rules	
  
Alerts	
  
AutomaDon	
  
Adapters	
  	
  
and	
  	
  
Handlers	
  
Adapters	
  
and	
  
Handlers	
  
StreamBase	
  Server(s)	
  
StreamBase	
  Studio	
  for	
  
Developing	
  EventFlow	
  Applica-ons	
  	
  
Data	
  Management	
  	
  	
  
Persistence	
  Stores	
  
Logs	
  
Market	
  
Data	
  
Trade	
  Data	
  
Sta-c	
  Data	
  
Systems	
  
Data	
  
Performance	
  
Benchmarks	
  
Automa-on	
  
Desktop	
  
Alerts	
  
Inputs	
   Outputs	
  
© Copyright 2000-2014 TIBCO Software Inc.
Real Time (Stream Processing) Real World Use Case
	
  
	
  
Real-­‐Time	
  Fraud	
  DetecDon	
  	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
“The	
  firm	
  needs	
  to	
  monitor	
  machine-­‐driven	
  algorithms,	
  and	
  look	
  for	
  suspicious	
  pa9erns.	
  Sounds	
  simple,	
  right?	
  Not	
  so	
  simple!	
  
In	
  this	
  case,	
  the	
  pa9erns	
  of	
  interest	
  required	
  correlaDon	
  of	
  5	
  streams	
  of	
  real-­‐Dme	
  data.	
  Pa9erns	
  happen	
  within	
  15-­‐30	
  second	
  windows,	
  during	
  which	
  thousands	
  of	
  dollars	
  could	
  be	
  lost.	
  A9acks	
  come	
  in	
  
bursts.	
  
The	
  data	
  required	
  to	
  find	
  these	
  pa9erns	
  was	
  loaded	
  into	
  a	
  data	
  warehouse	
  and	
  reports	
  were	
  checked	
  each	
  day.	
  Decisions	
  to	
  act	
  were	
  made	
  every	
  day.	
  
LiveView	
  now	
  intercepts	
  the	
  data	
  before	
  it	
  hit	
  the	
  warehouse	
  by	
  connecDng	
  LiveView	
  to	
  the	
  source	
  of	
  data.	
  It	
  took	
  3	
  days	
  to	
  integrate	
  these	
  sources	
  because	
  it	
  took	
  that	
  long	
  to	
  find	
  someone	
  who	
  
knew	
  where	
  3	
  of	
  the	
  data	
  streams	
  came	
  from!	
  
StreamBase	
  detects	
  fraud	
  pa9erns	
  in	
  milliseconds.	
  But	
  the	
  really	
  interesDng	
  part	
  came	
  next.	
  
Once	
  this	
  firm	
  could	
  see	
  pa9erns	
  of	
  fraud,	
  they	
  were	
  faced	
  with	
  a	
  new	
  challenge:	
  what	
  to	
  DO	
  about	
  it?	
  How	
  many	
  Dmes	
  did	
  the	
  pa9ern	
  need	
  to	
  be	
  repeated	
  unDl	
  acDve	
  surveillance	
  is	
  started?	
  	
  Should	
  
the	
  acDon	
  be	
  quaranDned	
  for	
  a	
  period,	
  or	
  halted	
  immediately?	
  All	
  these	
  quesDons	
  were	
  new,	
  and	
  the	
  answers	
  to	
  them	
  keeps	
  changing.	
  
The	
  fact	
  that	
  the	
  answers	
  keep	
  changing	
  highlights	
  the	
  importance	
  of	
  ease	
  of	
  use.	
  AnalyDcs	
  must	
  be	
  changed	
  quickly	
  and	
  be	
  made	
  available	
  to	
  fraud	
  experts	
  -­‐	
  in	
  some	
  cases,	
  in	
  hours	
  -­‐	
  as	
  understanding	
  
deepens,	
  and	
  as	
  the	
  bad	
  guys	
  change	
  their	
  tacDcs.	
  
Be9er,	
  higher	
  value-­‐add	
  customer	
  service	
  for	
  highly	
  automated	
  industries.	
  Knowledge	
  workers	
  who	
  anDcipate	
  sales	
  opportuniDes.	
  Spowng	
  fraud	
  in	
  high-­‐speed	
  transacDons	
  streams	
  and	
  taking	
  acDon.“	
  
	
  
Some	
  more	
  use	
  cases:	
  
h9p://streambase.typepad.com/streambase_stream_process/2012/04/streambase-­‐liveview-­‐10-­‐3-­‐stories-­‐from-­‐the-­‐trenches.html	
  
© Copyright 2000-2014 TIBCO Software Inc.
Real Time (CEP + In-Memory) Real World Use Case
“With	
  38	
  million	
  fans,	
  MGM	
  knows	
  how	
  to	
  put	
  its	
  customers	
  
first,	
  it	
  takes	
  more	
  than	
  a	
  smile	
  too.	
  Customers	
  want	
  a	
  
personalized,	
  tailored	
  experience,	
  one	
  that	
  knows	
  their	
  
name	
  and	
  can	
  anDcipate	
  their	
  needs.	
  With	
  the	
  help	
  of	
  TIBCO	
  
technologies	
  that	
  leverage	
  big	
  data	
  and	
  give	
  customers	
  a	
  
digital	
  idenDty,	
  MGM	
  can	
  send	
  personalized	
  offers	
  directly	
  
to	
  customers,	
  save	
  them	
  a	
  seat,	
  and	
  have	
  their	
  favorite	
  drink	
  
on	
  the	
  way.	
  With	
  mulDple	
  customer	
  touch	
  points	
  and	
  
channels,	
  MGM	
  can	
  reach	
  customers	
  in	
  more	
  ways,	
  and	
  in	
  
more	
  places,	
  than	
  ever	
  before.”	
  	
  
h9ps://www.youtube.com/watch?v=X-­‐7S3kCOx9k	
  
CEP:	
  
•  Correlate	
  
•  Analyze	
  
•  AcDon	
  
In-­‐Memory:	
  
•  Enable	
  Real	
  Time	
  
•  Only	
  customers	
  that	
  have	
  checked	
  in	
  
© Copyright 2000-2014 TIBCO Software Inc.
Live Demo
„TIBCO	
  StreamBase“	
  in	
  acDon...	
  
© Copyright 2000-2014 TIBCO Software Inc.
Hadoop: 
•  Storage
•  Complex computing (MapReduce)
Real Time:
•  Immediate (proactive) reactions – automated or manually by user
•  Monitor streaming data in Real Time 
Example: 



TIBCO StreamBase and its Apache Flume connector for reading streaming data from Hadoop /
HDFS or to send streaming data to Hadoop / HDFS
Real Time plus Hadoop?
© Copyright 2000-2014 TIBCO Software Inc.
Use Case: 
•  Predict pricing movement in live bets

Hadoop: 
•  Store all history information about all past bets
•  Use MapReduce to precompute odds for new
matches, based on all history data
TIBCO StreamBase:
•  Compute new odds in real time to react within a live
game after events (e.g. when a team scores a goal) 
•  Monitor stream data in real time dashboards
Real Time plus Hadoop Real World Use Case
h9p://www.casestudyu.com/news/2014/04/04/7762652.htm	
  
h9p://vimeo.com/91461315	
  
© Copyright 2000-2014 TIBCO Software Inc.
Recap: Big Data Architecture
DWH	
  /	
  BI	
  
Hadoop	
  
Real	
  Time	
  
Big	
  Data	
  Architecture	
  
© Copyright 2000-2014 TIBCO Software Inc.
Off Topic

What about Integration?
© Copyright 2000-2014 TIBCO Software Inc.
Off Topic
Integration is no talking point in this
session… However:
It gets even more important in the future!
The number of different data sources and technologies increases
even more than in the past
–  CRM, ERP, Host, B2B, etc. will not disappear
–  DWH, Hadoop cluster, event / streaming server, In-
Memory DB have to communicate
–  Cloud, Mobile, Internet of Things are no option, but our
future!
© Copyright 2000-2014 TIBCO Software Inc.
Recap: Key Messages
Big Data is not just Hadoop, concentrate on Business Value!
A good Big Data Architecture combines DWH, Hadoop and Real Time!
The Integration Layer is getting even more important in the Big Data Era!
© Copyright 2000-2014 TIBCO Software Inc.
Questions?
Kai Wähner
kwaehner@tibco.com, @KaiWaehner, www.kai-waehner.de

More Related Content

What's hot

Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceDatabricks
 
Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)James Serra
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Cathrine Wilhelmsen
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookJames Serra
 
Big Data & Analytics Architecture
Big Data & Analytics ArchitectureBig Data & Analytics Architecture
Big Data & Analytics ArchitectureArvind Sathi
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Cathrine Wilhelmsen
 
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...Michael Rys
 
Vector Search for Data Scientists.pdf
Vector Search for Data Scientists.pdfVector Search for Data Scientists.pdf
Vector Search for Data Scientists.pdfConnorShorten2
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Hortonworks
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture DesignKujambu Murugesan
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 
Introduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big DataIntroduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big Datawaheed751
 
Azure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar PresentationAzure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar PresentationMatthew W. Bowers
 
Azure+Databricks+Course+Slide+Deck+V4.pdf
Azure+Databricks+Course+Slide+Deck+V4.pdfAzure+Databricks+Course+Slide+Deck+V4.pdf
Azure+Databricks+Course+Slide+Deck+V4.pdfChitresh Kaushik
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemJames Serra
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesIvo Andreev
 

What's hot (20)

Big Data Ecosystem
Big Data EcosystemBig Data Ecosystem
Big Data Ecosystem
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future Outlook
 
Big Data & Analytics Architecture
Big Data & Analytics ArchitectureBig Data & Analytics Architecture
Big Data & Analytics Architecture
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
 
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
 
Vector Search for Data Scientists.pdf
Vector Search for Data Scientists.pdfVector Search for Data Scientists.pdf
Vector Search for Data Scientists.pdf
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
 
Modern Data architecture Design
Modern Data architecture DesignModern Data architecture Design
Modern Data architecture Design
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
Data Science in Digital Marketing - Forest Cassidy, LeadFerret
Data Science in Digital Marketing - Forest Cassidy, LeadFerretData Science in Digital Marketing - Forest Cassidy, LeadFerret
Data Science in Digital Marketing - Forest Cassidy, LeadFerret
 
Introduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big DataIntroduction to Cloud Computing and Big Data
Introduction to Cloud Computing and Big Data
 
Azure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar PresentationAzure Synapse 101 Webinar Presentation
Azure Synapse 101 Webinar Presentation
 
Introduction to Amazon Redshift
Introduction to Amazon RedshiftIntroduction to Amazon Redshift
Introduction to Amazon Redshift
 
Big Data
Big DataBig Data
Big Data
 
Azure+Databricks+Course+Slide+Deck+V4.pdf
Azure+Databricks+Course+Slide+Deck+V4.pdfAzure+Databricks+Course+Slide+Deck+V4.pdf
Azure+Databricks+Course+Slide+Deck+V4.pdf
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best Practices
 

Similar to "Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about Real Time?" - Slides (including TIBCO Examples) from JAX 2014 Online

The Anywhere Enterprise – How a Flexible Foundation Opens Doors
The Anywhere Enterprise – How a Flexible Foundation Opens DoorsThe Anywhere Enterprise – How a Flexible Foundation Opens Doors
The Anywhere Enterprise – How a Flexible Foundation Opens DoorsInside Analysis
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...Hortonworks
 
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)Bogdan Bocse
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
Webinar: Improving Time to Value for Enterprise Big Data Analytics
Webinar: Improving Time to Value for Enterprise Big Data AnalyticsWebinar: Improving Time to Value for Enterprise Big Data Analytics
Webinar: Improving Time to Value for Enterprise Big Data AnalyticsStorage Switzerland
 
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Big Data Spain
 
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...Kai Wähner
 
Time to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going MainstreamTime to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going MainstreamInside Analysis
 
Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Holden Ackerman
 
Enterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum ComputingEnterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum ComputingKnowledgent
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
Big Data Infrastructure
Big Data InfrastructureBig Data Infrastructure
Big Data InfrastructureTrivadis
 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointInside Analysis
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnectaDigital
 
Organising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data WorldOrganising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data WorldDataWorks Summit/Hadoop Summit
 
Big data presentation (2014)
Big data presentation (2014)Big data presentation (2014)
Big data presentation (2014)Xavier Constant
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & HadoopBlackvard
 

Similar to "Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about Real Time?" - Slides (including TIBCO Examples) from JAX 2014 Online (20)

The Anywhere Enterprise – How a Flexible Foundation Opens Doors
The Anywhere Enterprise – How a Flexible Foundation Opens DoorsThe Anywhere Enterprise – How a Flexible Foundation Opens Doors
The Anywhere Enterprise – How a Flexible Foundation Opens Doors
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
 
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
The Rise of Digital Audio (AdsWizz, DevTalks Bucharest, 2015)
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Webinar: Improving Time to Value for Enterprise Big Data Analytics
Webinar: Improving Time to Value for Enterprise Big Data AnalyticsWebinar: Improving Time to Value for Enterprise Big Data Analytics
Webinar: Improving Time to Value for Enterprise Big Data Analytics
 
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
 
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
 
Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
 
Time to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going MainstreamTime to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going Mainstream
 
Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Enterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum ComputingEnterprise Architecture in the Era of Big Data and Quantum Computing
Enterprise Architecture in the Era of Big Data and Quantum Computing
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
Big Data Infrastructure
Big Data InfrastructureBig Data Infrastructure
Big Data Infrastructure
 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter Point
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
 
Organising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data WorldOrganising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data World
 
Big data presentation (2014)
Big data presentation (2014)Big data presentation (2014)
Big data presentation (2014)
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & Hadoop
 
Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?
 

More from Kai Wähner

Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Kai Wähner
 
When NOT to use Apache Kafka?
When NOT to use Apache Kafka?When NOT to use Apache Kafka?
When NOT to use Apache Kafka?Kai Wähner
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKai Wähner
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaKai Wähner
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareApache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareKai Wähner
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Kai Wähner
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureKai Wähner
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Kai Wähner
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryData Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryKai Wähner
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryKai Wähner
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryKai Wähner
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Apache Kafka for Real-time Supply Chainin the Food and Retail IndustryApache Kafka for Real-time Supply Chainin the Food and Retail Industry
Apache Kafka for Real-time Supply Chain in the Food and Retail IndustryKai Wähner
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKai Wähner
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Kai Wähner
 
Apache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingApache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingKai Wähner
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKai Wähner
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022Kai Wähner
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesEvent Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesKai Wähner
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Kai Wähner
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Kai Wähner
 

More from Kai Wähner (20)

Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
 
When NOT to use Apache Kafka?
When NOT to use Apache Kafka?When NOT to use Apache Kafka?
When NOT to use Apache Kafka?
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping MetaverseKafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache KafkaThe Heart of the Data Mesh Beats in Real-Time with Apache Kafka
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform MiddlewareApache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse ArchitectureServerless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
 
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apac...
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity IndustryData Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare Industry
 
Apache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare IndustryApache Kafka in the Healthcare Industry
Apache Kafka in the Healthcare Industry
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Apache Kafka for Real-time Supply Chainin the Food and Retail IndustryApache Kafka for Real-time Supply Chainin the Food and Retail Industry
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid CloudKafka for Real-Time Replication between Edge and Hybrid Cloud
Kafka for Real-Time Replication between Edge and Hybrid Cloud
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
 
Apache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and ManufacturingApache Kafka Landscape for Automotive and Manufacturing
Apache Kafka Landscape for Automotive and Manufacturing
 
Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology Comparison
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022The Top 5 Apache Kafka Use Cases and Architectures in 2022
The Top 5 Apache Kafka Use Cases and Architectures in 2022
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka ArchitecturesEvent Streaming CTO Roundtable for Cloud-native Kafka Architectures
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
 

Recently uploaded

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 

Recently uploaded (20)

The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 

"Hadoop and Data Warehouse (DWH) – Friends, Enemies or Profiteers? What about Real Time?" - Slides (including TIBCO Examples) from JAX 2014 Online

  • 1. © Copyright 2000-2014 TIBCO Software Inc. Hadoop and Data Warehouse – Friends, Enemies or Profiteers? What about Real Time? Kai Wähner kwaehner@tibco.com @KaiWaehner www.kai-waehner.de
  • 2. © Copyright 2000-2014 TIBCO Software Inc. Disclaimer ! These opinions are my own and do not necessarily represent my employer
  • 3. © Copyright 2000-2014 TIBCO Software Inc. Key Messages Big Data is not just Hadoop, concentrate on Business Value! A good Big Data Architecture combines DWH, Hadoop and Real Time! The Integration Layer is getting even more important in the Big Data Era!
  • 4. © Copyright 2000-2014 TIBCO Software Inc. Agenda •  Terminology •  Data Warehouse and Business Intelligence •  Big Data Processing with Hadoop •  Big Data Processing in Real Time
  • 5. © Copyright 2000-2014 TIBCO Software Inc. Agenda •  Terminology •  Data Warehouse and Business Intelligence •  Big Data Processing with Hadoop •  Big Data Processing in Real Time
  • 6. © Copyright 2000-2014 TIBCO Software Inc. Big Data Architecture DWH  /  BI   Hadoop   Real  Time   Big  Data  Architecture  
  • 7. © Copyright 2000-2014 TIBCO Software Inc. DWH means analyzing OLAP Cubes h9p://www.exforsys.com/tutorials/msas/data-­‐warehouse-­‐database-­‐and-­‐oltp-­‐database.html  
  • 8. © Copyright 2000-2014 TIBCO Software Inc. Big Data means analyzing Everything h9p://blogs.teradata.com/internaDonal/tag/hadoop/   •  Store  everything   •  Even  without  structure   •  Use  whatever  you  need  (now  or  later)  
  • 9. © Copyright 2000-2014 TIBCO Software Inc. Big Data: Three shifts in the Way we analyze Information •  Messiness:  Using  ALL  data,  not  just  samples   •  Also  bad  data  (e.g.  Word  spell  checker,  Google  auto-­‐complete  and  „did   you  mean...“  recommendaDon     •  Correla-ons:  Instead  of  causaliDes   •  May  not  tell  us  WHY  something  is  happening,  but  THAT  it  is  happening   •  In  many  situaDons,  this  is  good  enough   •  What  drug  substance  cures  cancer?  When  should  I  buy  an  airplane  Dcket?     •  Datafica-on:  Store,  process,  combine,  reuse,  enhance  all  data!   •  DigitalisaDon  (Amazon  Kindle  à  Read)  vs.  DataficaDon  (Google  Books  à   Read,  Search,  Process,  ...)     •  Words  becomes  data:  Google  books:  not  just  read,  but  also  search,   analyse,  etc.   •  LocaDons  becomes  data:  GPS:  not  just  navigaDon,  but  also  insurance   costs,  economic  routes,  etc.      
  • 10. © Copyright 2000-2014 TIBCO Software Inc. What is Big Data? The combined Vs of Big Data Volume     (terabytes,   petabytes)                     Variety     (social  networks,   blog  posts,  logs,   sensors,  etc.)            Velocity                (realDme)           Value   X
  • 11. © Copyright 2000-2014 TIBCO Software Inc. Real Time Wikipedia Definition: •  Real time programs must guarantee response within strict time constraints, often referred to as "deadlines”. Real time responses are often understood to be in the order of milliseconds, and sometimes microseconds. •  The term "near real time” refers to the time delay introduced, by automated data processing or network transmission. •  The distinction between the terms "near real time" and "real time" is somewhat nebulous and must be defined for the situation at hand. Hereby, for this talk, I define: –  Real time == response in nanoseconds || microseconds || milliseconds || <= one second –  Near real time == (response time > one second)
  • 12. © Copyright 2000-2014 TIBCO Software Inc. Agenda •  Terminology •  Data Warehouse and Business Intelligence •  Big Data Processing with Hadoop •  Big Data Processing in Real Time
  • 13. © Copyright 2000-2014 TIBCO Software Inc. Big Data Architecture DWH  /  BI   Hadoop   Real  Time   Big  Data  Architecture  
  • 14. © Copyright 2000-2014 TIBCO Software Inc. DWH vs. BI •  Data Warehouse (DWH) à Storage •  Business Intelligence (BI) à Analytics •  Both terms are often used as synonym, i.e. when someone talks about a DWH, this might include analytics •  BI can be used without a DWH
  • 15. © Copyright 2000-2014 TIBCO Software Inc. Typical DWH Process h9p://wikibon.org/blog/not-­‐your-­‐fathers-­‐data-­‐analyDcs/     A  DWH  is  „Business  Case  driven“:   •  ReporDng   •  Dashboards   •  Drill  Down  AnalyDcs     Different  DWH  OpDons:   •  Enterprise  DWH  (  ==  EDW)     •  Department  /  Project  DWH   •  Embedded  BI  (into  ApplicaDons)    
  • 16. © Copyright 2000-2014 TIBCO Software Inc. BI == Reporting + Statistics + Data Discovery DWH   BI  
  • 17. © Copyright 2000-2014 TIBCO Software Inc. BI Visualization
  • 18. © Copyright 2000-2014 TIBCO Software Inc. Products DWH •  SQL: e.g. MySQL •  MPP: e.g. Teradata, EMC Greenplum, IBM Netezza –  Scale very well (almost linear), very high performance, hardware / software costs also increase a lot BI •  Microsoft Excel •  BI Tools: e.g. TIBCO Spotfire, Tableau, MicroStrategy Hint: Good BI tools •  allow data discovery / visualization using different sources, not just DWH •  are easy to use
  • 19. © Copyright 2000-2014 TIBCO Software Inc. BI Tool Example: TIBCO Spotfire
  • 20. © Copyright 2000-2014 TIBCO Software Inc. BI Tool Example: TIBCO Spotfire The  whole  team  needs  analyDcs.  Spo`ire  is  for   everyone,  helping  users  with  a  variety  of  skill   levels  to  visualize,  explore  and  share   informaDon:  It  has     •  At-­‐a-­‐glance  business  facts  for  managers   •  Dashboards  for  front-­‐line  decision-­‐makers   •  Visual  discovery  for  business  users   •  Deep  data  exploraDon  for  analysts   •  Advanced  predicDve  analyDcs  for   staDsDcians   •  And  beauDful  visualizaDons  to  empress   your  execuDves  
  • 21. © Copyright 2000-2014 TIBCO Software Inc. Example: TIBCO Spotfire
  • 22. © Copyright 2000-2014 TIBCO Software Inc. Live Demo „TIBCO  Spo`ire“  in  acDon...  
  • 23. © Copyright 2000-2014 TIBCO Software Inc. DWH Real World Use Case h9p://spo`ire.Dbco.com/resources/content-­‐center?Content%20Type=Case%20Studies  
  • 24. © Copyright 2000-2014 TIBCO Software Inc. DWH Real World Use Case h9p://spo`ire.Dbco.com/resources/content-­‐center?Content%20Type=Case%20Studies  
  • 25. © Copyright 2000-2014 TIBCO Software Inc. Embedded BI Real World Use Case h9ps://www.jaspersod.com/embeddedShowcase/periscope.html  
  • 26. © Copyright 2000-2014 TIBCO Software Inc. Problems of a DWH No flexibility / agility •  Just structured data •  Just some (maybe aggregated) history data •  Just good for already known business cases Low speed •  ETL is batch, usually takes hours or sometimes even days •  No proactive reactions possible à “too late architecture” High costs (per GB) •  Just selected data •  Too old data is often outsourced to archives
  • 27. © Copyright 2000-2014 TIBCO Software Inc. Classic BI vs. Big Data BI
  • 28. © Copyright 2000-2014 TIBCO Software Inc. Agenda •  Terminology •  Data Warehouse and Business Intelligence •  Big Data Processing with Hadoop •  Big Data Processing in Real Time
  • 29. © Copyright 2000-2014 TIBCO Software Inc. Big Data Architecture DWH  /  BI   Hadoop   Real  Time   Big  Data  Architecture  
  • 30. © Copyright 2000-2014 TIBCO Software Inc. Why no longer DWH, but Hadoop? Hadoop was built to solve problems of RDBMS and DWH… Benefits of Hadoop: •  Store and analyze all data –  all data == not just selected (maybe aggregated) data –  all data == structured + semi-structured + unstructured à be more flexible, adapt to changing business cases •  Better performance (massively parallel) •  Ad hoc data discovery – also for big data volumes •  Save money (commodity hardware, open source software)
  • 31. © Copyright 2000-2014 TIBCO Software Inc. What is Hadoop? Apache Hadoop, an open-source software library, is a framework that allows for the distributed processing of large data sets across clusters of commodity hardware using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.
  • 32. © Copyright 2000-2014 TIBCO Software Inc. MapReduce Simple  example:     •  Input:  (very  large)  text  files  with  lists  of  strings,  such  as:      „318,  0043012650999991949032412004...0500001N9+01111+99999999999...“   •  We  are  interested  just  in  some  content:  year  and  temperate  (marked  in  red)   •  The  Map  Reduce  funcDon  has  to  compute  the  maximum  temperature  for  every  year  
  • 33. © Copyright 2000-2014 TIBCO Software Inc. Hadoop Products MapReduce HDFS Ecosystem Features included few many Apache Hadoop
  • 34. © Copyright 2000-2014 TIBCO Software Inc. Hadoop Ecosystem
  • 35. © Copyright 2000-2014 TIBCO Software Inc. Hadoop Products MapReduce HDFS Ecosystem Features included Hadoop   DistribuDon   few many Apache Hadoop Packaging Deployment-Tooling Support +
  • 36. © Copyright 2000-2014 TIBCO Software Inc. Hadoop Distributions (…  some  more  arising)   EMR  
  • 37. © Copyright 2000-2014 TIBCO Software Inc. Hadoop Products MapReduce HDFS Ecosystem Features included Hadoop   DistribuDon   Big  Data  Suite   few many Apache Hadoop Packaging Deployment-Tooling Support + Tooling / Modeling Code Generation Scheduling Integration +
  • 38. © Copyright 2000-2014 TIBCO Software Inc. Big Data Integration Suite: TIBCO BusinessWorks
  • 39. © Copyright 2000-2014 TIBCO Software Inc. Live Demo „TIBCO  BusinessWorks“  in  acDon...  
  • 40. © Copyright 2000-2014 TIBCO Software Inc. Hadoop Real World Use Case: Replace ETL to improve Performance “The advantage of their new system is that they can now look at their data [from their log processing system] in anyway they want: •  Nightly MapReduce jobs collect statistics about their mail system such as spam counts by domain, bytes transferred and number of logins. •  When they wanted to find out which part of the world their customers logged in from, a quick [ad hoc] MapReduce job was created and they had the answer within a few hours. Not really possible in your typical ETL system.” http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-query-terabytes-data (  no  TIBCO  reference)  
  • 41. © Copyright 2000-2014 TIBCO Software Inc. •  A lot of data must be stored „forever“ •  Numbers increase exponentially •  Goal: As cheap as possible •  Problem: Queries must still be possible (compliance!) •  Solution: Commodity servers and „Hadoop querying“ Global  Parcel  Service   h9p://archive.org/stream/BigDataImPraxiseinsatz-­‐SzenarienBeispieleEffekte/Big_Data_BITKOM-­‐Lei`aden_Sept.2012#page/n0/mode/2up   Hadoop Real World Use Case: Storage to reduce Costs (  no  TIBCO  reference)  
  • 42. © Copyright 2000-2014 TIBCO Software Inc. DWH or Hadoop? DWH   Hadoop   Data   Structured   All  data   Maturity   Established  in  Enterprise   New  concepts   Tooling   Installed,  good   knowledge  and   experience   New  tools,  coding   required,  business  can   sDll  use  SQL-­‐similar   queries  or  same  BI  tool   Costs   High  (per  GB)   Low  (per  GB)  
  • 43. © Copyright 2000-2014 TIBCO Software Inc. DWH plus Hadoop? DWH and Hadoop complement each other very well •  Store all data in Hadoop (cheap per GB) •  ETL from Hadoop to DWH (expensive per GB) •  Create specific reports / dashboards in DWH (leverage existing products and knowledge) •  Do Ad Hoc (Big) Data Discovery directly in Hadoop, no DWH needed Good BI tools support both, DWH and Hadoop! For example, TIBCO Spotfire has connectors to: •  RDBMS (e.g. MySQL) •  MPP (e.g. Teradata, IBM Netezza, Greenplum) •  Hadoop (e.g. Hive, Impala) •  In-Memory (e.g. TIBCO ActiveSpaces, SAP HANA)
  • 44. © Copyright 2000-2014 TIBCO Software Inc. Recommendation DWH vs. Hadoop vs. XYZ •  Short  term:   Use  Hadoop  (only)  when  you  can  save  (a  lot  of)  money  or  when  you  can  not  solve  your  business  problem   without  Hadoop.  A  lot  of  things  have  to  be  improved,  e.g.  governance,  security,  performance,  and  tool   support.     •   Long  term:   Hadoop  can  replace  DWH  (as  you  can  create  a  DWH  on  top  of  Hadoop  with  SQL  interface  already  today)!     •  Be  aware:   A  lot  of  other  opDons  emerge  for  analyzing  big  data  besides  Hadoop,  e.g.   -­‐  AnalyDcal  databases  with  SQL  interface  (MemSQL,  Citus  Data)   -­‐  Log  AnalyDcs  (Splunk,  TIBCO  LogLogic)   -­‐  Graph  databases  (Neo4j,  InfiniteGraph)  
  • 45. © Copyright 2000-2014 TIBCO Software Inc. Vendors Strategy... Hadoop vendors push Hadoop as DWH replacement à Called e.g. „Enterprise Data Hub“ (Cloudera) or „Data Lake“ (Hortonworks) h9p://gigaom.com/2013/10/29/clouderas-­‐plan-­‐to-­‐become-­‐the-­‐center-­‐of-­‐your-­‐data-­‐universe/   h9p://hortonworks.com/wp-­‐content/uploads/downloads/2013/04/ Hortonworks.ApacheHadoopPa9ernsOfUse.v1.0.pdf  
  • 46. © Copyright 2000-2014 TIBCO Software Inc. Vendors Strategy... MPP / DWH vendors add Hadoop support as complementary addon to their DWH à  Reason (probably): Market pressure! à  Benefit: One platform (including tooling and support) for DWH and Hadoop
  • 47. © Copyright 2000-2014 TIBCO Software Inc. Example: EMC combines DWH and Hadoop h9p://wikibon.org/wiki/v/EMC_Integrates_Greenplum_DB_and_Hadoop_with_Pivotal_HD   h9p://www.gopivotal.com/big-­‐data/pivotal-­‐hd  
  • 48. © Copyright 2000-2014 TIBCO Software Inc. Example: Teradata combines DWH and Hadoop h9p://www.teradata.com/Teradata-­‐Enterprise-­‐Access-­‐for-­‐Hadoop/   h9p://gigaom.com/2014/04/07/teradata-­‐says-­‐hadoop-­‐is-­‐good-­‐for-­‐business-­‐but-­‐for-­‐how-­‐long/  
  • 49. © Copyright 2000-2014 TIBCO Software Inc. Hadoop evolving from Batch to Near Real Time Hadoop is MapReduce == Batch (== hours, minutes, seconds) •  Good for complex transformations / computations of big data volumes •  Not so good for ad hoc data exploration •  Improvements: Hive Stinger (Hortonworks) etc. Non-MapReduce processing engines added in the meantime (YARN makes it possible) •  Ad hoc data discovery (== seconds) •  Hive / Pig with Apache Tez replacing MapReduce under the hood for data processing •  New Query engines, e.g. Impala (Cloudera) or Apache Drill (MapR) MPP vendors (e.g. Teradata, EMC Greenplum) also add own query engines •  Offer fast data exploration (without MapReduce) Some Hadoop problems remain •  No good, easy tooling (Hadoop ecosystem) à might be solved next years •  Missing maturity (alpha / beta versions) à might be solved next years •  No “real time” (== ms, ns), but “near real time” (> 1 sec) à “too late architecture”
  • 50. © Copyright 2000-2014 TIBCO Software Inc. Agenda •  Terminology •  Data Warehouse and Business Intelligence •  Big Data Processing with Hadoop •  Big Data Processing in Real Time
  • 51. © Copyright 2000-2014 TIBCO Software Inc. Big Data Architecture DWH  /  BI   Hadoop   Real  Time   Big  Data  Architecture  
  • 52. © Copyright 2000-2014 TIBCO Software Inc. Real Time: “The Two-Second Advantage” “A  li&le  bit  of  the  right  informa2on,  just  a   li&le  bit  beforehand  –  whether  it  is  a   couple  of  seconds,  minutes  or  hours  –  is   more  valuable  than  all  of  the  informa2on   in  the  world  six  months  later…  this  is  the   two-­‐second  advantage.”                                    Vikek  Ranadivé,  Founder  and  CEO  of  TIBCO  
  • 53. © Copyright 2000-2014 TIBCO Software Inc. The Value of Data decreases over Time
  • 54. © Copyright 2000-2014 TIBCO Software Inc. What is Big Data? The combined Vs of Big Data Volume     (terabytes,   petabytes)                     Variety     (social  networks,   blog  posts,  logs,   sensors,  etc.)            Velocity                (realDme)           X Fast     Data  
  • 55. © Copyright 2000-2014 TIBCO Software Inc. Real Time Architecture? EVENTS   Mainframe/ERP/DB/App   ACTION   TransacDon  Based  Architectures   EVENTS   Mainframe/ERP/DB/App   ACTION   Behavior  Based  Architectures   TransacDon   Data,  Event  and   AnalyDcs   Not  ElasDc,  Doesn’t  Scale,    “Always  Late”  architecture  and  analyDcs       ElasDc,  Scales,  Real  Dme  architecture     (Events,  Data  and  AnalyDcs)  
  • 56. © Copyright 2000-2014 TIBCO Software Inc. Complex Event / Stream Processing / In-Memory Concepts •  Streams: Monitoring millions of events in a specific time window to react proactively •  Stateful: Collect, filter and correlate events with state to anticipate outcomes and react proactively •  Transactional: Highly performant transactional event processing Products vs. Frameworks •  Products are mature, mission-critical, in production, e.g. TIBCO StreamBase, IBM InfoSphere Streams •  Open Source Frameworks, e.g. “Apache Spark” and “Apache Storm” –  Future will tell us about performance, tooling, support, etc. –  Can be combined with Hadoop –  Are complementary to Products such as TIBCO StreamBase In-Memory •  Can also be used for “big data” (Terabytes possible!) •  Usually complementary, i.e. they can be / have to be combined with stream processing / complex event processing
  • 57. © Copyright 2000-2014 TIBCO Software Inc. Stream Processing Architecture LiveView Datamart Con-nuous  Query   Continuous Query Processor Ad  Hoc  Query   Alerts   CEP   Messaging  (low  latency)   Messaging  (JMS)   Social  Media  Data   Market  Data   In-­‐Memory   ESB  Integra-on   Sensor  Data   Historical   Data   JDBC   Ac-veSpaces   Enterprise   data  
  • 58. © Copyright 2000-2014 TIBCO Software Inc. Stream Processing Architecture (Example: TIBCO StreamBase) TIBCO StreamBase Con-nuous   Query   Continuous Query Processor Ad  Hoc  Query   Alerts   Active Tables Trading  Signal   Transac-on  Cost   Orders  /  Execu-ons   Market  Data   Alert  SeMng   TIBCO LiveViewSnapshot  AND  always-­‐live   updates   Quickly  connect  to  streams   An;cipate  opportuni;es,  proac;ve  ac;on  
  • 59. © Copyright 2000-2014 TIBCO Software Inc. Example: TIBCO StreamBase Tooling StreamBase Development Studio •  Visual Development •  Visual Debugging •  Feed Simulation •  Unit Testing StreamBase LiveView •  Real Time Analytics and Visualization •  Ad hoc queries •  Alerts and Notifications •  Web, Mobile and API Integration
  • 60. © Copyright 2000-2014 TIBCO Software Inc. Real World: Real-Time Trade Surveillance Applica-ons   IntegraDon   NormalizaDon   AggregaDon   CorrelaDon   Rules   Alerts   AutomaDon   Adapters     and     Handlers   Adapters   and   Handlers   StreamBase  Server(s)   StreamBase  Studio  for   Developing  EventFlow  Applica-ons     Data  Management       Persistence  Stores   Logs   Market   Data   Trade  Data   Sta-c  Data   Systems   Data   Performance   Benchmarks   Automa-on   Desktop   Alerts   Inputs   Outputs  
  • 61. © Copyright 2000-2014 TIBCO Software Inc. Real Time (Stream Processing) Real World Use Case     Real-­‐Time  Fraud  DetecDon                     “The  firm  needs  to  monitor  machine-­‐driven  algorithms,  and  look  for  suspicious  pa9erns.  Sounds  simple,  right?  Not  so  simple!   In  this  case,  the  pa9erns  of  interest  required  correlaDon  of  5  streams  of  real-­‐Dme  data.  Pa9erns  happen  within  15-­‐30  second  windows,  during  which  thousands  of  dollars  could  be  lost.  A9acks  come  in   bursts.   The  data  required  to  find  these  pa9erns  was  loaded  into  a  data  warehouse  and  reports  were  checked  each  day.  Decisions  to  act  were  made  every  day.   LiveView  now  intercepts  the  data  before  it  hit  the  warehouse  by  connecDng  LiveView  to  the  source  of  data.  It  took  3  days  to  integrate  these  sources  because  it  took  that  long  to  find  someone  who   knew  where  3  of  the  data  streams  came  from!   StreamBase  detects  fraud  pa9erns  in  milliseconds.  But  the  really  interesDng  part  came  next.   Once  this  firm  could  see  pa9erns  of  fraud,  they  were  faced  with  a  new  challenge:  what  to  DO  about  it?  How  many  Dmes  did  the  pa9ern  need  to  be  repeated  unDl  acDve  surveillance  is  started?    Should   the  acDon  be  quaranDned  for  a  period,  or  halted  immediately?  All  these  quesDons  were  new,  and  the  answers  to  them  keeps  changing.   The  fact  that  the  answers  keep  changing  highlights  the  importance  of  ease  of  use.  AnalyDcs  must  be  changed  quickly  and  be  made  available  to  fraud  experts  -­‐  in  some  cases,  in  hours  -­‐  as  understanding   deepens,  and  as  the  bad  guys  change  their  tacDcs.   Be9er,  higher  value-­‐add  customer  service  for  highly  automated  industries.  Knowledge  workers  who  anDcipate  sales  opportuniDes.  Spowng  fraud  in  high-­‐speed  transacDons  streams  and  taking  acDon.“     Some  more  use  cases:   h9p://streambase.typepad.com/streambase_stream_process/2012/04/streambase-­‐liveview-­‐10-­‐3-­‐stories-­‐from-­‐the-­‐trenches.html  
  • 62. © Copyright 2000-2014 TIBCO Software Inc. Real Time (CEP + In-Memory) Real World Use Case “With  38  million  fans,  MGM  knows  how  to  put  its  customers   first,  it  takes  more  than  a  smile  too.  Customers  want  a   personalized,  tailored  experience,  one  that  knows  their   name  and  can  anDcipate  their  needs.  With  the  help  of  TIBCO   technologies  that  leverage  big  data  and  give  customers  a   digital  idenDty,  MGM  can  send  personalized  offers  directly   to  customers,  save  them  a  seat,  and  have  their  favorite  drink   on  the  way.  With  mulDple  customer  touch  points  and   channels,  MGM  can  reach  customers  in  more  ways,  and  in   more  places,  than  ever  before.”     h9ps://www.youtube.com/watch?v=X-­‐7S3kCOx9k   CEP:   •  Correlate   •  Analyze   •  AcDon   In-­‐Memory:   •  Enable  Real  Time   •  Only  customers  that  have  checked  in  
  • 63. © Copyright 2000-2014 TIBCO Software Inc. Live Demo „TIBCO  StreamBase“  in  acDon...  
  • 64. © Copyright 2000-2014 TIBCO Software Inc. Hadoop: •  Storage •  Complex computing (MapReduce) Real Time: •  Immediate (proactive) reactions – automated or manually by user •  Monitor streaming data in Real Time Example: TIBCO StreamBase and its Apache Flume connector for reading streaming data from Hadoop / HDFS or to send streaming data to Hadoop / HDFS Real Time plus Hadoop?
  • 65. © Copyright 2000-2014 TIBCO Software Inc. Use Case: •  Predict pricing movement in live bets Hadoop: •  Store all history information about all past bets •  Use MapReduce to precompute odds for new matches, based on all history data TIBCO StreamBase: •  Compute new odds in real time to react within a live game after events (e.g. when a team scores a goal) •  Monitor stream data in real time dashboards Real Time plus Hadoop Real World Use Case h9p://www.casestudyu.com/news/2014/04/04/7762652.htm   h9p://vimeo.com/91461315  
  • 66. © Copyright 2000-2014 TIBCO Software Inc. Recap: Big Data Architecture DWH  /  BI   Hadoop   Real  Time   Big  Data  Architecture  
  • 67. © Copyright 2000-2014 TIBCO Software Inc. Off Topic What about Integration?
  • 68. © Copyright 2000-2014 TIBCO Software Inc. Off Topic Integration is no talking point in this session… However: It gets even more important in the future! The number of different data sources and technologies increases even more than in the past –  CRM, ERP, Host, B2B, etc. will not disappear –  DWH, Hadoop cluster, event / streaming server, In- Memory DB have to communicate –  Cloud, Mobile, Internet of Things are no option, but our future!
  • 69. © Copyright 2000-2014 TIBCO Software Inc. Recap: Key Messages Big Data is not just Hadoop, concentrate on Business Value! A good Big Data Architecture combines DWH, Hadoop and Real Time! The Integration Layer is getting even more important in the Big Data Era!
  • 70. © Copyright 2000-2014 TIBCO Software Inc. Questions? Kai Wähner kwaehner@tibco.com, @KaiWaehner, www.kai-waehner.de