SlideShare a Scribd company logo
1 of 20
Download to read offline
Sai Paravastu
Principal, BAR360
Open Data Platform
Why open source has
taken precedence in
making a common
data platform for
enterprises ?
DAMA Sydney Chapter
11th August 2015
Sai Paravastu
Principal
360°
view of
business
Insights
Data
Modeling
Business
Analytics
Portal
Development
Data
Services
Business
Intelligence
Data
Analysis
Confidential
BAR360 is an Australian DATA services business since 2005. We understand the information management
needs of Australian businesses and cater tailored business DATA solutions by integrating and improving their
data capabilities. BIRT is well open source Business Intelligence engine and we deliver development services,
integration, maintenance and training offerings tailored to business needs in the Australia and New Zealand
region. We also work with major BIRT OEM vendors like IBM, Micro focus, Schneider-Electric to name a few.
We have started our practice in Hadoop and NoSQL for data ingestion and processing in the world of BIG
Data
Our Value Proposition
We Provide Data Cleansing, Quality, Loading, Processing and Reporting services
We source and manage DATA solutions focused on achieving ROI.
We bring reliability and trustworthiness through simplicity in our engagements with clients.
We develop, training and support in implementations.
We evangelize in open source technologies.
We source local IT resources in projects and stand by our team.
Who is BAR360
Confidential
Extended Partners
We have provided training and professional services to Open source BIRT adopters and BIRT OEM
vendors across Australia.
Clients who used our Training and Professional Services
We are planning to grow and develop the open source community as well as building integrated
solutions using open source technologies.
Extended our professional services to other signed partnership agreements with BIRT
OEM vendors and open source adopters for Training and Professional services.
Confidential
Business Challenges Achieved
Business Intelligence
• Need to empower users with easy-to use,
self-service BI
– Users need actionable insight
– when they need it
– where ever they are
– in the form required
– Delivered based on role and security
levels
– Straightforward enough for non-IT staff to
use
• Reducing delivery time
– Single platform for all systems
– Easy-to-use for a range of skills
Business Analytics
Improve Member Loyalty
Reduce Churn
• Who are our loyal members ?
• Does member cover type has impact on their
likelihood of leaving the fund?
• Does member claiming has an impact on their
leaving the fund?
• What role does demographics play in churn?
• Is time with fund a factor ?
– Case study developed on “Member
Retention Analysis”
– Case study developed “Customer
Segmentation and Next Best Offer”
Confidential
Profile of the Principal
Confidential
Sai is an experienced technology consultant with a very good track record of being instrumental
in delivering projects Experienced Business & Architecture focused Solution Integration
Architect, a strategic thinker and a change agent.
Experience in Formulating the IT Architecture and driving solutions in line with EA and
presenting to senior management. Manage engagements through to successful completion of
projects with in the timelines, meeting all requirements including ROI, business benefits and
customer satisfaction. With over 2 decades of experience in technology.
Sai is a strategic planner around enterprise data strategy and system development life cycle
improvement. He has very good understanding of the challenges faced by the IT as well as
business stakeholders in the information management space.
Experienced across Manufacturing, Education, Public sector, Banking and Insurance verticals.
Associated with many consulting service companies in solution based sales for the last 7 years. I
am a Strategic Implementation partner with OpenText and Hortonworks in ANZ region.
Big Data Definition
Big data is a collection of data sets so large and complex that it becomes difficult to process using
currently on-hand database management tools or traditional data processing applications
Source: Wikipedia
Web Logs RFID Sensors Social Networks
Internet Text Searches Call Detail Records Astronomy
Atmospheric
Info
Genomics Biogeochemical Biological
Military Surveillance Medical Records E-Commerce Video
Traditional Data vs. Big Data
Traditional Data Big Data
Gigabytes to Terabytes Petabytes to Exabytes
Centralized Distributed
Structured Semi-structured to Unstructured
Stable Data Model Flat Schemas
Known Complex Interrelationships Few Complex Interrelationships
Source: Wikibon Community
When to Use Big Data vs. Relational
Big Data Relational
Analysis Type
Exploratory analysis to uncover
value in the data
Operational analysis of what was
discovered
Data Granularity
Store HUGE amounts of highly
granular data
Store transform (sometimes)
aggregated data
Timeframe
Data flows in BIG Data
 “real-time” monitoring
Long term trending analysis
Is Big Data a replacement for Relational Data?
Why BIG Data
In a nutshell, the quest for Big Data is directly
attributable to analytics, which has evolved from being a
business initiative to a business imperative.
Many vendors are talking about Big Data, but we’re not
seeing much more than the ability to store large
volumes of data, leaving the organization to “roll
their own” applications without much help to make
sense of it all. Real value can only emerge from a
consumable analytics platform that saves you from
having to build applications from scratch one that
effectively flattens the time-to-insight curve.
In my opinion BIG Data is truly all about analytics.
Confidential
New Approaches To Big Data Processing & Analytics
Traditional tools and technologies are straining
• New approaches to data processing
– Commodity hardware to scale
– Parallel processing techniques
– Non-relational data storage capabilities
– Unstructured, semi-structured data
• Better analytics
– Advanced visualization
– Data mining
Source: Wikibon Community
Confidential
New Approaches to Big Data Processing & Analytics
– Hadoop Approach
• Data broken into “parts”
• Loaded into file system
• Multiple nodes
• MapReduce
• Batch-style historical analysis
– NoSQL
• Cassandra, MongoDB, CouchDB, HBase*
• Discrete data stored among large volumes
• Higher performance than relational data sources
– Massively Parallel Analytic Databases
• Quickly ingest mostly structured data
• Minimal data modeling
• Scale to petabytes of data
• Near real-time results to complex SQL
Source: Wikibon Community
Confidential
Big Data Growth Drivers
• Increased awareness of the Big Data benefits
– Not just web, financial services, pharmaceuticals, retail
• Increased maturity of Big Data software
– Data stores, analytical engines
• Increased availability of professional services
– Supporting business use cases
• Increased investment in infrastructure
– Google, Facebook, Amazon
Source: Wikibon Community
Confidential
Top Big Data Challenges
• Data integration
– Top challenge
– Integrating disparate data, different sources, different formats is difficult
• Getting started with the right project
– Building the right team
– Determine the top business problem
• Architecting a big data system.
– High volume, high frequency data
– Build unified information architecture
• Lack of skills or staff
– Some hire externally / university hires.
– Others try to re-train from within.
– Cross pollinate skills from another part of the organization
– Build centers of excellence that help with the training
Source:TDWI
• Data privacy, governance and compliance issues
• How it can help business
• Integrating legacy systems
• The cost of implementation
Confidential
Apache Software Foundation – ASF
There are currently 300+ open source initiatives at the ASF:
• 163 committees managing 273 projects
• 5 special committees
• 43 incubating podlings
Source: ASF
Confidential
Open Data Platform
Enabling BIG Data solutions to flourish atop a common core platform
• The Open Data Platform Initiative (ODP) is an enterprise-focused shared industry effort focused on simplifying adoption
and promoting the use and advancing the state of Apache Hadoop® and Big Data technologies for the enterprise. It is a
non-profit organization being created by folks that help to create: Apache, Eclipse, Linux, OpenStack, OpenDaylight, Open
Networking Foundation, OSGI, WSI, UDDI , OASIS, Cloud Foundry Foundation and many others.
• Under the governance of the Apache Software Foundation community to innovate and deliver a common data platform
for enterprises as it brings the largest number of developers together to commit far faster than any single vendor could
achieve and in a way that is free of friction for the enterprise and vendors build extension on the core of the ODP.
Source: ASF
HIVE
Query
PIG
Scripting
MAHOUT
Machine Learning
MAP REDUCE
Distributed processing
YARN
Resource scheduling and negotiation
HDFS
Distributed Storage
HCATALOG
Metadata mgmt
HBASE
NoSQL database
SQOOP
Import/Export
FLUME/STORM
stream
KAFKA
Sub/Pub
ZOOKEEPER
Coordination
OOZIE
WFautomation
AMBARI
DRILL
Interactive
SPARK / FLINK
FALCON
KNOX
TEZ
Interactive
ARVO
dataserialization
Confidential
Benefits of ODP
Enabling BIG Data solutions to flourish atop a common core platform
The ODP core is a set of open source Hadoop technologies designed to provide a standardized core that big data solution
providers software and hardware developers can use to deliver compatible solutions rooted in open source that unlock
customer choice.
Source: ODP
How do we benefit:
ASF
- 100% focus on enabling collaboration between developers
- does not recognize corporations
- projects are on completely asynchronous development cycles
ODP
- Enables collaboration between vendors
- Focused on developing a platform , but does not supersede governance
- creates complimentary brand value for integrated platform
- focused on enterprise use case for hadoop
Confidential
ODP Core will initially focus on Apache Hadoop (inclusive of HDFS, YARN, and MapReduce) and Apache Ambari.
Once the ODP members and processes are well established, the scope of the ODP Core may expand to include
other open source projects.
The ODP Core will deliver the following benefits:
• For Apache Hadoop technology vendors, reduced R&D costs that come from a shared qualification effort
• For Big Data application solution providers, reduced R&D costs that come from more predictable and better
qualified releases
• Improved interoperability within the platform and simplified integration with existing systems in support of a
broad set of use cases
• Less friction and confusion for Enterprise customers and vendors
• Ability to redirect resources towards higher value efforts
Benefits of ODP
Source: ODP Confidential
1. Provide a stable base against which Big Data solutions providers can qualify solutions.
2. Support community development and outreach activities that accelerate the rollout of modern data
architectures that leverage Apache Hadoop
3. Contribute to ASF projects in accordance with ASF processes and Intellectual Property guidelines.
4. Accelerate the delivery of Big Data solutions by providing a well-defined core platform to target.
5. Define, integrate, test, and certify a standard "ODP Core" of compatible versions of select Big Data open
source projects.
6. Produce a set of tools and methods that enable members to create and test differentiated offerings based
on the ODP Core.
7. Reinforce the role of the Apache Software Foundation (ASF) in the development and governance of
upstream projects.
8. Help minimize the fragmentation and duplication of effort within the industry
ODP Delivers
The ODP Core will take the guesswork out of the process and accelerate many use cases by running on a
common platform.
Freeing up enterprises and ecosystem vendors to focus on building business driven applications.
Source: ODP Confidential
Thank You Partners
Confidential
BAR360
Sai Paravastu
Principal
+61 402 449 524
sai@bar360.com.au
www.bar360.com.au

More Related Content

What's hot

Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyDataWorks Summit
 
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...StampedeCon
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Perficient, Inc.
 
Big Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseBig Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseCaserta
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaCaserta
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data LakeCaserta
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...Capgemini
 
Balancing Data Governance and Innovation
Balancing Data Governance and InnovationBalancing Data Governance and Innovation
Balancing Data Governance and InnovationCaserta
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?Caserta
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteCaserta
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubCloudera, Inc.
 
A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...DataWorks Summit
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Caserta
 
Data architecture for modern enterprise
Data architecture for modern enterpriseData architecture for modern enterprise
Data architecture for modern enterprisekayalvizhi kandasamy
 
The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseCaserta
 
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...DLT Solutions
 
Data Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and FutureData Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and FutureLorenzo Nicora
 
Applying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to HealthcareApplying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to HealthcarePaul Boal
 
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Caserta
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleVasu S
 

What's hot (20)

Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata Company
 
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
 
Big Data's Impact on the Enterprise
Big Data's Impact on the EnterpriseBig Data's Impact on the Enterprise
Big Data's Impact on the Enterprise
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with Cloudera
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
 
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
EMC World 2014 Breakout: Move to the Business Data Lake – Not as Hard as It S...
 
Balancing Data Governance and Innovation
Balancing Data Governance and InnovationBalancing Data Governance and Innovation
Balancing Data Governance and Innovation
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
 
Data architecture for modern enterprise
Data architecture for modern enterpriseData architecture for modern enterprise
Data architecture for modern enterprise
 
The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's Enterprise
 
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
Bringing Strategy to Life: Using an Intelligent Data Platform to Become Data ...
 
Data Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and FutureData Mesh at CMC Markets: Past, Present and Future
Data Mesh at CMC Markets: Past, Present and Future
 
Applying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to HealthcareApplying Big Data Superpowers to Healthcare
Applying Big Data Superpowers to Healthcare
 
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | Qubole
 

Viewers also liked

Uso de mineração de dados e textos para cálculo de preços de referência em co...
Uso de mineração de dados e textos para cálculo de preços de referência em co...Uso de mineração de dados e textos para cálculo de preços de referência em co...
Uso de mineração de dados e textos para cálculo de preços de referência em co...Rommel Carvalho
 
Filiação partidária e risco de corrupção de servidores públicos federais
Filiação partidária e risco de corrupção de servidores públicos federaisFiliação partidária e risco de corrupção de servidores públicos federais
Filiação partidária e risco de corrupção de servidores públicos federaisRommel Carvalho
 
Hazardous Situation Ontology Design Pattern
Hazardous Situation Ontology Design Pattern Hazardous Situation Ontology Design Pattern
Hazardous Situation Ontology Design Pattern Agnieszka Ławrynowicz
 
Cirugía de odontomas, frenillos.
Cirugía de odontomas, frenillos.Cirugía de odontomas, frenillos.
Cirugía de odontomas, frenillos.Yasminne Rodríguez
 
Detecção preventiva de fracionamento de compras
Detecção preventiva de fracionamento de comprasDetecção preventiva de fracionamento de compras
Detecção preventiva de fracionamento de comprasRommel Carvalho
 
Aplicação de técnicas de mineração de textos para classificação automática de...
Aplicação de técnicas de mineração de textos para classificação automática de...Aplicação de técnicas de mineração de textos para classificação automática de...
Aplicação de técnicas de mineração de textos para classificação automática de...Rommel Carvalho
 

Viewers also liked (6)

Uso de mineração de dados e textos para cálculo de preços de referência em co...
Uso de mineração de dados e textos para cálculo de preços de referência em co...Uso de mineração de dados e textos para cálculo de preços de referência em co...
Uso de mineração de dados e textos para cálculo de preços de referência em co...
 
Filiação partidária e risco de corrupção de servidores públicos federais
Filiação partidária e risco de corrupção de servidores públicos federaisFiliação partidária e risco de corrupção de servidores públicos federais
Filiação partidária e risco de corrupção de servidores públicos federais
 
Hazardous Situation Ontology Design Pattern
Hazardous Situation Ontology Design Pattern Hazardous Situation Ontology Design Pattern
Hazardous Situation Ontology Design Pattern
 
Cirugía de odontomas, frenillos.
Cirugía de odontomas, frenillos.Cirugía de odontomas, frenillos.
Cirugía de odontomas, frenillos.
 
Detecção preventiva de fracionamento de compras
Detecção preventiva de fracionamento de comprasDetecção preventiva de fracionamento de compras
Detecção preventiva de fracionamento de compras
 
Aplicação de técnicas de mineração de textos para classificação automática de...
Aplicação de técnicas de mineração de textos para classificação automática de...Aplicação de técnicas de mineração de textos para classificação automática de...
Aplicação de técnicas de mineração de textos para classificação automática de...
 

Similar to BAR360 open data platform presentation at DAMA, Sydney

Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US InformationJulian Tong
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Denodo
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitecturePerficient, Inc.
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02email2jl
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Nathan Bijnens
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseDatabricks
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsCaserta
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)Moacyr Passador
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Incorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureIncorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureCaserta
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Denodo
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Caserta
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIDenodo
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategyJames Serra
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikBardess Group
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleBardess Group
 
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot ProgramszData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot ProgramszData Inc.
 

Similar to BAR360 open data platform presentation at DAMA, Sydney (20)

Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data Architecture
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Incorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic ArchitectureIncorporating the Data Lake into Your Analytic Architecture
Incorporating the Data Lake into Your Analytic Architecture
 
Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)Data Virtualization. An Introduction (ASEAN)
Data Virtualization. An Introduction (ASEAN)
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess Qlik
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus Example
 
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot ProgramszData BI & Advanced Analytics Platform + 8 Week Pilot Programs
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
 

Recently uploaded

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

BAR360 open data platform presentation at DAMA, Sydney

  • 1. Sai Paravastu Principal, BAR360 Open Data Platform Why open source has taken precedence in making a common data platform for enterprises ? DAMA Sydney Chapter 11th August 2015
  • 3. BAR360 is an Australian DATA services business since 2005. We understand the information management needs of Australian businesses and cater tailored business DATA solutions by integrating and improving their data capabilities. BIRT is well open source Business Intelligence engine and we deliver development services, integration, maintenance and training offerings tailored to business needs in the Australia and New Zealand region. We also work with major BIRT OEM vendors like IBM, Micro focus, Schneider-Electric to name a few. We have started our practice in Hadoop and NoSQL for data ingestion and processing in the world of BIG Data Our Value Proposition We Provide Data Cleansing, Quality, Loading, Processing and Reporting services We source and manage DATA solutions focused on achieving ROI. We bring reliability and trustworthiness through simplicity in our engagements with clients. We develop, training and support in implementations. We evangelize in open source technologies. We source local IT resources in projects and stand by our team. Who is BAR360 Confidential
  • 4. Extended Partners We have provided training and professional services to Open source BIRT adopters and BIRT OEM vendors across Australia. Clients who used our Training and Professional Services We are planning to grow and develop the open source community as well as building integrated solutions using open source technologies. Extended our professional services to other signed partnership agreements with BIRT OEM vendors and open source adopters for Training and Professional services. Confidential
  • 5. Business Challenges Achieved Business Intelligence • Need to empower users with easy-to use, self-service BI – Users need actionable insight – when they need it – where ever they are – in the form required – Delivered based on role and security levels – Straightforward enough for non-IT staff to use • Reducing delivery time – Single platform for all systems – Easy-to-use for a range of skills Business Analytics Improve Member Loyalty Reduce Churn • Who are our loyal members ? • Does member cover type has impact on their likelihood of leaving the fund? • Does member claiming has an impact on their leaving the fund? • What role does demographics play in churn? • Is time with fund a factor ? – Case study developed on “Member Retention Analysis” – Case study developed “Customer Segmentation and Next Best Offer” Confidential
  • 6. Profile of the Principal Confidential Sai is an experienced technology consultant with a very good track record of being instrumental in delivering projects Experienced Business & Architecture focused Solution Integration Architect, a strategic thinker and a change agent. Experience in Formulating the IT Architecture and driving solutions in line with EA and presenting to senior management. Manage engagements through to successful completion of projects with in the timelines, meeting all requirements including ROI, business benefits and customer satisfaction. With over 2 decades of experience in technology. Sai is a strategic planner around enterprise data strategy and system development life cycle improvement. He has very good understanding of the challenges faced by the IT as well as business stakeholders in the information management space. Experienced across Manufacturing, Education, Public sector, Banking and Insurance verticals. Associated with many consulting service companies in solution based sales for the last 7 years. I am a Strategic Implementation partner with OpenText and Hortonworks in ANZ region.
  • 7. Big Data Definition Big data is a collection of data sets so large and complex that it becomes difficult to process using currently on-hand database management tools or traditional data processing applications Source: Wikipedia Web Logs RFID Sensors Social Networks Internet Text Searches Call Detail Records Astronomy Atmospheric Info Genomics Biogeochemical Biological Military Surveillance Medical Records E-Commerce Video
  • 8. Traditional Data vs. Big Data Traditional Data Big Data Gigabytes to Terabytes Petabytes to Exabytes Centralized Distributed Structured Semi-structured to Unstructured Stable Data Model Flat Schemas Known Complex Interrelationships Few Complex Interrelationships Source: Wikibon Community
  • 9. When to Use Big Data vs. Relational Big Data Relational Analysis Type Exploratory analysis to uncover value in the data Operational analysis of what was discovered Data Granularity Store HUGE amounts of highly granular data Store transform (sometimes) aggregated data Timeframe Data flows in BIG Data  “real-time” monitoring Long term trending analysis Is Big Data a replacement for Relational Data?
  • 10. Why BIG Data In a nutshell, the quest for Big Data is directly attributable to analytics, which has evolved from being a business initiative to a business imperative. Many vendors are talking about Big Data, but we’re not seeing much more than the ability to store large volumes of data, leaving the organization to “roll their own” applications without much help to make sense of it all. Real value can only emerge from a consumable analytics platform that saves you from having to build applications from scratch one that effectively flattens the time-to-insight curve. In my opinion BIG Data is truly all about analytics. Confidential
  • 11. New Approaches To Big Data Processing & Analytics Traditional tools and technologies are straining • New approaches to data processing – Commodity hardware to scale – Parallel processing techniques – Non-relational data storage capabilities – Unstructured, semi-structured data • Better analytics – Advanced visualization – Data mining Source: Wikibon Community Confidential
  • 12. New Approaches to Big Data Processing & Analytics – Hadoop Approach • Data broken into “parts” • Loaded into file system • Multiple nodes • MapReduce • Batch-style historical analysis – NoSQL • Cassandra, MongoDB, CouchDB, HBase* • Discrete data stored among large volumes • Higher performance than relational data sources – Massively Parallel Analytic Databases • Quickly ingest mostly structured data • Minimal data modeling • Scale to petabytes of data • Near real-time results to complex SQL Source: Wikibon Community Confidential
  • 13. Big Data Growth Drivers • Increased awareness of the Big Data benefits – Not just web, financial services, pharmaceuticals, retail • Increased maturity of Big Data software – Data stores, analytical engines • Increased availability of professional services – Supporting business use cases • Increased investment in infrastructure – Google, Facebook, Amazon Source: Wikibon Community Confidential
  • 14. Top Big Data Challenges • Data integration – Top challenge – Integrating disparate data, different sources, different formats is difficult • Getting started with the right project – Building the right team – Determine the top business problem • Architecting a big data system. – High volume, high frequency data – Build unified information architecture • Lack of skills or staff – Some hire externally / university hires. – Others try to re-train from within. – Cross pollinate skills from another part of the organization – Build centers of excellence that help with the training Source:TDWI • Data privacy, governance and compliance issues • How it can help business • Integrating legacy systems • The cost of implementation Confidential
  • 15. Apache Software Foundation – ASF There are currently 300+ open source initiatives at the ASF: • 163 committees managing 273 projects • 5 special committees • 43 incubating podlings Source: ASF Confidential
  • 16. Open Data Platform Enabling BIG Data solutions to flourish atop a common core platform • The Open Data Platform Initiative (ODP) is an enterprise-focused shared industry effort focused on simplifying adoption and promoting the use and advancing the state of Apache Hadoop® and Big Data technologies for the enterprise. It is a non-profit organization being created by folks that help to create: Apache, Eclipse, Linux, OpenStack, OpenDaylight, Open Networking Foundation, OSGI, WSI, UDDI , OASIS, Cloud Foundry Foundation and many others. • Under the governance of the Apache Software Foundation community to innovate and deliver a common data platform for enterprises as it brings the largest number of developers together to commit far faster than any single vendor could achieve and in a way that is free of friction for the enterprise and vendors build extension on the core of the ODP. Source: ASF HIVE Query PIG Scripting MAHOUT Machine Learning MAP REDUCE Distributed processing YARN Resource scheduling and negotiation HDFS Distributed Storage HCATALOG Metadata mgmt HBASE NoSQL database SQOOP Import/Export FLUME/STORM stream KAFKA Sub/Pub ZOOKEEPER Coordination OOZIE WFautomation AMBARI DRILL Interactive SPARK / FLINK FALCON KNOX TEZ Interactive ARVO dataserialization Confidential
  • 17. Benefits of ODP Enabling BIG Data solutions to flourish atop a common core platform The ODP core is a set of open source Hadoop technologies designed to provide a standardized core that big data solution providers software and hardware developers can use to deliver compatible solutions rooted in open source that unlock customer choice. Source: ODP How do we benefit: ASF - 100% focus on enabling collaboration between developers - does not recognize corporations - projects are on completely asynchronous development cycles ODP - Enables collaboration between vendors - Focused on developing a platform , but does not supersede governance - creates complimentary brand value for integrated platform - focused on enterprise use case for hadoop Confidential
  • 18. ODP Core will initially focus on Apache Hadoop (inclusive of HDFS, YARN, and MapReduce) and Apache Ambari. Once the ODP members and processes are well established, the scope of the ODP Core may expand to include other open source projects. The ODP Core will deliver the following benefits: • For Apache Hadoop technology vendors, reduced R&D costs that come from a shared qualification effort • For Big Data application solution providers, reduced R&D costs that come from more predictable and better qualified releases • Improved interoperability within the platform and simplified integration with existing systems in support of a broad set of use cases • Less friction and confusion for Enterprise customers and vendors • Ability to redirect resources towards higher value efforts Benefits of ODP Source: ODP Confidential
  • 19. 1. Provide a stable base against which Big Data solutions providers can qualify solutions. 2. Support community development and outreach activities that accelerate the rollout of modern data architectures that leverage Apache Hadoop 3. Contribute to ASF projects in accordance with ASF processes and Intellectual Property guidelines. 4. Accelerate the delivery of Big Data solutions by providing a well-defined core platform to target. 5. Define, integrate, test, and certify a standard "ODP Core" of compatible versions of select Big Data open source projects. 6. Produce a set of tools and methods that enable members to create and test differentiated offerings based on the ODP Core. 7. Reinforce the role of the Apache Software Foundation (ASF) in the development and governance of upstream projects. 8. Help minimize the fragmentation and duplication of effort within the industry ODP Delivers The ODP Core will take the guesswork out of the process and accelerate many use cases by running on a common platform. Freeing up enterprises and ecosystem vendors to focus on building business driven applications. Source: ODP Confidential
  • 20. Thank You Partners Confidential BAR360 Sai Paravastu Principal +61 402 449 524 sai@bar360.com.au www.bar360.com.au