SlideShare a Scribd company logo
1 of 20
Download to read offline
BESTPEER++: A PEER-TO-PEER BASED
LARGE-SCALE DATA PROCESSING
PLATFORM
Submitted by,
PRABHUDEV R
4NI12IS416
1
AGENDA
1. Introduction.
2. Overview of the BestPeer++ system.
3. Bootstrap peer.
4. Normal peer.
5. Benchmarking.
6. Advantages.
7. Conclusion.
2
1. INTRODUCTION
 Corporate network shares information with participating
companies of a common interest.
 Companies reduce their operational cost and increase
the revenue.
 Delivers elastic data sharing services.
 Provides economical , flexible and scalable platform.
 Based on pay as you go business model. 3
2. OVERVIEW OF THE BESTPEER++
SYSTEM
BestPeer data management platform
 Adaptive join query processing
 distributed online aggregation techniques to provide efficient
query processing.
BestPeer++, cloud enabled evolution of BestPeer
 Distributed access control
 Multiple types of indexes
 Pay-as-you-go query processing for delivering elastic data
sharing services in the cloud.
 The software components of BestPeer++ are separated
into two parts:-
1. Core.
2. Adapter. 4
AMAZON CLOUD ADAPTER
 Elastic hardware infrastructure for BestPeer++ to
operate on by using Amazon Cloud services.
 Launching/terminating dedicated MySQL database
servers and monitoring/ backup/auto-scaling those
servers.
 Finally, the Amazon Cloud Adapter also provides
automatic fail-over service.
5
THE BESTPEER++ CORE
 Platform-independent
logic, including query
processing and P2P
overlay.
 Cloud adapter and
consists of two software
components:
1. Bootstrap peer.
2. Normal peer.
6
3. BOOTSTRAP PEER
 The bootstrap peer is run by the BestPeer++ service
provider, and its main functionality is to manage the
BestPeer+ + network
1. Managing Normal Peer Join/Departure.
2. Auto Fail-Over and Auto-Scaling.
7
MANAGING NORMAL PEER JOIN/DEPARTURE
 Each normal peer intends to join an existing
corporate network must first connect to the
bootstrap peer.
 The joined peer will receive the corporate network
information including the current participants, global
schema, role definitions, and an issued certificate.
 When a normal peer needs to leave the network, it
also notifies the bootstrap peer first.
8
AUTO FAIL-OVER AND AUTO-SCALING
 In addition to managing peer join and peer
departure.
 The bootstrap peer spends most of its running-time
on monitoring the health of normal peers.
 Scheduling fail-over and auto-scaling events.
9
4.NORMAL PEER
Offline data flow
The data are extracted
periodically by a data
loader from the business
production system to the
normal peer instance.
Online data flow
 The query processor
performs user queries
using a fetch and
process strategy.
10
SCHEMA MAPPING
 Defines the mapping between the local schema of
each production system and the global shared
schema.
 The mapping consists of metadata mappings and
value mappings and also support instance level
mapping.
11
DATA LOADER
 Extracts data from production systems to normal peer
instances according to the result of schema mapping.
 The data loader also creates a snapshot of the newly
inserted data.
 At interval times, re-extracts data from the production system
to create a new snapshot.
 This snapshot is then compared to the previously stored one
to detect data changes.
 Finally, the changes are used to update the MySQL database
hosted in the normal peer. 12
DATA INDEXER
 BATON
 The first range, R0, is the subdomain maintained by
the node.
 The second range, R1, is the domain of the sub tree
rooted at the node. 13
DISTRIBUTED ACCESS CONTROL
 The basic idea is to use roles as templates to
capture common data access privileges and allow
businesses to override these privileges to meet
their specific needs.
 The information of the users created at one peer is
forwarded to the bootstrap peer and then
broadcasted to other normal peers also.
 The local administrator at this peer can easily
define the role-based access control for any user.
14
PAY-AS-YOU-GO QUERY PROCESSING
 BestPeer++ provides two services for the
participants:
1. Storage service
2. Search service
 After data are exported from the local business
system into a BestPeer++ instance, we apply the
schema mapping rules to transform them into the
predefined formats.
15
5.BENCHMARKING
 This section evaluates the performance and throughput
of BestPeer++ on Amazon cloud platform.
1. For the performance benchmark, they compare the
query latency of BestPeer++ with HadoopDB using
five queries selected from typical corporate network
applications workloads.
2. For the throughput benchmark, they create a simple
supply-chain network consisting of suppliers and
retailers and study the query throughput of the
system.
16
6.ADVANTAGES OF BESTPEER++
1. Deliver near linear query throughput as the number of
normal peers grows.
2. BestPeer++ adopts the pay-as-you-go business model
popularized by cloud computing.
3. The role-based access control for the inherent distributed
environment of corporate networks.
4. P2P technology to retrieve data between business
partners.
5. Efficient data sharing within corporate networks. 17
7.CONCLUSION
 The benchmark conducted on Amazon EC2 cloud
platform shows that our system can efficiently
handle typical workloads in a corporate network
and can deliver near linear query throughput as the
number of normal peers grows.
 Therefore, BestPeer++ is a promising solution for
efficient data sharing within corporate networks.
18
REFERENCES
1. S. Wu, Q.H. Vu, J. Li, and K.-L. Tan, “Adaptive Multi-Join
Query Processing in PDBMS,” Proc. IEEE Int’l Conf. Data
Eng. (ICDE ’09), pp. 1239-1242, 2009.
2. D. Bermbach and S. Tai, “Eventual Consistency: How Soon
is Eventual? An Evaluation of Amazon s3’s Consistency
Behavior,” in Proc. 6th Workshop Middleware Serv. Oriented
Comput. (MW4SOC ’11), pp. 1:1-1:6, NY, USA, 2011.
3. B. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R.
Sears, “Benchmarking Cloud Serving Systems with YCSB,”
Proc. First ACM Symp. Cloud Computing, pp. 143-154,
2010.
4. Oracle Inc., “Achieving the Cloud Computing Vision,” White
Paper, 2010. 19
20

More Related Content

What's hot

Building a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with DatabricksBuilding a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with DatabricksDatabricks
 
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance Senturus
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachDataWorks Summit
 
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts EMC
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewDataWorks Summit/Hadoop Summit
 
Building and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieBuilding and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieDataWorks Summit/Hadoop Summit
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersDataWorks Summit
 

What's hot (8)

Building a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with DatabricksBuilding a Turbo-fast Data Warehousing Platform with Databricks
Building a Turbo-fast Data Warehousing Platform with Databricks
 
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
Cognos Analytics Performance Tuning: Tips & Tricks to Rev Performance
 
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run ApproachEvolution of Big Data at Intel - Crawl, Walk and Run Approach
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
 
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts Whitepaper : Working with Greenplum Database using Toad for Data Analysts
Whitepaper : Working with Greenplum Database using Toad for Data Analysts
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
 
Building and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache OozieBuilding and managing complex dependencies pipeline using Apache Oozie
Building and managing complex dependencies pipeline using Apache Oozie
 
Apache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, ScaleApache Hive 2.0: SQL, Speed, Scale
Apache Hive 2.0: SQL, Speed, Scale
 
Protecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against DisastersProtecting your Critical Hadoop Clusters Against Disasters
Protecting your Critical Hadoop Clusters Against Disasters
 

Viewers also liked

श्रीराम समर्थ Dr shriniwas kashalikar
श्रीराम समर्थ  Dr shriniwas kashalikarश्रीराम समर्थ  Dr shriniwas kashalikar
श्रीराम समर्थ Dr shriniwas kashalikarshriniwas kashalikar
 
Project Management Experience - Anthony D. Braunscheidel
Project Management Experience - Anthony D. BraunscheidelProject Management Experience - Anthony D. Braunscheidel
Project Management Experience - Anthony D. BraunscheidelAnthony Braunscheidel
 
Six Tips for Great Customer Interviews
Six Tips for Great Customer InterviewsSix Tips for Great Customer Interviews
Six Tips for Great Customer InterviewsAnne Piessens
 
Superliving (concise edition) dr. shriniwas kashalikar
Superliving (concise edition) dr. shriniwas kashalikarSuperliving (concise edition) dr. shriniwas kashalikar
Superliving (concise edition) dr. shriniwas kashalikarshriniwas kashalikar
 
Alfabeto de nomes h
Alfabeto de nomes   hAlfabeto de nomes   h
Alfabeto de nomes hDário Reis
 
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...SERVIER POLSKA
 
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)ARERA
 
Financing the water sector
Financing the water sectorFinancing the water sector
Financing the water sectorARERA
 
Broker de Forex RoboForex
Broker de Forex RoboForexBroker de Forex RoboForex
Broker de Forex RoboForexRaul Canessa
 
Estrategia de trading Nihilist para Forex basada en MT4
Estrategia de trading Nihilist para Forex basada en MT4Estrategia de trading Nihilist para Forex basada en MT4
Estrategia de trading Nihilist para Forex basada en MT4Raul Canessa
 
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...ARERA
 
An Overview of Scenario Planning - Introduction, Overview and Examples
An Overview of Scenario Planning - Introduction, Overview and ExamplesAn Overview of Scenario Planning - Introduction, Overview and Examples
An Overview of Scenario Planning - Introduction, Overview and ExamplesAxiom EPM
 
Artificial Intelligence and the Law of Armed Conflict
Artificial Intelligence and the Law of Armed ConflictArtificial Intelligence and the Law of Armed Conflict
Artificial Intelligence and the Law of Armed ConflictDr. Lydia Kostopoulos
 

Viewers also liked (19)

श्रीराम समर्थ Dr shriniwas kashalikar
श्रीराम समर्थ  Dr shriniwas kashalikarश्रीराम समर्थ  Dr shriniwas kashalikar
श्रीराम समर्थ Dr shriniwas kashalikar
 
The Presentation Secrets of Steve Jobs
The Presentation Secrets of Steve JobsThe Presentation Secrets of Steve Jobs
The Presentation Secrets of Steve Jobs
 
Edi
EdiEdi
Edi
 
Project Management Experience - Anthony D. Braunscheidel
Project Management Experience - Anthony D. BraunscheidelProject Management Experience - Anthony D. Braunscheidel
Project Management Experience - Anthony D. Braunscheidel
 
Ebay
EbayEbay
Ebay
 
Six Tips for Great Customer Interviews
Six Tips for Great Customer InterviewsSix Tips for Great Customer Interviews
Six Tips for Great Customer Interviews
 
Joy of life
Joy of lifeJoy of life
Joy of life
 
Superliving (concise edition) dr. shriniwas kashalikar
Superliving (concise edition) dr. shriniwas kashalikarSuperliving (concise edition) dr. shriniwas kashalikar
Superliving (concise edition) dr. shriniwas kashalikar
 
Alfabeto de nomes h
Alfabeto de nomes   hAlfabeto de nomes   h
Alfabeto de nomes h
 
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
Przepisy na zdrowe posiłki dla dzieci, młodzieży, dorosłych - Łukasz Konik - ...
 
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
La Regolazione per i Sistemi di Distribuzione Chiusi (SDC)
 
Financing the water sector
Financing the water sectorFinancing the water sector
Financing the water sector
 
Broker de Forex RoboForex
Broker de Forex RoboForexBroker de Forex RoboForex
Broker de Forex RoboForex
 
Estrategia de trading Nihilist para Forex basada en MT4
Estrategia de trading Nihilist para Forex basada en MT4Estrategia de trading Nihilist para Forex basada en MT4
Estrategia de trading Nihilist para Forex basada en MT4
 
PPt on internet
PPt on internetPPt on internet
PPt on internet
 
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
Indagine conoscitiva avviata con deliberazione 595/2015/R/idr sulle strategie...
 
An Overview of Scenario Planning - Introduction, Overview and Examples
An Overview of Scenario Planning - Introduction, Overview and ExamplesAn Overview of Scenario Planning - Introduction, Overview and Examples
An Overview of Scenario Planning - Introduction, Overview and Examples
 
Artificial Intelligence and the Law of Armed Conflict
Artificial Intelligence and the Law of Armed ConflictArtificial Intelligence and the Law of Armed Conflict
Artificial Intelligence and the Law of Armed Conflict
 
Sustainability Hub Presentation
Sustainability Hub PresentationSustainability Hub Presentation
Sustainability Hub Presentation
 

Similar to Best peer++

Bestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platformBestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platformPapitha Velumani
 
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing PlatformJPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platformchennaijp
 
best peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platformbest peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platformswathi78
 
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...IEEEFINALYEARSTUDENTPROJECTS
 
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...IEEEMEMTECHSTUDENTSPROJECTS
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataEMC
 
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for CloudIDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for CloudAngel Villar Garea
 
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...AboutYouGmbH
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesDataWorks Summit
 
Tuning database performance
Tuning database performanceTuning database performance
Tuning database performanceBinay Acharya
 
TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16Terence White
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Karl Roche
 
Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Denodo
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Denodo
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Precisely
 
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM Events
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionAppfluent Technology
 

Similar to Best peer++ (20)

Bestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platformBestpeer++ a peer to-peer based large-scale data processing platform
Bestpeer++ a peer to-peer based large-scale data processing platform
 
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing PlatformJPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
JPJ1416 BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform
 
About CDAP
About CDAPAbout CDAP
About CDAP
 
best peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platformbest peer++ a peer-to-peer based large-scale data processing platform
best peer++ a peer-to-peer based large-scale data processing platform
 
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
IEEE 2014 JAVA DATA MINING PROJECTS Best peer++ a peer to-peer based large-sc...
 
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
2014 IEEE JAVA DATA MINING PROJECT Best peer++ a peer to-peer based large-sca...
 
Orca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big DataOrca: A Modular Query Optimizer Architecture for Big Data
Orca: A Modular Query Optimizer Architecture for Big Data
 
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for CloudIDC WHITE PAPER - IBM PureFlex System Ready for Cloud
IDC WHITE PAPER - IBM PureFlex System Ready for Cloud
 
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
 
Information Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data LakesInformation Virtualization: Query Federation on Data Lakes
Information Virtualization: Query Federation on Data Lakes
 
Tuning database performance
Tuning database performanceTuning database performance
Tuning database performance
 
TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16TSOLogic_I-P_Overview-2016-08-16
TSOLogic_I-P_Overview-2016-08-16
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.
 
Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)Demystifying Data Virtualization (ASEAN)
Demystifying Data Virtualization (ASEAN)
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
IBM Cloud pak for data brochure
IBM Cloud pak for data   brochureIBM Cloud pak for data   brochure
IBM Cloud pak for data brochure
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?
 
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & WieckIBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
IBM InterConnect 2013 Expert Integrated Systems Keynote: Sotiropoulos & Wieck
 
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR DistributionCisco Big Data Warehouse Expansion Featuring MapR Distribution
Cisco Big Data Warehouse Expansion Featuring MapR Distribution
 
MKHCV2
MKHCV2MKHCV2
MKHCV2
 

Recently uploaded

How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17Celine George
 
DBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdfDBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdfChristalin Nelson
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...Nguyen Thanh Tu Collection
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxCLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxAnupam32727
 
4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptxmary850239
 
An Overview of the Calendar App in Odoo 17 ERP
An Overview of the Calendar App in Odoo 17 ERPAn Overview of the Calendar App in Odoo 17 ERP
An Overview of the Calendar App in Odoo 17 ERPCeline George
 
The role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipThe role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipKarl Donert
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Celine George
 
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEPART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEMISSRITIMABIOLOGYEXP
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfChristalin Nelson
 
How to Uninstall a Module in Odoo 17 Using Command Line
How to Uninstall a Module in Odoo 17 Using Command LineHow to Uninstall a Module in Odoo 17 Using Command Line
How to Uninstall a Module in Odoo 17 Using Command LineCeline George
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 

Recently uploaded (20)

How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17How to Manage Buy 3 Get 1 Free in Odoo 17
How to Manage Buy 3 Get 1 Free in Odoo 17
 
DBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdfDBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdf
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptxCLASSIFICATION OF ANTI - CANCER DRUGS.pptx
CLASSIFICATION OF ANTI - CANCER DRUGS.pptx
 
4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx
 
An Overview of the Calendar App in Odoo 17 ERP
An Overview of the Calendar App in Odoo 17 ERPAn Overview of the Calendar App in Odoo 17 ERP
An Overview of the Calendar App in Odoo 17 ERP
 
The role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipThe role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenship
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17
 
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFEPART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
PART 1 - CHAPTER 1 - CELL THE FUNDAMENTAL UNIT OF LIFE
 
Indexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdfIndexing Structures in Database Management system.pdf
Indexing Structures in Database Management system.pdf
 
How to Uninstall a Module in Odoo 17 Using Command Line
How to Uninstall a Module in Odoo 17 Using Command LineHow to Uninstall a Module in Odoo 17 Using Command Line
How to Uninstall a Module in Odoo 17 Using Command Line
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
CARNAVAL COM MAGIA E EUFORIA _
CARNAVAL COM MAGIA E EUFORIA            _CARNAVAL COM MAGIA E EUFORIA            _
CARNAVAL COM MAGIA E EUFORIA _
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 

Best peer++

  • 1. BESTPEER++: A PEER-TO-PEER BASED LARGE-SCALE DATA PROCESSING PLATFORM Submitted by, PRABHUDEV R 4NI12IS416 1
  • 2. AGENDA 1. Introduction. 2. Overview of the BestPeer++ system. 3. Bootstrap peer. 4. Normal peer. 5. Benchmarking. 6. Advantages. 7. Conclusion. 2
  • 3. 1. INTRODUCTION  Corporate network shares information with participating companies of a common interest.  Companies reduce their operational cost and increase the revenue.  Delivers elastic data sharing services.  Provides economical , flexible and scalable platform.  Based on pay as you go business model. 3
  • 4. 2. OVERVIEW OF THE BESTPEER++ SYSTEM BestPeer data management platform  Adaptive join query processing  distributed online aggregation techniques to provide efficient query processing. BestPeer++, cloud enabled evolution of BestPeer  Distributed access control  Multiple types of indexes  Pay-as-you-go query processing for delivering elastic data sharing services in the cloud.  The software components of BestPeer++ are separated into two parts:- 1. Core. 2. Adapter. 4
  • 5. AMAZON CLOUD ADAPTER  Elastic hardware infrastructure for BestPeer++ to operate on by using Amazon Cloud services.  Launching/terminating dedicated MySQL database servers and monitoring/ backup/auto-scaling those servers.  Finally, the Amazon Cloud Adapter also provides automatic fail-over service. 5
  • 6. THE BESTPEER++ CORE  Platform-independent logic, including query processing and P2P overlay.  Cloud adapter and consists of two software components: 1. Bootstrap peer. 2. Normal peer. 6
  • 7. 3. BOOTSTRAP PEER  The bootstrap peer is run by the BestPeer++ service provider, and its main functionality is to manage the BestPeer+ + network 1. Managing Normal Peer Join/Departure. 2. Auto Fail-Over and Auto-Scaling. 7
  • 8. MANAGING NORMAL PEER JOIN/DEPARTURE  Each normal peer intends to join an existing corporate network must first connect to the bootstrap peer.  The joined peer will receive the corporate network information including the current participants, global schema, role definitions, and an issued certificate.  When a normal peer needs to leave the network, it also notifies the bootstrap peer first. 8
  • 9. AUTO FAIL-OVER AND AUTO-SCALING  In addition to managing peer join and peer departure.  The bootstrap peer spends most of its running-time on monitoring the health of normal peers.  Scheduling fail-over and auto-scaling events. 9
  • 10. 4.NORMAL PEER Offline data flow The data are extracted periodically by a data loader from the business production system to the normal peer instance. Online data flow  The query processor performs user queries using a fetch and process strategy. 10
  • 11. SCHEMA MAPPING  Defines the mapping between the local schema of each production system and the global shared schema.  The mapping consists of metadata mappings and value mappings and also support instance level mapping. 11
  • 12. DATA LOADER  Extracts data from production systems to normal peer instances according to the result of schema mapping.  The data loader also creates a snapshot of the newly inserted data.  At interval times, re-extracts data from the production system to create a new snapshot.  This snapshot is then compared to the previously stored one to detect data changes.  Finally, the changes are used to update the MySQL database hosted in the normal peer. 12
  • 13. DATA INDEXER  BATON  The first range, R0, is the subdomain maintained by the node.  The second range, R1, is the domain of the sub tree rooted at the node. 13
  • 14. DISTRIBUTED ACCESS CONTROL  The basic idea is to use roles as templates to capture common data access privileges and allow businesses to override these privileges to meet their specific needs.  The information of the users created at one peer is forwarded to the bootstrap peer and then broadcasted to other normal peers also.  The local administrator at this peer can easily define the role-based access control for any user. 14
  • 15. PAY-AS-YOU-GO QUERY PROCESSING  BestPeer++ provides two services for the participants: 1. Storage service 2. Search service  After data are exported from the local business system into a BestPeer++ instance, we apply the schema mapping rules to transform them into the predefined formats. 15
  • 16. 5.BENCHMARKING  This section evaluates the performance and throughput of BestPeer++ on Amazon cloud platform. 1. For the performance benchmark, they compare the query latency of BestPeer++ with HadoopDB using five queries selected from typical corporate network applications workloads. 2. For the throughput benchmark, they create a simple supply-chain network consisting of suppliers and retailers and study the query throughput of the system. 16
  • 17. 6.ADVANTAGES OF BESTPEER++ 1. Deliver near linear query throughput as the number of normal peers grows. 2. BestPeer++ adopts the pay-as-you-go business model popularized by cloud computing. 3. The role-based access control for the inherent distributed environment of corporate networks. 4. P2P technology to retrieve data between business partners. 5. Efficient data sharing within corporate networks. 17
  • 18. 7.CONCLUSION  The benchmark conducted on Amazon EC2 cloud platform shows that our system can efficiently handle typical workloads in a corporate network and can deliver near linear query throughput as the number of normal peers grows.  Therefore, BestPeer++ is a promising solution for efficient data sharing within corporate networks. 18
  • 19. REFERENCES 1. S. Wu, Q.H. Vu, J. Li, and K.-L. Tan, “Adaptive Multi-Join Query Processing in PDBMS,” Proc. IEEE Int’l Conf. Data Eng. (ICDE ’09), pp. 1239-1242, 2009. 2. D. Bermbach and S. Tai, “Eventual Consistency: How Soon is Eventual? An Evaluation of Amazon s3’s Consistency Behavior,” in Proc. 6th Workshop Middleware Serv. Oriented Comput. (MW4SOC ’11), pp. 1:1-1:6, NY, USA, 2011. 3. B. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears, “Benchmarking Cloud Serving Systems with YCSB,” Proc. First ACM Symp. Cloud Computing, pp. 143-154, 2010. 4. Oracle Inc., “Achieving the Cloud Computing Vision,” White Paper, 2010. 19
  • 20. 20