SlideShare a Scribd company logo
1 of 31
Presentation
on
STUDY AND IMPLEMENTATION OF
TEMPORAL AGGREGATION FUNCTIONS
PRESENTED BY:
NITISH SHARMA
SUPERVISED BY:
Objective
 To study various techniques of Temporal Aggregation.
 To implement User Defined Aggregate functions (UDA) for adjacent, overlapping
or near by tuples.
2
Base Paper
Title: Parsimonious Temporal Aggregation
Author(s): Juozas Gordeviˇcius · Johann Gamper · Michael Böhlen
The VLDB Journal (2012) 21:309–332
DOI : 10.1007/s00778-011-0243-9
3
Temporal aggregation merges the temporal extents of value-equivalent
tuples. A temporal extent is usually coalesced offline and stored since
aggregation is an expensive operation.
• User defined aggregates (UDA) function are developed for the
aggregation of tuples with adjacent, overlapping or near by timestamps
through which the temporal aggregation process becomes stress-free
operation in temporal database.
 the underlying temporal data model supports tuple time-stamping and
the valid time.
 the insert and the update operations preserve the temporal order of
tuples in the modeled reality.
4
Introduction
Temporal aggregation is used to summarize large sets of such data by aggregating
specific attribute values over all tuples that hold at a time point or a time interval.
Aggregation Types:
• In instant temporal aggregation (ITA) the aggregate value at a time instant t is
computed from the set of all tuples whose timestamp contains t.
• Span temporal aggregation (STA) - allows an application to specify in the query
the time intervals for which to report result tuples
5
Candidates for Temporal Aggregation
 Adjacent Tuples
 Near-by Tuples
 Overlapping Tuples
6
Definition
• Adjacent tuples
Let s be a sequential relation with schema
S = (A1, . . . , Ak , B1, . . . , Bp, T)
&
grouping attributes A = {A1, . . . , Ak }.
Two tuples si , s j ∈ s are adjacent, si ≺ s j , iff the following holds:
(1) si .A = s j .A,
(2) si .te = s j .tb − 1.
7
Contd.
• Merge operator
Let s be an ITA result relation with schema
S = (A1, . . . , Ak , B1, . . . , Bp, T), where
A = {A1, . . . , Ak } are the grouping attributes and
B = {B1, . . . , Bp} store the aggregate values.
The merge, ⊕, of two adjacent tuples, si , s j ∈ s, si ≺ s j , is defined as
si ⊕ s j = (si .A1, . . . , si .Ak, v1, . . . , vp, [si .Tb, s j .Te]),
The merge operator produces a new tuple.
8
Problems in Temporal Aggregation operation
 A projection of a coalesced temporal relation may produce an un-
coalesced result
 Update and Insert operations may not enforce coalescing due to
efficiency concerns
 Time union operators cannot be expressed in terms of traditional
relational algebra
9
Temporal Aggregation approaches
• Run-time:
During execution of query, tuples are coalesced as needed
• Insert:
During insertion of tuple in the relation
• Update:
Tuples are coalesced, when existing data is modified
10
User Defined Aggregates (UDA)
• UDA is a native SQL extension.
• UDAs can be used to support windows, time-series queries, and
sequence queries on data streams.
• UDA approach beats the traditional pure SQL coalescing queries in
performance.
• UDA, requires one single scan of the input tuples.
11
Algorithm
“User Defined Aggregate function for aggregation of adjacent or overlapping tuples”.
1: Define table Temp (TSTART, TEND) to store the current coalesced period, initially
empty;
2: Insert the first tuple’s TSTART and TEND value into Temp;
3: for every new input tuple T do
4: if T.TSTART <= Temp.TEND then
5: //new tuple coalescable with current period
6: Update Temp.TEND with T.TEND;
7: else
8: //current coalesced period ends, a new coalescing period begins
9: Output the tuple in Temp, then update Temp with T.TSTART and T.TEND;
10: end if
11: end for
12: Output the tuple in Temp;
12
Algorithm
“User Defined Aggregate function for aggregation of near by tuples”.
STEPS:
Input: A nearby threshold α; a sorted list of intervals S = {s1, s2, • • • , sm}
Output: A list of coalesced intervals T
1: T ← ∅
2: t ← s1 #t is a working variable for intermediate coalescing result
3: i ← 2
4: while i <= |S| do
5: if si.B > t.E + α ∨ si.W ≠ t.W then #not α-coalescible
6: T ← T ∪ {t}
7: t ← si
8: else if si.E > t.E ∧ si.W = t.W then #coalesce
9: t.E ← si.E
10: end if
11: i ← i + 1.
12: end while
13: return T
13
Screenshots
14
Overlapping Tuples
15
1 2 3 4
After aggregation of overlapping tuples
1 2 3 4
Relation - Before Aggregation16
Relation - After Aggregation of Tuples
17
18
Tuples are aggregated:
1. Six tuples related to “sumit” aggregated into one single tuple.
2. Six tuples related to “nitish” aggregated into one single scan.
Adjacent Tuples
19
1 2 3 4
After aggregation of overlapping tuples
1 2 3 4
5 6
5 6
Relation - Before Aggregation20
Relation - After Aggregation of Tuples
21
22
Tuples are aggregated:
Two tuples related to “Krishna” aggregated into one single tuple.
Near By Tuples
(alpha-coalescing)
23
1 2 3 4
After aggregation of overlapping tuples
1 2 3 4
5 6
5 6
here, alpha=2
Relation - Before Aggregation24
Relation - After Aggregation of Tuples
25
26
Tuples are aggregated:
Two tuples related to “Krishna” aggregated into one single tuple.
Literature Survey
Parsimonious temporal aggregation[1].
A novel temporal aggregation operator, termed parsimonious temporal
aggregation (PTA), that overcomes major limitations of existing approaches. PTA
takes the result of instant temporal aggregation (ITA) of size n, which might be up
to twice as large as the argument relation, and merges similar tuples until a given
error (e) or size (c) bound is reached. The new operator is data-adaptive and allows
the user to control the trade-off between the result size and the error introduced
by merging. For the precise evaluation of PTA queries, we propose two dynamic
programming-based algorithms for size and error-bounded queries, respectively,
with a worst-case complexity that is quadratic in n. For the quick computation of
an approximate PTA answer, we propose an efficient greedy merging strategy with
a precision that is upper bounded by O(log n). We present two algorithms that
implement this strategy and begin to merge as ITA tuples are produced. They
require O(n log(c + β)) time and O(c + β) space, where β is the size of a read-ahead
buffer and is typically very small.
27
Contd.
Efficient temporal coalescing query support in relational database systems[3].
An SQL: 2003-based query algorithm and a native relational user defined
aggregates (UDA) approach – both approaches only require a single scan of the
database. They conclude that temporal queries can be best supported by OLAP
functions supported in the current SQL: 2003 standards. These new findings
demonstrate that the current RDBMSs are mature enough to directly support
efficient temporal queries, and provide a new paradigm for temporal database
research and implementation.
28
Conclusion
• User defined aggregates (UDA) are developed for the aggregation of
tuples with adjacent, overlapping or near by timestamps through
which the temporal aggregation process becomes stress-free
operation in temporal database. Two additional tracing attributes are
associated with each temporal attribute of a relation which are used
to trace the change in value of the corresponding temporal attribute.
• UDA approach beats the traditional pure SQL coalescing queries in
performance.
29
Future work
Try to optimize cost of temporal aggregate operator.
30
Reference
[1] Gordevičius, Juozas, Johann Gamper, and Michael Böhlen. "Parsimonious temporal
aggregation." The VLDB Journal 21.3 (2012): 309-332.
[2] Pilman, Markus, et al. "ParTime: Parallel Temporal Aggregation." Proceedings of the 2016
International Conference on Management of Data. ACM, 2016.
[3] Zhou, Xin, Fusheng Wang, and Carlo Zaniolo. "Efficient temporal coalescing query support
in relational database systems." International Conference on Database and Expert Systems
Applications. Springer Berlin Heidelberg, 2006.
[4] Cheng, Kai. "Approximate Temporal Aggregation with Nearby Coalescing." International
Conference on Database and Expert Systems Applications. Springer International Publishing,
2016.
31

More Related Content

What's hot

A statistical comparative study of
A statistical comparative study ofA statistical comparative study of
A statistical comparative study ofijfcstjournal
 
IJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsIJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsAkisato Kimura
 
Seismic Analysis of Structures - III
Seismic Analysis of Structures - IIISeismic Analysis of Structures - III
Seismic Analysis of Structures - IIItushardatta
 
Ashfaq Munshi, ML7 Fellow, Pepperdata
Ashfaq Munshi, ML7 Fellow, PepperdataAshfaq Munshi, ML7 Fellow, Pepperdata
Ashfaq Munshi, ML7 Fellow, PepperdataMLconf
 
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation AlgorithmA Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation AlgorithmNECST Lab @ Politecnico di Milano
 
Determining the k in k-means with MapReduce
Determining the k in k-means with MapReduceDetermining the k in k-means with MapReduce
Determining the k in k-means with MapReduceThibault Debatty
 
Time and Reliability Optimization Bat Algorithm for Scheduling Workflow in Cloud
Time and Reliability Optimization Bat Algorithm for Scheduling Workflow in CloudTime and Reliability Optimization Bat Algorithm for Scheduling Workflow in Cloud
Time and Reliability Optimization Bat Algorithm for Scheduling Workflow in CloudIRJET Journal
 
Fuzzy clustering and merging
Fuzzy clustering and mergingFuzzy clustering and merging
Fuzzy clustering and mergingabc
 
On selection of periodic kernels parameters in time series prediction
On selection of periodic kernels parameters in time series predictionOn selection of periodic kernels parameters in time series prediction
On selection of periodic kernels parameters in time series predictioncsandit
 
Temporal workload analysis and its application to power aware scheduling
Temporal workload analysis and its application to power aware schedulingTemporal workload analysis and its application to power aware scheduling
Temporal workload analysis and its application to power aware schedulingijesajournal
 
Scalable Graph Clustering with Pregel
Scalable Graph Clustering with PregelScalable Graph Clustering with Pregel
Scalable Graph Clustering with PregelSqrrl
 
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to PracticeWorkflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to PracticeFrederic Desprez
 

What's hot (17)

Master thesis report
Master thesis reportMaster thesis report
Master thesis report
 
A statistical comparative study of
A statistical comparative study ofA statistical comparative study of
A statistical comparative study of
 
IJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsIJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphs
 
Seismic Analysis of Structures - III
Seismic Analysis of Structures - IIISeismic Analysis of Structures - III
Seismic Analysis of Structures - III
 
Ashfaq Munshi, ML7 Fellow, Pepperdata
Ashfaq Munshi, ML7 Fellow, PepperdataAshfaq Munshi, ML7 Fellow, Pepperdata
Ashfaq Munshi, ML7 Fellow, Pepperdata
 
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation AlgorithmA Scalable Dataflow Implementation of Curran's Approximation Algorithm
A Scalable Dataflow Implementation of Curran's Approximation Algorithm
 
Determining the k in k-means with MapReduce
Determining the k in k-means with MapReduceDetermining the k in k-means with MapReduce
Determining the k in k-means with MapReduce
 
ECE 565 presentation
ECE 565 presentationECE 565 presentation
ECE 565 presentation
 
Seminar9
Seminar9Seminar9
Seminar9
 
Time and Reliability Optimization Bat Algorithm for Scheduling Workflow in Cloud
Time and Reliability Optimization Bat Algorithm for Scheduling Workflow in CloudTime and Reliability Optimization Bat Algorithm for Scheduling Workflow in Cloud
Time and Reliability Optimization Bat Algorithm for Scheduling Workflow in Cloud
 
Principal component analysis
Principal component analysisPrincipal component analysis
Principal component analysis
 
Chapter26
Chapter26Chapter26
Chapter26
 
Fuzzy clustering and merging
Fuzzy clustering and mergingFuzzy clustering and merging
Fuzzy clustering and merging
 
On selection of periodic kernels parameters in time series prediction
On selection of periodic kernels parameters in time series predictionOn selection of periodic kernels parameters in time series prediction
On selection of periodic kernels parameters in time series prediction
 
Temporal workload analysis and its application to power aware scheduling
Temporal workload analysis and its application to power aware schedulingTemporal workload analysis and its application to power aware scheduling
Temporal workload analysis and its application to power aware scheduling
 
Scalable Graph Clustering with Pregel
Scalable Graph Clustering with PregelScalable Graph Clustering with Pregel
Scalable Graph Clustering with Pregel
 
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to PracticeWorkflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
Workflow Allocations and Scheduling on IaaS Platforms, from Theory to Practice
 

Viewers also liked

Database system-DBMS
Database system-DBMSDatabase system-DBMS
Database system-DBMSikjsamuel
 
Characteristic of dabase approach
Characteristic of dabase approachCharacteristic of dabase approach
Characteristic of dabase approachLuina Pani
 
Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query OptimizationAli Usman
 
physical and logical data independence
physical and logical data independencephysical and logical data independence
physical and logical data independenceapoorva_upadhyay
 
6 Data Modeling for NoSQL 2/2
6 Data Modeling for NoSQL 2/26 Data Modeling for NoSQL 2/2
6 Data Modeling for NoSQL 2/2Fabio Fumarola
 
14. Query Optimization in DBMS
14. Query Optimization in DBMS14. Query Optimization in DBMS
14. Query Optimization in DBMSkoolkampus
 
MS Sql Server: Introduction To Database Concepts
MS Sql Server: Introduction To Database ConceptsMS Sql Server: Introduction To Database Concepts
MS Sql Server: Introduction To Database ConceptsDataminingTools Inc
 
Query optimization
Query optimizationQuery optimization
Query optimizationdixitdavey
 
Database administrator
Database administratorDatabase administrator
Database administratorTech_MX
 
Single User v/s Multi User Databases
Single User v/s Multi User DatabasesSingle User v/s Multi User Databases
Single User v/s Multi User DatabasesRaminder Pal Singh
 
Chapter 1 Fundamentals of Database Management System
Chapter 1 Fundamentals of Database Management SystemChapter 1 Fundamentals of Database Management System
Chapter 1 Fundamentals of Database Management SystemEddyzulham Mahluzydde
 
13. Query Processing in DBMS
13. Query Processing in DBMS13. Query Processing in DBMS
13. Query Processing in DBMSkoolkampus
 
2. Entity Relationship Model in DBMS
2. Entity Relationship Model in DBMS2. Entity Relationship Model in DBMS
2. Entity Relationship Model in DBMSkoolkampus
 
Relational algebra1
Relational algebra1Relational algebra1
Relational algebra1Tianlu Wang
 
Sql Authorization
Sql AuthorizationSql Authorization
Sql AuthorizationFhuy
 
MYSQL Aggregate Functions
MYSQL Aggregate FunctionsMYSQL Aggregate Functions
MYSQL Aggregate FunctionsLeroy Blair
 

Viewers also liked (20)

Database system-DBMS
Database system-DBMSDatabase system-DBMS
Database system-DBMS
 
Characteristic of dabase approach
Characteristic of dabase approachCharacteristic of dabase approach
Characteristic of dabase approach
 
Data model and entity relationship
Data model and entity relationshipData model and entity relationship
Data model and entity relationship
 
Database , 8 Query Optimization
Database , 8 Query OptimizationDatabase , 8 Query Optimization
Database , 8 Query Optimization
 
physical and logical data independence
physical and logical data independencephysical and logical data independence
physical and logical data independence
 
6 Data Modeling for NoSQL 2/2
6 Data Modeling for NoSQL 2/26 Data Modeling for NoSQL 2/2
6 Data Modeling for NoSQL 2/2
 
14. Query Optimization in DBMS
14. Query Optimization in DBMS14. Query Optimization in DBMS
14. Query Optimization in DBMS
 
MS Sql Server: Introduction To Database Concepts
MS Sql Server: Introduction To Database ConceptsMS Sql Server: Introduction To Database Concepts
MS Sql Server: Introduction To Database Concepts
 
Query optimization
Query optimizationQuery optimization
Query optimization
 
Database administrator
Database administratorDatabase administrator
Database administrator
 
Dbms slides
Dbms slidesDbms slides
Dbms slides
 
Single User v/s Multi User Databases
Single User v/s Multi User DatabasesSingle User v/s Multi User Databases
Single User v/s Multi User Databases
 
Chapter 1 Fundamentals of Database Management System
Chapter 1 Fundamentals of Database Management SystemChapter 1 Fundamentals of Database Management System
Chapter 1 Fundamentals of Database Management System
 
13. Query Processing in DBMS
13. Query Processing in DBMS13. Query Processing in DBMS
13. Query Processing in DBMS
 
Databases: Normalisation
Databases: NormalisationDatabases: Normalisation
Databases: Normalisation
 
2. Entity Relationship Model in DBMS
2. Entity Relationship Model in DBMS2. Entity Relationship Model in DBMS
2. Entity Relationship Model in DBMS
 
Relational algebra1
Relational algebra1Relational algebra1
Relational algebra1
 
Commands
CommandsCommands
Commands
 
Sql Authorization
Sql AuthorizationSql Authorization
Sql Authorization
 
MYSQL Aggregate Functions
MYSQL Aggregate FunctionsMYSQL Aggregate Functions
MYSQL Aggregate Functions
 

Similar to STUDY AND IMPLEMENTATION OF TEMPORAL AGGREGATION FUNCTIONS

NS-CUK Seminar: S.T.Nguyen, Review on "Continuous-Time Sequential Recommendat...
NS-CUK Seminar: S.T.Nguyen, Review on "Continuous-Time Sequential Recommendat...NS-CUK Seminar: S.T.Nguyen, Review on "Continuous-Time Sequential Recommendat...
NS-CUK Seminar: S.T.Nguyen, Review on "Continuous-Time Sequential Recommendat...ssuser4b1f48
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETScsandit
 
Temporal workload analysis and its application to power aware scheduling
Temporal workload analysis and its application to power aware schedulingTemporal workload analysis and its application to power aware scheduling
Temporal workload analysis and its application to power aware schedulingijesajournal
 
Scilab Finite element solver for stationary and incompressible navier-stokes ...
Scilab Finite element solver for stationary and incompressible navier-stokes ...Scilab Finite element solver for stationary and incompressible navier-stokes ...
Scilab Finite element solver for stationary and incompressible navier-stokes ...Scilab
 
Accelerated Particle Swarm Optimization and Support Vector Machine for Busine...
Accelerated Particle Swarm Optimization and Support Vector Machine for Busine...Accelerated Particle Swarm Optimization and Support Vector Machine for Busine...
Accelerated Particle Swarm Optimization and Support Vector Machine for Busine...Xin-She Yang
 
Dynamic Economic Dispatch Assessment Using Particle Swarm Optimization Technique
Dynamic Economic Dispatch Assessment Using Particle Swarm Optimization TechniqueDynamic Economic Dispatch Assessment Using Particle Swarm Optimization Technique
Dynamic Economic Dispatch Assessment Using Particle Swarm Optimization TechniquejournalBEEI
 
15.sp.dictionary_draft.pdf
15.sp.dictionary_draft.pdf15.sp.dictionary_draft.pdf
15.sp.dictionary_draft.pdfAllanKelvinSales
 
Parallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower BoundsParallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower BoundsSubhajit Sahu
 
Parallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower BoundsParallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower BoundsSubhajit Sahu
 
A weighted-sum-technique-for-the-joint-optimization-of-performance-and-power-...
A weighted-sum-technique-for-the-joint-optimization-of-performance-and-power-...A weighted-sum-technique-for-the-joint-optimization-of-performance-and-power-...
A weighted-sum-technique-for-the-joint-optimization-of-performance-and-power-...Cemal Ardil
 
REDUCING TIMED AUTOMATA: A NEW APPROACH
REDUCING TIMED AUTOMATA: A NEW APPROACHREDUCING TIMED AUTOMATA: A NEW APPROACH
REDUCING TIMED AUTOMATA: A NEW APPROACHijistjournal
 
REDUCING TIMED AUTOMATA : A NEW APPROACH
REDUCING TIMED AUTOMATA : A NEW APPROACHREDUCING TIMED AUTOMATA : A NEW APPROACH
REDUCING TIMED AUTOMATA : A NEW APPROACHijistjournal
 
DCE: A NOVEL DELAY CORRELATION MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE REAL...
DCE: A NOVEL DELAY CORRELATION  MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE  REAL...DCE: A NOVEL DELAY CORRELATION  MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE  REAL...
DCE: A NOVEL DELAY CORRELATION MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE REAL...ijdpsjournal
 
Scheduling and Allocation Algorithm for an Elliptic Filter
Scheduling and Allocation Algorithm for an Elliptic FilterScheduling and Allocation Algorithm for an Elliptic Filter
Scheduling and Allocation Algorithm for an Elliptic Filterijait
 
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERYSCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERYijcseit
 
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERYSCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERYijcseit
 
A unique sorting algorithm with linear time &amp; space complexity
A unique sorting algorithm with linear time &amp; space complexityA unique sorting algorithm with linear time &amp; space complexity
A unique sorting algorithm with linear time &amp; space complexityeSAT Journals
 
USE OF ADAPTIVE COLOURED PETRI NETWORK IN SUPPORT OF DECISIONMAKING
USE OF ADAPTIVE COLOURED PETRI NETWORK IN SUPPORT OF DECISIONMAKINGUSE OF ADAPTIVE COLOURED PETRI NETWORK IN SUPPORT OF DECISIONMAKING
USE OF ADAPTIVE COLOURED PETRI NETWORK IN SUPPORT OF DECISIONMAKINGcsandit
 

Similar to STUDY AND IMPLEMENTATION OF TEMPORAL AGGREGATION FUNCTIONS (20)

NS-CUK Seminar: S.T.Nguyen, Review on "Continuous-Time Sequential Recommendat...
NS-CUK Seminar: S.T.Nguyen, Review on "Continuous-Time Sequential Recommendat...NS-CUK Seminar: S.T.Nguyen, Review on "Continuous-Time Sequential Recommendat...
NS-CUK Seminar: S.T.Nguyen, Review on "Continuous-Time Sequential Recommendat...
 
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETSFAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
FAST ALGORITHMS FOR UNSUPERVISED LEARNING IN LARGE DATA SETS
 
Temporal workload analysis and its application to power aware scheduling
Temporal workload analysis and its application to power aware schedulingTemporal workload analysis and its application to power aware scheduling
Temporal workload analysis and its application to power aware scheduling
 
Scilab Finite element solver for stationary and incompressible navier-stokes ...
Scilab Finite element solver for stationary and incompressible navier-stokes ...Scilab Finite element solver for stationary and incompressible navier-stokes ...
Scilab Finite element solver for stationary and incompressible navier-stokes ...
 
Accelerated Particle Swarm Optimization and Support Vector Machine for Busine...
Accelerated Particle Swarm Optimization and Support Vector Machine for Busine...Accelerated Particle Swarm Optimization and Support Vector Machine for Busine...
Accelerated Particle Swarm Optimization and Support Vector Machine for Busine...
 
Dynamic Economic Dispatch Assessment Using Particle Swarm Optimization Technique
Dynamic Economic Dispatch Assessment Using Particle Swarm Optimization TechniqueDynamic Economic Dispatch Assessment Using Particle Swarm Optimization Technique
Dynamic Economic Dispatch Assessment Using Particle Swarm Optimization Technique
 
15.sp.dictionary_draft.pdf
15.sp.dictionary_draft.pdf15.sp.dictionary_draft.pdf
15.sp.dictionary_draft.pdf
 
Parallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower BoundsParallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
 
Parallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower BoundsParallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
Parallel Batch-Dynamic Graphs: Algorithms and Lower Bounds
 
Flex ch
Flex chFlex ch
Flex ch
 
A weighted-sum-technique-for-the-joint-optimization-of-performance-and-power-...
A weighted-sum-technique-for-the-joint-optimization-of-performance-and-power-...A weighted-sum-technique-for-the-joint-optimization-of-performance-and-power-...
A weighted-sum-technique-for-the-joint-optimization-of-performance-and-power-...
 
REDUCING TIMED AUTOMATA: A NEW APPROACH
REDUCING TIMED AUTOMATA: A NEW APPROACHREDUCING TIMED AUTOMATA: A NEW APPROACH
REDUCING TIMED AUTOMATA: A NEW APPROACH
 
REDUCING TIMED AUTOMATA : A NEW APPROACH
REDUCING TIMED AUTOMATA : A NEW APPROACHREDUCING TIMED AUTOMATA : A NEW APPROACH
REDUCING TIMED AUTOMATA : A NEW APPROACH
 
DCE: A NOVEL DELAY CORRELATION MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE REAL...
DCE: A NOVEL DELAY CORRELATION  MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE  REAL...DCE: A NOVEL DELAY CORRELATION  MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE  REAL...
DCE: A NOVEL DELAY CORRELATION MEASUREMENT FOR TOMOGRAPHY WITH PASSIVE REAL...
 
Scheduling and Allocation Algorithm for an Elliptic Filter
Scheduling and Allocation Algorithm for an Elliptic FilterScheduling and Allocation Algorithm for an Elliptic Filter
Scheduling and Allocation Algorithm for an Elliptic Filter
 
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERYSCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
 
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERYSCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
SCIENTIFIC WORKFLOW CLUSTERING BASED ON MOTIF DISCOVERY
 
A unique sorting algorithm with linear time &amp; space complexity
A unique sorting algorithm with linear time &amp; space complexityA unique sorting algorithm with linear time &amp; space complexity
A unique sorting algorithm with linear time &amp; space complexity
 
USE OF ADAPTIVE COLOURED PETRI NETWORK IN SUPPORT OF DECISIONMAKING
USE OF ADAPTIVE COLOURED PETRI NETWORK IN SUPPORT OF DECISIONMAKINGUSE OF ADAPTIVE COLOURED PETRI NETWORK IN SUPPORT OF DECISIONMAKING
USE OF ADAPTIVE COLOURED PETRI NETWORK IN SUPPORT OF DECISIONMAKING
 
TEXT CLUSTERING.doc
TEXT CLUSTERING.docTEXT CLUSTERING.doc
TEXT CLUSTERING.doc
 

Recently uploaded

Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 

Recently uploaded (20)

Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdf
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 

STUDY AND IMPLEMENTATION OF TEMPORAL AGGREGATION FUNCTIONS

  • 1. Presentation on STUDY AND IMPLEMENTATION OF TEMPORAL AGGREGATION FUNCTIONS PRESENTED BY: NITISH SHARMA SUPERVISED BY:
  • 2. Objective  To study various techniques of Temporal Aggregation.  To implement User Defined Aggregate functions (UDA) for adjacent, overlapping or near by tuples. 2
  • 3. Base Paper Title: Parsimonious Temporal Aggregation Author(s): Juozas Gordeviˇcius · Johann Gamper · Michael Böhlen The VLDB Journal (2012) 21:309–332 DOI : 10.1007/s00778-011-0243-9 3
  • 4. Temporal aggregation merges the temporal extents of value-equivalent tuples. A temporal extent is usually coalesced offline and stored since aggregation is an expensive operation. • User defined aggregates (UDA) function are developed for the aggregation of tuples with adjacent, overlapping or near by timestamps through which the temporal aggregation process becomes stress-free operation in temporal database.  the underlying temporal data model supports tuple time-stamping and the valid time.  the insert and the update operations preserve the temporal order of tuples in the modeled reality. 4
  • 5. Introduction Temporal aggregation is used to summarize large sets of such data by aggregating specific attribute values over all tuples that hold at a time point or a time interval. Aggregation Types: • In instant temporal aggregation (ITA) the aggregate value at a time instant t is computed from the set of all tuples whose timestamp contains t. • Span temporal aggregation (STA) - allows an application to specify in the query the time intervals for which to report result tuples 5
  • 6. Candidates for Temporal Aggregation  Adjacent Tuples  Near-by Tuples  Overlapping Tuples 6
  • 7. Definition • Adjacent tuples Let s be a sequential relation with schema S = (A1, . . . , Ak , B1, . . . , Bp, T) & grouping attributes A = {A1, . . . , Ak }. Two tuples si , s j ∈ s are adjacent, si ≺ s j , iff the following holds: (1) si .A = s j .A, (2) si .te = s j .tb − 1. 7
  • 8. Contd. • Merge operator Let s be an ITA result relation with schema S = (A1, . . . , Ak , B1, . . . , Bp, T), where A = {A1, . . . , Ak } are the grouping attributes and B = {B1, . . . , Bp} store the aggregate values. The merge, ⊕, of two adjacent tuples, si , s j ∈ s, si ≺ s j , is defined as si ⊕ s j = (si .A1, . . . , si .Ak, v1, . . . , vp, [si .Tb, s j .Te]), The merge operator produces a new tuple. 8
  • 9. Problems in Temporal Aggregation operation  A projection of a coalesced temporal relation may produce an un- coalesced result  Update and Insert operations may not enforce coalescing due to efficiency concerns  Time union operators cannot be expressed in terms of traditional relational algebra 9
  • 10. Temporal Aggregation approaches • Run-time: During execution of query, tuples are coalesced as needed • Insert: During insertion of tuple in the relation • Update: Tuples are coalesced, when existing data is modified 10
  • 11. User Defined Aggregates (UDA) • UDA is a native SQL extension. • UDAs can be used to support windows, time-series queries, and sequence queries on data streams. • UDA approach beats the traditional pure SQL coalescing queries in performance. • UDA, requires one single scan of the input tuples. 11
  • 12. Algorithm “User Defined Aggregate function for aggregation of adjacent or overlapping tuples”. 1: Define table Temp (TSTART, TEND) to store the current coalesced period, initially empty; 2: Insert the first tuple’s TSTART and TEND value into Temp; 3: for every new input tuple T do 4: if T.TSTART <= Temp.TEND then 5: //new tuple coalescable with current period 6: Update Temp.TEND with T.TEND; 7: else 8: //current coalesced period ends, a new coalescing period begins 9: Output the tuple in Temp, then update Temp with T.TSTART and T.TEND; 10: end if 11: end for 12: Output the tuple in Temp; 12
  • 13. Algorithm “User Defined Aggregate function for aggregation of near by tuples”. STEPS: Input: A nearby threshold α; a sorted list of intervals S = {s1, s2, • • • , sm} Output: A list of coalesced intervals T 1: T ← ∅ 2: t ← s1 #t is a working variable for intermediate coalescing result 3: i ← 2 4: while i <= |S| do 5: if si.B > t.E + α ∨ si.W ≠ t.W then #not α-coalescible 6: T ← T ∪ {t} 7: t ← si 8: else if si.E > t.E ∧ si.W = t.W then #coalesce 9: t.E ← si.E 10: end if 11: i ← i + 1. 12: end while 13: return T 13
  • 15. Overlapping Tuples 15 1 2 3 4 After aggregation of overlapping tuples 1 2 3 4
  • 16. Relation - Before Aggregation16
  • 17. Relation - After Aggregation of Tuples 17
  • 18. 18 Tuples are aggregated: 1. Six tuples related to “sumit” aggregated into one single tuple. 2. Six tuples related to “nitish” aggregated into one single scan.
  • 19. Adjacent Tuples 19 1 2 3 4 After aggregation of overlapping tuples 1 2 3 4 5 6 5 6
  • 20. Relation - Before Aggregation20
  • 21. Relation - After Aggregation of Tuples 21
  • 22. 22 Tuples are aggregated: Two tuples related to “Krishna” aggregated into one single tuple.
  • 23. Near By Tuples (alpha-coalescing) 23 1 2 3 4 After aggregation of overlapping tuples 1 2 3 4 5 6 5 6 here, alpha=2
  • 24. Relation - Before Aggregation24
  • 25. Relation - After Aggregation of Tuples 25
  • 26. 26 Tuples are aggregated: Two tuples related to “Krishna” aggregated into one single tuple.
  • 27. Literature Survey Parsimonious temporal aggregation[1]. A novel temporal aggregation operator, termed parsimonious temporal aggregation (PTA), that overcomes major limitations of existing approaches. PTA takes the result of instant temporal aggregation (ITA) of size n, which might be up to twice as large as the argument relation, and merges similar tuples until a given error (e) or size (c) bound is reached. The new operator is data-adaptive and allows the user to control the trade-off between the result size and the error introduced by merging. For the precise evaluation of PTA queries, we propose two dynamic programming-based algorithms for size and error-bounded queries, respectively, with a worst-case complexity that is quadratic in n. For the quick computation of an approximate PTA answer, we propose an efficient greedy merging strategy with a precision that is upper bounded by O(log n). We present two algorithms that implement this strategy and begin to merge as ITA tuples are produced. They require O(n log(c + β)) time and O(c + β) space, where β is the size of a read-ahead buffer and is typically very small. 27
  • 28. Contd. Efficient temporal coalescing query support in relational database systems[3]. An SQL: 2003-based query algorithm and a native relational user defined aggregates (UDA) approach – both approaches only require a single scan of the database. They conclude that temporal queries can be best supported by OLAP functions supported in the current SQL: 2003 standards. These new findings demonstrate that the current RDBMSs are mature enough to directly support efficient temporal queries, and provide a new paradigm for temporal database research and implementation. 28
  • 29. Conclusion • User defined aggregates (UDA) are developed for the aggregation of tuples with adjacent, overlapping or near by timestamps through which the temporal aggregation process becomes stress-free operation in temporal database. Two additional tracing attributes are associated with each temporal attribute of a relation which are used to trace the change in value of the corresponding temporal attribute. • UDA approach beats the traditional pure SQL coalescing queries in performance. 29
  • 30. Future work Try to optimize cost of temporal aggregate operator. 30
  • 31. Reference [1] Gordevičius, Juozas, Johann Gamper, and Michael Böhlen. "Parsimonious temporal aggregation." The VLDB Journal 21.3 (2012): 309-332. [2] Pilman, Markus, et al. "ParTime: Parallel Temporal Aggregation." Proceedings of the 2016 International Conference on Management of Data. ACM, 2016. [3] Zhou, Xin, Fusheng Wang, and Carlo Zaniolo. "Efficient temporal coalescing query support in relational database systems." International Conference on Database and Expert Systems Applications. Springer Berlin Heidelberg, 2006. [4] Cheng, Kai. "Approximate Temporal Aggregation with Nearby Coalescing." International Conference on Database and Expert Systems Applications. Springer International Publishing, 2016. 31