SlideShare a Scribd company logo
1 of 66
Download to read offline
Time Series Data Mining Challenges
Time Series Data Mining Challenges
Jose A. Lozano
Basque Center for Applied Mathematics (BCAM)
University of the Basque Country UPV/EHU
Time Series Days, Rennes, March 25-26, 2019
Time Series Data Mining Challenges
Outline of the presentation
1 Time Series Data Mining Activities
2 Clustering
3 (Early) Supervised Classification
4 Outlier/Anomaly Detection
5 Conclusions and Future Work
Time Series Data Mining Challenges
Time Series Data Mining Activities
Outline of the presentation
1 Time Series Data Mining Activities
2 Clustering
3 (Early) Supervised Classification
4 Outlier/Anomaly Detection
5 Conclusions and Future Work
Time Series Data Mining Challenges
Time Series Data Mining Activities
Time series all around
Temporal
correlation
High dimen-
sionality
Noisy
Industry 4.0
Bio Signals
Weather Forecasting
Shapes
Time Series Data Mining Challenges
Time Series Data Mining Activities
Time series forecasting
Time Series Data Mining Challenges
Time Series Data Mining Activities
Time series data base: our object of study
A set of time series
(usually big)
Different lengths
Multidimensional
Time Series Data Mining Challenges
Time Series Data Mining Activities
Time series clustering. Examples
CLUSTERING
ALGORITHM
Time Series Data Mining Challenges
Time Series Data Mining Activities
Supervised classification of time series
C1
C2
C3
C2
C3
C1
ALGORITHM
CLASSIFIER
? C2
TRAINING SET
Time Series Data Mining Challenges
Time Series Data Mining Activities
Anomaly/outlier detection
Time Series Data Mining Challenges
Time Series Data Mining Activities
Segmentation
Time Series Data Mining Challenges
Clustering
Outline of the presentation
1 Time Series Data Mining Activities
2 Clustering
3 (Early) Supervised Classification
4 Outlier/Anomaly Detection
5 Conclusions and Future Work
Time Series Data Mining Challenges
Clustering
Time series clustering. Examples
CLUSTERING
ALGORITHM
Time Series Data Mining Challenges
Clustering
Time series clustering: hierarchical, partitional
we need a
DISTANCE
0
100
200
300
400
Series
1
Series
2
Series
3
Series
4
Series
5
Series
6
Series
7
Series
8
Series
9
k-means
Time Series Data Mining Challenges
Clustering
Distance between time series
Rigid Distance Flexible Distance
Time Series Data Mining Challenges
Clustering
Euclidean Distance (ED)
D(X, Y) =
qPn
i=1(xi − yi)2
Easy to compute
Only for series with the same distance
Does not consider the time
Sensitivity to noise
Time Series Data Mining Challenges
Clustering
Dynamic Time Warping (DTW)
Takes into account the ordered
sequence (time)
It can deal with series of different
sizes
Computationally expensive
O(min{m, n}2)
Time Series Data Mining Challenges
Clustering
Euclidean Distance vs Dynamic Time Warping
EUCLIDEAN DTW
6
6
Time Series Data Mining Challenges
Clustering
Remark on distances
More on elastic distances
Cheap versions of dynamic time warping (Sakoe-chiba,
bounding)
Edit distance for real sequences (EDR)
Mori et al 2016, R journal
On-line versions (Oregi et al 2019, PR)
Alternatives to calculate distances
Represent each series by means of a set of features and
calculate the distance between the features
Learn a parametric model for each series and calculate the
distance between the parameters
Time Series Data Mining Challenges
Clustering
Distances between series
Remarks
There is not best distance (no free lunch)
Each problem requires a different distance
The distance to be used needs to be in agreement with our
knowledge about what is far and what is close
Hint: try with several distances
Challenge:
Design a method to the (semi)automatic selection of a distance
(e.g. Mori et al. 2016, TKDE)
Time Series Data Mining Challenges
Clustering
...Coming back to clustering: K-means
k-medois
k-means
k-medoids
Time Series Data Mining Challenges
Clustering
Remarks on clustering
Recent papers on the computation of a mean series
Alternate clustering methods: graph-based, spectral,
model-based,...
Multivariate time series clustering
Time Series Data Mining Challenges
(Early) Supervised Classification
Outline of the presentation
1 Time Series Data Mining Activities
2 Clustering
3 (Early) Supervised Classification
4 Outlier/Anomaly Detection
5 Conclusions and Future Work
Time Series Data Mining Challenges
(Early) Supervised Classification
Supervised Classification of Time Series
General-purpose classifiers Specific TS classifiers
FEATURES C FEATURES
CLASSIFIER
SERIES C SERIES
CLASSIFIER
Time Series Data Mining Challenges
(Early) Supervised Classification
General-purpose classifiers
Each series is considered an instance
Each time stamp is considered a feature
t1 t2 t3 . . . tn C
x11 x12 x13 . . . x1n c1
x21 x22 x23 . . . x2n c2
. . . . . . . . . . . . . . . . . .
xm1 xm2 xm3 . . . xmn c2
Time Series Data Mining Challenges
(Early) Supervised Classification
General-purpose classifiers
Each series is considered an instance
Each time stamp is considered a feature
t2 t1 t3 . . . tn C
x12 x11 x13 . . . x1n c1
x22 x21 x23 . . . x2n c2
. . . . . . . . . . . . . . . . . .
xm2 xm1 xm3 . . . xmn c2
Time Series Data Mining Challenges
(Early) Supervised Classification
General-purpose classifiers
Each series is considered an instance
Each time stamp is considered a feature
t2 t1 t3 . . . tn C
x12 x11 x13 . . . x1n c1
x22 x21 x23 . . . x2n c2
. . . . . . . . . . . . . . . . . .
xm2 xm1 xm3 . . . xmn c2
CHALLENGE
When to use general-purpose and when time-series specific?
Time Series Data Mining Challenges
(Early) Supervised Classification
What is relevant in TSC?
PROBLEM I PROBLEM II
Time Series Data Mining Challenges
(Early) Supervised Classification
What is relevant in TSC?
PROBLEM I PROBLEM II
SHAPE
Time Series Data Mining Challenges
(Early) Supervised Classification
What is relevant in TSC?
PROBLEM I PROBLEM II
SHAPE LOCATION
Time Series Data Mining Challenges
(Early) Supervised Classification
A taxonomy of time series classification methods
Taxonomy
Distance-based classifiers
Model-based classifiers
Feature-based classifiers
Time Series Data Mining Challenges
(Early) Supervised Classification
Taxonomy of distance-based TSC (Abanda et al.
2019, DAMI)
Time Series Data Mining Challenges
(Early) Supervised Classification
1-Nearest Neighbour (1-NN)
C1
C2
C3
C2
C3
C1
? C2
(d1 d2 d3 d4 d5 d6)
MINIMUM
DISTANCE
Time Series Data Mining Challenges
(Early) Supervised Classification
1-Nearest Neighbour (1-NN)
Easy to understand
Better results with higher
number of series
Computational cost
Challenge: What
distance???
C1
C2
C3
C2
C3
C1
? C2
(d1 d2 d3 d4 d5 d6)
MINIMUM
DISTANCE
Time Series Data Mining Challenges
(Early) Supervised Classification
Distance-based. Distance features
CLASIFICADOR
SERIES C DISTANCE MATRIX
...
...
...
...
...
...
C
SERIES
CLASSIFIER
DISTANCES
...
Time Series Data Mining Challenges
(Early) Supervised Classification
Distance-based. Distance features. Global
CLASIFICADOR
SERIES C DISTANCE MATRIX
...
...
...
...
...
...
C
SERIES
CLASSIFIER
DISTANCES
...
VECTORS!
Time Series Data Mining Challenges
(Early) Supervised Classification
Distance-based. Distance features. Global
Any general-purpose
algorithm could be
applied
It depends on the
number of series in
training
Computationally
expensive
Difficult to transfer to the
on-line setting
CLASIFICADOR
SERIES C DISTANCE MATRIX
...
...
...
...
...
...
C
SERIES
CLASSIFIER
DISTANCES
...
VECTORS!
Time Series Data Mining Challenges
(Early) Supervised Classification
Distance-based. Distance features. Local
Lij could be distance or
presence
Computationally expensive
When the shapelets are
relevant extremely good
results
Easy to interpret
Shapelet 2
Time Series Data Mining Challenges
(Early) Supervised Classification
Distance-based. Embedding
Time Series Data Mining Challenges
(Early) Supervised Classification
Distance-based. Embedding
Many classifiers defined
in Euclidean spaces
Computational
complexity
Prediction
Time Series Data Mining Challenges
(Early) Supervised Classification
Distance Kernels
Definite (PSD) Kernel
All the SVM machinery
works
Difficult to define/check
Indefinite
Theoretical properties are
lost
Easy to define
Some methods can not be
applied
Time Series Data Mining Challenges
(Early) Supervised Classification
Distance Kernels. Indefinite
Gaussian Distance Substitution Kernels
GDSd (x, x0
) = exp

−
d(x, x0)2
σ2

where d = DTW, ..
Time Series Data Mining Challenges
(Early) Supervised Classification
Feature-based time series classification
CLASIFICADOR
SERIES C FEATURES
...
...
...
...
...
...
C
SERIES
CLASSIFIER
FEATURES
...
Time Series Data Mining Challenges
(Early) Supervised Classification
Feature-based time series classification
Features
Statistics: mean,
variance
Autorregresive
coefficients
Fourier coefficients
Shift, trend, ...
CLASIFICADOR
SERIES C FEATURES
...
...
...
...
...
...
C
SERIES
CLASSIFIER
FEATURES
...
Time Series Data Mining Challenges
(Early) Supervised Classification
Feature-based time series classification
Representation
independent on the
number of series
Interpretable
representation
Challenge: what
features to use?
CLASIFICADOR
SERIES C FEATURES
...
...
...
...
...
...
C
SERIES
CLASSIFIER
FEATURES
...
Time Series Data Mining Challenges
(Early) Supervised Classification
Model-based time series classification
SERIES C
SERIES
PREDICTION
MODEL I
PREDICTION
MODEL II
PREDICTION
MODEL III
What is the most
probable model?
Time Series Data Mining Challenges
(Early) Supervised Classification
Model-based time series classification
Good results with an
appropriate model
Choice of model
Existence of model
SERIES C
SERIES
PREDICTION
MODEL I
PREDICTION
MODEL II
PREDICTION
MODEL III
What is the most
probable model?
Time Series Data Mining Challenges
(Early) Supervised Classification
Early time series classification
Examples
Early activity recognition
Early disease recognition in electrocardiograms
Early detection of sepsis in newborn
Early detection of failures in machines (predictive
maintenance)
Time Series Data Mining Challenges
(Early) Supervised Classification
Early time series classification
Balance between
accuracy and
earlyness
C1
C2
C3
C2
C3
C1
ALGORITHM
C2
TRAINING SET
?
?
Wait for
more data
CLASSIFIER
EARLY
Time Series Data Mining Challenges
(Early) Supervised Classification
Early time series classificationc (Mori et al 2017,
DAMI, TNNLS)
t1 2
t
C1 C2 C3
t3 ...
...
...
T1 T2 T3
Time Series Data Mining Challenges
(Early) Supervised Classification
Early time series classification
t1 2
t
C1 C2 C3
t3 ...
...
...
T1 T2 T3
Time Series Data Mining Challenges
(Early) Supervised Classification
Early time series classification
t1 2
t
C1 C2 C3
t3 ...
...
...
T1 T2 T3
Output BLUE class
Time Series Data Mining Challenges
(Early) Supervised Classification
Multivariate time series classification
CHALLENGE
Time Series Data Mining Challenges
Outlier/Anomaly Detection
Outline of the presentation
1 Time Series Data Mining Activities
2 Clustering
3 (Early) Supervised Classification
4 Outlier/Anomaly Detection
5 Conclusions and Future Work
Time Series Data Mining Challenges
Outlier/Anomaly Detection
Outlier vs Anomaly
Time Series Data Mining Challenges
Outlier/Anomaly Detection
Type of outlier: point outlier
Time Series Data Mining Challenges
Outlier/Anomaly Detection
Type of outlier: subsequence outlier
Time Series Data Mining Challenges
Outlier/Anomaly Detection
Type of outlier: series outlier
Time Series Data Mining Challenges
Outlier/Anomaly Detection
Outlier detection method: basic
|xt − x̂t|  τ
Time Series Data Mining Challenges
Outlier/Anomaly Detection
Outlier detection method: basic
|xt − x̂t|  τ Median
Time Series Data Mining Challenges
Outlier/Anomaly Detection
Outlier detection method: basic
|xt − x̂t|  τ MAD
Time Series Data Mining Challenges
Outlier/Anomaly Detection
Outlier detection method: basic
|xt − x̂t|  τ Model
Time Series Data Mining Challenges
Outlier/Anomaly Detection
An overview of outlier/anomaly detection
Time Series Data Mining Challenges
Conclusions and Future Work
Outline of the presentation
1 Time Series Data Mining Activities
2 Clustering
3 (Early) Supervised Classification
4 Outlier/Anomaly Detection
5 Conclusions and Future Work
Time Series Data Mining Challenges
Conclusions and Future Work
Not too explored lands
Challenges
Time series subset selection
Learning in weakly environments: semi-supervised,
multi-label, crowd learning
Theoretical bounds on learning: assumptions on the
generating model
Time Series Data Mining Challenges
Conclusions and Future Work
Collaboration
Usue Mori (UPV/EHU), Amaia Abanda (BCAM)
Ane Blazquez (Ikerlan), Angel Conde (Ikerlan)
Aritz Perez (BCAM), Izaskun Oregui (Tecnalia), Javier del
Ser (Tecnalia)
Josu Ircio (Ikerlan), Aizea Lojo (Ikerlan)
Time Series Data Mining Challenges
Conclusions and Future Work
Time Series Data Mining Challenges
Jose A. Lozano
Basque Center for Applied Mathematics (BCAM)
University of the Basque Country UPV/EHU
Time Series Days, Rennes, March 25-26, 2019

More Related Content

Similar to Una introducción a la minería de series temporales

Time Series Analysis and Mining with R
Time Series Analysis and Mining with RTime Series Analysis and Mining with R
Time Series Analysis and Mining with RYanchang Zhao
 
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...Man_Ebook
 
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...Man_Ebook
 
Efficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input DataEfficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input Dataymelka
 
Crash course on data streaming (with examples using Apache Flink)
Crash course on data streaming (with examples using Apache Flink)Crash course on data streaming (with examples using Apache Flink)
Crash course on data streaming (with examples using Apache Flink)Vincenzo Gulisano
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...KAIST
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...KAIST
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017StampedeCon
 
Introduction to deep learning based voice activity detection
Introduction to deep learning based voice activity detectionIntroduction to deep learning based voice activity detection
Introduction to deep learning based voice activity detectionNAVER Engineering
 
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...Mokhtar SELLAMI
 
Weight watcher Bay Area ACM Feb 28, 2022
Weight watcher Bay Area ACM Feb 28, 2022 Weight watcher Bay Area ACM Feb 28, 2022
Weight watcher Bay Area ACM Feb 28, 2022 Charles Martin
 
Finite-Length Performance Analysis of Slotted ALOHA-Thesis.pdf
Finite-Length Performance Analysis of Slotted ALOHA-Thesis.pdfFinite-Length Performance Analysis of Slotted ALOHA-Thesis.pdf
Finite-Length Performance Analysis of Slotted ALOHA-Thesis.pdfYAAKOVSOLOMON1
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_financeStefan Duprey
 
Anomaly Detection in Temporal data Using Kmeans Clustering with C5.0
Anomaly Detection in Temporal data Using Kmeans Clustering with C5.0Anomaly Detection in Temporal data Using Kmeans Clustering with C5.0
Anomaly Detection in Temporal data Using Kmeans Clustering with C5.0theijes
 
Discrete-event simulation: best practices and implementation details in Pytho...
Discrete-event simulation: best practices and implementation details in Pytho...Discrete-event simulation: best practices and implementation details in Pytho...
Discrete-event simulation: best practices and implementation details in Pytho...Carlos Natalino da Silva
 

Similar to Una introducción a la minería de series temporales (20)

20191107 breizh data_day
20191107 breizh data_day20191107 breizh data_day
20191107 breizh data_day
 
Time Series Analysis and Mining with R
Time Series Analysis and Mining with RTime Series Analysis and Mining with R
Time Series Analysis and Mining with R
 
sequencea.ppt
sequencea.pptsequencea.ppt
sequencea.ppt
 
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
 
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
A study on improving speaker diarization system = Nghiên cứu phương pháp cải ...
 
Efficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input DataEfficient Implementation of Self-Organizing Map for Sparse Input Data
Efficient Implementation of Self-Organizing Map for Sparse Input Data
 
AUDIBERT_Julien_2021.pdf
AUDIBERT_Julien_2021.pdfAUDIBERT_Julien_2021.pdf
AUDIBERT_Julien_2021.pdf
 
Crash course on data streaming (with examples using Apache Flink)
Crash course on data streaming (with examples using Apache Flink)Crash course on data streaming (with examples using Apache Flink)
Crash course on data streaming (with examples using Apache Flink)
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
[AAAI2018] Multispectral Transfer Network: Unsupervised Depth Estimation for ...
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
 
Introduction to deep learning based voice activity detection
Introduction to deep learning based voice activity detectionIntroduction to deep learning based voice activity detection
Introduction to deep learning based voice activity detection
 
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
 
BIRTE-13-Kawashima
BIRTE-13-KawashimaBIRTE-13-Kawashima
BIRTE-13-Kawashima
 
Weight watcher Bay Area ACM Feb 28, 2022
Weight watcher Bay Area ACM Feb 28, 2022 Weight watcher Bay Area ACM Feb 28, 2022
Weight watcher Bay Area ACM Feb 28, 2022
 
Finite-Length Performance Analysis of Slotted ALOHA-Thesis.pdf
Finite-Length Performance Analysis of Slotted ALOHA-Thesis.pdfFinite-Length Performance Analysis of Slotted ALOHA-Thesis.pdf
Finite-Length Performance Analysis of Slotted ALOHA-Thesis.pdf
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
 
Anomaly Detection in Temporal data Using Kmeans Clustering with C5.0
Anomaly Detection in Temporal data Using Kmeans Clustering with C5.0Anomaly Detection in Temporal data Using Kmeans Clustering with C5.0
Anomaly Detection in Temporal data Using Kmeans Clustering with C5.0
 
Csmr10a.ppt
Csmr10a.pptCsmr10a.ppt
Csmr10a.ppt
 
Discrete-event simulation: best practices and implementation details in Pytho...
Discrete-event simulation: best practices and implementation details in Pytho...Discrete-event simulation: best practices and implementation details in Pytho...
Discrete-event simulation: best practices and implementation details in Pytho...
 

More from Facultad de Informática UCM

¿Por qué debemos seguir trabajando en álgebra lineal?
¿Por qué debemos seguir trabajando en álgebra lineal?¿Por qué debemos seguir trabajando en álgebra lineal?
¿Por qué debemos seguir trabajando en álgebra lineal?Facultad de Informática UCM
 
TECNOPOLÍTICA Y ACTIVISMO DE DATOS: EL MAPEO COMO FORMA DE RESILIENCIA ANTE L...
TECNOPOLÍTICA Y ACTIVISMO DE DATOS: EL MAPEO COMO FORMA DE RESILIENCIA ANTE L...TECNOPOLÍTICA Y ACTIVISMO DE DATOS: EL MAPEO COMO FORMA DE RESILIENCIA ANTE L...
TECNOPOLÍTICA Y ACTIVISMO DE DATOS: EL MAPEO COMO FORMA DE RESILIENCIA ANTE L...Facultad de Informática UCM
 
DRAC: Designing RISC-V-based Accelerators for next generation Computers
DRAC: Designing RISC-V-based Accelerators for next generation ComputersDRAC: Designing RISC-V-based Accelerators for next generation Computers
DRAC: Designing RISC-V-based Accelerators for next generation ComputersFacultad de Informática UCM
 
Tendencias en el diseño de procesadores con arquitectura Arm
Tendencias en el diseño de procesadores con arquitectura ArmTendencias en el diseño de procesadores con arquitectura Arm
Tendencias en el diseño de procesadores con arquitectura ArmFacultad de Informática UCM
 
Introduction to Quantum Computing and Quantum Service Oriented Computing
Introduction to Quantum Computing and Quantum Service Oriented ComputingIntroduction to Quantum Computing and Quantum Service Oriented Computing
Introduction to Quantum Computing and Quantum Service Oriented ComputingFacultad de Informática UCM
 
Inteligencia Artificial en la atención sanitaria del futuro
Inteligencia Artificial en la atención sanitaria del futuroInteligencia Artificial en la atención sanitaria del futuro
Inteligencia Artificial en la atención sanitaria del futuroFacultad de Informática UCM
 
Design Automation Approaches for Real-Time Edge Computing for Science Applic...
 Design Automation Approaches for Real-Time Edge Computing for Science Applic... Design Automation Approaches for Real-Time Edge Computing for Science Applic...
Design Automation Approaches for Real-Time Edge Computing for Science Applic...Facultad de Informática UCM
 
Estrategias de navegación para robótica móvil de campo: caso de estudio proye...
Estrategias de navegación para robótica móvil de campo: caso de estudio proye...Estrategias de navegación para robótica móvil de campo: caso de estudio proye...
Estrategias de navegación para robótica móvil de campo: caso de estudio proye...Facultad de Informática UCM
 
Fault-tolerance Quantum computation and Quantum Error Correction
Fault-tolerance Quantum computation and Quantum Error CorrectionFault-tolerance Quantum computation and Quantum Error Correction
Fault-tolerance Quantum computation and Quantum Error CorrectionFacultad de Informática UCM
 
Cómo construir un chatbot inteligente sin morir en el intento
Cómo construir un chatbot inteligente sin morir en el intentoCómo construir un chatbot inteligente sin morir en el intento
Cómo construir un chatbot inteligente sin morir en el intentoFacultad de Informática UCM
 
Automatic generation of hardware memory architectures for HPC
Automatic generation of hardware memory architectures for HPCAutomatic generation of hardware memory architectures for HPC
Automatic generation of hardware memory architectures for HPCFacultad de Informática UCM
 
Hardware/software security contracts: Principled foundations for building sec...
Hardware/software security contracts: Principled foundations for building sec...Hardware/software security contracts: Principled foundations for building sec...
Hardware/software security contracts: Principled foundations for building sec...Facultad de Informática UCM
 
Jose carlossancho slidesLa seguridad en el desarrollo de software implementad...
Jose carlossancho slidesLa seguridad en el desarrollo de software implementad...Jose carlossancho slidesLa seguridad en el desarrollo de software implementad...
Jose carlossancho slidesLa seguridad en el desarrollo de software implementad...Facultad de Informática UCM
 
Redes neuronales y reinforcement learning. Aplicación en energía eólica.
Redes neuronales y reinforcement learning. Aplicación en energía eólica.Redes neuronales y reinforcement learning. Aplicación en energía eólica.
Redes neuronales y reinforcement learning. Aplicación en energía eólica.Facultad de Informática UCM
 
Challenges and Opportunities for AI and Data analytics in Offshore wind
Challenges and Opportunities for AI and Data analytics in Offshore windChallenges and Opportunities for AI and Data analytics in Offshore wind
Challenges and Opportunities for AI and Data analytics in Offshore windFacultad de Informática UCM
 

More from Facultad de Informática UCM (20)

¿Por qué debemos seguir trabajando en álgebra lineal?
¿Por qué debemos seguir trabajando en álgebra lineal?¿Por qué debemos seguir trabajando en álgebra lineal?
¿Por qué debemos seguir trabajando en álgebra lineal?
 
TECNOPOLÍTICA Y ACTIVISMO DE DATOS: EL MAPEO COMO FORMA DE RESILIENCIA ANTE L...
TECNOPOLÍTICA Y ACTIVISMO DE DATOS: EL MAPEO COMO FORMA DE RESILIENCIA ANTE L...TECNOPOLÍTICA Y ACTIVISMO DE DATOS: EL MAPEO COMO FORMA DE RESILIENCIA ANTE L...
TECNOPOLÍTICA Y ACTIVISMO DE DATOS: EL MAPEO COMO FORMA DE RESILIENCIA ANTE L...
 
DRAC: Designing RISC-V-based Accelerators for next generation Computers
DRAC: Designing RISC-V-based Accelerators for next generation ComputersDRAC: Designing RISC-V-based Accelerators for next generation Computers
DRAC: Designing RISC-V-based Accelerators for next generation Computers
 
uElectronics ongoing activities at ESA
uElectronics ongoing activities at ESAuElectronics ongoing activities at ESA
uElectronics ongoing activities at ESA
 
Tendencias en el diseño de procesadores con arquitectura Arm
Tendencias en el diseño de procesadores con arquitectura ArmTendencias en el diseño de procesadores con arquitectura Arm
Tendencias en el diseño de procesadores con arquitectura Arm
 
Formalizing Mathematics in Lean
Formalizing Mathematics in LeanFormalizing Mathematics in Lean
Formalizing Mathematics in Lean
 
Introduction to Quantum Computing and Quantum Service Oriented Computing
Introduction to Quantum Computing and Quantum Service Oriented ComputingIntroduction to Quantum Computing and Quantum Service Oriented Computing
Introduction to Quantum Computing and Quantum Service Oriented Computing
 
Computer Design Concepts for Machine Learning
Computer Design Concepts for Machine LearningComputer Design Concepts for Machine Learning
Computer Design Concepts for Machine Learning
 
Inteligencia Artificial en la atención sanitaria del futuro
Inteligencia Artificial en la atención sanitaria del futuroInteligencia Artificial en la atención sanitaria del futuro
Inteligencia Artificial en la atención sanitaria del futuro
 
Design Automation Approaches for Real-Time Edge Computing for Science Applic...
 Design Automation Approaches for Real-Time Edge Computing for Science Applic... Design Automation Approaches for Real-Time Edge Computing for Science Applic...
Design Automation Approaches for Real-Time Edge Computing for Science Applic...
 
Estrategias de navegación para robótica móvil de campo: caso de estudio proye...
Estrategias de navegación para robótica móvil de campo: caso de estudio proye...Estrategias de navegación para robótica móvil de campo: caso de estudio proye...
Estrategias de navegación para robótica móvil de campo: caso de estudio proye...
 
Fault-tolerance Quantum computation and Quantum Error Correction
Fault-tolerance Quantum computation and Quantum Error CorrectionFault-tolerance Quantum computation and Quantum Error Correction
Fault-tolerance Quantum computation and Quantum Error Correction
 
Cómo construir un chatbot inteligente sin morir en el intento
Cómo construir un chatbot inteligente sin morir en el intentoCómo construir un chatbot inteligente sin morir en el intento
Cómo construir un chatbot inteligente sin morir en el intento
 
Automatic generation of hardware memory architectures for HPC
Automatic generation of hardware memory architectures for HPCAutomatic generation of hardware memory architectures for HPC
Automatic generation of hardware memory architectures for HPC
 
Type and proof structures for concurrency
Type and proof structures for concurrencyType and proof structures for concurrency
Type and proof structures for concurrency
 
Hardware/software security contracts: Principled foundations for building sec...
Hardware/software security contracts: Principled foundations for building sec...Hardware/software security contracts: Principled foundations for building sec...
Hardware/software security contracts: Principled foundations for building sec...
 
Jose carlossancho slidesLa seguridad en el desarrollo de software implementad...
Jose carlossancho slidesLa seguridad en el desarrollo de software implementad...Jose carlossancho slidesLa seguridad en el desarrollo de software implementad...
Jose carlossancho slidesLa seguridad en el desarrollo de software implementad...
 
Do you trust your artificial intelligence system?
Do you trust your artificial intelligence system?Do you trust your artificial intelligence system?
Do you trust your artificial intelligence system?
 
Redes neuronales y reinforcement learning. Aplicación en energía eólica.
Redes neuronales y reinforcement learning. Aplicación en energía eólica.Redes neuronales y reinforcement learning. Aplicación en energía eólica.
Redes neuronales y reinforcement learning. Aplicación en energía eólica.
 
Challenges and Opportunities for AI and Data analytics in Offshore wind
Challenges and Opportunities for AI and Data analytics in Offshore windChallenges and Opportunities for AI and Data analytics in Offshore wind
Challenges and Opportunities for AI and Data analytics in Offshore wind
 

Recently uploaded

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfRagavanV2
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoordharasingh5698
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Arindam Chakraborty, Ph.D., P.E. (CA, TX)
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 

Recently uploaded (20)

(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 

Una introducción a la minería de series temporales

  • 1. Time Series Data Mining Challenges Time Series Data Mining Challenges Jose A. Lozano Basque Center for Applied Mathematics (BCAM) University of the Basque Country UPV/EHU Time Series Days, Rennes, March 25-26, 2019
  • 2. Time Series Data Mining Challenges Outline of the presentation 1 Time Series Data Mining Activities 2 Clustering 3 (Early) Supervised Classification 4 Outlier/Anomaly Detection 5 Conclusions and Future Work
  • 3. Time Series Data Mining Challenges Time Series Data Mining Activities Outline of the presentation 1 Time Series Data Mining Activities 2 Clustering 3 (Early) Supervised Classification 4 Outlier/Anomaly Detection 5 Conclusions and Future Work
  • 4. Time Series Data Mining Challenges Time Series Data Mining Activities Time series all around Temporal correlation High dimen- sionality Noisy Industry 4.0 Bio Signals Weather Forecasting Shapes
  • 5. Time Series Data Mining Challenges Time Series Data Mining Activities Time series forecasting
  • 6. Time Series Data Mining Challenges Time Series Data Mining Activities Time series data base: our object of study A set of time series (usually big) Different lengths Multidimensional
  • 7. Time Series Data Mining Challenges Time Series Data Mining Activities Time series clustering. Examples CLUSTERING ALGORITHM
  • 8. Time Series Data Mining Challenges Time Series Data Mining Activities Supervised classification of time series C1 C2 C3 C2 C3 C1 ALGORITHM CLASSIFIER ? C2 TRAINING SET
  • 9. Time Series Data Mining Challenges Time Series Data Mining Activities Anomaly/outlier detection
  • 10. Time Series Data Mining Challenges Time Series Data Mining Activities Segmentation
  • 11. Time Series Data Mining Challenges Clustering Outline of the presentation 1 Time Series Data Mining Activities 2 Clustering 3 (Early) Supervised Classification 4 Outlier/Anomaly Detection 5 Conclusions and Future Work
  • 12. Time Series Data Mining Challenges Clustering Time series clustering. Examples CLUSTERING ALGORITHM
  • 13. Time Series Data Mining Challenges Clustering Time series clustering: hierarchical, partitional we need a DISTANCE 0 100 200 300 400 Series 1 Series 2 Series 3 Series 4 Series 5 Series 6 Series 7 Series 8 Series 9 k-means
  • 14. Time Series Data Mining Challenges Clustering Distance between time series Rigid Distance Flexible Distance
  • 15. Time Series Data Mining Challenges Clustering Euclidean Distance (ED) D(X, Y) = qPn i=1(xi − yi)2 Easy to compute Only for series with the same distance Does not consider the time Sensitivity to noise
  • 16. Time Series Data Mining Challenges Clustering Dynamic Time Warping (DTW) Takes into account the ordered sequence (time) It can deal with series of different sizes Computationally expensive O(min{m, n}2)
  • 17. Time Series Data Mining Challenges Clustering Euclidean Distance vs Dynamic Time Warping EUCLIDEAN DTW 6 6
  • 18. Time Series Data Mining Challenges Clustering Remark on distances More on elastic distances Cheap versions of dynamic time warping (Sakoe-chiba, bounding) Edit distance for real sequences (EDR) Mori et al 2016, R journal On-line versions (Oregi et al 2019, PR) Alternatives to calculate distances Represent each series by means of a set of features and calculate the distance between the features Learn a parametric model for each series and calculate the distance between the parameters
  • 19. Time Series Data Mining Challenges Clustering Distances between series Remarks There is not best distance (no free lunch) Each problem requires a different distance The distance to be used needs to be in agreement with our knowledge about what is far and what is close Hint: try with several distances Challenge: Design a method to the (semi)automatic selection of a distance (e.g. Mori et al. 2016, TKDE)
  • 20. Time Series Data Mining Challenges Clustering ...Coming back to clustering: K-means k-medois k-means k-medoids
  • 21. Time Series Data Mining Challenges Clustering Remarks on clustering Recent papers on the computation of a mean series Alternate clustering methods: graph-based, spectral, model-based,... Multivariate time series clustering
  • 22. Time Series Data Mining Challenges (Early) Supervised Classification Outline of the presentation 1 Time Series Data Mining Activities 2 Clustering 3 (Early) Supervised Classification 4 Outlier/Anomaly Detection 5 Conclusions and Future Work
  • 23. Time Series Data Mining Challenges (Early) Supervised Classification Supervised Classification of Time Series General-purpose classifiers Specific TS classifiers FEATURES C FEATURES CLASSIFIER SERIES C SERIES CLASSIFIER
  • 24. Time Series Data Mining Challenges (Early) Supervised Classification General-purpose classifiers Each series is considered an instance Each time stamp is considered a feature t1 t2 t3 . . . tn C x11 x12 x13 . . . x1n c1 x21 x22 x23 . . . x2n c2 . . . . . . . . . . . . . . . . . . xm1 xm2 xm3 . . . xmn c2
  • 25. Time Series Data Mining Challenges (Early) Supervised Classification General-purpose classifiers Each series is considered an instance Each time stamp is considered a feature t2 t1 t3 . . . tn C x12 x11 x13 . . . x1n c1 x22 x21 x23 . . . x2n c2 . . . . . . . . . . . . . . . . . . xm2 xm1 xm3 . . . xmn c2
  • 26. Time Series Data Mining Challenges (Early) Supervised Classification General-purpose classifiers Each series is considered an instance Each time stamp is considered a feature t2 t1 t3 . . . tn C x12 x11 x13 . . . x1n c1 x22 x21 x23 . . . x2n c2 . . . . . . . . . . . . . . . . . . xm2 xm1 xm3 . . . xmn c2 CHALLENGE When to use general-purpose and when time-series specific?
  • 27. Time Series Data Mining Challenges (Early) Supervised Classification What is relevant in TSC? PROBLEM I PROBLEM II
  • 28. Time Series Data Mining Challenges (Early) Supervised Classification What is relevant in TSC? PROBLEM I PROBLEM II SHAPE
  • 29. Time Series Data Mining Challenges (Early) Supervised Classification What is relevant in TSC? PROBLEM I PROBLEM II SHAPE LOCATION
  • 30. Time Series Data Mining Challenges (Early) Supervised Classification A taxonomy of time series classification methods Taxonomy Distance-based classifiers Model-based classifiers Feature-based classifiers
  • 31. Time Series Data Mining Challenges (Early) Supervised Classification Taxonomy of distance-based TSC (Abanda et al. 2019, DAMI)
  • 32. Time Series Data Mining Challenges (Early) Supervised Classification 1-Nearest Neighbour (1-NN) C1 C2 C3 C2 C3 C1 ? C2 (d1 d2 d3 d4 d5 d6) MINIMUM DISTANCE
  • 33. Time Series Data Mining Challenges (Early) Supervised Classification 1-Nearest Neighbour (1-NN) Easy to understand Better results with higher number of series Computational cost Challenge: What distance??? C1 C2 C3 C2 C3 C1 ? C2 (d1 d2 d3 d4 d5 d6) MINIMUM DISTANCE
  • 34. Time Series Data Mining Challenges (Early) Supervised Classification Distance-based. Distance features CLASIFICADOR SERIES C DISTANCE MATRIX ... ... ... ... ... ... C SERIES CLASSIFIER DISTANCES ...
  • 35. Time Series Data Mining Challenges (Early) Supervised Classification Distance-based. Distance features. Global CLASIFICADOR SERIES C DISTANCE MATRIX ... ... ... ... ... ... C SERIES CLASSIFIER DISTANCES ... VECTORS!
  • 36. Time Series Data Mining Challenges (Early) Supervised Classification Distance-based. Distance features. Global Any general-purpose algorithm could be applied It depends on the number of series in training Computationally expensive Difficult to transfer to the on-line setting CLASIFICADOR SERIES C DISTANCE MATRIX ... ... ... ... ... ... C SERIES CLASSIFIER DISTANCES ... VECTORS!
  • 37. Time Series Data Mining Challenges (Early) Supervised Classification Distance-based. Distance features. Local Lij could be distance or presence Computationally expensive When the shapelets are relevant extremely good results Easy to interpret Shapelet 2
  • 38. Time Series Data Mining Challenges (Early) Supervised Classification Distance-based. Embedding
  • 39. Time Series Data Mining Challenges (Early) Supervised Classification Distance-based. Embedding Many classifiers defined in Euclidean spaces Computational complexity Prediction
  • 40. Time Series Data Mining Challenges (Early) Supervised Classification Distance Kernels Definite (PSD) Kernel All the SVM machinery works Difficult to define/check Indefinite Theoretical properties are lost Easy to define Some methods can not be applied
  • 41. Time Series Data Mining Challenges (Early) Supervised Classification Distance Kernels. Indefinite Gaussian Distance Substitution Kernels GDSd (x, x0 ) = exp − d(x, x0)2 σ2 where d = DTW, ..
  • 42. Time Series Data Mining Challenges (Early) Supervised Classification Feature-based time series classification CLASIFICADOR SERIES C FEATURES ... ... ... ... ... ... C SERIES CLASSIFIER FEATURES ...
  • 43. Time Series Data Mining Challenges (Early) Supervised Classification Feature-based time series classification Features Statistics: mean, variance Autorregresive coefficients Fourier coefficients Shift, trend, ... CLASIFICADOR SERIES C FEATURES ... ... ... ... ... ... C SERIES CLASSIFIER FEATURES ...
  • 44. Time Series Data Mining Challenges (Early) Supervised Classification Feature-based time series classification Representation independent on the number of series Interpretable representation Challenge: what features to use? CLASIFICADOR SERIES C FEATURES ... ... ... ... ... ... C SERIES CLASSIFIER FEATURES ...
  • 45. Time Series Data Mining Challenges (Early) Supervised Classification Model-based time series classification SERIES C SERIES PREDICTION MODEL I PREDICTION MODEL II PREDICTION MODEL III What is the most probable model?
  • 46. Time Series Data Mining Challenges (Early) Supervised Classification Model-based time series classification Good results with an appropriate model Choice of model Existence of model SERIES C SERIES PREDICTION MODEL I PREDICTION MODEL II PREDICTION MODEL III What is the most probable model?
  • 47. Time Series Data Mining Challenges (Early) Supervised Classification Early time series classification Examples Early activity recognition Early disease recognition in electrocardiograms Early detection of sepsis in newborn Early detection of failures in machines (predictive maintenance)
  • 48. Time Series Data Mining Challenges (Early) Supervised Classification Early time series classification Balance between accuracy and earlyness C1 C2 C3 C2 C3 C1 ALGORITHM C2 TRAINING SET ? ? Wait for more data CLASSIFIER EARLY
  • 49. Time Series Data Mining Challenges (Early) Supervised Classification Early time series classificationc (Mori et al 2017, DAMI, TNNLS) t1 2 t C1 C2 C3 t3 ... ... ... T1 T2 T3
  • 50. Time Series Data Mining Challenges (Early) Supervised Classification Early time series classification t1 2 t C1 C2 C3 t3 ... ... ... T1 T2 T3
  • 51. Time Series Data Mining Challenges (Early) Supervised Classification Early time series classification t1 2 t C1 C2 C3 t3 ... ... ... T1 T2 T3 Output BLUE class
  • 52. Time Series Data Mining Challenges (Early) Supervised Classification Multivariate time series classification CHALLENGE
  • 53. Time Series Data Mining Challenges Outlier/Anomaly Detection Outline of the presentation 1 Time Series Data Mining Activities 2 Clustering 3 (Early) Supervised Classification 4 Outlier/Anomaly Detection 5 Conclusions and Future Work
  • 54. Time Series Data Mining Challenges Outlier/Anomaly Detection Outlier vs Anomaly
  • 55. Time Series Data Mining Challenges Outlier/Anomaly Detection Type of outlier: point outlier
  • 56. Time Series Data Mining Challenges Outlier/Anomaly Detection Type of outlier: subsequence outlier
  • 57. Time Series Data Mining Challenges Outlier/Anomaly Detection Type of outlier: series outlier
  • 58. Time Series Data Mining Challenges Outlier/Anomaly Detection Outlier detection method: basic |xt − x̂t| τ
  • 59. Time Series Data Mining Challenges Outlier/Anomaly Detection Outlier detection method: basic |xt − x̂t| τ Median
  • 60. Time Series Data Mining Challenges Outlier/Anomaly Detection Outlier detection method: basic |xt − x̂t| τ MAD
  • 61. Time Series Data Mining Challenges Outlier/Anomaly Detection Outlier detection method: basic |xt − x̂t| τ Model
  • 62. Time Series Data Mining Challenges Outlier/Anomaly Detection An overview of outlier/anomaly detection
  • 63. Time Series Data Mining Challenges Conclusions and Future Work Outline of the presentation 1 Time Series Data Mining Activities 2 Clustering 3 (Early) Supervised Classification 4 Outlier/Anomaly Detection 5 Conclusions and Future Work
  • 64. Time Series Data Mining Challenges Conclusions and Future Work Not too explored lands Challenges Time series subset selection Learning in weakly environments: semi-supervised, multi-label, crowd learning Theoretical bounds on learning: assumptions on the generating model
  • 65. Time Series Data Mining Challenges Conclusions and Future Work Collaboration Usue Mori (UPV/EHU), Amaia Abanda (BCAM) Ane Blazquez (Ikerlan), Angel Conde (Ikerlan) Aritz Perez (BCAM), Izaskun Oregui (Tecnalia), Javier del Ser (Tecnalia) Josu Ircio (Ikerlan), Aizea Lojo (Ikerlan)
  • 66. Time Series Data Mining Challenges Conclusions and Future Work Time Series Data Mining Challenges Jose A. Lozano Basque Center for Applied Mathematics (BCAM) University of the Basque Country UPV/EHU Time Series Days, Rennes, March 25-26, 2019