SlideShare a Scribd company logo
1 of 25
Download to read offline
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
On the Stability of Clustering Financial Time
Series – How to investigate?
IEEE ICMLA Miami, Florida, USA, December 9-11, 2015
Gautier Marti, Philippe Very, Philippe Donnat, Frank Nielsen
9 December 2015
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
1 Introduction to financial time series clustering
2 Empirical results from the clustering stability study
3 Conclusion
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Financial time series (data from www.datagrapple.com)
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Clustering?
Definition
Clustering is the task of grouping a set of objects in such a way
that objects in the same group (cluster) are more similar to each
other than those in different groups.
French banks (blue) and
building materials (red)
CDS over 2006-2015
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Why clustering?
Mathematical finance: Use of variance-covariance matrices
(e.g., Markowitz, Value-at-Risk)
Stylized fact: Empirical
variance-covariance matrices
estimated on financial time
series are very noisy
(Random Matrix Theory,
Noise Dressing of Financial
Correlation Matrices, Laloux
et al, 1999)
0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0
λ
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
ρ(λ)
Marchenko-Pastur distribution vs.
empirical eigenvalues distribution
of the correlation matrix
How to filter these variance-covariance matrices?
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
For filtering, clustering!
Mantegna (1999) et al’s work:
0 100 200 300 400 500
0
100
200
300
400
500
0 100 200 300 400 500
0
100
200
300
400
500
0 100 200 300 400 500
0
100
200
300
400
500
(left) empirical correlation matrix
(center) the same matrix seriated using a hierarchical clustering
(right) correlations filtered using the clustering structure
N.B. other applications: statarb, alternative risk measures
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Why stability?
statistical consistency of
the clustering method
requires assumptions that
may not hold in practice:
e.g. returns are i.i.d.,
underlying elliptical copula,
enough data is available
stability is a weaker
property: reproducibility of
results across a wide range
of slight data perturbations
Clusters obtained at time t, t + 1,
t + 2; Is the difference between the
successive clusters a“true”signal?
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Is the clustering of financial time series stable?
According to [2], clusters are not stable
with respect to the clustering algorithm,
but only a squared Euclidean distance was considered which is not
relevant for clustering assets from their returns (cf. [4]).
Idea: A more relevant distance should increase stability
We investigate the clustering stability resulting from using:
an Euclidean distance
a Pearson correlation distance [3]
a Spearman correlation distance
a distance for comparing two dependent random variables [4]
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Some usual distances for clustering financial time series
(Pi
t )t≥0
Si
t+1 = log Pi
t+1 −log Pi
t
(Si
t )t≥1
Euclidean distance:
d(Si , Sj ) = T
t=1(Si
t − Sj
t )2
Pearson correl.: ρ(Si , Sj ) =
T
t=1(Si
t −Si )(Sj
t −Sj )
T
t=1(Si
t −Si )2 T
t=1(Sj
t −Sj )2
Spearman correl.: ρS (Si , Sj ) =
1 − 6
T(T2−1)
T
t=1(Si
(t) − Si
(t))2
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Generic Non-Parametric Distance [4]
d2
θ (Xi , Xj ) = θ3E |Pi (Xi ) − Pj (Xj )|2
+ (1 − θ)
1
2 R
dPi
dλ
−
dPj
dλ
2
dλ
(i) 0 ≤ dθ ≤ 1, (ii) 0 < θ < 1, dθ metric,
(iii) dθ is invariant under diffeomorphism
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Generic Non-Parametric Distance [4]
d2
0 : 1
2 R
dPi
dλ −
dPj
dλ
2
dλ = Hellinger2
d2
1 : 3E |Pi (Xi ) − Pj (Xj )|2
=
1 − ρS
2
= 2−6
1
0
1
0
C(u, v)dudv
Remark: If
f (x, θ) = c(F1(x1; ν1), . . . , FN(xN; νN); θc)
N
i=1
fi (xi ; νi )
then with CML hypothesis
ds2
= ds2
copula +
N
i=1
ds2
margins
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
1 Introduction to financial time series clustering
2 Empirical results from the clustering stability study
3 Conclusion
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Sliding Window
PCA stability curve (red) vs.
Euclidean Clusters stability curve as
a function of time using results from
[1] for fair comparison: clusters are
more stable
most basic perturbation:
traders face it everyday
when monitoring their
indicators
we do not want to overfit
our analysis to this
particular stability goal
stability perf.: dist. [4]
Spearman Pearson
Euclidean
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Odd vs. Even
A clustering al-
gorithm applied
on two samples
describing the same
phenomenon should
yield the same
results.
How to obtain two
of these samples? (un)Stability of
clusters with L2
distance
Stability of clusters
with the proposed
distance [4]
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Economic Regimes
AXA 5-year CDS spread over 2006-2015
Average of the pairwise
correlations; correlation
skyrockets during crises
Is the clustering structure persistent?
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Economic Regimes Clustering Stability
Pearson (top left), Spearman (top right),
Euclidean (bottom left), corr+distr (bottom right)
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Heart vs. Tails Clustering Stability
≈ orange+red vs. green+yellow periods
Pearson (top left), Spearman (top right),
Euclidean (bottom left), corr+distr (bottom right)
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Multiscale
Is the clustering structure persistent to different sampling frequencies?
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Multiscale Clustering Stability
Pearson (top left), Spearman (top right),
Euclidean (bottom left), corr+distr (bottom right)
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Maturities & Term Structure
An asset is described by several time series whose dynamics are similar:
Nokia Oyj is described here by the cost of insurance against its default
for {1, 3, 5, 7, 10} years
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Maturities & Term Structure Clustering Stability
Pearson (top left), Spearman (top right),
Euclidean (bottom left), corr+distr (bottom right)
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
1 Introduction to financial time series clustering
2 Empirical results from the clustering stability study
3 Conclusion
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Discussion and questions?
A given clustering algorithm yields a particular clustering
structure, but with a relevant distance it can be more stable
The perturbations presented can be readily extended (e.g.
using different CDS datasets)
Disclosing stability results is interesting since complex
models often perform poorly (the many parameters are
somewhat overfitted) and cannot be used by practitioners
Correlation+distribution distance (presented in [4]) may work
for your applications (which ones?)
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
C. Ding and X. He.
K-means clustering via principal component analysis.
In Proceedings of the twenty-first international conference on
Machine learning, page 29. ACM, 2004.
V. Lemieux, P. S. Rahmdel, R. Walker, B. Wong, and
M. Flood.
Clustering techniques and their effect on portfolio formation
and risk analysis.
In Proceedings of the International Workshop on Data Science
for Macro-Modeling, pages 1–6. ACM, 2014.
R. N. Mantegna and H. E. Stanley.
Introduction to econophysics: correlations and complexity in
finance.
Cambridge university press, 1999.
G. Marti, P. Very, and P. Donnat.
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
Introduction to financial time series clustering
Empirical results from the clustering stability study
Conclusion
Toward a generic representation of random variables for
machine learning.
Pattern Recognition Letters, 2015.
Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series

More Related Content

What's hot

[DL輪読会]Deep Learning 第4章 数値計算
[DL輪読会]Deep Learning 第4章 数値計算[DL輪読会]Deep Learning 第4章 数値計算
[DL輪読会]Deep Learning 第4章 数値計算Deep Learning JP
 
Neural text-to-speech and voice conversion
Neural text-to-speech and voice conversionNeural text-to-speech and voice conversion
Neural text-to-speech and voice conversionYuki Saito
 
分布あるいはモーメント間距離最小化に基づく統計的音声合成
分布あるいはモーメント間距離最小化に基づく統計的音声合成分布あるいはモーメント間距離最小化に基づく統計的音声合成
分布あるいはモーメント間距離最小化に基づく統計的音声合成Shinnosuke Takamichi
 
GAN-based statistical speech synthesis (in Japanese)
GAN-based statistical speech synthesis (in Japanese)GAN-based statistical speech synthesis (in Japanese)
GAN-based statistical speech synthesis (in Japanese)Yuki Saito
 
[DL輪読会]Deep Learning 第20章 深層生成モデル
[DL輪読会]Deep Learning 第20章 深層生成モデル[DL輪読会]Deep Learning 第20章 深層生成モデル
[DL輪読会]Deep Learning 第20章 深層生成モデルDeep Learning JP
 
第1回文献紹介勉強会20140826
第1回文献紹介勉強会20140826第1回文献紹介勉強会20140826
第1回文献紹介勉強会20140826Masakazu Sano
 
(DL hacks輪読) Difference Target Propagation
(DL hacks輪読) Difference Target Propagation(DL hacks輪読) Difference Target Propagation
(DL hacks輪読) Difference Target PropagationMasahiro Suzuki
 
Rainbow의 혈관 속 탐험 (The Rainbow's adventure in the vessel) (RL Korea)
Rainbow의 혈관 속 탐험 (The Rainbow's adventure in the vessel) (RL Korea)Rainbow의 혈관 속 탐험 (The Rainbow's adventure in the vessel) (RL Korea)
Rainbow의 혈관 속 탐험 (The Rainbow's adventure in the vessel) (RL Korea)Kyunghwan Kim
 
[Ridge-i 論文読み会] ICLR2019における不完全ラベル学習
[Ridge-i 論文読み会] ICLR2019における不完全ラベル学習[Ridge-i 論文読み会] ICLR2019における不完全ラベル学習
[Ridge-i 論文読み会] ICLR2019における不完全ラベル学習Masanari Kimura
 
Sliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデルSliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデルohken
 
8.4 グラフィカルモデルによる推論
8.4 グラフィカルモデルによる推論8.4 グラフィカルモデルによる推論
8.4 グラフィカルモデルによる推論sleepy_yoshi
 
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process ModelsDeep Learning JP
 
最適輸送の解き方
最適輸送の解き方最適輸送の解き方
最適輸送の解き方joisino
 
大学生及び大学院生の研究時間とメンタルヘルス
大学生及び大学院生の研究時間とメンタルヘルス大学生及び大学院生の研究時間とメンタルヘルス
大学生及び大学院生の研究時間とメンタルヘルスAtsuto ONODA
 
Graphic Notes on Linear Algebra and Data Science
Graphic Notes on Linear Algebra and Data ScienceGraphic Notes on Linear Algebra and Data Science
Graphic Notes on Linear Algebra and Data ScienceKenji Hiranabe
 
「世界モデル」と関連研究について
「世界モデル」と関連研究について「世界モデル」と関連研究について
「世界モデル」と関連研究についてMasahiro Suzuki
 
20170422 数学カフェ Part2
20170422 数学カフェ Part220170422 数学カフェ Part2
20170422 数学カフェ Part2Kenta Oono
 
行列計算を利用したデータ解析技術
行列計算を利用したデータ解析技術行列計算を利用したデータ解析技術
行列計算を利用したデータ解析技術Yoshihiro Mizoguchi
 

What's hot (20)

[DL輪読会]Deep Learning 第4章 数値計算
[DL輪読会]Deep Learning 第4章 数値計算[DL輪読会]Deep Learning 第4章 数値計算
[DL輪読会]Deep Learning 第4章 数値計算
 
Neural text-to-speech and voice conversion
Neural text-to-speech and voice conversionNeural text-to-speech and voice conversion
Neural text-to-speech and voice conversion
 
Up to GLOW
Up to GLOWUp to GLOW
Up to GLOW
 
分布あるいはモーメント間距離最小化に基づく統計的音声合成
分布あるいはモーメント間距離最小化に基づく統計的音声合成分布あるいはモーメント間距離最小化に基づく統計的音声合成
分布あるいはモーメント間距離最小化に基づく統計的音声合成
 
GAN-based statistical speech synthesis (in Japanese)
GAN-based statistical speech synthesis (in Japanese)GAN-based statistical speech synthesis (in Japanese)
GAN-based statistical speech synthesis (in Japanese)
 
[DL輪読会]Deep Learning 第20章 深層生成モデル
[DL輪読会]Deep Learning 第20章 深層生成モデル[DL輪読会]Deep Learning 第20章 深層生成モデル
[DL輪読会]Deep Learning 第20章 深層生成モデル
 
第1回文献紹介勉強会20140826
第1回文献紹介勉強会20140826第1回文献紹介勉強会20140826
第1回文献紹介勉強会20140826
 
(DL hacks輪読) Difference Target Propagation
(DL hacks輪読) Difference Target Propagation(DL hacks輪読) Difference Target Propagation
(DL hacks輪読) Difference Target Propagation
 
Rainbow의 혈관 속 탐험 (The Rainbow's adventure in the vessel) (RL Korea)
Rainbow의 혈관 속 탐험 (The Rainbow's adventure in the vessel) (RL Korea)Rainbow의 혈관 속 탐험 (The Rainbow's adventure in the vessel) (RL Korea)
Rainbow의 혈관 속 탐험 (The Rainbow's adventure in the vessel) (RL Korea)
 
[Ridge-i 論文読み会] ICLR2019における不完全ラベル学習
[Ridge-i 論文読み会] ICLR2019における不完全ラベル学習[Ridge-i 論文読み会] ICLR2019における不完全ラベル学習
[Ridge-i 論文読み会] ICLR2019における不完全ラベル学習
 
大規模凸最適化問題に対する勾配法
大規模凸最適化問題に対する勾配法大規模凸最適化問題に対する勾配法
大規模凸最適化問題に対する勾配法
 
Sliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデルSliced Wasserstein距離と生成モデル
Sliced Wasserstein距離と生成モデル
 
8.4 グラフィカルモデルによる推論
8.4 グラフィカルモデルによる推論8.4 グラフィカルモデルによる推論
8.4 グラフィカルモデルによる推論
 
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
[DL輪読会]Scalable Training of Inference Networks for Gaussian-Process Models
 
最適輸送の解き方
最適輸送の解き方最適輸送の解き方
最適輸送の解き方
 
大学生及び大学院生の研究時間とメンタルヘルス
大学生及び大学院生の研究時間とメンタルヘルス大学生及び大学院生の研究時間とメンタルヘルス
大学生及び大学院生の研究時間とメンタルヘルス
 
Graphic Notes on Linear Algebra and Data Science
Graphic Notes on Linear Algebra and Data ScienceGraphic Notes on Linear Algebra and Data Science
Graphic Notes on Linear Algebra and Data Science
 
「世界モデル」と関連研究について
「世界モデル」と関連研究について「世界モデル」と関連研究について
「世界モデル」と関連研究について
 
20170422 数学カフェ Part2
20170422 数学カフェ Part220170422 数学カフェ Part2
20170422 数学カフェ Part2
 
行列計算を利用したデータ解析技術
行列計算を利用したデータ解析技術行列計算を利用したデータ解析技術
行列計算を利用したデータ解析技術
 

Viewers also liked

IBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMESIBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMESFernando M. Imperiale
 
Health & safety officer performance appraisal
Health & safety officer performance appraisalHealth & safety officer performance appraisal
Health & safety officer performance appraisalsandersjamie999
 
Nutrifit parcial vane
Nutrifit parcial vaneNutrifit parcial vane
Nutrifit parcial vanevanessaghia12
 
Fernando Imperiale - Una aguja en el pajar
Fernando Imperiale - Una aguja en el pajarFernando Imperiale - Una aguja en el pajar
Fernando Imperiale - Una aguja en el pajarFernando M. Imperiale
 
Yazeed kay-ghazi
Yazeed kay-ghaziYazeed kay-ghazi
Yazeed kay-ghaziYounas Aziz
 
Yasemin yilmazer latifepalta_zeynepucar
Yasemin yilmazer latifepalta_zeynepucarYasemin yilmazer latifepalta_zeynepucar
Yasemin yilmazer latifepalta_zeynepucarzeynepucarr
 
Fernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMESFernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMESFernando M. Imperiale
 
Geography 372 Final Presentation
Geography 372 Final PresentationGeography 372 Final Presentation
Geography 372 Final PresentationMac Ferrick
 
Clustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence ratesClustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence ratesGautier Marti
 
Here be dragons
Here be dragonsHere be dragons
Here be dragonsdeelay1
 
Diapo bourse aux sports
Diapo bourse aux sportsDiapo bourse aux sports
Diapo bourse aux sportsmfrfye
 
Searching for the grey gold - 2013
Searching for the grey gold - 2013Searching for the grey gold - 2013
Searching for the grey gold - 2013Olle Bergendahl
 

Viewers also liked (19)

Magento News @ Magento Meetup Wien 17
Magento News @ Magento Meetup Wien 17Magento News @ Magento Meetup Wien 17
Magento News @ Magento Meetup Wien 17
 
IBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMESIBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMES
 
Health & safety officer performance appraisal
Health & safety officer performance appraisalHealth & safety officer performance appraisal
Health & safety officer performance appraisal
 
Nutrifit parcial vane
Nutrifit parcial vaneNutrifit parcial vane
Nutrifit parcial vane
 
bala.resume
bala.resumebala.resume
bala.resume
 
Prezentacja1
Prezentacja1Prezentacja1
Prezentacja1
 
Fernando Imperiale - Una aguja en el pajar
Fernando Imperiale - Una aguja en el pajarFernando Imperiale - Una aguja en el pajar
Fernando Imperiale - Una aguja en el pajar
 
Prabhu Sundaramurthi (4)
Prabhu Sundaramurthi (4)Prabhu Sundaramurthi (4)
Prabhu Sundaramurthi (4)
 
Yazeed kay-ghazi
Yazeed kay-ghaziYazeed kay-ghazi
Yazeed kay-ghazi
 
Yasemin yilmazer latifepalta_zeynepucar
Yasemin yilmazer latifepalta_zeynepucarYasemin yilmazer latifepalta_zeynepucar
Yasemin yilmazer latifepalta_zeynepucar
 
Fernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMESFernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMES
 
Geography 372 Final Presentation
Geography 372 Final PresentationGeography 372 Final Presentation
Geography 372 Final Presentation
 
Clustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence ratesClustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence rates
 
Here be dragons
Here be dragonsHere be dragons
Here be dragons
 
Prevenzione
PrevenzionePrevenzione
Prevenzione
 
Diapo bourse aux sports
Diapo bourse aux sportsDiapo bourse aux sports
Diapo bourse aux sports
 
EColi_CaseStudyRoughDraft.docx
EColi_CaseStudyRoughDraft.docxEColi_CaseStudyRoughDraft.docx
EColi_CaseStudyRoughDraft.docx
 
Searching for the grey gold - 2013
Searching for the grey gold - 2013Searching for the grey gold - 2013
Searching for the grey gold - 2013
 
NSO_cv_20160511
NSO_cv_20160511NSO_cv_20160511
NSO_cv_20160511
 

Similar to On the stability of clustering financial time series

Dr. Syed Muhammad Ali Tirmizi - Special topics in finance lec 5
Dr. Syed Muhammad Ali Tirmizi - Special topics in finance   lec 5Dr. Syed Muhammad Ali Tirmizi - Special topics in finance   lec 5
Dr. Syed Muhammad Ali Tirmizi - Special topics in finance lec 5Dr. Muhammad Ali Tirmizi., Ph.D.
 
Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?Gautier Marti
 
Financial Time Series Analysis Using R
Financial Time Series Analysis Using RFinancial Time Series Analysis Using R
Financial Time Series Analysis Using RMajeed Simaan
 
On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...Gautier Marti
 
The dangers of policy experiments Initial beliefs under adaptive learning
The dangers of policy experiments Initial beliefs under adaptive learningThe dangers of policy experiments Initial beliefs under adaptive learning
The dangers of policy experiments Initial beliefs under adaptive learningGRAPE
 
Putting the cycle back into business cycle analysis
Putting the cycle back into business cycle analysisPutting the cycle back into business cycle analysis
Putting the cycle back into business cycle analysisADEMU_Project
 
Banque de France's Workshop on Granularity: Basile Grassi's slides, June 2016
Banque de France's Workshop on Granularity: Basile Grassi's slides, June 2016 Banque de France's Workshop on Granularity: Basile Grassi's slides, June 2016
Banque de France's Workshop on Granularity: Basile Grassi's slides, June 2016 Soledad Zignago
 
03.time series presentation
03.time series presentation03.time series presentation
03.time series presentationDr. Hari Arora
 
The Cyclical Behavior of the Markups in the New Keynesian Models
The Cyclical Behavior of the Markups in the New Keynesian ModelsThe Cyclical Behavior of the Markups in the New Keynesian Models
The Cyclical Behavior of the Markups in the New Keynesian ModelsJEAN BLAISE NLEMFU MUKOKO
 
Multivariate high-order-fuzzy-time-series-forecasting-for-car-road-accidents
Multivariate high-order-fuzzy-time-series-forecasting-for-car-road-accidentsMultivariate high-order-fuzzy-time-series-forecasting-for-car-road-accidents
Multivariate high-order-fuzzy-time-series-forecasting-for-car-road-accidentsCemal Ardil
 
Cointegration among biotech stocks
Cointegration among biotech stocksCointegration among biotech stocks
Cointegration among biotech stocksPeter Zobel
 
Forecasting Slides
Forecasting SlidesForecasting Slides
Forecasting Slidesknksmart
 
A Framework for Analyzing the Impact of Business Cycles on Endogenous Growth
A Framework for Analyzing the Impact of Business Cycles on Endogenous GrowthA Framework for Analyzing the Impact of Business Cycles on Endogenous Growth
A Framework for Analyzing the Impact of Business Cycles on Endogenous GrowthGRAPE
 
Bagging-Clustering Methods to Forecast Time Series
Bagging-Clustering Methods to Forecast Time SeriesBagging-Clustering Methods to Forecast Time Series
Bagging-Clustering Methods to Forecast Time SeriesTiago Mendes Dantas
 
Enterprise_Planning_TimeSeries_And_Components
Enterprise_Planning_TimeSeries_And_ComponentsEnterprise_Planning_TimeSeries_And_Components
Enterprise_Planning_TimeSeries_And_Componentsnanfei
 
"Correlated Volatility Shocks" by Dr. Xiao Qiao, Researcher at SummerHaven In...
"Correlated Volatility Shocks" by Dr. Xiao Qiao, Researcher at SummerHaven In..."Correlated Volatility Shocks" by Dr. Xiao Qiao, Researcher at SummerHaven In...
"Correlated Volatility Shocks" by Dr. Xiao Qiao, Researcher at SummerHaven In...Quantopian
 
Statistical Arbitrage Pairs Trading, Long-Short Strategy
Statistical Arbitrage Pairs Trading, Long-Short StrategyStatistical Arbitrage Pairs Trading, Long-Short Strategy
Statistical Arbitrage Pairs Trading, Long-Short Strategyz-score
 
Demand forecasting methods2 gp
Demand forecasting methods2 gpDemand forecasting methods2 gp
Demand forecasting methods2 gpPUTTU GURU PRASAD
 

Similar to On the stability of clustering financial time series (20)

Master_Thesis_Harihara_Subramanyam_Sreenivasan
Master_Thesis_Harihara_Subramanyam_SreenivasanMaster_Thesis_Harihara_Subramanyam_Sreenivasan
Master_Thesis_Harihara_Subramanyam_Sreenivasan
 
Econometrics
EconometricsEconometrics
Econometrics
 
Dr. Syed Muhammad Ali Tirmizi - Special topics in finance lec 5
Dr. Syed Muhammad Ali Tirmizi - Special topics in finance   lec 5Dr. Syed Muhammad Ali Tirmizi - Special topics in finance   lec 5
Dr. Syed Muhammad Ali Tirmizi - Special topics in finance lec 5
 
Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?
 
Financial Time Series Analysis Using R
Financial Time Series Analysis Using RFinancial Time Series Analysis Using R
Financial Time Series Analysis Using R
 
On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...
 
The dangers of policy experiments Initial beliefs under adaptive learning
The dangers of policy experiments Initial beliefs under adaptive learningThe dangers of policy experiments Initial beliefs under adaptive learning
The dangers of policy experiments Initial beliefs under adaptive learning
 
Putting the cycle back into business cycle analysis
Putting the cycle back into business cycle analysisPutting the cycle back into business cycle analysis
Putting the cycle back into business cycle analysis
 
Banque de France's Workshop on Granularity: Basile Grassi's slides, June 2016
Banque de France's Workshop on Granularity: Basile Grassi's slides, June 2016 Banque de France's Workshop on Granularity: Basile Grassi's slides, June 2016
Banque de France's Workshop on Granularity: Basile Grassi's slides, June 2016
 
03.time series presentation
03.time series presentation03.time series presentation
03.time series presentation
 
The Cyclical Behavior of the Markups in the New Keynesian Models
The Cyclical Behavior of the Markups in the New Keynesian ModelsThe Cyclical Behavior of the Markups in the New Keynesian Models
The Cyclical Behavior of the Markups in the New Keynesian Models
 
Multivariate high-order-fuzzy-time-series-forecasting-for-car-road-accidents
Multivariate high-order-fuzzy-time-series-forecasting-for-car-road-accidentsMultivariate high-order-fuzzy-time-series-forecasting-for-car-road-accidents
Multivariate high-order-fuzzy-time-series-forecasting-for-car-road-accidents
 
Cointegration among biotech stocks
Cointegration among biotech stocksCointegration among biotech stocks
Cointegration among biotech stocks
 
Forecasting Slides
Forecasting SlidesForecasting Slides
Forecasting Slides
 
A Framework for Analyzing the Impact of Business Cycles on Endogenous Growth
A Framework for Analyzing the Impact of Business Cycles on Endogenous GrowthA Framework for Analyzing the Impact of Business Cycles on Endogenous Growth
A Framework for Analyzing the Impact of Business Cycles on Endogenous Growth
 
Bagging-Clustering Methods to Forecast Time Series
Bagging-Clustering Methods to Forecast Time SeriesBagging-Clustering Methods to Forecast Time Series
Bagging-Clustering Methods to Forecast Time Series
 
Enterprise_Planning_TimeSeries_And_Components
Enterprise_Planning_TimeSeries_And_ComponentsEnterprise_Planning_TimeSeries_And_Components
Enterprise_Planning_TimeSeries_And_Components
 
"Correlated Volatility Shocks" by Dr. Xiao Qiao, Researcher at SummerHaven In...
"Correlated Volatility Shocks" by Dr. Xiao Qiao, Researcher at SummerHaven In..."Correlated Volatility Shocks" by Dr. Xiao Qiao, Researcher at SummerHaven In...
"Correlated Volatility Shocks" by Dr. Xiao Qiao, Researcher at SummerHaven In...
 
Statistical Arbitrage Pairs Trading, Long-Short Strategy
Statistical Arbitrage Pairs Trading, Long-Short StrategyStatistical Arbitrage Pairs Trading, Long-Short Strategy
Statistical Arbitrage Pairs Trading, Long-Short Strategy
 
Demand forecasting methods2 gp
Demand forecasting methods2 gpDemand forecasting methods2 gp
Demand forecasting methods2 gp
 

More from Gautier Marti

Using Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeUsing Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeGautier Marti
 
What deep learning can bring to...
What deep learning can bring to...What deep learning can bring to...
What deep learning can bring to...Gautier Marti
 
A quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsA quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsGautier Marti
 
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...Gautier Marti
 
How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?Gautier Marti
 
Generating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGenerating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGautier Marti
 
Applications of GANs in Finance
Applications of GANs in FinanceApplications of GANs in Finance
Applications of GANs in FinanceGautier Marti
 
My recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsMy recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsGautier Marti
 
Takeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaTakeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaGautier Marti
 
A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...Gautier Marti
 
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesGautier Marti
 
Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Gautier Marti
 
A closer look at correlations
A closer look at correlationsA closer look at correlations
A closer look at correlationsGautier Marti
 
Optimal Transport vs. Fisher-Rao distance between Copulas
Optimal Transport vs. Fisher-Rao distance between CopulasOptimal Transport vs. Fisher-Rao distance between Copulas
Optimal Transport vs. Fisher-Rao distance between CopulasGautier Marti
 
On Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationOn Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationGautier Marti
 
Optimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesOptimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesGautier Marti
 
Clustering Random Walk Time Series
Clustering Random Walk Time SeriesClustering Random Walk Time Series
Clustering Random Walk Time SeriesGautier Marti
 

More from Gautier Marti (17)

Using Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeUsing Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of Code
 
What deep learning can bring to...
What deep learning can bring to...What deep learning can bring to...
What deep learning can bring to...
 
A quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsA quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptions
 
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
 
How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?
 
Generating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGenerating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in Finance
 
Applications of GANs in Finance
Applications of GANs in FinanceApplications of GANs in Finance
Applications of GANs in Finance
 
My recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsMy recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returns
 
Takeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaTakeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, California
 
A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...
 
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
 
Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...
 
A closer look at correlations
A closer look at correlationsA closer look at correlations
A closer look at correlations
 
Optimal Transport vs. Fisher-Rao distance between Copulas
Optimal Transport vs. Fisher-Rao distance between CopulasOptimal Transport vs. Fisher-Rao distance between Copulas
Optimal Transport vs. Fisher-Rao distance between Copulas
 
On Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationOn Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond Correlation
 
Optimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesOptimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time Series
 
Clustering Random Walk Time Series
Clustering Random Walk Time SeriesClustering Random Walk Time Series
Clustering Random Walk Time Series
 

Recently uploaded

Advances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of CancerAdvances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of CancerLuis Miguel Chong Chong
 
Probability.pptx, Types of Probability, UG
Probability.pptx, Types of Probability, UGProbability.pptx, Types of Probability, UG
Probability.pptx, Types of Probability, UGSoniaBajaj10
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
BACTERIAL DEFENSE SYSTEM by Dr. Chayanika Das
BACTERIAL DEFENSE SYSTEM by Dr. Chayanika DasBACTERIAL DEFENSE SYSTEM by Dr. Chayanika Das
BACTERIAL DEFENSE SYSTEM by Dr. Chayanika DasChayanika Das
 
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsTotal Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsMarkus Roggen
 
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxEnvironmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxpriyankatabhane
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Christina Parmionova
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxPayal Shrivastava
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfGABYFIORELAMALPARTID1
 
Measures of Central Tendency.pptx for UG
Measures of Central Tendency.pptx for UGMeasures of Central Tendency.pptx for UG
Measures of Central Tendency.pptx for UGSoniaBajaj10
 
Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPirithiRaju
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learningvschiavoni
 
Gas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGiovaniTrinidad
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...Chayanika Das
 

Recently uploaded (20)

PLASMODIUM. PPTX
PLASMODIUM. PPTXPLASMODIUM. PPTX
PLASMODIUM. PPTX
 
Advances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of CancerAdvances in AI-driven Image Recognition for Early Detection of Cancer
Advances in AI-driven Image Recognition for Early Detection of Cancer
 
Probability.pptx, Types of Probability, UG
Probability.pptx, Types of Probability, UGProbability.pptx, Types of Probability, UG
Probability.pptx, Types of Probability, UG
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
BACTERIAL DEFENSE SYSTEM by Dr. Chayanika Das
BACTERIAL DEFENSE SYSTEM by Dr. Chayanika DasBACTERIAL DEFENSE SYSTEM by Dr. Chayanika Das
BACTERIAL DEFENSE SYSTEM by Dr. Chayanika Das
 
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsTotal Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
 
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptxEnvironmental Acoustics- Speech interference level, acoustics calibrator.pptx
Environmental Acoustics- Speech interference level, acoustics calibrator.pptx
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
Charateristics of the Angara-A5 spacecraft launched from the Vostochny Cosmod...
 
Introduction Classification Of Alkaloids
Introduction Classification Of AlkaloidsIntroduction Classification Of Alkaloids
Introduction Classification Of Alkaloids
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptx
 
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdfKDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
KDIGO-2023-CKD-Guideline-Public-Review-Draft_5-July-2023.pdf
 
Interferons.pptx.
Interferons.pptx.Interferons.pptx.
Interferons.pptx.
 
AZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTXAZOTOBACTER AS BIOFERILIZER.PPTX
AZOTOBACTER AS BIOFERILIZER.PPTX
 
Measures of Central Tendency.pptx for UG
Measures of Central Tendency.pptx for UGMeasures of Central Tendency.pptx for UG
Measures of Central Tendency.pptx for UG
 
Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPR
 
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep LearningCombining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
Combining Asynchronous Task Parallelism and Intel SGX for Secure Deep Learning
 
Ultrastructure and functions of Chloroplast.pptx
Ultrastructure and functions of Chloroplast.pptxUltrastructure and functions of Chloroplast.pptx
Ultrastructure and functions of Chloroplast.pptx
 
Gas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptxGas-ExchangeS-in-Plants-and-Animals.pptx
Gas-ExchangeS-in-Plants-and-Animals.pptx
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
 

On the stability of clustering financial time series

  • 1. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion On the Stability of Clustering Financial Time Series – How to investigate? IEEE ICMLA Miami, Florida, USA, December 9-11, 2015 Gautier Marti, Philippe Very, Philippe Donnat, Frank Nielsen 9 December 2015 Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 2. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion 1 Introduction to financial time series clustering 2 Empirical results from the clustering stability study 3 Conclusion Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 3. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Financial time series (data from www.datagrapple.com) Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 4. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Clustering? Definition Clustering is the task of grouping a set of objects in such a way that objects in the same group (cluster) are more similar to each other than those in different groups. French banks (blue) and building materials (red) CDS over 2006-2015 Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 5. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Why clustering? Mathematical finance: Use of variance-covariance matrices (e.g., Markowitz, Value-at-Risk) Stylized fact: Empirical variance-covariance matrices estimated on financial time series are very noisy (Random Matrix Theory, Noise Dressing of Financial Correlation Matrices, Laloux et al, 1999) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 4.0 λ 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 ρ(λ) Marchenko-Pastur distribution vs. empirical eigenvalues distribution of the correlation matrix How to filter these variance-covariance matrices? Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 6. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion For filtering, clustering! Mantegna (1999) et al’s work: 0 100 200 300 400 500 0 100 200 300 400 500 0 100 200 300 400 500 0 100 200 300 400 500 0 100 200 300 400 500 0 100 200 300 400 500 (left) empirical correlation matrix (center) the same matrix seriated using a hierarchical clustering (right) correlations filtered using the clustering structure N.B. other applications: statarb, alternative risk measures Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 7. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Why stability? statistical consistency of the clustering method requires assumptions that may not hold in practice: e.g. returns are i.i.d., underlying elliptical copula, enough data is available stability is a weaker property: reproducibility of results across a wide range of slight data perturbations Clusters obtained at time t, t + 1, t + 2; Is the difference between the successive clusters a“true”signal? Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 8. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Is the clustering of financial time series stable? According to [2], clusters are not stable with respect to the clustering algorithm, but only a squared Euclidean distance was considered which is not relevant for clustering assets from their returns (cf. [4]). Idea: A more relevant distance should increase stability We investigate the clustering stability resulting from using: an Euclidean distance a Pearson correlation distance [3] a Spearman correlation distance a distance for comparing two dependent random variables [4] Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 9. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Some usual distances for clustering financial time series (Pi t )t≥0 Si t+1 = log Pi t+1 −log Pi t (Si t )t≥1 Euclidean distance: d(Si , Sj ) = T t=1(Si t − Sj t )2 Pearson correl.: ρ(Si , Sj ) = T t=1(Si t −Si )(Sj t −Sj ) T t=1(Si t −Si )2 T t=1(Sj t −Sj )2 Spearman correl.: ρS (Si , Sj ) = 1 − 6 T(T2−1) T t=1(Si (t) − Si (t))2 Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 10. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Generic Non-Parametric Distance [4] d2 θ (Xi , Xj ) = θ3E |Pi (Xi ) − Pj (Xj )|2 + (1 − θ) 1 2 R dPi dλ − dPj dλ 2 dλ (i) 0 ≤ dθ ≤ 1, (ii) 0 < θ < 1, dθ metric, (iii) dθ is invariant under diffeomorphism Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 11. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Generic Non-Parametric Distance [4] d2 0 : 1 2 R dPi dλ − dPj dλ 2 dλ = Hellinger2 d2 1 : 3E |Pi (Xi ) − Pj (Xj )|2 = 1 − ρS 2 = 2−6 1 0 1 0 C(u, v)dudv Remark: If f (x, θ) = c(F1(x1; ν1), . . . , FN(xN; νN); θc) N i=1 fi (xi ; νi ) then with CML hypothesis ds2 = ds2 copula + N i=1 ds2 margins Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 12. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion 1 Introduction to financial time series clustering 2 Empirical results from the clustering stability study 3 Conclusion Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 13. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Sliding Window PCA stability curve (red) vs. Euclidean Clusters stability curve as a function of time using results from [1] for fair comparison: clusters are more stable most basic perturbation: traders face it everyday when monitoring their indicators we do not want to overfit our analysis to this particular stability goal stability perf.: dist. [4] Spearman Pearson Euclidean Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 14. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Odd vs. Even A clustering al- gorithm applied on two samples describing the same phenomenon should yield the same results. How to obtain two of these samples? (un)Stability of clusters with L2 distance Stability of clusters with the proposed distance [4] Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 15. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Economic Regimes AXA 5-year CDS spread over 2006-2015 Average of the pairwise correlations; correlation skyrockets during crises Is the clustering structure persistent? Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 16. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Economic Regimes Clustering Stability Pearson (top left), Spearman (top right), Euclidean (bottom left), corr+distr (bottom right) Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 17. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Heart vs. Tails Clustering Stability ≈ orange+red vs. green+yellow periods Pearson (top left), Spearman (top right), Euclidean (bottom left), corr+distr (bottom right) Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 18. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Multiscale Is the clustering structure persistent to different sampling frequencies? Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 19. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Multiscale Clustering Stability Pearson (top left), Spearman (top right), Euclidean (bottom left), corr+distr (bottom right) Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 20. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Maturities & Term Structure An asset is described by several time series whose dynamics are similar: Nokia Oyj is described here by the cost of insurance against its default for {1, 3, 5, 7, 10} years Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 21. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Maturities & Term Structure Clustering Stability Pearson (top left), Spearman (top right), Euclidean (bottom left), corr+distr (bottom right) Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 22. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion 1 Introduction to financial time series clustering 2 Empirical results from the clustering stability study 3 Conclusion Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 23. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Discussion and questions? A given clustering algorithm yields a particular clustering structure, but with a relevant distance it can be more stable The perturbations presented can be readily extended (e.g. using different CDS datasets) Disclosing stability results is interesting since complex models often perform poorly (the many parameters are somewhat overfitted) and cannot be used by practitioners Correlation+distribution distance (presented in [4]) may work for your applications (which ones?) Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 24. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion C. Ding and X. He. K-means clustering via principal component analysis. In Proceedings of the twenty-first international conference on Machine learning, page 29. ACM, 2004. V. Lemieux, P. S. Rahmdel, R. Walker, B. Wong, and M. Flood. Clustering techniques and their effect on portfolio formation and risk analysis. In Proceedings of the International Workshop on Data Science for Macro-Modeling, pages 1–6. ACM, 2014. R. N. Mantegna and H. E. Stanley. Introduction to econophysics: correlations and complexity in finance. Cambridge university press, 1999. G. Marti, P. Very, and P. Donnat. Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series
  • 25. Introduction to financial time series clustering Empirical results from the clustering stability study Conclusion Toward a generic representation of random variables for machine learning. Pattern Recognition Letters, 2015. Gautier Marti, Philippe Donnat On the Stability of Clustering Financial Time Series