SlideShare a Scribd company logo
1 of 35
Download to read offline
Introduction
How to define a distance between two random walks?
Applications
Conclusion
How to cluster random walks?
Paris Machine Learning #5 Season 2: Time Series and FinTech
Philippe Donnat1 Gautier Marti1,2
Frank Nielsen2 Philippe Very1
1Hellebore Capital Management
2Ecole Polytechnique
th January 
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
1 Introduction
Data Science for the CDS market
How to group random walks?
What is a clustering program?
2 How to define a distance between two random walks?
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
3 Applications
Results on synthetic datasets
Clustering Credit Default Swaps
4 Conclusion
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
1 Introduction
Data Science for the CDS market
How to group random walks?
What is a clustering program?
2 How to define a distance between two random walks?
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
3 Applications
Results on synthetic datasets
Clustering Credit Default Swaps
4 Conclusion
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
Hellebore Capital Management & Data Science
Current R&D projects in Data Science:
Data mining: parsing & natural language processing
Inference: incomplete data sources
Portfolio & Risk analysis: understanding joint behaviours
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
Do you see clusters?
Random walks
French banks and building materials
CDS over 2006-2010
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
Do you see clusters?
Random walks
French banks and building materials
CDS over 2006-2010
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
Do you see clusters?
Random walks
French banks and building materials
CDS over 2006-2010
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
Do you see clusters?
Random walks
French banks and building materials
CDS over 2006-2010
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
Do you see clusters?
Random walks
French banks and building materials
CDS over 2006-2010
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
Do you see clusters?
Random walks
French banks and building materials
CDS over 2006-2010
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
Do you see clusters?
Random walks
French banks and building materials
CDS over 2006-2015
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
Do you see clusters?
Random walks
French banks and building materials
CDS over 2006-2015
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Data Science for the CDS market
How to group random walks?
What is a clustering program?
What is a clustering program?
Definition
Clustering is the task of grouping a set of objects in such a way
that objects in the same group (cluster) are more similar to each
other than those in different groups.
Definition
We aim at finding K groups by positioning K group centers
{c1, . . . , cK } such that data points {x1, . . . , xn} minimize
min
c1,...,cK
n
i=1
K
min
j=1
d(xi , cj )2
But, what is the distance d between two random walks?
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
1 Introduction
Data Science for the CDS market
How to group random walks?
What is a clustering program?
2 How to define a distance between two random walks?
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
3 Applications
Results on synthetic datasets
Clustering Credit Default Swaps
4 Conclusion
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
Naive distance between two random walks
random walks Y ,Y
d·
−→
increments X,X covariance scatterplot
X = (y2 − y1, . . . , yT − yT−1)
X,X points in RT : ||X − X ||2 = T−1
i=1 (Xi − Xi )2
apply normalizations: e.g. (X − µ)/σ, (X − min)/(max − min)
capture rather well comovements
drawbacks: not robust to outliers, blind to signal shape
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
Our approach: split comovements and distributions
?
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
GNPR: A suitable representation
Definition
GNPR (Generic Non-Parametric Representation) projection:
T : VN
→ UN
× GN
(1)
X → (GX (X), GX )
GX : x → P[X ≤ x] cumulative distribution function
GX (X) ∼ U[0, 1]
1
T rank(Xt) = 1
T k≤T 1{Xk ≤ Xt} →T∞ P[X ≤ Xt] = GX (Xt)
Property
T is a bijection.
N.B. It replicates Sklar’s theorem, the seminal result of Copula Theory.
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
A distance dθ leveraging GNPR
Definition
Let (X, Y ) ∈ V2. Let GX , GY be vectors of marginal cdf. Let
θ ∈ [0, 1]. We define the following distance
d2
θ (X, Y ) = θd2
1 (GX (X), GY (Y )) + (1 − θ)d2
0 (GX , GY ), (2)
where
d2
1 (GX (X), GY (Y )) = 3E[|GX (X) − GY (Y )|2
], (3)
and
d2
0 (GX , GY ) =
1
2 R
dGX
dλ
−
dGY
dλ
2
dλ. (4)
d0 Hellinger;
d1 = (1 − ρS )/2, with ρS the rank correlation between X and Y .
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
GNPR θ = 1: Increase of correlation
Correlation
Density
−0.2 0.0 0.2 0.4 0.6 0.8 1.0
0.00.51.01.52.02.53.03.5
Pearson Correlation
Spearman Correlation
Distribution of Correlations
10% more correlation, in average, using GNPR θ = 1 (rank statistics)
instead of standard correlation
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
GNPR θ = 0: Find distribution peculiarities
Parametric modelling: Real-life CDS variations:
Which distribution?
Nokia
−310 −280 −250 −220 −190 −160 −130 −100 −70 −40 −10 10 30 50 70 90 110 130 150 170 190
Distribution Nokia
IncrementLogDensity
5e−051e−042e−045e−041e−032e−035e−031e−022e−02
020040060080010001200
Nokia 5Y CDS
Time
CDSSpread
Jan−2006 Oct−2006 Jul−2007 Apr−2008 Feb−2009 Nov−2009 Sep−2010 Jul−2011 Apr−2012 Jan−2013 Oct−2013 Jul−2014
Telecom Italia
−62 −56 −50 −44 −38 −32 −26 −20 −14 −8 −2 2 6 10 16 22 28 34 40 46 52 58 64
Distribution Telecom Italia
Increment
LogDensity
2e−045e−041e−032e−035e−031e−022e−025e−021e−01
100200300400500
Telecom Italia 5Y CDS
Time
CDSSpread
Jan−2006 Dec−2006 Nov−2007 Oct−2008 Sep−2009 Jul−2010 Jun−2011 May−2012 Apr−2013 Mar−2014
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Results on synthetic datasets
Clustering Credit Default Swaps
1 Introduction
Data Science for the CDS market
How to group random walks?
What is a clustering program?
2 How to define a distance between two random walks?
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
3 Applications
Results on synthetic datasets
Clustering Credit Default Swaps
4 Conclusion
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Results on synthetic datasets
Clustering Credit Default Swaps
Description of the testing datasets
We define some interesting test case datasets to study:
distribution clustering (dataset A),
dependence clustering (dataset B),
a mix of both (dataset C).
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Results on synthetic datasets
Clustering Credit Default Swaps
Results: GNPR works!
Adjusted Rand Index
Representation Algorithm A B C
X Ward 0 0.94 0.42
(X − µX )/σX Ward 0 0.94 0.42
(X − min)/(max − min) Ward 0 0.48 0.45
GNPR θ = 0 Ward 1 0 0.47
GNPR θ = 1 Ward 0 0.91 0.72
GNPR θ Ward 1 0.92 1
X k-means++ 0 0.90 0.44
(X − µX )/σX k-means++ 0 0.91 0.45
(X − min)/(max − min) k-means++ 0.11 0.55 0.47
GNPR θ = 0 k-means++ 1 0 0.53
GNPR θ = 1 k-means++ 0.06 0.99 0.80
GNPR θ k-means++ 1 0.99 1
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Results on synthetic datasets
Clustering Credit Default Swaps
HCMapper: Compare Hierarchical Clustering
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Results on synthetic datasets
Clustering Credit Default Swaps
HCMapper: Compare Hierarchical Clustering
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Results on synthetic datasets
Clustering Credit Default Swaps
“Western sovereigns”cluster
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
1 Introduction
Data Science for the CDS market
How to group random walks?
What is a clustering program?
2 How to define a distance between two random walks?
Standard approach on time series
Comovements and distributions
GNPR: the best of both worlds
3 Applications
Results on synthetic datasets
Clustering Credit Default Swaps
4 Conclusion
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Conclusion: Take Home Message
GNPR is a way to deal separately with
dependence information,
distribution information,
without loosing any.
Avenue for research:
better aggregation: generalized means?
consistency proof?
any idea of interesting random walks outside finance?
Check out www.datagrapple.com
to follow our R&D stuff and news about the CDS market!
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Internships
If interested, please contact
gautier.marti@helleborecapital.com
philippe.very@helleborecapital.com
for further details and application.
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
References I
Shai Ben-David and Ulrike Von Luxburg, Relating clustering
stability to properties of cluster boundaries., COLT, vol. 2008,
2008, pp. 379–390.
Shai Ben-David, Ulrike Von Luxburg, and D´avid P´al, A sober
look at clustering stability, Learning theory, Springer, 2006,
pp. 5–19.
Asa Ben-Hur, Andre Elisseeff, and Isabelle Guyon, A stability
based method for discovering structure in clustered data,
Pacific symposium on biocomputing, vol. 7, 2001, pp. 6–17.
Tilman Lange, Volker Roth, Mikio L Braun, and Joachim M
Buhmann, Stability-based validation of clustering solutions,
Neural computation 16 (2004), no. 6, 1299–1323.
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
How to find θ ?
Clustering stability using perturbations due to:
bootstrap
time-sliding window
draws from an oracle
stability pros [BHEG01, LRBB04]
stability cons [BDVLP06, BDVL08]
theta
T
accuracy
Accuracy landscape
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0.0 0.2 0.4 0.6 0.8 1.0
ARI
θ
Accuracy and stability of clustering using GNPR
Accuracy
Bootstrap stability
Time stability
Cross-datasets stability
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Properties of dθ
Property
∀θ ∈ [0, 1], 0 ≤ dθ ≤ 1
Property
For 0 < θ < 1, dθ is a metric
For θ ∈ {0, 1},
U ∼ U[0, 1] = 1 − U, yet d0(U, 1 − U) = 0
V = 2V , yet d1(V , 2V ) = 0
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Description of the testing datasets
For 1 ≤ i ≤ N = pK S
s=1 Ks, we define
Xi =
S
s=1
Ks
k=1
βs
k,i Y s
k +
K
k=1
αk,i Zi
k, (5)
where
a) αk,i = 1, if i ≡ k − 1 (mod K), 0 otherwise;
b) βs
k ∈ [0, 1],
c) βs
k,i = βs
k, if iKs/N = k, 0 otherwise.
(Xi )N
i=1 are partitioned into Q = K S
s=1 Ks clusters of p random
variables each.
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
OTC data flow processing
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
Introduction
How to define a distance between two random walks?
Applications
Conclusion
Conjecture: consistency of clustering random walks
200
400
600
0 500 1000 1500 2000
T
N
0.4
0.6
0.8
1.0
ARI
0.0
0.2
0.4
0.6
0.8
1.0
0 500 1000 1500 2000
ARI
T
Clustering convergence to the ground-truth partition
Clustering distribution θ = 0
Clustering dependence θ = 1
Clustering total information θ = θ*
Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?

More Related Content

What's hot

A closer look at correlations
A closer look at correlationsA closer look at correlations
A closer look at correlationsGautier Marti
 
Parameter Uncertainty and Learning in Dynamic Financial Decisions
Parameter Uncertainty and Learning in Dynamic Financial DecisionsParameter Uncertainty and Learning in Dynamic Financial Decisions
Parameter Uncertainty and Learning in Dynamic Financial DecisionsDaniel Bruggisser
 
Using Vector Clocks to Visualize Communication Flow
Using Vector Clocks to Visualize Communication FlowUsing Vector Clocks to Visualize Communication Flow
Using Vector Clocks to Visualize Communication FlowMartin Harrigan
 
Csr2011 june16 15_45_meer
Csr2011 june16 15_45_meerCsr2011 june16 15_45_meer
Csr2011 june16 15_45_meerCSR2011
 
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...Chiheb Ben Hammouda
 
Numerical smoothing and hierarchical approximations for efficient option pric...
Numerical smoothing and hierarchical approximations for efficient option pric...Numerical smoothing and hierarchical approximations for efficient option pric...
Numerical smoothing and hierarchical approximations for efficient option pric...Chiheb Ben Hammouda
 
11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...Alexander Decker
 
Pairwise and Problem-Specific Distance Metrics in the Linkage Tree Genetic Al...
Pairwise and Problem-Specific Distance Metrics in the Linkage Tree Genetic Al...Pairwise and Problem-Specific Distance Metrics in the Linkage Tree Genetic Al...
Pairwise and Problem-Specific Distance Metrics in the Linkage Tree Genetic Al...Martin Pelikan
 
MCMC and likelihood-free methods
MCMC and likelihood-free methodsMCMC and likelihood-free methods
MCMC and likelihood-free methodsChristian Robert
 
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...Chiheb Ben Hammouda
 
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...Mokhtar SELLAMI
 
ABC and empirical likelihood
ABC and empirical likelihoodABC and empirical likelihood
ABC and empirical likelihoodChristian Robert
 
ABC and empirical likelihood
ABC and empirical likelihoodABC and empirical likelihood
ABC and empirical likelihoodChristian Robert
 
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...SYRTO Project
 
Bayesian model choice in cosmology
Bayesian model choice in cosmologyBayesian model choice in cosmology
Bayesian model choice in cosmologyChristian Robert
 
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Frank Nielsen
 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsJulyan Arbel
 
Scalable inference for a full multivariate stochastic volatility
Scalable inference for a full multivariate stochastic volatilityScalable inference for a full multivariate stochastic volatility
Scalable inference for a full multivariate stochastic volatilitySYRTO Project
 
Uncertain Volatility Models
Uncertain Volatility ModelsUncertain Volatility Models
Uncertain Volatility ModelsSwati Mital
 
Pricing Exotics using Change of Numeraire
Pricing Exotics using Change of NumerairePricing Exotics using Change of Numeraire
Pricing Exotics using Change of NumeraireSwati Mital
 

What's hot (20)

A closer look at correlations
A closer look at correlationsA closer look at correlations
A closer look at correlations
 
Parameter Uncertainty and Learning in Dynamic Financial Decisions
Parameter Uncertainty and Learning in Dynamic Financial DecisionsParameter Uncertainty and Learning in Dynamic Financial Decisions
Parameter Uncertainty and Learning in Dynamic Financial Decisions
 
Using Vector Clocks to Visualize Communication Flow
Using Vector Clocks to Visualize Communication FlowUsing Vector Clocks to Visualize Communication Flow
Using Vector Clocks to Visualize Communication Flow
 
Csr2011 june16 15_45_meer
Csr2011 june16 15_45_meerCsr2011 june16 15_45_meer
Csr2011 june16 15_45_meer
 
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
 
Numerical smoothing and hierarchical approximations for efficient option pric...
Numerical smoothing and hierarchical approximations for efficient option pric...Numerical smoothing and hierarchical approximations for efficient option pric...
Numerical smoothing and hierarchical approximations for efficient option pric...
 
11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...11.the comparative study of finite difference method and monte carlo method f...
11.the comparative study of finite difference method and monte carlo method f...
 
Pairwise and Problem-Specific Distance Metrics in the Linkage Tree Genetic Al...
Pairwise and Problem-Specific Distance Metrics in the Linkage Tree Genetic Al...Pairwise and Problem-Specific Distance Metrics in the Linkage Tree Genetic Al...
Pairwise and Problem-Specific Distance Metrics in the Linkage Tree Genetic Al...
 
MCMC and likelihood-free methods
MCMC and likelihood-free methodsMCMC and likelihood-free methods
MCMC and likelihood-free methods
 
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
Hierarchical Deterministic Quadrature Methods for Option Pricing under the Ro...
 
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
CARI-2020, Application of LSTM architectures for next frame forecasting in Se...
 
ABC and empirical likelihood
ABC and empirical likelihoodABC and empirical likelihood
ABC and empirical likelihood
 
ABC and empirical likelihood
ABC and empirical likelihoodABC and empirical likelihood
ABC and empirical likelihood
 
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
Spillover Dynamics for Systemic Risk Measurement Using Spatial Financial Time...
 
Bayesian model choice in cosmology
Bayesian model choice in cosmologyBayesian model choice in cosmology
Bayesian model choice in cosmology
 
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian Nonparametrics
 
Scalable inference for a full multivariate stochastic volatility
Scalable inference for a full multivariate stochastic volatilityScalable inference for a full multivariate stochastic volatility
Scalable inference for a full multivariate stochastic volatility
 
Uncertain Volatility Models
Uncertain Volatility ModelsUncertain Volatility Models
Uncertain Volatility Models
 
Pricing Exotics using Change of Numeraire
Pricing Exotics using Change of NumerairePricing Exotics using Change of Numeraire
Pricing Exotics using Change of Numeraire
 

Viewers also liked

IBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMESIBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMESFernando M. Imperiale
 
Searching for the grey gold - 2013
Searching for the grey gold - 2013Searching for the grey gold - 2013
Searching for the grey gold - 2013Olle Bergendahl
 
Here be dragons
Here be dragonsHere be dragons
Here be dragonsdeelay1
 
Diapo bourse aux sports
Diapo bourse aux sportsDiapo bourse aux sports
Diapo bourse aux sportsmfrfye
 
Fernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMESFernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMESFernando M. Imperiale
 
Yasemin yilmazer latifepalta_zeynepucar
Yasemin yilmazer latifepalta_zeynepucarYasemin yilmazer latifepalta_zeynepucar
Yasemin yilmazer latifepalta_zeynepucarzeynepucarr
 
2015年3月の中国からGitHubへのDDoS攻撃(MITM)の概要
2015年3月の中国からGitHubへのDDoS攻撃(MITM)の概要2015年3月の中国からGitHubへのDDoS攻撃(MITM)の概要
2015年3月の中国からGitHubへのDDoS攻撃(MITM)の概要Zimb_
 
Carla Casilli - Cineca + open badges - May 2015
Carla Casilli - Cineca + open badges - May 2015Carla Casilli - Cineca + open badges - May 2015
Carla Casilli - Cineca + open badges - May 2015Bestr
 
Nutrifit parcial vane
Nutrifit parcial vaneNutrifit parcial vane
Nutrifit parcial vanevanessaghia12
 
National Development 5.15.15
National Development 5.15.15National Development 5.15.15
National Development 5.15.15Jack Murray III
 
Health & safety officer performance appraisal
Health & safety officer performance appraisalHealth & safety officer performance appraisal
Health & safety officer performance appraisalsandersjamie999
 
Bartha_Éva_Lili-A_matroid_és_gráfelmélet_összefüggései - MSc_Diplomamunka
Bartha_Éva_Lili-A_matroid_és_gráfelmélet_összefüggései - MSc_DiplomamunkaBartha_Éva_Lili-A_matroid_és_gráfelmélet_összefüggései - MSc_Diplomamunka
Bartha_Éva_Lili-A_matroid_és_gráfelmélet_összefüggései - MSc_DiplomamunkaLili Eva Bartha
 
SuperMaraton - Session 9 - IBM AppScan - Fernando Imperiale v2
SuperMaraton - Session 9 - IBM AppScan - Fernando Imperiale v2SuperMaraton - Session 9 - IBM AppScan - Fernando Imperiale v2
SuperMaraton - Session 9 - IBM AppScan - Fernando Imperiale v2Fernando M. Imperiale
 
integrating climate risks in agricultural value chains enamul haque
integrating climate risks in agricultural value chains   enamul haqueintegrating climate risks in agricultural value chains   enamul haque
integrating climate risks in agricultural value chains enamul haqueEnamul Haque
 

Viewers also liked (20)

IBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMESIBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMES
 
Searching for the grey gold - 2013
Searching for the grey gold - 2013Searching for the grey gold - 2013
Searching for the grey gold - 2013
 
Here be dragons
Here be dragonsHere be dragons
Here be dragons
 
Prezentacja1
Prezentacja1Prezentacja1
Prezentacja1
 
Diapo bourse aux sports
Diapo bourse aux sportsDiapo bourse aux sports
Diapo bourse aux sports
 
EColi_CaseStudyRoughDraft.docx
EColi_CaseStudyRoughDraft.docxEColi_CaseStudyRoughDraft.docx
EColi_CaseStudyRoughDraft.docx
 
Fernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMESFernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMES
 
Yasemin yilmazer latifepalta_zeynepucar
Yasemin yilmazer latifepalta_zeynepucarYasemin yilmazer latifepalta_zeynepucar
Yasemin yilmazer latifepalta_zeynepucar
 
2015年3月の中国からGitHubへのDDoS攻撃(MITM)の概要
2015年3月の中国からGitHubへのDDoS攻撃(MITM)の概要2015年3月の中国からGitHubへのDDoS攻撃(MITM)の概要
2015年3月の中国からGitHubへのDDoS攻撃(MITM)の概要
 
Carla Casilli - Cineca + open badges - May 2015
Carla Casilli - Cineca + open badges - May 2015Carla Casilli - Cineca + open badges - May 2015
Carla Casilli - Cineca + open badges - May 2015
 
bala.resume
bala.resumebala.resume
bala.resume
 
Nutrifit parcial vane
Nutrifit parcial vaneNutrifit parcial vane
Nutrifit parcial vane
 
National Development 5.15.15
National Development 5.15.15National Development 5.15.15
National Development 5.15.15
 
Health & safety officer performance appraisal
Health & safety officer performance appraisalHealth & safety officer performance appraisal
Health & safety officer performance appraisal
 
NSO_cv_20160511
NSO_cv_20160511NSO_cv_20160511
NSO_cv_20160511
 
Cv bank pa
Cv bank paCv bank pa
Cv bank pa
 
Bartha_Éva_Lili-A_matroid_és_gráfelmélet_összefüggései - MSc_Diplomamunka
Bartha_Éva_Lili-A_matroid_és_gráfelmélet_összefüggései - MSc_DiplomamunkaBartha_Éva_Lili-A_matroid_és_gráfelmélet_összefüggései - MSc_Diplomamunka
Bartha_Éva_Lili-A_matroid_és_gráfelmélet_összefüggései - MSc_Diplomamunka
 
SuperMaraton - Session 9 - IBM AppScan - Fernando Imperiale v2
SuperMaraton - Session 9 - IBM AppScan - Fernando Imperiale v2SuperMaraton - Session 9 - IBM AppScan - Fernando Imperiale v2
SuperMaraton - Session 9 - IBM AppScan - Fernando Imperiale v2
 
Magento News @ Magento Meetup Wien 17
Magento News @ Magento Meetup Wien 17Magento News @ Magento Meetup Wien 17
Magento News @ Magento Meetup Wien 17
 
integrating climate risks in agricultural value chains enamul haque
integrating climate risks in agricultural value chains   enamul haqueintegrating climate risks in agricultural value chains   enamul haque
integrating climate risks in agricultural value chains enamul haque
 

Similar to Cluster Random Walks

flat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimationflat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimationLuís Moreira-Matias
 
Who Cares about Others' Privacy: Personalized Anonymization of Moving Object ...
Who Cares about Others' Privacy: Personalized Anonymization of Moving Object ...Who Cares about Others' Privacy: Personalized Anonymization of Moving Object ...
Who Cares about Others' Privacy: Personalized Anonymization of Moving Object ...e-Bi Lab
 
Learning from (dis)similarity data
Learning from (dis)similarity dataLearning from (dis)similarity data
Learning from (dis)similarity datatuxette
 
Hyper variance and autonomous bus
Hyper variance and autonomous busHyper variance and autonomous bus
Hyper variance and autonomous busJun Steed Huang
 
Making the Crowd Wiser: (Re)combination through Teaming in Crowdsourcing
Making the Crowd Wiser: (Re)combination through Teaming in CrowdsourcingMaking the Crowd Wiser: (Re)combination through Teaming in Crowdsourcing
Making the Crowd Wiser: (Re)combination through Teaming in CrowdsourcingJungpil Hahn
 
Summarizing Cluster Evolution in Dynamic Environments
Summarizing Cluster Evolution in Dynamic EnvironmentsSummarizing Cluster Evolution in Dynamic Environments
Summarizing Cluster Evolution in Dynamic EnvironmentsEirini Ntoutsi
 
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGFAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGIJNSA Journal
 
USE OF BARNES-HUT ALGORITHM TO ATTACK COVID-19 VIRUS
USE OF BARNES-HUT ALGORITHM TO ATTACK COVID-19 VIRUSUSE OF BARNES-HUT ALGORITHM TO ATTACK COVID-19 VIRUS
USE OF BARNES-HUT ALGORITHM TO ATTACK COVID-19 VIRUSIJCI JOURNAL
 
Credit card fraud detection and concept drift adaptation with delayed supervi...
Credit card fraud detection and concept drift adaptation with delayed supervi...Credit card fraud detection and concept drift adaptation with delayed supervi...
Credit card fraud detection and concept drift adaptation with delayed supervi...Andrea Dal Pozzolo
 
Dijkstra Shortest Path Visualization
Dijkstra Shortest Path VisualizationDijkstra Shortest Path Visualization
Dijkstra Shortest Path VisualizationIRJET Journal
 
A Lagrangian-Based Branch-And-Bound Algorithm For The Two-Level Uncapacitated...
A Lagrangian-Based Branch-And-Bound Algorithm For The Two-Level Uncapacitated...A Lagrangian-Based Branch-And-Bound Algorithm For The Two-Level Uncapacitated...
A Lagrangian-Based Branch-And-Bound Algorithm For The Two-Level Uncapacitated...Lori Head
 
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGFAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGIJNSA Journal
 
Una introducción a la minería de series temporales
Una introducción a la minería de series temporalesUna introducción a la minería de series temporales
Una introducción a la minería de series temporalesFacultad de Informática UCM
 
Options on Quantum Money: Quantum Path- Integral With Serial Shocks
Options on Quantum Money: Quantum Path- Integral With Serial ShocksOptions on Quantum Money: Quantum Path- Integral With Serial Shocks
Options on Quantum Money: Quantum Path- Integral With Serial ShocksAM Publications,India
 
Clustering in dynamic causal networks as a measure of systemic risk on the eu...
Clustering in dynamic causal networks as a measure of systemic risk on the eu...Clustering in dynamic causal networks as a measure of systemic risk on the eu...
Clustering in dynamic causal networks as a measure of systemic risk on the eu...SYRTO Project
 
THIC MedIX Summer 2015 Poster
THIC MedIX Summer 2015 PosterTHIC MedIX Summer 2015 Poster
THIC MedIX Summer 2015 PosterDiana Zajac
 

Similar to Cluster Random Walks (20)

flat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimationflat_presentation_time_evolving_OD_matrix_estimation
flat_presentation_time_evolving_OD_matrix_estimation
 
Who Cares about Others' Privacy: Personalized Anonymization of Moving Object ...
Who Cares about Others' Privacy: Personalized Anonymization of Moving Object ...Who Cares about Others' Privacy: Personalized Anonymization of Moving Object ...
Who Cares about Others' Privacy: Personalized Anonymization of Moving Object ...
 
Learning from (dis)similarity data
Learning from (dis)similarity dataLearning from (dis)similarity data
Learning from (dis)similarity data
 
Hyper variance and autonomous bus
Hyper variance and autonomous busHyper variance and autonomous bus
Hyper variance and autonomous bus
 
Making the Crowd Wiser: (Re)combination through Teaming in Crowdsourcing
Making the Crowd Wiser: (Re)combination through Teaming in CrowdsourcingMaking the Crowd Wiser: (Re)combination through Teaming in Crowdsourcing
Making the Crowd Wiser: (Re)combination through Teaming in Crowdsourcing
 
Summarizing Cluster Evolution in Dynamic Environments
Summarizing Cluster Evolution in Dynamic EnvironmentsSummarizing Cluster Evolution in Dynamic Environments
Summarizing Cluster Evolution in Dynamic Environments
 
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGFAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
 
USE OF BARNES-HUT ALGORITHM TO ATTACK COVID-19 VIRUS
USE OF BARNES-HUT ALGORITHM TO ATTACK COVID-19 VIRUSUSE OF BARNES-HUT ALGORITHM TO ATTACK COVID-19 VIRUS
USE OF BARNES-HUT ALGORITHM TO ATTACK COVID-19 VIRUS
 
Credit card fraud detection and concept drift adaptation with delayed supervi...
Credit card fraud detection and concept drift adaptation with delayed supervi...Credit card fraud detection and concept drift adaptation with delayed supervi...
Credit card fraud detection and concept drift adaptation with delayed supervi...
 
Dijkstra Shortest Path Visualization
Dijkstra Shortest Path VisualizationDijkstra Shortest Path Visualization
Dijkstra Shortest Path Visualization
 
A Lagrangian-Based Branch-And-Bound Algorithm For The Two-Level Uncapacitated...
A Lagrangian-Based Branch-And-Bound Algorithm For The Two-Level Uncapacitated...A Lagrangian-Based Branch-And-Bound Algorithm For The Two-Level Uncapacitated...
A Lagrangian-Based Branch-And-Bound Algorithm For The Two-Level Uncapacitated...
 
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTINGFAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
FAST DETECTION OF DDOS ATTACKS USING NON-ADAPTIVE GROUP TESTING
 
Una introducción a la minería de series temporales
Una introducción a la minería de series temporalesUna introducción a la minería de series temporales
Una introducción a la minería de series temporales
 
Data science
Data scienceData science
Data science
 
Options on Quantum Money: Quantum Path- Integral With Serial Shocks
Options on Quantum Money: Quantum Path- Integral With Serial ShocksOptions on Quantum Money: Quantum Path- Integral With Serial Shocks
Options on Quantum Money: Quantum Path- Integral With Serial Shocks
 
Clustering in dynamic causal networks as a measure of systemic risk on the eu...
Clustering in dynamic causal networks as a measure of systemic risk on the eu...Clustering in dynamic causal networks as a measure of systemic risk on the eu...
Clustering in dynamic causal networks as a measure of systemic risk on the eu...
 
THIC MedIX Summer 2015 Poster
THIC MedIX Summer 2015 PosterTHIC MedIX Summer 2015 Poster
THIC MedIX Summer 2015 Poster
 
Multimodal Deep Learning
Multimodal Deep LearningMultimodal Deep Learning
Multimodal Deep Learning
 
The impact of attribute preferences on adoption timing of hotel distribution ...
The impact of attribute preferences on adoption timing of hotel distribution ...The impact of attribute preferences on adoption timing of hotel distribution ...
The impact of attribute preferences on adoption timing of hotel distribution ...
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 

More from Gautier Marti

Using Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeUsing Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeGautier Marti
 
What deep learning can bring to...
What deep learning can bring to...What deep learning can bring to...
What deep learning can bring to...Gautier Marti
 
A quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsA quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsGautier Marti
 
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...Gautier Marti
 
How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?Gautier Marti
 
Generating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGenerating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGautier Marti
 
Applications of GANs in Finance
Applications of GANs in FinanceApplications of GANs in Finance
Applications of GANs in FinanceGautier Marti
 
My recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsMy recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsGautier Marti
 
Takeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaTakeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaGautier Marti
 
A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...Gautier Marti
 
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesGautier Marti
 
On Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationOn Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationGautier Marti
 

More from Gautier Marti (12)

Using Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeUsing Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of Code
 
What deep learning can bring to...
What deep learning can bring to...What deep learning can bring to...
What deep learning can bring to...
 
A quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsA quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptions
 
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
 
How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?
 
Generating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGenerating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in Finance
 
Applications of GANs in Finance
Applications of GANs in FinanceApplications of GANs in Finance
Applications of GANs in Finance
 
My recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsMy recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returns
 
Takeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaTakeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, California
 
A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...
 
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
 
On Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationOn Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond Correlation
 

Recently uploaded

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxdolaknnilon
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 

Recently uploaded (20)

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptx
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 

Cluster Random Walks

  • 1. Introduction How to define a distance between two random walks? Applications Conclusion How to cluster random walks? Paris Machine Learning #5 Season 2: Time Series and FinTech Philippe Donnat1 Gautier Marti1,2 Frank Nielsen2 Philippe Very1 1Hellebore Capital Management 2Ecole Polytechnique th January  Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 2. Introduction How to define a distance between two random walks? Applications Conclusion 1 Introduction Data Science for the CDS market How to group random walks? What is a clustering program? 2 How to define a distance between two random walks? Standard approach on time series Comovements and distributions GNPR: the best of both worlds 3 Applications Results on synthetic datasets Clustering Credit Default Swaps 4 Conclusion Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 3. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? 1 Introduction Data Science for the CDS market How to group random walks? What is a clustering program? 2 How to define a distance between two random walks? Standard approach on time series Comovements and distributions GNPR: the best of both worlds 3 Applications Results on synthetic datasets Clustering Credit Default Swaps 4 Conclusion Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 4. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? Hellebore Capital Management & Data Science Current R&D projects in Data Science: Data mining: parsing & natural language processing Inference: incomplete data sources Portfolio & Risk analysis: understanding joint behaviours Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 5. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? Do you see clusters? Random walks French banks and building materials CDS over 2006-2010 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 6. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? Do you see clusters? Random walks French banks and building materials CDS over 2006-2010 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 7. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? Do you see clusters? Random walks French banks and building materials CDS over 2006-2010 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 8. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? Do you see clusters? Random walks French banks and building materials CDS over 2006-2010 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 9. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? Do you see clusters? Random walks French banks and building materials CDS over 2006-2010 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 10. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? Do you see clusters? Random walks French banks and building materials CDS over 2006-2010 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 11. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? Do you see clusters? Random walks French banks and building materials CDS over 2006-2015 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 12. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? Do you see clusters? Random walks French banks and building materials CDS over 2006-2015 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 13. Introduction How to define a distance between two random walks? Applications Conclusion Data Science for the CDS market How to group random walks? What is a clustering program? What is a clustering program? Definition Clustering is the task of grouping a set of objects in such a way that objects in the same group (cluster) are more similar to each other than those in different groups. Definition We aim at finding K groups by positioning K group centers {c1, . . . , cK } such that data points {x1, . . . , xn} minimize min c1,...,cK n i=1 K min j=1 d(xi , cj )2 But, what is the distance d between two random walks? Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 14. Introduction How to define a distance between two random walks? Applications Conclusion Standard approach on time series Comovements and distributions GNPR: the best of both worlds 1 Introduction Data Science for the CDS market How to group random walks? What is a clustering program? 2 How to define a distance between two random walks? Standard approach on time series Comovements and distributions GNPR: the best of both worlds 3 Applications Results on synthetic datasets Clustering Credit Default Swaps 4 Conclusion Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 15. Introduction How to define a distance between two random walks? Applications Conclusion Standard approach on time series Comovements and distributions GNPR: the best of both worlds Naive distance between two random walks random walks Y ,Y d· −→ increments X,X covariance scatterplot X = (y2 − y1, . . . , yT − yT−1) X,X points in RT : ||X − X ||2 = T−1 i=1 (Xi − Xi )2 apply normalizations: e.g. (X − µ)/σ, (X − min)/(max − min) capture rather well comovements drawbacks: not robust to outliers, blind to signal shape Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 16. Introduction How to define a distance between two random walks? Applications Conclusion Standard approach on time series Comovements and distributions GNPR: the best of both worlds Our approach: split comovements and distributions ? Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 17. Introduction How to define a distance between two random walks? Applications Conclusion Standard approach on time series Comovements and distributions GNPR: the best of both worlds GNPR: A suitable representation Definition GNPR (Generic Non-Parametric Representation) projection: T : VN → UN × GN (1) X → (GX (X), GX ) GX : x → P[X ≤ x] cumulative distribution function GX (X) ∼ U[0, 1] 1 T rank(Xt) = 1 T k≤T 1{Xk ≤ Xt} →T∞ P[X ≤ Xt] = GX (Xt) Property T is a bijection. N.B. It replicates Sklar’s theorem, the seminal result of Copula Theory. Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 18. Introduction How to define a distance between two random walks? Applications Conclusion Standard approach on time series Comovements and distributions GNPR: the best of both worlds A distance dθ leveraging GNPR Definition Let (X, Y ) ∈ V2. Let GX , GY be vectors of marginal cdf. Let θ ∈ [0, 1]. We define the following distance d2 θ (X, Y ) = θd2 1 (GX (X), GY (Y )) + (1 − θ)d2 0 (GX , GY ), (2) where d2 1 (GX (X), GY (Y )) = 3E[|GX (X) − GY (Y )|2 ], (3) and d2 0 (GX , GY ) = 1 2 R dGX dλ − dGY dλ 2 dλ. (4) d0 Hellinger; d1 = (1 − ρS )/2, with ρS the rank correlation between X and Y . Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 19. Introduction How to define a distance between two random walks? Applications Conclusion Standard approach on time series Comovements and distributions GNPR: the best of both worlds GNPR θ = 1: Increase of correlation Correlation Density −0.2 0.0 0.2 0.4 0.6 0.8 1.0 0.00.51.01.52.02.53.03.5 Pearson Correlation Spearman Correlation Distribution of Correlations 10% more correlation, in average, using GNPR θ = 1 (rank statistics) instead of standard correlation Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 20. Introduction How to define a distance between two random walks? Applications Conclusion Standard approach on time series Comovements and distributions GNPR: the best of both worlds GNPR θ = 0: Find distribution peculiarities Parametric modelling: Real-life CDS variations: Which distribution? Nokia −310 −280 −250 −220 −190 −160 −130 −100 −70 −40 −10 10 30 50 70 90 110 130 150 170 190 Distribution Nokia IncrementLogDensity 5e−051e−042e−045e−041e−032e−035e−031e−022e−02 020040060080010001200 Nokia 5Y CDS Time CDSSpread Jan−2006 Oct−2006 Jul−2007 Apr−2008 Feb−2009 Nov−2009 Sep−2010 Jul−2011 Apr−2012 Jan−2013 Oct−2013 Jul−2014 Telecom Italia −62 −56 −50 −44 −38 −32 −26 −20 −14 −8 −2 2 6 10 16 22 28 34 40 46 52 58 64 Distribution Telecom Italia Increment LogDensity 2e−045e−041e−032e−035e−031e−022e−025e−021e−01 100200300400500 Telecom Italia 5Y CDS Time CDSSpread Jan−2006 Dec−2006 Nov−2007 Oct−2008 Sep−2009 Jul−2010 Jun−2011 May−2012 Apr−2013 Mar−2014 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 21. Introduction How to define a distance between two random walks? Applications Conclusion Results on synthetic datasets Clustering Credit Default Swaps 1 Introduction Data Science for the CDS market How to group random walks? What is a clustering program? 2 How to define a distance between two random walks? Standard approach on time series Comovements and distributions GNPR: the best of both worlds 3 Applications Results on synthetic datasets Clustering Credit Default Swaps 4 Conclusion Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 22. Introduction How to define a distance between two random walks? Applications Conclusion Results on synthetic datasets Clustering Credit Default Swaps Description of the testing datasets We define some interesting test case datasets to study: distribution clustering (dataset A), dependence clustering (dataset B), a mix of both (dataset C). Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 23. Introduction How to define a distance between two random walks? Applications Conclusion Results on synthetic datasets Clustering Credit Default Swaps Results: GNPR works! Adjusted Rand Index Representation Algorithm A B C X Ward 0 0.94 0.42 (X − µX )/σX Ward 0 0.94 0.42 (X − min)/(max − min) Ward 0 0.48 0.45 GNPR θ = 0 Ward 1 0 0.47 GNPR θ = 1 Ward 0 0.91 0.72 GNPR θ Ward 1 0.92 1 X k-means++ 0 0.90 0.44 (X − µX )/σX k-means++ 0 0.91 0.45 (X − min)/(max − min) k-means++ 0.11 0.55 0.47 GNPR θ = 0 k-means++ 1 0 0.53 GNPR θ = 1 k-means++ 0.06 0.99 0.80 GNPR θ k-means++ 1 0.99 1 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 24. Introduction How to define a distance between two random walks? Applications Conclusion Results on synthetic datasets Clustering Credit Default Swaps HCMapper: Compare Hierarchical Clustering Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 25. Introduction How to define a distance between two random walks? Applications Conclusion Results on synthetic datasets Clustering Credit Default Swaps HCMapper: Compare Hierarchical Clustering Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 26. Introduction How to define a distance between two random walks? Applications Conclusion Results on synthetic datasets Clustering Credit Default Swaps “Western sovereigns”cluster Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 27. Introduction How to define a distance between two random walks? Applications Conclusion 1 Introduction Data Science for the CDS market How to group random walks? What is a clustering program? 2 How to define a distance between two random walks? Standard approach on time series Comovements and distributions GNPR: the best of both worlds 3 Applications Results on synthetic datasets Clustering Credit Default Swaps 4 Conclusion Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 28. Introduction How to define a distance between two random walks? Applications Conclusion Conclusion: Take Home Message GNPR is a way to deal separately with dependence information, distribution information, without loosing any. Avenue for research: better aggregation: generalized means? consistency proof? any idea of interesting random walks outside finance? Check out www.datagrapple.com to follow our R&D stuff and news about the CDS market! Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 29. Introduction How to define a distance between two random walks? Applications Conclusion Internships If interested, please contact gautier.marti@helleborecapital.com philippe.very@helleborecapital.com for further details and application. Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 30. Introduction How to define a distance between two random walks? Applications Conclusion References I Shai Ben-David and Ulrike Von Luxburg, Relating clustering stability to properties of cluster boundaries., COLT, vol. 2008, 2008, pp. 379–390. Shai Ben-David, Ulrike Von Luxburg, and D´avid P´al, A sober look at clustering stability, Learning theory, Springer, 2006, pp. 5–19. Asa Ben-Hur, Andre Elisseeff, and Isabelle Guyon, A stability based method for discovering structure in clustered data, Pacific symposium on biocomputing, vol. 7, 2001, pp. 6–17. Tilman Lange, Volker Roth, Mikio L Braun, and Joachim M Buhmann, Stability-based validation of clustering solutions, Neural computation 16 (2004), no. 6, 1299–1323. Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 31. Introduction How to define a distance between two random walks? Applications Conclusion How to find θ ? Clustering stability using perturbations due to: bootstrap time-sliding window draws from an oracle stability pros [BHEG01, LRBB04] stability cons [BDVLP06, BDVL08] theta T accuracy Accuracy landscape 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.0 0.2 0.4 0.6 0.8 1.0 ARI θ Accuracy and stability of clustering using GNPR Accuracy Bootstrap stability Time stability Cross-datasets stability Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 32. Introduction How to define a distance between two random walks? Applications Conclusion Properties of dθ Property ∀θ ∈ [0, 1], 0 ≤ dθ ≤ 1 Property For 0 < θ < 1, dθ is a metric For θ ∈ {0, 1}, U ∼ U[0, 1] = 1 − U, yet d0(U, 1 − U) = 0 V = 2V , yet d1(V , 2V ) = 0 Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 33. Introduction How to define a distance between two random walks? Applications Conclusion Description of the testing datasets For 1 ≤ i ≤ N = pK S s=1 Ks, we define Xi = S s=1 Ks k=1 βs k,i Y s k + K k=1 αk,i Zi k, (5) where a) αk,i = 1, if i ≡ k − 1 (mod K), 0 otherwise; b) βs k ∈ [0, 1], c) βs k,i = βs k, if iKs/N = k, 0 otherwise. (Xi )N i=1 are partitioned into Q = K S s=1 Ks clusters of p random variables each. Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 34. Introduction How to define a distance between two random walks? Applications Conclusion OTC data flow processing Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?
  • 35. Introduction How to define a distance between two random walks? Applications Conclusion Conjecture: consistency of clustering random walks 200 400 600 0 500 1000 1500 2000 T N 0.4 0.6 0.8 1.0 ARI 0.0 0.2 0.4 0.6 0.8 1.0 0 500 1000 1500 2000 ARI T Clustering convergence to the ground-truth partition Clustering distribution θ = 0 Clustering dependence θ = 1 Clustering total information θ = θ* Philippe Donnat, Gautier Marti, Frank Nielsen, Philippe Very How to cluster random walks?