SlideShare a Scribd company logo
1 of 64
Download to read offline
Generating Realistic Synthetic Data in Finance
Applications of GANs
Talk at IHS Markit Webinar
Gautier Marti
HKML Research
15 October 2020
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 1 / 64
Table of contents
1 Introduction
2 GANs explained
GANs Milestones & Major Achievements
How do GANs work?
3 Applications of GANs in Finance
Applications for quants
Generating Synthetic Datasets to Avoid Strategy Overfitting
Generating Alternative Realistic Historical Paths for Risk Estimation
Applications for cloud computing providers
Training Machine Learning Models in the Cloud on Synthetic Data
Applications for data vendors
A Larger Data Market: Synthetic Datasets, A New Product
Applications for auditors and regulators
Deepfakes of Financial Statements and Tools to Find Them
Current State of the Art and Limitations
4 Conclusion and Questions?
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 2 / 64
Section 1
Introduction
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 3 / 64
Generative Adversarial Network’s Definition & Goal
Definition: Generative Adversarial Network (GAN)
A generative adversarial network (GAN) is a class of machine learning frame-
works where two neural networks (usually known as the generator and the
discriminator) contest with each other in a zero-sum game (i.e. one agent’s
gain is another agent’s loss).
Main purpose of a GAN: Generating realistic synthetic data
Given a training set, this technique learns to generate new data with the
same statistics as the training set.
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 4 / 64
Introduction (3 min. video)
https://www.youtube.com/watch?v=97B8tuHwLY0
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 5 / 64
Scope of the presentation
In this talk, we will focus on
doing a review of GANs success so far (mostly image generation),
explaining how Generative Adversarial Networks (GANs) work.
Then, we will present
applications of these models for generating finance-related data,
and the associated ’business’ use cases.
Finally, we shall briefly discuss
the current limitations and challenges to be overcome for a broader
adoption of these models in the industry.
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 6 / 64
Section 2
GANs explained
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 7 / 64
Subsection 1
GANs Milestones & Major Achievements
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 8 / 64
GANs: A relatively new Deep Learning model (2014)
Goodfellow, Ian, et al. “Generative adversarial nets.”
Advances in neural information processing systems. 2014.
Cited by 22354 papers as of 18 September 2020.
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 9 / 64
Text to Photo-realistic Image Synthesis (2016)
https://arxiv.org/pdf/1612.03242.pdf
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 10 / 64
Deepfakes (2017)
Ctrl Shift Face (YouTube channel)
https://www.youtube.com/channel/UCKpH0CKltc73e4wh0_pgL3g
You Won’t Believe What Obama Says In This Video!
https://www.youtube.com/watch?v=cQ54GDm1eL0
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 11 / 64
Edmond de Belamy (2018)
Created by 3 students
First artwork created using
Artificial Intelligence to be
featured in a Christie’s auction
Sold for USD 432,500
Signed
minG maxD Ex [log(D(x))] +
Ez [log(1 − D(G(z)))]
In French, “bel ami” means
“good fellow”, a pun-tribute to
Ian Goodfellow, the creator of
GANs
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 12 / 64
StyleGAN (2019) & StyleGAN2 (2020)
GANs can generate realistic fake human faces:
Give it a try: https://thispersondoesnotexist.com/
NVIDIA StyleGAN paper: https://arxiv.org/pdf/1812.04948.pdf
NVIDIA StyleGAN2 paper: https://arxiv.org/pdf/1912.04958.pdf
Remark. In December 2019, Facebook took down a network of accounts
with false identities, and mentioned that some of them had used profile
pictures created with artificial intelligence.
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 13 / 64
Speech2Face & Wav2Pix (2019)
GANs conditioned on speech of a person can output a realistic face with
its correct gender, ethnicity, and approximate age:
Relevant papers:
Wav2Pix: https://arxiv.org/pdf/1903.10195.pdf
Speech2Face: https://arxiv.org/pdf/1905.09773.pdf
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 14 / 64
GameGAN (2020)
GameGAN is able to learn Pac-Man dynamics and produce a visually
consistent simulation of the game:
NVIDIA paper https://arxiv.org/pdf/2005.12126.pdf
https://www.youtube.com/watch?v=4OzJUNsPx60
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 15 / 64
Subsection 2
How do GANs work?
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 16 / 64
A GAN basic architecture
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 17 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 18 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 19 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 20 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 21 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 22 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 23 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 24 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 25 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 26 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 27 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 28 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 29 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 30 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 31 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 32 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 33 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 34 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 35 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 36 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 37 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 38 / 64
GAN training – step by step tutorial
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 39 / 64
Conditional Generative Adversarial Nets
Seminal paper: https://arxiv.org/pdf/1411.1784.pdf
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 40 / 64
Conditional Generative Adversarial Nets
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 41 / 64
Section 3
Applications of GANs in Finance
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 42 / 64
Subsection 1
Applications for quants
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 43 / 64
Strategy Overfitting
Scenario
In September 2020, a naive or unscrupulous strat presents to his
manager a new strategy: The strategy would have initiated a massive
short in February 2020, and would have bet big on a rally starting late
March 2020.
The strat sells the strategy to his manager as able to pick-up early
signals of market sell-offs and bounce-backs thanks to advanced
machine learning and alternative data (obviously, what else?).
The manager, excited, cannot wait but to deploy capital to it.
How to fight against this industry-wide fallacy?
Well-thought and carefully designed incentives schemes
(e.g. reward a sound research framework rather than impressive in-sample backtests)
Counterfactual thinking, realistic simulations of alternative paths
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 44 / 64
Strategy Overfitting – Simulations of Alternative Paths (1)
Some alternative data: Second-hand car market in Hong Kong
A tabular dataset X
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 45 / 64
Strategy Overfitting – Simulations of Alternative Paths (2)
econ quant idea: When inventory builds up (more people are selling their
cars than buying), then the HSI plummets over the next quarter.
math quant idea: My black-box ML algorithm predicts that HSI decreases
whenever two Ferraris are sold right after a Bugatti Veyron comes to
market, based on the given dataset.
Using GANs, we can sample new datasets ˆX ∼ X, or even ( ˆX, ˆy) ∼ (X, y),
where y can be the HSI performance or any variable we aim at predicting
(e.g. market+industry residualized returns of the respective carmakers).
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 46 / 64
Strategy Overfitting – Simulations of Alternative Paths (3)
Method 1: Strategy stability
Use a GAN to learn the distribution (X, y) of the tabular data
(cf. https://arxiv.org/pdf/1907.00503.pdf)
From the calibrated GAN, generate ˆX(n)
, ˆy(n)
N
n=1
synthetic datasets
Backtest the strategy on the N datasets, and collect perf. metrics
Analyze the perf. metrics; Discard the strategy if not stable.
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 47 / 64
Strategy Overfitting – Simulations of Alternative Paths (4)
In Koshiyama et al. (https://arxiv.org/pdf/1901.01751.pdf):
Generate N synthetic datasets:
Method 2: Strategy fine-tuning
Find the parameters which maximize the average perf. over the N datasets
Method 2bis: Strategy combination (ensemble models)
Fit a model on each of the N datasets
Predict using an ensemble of the N trained models
GAN vs. Stationary Bootstrap (for generating the N synthetic datasets)
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 48 / 64
Risk-based Portfolio Allocation (1)
Scenario
You want to systematically allocate capital to your strategies.
Literature claims that such or such method works better than all
others by providing a dubious backtest.
When you horse race the various allocation methods, results are not
stable across periods and universes.
How can we conclude anything useful? Realistic simulations!
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 49 / 64
Risk-based Portfolio Allocation (2)
Method 3: Testing Portfolio Allocation on Alternative Historical Paths
Use a GAN to generate realistic synthetic correlation matrices
(cf. https://arxiv.org/pdf/1910.09504.pdf)
Generate time series verifying the correlation structure
Estimate portfolio weights (in-sample), measure out-of-sample risk
Analyze performance of the portfolio allocation methods
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 50 / 64
Risk-based Portfolio Allocation (3)
Question: Can we predict when a method (say the Hierarchical Risk
Parity (HRP)) outperforms another one (say naive risk parity)? And why?
Method 4: Understanding the outperformance of a method over another
Extract features from the underlying correlation matrix
Fit a ML model features → outperformance
(using train, validation, test datasets)
Verify if there is any good predictability
Explain the predictions as a function of features
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 51 / 64
Risk-based Portfolio Allocation (4)
Concrete example: HRP vs. naive risk parity
High values for the cophenetic correlation coefficient are characteristic of a strong
hierarchical structure. Thus, HRP outperforms naive risk parity when the
underlying DGP has a strong hierarchical structure (nested clusters).
cf. Jochen Papenbrock recent work for similar applications of Explainable AI (XAI) in finance (https://firamis.de/)
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 52 / 64
Subsection 2
Applications for cloud computing providers
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 53 / 64
Training Machine Learning Models in the Cloud on
Synthetic Data
Scenario
Your company forbids the transfer of sensitive data (e.g. trades &
positions) to the Cloud
It would be more relevant and cost-effective to train large and recent
ML models in the Cloud (e.g. Amazon SageMaker)
Method 5: Training in the Cloud on synthetic data
Generate anonymized synthetic versions of the sensitive data
Send the GAN-generated non-sensitive synthetic datasets to the Cloud
Train Machine Learning models on these datasets in the Cloud
Download the Machine Learning models on premise
Fine-tune and apply the ML models on the original data
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 54 / 64
Subsection 3
Applications for data vendors
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 55 / 64
A Larger Data Market: Synthetic Datasets, A New Product
As a data vendor:
Use Case 1
You may gather interests from research labs and startups which
cannot afford the price tag a hedge fund can for a dataset
You cannot sell them the original premium dataset at a hard discount
But you could sell them anonymized synthetic datasets based on the
original one at a fraction of the price
In some cases, realistic synthetic datasets may be sufficient,
e.g. a researcher studying the structural properties of a supply-chain
network rather than trying to predict markets from it
cf. this paper https://arxiv.org/pdf/2002.02271.pdf (Feb 2020) from American Express researchers
=⇒ A broader and more diverse base of clients
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 56 / 64
A Larger Data Market: Synthetic Datasets, A New Product
As a data vendor, you should be aware that:
Use Case 2
Quantitative trading firms are afraid of over-fitting
Besides the original dataset, they may be interested in buying realistic
synthetic versions of it:
1 Original dataset will be used in production for trading
2 Realistic synthetic versions can be used at the research stage
Managers can distribute the synthetic datasets to their strat quants, then
they can check for consistent results across the synthetic datasets and the
original one (cf. Methods 1, 2 and 2bis previously discussed).
=⇒ A new product offered at a premium on top of the original dataset
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 57 / 64
A Larger Data Market: Synthetic Datasets, A New Product
As a data vendor, you know that:
Use Case 3
Getting the client’s legal and compliance departments approval can be
a long process, even for a simple trial
In some cases, it can result in the loss of business to a competitor
So that the prospect has a quick first overview, you may be able to send
over a synthetic dataset. This should not raise the scrutiny of legal (e.g.
no contractual terms to check) or compliance (e.g. no material non-public
information in anonymized synthetic data by construction).
=⇒ Easier to maintain customer engagement and pitch new datasets
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 58 / 64
Subsection 4
Applications for auditors and regulators
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 59 / 64
Deepfakes of Financial Statements and Tools to Find Them
An an auditor:
Scenario
You have to assess journal entries comprising millions of transactions
You use ’Computer Assisted Audit Techniques’ which range from
rule-based tests designed according to past frauds to basic statistical
methods for detecting accounting anomalies
Fraudsters may adapt deepfakes to business accounting
cf. this paper https://arxiv.org/pdf/1910.03810.pdf (Oct 2019) about deepfakes in accounting
=⇒ Auditors and regulators should learn the techniques to uncover these
special types of deepfakes.
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 60 / 64
Subsection 5
Current State of the Art and Limitations
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 61 / 64
Current State of the Art and Limitations
This discussion is by nature technical, but we can highlight the following:
GANs can generate realistic tabular datasets (2019), but models
trained on synthetic data only were shown inferior to the ones trained
on the original data
GANs can offer a high degree of anonymization, but not all of them
are built to be differentially private meaning they might leak
information about the original data
We know how to GAN-generate realistic synthetic financial time series
(e.g. S&P 500 returns);
We know how to GAN-generate realistic synthetic financial correlation
matrices (of the S&P 500 constituents, for example);
However, we do not know yet how to GAN-generate realistic
multivariate financial time series, i.e. verifying both the time series
and the cross-sectional stylized facts
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 62 / 64
Section 4
Conclusion and Questions?
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 63 / 64
Conclusion and Questions?
Deep generative models and GANs in particular are an exciting new
technology.
In finance, they are actively researched in a few places but results are
not widely advertised.
We have to rely on top tech companies and academic labs to drive
the fundamental understanding and improvements of these models.
contact@hkml-research.com
Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 64 / 64

More Related Content

What's hot

Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
Ed Fernandez
 
DataOps, MLOps, and DevOps.pdf
DataOps, MLOps, and DevOps.pdfDataOps, MLOps, and DevOps.pdf
DataOps, MLOps, and DevOps.pdf
Peace546228
 
Machine Learning Course | Edureka
Machine Learning Course | EdurekaMachine Learning Course | Edureka
Machine Learning Course | Edureka
Edureka!
 
A Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security KnowledgeA Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security Knowledge
Vaticle
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
Edureka!
 

What's hot (20)

Detection of fraud in financial blockchain-based transactions through big dat...
Detection of fraud in financial blockchain-based transactions through big dat...Detection of fraud in financial blockchain-based transactions through big dat...
Detection of fraud in financial blockchain-based transactions through big dat...
 
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
 
Responsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons LearnedResponsible AI in Industry: Practical Challenges and Lessons Learned
Responsible AI in Industry: Practical Challenges and Lessons Learned
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientist
 
DataOps, MLOps, and DevOps.pdf
DataOps, MLOps, and DevOps.pdfDataOps, MLOps, and DevOps.pdf
DataOps, MLOps, and DevOps.pdf
 
KNIME Data Science Learnathon: From Raw Data To Deployment
KNIME Data Science Learnathon: From Raw Data To DeploymentKNIME Data Science Learnathon: From Raw Data To Deployment
KNIME Data Science Learnathon: From Raw Data To Deployment
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Introduction to Big Data Analytics and Data Science
Introduction to Big Data Analytics and Data ScienceIntroduction to Big Data Analytics and Data Science
Introduction to Big Data Analytics and Data Science
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
 
Unified Approach to Interpret Machine Learning Model: SHAP + LIME
Unified Approach to Interpret Machine Learning Model: SHAP + LIMEUnified Approach to Interpret Machine Learning Model: SHAP + LIME
Unified Approach to Interpret Machine Learning Model: SHAP + LIME
 
Synthetic data generation
Synthetic data generationSynthetic data generation
Synthetic data generation
 
Machine Learning Course | Edureka
Machine Learning Course | EdurekaMachine Learning Course | Edureka
Machine Learning Course | Edureka
 
Data Science Project Lifecycle and Skill Set
Data Science Project Lifecycle and Skill SetData Science Project Lifecycle and Skill Set
Data Science Project Lifecycle and Skill Set
 
Clustering for Stream and Parallelism (DATA ANALYTICS)
Clustering for Stream and Parallelism (DATA ANALYTICS)Clustering for Stream and Parallelism (DATA ANALYTICS)
Clustering for Stream and Parallelism (DATA ANALYTICS)
 
Building and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache AirflowBuilding and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache Airflow
 
Anomaly Detection
Anomaly DetectionAnomaly Detection
Anomaly Detection
 
A Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security KnowledgeA Data Modelling Framework to Unify Cyber Security Knowledge
A Data Modelling Framework to Unify Cyber Security Knowledge
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
 

Similar to Generating Realistic Synthetic Data in Finance

AylorB_Unit10a1_ResearchPaper_FinalPoject
AylorB_Unit10a1_ResearchPaper_FinalPojectAylorB_Unit10a1_ResearchPaper_FinalPoject
AylorB_Unit10a1_ResearchPaper_FinalPoject
Renee Reagan
 

Similar to Generating Realistic Synthetic Data in Finance (20)

Applications of GANs in Finance
Applications of GANs in FinanceApplications of GANs in Finance
Applications of GANs in Finance
 
How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?
 
Master Class: GANS with Applications in Synthetic Data Generation
Master Class:   GANS with  Applications in  Synthetic Data GenerationMaster Class:   GANS with  Applications in  Synthetic Data Generation
Master Class: GANS with Applications in Synthetic Data Generation
 
IoT digital disruption and new IoT business models
IoT digital disruption and new IoT business modelsIoT digital disruption and new IoT business models
IoT digital disruption and new IoT business models
 
MarkNtel_Global Digital Identity Management Market Analysis, 2020_Sample.pdf
MarkNtel_Global Digital Identity Management Market Analysis, 2020_Sample.pdfMarkNtel_Global Digital Identity Management Market Analysis, 2020_Sample.pdf
MarkNtel_Global Digital Identity Management Market Analysis, 2020_Sample.pdf
 
Global Digital Identity Management Market Analysis, 2020 - MarkNtel Advisors
Global Digital Identity Management Market Analysis, 2020 - MarkNtel AdvisorsGlobal Digital Identity Management Market Analysis, 2020 - MarkNtel Advisors
Global Digital Identity Management Market Analysis, 2020 - MarkNtel Advisors
 
How to Use FouAnalytics For Marketers
How to Use FouAnalytics   For MarketersHow to Use FouAnalytics   For Marketers
How to Use FouAnalytics For Marketers
 
B9MG121-Smurfit Kappaaaaaaaaaaaaaaaaaaaaa
B9MG121-Smurfit KappaaaaaaaaaaaaaaaaaaaaaB9MG121-Smurfit Kappaaaaaaaaaaaaaaaaaaaaa
B9MG121-Smurfit Kappaaaaaaaaaaaaaaaaaaaaa
 
Gartner Go to Market Strategy Assumptions
Gartner Go to Market Strategy AssumptionsGartner Go to Market Strategy Assumptions
Gartner Go to Market Strategy Assumptions
 
Industry 5.0 Revolution towards an Imminent Future Driven Society
Industry 5.0 Revolution towards an Imminent Future Driven SocietyIndustry 5.0 Revolution towards an Imminent Future Driven Society
Industry 5.0 Revolution towards an Imminent Future Driven Society
 
Ultimaker's Future Strategy
Ultimaker's Future StrategyUltimaker's Future Strategy
Ultimaker's Future Strategy
 
[Fórum 2020 - Global Edition] Dados e Al - A galinha dos ovos de ouro no E-Co...
[Fórum 2020 - Global Edition] Dados e Al - A galinha dos ovos de ouro no E-Co...[Fórum 2020 - Global Edition] Dados e Al - A galinha dos ovos de ouro no E-Co...
[Fórum 2020 - Global Edition] Dados e Al - A galinha dos ovos de ouro no E-Co...
 
Strategic market expansion path en
Strategic market expansion path enStrategic market expansion path en
Strategic market expansion path en
 
Refocus on the agile developer
Refocus on the agile developerRefocus on the agile developer
Refocus on the agile developer
 
AylorB_Unit10a1_ResearchPaper_FinalPoject
AylorB_Unit10a1_ResearchPaper_FinalPojectAylorB_Unit10a1_ResearchPaper_FinalPoject
AylorB_Unit10a1_ResearchPaper_FinalPoject
 
Big Data Pilot Days
Big Data Pilot DaysBig Data Pilot Days
Big Data Pilot Days
 
Artificial Intelligence In Marketing A Systematic Literature Review
Artificial Intelligence In Marketing  A Systematic Literature ReviewArtificial Intelligence In Marketing  A Systematic Literature Review
Artificial Intelligence In Marketing A Systematic Literature Review
 
Infinitech, BigDataStack & BDVA webinar 2020
Infinitech, BigDataStack & BDVA webinar 2020Infinitech, BigDataStack & BDVA webinar 2020
Infinitech, BigDataStack & BDVA webinar 2020
 
India 2047_Vision & Strategic Roadmap for Technical Textiles in India_KPMG (1...
India 2047_Vision & Strategic Roadmap for Technical Textiles in India_KPMG (1...India 2047_Vision & Strategic Roadmap for Technical Textiles in India_KPMG (1...
India 2047_Vision & Strategic Roadmap for Technical Textiles in India_KPMG (1...
 
Global pleated filters market(1)
Global pleated filters market(1)Global pleated filters market(1)
Global pleated filters market(1)
 

More from Gautier Marti

On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...
Gautier Marti
 

More from Gautier Marti (19)

Using Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeUsing Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of Code
 
What deep learning can bring to...
What deep learning can bring to...What deep learning can bring to...
What deep learning can bring to...
 
A quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsA quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptions
 
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
 
My recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsMy recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returns
 
Takeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaTakeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, California
 
A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...
 
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
 
Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...
 
Clustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence ratesClustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence rates
 
Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...
 
A closer look at correlations
A closer look at correlationsA closer look at correlations
A closer look at correlations
 
Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?
 
Optimal Transport vs. Fisher-Rao distance between Copulas
Optimal Transport vs. Fisher-Rao distance between CopulasOptimal Transport vs. Fisher-Rao distance between Copulas
Optimal Transport vs. Fisher-Rao distance between Copulas
 
On Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationOn Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond Correlation
 
Optimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesOptimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time Series
 
On the stability of clustering financial time series
On the stability of clustering financial time seriesOn the stability of clustering financial time series
On the stability of clustering financial time series
 
Clustering Random Walk Time Series
Clustering Random Walk Time SeriesClustering Random Walk Time Series
Clustering Random Walk Time Series
 
On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...
 

Recently uploaded

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
karishmasinghjnh
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 

Recently uploaded (20)

Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 

Generating Realistic Synthetic Data in Finance

  • 1. Generating Realistic Synthetic Data in Finance Applications of GANs Talk at IHS Markit Webinar Gautier Marti HKML Research 15 October 2020 Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 1 / 64
  • 2. Table of contents 1 Introduction 2 GANs explained GANs Milestones & Major Achievements How do GANs work? 3 Applications of GANs in Finance Applications for quants Generating Synthetic Datasets to Avoid Strategy Overfitting Generating Alternative Realistic Historical Paths for Risk Estimation Applications for cloud computing providers Training Machine Learning Models in the Cloud on Synthetic Data Applications for data vendors A Larger Data Market: Synthetic Datasets, A New Product Applications for auditors and regulators Deepfakes of Financial Statements and Tools to Find Them Current State of the Art and Limitations 4 Conclusion and Questions? Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 2 / 64
  • 3. Section 1 Introduction Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 3 / 64
  • 4. Generative Adversarial Network’s Definition & Goal Definition: Generative Adversarial Network (GAN) A generative adversarial network (GAN) is a class of machine learning frame- works where two neural networks (usually known as the generator and the discriminator) contest with each other in a zero-sum game (i.e. one agent’s gain is another agent’s loss). Main purpose of a GAN: Generating realistic synthetic data Given a training set, this technique learns to generate new data with the same statistics as the training set. Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 4 / 64
  • 5. Introduction (3 min. video) https://www.youtube.com/watch?v=97B8tuHwLY0 Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 5 / 64
  • 6. Scope of the presentation In this talk, we will focus on doing a review of GANs success so far (mostly image generation), explaining how Generative Adversarial Networks (GANs) work. Then, we will present applications of these models for generating finance-related data, and the associated ’business’ use cases. Finally, we shall briefly discuss the current limitations and challenges to be overcome for a broader adoption of these models in the industry. Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 6 / 64
  • 7. Section 2 GANs explained Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 7 / 64
  • 8. Subsection 1 GANs Milestones & Major Achievements Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 8 / 64
  • 9. GANs: A relatively new Deep Learning model (2014) Goodfellow, Ian, et al. “Generative adversarial nets.” Advances in neural information processing systems. 2014. Cited by 22354 papers as of 18 September 2020. Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 9 / 64
  • 10. Text to Photo-realistic Image Synthesis (2016) https://arxiv.org/pdf/1612.03242.pdf Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 10 / 64
  • 11. Deepfakes (2017) Ctrl Shift Face (YouTube channel) https://www.youtube.com/channel/UCKpH0CKltc73e4wh0_pgL3g You Won’t Believe What Obama Says In This Video! https://www.youtube.com/watch?v=cQ54GDm1eL0 Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 11 / 64
  • 12. Edmond de Belamy (2018) Created by 3 students First artwork created using Artificial Intelligence to be featured in a Christie’s auction Sold for USD 432,500 Signed minG maxD Ex [log(D(x))] + Ez [log(1 − D(G(z)))] In French, “bel ami” means “good fellow”, a pun-tribute to Ian Goodfellow, the creator of GANs Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 12 / 64
  • 13. StyleGAN (2019) & StyleGAN2 (2020) GANs can generate realistic fake human faces: Give it a try: https://thispersondoesnotexist.com/ NVIDIA StyleGAN paper: https://arxiv.org/pdf/1812.04948.pdf NVIDIA StyleGAN2 paper: https://arxiv.org/pdf/1912.04958.pdf Remark. In December 2019, Facebook took down a network of accounts with false identities, and mentioned that some of them had used profile pictures created with artificial intelligence. Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 13 / 64
  • 14. Speech2Face & Wav2Pix (2019) GANs conditioned on speech of a person can output a realistic face with its correct gender, ethnicity, and approximate age: Relevant papers: Wav2Pix: https://arxiv.org/pdf/1903.10195.pdf Speech2Face: https://arxiv.org/pdf/1905.09773.pdf Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 14 / 64
  • 15. GameGAN (2020) GameGAN is able to learn Pac-Man dynamics and produce a visually consistent simulation of the game: NVIDIA paper https://arxiv.org/pdf/2005.12126.pdf https://www.youtube.com/watch?v=4OzJUNsPx60 Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 15 / 64
  • 16. Subsection 2 How do GANs work? Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 16 / 64
  • 17. A GAN basic architecture Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 17 / 64
  • 18. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 18 / 64
  • 19. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 19 / 64
  • 20. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 20 / 64
  • 21. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 21 / 64
  • 22. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 22 / 64
  • 23. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 23 / 64
  • 24. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 24 / 64
  • 25. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 25 / 64
  • 26. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 26 / 64
  • 27. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 27 / 64
  • 28. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 28 / 64
  • 29. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 29 / 64
  • 30. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 30 / 64
  • 31. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 31 / 64
  • 32. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 32 / 64
  • 33. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 33 / 64
  • 34. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 34 / 64
  • 35. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 35 / 64
  • 36. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 36 / 64
  • 37. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 37 / 64
  • 38. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 38 / 64
  • 39. GAN training – step by step tutorial Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 39 / 64
  • 40. Conditional Generative Adversarial Nets Seminal paper: https://arxiv.org/pdf/1411.1784.pdf Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 40 / 64
  • 41. Conditional Generative Adversarial Nets Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 41 / 64
  • 42. Section 3 Applications of GANs in Finance Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 42 / 64
  • 43. Subsection 1 Applications for quants Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 43 / 64
  • 44. Strategy Overfitting Scenario In September 2020, a naive or unscrupulous strat presents to his manager a new strategy: The strategy would have initiated a massive short in February 2020, and would have bet big on a rally starting late March 2020. The strat sells the strategy to his manager as able to pick-up early signals of market sell-offs and bounce-backs thanks to advanced machine learning and alternative data (obviously, what else?). The manager, excited, cannot wait but to deploy capital to it. How to fight against this industry-wide fallacy? Well-thought and carefully designed incentives schemes (e.g. reward a sound research framework rather than impressive in-sample backtests) Counterfactual thinking, realistic simulations of alternative paths Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 44 / 64
  • 45. Strategy Overfitting – Simulations of Alternative Paths (1) Some alternative data: Second-hand car market in Hong Kong A tabular dataset X Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 45 / 64
  • 46. Strategy Overfitting – Simulations of Alternative Paths (2) econ quant idea: When inventory builds up (more people are selling their cars than buying), then the HSI plummets over the next quarter. math quant idea: My black-box ML algorithm predicts that HSI decreases whenever two Ferraris are sold right after a Bugatti Veyron comes to market, based on the given dataset. Using GANs, we can sample new datasets ˆX ∼ X, or even ( ˆX, ˆy) ∼ (X, y), where y can be the HSI performance or any variable we aim at predicting (e.g. market+industry residualized returns of the respective carmakers). Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 46 / 64
  • 47. Strategy Overfitting – Simulations of Alternative Paths (3) Method 1: Strategy stability Use a GAN to learn the distribution (X, y) of the tabular data (cf. https://arxiv.org/pdf/1907.00503.pdf) From the calibrated GAN, generate ˆX(n) , ˆy(n) N n=1 synthetic datasets Backtest the strategy on the N datasets, and collect perf. metrics Analyze the perf. metrics; Discard the strategy if not stable. Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 47 / 64
  • 48. Strategy Overfitting – Simulations of Alternative Paths (4) In Koshiyama et al. (https://arxiv.org/pdf/1901.01751.pdf): Generate N synthetic datasets: Method 2: Strategy fine-tuning Find the parameters which maximize the average perf. over the N datasets Method 2bis: Strategy combination (ensemble models) Fit a model on each of the N datasets Predict using an ensemble of the N trained models GAN vs. Stationary Bootstrap (for generating the N synthetic datasets) Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 48 / 64
  • 49. Risk-based Portfolio Allocation (1) Scenario You want to systematically allocate capital to your strategies. Literature claims that such or such method works better than all others by providing a dubious backtest. When you horse race the various allocation methods, results are not stable across periods and universes. How can we conclude anything useful? Realistic simulations! Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 49 / 64
  • 50. Risk-based Portfolio Allocation (2) Method 3: Testing Portfolio Allocation on Alternative Historical Paths Use a GAN to generate realistic synthetic correlation matrices (cf. https://arxiv.org/pdf/1910.09504.pdf) Generate time series verifying the correlation structure Estimate portfolio weights (in-sample), measure out-of-sample risk Analyze performance of the portfolio allocation methods Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 50 / 64
  • 51. Risk-based Portfolio Allocation (3) Question: Can we predict when a method (say the Hierarchical Risk Parity (HRP)) outperforms another one (say naive risk parity)? And why? Method 4: Understanding the outperformance of a method over another Extract features from the underlying correlation matrix Fit a ML model features → outperformance (using train, validation, test datasets) Verify if there is any good predictability Explain the predictions as a function of features Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 51 / 64
  • 52. Risk-based Portfolio Allocation (4) Concrete example: HRP vs. naive risk parity High values for the cophenetic correlation coefficient are characteristic of a strong hierarchical structure. Thus, HRP outperforms naive risk parity when the underlying DGP has a strong hierarchical structure (nested clusters). cf. Jochen Papenbrock recent work for similar applications of Explainable AI (XAI) in finance (https://firamis.de/) Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 52 / 64
  • 53. Subsection 2 Applications for cloud computing providers Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 53 / 64
  • 54. Training Machine Learning Models in the Cloud on Synthetic Data Scenario Your company forbids the transfer of sensitive data (e.g. trades & positions) to the Cloud It would be more relevant and cost-effective to train large and recent ML models in the Cloud (e.g. Amazon SageMaker) Method 5: Training in the Cloud on synthetic data Generate anonymized synthetic versions of the sensitive data Send the GAN-generated non-sensitive synthetic datasets to the Cloud Train Machine Learning models on these datasets in the Cloud Download the Machine Learning models on premise Fine-tune and apply the ML models on the original data Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 54 / 64
  • 55. Subsection 3 Applications for data vendors Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 55 / 64
  • 56. A Larger Data Market: Synthetic Datasets, A New Product As a data vendor: Use Case 1 You may gather interests from research labs and startups which cannot afford the price tag a hedge fund can for a dataset You cannot sell them the original premium dataset at a hard discount But you could sell them anonymized synthetic datasets based on the original one at a fraction of the price In some cases, realistic synthetic datasets may be sufficient, e.g. a researcher studying the structural properties of a supply-chain network rather than trying to predict markets from it cf. this paper https://arxiv.org/pdf/2002.02271.pdf (Feb 2020) from American Express researchers =⇒ A broader and more diverse base of clients Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 56 / 64
  • 57. A Larger Data Market: Synthetic Datasets, A New Product As a data vendor, you should be aware that: Use Case 2 Quantitative trading firms are afraid of over-fitting Besides the original dataset, they may be interested in buying realistic synthetic versions of it: 1 Original dataset will be used in production for trading 2 Realistic synthetic versions can be used at the research stage Managers can distribute the synthetic datasets to their strat quants, then they can check for consistent results across the synthetic datasets and the original one (cf. Methods 1, 2 and 2bis previously discussed). =⇒ A new product offered at a premium on top of the original dataset Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 57 / 64
  • 58. A Larger Data Market: Synthetic Datasets, A New Product As a data vendor, you know that: Use Case 3 Getting the client’s legal and compliance departments approval can be a long process, even for a simple trial In some cases, it can result in the loss of business to a competitor So that the prospect has a quick first overview, you may be able to send over a synthetic dataset. This should not raise the scrutiny of legal (e.g. no contractual terms to check) or compliance (e.g. no material non-public information in anonymized synthetic data by construction). =⇒ Easier to maintain customer engagement and pitch new datasets Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 58 / 64
  • 59. Subsection 4 Applications for auditors and regulators Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 59 / 64
  • 60. Deepfakes of Financial Statements and Tools to Find Them An an auditor: Scenario You have to assess journal entries comprising millions of transactions You use ’Computer Assisted Audit Techniques’ which range from rule-based tests designed according to past frauds to basic statistical methods for detecting accounting anomalies Fraudsters may adapt deepfakes to business accounting cf. this paper https://arxiv.org/pdf/1910.03810.pdf (Oct 2019) about deepfakes in accounting =⇒ Auditors and regulators should learn the techniques to uncover these special types of deepfakes. Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 60 / 64
  • 61. Subsection 5 Current State of the Art and Limitations Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 61 / 64
  • 62. Current State of the Art and Limitations This discussion is by nature technical, but we can highlight the following: GANs can generate realistic tabular datasets (2019), but models trained on synthetic data only were shown inferior to the ones trained on the original data GANs can offer a high degree of anonymization, but not all of them are built to be differentially private meaning they might leak information about the original data We know how to GAN-generate realistic synthetic financial time series (e.g. S&P 500 returns); We know how to GAN-generate realistic synthetic financial correlation matrices (of the S&P 500 constituents, for example); However, we do not know yet how to GAN-generate realistic multivariate financial time series, i.e. verifying both the time series and the cross-sectional stylized facts Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 62 / 64
  • 63. Section 4 Conclusion and Questions? Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 63 / 64
  • 64. Conclusion and Questions? Deep generative models and GANs in particular are an exciting new technology. In finance, they are actively researched in a few places but results are not widely advertised. We have to rely on top tech companies and academic labs to drive the fundamental understanding and improvements of these models. contact@hkml-research.com Gautier Marti (HKML Research) Applications of GANs in Finance 15 October 2020 64 / 64