Takeaways from ICML 2019, Long Beach, California

Takeaways from ICML 2019
Hong Kong Machine Learning Meetup
Season 1 Episode 12 – Season Finale
Gautier Marti
Gautier Marti (Shell Street Labs) Takeaways from ICML 2019 1 / 41

Table of contents
1 Day 1 - Tutorials
2 Day 2 - U.S. Census, Time Series, Hawkes Processes, Shapley values,
Topological Data Analysis, Optimal Transport for Graphs
3 Day 3 - Robotics, Gaussian Processes, Learning with noisy labels
4 Day 4 - Interpretability, Natural Language Processing
5 Day 5 - Workshop Time Series

Disclaimer
I was sent to the ICML 2019 conference by my employer Shell Street Labs.
However, the opinions expressed in this presentation are my own and do
not reﬂect in any ways the view of my employer.

Section 1
Day 1 - Tutorials

Subsection 1
Safe Machine Learning

Safe Machine Learning

COMPAS - An example of ML biases & misspecification
We show, however, that the widely used commercial risk
assessment software COMPAS is no more accurate or fair than
predictions made by people with little or no criminal justice
expertise.
The accuracy, fairness, and limits of predicting recidivism
https://advances.sciencemag.org/content/4/1/eaao5580
Our analysis of Northpointe’s tool, called COMPAS (which
stands for Correctional Offender Management Profiling for
Alternative Sanctions), found that black defendants were far
more likely than white defendants to be incorrectly judged to be
at a higher risk of recidivism, while white defendants were more
likely than black defendants to be incorrectly flagged as low risk.
How We Analyzed the COMPAS Recidivism Algorithm
https://www.propublica.org/article/
how-we-analyzed-the-compas-recidivism-algorithm
GitHub: https://github.com/propublica/compas-analysis

RL in the Wild - Other examples of misspeciﬁcations
Reinforcement learning algorithms can break in surprising,
counterintuitive ways.
Faulty Reward Functions in the Wild
https://openai.com/blog/faulty-reward-functions/

Robustness - Adversarial attacks
Robust Physical-World Attacks on
Deep Learning Visual Classiﬁca-
tion
https://arxiv.org/pdf/
1707.08945.pdf
Fooling automated surveillance
cameras: adversarial patches to
attack person detection
https://arxiv.org/pdf/
1904.08653.pdf

Subsection 2
Active Learning: From Theory to Practice

Active Learning: From Theory to Practice
Slides for the tutorial: http://nowak.ece.wisc.edu/ActiveML.html
Active Learning tries to answer the question:
Can we train machines with less labeled data and less human
supervision?

Active Learning

Rethinking classical model generalization

Subsection 3
A Tutorial on Attention in Deep Learning

A Tutorial on Attention in Deep Learning
Slides: http://alex.smola.org/talks/ICML19-attention.pdf
https://www.d2l.ai/
https://github.com/d2l-ai

Section 2
Day 2 - U.S. Census, Time Series, Hawkes Processes,
Shapley values, Topological Data Analysis, Optimal
Transport for Graphs

Subsection 1
Diﬀerential privacy

U.S. Census & Diﬀerential privacy
Good related read: https://www.sciencemag.org/news/2019/01/
can-set-equations-keep-us-census-data-private

Subsection 2
Time Series

Deep Factors for Forecasting
https://arxiv.org/pdf/1905.12417.pdf

Hawkes Processes
ICML 2018 tutorial: http://learning.mpi-sws.org/tpp-icml18/
http://proceedings.mlr.press/v97/trouleau19a/trouleau19a.pdf
cf. tick for practitioners: https://github.com/X-DataInitiative/tick

Subsection 3
Shapley values: Explainability & Data Valuation

Shapley values
A new trend in ML based on:
Shapley, Lloyd S. A value for n-person games. Contributions to the
Theory of Games 2.28 (1953). 158 citations.
φi (v) =
S⊆N{i}
|S|!(N − |S| − 1)!
N!
(v(S ∪ {i}) − v(S))
Recent applications:
Explainability: A Uniﬁed Approach to Interpreting Model Predictions
http://papers.nips.cc/paper/
7062-a-unified-approach-to-interpreting-model-predictions.
pdf
Data valuation: Data Shapley: Equitable Valuation of Data for
Machine Learning
http://proceedings.mlr.press/v97/ghorbani19c/ghorbani19c.pdf
Since computationally intensive, many papers try to approximate these
values fast...

Subsection 4
Topological Data Analysis

Topological Data Analysis
Already a tutorial at ICML 2013: Topological Data Analysis and Machine
Learning http://www2.stat.duke.edu/~sayan/Primoz/ICML.pdf
A round-up of TDA papers at ICML 2019:
https://bastian.rieck.me/blog/posts/2019/icml_tda_roundup/
(6 TDA-related papers)
Take-home message: TDA can provide robust features to Machine
Learning models.

Subsection 5
Optimal Transport

Optimal Transport
Trending in the ML community (at least 5 ICML 2019 papers) since
Cuturi’s 2013 NIPS paper: Sinkhorn Distances: Lightspeed Computation
of Optimal Transport, which made Optimal Transport for Machine
Learning possible in practice. Way too slow before!
Theoretical contributions:
On Eﬃcient Optimal Transport: An Analysis of Greedy and
Accelerated Mirror Descent Algorithms
Methodology:
Optimal Transport for structured data with application on graphs

Section 3
Day 3 - Robotics, Gaussian Processes, Learning with
noisy labels

Subsection 1
Machine Learning with Application to Robotics

Machine Learning with Application to Robotics
Solving complex PDEs and other stochastic control problems in real time
is not really feasible as of today. Machine Learning (supervised learning of
trajectories, learning from demonstration, etc.) can help robotics.
Recorded talk:
https://www.facebook.com/icml.imls/videos/2368059266588651/
Lab: http://lasa.epfl.ch/

Subsection 2
Gaussian Processes

Gaussian Processes
One ﬂagship project: The Automatic Statistician
https://www.automaticstatistician.com/index/
Discovering Latent Covariance Structures for Multiple Time Series

Subsection 3
Labels. . .

Labels. . .
Learning Dependency Structures for Weak Supervision Models

Section 4
Day 4 - Interpretability, Natural Language Processing

Subsection 1
Interpretability

Interpretability
Towards a Deep and Uniﬁed Understanding of Deep Neural Models in NLP
http://proceedings.mlr.press/v97/guan19a/guan19a.pdf
Explaining Deep Neural Networks with a Polynomial Time Algorithm for
Shapley Values Approximation
http://proceedings.mlr.press/v97/ancona19a/ancona19a.pdf
https://icml.cc/media/Slides/icml/2019/grandball(13-09-00)
-13-09-25-4776-explaining_deep.pdf

Subsection 2
Natural Language Processing

Natural Language Processing
MeanSum: A Neural Model for Unsupervised Multi-Document
Abstractive Summarization
https://icml.cc/media/Slides/icml/2019/104(13-11-00)
-13-12-10-4891-meansum_a_neur.pdf

Section 5
Day 5 - Workshop Time Series

GluonTS
GluonTS: Probabilistic Time Series Models in Python
https://github.com/awslabs/gluon-ts

Takeaways from ICML 2019, Long Beach, California

Recommended

Recommended

More Related Content

Similar to Takeaways from ICML 2019, Long Beach, California

Similar to Takeaways from ICML 2019, Long Beach, California (17)

More from Gautier Marti

More from Gautier Marti (15)

Recently uploaded

Recently uploaded (20)

Takeaways from ICML 2019, Long Beach, California