Learning from Unpaired Data

2020/12/14
1
Learning with Unpaired Data
Jiebo Luo
University of Rochester
Keynote at ICMLA 2020
December 14, 2020
Introduction
● The recent successes by deep learning have been largely driven by
annotated big data

2020/12/14
2
Introduction
● Learning from more or less data
Learning from
Unpaired Data
Introduction
● The mapping from an input to a structured output is desired in ML
French→English
Une fusillade a eu lieu a`
l’aeroport international de
Los ´ Angeles.
There was a shooting in Los
Angeles International Airport.
Horse→Zebra Image→Caption
a black and white cat sitting
in a bathroom sink

2020/12/14
3
Introduction
● The mapping from an input to a structured output is desired in ML & applications
Image Enhancement Image Dehazing Artifact Reduction
Introduction
● Usually, paired training data are collected to train such a mapping model
● Collecting paired data is expensive
Collecting paired data Possible Expensive
Machine Translation ✅ 💲
Horse→Zebra ❌
Image Captioning ✅ 💲
Image Enhancement ✅ 💲💲
Image Dehazing ❓ 💲💲💲
Artifact Reduction ❓ 💲💲💲

2020/12/14
4
Introduction
● Unpaired data are much easier to collect
○ No annotation needed, or
○ Light-weight annotation
Introduction
● The new challenge: how to align the unpaired data
○ Reconstruction loss
○ GAN loss
Domain 1 Domain 2

2020/12/14
5
Introduction
● Going into details
○ Machine Translation
○ Horse→Zebra
○ Image Captioning
○ Image Dehazing
○ Artifact Reduction
Unsupervised Neural Machine Translation (NMT)
● Key elements
○ Dual structure (same model for L1-L2 and L2-L1)
○ Shared encoder (same for L1/L2)
○ Fixed embeddings in the encoder (pre-trained cross-lingual embeddings vs. random initialization)
Mikel Artetxe, Gorka Labaka, Eneko Agirre, and Kyunghyun Cho. "Unsupervised neural machine translation." arXiv preprint arXiv:1710.11041
(2017).

2020/12/14
6
On-the-fly back-translation
Guillaume Lample, Alexis Conneau, Ludovic Denoyer, and Marc'Aurelio Ranzato. "Unsupervised machine translation using monolingual corpora
only." arXiv preprint arXiv:1711.00043 (2017).
Results
● BLEU scores on newstest2014
Mikel Artetxe, Gorka Labaka, Eneko Agirre, and Kyunghyun Cho. "Unsupervised neural machine translation." arXiv preprint arXiv:1710.11041
(2017).

2020/12/14
7
Cycle GAN
● It is (nearly) impossible to collect paired images in some cases
Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. "Unpaired image-to-image translation using cycle-consistent adversarial
networks." In Proceedings of the IEEE international conference on computer vision, pp. 2223-2232. 2017.
Cycle GAN

2020/12/14
8
Cycle GAN
● Adversarial Loss
● Cycle Consistency Loss
● Full Objective
Cycle GAN

2020/12/14
9
Unsupervised Image Captioning
a. Supervised
b. Novel object
c. Domain adaptation
d. Pivot
e. Semi-supervised
f. Unsupervised
Yang Feng, Lin Ma, Wei Liu, and Jiebo Luo. "Unsupervised image captioning." In Proceedings of the IEEE conference on computer vision and
pattern recognition, pp. 4125-4134. 2019.

2020/12/14
10
● Images are from MSCOCO dataset
● We crawl a large-scale image description corpus consisting of 2 million
natural sentences from Shutterstock to facilitate unsupervised image
captioning
● Object detectors trained on the OpenImage dataset are used

2020/12/14
11
● Results
● Results

2020/12/14
12
Image Dehazing
Atmosphere scattering model:
I(x) is the observed hazy image, J(x) is the scene radiance (haze-free image),
t(x) is the medium transmission map, A is the global atmospheric light.
Goal: recover J(x), t(x) and A from I(x).
Xitong Yang, Zheng Xu, and Jiebo Luo. "Towards Perceptual Image Dehazing by Physics-Based Disentanglement and Adversarial Training." In
AAAI, pp. 7485-7492. 2018.
Image Dehazing
● Uses synthetic hazy image pairs in
previous methods
● Generates realistic haze-free images
using only unpaired supervision in the
proposed method
AAAI, pp. 7485-7492. 2018.

2020/12/14
13
Image Dehazing
● Synthetic datasets
○ D-HAZY dataset
○ NYU-Depth dataset
● Real image datasets
○ HazyCity dataset
○ Crawled photos of Beijing
○ Annotating hazy or not
○ 845 natural hazy images
○ 1891 haze-free images
AAAI, pp. 7485-7492. 2018.
Image Dehazing
● Results
AAAI, pp. 7485-7492. 2018.

2020/12/14
14
Image Dehazing
● Results
AAAI, pp. 7485-7492. 2018.
Artifact Reduction
Haofu Liao, Wei-An Lin, S. Kevin Zhou, and Jiebo Luo. "ADN: Artifact disentanglement network for unsupervised metal artifact reduction." IEEE
Transactions on Medical Imaging 39, no. 3 (2019): 634-643.
Model
LossUnpaired
clinical data
● Existing models require paired data
for training, which are synthesized
● When applied to clinical data,
models trained on synthesized data
generalize poorly
● We aim to learn metal artifact
reduction (MAR) directly from
clinical data without using paired
data
input output
artifact-free

2020/12/14
15
Artifact Disentanglement
decodingencoding
artifact reduced!
: artifact space
: content space
with artifact
bones, soft tissues,
lesions, implants, etc.
dark shadings,
streaks, noises, etc.
Artifact Disentanglement Network

2020/12/14
16
content
encoder
content
decoder
Adversarial
Loss
artifact
reduced
with
artifact
without
artifact
content
encoder
artifact
encoder
artifact &
content
decoder
Adversarial
Loss
with artifact
artifact
transferred
without
artifact
with
artifact

2020/12/14
17
artifact &
content
decoder
self-
reconstruction
Reconstruction
Loss (L1)
artifact
encoder
content
encoder
with
artifact
with
artifact
content
encoder
content
decoder
self-
reconstruction
Reconstruction
Loss (L1)
without
artifact
without
artifact

2020/12/14
18
-
-
Artifact Consistency
Loss (L1)
artifact
reduced
Self-reduction
Loss (L1)
artifact
transferred
without
artifact
with
artifact
without
artifact

2020/12/14
19
Quantitative Results on Synthetic Data
● Results
Qualitative Results on Clinical Data

2020/12/14
20
Conclusions
● Learning with unpaired data can greatly reduce the effort in collecting data.
● Some level of explicit disentanglement of compounding components is the
key to improved performance beyond using cycle consistency.
● For machine translation and image captioning, the performance of unpaired
method is still lower than fully supervised methods.
● Compared with data synthesis, unpaired learning can achieve promising
results on many machine learning tasks.
Open Issues and Future Directions
● Learning with the combination of unpaired data and small paired data to
achieve better performance
● Going beyond text and image modalities, for example, to video and sound
modalities

2020/12/14
21
Acknowledgments
Yang Feng (Google Cloud) Haofu Liao (AWS)
https://www.yangfeng.name http://liaohaofu.com
Visual Intelligence and Social Multimedia Analytics

Learning from Unpaired Data

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Learning from Unpaired Data

Similar to Learning from Unpaired Data (20)

More from Goergen Institute for Data Science

More from Goergen Institute for Data Science (14)

Recently uploaded

Recently uploaded (20)

Learning from Unpaired Data