DeNA TechCon 2020 AI and CV Talks

•

1 like•1,993 views

The document appears to be slides from DeNA TechCon 2020. It discusses several topics relating to AI and computer vision, including DRIVE CHART which is DeNA's AI platform, work on object detection, optical flow estimation, and the use of neural networks like CNNs and RNNs in areas like computer vision. It also references prior work on generative adversarial networks and improving discriminators and generators in these networks.

Technology

DeNA TechCon 2020
#denatechcon
DeNA TechCon 2020
DRIVE CHART

DeNA TechCon 2020
#denatechcon
Tomoyuki Suzuki
AI
• AI CV
• DRIVE CHART
• ~2019.3
• 2019.4 DeNA
@tomoyukun

DeNA TechCon 2020
#denatechcon
(CV: Computer Vision)
dog
2.2. Prior art
Much of the work on GAN architectures has focuse
on improving the discriminator by, e.g., using multip
discriminators [18, 47, 11], multiresolution discriminatio
[60, 55], or self-attention [63]. The work on generator sid
has mostly focused on the exact distribution in the input l
tent space [5] or shaping the input latent space via Gaussia
mixture models [4], clustering [48], or encouraging conve
ity [52].
Recent conditional generators feed the class identiﬁ
through a separate embedding network to a large numb
of layers in the generator [46], while the latent is still pr
vided though the input layer. A few authors have considere
feeding parts of the latent code to multiple generator laye
[9, 5]. In parallel work, Chen et al. [6] “self modulate” th
generator using AdaINs, similarly to our work, but do n
consider an intermediate latent space or noise inputs.
*
*Karras et al., “A Style-Based Generator Architecture for GeneraCve,” in Proc. of CVPR 2019.
cat

DeNA TechCon 2020
#denatechcon
DRIVE CHART
AI
• 2019 6 4
• AI
DRIVE CHART

DeNA TechCon 2020
#denatechcon
DRIVE CHART
CV

DeNA TechCon 2020
#denatechcon
1.
2.
3.
4.

DeNA TechCon 2020
#denatechcon
%
0
(
)
)
% 67 8
% 67 8
5 8
2 1 %3 .

DeNA TechCon 2020
#denatechcon
•
• 3
3 D
Zhang +, OpenGaze Demo: Gaze Visualiza7on: h8ps://www.youtube.com/watch?v=9Lujg3beiYI
3

DeNA TechCon 2020
#denatechcon
•
•
Screen coordinate system
Camera coordinate system
Head coordinate
system
Figure 1: System conﬁguration for data collection
LCD monitor, and these cameras capture images in a syn-
chronized manner via a software trigger controlled by the
host computer. Intrinsic and extrinsic camera parameters
are calibrated beforehand, and the 3D position of the moni-
Midpoints of
3D facial landmarks
Figure 2: Deﬁnition of head pose. The head coordinate sys
tem is deﬁned based on a triangle connecting three mid
points of the eyes and mouth.
poses of the subjects. As illustrated in Fig. 2, the head coor
Y. Sugano et al., ”Learning-by-Synthesis for Appearance-Based 3D Gaze EsBmaBon,”
in Proc. of CVPR, 2014
Andreas et al., "Wearable EOG goggles: Eye-based interacBon
in everyday environments," in Proc. of CHI 2009.

DeNA TechCon 2020
#denatechcon
0.1
0.9
0.8

DeNA TechCon 2020
#denatechcon
(CNN: Convolu+onal Neural Network)
CV

DeNA TechCon 2020
#denatechcon
CNN
(CNN: Convolu+onal Neural Network)

DeNA TechCon 2020
#denatechcon
(RNN: Recurrent Neural Network)
!"!#!$

DeNA TechCon 2020
#denatechcon
(RNN: Recurrent Neural Network)
0.8 0.2 0.6

DeNA TechCon 2020
#denatechcon
–
0.1
0.9
0.8
CNN RNN

DeNA TechCon 2020
#denatechcon
–
0.1
0.9
0.8
0 =
1
0
1
0
CNN RNN

DeNA TechCon 2020
#denatechcon
Takumi Karasawa
@Takarasawa_
AI
• AI CV
• DRIVE CHART
• ~2019.3
• 2019.4 DeNA

DeNA TechCon 2020
#denatechcon
…
0 0.8
⇔

DeNA TechCon 2020
#denatechcon
Techcon2018
•
•

DeNA TechCon 2020
#denatechcon
Object Detec)on
x y
(x, y)
horse
person

DeNA TechCon 2020
#denatechcon
1.
•
2.
• x
• = y
1. 2.

DeNA TechCon 2020
#denatechcon
1.
•
2.
• x
• = y

DeNA TechCon 2020
#denatechcon
Op#cal Flow Es#ma#on

DeNA TechCon 2020
#denatechcon
DRIVE CHART
DRIVE CHART AI
AI CV

DeNA TechCon 2020
#denatechcon
DeNA TechCon 2020

What's hot

「スプラトゥーン」リアルタイム画像解析ツール「IkaLog」の裏側Takeshi HASEGAWA

コンテナ未経験新人が学ぶコンテナ技術入門Kohei Tokunaga

OpenID Connectとネイティブアプリを取り巻く仕様と動向 Yahoo! JAPANの取り組み #openid #openid_tokyo Yahoo!デベロッパーネットワーク

[AWS EXpert Online for JAWS-UG 18] 見せてやるよ、Step Functions の本気ってやつをなAmazon Web Services Japan

TDD のこころTakuto Wada

NTTデータ流Infrastructure as Code～大規模プロジェクトを通して考え抜いた基盤自動化の新たな姿～（NTTデータテクノロジーカンフ...NTT DATA Technology & Innovation

SageMakerを使った異常検知Ryohei Yamaguchi

テスト自動化とアーキテクチャToru Koido

20190806 AWS Black Belt Online Seminar AWS GlueAmazon Web Services Japan

MLOps入門Hiro Mura

BuildKitによる高速でセキュアなイメージビルドAkihiro Suda

データ分析の目的に応じた人事、分析組織づくり、データ人材の評価Takeaki Ohi

AWS CognitoからAuth0への移行パターン4つ株式会社スタジオメッシュ

IAM Roles Anywhereのない世界とある世界（2022年のAWSアップデートを振り返ろう ~Season 4~ 発表資料）NTT DATA Technology & Innovation

20180425 AWS Black Belt Online Seminar Amazon Relational Database Service (Am...Amazon Web Services Japan

MLflowで学ぶMLOpsことはじめKenichi Sonoda

Kubernetesによる機械学習基盤への挑戦Preferred Networks

Azure Web PubSub Serviceを触ってみたDevTakas

MapReduce/YARNの仕組みを知る日本ヒューレット・パッカード株式会社

AWS IoTアーキテクチャパターンAmazon Web Services Japan

What's hot (20)

「スプラトゥーン」リアルタイム画像解析ツール「IkaLog」の裏側

コンテナ未経験新人が学ぶコンテナ技術入門

OpenID Connectとネイティブアプリを取り巻く仕様と動向 Yahoo! JAPANの取り組み #openid #openid_tokyo

[AWS EXpert Online for JAWS-UG 18] 見せてやるよ、Step Functions の本気ってやつをな

TDD のこころ

NTTデータ流Infrastructure as Code～大規模プロジェクトを通して考え抜いた基盤自動化の新たな姿～（NTTデータテクノロジーカンフ...

SageMakerを使った異常検知

テスト自動化とアーキテクチャ

20190806 AWS Black Belt Online Seminar AWS Glue

MLOps入門

BuildKitによる高速でセキュアなイメージビルド

データ分析の目的に応じた人事、分析組織づくり、データ人材の評価

AWS CognitoからAuth0への移行パターン4つ

IAM Roles Anywhereのない世界とある世界（2022年のAWSアップデートを振り返ろう ~Season 4~ 発表資料）

20180425 AWS Black Belt Online Seminar Amazon Relational Database Service (Am...

MLflowで学ぶMLOpsことはじめ

Kubernetesによる機械学習基盤への挑戦

Azure Web PubSub Serviceを触ってみた

MapReduce/YARNの仕組みを知る

AWS IoTアーキテクチャパターン

Similar to DeNA TechCon 2020 AI and CV Talks

IRJET- A Study of Generative Adversarial Networks in 3D ModellingIRJET Journal

An Intelligent Approach for Effective Retrieval of Content from Large Data Se...IJCSIS Research Publications

Indoor Point Cloud ProcessingPetteriTeikariPhD

Indoor Point Cloud Processing - Deep learning for semantic segmentation of in...CubiCasa

Facial Recognition Based Attendance SystemIRJET Journal

_OOP with JAVA Solution Manual (1).pdfvanithagp1

CG_report_merged (1).pdfrahul812082

NetDevOps Development EnvironmentsJoel W. King

REVIEW ON OBJECT DETECTION WITH CNNIRJET Journal

IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...IRJET Journal

FACE PHOTO-SKETCH RECOGNITION USING DEEP LEARNING TECHNIQUES - A REVIEWIRJET Journal

JoseSanchezInternPosterJose Sanchez Garcia

Sample projectdocumentationhlksd

AIR WRITING USING PYTHON (2021-2022)IRJET Journal

Mitesh goplanimiteshgoplani

Three-dimensional shape generation via variational autoencoder generative ad...IJECEIAES

Visual Network NarrationsJanna Joceli Omena

A Review of Virtual Programming Laboratory: Design IssuesIRJET Journal

A SURVEY ON DEEPFAKES CREATION AND DETECTIONIRJET Journal

IRJET- Semantic Segmentation using Deep LearningIRJET Journal

Similar to DeNA TechCon 2020 AI and CV Talks (20)

IRJET- A Study of Generative Adversarial Networks in 3D Modelling

An Intelligent Approach for Effective Retrieval of Content from Large Data Se...

Indoor Point Cloud Processing

Indoor Point Cloud Processing - Deep learning for semantic segmentation of in...

Facial Recognition Based Attendance System

_OOP with JAVA Solution Manual (1).pdf

CG_report_merged (1).pdf

NetDevOps Development Environments

REVIEW ON OBJECT DETECTION WITH CNN

IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...

FACE PHOTO-SKETCH RECOGNITION USING DEEP LEARNING TECHNIQUES - A REVIEW

JoseSanchezInternPoster

Sample projectdocumentation

AIR WRITING USING PYTHON (2021-2022)

Mitesh goplani

Three-dimensional shape generation via variational autoencoder generative ad...

Visual Network Narrations

A Review of Virtual Programming Laboratory: Design Issues

A SURVEY ON DEEPFAKES CREATION AND DETECTION

IRJET- Semantic Segmentation using Deep Learning

Recently uploaded

Time Series Foundation Models - current state and future directionsNathaniel Shimoni

How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3

unit 4 immunoblotting technique complete.pptxBkGupta21

TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3

A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina

How to write a Business Continuity PlanDatabarracks

Take control of your SAP testing with UiPath Test SuiteDianaGray10

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

Sample pptx for embedding into website for demoHarshalMandlekar2

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays

Recently uploaded (20)

Time Series Foundation Models - current state and future directions

How AI, OpenAI, and ChatGPT impact business and software.

Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx

unit 4 immunoblotting technique complete.pptx

TeamStation AI System Report LATAM IT Salaries 2024

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx

SIP trunking in Janus @ Kamailio World 2024

Dev Dives: Streamline document processing with UiPath Studio Web

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

What is DBT - The Ultimate Data Build Tool.pdf

How to write a Business Continuity Plan

Take control of your SAP testing with UiPath Test Suite

Developer Data Modeling Mistakes: From Postgres to NoSQL

What's New in Teams Calling, Meetings and Devices March 2024

Sample pptx for embedding into website for demo

Nell’iperspazio con Rocket: il Framework Web di Rust!

"Debugging python applications inside k8s environment", Andrii Soldatenko

DeNA TechCon 2020 AI and CV Talks

1. DeNA TechCon 2020 #denatechcon DeNA TechCon 2020 DRIVE CHART

2. DeNA TechCon 2020 #denatechcon Tomoyuki Suzuki AI • AI CV • DRIVE CHART • ~2019.3 • 2019.4 DeNA @tomoyukun

3. DeNA TechCon 2020 #denatechcon (CV: Computer Vision) dog 2.2. Prior art Much of the work on GAN architectures has focuse on improving the discriminator by, e.g., using multip discriminators [18, 47, 11], multiresolution discriminatio [60, 55], or self-attention [63]. The work on generator sid has mostly focused on the exact distribution in the input l tent space [5] or shaping the input latent space via Gaussia mixture models [4], clustering [48], or encouraging conve ity [52]. Recent conditional generators feed the class identiﬁ through a separate embedding network to a large numb of layers in the generator [46], while the latent is still pr vided though the input layer. A few authors have considere feeding parts of the latent code to multiple generator laye [9, 5]. In parallel work, Chen et al. [6] “self modulate” th generator using AdaINs, similarly to our work, but do n consider an intermediate latent space or noise inputs. * *Karras et al., “A Style-Based Generator Architecture for GeneraCve,” in Proc. of CVPR 2019. cat

4. DeNA TechCon 2020 #denatechcon DRIVE CHART AI • 2019 6 4 • AI DRIVE CHART

5. DeNA TechCon 2020 #denatechcon DRIVE CHART CV

6. DeNA TechCon 2020 #denatechcon 1. 2. 3. 4.

7. DeNA TechCon 2020 #denatechcon 1. 2. 3. 4.

8. DeNA TechCon 2020 #denatechcon % 0 ( ) ) % 67 8 % 67 8 5 8 2 1 %3 .

9. DeNA TechCon 2020 #denatechcon * …

10. DeNA TechCon 2020 #denatechcon * …

11. DeNA TechCon 2020 #denatechcon 1. 2. 3. 4.

12. DeNA TechCon 2020 #denatechcon –

13. DeNA TechCon 2020 #denatechcon –

14. DeNA TechCon 2020 #denatechcon –

15. DeNA TechCon 2020 #denatechcon • • 3 3 D Zhang +, OpenGaze Demo: Gaze Visualiza7on: h8ps://www.youtube.com/watch?v=9Lujg3beiYI 3

16. DeNA TechCon 2020 #denatechcon • • Screen coordinate system Camera coordinate system Head coordinate system Figure 1: System configuration for data collection LCD monitor, and these cameras capture images in a syn- chronized manner via a software trigger controlled by the host computer. Intrinsic and extrinsic camera parameters are calibrated beforehand, and the 3D position of the moni- Midpoints of 3D facial landmarks Figure 2: Definition of head pose. The head coordinate sys tem is defined based on a triangle connecting three mid points of the eyes and mouth. poses of the subjects. As illustrated in Fig. 2, the head coor Y. Sugano et al., ”Learning-by-Synthesis for Appearance-Based 3D Gaze EsBmaBon,” in Proc. of CVPR, 2014 Andreas et al., "Wearable EOG goggles: Eye-based interacBon in everyday environments," in Proc. of CHI 2009.

17. DeNA TechCon 2020 #denatechcon –

18. DeNA TechCon 2020 #denatechcon ”

19. DeNA TechCon 2020 #denatechcon

20. DeNA TechCon 2020 #denatechcon

21. DeNA TechCon 2020 #denatechcon 1. 2. 3. 4.

22. DeNA TechCon 2020 #denatechcon

23. DeNA TechCon 2020 #denatechcon ❌ ⭕

24. DeNA TechCon 2020 #denatechcon

25. DeNA TechCon 2020 #denatechcon

26. DeNA TechCon 2020 #denatechcon

27. DeNA TechCon 2020 #denatechcon

28. DeNA TechCon 2020 #denatechcon