Predict Molecular Properties with Graph Neural Networks

•Download as PPTX, PDF•

3 likes•652 views

Shion Honda

Deep learning on graphs and its application to chemistry. The slides are used in the seminar at NTU.

Data & Analytics

Towards Predicting Molecular
Property by Graph Neural
Networks
Shion HONDA
The Graduate School of Information Science
and Technology,
The University of Tokyo
@ National Taiwan University

Contents
• Basics of molecular property prediction
• Basics of graph theory
• Introduction to graph convolution
• Graph convolutional networks
• Recent advancements
2019/3/20 Shion HONDA 2

Who Am I?
• Studying cheminformatics, or ML application
to drug discovery
• My interests: NLP, CV, GANs, RL etc
• SNS
2019/3/20 Shion HONDA 3
@shion_honda
@shionhonda

Why Graphs?
• Many kinds of data can be represented as
graphs
• Web
• Traffic network
• Social network
• Citation network
• Neuronal network
• Molecules
2019/3/20 Shion HONDA 4
Zachary’s karate club
Salicylic acid

Molecular Property Prediction
• When predicting molecular property by some
model, molecular fingerprints are useful
• Hand-crafted fingerprints
• MACCS Keys
• Circular fingerprint (most popular)
• Binary sparse vector
• Learned fingerprints
• SMILES-based
• Graph neural networks
2019/3/20 Shion HONDA 5
c1ccccc1C(=O)O

Graphs Are Difficult
• Unaligned structure
• Graphs are not aligned like images/texts
• Different structures and different tasks
• Directed vs undirected
• Weighted vs unweighted etc.
• Link prediction vs graph embedding
• Scalability
• Some graphs (e.g., web, SNS) are huge
• Domain knowledge
• In the case of molecules, link degrees are up to 5
• In the case of SNS, links are unevenly distributed
2019/3/20 Shion HONDA 6

Preliminaries And Definition
2019/3/20 Shion HONDA 7
Symbol Meaning
V, E Set of nodes, edges
N, M Number of nodes, edges
G=(V,E) Graph
𝐹 𝑉
, 𝐹 𝐸 Feature vector of nodes, edges
A Adjacency matrix
D Degree matrix
L=D-A Laplacian matrix
H Hidden vectors
Normalized Laplacian matrix
Learnable parameters
k-hop neighbors from node i

Graph Fourier Transform
• Eigen decomposition of normalized Laplacian
• U is a unitary matrix as L is a real symmetric
(Hermitian) matrix
• Graph Fourier transform of a signal on a
graph is defined as:
• Inverse operation
2019/3/20 Shion HONDA 8

Graph Convolution
• Convolution theorem
• Convolution of a filter
where
• Graph convolution is defined as a product of
matrices
2019/3/20 Shion HONDA 9

Naïve GCN
• is a filter
->Analogy of CNNs for images
• Arbitrary network can be
constructed by repeating
this operation
• Problems
• Eigen-decomposition has at least O(N^2) time
complexity
• Parameters cannot be shared over graphs of
different sizes
2019/3/20 Shion HONDA 10

ChebNet
• Eigen-decomposition can be avoided by
Chebyshev polynomials
• Learnable parameters are:
• Then graph convolution is defined as:
2019/3/20 Shion HONDA 11

GCN (Kipf & Welling)
• ChebNet when
• By assuming , graph conv. is
• By replacing and
stacking over channels to
where
2019/3/20 Shion HONDA 12
Independent of N

GCN Variants
• Neural Fingerprint
• PATCHY-SAN
• Message Passing Neural Network
• Graph Network
• Relational GCN
• Jumping Knowledge Network
2019/3/20 Shion HONDA 13

Recent Advances
• Autoencoder/VAE
• Decode in a non-parametric manner
• GAN + reinforcement learning
• Generate drug candidates
• e.g., MolGAN, GCPN
• Dynamic graphs
• In most of the real problems, graphs are dynamic
• e.g., web, electricity, SNS
• GCN+LSTM
2019/3/20 Shion HONDA 14

Summary
• Graph convolution can be defined with graph
Fourier transform
• Chebyshev polynomials approximation is
used to avoid computationally-expensive
eigen-decomposition
• Several other GCNs have been proposed
• Next direction: Generative models and
dynamic graphs
2019/3/20 Shion HONDA 15

References
[1] Z. Zhang et al., Deep Learning on Graphs: A Survey, arXiv, 2018.
[2] J. Zhou et al., Graph Neural Networks: A Review of Methods and
Applications, arXiv, 2018.
[3] Z. Wu et al., A Comprehensive Survey on Graph Neural Networks,
arXiv, 2019.
[4] D. I. Shuman et al., The Emerging Field of Signal Processing on
Graphs: Extending High-Dimensional Data Analysis to Networks and
Other Irregular Domains, IEEE Signal Processing Magazine, 2013.
[5]グラフラプラシアン - 初級Mathマニアの寝言
2019/3/20 Shion HONDA 16

Similar to Predict Molecular Properties with Graph Neural Networks

PCL (Point Cloud Library)University of Oklahoma

Vlsi td introductionGovinda Prasad Acharya

lecture_16_jiajun.pdfKuan-Tsae Huang

[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...thanhdowork

Multicore architecturesMuhammet SOYTÜRK

Emerging Technologies in On-Chip and Off-Chip Interconnection NetworksAshif Sikder

2017 09-ohkawa-MCSoC2017-presenTakeshi Ohkawa

Kharita: Robust Road Map Inference Through Network Alignment of Trajectoriesvipyoung

2019 3 testing and verification of vlsi design_staUsha Mehta

[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptxthanhdowork

Presentationutkarshsrivastava831298

Camera-Based Road Lane Detection by Deep Learning IIYu Huang

PacNOG 31: Internet Exchange PointsAPNIC

PITA 27th AGM & Business Forum Expo 23: Internet Exchange PointsAPNIC

3D SLAM introcution& current statuse8xu

[20240408_LabSeminar_Huy]PivotalSTGNN.pptxthanhdowork

Crash course on data streaming (with examples using Apache Flink)Vincenzo Gulisano

Aalto University Mobile Management in SDNHector Fuentes

FrrbaseViz-A Tool for Exploring Freebase Using Query-Driven VisualisationMahmoud Elbattah

Efficient architecture for arithmetic designs using perpendicular NanoMagneti...VIT-AP University

Similar to Predict Molecular Properties with Graph Neural Networks (20)

PCL (Point Cloud Library)

Vlsi td introduction

lecture_16_jiajun.pdf

[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...

Multicore architectures

Emerging Technologies in On-Chip and Off-Chip Interconnection Networks

2017 09-ohkawa-MCSoC2017-presen

Kharita: Robust Road Map Inference Through Network Alignment of Trajectories

2019 3 testing and verification of vlsi design_sta

[20240311_LabSeminar_Huy]LINE: Large-scale Information Network Embedding.pptx

Presentation

Camera-Based Road Lane Detection by Deep Learning II

PacNOG 31: Internet Exchange Points

PITA 27th AGM & Business Forum Expo 23: Internet Exchange Points

3D SLAM introcution& current status

[20240408_LabSeminar_Huy]PivotalSTGNN.pptx

Crash course on data streaming (with examples using Apache Flink)

Aalto University Mobile Management in SDN

FrrbaseViz-A Tool for Exploring Freebase Using Query-Driven Visualisation

Efficient architecture for arithmetic designs using perpendicular NanoMagneti...

Recently uploaded

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort

PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava

Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy

Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一fhwihughh

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics

Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2

Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson

Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha

Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics

Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

Semantic Shed - Squashing and Squeezing.pptxMike Bennett

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档208367051

NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali

Call Girls in Saket 99530🔝 56974 Escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Recently uploaded (20)

Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024

Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)

PKS-TGC-1084-630 - Stage 1 Proposal.pptx

Student Profile Sample report on improving academic performance by uniting gr...

Data Factory in Microsoft Fabric (MsBIP #82)

办理学位证纽约大学毕业证(NYU毕业证书）原版一比一

Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...

Identifying Appropriate Test Statistics Involving Population Mean

Defining Constituents, Data Vizzes and Telling a Data Story

Call Girls In Dwarka 9654467111 Escorts Service

Heart Disease Classification Report: A Data Analysis Project

Student profile product demonstration on grades, ability, well-being and mind...

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

Semantic Shed - Squashing and Squeezing.pptx

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx

原版1:1定制南十字星大学毕业证（SCU毕业证）#文凭成绩单#真实留信学历认证永久存档

NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...

Call Girls in Saket 99530🔝 56974 Escort Service

Predict Molecular Properties with Graph Neural Networks

1. Towards Predicting Molecular Property by Graph Neural Networks Shion HONDA The Graduate School of Information Science and Technology, The University of Tokyo @ National Taiwan University

2. Contents • Basics of molecular property prediction • Basics of graph theory • Introduction to graph convolution • Graph convolutional networks • Recent advancements 2019/3/20 Shion HONDA 2

3. Who Am I? • Studying cheminformatics, or ML application to drug discovery • My interests: NLP, CV, GANs, RL etc • SNS 2019/3/20 Shion HONDA 3 @shion_honda @shionhonda

4. Why Graphs? • Many kinds of data can be represented as graphs • Web • Traffic network • Social network • Citation network • Neuronal network • Molecules 2019/3/20 Shion HONDA 4 Zachary’s karate club Salicylic acid

5. Molecular Property Prediction • When predicting molecular property by some model, molecular fingerprints are useful • Hand-crafted fingerprints • MACCS Keys • Circular fingerprint (most popular) • Binary sparse vector • Learned fingerprints • SMILES-based • Graph neural networks 2019/3/20 Shion HONDA 5 c1ccccc1C(=O)O

6. Graphs Are Difficult • Unaligned structure • Graphs are not aligned like images/texts • Different structures and different tasks • Directed vs undirected • Weighted vs unweighted etc. • Link prediction vs graph embedding • Scalability • Some graphs (e.g., web, SNS) are huge • Domain knowledge • In the case of molecules, link degrees are up to 5 • In the case of SNS, links are unevenly distributed 2019/3/20 Shion HONDA 6

7. Preliminaries And Definition 2019/3/20 Shion HONDA 7 Symbol Meaning V, E Set of nodes, edges N, M Number of nodes, edges G=(V,E) Graph 𝐹 𝑉 , 𝐹 𝐸 Feature vector of nodes, edges A Adjacency matrix D Degree matrix L=D-A Laplacian matrix H Hidden vectors Normalized Laplacian matrix Learnable parameters k-hop neighbors from node i

8. Graph Fourier Transform • Eigen decomposition of normalized Laplacian • U is a unitary matrix as L is a real symmetric (Hermitian) matrix • Graph Fourier transform of a signal on a graph is defined as: • Inverse operation 2019/3/20 Shion HONDA 8

9. Graph Convolution • Convolution theorem • Convolution of a filter where • Graph convolution is defined as a product of matrices 2019/3/20 Shion HONDA 9

10. Naïve GCN • is a filter ->Analogy of CNNs for images • Arbitrary network can be constructed by repeating this operation • Problems • Eigen-decomposition has at least O(N^2) time complexity • Parameters cannot be shared over graphs of different sizes 2019/3/20 Shion HONDA 10

11. ChebNet • Eigen-decomposition can be avoided by Chebyshev polynomials • Learnable parameters are: • Then graph convolution is defined as: 2019/3/20 Shion HONDA 11

12. GCN (Kipf & Welling) • ChebNet when • By assuming , graph conv. is • By replacing and stacking over channels to where 2019/3/20 Shion HONDA 12 Independent of N

13. GCN Variants • Neural Fingerprint • PATCHY-SAN • Message Passing Neural Network • Graph Network • Relational GCN • Jumping Knowledge Network 2019/3/20 Shion HONDA 13

14. Recent Advances • Autoencoder/VAE • Decode in a non-parametric manner • GAN + reinforcement learning • Generate drug candidates • e.g., MolGAN, GCPN • Dynamic graphs • In most of the real problems, graphs are dynamic • e.g., web, electricity, SNS • GCN+LSTM 2019/3/20 Shion HONDA 14

15. Summary • Graph convolution can be defined with graph Fourier transform • Chebyshev polynomials approximation is used to avoid computationally-expensive eigen-decomposition • Several other GCNs have been proposed • Next direction: Generative models and dynamic graphs 2019/3/20 Shion HONDA 15

16. References [1] Z. Zhang et al., Deep Learning on Graphs: A Survey, arXiv, 2018. [2] J. Zhou et al., Graph Neural Networks: A Review of Methods and Applications, arXiv, 2018. [3] Z. Wu et al., A Comprehensive Survey on Graph Neural Networks, arXiv, 2019. [4] D. I. Shuman et al., The Emerging Field of Signal Processing on Graphs: Extending High-Dimensional Data Analysis to Networks and Other Irregular Domains, IEEE Signal Processing Magazine, 2013. [5]グラフラプラシアン - 初級Mathマニアの寝言 2019/3/20 Shion HONDA 16

Editor's Notes

But first of all, let me introduce myself briefly. I’m a master student, studying cheminformatics, or ML application to drug discovery. My interests lies over ML: NLP, CV, GANs, RL etc.
Let’s move on to the topic. Why are graph neural networks important? Why do they attract many people? Because, many kinds of data can be represented as graphs and they have not been benefitted from DL so much as NLP and CV. One of the applications is chemistry. molecular property prediction, molecule generation, and so on. And I’m working on these stuff.
I’ll now briefly tell you about the backgrounds of Molecular Property Prediction. When predicting molecular property by some model, vector representations, or embeddings, are useful. This feature vectors are called fingerprints in chemistry. Fingerprints have been hand-crafted for a long time. For example, MACCS Keys is a binary vector that represents the existence of pre-defined substructures. However, hand-crafted fingerprints are sparse and do not perform well for ML models. Recent researches try to learn better fingerprints applying seq2seq on the character representation called SMILES. Or, simply applying GNNs. Today I focus on the GNN approaches.
Generally speaking, graphs are more complicated than texts and images. This is firstly due to their unaligned structure. There is no order in nodes, so even judging whether two given graphs are the same or not is difficult. Second, there are several types of graphs such as directed vs undirected… Scalability and Domain knowledge are also problems.
Before moving on to graph convolution, let me give you definitions. To keep up with the following talk, You need to remember adjacency matrix, degree matrix. Laplacian matrix, and normalized Laplacian matrix.
Graph convolution is defined through graph fourier transform. Consider Eigen decomposition of normalized Laplacian. Here, U is a unitary matrix as L is a real symmetric matrix. Now, Graph Fourier transform of a signal on a graph x (blue pillars) is defined by multiplying transposed U. Inverse operation is simple.
Convolution theorem holds just as convolution on images and time-series data. I mean, convolution in space domain is equal to the element-wise product in the Fourier domain. Let theta as the diagonal matrix like this, then convolution of a filter F can be written as: Then, Graph convolution is defined as a product of matrices There is a beautiful theory behind the formula, so if you are interested, please check it out afterwards.
Now, “naive” graph convolutional network can be designed by naively imitating CNN for images. You can compose arbitrary network by repeating this operation. However, there are 2 major problems: First, Eigen-decomposition has O(N^2) time complexity Second, Parameters cannot be shared over graphs of different sizes because theta depends on the size of U, the number of nodes
But don’t worry. Chebyshev save the world! ChebNet avoided Eigen-decomposition by using k-degree Chebyshev polynomials! They set learnable parameters in this formula. Then graph convolution is defined without eigen decomposition.
Kipf and Welling futrher approximate by considering K=1 case. By assuming thetas are the same, graph convolution is: And lastly, by replacing and stacking over channels, theta is independent of graph size N.
The following researches proposed variants of GCNs. Here are some examples.
I didn’t have enough time to step into recent topics such as autoencoder/VAE, GANs, RL, and dynamic graphs. If you are interested, I can give you some relevant literature.
To sum up,

Predict Molecular Properties with Graph Neural Networks

Recommended

Recommended

More Related Content

Similar to Predict Molecular Properties with Graph Neural Networks

Similar to Predict Molecular Properties with Graph Neural Networks (20)

More from Shion Honda

More from Shion Honda (11)

Recently uploaded

Recently uploaded (20)

Predict Molecular Properties with Graph Neural Networks

Editor's Notes