SlideShare a Scribd company logo
1 of 39
Download to read offline
A Deep Learning Computational
Chemistry “AI”
Making chemical predictions with minimal expert knowledge
GARRETT GOH (@garrettbgoh)
May 17, 2017 1
High Performance Computing Group,
Advanced Computing, Mathematics & Data Division,
Pacific Northwest National Laboratory
CHEMical InCEPTION
May 17, 2017 2
@garrettbgoh
Recent Trends in Deep Learning
May 17, 2017 3
@garrettbgoh
Recent Trends in Deep Learning
May 17, 2017 4
@garrettbgoh
What is Deep Learning?
Deep Learning = Multi-layer artificial neural network
May 17, 2017 5
@garrettbgoh
What is Deep Learning?
Deep Learning = Multi-layer artificial neural network
May 17, 2017 6
Input
(Features)
Output
(Prediction)
@garrettbgoh
What is Deep Learning?
Deep Learning = Multi-layer artificial neural network
May 17, 2017 7
Input
(Features)
Output
(Prediction)
Many Hidden Layers
@garrettbgoh
Why Deep Learning today?
What has changed from the past?
Substantial increase of data (particularly
from internet)
Improved algorithms for training deep
neural networks
GPU-accelerated deep learning at
reasonable cost
May 17, 2017 8Glorot, X.; Bordes, A.; Bengio, Y. Proc. of the 14th Int. Conf. on Artificial Intelligence and Statistics 2011
Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. J. Mach. Learn Res. 2014, 15, 1929
@garrettbgoh
What makes Deep Learning better than
traditional/shallow Machine Learning?
Representation Learning  Automated Feature Engineering
May 17, 2017 9
http://www.nature.com/news/computer-science-the-learning-machines-1.14481
@garrettbgoh
A Case Study of Deep Learning Success
in Computer Vision
Human-level performance in image classification within 3 years
Manual feature engineering has been mostly replaced by deep neural
networks
May 17, 2017 10
Goh, G.B.; Hodas, N.O. Vishnu, A. J. Comp. Chem., 2017, 38, 1291
Deep Learning for Chemistry
May 17, 2017 11
@garrettbgoh
A Short History on
Feature Engineering in Chemistry
1880s: First concepts of “molecular structure” emerged
1940s: First modern molecular descriptors (i.e. engineered features of
molecules/chemicals) emerged
1960s: First modern QSAR/QSPR models developed (i.e. simple regression
models that predict a chemical’s activity or property)
1980s: Modern machine learning algorithms adopted (linear regression  SVMs
 RF)
2010s: First deep learning models using molecular descriptors for chemistry
developed
May 17, 2017 12
“Feature engineering in chemistry has been
going on for a while….”
@garrettbgoh
A Short History on
Feature Engineering in Chemistry
1880s: First concepts of “molecular structure” emerged
1940s: First modern molecular descriptors (i.e. engineered features of
molecules/chemicals) emerged
1960s: First modern QSAR/QSPR models developed (i.e. simple regression
models that predict a chemical’s activity or property)
1980s: Modern machine learning algorithms adopted (linear regression  SVMs
 RF)
2010s: First deep learning models using molecular descriptors for chemistry
developed
Today: First deep learning models using “raw image data” for chemistry
developed
May 17, 2017 13
“How much chemistry do you need to know to
predict chemistry?”
@garrettbgoh
Deep Learning for Computational
Chemistry
Deep Learning trained on molecular descriptors outperformed traditional
ML in the Merck Kaggle challenge in 2012 (activity prediction) and
Tox21 challenge (toxicity prediction) in 2014
May 17, 2017 14
Mayr, A.; Klambauer, G.; Unterthiner, T.; Hochreiter, S. Front. Env. Sci. 2016, 3, 1.
Ramsundar, B.; Kearnes, S.; Riley, P.; Webster, D.; Konerding, D.; Pande, V. 2015 https://arxiv.org/abs/1502.02072
Dahl, G. E.; Jaitly, N.; Salakhutdinov, R. 2014 https://arxiv.org/abs/1406.1231
@garrettbgoh
Deep Learning as a Machine Learning
Tool in Scientific (Chemistry) Research
May 17, 2017 15
Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
@garrettbgoh
Deep Learning as “Machine Intelligence”
in Scientific (Chemistry) Research
May 17, 2017 16
Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
aka…“Siri for chemists”
@garrettbgoh
Designing a Deep Learning Framework
with Minimal Chemistry Knowledge
May 17, 2017 17
Draw MoleculesHigh School Students
Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
@garrettbgoh
Deep Learning predicts Physiological,
Biochemical & Physical Properties
May 17, 2017 18
Physiological
e.g. Toxicity
Binary Classification
10,000 images
Biochemical
e.g. Activity
Binary Classification
40,000 images
Physical
e.g. Solvation
Regression
500 images
Deep Learning
@garrettbgoh
Experiments with Different
Deep Neural Network Architectures
AlexNet: Linear topology
ResNet: Linear topology with residual links
GoogleNet: Branched topology
May 17, 2017 19
Krizhevsky, A.; Sutskever, I.; Hinton, G. E. Advances in Neural Information Processing Systems 2012.
He, K.; Zhang, X.; Ren, S.; Sun, J. 2015 https://arxiv.org/abs/1512.03385
Szegedy, C.; et. al. 2014 https://arxiv.org/abs/1409.4842
@garrettbgoh
Experiments with Different
Deep Neural Network Architectures
In the regime of limited data, the are limits to the size (depth & breadth)
of deep neural networks
May 17, 2017 20
@garrettbgoh
Chemception Deep Neural Network
Based off Inception-ResNet v2 architectural template
May 17, 2017 21
Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
@garrettbgoh
Tweaking Chemception
(Depth & Width)
Chemception T3_F16 (~150,000 parameters, 45
layers), was empirically determined to be the optimal
neural network architecture
Tested depth from 21 to 69 layers
Tested width from 16 to 64 convolutional filters/layer
No. of parameters varied from ~70,000 to 2.4 million
Deep & skinny neural network seems to work best for
small datasets of chemical images
May 17, 2017 22
n=3
n=3
n=3
Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
@garrettbgoh
Benchmarking Chemception Performance
May 17, 2017 23
vs
aka engineered features
Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564
@garrettbgoh
Chemception + Raw Images
Activity Prediction Results
Slightly outperforms traditional ML using engineered features
Outperforms DL (MLP) using engineered features
May 17, 2017 24Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564
Uses engineered
features
Uses engineered
features
Ours Ours
@garrettbgoh
Chemception + Raw Images
Toxicity Prediction Results
Outperforms traditional ML using engineered features
Slightly underperforms DL (MLP) using engineered features
May 17, 2017 25Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564
Uses engineered
features
Uses engineered
features
Ours Ours
@garrettbgoh
Chemception + Raw Images
Solvation Prediction Results
Outperforms DL (MLP) using engineered features
Slightly underperforms physics-based models
May 17, 2017 26Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564
Uses engineered
features
Uses engineered
features
Ours
Ours
@garrettbgoh
Improving Chemception Performance
DATA: High quality labeled data is expensive and limited in technical
sciences
From Greyscale to Color Augmented Images:
Encoded domain-specific information into image channels
May 17, 2017 27
Chemistry property #1
Chemistry property #2
Chemistry property #3
@garrettbgoh
Designing a Deep Learning Framework
with Minimal Chemistry Knowledge
May 17, 2017 28
Draw MoleculesHigh School Students
Annotate Drawings with Basic
Chemistry Knowledge
@garrettbgoh
Chemception + Augmented Images
Activity Prediction Results
Outperforms traditional ML using engineered features
Outperforms DL (MLP) using engineered features
May 17, 2017 29Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564
Uses engineered
features
Uses engineered
features
@garrettbgoh
Chemception + Augmented Images
Toxicity Prediction Results
Outperforms traditional ML using engineered features
Outperforms DL (MLP) using engineered features
May 17, 2017 30Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564
Uses engineered
features
Uses engineered
features
@garrettbgoh
Chemception + Augmented Images
Solvation Prediction Results
Outperforms DL (MLP) using engineered features
Outperforms physics-based models!
May 17, 2017 31Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564
Uses engineered
features
Uses engineered
features
@garrettbgoh
Conclusion
Chemception: A deep neural network that predict chemical properties
just as well as expert-developed models, but with minimal chemical
knowledge
When trained with augmented images, Chemception outperforms both
ML & DL models that uses engineered features
A general (i.e. not domain specific) framework that represents a
“proof of concept” for using a deep learning machine intelligence
in research
May 17, 2017 32
@garrettbgoh
Conclusion
Q: How much chemistry do you need to know to predict
chemistry?
A: Not a lot…
May 17, 2017 33
@garrettbgoh
Conclusion
Q: How much chemistry <insert your interest here> do you need to
know to predict chemistry <insert your interest here>?
A: (Probably) Not a lot…
Caveat for using CNNs: As long as there is a systematic image
representation of your data from which the property to predict can be
inferred
May 17, 2017 34
@garrettbgoh
Conclusion
Q: How much chemistry <insert your interest here> do you need to
know to predict chemistry <insert your interest here>?
A: (Probably) Not a lot…
Caveat for using CNNs: As long as there is a systematic image
representation of your data from which the property to predict can be
inferred
May 17, 2017 35
Weather prediction? Traffic prediction?
@garrettbgoh
How do we deal with the “small labeled data” problem?
Will an “expert chemist” neural network do better? How do we train one?
Future Challenges
May 17, 2017 36
?
@garrettbgoh
How do we start using “machine intelligence” with human intelligence to
tackle previously “unexplainable/unsolvable” problems in science?
Future Challenges
May 17, 2017 37
“Creativity”
“Imagination”
“Stamina”
“Logical”
@garrettbgoh
Acknowledgements
Deep Learning for Computational Chemistry Team
Funding / Resources
May 17, 2017 38
Nathan HodasAbhinav Vishnu Nathan BakerCharles Siegel
Questions?
(@garrettbgoh)
May 17, 2017 39

More Related Content

Viewers also liked

Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...MLconf
 
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...MLconf
 
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017MLconf
 
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...MLconf
 
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017MLconf
 
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017MLconf
 
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017 Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017 MLconf
 
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017MLconf
 
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...MLconf
 
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017MLconf
 
Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017MLconf
 
Jonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAIJonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAIMLconf
 
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017 Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017 MLconf
 
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017MLconf
 
Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016
Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016
Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016MLconf
 
Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017
Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017
Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017MLconf
 
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016MLconf
 

Viewers also liked (18)

Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
Jacob Eisenstein, Assistant Professor, School of Interactive Computing, Georg...
 
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
Jeremy Nixon, Machine Learning Engineer, Spark Technology Center at MLconf AT...
 
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
Rahul Mehrotra, Product Manager, Maluuba at The AI Conference 2017
 
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
Will Murphy, VP of Business Development & Co-Founder, Talla at The AI Confere...
 
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
Aran Khanna, Software Engineer, Amazon Web Services at MLconf ATL 2017
 
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
Qiaoling Liu, Lead Data Scientist, CareerBuilder at MLconf ATL 2017
 
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017 Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
Artemy Malkov, CEO, Data Monsters at The AI Conference 2017
 
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
Ryan West, Machine Learning Engineer, Nexosis at MLconf ATL 2017
 
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
Jennifer Marsman, Principal Software Development Engineer, Microsoft at MLcon...
 
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
 
Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017Daniel Shank, Data Scientist, Talla at MLconf SF 2017
Daniel Shank, Data Scientist, Talla at MLconf SF 2017
 
Jonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAIJonas Schneider, Head of Engineering for Robotics, OpenAI
Jonas Schneider, Head of Engineering for Robotics, OpenAI
 
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017 Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
 
ML to cure the world
ML to cure the worldML to cure the world
ML to cure the world
 
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
Scott Clark, CEO, SigOpt, at MLconf Seattle 2017
 
Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016
Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016
Rajat Monga, Engineering Director, TensorFlow, Google at MLconf 2016
 
Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017
Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017
Irina Rish, Researcher, IBM Watson, at MLconf NYC 2017
 
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
 

Similar to Garrett Goh, Scientist, Pacific Northwest National Lab

Towards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataTowards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataPablo Bernabeu
 
Construction and Querying of Dynamic Knowledge Graphs
Construction and Querying of Dynamic Knowledge GraphsConstruction and Querying of Dynamic Knowledge Graphs
Construction and Querying of Dynamic Knowledge GraphsSutanay Choudhury
 
BioContainers on ELIXIR All Hands 2017
BioContainers on ELIXIR All Hands 2017BioContainers on ELIXIR All Hands 2017
BioContainers on ELIXIR All Hands 2017Yasset Perez-Riverol
 
Providing Research Graph data in JSON-LD using Schema.org
Providing Research Graph data in JSON-LD using Schema.orgProviding Research Graph data in JSON-LD using Schema.org
Providing Research Graph data in JSON-LD using Schema.orgJingbo Wang
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...Juan Antonio Vizcaino
 
Discovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials ProjectDiscovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials ProjectAnubhav Jain
 
Cs231n 2017 lecture13 Generative Model
Cs231n 2017 lecture13 Generative ModelCs231n 2017 lecture13 Generative Model
Cs231n 2017 lecture13 Generative ModelYanbin Kong
 

Similar to Garrett Goh, Scientist, Pacific Northwest National Lab (8)

Using Knowledge Graph for Promoting Cognitive Computing
Using Knowledge Graph for Promoting Cognitive ComputingUsing Knowledge Graph for Promoting Cognitive Computing
Using Knowledge Graph for Promoting Cognitive Computing
 
Towards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataTowards reproducibility and maximally-open data
Towards reproducibility and maximally-open data
 
Construction and Querying of Dynamic Knowledge Graphs
Construction and Querying of Dynamic Knowledge GraphsConstruction and Querying of Dynamic Knowledge Graphs
Construction and Querying of Dynamic Knowledge Graphs
 
BioContainers on ELIXIR All Hands 2017
BioContainers on ELIXIR All Hands 2017BioContainers on ELIXIR All Hands 2017
BioContainers on ELIXIR All Hands 2017
 
Providing Research Graph data in JSON-LD using Schema.org
Providing Research Graph data in JSON-LD using Schema.orgProviding Research Graph data in JSON-LD using Schema.org
Providing Research Graph data in JSON-LD using Schema.org
 
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...A proteomics data “gold mine” at your disposal: Now that the data is there, w...
A proteomics data “gold mine” at your disposal: Now that the data is there, w...
 
Discovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials ProjectDiscovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials Project
 
Cs231n 2017 lecture13 Generative Model
Cs231n 2017 lecture13 Generative ModelCs231n 2017 lecture13 Generative Model
Cs231n 2017 lecture13 Generative Model
 

More from MLconf

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...MLconf
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingMLconf
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...MLconf
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushMLconf
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 

More from MLconf (20)

Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
Jamila Smith-Loud - Understanding Human Impact: Social and Equity Assessments...
 
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language UnderstandingTed Willke - The Brain’s Guide to Dealing with Context in Language Understanding
Ted Willke - The Brain’s Guide to Dealing with Context in Language Understanding
 
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
Justin Armstrong - Applying Computer Vision to Reduce Contamination in the Re...
 
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold RushIgor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
Igor Markov - Quantum Computing: a Treasure Hunt, not a Gold Rush
 
Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 

Recently uploaded

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Recently uploaded (20)

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

Garrett Goh, Scientist, Pacific Northwest National Lab

  • 1. A Deep Learning Computational Chemistry “AI” Making chemical predictions with minimal expert knowledge GARRETT GOH (@garrettbgoh) May 17, 2017 1 High Performance Computing Group, Advanced Computing, Mathematics & Data Division, Pacific Northwest National Laboratory
  • 3. @garrettbgoh Recent Trends in Deep Learning May 17, 2017 3
  • 4. @garrettbgoh Recent Trends in Deep Learning May 17, 2017 4
  • 5. @garrettbgoh What is Deep Learning? Deep Learning = Multi-layer artificial neural network May 17, 2017 5
  • 6. @garrettbgoh What is Deep Learning? Deep Learning = Multi-layer artificial neural network May 17, 2017 6 Input (Features) Output (Prediction)
  • 7. @garrettbgoh What is Deep Learning? Deep Learning = Multi-layer artificial neural network May 17, 2017 7 Input (Features) Output (Prediction) Many Hidden Layers
  • 8. @garrettbgoh Why Deep Learning today? What has changed from the past? Substantial increase of data (particularly from internet) Improved algorithms for training deep neural networks GPU-accelerated deep learning at reasonable cost May 17, 2017 8Glorot, X.; Bordes, A.; Bengio, Y. Proc. of the 14th Int. Conf. on Artificial Intelligence and Statistics 2011 Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. J. Mach. Learn Res. 2014, 15, 1929
  • 9. @garrettbgoh What makes Deep Learning better than traditional/shallow Machine Learning? Representation Learning  Automated Feature Engineering May 17, 2017 9 http://www.nature.com/news/computer-science-the-learning-machines-1.14481
  • 10. @garrettbgoh A Case Study of Deep Learning Success in Computer Vision Human-level performance in image classification within 3 years Manual feature engineering has been mostly replaced by deep neural networks May 17, 2017 10 Goh, G.B.; Hodas, N.O. Vishnu, A. J. Comp. Chem., 2017, 38, 1291
  • 11. Deep Learning for Chemistry May 17, 2017 11
  • 12. @garrettbgoh A Short History on Feature Engineering in Chemistry 1880s: First concepts of “molecular structure” emerged 1940s: First modern molecular descriptors (i.e. engineered features of molecules/chemicals) emerged 1960s: First modern QSAR/QSPR models developed (i.e. simple regression models that predict a chemical’s activity or property) 1980s: Modern machine learning algorithms adopted (linear regression  SVMs  RF) 2010s: First deep learning models using molecular descriptors for chemistry developed May 17, 2017 12 “Feature engineering in chemistry has been going on for a while….”
  • 13. @garrettbgoh A Short History on Feature Engineering in Chemistry 1880s: First concepts of “molecular structure” emerged 1940s: First modern molecular descriptors (i.e. engineered features of molecules/chemicals) emerged 1960s: First modern QSAR/QSPR models developed (i.e. simple regression models that predict a chemical’s activity or property) 1980s: Modern machine learning algorithms adopted (linear regression  SVMs  RF) 2010s: First deep learning models using molecular descriptors for chemistry developed Today: First deep learning models using “raw image data” for chemistry developed May 17, 2017 13 “How much chemistry do you need to know to predict chemistry?”
  • 14. @garrettbgoh Deep Learning for Computational Chemistry Deep Learning trained on molecular descriptors outperformed traditional ML in the Merck Kaggle challenge in 2012 (activity prediction) and Tox21 challenge (toxicity prediction) in 2014 May 17, 2017 14 Mayr, A.; Klambauer, G.; Unterthiner, T.; Hochreiter, S. Front. Env. Sci. 2016, 3, 1. Ramsundar, B.; Kearnes, S.; Riley, P.; Webster, D.; Konerding, D.; Pande, V. 2015 https://arxiv.org/abs/1502.02072 Dahl, G. E.; Jaitly, N.; Salakhutdinov, R. 2014 https://arxiv.org/abs/1406.1231
  • 15. @garrettbgoh Deep Learning as a Machine Learning Tool in Scientific (Chemistry) Research May 17, 2017 15 Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
  • 16. @garrettbgoh Deep Learning as “Machine Intelligence” in Scientific (Chemistry) Research May 17, 2017 16 Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation aka…“Siri for chemists”
  • 17. @garrettbgoh Designing a Deep Learning Framework with Minimal Chemistry Knowledge May 17, 2017 17 Draw MoleculesHigh School Students Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
  • 18. @garrettbgoh Deep Learning predicts Physiological, Biochemical & Physical Properties May 17, 2017 18 Physiological e.g. Toxicity Binary Classification 10,000 images Biochemical e.g. Activity Binary Classification 40,000 images Physical e.g. Solvation Regression 500 images Deep Learning
  • 19. @garrettbgoh Experiments with Different Deep Neural Network Architectures AlexNet: Linear topology ResNet: Linear topology with residual links GoogleNet: Branched topology May 17, 2017 19 Krizhevsky, A.; Sutskever, I.; Hinton, G. E. Advances in Neural Information Processing Systems 2012. He, K.; Zhang, X.; Ren, S.; Sun, J. 2015 https://arxiv.org/abs/1512.03385 Szegedy, C.; et. al. 2014 https://arxiv.org/abs/1409.4842
  • 20. @garrettbgoh Experiments with Different Deep Neural Network Architectures In the regime of limited data, the are limits to the size (depth & breadth) of deep neural networks May 17, 2017 20
  • 21. @garrettbgoh Chemception Deep Neural Network Based off Inception-ResNet v2 architectural template May 17, 2017 21 Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
  • 22. @garrettbgoh Tweaking Chemception (Depth & Width) Chemception T3_F16 (~150,000 parameters, 45 layers), was empirically determined to be the optimal neural network architecture Tested depth from 21 to 69 layers Tested width from 16 to 64 convolutional filters/layer No. of parameters varied from ~70,000 to 2.4 million Deep & skinny neural network seems to work best for small datasets of chemical images May 17, 2017 22 n=3 n=3 n=3 Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation
  • 23. @garrettbgoh Benchmarking Chemception Performance May 17, 2017 23 vs aka engineered features Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564
  • 24. @garrettbgoh Chemception + Raw Images Activity Prediction Results Slightly outperforms traditional ML using engineered features Outperforms DL (MLP) using engineered features May 17, 2017 24Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564 Uses engineered features Uses engineered features Ours Ours
  • 25. @garrettbgoh Chemception + Raw Images Toxicity Prediction Results Outperforms traditional ML using engineered features Slightly underperforms DL (MLP) using engineered features May 17, 2017 25Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564 Uses engineered features Uses engineered features Ours Ours
  • 26. @garrettbgoh Chemception + Raw Images Solvation Prediction Results Outperforms DL (MLP) using engineered features Slightly underperforms physics-based models May 17, 2017 26Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564 Uses engineered features Uses engineered features Ours Ours
  • 27. @garrettbgoh Improving Chemception Performance DATA: High quality labeled data is expensive and limited in technical sciences From Greyscale to Color Augmented Images: Encoded domain-specific information into image channels May 17, 2017 27 Chemistry property #1 Chemistry property #2 Chemistry property #3
  • 28. @garrettbgoh Designing a Deep Learning Framework with Minimal Chemistry Knowledge May 17, 2017 28 Draw MoleculesHigh School Students Annotate Drawings with Basic Chemistry Knowledge
  • 29. @garrettbgoh Chemception + Augmented Images Activity Prediction Results Outperforms traditional ML using engineered features Outperforms DL (MLP) using engineered features May 17, 2017 29Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564 Uses engineered features Uses engineered features
  • 30. @garrettbgoh Chemception + Augmented Images Toxicity Prediction Results Outperforms traditional ML using engineered features Outperforms DL (MLP) using engineered features May 17, 2017 30Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564 Uses engineered features Uses engineered features
  • 31. @garrettbgoh Chemception + Augmented Images Solvation Prediction Results Outperforms DL (MLP) using engineered features Outperforms physics-based models! May 17, 2017 31Goh, G.B.; Siegel, C.; Vishnu, A.; Hodas, N.O.; Baker, N.A. 2017, in preparation Non-Chemception Data from Wu, Z., et. al. 2017, https://arxiv.org/abs/1703.00564 Uses engineered features Uses engineered features
  • 32. @garrettbgoh Conclusion Chemception: A deep neural network that predict chemical properties just as well as expert-developed models, but with minimal chemical knowledge When trained with augmented images, Chemception outperforms both ML & DL models that uses engineered features A general (i.e. not domain specific) framework that represents a “proof of concept” for using a deep learning machine intelligence in research May 17, 2017 32
  • 33. @garrettbgoh Conclusion Q: How much chemistry do you need to know to predict chemistry? A: Not a lot… May 17, 2017 33
  • 34. @garrettbgoh Conclusion Q: How much chemistry <insert your interest here> do you need to know to predict chemistry <insert your interest here>? A: (Probably) Not a lot… Caveat for using CNNs: As long as there is a systematic image representation of your data from which the property to predict can be inferred May 17, 2017 34
  • 35. @garrettbgoh Conclusion Q: How much chemistry <insert your interest here> do you need to know to predict chemistry <insert your interest here>? A: (Probably) Not a lot… Caveat for using CNNs: As long as there is a systematic image representation of your data from which the property to predict can be inferred May 17, 2017 35 Weather prediction? Traffic prediction?
  • 36. @garrettbgoh How do we deal with the “small labeled data” problem? Will an “expert chemist” neural network do better? How do we train one? Future Challenges May 17, 2017 36 ?
  • 37. @garrettbgoh How do we start using “machine intelligence” with human intelligence to tackle previously “unexplainable/unsolvable” problems in science? Future Challenges May 17, 2017 37 “Creativity” “Imagination” “Stamina” “Logical”
  • 38. @garrettbgoh Acknowledgements Deep Learning for Computational Chemistry Team Funding / Resources May 17, 2017 38 Nathan HodasAbhinav Vishnu Nathan BakerCharles Siegel