SlideShare a Scribd company logo
1 of 41
Download to read offline
New Target Prediction and Visualization Tools
Incorporating Open Source Molecular Fingerprints
For TB Mobile Version 2.0
Alex M. Clark1 Malabika Sarker2 and Sean Ekins3, 4
1Molecular Materials Informatics, 1900 St. Jacques #302, Montreal Quebec, Canada H3J 2S1
2SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025, USA.
3Collaborative Drug Discovery, 1633 Bayshore Highway, Suite 342, Burlingame, CA 94010, USA.
4Collaborations in Chemistry, 5616 Hilltop Needmore Road, Fuquay-Varina, NC 27526, USA.
.
Tuberculosis kills 1.6-1.7m/yr (~1 every 8 seconds)
1/3rd of worlds population infected!!!!
streptomycin (1943)
para-aminosalicyclic acid (1949)
isoniazid (1952)
pyrazinamide (1954)
cycloserine (1955)
ethambutol (1962)
rifampicin (1967)
• Multi drug resistance in 4.3% of
cases
• Extensively drug resistant
increasing incidence
• One new drug (bedaquiline) in 40
yrs
TB key points
Tested >350,000 molecules Tested ~2M 2M >300,000
>1500 active and non toxic Published 177 100s 800
Big Data: Screening for New Tuberculosis Treatments
TBDA screened over 1 million, 1 million
more to go
TB Alliance + Japanese pharma screens
How many will become a new drug?
How do we learn from this big data?
What are the targets for these molecules?
Over 6 years analyzed in vitro data and built models
Top scoring molecules
assayed for
Mtb growth inhibition
Mtb screening
molecule
database/s
High-throughput
phenotypic
Mtb screening
Descriptors + Bioactivity (+Cytotoxicity)
Bayesian Machine Learning classification Mtb Model
Molecule Database
(e.g. GSK malaria
actives)
virtually scored
using Bayesian Models
New bioactivity data
may enhance models
Identify in vitro hits and test models3 x published prospective tests ~750
molecules were tested in vitro
198 actives were identified
>20 % hit rate
Multiple retrospective tests 3-10 fold
enrichment
N
H
S
N
Ekins et al., Pharm Res 31: 414-435, 2014
Ekins, et al., Tuberculosis 94; 162-169, 2014
Ekins, et al., PLOSONE 8; e63240, 2013
Ekins, et al., Chem Biol 20: 370-378, 2013
Ekins, et al., JCIM, 53: 3054−3063, 2013
Ekins and Freundlich, Pharm Res, 28, 1859-1869, 2011
Ekins et al., Mol BioSyst, 6: 840-851, 2010
Ekins, et al., Mol. Biosyst. 6, 2316-2324, 2010,
Pathway analysis
Binding site similarity to Mtb proteins
Docking
Bayesian Models - ligand similarity
Predicting the target/s for small molecules
Multi-step process
1. Identification of essential in vivo enzymes of Mtb involved intensive
literature mining and manual curation, to extract all the genes essential
for Mtb growth in vivo across species.
2. Homolog information was collated from other studies.
3. Collection of metabolic pathway information involved using TBDB.
4. Identifying molecules and drugs with known or predicted targets
involved searching the CDD databases for manually curated data. The
structures and data were exported for combination with the other data.
5. All data were combined with URL links to literature and TBDB and
deposited in the CDD database.
Initially over 700 molecules in dataset
Dataset Curation: TB molecules and target information
database connects molecule, gene, pathway and literature
Sarker et al., Pharm Res 2012, 29, 2115-2127.
TB molecules and target information database connects
molecule, gene, pathway and literature
• mobile is revolutionary: a clean break
 entirely new user interface
 no backward compatibility
 highly constrained resources
 applicable to entirely new situations
mainframes
minicomputers
personal computers
portable laptops
mobile tablets
smartphones
?
The future of computing for science is mobile
 Create an app that would be useful to TB
researchers
 An active compound from phenotypic screens–
but what is target
 Predict TB targets by similarity, mine published
data
 Push the boundaries of what can be done in a
mobile app
Cheminformatics and Bioinformatics in an App
http://goo.gl/vPOKS
http://goo.gl/iDJFR
TB Mobile 2– Is on iTunes and TB Mobile 1 is on Google
play and are FREE
TB Mobile 2. layout on iPhone
About CDD
Molecule search
Filters
Action Menu Molecule prediction Clustering About TB Mobile
Control block
Compound list
Text search
TB Mobile 2. iPhone vs TB Mobile 1. Android
Molecule Detail and Links
iPhone Android
Bookmark
copy
open-in
cluster
close
TB Mobile 2. iPhone vs TB Mobile 1.
Android Similarity Searching in the app
iPhone Android
TB Mobile 2. – Filtering and Sharing
Functions
Data can also be filtered by target name, pathway name,
essentiality and human ortholog
TB Mobile 2. iPhone Filtering and Sharing Functions
Each molecule can be copied to the clipboard then opened
with other apps (e.g. MMDS, MolPrime, MolSync,
ChemSpider, and from these exported via Twitter or email)
or shared via Dropbox.
PCA of 745 compounds with Mtb
targets (blue) and 60 new
compounds (yellow)
Chemical property space of screening hits and
molecules evaluated in TB Mobile 2.
PCA of 745 compounds with Mtb
targets (blue) and 20 new test
compounds (yellow)
Gene Count Gene Count
Rv0283 1 Rv0678 1
Rv1211 1 Rv1685c 1
Rv1885c 4 Rv3160c 1
Rv3161c 1 TB27.3 (Rv0577) 2
ald (Rv2780) 2 alr (Rv3423c) 8
aroD (Rv2537c) 14 aspS (Rv2572c) 1
atpE (Rv1305) 2 blaC (Rv2068c) 1
clpB (Rv0384c) 1 clpC (Rv3596c) 1
cyp121 (Rv2276) 2 cyp130 (Rv1256c) 2
cyp51 (Rv0764c) 2 cysH (Rv2392) 10
cysS (Rv2130c) 1 dacB2 (Rv2911) 1
dapA (Rv2753c) 12 deaD (Rv1253) 1
def (Rv0429c) 14 dfrA (Rv2763c) 3
dinG (Rv1329c)" (count=1) 1 dlaT (Rv2215) 2
dnaA (Rv0001)" (count=1) 1 dnaB (Rv0058) 1
dnaE2 (Rv3370c)" (count=1) 1 dprE1 (Rv3790) 8
dprE2" (count=1) 1 drpE2 (Rv3791) 2
dxr (Rv2870c) 1 dxs1 (Rv2682C) 29
embA (Rv3794) 2 embB (Rv3795) 1
embC (Rv3793) 1 engA (Rv1713) 1
era (Rv2364c) 1 ethA (Rv3854c) 1
fabG (Rv0242c) 2 fabH (Rv0533) 48
fadD32 (Rv3801c) 5 fbpC (Rv0129C) 21
folP1 (Rv3608C) 1 folP2 (Rv1207) 1
frdA (Rv1552) 1 ftsZ (Rv2150c) 3
fusA1 (Rv0684) 3 fusA2 (Rv0120c) 3
glcB (Rv837c) 1 glf (Rv3809c) 40
glmU (Rv1018c) 1 guab2 (Rv3411) 1
gyrA (Rv0006) 24 gyrB (Rv0005) 9
ilvG (Rv1820) 1 infB (Rv2839c) 1
inhA (Rv1484) 157 kasA (Rv2245) 9
kasB (Rv2246) 5 ldtMt1 (Rv0116c) 4
ldtMt2 (Rv2518c) 1 lpd (Rv0462) 5
lppS (Rv2515c) 1 mbtA (Rv2384) 95
mca (Rv1082) 27 mfd (Rv1020) 1
mmpL3 (Rv0206c) 15 moeW (Rv2338c) 1
mshB (Rv1170) 4 murB (Rv0482) 1
murD (Rv2155c) 2 nadB (Rv1595) 1
ndhA (Rv0392c) 1 nrdR (Rv2718c) 2
pH Homeostasis 5 panC (Rv3602c) 20
pks13 (Rv3800c) 3 proteasome 2
ptpA (Rv2234) 38 ptpB (Rv0153c) 3
purU (Rv2964) 2 qcrB (Rv2196) 5
quinol oxidase 1 recG (Rv2973c) 1
rplC (Rv0701) 2 rplJ (Rv0651) 3
rpoB (Rv0667) 4 sahH (Rv3248c) 2
thiL (Rv2977c) 106 tlyA (Rv1694) 2
Target distribution
in TB Mobile 2.
MoDELS RESIDE IN PAPERS
NOT ACCESSIBLE…THIS IS
UNDESIRABLE
How do we share them?
How do we use Them?
Open Extended Connectivity Fingerprints
ECFP_6 FCFP_6
• Collected,
deduplicated,
hashed
• Sparse integers
• Invented for Pipeline Pilot: public method, proprietary details
• Often used with Bayesian models: many published papers
• Built a new implementation: open source, Java, CDK
– stable: fingerprints don't change with each new toolkit release
– well defined: easy to document precise steps
– easy to port: already migrated to iOS (Objective-C) for TB Mobile app
• Provides core basis feature for CDD open source model
service
•Clark et al., J Cheminformatics 6: 38 (2014)
Testing the fingerprints – comparing to published data
Dataset Leave one out ROC
Published
Reference Leave one out
ROC in this study
Combined model (5304
molecules) ECFP_6
fingerprints
N/A N/A 0.77
Combined model (5304
molecules) FCFP_6
fingerprints
0.71 J Chem Inf
Model
53:3054-
3063.
0.77
MLSMR dual event model
(2273 molecules) and
ECFP_6 fingerprints
N/A N/A 0.84
MLSMR dual event model
(2273 molecules) and
FCFP_6 fingerprints
0.86 PLOSONE
8:e63240
0.83
Published models also include 8 additional descriptors as well as fingerprints
•Clark et al., J Cheminformatics 6: 38 (2014)
Predictions for the InhA target: (a) the ROC curve with ECFP_6 and FCFP_6
fingerprints; (b) modified Bayesian estimators for active and inactive
compounds; (c) structures of selected binders.
For each listed target with at least two binders, it is first assumed that all of the
molecules in the collection that do not indicate this as one of their targets are
inactive.
In the app we used ECFP_6 fingerprints
Building Bayesian models for each target
Predict targets
Cluster molecules
Open in MMDS
Bayesian predictions, data export and clustering
Clark et al., J Cheminformatics, 6: 38 (2014)
 Draw structures either in app, paste or open from
other apps e.g. MMDS
 TB Mobile ranks content
 TB Mobile can use built in target Bayesian models to
predict target
 Take a screenshot of results
 Output bayesian model predictions to MMDS
 Compare to published data
 Annotate results, tabulate
Process used to evaluate TB Mobile
We have curated an additional set of 20 molecules that have activity
against Mtb and were identified by HTS or other methods
Several targets were not in the database
Molecules active against Mtb evaluated in TB Mobile app
•Clark et al., J Cheminformatics 6: 38 (2014)
 Continue to update with more data
 Outreach to increase awareness of app and data
 Add machine learning algorithms to predict Mtb activity
(in vitro and in vivo whole cell activity)
 Could we appify similar target data for other neglected
diseases/ targets e.g. malaria
What next ?
Uses Bayesian algorithm and FCFP_6 fingerprints
Model with >300,000 compounds
Copyright © 2013 All Rights Reserved Collaborative Drug Discovery
MM4TB: 25 organizations
New
Old
Neuroscience
Kinetoplastid Drug Development
Consortium
 Exposure of CDD content from collaboration with SRI
 More visibility for brand in new places
 Experiment in small database with focus on content
delivery
 A functional app to reach scientists that may not have
cheminformatics or bioinformatics training
 Used in several collaborations e.g. at Rutgers etc.
 Test bed for developing technologies for main product
 Lead to CDD Models
 on Github look for the tools section, for class
org.openscience.cdk.fingerprint.model.Bayesian.
 Opened door to models in other apps
Benefits of creating TB Mobile
Papers published on TB Mobile or using it
Ekins et al., Tuberculosis 94: 162-169 (2014)
Ekins et al., J Chem Inf Model 53: 3054 (2013)
Clark et al., J Cheminformatics 6:38 (2014)
Ekins et al., J Cheminformatics 5:13 (2013)
Krishna Dole and all at CDD, SRI, IDRI and many others …Funding: 2R42AI088893-02 NIAID, NIH;
9R44TR000942-02 NCATS, NIH; CDD TB has been developed thanks to funding from the Bill and Melinda
Gates Foundation (Grant#49852)
Freundlich Lab
Drop by and visit CDD in Booth 1155.
App papers
• Ekins S, Clark AM, Swamidass SJ, Litterman N and Williams AJ, Bigger data, collaborative tools
and the future of predictive drug discovery, J Comp-Aided Mol Design, 54:2157-65, 2014.
• Clark AM, Sarker M and Ekins S, New target predictions and visualization tools incorporating
open source molecular fingerprints for TB Mobile 2.0, J Cheminform 6: 38, 2014.
• Ekins S, Casey A.C, Roberts D, Parish T. and Bunin BA, Bayesian Models for Screening and TB
Mobile for Target Inference with Mycobacterium tuberculosis, Tuberculosis, 94:162-9, 2014.
• Clark AM, Williams AJ and Ekins S, Cheminformatics workflows using mobile apps, Chem-Bio
Informatics Journal, 13: 1-18 2013.
• Ekins S, Clark AM and Williams AJ, Incorporating Green Chemistry Concepts into Mobile
Applications and their potential uses, ACS Sustain Chem Eng, 1. 8-13, 2013.
• Ekins S, Freundlich JS and Reynolds RC, Fusing dual-event datasets for Mycobacterium
Tuberculosis machine learning models and their evaluation, J Chem Inf Model, 53(11):3054-
63, 2013.
• Ekins S, Clark AM and Williams AJ, Open Drug Discovery Teams: A Chemistry Mobile App for
Collaboration, Mol Informatics, 31: 585-597, 2012.
• Clark, AM, Ekins S and Williams AJ, Redefining cheminformatics with intuitive collaborative
mobile apps, Mol Informatics, 31: 569-584, 2012.
• Williams, AJ, Ekins S, Clark AM, Jack JJ and Apodaca RL, Mobile apps for chemistry in the
world of drug discovery, Drug Disc Today, 16:928-939, 2011.
Extra slides
14 First line drugs active against Mtb evaluated in TB
Mobile app and the top 3 molecules shown
Confirms all in TB Mobile and retrieved
Ekins et al., Tuberculosis 94: 162-169 (2014)
Predicted targets
using TB Mobile
No verification
yet
iPhone Android
TB Mobile 1. layout on iPhone and Android
~745 compounds with targets
Predicted targets of GSK TB hits months
earlier using TB Mobile
GSK report hits Dec 2012
24th Jan 2013 http://goo.gl/9LKrPZ
GSK predict targets Oct 2013
PCA of 745 compounds
with Mtb targets (blue)
and 1200 Mtb active and
non cytotoxic hits
compounds (yellow)
Chemical property space of TB Mobile compounds
Ekins et al., Tuberculosis 94: 162-169 (2014)
PCA of 745 compounds
with Mtb targets (blue)
and 177 GSK Mtb leads
(yellow)
Ekins et al., J Chem Inf Model 53: 3054 (2013)

More Related Content

What's hot

AMR Collaborative Grants at the University of Bristol
AMR Collaborative Grants at the University of BristolAMR Collaborative Grants at the University of Bristol
AMR Collaborative Grants at the University of Bristolwarwick_amr
 
Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1Jean-Claude Bradley
 
NetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderNetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderAlexander Pico
 
The Role of Bioinformatics in The Drug Discovery Process
The Role of Bioinformatics in The Drug Discovery ProcessThe Role of Bioinformatics in The Drug Discovery Process
The Role of Bioinformatics in The Drug Discovery ProcessAdebowale Qazeem
 
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryMel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryJean-Claude Bradley
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William HsiaoWilliam Hsiao
 
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...dkNET
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designingW Roseybala Devi
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016Sean Ekins
 
Computational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasisComputational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasisAlexander Decker
 
Effect of the Gayatri Mantra Playing on Microbial Load in Room Air
Effect of the Gayatri Mantra Playing on Microbial Load in Room AirEffect of the Gayatri Mantra Playing on Microbial Load in Room Air
Effect of the Gayatri Mantra Playing on Microbial Load in Room AirBhoj Raj Singh
 
Modelling the Kinetic of UV Water Disinfection
Modelling the Kinetic of UV Water DisinfectionModelling the Kinetic of UV Water Disinfection
Modelling the Kinetic of UV Water DisinfectionMichael George
 
Bioc strucvariant seattle_11_09
Bioc strucvariant seattle_11_09Bioc strucvariant seattle_11_09
Bioc strucvariant seattle_11_09Sean Davis
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUniversity Medicine Greifswald
 
Quantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserQuantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserNeil Swainston
 

What's hot (20)

AMR Collaborative Grants at the University of Bristol
AMR Collaborative Grants at the University of BristolAMR Collaborative Grants at the University of Bristol
AMR Collaborative Grants at the University of Bristol
 
Kishor Presentation
Kishor PresentationKishor Presentation
Kishor Presentation
 
2016 Analyst MALDI Env Microb_ICS_ASAP
2016 Analyst MALDI Env Microb_ICS_ASAP2016 Analyst MALDI Env Microb_ICS_ASAP
2016 Analyst MALDI Env Microb_ICS_ASAP
 
Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1Vanderwall cheminformatics Drexel Part 1
Vanderwall cheminformatics Drexel Part 1
 
NetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas KelderNetBioSIG2013-Talk Thomas Kelder
NetBioSIG2013-Talk Thomas Kelder
 
The Role of Bioinformatics in The Drug Discovery Process
The Role of Bioinformatics in The Drug Discovery ProcessThe Role of Bioinformatics in The Drug Discovery Process
The Role of Bioinformatics in The Drug Discovery Process
 
GoTermsAnalysisWithR
GoTermsAnalysisWithRGoTermsAnalysisWithR
GoTermsAnalysisWithR
 
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryMel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
 
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality?  - William HsiaoHow Can We Make Genomic Epidemiology a Widespread Reality?  - William Hsiao
How Can We Make Genomic Epidemiology a Widespread Reality? - William Hsiao
 
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016
 
Computational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasisComputational design of novel candidate drug molecules for schistosomiasis
Computational design of novel candidate drug molecules for schistosomiasis
 
Effect of the Gayatri Mantra Playing on Microbial Load in Room Air
Effect of the Gayatri Mantra Playing on Microbial Load in Room AirEffect of the Gayatri Mantra Playing on Microbial Load in Room Air
Effect of the Gayatri Mantra Playing on Microbial Load in Room Air
 
Modelling the Kinetic of UV Water Disinfection
Modelling the Kinetic of UV Water DisinfectionModelling the Kinetic of UV Water Disinfection
Modelling the Kinetic of UV Water Disinfection
 
Bioc strucvariant seattle_11_09
Bioc strucvariant seattle_11_09Bioc strucvariant seattle_11_09
Bioc strucvariant seattle_11_09
 
Using Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology modelsUsing Neo4j technologies for the management of systems biology models
Using Neo4j technologies for the management of systems biology models
 
Quantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserQuantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To Browser
 
Enriching Scholarship Personal Genomics presentation
Enriching Scholarship Personal Genomics presentationEnriching Scholarship Personal Genomics presentation
Enriching Scholarship Personal Genomics presentation
 
The Future of Computational Models for Predicting Human Toxicities
The Future of Computational Models for Predicting Human ToxicitiesThe Future of Computational Models for Predicting Human Toxicities
The Future of Computational Models for Predicting Human Toxicities
 

Viewers also liked

Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueSean Ekins
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesSean Ekins
 
Pasteurized packed milk
Pasteurized packed milkPasteurized packed milk
Pasteurized packed milkAbhinav Kp
 
Milk microbiology standards of milk and milk product22222 (2)
Milk microbiology  standards of milk and milk product22222 (2)Milk microbiology  standards of milk and milk product22222 (2)
Milk microbiology standards of milk and milk product22222 (2)shivnam
 
Food Packaging of Dairy Products
Food Packaging of Dairy ProductsFood Packaging of Dairy Products
Food Packaging of Dairy ProductsIlyana Causing
 
Milk packaging yesterday, today, tomorrrow
Milk packaging  yesterday, today, tomorrrowMilk packaging  yesterday, today, tomorrrow
Milk packaging yesterday, today, tomorrrowkarthipack12
 
Packaging of MILK & CHEESE
Packaging of MILK & CHEESEPackaging of MILK & CHEESE
Packaging of MILK & CHEESEAmmar Babar
 
Microbiology of milk
Microbiology of milkMicrobiology of milk
Microbiology of milkYash Gajwani
 
Dairy Microbiology. Methods of preservation of milk and Milk Products
Dairy Microbiology. Methods of preservation of milk and Milk ProductsDairy Microbiology. Methods of preservation of milk and Milk Products
Dairy Microbiology. Methods of preservation of milk and Milk ProductsSaugat Bhattacharjee
 
Marketing - Packaging Project CBSE
Marketing - Packaging Project CBSEMarketing - Packaging Project CBSE
Marketing - Packaging Project CBSESaish Gaonkar
 
Fermented milk products
Fermented milk products Fermented milk products
Fermented milk products Ankita Patil
 
Milk and milk products
Milk and milk productsMilk and milk products
Milk and milk productsEkta Belwal
 

Viewers also liked (15)

ECDC MRSA molecular typing initiative: current status, Daniel Palm (ECDC)
ECDC MRSA molecular typing initiative: current status, Daniel Palm (ECDC)ECDC MRSA molecular typing initiative: current status, Daniel Palm (ECDC)
ECDC MRSA molecular typing initiative: current status, Daniel Palm (ECDC)
 
Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issue
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
 
Pasteurized packed milk
Pasteurized packed milkPasteurized packed milk
Pasteurized packed milk
 
Milk microbiology standards of milk and milk product22222 (2)
Milk microbiology  standards of milk and milk product22222 (2)Milk microbiology  standards of milk and milk product22222 (2)
Milk microbiology standards of milk and milk product22222 (2)
 
Food Packaging of Dairy Products
Food Packaging of Dairy ProductsFood Packaging of Dairy Products
Food Packaging of Dairy Products
 
Milk packaging yesterday, today, tomorrrow
Milk packaging  yesterday, today, tomorrrowMilk packaging  yesterday, today, tomorrrow
Milk packaging yesterday, today, tomorrrow
 
Packaging of MILK & CHEESE
Packaging of MILK & CHEESEPackaging of MILK & CHEESE
Packaging of MILK & CHEESE
 
Microbiology of milk
Microbiology of milkMicrobiology of milk
Microbiology of milk
 
Milk Microbiology
Milk MicrobiologyMilk Microbiology
Milk Microbiology
 
Dairy Microbiology. Methods of preservation of milk and Milk Products
Dairy Microbiology. Methods of preservation of milk and Milk ProductsDairy Microbiology. Methods of preservation of milk and Milk Products
Dairy Microbiology. Methods of preservation of milk and Milk Products
 
Marketing - Packaging Project CBSE
Marketing - Packaging Project CBSEMarketing - Packaging Project CBSE
Marketing - Packaging Project CBSE
 
Fermented milk products
Fermented milk products Fermented milk products
Fermented milk products
 
Milk and milk products
Milk and milk productsMilk and milk products
Milk and milk products
 
analysis of milk
analysis of milkanalysis of milk
analysis of milk
 

Similar to New Target Prediction and Visualization Tools Incorporating Open Source Molecular Fingerprints For TB Mobile Version 2.0

Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Sean Ekins
 
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsCDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsSean Ekins
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Sean Ekins
 
Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014LushPrize
 
Bigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoveryBigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoverySean Ekins
 
Computational Biology thesis defense
Computational Biology thesis defenseComputational Biology thesis defense
Computational Biology thesis defensecsfunk
 
Cheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug DiscoveryCheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug DiscoverySean Ekins
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judesSean Ekins
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsSean Ekins
 
TB Mobile: Appifying data on antituberculosis molecule targets
TB Mobile: Appifying data on antituberculosis molecule targetsTB Mobile: Appifying data on antituberculosis molecule targets
TB Mobile: Appifying data on antituberculosis molecule targetsSean Ekins
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015IRIDA_community
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Sunghwan Kim
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesElia Brodsky
 
UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG: connecting the knowledge community
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Ian Foster
 
Nc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologyNc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologySean Ekins
 
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...Sean Ekins
 
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...Sean Ekins
 
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...DATAVERSITY
 

Similar to New Target Prediction and Visualization Tools Incorporating Open Source Molecular Fingerprints For TB Mobile Version 2.0 (20)

Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery Exploiting bigger data and collaborative tools for predictive drug discovery
Exploiting bigger data and collaborative tools for predictive drug discovery
 
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsCDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
 
Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014
 
Bigger Data to Increase Drug Discovery
Bigger Data to Increase Drug DiscoveryBigger Data to Increase Drug Discovery
Bigger Data to Increase Drug Discovery
 
Computational Biology thesis defense
Computational Biology thesis defenseComputational Biology thesis defense
Computational Biology thesis defense
 
Cheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug DiscoveryCheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug Discovery
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judes
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
 
TB Mobile: Appifying data on antituberculosis molecule targets
TB Mobile: Appifying data on antituberculosis molecule targetsTB Mobile: Appifying data on antituberculosis molecule targets
TB Mobile: Appifying data on antituberculosis molecule targets
 
Grand round whsiao_may2015
Grand round whsiao_may2015Grand round whsiao_may2015
Grand round whsiao_may2015
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and Approaches
 
T-bioinfo overview
T-bioinfo overviewT-bioinfo overview
T-bioinfo overview
 
UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...UKSG 2023 - Will artificial intelligence change how readers use the research ...
UKSG 2023 - Will artificial intelligence change how readers use the research ...
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Nc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational ToxicologyNc state lecture v2 Computational Toxicology
Nc state lecture v2 Computational Toxicology
 
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
Medicinal Chemistry Due Diligence: Computational Predictions of an expert’s e...
 
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
Combining Metabolite-Based Pharmacophores with Bayesian Machine Learning Mode...
 
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
WEBINAR: The Yosemite Project PART 6 -- Data-Driven Biomedical Research with ...
 

More from Sean Ekins

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptxSean Ekins
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Sean Ekins
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...Sean Ekins
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Sean Ekins
 
Bayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseBayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseSean Ekins
 
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Sean Ekins
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchSean Ekins
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation Sean Ekins
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2Sean Ekins
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3 Sean Ekins
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2 Sean Ekins
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1 Sean Ekins
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...Sean Ekins
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b igSean Ekins
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - Sean Ekins
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientistsSean Ekins
 
Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...Sean Ekins
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16Sean Ekins
 
Infographic for Sanfilippo Syndrome IIIC and IIID
Infographic for Sanfilippo Syndrome IIIC and IIIDInfographic for Sanfilippo Syndrome IIIC and IIID
Infographic for Sanfilippo Syndrome IIIC and IIIDSean Ekins
 
Cmt update-summer-newsletter-2015
Cmt update-summer-newsletter-2015Cmt update-summer-newsletter-2015
Cmt update-summer-newsletter-2015Sean Ekins
 

More from Sean Ekins (20)

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptx
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
 
Bayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseBayesian Models for Chagas Disease
Bayesian Models for Chagas Disease
 
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b ig
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models -
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientists
 
Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...Rare pediatric and neglected tropical diseases priority review voucher and tr...
Rare pediatric and neglected tropical diseases priority review voucher and tr...
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16
 
Infographic for Sanfilippo Syndrome IIIC and IIID
Infographic for Sanfilippo Syndrome IIIC and IIIDInfographic for Sanfilippo Syndrome IIIC and IIID
Infographic for Sanfilippo Syndrome IIIC and IIID
 
Cmt update-summer-newsletter-2015
Cmt update-summer-newsletter-2015Cmt update-summer-newsletter-2015
Cmt update-summer-newsletter-2015
 

New Target Prediction and Visualization Tools Incorporating Open Source Molecular Fingerprints For TB Mobile Version 2.0

  • 1. New Target Prediction and Visualization Tools Incorporating Open Source Molecular Fingerprints For TB Mobile Version 2.0 Alex M. Clark1 Malabika Sarker2 and Sean Ekins3, 4 1Molecular Materials Informatics, 1900 St. Jacques #302, Montreal Quebec, Canada H3J 2S1 2SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025, USA. 3Collaborative Drug Discovery, 1633 Bayshore Highway, Suite 342, Burlingame, CA 94010, USA. 4Collaborations in Chemistry, 5616 Hilltop Needmore Road, Fuquay-Varina, NC 27526, USA. .
  • 2. Tuberculosis kills 1.6-1.7m/yr (~1 every 8 seconds) 1/3rd of worlds population infected!!!! streptomycin (1943) para-aminosalicyclic acid (1949) isoniazid (1952) pyrazinamide (1954) cycloserine (1955) ethambutol (1962) rifampicin (1967) • Multi drug resistance in 4.3% of cases • Extensively drug resistant increasing incidence • One new drug (bedaquiline) in 40 yrs TB key points
  • 3. Tested >350,000 molecules Tested ~2M 2M >300,000 >1500 active and non toxic Published 177 100s 800 Big Data: Screening for New Tuberculosis Treatments TBDA screened over 1 million, 1 million more to go TB Alliance + Japanese pharma screens How many will become a new drug? How do we learn from this big data? What are the targets for these molecules?
  • 4. Over 6 years analyzed in vitro data and built models Top scoring molecules assayed for Mtb growth inhibition Mtb screening molecule database/s High-throughput phenotypic Mtb screening Descriptors + Bioactivity (+Cytotoxicity) Bayesian Machine Learning classification Mtb Model Molecule Database (e.g. GSK malaria actives) virtually scored using Bayesian Models New bioactivity data may enhance models Identify in vitro hits and test models3 x published prospective tests ~750 molecules were tested in vitro 198 actives were identified >20 % hit rate Multiple retrospective tests 3-10 fold enrichment N H S N Ekins et al., Pharm Res 31: 414-435, 2014 Ekins, et al., Tuberculosis 94; 162-169, 2014 Ekins, et al., PLOSONE 8; e63240, 2013 Ekins, et al., Chem Biol 20: 370-378, 2013 Ekins, et al., JCIM, 53: 3054−3063, 2013 Ekins and Freundlich, Pharm Res, 28, 1859-1869, 2011 Ekins et al., Mol BioSyst, 6: 840-851, 2010 Ekins, et al., Mol. Biosyst. 6, 2316-2324, 2010,
  • 5. Pathway analysis Binding site similarity to Mtb proteins Docking Bayesian Models - ligand similarity Predicting the target/s for small molecules
  • 6. Multi-step process 1. Identification of essential in vivo enzymes of Mtb involved intensive literature mining and manual curation, to extract all the genes essential for Mtb growth in vivo across species. 2. Homolog information was collated from other studies. 3. Collection of metabolic pathway information involved using TBDB. 4. Identifying molecules and drugs with known or predicted targets involved searching the CDD databases for manually curated data. The structures and data were exported for combination with the other data. 5. All data were combined with URL links to literature and TBDB and deposited in the CDD database. Initially over 700 molecules in dataset Dataset Curation: TB molecules and target information database connects molecule, gene, pathway and literature Sarker et al., Pharm Res 2012, 29, 2115-2127.
  • 7. TB molecules and target information database connects molecule, gene, pathway and literature
  • 8. • mobile is revolutionary: a clean break  entirely new user interface  no backward compatibility  highly constrained resources  applicable to entirely new situations mainframes minicomputers personal computers portable laptops mobile tablets smartphones ? The future of computing for science is mobile
  • 9.  Create an app that would be useful to TB researchers  An active compound from phenotypic screens– but what is target  Predict TB targets by similarity, mine published data  Push the boundaries of what can be done in a mobile app Cheminformatics and Bioinformatics in an App
  • 10. http://goo.gl/vPOKS http://goo.gl/iDJFR TB Mobile 2– Is on iTunes and TB Mobile 1 is on Google play and are FREE
  • 11. TB Mobile 2. layout on iPhone About CDD Molecule search Filters Action Menu Molecule prediction Clustering About TB Mobile Control block Compound list Text search
  • 12. TB Mobile 2. iPhone vs TB Mobile 1. Android Molecule Detail and Links iPhone Android Bookmark copy open-in cluster close
  • 13. TB Mobile 2. iPhone vs TB Mobile 1. Android Similarity Searching in the app iPhone Android
  • 14. TB Mobile 2. – Filtering and Sharing Functions Data can also be filtered by target name, pathway name, essentiality and human ortholog
  • 15. TB Mobile 2. iPhone Filtering and Sharing Functions Each molecule can be copied to the clipboard then opened with other apps (e.g. MMDS, MolPrime, MolSync, ChemSpider, and from these exported via Twitter or email) or shared via Dropbox.
  • 16. PCA of 745 compounds with Mtb targets (blue) and 60 new compounds (yellow) Chemical property space of screening hits and molecules evaluated in TB Mobile 2. PCA of 745 compounds with Mtb targets (blue) and 20 new test compounds (yellow)
  • 17. Gene Count Gene Count Rv0283 1 Rv0678 1 Rv1211 1 Rv1685c 1 Rv1885c 4 Rv3160c 1 Rv3161c 1 TB27.3 (Rv0577) 2 ald (Rv2780) 2 alr (Rv3423c) 8 aroD (Rv2537c) 14 aspS (Rv2572c) 1 atpE (Rv1305) 2 blaC (Rv2068c) 1 clpB (Rv0384c) 1 clpC (Rv3596c) 1 cyp121 (Rv2276) 2 cyp130 (Rv1256c) 2 cyp51 (Rv0764c) 2 cysH (Rv2392) 10 cysS (Rv2130c) 1 dacB2 (Rv2911) 1 dapA (Rv2753c) 12 deaD (Rv1253) 1 def (Rv0429c) 14 dfrA (Rv2763c) 3 dinG (Rv1329c)" (count=1) 1 dlaT (Rv2215) 2 dnaA (Rv0001)" (count=1) 1 dnaB (Rv0058) 1 dnaE2 (Rv3370c)" (count=1) 1 dprE1 (Rv3790) 8 dprE2" (count=1) 1 drpE2 (Rv3791) 2 dxr (Rv2870c) 1 dxs1 (Rv2682C) 29 embA (Rv3794) 2 embB (Rv3795) 1 embC (Rv3793) 1 engA (Rv1713) 1 era (Rv2364c) 1 ethA (Rv3854c) 1 fabG (Rv0242c) 2 fabH (Rv0533) 48 fadD32 (Rv3801c) 5 fbpC (Rv0129C) 21 folP1 (Rv3608C) 1 folP2 (Rv1207) 1 frdA (Rv1552) 1 ftsZ (Rv2150c) 3 fusA1 (Rv0684) 3 fusA2 (Rv0120c) 3 glcB (Rv837c) 1 glf (Rv3809c) 40 glmU (Rv1018c) 1 guab2 (Rv3411) 1 gyrA (Rv0006) 24 gyrB (Rv0005) 9 ilvG (Rv1820) 1 infB (Rv2839c) 1 inhA (Rv1484) 157 kasA (Rv2245) 9 kasB (Rv2246) 5 ldtMt1 (Rv0116c) 4 ldtMt2 (Rv2518c) 1 lpd (Rv0462) 5 lppS (Rv2515c) 1 mbtA (Rv2384) 95 mca (Rv1082) 27 mfd (Rv1020) 1 mmpL3 (Rv0206c) 15 moeW (Rv2338c) 1 mshB (Rv1170) 4 murB (Rv0482) 1 murD (Rv2155c) 2 nadB (Rv1595) 1 ndhA (Rv0392c) 1 nrdR (Rv2718c) 2 pH Homeostasis 5 panC (Rv3602c) 20 pks13 (Rv3800c) 3 proteasome 2 ptpA (Rv2234) 38 ptpB (Rv0153c) 3 purU (Rv2964) 2 qcrB (Rv2196) 5 quinol oxidase 1 recG (Rv2973c) 1 rplC (Rv0701) 2 rplJ (Rv0651) 3 rpoB (Rv0667) 4 sahH (Rv3248c) 2 thiL (Rv2977c) 106 tlyA (Rv1694) 2 Target distribution in TB Mobile 2.
  • 18. MoDELS RESIDE IN PAPERS NOT ACCESSIBLE…THIS IS UNDESIRABLE How do we share them? How do we use Them?
  • 19. Open Extended Connectivity Fingerprints ECFP_6 FCFP_6 • Collected, deduplicated, hashed • Sparse integers • Invented for Pipeline Pilot: public method, proprietary details • Often used with Bayesian models: many published papers • Built a new implementation: open source, Java, CDK – stable: fingerprints don't change with each new toolkit release – well defined: easy to document precise steps – easy to port: already migrated to iOS (Objective-C) for TB Mobile app • Provides core basis feature for CDD open source model service •Clark et al., J Cheminformatics 6: 38 (2014)
  • 20. Testing the fingerprints – comparing to published data Dataset Leave one out ROC Published Reference Leave one out ROC in this study Combined model (5304 molecules) ECFP_6 fingerprints N/A N/A 0.77 Combined model (5304 molecules) FCFP_6 fingerprints 0.71 J Chem Inf Model 53:3054- 3063. 0.77 MLSMR dual event model (2273 molecules) and ECFP_6 fingerprints N/A N/A 0.84 MLSMR dual event model (2273 molecules) and FCFP_6 fingerprints 0.86 PLOSONE 8:e63240 0.83 Published models also include 8 additional descriptors as well as fingerprints •Clark et al., J Cheminformatics 6: 38 (2014)
  • 21. Predictions for the InhA target: (a) the ROC curve with ECFP_6 and FCFP_6 fingerprints; (b) modified Bayesian estimators for active and inactive compounds; (c) structures of selected binders. For each listed target with at least two binders, it is first assumed that all of the molecules in the collection that do not indicate this as one of their targets are inactive. In the app we used ECFP_6 fingerprints Building Bayesian models for each target
  • 22. Predict targets Cluster molecules Open in MMDS Bayesian predictions, data export and clustering Clark et al., J Cheminformatics, 6: 38 (2014)
  • 23.  Draw structures either in app, paste or open from other apps e.g. MMDS  TB Mobile ranks content  TB Mobile can use built in target Bayesian models to predict target  Take a screenshot of results  Output bayesian model predictions to MMDS  Compare to published data  Annotate results, tabulate Process used to evaluate TB Mobile
  • 24. We have curated an additional set of 20 molecules that have activity against Mtb and were identified by HTS or other methods Several targets were not in the database Molecules active against Mtb evaluated in TB Mobile app •Clark et al., J Cheminformatics 6: 38 (2014)
  • 25.  Continue to update with more data  Outreach to increase awareness of app and data  Add machine learning algorithms to predict Mtb activity (in vitro and in vivo whole cell activity)  Could we appify similar target data for other neglected diseases/ targets e.g. malaria What next ?
  • 26. Uses Bayesian algorithm and FCFP_6 fingerprints Model with >300,000 compounds
  • 27. Copyright © 2013 All Rights Reserved Collaborative Drug Discovery MM4TB: 25 organizations New Old Neuroscience Kinetoplastid Drug Development Consortium
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.  Exposure of CDD content from collaboration with SRI  More visibility for brand in new places  Experiment in small database with focus on content delivery  A functional app to reach scientists that may not have cheminformatics or bioinformatics training  Used in several collaborations e.g. at Rutgers etc.  Test bed for developing technologies for main product  Lead to CDD Models  on Github look for the tools section, for class org.openscience.cdk.fingerprint.model.Bayesian.  Opened door to models in other apps Benefits of creating TB Mobile
  • 33. Papers published on TB Mobile or using it Ekins et al., Tuberculosis 94: 162-169 (2014) Ekins et al., J Chem Inf Model 53: 3054 (2013) Clark et al., J Cheminformatics 6:38 (2014) Ekins et al., J Cheminformatics 5:13 (2013)
  • 34. Krishna Dole and all at CDD, SRI, IDRI and many others …Funding: 2R42AI088893-02 NIAID, NIH; 9R44TR000942-02 NCATS, NIH; CDD TB has been developed thanks to funding from the Bill and Melinda Gates Foundation (Grant#49852) Freundlich Lab Drop by and visit CDD in Booth 1155.
  • 35. App papers • Ekins S, Clark AM, Swamidass SJ, Litterman N and Williams AJ, Bigger data, collaborative tools and the future of predictive drug discovery, J Comp-Aided Mol Design, 54:2157-65, 2014. • Clark AM, Sarker M and Ekins S, New target predictions and visualization tools incorporating open source molecular fingerprints for TB Mobile 2.0, J Cheminform 6: 38, 2014. • Ekins S, Casey A.C, Roberts D, Parish T. and Bunin BA, Bayesian Models for Screening and TB Mobile for Target Inference with Mycobacterium tuberculosis, Tuberculosis, 94:162-9, 2014. • Clark AM, Williams AJ and Ekins S, Cheminformatics workflows using mobile apps, Chem-Bio Informatics Journal, 13: 1-18 2013. • Ekins S, Clark AM and Williams AJ, Incorporating Green Chemistry Concepts into Mobile Applications and their potential uses, ACS Sustain Chem Eng, 1. 8-13, 2013. • Ekins S, Freundlich JS and Reynolds RC, Fusing dual-event datasets for Mycobacterium Tuberculosis machine learning models and their evaluation, J Chem Inf Model, 53(11):3054- 63, 2013. • Ekins S, Clark AM and Williams AJ, Open Drug Discovery Teams: A Chemistry Mobile App for Collaboration, Mol Informatics, 31: 585-597, 2012. • Clark, AM, Ekins S and Williams AJ, Redefining cheminformatics with intuitive collaborative mobile apps, Mol Informatics, 31: 569-584, 2012. • Williams, AJ, Ekins S, Clark AM, Jack JJ and Apodaca RL, Mobile apps for chemistry in the world of drug discovery, Drug Disc Today, 16:928-939, 2011.
  • 37. 14 First line drugs active against Mtb evaluated in TB Mobile app and the top 3 molecules shown Confirms all in TB Mobile and retrieved
  • 38. Ekins et al., Tuberculosis 94: 162-169 (2014) Predicted targets using TB Mobile No verification yet
  • 39. iPhone Android TB Mobile 1. layout on iPhone and Android ~745 compounds with targets
  • 40. Predicted targets of GSK TB hits months earlier using TB Mobile GSK report hits Dec 2012 24th Jan 2013 http://goo.gl/9LKrPZ GSK predict targets Oct 2013
  • 41. PCA of 745 compounds with Mtb targets (blue) and 1200 Mtb active and non cytotoxic hits compounds (yellow) Chemical property space of TB Mobile compounds Ekins et al., Tuberculosis 94: 162-169 (2014) PCA of 745 compounds with Mtb targets (blue) and 177 GSK Mtb leads (yellow) Ekins et al., J Chem Inf Model 53: 3054 (2013)