SlideShare a Scribd company logo
1 of 14
Challenges of curating approved medicines: 
Will the real drugs please stand up? 
Chris Southan, representing the Database Team 
NC-IUPHAR/BPS/GTPdb Biannual Meeting, Paris, October 2014 
1
What is the total for approved drug structures? 
Take your pick ….. 
2
Discordance between sources inside PubChem 
3
Explanations 
• Discordance: distinctly different drug molecular representations from 
different sources that we would recognise canonically as the same 
bioactive substance 
• These are merged into multiple CIDs per drug (i.e. “multiplexed”) via the 
PubChem chemistry rules due to: 
– Permutation of R/S stereo centers 
– Salt forms 
– Mixtures 
– Unresolved E/Z bonds 
– Tautomers 
– Isotopic derivatives including deuteration 
4
Causes of drug structure multiplexing 
• Inherent challenges and complexities of chemical representation 
• Utility of PubChem depends on advanced rules applied to a submission-based 
system 
• Drug companies never verify their own structures in public databases 
• Legacy of structure image primacy in documents 
• No clear accountability for correctness of public approved drug structures 
(companies? FDA? WHO(INN)? AMA(USAN)? Wikipedia? CAS?) 
• Structural variants enter databases from general source proliferation, 
large-scale patent extractions, chemical vendor submissions and 
repeated exemplifications in journals 
• The net effect is an inexorable increase in multiplexing but not necessarily 
erroneous structures per se 
5
A case of the wrong name > structure 
6
Fixing errors: doing our bit 
7
Taxol: a challenging example 
8
Finding the links: multiplexed to 129 CIDs 
9
Reading the links for alternative taxols: 
different structures > 20 sets of assay results 
10
Virtual deuteration: compounding drug multiplexing 
11
Scale of the issue for approved drugs in PubChem: 
multiplexing expansion from 2005 to 2014 
12
So how are we doing in our database? 
• Sets were salt-stripped for this comparison 
• GTPdb (Oct 2014) has 983 approved drug CIDs concordant with either 
ChEMBL or DrugBank 
• But only 723 are 4-way concordant 
• We will inspect the 152, 192 and 180 sectors for consensus expansion 
13
Consequences and possible solutions to the 
drug multiplexing issue 
• Our drugs annotation Committee cannot magic these issues away 
but their support is crucial 
• Our consensus approach is useful and statistical defendable 
• In the GTPdb we add curator comments and cross-pointers for key 
multiplexed examples 
• Sources that make the effort to collate drug structure sets should 
cross-corroborate more 
• A canonical approach to merging drug structure-to-bioactivity 
mappings could be considered 
• The inner connectivity layer of the InChIKey goes some way towards 
this 
14

More Related Content

What's hot

MDC Connects Series 2021 | A Guide to Complex Medicines: What is the commerci...
MDC Connects Series 2021 | A Guide to Complex Medicines: What is the commerci...MDC Connects Series 2021 | A Guide to Complex Medicines: What is the commerci...
MDC Connects Series 2021 | A Guide to Complex Medicines: What is the commerci...Medicines Discovery Catapult
 
A Fully Compliant, DDI Solution
A Fully Compliant, DDI SolutionA Fully Compliant, DDI Solution
A Fully Compliant, DDI SolutionCovance
 
Active Pharmaceutical Ingredient (API); Impact of Primary / Alternate Source ...
Active Pharmaceutical Ingredient (API); Impact of Primary / Alternate Source ...Active Pharmaceutical Ingredient (API); Impact of Primary / Alternate Source ...
Active Pharmaceutical Ingredient (API); Impact of Primary / Alternate Source ...Dr.Faheem Lakhani
 
Base Pair Case Study: Competitive and Comparative SELEX
Base Pair Case Study: Competitive and Comparative SELEXBase Pair Case Study: Competitive and Comparative SELEX
Base Pair Case Study: Competitive and Comparative SELEXLisa Thurston
 
Analysing targets and drugs to populate the GToP database
Analysing  targets and drugs to populate the GToP databaseAnalysing  targets and drugs to populate the GToP database
Analysing targets and drugs to populate the GToP databaseChris Southan
 
Patent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEsPatent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEsChris Southan
 
Vemurafenib - Comprehensive patent search
Vemurafenib   - Comprehensive patent searchVemurafenib   - Comprehensive patent search
Vemurafenib - Comprehensive patent searchReportsnReports
 
Vilazodone - Comprehensive patent search
Vilazodone  - Comprehensive patent searchVilazodone  - Comprehensive patent search
Vilazodone - Comprehensive patent searchReportsnReports
 
Stability Testing of Pharmaceuticals and Supplements
Stability Testing of Pharmaceuticals and SupplementsStability Testing of Pharmaceuticals and Supplements
Stability Testing of Pharmaceuticals and SupplementsEMMAIntl
 
Treprostinil - Comprehensive patent search
Treprostinil  - Comprehensive patent searchTreprostinil  - Comprehensive patent search
Treprostinil - Comprehensive patent searchReportsnReports
 
Biomarker Exchange Standards
Biomarker Exchange StandardsBiomarker Exchange Standards
Biomarker Exchange StandardsPistoia Alliance
 
Rifaximin - Comprehensive patent search
Rifaximin  - Comprehensive patent searchRifaximin  - Comprehensive patent search
Rifaximin - Comprehensive patent searchReportsnReports
 
USUGM 2014 - Gerald Wyckoff (Chemalytics): Development of the Chemalytics Pl...
USUGM 2014 -  Gerald Wyckoff (Chemalytics): Development of the Chemalytics Pl...USUGM 2014 -  Gerald Wyckoff (Chemalytics): Development of the Chemalytics Pl...
USUGM 2014 - Gerald Wyckoff (Chemalytics): Development of the Chemalytics Pl...ChemAxon
 
Liraglutide - Comprehensive patent search
Liraglutide  - Comprehensive patent searchLiraglutide  - Comprehensive patent search
Liraglutide - Comprehensive patent searchReportsnReports
 
Bacteremia – pipeline review, h1 2012
Bacteremia – pipeline review, h1 2012Bacteremia – pipeline review, h1 2012
Bacteremia – pipeline review, h1 2012Rose088
 

What's hot (18)

MDC Connects Series 2021 | A Guide to Complex Medicines: What is the commerci...
MDC Connects Series 2021 | A Guide to Complex Medicines: What is the commerci...MDC Connects Series 2021 | A Guide to Complex Medicines: What is the commerci...
MDC Connects Series 2021 | A Guide to Complex Medicines: What is the commerci...
 
A Fully Compliant, DDI Solution
A Fully Compliant, DDI SolutionA Fully Compliant, DDI Solution
A Fully Compliant, DDI Solution
 
Active Pharmaceutical Ingredient (API); Impact of Primary / Alternate Source ...
Active Pharmaceutical Ingredient (API); Impact of Primary / Alternate Source ...Active Pharmaceutical Ingredient (API); Impact of Primary / Alternate Source ...
Active Pharmaceutical Ingredient (API); Impact of Primary / Alternate Source ...
 
Base Pair Case Study: Competitive and Comparative SELEX
Base Pair Case Study: Competitive and Comparative SELEXBase Pair Case Study: Competitive and Comparative SELEX
Base Pair Case Study: Competitive and Comparative SELEX
 
1 click docking
1 click docking1 click docking
1 click docking
 
Analysing targets and drugs to populate the GToP database
Analysing  targets and drugs to populate the GToP databaseAnalysing  targets and drugs to populate the GToP database
Analysing targets and drugs to populate the GToP database
 
Patent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEsPatent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEs
 
Vemurafenib - Comprehensive patent search
Vemurafenib   - Comprehensive patent searchVemurafenib   - Comprehensive patent search
Vemurafenib - Comprehensive patent search
 
Vilazodone - Comprehensive patent search
Vilazodone  - Comprehensive patent searchVilazodone  - Comprehensive patent search
Vilazodone - Comprehensive patent search
 
Stability Testing of Pharmaceuticals and Supplements
Stability Testing of Pharmaceuticals and SupplementsStability Testing of Pharmaceuticals and Supplements
Stability Testing of Pharmaceuticals and Supplements
 
Treprostinil - Comprehensive patent search
Treprostinil  - Comprehensive patent searchTreprostinil  - Comprehensive patent search
Treprostinil - Comprehensive patent search
 
Biomarker Exchange Standards
Biomarker Exchange StandardsBiomarker Exchange Standards
Biomarker Exchange Standards
 
Rifaximin - Comprehensive patent search
Rifaximin  - Comprehensive patent searchRifaximin  - Comprehensive patent search
Rifaximin - Comprehensive patent search
 
USUGM 2014 - Gerald Wyckoff (Chemalytics): Development of the Chemalytics Pl...
USUGM 2014 -  Gerald Wyckoff (Chemalytics): Development of the Chemalytics Pl...USUGM 2014 -  Gerald Wyckoff (Chemalytics): Development of the Chemalytics Pl...
USUGM 2014 - Gerald Wyckoff (Chemalytics): Development of the Chemalytics Pl...
 
Adc
AdcAdc
Adc
 
Liraglutide - Comprehensive patent search
Liraglutide  - Comprehensive patent searchLiraglutide  - Comprehensive patent search
Liraglutide - Comprehensive patent search
 
29. Prof. Sergey Tulyandin - Russian Scientific Oncology N.N. Blokhin Center
29. Prof. Sergey Tulyandin - Russian Scientific Oncology N.N. Blokhin Center29. Prof. Sergey Tulyandin - Russian Scientific Oncology N.N. Blokhin Center
29. Prof. Sergey Tulyandin - Russian Scientific Oncology N.N. Blokhin Center
 
Bacteremia – pipeline review, h1 2012
Bacteremia – pipeline review, h1 2012Bacteremia – pipeline review, h1 2012
Bacteremia – pipeline review, h1 2012
 

Viewers also liked

Closing the gap between chemistry and biology: Joining between text tombs and...
Closing the gap between chemistry and biology: Joining between text tombs and...Closing the gap between chemistry and biology: Joining between text tombs and...
Closing the gap between chemistry and biology: Joining between text tombs and...Chris Southan
 
20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horse20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horseChris Southan
 
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...ChemAxon
 
Correct drug structures for pharmacology
Correct drug structures for pharmacologyCorrect drug structures for pharmacology
Correct drug structures for pharmacologyChris Southan
 

Viewers also liked (6)

Closing the gap between chemistry and biology: Joining between text tombs and...
Closing the gap between chemistry and biology: Joining between text tombs and...Closing the gap between chemistry and biology: Joining between text tombs and...
Closing the gap between chemistry and biology: Joining between text tombs and...
 
biologydriven
biologydrivenbiologydriven
biologydriven
 
20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horse20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horse
 
Presentation given at UCSF Precision Medicine meeting 4/11/2015
Presentation given at UCSF Precision Medicine meeting 4/11/2015 Presentation given at UCSF Precision Medicine meeting 4/11/2015
Presentation given at UCSF Precision Medicine meeting 4/11/2015
 
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
EUGM 2013 - Christopher Southan (TW2Informatics): Chemicalize.org, SureChemOp...
 
Correct drug structures for pharmacology
Correct drug structures for pharmacologyCorrect drug structures for pharmacology
Correct drug structures for pharmacology
 

Similar to Southan real drugs_paris_oct_11_2014

Evolving consensus-based curatorial strategies
Evolving consensus-based curatorial strategiesEvolving consensus-based curatorial strategies
Evolving consensus-based curatorial strategiesChris Southan
 
Sorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaffSorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaffChris Southan
 
Sorting bioactive wheat from database chaff: Challenges of discerning correct...
Sorting bioactive wheat from database chaff: Challenges of discerning correct...Sorting bioactive wheat from database chaff: Challenges of discerning correct...
Sorting bioactive wheat from database chaff: Challenges of discerning correct...Guide to PHARMACOLOGY
 
The Many Roles of Computation in Drug Discovery
The Many Roles of Computation in Drug DiscoveryThe Many Roles of Computation in Drug Discovery
The Many Roles of Computation in Drug DiscoveryNishoanth Ramanathan
 
PubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyPubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyChris Southan
 
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...Richard Boyce, PhD
 
Linked-spls-initial-hcls-presentation-06082012
Linked-spls-initial-hcls-presentation-06082012Linked-spls-initial-hcls-presentation-06082012
Linked-spls-initial-hcls-presentation-06082012Richard Boyce, PhD
 
Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)Jannat985397
 
citc-rwe-8dec2021_v2.pdf
citc-rwe-8dec2021_v2.pdfcitc-rwe-8dec2021_v2.pdf
citc-rwe-8dec2021_v2.pdfssuser660bb1
 
Applications Of Bioinformatics In Drug Discovery And Process
Applications Of Bioinformatics In Drug Discovery And ProcessApplications Of Bioinformatics In Drug Discovery And Process
Applications Of Bioinformatics In Drug Discovery And ProcessProf. Dr. Basavaraj Nanjwade
 
Chemistry-to-Protein Relastionship Quality
Chemistry-to-Protein Relastionship QualityChemistry-to-Protein Relastionship Quality
Chemistry-to-Protein Relastionship QualityChris Southan
 
The Impact of Real-World Data in Pharmacovigilance and Regulatory Decision-Ma...
The Impact of Real-World Data in Pharmacovigilance and Regulatory Decision-Ma...The Impact of Real-World Data in Pharmacovigilance and Regulatory Decision-Ma...
The Impact of Real-World Data in Pharmacovigilance and Regulatory Decision-Ma...ClinosolIndia
 
OUTSOURCING TO BE AND BA final (1).pptx
OUTSOURCING TO BE AND BA final (1).pptxOUTSOURCING TO BE AND BA final (1).pptx
OUTSOURCING TO BE AND BA final (1).pptxDhanaa Dhoni
 
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Chanin Nantasenamat
 
Metabolite Likeness for selection of pharmaceutical drug libraries
Metabolite Likeness for selection of pharmaceutical drug librariesMetabolite Likeness for selection of pharmaceutical drug libraries
Metabolite Likeness for selection of pharmaceutical drug librariesAnkit Dhaundiyal
 
Drug Discovery Method (Bioinformatics)
Drug Discovery Method (Bioinformatics)Drug Discovery Method (Bioinformatics)
Drug Discovery Method (Bioinformatics)Hemantkrdu
 
Bioinformatics role in Pharmaceutical industries
Bioinformatics role in Pharmaceutical industriesBioinformatics role in Pharmaceutical industries
Bioinformatics role in Pharmaceutical industriesMuzna Kashaf
 
Pharmacists in Drug Discovery & Development
Pharmacists in Drug Discovery & Development Pharmacists in Drug Discovery & Development
Pharmacists in Drug Discovery & Development Bhaswat Chakraborty
 

Similar to Southan real drugs_paris_oct_11_2014 (20)

Evolving consensus-based curatorial strategies
Evolving consensus-based curatorial strategiesEvolving consensus-based curatorial strategies
Evolving consensus-based curatorial strategies
 
Sorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaffSorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaff
 
Sorting bioactive wheat from database chaff: Challenges of discerning correct...
Sorting bioactive wheat from database chaff: Challenges of discerning correct...Sorting bioactive wheat from database chaff: Challenges of discerning correct...
Sorting bioactive wheat from database chaff: Challenges of discerning correct...
 
The Many Roles of Computation in Drug Discovery
The Many Roles of Computation in Drug DiscoveryThe Many Roles of Computation in Drug Discovery
The Many Roles of Computation in Drug Discovery
 
PubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyPubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biology
 
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
Extending the "Web of Drug Identity" with knowledge extracted from United Sta...
 
Linked-spls-initial-hcls-presentation-06082012
Linked-spls-initial-hcls-presentation-06082012Linked-spls-initial-hcls-presentation-06082012
Linked-spls-initial-hcls-presentation-06082012
 
Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)
 
Cadd
CaddCadd
Cadd
 
citc-rwe-8dec2021_v2.pdf
citc-rwe-8dec2021_v2.pdfcitc-rwe-8dec2021_v2.pdf
citc-rwe-8dec2021_v2.pdf
 
Applications Of Bioinformatics In Drug Discovery And Process
Applications Of Bioinformatics In Drug Discovery And ProcessApplications Of Bioinformatics In Drug Discovery And Process
Applications Of Bioinformatics In Drug Discovery And Process
 
Chemistry-to-Protein Relastionship Quality
Chemistry-to-Protein Relastionship QualityChemistry-to-Protein Relastionship Quality
Chemistry-to-Protein Relastionship Quality
 
The Impact of Real-World Data in Pharmacovigilance and Regulatory Decision-Ma...
The Impact of Real-World Data in Pharmacovigilance and Regulatory Decision-Ma...The Impact of Real-World Data in Pharmacovigilance and Regulatory Decision-Ma...
The Impact of Real-World Data in Pharmacovigilance and Regulatory Decision-Ma...
 
OUTSOURCING TO BE AND BA final (1).pptx
OUTSOURCING TO BE AND BA final (1).pptxOUTSOURCING TO BE AND BA final (1).pptx
OUTSOURCING TO BE AND BA final (1).pptx
 
ABT 609 PPT
ABT 609 PPTABT 609 PPT
ABT 609 PPT
 
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
 
Metabolite Likeness for selection of pharmaceutical drug libraries
Metabolite Likeness for selection of pharmaceutical drug librariesMetabolite Likeness for selection of pharmaceutical drug libraries
Metabolite Likeness for selection of pharmaceutical drug libraries
 
Drug Discovery Method (Bioinformatics)
Drug Discovery Method (Bioinformatics)Drug Discovery Method (Bioinformatics)
Drug Discovery Method (Bioinformatics)
 
Bioinformatics role in Pharmaceutical industries
Bioinformatics role in Pharmaceutical industriesBioinformatics role in Pharmaceutical industries
Bioinformatics role in Pharmaceutical industries
 
Pharmacists in Drug Discovery & Development
Pharmacists in Drug Discovery & Development Pharmacists in Drug Discovery & Development
Pharmacists in Drug Discovery & Development
 

More from Chris Southan

FAIR connectivity for DARCP
FAIR  connectivity for DARCPFAIR  connectivity for DARCP
FAIR connectivity for DARCPChris Southan
 
Connectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityConnectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityChris Southan
 
Peptide tribulations
Peptide tribulationsPeptide tribulations
Peptide tribulationsChris Southan
 
Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Chris Southan
 
Guide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeGuide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeChris Southan
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentChris Southan
 
Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Chris Southan
 
Desperately seeking DARCP
Desperately seeking DARCPDesperately seeking DARCP
Desperately seeking DARCPChris Southan
 
Seeking glimmers of light in Pharos “Tdark” proteins
Seeking glimmers of light in  Pharos “Tdark” proteinsSeeking glimmers of light in  Pharos “Tdark” proteins
Seeking glimmers of light in Pharos “Tdark” proteinsChris Southan
 
5HT2A modulators update for SAFER
5HT2A modulators update for SAFER5HT2A modulators update for SAFER
5HT2A modulators update for SAFERChris Southan
 
Quality and noise in big chemistry databases
Quality and noise in big chemistry databasesQuality and noise in big chemistry databases
Quality and noise in big chemistry databasesChris Southan
 
Connecting chemistry-to-biology
Connecting chemistry-to-biology Connecting chemistry-to-biology
Connecting chemistry-to-biology Chris Southan
 
GtoPdb June 2019 poster
GtoPdb June 2019 posterGtoPdb June 2019 poster
GtoPdb June 2019 posterChris Southan
 
PubChem as a source of systems biology perturbagens
PubChem as a source of  systems biology perturbagensPubChem as a source of  systems biology perturbagens
PubChem as a source of systems biology perturbagensChris Southan
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand upChris Southan
 
Peptide Tribulations
Peptide TribulationsPeptide Tribulations
Peptide TribulationsChris Southan
 
Looking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRLooking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRChris Southan
 
Guide to Immunopharmacology update
Guide to Immunopharmacology updateGuide to Immunopharmacology update
Guide to Immunopharmacology updateChris Southan
 
Druggable Proteome sources in UniProt
Druggable Proteome sources in UniProtDruggable Proteome sources in UniProt
Druggable Proteome sources in UniProtChris Southan
 
Peptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbPeptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbChris Southan
 

More from Chris Southan (20)

FAIR connectivity for DARCP
FAIR  connectivity for DARCPFAIR  connectivity for DARCP
FAIR connectivity for DARCP
 
Connectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityConnectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivity
 
Peptide tribulations
Peptide tribulationsPeptide tribulations
Peptide tribulations
 
Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
 
Guide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeGuide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updae
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
 
Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?
 
Desperately seeking DARCP
Desperately seeking DARCPDesperately seeking DARCP
Desperately seeking DARCP
 
Seeking glimmers of light in Pharos “Tdark” proteins
Seeking glimmers of light in  Pharos “Tdark” proteinsSeeking glimmers of light in  Pharos “Tdark” proteins
Seeking glimmers of light in Pharos “Tdark” proteins
 
5HT2A modulators update for SAFER
5HT2A modulators update for SAFER5HT2A modulators update for SAFER
5HT2A modulators update for SAFER
 
Quality and noise in big chemistry databases
Quality and noise in big chemistry databasesQuality and noise in big chemistry databases
Quality and noise in big chemistry databases
 
Connecting chemistry-to-biology
Connecting chemistry-to-biology Connecting chemistry-to-biology
Connecting chemistry-to-biology
 
GtoPdb June 2019 poster
GtoPdb June 2019 posterGtoPdb June 2019 poster
GtoPdb June 2019 poster
 
PubChem as a source of systems biology perturbagens
PubChem as a source of  systems biology perturbagensPubChem as a source of  systems biology perturbagens
PubChem as a source of systems biology perturbagens
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand up
 
Peptide Tribulations
Peptide TribulationsPeptide Tribulations
Peptide Tribulations
 
Looking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRLooking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIR
 
Guide to Immunopharmacology update
Guide to Immunopharmacology updateGuide to Immunopharmacology update
Guide to Immunopharmacology update
 
Druggable Proteome sources in UniProt
Druggable Proteome sources in UniProtDruggable Proteome sources in UniProt
Druggable Proteome sources in UniProt
 
Peptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbPeptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdb
 

Southan real drugs_paris_oct_11_2014

  • 1. Challenges of curating approved medicines: Will the real drugs please stand up? Chris Southan, representing the Database Team NC-IUPHAR/BPS/GTPdb Biannual Meeting, Paris, October 2014 1
  • 2. What is the total for approved drug structures? Take your pick ….. 2
  • 3. Discordance between sources inside PubChem 3
  • 4. Explanations • Discordance: distinctly different drug molecular representations from different sources that we would recognise canonically as the same bioactive substance • These are merged into multiple CIDs per drug (i.e. “multiplexed”) via the PubChem chemistry rules due to: – Permutation of R/S stereo centers – Salt forms – Mixtures – Unresolved E/Z bonds – Tautomers – Isotopic derivatives including deuteration 4
  • 5. Causes of drug structure multiplexing • Inherent challenges and complexities of chemical representation • Utility of PubChem depends on advanced rules applied to a submission-based system • Drug companies never verify their own structures in public databases • Legacy of structure image primacy in documents • No clear accountability for correctness of public approved drug structures (companies? FDA? WHO(INN)? AMA(USAN)? Wikipedia? CAS?) • Structural variants enter databases from general source proliferation, large-scale patent extractions, chemical vendor submissions and repeated exemplifications in journals • The net effect is an inexorable increase in multiplexing but not necessarily erroneous structures per se 5
  • 6. A case of the wrong name > structure 6
  • 9. Finding the links: multiplexed to 129 CIDs 9
  • 10. Reading the links for alternative taxols: different structures > 20 sets of assay results 10
  • 11. Virtual deuteration: compounding drug multiplexing 11
  • 12. Scale of the issue for approved drugs in PubChem: multiplexing expansion from 2005 to 2014 12
  • 13. So how are we doing in our database? • Sets were salt-stripped for this comparison • GTPdb (Oct 2014) has 983 approved drug CIDs concordant with either ChEMBL or DrugBank • But only 723 are 4-way concordant • We will inspect the 152, 192 and 180 sectors for consensus expansion 13
  • 14. Consequences and possible solutions to the drug multiplexing issue • Our drugs annotation Committee cannot magic these issues away but their support is crucial • Our consensus approach is useful and statistical defendable • In the GTPdb we add curator comments and cross-pointers for key multiplexed examples • Sources that make the effort to collate drug structure sets should cross-corroborate more • A canonical approach to merging drug structure-to-bioactivity mappings could be considered • The inner connectivity layer of the InChIKey goes some way towards this 14