SlideShare a Scribd company logo
1 of 18
Download to read offline
Translating research data into
Gene Ontology annotations
Pascale Gaudet
SIB – Swiss Institute of Bioinformatics
GO Consortium
Ontology Annotations Model of biology
Gene Ontology Consortium
What we provide
A structured representation
of biology, composed of:
• Classes
• Relations
• Definitions
+ =
- Antigen binding
- Adaptive
immune response
- Extracellular
IGHA1		
Immunoglobulin	heavy	constant	alpha	1
- Glutamine-tRNA
ligase activity
- Translation
- Cytoplasm
QARS
Gln tRNA synthetase
Statements about the
functions of specific gene
products.
3 aspects:
• Molecular function
• Biological process
• Cellular component
Representation of current
knowledge in a manner
that is:
• Human
understandable
• Machine computable
GO “annotations”
§ An annotation is a statement linking a gene to
some aspect of its function (a GO ontology term)
§ Each annotation is based on some evidence,
recorded as part of the annotation
§ Evidence code (type of evidence)
§ Reference (published journal article)
Examples:
Annotation	1:	INSR	+	‘receptor	activity’
Annotation	2:	INSR	+	‘plasma	membrane’
Annotation	3:	INSR	+	‘insulin	receptor	signaling	pathway’
Semantics of a GO annotation
The association of a GO class with a gene product
is a statement that means:
§ molecular function: molecular activities of gene
products
§ cellular component: where gene products are active
§ biological process: pathways and larger processes
made up of the activities of multiple gene products.
§ In other words, annotations represent the
normal, in vivo biological role of gene products
Manual	- Literature-based Manual	- Sequence-based Algorithmic	(unreviewed)
How are annotations generated?
An	computer	program	
analyses	a	sequences	and	
make	a	prediction	based	
on	some	decision	criteria,	
for	example:	
-protein	domain	
(InterPro2GO)
- sequence	similarity	
(BLAST2GO)
An	expert	reviews	the	
literature	and	assigns	
functions,	processes	and	
cellular	components	to	
genes	products	
>	500,000	annotations >	65M	annotations
An	expert	analyses	a	
sequence	and	makes	a	
prediction concerning	the	
gene	function	based	on	
known	functions	of	
related	sequences
The	predictions	can	be	
based	on	the	known	
function	of	evolutionarily	
related	sequences	
(phylogenetic	
relationships)	
>	3M	annotations
Manual	- Literature-based
Evidence types
Chibucos MC,	Siegele DA,	Hu	JC,	Giglio M.	(2017)	Evidence	and	conclusion	ontology	PMID:	27812948	
Manual	- Sequence-based Algorithmic	(unreviewed)
EXP
experimental	evidence
IDA
inferred	from	direct	assay
IPI
inferred	from	physical	
interaction
IMP
inferred	from	mutant	
phenotype
ISS
inferred	from	sequence	
similarity
ISO
inferred	from	sequence	
ortholog
IBA
inferred	from	biological	
aspect	of	ancestor
IEA
inferred	from	electronic	
annotation
Who produces GO annotations?
• Model organism databases (SGD, FlyBase,
wormbase, MGI, etc)
• Generalist databases, for eg UniProtKB, IntAct
• Domain-specific projects: Cardiovascular project
(UCL), synapse project (VU), etc.
• Anyone who wishes to contribute their expertise
and data to the project
Best practices for generating
literature-based GO annotations
§ Ensure consistency of usage across a
broad consortium of contributors
§ Improve inferencing capabilities
Focus on the research hypothesis
§ Use prior knowledge to understand the hypothesis
being tested and its relation to the experimental
observation
Protein Known	roles Hypothesis Assay Result Conclusion	for GO
DDFB	(O76075) DNase	 The	nuclease	activity	of	
DDFB	is	required	for	
nuclear	DNA	
fragmentation	during	
apoptosis
Apoptotic	DNA	
fragmentation	
increased	in	the	
presence	of	DDFB
DDFB	mediates	nuclear	DNA	
fragmentation	during	
apoptosis
=	apoptotic	DNA	
fragmentation	
(GO:0006309)
FOXL2	(P58012) Transcription	
factor
Mutations	in	FOXL2	are	
known	to	cause	
premature	ovarian	
failure,	which	may	be	
due	to			increased	
apoptosis	
Apoptotic	DNA	
fragmentation	
increased	in	the	
presence	of	FOXL2
FOXL2	increases	the	rate	of	
apoptosis	
=	positive	regulation	of	
apoptotic	process	
(GO:0043065)
Annotate the conclusion, not the assay
1) rubidium if often used to assay potassium transport,
because the radioactive form is more readily available;
- the physiologically relevant substrate is potassium
2) Protein kinases are often tested with non-physiologically
relevant substrates, such as histone
- if the authors do not discuss the physiological relevance,
one cannot annotate the substrate
On the in vivo relevance of phenotypes
• Phenotypes can help understand the function of proteins
• Phenotypes can insights into mechanisms leading to disease
• The scope of the GO, though, is to capture the normal function of
proteins
Indirect effects of a mutation
- RNA polymerase affects essentially all cellular processes (cell
proliferation, development, etc) but does not mediate these
processes
Lack of hypothesis for a role of a protein in a process:
- Knockdown of Tmem234 in zebrafish results defects in pronephric
glomerulus formation. Annotation by IMP to glomerulus formation is
not supported by any cellular/molecular data
Get the wider perspective
• Favor a gene-by-gene or pathway-by-pathway
approach for curation rather than paper-by-paper
• Read recent publications
• Remove incorrect annotations based on invalidated
hypothesis
Guidelines for high quality
annotations
• Annotate the conclusion of the experiment
• Use the biological context to interpret the
experiments
• Carefully select publications. Read recent
publications
• Ensure consistency with existing annotations
• Keep annotation up-to date: Remove obsolete
annotations
Other approaches for quality
control
• Annotation consistency exercises
• Taxonomic constraints
• Co-occurrence of annotations
• Phylogenetic annotations
• User feedback
- from GO website
- from PubMed
- from databases
GO annotations in PubMed
Annotations for a paper
This talk was based upon
Acknowledgments
• GO PIs
• Judy Blake
• Mike Cherry
• Suzanna Lewis
• Paul Sternberg
• Paul Thomas
• GO Handbook
contributors
• Christophe Dessimoz
• Jim Hu
• Nives Skunca
• Sylvain Poux
• Funding
• NIH HG002273 (GO)

More Related Content

What's hot

epigenomics for crop improvement
epigenomics for crop improvementepigenomics for crop improvement
epigenomics for crop improvementanjaligoud
 
2014 DURF CONFERENCE POSTER
2014 DURF CONFERENCE POSTER2014 DURF CONFERENCE POSTER
2014 DURF CONFERENCE POSTERSudeep Pisipaty
 
Rigor Mortis and Intestinal Necrosis During C. elegans Death
Rigor Mortis and Intestinal Necrosis During C. elegans DeathRigor Mortis and Intestinal Necrosis During C. elegans Death
Rigor Mortis and Intestinal Necrosis During C. elegans Deathjunaedrx
 
L09 cell cycle
L09 cell cycleL09 cell cycle
L09 cell cycleMUBOSScz
 
2.4. Alterations in Genome
2.4. Alterations in Genome 2.4. Alterations in Genome
2.4. Alterations in Genome Garry D. Lasaga
 
Site-Directed Mutagenesis of β-2 Microglobulin Research Symposium Poster
Site-Directed Mutagenesis of β-2 Microglobulin Research Symposium PosterSite-Directed Mutagenesis of β-2 Microglobulin Research Symposium Poster
Site-Directed Mutagenesis of β-2 Microglobulin Research Symposium PosterTyler Liang
 
Gfp application in bacterial dynamics and disease diagnosis
Gfp application in bacterial dynamics and disease diagnosisGfp application in bacterial dynamics and disease diagnosis
Gfp application in bacterial dynamics and disease diagnosisgarima shrinet
 
Biotechnology Chapter Five Lecture- Proteins (part b)
Biotechnology Chapter Five Lecture- Proteins (part b)Biotechnology Chapter Five Lecture- Proteins (part b)
Biotechnology Chapter Five Lecture- Proteins (part b)Mary Beth Smith
 
poster for Bio dept UMB 36x40 rev 5
poster for Bio dept UMB 36x40 rev 5poster for Bio dept UMB 36x40 rev 5
poster for Bio dept UMB 36x40 rev 5Rudy Matheson
 
Summer Scholarship 2014 Outcomes
Summer Scholarship 2014 OutcomesSummer Scholarship 2014 Outcomes
Summer Scholarship 2014 OutcomesRose Upton
 
Radiation Protection : Phospholipase A
Radiation Protection : Phospholipase ARadiation Protection : Phospholipase A
Radiation Protection : Phospholipase ADmitri Popov
 

What's hot (19)

epigenomics for crop improvement
epigenomics for crop improvementepigenomics for crop improvement
epigenomics for crop improvement
 
PHd defense presentation Final RIVES
PHd defense presentation Final RIVESPHd defense presentation Final RIVES
PHd defense presentation Final RIVES
 
2014 DURF CONFERENCE POSTER
2014 DURF CONFERENCE POSTER2014 DURF CONFERENCE POSTER
2014 DURF CONFERENCE POSTER
 
Rigor Mortis and Intestinal Necrosis During C. elegans Death
Rigor Mortis and Intestinal Necrosis During C. elegans DeathRigor Mortis and Intestinal Necrosis During C. elegans Death
Rigor Mortis and Intestinal Necrosis During C. elegans Death
 
L09 cell cycle
L09 cell cycleL09 cell cycle
L09 cell cycle
 
2.4. Alterations in Genome
2.4. Alterations in Genome 2.4. Alterations in Genome
2.4. Alterations in Genome
 
Mutation
MutationMutation
Mutation
 
Site-Directed Mutagenesis of β-2 Microglobulin Research Symposium Poster
Site-Directed Mutagenesis of β-2 Microglobulin Research Symposium PosterSite-Directed Mutagenesis of β-2 Microglobulin Research Symposium Poster
Site-Directed Mutagenesis of β-2 Microglobulin Research Symposium Poster
 
Gfp application in bacterial dynamics and disease diagnosis
Gfp application in bacterial dynamics and disease diagnosisGfp application in bacterial dynamics and disease diagnosis
Gfp application in bacterial dynamics and disease diagnosis
 
Articulo 4
Articulo 4Articulo 4
Articulo 4
 
Biotechnology Chapter Five Lecture- Proteins (part b)
Biotechnology Chapter Five Lecture- Proteins (part b)Biotechnology Chapter Five Lecture- Proteins (part b)
Biotechnology Chapter Five Lecture- Proteins (part b)
 
Biochem mutations
Biochem    mutationsBiochem    mutations
Biochem mutations
 
poster for Bio dept UMB 36x40 rev 5
poster for Bio dept UMB 36x40 rev 5poster for Bio dept UMB 36x40 rev 5
poster for Bio dept UMB 36x40 rev 5
 
Abzymes
AbzymesAbzymes
Abzymes
 
Summer Scholarship 2014 Outcomes
Summer Scholarship 2014 OutcomesSummer Scholarship 2014 Outcomes
Summer Scholarship 2014 Outcomes
 
Molecular evolution
Molecular evolutionMolecular evolution
Molecular evolution
 
Gene technology
Gene technologyGene technology
Gene technology
 
emm201548a
emm201548aemm201548a
emm201548a
 
Radiation Protection : Phospholipase A
Radiation Protection : Phospholipase ARadiation Protection : Phospholipase A
Radiation Protection : Phospholipase A
 

Viewers also liked

2016 03-02 EyesOnALZ All-hands meeting_mohammad haft_p1
2016 03-02 EyesOnALZ All-hands meeting_mohammad haft_p12016 03-02 EyesOnALZ All-hands meeting_mohammad haft_p1
2016 03-02 EyesOnALZ All-hands meeting_mohammad haft_p1EyesOnALZ
 
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...EUDAT
 
Editing Functionality - Apollo Workshop
Editing Functionality - Apollo WorkshopEditing Functionality - Apollo Workshop
Editing Functionality - Apollo WorkshopMonica Munoz-Torres
 
From Sequence to Knowledge (Phage Genomics Workshop Intro at the 22nd Biennia...
From Sequence to Knowledge (Phage Genomics Workshop Intro at the 22nd Biennia...From Sequence to Knowledge (Phage Genomics Workshop Intro at the 22nd Biennia...
From Sequence to Knowledge (Phage Genomics Workshop Intro at the 22nd Biennia...Ramy K. Aziz
 
Data Science with Humans in the Loop
Data Science with Humans in the LoopData Science with Humans in the Loop
Data Science with Humans in the LoopLora Aroyo
 

Viewers also liked (6)

2016 03-02 EyesOnALZ All-hands meeting_mohammad haft_p1
2016 03-02 EyesOnALZ All-hands meeting_mohammad haft_p12016 03-02 EyesOnALZ All-hands meeting_mohammad haft_p1
2016 03-02 EyesOnALZ All-hands meeting_mohammad haft_p1
 
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
EUDAT Webinar "Organise, retrieve and aggregate data using annotations with B...
 
Editing Functionality - Apollo Workshop
Editing Functionality - Apollo WorkshopEditing Functionality - Apollo Workshop
Editing Functionality - Apollo Workshop
 
From Sequence to Knowledge (Phage Genomics Workshop Intro at the 22nd Biennia...
From Sequence to Knowledge (Phage Genomics Workshop Intro at the 22nd Biennia...From Sequence to Knowledge (Phage Genomics Workshop Intro at the 22nd Biennia...
From Sequence to Knowledge (Phage Genomics Workshop Intro at the 22nd Biennia...
 
Data Science with Humans in the Loop
Data Science with Humans in the LoopData Science with Humans in the Loop
Data Science with Humans in the Loop
 
10 facts about jobs in the future
10 facts about jobs in the future10 facts about jobs in the future
10 facts about jobs in the future
 

Similar to Translating research data into Gene Ontology annotations

KDM5 epigenetic modifiers as a focus for drug discovery
KDM5 epigenetic modifiers as a focus for drug discoveryKDM5 epigenetic modifiers as a focus for drug discovery
KDM5 epigenetic modifiers as a focus for drug discoveryChristopher Wynder
 
Summer Research Poster
Summer Research PosterSummer Research Poster
Summer Research PosterAlan Kim
 
Radiation Protection: Phospholipase C, LAMP and Phopholipase C, LAMP inhibition.
Radiation Protection: Phospholipase C, LAMP and Phopholipase C, LAMP inhibition.Radiation Protection: Phospholipase C, LAMP and Phopholipase C, LAMP inhibition.
Radiation Protection: Phospholipase C, LAMP and Phopholipase C, LAMP inhibition.Dmitri Popov
 
2014 DURF CONFERENCE POSTER
2014 DURF CONFERENCE POSTER2014 DURF CONFERENCE POSTER
2014 DURF CONFERENCE POSTERSudeep Pisipaty
 
PIIS1552526009012771(1)(3)
PIIS1552526009012771(1)(3)PIIS1552526009012771(1)(3)
PIIS1552526009012771(1)(3)Marco Garza
 
Oncology: Spatial Localization of Ras proteins
Oncology: Spatial Localization of Ras proteinsOncology: Spatial Localization of Ras proteins
Oncology: Spatial Localization of Ras proteinsNachiket Vartak
 
Wnt-Signalling in Endocrinology
Wnt-Signalling in EndocrinologyWnt-Signalling in Endocrinology
Wnt-Signalling in EndocrinologyShinjan Patra
 
Linking t rna localization with activation
Linking t rna localization with activationLinking t rna localization with activation
Linking t rna localization with activationJoy Maria Mitchell
 
Fingolimod the path from a fungal metabolite to fda approved drug-biom255-sp...
Fingolimod  the path from a fungal metabolite to fda approved drug-biom255-sp...Fingolimod  the path from a fungal metabolite to fda approved drug-biom255-sp...
Fingolimod the path from a fungal metabolite to fda approved drug-biom255-sp...People with Multiple Sclerosis (Vic) Inc.
 
Functional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in bloodFunctional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in bloodJoaquin Dopazo
 
Regulation of gene expression in eukaryotes
Regulation of gene expression in eukaryotesRegulation of gene expression in eukaryotes
Regulation of gene expression in eukaryotesKristu Jayanti College
 
Sub-optimal phenotypes of double-knockout of E.coli
Sub-optimal phenotypes of double-knockout of E.coliSub-optimal phenotypes of double-knockout of E.coli
Sub-optimal phenotypes of double-knockout of E.coliDr Fatumina Abukar
 
GENE Expression TARGETED THERAPY.pdf
GENE Expression  TARGETED THERAPY.pdfGENE Expression  TARGETED THERAPY.pdf
GENE Expression TARGETED THERAPY.pdfmohieeldien elsayed
 
NetBioSIG2013-KEYNOTE Stefan Schuster
NetBioSIG2013-KEYNOTE Stefan SchusterNetBioSIG2013-KEYNOTE Stefan Schuster
NetBioSIG2013-KEYNOTE Stefan SchusterAlexander Pico
 
J Neurosci 2006
J Neurosci 2006J Neurosci 2006
J Neurosci 2006Raul Pardo
 
molecular-basis-of-mutation.pdf
molecular-basis-of-mutation.pdfmolecular-basis-of-mutation.pdf
molecular-basis-of-mutation.pdfssuserf7c19f
 

Similar to Translating research data into Gene Ontology annotations (20)

KDM5 epigenetic modifiers as a focus for drug discovery
KDM5 epigenetic modifiers as a focus for drug discoveryKDM5 epigenetic modifiers as a focus for drug discovery
KDM5 epigenetic modifiers as a focus for drug discovery
 
Lucas...Cowell 2014
Lucas...Cowell 2014Lucas...Cowell 2014
Lucas...Cowell 2014
 
Summer Research Poster
Summer Research PosterSummer Research Poster
Summer Research Poster
 
Radiation Protection: Phospholipase C, LAMP and Phopholipase C, LAMP inhibition.
Radiation Protection: Phospholipase C, LAMP and Phopholipase C, LAMP inhibition.Radiation Protection: Phospholipase C, LAMP and Phopholipase C, LAMP inhibition.
Radiation Protection: Phospholipase C, LAMP and Phopholipase C, LAMP inhibition.
 
2014 DURF CONFERENCE POSTER
2014 DURF CONFERENCE POSTER2014 DURF CONFERENCE POSTER
2014 DURF CONFERENCE POSTER
 
PIIS1552526009012771(1)(3)
PIIS1552526009012771(1)(3)PIIS1552526009012771(1)(3)
PIIS1552526009012771(1)(3)
 
Oncology: Spatial Localization of Ras proteins
Oncology: Spatial Localization of Ras proteinsOncology: Spatial Localization of Ras proteins
Oncology: Spatial Localization of Ras proteins
 
Wnt-Signalling in Endocrinology
Wnt-Signalling in EndocrinologyWnt-Signalling in Endocrinology
Wnt-Signalling in Endocrinology
 
Linking t rna localization with activation
Linking t rna localization with activationLinking t rna localization with activation
Linking t rna localization with activation
 
Fingolimod the path from a fungal metabolite to fda approved drug-biom255-sp...
Fingolimod  the path from a fungal metabolite to fda approved drug-biom255-sp...Fingolimod  the path from a fungal metabolite to fda approved drug-biom255-sp...
Fingolimod the path from a fungal metabolite to fda approved drug-biom255-sp...
 
Functional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in bloodFunctional profile of the pre- to post-mortem transition in blood
Functional profile of the pre- to post-mortem transition in blood
 
Regulation of gene expression in eukaryotes
Regulation of gene expression in eukaryotesRegulation of gene expression in eukaryotes
Regulation of gene expression in eukaryotes
 
Molecular Basis of Mutation
Molecular Basis of MutationMolecular Basis of Mutation
Molecular Basis of Mutation
 
Sub-optimal phenotypes of double-knockout of E.coli
Sub-optimal phenotypes of double-knockout of E.coliSub-optimal phenotypes of double-knockout of E.coli
Sub-optimal phenotypes of double-knockout of E.coli
 
Presentation final
Presentation finalPresentation final
Presentation final
 
GENE Expression TARGETED THERAPY.pdf
GENE Expression  TARGETED THERAPY.pdfGENE Expression  TARGETED THERAPY.pdf
GENE Expression TARGETED THERAPY.pdf
 
NetBioSIG2013-KEYNOTE Stefan Schuster
NetBioSIG2013-KEYNOTE Stefan SchusterNetBioSIG2013-KEYNOTE Stefan Schuster
NetBioSIG2013-KEYNOTE Stefan Schuster
 
Genes,brain & behavior1
Genes,brain & behavior1Genes,brain & behavior1
Genes,brain & behavior1
 
J Neurosci 2006
J Neurosci 2006J Neurosci 2006
J Neurosci 2006
 
molecular-basis-of-mutation.pdf
molecular-basis-of-mutation.pdfmolecular-basis-of-mutation.pdf
molecular-basis-of-mutation.pdf
 

Recently uploaded

Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsTotal Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsMarkus Roggen
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024Jene van der Heide
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxtuking87
 
Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPirithiRaju
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxPayal Shrivastava
 
Role of Gibberellins, mode of action and external applications.pptx
Role of Gibberellins, mode of action and external applications.pptxRole of Gibberellins, mode of action and external applications.pptx
Role of Gibberellins, mode of action and external applications.pptxjana861314
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxMedical College
 
Production technology of Brinjal -Solanum melongena
Production technology of Brinjal -Solanum melongenaProduction technology of Brinjal -Solanum melongena
Production technology of Brinjal -Solanum melongenajana861314
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests GlycosidesNandakishor Bhaurao Deshmukh
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...Chayanika Das
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPRPirithiRaju
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxGiDMOh
 
cybrids.pptx production_advanges_limitation
cybrids.pptx production_advanges_limitationcybrids.pptx production_advanges_limitation
cybrids.pptx production_advanges_limitationSanghamitraMohapatra5
 
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer ZahanaEGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer ZahanaDr.Mahmoud Abbas
 
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's SurvivalHarry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survivalkevin8smith
 
Immunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptImmunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptAmirRaziq1
 

Recently uploaded (20)

Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of CannabinoidsTotal Legal: A “Joint” Journey into the Chemistry of Cannabinoids
Total Legal: A “Joint” Journey into the Chemistry of Cannabinoids
 
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024GenAI talk for Young at Wageningen University & Research (WUR) March 2024
GenAI talk for Young at Wageningen University & Research (WUR) March 2024
 
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptxQ4-Mod-1c-Quiz-Projectile-333344444.pptx
Q4-Mod-1c-Quiz-Projectile-333344444.pptx
 
Pests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPRPests of Sunflower_Binomics_Identification_Dr.UPR
Pests of Sunflower_Binomics_Identification_Dr.UPR
 
FBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptxFBI Profiling - Forensic Psychology.pptx
FBI Profiling - Forensic Psychology.pptx
 
Role of Gibberellins, mode of action and external applications.pptx
Role of Gibberellins, mode of action and external applications.pptxRole of Gibberellins, mode of action and external applications.pptx
Role of Gibberellins, mode of action and external applications.pptx
 
Introduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptxIntroduction of Human Body & Structure of cell.pptx
Introduction of Human Body & Structure of cell.pptx
 
Production technology of Brinjal -Solanum melongena
Production technology of Brinjal -Solanum melongenaProduction technology of Brinjal -Solanum melongena
Production technology of Brinjal -Solanum melongena
 
Ultrastructure and functions of Chloroplast.pptx
Ultrastructure and functions of Chloroplast.pptxUltrastructure and functions of Chloroplast.pptx
Ultrastructure and functions of Chloroplast.pptx
 
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests GlycosidesGLYCOSIDES Classification Of GLYCOSIDES  Chemical Tests Glycosides
GLYCOSIDES Classification Of GLYCOSIDES Chemical Tests Glycosides
 
Interferons.pptx.
Interferons.pptx.Interferons.pptx.
Interferons.pptx.
 
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
ESSENTIAL FEATURES REQUIRED FOR ESTABLISHING FOUR TYPES OF BIOSAFETY LABORATO...
 
CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
6.2 Pests of Sesame_Identification_Binomics_Dr.UPR
 
DNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptxDNA isolation molecular biology practical.pptx
DNA isolation molecular biology practical.pptx
 
cybrids.pptx production_advanges_limitation
cybrids.pptx production_advanges_limitationcybrids.pptx production_advanges_limitation
cybrids.pptx production_advanges_limitation
 
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer ZahanaEGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
EGYPTIAN IMPRINT IN SPAIN Lecture by Dr Abeer Zahana
 
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's SurvivalHarry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
Harry Coumnas Thinks That Human Teleportation May Ensure Humanity's Survival
 
Immunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.pptImmunoblott technique for protein detection.ppt
Immunoblott technique for protein detection.ppt
 

Translating research data into Gene Ontology annotations

  • 1. Translating research data into Gene Ontology annotations Pascale Gaudet SIB – Swiss Institute of Bioinformatics GO Consortium
  • 2. Ontology Annotations Model of biology Gene Ontology Consortium What we provide A structured representation of biology, composed of: • Classes • Relations • Definitions + = - Antigen binding - Adaptive immune response - Extracellular IGHA1 Immunoglobulin heavy constant alpha 1 - Glutamine-tRNA ligase activity - Translation - Cytoplasm QARS Gln tRNA synthetase Statements about the functions of specific gene products. 3 aspects: • Molecular function • Biological process • Cellular component Representation of current knowledge in a manner that is: • Human understandable • Machine computable
  • 3. GO “annotations” § An annotation is a statement linking a gene to some aspect of its function (a GO ontology term) § Each annotation is based on some evidence, recorded as part of the annotation § Evidence code (type of evidence) § Reference (published journal article) Examples: Annotation 1: INSR + ‘receptor activity’ Annotation 2: INSR + ‘plasma membrane’ Annotation 3: INSR + ‘insulin receptor signaling pathway’
  • 4. Semantics of a GO annotation The association of a GO class with a gene product is a statement that means: § molecular function: molecular activities of gene products § cellular component: where gene products are active § biological process: pathways and larger processes made up of the activities of multiple gene products. § In other words, annotations represent the normal, in vivo biological role of gene products
  • 5. Manual - Literature-based Manual - Sequence-based Algorithmic (unreviewed) How are annotations generated? An computer program analyses a sequences and make a prediction based on some decision criteria, for example: -protein domain (InterPro2GO) - sequence similarity (BLAST2GO) An expert reviews the literature and assigns functions, processes and cellular components to genes products > 500,000 annotations > 65M annotations An expert analyses a sequence and makes a prediction concerning the gene function based on known functions of related sequences The predictions can be based on the known function of evolutionarily related sequences (phylogenetic relationships) > 3M annotations
  • 6. Manual - Literature-based Evidence types Chibucos MC, Siegele DA, Hu JC, Giglio M. (2017) Evidence and conclusion ontology PMID: 27812948 Manual - Sequence-based Algorithmic (unreviewed) EXP experimental evidence IDA inferred from direct assay IPI inferred from physical interaction IMP inferred from mutant phenotype ISS inferred from sequence similarity ISO inferred from sequence ortholog IBA inferred from biological aspect of ancestor IEA inferred from electronic annotation
  • 7. Who produces GO annotations? • Model organism databases (SGD, FlyBase, wormbase, MGI, etc) • Generalist databases, for eg UniProtKB, IntAct • Domain-specific projects: Cardiovascular project (UCL), synapse project (VU), etc. • Anyone who wishes to contribute their expertise and data to the project
  • 8. Best practices for generating literature-based GO annotations § Ensure consistency of usage across a broad consortium of contributors § Improve inferencing capabilities
  • 9. Focus on the research hypothesis § Use prior knowledge to understand the hypothesis being tested and its relation to the experimental observation Protein Known roles Hypothesis Assay Result Conclusion for GO DDFB (O76075) DNase The nuclease activity of DDFB is required for nuclear DNA fragmentation during apoptosis Apoptotic DNA fragmentation increased in the presence of DDFB DDFB mediates nuclear DNA fragmentation during apoptosis = apoptotic DNA fragmentation (GO:0006309) FOXL2 (P58012) Transcription factor Mutations in FOXL2 are known to cause premature ovarian failure, which may be due to increased apoptosis Apoptotic DNA fragmentation increased in the presence of FOXL2 FOXL2 increases the rate of apoptosis = positive regulation of apoptotic process (GO:0043065)
  • 10. Annotate the conclusion, not the assay 1) rubidium if often used to assay potassium transport, because the radioactive form is more readily available; - the physiologically relevant substrate is potassium 2) Protein kinases are often tested with non-physiologically relevant substrates, such as histone - if the authors do not discuss the physiological relevance, one cannot annotate the substrate
  • 11. On the in vivo relevance of phenotypes • Phenotypes can help understand the function of proteins • Phenotypes can insights into mechanisms leading to disease • The scope of the GO, though, is to capture the normal function of proteins Indirect effects of a mutation - RNA polymerase affects essentially all cellular processes (cell proliferation, development, etc) but does not mediate these processes Lack of hypothesis for a role of a protein in a process: - Knockdown of Tmem234 in zebrafish results defects in pronephric glomerulus formation. Annotation by IMP to glomerulus formation is not supported by any cellular/molecular data
  • 12. Get the wider perspective • Favor a gene-by-gene or pathway-by-pathway approach for curation rather than paper-by-paper • Read recent publications • Remove incorrect annotations based on invalidated hypothesis
  • 13. Guidelines for high quality annotations • Annotate the conclusion of the experiment • Use the biological context to interpret the experiments • Carefully select publications. Read recent publications • Ensure consistency with existing annotations • Keep annotation up-to date: Remove obsolete annotations
  • 14. Other approaches for quality control • Annotation consistency exercises • Taxonomic constraints • Co-occurrence of annotations • Phylogenetic annotations • User feedback - from GO website - from PubMed - from databases
  • 17. This talk was based upon
  • 18. Acknowledgments • GO PIs • Judy Blake • Mike Cherry • Suzanna Lewis • Paul Sternberg • Paul Thomas • GO Handbook contributors • Christophe Dessimoz • Jim Hu • Nives Skunca • Sylvain Poux • Funding • NIH HG002273 (GO)