SlideShare a Scribd company logo
1 of 29
Download to read offline
Ontology-centric




Knowledge Navigation
.. of the scientific literature
      Christopher J. O. Baker
   Institute for InfoComm Research,
          A*STAR, Singapore
Motivation
•   Scientists typically need to integrate a spectrum of
    information to successfully complete a task.

•   On average a scientist or knowledge worker
    spends 1 day per week searching for, integrating
    and analyzing information, 50% of which is
    unstructured digital formats.

•   Access to information structured according to
    explicit knowledge representations or taxonomies
    is a fundamental concern of all scientists.

•   Moving beyond keyword search requires tools that
    provide lexical matching to semantic, conceptual
    and contextual levels of information and this
    entails an infrastructure for indexing text segments
    according to domain-specific metadata
In the future ….
•   Users will be involved in the design of information systems

•   Publishers will charge users for value added search:
    (who will build such search systems)

•   Users will search across semantically integration data
    sources and data types (how to facilitate system creation /
    adoption)

•   Knowledge driven systems - rapidly built and deployed with
    the engagement of domain experts in a knowledge
    engineering team
Literature-driven, Ontology-centric
  Knowledge Integration and Navigation

                                                 Visual Query
                                   Reasoning



                                   Ontology                        50
                                                                sentences
500 documents,                                                   to read
blogs, newsfeeds
                                  Ontology
to browse                         Population
                   Text Mining




              Content delivery using expressive semantics
W3C Semantic Web Technologies
•   URI / LSID
•   Ontologies
•   Reasoners
•   Query Languages
•   Web Services
•   Service Registries
•   Agents
•   Multi Agent Systems
•   Workflows Engines
•   GRID / Semantic GRID
•   Text Mining
•   Service Oriented Architecture
Controlled Vocabularies Ontologies
Catalog/ Thesauri                            Formal                             General
ID       “narrower term”                      is-a         Frames                 logical
           Controlled vocabularies           part-of     (properties)         constraints


      Terms/                      Informal          Formal       Value
 Glossary/Controlled                 is-a          instance    restrictions
    vocabularies                   part-of

Capture knowledge:                            Make the content in
The meaning of important vocabulary           information sources explicit.
(classes, properties/relations
and instance data in a domain model).
                                              Common domain terminology
Index and query model                         Basis for interoperability
to a repository of information.
                                              between information systems.
Lipid Ontology
> Implementation:
   OWL-DL
> DL Expressivity
    ALCHIQ
> Uses LIPIDMAPS
    systematic
    nomenclature
> 560 Named classes
> 352 Lipid subclasses
    71 Object properties
    (inc inv.)
> 4 Datatype
    properties
> Lipid instance:                           Graph fragment
                           DL Axioms
   LIPIDMAPS
    systematic name
                                       Lipid Hierarchy
> Depth: 8 levels
Domain Knowledge vs
    information                                          Concept Definitions
    system metadata
Ontologies Online
Ontology-centric knowledge architecture
Ontology-centric Knowledge Integration
                     • Content Delivery Platform - Automated
                       Document delivery from online databases
                       Tools for conversion to text-minable text
 Content
Acquisition
                     • Text Mining - Customized and Automated
                       Regular Expressions, Named Entities,
                       Relations,

 Domain              • Knowledge Engineering – Ontology Creation
 specific              Domain Modeling / Customized Rapid
 raw text               Prototyping


                     • Ontology Population – Automated Instantiation
                       Sentences as instances / Co-occurrence and
                       named relations (Rules)
Domian Ontology vs Mixed Metadata:
     a literature specification
Ontology Population Workflow
             •   Ontology based information retrieval
                 applies NLP to link documents to
                 existing ontologies
             •   Ontology-driven NLP - NLP that
                 actively uses ontological resources for
                 NLP tasks
             •   Ontological NLP - ontologies used as a
                 knowledge base for NLP tasks while
                 also exporting the results of NLP
                 analyses into an ontology that can then
                 subsequent semantic queries to the
                 ontology using description logic
                 reasoners and a box reasoning
             •   Ontology based NLP - the results of
                 NLP are exported to another ontology,
                 using external resources for text
                 processing,
                                           Witte etal. 2007
Text Mining
• Class Instance Generation from full text
   – Named entity recognition (gazetteer based)
   – Dictionary based matching of text tokens to domain
     specific vocabularies i.e. (LipidBank, Lipidmaps,
     KEGG, IUPAC) and curated Swissprot terms and disease
     ontology of CGM
   – Normalization and grounding to canonical names
• Relation Detection - Role Assertions:
   – Co-occurrence and Rule-based relation detection of binary
     pairs from which knowledgebase instances are generated.
     Primary set of binary interactions mined from text:
   – Lipid-Protein, Lipid-Disease, Protein-Disease
   – Domain specific library of curated biological relations.
Knowledgebase Instantiation
1) Rule based identification of Sentences containing target keywords
2) Instantiation with JENA API http://jena.sourceforge.net/ for this purpose.

Target keywords found in sentences are instantiated to corresponding
  ontology class
•   Lipid / Protein / Disease instances are instantiated to the respective ontology
    classes (as tagged by the gazetteer)
•   Binary pairs instantiated to the respective Object Properties as role assertions
•   Sentences instantiated to the respective Data type properties.

For each lipid identified in a sentence the corresponding data
  are instantiated to the ontology from Lipid Data Warehouse records
   requiring no further text processing.
•   Lipid - LIPIDMAPS Systematic Name and its associated
•   Lipid - IUPAC Name, Lipid – synonyms, Lipid - Database ID.
Knowledgebase Instantiation
Rule Based Sentence Processing
<Lipid> AND <Protein> AND LipidProteinInteraction-TriggerWord e.g. quot;interactquot;, quot;bindquot;, quot;mediatequot;
<Lipid> AND <Disease> AND LipidDiseaseInteraction-TriggerWord e.g quot;involvequot;, quot;causequot;
                   Lipid Class                                              Protein
                                                                            Instance


                                               Lipid Instance




                                                                   Lipid Instance
Knowledge Integration and Query
       User input query                        Search            Web content or
                                               Engine            Full text papers                          NLP tagging
Papers identified: 262
    121 papers with no lipid protein relations
     141 papers contributed to ontology instantiation
     186 lipid names
                                                                                                                 docs
     528 protein names
                                                                                                                tagged
After normalisation and grounding:
                                                                                                                  with
     92 Lipidmaps systematic names                                                                             relevant
     52 IUPAC names, 412 exact synonyms, 6 broad synonyms, 319 protein names                                     name
    Cross link to 59 Lipidbank entries                                                                          entities
Sentences:
    Co-occurrence before rules 1356 Sentences, After rules 683 Interaction sentences
     92 Lipidmaps names instantiated to 35 classes (2.6 lipids per class)

              Instantiation Time: 22 seconds
                                                                                                              Ontology
                                               Knowledge        “Instantiated ontology”                     instantiation
User             Output for end user           Navigation             Baker CJ, Kanagasabai R, Ang WT, Veeramani A, Low HS, and
                                                 vehicle              Wenk MR. Towards ontology-driven navigation of the lipid
                                                                      bibliosphere. BMC Bioinformatics. 2008;9 Suppl 1:S5.
Knowledge Integration and Query
       User input query              Search       Web content or
                                     Engine       Full text papers                         NLP tagging




                                                                                                 docs
                                                                                                tagged
                                                                                                  with
                                                                                               relevant
                                                                                                 name
                                                                                                entities




                                                                                              Ontology
                                     Knowledge    “Instantiated ontology”                   instantiation
User           Output for end user   Navigation       Baker CJ, Kanagasabai R, Ang WT, Veeramani A, Low HS, and
                                       vehicle        Wenk MR. Towards ontology-driven navigation of the lipid
                                                      bibliosphere. BMC Bioinformatics. 2008;9 Suppl 1:S5.
Knowlegator

                       Query Composition Panel

                                  Results Panel


Ontology
Content
Query
Syntax
                                  Query Engine
 Concept                            Dialogue
Properties
Overview
Complex Query Generation


                                                     rma tician
                                            In f o


         x pert
    ain e
D om




Find documents and sentences describing proteins-
lipid interaction and corresponding lipid synonyms.
Pathway Discovery Algorithm
Finds transitive paths
across the graph:
between source and
target concepts. Can
define path length
and result size


                         … paths between any object
                         properties or a user defined
                         object properties only e.g.
                         protein interacts with protein
Pathway Knowledge Discovery



2 concepts or keywords




                                                                                                ... across
Results with                                                                                     multiple
                          Kanagasabai R. Low HS ,Ang WT, Wenk MR, Baker CJO.
semantic labelling        Ontology-centric navigation of pathway information mined from text,
                          Bio-Ontologies SIG: Knowledge in Biology, ISMB July 2008
                                                                                                relations
Pathway Knowledge Discovery 2
Navigation of Cancer Pathways
1 search term
 (instance or
 concept)
 generates a
 list of natural
 language
 questions
 answerable by
 the ontology

     and a
     direct link
     to answers

Ang WT, Kanagasabai R, Baker CJ.
Knowledge Translation: Computing the
query potential of bio-ontologies,
Genome Informatics Workshop 2008
Submitted …..
Application Workflow
Semantic Technologies Architecture
Knowledge Services: Development
Knowledge Worker involved in Discovery                 Navigation Paradigms


         Ontology Engineering                    Quality                        Semantic
                                               Evolution                          Data
                                              Maintenance
                                                                               Integration



                                                                  NLP &
                                                                   Text        Databases
                                                                  Mining




                     Multi-user involvement
                                                                Domain Expert
                                                                        Text Mining
                                              Ontology                  Engineer
Phase 1             Phase 2
                                                                              Semantics
                                                    Ontology Engineer         Engineer
Annotation Services
Acknowledgements
 Semantic Technology Group

      Christopher J. O. Baker
      Kanagasabi Rajaraman
        Menaka Rajapakse
        Anitha Veeramani
         Ang Wee Tiong
    Alexander Garcia (Alumnus)

         Collaborators
      Markus R Wenk, NUS
      Low Hong-Sang, NUS
       Choo Kar Heng, I2R
     Shoba Ranganathan NUS
        Suisheng Tan, I2R

More Related Content

What's hot

The Semantic Web #8 - Ontology
The Semantic Web #8 - OntologyThe Semantic Web #8 - Ontology
The Semantic Web #8 - OntologyMyungjin Lee
 
Sasa Nesic - PhD Dissertation Defense
Sasa Nesic - PhD Dissertation DefenseSasa Nesic - PhD Dissertation Defense
Sasa Nesic - PhD Dissertation DefenseSasa Nesic
 
Deep Content Learning in Traffic Prediction and Text Classification
Deep Content Learning in Traffic Prediction and Text ClassificationDeep Content Learning in Traffic Prediction and Text Classification
Deep Content Learning in Traffic Prediction and Text ClassificationHPCC Systems
 
State Of The Art - Part 2 Products Projects
State Of The Art - Part 2 Products ProjectsState Of The Art - Part 2 Products Projects
State Of The Art - Part 2 Products ProjectsPascal Cottereau
 
Model-Driven Software Development with Semantic Web Technologies
Model-Driven Software Development with Semantic Web TechnologiesModel-Driven Software Development with Semantic Web Technologies
Model-Driven Software Development with Semantic Web TechnologiesFernando Silva Parreiras
 
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...open_phacts
 
Open hpi semweb-06-part4
Open hpi semweb-06-part4Open hpi semweb-06-part4
Open hpi semweb-06-part4Nadine Ludwig
 

What's hot (9)

Tutorial kcc-2011
Tutorial kcc-2011Tutorial kcc-2011
Tutorial kcc-2011
 
The Semantic Web #8 - Ontology
The Semantic Web #8 - OntologyThe Semantic Web #8 - Ontology
The Semantic Web #8 - Ontology
 
Sasa Nesic - PhD Dissertation Defense
Sasa Nesic - PhD Dissertation DefenseSasa Nesic - PhD Dissertation Defense
Sasa Nesic - PhD Dissertation Defense
 
Jmora.di.oeg.3x1e
Jmora.di.oeg.3x1eJmora.di.oeg.3x1e
Jmora.di.oeg.3x1e
 
Deep Content Learning in Traffic Prediction and Text Classification
Deep Content Learning in Traffic Prediction and Text ClassificationDeep Content Learning in Traffic Prediction and Text Classification
Deep Content Learning in Traffic Prediction and Text Classification
 
State Of The Art - Part 2 Products Projects
State Of The Art - Part 2 Products ProjectsState Of The Art - Part 2 Products Projects
State Of The Art - Part 2 Products Projects
 
Model-Driven Software Development with Semantic Web Technologies
Model-Driven Software Development with Semantic Web TechnologiesModel-Driven Software Development with Semantic Web Technologies
Model-Driven Software Development with Semantic Web Technologies
 
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
2012-10-08 Practical Semantics In The Pharmaceutical Industry - The Open PHAC...
 
Open hpi semweb-06-part4
Open hpi semweb-06-part4Open hpi semweb-06-part4
Open hpi semweb-06-part4
 

Viewers also liked

Embedded Human Computation for Knowledge Extraction and Evaluation
Embedded Human Computation for Knowledge Extraction and EvaluationEmbedded Human Computation for Knowledge Extraction and Evaluation
Embedded Human Computation for Knowledge Extraction and EvaluationwebLyzard technology
 
SAS University Edition - Getting Started
SAS University Edition - Getting StartedSAS University Edition - Getting Started
SAS University Edition - Getting StartedCraig Trim
 
Measuring Knowledge Commercialization/ Denzil Doyle
Measuring Knowledge Commercialization/ Denzil DoyleMeasuring Knowledge Commercialization/ Denzil Doyle
Measuring Knowledge Commercialization/ Denzil Doyleclaplante
 
Spay.Neuter Road Map Conf
Spay.Neuter Road Map ConfSpay.Neuter Road Map Conf
Spay.Neuter Road Map Confjlandsman
 
39 Clues Book - 15th Anniversary Gift
39 Clues Book - 15th Anniversary Gift39 Clues Book - 15th Anniversary Gift
39 Clues Book - 15th Anniversary GiftScott Studham
 
Experinces Deploying Shared Services
Experinces Deploying Shared ServicesExperinces Deploying Shared Services
Experinces Deploying Shared ServicesScott Studham
 
Ilya interview leven#14
Ilya interview leven#14Ilya interview leven#14
Ilya interview leven#14Ilya van Marle
 
PloneFormGen: Past, Present, Future
PloneFormGen: Past, Present, FuturePloneFormGen: Past, Present, Future
PloneFormGen: Past, Present, FutureSteve McMahon
 
presentation Health 2.0 Amsterdam
presentation Health 2.0 Amsterdampresentation Health 2.0 Amsterdam
presentation Health 2.0 AmsterdamValentina Rao
 
Some ideas for the evaluation of cross media interaction
Some ideas for the evaluation of cross media interactionSome ideas for the evaluation of cross media interaction
Some ideas for the evaluation of cross media interactionValentina Rao
 

Viewers also liked (20)

Embedded Human Computation for Knowledge Extraction and Evaluation
Embedded Human Computation for Knowledge Extraction and EvaluationEmbedded Human Computation for Knowledge Extraction and Evaluation
Embedded Human Computation for Knowledge Extraction and Evaluation
 
Iatefl2007
Iatefl2007Iatefl2007
Iatefl2007
 
Group discussion
Group discussionGroup discussion
Group discussion
 
SAS University Edition - Getting Started
SAS University Edition - Getting StartedSAS University Edition - Getting Started
SAS University Edition - Getting Started
 
ENGAGE Group Discussion
ENGAGE Group DiscussionENGAGE Group Discussion
ENGAGE Group Discussion
 
WeConnect
WeConnectWeConnect
WeConnect
 
How diazo works
How diazo worksHow diazo works
How diazo works
 
The Mashup Library
The Mashup LibraryThe Mashup Library
The Mashup Library
 
Measuring Knowledge Commercialization/ Denzil Doyle
Measuring Knowledge Commercialization/ Denzil DoyleMeasuring Knowledge Commercialization/ Denzil Doyle
Measuring Knowledge Commercialization/ Denzil Doyle
 
Spay.Neuter Road Map Conf
Spay.Neuter Road Map ConfSpay.Neuter Road Map Conf
Spay.Neuter Road Map Conf
 
Ornl IT
Ornl ITOrnl IT
Ornl IT
 
39 Clues Book - 15th Anniversary Gift
39 Clues Book - 15th Anniversary Gift39 Clues Book - 15th Anniversary Gift
39 Clues Book - 15th Anniversary Gift
 
Rao bc2013
Rao bc2013Rao bc2013
Rao bc2013
 
Experinces Deploying Shared Services
Experinces Deploying Shared ServicesExperinces Deploying Shared Services
Experinces Deploying Shared Services
 
Intro Nuevos Medios
Intro Nuevos MediosIntro Nuevos Medios
Intro Nuevos Medios
 
Security
SecuritySecurity
Security
 
Ilya interview leven#14
Ilya interview leven#14Ilya interview leven#14
Ilya interview leven#14
 
PloneFormGen: Past, Present, Future
PloneFormGen: Past, Present, FuturePloneFormGen: Past, Present, Future
PloneFormGen: Past, Present, Future
 
presentation Health 2.0 Amsterdam
presentation Health 2.0 Amsterdampresentation Health 2.0 Amsterdam
presentation Health 2.0 Amsterdam
 
Some ideas for the evaluation of cross media interaction
Some ideas for the evaluation of cross media interactionSome ideas for the evaluation of cross media interaction
Some ideas for the evaluation of cross media interaction
 

Similar to "Ontology-centric navigation of the scientific literature"

Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseHilmar Lapp
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003robertstevens65
 
Ontologies and semantic web
Ontologies and semantic webOntologies and semantic web
Ontologies and semantic webStanley Wang
 
NLP in Web Data Extraction (Omer Gunes)
NLP in Web Data Extraction (Omer Gunes)NLP in Web Data Extraction (Omer Gunes)
NLP in Web Data Extraction (Omer Gunes)timfu
 
Usability-focused Clinical Decision Support with the Help of Semantic Technol...
Usability-focused Clinical Decision Support with the Help of Semantic Technol...Usability-focused Clinical Decision Support with the Help of Semantic Technol...
Usability-focused Clinical Decision Support with the Help of Semantic Technol...Plan de Calidad para el SNS
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyChris Evelo
 
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...Neuroscience Information Framework
 
Case Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeCase Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeDavid Portnoy
 
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 20133 Round Stones
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicagoDeborah McGuinness
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological databaseKAUSHAL SAHU
 
Roeder rocky 2011_46
Roeder rocky 2011_46Roeder rocky 2011_46
Roeder rocky 2011_46Chris Roeder
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Amit Sheth
 
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...Amit Sheth
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinalDeborah McGuinness
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformaticscontactsoorya
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EITESANGO
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Dag Endresen
 

Similar to "Ontology-centric navigation of the scientific literature" (20)

Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic DatabaseTowards a Simple, Standards-Compliant, and Generic Phylogenetic Database
Towards a Simple, Standards-Compliant, and Generic Phylogenetic Database
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
 
Ontologies and semantic web
Ontologies and semantic webOntologies and semantic web
Ontologies and semantic web
 
NLP in Web Data Extraction (Omer Gunes)
NLP in Web Data Extraction (Omer Gunes)NLP in Web Data Extraction (Omer Gunes)
NLP in Web Data Extraction (Omer Gunes)
 
Important protein databases and proteomics softwares
Important protein databases and proteomics softwaresImportant protein databases and proteomics softwares
Important protein databases and proteomics softwares
 
Usability-focused Clinical Decision Support with the Help of Semantic Technol...
Usability-focused Clinical Decision Support with the Help of Semantic Technol...Usability-focused Clinical Decision Support with the Help of Semantic Technol...
Usability-focused Clinical Decision Support with the Help of Semantic Technol...
 
Using ontologies to do integrative systems biology
Using ontologies to do integrative systems biologyUsing ontologies to do integrative systems biology
Using ontologies to do integrative systems biology
 
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
 
Case Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human GenomeCase Study in Linked Data and Semantic Web: Human Genome
Case Study in Linked Data and Semantic Web: Human Genome
 
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
 
20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago20120419 linkedopendataandteamsciencemcguinnesschicago
20120419 linkedopendataandteamsciencemcguinnesschicago
 
Primary, secondary, tertiary biological database
Primary, secondary, tertiary biological databasePrimary, secondary, tertiary biological database
Primary, secondary, tertiary biological database
 
Ontology at Manchester
Ontology at ManchesterOntology at Manchester
Ontology at Manchester
 
Roeder rocky 2011_46
Roeder rocky 2011_46Roeder rocky 2011_46
Roeder rocky 2011_46
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal
 
SooryaKiran Bioinformatics
SooryaKiran BioinformaticsSooryaKiran Bioinformatics
SooryaKiran Bioinformatics
 
EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017 EiTESAL eHealth Conference 14&15 May 2017
EiTESAL eHealth Conference 14&15 May 2017
 
Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...Knowledge Organization System (KOS) for biodiversity information resources, G...
Knowledge Organization System (KOS) for biodiversity information resources, G...
 

More from bridgingworlds2008

“Libraries as common denominator: from the citizen, country and global perspe...
“Libraries as common denominator: from the citizen, country and global perspe...“Libraries as common denominator: from the citizen, country and global perspe...
“Libraries as common denominator: from the citizen, country and global perspe...bridgingworlds2008
 
“How do you provide for everyone: success with diverse populations in the UK ...
“How do you provide for everyone: success with diverse populations in the UK ...“How do you provide for everyone: success with diverse populations in the UK ...
“How do you provide for everyone: success with diverse populations in the UK ...bridgingworlds2008
 
“Developing a Metrics-based Online Strategy”
“Developing a Metrics-based Online Strategy”“Developing a Metrics-based Online Strategy”
“Developing a Metrics-based Online Strategy”bridgingworlds2008
 
“How do you provide for everyone: success with diverse populations in the UK ...
“How do you provide for everyone: success with diverse populations in the UK ...“How do you provide for everyone: success with diverse populations in the UK ...
“How do you provide for everyone: success with diverse populations in the UK ...bridgingworlds2008
 
“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...bridgingworlds2008
 
“The physical library in the 2.0 age and beyond - a UK perspective”
“The physical library in the 2.0 age and beyond - a UK perspective”“The physical library in the 2.0 age and beyond - a UK perspective”
“The physical library in the 2.0 age and beyond - a UK perspective”bridgingworlds2008
 
“Open Source, Crowd Source: harnessing the power of the people behind our lib...
“Open Source, Crowd Source: harnessing the power of the people behind our lib...“Open Source, Crowd Source: harnessing the power of the people behind our lib...
“Open Source, Crowd Source: harnessing the power of the people behind our lib...bridgingworlds2008
 
“New spaces, activities and challenges: village kids in the library”
“New spaces, activities and challenges: village kids in the library”“New spaces, activities and challenges: village kids in the library”
“New spaces, activities and challenges: village kids in the library”bridgingworlds2008
 
“Search Engine 101: Crawling, Ranking, Coverage and Freshness”
“Search Engine 101: Crawling, Ranking, Coverage and Freshness”“Search Engine 101: Crawling, Ranking, Coverage and Freshness”
“Search Engine 101: Crawling, Ranking, Coverage and Freshness”bridgingworlds2008
 
“Public Library 2.0: Making it Happen”
“Public Library 2.0: Making it Happen”“Public Library 2.0: Making it Happen”
“Public Library 2.0: Making it Happen”bridgingworlds2008
 
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”bridgingworlds2008
 
“Information Literacy and Web 2.0 : is it just hype?”
“Information Literacy and Web 2.0 :  is it just hype?”“Information Literacy and Web 2.0 :  is it just hype?”
“Information Literacy and Web 2.0 : is it just hype?”bridgingworlds2008
 
“Library spaces in the knowledge society – knotting together global and local”
“Library spaces in the knowledge society – knotting together global and local”“Library spaces in the knowledge society – knotting together global and local”
“Library spaces in the knowledge society – knotting together global and local”bridgingworlds2008
 
“Agency in a socially networked world: library clients increase their room to...
“Agency in a socially networked world: library clients increase their room to...“Agency in a socially networked world: library clients increase their room to...
“Agency in a socially networked world: library clients increase their room to...bridgingworlds2008
 
"Avatars: HVX Silverstar | HVX Voyager & HVX Shephard in Teen Second Life"
"Avatars: HVX Silverstar | HVX Voyager & HVX Shephard in Teen Second Life""Avatars: HVX Silverstar | HVX Voyager & HVX Shephard in Teen Second Life"
"Avatars: HVX Silverstar | HVX Voyager & HVX Shephard in Teen Second Life"bridgingworlds2008
 
"Innovation and Wealth Creation: the British Library Perspective”
"Innovation and Wealth Creation: the British Library Perspective”"Innovation and Wealth Creation: the British Library Perspective”
"Innovation and Wealth Creation: the British Library Perspective”bridgingworlds2008
 
“Librarians 2.0: Sowing Padi into (the) SEA”
“Librarians 2.0: Sowing Padi into (the) SEA”“Librarians 2.0: Sowing Padi into (the) SEA”
“Librarians 2.0: Sowing Padi into (the) SEA”bridgingworlds2008
 
“Library 2.0: Let's get connected!”
“Library 2.0: Let's get connected!”“Library 2.0: Let's get connected!”
“Library 2.0: Let's get connected!”bridgingworlds2008
 
“Librarian 2.0 - New Breed or Just Another Day at the Office?”
“Librarian 2.0 - New Breed or Just Another Day at the Office?”“Librarian 2.0 - New Breed or Just Another Day at the Office?”
“Librarian 2.0 - New Breed or Just Another Day at the Office?”bridgingworlds2008
 

More from bridgingworlds2008 (19)

“Libraries as common denominator: from the citizen, country and global perspe...
“Libraries as common denominator: from the citizen, country and global perspe...“Libraries as common denominator: from the citizen, country and global perspe...
“Libraries as common denominator: from the citizen, country and global perspe...
 
“How do you provide for everyone: success with diverse populations in the UK ...
“How do you provide for everyone: success with diverse populations in the UK ...“How do you provide for everyone: success with diverse populations in the UK ...
“How do you provide for everyone: success with diverse populations in the UK ...
 
“Developing a Metrics-based Online Strategy”
“Developing a Metrics-based Online Strategy”“Developing a Metrics-based Online Strategy”
“Developing a Metrics-based Online Strategy”
 
“How do you provide for everyone: success with diverse populations in the UK ...
“How do you provide for everyone: success with diverse populations in the UK ...“How do you provide for everyone: success with diverse populations in the UK ...
“How do you provide for everyone: success with diverse populations in the UK ...
 
“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...“Virtual Communities in Europe: the cultural mix and how the European Library...
“Virtual Communities in Europe: the cultural mix and how the European Library...
 
“The physical library in the 2.0 age and beyond - a UK perspective”
“The physical library in the 2.0 age and beyond - a UK perspective”“The physical library in the 2.0 age and beyond - a UK perspective”
“The physical library in the 2.0 age and beyond - a UK perspective”
 
“Open Source, Crowd Source: harnessing the power of the people behind our lib...
“Open Source, Crowd Source: harnessing the power of the people behind our lib...“Open Source, Crowd Source: harnessing the power of the people behind our lib...
“Open Source, Crowd Source: harnessing the power of the people behind our lib...
 
“New spaces, activities and challenges: village kids in the library”
“New spaces, activities and challenges: village kids in the library”“New spaces, activities and challenges: village kids in the library”
“New spaces, activities and challenges: village kids in the library”
 
“Search Engine 101: Crawling, Ranking, Coverage and Freshness”
“Search Engine 101: Crawling, Ranking, Coverage and Freshness”“Search Engine 101: Crawling, Ranking, Coverage and Freshness”
“Search Engine 101: Crawling, Ranking, Coverage and Freshness”
 
“Public Library 2.0: Making it Happen”
“Public Library 2.0: Making it Happen”“Public Library 2.0: Making it Happen”
“Public Library 2.0: Making it Happen”
 
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
“Library 2.0: Balancing the Risks and Benefits to Maximise the Dividends”
 
“Information Literacy and Web 2.0 : is it just hype?”
“Information Literacy and Web 2.0 :  is it just hype?”“Information Literacy and Web 2.0 :  is it just hype?”
“Information Literacy and Web 2.0 : is it just hype?”
 
“Library spaces in the knowledge society – knotting together global and local”
“Library spaces in the knowledge society – knotting together global and local”“Library spaces in the knowledge society – knotting together global and local”
“Library spaces in the knowledge society – knotting together global and local”
 
“Agency in a socially networked world: library clients increase their room to...
“Agency in a socially networked world: library clients increase their room to...“Agency in a socially networked world: library clients increase their room to...
“Agency in a socially networked world: library clients increase their room to...
 
"Avatars: HVX Silverstar | HVX Voyager & HVX Shephard in Teen Second Life"
"Avatars: HVX Silverstar | HVX Voyager & HVX Shephard in Teen Second Life""Avatars: HVX Silverstar | HVX Voyager & HVX Shephard in Teen Second Life"
"Avatars: HVX Silverstar | HVX Voyager & HVX Shephard in Teen Second Life"
 
"Innovation and Wealth Creation: the British Library Perspective”
"Innovation and Wealth Creation: the British Library Perspective”"Innovation and Wealth Creation: the British Library Perspective”
"Innovation and Wealth Creation: the British Library Perspective”
 
“Librarians 2.0: Sowing Padi into (the) SEA”
“Librarians 2.0: Sowing Padi into (the) SEA”“Librarians 2.0: Sowing Padi into (the) SEA”
“Librarians 2.0: Sowing Padi into (the) SEA”
 
“Library 2.0: Let's get connected!”
“Library 2.0: Let's get connected!”“Library 2.0: Let's get connected!”
“Library 2.0: Let's get connected!”
 
“Librarian 2.0 - New Breed or Just Another Day at the Office?”
“Librarian 2.0 - New Breed or Just Another Day at the Office?”“Librarian 2.0 - New Breed or Just Another Day at the Office?”
“Librarian 2.0 - New Breed or Just Another Day at the Office?”
 

Recently uploaded

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 

Recently uploaded (20)

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 

"Ontology-centric navigation of the scientific literature"

  • 1. Ontology-centric Knowledge Navigation .. of the scientific literature Christopher J. O. Baker Institute for InfoComm Research, A*STAR, Singapore
  • 2. Motivation • Scientists typically need to integrate a spectrum of information to successfully complete a task. • On average a scientist or knowledge worker spends 1 day per week searching for, integrating and analyzing information, 50% of which is unstructured digital formats. • Access to information structured according to explicit knowledge representations or taxonomies is a fundamental concern of all scientists. • Moving beyond keyword search requires tools that provide lexical matching to semantic, conceptual and contextual levels of information and this entails an infrastructure for indexing text segments according to domain-specific metadata
  • 3. In the future …. • Users will be involved in the design of information systems • Publishers will charge users for value added search: (who will build such search systems) • Users will search across semantically integration data sources and data types (how to facilitate system creation / adoption) • Knowledge driven systems - rapidly built and deployed with the engagement of domain experts in a knowledge engineering team
  • 4. Literature-driven, Ontology-centric Knowledge Integration and Navigation Visual Query Reasoning Ontology 50 sentences 500 documents, to read blogs, newsfeeds Ontology to browse Population Text Mining Content delivery using expressive semantics
  • 5. W3C Semantic Web Technologies • URI / LSID • Ontologies • Reasoners • Query Languages • Web Services • Service Registries • Agents • Multi Agent Systems • Workflows Engines • GRID / Semantic GRID • Text Mining • Service Oriented Architecture
  • 6. Controlled Vocabularies Ontologies Catalog/ Thesauri Formal General ID “narrower term” is-a Frames logical Controlled vocabularies part-of (properties) constraints Terms/ Informal Formal Value Glossary/Controlled is-a instance restrictions vocabularies part-of Capture knowledge: Make the content in The meaning of important vocabulary information sources explicit. (classes, properties/relations and instance data in a domain model). Common domain terminology Index and query model Basis for interoperability to a repository of information. between information systems.
  • 7. Lipid Ontology > Implementation: OWL-DL > DL Expressivity ALCHIQ > Uses LIPIDMAPS systematic nomenclature > 560 Named classes > 352 Lipid subclasses 71 Object properties (inc inv.) > 4 Datatype properties > Lipid instance: Graph fragment DL Axioms LIPIDMAPS systematic name Lipid Hierarchy > Depth: 8 levels Domain Knowledge vs information Concept Definitions system metadata
  • 10. Ontology-centric Knowledge Integration • Content Delivery Platform - Automated Document delivery from online databases Tools for conversion to text-minable text Content Acquisition • Text Mining - Customized and Automated Regular Expressions, Named Entities, Relations, Domain • Knowledge Engineering – Ontology Creation specific Domain Modeling / Customized Rapid raw text Prototyping • Ontology Population – Automated Instantiation Sentences as instances / Co-occurrence and named relations (Rules)
  • 11. Domian Ontology vs Mixed Metadata: a literature specification
  • 12. Ontology Population Workflow • Ontology based information retrieval applies NLP to link documents to existing ontologies • Ontology-driven NLP - NLP that actively uses ontological resources for NLP tasks • Ontological NLP - ontologies used as a knowledge base for NLP tasks while also exporting the results of NLP analyses into an ontology that can then subsequent semantic queries to the ontology using description logic reasoners and a box reasoning • Ontology based NLP - the results of NLP are exported to another ontology, using external resources for text processing, Witte etal. 2007
  • 13. Text Mining • Class Instance Generation from full text – Named entity recognition (gazetteer based) – Dictionary based matching of text tokens to domain specific vocabularies i.e. (LipidBank, Lipidmaps, KEGG, IUPAC) and curated Swissprot terms and disease ontology of CGM – Normalization and grounding to canonical names • Relation Detection - Role Assertions: – Co-occurrence and Rule-based relation detection of binary pairs from which knowledgebase instances are generated. Primary set of binary interactions mined from text: – Lipid-Protein, Lipid-Disease, Protein-Disease – Domain specific library of curated biological relations.
  • 14. Knowledgebase Instantiation 1) Rule based identification of Sentences containing target keywords 2) Instantiation with JENA API http://jena.sourceforge.net/ for this purpose. Target keywords found in sentences are instantiated to corresponding ontology class • Lipid / Protein / Disease instances are instantiated to the respective ontology classes (as tagged by the gazetteer) • Binary pairs instantiated to the respective Object Properties as role assertions • Sentences instantiated to the respective Data type properties. For each lipid identified in a sentence the corresponding data are instantiated to the ontology from Lipid Data Warehouse records requiring no further text processing. • Lipid - LIPIDMAPS Systematic Name and its associated • Lipid - IUPAC Name, Lipid – synonyms, Lipid - Database ID.
  • 15. Knowledgebase Instantiation Rule Based Sentence Processing <Lipid> AND <Protein> AND LipidProteinInteraction-TriggerWord e.g. quot;interactquot;, quot;bindquot;, quot;mediatequot; <Lipid> AND <Disease> AND LipidDiseaseInteraction-TriggerWord e.g quot;involvequot;, quot;causequot; Lipid Class Protein Instance Lipid Instance Lipid Instance
  • 16. Knowledge Integration and Query User input query Search Web content or Engine Full text papers NLP tagging Papers identified: 262 121 papers with no lipid protein relations 141 papers contributed to ontology instantiation 186 lipid names docs 528 protein names tagged After normalisation and grounding: with 92 Lipidmaps systematic names relevant 52 IUPAC names, 412 exact synonyms, 6 broad synonyms, 319 protein names name Cross link to 59 Lipidbank entries entities Sentences: Co-occurrence before rules 1356 Sentences, After rules 683 Interaction sentences 92 Lipidmaps names instantiated to 35 classes (2.6 lipids per class) Instantiation Time: 22 seconds Ontology Knowledge “Instantiated ontology” instantiation User Output for end user Navigation Baker CJ, Kanagasabai R, Ang WT, Veeramani A, Low HS, and vehicle Wenk MR. Towards ontology-driven navigation of the lipid bibliosphere. BMC Bioinformatics. 2008;9 Suppl 1:S5.
  • 17. Knowledge Integration and Query User input query Search Web content or Engine Full text papers NLP tagging docs tagged with relevant name entities Ontology Knowledge “Instantiated ontology” instantiation User Output for end user Navigation Baker CJ, Kanagasabai R, Ang WT, Veeramani A, Low HS, and vehicle Wenk MR. Towards ontology-driven navigation of the lipid bibliosphere. BMC Bioinformatics. 2008;9 Suppl 1:S5.
  • 18. Knowlegator Query Composition Panel Results Panel Ontology Content Query Syntax Query Engine Concept Dialogue Properties Overview
  • 19. Complex Query Generation rma tician In f o x pert ain e D om Find documents and sentences describing proteins- lipid interaction and corresponding lipid synonyms.
  • 20. Pathway Discovery Algorithm Finds transitive paths across the graph: between source and target concepts. Can define path length and result size … paths between any object properties or a user defined object properties only e.g. protein interacts with protein
  • 21. Pathway Knowledge Discovery 2 concepts or keywords ... across Results with multiple Kanagasabai R. Low HS ,Ang WT, Wenk MR, Baker CJO. semantic labelling Ontology-centric navigation of pathway information mined from text, Bio-Ontologies SIG: Knowledge in Biology, ISMB July 2008 relations
  • 24. 1 search term (instance or concept) generates a list of natural language questions answerable by the ontology and a direct link to answers Ang WT, Kanagasabai R, Baker CJ. Knowledge Translation: Computing the query potential of bio-ontologies, Genome Informatics Workshop 2008 Submitted …..
  • 27. Knowledge Services: Development Knowledge Worker involved in Discovery Navigation Paradigms Ontology Engineering Quality Semantic Evolution Data Maintenance Integration NLP & Text Databases Mining Multi-user involvement Domain Expert Text Mining Ontology Engineer Phase 1 Phase 2 Semantics Ontology Engineer Engineer
  • 29. Acknowledgements Semantic Technology Group Christopher J. O. Baker Kanagasabi Rajaraman Menaka Rajapakse Anitha Veeramani Ang Wee Tiong Alexander Garcia (Alumnus) Collaborators Markus R Wenk, NUS Low Hong-Sang, NUS Choo Kar Heng, I2R Shoba Ranganathan NUS Suisheng Tan, I2R