SlideShare a Scribd company logo
1 of 12
Download to read offline
Semantic-assisted Analysis and 
Search in Customer Specifications 
Martin Voigt, Daniel Hladky 
September 2014 
1 
ONTOS LINKED DATA INFORMATION WORKBENCH 
Extraction & Analysis 
Indexing 
Information & 
Knowledge Management 
Search 
Engineer 
Storage 
Sales 
Portal 
Multilingual 
Specifications
I speakabout… 
The Problem, 
Our Solution, 
Insights & Further Work. 
2
The Problem 
AviComp Controls GmbH 
 leading engineering contractor 
for rotating machinery controls 
3 
Customers 
Engineers 
Sales 
> 100k Technical 
Specifications 
http://www.avicomp.com/capabilities/turbo-compressor-controls.html
The Problem 
Analysis: 1) task, 2) current solution, 3) ideas 
Problems 
Multiple, inefficient tools 
Heterogeneity 
Knowledge management & transfer 
4 
http://answerhub.com/article/ the-cost-of-knowledge-loss/
Our Solution 
5 
ONTOS LINKED DATA INFORMATION WORKBENCH 
Extraction & Analysis 
Indexing 
Information & 
Knowledge Management 
Search 
Engineer 
Storage 
Sales 
Portal 
Multilingual 
Specifications 
http://www.ontos.com/products/ontosldiw/
Our Solution 
Extraction& Analysis 
Homogenization: PDF conversion (Apache POI) & OCR (CuneiForm) 
Text extraction (Apache Tika) 
Language detection (language-detection API) 
Text preparation, e.g., remove headers & footers 
SKOS-based concept identification 
6 
Lorem ipsum dolor sit amet, consetetursadipscing 
elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam 
erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet 
clitakasdgubergren, no sea takimata 
sanctusestLorem ipsum dolor sit 
elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam 
erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet 
clitakasdgubergren, no sea takimata 
elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam 
erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet 
clitakasdgubergren, no sea takimata ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
Our Solution 
Storage via OntoQUAD 
 Triple and/or QuadStore, SPARQL 1.1, … 
Indexing 
 Full text search, result grouping, faceted browsing, 
SKOS-based label expansion, … 
 Apache Solr with lucene-skos plugin 
(https://github.com/behas/lucene-SKOS) 
7 
ONTOS LINKED DATA INFORMATION WORKBENCH 
Extraction & Analysis 
Indexing 
Information & 
Knowledge Management 
Search 
Engineer 
Storage 
Sales 
Portal 
Multilingual 
Specifications
Our Solution 
Knowledge Management 
via OntoDixbut SKOS-only 
8 
ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
Our Solution 
Search 
via AJAX Solr(https://github.com/evolvingweb/ajax-solr) 
9 
ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
Insights & Further Work 
Iterative development with early customer testing lowers usage barrier 
Lessons learned 
Development of a knowledge base 
Faceted search user interface 
Faceted search on RDF 
Multilingual disambiguationmechanisms 
10
Q&A 
Martin Voigt 
Ontos AG / GmbH 
Nidau(CH) / Leipzig (DE) 
T:+49 341 21559-10 
M:+49 178 40 222 58 
E: martin.voigt@ontos.com 
11
About Ontos 
12 
12 
DoW – CTI Project 
Ontos Group 
Key Facts 
- Established 2001 
- 15+ employees 
- Share in Eventos RU 
(30 people) 
- 5± Mio CHF turnover 
Industry 
- Media/News 
- Law Enforcement 
- Government 
- (Russia)

More Related Content

Similar to Semantic analysis and search of customer specifications

Microsoft Power BI and Cortana Analytics user group meetings with Alteryx
Microsoft Power BI and Cortana Analytics user group meetings with AlteryxMicrosoft Power BI and Cortana Analytics user group meetings with Alteryx
Microsoft Power BI and Cortana Analytics user group meetings with AlteryxHåkan Söderbom
 
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botifyapidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botifyapidays
 
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Vertex AI - Unified ML Platform for the entire AI workflow on Google CloudVertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Vertex AI - Unified ML Platform for the entire AI workflow on Google CloudMárton Kodok
 
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance ManagementAn AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance ManagementDatabricks
 
ActiveWarehouse/ETL - BI & DW for Ruby/Rails
ActiveWarehouse/ETL - BI & DW for Ruby/RailsActiveWarehouse/ETL - BI & DW for Ruby/Rails
ActiveWarehouse/ETL - BI & DW for Ruby/RailsPaul Gallagher
 
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Impetus Technologies
 
Semtech 2011 impressions
Semtech 2011 impressionsSemtech 2011 impressions
Semtech 2011 impressionsGeorge Roth
 
Webinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence IntroWebinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence IntroSpagoWorld
 
Berlin buzzwords 2020-feature-store-dowling
Berlin buzzwords 2020-feature-store-dowlingBerlin buzzwords 2020-feature-store-dowling
Berlin buzzwords 2020-feature-store-dowlingJim Dowling
 
The power of faceted search in alfresco
The power of faceted search in alfrescoThe power of faceted search in alfresco
The power of faceted search in alfrescoXeniT Solutions nv
 
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...Ingo Renner
 
Sharepoint 2013-applied architecture from the field v3 (public)
Sharepoint 2013-applied architecture from the field v3 (public)Sharepoint 2013-applied architecture from the field v3 (public)
Sharepoint 2013-applied architecture from the field v3 (public)Tihomir Ignatov
 
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...tdc-globalcode
 
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...tdc-globalcode
 
SharePoint 2013 Dev Features
SharePoint 2013 Dev FeaturesSharePoint 2013 Dev Features
SharePoint 2013 Dev FeaturesRicardo Wilkins
 
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike WatsonSharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike WatsonJoel Oleson
 
OFF SHORE RECRUITER TRAINING
OFF SHORE RECRUITER TRAININGOFF SHORE RECRUITER TRAINING
OFF SHORE RECRUITER TRAININGsatish_kumar646
 
Discussion for Anomaly & Prediction Engine
Discussion for Anomaly & Prediction EngineDiscussion for Anomaly & Prediction Engine
Discussion for Anomaly & Prediction EngineHisashiOsanai
 
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data PlatformsData Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data PlatformsAnant Corporation
 

Similar to Semantic analysis and search of customer specifications (20)

Microsoft Power BI and Cortana Analytics user group meetings with Alteryx
Microsoft Power BI and Cortana Analytics user group meetings with AlteryxMicrosoft Power BI and Cortana Analytics user group meetings with Alteryx
Microsoft Power BI and Cortana Analytics user group meetings with Alteryx
 
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botifyapidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
apidays LIVE Paris 2021 - Building an analytics API by David Wobrock, Botify
 
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Vertex AI - Unified ML Platform for the entire AI workflow on Google CloudVertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
Vertex AI - Unified ML Platform for the entire AI workflow on Google Cloud
 
PoolParty Overview
PoolParty OverviewPoolParty Overview
PoolParty Overview
 
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance ManagementAn AI-Powered Chatbot to Simplify Apache Spark Performance Management
An AI-Powered Chatbot to Simplify Apache Spark Performance Management
 
ActiveWarehouse/ETL - BI & DW for Ruby/Rails
ActiveWarehouse/ETL - BI & DW for Ruby/RailsActiveWarehouse/ETL - BI & DW for Ruby/Rails
ActiveWarehouse/ETL - BI & DW for Ruby/Rails
 
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
 
Semtech 2011 impressions
Semtech 2011 impressionsSemtech 2011 impressions
Semtech 2011 impressions
 
Webinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence IntroWebinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence Intro
 
Berlin buzzwords 2020-feature-store-dowling
Berlin buzzwords 2020-feature-store-dowlingBerlin buzzwords 2020-feature-store-dowling
Berlin buzzwords 2020-feature-store-dowling
 
The power of faceted search in alfresco
The power of faceted search in alfrescoThe power of faceted search in alfresco
The power of faceted search in alfresco
 
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
Open Source Enterprise Search meets Open Source Enterprise CMS - Apache Solr ...
 
Sharepoint 2013-applied architecture from the field v3 (public)
Sharepoint 2013-applied architecture from the field v3 (public)Sharepoint 2013-applied architecture from the field v3 (public)
Sharepoint 2013-applied architecture from the field v3 (public)
 
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
TDC2018SP | Trilha Computacao Cognitiva - Sentiment Analysis com Power BI e C...
 
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
TDC2018SP | Trilha Comp Cognitiva - Sentiment Analysis com Power BI e Cogniti...
 
SharePoint 2013 Dev Features
SharePoint 2013 Dev FeaturesSharePoint 2013 Dev Features
SharePoint 2013 Dev Features
 
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike WatsonSharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
 
OFF SHORE RECRUITER TRAINING
OFF SHORE RECRUITER TRAININGOFF SHORE RECRUITER TRAINING
OFF SHORE RECRUITER TRAINING
 
Discussion for Anomaly & Prediction Engine
Discussion for Anomaly & Prediction EngineDiscussion for Anomaly & Prediction Engine
Discussion for Anomaly & Prediction Engine
 
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data PlatformsData Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
 

Recently uploaded

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfLivetecs LLC
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...confluent
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 

Recently uploaded (20)

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
How to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdfHow to Track Employee Performance A Comprehensive Guide.pdf
How to Track Employee Performance A Comprehensive Guide.pdf
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 

Semantic analysis and search of customer specifications

  • 1. Semantic-assisted Analysis and Search in Customer Specifications Martin Voigt, Daniel Hladky September 2014 1 ONTOS LINKED DATA INFORMATION WORKBENCH Extraction & Analysis Indexing Information & Knowledge Management Search Engineer Storage Sales Portal Multilingual Specifications
  • 2. I speakabout… The Problem, Our Solution, Insights & Further Work. 2
  • 3. The Problem AviComp Controls GmbH  leading engineering contractor for rotating machinery controls 3 Customers Engineers Sales > 100k Technical Specifications http://www.avicomp.com/capabilities/turbo-compressor-controls.html
  • 4. The Problem Analysis: 1) task, 2) current solution, 3) ideas Problems Multiple, inefficient tools Heterogeneity Knowledge management & transfer 4 http://answerhub.com/article/ the-cost-of-knowledge-loss/
  • 5. Our Solution 5 ONTOS LINKED DATA INFORMATION WORKBENCH Extraction & Analysis Indexing Information & Knowledge Management Search Engineer Storage Sales Portal Multilingual Specifications http://www.ontos.com/products/ontosldiw/
  • 6. Our Solution Extraction& Analysis Homogenization: PDF conversion (Apache POI) & OCR (CuneiForm) Text extraction (Apache Tika) Language detection (language-detection API) Text preparation, e.g., remove headers & footers SKOS-based concept identification 6 Lorem ipsum dolor sit amet, consetetursadipscing elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet clitakasdgubergren, no sea takimata sanctusestLorem ipsum dolor sit elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet clitakasdgubergren, no sea takimata elitr, seddiamnonumyeirmodtemporinviduntutlaboreet doloremagna aliquyam erat, seddiamvoluptua. At veroeoset accusamet justoduo doloreset earebum. Stet clitakasdgubergren, no sea takimata ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
  • 7. Our Solution Storage via OntoQUAD  Triple and/or QuadStore, SPARQL 1.1, … Indexing  Full text search, result grouping, faceted browsing, SKOS-based label expansion, …  Apache Solr with lucene-skos plugin (https://github.com/behas/lucene-SKOS) 7 ONTOS LINKED DATA INFORMATION WORKBENCH Extraction & Analysis Indexing Information & Knowledge Management Search Engineer Storage Sales Portal Multilingual Specifications
  • 8. Our Solution Knowledge Management via OntoDixbut SKOS-only 8 ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
  • 9. Our Solution Search via AJAX Solr(https://github.com/evolvingweb/ajax-solr) 9 ONTOS LINKED DATA INFORMATION WORKBENCHExtraction & AnalysisIndexingInformation & Knowledge ManagementSearchEngineer Storage Sales Portal MultilingualSpecifications
  • 10. Insights & Further Work Iterative development with early customer testing lowers usage barrier Lessons learned Development of a knowledge base Faceted search user interface Faceted search on RDF Multilingual disambiguationmechanisms 10
  • 11. Q&A Martin Voigt Ontos AG / GmbH Nidau(CH) / Leipzig (DE) T:+49 341 21559-10 M:+49 178 40 222 58 E: martin.voigt@ontos.com 11
  • 12. About Ontos 12 12 DoW – CTI Project Ontos Group Key Facts - Established 2001 - 15+ employees - Share in Eventos RU (30 people) - 5± Mio CHF turnover Industry - Media/News - Law Enforcement - Government - (Russia)