SlideShare a Scribd company logo
1 of 31
Download to read offline
The Power of Faceted Search
in Alfresco

Roxana Angheluta
Willem Van den Eynde
This presentation

●   Who are we ?
●   Alfresco
●   Faceted Search
●   Demo
Willem Van den Eynde

● 2004 - 2007
   ○ Bachelor applied informatics, KHLeuven

● 2006 - 2007
   ○ LiU Erasmus Sweden

● 2007
   ○ internship in Paris

● 2007 - 2010
   ○ Master in applied informatics KULeuven

● 2010 - current
   ○ Software Engineer XeniT Leuven
Roxana Angheluta

● 1995 - 1999
   ○ Bachelor informatics, University of Bucharest

● 2000 - 2001
   ○ Erasmus student KULeuven

● 2001-2004
   ○ Assistant researcher KULeuven

● 2003-2004
   ○ Master in Artificial Intelligence KULeuven

● 2004 - 2012
   ○ Software Engineer Attentio Brussels

● 2012 - 2013
   ○ Software Engineer XeniT Leuven
Introducing XeniT

              Managing content in a
              smart way




2009 - Proprietary and Confidential Information of Xenit Solutions
2009 - Proprietary and Confidential Information of Xenit Solutions



From our home base

          With an enthusiastic and experienced team




                     In collaboration with our customers
2009 - Proprietary and Confidential Information of Xenit Solutions



The corporate story of XeniT




 IWT project
 Concurrent
collaboration




                3.5 M
                docs
                                                                                          Alfresco-
                                 8M                                                         As-A-
                                 docs                                                      Service


2007            2008    2009   2010     2011                     2012                    2013
Maidenhead, UK Global Headquarters   Atlanta, US Headquarters




Alfresco is the largest private, pure-
play open source software company
in the world.
4 million+ downloads of Alfresco community
75,000+ sites running community
2000+ Enterprise customers from 43+ countries
200+ channel partners
20 consecutive quarters of revenue growth
founded in 2005
What is Alfresco



Alfresco is an open source enterprise content
management system
What is Alfresco ?


●   Enterprise Content Management (ECM)
      is a formalized means of organizing and storing
     an organization's documents, and other content,
      that relate to the organization's processes. The
        term encompasses strategies, methods, and
          tools used throughout the lifecycle of the
                          content.
Classification and Retrieval

●   Classification
●   Retrieval
FAQ

●   How does an open-source company like Alfresco
    generate revenue ?
●   Alfresco vs Microsoft SharePoint
2009 - Proprietary and Confidential Information of Xenit Solutions




Alfresco demo
Search in Alfresco

● Many search engines out there, few engines
  really good, fewer open source
● Requirements:
   ○   accurate
   ○   performant
   ○   flexible
   ○   cross-platform
   ○   scalable
   ○   mature
● Lucene
   ○ https://lucene.apache.org/
● Starting with Alfresco 4.0 => Solr
   ○ http://lucene.apache.org/solr/
Lucene

● Java-based indexing and search library, as
  well as spellchecking, hit highlighting and
  advanced analysis/tokenization capabilities

● History
  Doug Cutting originally wrote Lucene in 1999.[2] It was initially available for download from its home at the SourceForge
  web site. It joined the Apache Software Foundation's Jakarta family of open-source Java products in September 2001
  and became its own top-level Apache project in February 2005.




● Many projects based on Lucene: Solr,
  Nutch, Elasticsearch
Lucene

Indexing
 ●   over 150GB/hour on modern hardware
 ●   small RAM requirements -- only 1MB heap
 ●   incremental indexing as fast as batch indexing
 ●   index size roughly 20-30% the size of text indexed


Searching
 ●   ranked searching -- best results returned first
 ●   many powerful query types: phrase queries, wildcard queries, proximity queries, range queries and more
 ●   fielded searching (e.g. title, author, contents)
 ●   sorting by any field
 ●   multiple-index searching with merged results
 ●   allows simultaneous update and searching
 ●   flexible faceting, highlighting, joins and result grouping
 ●   fast, memory-efficient and typo-tolerant suggesters
 ●   pluggable ranking models, including the Vector Space Model and Okapi BM25
 ●   configurable storage engine (codecs)
2009 - Proprietary and Confidential Information of Xenit Solutions




Lucene in Alfresco
http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
2009 - Proprietary and Confidential Information of Xenit Solutions




Lucene in Alfresco
http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
2009 - Proprietary and Confidential Information of Xenit Solutions




Lucene in Alfresco
http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
2009 - Proprietary and Confidential Information of Xenit Solutions




Lucene in Alfresco
http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
2009 - Proprietary and Confidential Information of Xenit Solutions




Lucene in Alfresco
http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
2009 - Proprietary and Confidential Information of Xenit Solutions




Lucene in Alfresco
http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation




The way to preserve information in the Lucene
index is specified in Alfresco's data models
Main concept: tokenization
2009 - Proprietary and Confidential Information of Xenit Solutions




 Lucene in Alfresco
 http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation




with




 without tokenization                                                        with tokenization
2009 - Proprietary and Confidential Information of Xenit Solutions




Lucene in Alfresco


● Out of the box search:
  ○ search in all items, in a certain property or in the
    content (full text search)

  ○ additionally: PATH, ASPECT, CATEGORY searches

  ○ Lucene syntax allowed:
    ■ boolean queries
    ■ wildcard queries
    ■ range queries
Solr

● Standalone full-text search server within a
  servlet container such as Tomcat. Uses
  Lucene library and has REST-like
  HTTP/XML and JSON API. Has an
  extensive plugin architecture.
●  In 2004, Solr was created by Yonik Seeley at CNET_Networks and in January 2006 the source code was donated to the
   Apache Software Foundation under the Lucene top-level project. In March 2010, the Lucene and Solr projects merged and
   consequently in 2011, the Solr version number scheme was changed in order to match that of Lucene.


● Many users:
   ○       http://wiki.apache.org/solr/PublicServers
Solr

 ●   Uses the Lucene library for full-text search
 ●   Faceted navigation
 ●   Hit highlighting
 ●   Query language supports structured as well as textual search
 ●   JSON, XML, PHP, Ruby, Python, XSLT, Velocity and custom Java binary output formats over HTTP
 ●   HTML administration interface
 ●   Replication to other Solr servers - enables scaling QPS
 ●   Distributed Search through Sharding - enables scaling content volume
 ●   Search results clustering based on Carrot2
 ●   Extensible through plugins
 ●   Pluggable relevance - boost through formula
 ●   Caching
 ●   Embeddable in a Java Application
Faceted Search in Alfresco

● A way to navigate through the documents,
  showing counts per property value and
  offering the possibility to drill down in the
  data
● Faceted search supported by Lucene/Solr,
  not yet supported by Alfresco
● Implemented by Xenit in Fred
Faceted Search in Alfresco
Faceted Search in Alfresco

● Questions
   ○ which fields should be facetable?
     ■ only the ones with a limited set of possible values
     ■ only the ones which are untokenized
     ■ plus ranges: dates and numbers
   ○ how to navigate inside facets?


● Current implementation
   ○ facetable fields configurable in a file
   ○ date ranges and number ranges not supported yet
   ○ drilling-down in a single value possible
Faceted Search in Fred: mockup
Demo

More Related Content

Similar to The power of faceted search in alfresco

Introduction to Alfresco
Introduction to AlfrescoIntroduction to Alfresco
Introduction to Alfresco
Wildan Maulana
 
A fresh approach_to_content_management
A fresh approach_to_content_managementA fresh approach_to_content_management
A fresh approach_to_content_management
ECNU
 
Integrating Alfresco with Liferay Portal for Document-Centric Use Cases
Integrating Alfresco with Liferay Portal for Document-Centric Use CasesIntegrating Alfresco with Liferay Portal for Document-Centric Use Cases
Integrating Alfresco with Liferay Portal for Document-Centric Use Cases
rivetlogic
 
S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data CompanionS. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
Flink Forward
 

Similar to The power of faceted search in alfresco (20)

Introduction to Alfresco
Introduction to AlfrescoIntroduction to Alfresco
Introduction to Alfresco
 
A fresh approach_to_content_management
A fresh approach_to_content_managementA fresh approach_to_content_management
A fresh approach_to_content_management
 
WCM-5 WCM Solutions with Drupal and Alfresco
WCM-5 WCM Solutions with Drupal and AlfrescoWCM-5 WCM Solutions with Drupal and Alfresco
WCM-5 WCM Solutions with Drupal and Alfresco
 
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
Phil Ritchie | Putting Standards into Action: Multilingual and Semantic Enric...
 
Integrating Alfresco with Liferay Portal for Document-Centric Use Cases
Integrating Alfresco with Liferay Portal for Document-Centric Use CasesIntegrating Alfresco with Liferay Portal for Document-Centric Use Cases
Integrating Alfresco with Liferay Portal for Document-Centric Use Cases
 
EasySOA introduction and video demos - fOSSa 2011
EasySOA introduction and video demos - fOSSa 2011EasySOA introduction and video demos - fOSSa 2011
EasySOA introduction and video demos - fOSSa 2011
 
Whats new in alfresco community 3.4
Whats new in alfresco community 3.4Whats new in alfresco community 3.4
Whats new in alfresco community 3.4
 
Alfresco 4.0 - A Complete Introduction
 Alfresco 4.0 - A Complete Introduction Alfresco 4.0 - A Complete Introduction
Alfresco 4.0 - A Complete Introduction
 
Appnovation Intro Deck
Appnovation Intro DeckAppnovation Intro Deck
Appnovation Intro Deck
 
Key topics when migrating from FAST to Solr, EuroCon 2010
Key topics when migrating from FAST to Solr, EuroCon 2010Key topics when migrating from FAST to Solr, EuroCon 2010
Key topics when migrating from FAST to Solr, EuroCon 2010
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
 
Web Experience Management with Alfresco and Crafter rivet
Web Experience Management with Alfresco and Crafter rivetWeb Experience Management with Alfresco and Crafter rivet
Web Experience Management with Alfresco and Crafter rivet
 
Html5 Taller Campus Party Vfinal2l
Html5 Taller Campus Party Vfinal2lHtml5 Taller Campus Party Vfinal2l
Html5 Taller Campus Party Vfinal2l
 
Deep learning on HDP 2018 Prague
Deep learning on HDP 2018 PragueDeep learning on HDP 2018 Prague
Deep learning on HDP 2018 Prague
 
Semantic-assisted Analysis and Search in Customer Specifications
Semantic-assisted Analysis and Search in Customer SpecificationsSemantic-assisted Analysis and Search in Customer Specifications
Semantic-assisted Analysis and Search in Customer Specifications
 
FluentD for end to end monitoring
FluentD for end to end monitoringFluentD for end to end monitoring
FluentD for end to end monitoring
 
Migrating Fast to Solr
Migrating Fast to SolrMigrating Fast to Solr
Migrating Fast to Solr
 
S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data CompanionS. Bartoli & F. Pompermaier – A Semantic Big Data Companion
S. Bartoli & F. Pompermaier – A Semantic Big Data Companion
 
Freme at feisgiltt 2015 freme & linked data & localisers
Freme at feisgiltt 2015   freme & linked data & localisersFreme at feisgiltt 2015   freme & linked data & localisers
Freme at feisgiltt 2015 freme & linked data & localisers
 
Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891
Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891
Fremeatfeisgiltt2015 fremelinkeddatalocalisers-150603090934-lva1-app6891
 

More from XeniT Solutions nv

More from XeniT Solutions nv (20)

Data Security in the Insurance Industry: what you need to know about data pro...
Data Security in the Insurance Industry: what you need to know about data pro...Data Security in the Insurance Industry: what you need to know about data pro...
Data Security in the Insurance Industry: what you need to know about data pro...
 
Driving full-scale productivity and collaboration with the Alfresco connector...
Driving full-scale productivity and collaboration with the Alfresco connector...Driving full-scale productivity and collaboration with the Alfresco connector...
Driving full-scale productivity and collaboration with the Alfresco connector...
 
How to solve your toughest performance issues in Alfresco
How to solve your toughest performance issues in AlfrescoHow to solve your toughest performance issues in Alfresco
How to solve your toughest performance issues in Alfresco
 
How do you secure an electronic signature?
How do you secure an electronic signature?How do you secure an electronic signature?
How do you secure an electronic signature?
 
How to increase user's productivity with Alfred Desktop and Alfred Finder
How to increase user's productivity with Alfred Desktop and Alfred FinderHow to increase user's productivity with Alfred Desktop and Alfred Finder
How to increase user's productivity with Alfred Desktop and Alfred Finder
 
How to Scale Information Dissemination to the Virtual Digital Workspace
How to Scale Information Dissemination to the Virtual Digital WorkspaceHow to Scale Information Dissemination to the Virtual Digital Workspace
How to Scale Information Dissemination to the Virtual Digital Workspace
 
THE ALFRESCO FOUNDATION ARCHITECTURE FOR INTEGRATED FULL DIGITAL INSURANCE PR...
THE ALFRESCO FOUNDATION ARCHITECTURE FOR INTEGRATED FULL DIGITAL INSURANCE PR...THE ALFRESCO FOUNDATION ARCHITECTURE FOR INTEGRATED FULL DIGITAL INSURANCE PR...
THE ALFRESCO FOUNDATION ARCHITECTURE FOR INTEGRATED FULL DIGITAL INSURANCE PR...
 
Webinar | New release Alfred Desktop 3.7
Webinar | New release Alfred Desktop 3.7Webinar | New release Alfred Desktop 3.7
Webinar | New release Alfred Desktop 3.7
 
Webinar: How to turn Alfresco Digital Business Platform into a Managed Service
Webinar: How to turn Alfresco Digital Business Platform into a Managed ServiceWebinar: How to turn Alfresco Digital Business Platform into a Managed Service
Webinar: How to turn Alfresco Digital Business Platform into a Managed Service
 
Key points quality leaders should know about intelligent information manageme...
Key points quality leaders should know about intelligent information manageme...Key points quality leaders should know about intelligent information manageme...
Key points quality leaders should know about intelligent information manageme...
 
Decouple and simplify access to Alfresco with Alfred Edge - Webinar September...
Decouple and simplify access to Alfresco with Alfred Edge - Webinar September...Decouple and simplify access to Alfresco with Alfred Edge - Webinar September...
Decouple and simplify access to Alfresco with Alfred Edge - Webinar September...
 
Leuven European actuarial journal conference 20180911
Leuven European actuarial journal conference 20180911Leuven European actuarial journal conference 20180911
Leuven European actuarial journal conference 20180911
 
How to configure alfred desktop in your alfresco project in two days
How to configure alfred desktop in your alfresco project in two daysHow to configure alfred desktop in your alfresco project in two days
How to configure alfred desktop in your alfresco project in two days
 
Xenit diary dev con 2018
Xenit diary dev con 2018Xenit diary dev con 2018
Xenit diary dev con 2018
 
GDPR READY SOLUTION FOR UNSTRUCTURED DATA
GDPR READY SOLUTION FOR UNSTRUCTURED DATAGDPR READY SOLUTION FOR UNSTRUCTURED DATA
GDPR READY SOLUTION FOR UNSTRUCTURED DATA
 
REDUCING TOTAL COST OF OWNERSHIP AND INCREASING SCALABILITY WITH XENIT SOLUTI...
REDUCING TOTAL COST OF OWNERSHIP AND INCREASING SCALABILITY WITH XENIT SOLUTI...REDUCING TOTAL COST OF OWNERSHIP AND INCREASING SCALABILITY WITH XENIT SOLUTI...
REDUCING TOTAL COST OF OWNERSHIP AND INCREASING SCALABILITY WITH XENIT SOLUTI...
 
Introducing Alfred Desktop 3.6
Introducing Alfred Desktop 3.6 Introducing Alfred Desktop 3.6
Introducing Alfred Desktop 3.6
 
How to implement gdpr in your document repository
How to implement gdpr in your document repository How to implement gdpr in your document repository
How to implement gdpr in your document repository
 
Introducing Alfred Finder 2.0
Introducing Alfred Finder 2.0 Introducing Alfred Finder 2.0
Introducing Alfred Finder 2.0
 
20151201 swm elba_alfresco_user_day_wien
20151201 swm elba_alfresco_user_day_wien20151201 swm elba_alfresco_user_day_wien
20151201 swm elba_alfresco_user_day_wien
 

Recently uploaded

Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
VIP Call Girl in Mumbai 💧 9920725232 ( Call Me ) Get A New Crush Everyday Wit...
VIP Call Girl in Mumbai 💧 9920725232 ( Call Me ) Get A New Crush Everyday Wit...VIP Call Girl in Mumbai 💧 9920725232 ( Call Me ) Get A New Crush Everyday Wit...
VIP Call Girl in Mumbai 💧 9920725232 ( Call Me ) Get A New Crush Everyday Wit...
dipikadinghjn ( Why You Choose Us? ) Escorts
 
VIP Call Girl Service Andheri West ⚡ 9920725232 What It Takes To Be The Best ...
VIP Call Girl Service Andheri West ⚡ 9920725232 What It Takes To Be The Best ...VIP Call Girl Service Andheri West ⚡ 9920725232 What It Takes To Be The Best ...
VIP Call Girl Service Andheri West ⚡ 9920725232 What It Takes To Be The Best ...
dipikadinghjn ( Why You Choose Us? ) Escorts
 
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
dipikadinghjn ( Why You Choose Us? ) Escorts
 
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
dipikadinghjn ( Why You Choose Us? ) Escorts
 

Recently uploaded (20)

Indore Real Estate Market Trends Report.pdf
Indore Real Estate Market Trends Report.pdfIndore Real Estate Market Trends Report.pdf
Indore Real Estate Market Trends Report.pdf
 
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Shivane  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Shivane 6297143586 Call Hot Indian Gi...
 
Top Rated Pune Call Girls Aundh ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated  Pune Call Girls Aundh ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...Top Rated  Pune Call Girls Aundh ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
Top Rated Pune Call Girls Aundh ⟟ 6297143586 ⟟ Call Me For Genuine Sex Servi...
 
8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available
8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available
8377087607, Door Step Call Girls In Kalkaji (Locanto) 24/7 Available
 
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
 
(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7
(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7
(Vedika) Low Rate Call Girls in Pune Call Now 8250077686 Pune Escorts 24x7
 
VIP Call Girl in Mumbai 💧 9920725232 ( Call Me ) Get A New Crush Everyday Wit...
VIP Call Girl in Mumbai 💧 9920725232 ( Call Me ) Get A New Crush Everyday Wit...VIP Call Girl in Mumbai 💧 9920725232 ( Call Me ) Get A New Crush Everyday Wit...
VIP Call Girl in Mumbai 💧 9920725232 ( Call Me ) Get A New Crush Everyday Wit...
 
Call Girls Service Pune ₹7.5k Pick Up & Drop With Cash Payment 9352852248 Cal...
Call Girls Service Pune ₹7.5k Pick Up & Drop With Cash Payment 9352852248 Cal...Call Girls Service Pune ₹7.5k Pick Up & Drop With Cash Payment 9352852248 Cal...
Call Girls Service Pune ₹7.5k Pick Up & Drop With Cash Payment 9352852248 Cal...
 
Top Rated Pune Call Girls Shikrapur ⟟ 6297143586 ⟟ Call Me For Genuine Sex S...
Top Rated  Pune Call Girls Shikrapur ⟟ 6297143586 ⟟ Call Me For Genuine Sex S...Top Rated  Pune Call Girls Shikrapur ⟟ 6297143586 ⟟ Call Me For Genuine Sex S...
Top Rated Pune Call Girls Shikrapur ⟟ 6297143586 ⟟ Call Me For Genuine Sex S...
 
Booking open Available Pune Call Girls Talegaon Dabhade 6297143586 Call Hot ...
Booking open Available Pune Call Girls Talegaon Dabhade  6297143586 Call Hot ...Booking open Available Pune Call Girls Talegaon Dabhade  6297143586 Call Hot ...
Booking open Available Pune Call Girls Talegaon Dabhade 6297143586 Call Hot ...
 
VIP Call Girl Service Andheri West ⚡ 9920725232 What It Takes To Be The Best ...
VIP Call Girl Service Andheri West ⚡ 9920725232 What It Takes To Be The Best ...VIP Call Girl Service Andheri West ⚡ 9920725232 What It Takes To Be The Best ...
VIP Call Girl Service Andheri West ⚡ 9920725232 What It Takes To Be The Best ...
 
Webinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech BelgiumWebinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech Belgium
 
Top Rated Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
Top Rated  Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...Top Rated  Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
Top Rated Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
 
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
 
Stock Market Brief Deck (Under Pressure).pdf
Stock Market Brief Deck (Under Pressure).pdfStock Market Brief Deck (Under Pressure).pdf
Stock Market Brief Deck (Under Pressure).pdf
 
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
Solution Manual for Principles of Corporate Finance 14th Edition by Richard B...
 
Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...
Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...
Diva-Thane European Call Girls Number-9833754194-Diva Busty Professional Call...
 
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
 
Top Rated Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
 
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
 

The power of faceted search in alfresco

  • 1. The Power of Faceted Search in Alfresco Roxana Angheluta Willem Van den Eynde
  • 2. This presentation ● Who are we ? ● Alfresco ● Faceted Search ● Demo
  • 3. Willem Van den Eynde ● 2004 - 2007 ○ Bachelor applied informatics, KHLeuven ● 2006 - 2007 ○ LiU Erasmus Sweden ● 2007 ○ internship in Paris ● 2007 - 2010 ○ Master in applied informatics KULeuven ● 2010 - current ○ Software Engineer XeniT Leuven
  • 4. Roxana Angheluta ● 1995 - 1999 ○ Bachelor informatics, University of Bucharest ● 2000 - 2001 ○ Erasmus student KULeuven ● 2001-2004 ○ Assistant researcher KULeuven ● 2003-2004 ○ Master in Artificial Intelligence KULeuven ● 2004 - 2012 ○ Software Engineer Attentio Brussels ● 2012 - 2013 ○ Software Engineer XeniT Leuven
  • 5. Introducing XeniT Managing content in a smart way 2009 - Proprietary and Confidential Information of Xenit Solutions
  • 6. 2009 - Proprietary and Confidential Information of Xenit Solutions From our home base With an enthusiastic and experienced team In collaboration with our customers
  • 7. 2009 - Proprietary and Confidential Information of Xenit Solutions The corporate story of XeniT IWT project Concurrent collaboration 3.5 M docs Alfresco- 8M As-A- docs Service 2007 2008 2009 2010 2011 2012 2013
  • 8. Maidenhead, UK Global Headquarters Atlanta, US Headquarters Alfresco is the largest private, pure- play open source software company in the world. 4 million+ downloads of Alfresco community 75,000+ sites running community 2000+ Enterprise customers from 43+ countries 200+ channel partners 20 consecutive quarters of revenue growth founded in 2005
  • 9. What is Alfresco Alfresco is an open source enterprise content management system
  • 10. What is Alfresco ? ● Enterprise Content Management (ECM) is a formalized means of organizing and storing an organization's documents, and other content, that relate to the organization's processes. The term encompasses strategies, methods, and tools used throughout the lifecycle of the content.
  • 11. Classification and Retrieval ● Classification ● Retrieval
  • 12. FAQ ● How does an open-source company like Alfresco generate revenue ? ● Alfresco vs Microsoft SharePoint
  • 13. 2009 - Proprietary and Confidential Information of Xenit Solutions Alfresco demo
  • 14. Search in Alfresco ● Many search engines out there, few engines really good, fewer open source ● Requirements: ○ accurate ○ performant ○ flexible ○ cross-platform ○ scalable ○ mature ● Lucene ○ https://lucene.apache.org/ ● Starting with Alfresco 4.0 => Solr ○ http://lucene.apache.org/solr/
  • 15. Lucene ● Java-based indexing and search library, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities ● History Doug Cutting originally wrote Lucene in 1999.[2] It was initially available for download from its home at the SourceForge web site. It joined the Apache Software Foundation's Jakarta family of open-source Java products in September 2001 and became its own top-level Apache project in February 2005. ● Many projects based on Lucene: Solr, Nutch, Elasticsearch
  • 16. Lucene Indexing ● over 150GB/hour on modern hardware ● small RAM requirements -- only 1MB heap ● incremental indexing as fast as batch indexing ● index size roughly 20-30% the size of text indexed Searching ● ranked searching -- best results returned first ● many powerful query types: phrase queries, wildcard queries, proximity queries, range queries and more ● fielded searching (e.g. title, author, contents) ● sorting by any field ● multiple-index searching with merged results ● allows simultaneous update and searching ● flexible faceting, highlighting, joins and result grouping ● fast, memory-efficient and typo-tolerant suggesters ● pluggable ranking models, including the Vector Space Model and Okapi BM25 ● configurable storage engine (codecs)
  • 17. 2009 - Proprietary and Confidential Information of Xenit Solutions Lucene in Alfresco http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
  • 18. 2009 - Proprietary and Confidential Information of Xenit Solutions Lucene in Alfresco http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
  • 19. 2009 - Proprietary and Confidential Information of Xenit Solutions Lucene in Alfresco http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
  • 20. 2009 - Proprietary and Confidential Information of Xenit Solutions Lucene in Alfresco http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
  • 21. 2009 - Proprietary and Confidential Information of Xenit Solutions Lucene in Alfresco http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation
  • 22. 2009 - Proprietary and Confidential Information of Xenit Solutions Lucene in Alfresco http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation The way to preserve information in the Lucene index is specified in Alfresco's data models Main concept: tokenization
  • 23. 2009 - Proprietary and Confidential Information of Xenit Solutions Lucene in Alfresco http://www.slideshare.net/JM.Pascal/alfresco-search-tutorial-presentation with without tokenization with tokenization
  • 24. 2009 - Proprietary and Confidential Information of Xenit Solutions Lucene in Alfresco ● Out of the box search: ○ search in all items, in a certain property or in the content (full text search) ○ additionally: PATH, ASPECT, CATEGORY searches ○ Lucene syntax allowed: ■ boolean queries ■ wildcard queries ■ range queries
  • 25. Solr ● Standalone full-text search server within a servlet container such as Tomcat. Uses Lucene library and has REST-like HTTP/XML and JSON API. Has an extensive plugin architecture. ● In 2004, Solr was created by Yonik Seeley at CNET_Networks and in January 2006 the source code was donated to the Apache Software Foundation under the Lucene top-level project. In March 2010, the Lucene and Solr projects merged and consequently in 2011, the Solr version number scheme was changed in order to match that of Lucene. ● Many users: ○ http://wiki.apache.org/solr/PublicServers
  • 26. Solr ● Uses the Lucene library for full-text search ● Faceted navigation ● Hit highlighting ● Query language supports structured as well as textual search ● JSON, XML, PHP, Ruby, Python, XSLT, Velocity and custom Java binary output formats over HTTP ● HTML administration interface ● Replication to other Solr servers - enables scaling QPS ● Distributed Search through Sharding - enables scaling content volume ● Search results clustering based on Carrot2 ● Extensible through plugins ● Pluggable relevance - boost through formula ● Caching ● Embeddable in a Java Application
  • 27. Faceted Search in Alfresco ● A way to navigate through the documents, showing counts per property value and offering the possibility to drill down in the data ● Faceted search supported by Lucene/Solr, not yet supported by Alfresco ● Implemented by Xenit in Fred
  • 28. Faceted Search in Alfresco
  • 29. Faceted Search in Alfresco ● Questions ○ which fields should be facetable? ■ only the ones with a limited set of possible values ■ only the ones which are untokenized ■ plus ranges: dates and numbers ○ how to navigate inside facets? ● Current implementation ○ facetable fields configurable in a file ○ date ranges and number ranges not supported yet ○ drilling-down in a single value possible
  • 30. Faceted Search in Fred: mockup
  • 31. Demo