SlideShare a Scribd company logo
1 of 48
Download to read offline
Rated Ranking Evaluator Enterprise:
the Next Generation of Free Search
Quality Evaluation Tools



Alessandro Benedetti, Director
Andrea Gazzarini, Co-Founder
15th
September 2021
‣ Born in Tarquinia(ancient Etruscan city)
‣ R&D Software Engineer
‣ Director
‣ Master in Computer Science
‣ Apache Lucene/Solr PMC member/committer
‣ Elasticsearch expert
‣ Semantic, NLP, Machine Learning
technologies passionate
‣ Beach Volleyball player and Snowboarder
Who We Are
Alessandro Benedetti
‣ Born in Viterbo
‣ Hermit Software Engineer
‣ Master in Economy
‣ Programming Passionate
‣ RRE Creator
‣ Apache Lucene/Solr Expert
‣ Elasticsearch Expert
‣ Apache Qpid Committer
‣ Father, Husband
‣ Bass player, aspiring (still frustrated at the moment) Chapman
Stick player
Who We Are
Andrea Gazzarini
‣ Headquarter in London/distributed
‣ Open Source Enthusiasts
‣ Apache Lucene/Solr experts
‣ Elasticsearch experts
‣ Community Contributors
‣ Active Researchers
‣ Hot Trends : Learning To Rank,
Document Similarity,
Search Quality Evaluation,
Relevancy Tuning
Search Services
www.sease.io
Clients
Agenda
RRE Open Source
RRE Enterprise (RREE)
Query Discovery
Rating Generation
Explore Evaluation Results
Agenda
RRE Open Source
RRE Enterprise (RREE)
Query Discovery
Rating Generation
Explore Evaluation Results
‣ Open Source library for Search Quality Evaluation
‣ Ratings are expected in input (Json supported)
‣ Many offline metrics available out-of-the-box
(Precision@k, NDCG@k, F-Measure…)
‣ Apache Solr and Elasticsearch support
‣ Development-centric approach
‣ Evaluation on the fly and results in various formats
‣ Community building up!
‣ RRE-User mailing list:
https://groups.google.com/g/rre-user
‣ https://github.com/SeaseLtd/rated-ranking-evaluator
Rated Ranking Evaluator : RRE Open Source
What is it ?
Rated Ranking Evaluator : The Genesis
2018
Search Consultancy Project
A customer explicitly asked for a
rudimental search quality evaluation
tool while we were working on its
search infrastructure.
Jun
Search Quality Evaluation
A Developer Perspective
0.9
Search Quality Evaluation
Tools And Techniques
Oct
1.0
to be continued...
mumble mumble
Rated Ranking Evaluator: The Idea
RRE has been thought as a development tool that
executes search quality evaluations as part of a
project build process.
It’s like a read–eval–print loop (REPL) tied on top of an
Information Retrieval subsystem that encourages an
incremental / iterative approach.
The underlying idea
New system Existing system
Here are the requirements
Ok
V1.0 has been released
Cool!
a month later…
We have a change request.
We found a bug
We need to improve our search
system, users are complaining about
junk in search results.
Ok
v0.1
…
v0.9
v1.1
v1.2
v1.3
…
v2.0
v2.0
In terms of retrieval effectiveness,
how can we know the system
performance across various versions?
Rated Ranking Evaluator: Domain Model
RRE Domain Model is organized into a composite /
tree-like structure where the relationships between
entities are always 1 to many.
The top level entity is a placeholder representing an
evaluation execution.
Versioned metrics are computed at query level and
then reported, using an aggregation function, at
upper levels.
The benefit of having a composite structure is clear:
we can see a metric value at different levels (e.g. a
query, all queries belonging to a query group, all
queries belonging to a topic or at corpus level)
Domain Model
Evaluation
Corpus
1..*
v1.0
P@10
NDCG
AP
F-MEASURE
….
v1.1
P@10
NDCG
AP
F-MEASURE
….
v1.2
P@10
NDCG
AP
F-MEASURE
….
v1.n
P@10
NDCG
AP
F-MEASURE
….
Topic
Query Group
Query
1..*
1..*
1..*
…
Top level domain entity
Test dataset / collection
Information need
Query variants
Queries
Rated Ranking Evaluator : How it works
Data
Configuration
Ratings
Search Platform
uses a
produces
Evaluation Data
INPUT LAYER EVALUATION LAYER OUTPUT LAYER
JSON
RRE Console
…
used for generating
Explainability - Why is important in Information Retrieval?
Dev, tune & Build
Check evaluation results
We are thinking about how
to fill a third monitor
Agenda
RRE Open Source
RRE Enterprise (RREE)
Query Discovery
Rating Generation
Explore Evaluation Results
RRE Enterprise : The Genesis
2018
Search Consultancy Project
A customer explicitly asked for a rudimental search quality evaluation
tool while we were working on its search infrastructure.
2019
Rated Ranking Evaluator
An Open Source Approach
to be continued...
The first sketches depicting an idea
about an enterprise-level version of
RRE.
The development starts few months
later.
Rated Ranking Evaluator : The Genesis
2018
Search Consultancy Project
A customer explicitly asked for a rudimental search quality evaluation
tool while we were working on its search infrastructure.
2019
to be continued...
2020
2021
JBCP
ID Discovery
Query Discovery
Agenda
RRE Open Source
RRE Enterprise (RREE)
Query Discovery
Rating Generation
Explore Evaluation Results
RRE Open Source Recap: How it works (½)
Data
Configuration
Ratings
targets produces
Evaluation Data
INPUT LAYER EVALUATION LAYER OUTPUT LAYER
JSON
RRE Console
…
used for generating
OR
RRE Open Source Recap: How it works (2/2)
Data
Configuration
Ratings
targets produces
Evaluation Data
INPUT LAYER EVALUATION LAYER OUTPUT LAYER
JSON
RRE Console
…
used for generating
OR
RRE Enterprise: Query Discovery?
API
Problem: I have an intermediate Search-API that builds complex Apache Solr/Elasticsearch queries
RRE Open Source: No Query Discovery
targets
OR
1
Evaluation Data
produces
2
Query Discovery
RRE Enterprise: Query Discovery & Evaluation
2
Evaluation Data
produces
3
targets
OR
{SEARCH API}
1
correlates on
targets
Input Rating: RRE Open Source vs RREE
RRE Enterprise
RRE Open Source
Topic
Query Group
Query
Information need
Query variant
(Search Engine) Query
+ 3
Rated Document
Topic
Query Group
API Request
Information need
Query variant
(Search API) Request
+ 3
Rated Document
Query (Search Engine) Query
Agenda
RRE Open Source
RRE Enterprise (RREE)
Query Discovery
Rating Generation
Explore Evaluation Results
RRE Enterprise: Rating Generation
A fundamental requirement of
offline search quality
evaluation is to gather
<query,document,rating>
triples that represent the
relevance(rating) of a
document given a user
information need(query).
Before assessing the retrieval
effectiveness of a system it is
necessary to associate a
relevance rating to each pair
<query, document> involved in
our evaluation.
RRE Enterprise: Explicit Ratings (1/2)
• Explicitly provided by domain experts
• High accuracy
• High effort / time / resources
• RRE Open Source accepts only explicit ratings
Explicit Ratings
RRE Rating Structure
Topic
Query Group
Query
Information need
Query variant
Query
+ 3
Rated Document
Question: how to minimize the relevant effort required
to provide explicit ratings?
RRE Enterprise: Explicit Ratings (2/2)
• Chrome plugin which applies an
evaluation layer on top of an arbitrary
website
• Ratings are generated using directly the
customer website
• Users Lowest learning curve
• Generated ratings are sent to RREE
through a specific endpoint
• An “ID Discovery” component translates
the received data (rated web items) in RRE
Ratings (rated Solr/Elasticsearch
documents)
Judgment Collector
Implicit Feedback - Click
{
"collection": "papers",
"query": "interleaving",
"blackBoxQueryRequest": "GET
/query?q=interleaving HTTP/1.1rnHost:
localhost:5063",
"documentId": "1",
"click": 1,
"timestamp":
"2021-06-23T16:10:49Z",
"queryDocumentPosition": 0
}
Click
http://vps-933d20b7.vps.ovh.net:8080/1.0
/rre-enterprise-api/input-api/interaction
Search Result Page
Implicit Feedback - Add To Cart
{
"collection": "papers",
"query": "interleaving",
"blackBoxQueryRequest": "GET
/query?q=interleaving HTTP/1.1rnHost:
localhost:5063",
"documentId": "1",
"addToCart": 1,
"timestamp":
"2021-06-23T16:10:49Z",
"queryDocumentPosition": 0
}
Add To Cart
http://vps-933d20b7.vps.ovh.net:8080/1.0
/rre-enterprise-api/input-api/interaction
Search Result Page
Implicit Feedback - Sale
{
"collection": "papers",
"query": "interleaving",
"blackBoxQueryRequest": "GET
/query?q=interleaving HTTP/1.1rnHost:
localhost:5063",
"documentId": "1",
"sale": 1,
"timestamp":
"2021-06-23T16:10:49Z",
"queryDocumentPosition": 0
}
Sale
http://vps-933d20b7.vps.ovh.net:8080/1.0
/rre-enterprise-api/input-api/interaction
Search Result Page
Implicit Feedback - Revenue
{
"collection": "papers",
"query": "interleaving",
"blackBoxQueryRequest": "GET
/query?q=interleaving HTTP/1.1rnHost:
localhost:5063",
"documentId": "1",
"revenue": 100,
"timestamp":
"2021-06-23T16:10:49Z",
"queryDocumentPosition": 0
}
Revenue
http://vps-933d20b7.vps.ovh.net:8080/1.0
/rre-enterprise-api/input-api/interaction
Search Result Page
Implicit Feedback - Storage
{
"collection": "papers",
"query": "interleaving",
"blackBoxQueryRequest": "GET
/query?q=interleaving HTTP/1.1rnHost:
localhost:5063",
"documentId": "1",
"revenue": 100,
"timestamp":
"2021-06-23T16:10:49Z",
"queryDocumentPosition": 0
}
http://vps-933d20b7.vps.ovh.net:8080/1.0
/rre-enterprise-api/input-api/interaction
Interactions
Implicit Feedback - Calculate Online Metrics
Interactions
<query,document>
Implicit Feedback - Estimate Relevance
Global Max: 1 Local Max: [0...1]
Global Min: 0 Local Min: [0...1]
Simple Click Model
0 1 2 3 4
Rating
Metric Score
0 1
0.5
Agenda
RRE Open Source
RRE Enterprise (RREE)
Query Discovery
Rating Generation
Explore Evaluation Results
‣ UI using React library
‣ Configuration support
‣ Navigation of Evaluation Results
‣ Overview - quick view for Business Stakeholders
‣ Explore (an evaluation) - deep view for Software
Engineers
‣ Compare (two evaluations) - deep comparison for
Software Engineers
Explore Evaluation Results: UI
Evaluation Results - Overview
https://rre-enterprise.netlify.app/overview
Evaluation Results - Overview Expanded
Evaluation Results - Overview Zoom
Evaluation Results - Explore
Evaluation Results - Explore Query Info
Evaluation Results - Explore Query Info 2
Evaluation Results - Compare (1/3)
Evaluation Results - Compare (2/3)
Evaluation Results - Compare (3/3)
Evaluation Results - Compare Query Info
‣ Release with free usage plan
‣ Configuration support
‣ Support for multimedia document properties
‣ Intelligent insights on weak performing queries,
groups, topics
‣ Improvements on Click Modelling for Implicit
Relevance estimation
RRE Enterprise: Future Work
Thank You!

More Related Content

What's hot

Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Lucidworks
 

What's hot (20)

Anatomy of an eCommerce Search Engine by Mayur Datar
Anatomy of an eCommerce Search Engine by Mayur DatarAnatomy of an eCommerce Search Engine by Mayur Datar
Anatomy of an eCommerce Search Engine by Mayur Datar
 
Find it! Nail it! Boosting e-commerce search conversions with machine learnin...
Find it! Nail it!Boosting e-commerce search conversions with machine learnin...Find it! Nail it!Boosting e-commerce search conversions with machine learnin...
Find it! Nail it! Boosting e-commerce search conversions with machine learnin...
 
Near Real Time Indexing: Presented by Umesh Prasad & Thejus V M, Flipkart
Near Real Time Indexing: Presented by Umesh Prasad & Thejus V M, FlipkartNear Real Time Indexing: Presented by Umesh Prasad & Thejus V M, Flipkart
Near Real Time Indexing: Presented by Umesh Prasad & Thejus V M, Flipkart
 
Consuming RealTime Signals in Solr
Consuming RealTime Signals in Solr Consuming RealTime Signals in Solr
Consuming RealTime Signals in Solr
 
How to Build a Semantic Search System
How to Build a Semantic Search SystemHow to Build a Semantic Search System
How to Build a Semantic Search System
 
Slash n near real time indexing
Slash n   near real time indexingSlash n   near real time indexing
Slash n near real time indexing
 
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
Learning to Rank in Solr: Presented by Michael Nilsson & Diego Ceccarelli, Bl...
 
Recommender system
Recommender systemRecommender system
Recommender system
 
Integrating Spark and Solr-(Timothy Potter, Lucidworks)
Integrating Spark and Solr-(Timothy Potter, Lucidworks)Integrating Spark and Solr-(Timothy Potter, Lucidworks)
Integrating Spark and Solr-(Timothy Potter, Lucidworks)
 
Near RealTime search @Flipkart
Near RealTime search @FlipkartNear RealTime search @Flipkart
Near RealTime search @Flipkart
 
Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
 
Haystack 2019 - Query relaxation - a rewriting technique between search and r...
Haystack 2019 - Query relaxation - a rewriting technique between search and r...Haystack 2019 - Query relaxation - a rewriting technique between search and r...
Haystack 2019 - Query relaxation - a rewriting technique between search and r...
 
Learning to rank
Learning to rankLearning to rank
Learning to rank
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
Search summit-2018-ltr-presentation
Search summit-2018-ltr-presentationSearch summit-2018-ltr-presentation
Search summit-2018-ltr-presentation
 
A Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking ProblemA Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking Problem
 
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
 
Advantages of metadata
Advantages of metadataAdvantages of metadata
Advantages of metadata
 
Mobile First to AI First: How User Signals Change SEO | SMX19
Mobile First to AI First: How User Signals Change SEO | SMX19Mobile First to AI First: How User Signals Change SEO | SMX19
Mobile First to AI First: How User Signals Change SEO | SMX19
 
Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...
Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...
Improving Search Relevance in Elasticsearch Using Machine Learning - Milorad ...
 

Similar to Rated Ranking Evaluator Enterprise: the next generation of free Search Quality Evaluation Tools

Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Alessandro Benedetti
 
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
OpenSource Connections
 
Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Alessandro Benedetti
 
LSP ( Logic Score Preference ) _ Rajan_Dhabalia_San Francisco State University
LSP ( Logic Score Preference ) _ Rajan_Dhabalia_San Francisco State UniversityLSP ( Logic Score Preference ) _ Rajan_Dhabalia_San Francisco State University
LSP ( Logic Score Preference ) _ Rajan_Dhabalia_San Francisco State University
dhabalia
 
Web Rec Final Report
Web Rec Final ReportWeb Rec Final Report
Web Rec Final Report
weichen
 
Catalog maintenance and epo services at ta
Catalog maintenance and epo services at taCatalog maintenance and epo services at ta
Catalog maintenance and epo services at ta
thinkahead.net
 
Catalogue Maintenance & Editorial Process Outsourcing
Catalogue Maintenance &  Editorial Process Outsourcing Catalogue Maintenance &  Editorial Process Outsourcing
Catalogue Maintenance & Editorial Process Outsourcing
thinkahead.net
 
FinalInnovationWeek-Dec12
FinalInnovationWeek-Dec12FinalInnovationWeek-Dec12
FinalInnovationWeek-Dec12
Praneet Mhatre
 

Similar to Rated Ranking Evaluator Enterprise: the next generation of free Search Quality Evaluation Tools (20)

Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
 
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
Haystack 2019 - Rated Ranking Evaluator: an Open Source Approach for Search Q...
 
Rated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
 
Search Quality Evaluation: Tools and Techniques
Search Quality Evaluation: Tools and TechniquesSearch Quality Evaluation: Tools and Techniques
Search Quality Evaluation: Tools and Techniques
 
Haystack London - Search Quality Evaluation, Tools and Techniques
Haystack London - Search Quality Evaluation, Tools and Techniques Haystack London - Search Quality Evaluation, Tools and Techniques
Haystack London - Search Quality Evaluation, Tools and Techniques
 
Search Quality Evaluation to Help Reproducibility: An Open-source Approach
Search Quality Evaluation to Help Reproducibility: An Open-source ApproachSearch Quality Evaluation to Help Reproducibility: An Open-source Approach
Search Quality Evaluation to Help Reproducibility: An Open-source Approach
 
Search Quality Evaluation to Help Reproducibility : an Open Source Approach
Search Quality Evaluation to Help Reproducibility : an Open Source ApproachSearch Quality Evaluation to Help Reproducibility : an Open Source Approach
Search Quality Evaluation to Help Reproducibility : an Open Source Approach
 
Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: An Open Source Approach for Search Quality Evaluation
 
Search Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer PerspectiveSearch Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer Perspective
 
LSP ( Logic Score Preference ) _ Rajan_Dhabalia_San Francisco State University
LSP ( Logic Score Preference ) _ Rajan_Dhabalia_San Francisco State UniversityLSP ( Logic Score Preference ) _ Rajan_Dhabalia_San Francisco State University
LSP ( Logic Score Preference ) _ Rajan_Dhabalia_San Francisco State University
 
Web Rec Final Report
Web Rec Final ReportWeb Rec Final Report
Web Rec Final Report
 
How To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With KibanaHow To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With Kibana
 
E-Commerce Product Rating Based on Customer Review
E-Commerce Product Rating Based on Customer ReviewE-Commerce Product Rating Based on Customer Review
E-Commerce Product Rating Based on Customer Review
 
Ramesh Selenium
Ramesh SeleniumRamesh Selenium
Ramesh Selenium
 
SharePoint 2013 Search Topology and Optimization
SharePoint 2013 Search Topology and OptimizationSharePoint 2013 Search Topology and Optimization
SharePoint 2013 Search Topology and Optimization
 
Feature Based Opinion Mining from Amazon Reviews
Feature Based Opinion Mining from Amazon ReviewsFeature Based Opinion Mining from Amazon Reviews
Feature Based Opinion Mining from Amazon Reviews
 
#SPSPhilly search topology & optimization
#SPSPhilly search topology & optimization#SPSPhilly search topology & optimization
#SPSPhilly search topology & optimization
 
Catalog maintenance and epo services at ta
Catalog maintenance and epo services at taCatalog maintenance and epo services at ta
Catalog maintenance and epo services at ta
 
Catalogue Maintenance & Editorial Process Outsourcing
Catalogue Maintenance &  Editorial Process Outsourcing Catalogue Maintenance &  Editorial Process Outsourcing
Catalogue Maintenance & Editorial Process Outsourcing
 
FinalInnovationWeek-Dec12
FinalInnovationWeek-Dec12FinalInnovationWeek-Dec12
FinalInnovationWeek-Dec12
 

More from Sease

When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
Sease
 
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
Sease
 
Neural Search Comes to Apache Solr
Neural Search Comes to Apache SolrNeural Search Comes to Apache Solr
Neural Search Comes to Apache Solr
Sease
 
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
Sease
 

More from Sease (20)

Multi Valued Vectors Lucene
Multi Valued Vectors LuceneMulti Valued Vectors Lucene
Multi Valued Vectors Lucene
 
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
 
Introducing Multi Valued Vectors Fields in Apache Lucene
Introducing Multi Valued Vectors Fields in Apache LuceneIntroducing Multi Valued Vectors Fields in Apache Lucene
Introducing Multi Valued Vectors Fields in Apache Lucene
 
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
 
How does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspectiveHow does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspective
 
How To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With KibanaHow To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With Kibana
 
Neural Search Comes to Apache Solr
Neural Search Comes to Apache SolrNeural Search Comes to Apache Solr
Neural Search Comes to Apache Solr
 
Large Scale Indexing
Large Scale IndexingLarge Scale Indexing
Large Scale Indexing
 
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
 
Word2Vec model to generate synonyms on the fly in Apache Lucene.pdf
Word2Vec model to generate synonyms on the fly in Apache Lucene.pdfWord2Vec model to generate synonyms on the fly in Apache Lucene.pdf
Word2Vec model to generate synonyms on the fly in Apache Lucene.pdf
 
How to cache your searches_ an open source implementation.pptx
How to cache your searches_ an open source implementation.pptxHow to cache your searches_ an open source implementation.pptx
How to cache your searches_ an open source implementation.pptx
 
Online Testing Learning to Rank with Solr Interleaving
Online Testing Learning to Rank with Solr InterleavingOnline Testing Learning to Rank with Solr Interleaving
Online Testing Learning to Rank with Solr Interleaving
 
Apache Lucene/Solr Document Classification
Apache Lucene/Solr Document ClassificationApache Lucene/Solr Document Classification
Apache Lucene/Solr Document Classification
 
Advanced Document Similarity with Apache Lucene
Advanced Document Similarity with Apache LuceneAdvanced Document Similarity with Apache Lucene
Advanced Document Similarity with Apache Lucene
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
Explainability for Learning to Rank
Explainability for Learning to RankExplainability for Learning to Rank
Explainability for Learning to Rank
 
Interactive Questions and Answers - London Information Retrieval Meetup
Interactive Questions and Answers - London Information Retrieval MeetupInteractive Questions and Answers - London Information Retrieval Meetup
Interactive Questions and Answers - London Information Retrieval Meetup
 
Feature Extraction for Large-Scale Text Collections
Feature Extraction for Large-Scale Text CollectionsFeature Extraction for Large-Scale Text Collections
Feature Extraction for Large-Scale Text Collections
 
Enterprise Search – How Relevant Is Relevance?
Enterprise Search – How Relevant Is Relevance?Enterprise Search – How Relevant Is Relevance?
Enterprise Search – How Relevant Is Relevance?
 
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
 

Recently uploaded

Recently uploaded (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Rated Ranking Evaluator Enterprise: the next generation of free Search Quality Evaluation Tools

  • 1. Rated Ranking Evaluator Enterprise: the Next Generation of Free Search Quality Evaluation Tools
 
 Alessandro Benedetti, Director Andrea Gazzarini, Co-Founder 15th September 2021
  • 2. ‣ Born in Tarquinia(ancient Etruscan city) ‣ R&D Software Engineer ‣ Director ‣ Master in Computer Science ‣ Apache Lucene/Solr PMC member/committer ‣ Elasticsearch expert ‣ Semantic, NLP, Machine Learning technologies passionate ‣ Beach Volleyball player and Snowboarder Who We Are Alessandro Benedetti
  • 3. ‣ Born in Viterbo ‣ Hermit Software Engineer ‣ Master in Economy ‣ Programming Passionate ‣ RRE Creator ‣ Apache Lucene/Solr Expert ‣ Elasticsearch Expert ‣ Apache Qpid Committer ‣ Father, Husband ‣ Bass player, aspiring (still frustrated at the moment) Chapman Stick player Who We Are Andrea Gazzarini
  • 4. ‣ Headquarter in London/distributed ‣ Open Source Enthusiasts ‣ Apache Lucene/Solr experts ‣ Elasticsearch experts ‣ Community Contributors ‣ Active Researchers ‣ Hot Trends : Learning To Rank, Document Similarity, Search Quality Evaluation, Relevancy Tuning Search Services www.sease.io
  • 6. Agenda RRE Open Source RRE Enterprise (RREE) Query Discovery Rating Generation Explore Evaluation Results
  • 7. Agenda RRE Open Source RRE Enterprise (RREE) Query Discovery Rating Generation Explore Evaluation Results
  • 8. ‣ Open Source library for Search Quality Evaluation ‣ Ratings are expected in input (Json supported) ‣ Many offline metrics available out-of-the-box (Precision@k, NDCG@k, F-Measure…) ‣ Apache Solr and Elasticsearch support ‣ Development-centric approach ‣ Evaluation on the fly and results in various formats ‣ Community building up! ‣ RRE-User mailing list: https://groups.google.com/g/rre-user ‣ https://github.com/SeaseLtd/rated-ranking-evaluator Rated Ranking Evaluator : RRE Open Source What is it ?
  • 9. Rated Ranking Evaluator : The Genesis 2018 Search Consultancy Project A customer explicitly asked for a rudimental search quality evaluation tool while we were working on its search infrastructure. Jun Search Quality Evaluation A Developer Perspective 0.9 Search Quality Evaluation Tools And Techniques Oct 1.0 to be continued... mumble mumble
  • 10. Rated Ranking Evaluator: The Idea RRE has been thought as a development tool that executes search quality evaluations as part of a project build process. It’s like a read–eval–print loop (REPL) tied on top of an Information Retrieval subsystem that encourages an incremental / iterative approach. The underlying idea New system Existing system Here are the requirements Ok V1.0 has been released Cool! a month later… We have a change request. We found a bug We need to improve our search system, users are complaining about junk in search results. Ok v0.1 … v0.9 v1.1 v1.2 v1.3 … v2.0 v2.0 In terms of retrieval effectiveness, how can we know the system performance across various versions?
  • 11. Rated Ranking Evaluator: Domain Model RRE Domain Model is organized into a composite / tree-like structure where the relationships between entities are always 1 to many. The top level entity is a placeholder representing an evaluation execution. Versioned metrics are computed at query level and then reported, using an aggregation function, at upper levels. The benefit of having a composite structure is clear: we can see a metric value at different levels (e.g. a query, all queries belonging to a query group, all queries belonging to a topic or at corpus level) Domain Model Evaluation Corpus 1..* v1.0 P@10 NDCG AP F-MEASURE …. v1.1 P@10 NDCG AP F-MEASURE …. v1.2 P@10 NDCG AP F-MEASURE …. v1.n P@10 NDCG AP F-MEASURE …. Topic Query Group Query 1..* 1..* 1..* … Top level domain entity Test dataset / collection Information need Query variants Queries
  • 12. Rated Ranking Evaluator : How it works Data Configuration Ratings Search Platform uses a produces Evaluation Data INPUT LAYER EVALUATION LAYER OUTPUT LAYER JSON RRE Console … used for generating
  • 13. Explainability - Why is important in Information Retrieval? Dev, tune & Build Check evaluation results We are thinking about how to fill a third monitor
  • 14. Agenda RRE Open Source RRE Enterprise (RREE) Query Discovery Rating Generation Explore Evaluation Results
  • 15. RRE Enterprise : The Genesis 2018 Search Consultancy Project A customer explicitly asked for a rudimental search quality evaluation tool while we were working on its search infrastructure. 2019 Rated Ranking Evaluator An Open Source Approach to be continued... The first sketches depicting an idea about an enterprise-level version of RRE. The development starts few months later.
  • 16. Rated Ranking Evaluator : The Genesis 2018 Search Consultancy Project A customer explicitly asked for a rudimental search quality evaluation tool while we were working on its search infrastructure. 2019 to be continued... 2020 2021 JBCP ID Discovery Query Discovery
  • 17. Agenda RRE Open Source RRE Enterprise (RREE) Query Discovery Rating Generation Explore Evaluation Results
  • 18. RRE Open Source Recap: How it works (½) Data Configuration Ratings targets produces Evaluation Data INPUT LAYER EVALUATION LAYER OUTPUT LAYER JSON RRE Console … used for generating OR
  • 19. RRE Open Source Recap: How it works (2/2) Data Configuration Ratings targets produces Evaluation Data INPUT LAYER EVALUATION LAYER OUTPUT LAYER JSON RRE Console … used for generating OR
  • 20. RRE Enterprise: Query Discovery? API Problem: I have an intermediate Search-API that builds complex Apache Solr/Elasticsearch queries
  • 21. RRE Open Source: No Query Discovery targets OR 1 Evaluation Data produces 2
  • 22. Query Discovery RRE Enterprise: Query Discovery & Evaluation 2 Evaluation Data produces 3 targets OR {SEARCH API} 1 correlates on targets
  • 23. Input Rating: RRE Open Source vs RREE RRE Enterprise RRE Open Source Topic Query Group Query Information need Query variant (Search Engine) Query + 3 Rated Document Topic Query Group API Request Information need Query variant (Search API) Request + 3 Rated Document Query (Search Engine) Query
  • 24. Agenda RRE Open Source RRE Enterprise (RREE) Query Discovery Rating Generation Explore Evaluation Results
  • 25. RRE Enterprise: Rating Generation A fundamental requirement of offline search quality evaluation is to gather <query,document,rating> triples that represent the relevance(rating) of a document given a user information need(query). Before assessing the retrieval effectiveness of a system it is necessary to associate a relevance rating to each pair <query, document> involved in our evaluation.
  • 26. RRE Enterprise: Explicit Ratings (1/2) • Explicitly provided by domain experts • High accuracy • High effort / time / resources • RRE Open Source accepts only explicit ratings Explicit Ratings RRE Rating Structure Topic Query Group Query Information need Query variant Query + 3 Rated Document Question: how to minimize the relevant effort required to provide explicit ratings?
  • 27. RRE Enterprise: Explicit Ratings (2/2) • Chrome plugin which applies an evaluation layer on top of an arbitrary website • Ratings are generated using directly the customer website • Users Lowest learning curve • Generated ratings are sent to RREE through a specific endpoint • An “ID Discovery” component translates the received data (rated web items) in RRE Ratings (rated Solr/Elasticsearch documents) Judgment Collector
  • 28. Implicit Feedback - Click { "collection": "papers", "query": "interleaving", "blackBoxQueryRequest": "GET /query?q=interleaving HTTP/1.1rnHost: localhost:5063", "documentId": "1", "click": 1, "timestamp": "2021-06-23T16:10:49Z", "queryDocumentPosition": 0 } Click http://vps-933d20b7.vps.ovh.net:8080/1.0 /rre-enterprise-api/input-api/interaction Search Result Page
  • 29. Implicit Feedback - Add To Cart { "collection": "papers", "query": "interleaving", "blackBoxQueryRequest": "GET /query?q=interleaving HTTP/1.1rnHost: localhost:5063", "documentId": "1", "addToCart": 1, "timestamp": "2021-06-23T16:10:49Z", "queryDocumentPosition": 0 } Add To Cart http://vps-933d20b7.vps.ovh.net:8080/1.0 /rre-enterprise-api/input-api/interaction Search Result Page
  • 30. Implicit Feedback - Sale { "collection": "papers", "query": "interleaving", "blackBoxQueryRequest": "GET /query?q=interleaving HTTP/1.1rnHost: localhost:5063", "documentId": "1", "sale": 1, "timestamp": "2021-06-23T16:10:49Z", "queryDocumentPosition": 0 } Sale http://vps-933d20b7.vps.ovh.net:8080/1.0 /rre-enterprise-api/input-api/interaction Search Result Page
  • 31. Implicit Feedback - Revenue { "collection": "papers", "query": "interleaving", "blackBoxQueryRequest": "GET /query?q=interleaving HTTP/1.1rnHost: localhost:5063", "documentId": "1", "revenue": 100, "timestamp": "2021-06-23T16:10:49Z", "queryDocumentPosition": 0 } Revenue http://vps-933d20b7.vps.ovh.net:8080/1.0 /rre-enterprise-api/input-api/interaction Search Result Page
  • 32. Implicit Feedback - Storage { "collection": "papers", "query": "interleaving", "blackBoxQueryRequest": "GET /query?q=interleaving HTTP/1.1rnHost: localhost:5063", "documentId": "1", "revenue": 100, "timestamp": "2021-06-23T16:10:49Z", "queryDocumentPosition": 0 } http://vps-933d20b7.vps.ovh.net:8080/1.0 /rre-enterprise-api/input-api/interaction Interactions
  • 33. Implicit Feedback - Calculate Online Metrics Interactions <query,document>
  • 34. Implicit Feedback - Estimate Relevance Global Max: 1 Local Max: [0...1] Global Min: 0 Local Min: [0...1] Simple Click Model 0 1 2 3 4 Rating Metric Score 0 1 0.5
  • 35. Agenda RRE Open Source RRE Enterprise (RREE) Query Discovery Rating Generation Explore Evaluation Results
  • 36. ‣ UI using React library ‣ Configuration support ‣ Navigation of Evaluation Results ‣ Overview - quick view for Business Stakeholders ‣ Explore (an evaluation) - deep view for Software Engineers ‣ Compare (two evaluations) - deep comparison for Software Engineers Explore Evaluation Results: UI
  • 37. Evaluation Results - Overview https://rre-enterprise.netlify.app/overview
  • 38. Evaluation Results - Overview Expanded
  • 39. Evaluation Results - Overview Zoom
  • 41. Evaluation Results - Explore Query Info
  • 42. Evaluation Results - Explore Query Info 2
  • 43. Evaluation Results - Compare (1/3)
  • 44. Evaluation Results - Compare (2/3)
  • 45. Evaluation Results - Compare (3/3)
  • 46. Evaluation Results - Compare Query Info
  • 47. ‣ Release with free usage plan ‣ Configuration support ‣ Support for multimedia document properties ‣ Intelligent insights on weak performing queries, groups, topics ‣ Improvements on Click Modelling for Implicit Relevance estimation RRE Enterprise: Future Work