SlideShare a Scribd company logo
1 of 53
Download to read offline
Encores? - Going beyond
matching and ranking of
search results
Berlin Buzzwords 2021
Eric Pugh, René Kriegler
Who we are
@renekrie @dep4b
Combined 30 years of
experience in search
Open Source enthusiasts
ASF member, Committers on:
Solr, Querqy, SMUI, Quepid,
https://mices.co
Search - beyond matching and ranking
We tend to focus on matching and ranking. Other search features are almost
treated like an afterthought, like ‘encores’ that follow the main performance:
● Facets
● Query auto-completion
● Spelling correction
● Query relaxation
=> BUT: These are essential features that help the user formulate the query,
understand and narrow down the results
Our main act today (not encores!)
● Facets
● Query auto-completion
● Spelling correction
● Query relaxation
Learn about solutions that come out of the box (in Solr)
Typical challenges and how to overcome them
Advanced solutions: understand the concepts, create your own
The Art of Facets
Facets help the user ...
● understand the search results (see ‘what is there’, learn about the domain)
● narrow down search results
Chorus Electronics Project: https://github.com/querqy/chorus
Try the Demo Ecommerce Shop: http://chorus.dev.o19s.com:4000/
Facets help the user ...
● understand the search results (see ‘what is there’, learn about the domain)
● narrow down search results
Challenges
Getting the counts right in e-commerce search
Showing the best facets in the best order
Selecting the facet values to show
“Qui numerare incipit errare incipit”
Facet Counts
Facets and filters
A trivial example:
query=t-shirts
filter=color:black
Challenge: We still need to count all colours in the facets, even if the search result
contains only black t-shirts
Solution: Tagging and exclusion of filters
Facets and filters: tagging and exclusion
Tagging:
fq={!tag=f_color}color:black
Exclusion:
Facet param
facet.field={!ex=f_color}color
JSON facets
"facet": {
"color": {
"type": "terms",
"field": "color",
"domain": {
"excludeTags":"f_color"
}
}
}
Challenge: product variants
Product ID: 9739, brand: “inteemate”
Size: XS
Price: 11.99
Size: XL
Price: 11.99
Size: S
Price: 12.99
Size: S
Price: 12.99
Size: M
Price: 13.99
Size: L
Price: 13.99
Challenge: product variants
Product ID: 9739, brand: “inteemate”
Size: XS
Price: 11.99
Size: XL
Price: 11.99
Size: S
Price: 12.99
Size: S
Price: 12.99
Size: M
Price: 13.99
Size: L
Price: 13.99
color: [green, yellow, blue]
size: [XS, S, M, L, XL]
price: [11.99, 12.99, 13.99]
Merge into single document??
Facets would work great but
false matches for filter color:green AND size:M
Challenge: product variants
Best solution (in our opinion):
● Index one document for each variant
● Group variants at query time using the collapse query parser:
fq={!collapse field=productId}
=> Boolean filters work as expected
=> Great flexibility for counting facets
=> Fast enough
Challenge: product variants
Size: XS
Price: 11.99
Product: 9739
Brand: inteemate
Size: XL
Price: 11.99
Product: 9739
Brand: inteemate
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: M
Price: 13.99
Product: 9739
Brand: inteemate
Size: L
Price: 13.99
Product: 9739
Brand: inteemate
filter query
fq=brand:inteemate
Challenge: product variants
Size: XS
Price: 11.99
Product: 9739
Brand: inteemate
Size: XL
Price: 11.99
Product: 9739
Brand: inteemate
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: M
Price: 13.99
Product: 9739
Brand: inteemate
Size: L
Price: 13.99
Product: 9739
Brand: inteemate
filter query
fq=brand:inteemate
filter query
fq=color:blue
Challenge: product variants
Size: XS
Price: 11.99
Product: 9739
Brand: inteemate
Size: XL
Price: 11.99
Product: 9739
Brand: inteemate
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: M
Price: 13.99
Product: 9739
Brand: inteemate
Size: L
Price: 13.99
Product: 9739
Brand: inteemate
filter query
fq=brand:inteemate
filter query
fq=color:blue
query
q=t-shirt
Challenge: product variants
Size: XS
Price: 11.99
Product: 9739
Brand: inteemate
Size: XL
Price: 11.99
Product: 9739
Brand: inteemate
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: M
Price: 13.99
Product: 9739
Brand: inteemate
Size: L
Price: 13.99
Product: 9739
Brand: inteemate
filter query
fq=brand:inteemate
filter query
fq=color:blue
query
q=t-shirt
post filter query
fq={!collapse
field=productId}
Challenge: product variants in facets
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: M
Price: 13.99
Product: 9739
Brand: inteemate
Size: L
Price: 13.99
Product: 9739
Brand: inteemate
...
post filter query
fq={!collapse
field=productId}
Facet counts will be correct for
product attributes (“brand”)
Challenge: product variants in facets
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: M
Price: 13.99
Product: 9739
Brand: inteemate
Size: L
Price: 13.99
Product: 9739
Brand: inteemate
...
post filter query
fq={!collapse
field=productId
tag=coll}
For facet counts of variant
attributes we’ll have to tag and
exclude collapse filter:
"facet": {
"size": {
"type": "terms",
"field": "size",
"domain": {
"excludeTags":"coll"
}
}
}
Sizes S, M, L all shown as ‘1 result’ in facets ✅
Challenge: product variants in facets
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: M
Price: 13.99
Product: 9739
Brand: inteemate
Size: L
Price: 13.99
Product: 9739
Brand: inteemate
...
post filter query
fq={!collapse
field=productId
tag=coll}
For facet counts of variant
attributes we’ll have to tag and
exclude collapse filter:
"facet": {
"color": {
"type": "terms",
"field": "color",
"domain": {
"excludeTags":"coll"
}
}
}
Sizes S, M, L all shown as ‘1 result’ in facets ✅
Color ‘Blue’ shown as ‘3 results’ in facets ❌
Challenge: product variants in facets
Size: S
Price: 12.99
Product: 9739
Brand: inteemate
Size: M
Price: 13.99
Product: 9739
Brand: inteemate
Size: L
Price: 13.99
Product: 9739
Brand: inteemate
...
post filter query
fq={!collapse
field=productId
tag=coll}
For facet counts of variant
attributes we’ll have to tag and
exclude collapse filter:
"facet": {
"color": {
"type": "terms",
"field": "color",
"domain": {
"excludeTags":"coll"
},
facet: {
"numProducts":"unique(productId)"
}
}
}
Sizes S, M, L all shown as ‘1 result’ in facets ✅
Color ‘Blue’ shown as ‘1 result’ in facets ✅
Collapse query parser - notes on implementation
● Beware of high cardinality of product IDs.
○ If you have 10M different product IDs in your index, the collapse query parser will allocate
heap space for 2 arrays (float/int) x 10M elements (ca. 80 MB) per request!
○ Solution:
■ Many products have just 1 variant. It’s better to leave the productId empty in this case.
■ Combine with nullPolicy=expand, which avoids reserving array space for products
without a productId:
fq={!collapse field=productId nullPolicy=expand}
● All variants of a product must be indexed to the same shard
Facet Selection
Which facets should we show?
Some domains are rich in attributes. For example, electronics could use 10k
different attributes.
Even if we reduced the number of attributes to be used in facets at index time, we
could be left with several hundreds of candidates for facetting.
Building a request for hundreds of facets is not feasible. We’ll show a simple
solution, that will just use the search engine to select facets.
At the other end of the spectrum, you could train a model, that predicts which
facets to show for a given query.
Which facets should we show? - Solution
Index a field multivalued field that holds the names of the facettable fields...
Doc1:
screenSize: 17, ....
facettableFields: [
“screenSize”, “ramGB”, “height”, “width”, ...
]
Doc2:
...
facettableFields: [
“screenSize”, “numHDMIPorts”, “height”, “width”, ...
]
Which facets should we show? - Solution
... and execute an additional, prior facet request on this field. Add the facet values
returned by this request as facet parameters to the main request:
"facet": {
"facettable_fields": {
"type": "terms",
"field": "facettable_fields"
}
}
(query/filter queries are the same like in
the ‘main request’)
"facets":{
"facettable_fields":{
"buckets":[{
"val":"screenSize",
"count":12},
{
"val":"ramGB",
"count":4},
{
"val":"height",
"count":3},
]
}
}
Which facets should we show? - Solution
Index additional information together with the names of facettable fields
facettableFields: [
"00010;screenSize;Screen size ",
"00100;ramGB;Memory (GB) ",
"00005;height;Height ",
"00005;width;Width ",
...]
Importance
(padding makes
values sortable!)
Field name Label
Facet Value Selection
Which facet values?
Category Pills being dynamically included IF the entropy model says they are
meaningful for filtering the data
Shannon’s Entropy Worksheet
https://bit.ly/measure-diversity
(https://en.wikipedia.org/wiki/Entropy_(information_theory))
Auto-completion & spelling correction
Autocompletion - Using a Suggester
<searchComponent name="suggest" class="solr.SuggestComponent">
<lst name="suggester">
<str name="name">mySuggester</str>
<str name="lookupImpl">FuzzyLookupFactory</str>
<str name="suggestAnalyzerFieldType">text_general</str>
<str name="buildOnCommit">true</str>
<str name="field">dictionary</str>
</lst>
</searchComponent>
Experiment with combinations of Lookups & Dictionary implementations.
Spellchecking - Using Solr component
Two flavours: “cofffee --> coffee”, collations: “expresso machine”
-->“espresso machine”
<str name="spellcheck">true</str>
<str name="spellcheck.dictionary">title</str>
<str name="spellcheck.onlyMorePopular">true</str>
<str name="spellcheck.extendedResults">true</str>
<str name="spellcheck.collate">true</str>
<str name="spellcheck.maxCollations">100</str>
<str name="spellcheck.maxCollationTries">5</str>
<str name="spellcheck.count">5</str>
<str name="spellcheck.collateParam.mm">100%</str>
<searchComponent name="suggest" class="solr.SuggestComponent">
<lst name="suggester">
<str name="name">mySuggester</str>
<str name="lookupImpl">FuzzyLookupFactory</str>
<str name="suggestAnalyzerFieldType">text_general</str>
<str name="buildOnCommit">true</str>
<str name="field">dictionary</str>
</lst>
</searchComponent>
Good collations are
what we want!
Autocompletion - using a query index
User enters: ja
Show the best query completions
in the best order!
Using ideas from Joshua Bacher/Christine Bellstedt, Search
Suggestions - The Underestimated Killer Feature of your Online Shop.
Berlin Buzzwords 2018
Autocompletion - using a query index
User enters: ja
Match prefix in field “match”
q=*:*&fq=match:ja
indexed with EdgeNGramFilter,
lowercase, remove accents/ASCII
folding, ...
Optionally index and match
spelling variants (jacket/jakcet)
Autocompletion - using a query index
Sort by “weight desc”
q=*:*&fq=match:ja
&sort=weight desc
Sorting might get slow for short
prefixes if the query index is large -
tag the top N queries for lengths 1 and
2 and add another filter (fewer
matches to sort, nicely cacheable):
q=*:*&fq=match:ja
&sort=weight desc
&fq=top_len_2:true
Autocompletion - using a query index
If two queries have the same
fingerprint, drop the one with the
lower weight
Fingerprint: concatenated sorted,
normalised query tokens
This increases the diversity of the
suggestions.
Autocompletion - using a query index
Suggest the Labels as query
completions!
Autocompletion - using a query index
You can show the best matching
category for disambiguation and
affirmation:
* jacket
* jacket in Fashion
Spelling correction - using a query index
Structure similar to query index for
autocompletion
Copy of the ‘match’ field indexed
as n-grams
Spelling correction - using a query index
Filters on edit distance and rank
based on n-grams (via TF*IDF)
q=jakc jones
&defType=edismax
&qf=match_ngram
&sow=false
&fq=match:jakc jones~2
Spelling correction - using a query index
Add boost by weight (or a function
of it)
q=jakc jones
&defType=edismax
&qf=match_ngram
&sow=false
&fq=match:jakc jones~2
&boost=weight
General model for spelling correction &
autocompletion
Noisy Channel Model / Bayesian Inference
(Kernighan et. al., 1990; Jurafsky & Martin, 2009)
Our ‘Weight’ field
Edit distance, n-gram
model, keyboard layout
(Symspell!), ... prefix
match for autocompletion
Query relaxation
Query relaxation
Which query term should we drop if we can’t match all of them together?
jacket xs green
jacket xs green
jacket xs green
jacket xs green
iphone 12
iphone 12
iphone 12
Query relaxation - ‘mm’ anti-pattern
Loosening ‘minimum should match’ (mm) constraint to < 100%
iphone 12
iphone 12
You’ll get matches for “12”
She will just see probably imprecise results that don’t match her
query exactly.
You cannot tell the user what happened and which term you
dropped. She wouldn’t know what to do in order to improve the
query.
Don’t do this!
At least not in e-commerce search
Query relaxation - Solutions
René.Kriegler, Query Relaxation - a rewriting technique
between search and recommendations. Haystack
Conference 2019
Query relaxation - Solutions
Try searching with each term individually and drop the one from
the query that yields the fewest results (might require additional
rules to avoid just keeping number terms)
Query relaxation - Solutions
Multi-layer Neural Network,
Word embeddings as input to represent terms
Encores?
Facets, autocompletion, spelling correction, query relaxation are important
features of a search application.
We’ve shown simple out-of-the-box solutions and a path to implement more
advanced approaches.
Thank you!

More Related Content

What's hot

The Apache Solr Semantic Knowledge Graph
The Apache Solr Semantic Knowledge GraphThe Apache Solr Semantic Knowledge Graph
The Apache Solr Semantic Knowledge GraphTrey Grainger
 
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...Extreme Apache Spark: how in 3 months we created a pipeline that can process ...
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...Josef A. Habdank
 
Elasticsearch for Data Analytics
Elasticsearch for Data AnalyticsElasticsearch for Data Analytics
Elasticsearch for Data AnalyticsFelipe
 
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Apache Spark Data Source V2 with Wenchen Fan and Gengliang WangApache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Apache Spark Data Source V2 with Wenchen Fan and Gengliang WangDatabricks
 
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...Edureka!
 
An Introduction to Elastic Search.
An Introduction to Elastic Search.An Introduction to Elastic Search.
An Introduction to Elastic Search.Jurriaan Persyn
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notesAnandh Arumugakan
 
Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...
Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...
Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...Spark Summit
 
Introduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of LuceneIntroduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of LuceneRahul Jain
 
ElasticSearch Basic Introduction
ElasticSearch Basic IntroductionElasticSearch Basic Introduction
ElasticSearch Basic IntroductionMayur Rathod
 
Deep Dive Into Elasticsearch
Deep Dive Into ElasticsearchDeep Dive Into Elasticsearch
Deep Dive Into ElasticsearchKnoldus Inc.
 
OCR with MXNet Gluon
OCR with MXNet GluonOCR with MXNet Gluon
OCR with MXNet GluonApache MXNet
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsDatamining Tools
 
Elasticsearch development case
Elasticsearch development caseElasticsearch development case
Elasticsearch development case일규 최
 
Inside the InfluxDB storage engine
Inside the InfluxDB storage engineInside the InfluxDB storage engine
Inside the InfluxDB storage engineInfluxData
 
Parquet Strata/Hadoop World, New York 2013
Parquet Strata/Hadoop World, New York 2013Parquet Strata/Hadoop World, New York 2013
Parquet Strata/Hadoop World, New York 2013Julien Le Dem
 
Elasticsearch presentation 1
Elasticsearch presentation 1Elasticsearch presentation 1
Elasticsearch presentation 1Maruf Hassan
 
Query DSL In Elasticsearch
Query DSL In ElasticsearchQuery DSL In Elasticsearch
Query DSL In ElasticsearchKnoldus Inc.
 

What's hot (20)

The Apache Solr Semantic Knowledge Graph
The Apache Solr Semantic Knowledge GraphThe Apache Solr Semantic Knowledge Graph
The Apache Solr Semantic Knowledge Graph
 
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...Extreme Apache Spark: how in 3 months we created a pipeline that can process ...
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...
 
Elasticsearch for Data Analytics
Elasticsearch for Data AnalyticsElasticsearch for Data Analytics
Elasticsearch for Data Analytics
 
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Apache Spark Data Source V2 with Wenchen Fan and Gengliang WangApache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
 
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
Elasticsearch Tutorial | Getting Started with Elasticsearch | ELK Stack Train...
 
An Introduction to Elastic Search.
An Introduction to Elastic Search.An Introduction to Elastic Search.
An Introduction to Elastic Search.
 
Elasticsearch
ElasticsearchElasticsearch
Elasticsearch
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...
Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...
Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...
 
Introduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of LuceneIntroduction to Elasticsearch with basics of Lucene
Introduction to Elasticsearch with basics of Lucene
 
ElasticSearch Basic Introduction
ElasticSearch Basic IntroductionElasticSearch Basic Introduction
ElasticSearch Basic Introduction
 
Deep Dive Into Elasticsearch
Deep Dive Into ElasticsearchDeep Dive Into Elasticsearch
Deep Dive Into Elasticsearch
 
OCR with MXNet Gluon
OCR with MXNet GluonOCR with MXNet Gluon
OCR with MXNet Gluon
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
 
Elasticsearch
ElasticsearchElasticsearch
Elasticsearch
 
Elasticsearch development case
Elasticsearch development caseElasticsearch development case
Elasticsearch development case
 
Inside the InfluxDB storage engine
Inside the InfluxDB storage engineInside the InfluxDB storage engine
Inside the InfluxDB storage engine
 
Parquet Strata/Hadoop World, New York 2013
Parquet Strata/Hadoop World, New York 2013Parquet Strata/Hadoop World, New York 2013
Parquet Strata/Hadoop World, New York 2013
 
Elasticsearch presentation 1
Elasticsearch presentation 1Elasticsearch presentation 1
Elasticsearch presentation 1
 
Query DSL In Elasticsearch
Query DSL In ElasticsearchQuery DSL In Elasticsearch
Query DSL In Elasticsearch
 

Similar to Encores

Strategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsStrategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsJean Silva
 
Feature surfacing - meetup
Feature surfacing  - meetupFeature surfacing  - meetup
Feature surfacing - meetupPredicSis
 
Product Line And Product Development Exercises
Product Line And Product Development ExercisesProduct Line And Product Development Exercises
Product Line And Product Development ExercisesRoland Frasier
 
The product is not "the product". Who owns it anyway?
The product is not "the product". Who owns it anyway? The product is not "the product". Who owns it anyway?
The product is not "the product". Who owns it anyway? donato mangialardo
 
Design handoffs with design systems in mind
Design handoffs with design systems in mindDesign handoffs with design systems in mind
Design handoffs with design systems in mindRoland Lösslein
 
Make money with industrial design. Jukola7 presentation 2014
Make money with industrial design. Jukola7 presentation 2014Make money with industrial design. Jukola7 presentation 2014
Make money with industrial design. Jukola7 presentation 2014Lauri Aaltio
 
QM-008-Design for Six Sigma 1
QM-008-Design for Six Sigma 1QM-008-Design for Six Sigma 1
QM-008-Design for Six Sigma 1handbook
 
Applying NLP to product comparison at visual meta
Applying NLP to product comparison at visual metaApplying NLP to product comparison at visual meta
Applying NLP to product comparison at visual metaRoss Turner
 
Retail referencearchitecture productcatalog
Retail referencearchitecture productcatalogRetail referencearchitecture productcatalog
Retail referencearchitecture productcatalogMongoDB
 
Samples of Database and Website Design
Samples of Database and Website DesignSamples of Database and Website Design
Samples of Database and Website DesignSherri Orwick Ogden
 
Writing a specification
Writing a specificationWriting a specification
Writing a specificationDavid Hassett
 
Boosting Product Categorization with Machine Learning
Boosting Product Categorization with Machine LearningBoosting Product Categorization with Machine Learning
Boosting Product Categorization with Machine LearningAmadeus Magrabi
 
Just ask: Designing intent-driven algos
Just ask: Designing intent-driven algosJust ask: Designing intent-driven algos
Just ask: Designing intent-driven algosAnna Schneider
 
Building complex UI on Android
Building complex UI on AndroidBuilding complex UI on Android
Building complex UI on AndroidMaciej Witowski
 
4.Data-Visualization.pptx
4.Data-Visualization.pptx4.Data-Visualization.pptx
4.Data-Visualization.pptxPratyushJain37
 
Marketing Analytics with R Lifting Campaign Success Rates
Marketing Analytics with R Lifting Campaign Success RatesMarketing Analytics with R Lifting Campaign Success Rates
Marketing Analytics with R Lifting Campaign Success RatesRevolution Analytics
 
New Principles for Digital Experiences That Perform
New Principles for Digital Experiences That PerformNew Principles for Digital Experiences That Perform
New Principles for Digital Experiences That PerformOptimizely
 

Similar to Encores (20)

Strategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero resultsStrategies for using alternative queries to mitigate zero results
Strategies for using alternative queries to mitigate zero results
 
Feature surfacing - meetup
Feature surfacing  - meetupFeature surfacing  - meetup
Feature surfacing - meetup
 
Product Line And Product Development Exercises
Product Line And Product Development ExercisesProduct Line And Product Development Exercises
Product Line And Product Development Exercises
 
The product is not "the product". Who owns it anyway?
The product is not "the product". Who owns it anyway? The product is not "the product". Who owns it anyway?
The product is not "the product". Who owns it anyway?
 
Design handoffs with design systems in mind
Design handoffs with design systems in mindDesign handoffs with design systems in mind
Design handoffs with design systems in mind
 
Make money with industrial design. Jukola7 presentation 2014
Make money with industrial design. Jukola7 presentation 2014Make money with industrial design. Jukola7 presentation 2014
Make money with industrial design. Jukola7 presentation 2014
 
QM-008-Design for Six Sigma 1
QM-008-Design for Six Sigma 1QM-008-Design for Six Sigma 1
QM-008-Design for Six Sigma 1
 
Applying NLP to product comparison at visual meta
Applying NLP to product comparison at visual metaApplying NLP to product comparison at visual meta
Applying NLP to product comparison at visual meta
 
Retail referencearchitecture productcatalog
Retail referencearchitecture productcatalogRetail referencearchitecture productcatalog
Retail referencearchitecture productcatalog
 
Samples of Database and Website Design
Samples of Database and Website DesignSamples of Database and Website Design
Samples of Database and Website Design
 
Writing a specification
Writing a specificationWriting a specification
Writing a specification
 
DC Group
DC Group DC Group
DC Group
 
Boosting Product Categorization with Machine Learning
Boosting Product Categorization with Machine LearningBoosting Product Categorization with Machine Learning
Boosting Product Categorization with Machine Learning
 
Just ask: Designing intent-driven algos
Just ask: Designing intent-driven algosJust ask: Designing intent-driven algos
Just ask: Designing intent-driven algos
 
Vb introduction.
Vb introduction.Vb introduction.
Vb introduction.
 
SXSW After Party
SXSW After PartySXSW After Party
SXSW After Party
 
Building complex UI on Android
Building complex UI on AndroidBuilding complex UI on Android
Building complex UI on Android
 
4.Data-Visualization.pptx
4.Data-Visualization.pptx4.Data-Visualization.pptx
4.Data-Visualization.pptx
 
Marketing Analytics with R Lifting Campaign Success Rates
Marketing Analytics with R Lifting Campaign Success RatesMarketing Analytics with R Lifting Campaign Success Rates
Marketing Analytics with R Lifting Campaign Success Rates
 
New Principles for Digital Experiences That Perform
New Principles for Digital Experiences That PerformNew Principles for Digital Experiences That Perform
New Principles for Digital Experiences That Perform
 

More from OpenSource Connections

How To Structure Your Search Team for Success
How To Structure Your Search Team for SuccessHow To Structure Your Search Team for Success
How To Structure Your Search Team for SuccessOpenSource Connections
 
The right path to making search relevant - Taxonomy Bootcamp London 2019
The right path to making search relevant  - Taxonomy Bootcamp London 2019The right path to making search relevant  - Taxonomy Bootcamp London 2019
The right path to making search relevant - Taxonomy Bootcamp London 2019OpenSource Connections
 
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie HullHaystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie HullOpenSource Connections
 
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - State of Apache Tika - Tim AllisonHaystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - State of Apache Tika - Tim AllisonOpenSource Connections
 
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...OpenSource Connections
 
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj BharadwajHaystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj BharadwajOpenSource Connections
 
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...OpenSource Connections
 
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search-based recommendations at Politico - Ryan KohlHaystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search-based recommendations at Politico - Ryan KohlOpenSource Connections
 
Haystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon HughesHaystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon HughesOpenSource Connections
 
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey GraingerHaystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey GraingerOpenSource Connections
 
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...OpenSource Connections
 
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...OpenSource Connections
 
Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Architectural considerations on search relevancy in the conte...Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Architectural considerations on search relevancy in the conte...OpenSource Connections
 
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...OpenSource Connections
 
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...OpenSource Connections
 
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...OpenSource Connections
 
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah ViaOpenSource Connections
 
Haystack 2019 - Addressing variance in AB tests: Interleaved evaluation of ra...
Haystack 2019 - Addressing variance in AB tests: Interleaved evaluation of ra...Haystack 2019 - Addressing variance in AB tests: Interleaved evaluation of ra...
Haystack 2019 - Addressing variance in AB tests: Interleaved evaluation of ra...OpenSource Connections
 

More from OpenSource Connections (20)

Test driven relevancy
Test driven relevancyTest driven relevancy
Test driven relevancy
 
How To Structure Your Search Team for Success
How To Structure Your Search Team for SuccessHow To Structure Your Search Team for Success
How To Structure Your Search Team for Success
 
The right path to making search relevant - Taxonomy Bootcamp London 2019
The right path to making search relevant  - Taxonomy Bootcamp London 2019The right path to making search relevant  - Taxonomy Bootcamp London 2019
The right path to making search relevant - Taxonomy Bootcamp London 2019
 
Payloads and OCR with Solr
Payloads and OCR with SolrPayloads and OCR with Solr
Payloads and OCR with Solr
 
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie HullHaystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
 
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - State of Apache Tika - Tim AllisonHaystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
 
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
 
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj BharadwajHaystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
 
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
 
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search-based recommendations at Politico - Ryan KohlHaystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
 
Haystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon HughesHaystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon Hughes
 
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey GraingerHaystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
 
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
 
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
 
Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Architectural considerations on search relevancy in the conte...Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Architectural considerations on search relevancy in the conte...
 
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
 
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
 
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
 
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
 
Haystack 2019 - Addressing variance in AB tests: Interleaved evaluation of ra...
Haystack 2019 - Addressing variance in AB tests: Interleaved evaluation of ra...Haystack 2019 - Addressing variance in AB tests: Interleaved evaluation of ra...
Haystack 2019 - Addressing variance in AB tests: Interleaved evaluation of ra...
 

Recently uploaded

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Recently uploaded (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Encores

  • 1. Encores? - Going beyond matching and ranking of search results Berlin Buzzwords 2021 Eric Pugh, René Kriegler
  • 2. Who we are @renekrie @dep4b Combined 30 years of experience in search Open Source enthusiasts ASF member, Committers on: Solr, Querqy, SMUI, Quepid,
  • 4. Search - beyond matching and ranking We tend to focus on matching and ranking. Other search features are almost treated like an afterthought, like ‘encores’ that follow the main performance: ● Facets ● Query auto-completion ● Spelling correction ● Query relaxation => BUT: These are essential features that help the user formulate the query, understand and narrow down the results
  • 5. Our main act today (not encores!) ● Facets ● Query auto-completion ● Spelling correction ● Query relaxation Learn about solutions that come out of the box (in Solr) Typical challenges and how to overcome them Advanced solutions: understand the concepts, create your own
  • 6. The Art of Facets
  • 7. Facets help the user ... ● understand the search results (see ‘what is there’, learn about the domain) ● narrow down search results Chorus Electronics Project: https://github.com/querqy/chorus Try the Demo Ecommerce Shop: http://chorus.dev.o19s.com:4000/
  • 8. Facets help the user ... ● understand the search results (see ‘what is there’, learn about the domain) ● narrow down search results
  • 9. Challenges Getting the counts right in e-commerce search Showing the best facets in the best order Selecting the facet values to show
  • 10. “Qui numerare incipit errare incipit” Facet Counts
  • 11. Facets and filters A trivial example: query=t-shirts filter=color:black Challenge: We still need to count all colours in the facets, even if the search result contains only black t-shirts Solution: Tagging and exclusion of filters
  • 12. Facets and filters: tagging and exclusion Tagging: fq={!tag=f_color}color:black Exclusion: Facet param facet.field={!ex=f_color}color JSON facets "facet": { "color": { "type": "terms", "field": "color", "domain": { "excludeTags":"f_color" } } }
  • 13. Challenge: product variants Product ID: 9739, brand: “inteemate” Size: XS Price: 11.99 Size: XL Price: 11.99 Size: S Price: 12.99 Size: S Price: 12.99 Size: M Price: 13.99 Size: L Price: 13.99
  • 14. Challenge: product variants Product ID: 9739, brand: “inteemate” Size: XS Price: 11.99 Size: XL Price: 11.99 Size: S Price: 12.99 Size: S Price: 12.99 Size: M Price: 13.99 Size: L Price: 13.99 color: [green, yellow, blue] size: [XS, S, M, L, XL] price: [11.99, 12.99, 13.99] Merge into single document?? Facets would work great but false matches for filter color:green AND size:M
  • 15. Challenge: product variants Best solution (in our opinion): ● Index one document for each variant ● Group variants at query time using the collapse query parser: fq={!collapse field=productId} => Boolean filters work as expected => Great flexibility for counting facets => Fast enough
  • 16. Challenge: product variants Size: XS Price: 11.99 Product: 9739 Brand: inteemate Size: XL Price: 11.99 Product: 9739 Brand: inteemate Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: M Price: 13.99 Product: 9739 Brand: inteemate Size: L Price: 13.99 Product: 9739 Brand: inteemate filter query fq=brand:inteemate
  • 17. Challenge: product variants Size: XS Price: 11.99 Product: 9739 Brand: inteemate Size: XL Price: 11.99 Product: 9739 Brand: inteemate Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: M Price: 13.99 Product: 9739 Brand: inteemate Size: L Price: 13.99 Product: 9739 Brand: inteemate filter query fq=brand:inteemate filter query fq=color:blue
  • 18. Challenge: product variants Size: XS Price: 11.99 Product: 9739 Brand: inteemate Size: XL Price: 11.99 Product: 9739 Brand: inteemate Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: M Price: 13.99 Product: 9739 Brand: inteemate Size: L Price: 13.99 Product: 9739 Brand: inteemate filter query fq=brand:inteemate filter query fq=color:blue query q=t-shirt
  • 19. Challenge: product variants Size: XS Price: 11.99 Product: 9739 Brand: inteemate Size: XL Price: 11.99 Product: 9739 Brand: inteemate Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: M Price: 13.99 Product: 9739 Brand: inteemate Size: L Price: 13.99 Product: 9739 Brand: inteemate filter query fq=brand:inteemate filter query fq=color:blue query q=t-shirt post filter query fq={!collapse field=productId}
  • 20. Challenge: product variants in facets Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: M Price: 13.99 Product: 9739 Brand: inteemate Size: L Price: 13.99 Product: 9739 Brand: inteemate ... post filter query fq={!collapse field=productId} Facet counts will be correct for product attributes (“brand”)
  • 21. Challenge: product variants in facets Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: M Price: 13.99 Product: 9739 Brand: inteemate Size: L Price: 13.99 Product: 9739 Brand: inteemate ... post filter query fq={!collapse field=productId tag=coll} For facet counts of variant attributes we’ll have to tag and exclude collapse filter: "facet": { "size": { "type": "terms", "field": "size", "domain": { "excludeTags":"coll" } } } Sizes S, M, L all shown as ‘1 result’ in facets ✅
  • 22. Challenge: product variants in facets Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: M Price: 13.99 Product: 9739 Brand: inteemate Size: L Price: 13.99 Product: 9739 Brand: inteemate ... post filter query fq={!collapse field=productId tag=coll} For facet counts of variant attributes we’ll have to tag and exclude collapse filter: "facet": { "color": { "type": "terms", "field": "color", "domain": { "excludeTags":"coll" } } } Sizes S, M, L all shown as ‘1 result’ in facets ✅ Color ‘Blue’ shown as ‘3 results’ in facets ❌
  • 23. Challenge: product variants in facets Size: S Price: 12.99 Product: 9739 Brand: inteemate Size: M Price: 13.99 Product: 9739 Brand: inteemate Size: L Price: 13.99 Product: 9739 Brand: inteemate ... post filter query fq={!collapse field=productId tag=coll} For facet counts of variant attributes we’ll have to tag and exclude collapse filter: "facet": { "color": { "type": "terms", "field": "color", "domain": { "excludeTags":"coll" }, facet: { "numProducts":"unique(productId)" } } } Sizes S, M, L all shown as ‘1 result’ in facets ✅ Color ‘Blue’ shown as ‘1 result’ in facets ✅
  • 24. Collapse query parser - notes on implementation ● Beware of high cardinality of product IDs. ○ If you have 10M different product IDs in your index, the collapse query parser will allocate heap space for 2 arrays (float/int) x 10M elements (ca. 80 MB) per request! ○ Solution: ■ Many products have just 1 variant. It’s better to leave the productId empty in this case. ■ Combine with nullPolicy=expand, which avoids reserving array space for products without a productId: fq={!collapse field=productId nullPolicy=expand} ● All variants of a product must be indexed to the same shard
  • 26. Which facets should we show? Some domains are rich in attributes. For example, electronics could use 10k different attributes. Even if we reduced the number of attributes to be used in facets at index time, we could be left with several hundreds of candidates for facetting. Building a request for hundreds of facets is not feasible. We’ll show a simple solution, that will just use the search engine to select facets. At the other end of the spectrum, you could train a model, that predicts which facets to show for a given query.
  • 27. Which facets should we show? - Solution Index a field multivalued field that holds the names of the facettable fields... Doc1: screenSize: 17, .... facettableFields: [ “screenSize”, “ramGB”, “height”, “width”, ... ] Doc2: ... facettableFields: [ “screenSize”, “numHDMIPorts”, “height”, “width”, ... ]
  • 28. Which facets should we show? - Solution ... and execute an additional, prior facet request on this field. Add the facet values returned by this request as facet parameters to the main request: "facet": { "facettable_fields": { "type": "terms", "field": "facettable_fields" } } (query/filter queries are the same like in the ‘main request’) "facets":{ "facettable_fields":{ "buckets":[{ "val":"screenSize", "count":12}, { "val":"ramGB", "count":4}, { "val":"height", "count":3}, ] } }
  • 29. Which facets should we show? - Solution Index additional information together with the names of facettable fields facettableFields: [ "00010;screenSize;Screen size ", "00100;ramGB;Memory (GB) ", "00005;height;Height ", "00005;width;Width ", ...] Importance (padding makes values sortable!) Field name Label
  • 31. Which facet values? Category Pills being dynamically included IF the entropy model says they are meaningful for filtering the data
  • 34. Autocompletion - Using a Suggester <searchComponent name="suggest" class="solr.SuggestComponent"> <lst name="suggester"> <str name="name">mySuggester</str> <str name="lookupImpl">FuzzyLookupFactory</str> <str name="suggestAnalyzerFieldType">text_general</str> <str name="buildOnCommit">true</str> <str name="field">dictionary</str> </lst> </searchComponent> Experiment with combinations of Lookups & Dictionary implementations.
  • 35. Spellchecking - Using Solr component Two flavours: “cofffee --> coffee”, collations: “expresso machine” -->“espresso machine” <str name="spellcheck">true</str> <str name="spellcheck.dictionary">title</str> <str name="spellcheck.onlyMorePopular">true</str> <str name="spellcheck.extendedResults">true</str> <str name="spellcheck.collate">true</str> <str name="spellcheck.maxCollations">100</str> <str name="spellcheck.maxCollationTries">5</str> <str name="spellcheck.count">5</str> <str name="spellcheck.collateParam.mm">100%</str> <searchComponent name="suggest" class="solr.SuggestComponent"> <lst name="suggester"> <str name="name">mySuggester</str> <str name="lookupImpl">FuzzyLookupFactory</str> <str name="suggestAnalyzerFieldType">text_general</str> <str name="buildOnCommit">true</str> <str name="field">dictionary</str> </lst> </searchComponent> Good collations are what we want!
  • 36. Autocompletion - using a query index User enters: ja Show the best query completions in the best order! Using ideas from Joshua Bacher/Christine Bellstedt, Search Suggestions - The Underestimated Killer Feature of your Online Shop. Berlin Buzzwords 2018
  • 37. Autocompletion - using a query index User enters: ja Match prefix in field “match” q=*:*&fq=match:ja indexed with EdgeNGramFilter, lowercase, remove accents/ASCII folding, ... Optionally index and match spelling variants (jacket/jakcet)
  • 38. Autocompletion - using a query index Sort by “weight desc” q=*:*&fq=match:ja &sort=weight desc Sorting might get slow for short prefixes if the query index is large - tag the top N queries for lengths 1 and 2 and add another filter (fewer matches to sort, nicely cacheable): q=*:*&fq=match:ja &sort=weight desc &fq=top_len_2:true
  • 39. Autocompletion - using a query index If two queries have the same fingerprint, drop the one with the lower weight Fingerprint: concatenated sorted, normalised query tokens This increases the diversity of the suggestions.
  • 40. Autocompletion - using a query index Suggest the Labels as query completions!
  • 41. Autocompletion - using a query index You can show the best matching category for disambiguation and affirmation: * jacket * jacket in Fashion
  • 42. Spelling correction - using a query index Structure similar to query index for autocompletion Copy of the ‘match’ field indexed as n-grams
  • 43. Spelling correction - using a query index Filters on edit distance and rank based on n-grams (via TF*IDF) q=jakc jones &defType=edismax &qf=match_ngram &sow=false &fq=match:jakc jones~2
  • 44. Spelling correction - using a query index Add boost by weight (or a function of it) q=jakc jones &defType=edismax &qf=match_ngram &sow=false &fq=match:jakc jones~2 &boost=weight
  • 45. General model for spelling correction & autocompletion Noisy Channel Model / Bayesian Inference (Kernighan et. al., 1990; Jurafsky & Martin, 2009) Our ‘Weight’ field Edit distance, n-gram model, keyboard layout (Symspell!), ... prefix match for autocompletion
  • 47. Query relaxation Which query term should we drop if we can’t match all of them together? jacket xs green jacket xs green jacket xs green jacket xs green iphone 12 iphone 12 iphone 12
  • 48. Query relaxation - ‘mm’ anti-pattern Loosening ‘minimum should match’ (mm) constraint to < 100% iphone 12 iphone 12 You’ll get matches for “12” She will just see probably imprecise results that don’t match her query exactly. You cannot tell the user what happened and which term you dropped. She wouldn’t know what to do in order to improve the query. Don’t do this! At least not in e-commerce search
  • 49. Query relaxation - Solutions René.Kriegler, Query Relaxation - a rewriting technique between search and recommendations. Haystack Conference 2019
  • 50. Query relaxation - Solutions Try searching with each term individually and drop the one from the query that yields the fewest results (might require additional rules to avoid just keeping number terms)
  • 51. Query relaxation - Solutions Multi-layer Neural Network, Word embeddings as input to represent terms
  • 52. Encores? Facets, autocompletion, spelling correction, query relaxation are important features of a search application. We’ve shown simple out-of-the-box solutions and a path to implement more advanced approaches.