Deep Semantic Similarity Model for Recommendation Systems

•Download as PPTX, PDF•

1 like•462 views

Deep Semantic Similarity Model (DSSM) is a deep learning technique that represents text strings in a continuous semantic space to model semantic similarity. DSSM maps queries and documents to feature vectors using a neural network and computes cosine similarity between the vectors to rank documents. DSSM is trained using word hashing, convolutional and max-pooling layers, and negative sampling to maximize the likelihood of clicked documents. Evaluation on search engine queries showed DSSM improved ranking quality over baselines, and it has also been applied to recommendations by modeling similarities across domains.

Technology

Deep Semantic Similarity Model
Po-Sen Huang, CIKM2013
Presenter: Shuai Zhang, CSE, UNSW

Content
Introduction
Framework of DSSM
Training Techniques
DSSM for Recommender System

Introduction
DSSM: stands for Deep Structured Semantic Model or
Deep Semantic Similarity Model
Is a deep neural network modelling technique for
representing text strings in a continuous semantic space
and modelling semantic similarity between two text
strings.
web search ranking question answering
knowledge inference image captioning
machine translation recommendation

Introduction
automatically generating image descriptions
Recommending target documents to be
of interest to a user based on a source
document that she is reading

Motivation
Fuzzy keyword matching
q: cold home remedy
Spelling correction
q: cold remeedies
Query alteration/expansion
q: flu treatment
Query/document semantic matching
q: how to deal with stuffy nose
best home
remedies for
cold and flu

Structure of DSSM
• Compute semantic similarity between two text strings X and Y
• Map X and Y to feature vectors in a latent semantic space via deep
neural net
• Compute the cosine similarity between the feature vectors

Structure of DSSM
1. Get the semantic representation of two vector
2. Normalize the two semantic vectors
3. Compute their similarity
4. Use semantic similarity to
rank documents
Semantic representation

Structure of DSSM
Semantic Relevance Score between a query Q and a document D
x: input
y: output
l: hidden layer
f: activation function

Structure of DSSM
Supervised Model: We assume that a query is relevant to the documents
that are clicked on for that query.
The posterior probability of a document given a query:
Where γ is the smoothing factor in the SoftMax function.
D denotes the set of candidate documents to be ranked.

Structure of DSSM
Maximize the likelihood of the clicked documents P(D|Q)
Equivalently, we need to minimize the following loss function
The model is trained using gradient-based numerical optimization
algorithms

Training Techniques
Word Hashing: use sub-word unit (e.g., letter n-gram) as raw input to
handle very large vocabulary,
Letter-trigram Representation
cat → #cat# → #-c-a, c-a-t, a-t-#
Only around 50K letter-trigrams in English
Advantages
• Capture sub-word semantics
• Control the dimensionality of the input space
• Words with small typos have similar raw representations

Training Techniques
Convolutional and Max-pooling layer: identify key words or concepts
Extract local features
using convolutional layer
Generate global features
using max-pooling

Training Techniques
Negative Sampling
Where γ is the smoothing factor in the SoftMax function.
D denotes the set of candidate documents to be ranked.
Ideally, D should contain all possible documents.
In practice, we usually approximate D by including clicked document set D+
and some randomly selected documents
posterior
probability

Evaluation Results
NDCG: Normalized Discounted Cumulative Gain
A measure of ranking quality
Measures the usefulness of a document based on its position in the result list
The evaluation data set contains
16510 English queries sampled
from one-year query log files of
a commercial search engine.

DSSM for Recommendation
DSPR: deep-semantic similarity-based personalized recommendation

DSSM for Recommendation
Multi-View Deep Neural Network for Cross Domain Recommendation:
• Search Engine logs
• News article
browsing history
• App download logs
• Movie/TV view logs

References
1. https://en.wikipedia.org/wiki/N-gram
2. https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/wsdm2015.v3.pdf
3. https://www.microsoft.com/en-us/research/project/dssm/
4. https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/cikm2013_DSSM_fullversion.pdf

What's hot

Skip gram and cbowhyunyoung Lee

The Duet modelBhaskar Mitra

Word2Vechyunyoung Lee

A Simple Introduction to Neural Information RetrievalBhaskar Mitra

Topic modeling using big data analyticsFarheen Nilofer

Word2Vecmohammad javad hasani

Distributed representation of sentences and documentsAbdullah Khan Zehady

Presentation on Text ClassificationSai Srinivas Kotni

Language Technology Enhanced Learningtelss09

Text summarizationAkash Karwande

The Geometry of Learningfridolin.wild

Topic Extraction on Domain OntologyKeerti Bhogaraju

Summary distributed representations_words_phrasesYue Xiangnan

Topic ModelingKarol Grzegorczyk

Research Summary: Hidden Topic Markov Models, GruberAlex Klibisz

Improving Neural Abstractive Text Summarization with Prior KnowledgeGaetano Rossiello, PhD

Seminar dmMHDAmmarALkelany

Conformer-Kernel with Query Term Independence @ TREC 2020 Deep Learning TrackBhaskar Mitra

Topics ModelingSvitlana volkova

Word representations in vector spaceAbdullah Khan Zehady

What's hot (20)

Skip gram and cbow

The Duet model

Word2Vec

A Simple Introduction to Neural Information Retrieval

Topic modeling using big data analytics

Word2Vec

Distributed representation of sentences and documents

Presentation on Text Classification

Language Technology Enhanced Learning

Text summarization

The Geometry of Learning

Topic Extraction on Domain Ontology

Summary distributed representations_words_phrases

Topic Modeling

Research Summary: Hidden Topic Markov Models, Gruber

Improving Neural Abstractive Text Summarization with Prior Knowledge

Seminar dm

Conformer-Kernel with Query Term Independence @ TREC 2020 Deep Learning Track

Topics Modeling

Word representations in vector space

Similar to Deep Semantic Similarity Model for Recommendation Systems

Topic Models Based Personalized Spam FilterSudarsun Santhiappan

Learning from similarity and information extraction from structured documents...Infrrd

Learning deep structured semantic models for web searchhyunsung lee

lecture_mooney.pptbutest

Zhao huang deep sim deep learning code functional similarityitrejos

Adversarial_Examples_in_Audio_and_Text.pptxujjawalchaurasia1

Ju3517011704IJERA Editor

Artificial Intelligencevini89

Text extraction using document structure features and support vector machinesKonstantinos Zagoris

Deep Learning and Watson StudioSasha Lazarevic

LSDI 2.pptxHisokaFreecs

Discovering Novel Information with sentence Level clustering From Multi-docu...irjes

Marvin_CapstoneMarvin Bertin

Sms spam classificationAnishaAgarwal41

Neural Models for Document RankingBhaskar Mitra

Sentence Validation by Statistical Language Modeling and Semantic RelationsEditor IJCATR

Authorship attribution pydata londonkperi

Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...ijtsrd

Machine intelligence in HR technology: resume analysis at scale - Adrian MihaiSebastian Ruder

Chat bot using text similarity approachdinesh_joshy

Similar to Deep Semantic Similarity Model for Recommendation Systems (20)

Topic Models Based Personalized Spam Filter

Learning from similarity and information extraction from structured documents...

Learning deep structured semantic models for web search

lecture_mooney.ppt

Zhao huang deep sim deep learning code functional similarity

Adversarial_Examples_in_Audio_and_Text.pptx

Ju3517011704

Artificial Intelligence

Text extraction using document structure features and support vector machines

Deep Learning and Watson Studio

LSDI 2.pptx

Discovering Novel Information with sentence Level clustering From Multi-docu...

Marvin_Capstone

Sms spam classification

Neural Models for Document Ranking

Sentence Validation by Statistical Language Modeling and Semantic Relations

Authorship attribution pydata london

Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...

Machine intelligence in HR technology: resume analysis at scale - Adrian Mihai

Chat bot using text similarity approach

Recently uploaded

"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays

The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2

Artificial intelligence in cctv survelliance.pptxhariprasad279825

Story boards and shot lists for my a level piececharlottematthew16

Search Engine Optimization SEO PDF for 2024.pdfRankYa

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

"ML in Production",Oleksandr BaganFwdays

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

WordPress Websites for Engineers: Elevate Your Brandgvaughan

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer

DevEX - reference for building teams, processes, and platformsSergiu Bodiu

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

Gen AI in Business - Global Trends Report 2024.pdfAddepto

Recently uploaded (20)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn

The Future of Software Development - Devin AI Innovative Approach.pdf

Artificial intelligence in cctv survelliance.pptx

Story boards and shot lists for my a level piece

Search Engine Optimization SEO PDF for 2024.pdf

Are Multi-Cloud and Serverless Good or Bad?

Advanced Test Driven-Development @ php[tek] 2024

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

"ML in Production",Oleksandr Bagan

DevoxxFR 2024 Reproducible Builds with Apache Maven

WordPress Websites for Engineers: Elevate Your Brand

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

My INSURER PTE LTD - Insurtech Innovation Award 2024

DevEX - reference for building teams, processes, and platforms

Unraveling Multimodality with Large Language Models.pdf

My Hashitalk Indonesia April 2024 Presentation

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

SIP trunking in Janus @ Kamailio World 2024

Gen AI in Business - Global Trends Report 2024.pdf

Deep Semantic Similarity Model for Recommendation Systems

1. Deep Semantic Similarity Model Po-Sen Huang, CIKM2013 Presenter: Shuai Zhang, CSE, UNSW

2. Content Introduction Framework of DSSM Training Techniques DSSM for Recommender System

3. Introduction DSSM: stands for Deep Structured Semantic Model or Deep Semantic Similarity Model Is a deep neural network modelling technique for representing text strings in a continuous semantic space and modelling semantic similarity between two text strings. web search ranking question answering knowledge inference image captioning machine translation recommendation

4. Introduction automatically generating image descriptions Recommending target documents to be of interest to a user based on a source document that she is reading

5. Motivation Fuzzy keyword matching q: cold home remedy Spelling correction q: cold remeedies Query alteration/expansion q: flu treatment Query/document semantic matching q: how to deal with stuffy nose best home remedies for cold and flu

6. Structure of DSSM • Compute semantic similarity between two text strings X and Y • Map X and Y to feature vectors in a latent semantic space via deep neural net • Compute the cosine similarity between the feature vectors

7. Structure of DSSM 1. Get the semantic representation of two vector 2. Normalize the two semantic vectors 3. Compute their similarity 4. Use semantic similarity to rank documents Semantic representation

8. Structure of DSSM

9. Structure of DSSM Semantic Relevance Score between a query Q and a document D x: input y: output l: hidden layer f: activation function

10. Structure of DSSM Supervised Model: We assume that a query is relevant to the documents that are clicked on for that query. The posterior probability of a document given a query: Where γ is the smoothing factor in the SoftMax function. D denotes the set of candidate documents to be ranked.

11. Structure of DSSM Maximize the likelihood of the clicked documents P(D|Q) Equivalently, we need to minimize the following loss function The model is trained using gradient-based numerical optimization algorithms

12. Training Techniques Word Hashing: use sub-word unit (e.g., letter n-gram) as raw input to handle very large vocabulary, Letter-trigram Representation cat → #cat# → #-c-a, c-a-t, a-t-# Only around 50K letter-trigrams in English Advantages • Capture sub-word semantics • Control the dimensionality of the input space • Words with small typos have similar raw representations

13. Training Techniques Convolutional and Max-pooling layer: identify key words or concepts Extract local features using convolutional layer Generate global features using max-pooling

14. Training Techniques Negative Sampling Where γ is the smoothing factor in the SoftMax function. D denotes the set of candidate documents to be ranked. Ideally, D should contain all possible documents. In practice, we usually approximate D by including clicked document set D+ and some randomly selected documents posterior probability

15. Training Techniques

16. Evaluation Results NDCG: Normalized Discounted Cumulative Gain A measure of ranking quality Measures the usefulness of a document based on its position in the result list The evaluation data set contains 16510 English queries sampled from one-year query log files of a commercial search engine.

17. DSSM for Recommendation DSPR: deep-semantic similarity-based personalized recommendation

18. DSSM for Recommendation Multi-View Deep Neural Network for Cross Domain Recommendation: • Search Engine logs • News article browsing history • App download logs • Movie/TV view logs

19. References 1. https://en.wikipedia.org/wiki/N-gram 2. https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/wsdm2015.v3.pdf 3. https://www.microsoft.com/en-us/research/project/dssm/ 4. https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/cikm2013_DSSM_fullversion.pdf

20. Thanks! Q & A

Editor's Notes

Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Named Entity Mining from Click-Through Data Using Weakly Supervised Latent Dirichlet Allocation Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation

Deep Semantic Similarity Model for Recommendation Systems

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Deep Semantic Similarity Model for Recommendation Systems

Similar to Deep Semantic Similarity Model for Recommendation Systems (20)

More from Shuai Zhang

More from Shuai Zhang (8)

Recently uploaded

Recently uploaded (20)

Deep Semantic Similarity Model for Recommendation Systems

Editor's Notes