SlideShare a Scribd company logo
1 of 36
Download to read offline
THE VOICE INTERFACE
REVOLUTION_
GENERAL
TRAITS
+ INTRODUCTION
WHY NOW?_
‣ WEB SERVICES AND IoT EXPLOTION
‣ HARDWARE NOW SUPPORTS FAR-FIELD VOICE INPUT PROCESSING
‣ SCIENCE BEHIND THE SCENES IS NOW ACCESIBLE
‣ AUTOMATIC SPEECH RECOGNITION, NATURAL LANGUAGE
UNDERSTANDING, TEXT TO SPEECH
‣ ARTIFICIAL INTELLIGENCE IS MAKING VOICE INTERFACES SMARTER
‣ PERSONALIZATION TO USER SPEECH, CONTEXTS AND
PREFERENCES
BENEFITS_
‣ MOST NATURAL INTERFACE FOR HUMANS
‣ INSTANT VALUE FOR QUICK DEMANDS
‣ SUITABLE FOR NON-TECHNOLOGICAL USERS
‣ LOW HARDWARE NEED, NO SCREEN REQUIRED
‣ ACCESIBILITY FOR LOW VISION CAPABILITIES AND HAND
DISABILITY
DRAWBACKS_
‣ ERRORS IN SPEECH RECOGNITION
‣ DIFFERENT SPEECH RECOGNITION/GENERATION ACCURACY
AMONG LANGUAGES
‣ BACKGROUND NOISE SUSCEPTIBILITY
‣ NON-ACCESIBLE FOR DEAF/MUTE
‣ RESPONSES WITH SLOW DATA EXPOSURE
USE CASES_
‣ SUITABLE
‣ QUICK LOW-PARAMETERIZED INFORMATION DEMANDS
‣ LOW-PARAMETERIZED NON-CRITICAL TRANSACTIONS
‣ RESPONSES WITH REDUCED AMOUNT OF DATA
‣ NON-SUITABLE
‣ HIGH-PARAMETERIZED QUESTIONS OR TRANSACTIONS
‣ CRITICAL TRANSACTIONS DUE TO ERROR POSSIBILITIES
‣ RESPONSES WITH LARGE AMOUNT OF DATA
DESIGN
+ INTERFACE
BASICS_
‣ CHOOSE YOUR CHANNEL WISELY
‣ CUSTOM APPLICATION
‣ GENERAL ASSISTANT
‣ STUDY YOUR DOMAIN
‣ VOICE-ONLY?
‣ BEST OPTION IS USALLY COMBINED GRAPHIC AND VOICE
INTERFACE
RECOMMEDATIONS_
‣ SHORT INTERACTIONS
‣ SHORTER THAN TEXT-BASED EXPERIENCES
‣ NO LONG FUNNELS MORE THAN TWO STEPS
‣ MEANINGFUL RESPONSES WITH VALUE
‣ TRANSPARENCY
‣ ENGAGEMENT
‣ TAIL QUESTIONS
‣ NOTIFICATIONS
ENTRANCE_
‣ DRIVEN FIRST INTERACTION
‣ SINGLE POINT ALLOW MORE CONTROL
‣ QUICK WELCOME
CUT THE BULLSHIT_
‣ VALUE OVER SMALLTALK
‣ VALUE OVER PERSONALITY
‣ VALUE OVER HUMOUR
‣ BE HONEST
CONVERSATIONS BASICS_
‣ Turn-taking
‣ Threading
‣ Leveraging inherent efficiency of language
‣ Anticipating variable user behaviour
‣ Understanding cooperative behaviour
‣ Cooperative principle
‣ Paul Grice’s Maxims
‣ Use everyday language
‣ Instilling user confidence
GRICE’S MAXIMS_
‣ The maxim of quantity, where one tries to be as informative as one
possibly can, and gives as much information as is needed, and no
more.
‣ The maxim of quality, where one tries to be truthful, and does not
give information that is false or that is not supported by evidence.
‣ The maxim of relation, where one tries to be relevant, and says things
that are pertinent to the discussion.
‣ The maxim of manner, when one tries to be as clear, as brief, and as
orderly as one can in what one says, and where one avoids obscurity
and ambiguity.
DEVELOPMENT
+ TECHNICAL
BASICS_
‣ SPEECH RECOGNITION/GENERATION
‣ AUTOMATIC IN GENERAL ASSISTANTS
‣ SERVICE OR LIBRARY BASED IN CUSTOM ASSISTANTS
‣ CORE COMPONENT IS THE DIALOG ENGINE
‣ GOOGLE DIALOGFLOW
‣ MICROSOFT BOT FRAMEWORK
‣ IBM WATSON ASSISTANT
‣ YOUR OWN
EXAMPLE DIALOG ENGINE DIAGRAM_
‣ NLU Platform to receive requests and converts them to intents,
parameters
RECOMMEDATIONS_
‣ NODEJS AS BACKEND TECHNOLOGY
‣ IDEAL FOR PaaS AND EVEN FaaS
‣ OWN USER SYSTEM
‣ MIXED CONTEXT STRATEGY:
‣ KEEP CONVERSATIONS ON MEMORY
‣ KEEP MEANINGFUL, ACTIONABLE DATA ON DATABASE
‣ PRINCIPLES OF MODULARITY AND COMPONENTIZATION
PATTERNS_
‣ ADAPTER
‣ IDEAL FOR CUSTOM INPUT ENTRIES:
‣ OWN/THIRD PARTY WEBHOOK
‣ MESSAGE SYSTEM LIBRARY
‣ IDEAL FOR CUSTOM OUTPUT EXITS
‣ MIDDLEWARE
‣ FOR USER INPUT
‣ FOR OUTPUT GENERATION
SPEECH RECOGNITION_
‣ ACCURACY IS DOWN TO 4.9 ERROR PERCENTAGE BY GOOGLE
THANKS TO AI TECHNIQUES LIKE DEEP LEARNING
‣ THREE MODELS WORK TOGETHER IN A GRAPH:
‣ ACOUSTIC: WAVEFORM TO EACH SOUND FRAGMENT
‣ PRONUNCIATION: SOUNDS TO WORDS
‣ LANGUAGE: WORDS TO SENTENCES
‣ STANDARD DATASET TO MEASURE ACCURACY IS NIST 2000
SWITCHBOARD
SPEECH RECOGNITION APIS_
‣ GOOGLE CLOUD SPEECH
‣ Converts audio to text, synchronously and asynchronously in 80+
different languages with a high degree of accuracy
‣ https://cloud.google.com/speech/docs
‣ MICROSOFT LUIS
‣ Interprets intents and extract entities, with built-in trained ones
‣ https://www.luis.ai/home
‣ IBM WATSON SPEECH-TO-TEXT
‣ https://www.ibm.com/watson/services/speech-to-text/
‣ AMAZON TRANSCRIBE
‣ https://aws.amazon.com/es/transcribe/
GOOGLE CLOUD API_
‣ NATURAL LANGUAGE
‣ Provides natural language understanding technologies to developers.
Examples include sentiment analysis, entity recognition, entity
sentiment analysis, and text annotations.
‣ https://cloud.google.com/natural-language/docs/reference/rest
‣ TRANSLATION
‣ Translates over 80+ languages and detect language from speech.
‣ https://cloud.google.com/translate/docs/reference/rest
GOOGLE
ASSISTANT
DEVELOPMENT
+ EXAMPLE
ACTIONS ON GOOGLE_
‣ Platform to build actions invoked by users to fulfill some need
‣ Easy way with Dialogflow integration
‣ Custom way with ACTIONS SDK
‣ How it works:
‣ User requests an action “Talk to my Hotel Concierge”
‣ Assistant asks Actions on Google to invoke the particular app
‣ The conversation between the user and the app begins
‣ Subsequent user input is sent directly to app until the app
fulfills the intent and ends
INTENTS_
‣ Represent a mapping between what a user says and what action
should be taken by your software.
‣ User Says (Expressions)
‣ Natural language expressions annotated with parameters that
are linked to entities
‣ Actions
‣ Trigger-name with associated parameters to perform an action
on the app
‣ Response
‣ You can add Simple Text or Rich Response depending on platform
‣ Contexts
‣ Passing info from other intents or external. Input are prerequisite
ENTITIES_
‣ Significant data extracted from user input in form of parameter
value
‣ Entities are associated to particular actions
‣ There are three types:
‣ System
‣ Pre-built entities provided by API.AI in order to facilitate
handling common concepts (colors, locations,…)
‣ Developer
‣ Custom entities created with Reference Value plus Synonyms
‣ User Entities
‣ Defined for the session, specific playlists for instance
CONTEXTS_
‣ Persisted information that can be used through intents
‣ It can be internal like a particular movie the user is asking for
‣ Or external like the user data retrieved from a user system
‣ Lifespan:
‣ By default they last for 5 requests or 10 minutes
‣ Input Context:
‣ Limit intents to be matched only when certain contexts are set
‣ For example when you need specific info to perform action
‣ Output Context:
‣ They are tied to user sessions, is shared by the intent
‣ Automatically added to follow-up intents
EVENTS & DIALOGS_
‣ Events is a feature that allows you to invoke intents by an event
name instead of a user query
‣ Dialogs
‣ Linear
‣ With Slot Filling you define required parameters with prompts
and order them. Agent will ask for them until has all info.
‣ Non-linear
‣ Complex dialogs are formed from context routing, removing
Output Context for Intent Responses, and adding new Output
Context that is matched for next question
GENERAL
ASSISTANTS
+ CHANNELS
GENERAL ASSISTANTS MAP_
GOOGLE ASSISTANT_
‣ ACTIONS ON GOOGLE ALLOWS BUILDING APPS
‣ GOOGLE HOME, HOME MINI, ANDROID, ANDROID AUTO, WEAR OS
‣ FUTURE UP TO 80% OF WORLD MOBILE

MARKET
‣ CUSTOM DIALOG ENGINE DIALOGFLOW
‣ BEST SPEECH RECOGNITION
‣ BEST POSSIBLE FUTURE INTEGRATION WITH OTHER GOOGLE
SERVICES
ALEXA_
‣ ALEXA SKILLS FOR THIRD PARTY INTEGRATIONS
‣ AMAZON ECHO, DOT ECHO
‣ LARGEST SALES CHANNEL
‣ LARGEST CURRENT MARKET SHARE THANKS

TO EARLY TIME-TO-MARKET
‣ IOT INTEGRATION THROUGH ALEXA VOICE SERVICE
HOME DEVICES_
‣ SMART SPEAKERS WILL BE THE CENTER OF HOME USAGE
‣ HOME AUTOMATION
‣ IoT DEVICES
‣ GENERAL USE CASES:
‣ PURCHASES
‣ INFORMATION DEMAND
‣ AGENDA
‣ CONTROL OVER OTHER DEVICES
US HOME DEVICES MARKET SHARE_
FUTURE
+ ADVANCED
CURRENT USAGE & PREDICTIONS_
‣ 50% OF ALL SEARCHES WILL BE VOICE-BASED BY 2020
‣ 22M SMART SPEAKERS IN US BY 2020
‣ 400M DEVICES WITH ACCESS TO GOOGLE ASSISTANT THIS YEAR
‣ A GOOGLE HOME IS SOLD EVERY SECOND IN US
‣ 40% ADULTS USE VOICE SEARCH
THANK YOU

More Related Content

Similar to The Voice Interface Revolution

Designing and Developing Custom Mobile Applications
Designing and Developing Custom Mobile ApplicationsDesigning and Developing Custom Mobile Applications
Designing and Developing Custom Mobile ApplicationsDunn Solutions Group
 
"API Design: From User Need to Finished Spec" by Andrew Jordan, ex-Product @T...
"API Design: From User Need to Finished Spec" by Andrew Jordan, ex-Product @T..."API Design: From User Need to Finished Spec" by Andrew Jordan, ex-Product @T...
"API Design: From User Need to Finished Spec" by Andrew Jordan, ex-Product @T...TheFamily
 
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...SAP Cloud Platform
 
ICT, Importance of programming and programming languages
ICT, Importance of programming and programming languagesICT, Importance of programming and programming languages
ICT, Importance of programming and programming languagesEbin Robinson
 
RailswayCon 2010 - Command Your Domain
RailswayCon 2010 - Command Your DomainRailswayCon 2010 - Command Your Domain
RailswayCon 2010 - Command Your DomainLourens Naudé
 
Android 103 - Firebase and Architecture Components
Android 103 - Firebase and Architecture ComponentsAndroid 103 - Firebase and Architecture Components
Android 103 - Firebase and Architecture ComponentsKai Koenig
 
The Laws of User Experience: Making it or breaking it with the UX Factor
The Laws of User Experience: Making it or breaking it with the UX FactorThe Laws of User Experience: Making it or breaking it with the UX Factor
The Laws of User Experience: Making it or breaking it with the UX FactorEffectiveUI
 
The Laws of User Experience: Making it or Breaking It with the UX Factor
The Laws of User Experience: Making it or Breaking It with the UX FactorThe Laws of User Experience: Making it or Breaking It with the UX Factor
The Laws of User Experience: Making it or Breaking It with the UX FactorEffective
 
SAP Cloud Platform Community NL Kick-off
SAP Cloud Platform Community NL Kick-offSAP Cloud Platform Community NL Kick-off
SAP Cloud Platform Community NL Kick-offJan Penninkhof
 
"Building Cross-platform Without Sacrificing Performance" by Simon Sturmer (K...
"Building Cross-platform Without Sacrificing Performance" by Simon Sturmer (K..."Building Cross-platform Without Sacrificing Performance" by Simon Sturmer (K...
"Building Cross-platform Without Sacrificing Performance" by Simon Sturmer (K...Tech in Asia ID
 
Red Hat Mobile
Red Hat MobileRed Hat Mobile
Red Hat MobileRed Hat
 
Losant craig baldwin cwin18_toulouse
Losant craig baldwin cwin18_toulouseLosant craig baldwin cwin18_toulouse
Losant craig baldwin cwin18_toulouseCapgemini
 
Achieving Business Transformation with UiPath RPA
Achieving Business Transformation with UiPath RPAAchieving Business Transformation with UiPath RPA
Achieving Business Transformation with UiPath RPACelonis
 
Secure, Strengthen, Automate, and Scale Modern Workloads with Red Hat & NGINX
Secure, Strengthen, Automate, and Scale Modern Workloads with Red Hat & NGINXSecure, Strengthen, Automate, and Scale Modern Workloads with Red Hat & NGINX
Secure, Strengthen, Automate, and Scale Modern Workloads with Red Hat & NGINXNGINX, Inc.
 
Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...
Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...
Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...InfluxData
 
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1SAP Cloud Platform
 
Appear IQ The Business Case for hybrid html5 mobile apps
Appear IQ The Business Case for hybrid html5 mobile appsAppear IQ The Business Case for hybrid html5 mobile apps
Appear IQ The Business Case for hybrid html5 mobile appsAppear
 

Similar to The Voice Interface Revolution (20)

Designing and Developing Custom Mobile Applications
Designing and Developing Custom Mobile ApplicationsDesigning and Developing Custom Mobile Applications
Designing and Developing Custom Mobile Applications
 
"API Design: From User Need to Finished Spec" by Andrew Jordan, ex-Product @T...
"API Design: From User Need to Finished Spec" by Andrew Jordan, ex-Product @T..."API Design: From User Need to Finished Spec" by Andrew Jordan, ex-Product @T...
"API Design: From User Need to Finished Spec" by Andrew Jordan, ex-Product @T...
 
Tick
TickTick
Tick
 
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
Overview and Walkthrough of the Application Programming Model with SAP Cloud ...
 
Druid deck
Druid deckDruid deck
Druid deck
 
ICT, Importance of programming and programming languages
ICT, Importance of programming and programming languagesICT, Importance of programming and programming languages
ICT, Importance of programming and programming languages
 
RailswayCon 2010 - Command Your Domain
RailswayCon 2010 - Command Your DomainRailswayCon 2010 - Command Your Domain
RailswayCon 2010 - Command Your Domain
 
Android 103 - Firebase and Architecture Components
Android 103 - Firebase and Architecture ComponentsAndroid 103 - Firebase and Architecture Components
Android 103 - Firebase and Architecture Components
 
The Laws of User Experience: Making it or breaking it with the UX Factor
The Laws of User Experience: Making it or breaking it with the UX FactorThe Laws of User Experience: Making it or breaking it with the UX Factor
The Laws of User Experience: Making it or breaking it with the UX Factor
 
The Laws of User Experience: Making it or Breaking It with the UX Factor
The Laws of User Experience: Making it or Breaking It with the UX FactorThe Laws of User Experience: Making it or Breaking It with the UX Factor
The Laws of User Experience: Making it or Breaking It with the UX Factor
 
SAP Cloud Platform Community NL Kick-off
SAP Cloud Platform Community NL Kick-offSAP Cloud Platform Community NL Kick-off
SAP Cloud Platform Community NL Kick-off
 
"Building Cross-platform Without Sacrificing Performance" by Simon Sturmer (K...
"Building Cross-platform Without Sacrificing Performance" by Simon Sturmer (K..."Building Cross-platform Without Sacrificing Performance" by Simon Sturmer (K...
"Building Cross-platform Without Sacrificing Performance" by Simon Sturmer (K...
 
Moving To SaaS
Moving To SaaSMoving To SaaS
Moving To SaaS
 
Red Hat Mobile
Red Hat MobileRed Hat Mobile
Red Hat Mobile
 
Losant craig baldwin cwin18_toulouse
Losant craig baldwin cwin18_toulouseLosant craig baldwin cwin18_toulouse
Losant craig baldwin cwin18_toulouse
 
Achieving Business Transformation with UiPath RPA
Achieving Business Transformation with UiPath RPAAchieving Business Transformation with UiPath RPA
Achieving Business Transformation with UiPath RPA
 
Secure, Strengthen, Automate, and Scale Modern Workloads with Red Hat & NGINX
Secure, Strengthen, Automate, and Scale Modern Workloads with Red Hat & NGINXSecure, Strengthen, Automate, and Scale Modern Workloads with Red Hat & NGINX
Secure, Strengthen, Automate, and Scale Modern Workloads with Red Hat & NGINX
 
Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...
Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...
Improving Clinical Data Accuracy: How to Streamline a Data Pipeline Using Nod...
 
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
 
Appear IQ The Business Case for hybrid html5 mobile apps
Appear IQ The Business Case for hybrid html5 mobile appsAppear IQ The Business Case for hybrid html5 mobile apps
Appear IQ The Business Case for hybrid html5 mobile apps
 

More from Rafael Casuso Romate

Rise and Fall of the Frontend Developer
Rise and Fall of the Frontend DeveloperRise and Fall of the Frontend Developer
Rise and Fall of the Frontend DeveloperRafael Casuso Romate
 
Nuxt Avanzado (de Scaffolding a MVP)
Nuxt Avanzado (de Scaffolding a MVP)Nuxt Avanzado (de Scaffolding a MVP)
Nuxt Avanzado (de Scaffolding a MVP)Rafael Casuso Romate
 
Solid NodeJS with TypeScript, Jest & NestJS
Solid NodeJS with TypeScript, Jest & NestJSSolid NodeJS with TypeScript, Jest & NestJS
Solid NodeJS with TypeScript, Jest & NestJSRafael Casuso Romate
 
Introduction to Weex: Mobile Apps with VueJS
Introduction to Weex: Mobile Apps with VueJSIntroduction to Weex: Mobile Apps with VueJS
Introduction to Weex: Mobile Apps with VueJSRafael Casuso Romate
 
Component-Oriented Progressive Web Applications with VueJS
Component-Oriented Progressive Web Applications with VueJSComponent-Oriented Progressive Web Applications with VueJS
Component-Oriented Progressive Web Applications with VueJSRafael Casuso Romate
 
JavaScript Editions ES7, ES8 and ES9 vs V8
JavaScript Editions ES7, ES8 and ES9 vs V8JavaScript Editions ES7, ES8 and ES9 vs V8
JavaScript Editions ES7, ES8 and ES9 vs V8Rafael Casuso Romate
 

More from Rafael Casuso Romate (10)

Rise and Fall of the Frontend Developer
Rise and Fall of the Frontend DeveloperRise and Fall of the Frontend Developer
Rise and Fall of the Frontend Developer
 
Nuxt Avanzado (de Scaffolding a MVP)
Nuxt Avanzado (de Scaffolding a MVP)Nuxt Avanzado (de Scaffolding a MVP)
Nuxt Avanzado (de Scaffolding a MVP)
 
The Core of Agile
The Core of AgileThe Core of Agile
The Core of Agile
 
Solid NodeJS with TypeScript, Jest & NestJS
Solid NodeJS with TypeScript, Jest & NestJSSolid NodeJS with TypeScript, Jest & NestJS
Solid NodeJS with TypeScript, Jest & NestJS
 
Introduction to Weex: Mobile Apps with VueJS
Introduction to Weex: Mobile Apps with VueJSIntroduction to Weex: Mobile Apps with VueJS
Introduction to Weex: Mobile Apps with VueJS
 
Component-Oriented Progressive Web Applications with VueJS
Component-Oriented Progressive Web Applications with VueJSComponent-Oriented Progressive Web Applications with VueJS
Component-Oriented Progressive Web Applications with VueJS
 
Intro to VueJS Workshop
Intro to VueJS WorkshopIntro to VueJS Workshop
Intro to VueJS Workshop
 
VueJS in Action
VueJS in ActionVueJS in Action
VueJS in Action
 
JavaScript Editions ES7, ES8 and ES9 vs V8
JavaScript Editions ES7, ES8 and ES9 vs V8JavaScript Editions ES7, ES8 and ES9 vs V8
JavaScript Editions ES7, ES8 and ES9 vs V8
 
VueJS: The Simple Revolution
VueJS: The Simple RevolutionVueJS: The Simple Revolution
VueJS: The Simple Revolution
 

Recently uploaded

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Clustering techniques data mining book ....
Clustering techniques data mining book ....Clustering techniques data mining book ....
Clustering techniques data mining book ....ShaimaaMohamedGalal
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 

Recently uploaded (20)

Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Clustering techniques data mining book ....
Clustering techniques data mining book ....Clustering techniques data mining book ....
Clustering techniques data mining book ....
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 

The Voice Interface Revolution

  • 3. WHY NOW?_ ‣ WEB SERVICES AND IoT EXPLOTION ‣ HARDWARE NOW SUPPORTS FAR-FIELD VOICE INPUT PROCESSING ‣ SCIENCE BEHIND THE SCENES IS NOW ACCESIBLE ‣ AUTOMATIC SPEECH RECOGNITION, NATURAL LANGUAGE UNDERSTANDING, TEXT TO SPEECH ‣ ARTIFICIAL INTELLIGENCE IS MAKING VOICE INTERFACES SMARTER ‣ PERSONALIZATION TO USER SPEECH, CONTEXTS AND PREFERENCES
  • 4. BENEFITS_ ‣ MOST NATURAL INTERFACE FOR HUMANS ‣ INSTANT VALUE FOR QUICK DEMANDS ‣ SUITABLE FOR NON-TECHNOLOGICAL USERS ‣ LOW HARDWARE NEED, NO SCREEN REQUIRED ‣ ACCESIBILITY FOR LOW VISION CAPABILITIES AND HAND DISABILITY
  • 5. DRAWBACKS_ ‣ ERRORS IN SPEECH RECOGNITION ‣ DIFFERENT SPEECH RECOGNITION/GENERATION ACCURACY AMONG LANGUAGES ‣ BACKGROUND NOISE SUSCEPTIBILITY ‣ NON-ACCESIBLE FOR DEAF/MUTE ‣ RESPONSES WITH SLOW DATA EXPOSURE
  • 6. USE CASES_ ‣ SUITABLE ‣ QUICK LOW-PARAMETERIZED INFORMATION DEMANDS ‣ LOW-PARAMETERIZED NON-CRITICAL TRANSACTIONS ‣ RESPONSES WITH REDUCED AMOUNT OF DATA ‣ NON-SUITABLE ‣ HIGH-PARAMETERIZED QUESTIONS OR TRANSACTIONS ‣ CRITICAL TRANSACTIONS DUE TO ERROR POSSIBILITIES ‣ RESPONSES WITH LARGE AMOUNT OF DATA
  • 8. BASICS_ ‣ CHOOSE YOUR CHANNEL WISELY ‣ CUSTOM APPLICATION ‣ GENERAL ASSISTANT ‣ STUDY YOUR DOMAIN ‣ VOICE-ONLY? ‣ BEST OPTION IS USALLY COMBINED GRAPHIC AND VOICE INTERFACE
  • 9. RECOMMEDATIONS_ ‣ SHORT INTERACTIONS ‣ SHORTER THAN TEXT-BASED EXPERIENCES ‣ NO LONG FUNNELS MORE THAN TWO STEPS ‣ MEANINGFUL RESPONSES WITH VALUE ‣ TRANSPARENCY ‣ ENGAGEMENT ‣ TAIL QUESTIONS ‣ NOTIFICATIONS
  • 10. ENTRANCE_ ‣ DRIVEN FIRST INTERACTION ‣ SINGLE POINT ALLOW MORE CONTROL ‣ QUICK WELCOME
  • 11. CUT THE BULLSHIT_ ‣ VALUE OVER SMALLTALK ‣ VALUE OVER PERSONALITY ‣ VALUE OVER HUMOUR ‣ BE HONEST
  • 12. CONVERSATIONS BASICS_ ‣ Turn-taking ‣ Threading ‣ Leveraging inherent efficiency of language ‣ Anticipating variable user behaviour ‣ Understanding cooperative behaviour ‣ Cooperative principle ‣ Paul Grice’s Maxims ‣ Use everyday language ‣ Instilling user confidence
  • 13. GRICE’S MAXIMS_ ‣ The maxim of quantity, where one tries to be as informative as one possibly can, and gives as much information as is needed, and no more. ‣ The maxim of quality, where one tries to be truthful, and does not give information that is false or that is not supported by evidence. ‣ The maxim of relation, where one tries to be relevant, and says things that are pertinent to the discussion. ‣ The maxim of manner, when one tries to be as clear, as brief, and as orderly as one can in what one says, and where one avoids obscurity and ambiguity.
  • 15. BASICS_ ‣ SPEECH RECOGNITION/GENERATION ‣ AUTOMATIC IN GENERAL ASSISTANTS ‣ SERVICE OR LIBRARY BASED IN CUSTOM ASSISTANTS ‣ CORE COMPONENT IS THE DIALOG ENGINE ‣ GOOGLE DIALOGFLOW ‣ MICROSOFT BOT FRAMEWORK ‣ IBM WATSON ASSISTANT ‣ YOUR OWN
  • 16. EXAMPLE DIALOG ENGINE DIAGRAM_ ‣ NLU Platform to receive requests and converts them to intents, parameters
  • 17. RECOMMEDATIONS_ ‣ NODEJS AS BACKEND TECHNOLOGY ‣ IDEAL FOR PaaS AND EVEN FaaS ‣ OWN USER SYSTEM ‣ MIXED CONTEXT STRATEGY: ‣ KEEP CONVERSATIONS ON MEMORY ‣ KEEP MEANINGFUL, ACTIONABLE DATA ON DATABASE ‣ PRINCIPLES OF MODULARITY AND COMPONENTIZATION
  • 18. PATTERNS_ ‣ ADAPTER ‣ IDEAL FOR CUSTOM INPUT ENTRIES: ‣ OWN/THIRD PARTY WEBHOOK ‣ MESSAGE SYSTEM LIBRARY ‣ IDEAL FOR CUSTOM OUTPUT EXITS ‣ MIDDLEWARE ‣ FOR USER INPUT ‣ FOR OUTPUT GENERATION
  • 19. SPEECH RECOGNITION_ ‣ ACCURACY IS DOWN TO 4.9 ERROR PERCENTAGE BY GOOGLE THANKS TO AI TECHNIQUES LIKE DEEP LEARNING ‣ THREE MODELS WORK TOGETHER IN A GRAPH: ‣ ACOUSTIC: WAVEFORM TO EACH SOUND FRAGMENT ‣ PRONUNCIATION: SOUNDS TO WORDS ‣ LANGUAGE: WORDS TO SENTENCES ‣ STANDARD DATASET TO MEASURE ACCURACY IS NIST 2000 SWITCHBOARD
  • 20. SPEECH RECOGNITION APIS_ ‣ GOOGLE CLOUD SPEECH ‣ Converts audio to text, synchronously and asynchronously in 80+ different languages with a high degree of accuracy ‣ https://cloud.google.com/speech/docs ‣ MICROSOFT LUIS ‣ Interprets intents and extract entities, with built-in trained ones ‣ https://www.luis.ai/home ‣ IBM WATSON SPEECH-TO-TEXT ‣ https://www.ibm.com/watson/services/speech-to-text/ ‣ AMAZON TRANSCRIBE ‣ https://aws.amazon.com/es/transcribe/
  • 21. GOOGLE CLOUD API_ ‣ NATURAL LANGUAGE ‣ Provides natural language understanding technologies to developers. Examples include sentiment analysis, entity recognition, entity sentiment analysis, and text annotations. ‣ https://cloud.google.com/natural-language/docs/reference/rest ‣ TRANSLATION ‣ Translates over 80+ languages and detect language from speech. ‣ https://cloud.google.com/translate/docs/reference/rest
  • 23. ACTIONS ON GOOGLE_ ‣ Platform to build actions invoked by users to fulfill some need ‣ Easy way with Dialogflow integration ‣ Custom way with ACTIONS SDK ‣ How it works: ‣ User requests an action “Talk to my Hotel Concierge” ‣ Assistant asks Actions on Google to invoke the particular app ‣ The conversation between the user and the app begins ‣ Subsequent user input is sent directly to app until the app fulfills the intent and ends
  • 24. INTENTS_ ‣ Represent a mapping between what a user says and what action should be taken by your software. ‣ User Says (Expressions) ‣ Natural language expressions annotated with parameters that are linked to entities ‣ Actions ‣ Trigger-name with associated parameters to perform an action on the app ‣ Response ‣ You can add Simple Text or Rich Response depending on platform ‣ Contexts ‣ Passing info from other intents or external. Input are prerequisite
  • 25. ENTITIES_ ‣ Significant data extracted from user input in form of parameter value ‣ Entities are associated to particular actions ‣ There are three types: ‣ System ‣ Pre-built entities provided by API.AI in order to facilitate handling common concepts (colors, locations,…) ‣ Developer ‣ Custom entities created with Reference Value plus Synonyms ‣ User Entities ‣ Defined for the session, specific playlists for instance
  • 26. CONTEXTS_ ‣ Persisted information that can be used through intents ‣ It can be internal like a particular movie the user is asking for ‣ Or external like the user data retrieved from a user system ‣ Lifespan: ‣ By default they last for 5 requests or 10 minutes ‣ Input Context: ‣ Limit intents to be matched only when certain contexts are set ‣ For example when you need specific info to perform action ‣ Output Context: ‣ They are tied to user sessions, is shared by the intent ‣ Automatically added to follow-up intents
  • 27. EVENTS & DIALOGS_ ‣ Events is a feature that allows you to invoke intents by an event name instead of a user query ‣ Dialogs ‣ Linear ‣ With Slot Filling you define required parameters with prompts and order them. Agent will ask for them until has all info. ‣ Non-linear ‣ Complex dialogs are formed from context routing, removing Output Context for Intent Responses, and adding new Output Context that is matched for next question
  • 30. GOOGLE ASSISTANT_ ‣ ACTIONS ON GOOGLE ALLOWS BUILDING APPS ‣ GOOGLE HOME, HOME MINI, ANDROID, ANDROID AUTO, WEAR OS ‣ FUTURE UP TO 80% OF WORLD MOBILE
 MARKET ‣ CUSTOM DIALOG ENGINE DIALOGFLOW ‣ BEST SPEECH RECOGNITION ‣ BEST POSSIBLE FUTURE INTEGRATION WITH OTHER GOOGLE SERVICES
  • 31. ALEXA_ ‣ ALEXA SKILLS FOR THIRD PARTY INTEGRATIONS ‣ AMAZON ECHO, DOT ECHO ‣ LARGEST SALES CHANNEL ‣ LARGEST CURRENT MARKET SHARE THANKS
 TO EARLY TIME-TO-MARKET ‣ IOT INTEGRATION THROUGH ALEXA VOICE SERVICE
  • 32. HOME DEVICES_ ‣ SMART SPEAKERS WILL BE THE CENTER OF HOME USAGE ‣ HOME AUTOMATION ‣ IoT DEVICES ‣ GENERAL USE CASES: ‣ PURCHASES ‣ INFORMATION DEMAND ‣ AGENDA ‣ CONTROL OVER OTHER DEVICES
  • 33. US HOME DEVICES MARKET SHARE_
  • 35. CURRENT USAGE & PREDICTIONS_ ‣ 50% OF ALL SEARCHES WILL BE VOICE-BASED BY 2020 ‣ 22M SMART SPEAKERS IN US BY 2020 ‣ 400M DEVICES WITH ACCESS TO GOOGLE ASSISTANT THIS YEAR ‣ A GOOGLE HOME IS SOLD EVERY SECOND IN US ‣ 40% ADULTS USE VOICE SEARCH