SlideShare a Scribd company logo
1 of 27
LCore: A Language Acquisition Framework for Robots
From Grounded Language Acquisition to Spoken Dialogues
2013/12/13

Komei Sugiura and Naoto Iwahashi
National Institute of Information and Communication Technology, Japan
komei.sugiura@nict.go.jp
Open problem: grounded language processing
• Language processing based on non-verbal information (vision,
motion, context, experience, …) is still very difficult
– e.g. “Put the blue cup away”, “Give me the usual”
• What is missing in dialog processing for robots?
– Physical situatedness / symbol grounding
– Shared experience

“blue cup”: multiple candidates
2

“the usual”: umbrella, remote, drink,..
Spoken dialogue system + Robot ≠ Robot dialogue
• Robot dialogue

– Categorization/prediction of real-world information
– Handling real-world properties
– Linguistic interaction

• Why is this difficult?

– Machine learning, CV, manipulation, symbol grounding problem,
speech recognition,…
Tableware

Cup

Tea cup

Cutlery

Fork

Plate

Knife
Robot Language Acquisition Framework

[Iwahashi 10, “Robots That Learn to Communicate: A Developmental Approach…”]

• Task: Object manipulation dialogues
• Key features
– Fully grounded vocabulary
– Imitation learning
– Incremental & interactive learning
– Language independent

4
LCore functions
Phoneme learning

Learning question answering

Word learning

Visual feature learning

Grammar learning

Affordance learning

Disambiguation of word ellipsis

Imitation learning

Utterance understanding

Role reversal imitation

Robot-directed utterance
detection

Active-learning-based dialogue
5
Learning modules
Word

Grammar

Motion-object
relationship

• Learning nouns/adjectives
• Learning verbs
• Learning probabilistic distributions of • Estimation of related objects
visual features
• Learning trajectories
• Learning phoneme sequences
• Learning phoneme sequences
Symbol grounding: Learning nouns and adjectives
• Visual features modeled by Gaussians
– Input: visual features of objects
• Out-of-vocabulary word = phoneme sequence + waveform
– Voice conversion (Eigenvoice GMM) to robot voice

Generative models

BLUE

Unknown object
RED
Imitation learning of object manipulation [Sugiura+ 07]
• Difficulty: Clustering trajectories in the world coordinate system does not work
• Proposed method
– Input: Position sequences of all objects
– Estimation of reference point and coordinate system by EM algorithm
– Number of state is optimized by cross-validation

Place A on B
Imitation learning using reference-point-dependent HMMs
[Sugiura+ 07][Sugiura+ 11]

Searching optimal coordinate system
Coordinate system
type

:Position at time t
…
=

Reference object ID

HMM
parameters

• Delta parameters

=

…

* Sugiura, K. et al, “Learning, Recognition, and Generation of Motion by …”, Advanced Robotics, Vol.25, No.17, 2011
Results: motion learning
No verb is estimated to have WCS
-> Reference-point-dependent verb

Velocity
Motion “place-on”

Log likelihood

Position

Place-on Move-closer

Raise

Jump-over Move-away

Rotate

Move-down

Training-set likelihood
Transformation of reference-point-dependent HMMs [Sugiura+ 11]
• What is the problem?
– Simple HMMs do not generate continuous trajectories
– Situation dependent trajectories
• Reference-point-dependent HMM
– Input: (motion ID, object ID) e.g. <place-on, Object 1, Object 3>
– Output: Maximum likelihood trajectory

Situation

HMM “Place-on”
World CS

Place X on Y

* Sugiura, K. (2011), “Learning, Generation, and Recognition of Reference-Point-Dependent Probabilistic…”
Generating continuous trajectory using delta parameters
[Tokuda+ 00]

Maximum likelihood trajectory

: time series of
(position,velocity,acceleration)

: state sequence
: HMM parameters

: filter (

)

: matrix of covariance
matrices of each OPDF
: time series of position
: vector of mean vectors

*Tokuda, K. et al, “Speech parameter generation algorithms for HMM-based speech synthesis”, 2000
Quantitative results
• Evaluation measure
– Euclidian distance
– Normalized by frame number T

Trajectory by Subject
Trajectory by proposed method
SPOKEN LANGUAGE UNDERSTANDING
USING NON-LINGUISTIC INFORMATION
Utterance understanding in LCore (1)
• User utterances are understood by using multimodal
information learned in a statistical learning framework

Vision

Motion

(Bayesian
learning of a
Gaussian)

(HMM)

Speech
(HMM)

Motion-object
relationship
(Bayesian learning
of a Gaussian)

Shared
belief

Context
(MCE Learning)
15
Integration of multimodal information
• Shared belief Ψ: weighted sum of five modules
utterance

action

scene

context

Speech
Motion
Vision
Motion-object relationship

Context
16
Inter-module learning
Multimodal
understanding

Confidence
learning

Utterance/Motion
generation

Place Elmo on box
Place Elmo
Place it

User intension

17
Grounded utterance disambiguation
Where to?
• Simple dialog systems
Which “cup”?
U: “Place the cup (on the table).”
R: “You said place the cup.”
-> Risk of motion failure
• Generating confirmation utterances using physical information
R: “I’ll place the red cup on the table, is it OK?”
Multimodal utterance understanding

Place-on Elmo
30th
1st

2nd

…
1st

2nd

…
30th

Sugiura, K. et al, "Situated Spoken Dialogue with Robots Using Active Learning", Advanced Robotics, 2011

19
Multimodal utterance understanding

Place-on Elmo
30th

Margin
1st

2nd

…
1st

2nd

…
30th

Sugiura, K. et al, "Situated Spoken Dialogue with Robots Using Active Learning", Advanced Robotics, 2011

20
Confirmation by paraphrasing user’s utterance

• Learning phase
• Bayesian Logistic Regression
• Input: Margin(d), Output: probability

• Execution phase
– Decision-making on responses
based on expected utility
Probability

21

Margin
Quantitative result: Risk reduction
Baseline

Proposed

Decreased to 1/4
Failure rate
Rejection rate
Confirmation rate
# of confirmation utt

22
Reduction of motion failure in learning phase [Sugiura+ 11]
• So far…
– Learning utterance understanding probabilities
• Idea
• Learning-by-asking
Phase

Operator

Motion executor

Active Learning

Robot

User

(Passive) learning

User

Robot

Execution

User

Robot

Sugiura, K. et al, "Situated Spoken Dialogue with Robots Using Active Learning", Advanced Robotics, Vol. 25, No. 17, 2011
Reduction of motion failure in learning phase
• Problem:
– Motion failure is required in learning
phase to avoid over-fitting

Active Learning
phase
Motion
failure

Motion
success

Learning phase

“Safe” training
data
Motion
failure

Execution phase

Motion
success
What kind of commands are effective for learning?
• Proposed method: Active Learning-based command generation
• Objective: Reduce the number of interactions
• [Input = image], [Output = utterance]
• Expected Log Loss Reduction(ELLR[Roy, 2001]) is used to select
the optimal utterance
Active Learning : A form of supervised learning in which inputs can be
selected by the algorithm
Target action

Robot utterance

Loss

Act=A, Objs = <1,3>

“Place-on Elmo blue box”

35.8

Act=A, Objs = <1,3>

“Place-on Elmo”

12.3

Act=A, Objs= <1, 2>

“Place-on Elmo”

28.1

:

:

:

“Raise box”

332.3

:

:

Act=B, Objs=<2>
:
Utterance generation by ELLR
Reduction of motion failure in learning phase

Test-set likelihood

(1) Proposed
(2) Baseline
Number of episodes

Motion failure risk
reduced
# of motion failure

Fast convergence

Proposed Baseline

More Related Content

Viewers also liked

Η εξέλιξη των παιχνιδομηχανών
Η εξέλιξη των παιχνιδομηχανώνΗ εξέλιξη των παιχνιδομηχανών
Η εξέλιξη των παιχνιδομηχανώνAthanasios Palanis
 
Introduction To Operating In Second Life Macpa
Introduction To Operating In Second Life MacpaIntroduction To Operating In Second Life Macpa
Introduction To Operating In Second Life MacpaTom Hood, CPA,CITP,CGMA
 
3 d pie chart circular puzzle with hole in center process 10 stages style 2 p...
3 d pie chart circular puzzle with hole in center process 10 stages style 2 p...3 d pie chart circular puzzle with hole in center process 10 stages style 2 p...
3 d pie chart circular puzzle with hole in center process 10 stages style 2 p...SlideTeam.net
 
Recruitment strategy for startup business
Recruitment strategy for startup businessRecruitment strategy for startup business
Recruitment strategy for startup businessCrowdinvest
 
A Decade Of Contract Cheating - Examining The Changes That Have Been Made To ...
A Decade Of Contract Cheating - Examining The Changes That Have Been Made To ...A Decade Of Contract Cheating - Examining The Changes That Have Been Made To ...
A Decade Of Contract Cheating - Examining The Changes That Have Been Made To ...Thomas Lancaster
 
The Roles and Functions of Educational Technology in the 21st Century Education
The Roles and Functions of Educational Technology in the 21st Century EducationThe Roles and Functions of Educational Technology in the 21st Century Education
The Roles and Functions of Educational Technology in the 21st Century Educationtrajanokharine89
 
Optimizing Engagement with Persona-Based Marketing - Inbound 2014
Optimizing Engagement with Persona-Based Marketing - Inbound 2014Optimizing Engagement with Persona-Based Marketing - Inbound 2014
Optimizing Engagement with Persona-Based Marketing - Inbound 2014Lattice Engines
 
Jonathan E. Brandt, Aia Leed Ap Portfolio 2011
Jonathan E. Brandt, Aia Leed Ap Portfolio 2011Jonathan E. Brandt, Aia Leed Ap Portfolio 2011
Jonathan E. Brandt, Aia Leed Ap Portfolio 2011Jonathan Brandt
 
Confidence intervals: a useful statistical tool to estimate effect sizes in t...
Confidence intervals: a useful statistical tool to estimate effect sizes in t...Confidence intervals: a useful statistical tool to estimate effect sizes in t...
Confidence intervals: a useful statistical tool to estimate effect sizes in t...Cecilia M. Patino-Sutton, MD MeD PhD
 
The Boys' Club of New York
The Boys' Club of New YorkThe Boys' Club of New York
The Boys' Club of New YorkAmanda Starbuck
 
Welcome to CYP Nursing at
Welcome to CYP Nursing atWelcome to CYP Nursing at
Welcome to CYP Nursing atRuth Davies
 
The six essential skills of professional social media practitioners
The six essential skills of professional social media practitionersThe six essential skills of professional social media practitioners
The six essential skills of professional social media practitionersMarketingatBahrain
 

Viewers also liked (15)

Η εξέλιξη των παιχνιδομηχανών
Η εξέλιξη των παιχνιδομηχανώνΗ εξέλιξη των παιχνιδομηχανών
Η εξέλιξη των παιχνιδομηχανών
 
Introduction To Operating In Second Life Macpa
Introduction To Operating In Second Life MacpaIntroduction To Operating In Second Life Macpa
Introduction To Operating In Second Life Macpa
 
3 d pie chart circular puzzle with hole in center process 10 stages style 2 p...
3 d pie chart circular puzzle with hole in center process 10 stages style 2 p...3 d pie chart circular puzzle with hole in center process 10 stages style 2 p...
3 d pie chart circular puzzle with hole in center process 10 stages style 2 p...
 
Recruitment strategy for startup business
Recruitment strategy for startup businessRecruitment strategy for startup business
Recruitment strategy for startup business
 
A Decade Of Contract Cheating - Examining The Changes That Have Been Made To ...
A Decade Of Contract Cheating - Examining The Changes That Have Been Made To ...A Decade Of Contract Cheating - Examining The Changes That Have Been Made To ...
A Decade Of Contract Cheating - Examining The Changes That Have Been Made To ...
 
The Roles and Functions of Educational Technology in the 21st Century Education
The Roles and Functions of Educational Technology in the 21st Century EducationThe Roles and Functions of Educational Technology in the 21st Century Education
The Roles and Functions of Educational Technology in the 21st Century Education
 
大球場
大球場大球場
大球場
 
Optimizing Engagement with Persona-Based Marketing - Inbound 2014
Optimizing Engagement with Persona-Based Marketing - Inbound 2014Optimizing Engagement with Persona-Based Marketing - Inbound 2014
Optimizing Engagement with Persona-Based Marketing - Inbound 2014
 
Jonathan E. Brandt, Aia Leed Ap Portfolio 2011
Jonathan E. Brandt, Aia Leed Ap Portfolio 2011Jonathan E. Brandt, Aia Leed Ap Portfolio 2011
Jonathan E. Brandt, Aia Leed Ap Portfolio 2011
 
Confidence intervals: a useful statistical tool to estimate effect sizes in t...
Confidence intervals: a useful statistical tool to estimate effect sizes in t...Confidence intervals: a useful statistical tool to estimate effect sizes in t...
Confidence intervals: a useful statistical tool to estimate effect sizes in t...
 
Animada
AnimadaAnimada
Animada
 
The Boys' Club of New York
The Boys' Club of New YorkThe Boys' Club of New York
The Boys' Club of New York
 
Welcome to CYP Nursing at
Welcome to CYP Nursing atWelcome to CYP Nursing at
Welcome to CYP Nursing at
 
栗子
栗子栗子
栗子
 
The six essential skills of professional social media practitioners
The six essential skills of professional social media practitionersThe six essential skills of professional social media practitioners
The six essential skills of professional social media practitioners
 

Similar to Language acquisition framework for robots: From grounded language acquisition to spoken dialogues

Real world-experiences paper given in Orlando Sloan-C conference 2013
Real world-experiences paper given in Orlando Sloan-C conference 2013Real world-experiences paper given in Orlando Sloan-C conference 2013
Real world-experiences paper given in Orlando Sloan-C conference 2013John Bourne
 
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Vic...
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Vic...RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Vic...
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Vic...Jeffrey Too Chuan TAN
 
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Mon...
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Mon...RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Mon...
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Mon...Jeffrey Too Chuan TAN
 
Presentation for elearning_from_diane_and_barry_wout_video[1]
Presentation for elearning_from_diane_and_barry_wout_video[1]Presentation for elearning_from_diane_and_barry_wout_video[1]
Presentation for elearning_from_diane_and_barry_wout_video[1]elearningassignment
 
Your Learners ARE Mobile. Is Your Learning?
Your Learners ARE Mobile. Is Your Learning?Your Learners ARE Mobile. Is Your Learning?
Your Learners ARE Mobile. Is Your Learning?Judy Brown
 
Robot Treasure Hunt
Robot Treasure HuntRobot Treasure Hunt
Robot Treasure HuntJon Rosewell
 
CubexMotion - Psychomotor Game Learning Using Leap Motion Technology
CubexMotion - Psychomotor Game Learning Using Leap Motion TechnologyCubexMotion - Psychomotor Game Learning Using Leap Motion Technology
CubexMotion - Psychomotor Game Learning Using Leap Motion TechnologyRizal Asrul Pambudi
 
Screencasting and Presenting for Engineers
Screencasting and Presenting for EngineersScreencasting and Presenting for Engineers
Screencasting and Presenting for EngineersKunal Johar
 
From ICT to Computing. Presentation for the inaugral meeting of the Calderdal...
From ICT to Computing. Presentation for the inaugral meeting of the Calderdal...From ICT to Computing. Presentation for the inaugral meeting of the Calderdal...
From ICT to Computing. Presentation for the inaugral meeting of the Calderdal...Pete Bell
 
Cognitive trainingrobot20140923 munsangkim
Cognitive trainingrobot20140923 munsangkimCognitive trainingrobot20140923 munsangkim
Cognitive trainingrobot20140923 munsangkimAlbert Yefimov
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning AnalyticsXavier Ochoa
 
05 distance learning standards-scorm research
05 distance learning standards-scorm research05 distance learning standards-scorm research
05 distance learning standards-scorm research宥均 林
 
Introduction to Reinforcement Learning - Code Heroku
Introduction to Reinforcement Learning - Code HerokuIntroduction to Reinforcement Learning - Code Heroku
Introduction to Reinforcement Learning - Code Herokucodeheroku
 
Introduction to Reinforcement Learning
Introduction to Reinforcement LearningIntroduction to Reinforcement Learning
Introduction to Reinforcement LearningMihir Thakkar
 
Designing for Immersive Worlds: Enhancing Experience to Accelerate Learning
Designing for Immersive Worlds: Enhancing Experience to Accelerate LearningDesigning for Immersive Worlds: Enhancing Experience to Accelerate Learning
Designing for Immersive Worlds: Enhancing Experience to Accelerate LearningNiki Lambropoulos PhD
 
Ijcai nyc ai summit 20140224 v1
Ijcai nyc ai summit 20140224 v1Ijcai nyc ai summit 20140224 v1
Ijcai nyc ai summit 20140224 v1ISSIP
 

Similar to Language acquisition framework for robots: From grounded language acquisition to spoken dialogues (20)

Real world-experiences paper given in Orlando Sloan-C conference 2013
Real world-experiences paper given in Orlando Sloan-C conference 2013Real world-experiences paper given in Orlando Sloan-C conference 2013
Real world-experiences paper given in Orlando Sloan-C conference 2013
 
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Vic...
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Vic...RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Vic...
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Vic...
 
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Mon...
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Mon...RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Mon...
RoboCup@HomeEDU AI-Focused Robotics Education by Home Service Robot DIY | Mon...
 
Laird ibm-small
Laird ibm-smallLaird ibm-small
Laird ibm-small
 
Presentation for elearning_from_diane_and_barry_wout_video[1]
Presentation for elearning_from_diane_and_barry_wout_video[1]Presentation for elearning_from_diane_and_barry_wout_video[1]
Presentation for elearning_from_diane_and_barry_wout_video[1]
 
ML-02.pdf
ML-02.pdfML-02.pdf
ML-02.pdf
 
Your Learners ARE Mobile. Is Your Learning?
Your Learners ARE Mobile. Is Your Learning?Your Learners ARE Mobile. Is Your Learning?
Your Learners ARE Mobile. Is Your Learning?
 
Robot Treasure Hunt
Robot Treasure HuntRobot Treasure Hunt
Robot Treasure Hunt
 
CubexMotion - Psychomotor Game Learning Using Leap Motion Technology
CubexMotion - Psychomotor Game Learning Using Leap Motion TechnologyCubexMotion - Psychomotor Game Learning Using Leap Motion Technology
CubexMotion - Psychomotor Game Learning Using Leap Motion Technology
 
Lecture 01: Introduction
Lecture 01: IntroductionLecture 01: Introduction
Lecture 01: Introduction
 
Screencasting and Presenting for Engineers
Screencasting and Presenting for EngineersScreencasting and Presenting for Engineers
Screencasting and Presenting for Engineers
 
SAX-VSM
SAX-VSMSAX-VSM
SAX-VSM
 
From ICT to Computing. Presentation for the inaugral meeting of the Calderdal...
From ICT to Computing. Presentation for the inaugral meeting of the Calderdal...From ICT to Computing. Presentation for the inaugral meeting of the Calderdal...
From ICT to Computing. Presentation for the inaugral meeting of the Calderdal...
 
Cognitive trainingrobot20140923 munsangkim
Cognitive trainingrobot20140923 munsangkimCognitive trainingrobot20140923 munsangkim
Cognitive trainingrobot20140923 munsangkim
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning Analytics
 
05 distance learning standards-scorm research
05 distance learning standards-scorm research05 distance learning standards-scorm research
05 distance learning standards-scorm research
 
Introduction to Reinforcement Learning - Code Heroku
Introduction to Reinforcement Learning - Code HerokuIntroduction to Reinforcement Learning - Code Heroku
Introduction to Reinforcement Learning - Code Heroku
 
Introduction to Reinforcement Learning
Introduction to Reinforcement LearningIntroduction to Reinforcement Learning
Introduction to Reinforcement Learning
 
Designing for Immersive Worlds: Enhancing Experience to Accelerate Learning
Designing for Immersive Worlds: Enhancing Experience to Accelerate LearningDesigning for Immersive Worlds: Enhancing Experience to Accelerate Learning
Designing for Immersive Worlds: Enhancing Experience to Accelerate Learning
 
Ijcai nyc ai summit 20140224 v1
Ijcai nyc ai summit 20140224 v1Ijcai nyc ai summit 20140224 v1
Ijcai nyc ai summit 20140224 v1
 

More from Komei Sugiura

ロボティクスにおける言語の利活用
ロボティクスにおける言語の利活用ロボティクスにおける言語の利活用
ロボティクスにおける言語の利活用Komei Sugiura
 
生活支援ロボットにおける 大規模データ収集に向けて
生活支援ロボットにおける大規模データ収集に向けて生活支援ロボットにおける大規模データ収集に向けて
生活支援ロボットにおける 大規模データ収集に向けてKomei Sugiura
 
生活支援ロボットのマルチモーダル言語理解技術
生活支援ロボットのマルチモーダル言語理解技術生活支援ロボットのマルチモーダル言語理解技術
生活支援ロボットのマルチモーダル言語理解技術Komei Sugiura
 
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...Komei Sugiura
 
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けてロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けてKomei Sugiura
 
Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...
Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...
Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...Komei Sugiura
 
言葉や能力の壁を越えるデータ指向知能
言葉や能力の壁を越えるデータ指向知能言葉や能力の壁を越えるデータ指向知能
言葉や能力の壁を越えるデータ指向知能Komei Sugiura
 
20160907rsj16ロボット聴覚OS
20160907rsj16ロボット聴覚OS20160907rsj16ロボット聴覚OS
20160907rsj16ロボット聴覚OSKomei Sugiura
 
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置Komei Sugiura
 
20160221statistic imitation learning and human-robot communication
20160221statistic imitation learning and human-robot communication20160221statistic imitation learning and human-robot communication
20160221statistic imitation learning and human-robot communicationKomei Sugiura
 
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバックKomei Sugiura
 
20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測
20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測
20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測Komei Sugiura
 
階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験
階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験
階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験Komei Sugiura
 
実世界の意味を扱う理論と機械知能の構築
実世界の意味を扱う理論と機械知能の構築実世界の意味を扱う理論と機械知能の構築
実世界の意味を扱う理論と機械知能の構築Komei Sugiura
 
Cloud Robotics for Human-Robot Dialogues
Cloud Robotics for Human-Robot DialoguesCloud Robotics for Human-Robot Dialogues
Cloud Robotics for Human-Robot DialoguesKomei Sugiura
 
20151129インテリジェントホームロボティクス研究会
20151129インテリジェントホームロボティクス研究会20151129インテリジェントホームロボティクス研究会
20151129インテリジェントホームロボティクス研究会Komei Sugiura
 
Japan Robot Week 2014けいはんなロボットフォーラム
Japan Robot Week 2014けいはんなロボットフォーラムJapan Robot Week 2014けいはんなロボットフォーラム
Japan Robot Week 2014けいはんなロボットフォーラムKomei Sugiura
 
rospeex: a cloud-based speech communication toolkit for ROS
rospeex: a cloud-based speech communication toolkit for ROSrospeex: a cloud-based speech communication toolkit for ROS
rospeex: a cloud-based speech communication toolkit for ROSKomei Sugiura
 
Introduction to RoboCup@Home
Introduction to RoboCup@HomeIntroduction to RoboCup@Home
Introduction to RoboCup@HomeKomei Sugiura
 

More from Komei Sugiura (20)

ロボティクスにおける言語の利活用
ロボティクスにおける言語の利活用ロボティクスにおける言語の利活用
ロボティクスにおける言語の利活用
 
生活支援ロボットにおける 大規模データ収集に向けて
生活支援ロボットにおける大規模データ収集に向けて生活支援ロボットにおける大規模データ収集に向けて
生活支援ロボットにおける 大規模データ収集に向けて
 
生活支援ロボットのマルチモーダル言語理解技術
生活支援ロボットのマルチモーダル言語理解技術生活支援ロボットのマルチモーダル言語理解技術
生活支援ロボットのマルチモーダル言語理解技術
 
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Netwo...
 
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けてロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
ロボットの音声コミュニケーション技術:言葉や能力の壁を越えるデータ指向知能に向けて
 
Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...
Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...
Spatio-Temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous S...
 
言葉や能力の壁を越えるデータ指向知能
言葉や能力の壁を越えるデータ指向知能言葉や能力の壁を越えるデータ指向知能
言葉や能力の壁を越えるデータ指向知能
 
20161014IROS_WS
20161014IROS_WS20161014IROS_WS
20161014IROS_WS
 
20160907rsj16ロボット聴覚OS
20160907rsj16ロボット聴覚OS20160907rsj16ロボット聴覚OS
20160907rsj16ロボット聴覚OS
 
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
20160606劣モジュラ性を利用したドローンによるばらまき型センサ配置
 
20160221statistic imitation learning and human-robot communication
20160221statistic imitation learning and human-robot communication20160221statistic imitation learning and human-robot communication
20160221statistic imitation learning and human-robot communication
 
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
20140513大規模異分野データ横断検索における時空間情報を用いた擬似適合性フィードバック
 
20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測
20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測
20150531Deep Recurrent Neural Networkによる環境モニタリングデータの予測
 
階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験
階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験
階層型評価構造に基づく観光スポット推薦システムの構築と長期実証実験
 
実世界の意味を扱う理論と機械知能の構築
実世界の意味を扱う理論と機械知能の構築実世界の意味を扱う理論と機械知能の構築
実世界の意味を扱う理論と機械知能の構築
 
Cloud Robotics for Human-Robot Dialogues
Cloud Robotics for Human-Robot DialoguesCloud Robotics for Human-Robot Dialogues
Cloud Robotics for Human-Robot Dialogues
 
20151129インテリジェントホームロボティクス研究会
20151129インテリジェントホームロボティクス研究会20151129インテリジェントホームロボティクス研究会
20151129インテリジェントホームロボティクス研究会
 
Japan Robot Week 2014けいはんなロボットフォーラム
Japan Robot Week 2014けいはんなロボットフォーラムJapan Robot Week 2014けいはんなロボットフォーラム
Japan Robot Week 2014けいはんなロボットフォーラム
 
rospeex: a cloud-based speech communication toolkit for ROS
rospeex: a cloud-based speech communication toolkit for ROSrospeex: a cloud-based speech communication toolkit for ROS
rospeex: a cloud-based speech communication toolkit for ROS
 
Introduction to RoboCup@Home
Introduction to RoboCup@HomeIntroduction to RoboCup@Home
Introduction to RoboCup@Home
 

Recently uploaded

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Recently uploaded (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Language acquisition framework for robots: From grounded language acquisition to spoken dialogues

  • 1. LCore: A Language Acquisition Framework for Robots From Grounded Language Acquisition to Spoken Dialogues 2013/12/13 Komei Sugiura and Naoto Iwahashi National Institute of Information and Communication Technology, Japan komei.sugiura@nict.go.jp
  • 2. Open problem: grounded language processing • Language processing based on non-verbal information (vision, motion, context, experience, …) is still very difficult – e.g. “Put the blue cup away”, “Give me the usual” • What is missing in dialog processing for robots? – Physical situatedness / symbol grounding – Shared experience “blue cup”: multiple candidates 2 “the usual”: umbrella, remote, drink,..
  • 3. Spoken dialogue system + Robot ≠ Robot dialogue • Robot dialogue – Categorization/prediction of real-world information – Handling real-world properties – Linguistic interaction • Why is this difficult? – Machine learning, CV, manipulation, symbol grounding problem, speech recognition,… Tableware Cup Tea cup Cutlery Fork Plate Knife
  • 4. Robot Language Acquisition Framework [Iwahashi 10, “Robots That Learn to Communicate: A Developmental Approach…”] • Task: Object manipulation dialogues • Key features – Fully grounded vocabulary – Imitation learning – Incremental & interactive learning – Language independent 4
  • 5. LCore functions Phoneme learning Learning question answering Word learning Visual feature learning Grammar learning Affordance learning Disambiguation of word ellipsis Imitation learning Utterance understanding Role reversal imitation Robot-directed utterance detection Active-learning-based dialogue 5
  • 6. Learning modules Word Grammar Motion-object relationship • Learning nouns/adjectives • Learning verbs • Learning probabilistic distributions of • Estimation of related objects visual features • Learning trajectories • Learning phoneme sequences • Learning phoneme sequences
  • 7. Symbol grounding: Learning nouns and adjectives • Visual features modeled by Gaussians – Input: visual features of objects • Out-of-vocabulary word = phoneme sequence + waveform – Voice conversion (Eigenvoice GMM) to robot voice Generative models BLUE Unknown object RED
  • 8. Imitation learning of object manipulation [Sugiura+ 07] • Difficulty: Clustering trajectories in the world coordinate system does not work • Proposed method – Input: Position sequences of all objects – Estimation of reference point and coordinate system by EM algorithm – Number of state is optimized by cross-validation Place A on B
  • 9. Imitation learning using reference-point-dependent HMMs [Sugiura+ 07][Sugiura+ 11] Searching optimal coordinate system Coordinate system type :Position at time t … = Reference object ID HMM parameters • Delta parameters = … * Sugiura, K. et al, “Learning, Recognition, and Generation of Motion by …”, Advanced Robotics, Vol.25, No.17, 2011
  • 10. Results: motion learning No verb is estimated to have WCS -> Reference-point-dependent verb Velocity Motion “place-on” Log likelihood Position Place-on Move-closer Raise Jump-over Move-away Rotate Move-down Training-set likelihood
  • 11. Transformation of reference-point-dependent HMMs [Sugiura+ 11] • What is the problem? – Simple HMMs do not generate continuous trajectories – Situation dependent trajectories • Reference-point-dependent HMM – Input: (motion ID, object ID) e.g. <place-on, Object 1, Object 3> – Output: Maximum likelihood trajectory Situation HMM “Place-on” World CS Place X on Y * Sugiura, K. (2011), “Learning, Generation, and Recognition of Reference-Point-Dependent Probabilistic…”
  • 12. Generating continuous trajectory using delta parameters [Tokuda+ 00] Maximum likelihood trajectory : time series of (position,velocity,acceleration) : state sequence : HMM parameters : filter ( ) : matrix of covariance matrices of each OPDF : time series of position : vector of mean vectors *Tokuda, K. et al, “Speech parameter generation algorithms for HMM-based speech synthesis”, 2000
  • 13. Quantitative results • Evaluation measure – Euclidian distance – Normalized by frame number T Trajectory by Subject Trajectory by proposed method
  • 14. SPOKEN LANGUAGE UNDERSTANDING USING NON-LINGUISTIC INFORMATION
  • 15. Utterance understanding in LCore (1) • User utterances are understood by using multimodal information learned in a statistical learning framework Vision Motion (Bayesian learning of a Gaussian) (HMM) Speech (HMM) Motion-object relationship (Bayesian learning of a Gaussian) Shared belief Context (MCE Learning) 15
  • 16. Integration of multimodal information • Shared belief Ψ: weighted sum of five modules utterance action scene context Speech Motion Vision Motion-object relationship Context 16
  • 18. Grounded utterance disambiguation Where to? • Simple dialog systems Which “cup”? U: “Place the cup (on the table).” R: “You said place the cup.” -> Risk of motion failure • Generating confirmation utterances using physical information R: “I’ll place the red cup on the table, is it OK?”
  • 19. Multimodal utterance understanding Place-on Elmo 30th 1st 2nd … 1st 2nd … 30th Sugiura, K. et al, "Situated Spoken Dialogue with Robots Using Active Learning", Advanced Robotics, 2011 19
  • 20. Multimodal utterance understanding Place-on Elmo 30th Margin 1st 2nd … 1st 2nd … 30th Sugiura, K. et al, "Situated Spoken Dialogue with Robots Using Active Learning", Advanced Robotics, 2011 20
  • 21. Confirmation by paraphrasing user’s utterance • Learning phase • Bayesian Logistic Regression • Input: Margin(d), Output: probability • Execution phase – Decision-making on responses based on expected utility Probability 21 Margin
  • 22. Quantitative result: Risk reduction Baseline Proposed Decreased to 1/4 Failure rate Rejection rate Confirmation rate # of confirmation utt 22
  • 23. Reduction of motion failure in learning phase [Sugiura+ 11] • So far… – Learning utterance understanding probabilities • Idea • Learning-by-asking Phase Operator Motion executor Active Learning Robot User (Passive) learning User Robot Execution User Robot Sugiura, K. et al, "Situated Spoken Dialogue with Robots Using Active Learning", Advanced Robotics, Vol. 25, No. 17, 2011
  • 24. Reduction of motion failure in learning phase • Problem: – Motion failure is required in learning phase to avoid over-fitting Active Learning phase Motion failure Motion success Learning phase “Safe” training data Motion failure Execution phase Motion success
  • 25. What kind of commands are effective for learning? • Proposed method: Active Learning-based command generation • Objective: Reduce the number of interactions • [Input = image], [Output = utterance] • Expected Log Loss Reduction(ELLR[Roy, 2001]) is used to select the optimal utterance Active Learning : A form of supervised learning in which inputs can be selected by the algorithm Target action Robot utterance Loss Act=A, Objs = <1,3> “Place-on Elmo blue box” 35.8 Act=A, Objs = <1,3> “Place-on Elmo” 12.3 Act=A, Objs= <1, 2> “Place-on Elmo” 28.1 : : : “Raise box” 332.3 : : Act=B, Objs=<2> :
  • 27. Reduction of motion failure in learning phase Test-set likelihood (1) Proposed (2) Baseline Number of episodes Motion failure risk reduced # of motion failure Fast convergence Proposed Baseline