SlideShare a Scribd company logo
1 of 66
Intelligent Agents: Technology and Applications Multi-agent Learning IST  597B Spring  200 3 John Yen
Learning Objectives ,[object Object],[object Object],[object Object]
Multi-Agent Learning
Multi-Agent Learning ,[object Object],[object Object],[object Object]
Examples ,[object Object],[object Object],[object Object],[object Object]
Examples ,[object Object],[object Object]
Predator/Pray (Pursuit) Domain ,[object Object],[object Object],[object Object],[object Object]
Predator/Pray (Pursuit) Domain
Taxonomy of MAS ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Taxonomy of MAS
Taxonomy of MAS
1. Homogenous, Non-Communicating Agents ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
1. Homogenous, Non-Communicating Agents ,[object Object],[object Object]
1. Homogenous, Non-Communicating Agents ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
1: Reactive vs. Deliberative Agents ,[object Object],[object Object]
2: Local vs. Global Perspective ,[object Object],[object Object],[object Object]
3: Modeling of Other Agents ,[object Object],[object Object],[object Object],[object Object]
3: Modeling of Other Agents ,[object Object],[object Object],[object Object],[object Object]
4: How to Affect Others ,[object Object],[object Object],[object Object],[object Object],[object Object]
4: How to Affect Others ,[object Object],[object Object],[object Object],[object Object]
4: How to Affect Others ,[object Object],[object Object],[object Object],[object Object]
5: Further Learning Opportunities ,[object Object],[object Object]
2. Heterogeneous, Non-Communicating Agents ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
2. Heterogeneous, Non-Communicating Agents ,[object Object],[object Object],[object Object],[object Object],[object Object]
2. Heterogeneous, Non-Communicating Agents ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
1: Benevolence vs. Competitiveness ,[object Object],[object Object],[object Object]
1: Benevolence vs. Competitiveness ,[object Object]
1: Benevolence vs. Competitiveness
1: Benevolence vs. Competitiveness ,[object Object],[object Object],[object Object],[object Object]
1: Benevolence vs. Competitiveness ,[object Object],[object Object]
2: Fixed vs. Learning Agents ,[object Object],[object Object],[object Object]
2: Fixed vs. Learning Agents ,[object Object],[object Object]
3: Modeling of other agents ,[object Object],[object Object],[object Object],[object Object]
4: Resource Management ,[object Object],[object Object],[object Object],[object Object]
5: Social Conventions ,[object Object],[object Object],[object Object]
3. Homogenous, Communicating Agents ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
4. Heterogeneous, Communicating Agents ,[object Object],[object Object],[object Object],[object Object],[object Object]
4. Heterogeneous, Communicating Agents ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
1: Understanding Each Other ,[object Object],[object Object],[object Object],[object Object],[object Object]
2: Planning Communication Acts ,[object Object],[object Object],[object Object]
3: Negotiation ,[object Object],[object Object],[object Object]
3: Negotiation ,[object Object],[object Object]
4: Commitment/Decommitment ,[object Object],[object Object],[object Object],[object Object],[object Object]
5: Further Learning Opportunities ,[object Object],[object Object]
Q Learning ,[object Object],[object Object],[object Object],[object Object]
The Q value R: Reward P xy : The probability of reaching state y from x by taking action action alpha. Gamma: Discount factor (between 0 and 1). V*(y): The expected total discounted return starting in y following the policy *. Policy: a sequence of actions.
The Expected Total Discount Return V for a state is the maximal Q value among all actions that can be taken at the state (following the rest of the policy).
 
Learning Rule for Q value Alpha: learning rate
and  for all  and  Do Forever: the current state that maximizes  over all  Carry out action  in the world.  Let the short term reward be  , and the new state be  For each state-action pair  do ,[object Object],1. 2. (a) (b)  (c) (d) (e) (f) (g) (h)
Probability for the agent to select action a i  based on Q values T: “temperature” parameter to determine the randomness of decisions.
Towards Collaborative and Adversarial Learning A Case Study in Robotic Soccer Peter Stone & Manuela Veloso
Introduction ,[object Object],[object Object],[object Object],[object Object],[object Object]
Simple Behavior
Parameters ,[object Object],[object Object],[object Object],[object Object]
Parameters
Fixed Ball Motion ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Neural Network
Results
Varying Ball Speed ,[object Object]
Varying Ball’s Trajectory ,[object Object],[object Object]
Moving the Goal ,[object Object],[object Object]
Cooperative Learning ,[object Object],[object Object],[object Object]
Cooperative Learning
Adversarial Learning
References ,[object Object],[object Object],[object Object]

More Related Content

Viewers also liked

Viewers also liked (9)

Parsing
ParsingParsing
Parsing
 
Bottom up parser
Bottom up parserBottom up parser
Bottom up parser
 
Intelligent agents
Intelligent agentsIntelligent agents
Intelligent agents
 
Compiler Design(NANTHU NOTES)
Compiler Design(NANTHU NOTES)Compiler Design(NANTHU NOTES)
Compiler Design(NANTHU NOTES)
 
Top down and botttom up Parsing
Top down     and botttom up ParsingTop down     and botttom up Parsing
Top down and botttom up Parsing
 
Lexical analyzer
Lexical analyzerLexical analyzer
Lexical analyzer
 
Topdown parsing
Topdown parsingTopdown parsing
Topdown parsing
 
Top down parsing
Top down parsingTop down parsing
Top down parsing
 
Lecture 4- Agent types
Lecture 4- Agent typesLecture 4- Agent types
Lecture 4- Agent types
 

Similar to Intelligent Agents: Technology and Applications

Words, Memes and Collective Intelligence: the emergence of distributed cognition
Words, Memes and Collective Intelligence: the emergence of distributed cognitionWords, Memes and Collective Intelligence: the emergence of distributed cognition
Words, Memes and Collective Intelligence: the emergence of distributed cognitionVUB
 
Part I. Match the term to its definition. The terms come from .docx
Part I. Match the term to its definition. The terms come from .docxPart I. Match the term to its definition. The terms come from .docx
Part I. Match the term to its definition. The terms come from .docxodiliagilby
 
Multiagent systems (and their use in industry)
Multiagent systems (and their use in industry)Multiagent systems (and their use in industry)
Multiagent systems (and their use in industry)Marc-Philippe Huget
 
Conflict Dissolving
Conflict DissolvingConflict Dissolving
Conflict Dissolvingshayby
 
trust,bargain,negotiate in artificail intelligence
trust,bargain,negotiate in artificail intelligencetrust,bargain,negotiate in artificail intelligence
trust,bargain,negotiate in artificail intelligencePriyadharshiniG41
 
Managing Conflict and Negotiating
Managing Conflict and NegotiatingManaging Conflict and Negotiating
Managing Conflict and NegotiatingMirasol Madrid
 
Deep Multi-agent Reinforcement Learning
Deep Multi-agent Reinforcement LearningDeep Multi-agent Reinforcement Learning
Deep Multi-agent Reinforcement Learningdeawoo Kim
 
Learning Structure, Reusability And Real Time Modeling In Teams Of Autonomous...
Learning Structure, Reusability And Real Time Modeling In Teams Of Autonomous...Learning Structure, Reusability And Real Time Modeling In Teams Of Autonomous...
Learning Structure, Reusability And Real Time Modeling In Teams Of Autonomous...ahmad bassiouny
 
15 conflict management
15 conflict management15 conflict management
15 conflict managementratan005
 
Chapter 8: Theories of Media Cognition and Information Processing (final vers...
Chapter 8: Theories of Media Cognition and Information Processing (final vers...Chapter 8: Theories of Media Cognition and Information Processing (final vers...
Chapter 8: Theories of Media Cognition and Information Processing (final vers...Toby Zhu
 
ITS 832Chapter 13Management of Complex Systems Toward Age.docx
ITS 832Chapter 13Management of Complex Systems Toward Age.docxITS 832Chapter 13Management of Complex Systems Toward Age.docx
ITS 832Chapter 13Management of Complex Systems Toward Age.docxvrickens
 
M.Ed Teacher Education's Topic-Flanders interaction analysis
M.Ed Teacher Education's Topic-Flanders interaction analysis M.Ed Teacher Education's Topic-Flanders interaction analysis
M.Ed Teacher Education's Topic-Flanders interaction analysis fatima roshan
 
Effective interpersonal communication in organizations(unit 3)
Effective interpersonal communication in organizations(unit 3)Effective interpersonal communication in organizations(unit 3)
Effective interpersonal communication in organizations(unit 3)Sumit Kumar
 
Unit 4 Artificial Intelligent Agent.pptx
Unit 4 Artificial Intelligent Agent.pptxUnit 4 Artificial Intelligent Agent.pptx
Unit 4 Artificial Intelligent Agent.pptxssuser40ae5e
 
Perception and individual decision making
Perception and individual decision makingPerception and individual decision making
Perception and individual decision makingSunnyErs
 
Communication Theory – Comm 300 F’14Task 4 – Mid-point Test – Week.docx
Communication Theory – Comm 300 F’14Task 4 – Mid-point Test – Week.docxCommunication Theory – Comm 300 F’14Task 4 – Mid-point Test – Week.docx
Communication Theory – Comm 300 F’14Task 4 – Mid-point Test – Week.docxfathwaitewalter
 

Similar to Intelligent Agents: Technology and Applications (20)

c27_mas.ppt
c27_mas.pptc27_mas.ppt
c27_mas.ppt
 
Agents(1).ppt
Agents(1).pptAgents(1).ppt
Agents(1).ppt
 
Words, Memes and Collective Intelligence: the emergence of distributed cognition
Words, Memes and Collective Intelligence: the emergence of distributed cognitionWords, Memes and Collective Intelligence: the emergence of distributed cognition
Words, Memes and Collective Intelligence: the emergence of distributed cognition
 
Part I. Match the term to its definition. The terms come from .docx
Part I. Match the term to its definition. The terms come from .docxPart I. Match the term to its definition. The terms come from .docx
Part I. Match the term to its definition. The terms come from .docx
 
Multiagent systems (and their use in industry)
Multiagent systems (and their use in industry)Multiagent systems (and their use in industry)
Multiagent systems (and their use in industry)
 
Conflict Dissolving
Conflict DissolvingConflict Dissolving
Conflict Dissolving
 
trust,bargain,negotiate in artificail intelligence
trust,bargain,negotiate in artificail intelligencetrust,bargain,negotiate in artificail intelligence
trust,bargain,negotiate in artificail intelligence
 
Managing Conflict and Negotiating
Managing Conflict and NegotiatingManaging Conflict and Negotiating
Managing Conflict and Negotiating
 
Deep Multi-agent Reinforcement Learning
Deep Multi-agent Reinforcement LearningDeep Multi-agent Reinforcement Learning
Deep Multi-agent Reinforcement Learning
 
Agent properties
Agent propertiesAgent properties
Agent properties
 
Learning Structure, Reusability And Real Time Modeling In Teams Of Autonomous...
Learning Structure, Reusability And Real Time Modeling In Teams Of Autonomous...Learning Structure, Reusability And Real Time Modeling In Teams Of Autonomous...
Learning Structure, Reusability And Real Time Modeling In Teams Of Autonomous...
 
15 conflict management
15 conflict management15 conflict management
15 conflict management
 
Chapter 8: Theories of Media Cognition and Information Processing (final vers...
Chapter 8: Theories of Media Cognition and Information Processing (final vers...Chapter 8: Theories of Media Cognition and Information Processing (final vers...
Chapter 8: Theories of Media Cognition and Information Processing (final vers...
 
Lect7MAS-Coordination
Lect7MAS-CoordinationLect7MAS-Coordination
Lect7MAS-Coordination
 
ITS 832Chapter 13Management of Complex Systems Toward Age.docx
ITS 832Chapter 13Management of Complex Systems Toward Age.docxITS 832Chapter 13Management of Complex Systems Toward Age.docx
ITS 832Chapter 13Management of Complex Systems Toward Age.docx
 
M.Ed Teacher Education's Topic-Flanders interaction analysis
M.Ed Teacher Education's Topic-Flanders interaction analysis M.Ed Teacher Education's Topic-Flanders interaction analysis
M.Ed Teacher Education's Topic-Flanders interaction analysis
 
Effective interpersonal communication in organizations(unit 3)
Effective interpersonal communication in organizations(unit 3)Effective interpersonal communication in organizations(unit 3)
Effective interpersonal communication in organizations(unit 3)
 
Unit 4 Artificial Intelligent Agent.pptx
Unit 4 Artificial Intelligent Agent.pptxUnit 4 Artificial Intelligent Agent.pptx
Unit 4 Artificial Intelligent Agent.pptx
 
Perception and individual decision making
Perception and individual decision makingPerception and individual decision making
Perception and individual decision making
 
Communication Theory – Comm 300 F’14Task 4 – Mid-point Test – Week.docx
Communication Theory – Comm 300 F’14Task 4 – Mid-point Test – Week.docxCommunication Theory – Comm 300 F’14Task 4 – Mid-point Test – Week.docx
Communication Theory – Comm 300 F’14Task 4 – Mid-point Test – Week.docx
 

More from butest

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEbutest
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jacksonbutest
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer IIbutest
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazzbutest
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.docbutest
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1butest
 
Facebook
Facebook Facebook
Facebook butest
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...butest
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...butest
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTbutest
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docbutest
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docbutest
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.docbutest
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!butest
 

More from butest (20)

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBE
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jackson
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer II
 
PPT
PPTPPT
PPT
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.doc
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1
 
Facebook
Facebook Facebook
Facebook
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENT
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.doc
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.doc
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.doc
 
hier
hierhier
hier
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!
 

Intelligent Agents: Technology and Applications