SlideShare a Scribd company logo
1 of 13
Artificial Intelligence
Learning Agents
Introduction
Artificial Intelligence Training
Created by Andrew Ferlitsch
Instructor
October, 2017
Introduction
• An agent (e.g., robot) interacts with a dynamic
environment.
• An agent learns from interacting with the environment
the best actions to take.
• Four Types of Agents (in increasing capability):
• Simple Reflex agents
• Model-based agents
• Goal-based agents
• Utility-based agents
Simple Reflex Agent
Simple Reflex Agent
Sensors
Actuators
Environment
Senses the environment (e.g., camera,
audio, LIDAR, GPS, ultrasonic)
Room, Street,
Warehouse, etc.
Modifies the environment
(e.g., walk, pickup, drive)
State
Actions
Current State of the Agent
relative to the environment.
Actions the Agent
can take.
Rules
A set of pre-defined
rules that map a
state to an action.
A simple reflex agent always executes the same action for the same observation.
Works in Environments that are fully observable.
Actions / Environment (Simple Reflex)
Reflex Agent
Environment
Action Observation
Continuous Cycle:
Observe Environment,
Take Action,
Observe Environment,
Take Action
Actions are determined
based on predefined
rules.
Preprogrammed
StateRules
How the action
effected the agent
and environment.
Predefined Rules select
Action based on predicted
State.
Model-Based (Reflex) Agent
Model-Based Reflex Agent
Sensors
Actuators
Environment
Senses the environment (e.g., camera,
audio, LIDAR, GPS, ultrasonic)
Room, Street,
Warehouse, etc.
Modifies the environment
(e.g., walk, pickup, drive)
State
Actions
Presumed State of the Agent
relative to the environment.
Actions the Agent
can take.
Rules
A set of pre-defined
rules that map a
state to an action.
A model-based agent uses model to predict the unobserved portion of the environment.
Works in Environments that are only partially observable.
Past
Model of how the
environment responds
to predict unobserved
changes to the
Environments.
Model
Short-term memory
of past observations.
Actions / Environment (Model-Based)
Model-based Reflex Agent
Environment
Action Observation
Preprogrammed
StatePast
How the action
effected the agent
and environment.
History of Past Observations
Predefined Model of
how the environment
behaves.
The model
combines the past and
present observations
to predict the state of
the environment.
Predefined Rules select
Action based on predicted
State.
Rules
MODEL
Goal-Based Agent
Goal-Based Agent
Sensors
Actuators
Environment
Senses the environment (e.g., camera,
audio, LIDAR, GPS, ultrasonic)
Room, Street,
Warehouse, etc.
Modifies the environment
(e.g., walk, pickup, drive)
State
Actions
(Presumed/Known) State of the Agent
relative to the environment.
Actions the Agent
can take.
Rules
A set of pre-defined
rules that map a
state to an action.
A goal-based agent uses a goal(s) to evaluate how close to achieving the goal is the
next possible action.
Works in Environments which need to predict the future.
Past
Model of how the
environment responds
to predict unobserved
changes to the
Environments.
Model
Goal
Optional
A goal(s) to achieve,
when evaluating the
next action (i.e., how
closer to achieving the
goal).
Actions / Environment (Goal-Based)
Goal-based Agent
Environment
Action Observation
StatePast
How the action
effected the agent
and environment.
History of Past Observations
Predefined Model
of how the
environment
behaves.
Predefined Rules select
Action based on how close is
the predicted State to the goal.
Rules
MODEL
GOAL(s)
A Goal(s) for
evaluating how
close is an
action/state to
the goal.
Utility-Based (“Rational”) Agent
Utility-Based Agent
Sensors
Actuators
Environment
Senses the environment (e.g., camera,
audio, LIDAR, GPS, ultrasonic)
Room, Street,
Warehouse, etc.
Modifies the environment
(e.g., walk, pickup, drive)
State
Actions
(Presumed/Known) State of the Agent
relative to the environment.
Actions the Agent
can take.
Rules
A set of pre-defined
rules that map a
state to an action.
A utility-based agent uses a utility to measure the value of the next possible action
to achieving the goal.
Works in Environments which must optimize achieving the Goal.
Past
Model of how the
environment responds
to predict unobserved
changes to the
Environments.
Model
Goal
Optional
A goal(s) to achieve,
when evaluating the
next action (i.e., how
closer to achieving the
goal). Utility
A measurement of
the value of an action
towards the goal.
Actions / Environment (Utility-Based)
Utility-based Agent
Environment
Action Observation
StatePast
How the action
effected the agent
and environment.
History of Past Observations
Predefined Model
of how the
environment
behaves.
Predefined Rules select
Action based on the value of the
predicted State to achieving the goal.
Rules
MODEL
GOAL(s)
A Goal(s) for
evaluating how
close is an
action/state to
the goal.
(𝑺, 𝑨)
A utility for measuring
The value of an State/Action
Towards achieving a goal.
What’s Missing?
• There is no learning!
• Learn the Model (learn to model the environment)
• Learn the Utility (learn to measure the value of a state)
Learning (“Intelligent”) Agent
Learning Agent
Sensors
Actuators
Environment
Senses the environment (e.g., camera,
audio, LIDAR, GPS, ultrasonic)
Room, Street,
Warehouse, etc.
Modifies the environment
(e.g., walk, pickup, drive)
State
Actions
(Presumed/Known) State of the Agent
relative to the environment.
Actions the Agent
can take.
A learning agent dynamically learns a Policy to model the Environment and build
Action/State rules.
Works in Environments that are Dynamically Changing (Stochastic)
Policy
A learned model of
the environment
and learned
State/Action rules.
Goal
Critic
Utility
A goal(s) to achieve,
when evaluating the
next action (i.e., how
closer to achieving the
goal).
A measurement of
the value of an action
towards the goal.
A measurement on
how good an action
actually was.
State / Reward
Intelligent Agent
Environment
Action Observation
State Reward
How the action
effected the agent
and environment.
How positive or
negative is the
new state.
LEARN
What was
learned from
the reward.
Policy
Learned set of
rules of:
States -> Actions
Example Positive Reward:
Robot Stands Up,
Closer to Destination
Example Negative Reward:
Robot Falls Down,
Further from Destination
Reinforcement Learning

More Related Content

What's hot

AI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial IntelligenceAI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial IntelligenceKirti Verma
 
Intelligence Agent - Artificial Intelligent (AI)
Intelligence Agent - Artificial Intelligent (AI)Intelligence Agent - Artificial Intelligent (AI)
Intelligence Agent - Artificial Intelligent (AI)mufassirin
 
peas description of task environment with different types of properties
 peas description of task environment with different types of properties peas description of task environment with different types of properties
peas description of task environment with different types of propertiesmonircse2
 
Lecture 2 agent and environment
Lecture 2   agent and environmentLecture 2   agent and environment
Lecture 2 agent and environmentVajira Thambawita
 
State space search
State space searchState space search
State space searchchauhankapil
 
1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptx1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptxSuvamvlogs
 
Problem solving agents
Problem solving agentsProblem solving agents
Problem solving agentsMegha Sharma
 
Intelligent Agent PPT ON SLIDESHARE IN ARTIFICIAL INTELLIGENCE
Intelligent Agent PPT ON SLIDESHARE IN ARTIFICIAL INTELLIGENCEIntelligent Agent PPT ON SLIDESHARE IN ARTIFICIAL INTELLIGENCE
Intelligent Agent PPT ON SLIDESHARE IN ARTIFICIAL INTELLIGENCEKhushboo Pal
 
Unification and Lifting
Unification and LiftingUnification and Lifting
Unification and LiftingMegha Sharma
 
Types of environment in Artificial Intelligence
Types of environment in Artificial IntelligenceTypes of environment in Artificial Intelligence
Types of environment in Artificial IntelligenceNoman Ullah Khan
 
Introduction to Web Mining and Spatial Data Mining
Introduction to Web Mining and Spatial Data MiningIntroduction to Web Mining and Spatial Data Mining
Introduction to Web Mining and Spatial Data MiningAarshDhokai
 
Reinforcement Learning
Reinforcement LearningReinforcement Learning
Reinforcement LearningSalem-Kabbani
 
Artificial intelligence- Logic Agents
Artificial intelligence- Logic AgentsArtificial intelligence- Logic Agents
Artificial intelligence- Logic AgentsNuruzzaman Milon
 

What's hot (20)

AI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial IntelligenceAI Agents, Agents in Artificial Intelligence
AI Agents, Agents in Artificial Intelligence
 
Intelligence Agent - Artificial Intelligent (AI)
Intelligence Agent - Artificial Intelligent (AI)Intelligence Agent - Artificial Intelligent (AI)
Intelligence Agent - Artificial Intelligent (AI)
 
Intelligent Agents
Intelligent Agents Intelligent Agents
Intelligent Agents
 
peas description of task environment with different types of properties
 peas description of task environment with different types of properties peas description of task environment with different types of properties
peas description of task environment with different types of properties
 
AI Lecture 4 (informed search and exploration)
AI Lecture 4 (informed search and exploration)AI Lecture 4 (informed search and exploration)
AI Lecture 4 (informed search and exploration)
 
Agents1
Agents1Agents1
Agents1
 
Lecture 2 agent and environment
Lecture 2   agent and environmentLecture 2   agent and environment
Lecture 2 agent and environment
 
State space search
State space searchState space search
State space search
 
1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptx1.1 What are Agent and Environment.pptx
1.1 What are Agent and Environment.pptx
 
Problem solving agents
Problem solving agentsProblem solving agents
Problem solving agents
 
Intelligent Agent PPT ON SLIDESHARE IN ARTIFICIAL INTELLIGENCE
Intelligent Agent PPT ON SLIDESHARE IN ARTIFICIAL INTELLIGENCEIntelligent Agent PPT ON SLIDESHARE IN ARTIFICIAL INTELLIGENCE
Intelligent Agent PPT ON SLIDESHARE IN ARTIFICIAL INTELLIGENCE
 
Informed search
Informed searchInformed search
Informed search
 
Unification and Lifting
Unification and LiftingUnification and Lifting
Unification and Lifting
 
Types of environment in Artificial Intelligence
Types of environment in Artificial IntelligenceTypes of environment in Artificial Intelligence
Types of environment in Artificial Intelligence
 
Introduction to Web Mining and Spatial Data Mining
Introduction to Web Mining and Spatial Data MiningIntroduction to Web Mining and Spatial Data Mining
Introduction to Web Mining and Spatial Data Mining
 
The structure of agents
The structure of agentsThe structure of agents
The structure of agents
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Reinforcement Learning
Reinforcement LearningReinforcement Learning
Reinforcement Learning
 
Artificial intelligence- Logic Agents
Artificial intelligence- Logic AgentsArtificial intelligence- Logic Agents
Artificial intelligence- Logic Agents
 
Intelligent agents
Intelligent agentsIntelligent agents
Intelligent agents
 

Similar to AI Learning Agents Introduction

AI - Introduction to Reinforcement Learning
AI - Introduction to Reinforcement LearningAI - Introduction to Reinforcement Learning
AI - Introduction to Reinforcement LearningAndrew Ferlitsch
 
Intelligent Agents
Intelligent AgentsIntelligent Agents
Intelligent Agentsmarada0033
 
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsPalGov
 
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsPalGov
 
Agents-and-Problem-Solving-20022024-094442am.pdf
Agents-and-Problem-Solving-20022024-094442am.pdfAgents-and-Problem-Solving-20022024-094442am.pdf
Agents-and-Problem-Solving-20022024-094442am.pdfsyedhasanali293
 
introduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.pptintroduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.pptdejene3
 
2.IntelligentAgents.ppt
2.IntelligentAgents.ppt2.IntelligentAgents.ppt
2.IntelligentAgents.pptGopiSuresh8
 
Artificial Intelligence Chapter two agents
Artificial Intelligence Chapter two agentsArtificial Intelligence Chapter two agents
Artificial Intelligence Chapter two agentsEhsan Nowrouzi
 

Similar to AI Learning Agents Introduction (20)

AI - Introduction to Reinforcement Learning
AI - Introduction to Reinforcement LearningAI - Introduction to Reinforcement Learning
AI - Introduction to Reinforcement Learning
 
m2-agents.pptx
m2-agents.pptxm2-agents.pptx
m2-agents.pptx
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
agents in ai ppt
agents in ai pptagents in ai ppt
agents in ai ppt
 
Intelligent Agents
Intelligent AgentsIntelligent Agents
Intelligent Agents
 
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
 
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagentsJarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
 
Agents-and-Problem-Solving-20022024-094442am.pdf
Agents-and-Problem-Solving-20022024-094442am.pdfAgents-and-Problem-Solving-20022024-094442am.pdf
Agents-and-Problem-Solving-20022024-094442am.pdf
 
introduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.pptintroduction to inteligent IntelligentAgent.ppt
introduction to inteligent IntelligentAgent.ppt
 
2.IntelligentAgents.ppt
2.IntelligentAgents.ppt2.IntelligentAgents.ppt
2.IntelligentAgents.ppt
 
Artificial Intelligence Chapter two agents
Artificial Intelligence Chapter two agentsArtificial Intelligence Chapter two agents
Artificial Intelligence Chapter two agents
 
m2-agents.ppt
m2-agents.pptm2-agents.ppt
m2-agents.ppt
 
Agents_AI.ppt
Agents_AI.pptAgents_AI.ppt
Agents_AI.ppt
 
AI Basic.pptx
AI Basic.pptxAI Basic.pptx
AI Basic.pptx
 
Lec 2 agents
Lec 2 agentsLec 2 agents
Lec 2 agents
 
Infosec
InfosecInfosec
Infosec
 
Lec 2-agents
Lec 2-agentsLec 2-agents
Lec 2-agents
 
Unit-1.pptx
Unit-1.pptxUnit-1.pptx
Unit-1.pptx
 
Intelligent agents
Intelligent agentsIntelligent agents
Intelligent agents
 
M2 agents
M2 agentsM2 agents
M2 agents
 

More from Andrew Ferlitsch

Pareto Principle Applied to QA
Pareto Principle Applied to QAPareto Principle Applied to QA
Pareto Principle Applied to QAAndrew Ferlitsch
 
Whiteboarding Coding Challenges in Python
Whiteboarding Coding Challenges in PythonWhiteboarding Coding Challenges in Python
Whiteboarding Coding Challenges in PythonAndrew Ferlitsch
 
Object Oriented Programming Principles
Object Oriented Programming PrinciplesObject Oriented Programming Principles
Object Oriented Programming PrinciplesAndrew Ferlitsch
 
Python - Installing and Using Python and Jupyter Notepad
Python - Installing and Using Python and Jupyter NotepadPython - Installing and Using Python and Jupyter Notepad
Python - Installing and Using Python and Jupyter NotepadAndrew Ferlitsch
 
Natural Language Processing - Groupings (Associations) Generation
Natural Language Processing - Groupings (Associations) GenerationNatural Language Processing - Groupings (Associations) Generation
Natural Language Processing - Groupings (Associations) GenerationAndrew Ferlitsch
 
Natural Language Provessing - Handling Narrarive Fields in Datasets for Class...
Natural Language Provessing - Handling Narrarive Fields in Datasets for Class...Natural Language Provessing - Handling Narrarive Fields in Datasets for Class...
Natural Language Provessing - Handling Narrarive Fields in Datasets for Class...Andrew Ferlitsch
 
Machine Learning - Introduction to Recurrent Neural Networks
Machine Learning - Introduction to Recurrent Neural NetworksMachine Learning - Introduction to Recurrent Neural Networks
Machine Learning - Introduction to Recurrent Neural NetworksAndrew Ferlitsch
 
Machine Learning - Introduction to Convolutional Neural Networks
Machine Learning - Introduction to Convolutional Neural NetworksMachine Learning - Introduction to Convolutional Neural Networks
Machine Learning - Introduction to Convolutional Neural NetworksAndrew Ferlitsch
 
Machine Learning - Introduction to Neural Networks
Machine Learning - Introduction to Neural NetworksMachine Learning - Introduction to Neural Networks
Machine Learning - Introduction to Neural NetworksAndrew Ferlitsch
 
Python - Numpy/Pandas/Matplot Machine Learning Libraries
Python - Numpy/Pandas/Matplot Machine Learning LibrariesPython - Numpy/Pandas/Matplot Machine Learning Libraries
Python - Numpy/Pandas/Matplot Machine Learning LibrariesAndrew Ferlitsch
 
Machine Learning - Accuracy and Confusion Matrix
Machine Learning - Accuracy and Confusion MatrixMachine Learning - Accuracy and Confusion Matrix
Machine Learning - Accuracy and Confusion MatrixAndrew Ferlitsch
 
Machine Learning - Ensemble Methods
Machine Learning - Ensemble MethodsMachine Learning - Ensemble Methods
Machine Learning - Ensemble MethodsAndrew Ferlitsch
 
ML - Multiple Linear Regression
ML - Multiple Linear RegressionML - Multiple Linear Regression
ML - Multiple Linear RegressionAndrew Ferlitsch
 
ML - Simple Linear Regression
ML - Simple Linear RegressionML - Simple Linear Regression
ML - Simple Linear RegressionAndrew Ferlitsch
 
Machine Learning - Dummy Variable Conversion
Machine Learning - Dummy Variable ConversionMachine Learning - Dummy Variable Conversion
Machine Learning - Dummy Variable ConversionAndrew Ferlitsch
 
Machine Learning - Splitting Datasets
Machine Learning - Splitting DatasetsMachine Learning - Splitting Datasets
Machine Learning - Splitting DatasetsAndrew Ferlitsch
 
Machine Learning - Dataset Preparation
Machine Learning - Dataset PreparationMachine Learning - Dataset Preparation
Machine Learning - Dataset PreparationAndrew Ferlitsch
 
Machine Learning - Introduction to Tensorflow
Machine Learning - Introduction to TensorflowMachine Learning - Introduction to Tensorflow
Machine Learning - Introduction to TensorflowAndrew Ferlitsch
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningAndrew Ferlitsch
 

More from Andrew Ferlitsch (20)

Pareto Principle Applied to QA
Pareto Principle Applied to QAPareto Principle Applied to QA
Pareto Principle Applied to QA
 
Whiteboarding Coding Challenges in Python
Whiteboarding Coding Challenges in PythonWhiteboarding Coding Challenges in Python
Whiteboarding Coding Challenges in Python
 
Object Oriented Programming Principles
Object Oriented Programming PrinciplesObject Oriented Programming Principles
Object Oriented Programming Principles
 
Python - OOP Programming
Python - OOP ProgrammingPython - OOP Programming
Python - OOP Programming
 
Python - Installing and Using Python and Jupyter Notepad
Python - Installing and Using Python and Jupyter NotepadPython - Installing and Using Python and Jupyter Notepad
Python - Installing and Using Python and Jupyter Notepad
 
Natural Language Processing - Groupings (Associations) Generation
Natural Language Processing - Groupings (Associations) GenerationNatural Language Processing - Groupings (Associations) Generation
Natural Language Processing - Groupings (Associations) Generation
 
Natural Language Provessing - Handling Narrarive Fields in Datasets for Class...
Natural Language Provessing - Handling Narrarive Fields in Datasets for Class...Natural Language Provessing - Handling Narrarive Fields in Datasets for Class...
Natural Language Provessing - Handling Narrarive Fields in Datasets for Class...
 
Machine Learning - Introduction to Recurrent Neural Networks
Machine Learning - Introduction to Recurrent Neural NetworksMachine Learning - Introduction to Recurrent Neural Networks
Machine Learning - Introduction to Recurrent Neural Networks
 
Machine Learning - Introduction to Convolutional Neural Networks
Machine Learning - Introduction to Convolutional Neural NetworksMachine Learning - Introduction to Convolutional Neural Networks
Machine Learning - Introduction to Convolutional Neural Networks
 
Machine Learning - Introduction to Neural Networks
Machine Learning - Introduction to Neural NetworksMachine Learning - Introduction to Neural Networks
Machine Learning - Introduction to Neural Networks
 
Python - Numpy/Pandas/Matplot Machine Learning Libraries
Python - Numpy/Pandas/Matplot Machine Learning LibrariesPython - Numpy/Pandas/Matplot Machine Learning Libraries
Python - Numpy/Pandas/Matplot Machine Learning Libraries
 
Machine Learning - Accuracy and Confusion Matrix
Machine Learning - Accuracy and Confusion MatrixMachine Learning - Accuracy and Confusion Matrix
Machine Learning - Accuracy and Confusion Matrix
 
Machine Learning - Ensemble Methods
Machine Learning - Ensemble MethodsMachine Learning - Ensemble Methods
Machine Learning - Ensemble Methods
 
ML - Multiple Linear Regression
ML - Multiple Linear RegressionML - Multiple Linear Regression
ML - Multiple Linear Regression
 
ML - Simple Linear Regression
ML - Simple Linear RegressionML - Simple Linear Regression
ML - Simple Linear Regression
 
Machine Learning - Dummy Variable Conversion
Machine Learning - Dummy Variable ConversionMachine Learning - Dummy Variable Conversion
Machine Learning - Dummy Variable Conversion
 
Machine Learning - Splitting Datasets
Machine Learning - Splitting DatasetsMachine Learning - Splitting Datasets
Machine Learning - Splitting Datasets
 
Machine Learning - Dataset Preparation
Machine Learning - Dataset PreparationMachine Learning - Dataset Preparation
Machine Learning - Dataset Preparation
 
Machine Learning - Introduction to Tensorflow
Machine Learning - Introduction to TensorflowMachine Learning - Introduction to Tensorflow
Machine Learning - Introduction to Tensorflow
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 

Recently uploaded

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsPrecisely
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 

Recently uploaded (20)

Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Unlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power SystemsUnlocking the Potential of the Cloud for IBM Power Systems
Unlocking the Potential of the Cloud for IBM Power Systems
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 

AI Learning Agents Introduction

  • 1. Artificial Intelligence Learning Agents Introduction Artificial Intelligence Training Created by Andrew Ferlitsch Instructor October, 2017
  • 2. Introduction • An agent (e.g., robot) interacts with a dynamic environment. • An agent learns from interacting with the environment the best actions to take. • Four Types of Agents (in increasing capability): • Simple Reflex agents • Model-based agents • Goal-based agents • Utility-based agents
  • 3. Simple Reflex Agent Simple Reflex Agent Sensors Actuators Environment Senses the environment (e.g., camera, audio, LIDAR, GPS, ultrasonic) Room, Street, Warehouse, etc. Modifies the environment (e.g., walk, pickup, drive) State Actions Current State of the Agent relative to the environment. Actions the Agent can take. Rules A set of pre-defined rules that map a state to an action. A simple reflex agent always executes the same action for the same observation. Works in Environments that are fully observable.
  • 4. Actions / Environment (Simple Reflex) Reflex Agent Environment Action Observation Continuous Cycle: Observe Environment, Take Action, Observe Environment, Take Action Actions are determined based on predefined rules. Preprogrammed StateRules How the action effected the agent and environment. Predefined Rules select Action based on predicted State.
  • 5. Model-Based (Reflex) Agent Model-Based Reflex Agent Sensors Actuators Environment Senses the environment (e.g., camera, audio, LIDAR, GPS, ultrasonic) Room, Street, Warehouse, etc. Modifies the environment (e.g., walk, pickup, drive) State Actions Presumed State of the Agent relative to the environment. Actions the Agent can take. Rules A set of pre-defined rules that map a state to an action. A model-based agent uses model to predict the unobserved portion of the environment. Works in Environments that are only partially observable. Past Model of how the environment responds to predict unobserved changes to the Environments. Model Short-term memory of past observations.
  • 6. Actions / Environment (Model-Based) Model-based Reflex Agent Environment Action Observation Preprogrammed StatePast How the action effected the agent and environment. History of Past Observations Predefined Model of how the environment behaves. The model combines the past and present observations to predict the state of the environment. Predefined Rules select Action based on predicted State. Rules MODEL
  • 7. Goal-Based Agent Goal-Based Agent Sensors Actuators Environment Senses the environment (e.g., camera, audio, LIDAR, GPS, ultrasonic) Room, Street, Warehouse, etc. Modifies the environment (e.g., walk, pickup, drive) State Actions (Presumed/Known) State of the Agent relative to the environment. Actions the Agent can take. Rules A set of pre-defined rules that map a state to an action. A goal-based agent uses a goal(s) to evaluate how close to achieving the goal is the next possible action. Works in Environments which need to predict the future. Past Model of how the environment responds to predict unobserved changes to the Environments. Model Goal Optional A goal(s) to achieve, when evaluating the next action (i.e., how closer to achieving the goal).
  • 8. Actions / Environment (Goal-Based) Goal-based Agent Environment Action Observation StatePast How the action effected the agent and environment. History of Past Observations Predefined Model of how the environment behaves. Predefined Rules select Action based on how close is the predicted State to the goal. Rules MODEL GOAL(s) A Goal(s) for evaluating how close is an action/state to the goal.
  • 9. Utility-Based (“Rational”) Agent Utility-Based Agent Sensors Actuators Environment Senses the environment (e.g., camera, audio, LIDAR, GPS, ultrasonic) Room, Street, Warehouse, etc. Modifies the environment (e.g., walk, pickup, drive) State Actions (Presumed/Known) State of the Agent relative to the environment. Actions the Agent can take. Rules A set of pre-defined rules that map a state to an action. A utility-based agent uses a utility to measure the value of the next possible action to achieving the goal. Works in Environments which must optimize achieving the Goal. Past Model of how the environment responds to predict unobserved changes to the Environments. Model Goal Optional A goal(s) to achieve, when evaluating the next action (i.e., how closer to achieving the goal). Utility A measurement of the value of an action towards the goal.
  • 10. Actions / Environment (Utility-Based) Utility-based Agent Environment Action Observation StatePast How the action effected the agent and environment. History of Past Observations Predefined Model of how the environment behaves. Predefined Rules select Action based on the value of the predicted State to achieving the goal. Rules MODEL GOAL(s) A Goal(s) for evaluating how close is an action/state to the goal. (𝑺, 𝑨) A utility for measuring The value of an State/Action Towards achieving a goal.
  • 11. What’s Missing? • There is no learning! • Learn the Model (learn to model the environment) • Learn the Utility (learn to measure the value of a state)
  • 12. Learning (“Intelligent”) Agent Learning Agent Sensors Actuators Environment Senses the environment (e.g., camera, audio, LIDAR, GPS, ultrasonic) Room, Street, Warehouse, etc. Modifies the environment (e.g., walk, pickup, drive) State Actions (Presumed/Known) State of the Agent relative to the environment. Actions the Agent can take. A learning agent dynamically learns a Policy to model the Environment and build Action/State rules. Works in Environments that are Dynamically Changing (Stochastic) Policy A learned model of the environment and learned State/Action rules. Goal Critic Utility A goal(s) to achieve, when evaluating the next action (i.e., how closer to achieving the goal). A measurement of the value of an action towards the goal. A measurement on how good an action actually was.
  • 13. State / Reward Intelligent Agent Environment Action Observation State Reward How the action effected the agent and environment. How positive or negative is the new state. LEARN What was learned from the reward. Policy Learned set of rules of: States -> Actions Example Positive Reward: Robot Stands Up, Closer to Destination Example Negative Reward: Robot Falls Down, Further from Destination Reinforcement Learning