SlideShare a Scribd company logo
1 of 43
Download to read offline
StarCra&	II	AI	Workshop	
[Deepmind&BlizzardStarCraftIIAI Workshop]
Teaching anActor-CriticAgentThrough
Optimal ScriptedAgentTrajectories
Chris Hoyean Song
StarCra&	II	AI	Workshop	
StarCra&	has	4	levels	of	difficulty.	
	
Easy	Level	
Intermediate	Level	
Expert	Level	
and	..
StarCra&	II	AI	Workshop	
The	ulBmate	level	
Korean
StarCra&	II	AI	Workshop	
First,	let	me	introduce		
my	hoFest	girlfriend.
StarCra&	II	AI	Workshop	
RL
StarCra&	II	AI	Workshop	
Reinforcement Learning
Tensorflow. "Newbie" Contributor
Microsoft AI MVP
Tensorflow-KR Admin (Korean No1. ML Community)
Ex-game developer (using Unity3D)
sjhshy@gmail.com
Posting StarCraft II Reinforcement Learning Tutorials on
http://chris-chris.ai
Kakao Corp. Data Engineer
- Data Pipeline management, Real-time log processing
- Business Intelligence, Marketing Intelligence
- Develop APIs & Dev-ops
StarCra&	II	AI	Workshop	
Cool Chris presents
Insert Subtitle Text Here
Insert Subtitle Text Here
Insert Subtitle Text Here
1.	Problem	DefiniBon	
2.	Lessons	learned	from	pysc2	
3.	Actor-CriBc	ImitaBon	Learning	Agent
StarCra&	II	AI	Workshop	
Problem Definition1
StarCra&	II	AI	Workshop	
1. Problem Definition
Level 3 :
Level 2 :
Level 4 :
Level 1 :
My	topic
StarCra&	II	AI	Workshop	
1. Problem Definition
1	
Develop		
Actor-CriBc		
agent	
2	
Get	opBmal	
trajectories	
3	
Teach	agent	
with	opBmal	
trajectories
StarCra&	II	AI	Workshop	
1. Problem Definition
Key	idea	1	:	
SAME	padding	+	1x1	convoluBon	layer		
Key	idea	2	:	
Base	acBon	masking	+	autoregressive	sub-acBon	selecBon
StarCra&	II	AI	Workshop	
1. Problem Definition
{"step":	1,	"done":	false,	"obs":	[[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	1,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	1,	0,	0,	0],	[0,	0,	0,	0,	0,	1,	0,	0,	0,	0,	0,	0,	1,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	1,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	1,	0,	0,	1,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	1,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	1,	0,	0,	0,	0,	1,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	1,	0,	1,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	1,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
1,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	1,	0,	
0,	1,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	1,	0,	0,	1,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	1,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	1,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0],	[0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	
0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0,	0]],	"base_acBon":	1,	"x":	12,	"y":	17,	"reward":	0}	
hFps://youtu.be/6byY0sXEk6U
StarCra&	II	AI	Workshop	
1. Problem Definition
Idea	1.	Supervised	Learning	
Source	:	Mastering	the	Game	of	Go	with	Deep	Neural	Networks	and	Tree	Search,	Deepmind
StarCra&	II	AI	Workshop	
1. Problem Definition
Idea	2.	Hybrid	?!	
To	be	conBnued
StarCra&	II	AI	Workshop	
Lessons learned from pysc22
StarCra&	II	AI	Workshop	
Almost 100 million actions you can make at one step
Multi agents : Agents should cooperate for one common goal.
Complexity : Too large action / observation space.
Each agents should solve problems like below
- Strategy
- Economy
- Production
- Tactics
- Recon
StarCra&	II	AI	Workshop	
Lesson 1
one-hot encoding
1 : marine
3 : mineral
1 : marine 1: mineral
Marine Layer Mineral Layer
3	 3	 3	3	
1	 1	
1	 1	 1	1	
player_relative
StarCra&	II	AI	Workshop	
Lesson 2
Make the model simple
Agent can cover all possible
actions in StarCraft II RL
environment.
Agent can select an unit,
and handle control groups
and move them.
Agent can recall control
group, and move the unit.
14 policy network model 7 policy network model 3 policy network model (now)
StarCra&	II	AI	Workshop	
Lesson 3
tf.clip_by_norm()
This	operaBon	prevents	your	model	from	explosion.	
It	is	very	explicit	way	to	handle	model	stability.	Let's	tune	it!
StarCra&	II	AI	Workshop	
Lesson 4
32x32 map size
Default	map	size	is	64x64		
but,	you	don't	need	4	pixel	point		
to	represent	one	marine.	
	
Reduce	the	map	size		
as	much	as	you	can.
StarCra&	II	AI	Workshop	
First trial
(up, down, left, right)
=> 4 action space
OpenAI baselines – DQN
Started to learn
hFps://youtu.be/xpdQYnnxAko
StarCra&	II	AI	Workshop	
Second trial
2 DQN Network
x => (32) action space
y => (32) action space
OpenAI baselines – DQN
Failed
hFps://youtu.be/RyFcWKWSGJE
StarCra&	II	AI	Workshop	
I've	tried	to	implement		
the	baseline	agent	architecture	on	the	paper.	
	
Some	of	my	architecture	would	be	incorrect.	
Oriol,	please	give	me	some	feedback	if	you	find	any.
StarCra&	II	AI	Workshop	
Actor-critic model architecture 1
16filters	
[?,	32,	32,	16]	
32filters	
[?,	32,	32,	32]	
observaBon	
[?,	32,32,3]	
5x5	 3x3	
1	filter	
[?,	32,	32,	1]	
1x1		
convoluBon	
1x1
StarCra&	II	AI	Workshop	
Actor-critic model architecture 2
3x3	 1x1	
16filters	
[?,	32,	32,	16]	
32filters	
[?,	32,	32,	32]	
1	filter	
[?,	32,	32,	1]	
1x1		
convoluBon	
We	take	1x1	convoluBon	layer	for	the	dimension	reducBon.	
1x1	convoluBon	layer	squeeze	32	layers	into	1	layer.
StarCra&	II	AI	Workshop	
Actor-critic model architecture 3
1x1	
32filters	
[?,	32,	32,	32]	
1	filter	
[?,	32,	32,	1]	
1x1		
convoluBon	
conv_to_fc	
[?,	1024,	1]	
argmax
StarCra&	II	AI	Workshop	
Actor-critic model architecture 4
1x1	
32filters	
[?,	32,	32,	32]	
fully_connected	
[?,	524,	1]	
argmax	
conv_to_fc	
[?,	32768,	1]	
fully_connected	
[?,	256,	1]	
base	acBon	:	4/	SELECT_CONTROL_GROUP	
masked	policy	
[?,	524,	1]	
masking	
unavailable	
acBons	
env	
available	
acBons
StarCra&	II	AI	Workshop	
Actor-critic model architecture 5
	
pi	
fully_connected	
[?,	524,	1]	
conv_to_fc	
[?,	32768,	1]	
fully_connected	
[?,	256,	1]	
sub_acBon	4	/	control_group_act	
:	Recall	0,	Set	1,	Append	2,	
	SetAndSteal	3,	AppendAndSteal	4	
sub_acBon	11	/	build_queue_id	
if	we	do	not	use	this	result,		
do	not	back-propagate	the	errors	on	this	layer	
argmax	
masked	pi_sub4	
[?,	5,	1]	
masked	pi_sub11	
[?,	10,	1]	
argmax	
masked	pi_sub5	
[?,	10,	1]	
sub_acBon	5	/	control_group_id	
:	0	~	9
StarCra&	II	AI	Workshop	
Actor-critic model architecture 6
1x1	
32filters	
[?,	32,	32,	32]	
vf	
[?,	1,	1]	
conv_to_fc	
[?,	32768,	1]	
fully_connected	
[?,	256,	1]
StarCra&	II	AI	Workshop	
Third trial
Agent can cover all possible
actions in StarCraft II RL
environment.
14 policy network model
hFps://youtu.be/oMax7wq-KCs
StarCra&	II	AI	Workshop	
Fourth Trial
Agent can select an unit, and
handle control groups and move
them.
7 policy network model
hFps://youtu.be/gNp3OnPFSRc
StarCra&	II	AI	Workshop	
Fifth Trial
Agent can recall control group,
and move the unit.
3 policy network model (now)
hFps://youtu.be/V0HqK1-VfLw
StarCra&	II	AI	Workshop	
Hybrid Actor-Critic Imitation Learning3
StarCra&	II	AI	Workshop	
Question
I know how to develop
optimal scripted agents.
but how can I make
Actor-Critic agent
learn from it?
Question
Actor-Critic Agent
(Learning Agent)
Optimal Scripted Agent
(Optimal Agent)
StarCra&	II	AI	Workshop	
Seems naive,
but intuitively make sense.
I propose new way to train Actor-Critic
Agent through optimal scripted agent
trajectories.
StarCra&	II	AI	Workshop	
Actor-Critic Imitation Learning Agent
Policy
Agent 1 Agent 2 Agent 3 Agent 4
Policy
Actor-CriBc	
Policy	&	Value	Network	
Policy Policy
trajectories
StarCra&	II	AI	Workshop	
Actor-Critic Imitation Learning Agent
Policy
Optimal
Script
Agent 1 Agent 2 Agent 3 Agent 4
Optimal
ScriptPolicy
Actor-CriBc	
Policy	&	Value	Network	
trajectories
StarCra&	II	AI	Workshop	
baseline : Pure A2C Agent
StarCra&	II	AI	Workshop	
1. Learn faster
2. Higher max reward
3. Robustness
Learning Rate : Random (e-5 ~ e-3)
X axis : episodes
Y axis : rewards
StarCra&	II	AI	Workshop	
Detailed description of this idea will be
covered on the paper or the blog post.
The source code is on my Github
https://github.com/chris-chris/pysc2-examples
python train_mineral_shards.py
--algorithm=a2c --num_agents=2
--num_scripts=2
StarCra&	II	AI	Workshop	
Insert Subtitle Text Here
Insert Subtitle Text Here
Insert Subtitle Text Here
1.	Problem	DefiniBon	
Goal	:	Make	the	RL	agent	learn	from	expert	game	plays.	
2.	Lessons	learned	from	pysc2	
Simple	model	/	one-hot	encoding	/	u.clip_by_norm()	/	Actor-CriBc	Architecture	
3.	Hybrid	Actor-CriBc	ImitaBon	Learning	
Hybrid	agent	learns	faster,	beFer	with	robustness.	
Wrap up
StarCra&	II	AI	Workshop	
- Special Thanks to
Seungil You(Google) helped me to understand the papers and the algorithms
and supported me to find bugs and improve my tensorflow RL model.
- Deepmind and Blizzard Team
Thank you for the StarCraft II Learning Environment.
- Thanks for the precious advices
Sungjoon Choi(Disney Research), Nako Sung(Naver),
Woongwon Lee(RLCode), Doyun Lee(NC Soft).
StarCra&	II	AI	Workshop	
Thank you
sjhshy@gmail.com
http://chris-chris.ai
https://www.facebook.com/ai.chris.chris
https://www.linkedin.com/in/chris-song-0bb03439/

More Related Content

Similar to Teaching an Actor-Critic Agent Through Optimal Scripted Agent Trajectories

【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-SupervisionDeep Learning JP
 
Tales from a radically polyglot team
Tales from a radically polyglot teamTales from a radically polyglot team
Tales from a radically polyglot teamThoughtworks
 
Stamp ai sp16
Stamp ai sp16Stamp ai sp16
Stamp ai sp16Rj Porosh
 
Python image processing_Python image processing.pptx
Python image processing_Python image processing.pptxPython image processing_Python image processing.pptx
Python image processing_Python image processing.pptxshashikant484397
 
mooc course presentation.pptx
mooc course presentation.pptxmooc course presentation.pptx
mooc course presentation.pptxAkshaySingh657739
 
Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Hiroto Honda
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017StampedeCon
 
[系列活動] 一日搞懂生成式對抗網路
[系列活動] 一日搞懂生成式對抗網路[系列活動] 一日搞懂生成式對抗網路
[系列活動] 一日搞懂生成式對抗網路台灣資料科學年會
 
The Proxy Fairy, and The Magic of Spring Framework
The Proxy Fairy, and The Magic of Spring FrameworkThe Proxy Fairy, and The Magic of Spring Framework
The Proxy Fairy, and The Magic of Spring FrameworkVictor Rentea
 
Python tools to deploy your machine learning models faster
Python tools to deploy your machine learning models fasterPython tools to deploy your machine learning models faster
Python tools to deploy your machine learning models fasterJeff Hale
 
2 Years of Real World FP at REA
2 Years of Real World FP at REA2 Years of Real World FP at REA
2 Years of Real World FP at REAkenbot
 
Planespotting - From Zero To Deep Learning
Planespotting - From Zero To Deep Learning Planespotting - From Zero To Deep Learning
Planespotting - From Zero To Deep Learning Fabio Tiriticco
 
速度——敏捷开发的丹田之气(2011敏捷中国大会)
速度——敏捷开发的丹田之气(2011敏捷中国大会)速度——敏捷开发的丹田之气(2011敏捷中国大会)
速度——敏捷开发的丹田之气(2011敏捷中国大会)Yi Xu
 
Kotlin: Why Do You Care?
Kotlin: Why Do You Care?Kotlin: Why Do You Care?
Kotlin: Why Do You Care?intelliyole
 
2013 syscan360 yuki_chen_syscan360_exploit your java native vulnerabilities o...
2013 syscan360 yuki_chen_syscan360_exploit your java native vulnerabilities o...2013 syscan360 yuki_chen_syscan360_exploit your java native vulnerabilities o...
2013 syscan360 yuki_chen_syscan360_exploit your java native vulnerabilities o...chen yuki
 

Similar to Teaching an Actor-Critic Agent Through Optimal Scripted Agent Trajectories (20)

【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
【DL輪読会】Unpaired Image Super-Resolution Using Pseudo-Supervision
 
Cs 72 (p)s4
Cs 72 (p)s4Cs 72 (p)s4
Cs 72 (p)s4
 
Make2win 線上課程分析
Make2win 線上課程分析Make2win 線上課程分析
Make2win 線上課程分析
 
Tales from a radically polyglot team
Tales from a radically polyglot teamTales from a radically polyglot team
Tales from a radically polyglot team
 
Stamp ai sp16
Stamp ai sp16Stamp ai sp16
Stamp ai sp16
 
Python image processing_Python image processing.pptx
Python image processing_Python image processing.pptxPython image processing_Python image processing.pptx
Python image processing_Python image processing.pptx
 
mooc course presentation.pptx
mooc course presentation.pptxmooc course presentation.pptx
mooc course presentation.pptx
 
專題報告
專題報告專題報告
專題報告
 
Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
 
[系列活動] 一日搞懂生成式對抗網路
[系列活動] 一日搞懂生成式對抗網路[系列活動] 一日搞懂生成式對抗網路
[系列活動] 一日搞懂生成式對抗網路
 
The Proxy Fairy, and The Magic of Spring Framework
The Proxy Fairy, and The Magic of Spring FrameworkThe Proxy Fairy, and The Magic of Spring Framework
The Proxy Fairy, and The Magic of Spring Framework
 
Python tools to deploy your machine learning models faster
Python tools to deploy your machine learning models fasterPython tools to deploy your machine learning models faster
Python tools to deploy your machine learning models faster
 
2 Years of Real World FP at REA
2 Years of Real World FP at REA2 Years of Real World FP at REA
2 Years of Real World FP at REA
 
Planespotting - From Zero To Deep Learning
Planespotting - From Zero To Deep Learning Planespotting - From Zero To Deep Learning
Planespotting - From Zero To Deep Learning
 
Why MacRuby Matters
Why MacRuby MattersWhy MacRuby Matters
Why MacRuby Matters
 
Y1 gd engine_terminology
Y1 gd engine_terminologyY1 gd engine_terminology
Y1 gd engine_terminology
 
速度——敏捷开发的丹田之气(2011敏捷中国大会)
速度——敏捷开发的丹田之气(2011敏捷中国大会)速度——敏捷开发的丹田之气(2011敏捷中国大会)
速度——敏捷开发的丹田之气(2011敏捷中国大会)
 
Kotlin: Why Do You Care?
Kotlin: Why Do You Care?Kotlin: Why Do You Care?
Kotlin: Why Do You Care?
 
2013 syscan360 yuki_chen_syscan360_exploit your java native vulnerabilities o...
2013 syscan360 yuki_chen_syscan360_exploit your java native vulnerabilities o...2013 syscan360 yuki_chen_syscan360_exploit your java native vulnerabilities o...
2013 syscan360 yuki_chen_syscan360_exploit your java native vulnerabilities o...
 

Recently uploaded

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 

Recently uploaded (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Teaching an Actor-Critic Agent Through Optimal Scripted Agent Trajectories