Game Analytics Cluster Scheduler

•Download as PPTX, PDF•

1 like•1,504 views

This document describes gascheduler, a library for executing distributed tasks across Erlang nodes. It provides a scheduler that distributes tasks to worker nodes with available capacity. The scheduler aims to be generic, simple, and operations friendly. It executes callbacks asynchronously, sending status messages to the client. Tasks should be side-effect free or idempotent for consistency. The scheduler separates business logic from infrastructure code and has few dependencies, allowing for reuse. Potential improvements include multi-master support and allowing clients to stop cleanly.

Software

gascheduler
Game Analytics Cluster Scheduler
Erlounge, Berlin Erlang Meetup, July 29th 2015

Introduction
What are we trying to do?
● parallel and distributed computation
How do we do it?
● that is what this talk is about

Parallel Execution in Erlang
> lists:foreach(fun(N) ->
spawn(fun() ->
io:format("hello world ~p~n", [N])
end)
end, lists:seq(1,10)).
hello world 1
hello world 2
hello world 3
hello world 4
hello world 5
hello world 6
hello world 7
hello world 8
hello world 9
hello world 10

gascheduler
A generic library for executing distributed tasks
scheduler
● pending queue
● running queue
worker
node 1
worker
node n
...client
spawn(callback)
stats
execute(callback)
add_worker_node(node)
ok
error
node down
max retries
ok
retry

Why our own scheduler?
● task manager rather than process manager
o we start the execution later
● asynchronous
o task status sent to client via messages
● multiple node distributed execution
● bounds on concurrent tasks

Design
refactored from existing code
generic for reuse
simple to minimize chance of bugs
operations friendly

Scheduling
execute callback
● node with least running tasks
pending queue
● unbounded
running queue per node
● bounded
worker retries on exception
● except for permanent failure
● possibly infinite times

Tasks
● The scheduler executes a callback
o like the map of map reduce
o e.g. count word occurrence in a string
● What are the requirements of a task?
o Ideally function should be side effect free
o Or idempotent
 f(f(state)) = f(state)
o Otherwise consistency must be handled externally

$Starting tasks Name = test, %% Each gascheduler has its own name. There can be multiple gaschedulers. Nodes = [...], %% A list of nodes to execute work on. See also erlang:nodes(). MaxWorkers = 10, %% Maximum number of workers per node. MaxRetries = 10, %% Maximum number of retries for a worker, i.e. it throws some exception. Client = self(), %% Where to send scheduler status messages to. %% Start the scheduler. {ok, _} = gascheduler:start_link(Name, Nodes, Client, MaxWorkers, MaxRetries), %% Execute hello:world(1) asynchronously. In the hello module exists, world(N) -> N. ok = gascheduler:execute(Name, {hello, world, [1]}), .....$

$Handling task status %% Receive a single status message from a particular scheduler. receive {Name, {ok, Result}, Node, MFA = {Mod, Fun, Args}} -> io:format(“hello world ~p from ~p~n”, [Result, Node]); {Name, {error, Reason}, Node, MFA = {Mod, Fun, Args}} -> io:format(“task ~p failed on ~p because ~p”, [MFA, Node, Reason]) end %% Task completed successfully. hello world 1 from slave1@worker1 %% Task failed. task {hello, world, [1]} failed on slave1@worker1 because max_retries task {hello, world, [1]} failed on slave1@worker1 because permanent_failure$

Advantages
● clean separation of business and
infrastructure code
● very few dependencies
● code reuse

Possible Improvements
multi master
● distributed consensus required
allow client to stop cleanly in a generic way
● clients currently implements clean stop

Thanks for listening!
Questions?
http://github.com/GameAnalytics/gascheduler

What's hot

Program to find the avg of two numbersSwarup Boro

My First Source Codeenidcruz

Finch + Finagle OAuth2Vladimir Kostyukov

All you need to know about Callbacks, Promises, GeneratorsBrainhub

App-o-Lockalypse now!Oddvar Moe

Bankers Algo ImplementationDeepak Agarwal

1swetha gokavarapu

serverstatsBen De Koster

Commit2015 kharchenko - python generators - extMaxym Kharchenko

2015 555 kharchenko_pptMaxym Kharchenko

我在 Mac 上的常用开发工具dennis zhuang

2016 gunma.web games-and-asm.jsNoritada Shimizu

Azure sql insert perfMornè Blake

20151224-gamesNoritada Shimizu

VLSI Sequential Circuits IIGouthaman V

C++HSS-Software House

Assignement of programming & problem solvingSyed Umair

Groovy and Grails talkdesistartups

PHP 机智问答Shengyou Fan

FSE 2008ericbodden

What's hot (20)

Program to find the avg of two numbers

My First Source Code

Finch + Finagle OAuth2

All you need to know about Callbacks, Promises, Generators

App-o-Lockalypse now!

Bankers Algo Implementation

serverstats

Commit2015 kharchenko - python generators - ext

2015 555 kharchenko_ppt

我在 Mac 上的常用开发工具

2016 gunma.web games-and-asm.js

Azure sql insert perf

20151224-games

VLSI Sequential Circuits II

C++

Assignement of programming & problem solving

Groovy and Grails talk

PHP 机智问答

FSE 2008

Similar to Game Analytics Cluster Scheduler

Dragoncraft Architectural Overviewjessesanford

Giorgio zoppi cpp11concurrencyGiorgio Zoppi

Node js lectureDarryl Sherman

Concurrency, Robustness & Elixir SoCraTes 2015steffenbauer

Advanced patterns in asynchronous programmingMichael Arenzon

NodeJSnodesforfreeinmyworldgipsnndnnd.pdfVivekSonawane45

Introduction to pythonAhmed Salama

Server Side Event Driven ProgrammingKamal Hussain

What can be done with Java, but should better be done with Erlang (@pavlobaron)Pavlo Baron

Introducing Elixir and OTP at the Erlang BASHdevbash

Background Jobs - Com BackgrounDRbJuan Maiz

NodeJS for BeginnerApaichon Punopas

Node.js Event Loop & EventEmitterSimen Li

MapReduce: teoria e práticaPET Computação

Event driven programming -- Node.JSDimitri Teravanessian

Groovy On Trading Desk (2010)Jonathan Felch

gRPC in GoAlmog Baku

Do snow.rwnARUN DN

Elixir concurrency 101Rafael Antonio Gutiérrez Turullols

ClojureScript loves React, DomCode May 26 2015Michiel Borkent

Similar to Game Analytics Cluster Scheduler (20)

Dragoncraft Architectural Overview

Giorgio zoppi cpp11concurrency

Node js lecture

Concurrency, Robustness & Elixir SoCraTes 2015

Advanced patterns in asynchronous programming

NodeJSnodesforfreeinmyworldgipsnndnnd.pdf

Introduction to python

Server Side Event Driven Programming

What can be done with Java, but should better be done with Erlang (@pavlobaron)

Introducing Elixir and OTP at the Erlang BASH

Background Jobs - Com BackgrounDRb

NodeJS for Beginner

Node.js Event Loop & EventEmitter

MapReduce: teoria e prática

Event driven programming -- Node.JS

Groovy On Trading Desk (2010)

gRPC in Go

Do snow.rwn

Elixir concurrency 101

ClojureScript loves React, DomCode May 26 2015

Recently uploaded

%in kempton park+277-882-255-28 abortion pills for sale in kempton park masabamasaba

WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2

What Goes Wrong with Language Definitions and How to Improve the SituationJuha-Pekka Tolvanen

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba

%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburgmasabamasaba

WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...WSO2

WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2

WSO2CON 2024 Slides - Open Source to SaaSWSO2

WSO2CON 2024 - Does Open Source Still Matter?WSO2

%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfonteinmasabamasaba

Artyushina_Guest lecture_YorkU CS May 2024.pptxAnnaArtyushina1

WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2

WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2

%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfonteinmasabamasaba

%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg

WSO2CON 2024 - How to Run a Security ProgramWSO2

WSO2CON2024 - It's time to go PlatformlessWSO2

WSO2Con204 - Hard Rock Presentation - KeynoteWSO2

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health

Recently uploaded (20)

%in kempton park+277-882-255-28 abortion pills for sale in kempton park

WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...

What Goes Wrong with Language Definitions and How to Improve the Situation

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...

%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg

WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...

WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation

WSO2CON 2024 Slides - Open Source to SaaS

WSO2CON 2024 - Does Open Source Still Matter?

%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein

Artyushina_Guest lecture_YorkU CS May 2024.pptx

WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...

WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source

%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein

%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...

Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...

WSO2CON 2024 - How to Run a Security Program

WSO2CON2024 - It's time to go Platformless

WSO2Con204 - Hard Rock Presentation - Keynote

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...

Game Analytics Cluster Scheduler

1. gascheduler Game Analytics Cluster Scheduler Erlounge, Berlin Erlang Meetup, July 29th 2015

2. Introduction What are we trying to do? ● parallel and distributed computation How do we do it? ● that is what this talk is about

3. Parallel Execution in Erlang > lists:foreach(fun(N) -> spawn(fun() -> io:format("hello world ~p~n", [N]) end) end, lists:seq(1,10)). hello world 1 hello world 2 hello world 3 hello world 4 hello world 5 hello world 6 hello world 7 hello world 8 hello world 9 hello world 10

4. gascheduler A generic library for executing distributed tasks scheduler ● pending queue ● running queue worker node 1 worker node n ...client spawn(callback) stats execute(callback) add_worker_node(node) ok error node down max retries ok retry

5. Why our own scheduler? ● task manager rather than process manager o we start the execution later ● asynchronous o task status sent to client via messages ● multiple node distributed execution ● bounds on concurrent tasks

6. Design refactored from existing code generic for reuse simple to minimize chance of bugs operations friendly

7. Scheduling execute callback ● node with least running tasks pending queue ● unbounded running queue per node ● bounded worker retries on exception ● except for permanent failure ● possibly infinite times

8. Tasks ● The scheduler executes a callback o like the map of map reduce o e.g. count word occurrence in a string ● What are the requirements of a task? o Ideally function should be side effect free o Or idempotent  f(f(state)) = f(state) o Otherwise consistency must be handled externally

9. Starting tasks Name = test, %% Each gascheduler has its own name. There can be multiple gaschedulers. Nodes = [...], %% A list of nodes to execute work on. See also erlang:nodes(). MaxWorkers = 10, %% Maximum number of workers per node. MaxRetries = 10, %% Maximum number of retries for a worker, i.e. it throws some exception. Client = self(), %% Where to send scheduler status messages to. %% Start the scheduler. {ok, _} = gascheduler:start_link(Name, Nodes, Client, MaxWorkers, MaxRetries), %% Execute hello:world(1) asynchronously. In the hello module exists, world(N) -> N. ok = gascheduler:execute(Name, {hello, world, [1]}), .....

10. Handling task status %% Receive a single status message from a particular scheduler. receive {Name, {ok, Result}, Node, MFA = {Mod, Fun, Args}} -> io:format(“hello world ~p from ~p~n”, [Result, Node]); {Name, {error, Reason}, Node, MFA = {Mod, Fun, Args}} -> io:format(“task ~p failed on ~p because ~p”, [MFA, Node, Reason]) end %% Task completed successfully. hello world 1 from slave1@worker1 %% Task failed. task {hello, world, [1]} failed on slave1@worker1 because max_retries task {hello, world, [1]} failed on slave1@worker1 because permanent_failure

11. Advantages ● clean separation of business and infrastructure code ● very few dependencies ● code reuse

12. Possible Improvements multi master ● distributed consensus required allow client to stop cleanly in a generic way ● clients currently implements clean stop

13. Thanks for listening! Questions? http://github.com/GameAnalytics/gascheduler

Game Analytics Cluster Scheduler

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Game Analytics Cluster Scheduler

Similar to Game Analytics Cluster Scheduler (20)

Recently uploaded

Recently uploaded (20)

Game Analytics Cluster Scheduler