SlideShare a Scribd company logo
1 of 35
Download to read offline
GRID COMPUTING
Grid Scheduling & Resource Management

Sandeep Kumar Poonia
Head of Dept. CS/IT, Jagan Nath University, Jaipur
B.E., M. Tech., UGC-NET
LM-IAENG, LM-IACSIT,LM-CSTA, LM-AIRCC, LM-SCIEI, AM-UACEE
11/9/2013

Sandeep Kumar Poonia

1
OUTLINE
Introduction
Scheduling Paradigms
How Scheduling Works
A Review of Condor, SGE, PBS and LSF
Grid Scheduling with QoS
Introduction
Grid scheduling is a process of mapping Grid jobs to
resources over multiple administrative domains.
 A Grid job can be split into many small tasks.
 The scheduler has the responsibility of selecting
resources and scheduling jobs in such a way that the
user and application requirements are met, in terms of

overall execution time (throughput) and cost of the
resources utilized.
Introduction

Jobs, via Globus, can be submitted to systems managed
by Condor, the Sun Grid Engine (SGE), thr Portable Batch
System (PBS) and the Load Sharing Facility (LSF)
Scheduling Paradigms

 Centralized Scheduling
 Hierarchical Scheduling
 Distributed Scheduling
Centralized Scheduling
In a centralized scheduling environment, a central
machine (node) acts as a resource manager to
schedule jobs to all the surrounding nodes that are
part of the environment.

This scheduling paradigm is often used in situations
like a computing centre where resources have

similar characteristics and usage policies.
Centralized Scheduling

Here, jobs are first submitted to the central scheduler, which then

dispatches the jobs to the appropriate nodes. Those jobs that
cannot be started on a node are normally stored in a central job
queue for a later start.
Centralized Scheduling: Advantage & Disadvantage
 Centralized scheduling system may produce better scheduling
decisions because it has all necessary, and up-to-date,
information about the available resources.
 Centralized scheduling does not scale well with the increasing
size of the environment that it manages.
 The scheduler itself may well become a bottleneck, and if
there is a problem with the hardware or software of the
scheduler’s server, i.e. a failure,
 it presents a single point of failure in the environment.
Distributed Scheduling
 No central scheduler responsible for managing all the
jobs.

 It involves multiple localized schedulers, which interact
with each other in order to dispatch jobs to the
participating nodes.
 There are two mechanisms for
communicate with other schedulers

 Direct Communication
 Indirect Communication.

a

scheduler to
Distributed Scheduling: Direct Communication

 Each local scheduler can directly communicate with
other schedulers for job dispatching.
Distributed Scheduling: Direct Communication
 Each scheduler has a list of remote schedulers that they can
interact with, or there may exist a central directory that

maintains all the information related to each scheduler.
 If a job cannot be dispatched to its local resources, its
scheduler will communicate with other remote schedulers
to find resources appropriate and available for executing its
job.
 Each scheduler may maintain a local job queue(s) for job
management.
Distributed Scheduling: Indirect Communication
Communication via a central job pool

In this scenario, jobs that cannot be executed immediately
are sent to a central job pool.
Distributed Scheduling: Indirect Communication
Communication via a central job pool
 Compared with direct communication, the local
schedulers can potentially choose suitable jobs to
schedule on their resources.
Policies are required so that all the jobs in the pool are
executed at some time.
 This method can be modified, so that all jobs are
pushed directly in the job-pool after submission.
 This way all small jobs requiring few resources can
be used for utilizing free resources on all
machines.
Hierarchical scheduling

In hierarchical scheduling, a centralized scheduler interacts with
local schedulers for job submission. The centralized scheduler is a
kind of a meta-scheduler that dispatches submitted jobs to local
schedulers.
Hierarchical scheduling
 Similar to the centralized scheduling paradigm,
hierarchical scheduling can have scalability and
communication bottlenecks.
 However, compared with centralized scheduling,
one advantage of hierarchical scheduling is that
the global scheduler and local scheduler can have

different policies in scheduling jobs.
HOW SCHEDULING WORKS
Grid scheduling involves four main stages:
 resource discovery,
 resource selection,
 schedule generation and
 job execution
Resource discovery
Goal: identify a list of authenticated resources that are
available for job submission.
In order to cope with the dynamic nature of the Grid,
a scheduler needs to have some way of incorporating
dynamic state information about the available
resources into its decision-making process.
A Grid environment typically uses
a pull model,
a push model or
a push–pull model
for resource discovery.
Resource discovery : The pull model

A single daemon associated with the scheduler can query
Grid resources and collect state information such as CPU
loads or the available memory.
Resource discovery : The pull model
 The pull model for gathering resource information
incurs relatively small communication overhead,
but unless it requests resource information
frequently, it tends to provide fairly stale
information which is likely to be constantly out-ofdate, and potentially misleading.
 In
centralized
scheduling,
the
resource
discovery/query process could be rather intrusive
and begin to take significant amounts of time as
the environment being monitored gets larger and
larger.
Resource discovery : The push model
Resource discovery
 Each resource in the environment has a daemon
for gathering local state information,
 which will be sent to a centralized scheduler that
maintains a database to record each resource’s
activity.
 If the updates are frequent, an accurate view of
the system state can be maintained over time;
obviously, frequent updates to the database are
intrusive and consume network bandwidth.
Resource discovery : The push–pull model

The push–pull model lies somewhere between the pull model and
the push model.
Resource discovery : The push–pull model
 Each resource in the environment runs a daemon
that collects state information.
 Instead of directly sending this information to a
central scheduler, there exist some intermediate
nodes running daemons that aggregate state
information from different sub-resources that
respond to queries from the scheduler.
 A challenge of this model is to find out what
information is most useful, how often it should be
collected and how long this information should be
kept around.
Resource Selection
The second phase of the scheduling process :
 Select those resources that best suit the constraints
and conditions imposed by the user, such as CPU
usage, RAM available or disk storage.
 The result of resource selection is to identify a
resource list Rselected in which all resources can meet
the minimum requirements for a submitted job or a
job list.
 The relationship between resources available
Ravailable and resources selected Rselected is:
 Rselected ⊆ Ravailable
Resource Generation

The generation of schedules involves two
steps,
 selecting jobs and
 producing resource selection strategies.
Resource Generation : Job Selection
 The resource selection process is used to choose
resource(s) from the resource list Rselected for a given
job.
 Since all resources in the list Rselected could meet the
minimum requirements imposed by the job, an
algorithm is needed to choose the best resource(s) to
execute the job.
 Although random selection is a choice, it is not an
ideal resource selection policy.
 The resource selection algorithm should take into
account the current state of resources and choose the
best one based on a quantitative evaluation.
Resource Generation : Job Selection
A resource selection algorithm that only takes CPU and RAM into
account could be designed as follows:

where :
WCPU – the weight allocated to
CPU speed;
CPUload – the current CPU load;
CPUspeed – real CPU speed;
CPUmin – minimum CPU speed;

WRAM – the weight allocated to
RAM;
RAMusage – the current RAM
usage;
RAMsize – original RAM size; and
RAMmin – minimum RAM size.
Resource Generation : Job Selection
Example: Suppose that the total weighting used in the
algorithm is 10, where the CPU weight is 6 and the RAM weight
is 4. The minimum CPU speed is 1 GHz and minimum RAM size
is 256 MB. Resource information matrix is as follow:

Find the best resource for submitted job.
Resource Generation : Job Selection
Then, evaluation values for resources can be
calculated using the three formulas:

From the results we know Resource3 is the best choice
for the submitted job.
Resource Generation : Resource Selection
The goal of job selection is to select a job from a
job queue for execution. Four strategies that can
be used to select a job are given below.
 First come first serve
 Random Selection
 Priority-based Selection

 Backfilling Selection
Resource Generation : Resource Selection
First come first serve:
 The scheduler selects jobs for execution in the order of
their submissions.
 If there is no resource available for the selected job, the
scheduler will wait until the job can be started.
 The other jobs in the job queue have to wait.

There are two main drawbacks with this type of job selection.
1. It may waste resources when, for example, the job
selected needs more resources to be available before
it can start, which results in a long waiting time.
2. jobs with high priorities cannot get dispatched
immediately if a job with a low priority needs more
time to complete.
Resource Generation : Resource Selection
Random selection:
 The next job to be scheduled is randomly
selected from the job queue.
 Apart from the two drawbacks with the firstcome-first-serve strategy, jobs selection is not
fair and job submitted earlier may not be
scheduled until much later.
Resource Generation : Resource Selection
Priority-based selection:
 Jobs submitted to the scheduler have different
priorities.
 The next job to be scheduled is the job with the
highest priority in the job queue.
 A job priority can be set when the job is submitted.
 One drawback of this strategy is that it is hard to set
an optimal criterion for a job priority.
 A job with the highest priority may need more
resources than available and may also result in a long
waiting time and inability to make good use of the
available resources.
Resource Generation : Resource Selection

Backfilling selection:
 The backfilling strategy requires knowledge of
the expected execution time of a job to be
scheduled.

 If the next job in the job queue cannot be
started due to a lack of available resources,

backfilling tries to find another job in the queue
that can use the idle resources.
Job execution
 Once a job and a resource are selected, the next

step is to submit the job to the resource for
execution.
 Job execution may be as easy as running a single
command or as complicated as running a series
of scripts that may, or may not, include set up or

staging.

More Related Content

What's hot

GRID COMPUTING PRESENTATION
GRID COMPUTING PRESENTATION GRID COMPUTING PRESENTATION
GRID COMPUTING PRESENTATION
Ashok Mannai
 
Wireless routing protocols
Wireless routing protocolsWireless routing protocols
Wireless routing protocols
barodia_1437
 
ENCAPSULATION AND TUNNELING
ENCAPSULATION AND TUNNELINGENCAPSULATION AND TUNNELING
ENCAPSULATION AND TUNNELING
Mohammad Adil
 
Chapter 4 a interprocess communication
Chapter 4 a interprocess communicationChapter 4 a interprocess communication
Chapter 4 a interprocess communication
AbDul ThaYyal
 
Grid computing [2005]
Grid computing [2005]Grid computing [2005]
Grid computing [2005]
Raul Soto
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memory
Ashish Kumar
 
Grid computing Seminar PPT
Grid computing Seminar PPTGrid computing Seminar PPT
Grid computing Seminar PPT
Upender Upr
 

What's hot (20)

Security in distributed systems
Security in distributed systems Security in distributed systems
Security in distributed systems
 
distributed Computing system model
distributed Computing system modeldistributed Computing system model
distributed Computing system model
 
Grid computing
Grid computingGrid computing
Grid computing
 
Cs6703 grid and cloud computing unit 3
Cs6703 grid and cloud computing unit 3Cs6703 grid and cloud computing unit 3
Cs6703 grid and cloud computing unit 3
 
1. GRID COMPUTING
1. GRID COMPUTING1. GRID COMPUTING
1. GRID COMPUTING
 
Grid computing
Grid computingGrid computing
Grid computing
 
8. mutual exclusion in Distributed Operating Systems
8. mutual exclusion in Distributed Operating Systems8. mutual exclusion in Distributed Operating Systems
8. mutual exclusion in Distributed Operating Systems
 
GRID COMPUTING PRESENTATION
GRID COMPUTING PRESENTATION GRID COMPUTING PRESENTATION
GRID COMPUTING PRESENTATION
 
Wireless routing protocols
Wireless routing protocolsWireless routing protocols
Wireless routing protocols
 
file sharing semantics by Umar Danjuma Maiwada
file sharing semantics by Umar Danjuma Maiwada file sharing semantics by Umar Danjuma Maiwada
file sharing semantics by Umar Danjuma Maiwada
 
Cs6703 grid and cloud computing unit 5
Cs6703 grid and cloud computing unit 5Cs6703 grid and cloud computing unit 5
Cs6703 grid and cloud computing unit 5
 
ENCAPSULATION AND TUNNELING
ENCAPSULATION AND TUNNELINGENCAPSULATION AND TUNNELING
ENCAPSULATION AND TUNNELING
 
Chapter 4 a interprocess communication
Chapter 4 a interprocess communicationChapter 4 a interprocess communication
Chapter 4 a interprocess communication
 
Grid computing [2005]
Grid computing [2005]Grid computing [2005]
Grid computing [2005]
 
computer forensic tools-Hardware & Software tools
computer forensic tools-Hardware & Software toolscomputer forensic tools-Hardware & Software tools
computer forensic tools-Hardware & Software tools
 
Peer to Peer services and File systems
Peer to Peer services and File systemsPeer to Peer services and File systems
Peer to Peer services and File systems
 
Message passing in Distributed Computing Systems
Message passing in Distributed Computing SystemsMessage passing in Distributed Computing Systems
Message passing in Distributed Computing Systems
 
Load balancing in Distributed Systems
Load balancing in Distributed SystemsLoad balancing in Distributed Systems
Load balancing in Distributed Systems
 
distributed shared memory
 distributed shared memory distributed shared memory
distributed shared memory
 
Grid computing Seminar PPT
Grid computing Seminar PPTGrid computing Seminar PPT
Grid computing Seminar PPT
 

Viewers also liked

JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHMJOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
mailjkb
 
Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters
Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared ClustersMercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters
Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters
DataWorks Summit
 

Viewers also liked (20)

Grid computing
Grid computingGrid computing
Grid computing
 
Grid computing ppt
Grid computing pptGrid computing ppt
Grid computing ppt
 
Genetic Algorithm by Example
Genetic Algorithm by ExampleGenetic Algorithm by Example
Genetic Algorithm by Example
 
Genetic algorithm
Genetic algorithmGenetic algorithm
Genetic algorithm
 
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHMJOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
JOB SCHEDULING USING ANT COLONY OPTIMIZATION ALGORITHM
 
A Review on Resource Allocation in Cloud Computing
A Review on Resource Allocation in Cloud ComputingA Review on Resource Allocation in Cloud Computing
A Review on Resource Allocation in Cloud Computing
 
GLOA:A New Job Scheduling Algorithm for Grid Computing
GLOA:A New Job Scheduling Algorithm for Grid ComputingGLOA:A New Job Scheduling Algorithm for Grid Computing
GLOA:A New Job Scheduling Algorithm for Grid Computing
 
Optimization Heuristics
Optimization HeuristicsOptimization Heuristics
Optimization Heuristics
 
Grid Computing - Collection of computer resources from multiple locations
Grid Computing - Collection of computer resources from multiple locationsGrid Computing - Collection of computer resources from multiple locations
Grid Computing - Collection of computer resources from multiple locations
 
Introduction to Grid Computing
Introduction to Grid ComputingIntroduction to Grid Computing
Introduction to Grid Computing
 
Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters
Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared ClustersMercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters
Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
Grid computing notes
Grid computing notesGrid computing notes
Grid computing notes
 
Grid computing
Grid computingGrid computing
Grid computing
 
Introduction to Virtualization
Introduction to VirtualizationIntroduction to Virtualization
Introduction to Virtualization
 
Distributed Computing
Distributed ComputingDistributed Computing
Distributed Computing
 
Multiprocessor architecture
Multiprocessor architectureMultiprocessor architecture
Multiprocessor architecture
 
1.Introduction to virtualization
1.Introduction to virtualization1.Introduction to virtualization
1.Introduction to virtualization
 
Genetic Algorithms Made Easy
Genetic Algorithms Made EasyGenetic Algorithms Made Easy
Genetic Algorithms Made Easy
 
Introduction to Genetic Algorithms
Introduction to Genetic AlgorithmsIntroduction to Genetic Algorithms
Introduction to Genetic Algorithms
 

Similar to 11. grid scheduling and resource managament

GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
ijgca
 
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
ijgca
 
Characterization and prediction of resource availability in grids
Characterization and prediction of resource availability in gridsCharacterization and prediction of resource availability in grids
Characterization and prediction of resource availability in grids
IAEME Publication
 

Similar to 11. grid scheduling and resource managament (20)

A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...
 
Job Resource Ratio Based Priority Driven Scheduling in Cloud Computing
Job Resource Ratio Based Priority Driven Scheduling in Cloud ComputingJob Resource Ratio Based Priority Driven Scheduling in Cloud Computing
Job Resource Ratio Based Priority Driven Scheduling in Cloud Computing
 
Scheduling
SchedulingScheduling
Scheduling
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environment
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environment
 
Resource management
Resource managementResource management
Resource management
 
Resource Availability Prediction in the Grid: Taxonomy and Review of State of...
Resource Availability Prediction in the Grid: Taxonomy and Review of State of...Resource Availability Prediction in the Grid: Taxonomy and Review of State of...
Resource Availability Prediction in the Grid: Taxonomy and Review of State of...
 
An adaptive algorithm for task scheduling for computational grid
An adaptive algorithm for task scheduling for computational gridAn adaptive algorithm for task scheduling for computational grid
An adaptive algorithm for task scheduling for computational grid
 
Application level scheduling (app les) in grid with quality of service (qos)
Application level scheduling (app les) in grid with quality of service (qos)Application level scheduling (app les) in grid with quality of service (qos)
Application level scheduling (app les) in grid with quality of service (qos)
 
APPLICATION LEVEL SCHEDULING(APPLES) IN GRID WITH QUALITY OF SERVICE (QOS)
APPLICATION LEVEL SCHEDULING(APPLES) IN GRID WITH QUALITY OF SERVICE (QOS)APPLICATION LEVEL SCHEDULING(APPLES) IN GRID WITH QUALITY OF SERVICE (QOS)
APPLICATION LEVEL SCHEDULING(APPLES) IN GRID WITH QUALITY OF SERVICE (QOS)
 
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
 
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
GROUPING BASED JOB SCHEDULING ALGORITHM USING PRIORITY QUEUE AND HYBRID ALGOR...
 
C017241316
C017241316C017241316
C017241316
 
Comparative Analysis of Various Grid Based Scheduling Algorithms
Comparative Analysis of Various Grid Based Scheduling AlgorithmsComparative Analysis of Various Grid Based Scheduling Algorithms
Comparative Analysis of Various Grid Based Scheduling Algorithms
 
Fault Tollerant scheduling system for computational grid
Fault Tollerant scheduling system for computational gridFault Tollerant scheduling system for computational grid
Fault Tollerant scheduling system for computational grid
 
J0210053057
J0210053057J0210053057
J0210053057
 
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
 
A bi objective workflow application
A bi objective workflow applicationA bi objective workflow application
A bi objective workflow application
 
Effective and Efficient Job Scheduling in Grid Computing
Effective and Efficient Job Scheduling in Grid ComputingEffective and Efficient Job Scheduling in Grid Computing
Effective and Efficient Job Scheduling in Grid Computing
 
Characterization and prediction of resource availability in grids
Characterization and prediction of resource availability in gridsCharacterization and prediction of resource availability in grids
Characterization and prediction of resource availability in grids
 

More from Dr Sandeep Kumar Poonia

Memetic search in differential evolution algorithm
Memetic search in differential evolution algorithmMemetic search in differential evolution algorithm
Memetic search in differential evolution algorithm
Dr Sandeep Kumar Poonia
 
Performance evaluation of different routing protocols in wsn using different ...
Performance evaluation of different routing protocols in wsn using different ...Performance evaluation of different routing protocols in wsn using different ...
Performance evaluation of different routing protocols in wsn using different ...
Dr Sandeep Kumar Poonia
 

More from Dr Sandeep Kumar Poonia (20)

Soft computing
Soft computingSoft computing
Soft computing
 
An improved memetic search in artificial bee colony algorithm
An improved memetic search in artificial bee colony algorithmAn improved memetic search in artificial bee colony algorithm
An improved memetic search in artificial bee colony algorithm
 
Modified position update in spider monkey optimization algorithm
Modified position update in spider monkey optimization algorithmModified position update in spider monkey optimization algorithm
Modified position update in spider monkey optimization algorithm
 
Enhanced local search in artificial bee colony algorithm
Enhanced local search in artificial bee colony algorithmEnhanced local search in artificial bee colony algorithm
Enhanced local search in artificial bee colony algorithm
 
RMABC
RMABCRMABC
RMABC
 
Memetic search in differential evolution algorithm
Memetic search in differential evolution algorithmMemetic search in differential evolution algorithm
Memetic search in differential evolution algorithm
 
Improved onlooker bee phase in artificial bee colony algorithm
Improved onlooker bee phase in artificial bee colony algorithmImproved onlooker bee phase in artificial bee colony algorithm
Improved onlooker bee phase in artificial bee colony algorithm
 
Comparative study of_hybrids_of_artificial_bee_colony_algorithm
Comparative study of_hybrids_of_artificial_bee_colony_algorithmComparative study of_hybrids_of_artificial_bee_colony_algorithm
Comparative study of_hybrids_of_artificial_bee_colony_algorithm
 
A novel hybrid crossover based abc algorithm
A novel hybrid crossover based abc algorithmA novel hybrid crossover based abc algorithm
A novel hybrid crossover based abc algorithm
 
Multiplication of two 3 d sparse matrices using 1d arrays and linked lists
Multiplication of two 3 d sparse matrices using 1d arrays and linked listsMultiplication of two 3 d sparse matrices using 1d arrays and linked lists
Multiplication of two 3 d sparse matrices using 1d arrays and linked lists
 
Sunzip user tool for data reduction using huffman algorithm
Sunzip user tool for data reduction using huffman algorithmSunzip user tool for data reduction using huffman algorithm
Sunzip user tool for data reduction using huffman algorithm
 
New Local Search Strategy in Artificial Bee Colony Algorithm
New Local Search Strategy in Artificial Bee Colony Algorithm New Local Search Strategy in Artificial Bee Colony Algorithm
New Local Search Strategy in Artificial Bee Colony Algorithm
 
A new approach of program slicing
A new approach of program slicingA new approach of program slicing
A new approach of program slicing
 
Performance evaluation of different routing protocols in wsn using different ...
Performance evaluation of different routing protocols in wsn using different ...Performance evaluation of different routing protocols in wsn using different ...
Performance evaluation of different routing protocols in wsn using different ...
 
Enhanced abc algo for tsp
Enhanced abc algo for tspEnhanced abc algo for tsp
Enhanced abc algo for tsp
 
Database aggregation using metadata
Database aggregation using metadataDatabase aggregation using metadata
Database aggregation using metadata
 
Performance evaluation of diff routing protocols in wsn using difft network p...
Performance evaluation of diff routing protocols in wsn using difft network p...Performance evaluation of diff routing protocols in wsn using difft network p...
Performance evaluation of diff routing protocols in wsn using difft network p...
 
Lecture28 tsp
Lecture28 tspLecture28 tsp
Lecture28 tsp
 
Lecture27 linear programming
Lecture27 linear programmingLecture27 linear programming
Lecture27 linear programming
 
Lecture26
Lecture26Lecture26
Lecture26
 

Recently uploaded

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 

Recently uploaded (20)

Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 

11. grid scheduling and resource managament

  • 1. GRID COMPUTING Grid Scheduling & Resource Management Sandeep Kumar Poonia Head of Dept. CS/IT, Jagan Nath University, Jaipur B.E., M. Tech., UGC-NET LM-IAENG, LM-IACSIT,LM-CSTA, LM-AIRCC, LM-SCIEI, AM-UACEE 11/9/2013 Sandeep Kumar Poonia 1
  • 2. OUTLINE Introduction Scheduling Paradigms How Scheduling Works A Review of Condor, SGE, PBS and LSF Grid Scheduling with QoS
  • 3. Introduction Grid scheduling is a process of mapping Grid jobs to resources over multiple administrative domains.  A Grid job can be split into many small tasks.  The scheduler has the responsibility of selecting resources and scheduling jobs in such a way that the user and application requirements are met, in terms of overall execution time (throughput) and cost of the resources utilized.
  • 4. Introduction Jobs, via Globus, can be submitted to systems managed by Condor, the Sun Grid Engine (SGE), thr Portable Batch System (PBS) and the Load Sharing Facility (LSF)
  • 5. Scheduling Paradigms  Centralized Scheduling  Hierarchical Scheduling  Distributed Scheduling
  • 6. Centralized Scheduling In a centralized scheduling environment, a central machine (node) acts as a resource manager to schedule jobs to all the surrounding nodes that are part of the environment. This scheduling paradigm is often used in situations like a computing centre where resources have similar characteristics and usage policies.
  • 7. Centralized Scheduling Here, jobs are first submitted to the central scheduler, which then dispatches the jobs to the appropriate nodes. Those jobs that cannot be started on a node are normally stored in a central job queue for a later start.
  • 8. Centralized Scheduling: Advantage & Disadvantage  Centralized scheduling system may produce better scheduling decisions because it has all necessary, and up-to-date, information about the available resources.  Centralized scheduling does not scale well with the increasing size of the environment that it manages.  The scheduler itself may well become a bottleneck, and if there is a problem with the hardware or software of the scheduler’s server, i.e. a failure,  it presents a single point of failure in the environment.
  • 9. Distributed Scheduling  No central scheduler responsible for managing all the jobs.  It involves multiple localized schedulers, which interact with each other in order to dispatch jobs to the participating nodes.  There are two mechanisms for communicate with other schedulers  Direct Communication  Indirect Communication. a scheduler to
  • 10. Distributed Scheduling: Direct Communication  Each local scheduler can directly communicate with other schedulers for job dispatching.
  • 11. Distributed Scheduling: Direct Communication  Each scheduler has a list of remote schedulers that they can interact with, or there may exist a central directory that maintains all the information related to each scheduler.  If a job cannot be dispatched to its local resources, its scheduler will communicate with other remote schedulers to find resources appropriate and available for executing its job.  Each scheduler may maintain a local job queue(s) for job management.
  • 12. Distributed Scheduling: Indirect Communication Communication via a central job pool In this scenario, jobs that cannot be executed immediately are sent to a central job pool.
  • 13. Distributed Scheduling: Indirect Communication Communication via a central job pool  Compared with direct communication, the local schedulers can potentially choose suitable jobs to schedule on their resources. Policies are required so that all the jobs in the pool are executed at some time.  This method can be modified, so that all jobs are pushed directly in the job-pool after submission.  This way all small jobs requiring few resources can be used for utilizing free resources on all machines.
  • 14. Hierarchical scheduling In hierarchical scheduling, a centralized scheduler interacts with local schedulers for job submission. The centralized scheduler is a kind of a meta-scheduler that dispatches submitted jobs to local schedulers.
  • 15. Hierarchical scheduling  Similar to the centralized scheduling paradigm, hierarchical scheduling can have scalability and communication bottlenecks.  However, compared with centralized scheduling, one advantage of hierarchical scheduling is that the global scheduler and local scheduler can have different policies in scheduling jobs.
  • 16. HOW SCHEDULING WORKS Grid scheduling involves four main stages:  resource discovery,  resource selection,  schedule generation and  job execution
  • 17. Resource discovery Goal: identify a list of authenticated resources that are available for job submission. In order to cope with the dynamic nature of the Grid, a scheduler needs to have some way of incorporating dynamic state information about the available resources into its decision-making process. A Grid environment typically uses a pull model, a push model or a push–pull model for resource discovery.
  • 18. Resource discovery : The pull model A single daemon associated with the scheduler can query Grid resources and collect state information such as CPU loads or the available memory.
  • 19. Resource discovery : The pull model  The pull model for gathering resource information incurs relatively small communication overhead, but unless it requests resource information frequently, it tends to provide fairly stale information which is likely to be constantly out-ofdate, and potentially misleading.  In centralized scheduling, the resource discovery/query process could be rather intrusive and begin to take significant amounts of time as the environment being monitored gets larger and larger.
  • 20. Resource discovery : The push model
  • 21. Resource discovery  Each resource in the environment has a daemon for gathering local state information,  which will be sent to a centralized scheduler that maintains a database to record each resource’s activity.  If the updates are frequent, an accurate view of the system state can be maintained over time; obviously, frequent updates to the database are intrusive and consume network bandwidth.
  • 22. Resource discovery : The push–pull model The push–pull model lies somewhere between the pull model and the push model.
  • 23. Resource discovery : The push–pull model  Each resource in the environment runs a daemon that collects state information.  Instead of directly sending this information to a central scheduler, there exist some intermediate nodes running daemons that aggregate state information from different sub-resources that respond to queries from the scheduler.  A challenge of this model is to find out what information is most useful, how often it should be collected and how long this information should be kept around.
  • 24. Resource Selection The second phase of the scheduling process :  Select those resources that best suit the constraints and conditions imposed by the user, such as CPU usage, RAM available or disk storage.  The result of resource selection is to identify a resource list Rselected in which all resources can meet the minimum requirements for a submitted job or a job list.  The relationship between resources available Ravailable and resources selected Rselected is:  Rselected ⊆ Ravailable
  • 25. Resource Generation The generation of schedules involves two steps,  selecting jobs and  producing resource selection strategies.
  • 26. Resource Generation : Job Selection  The resource selection process is used to choose resource(s) from the resource list Rselected for a given job.  Since all resources in the list Rselected could meet the minimum requirements imposed by the job, an algorithm is needed to choose the best resource(s) to execute the job.  Although random selection is a choice, it is not an ideal resource selection policy.  The resource selection algorithm should take into account the current state of resources and choose the best one based on a quantitative evaluation.
  • 27. Resource Generation : Job Selection A resource selection algorithm that only takes CPU and RAM into account could be designed as follows: where : WCPU – the weight allocated to CPU speed; CPUload – the current CPU load; CPUspeed – real CPU speed; CPUmin – minimum CPU speed; WRAM – the weight allocated to RAM; RAMusage – the current RAM usage; RAMsize – original RAM size; and RAMmin – minimum RAM size.
  • 28. Resource Generation : Job Selection Example: Suppose that the total weighting used in the algorithm is 10, where the CPU weight is 6 and the RAM weight is 4. The minimum CPU speed is 1 GHz and minimum RAM size is 256 MB. Resource information matrix is as follow: Find the best resource for submitted job.
  • 29. Resource Generation : Job Selection Then, evaluation values for resources can be calculated using the three formulas: From the results we know Resource3 is the best choice for the submitted job.
  • 30. Resource Generation : Resource Selection The goal of job selection is to select a job from a job queue for execution. Four strategies that can be used to select a job are given below.  First come first serve  Random Selection  Priority-based Selection  Backfilling Selection
  • 31. Resource Generation : Resource Selection First come first serve:  The scheduler selects jobs for execution in the order of their submissions.  If there is no resource available for the selected job, the scheduler will wait until the job can be started.  The other jobs in the job queue have to wait. There are two main drawbacks with this type of job selection. 1. It may waste resources when, for example, the job selected needs more resources to be available before it can start, which results in a long waiting time. 2. jobs with high priorities cannot get dispatched immediately if a job with a low priority needs more time to complete.
  • 32. Resource Generation : Resource Selection Random selection:  The next job to be scheduled is randomly selected from the job queue.  Apart from the two drawbacks with the firstcome-first-serve strategy, jobs selection is not fair and job submitted earlier may not be scheduled until much later.
  • 33. Resource Generation : Resource Selection Priority-based selection:  Jobs submitted to the scheduler have different priorities.  The next job to be scheduled is the job with the highest priority in the job queue.  A job priority can be set when the job is submitted.  One drawback of this strategy is that it is hard to set an optimal criterion for a job priority.  A job with the highest priority may need more resources than available and may also result in a long waiting time and inability to make good use of the available resources.
  • 34. Resource Generation : Resource Selection Backfilling selection:  The backfilling strategy requires knowledge of the expected execution time of a job to be scheduled.  If the next job in the job queue cannot be started due to a lack of available resources, backfilling tries to find another job in the queue that can use the idle resources.
  • 35. Job execution  Once a job and a resource are selected, the next step is to submit the job to the resource for execution.  Job execution may be as easy as running a single command or as complicated as running a series of scripts that may, or may not, include set up or staging.