SlideShare a Scribd company logo
1 of 29
GrenchMark: A Framework for
Analyzing, Testing, and Comparing Grids




A. Iosup, D.H.J. Epema
PDS Group, ST/EWI, TU Delft




                              April 4, 2012
                              CCGrid 2006     1
Outline


•   Introduction and Motivation
•   The GrenchMark Framework
•   Past and Current Experience with GrenchMark
•   A GrenchMark Success Story
•   Future Work
•   Conclusions




                         April 4, 2012
                                                  2
The Generic Problem of
Analyzing, Testing, and Comparing Grids

• Use cases for automatically analyzing, testing, and
  comparing Grids
   •   Comparisons for system design and procurement
   •   Functionality testing and system tuning
   •   Performance testing/analysis of grid applications
   •   …

• For grids, this problem is hard !
   •   Testing in real environments is difficult
   •   Grids change rapidly
   •   Validity of tests
   •   …
                                April 4, 2012
                                                           3
A Generic Solution to
Analyzing, Testing, and Comparing Grids

• “ Generate and run synthetic grid workloads, based
  on real and synthetic applications “

• Current alternatives (not covering all problems)
   • Benchmarking with real/synthetic applications (representative?)
   • User-defined test management (statistically sound?)

• Advantages of using synthetic grid workloads
   • Statistically sound composition of benchmarks
   • Statistically sound test management
   • Generic: cover the use cases’ broad spectrum (to be shown)
                             April 4, 2012
                                                                       4
Outline


•   Introduction and Motivation
•   The GrenchMark Framework
•   Past and Current Experience with GrenchMark
•   A GrenchMark Success Story
•   Future Work
•   Conclusions




                         April 4, 2012
                                                  5
GrenchMark: a Framework for
Analyzing, Testing, and Comparing grids

•   What’s in a name?
    grid benchmark → working towards a generic tool for the
    whole community: help standardizing the testing procedures,
                              but benchmarks are too early; we use
                              synthetic grid workloads instead

•   What’s it about?
    A systematic approach to analyzing, testing, and comparing grid
    settings, based on synthetic workloads
    • A set of metrics for analyzing grid settings
    • A set of representative grid applications
        • Both real and synthetic
    • Easy-to-use tools to create synthetic grid workloads
    • Flexible, extensible framework


                                    April 4, 2012
                                                                      6
GrenchMark: Iterative Research Roadmap




                 April 4, 2012
                                         7
GrenchMark: Iterative Research Roadmap
             Simple functional system
 A.Iosup, J.Maassen, R.V.van Nieuwpoort, D.H.J.Epema,
    Synthetic Grid Workloads with Ibis, KOALA, and
          GrenchMark, CoreGRID IW, Nov 2005.




                        April 4, 2012
                                                        8
GrenchMark: Iterative Research Roadmap




   Open-
GrenchMark

Community
  Effort
                         Complex extensible system
                                   This work
                   April 4, 2012
                                                     9
GrenchMark Overview:
Generate and Run Synthetic Workloads




                 April 4, 2012
                                       10
GrenchMark Overview: Easy to
Generate and Run Synthetic Workloads




                 April 4, 2012
                                       11
… but More Complicated Than You Think

• Workload structure                        • Measurement methods
   • User-defined and                             • Long workloads
     statistical models                           • Saturated / non-saturated system
   • Dynamic jobs arrival                         • Start-up, production, and
   • Burstiness and self-similarity                 cool-down scenarios
   • Feedback, background load                    • Scaling workload to system
   • Machine usage assumptions              • Applications
   • Users, VOs                                   • Synthetic
• Metrics                                         • Real
   • A(W) Run/Wait/Resp. Time               • Workload definition language
   • Efficiency, MakeSpan                         • Base language layer
   • Failure rate [!]                             • Extended language layer
• (Grid) notions                            • Other
   • Co-allocation, interactive jobs,             • Can use the same workload for
     malleable, moldable, …                       both simulations and real
                                                  environments


                                  April 4, 2012
                                                                                    12
GrenchMark Overview:
Unitary and Composite Applications
• Unitary applications
   •   sequential, MPI, Java RMI, Ibis, …
• Composite applications
  •    Bag of tasks
  •    Chain of jobs
  •    Direct Acyclic Graph-based
       (Standard Task Graph Archive)




                             April 4, 2012
                                             13
GrenchMark Overview:
 Workload Description Files

 • Format:




Number of jobs and application and
   Composition Co-allocation types Language extensions
                                  Inter-arrival and
                 number of components
Combining four workloads into one start time
                        April 4, 2012
                                                         14
Using GrenchMark:
Grid System Analysis

• Performance testing: test the performance of an
  application (for sequential, MPI, Ibis applications)
   • Report runtimes, waiting times, grid middleware overhead
   • Automatic results analysis



• What-if analysis: evaluate potential situations
   • System change
   • Grid inter-operability
   • Special situations: spikes in demand


                             April 4, 2012
                                                                15
Using GrenchMark:
Functionality Testing in Grid Environments

• System functionality testing: show the ability of
  the system to run various types of applications
   • Report failure rates
     [ arguably, functionality in grids is even more important than
     performance !  10% job failure rate in a controlled system
     like the DAS ]




• Periodic system testing: evaluate the current state
  of the grid
   • Replay workloads
                             April 4, 2012
                                                                      16
Using GrenchMark:
Comparing Grid Settings

• Single-site vs. co-allocated jobs:
  compare the success rate of single-site and co-allocated
  jobs, in a system without reservation capabilities
   • Single-site jobs 20% better vs. small co-allocated jobs (<32 CPUs),
     30% better vs. large co-allocated jobs
     [setting and workload-dependent !]

• Unitary vs. composite jobs:
  compare the success rate of unitary and composite jobs,
  with and without failure handling mechanisms
   • Both 100% with simple retry mechanism
     [setting and workload-dependent !]


                             April 4, 2012
                                                                      17
A GrenchMark Success Story:
Releasing the Koala Grid Scheduler on the DAS
•   Koala [ http://www.st.ewi.tudelft.nl/koala/ ]
     • Grid Scheduler with co-allocation capabilities
•   DAS: The Dutch Grid, ~200 researchers
•   Initially
     • Koala, a tested (!) scheduler, pre-release version
•   Test specifics
     • 3 different job submission modules
     • Workloads with different jobs requirements,
       inter-arrival rates, co-allocated v. single site jobs…
     • Evaluate: job success rate, Koala overhead and bottlenecks
•   Results
     • 5,000+ jobs successfully run (all workloads); functionality tests
     • 2 major bugs first day, 10+ bugs overall (all fixed)
     • KOALA is now officially released on the DAS
        (full credit to KOALA developers, 10x for testing with GrenchMark)

                                  April 4, 2012
                                                                             18
GrenchMark’s Current Status:
pre-”Open-GrenchMark”
• Already done in Python [http://www.python.org]
  • Workload Generator
  • Generic Workload Submitter (Koala, Globus GRAM,
    option to extend for JSDL, Condor, PBS, LSF, SGE, …)
  • Applications
      • Unitary, 3 types: sequential, MPI, Ibis (Java)
      • +35 real and synthetic applications
      • Composite applications: DAG-based
• Extending modeling capabilities
A.Iosup, D.H.J.Epema (TU Delft), C. Franke, A. Papaspyrou,
L. Schley, B. Song, R. Yahyapour (U Dortmund), On modeling
synthetic workloads for Grid performance evaluation, 12th
Workshop on Job Scheduling Strategies for Parallel
Processing (JSSPP), held April 4, 2012
                         in conjunction with SIGMETRICS
2006, Saint Malo, France, June 2006 (accepted).              19
Towards Open-GrenchMark:
 Grid traces, Simulators, Benchmarks

 • Distributed testing
    • Integrate with DiPerF (C. Dumitrescu, I. Raicu, M. Ripeanu)
 • Grid traces analysis
    • Automatic tools for grid traces analysis
A. Iosup, C. Dumitrescu, D.H.J. Epema (TU Delft), H. Li,
L. Wolters (U Leiden), How are Real Grids Used? The Analysis
of Four Grid Traces and Its Implications, (submitted).

 • Use in conjunction with simulators
    • Ability to generate workloads which can be used in simulated
      environments (e.g., GangSim, GridSim, …)
 • Grid benchmarks
    • Analyze the requirements for domain-specific grid benchmarks
                              April 4, 2012
                                                                     20
Conclusion

• GrenchMark
  generates diverse grid workloads
  easy-to-use, flexible, portable, extensible, …


• Experience
  used GrenchMark to test KOALA’s functionality and performance.
  used GrenchMark to analyze, test, and compare grid settings.
  15,000+ jobs generated and run … and counting.


• (more) advertisement
  Have specific grid setting you would like to test?
                Test with GrenchMark!
                              April 4, 2012
                                                                   21
Thank you! Jason Maassen to Hashim Mohamed (Koala),
             Many thanks
                         and Rob van Nieuwpoort (Ibis).




       Questions? Remarks? Observations?
                   All welcome!

                           GrenchMark
              http://grenchmark.st.ewi.tudelft.nl/ [10x Paulo]


                          Alexandru IOSUP
                             TU Delft
                            A.Iosup@tudelft.nl
    http://www.pds.ewi.tudelft.nl/~iosup/index.html [google: “iosup”]
                              April 4, 2012
                                                                        22
April 4, 2012
                23
Outline


•   Introduction                        [here]
•   The GrenchMark framework
•   Experience with GrenchMark
•   Extending GrenchMark
•   Conclusions




                        April 4, 2012
                                                 24
Representative Grid applications (1/4)


• Unitary applications
  • Just one scheduling unit (otherwise recursive definition)
  • Examples:
    Sequential, MPI, Java RMI, Ibis, …

• Composite applications
  • Composed of several unitary or composite applications
  • Examples:
    Parameter sweeps, chains of tasks, DAGs, workflows, …


                        April 4, 2012
                                                           25
Representative Grid applications (2/4)
   Unitary: synthetic sequential
                                                     <<single site/single node/single
                                                      stdin     processor>>       stdout



[I] read I1 data elements from stdin
                                                       N+1:I1,N*I2       N+2:O1,N*O2,O3



[O] write O1 data elements to stdout                       System reqs

                                                  Processor (float*N*S*C)
for each N steps (i) # superstep                  Memory (float*(M+1)*S)
  [I] read I2 data elements from stdin
  for each M memory locations (j) # computation step
     [M] get item j of size S
     [P] compute C computation units per memory item (fmul),
        and store results into temp memory location
     [M] put values from temp to location j
  [O] write O2 data elements to stdout
[O] write O3 data elements to stdout          Also version with
                                                   computation and I/O
                                   April 4, 2012
                                                                                           26
Representative Grid applications (2/4)
    Unitary: synthetic parallel
                                                              <<MPI system/single site>>
                                                                <<single node/single processor>>
                                                           <<single node/single processor>>

[I] read I1 data from in$ProcID           <<single node/single processor>>
                                                app-input.i    app-output.i
[O] write O1 data to out$ProcID
for each N steps (i) # superstep           N+1:
                                        I1,N*I2
                                                         Process i          N+2:
                                                                            O1,N*O2,O3

   [B] synchronize with barrier
   [I] read I2 data from in$ProcID
   for each M memory locations (j)                       Machine i


     [M] get item j of size S                  Processor i (float*N*S*C)
                                                                   k




     [P] compute C fmul/mem                    Memory (float*N*2S)


     [M] put values from temp to j
  [C] communicate to all other processes X values and
     receive X values from every other processor
  [O] write O2 data elements to out$ProcID
[O] write O3 data elements to out$ProcID
                                           April 4, 2012
                                                                                                   27
Representative Grid applications (3/4)
Unitary: Ibis jobs
• No modeling, just real applications
 [ http://www.cs.vu.nl/ibis ]
  •   physical simulation
  •   parallel rendering
  •   computational mathematics
  •   state space search
  •   bioinformatics
  •   data compression
  •   grid methods
  •   optimization
                        April 4, 2012
                                         28
Representative Grid applications (3/4)
Composite: DAG-based
• DAG-based applications
  • Real DAG
  • Chain of tools
• Try to model real or predicted (use) cases
                       perf1.dat    <<final>>                    param2.in some-list.in
        param1.in                  out_1-2.dat       l1p.dat
                                                                                               <<final>>
                                                                                               out2.res



                    App1           out_1-1.dat   Linker1 huge-data.out           App2
                <<single>>                       <<link>>                      <<single>>

                                                                                               perf2.dat
                    Linker               Input           Identity
                                                         (one task’s output = other’s input, unmodified)

                    User task           Output           Final result


                                         April 4, 2012
                                                                                                           29

More Related Content

What's hot

Visual Data Analytics in the Cloud for Exploratory Science
Visual Data Analytics in the Cloud for Exploratory ScienceVisual Data Analytics in the Cloud for Exploratory Science
Visual Data Analytics in the Cloud for Exploratory ScienceUniversity of Washington
 
Semantic Web: The Inside Story
Semantic Web: The Inside StorySemantic Web: The Inside Story
Semantic Web: The Inside StoryJames Hendler
 
Applications of Machine Learning at USC
Applications of Machine Learning at USCApplications of Machine Learning at USC
Applications of Machine Learning at USCSri Ambati
 
Big Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DBig Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DUniversity of Washington
 
Big Data + Big Sim: Query Processing over Unstructured CFD Models
Big Data + Big Sim: Query Processing over Unstructured CFD ModelsBig Data + Big Sim: Query Processing over Unstructured CFD Models
Big Data + Big Sim: Query Processing over Unstructured CFD ModelsUniversity of Washington
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)James Hendler
 
On Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebOn Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebJames Hendler
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for RealJames Hendler
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about DataBigDataExpo
 
Big Data in NATO and Your Role
Big Data in NATO and Your RoleBig Data in NATO and Your Role
Big Data in NATO and Your RoleJay Gendron
 
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...Geoffrey Fox
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataJames Hendler
 
Data Culture Series - Keynote & Panel - Birmingham - 8th April 2015
Data Culture Series  - Keynote & Panel - Birmingham - 8th April 2015Data Culture Series  - Keynote & Panel - Birmingham - 8th April 2015
Data Culture Series - Keynote & Panel - Birmingham - 8th April 2015Jonathan Woodward
 
A New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScienceA New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScienceUniversity of Washington
 
What is eScience, and where does it go from here?
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?Daniel S. Katz
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide WebJames Hendler
 
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Big Data Spain
 

What's hot (20)

Visual Data Analytics in the Cloud for Exploratory Science
Visual Data Analytics in the Cloud for Exploratory ScienceVisual Data Analytics in the Cloud for Exploratory Science
Visual Data Analytics in the Cloud for Exploratory Science
 
Wither OWL
Wither OWLWither OWL
Wither OWL
 
Semantic Web: The Inside Story
Semantic Web: The Inside StorySemantic Web: The Inside Story
Semantic Web: The Inside Story
 
Applications of Machine Learning at USC
Applications of Machine Learning at USCApplications of Machine Learning at USC
Applications of Machine Learning at USC
 
Big Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DBig Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&D
 
Broad Data
Broad DataBroad Data
Broad Data
 
End-to-End eScience
End-to-End eScienceEnd-to-End eScience
End-to-End eScience
 
Big Data + Big Sim: Query Processing over Unstructured CFD Models
Big Data + Big Sim: Query Processing over Unstructured CFD ModelsBig Data + Big Sim: Query Processing over Unstructured CFD Models
Big Data + Big Sim: Query Processing over Unstructured CFD Models
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)
 
On Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the WebOn Beyond OWL: challenges for ontologies on the Web
On Beyond OWL: challenges for ontologies on the Web
 
The Semantic Web: It's for Real
The Semantic Web: It's for RealThe Semantic Web: It's for Real
The Semantic Web: It's for Real
 
Rabobank - There is something about Data
Rabobank - There is something about DataRabobank - There is something about Data
Rabobank - There is something about Data
 
Big Data in NATO and Your Role
Big Data in NATO and Your RoleBig Data in NATO and Your Role
Big Data in NATO and Your Role
 
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
 
The Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of MetadataThe Unreasonable Effectiveness of Metadata
The Unreasonable Effectiveness of Metadata
 
Data Culture Series - Keynote & Panel - Birmingham - 8th April 2015
Data Culture Series  - Keynote & Panel - Birmingham - 8th April 2015Data Culture Series  - Keynote & Panel - Birmingham - 8th April 2015
Data Culture Series - Keynote & Panel - Birmingham - 8th April 2015
 
A New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScienceA New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScience
 
What is eScience, and where does it go from here?
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?
 
The Future(s) of the World Wide Web
The Future(s) of the World Wide WebThe Future(s) of the World Wide Web
The Future(s) of the World Wide Web
 
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
Data Science in 2016: Moving up by Paco Nathan at Big Data Spain 2015
 

Similar to GrenchMark at CCGrid, May 2006.

Detecting common scientific workflow fragments using templates and execution ...
Detecting common scientific workflow fragments using templates and execution ...Detecting common scientific workflow fragments using templates and execution ...
Detecting common scientific workflow fragments using templates and execution ...dgarijo
 
MongoDB for Spatio-Behavioral Data Analysis and Visualization
MongoDB for Spatio-Behavioral Data Analysis and VisualizationMongoDB for Spatio-Behavioral Data Analysis and Visualization
MongoDB for Spatio-Behavioral Data Analysis and VisualizationMongoDB
 
Jumbune optimize hadoop-solutions
Jumbune optimize hadoop-solutionsJumbune optimize hadoop-solutions
Jumbune optimize hadoop-solutionsArpan Bhandari
 
Jumbune optimize hadoop-solutions
Jumbune optimize hadoop-solutionsJumbune optimize hadoop-solutions
Jumbune optimize hadoop-solutionsMahesh Nair
 
Big Machine Learning Libraries & Open Challenges
Big Machine Learning Libraries & Open ChallengesBig Machine Learning Libraries & Open Challenges
Big Machine Learning Libraries & Open ChallengesPetr Novotný
 
Practical cheminformatics workflows with mobile apps
Practical cheminformatics workflows with mobile appsPractical cheminformatics workflows with mobile apps
Practical cheminformatics workflows with mobile appsAlex Clark
 
Use Case: Apollo Group at Oracle Open World
Use Case: Apollo Group at Oracle Open WorldUse Case: Apollo Group at Oracle Open World
Use Case: Apollo Group at Oracle Open WorldMongoDB
 
Adding Performance Testing to a Software Development Project
Adding Performance Testing to a Software Development ProjectAdding Performance Testing to a Software Development Project
Adding Performance Testing to a Software Development ProjectCris Holdorph
 
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013Neo4j
 
Machine Learning Application Development
Machine Learning Application DevelopmentMachine Learning Application Development
Machine Learning Application DevelopmentLARCA UPC
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-Systeminside-BigData.com
 
Tools of the Trade: Load Testing - Ignite session at WebPerfDays NY 14
Tools of the Trade: Load Testing -  Ignite session at WebPerfDays NY 14Tools of the Trade: Load Testing -  Ignite session at WebPerfDays NY 14
Tools of the Trade: Load Testing - Ignite session at WebPerfDays NY 14Alexander Podelko
 
Proposal defense2 flat
Proposal defense2 flatProposal defense2 flat
Proposal defense2 flatSam Skalicky
 
Vehicle management system
Vehicle management systemVehicle management system
Vehicle management systemwethecodershelp
 
Vehicle management system
Vehicle management systemVehicle management system
Vehicle management systemWeTheCoders
 
The Decision Table Template For Geospatial Business Rules
The Decision Table Template For Geospatial Business RulesThe Decision Table Template For Geospatial Business Rules
The Decision Table Template For Geospatial Business RulesJacob Feldman
 
Crafting bigdatabenchmarks
Crafting bigdatabenchmarksCrafting bigdatabenchmarks
Crafting bigdatabenchmarksTilmann Rabl
 
OSGi Working Group Technical Progress Report 2007 - Enterprise
OSGi Working Group Technical Progress Report 2007 - EnterpriseOSGi Working Group Technical Progress Report 2007 - Enterprise
OSGi Working Group Technical Progress Report 2007 - Enterprisemfrancis
 
Webinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database ReplacementWebinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database ReplacementMongoDB
 
Student Industrial Training Presentation Slide
Student Industrial Training Presentation SlideStudent Industrial Training Presentation Slide
Student Industrial Training Presentation SlideKhairul Filhan
 

Similar to GrenchMark at CCGrid, May 2006. (20)

Detecting common scientific workflow fragments using templates and execution ...
Detecting common scientific workflow fragments using templates and execution ...Detecting common scientific workflow fragments using templates and execution ...
Detecting common scientific workflow fragments using templates and execution ...
 
MongoDB for Spatio-Behavioral Data Analysis and Visualization
MongoDB for Spatio-Behavioral Data Analysis and VisualizationMongoDB for Spatio-Behavioral Data Analysis and Visualization
MongoDB for Spatio-Behavioral Data Analysis and Visualization
 
Jumbune optimize hadoop-solutions
Jumbune optimize hadoop-solutionsJumbune optimize hadoop-solutions
Jumbune optimize hadoop-solutions
 
Jumbune optimize hadoop-solutions
Jumbune optimize hadoop-solutionsJumbune optimize hadoop-solutions
Jumbune optimize hadoop-solutions
 
Big Machine Learning Libraries & Open Challenges
Big Machine Learning Libraries & Open ChallengesBig Machine Learning Libraries & Open Challenges
Big Machine Learning Libraries & Open Challenges
 
Practical cheminformatics workflows with mobile apps
Practical cheminformatics workflows with mobile appsPractical cheminformatics workflows with mobile apps
Practical cheminformatics workflows with mobile apps
 
Use Case: Apollo Group at Oracle Open World
Use Case: Apollo Group at Oracle Open WorldUse Case: Apollo Group at Oracle Open World
Use Case: Apollo Group at Oracle Open World
 
Adding Performance Testing to a Software Development Project
Adding Performance Testing to a Software Development ProjectAdding Performance Testing to a Software Development Project
Adding Performance Testing to a Software Development Project
 
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
Neo4j Theory and Practice - Tareq Abedrabbo @ GraphConnect London 2013
 
Machine Learning Application Development
Machine Learning Application DevelopmentMachine Learning Application Development
Machine Learning Application Development
 
The Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-SystemThe Analytics Frontier of the Hadoop Eco-System
The Analytics Frontier of the Hadoop Eco-System
 
Tools of the Trade: Load Testing - Ignite session at WebPerfDays NY 14
Tools of the Trade: Load Testing -  Ignite session at WebPerfDays NY 14Tools of the Trade: Load Testing -  Ignite session at WebPerfDays NY 14
Tools of the Trade: Load Testing - Ignite session at WebPerfDays NY 14
 
Proposal defense2 flat
Proposal defense2 flatProposal defense2 flat
Proposal defense2 flat
 
Vehicle management system
Vehicle management systemVehicle management system
Vehicle management system
 
Vehicle management system
Vehicle management systemVehicle management system
Vehicle management system
 
The Decision Table Template For Geospatial Business Rules
The Decision Table Template For Geospatial Business RulesThe Decision Table Template For Geospatial Business Rules
The Decision Table Template For Geospatial Business Rules
 
Crafting bigdatabenchmarks
Crafting bigdatabenchmarksCrafting bigdatabenchmarks
Crafting bigdatabenchmarks
 
OSGi Working Group Technical Progress Report 2007 - Enterprise
OSGi Working Group Technical Progress Report 2007 - EnterpriseOSGi Working Group Technical Progress Report 2007 - Enterprise
OSGi Working Group Technical Progress Report 2007 - Enterprise
 
Webinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database ReplacementWebinar: How We Evaluated MongoDB as a Relational Database Replacement
Webinar: How We Evaluated MongoDB as a Relational Database Replacement
 
Student Industrial Training Presentation Slide
Student Industrial Training Presentation SlideStudent Industrial Training Presentation Slide
Student Industrial Training Presentation Slide
 

Recently uploaded

Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

GrenchMark at CCGrid, May 2006.

  • 1. GrenchMark: A Framework for Analyzing, Testing, and Comparing Grids A. Iosup, D.H.J. Epema PDS Group, ST/EWI, TU Delft April 4, 2012 CCGrid 2006 1
  • 2. Outline • Introduction and Motivation • The GrenchMark Framework • Past and Current Experience with GrenchMark • A GrenchMark Success Story • Future Work • Conclusions April 4, 2012 2
  • 3. The Generic Problem of Analyzing, Testing, and Comparing Grids • Use cases for automatically analyzing, testing, and comparing Grids • Comparisons for system design and procurement • Functionality testing and system tuning • Performance testing/analysis of grid applications • … • For grids, this problem is hard ! • Testing in real environments is difficult • Grids change rapidly • Validity of tests • … April 4, 2012 3
  • 4. A Generic Solution to Analyzing, Testing, and Comparing Grids • “ Generate and run synthetic grid workloads, based on real and synthetic applications “ • Current alternatives (not covering all problems) • Benchmarking with real/synthetic applications (representative?) • User-defined test management (statistically sound?) • Advantages of using synthetic grid workloads • Statistically sound composition of benchmarks • Statistically sound test management • Generic: cover the use cases’ broad spectrum (to be shown) April 4, 2012 4
  • 5. Outline • Introduction and Motivation • The GrenchMark Framework • Past and Current Experience with GrenchMark • A GrenchMark Success Story • Future Work • Conclusions April 4, 2012 5
  • 6. GrenchMark: a Framework for Analyzing, Testing, and Comparing grids • What’s in a name? grid benchmark → working towards a generic tool for the whole community: help standardizing the testing procedures, but benchmarks are too early; we use synthetic grid workloads instead • What’s it about? A systematic approach to analyzing, testing, and comparing grid settings, based on synthetic workloads • A set of metrics for analyzing grid settings • A set of representative grid applications • Both real and synthetic • Easy-to-use tools to create synthetic grid workloads • Flexible, extensible framework April 4, 2012 6
  • 7. GrenchMark: Iterative Research Roadmap April 4, 2012 7
  • 8. GrenchMark: Iterative Research Roadmap Simple functional system A.Iosup, J.Maassen, R.V.van Nieuwpoort, D.H.J.Epema, Synthetic Grid Workloads with Ibis, KOALA, and GrenchMark, CoreGRID IW, Nov 2005. April 4, 2012 8
  • 9. GrenchMark: Iterative Research Roadmap Open- GrenchMark Community Effort Complex extensible system This work April 4, 2012 9
  • 10. GrenchMark Overview: Generate and Run Synthetic Workloads April 4, 2012 10
  • 11. GrenchMark Overview: Easy to Generate and Run Synthetic Workloads April 4, 2012 11
  • 12. … but More Complicated Than You Think • Workload structure • Measurement methods • User-defined and • Long workloads statistical models • Saturated / non-saturated system • Dynamic jobs arrival • Start-up, production, and • Burstiness and self-similarity cool-down scenarios • Feedback, background load • Scaling workload to system • Machine usage assumptions • Applications • Users, VOs • Synthetic • Metrics • Real • A(W) Run/Wait/Resp. Time • Workload definition language • Efficiency, MakeSpan • Base language layer • Failure rate [!] • Extended language layer • (Grid) notions • Other • Co-allocation, interactive jobs, • Can use the same workload for malleable, moldable, … both simulations and real environments April 4, 2012 12
  • 13. GrenchMark Overview: Unitary and Composite Applications • Unitary applications • sequential, MPI, Java RMI, Ibis, … • Composite applications • Bag of tasks • Chain of jobs • Direct Acyclic Graph-based (Standard Task Graph Archive) April 4, 2012 13
  • 14. GrenchMark Overview: Workload Description Files • Format: Number of jobs and application and Composition Co-allocation types Language extensions Inter-arrival and number of components Combining four workloads into one start time April 4, 2012 14
  • 15. Using GrenchMark: Grid System Analysis • Performance testing: test the performance of an application (for sequential, MPI, Ibis applications) • Report runtimes, waiting times, grid middleware overhead • Automatic results analysis • What-if analysis: evaluate potential situations • System change • Grid inter-operability • Special situations: spikes in demand April 4, 2012 15
  • 16. Using GrenchMark: Functionality Testing in Grid Environments • System functionality testing: show the ability of the system to run various types of applications • Report failure rates [ arguably, functionality in grids is even more important than performance !  10% job failure rate in a controlled system like the DAS ] • Periodic system testing: evaluate the current state of the grid • Replay workloads April 4, 2012 16
  • 17. Using GrenchMark: Comparing Grid Settings • Single-site vs. co-allocated jobs: compare the success rate of single-site and co-allocated jobs, in a system without reservation capabilities • Single-site jobs 20% better vs. small co-allocated jobs (<32 CPUs), 30% better vs. large co-allocated jobs [setting and workload-dependent !] • Unitary vs. composite jobs: compare the success rate of unitary and composite jobs, with and without failure handling mechanisms • Both 100% with simple retry mechanism [setting and workload-dependent !] April 4, 2012 17
  • 18. A GrenchMark Success Story: Releasing the Koala Grid Scheduler on the DAS • Koala [ http://www.st.ewi.tudelft.nl/koala/ ] • Grid Scheduler with co-allocation capabilities • DAS: The Dutch Grid, ~200 researchers • Initially • Koala, a tested (!) scheduler, pre-release version • Test specifics • 3 different job submission modules • Workloads with different jobs requirements, inter-arrival rates, co-allocated v. single site jobs… • Evaluate: job success rate, Koala overhead and bottlenecks • Results • 5,000+ jobs successfully run (all workloads); functionality tests • 2 major bugs first day, 10+ bugs overall (all fixed) • KOALA is now officially released on the DAS (full credit to KOALA developers, 10x for testing with GrenchMark) April 4, 2012 18
  • 19. GrenchMark’s Current Status: pre-”Open-GrenchMark” • Already done in Python [http://www.python.org] • Workload Generator • Generic Workload Submitter (Koala, Globus GRAM, option to extend for JSDL, Condor, PBS, LSF, SGE, …) • Applications • Unitary, 3 types: sequential, MPI, Ibis (Java) • +35 real and synthetic applications • Composite applications: DAG-based • Extending modeling capabilities A.Iosup, D.H.J.Epema (TU Delft), C. Franke, A. Papaspyrou, L. Schley, B. Song, R. Yahyapour (U Dortmund), On modeling synthetic workloads for Grid performance evaluation, 12th Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP), held April 4, 2012 in conjunction with SIGMETRICS 2006, Saint Malo, France, June 2006 (accepted). 19
  • 20. Towards Open-GrenchMark: Grid traces, Simulators, Benchmarks • Distributed testing • Integrate with DiPerF (C. Dumitrescu, I. Raicu, M. Ripeanu) • Grid traces analysis • Automatic tools for grid traces analysis A. Iosup, C. Dumitrescu, D.H.J. Epema (TU Delft), H. Li, L. Wolters (U Leiden), How are Real Grids Used? The Analysis of Four Grid Traces and Its Implications, (submitted). • Use in conjunction with simulators • Ability to generate workloads which can be used in simulated environments (e.g., GangSim, GridSim, …) • Grid benchmarks • Analyze the requirements for domain-specific grid benchmarks April 4, 2012 20
  • 21. Conclusion • GrenchMark generates diverse grid workloads easy-to-use, flexible, portable, extensible, … • Experience used GrenchMark to test KOALA’s functionality and performance. used GrenchMark to analyze, test, and compare grid settings. 15,000+ jobs generated and run … and counting. • (more) advertisement Have specific grid setting you would like to test? Test with GrenchMark! April 4, 2012 21
  • 22. Thank you! Jason Maassen to Hashim Mohamed (Koala), Many thanks and Rob van Nieuwpoort (Ibis). Questions? Remarks? Observations? All welcome! GrenchMark http://grenchmark.st.ewi.tudelft.nl/ [10x Paulo] Alexandru IOSUP TU Delft A.Iosup@tudelft.nl http://www.pds.ewi.tudelft.nl/~iosup/index.html [google: “iosup”] April 4, 2012 22
  • 24. Outline • Introduction [here] • The GrenchMark framework • Experience with GrenchMark • Extending GrenchMark • Conclusions April 4, 2012 24
  • 25. Representative Grid applications (1/4) • Unitary applications • Just one scheduling unit (otherwise recursive definition) • Examples: Sequential, MPI, Java RMI, Ibis, … • Composite applications • Composed of several unitary or composite applications • Examples: Parameter sweeps, chains of tasks, DAGs, workflows, … April 4, 2012 25
  • 26. Representative Grid applications (2/4) Unitary: synthetic sequential <<single site/single node/single stdin processor>> stdout [I] read I1 data elements from stdin N+1:I1,N*I2 N+2:O1,N*O2,O3 [O] write O1 data elements to stdout System reqs Processor (float*N*S*C) for each N steps (i) # superstep Memory (float*(M+1)*S) [I] read I2 data elements from stdin for each M memory locations (j) # computation step [M] get item j of size S [P] compute C computation units per memory item (fmul), and store results into temp memory location [M] put values from temp to location j [O] write O2 data elements to stdout [O] write O3 data elements to stdout Also version with computation and I/O April 4, 2012 26
  • 27. Representative Grid applications (2/4) Unitary: synthetic parallel <<MPI system/single site>> <<single node/single processor>> <<single node/single processor>> [I] read I1 data from in$ProcID <<single node/single processor>> app-input.i app-output.i [O] write O1 data to out$ProcID for each N steps (i) # superstep N+1: I1,N*I2 Process i N+2: O1,N*O2,O3 [B] synchronize with barrier [I] read I2 data from in$ProcID for each M memory locations (j) Machine i [M] get item j of size S Processor i (float*N*S*C) k [P] compute C fmul/mem Memory (float*N*2S) [M] put values from temp to j [C] communicate to all other processes X values and receive X values from every other processor [O] write O2 data elements to out$ProcID [O] write O3 data elements to out$ProcID April 4, 2012 27
  • 28. Representative Grid applications (3/4) Unitary: Ibis jobs • No modeling, just real applications [ http://www.cs.vu.nl/ibis ] • physical simulation • parallel rendering • computational mathematics • state space search • bioinformatics • data compression • grid methods • optimization April 4, 2012 28
  • 29. Representative Grid applications (3/4) Composite: DAG-based • DAG-based applications • Real DAG • Chain of tools • Try to model real or predicted (use) cases perf1.dat <<final>> param2.in some-list.in param1.in out_1-2.dat l1p.dat <<final>> out2.res App1 out_1-1.dat Linker1 huge-data.out App2 <<single>> <<link>> <<single>> perf2.dat Linker Input Identity (one task’s output = other’s input, unmodified) User task Output Final result April 4, 2012 29

Editor's Notes

  1. Solution: Build synthetic grid workloads using real and synthetic applications, and run them in real or simulated environments.