SlideShare a Scribd company logo
1 of 23
Download to read offline
http://itconfidence2013.wordpress.com
Beyond The Statistical Average
2°°°°International Conference on
IT Data collection, Analysis and Benchmarking
Tokyo (Japan) - October 22, 2014
John Ogilvie
CEO, ISBSGCEO, ISBSG
BeyInsert here a pictureB
The KISIS Principle
“Keeping it Simple is
Stupid”
2IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Beyond the
Statistical Average
Goals of the presentation
G1. Understand the characteristics of the data available to
establish a productivity baseline
G2. Statistical considerations in establishing a
productivity baseline
G3. Statistical considerations in measuring actual
performance against a baseline
http://itconfidence2013.wordpress.com
Case Study
• ABC company is outsourcing their application
development and maintenance.
• They wish to establish a set of targets for annual
improvements in productivity based on what they
were achieving internally prior to outsourcing.
• The contract with the vendor specified shared
risk/reward
• bonus/penalty payments for over/under achievement
against targets.
http://itconfidence2013.wordpress.com
Case Study
• For each of New Development and Application
Enhancement projects ABC required 28
( 4 technologies X 7 FP size bands) performance
baselines in Hours/Function Point
• Annual % improvement targets were specified
• ABC had data from 128 internal projects. If there
were at least 5 in a particular segment, the baseline
was set as the average.
• Otherwise an industry data average was used.
http://itconfidence2013.wordpress.com
Case Study
• At the end of each quarter, the average actual
performance in each segment was calculated and
the bonus/penalty rules applied
• No minimum number of data points for the calculation
was specified
• In many cases only 1 or 2 actual projects in each category
After 12 months there was considerable conflict
between ABC and vendor with ABC threatening
legal action and vendor threatening to exit
contract .
• Both over and under achievements were challenged
6IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
What Questions arise in Case Study
What are the characteristics of the data we have?
• Shape of distribution
• Handling of Outliers
Baseline:
• How much data is required
• Do performance segments make sense
Measurement:
• How do we determine how productivity has changed
• How much measurement data is required
http://itconfidence2013.wordpress.com
Data Used in This Presentation
For the purposes of this presentation, data was extracted
from the ISBSG Development and Enhancement Repository.
• Data Quality Rating: A or B
• Development Type: Enhancement
• Count Method: IFPUG
• Application Group: Business Application
• Development Platform: Mainframe/Midrange/Multi
Analyses and tables where produced using Minitab Statistical
Software
8IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
ISBSG Relative Sizes
Categorises the Functional Size by relative sizes as follows:
Relative Size Functional Size
1. XXS Extra-extra-small => 0 and <10
2. XS Extra-small => 10 and <30
3. S Small => 30 and <100
4. M1 Medium1 => 100 and <300
5. M2 Medium2 => 300 and <1000
6. L Large => 1,000 and < 3,000
7. XL Extra-large => 3,000 and < 9,000
8. XXL Extra-extra-large => 9,000 and < 18,000
9. XXXL Extra-extra-extra-large => 18,000
9IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Examine the Data: Descriptive Statistics
Relative Total
Size Count Mean TrMean StDev Minimum Median Maximum
1. XXS 43 27.01 19.84 45.27 1.40 7.90 236.30
2. XS 157 19.06 14.25 29.83 0.90 10.90 271.60
3. S 424 16.34 12.23 30.27 0.40 10.00 424.90
4. M1 470 13.19 11.47 12.67 0.90 9.60 97.90
5. M2 187 14.52 13.10 12.68 0.80 11.10 80.70
6. L 31 11.69 10.25 11.63 1.00 9.10 42.90
7. XL 4 1.50 * 1.34 0.10 1.30 3.30
8. XXL 2 0.35 * 0.21 0.20 0.35 0.50
10IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Examine the Data
444422220000333366660000333300000000222244440000111188880000111122220000666600000000
PDR (afp)
Dotplot of PDR (afp)
Each symbol represents up to 17 observations.
http://itconfidence2013.wordpress.com
Examine the Data
444422220000333366660000333300000000222244440000111188880000111122220000666600000000
1111.... XXXXXXXXSSSS
2222.... XXXXSSSS
3333.... SSSS
4444.... MMMM1111
5555.... MMMM2222
6666.... LLLL
7777.... XXXXLLLL
8888.... XXXXXXXXLLLL
PDR (afp)
RelativeSize
Dotplot of PDR (afp)
Each symbol represents up to 16 observations.
12IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Handling Outliers
An outlier is an unusually large or small observation. Outliers can
have a disproportionate influence on statistical results, such as the
mean, which can result in misleading interpretations
A variety of techniques can be used
• Trim the data by removing the top and bottom 5% - simple to do
• Remove data more than 2 standard deviations from the mean (
simple to do but assumes data has normal distribution)
• Statistical test that all values in the sample are from the same,
normally distributed population. ( Need a tool and assumes data has
normal distribution)
• Graphically using a Boxplot
http://itconfidence2013.wordpress.com
222255550000222200000000111155550000111100000000555500000000
PDR (afp)
Boxplot of PDR (afp) for 1. XXS
14IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Boxplot
• “Box” shows values in from Quartile 1(Q1) to
Quartile 3(Q3)
• Inter Quartile Range (IQR) is from Q1 to Q3.
• Value is Q3 – Q1
• Mean and Median are shown
• “Whiskers” go to 1.5*IQR above and below the box
• An outlier is taken to be any value beyond the Whiskers
• Applying this to each of the size groups and removing
sizes 7&8 reduced the number of data points by 106
from 1,319 to 1,231
15IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Descriptive Statistics after Outliers Removed
Relative Total
Size Count Mean TrMean StDev Minimum Median Maximum
1. XXS 38 13.44 11.84 14.92 1.40 6.00 53.80
2. XS 147 13.08 12.31 10.12 0.90 10.30 44.80
3. S 388 10.62 10.13 7.24 0.40 8.60 32.70
4. M1 433 10.20 9.79 6.38 0.90 8.80 30.30
5. M2 178 12.50 11.89 8.78 0.80 10.15 38.90
6. L 28 8.64 8.20 7.04 1.00 6.30 27.70
16IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Data Distribution after Outliers Removed
444488884444000033332222222244441111666688880000
PDR (afp)
Dotplot of PDR (afp)
Each symbol represents up to 3 observations.
17IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
How much data is required for Baseline &
Performance Measurement
The need in a baseline is to have sufficient data
points (n) such that their average will closely
estimate the population average .
One approach , based on the Central Limit Theorem
in statistical theory indicates:
• In general try to have n>30
• If data is highly skewed, ideally more data points
• If data is symmetric , less than 30 may suffice
5 data points was insufficient for establishing a
baseline in the case study
18IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
How much data required for Baseline &
Performance Measurement
• In the Case Study, in addition to setting a
baseline, ABC wanted to determine if target
productivity was being met
• Statistically, the 95% Confidence Interval for the
true average of our performance is expressed as:
“ We are 95% certain that the true mean is
contained in the interval: CI=S±1.96σ/√n “
where:
S=sample mean, σ=sample standard deviation,
n=sample size
19IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Required Sample Sizes
For example, if we have 15 data points for size M1 projects, with average S,
we can be confident at a level of 95% that the true average of M1 projects
would be in the range of S±3
Therefore it is this range, not just the value of S which needs to be considered
If the productivity target is in the range then it has been achieved.
Standard
Deviation
Confidence
Interval
Size M1=6.38 95% 90%
± 1.0 159 112
± 1.5 72 51
± 2.0 42 30
± 2.5 28 20
± 3.0 20 15
± 3.5 16 11
± 4.0 13 9
Sample Size @
Confidence Level
20IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Baseline – Do segments make sense
In deciding what segmentation should be used in
establishing the baseline and subsequent
performance management, the question is
whether there is sufficient evidence that
performance is different in each segment.
Too much segmentation reduces the number of
data points in each segment which impacts the
Confidence Interval of the measurement, as
described earlier
21IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
How to Determine if Segments Differ
6666.... LLLL5555.... MMMM22224444.... MMMM11113333.... SSSS2222.... XXXXSSSS1111.... XXXXXXXXSSSS
22220000....0000
11117777....5555
11115555....0000
11112222....5555
11110000....0000
7777....5555
5555....0000
Relative Size
PDR(afp)
5.90633
11.3651
11.2029
13.8004
9.59844
10.8039
9.90016
11.3457
11.426
14.7264
8.54161
18.3479
Interval Plot of PDR (afp)
99995555%%%% CCCCIIII ffffoooorrrr tttthhhheeee MMMMeeeeaaaannnn
Individual standard deviations were used to calculate the intervals.
22IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
How to Determine if Segments Differ
• The Interval Plot indicates that S & M1 could be
combined
• The fact that XS and M2 are similar is
unexpected
• Possibly need to add further attributes to data
selection criteria
• The CI for XXS and L is too large to be useful
due to small numbers of data points ( 38 and 28
respectively)
23IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com
Recommendations
• Beware of basing conclusions on small numbers
of data points
• Check data for outliers
• Try and determine reason for outlier and do not
remove if likely to occur in your own data.
• Do not segment data unless you are confident
there is a real difference between segments.
• Your own data is always best. Industry data is a
valuable benchmark reference and can provide
data until you build up your own repository.
Measure what you do and improve

More Related Content

Viewers also liked

How to Scale your Analytics in a Maturing Organization
How to Scale your Analytics in a Maturing OrganizationHow to Scale your Analytics in a Maturing Organization
How to Scale your Analytics in a Maturing OrganizationKissmetrics on SlideShare
 
The Value of Infrastructure Asset Management
The Value of Infrastructure Asset ManagementThe Value of Infrastructure Asset Management
The Value of Infrastructure Asset ManagementC.S. Davidson
 
Measuring the value of your KM strategy
Measuring the value of your KM strategyMeasuring the value of your KM strategy
Measuring the value of your KM strategyChris Fletcher
 
CTO Summit 2016: Navigating Build vs. Buy at CleverTap
CTO Summit 2016: Navigating Build vs. Buy at CleverTapCTO Summit 2016: Navigating Build vs. Buy at CleverTap
CTO Summit 2016: Navigating Build vs. Buy at CleverTapCleverTap
 
Developing Metrics that Demonstrate the Value of Technical Communications
Developing Metrics that Demonstrate the Value of Technical CommunicationsDeveloping Metrics that Demonstrate the Value of Technical Communications
Developing Metrics that Demonstrate the Value of Technical CommunicationsVic Passion
 
Caught in a COTS Project - Strategies for Requirements Success 2011MAY17
Caught in a COTS Project - Strategies for Requirements Success 2011MAY17Caught in a COTS Project - Strategies for Requirements Success 2011MAY17
Caught in a COTS Project - Strategies for Requirements Success 2011MAY17Michael Roy
 
INPACT 2: How to avoid a failed project - Slidecast 2: Organisation’s capability
INPACT 2: How to avoid a failed project - Slidecast 2: Organisation’s capabilityINPACT 2: How to avoid a failed project - Slidecast 2: Organisation’s capability
INPACT 2: How to avoid a failed project - Slidecast 2: Organisation’s capabilityThe Imaginist Company
 
The future of software pricing excellence transaction pricing management
The future of software pricing excellence transaction pricing managementThe future of software pricing excellence transaction pricing management
The future of software pricing excellence transaction pricing managementVishal Sharma
 
Increasing the Business Value of Communications: Innovation, Strategy and Trust
Increasing the Business Value of Communications: Innovation, Strategy and TrustIncreasing the Business Value of Communications: Innovation, Strategy and Trust
Increasing the Business Value of Communications: Innovation, Strategy and TrustJeff Zwier
 
Next-generation integrated business planning: The Deloitte Perspective | Anaplan
Next-generation integrated business planning: The Deloitte Perspective | AnaplanNext-generation integrated business planning: The Deloitte Perspective | Anaplan
Next-generation integrated business planning: The Deloitte Perspective | AnaplanAnaplan
 
Anaplan and Valizant webinar: Improving your S&OP investment
Anaplan and Valizant webinar: Improving your S&OP investmentAnaplan and Valizant webinar: Improving your S&OP investment
Anaplan and Valizant webinar: Improving your S&OP investmentAnaplan
 

Viewers also liked (16)

How to Scale your Analytics in a Maturing Organization
How to Scale your Analytics in a Maturing OrganizationHow to Scale your Analytics in a Maturing Organization
How to Scale your Analytics in a Maturing Organization
 
The Value of Infrastructure Asset Management
The Value of Infrastructure Asset ManagementThe Value of Infrastructure Asset Management
The Value of Infrastructure Asset Management
 
Measuring the value of your KM strategy
Measuring the value of your KM strategyMeasuring the value of your KM strategy
Measuring the value of your KM strategy
 
CTO Summit 2016: Navigating Build vs. Buy at CleverTap
CTO Summit 2016: Navigating Build vs. Buy at CleverTapCTO Summit 2016: Navigating Build vs. Buy at CleverTap
CTO Summit 2016: Navigating Build vs. Buy at CleverTap
 
Developing Metrics that Demonstrate the Value of Technical Communications
Developing Metrics that Demonstrate the Value of Technical CommunicationsDeveloping Metrics that Demonstrate the Value of Technical Communications
Developing Metrics that Demonstrate the Value of Technical Communications
 
Green - sizing for estimating, measurement and benchmarking
Green - sizing for estimating, measurement and benchmarkingGreen - sizing for estimating, measurement and benchmarking
Green - sizing for estimating, measurement and benchmarking
 
Caught in a COTS Project - Strategies for Requirements Success 2011MAY17
Caught in a COTS Project - Strategies for Requirements Success 2011MAY17Caught in a COTS Project - Strategies for Requirements Success 2011MAY17
Caught in a COTS Project - Strategies for Requirements Success 2011MAY17
 
INPACT 2: How to avoid a failed project - Slidecast 2: Organisation’s capability
INPACT 2: How to avoid a failed project - Slidecast 2: Organisation’s capabilityINPACT 2: How to avoid a failed project - Slidecast 2: Organisation’s capability
INPACT 2: How to avoid a failed project - Slidecast 2: Organisation’s capability
 
Org Design
Org DesignOrg Design
Org Design
 
Maximizing ROI In Product Development
Maximizing ROI In Product DevelopmentMaximizing ROI In Product Development
Maximizing ROI In Product Development
 
The future of software pricing excellence transaction pricing management
The future of software pricing excellence transaction pricing managementThe future of software pricing excellence transaction pricing management
The future of software pricing excellence transaction pricing management
 
Increasing the Business Value of Communications: Innovation, Strategy and Trust
Increasing the Business Value of Communications: Innovation, Strategy and TrustIncreasing the Business Value of Communications: Innovation, Strategy and Trust
Increasing the Business Value of Communications: Innovation, Strategy and Trust
 
FSI_Third Party Risk Management_Deloitte PoV
FSI_Third Party Risk Management_Deloitte PoVFSI_Third Party Risk Management_Deloitte PoV
FSI_Third Party Risk Management_Deloitte PoV
 
Next-generation integrated business planning: The Deloitte Perspective | Anaplan
Next-generation integrated business planning: The Deloitte Perspective | AnaplanNext-generation integrated business planning: The Deloitte Perspective | Anaplan
Next-generation integrated business planning: The Deloitte Perspective | Anaplan
 
Zero budgeting
Zero budgetingZero budgeting
Zero budgeting
 
Anaplan and Valizant webinar: Improving your S&OP investment
Anaplan and Valizant webinar: Improving your S&OP investmentAnaplan and Valizant webinar: Improving your S&OP investment
Anaplan and Valizant webinar: Improving your S&OP investment
 

Similar to Ogilvie - Beyond the statistical average

Leveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic dataLeveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic dataMRS
 
Step by Step guide to executing an analytics project
Step by Step guide to executing an analytics projectStep by Step guide to executing an analytics project
Step by Step guide to executing an analytics projectRamkumar Ravichandran
 
Data Analysis by Multimedia University
Data Analysis by Multimedia UniversityData Analysis by Multimedia University
Data Analysis by Multimedia Universitysitecmy
 
Forward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative AdjustmentsForward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative AdjustmentsLibby Bierman
 
Data Integration and Marketing Attribution
Data Integration and Marketing Attribution Data Integration and Marketing Attribution
Data Integration and Marketing Attribution ROIVENUE™
 
Business and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April MeetupBusiness and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April MeetupKen Tucker
 
Customer Decision Support System
Customer Decision Support SystemCustomer Decision Support System
Customer Decision Support SystemIRJET Journal
 
Machine learning for customer classification
Machine learning for customer classificationMachine learning for customer classification
Machine learning for customer classificationAndrew Barnes
 
How to Use Data for Product Decisions by YouTube Product Manager
How to Use Data for Product Decisions by YouTube Product ManagerHow to Use Data for Product Decisions by YouTube Product Manager
How to Use Data for Product Decisions by YouTube Product ManagerProduct School
 
ASMD 2022 for class.pptx
ASMD 2022 for class.pptxASMD 2022 for class.pptx
ASMD 2022 for class.pptxMahekSinghania2
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive StatisticsCIToolkit
 
Basic Analytics Module for Sponsors
Basic Analytics Module for SponsorsBasic Analytics Module for Sponsors
Basic Analytics Module for SponsorsDee Daley
 
Principles of Modern Marketing at NetSuite - Rob Israch (TOPO Demand Generati...
Principles of Modern Marketing at NetSuite - Rob Israch (TOPO Demand Generati...Principles of Modern Marketing at NetSuite - Rob Israch (TOPO Demand Generati...
Principles of Modern Marketing at NetSuite - Rob Israch (TOPO Demand Generati...TOPO
 
Iwsm2014 importance of benchmarking (john ogilvie & harold van heeringen)
Iwsm2014   importance of benchmarking (john ogilvie & harold van heeringen)Iwsm2014   importance of benchmarking (john ogilvie & harold van heeringen)
Iwsm2014 importance of benchmarking (john ogilvie & harold van heeringen)Nesma
 
The importance of benchmarking software projects - Van Heeringen and Ogilvie
The importance of benchmarking software projects - Van Heeringen and OgilvieThe importance of benchmarking software projects - Van Heeringen and Ogilvie
The importance of benchmarking software projects - Van Heeringen and OgilvieHarold van Heeringen
 
desai_wharton2002
desai_wharton2002desai_wharton2002
desai_wharton2002Vijay Desai
 

Similar to Ogilvie - Beyond the statistical average (20)

De la fuente and Castelo - Software Rates vs cost per Function Point: a cost ...
De la fuente and Castelo - Software Rates vs cost per Function Point: a cost ...De la fuente and Castelo - Software Rates vs cost per Function Point: a cost ...
De la fuente and Castelo - Software Rates vs cost per Function Point: a cost ...
 
Leveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic dataLeveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic data
 
Step by Step guide to executing an analytics project
Step by Step guide to executing an analytics projectStep by Step guide to executing an analytics project
Step by Step guide to executing an analytics project
 
Data Analysis by Multimedia University
Data Analysis by Multimedia UniversityData Analysis by Multimedia University
Data Analysis by Multimedia University
 
Forward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative AdjustmentsForward-Looking ALLL: Computing Qualitative Adjustments
Forward-Looking ALLL: Computing Qualitative Adjustments
 
Data Integration and Marketing Attribution
Data Integration and Marketing Attribution Data Integration and Marketing Attribution
Data Integration and Marketing Attribution
 
Business and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April MeetupBusiness and Data Analytics Collaborative April Meetup
Business and Data Analytics Collaborative April Meetup
 
Customer Decision Support System
Customer Decision Support SystemCustomer Decision Support System
Customer Decision Support System
 
Machine learning for customer classification
Machine learning for customer classificationMachine learning for customer classification
Machine learning for customer classification
 
How to Use Data for Product Decisions by YouTube Product Manager
How to Use Data for Product Decisions by YouTube Product ManagerHow to Use Data for Product Decisions by YouTube Product Manager
How to Use Data for Product Decisions by YouTube Product Manager
 
Experiences with indicators
Experiences with indicatorsExperiences with indicators
Experiences with indicators
 
ASMD 2022 for class.pptx
ASMD 2022 for class.pptxASMD 2022 for class.pptx
ASMD 2022 for class.pptx
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Basic Analytics Module for Sponsors
Basic Analytics Module for SponsorsBasic Analytics Module for Sponsors
Basic Analytics Module for Sponsors
 
Yu info 2015 final jg
Yu info 2015 final jgYu info 2015 final jg
Yu info 2015 final jg
 
Principles of Modern Marketing at NetSuite - Rob Israch (TOPO Demand Generati...
Principles of Modern Marketing at NetSuite - Rob Israch (TOPO Demand Generati...Principles of Modern Marketing at NetSuite - Rob Israch (TOPO Demand Generati...
Principles of Modern Marketing at NetSuite - Rob Israch (TOPO Demand Generati...
 
Iwsm2014 importance of benchmarking (john ogilvie & harold van heeringen)
Iwsm2014   importance of benchmarking (john ogilvie & harold van heeringen)Iwsm2014   importance of benchmarking (john ogilvie & harold van heeringen)
Iwsm2014 importance of benchmarking (john ogilvie & harold van heeringen)
 
The importance of benchmarking software projects - Van Heeringen and Ogilvie
The importance of benchmarking software projects - Van Heeringen and OgilvieThe importance of benchmarking software projects - Van Heeringen and Ogilvie
The importance of benchmarking software projects - Van Heeringen and Ogilvie
 
Strategic direction setting
Strategic direction settingStrategic direction setting
Strategic direction setting
 
desai_wharton2002
desai_wharton2002desai_wharton2002
desai_wharton2002
 

More from International Software Benchmarking Standards Group (ISBSG)

More from International Software Benchmarking Standards Group (ISBSG) (9)

Galorath - Why can't people estimate
Galorath - Why can't people estimateGalorath - Why can't people estimate
Galorath - Why can't people estimate
 
Furuyama - analysis of factors that affect productivity
Furuyama - analysis of factors that affect productivityFuruyama - analysis of factors that affect productivity
Furuyama - analysis of factors that affect productivity
 
Hill - Are we really bad? A look at software estimation accuracy
Hill - Are we really bad? A look at software estimation accuracyHill - Are we really bad? A look at software estimation accuracy
Hill - Are we really bad? A look at software estimation accuracy
 
Minkiewicz - Lessons Learned from the ISBSG Database
Minkiewicz - Lessons Learned from the ISBSG DatabaseMinkiewicz - Lessons Learned from the ISBSG Database
Minkiewicz - Lessons Learned from the ISBSG Database
 
Bertazo et al - Application Lifecycle Management and process monitoring throu...
Bertazo et al - Application Lifecycle Management and process monitoring throu...Bertazo et al - Application Lifecycle Management and process monitoring throu...
Bertazo et al - Application Lifecycle Management and process monitoring throu...
 
Silveira - KPIs used in a 6,000 Function Points Program
Silveira - KPIs used in a 6,000 Function Points ProgramSilveira - KPIs used in a 6,000 Function Points Program
Silveira - KPIs used in a 6,000 Function Points Program
 
Schofield - Using Benchmarks to Accelerate Process Improvement
Schofield - Using Benchmarks to Accelerate Process ImprovementSchofield - Using Benchmarks to Accelerate Process Improvement
Schofield - Using Benchmarks to Accelerate Process Improvement
 
Dekkers, T. - Software Estimation – The next level
Dekkers, T. - Software Estimation – The next levelDekkers, T. - Software Estimation – The next level
Dekkers, T. - Software Estimation – The next level
 
S Woodward - What is your quest for software analytics
S Woodward - What is your quest for software analyticsS Woodward - What is your quest for software analytics
S Woodward - What is your quest for software analytics
 

Recently uploaded

Neha Jhalani Hiranandani: A Guide to Her Life and Career
Neha Jhalani Hiranandani: A Guide to Her Life and CareerNeha Jhalani Hiranandani: A Guide to Her Life and Career
Neha Jhalani Hiranandani: A Guide to Her Life and Careerr98588472
 
Intermediate Accounting, Volume 2, 13th Canadian Edition by Donald E. Kieso t...
Intermediate Accounting, Volume 2, 13th Canadian Edition by Donald E. Kieso t...Intermediate Accounting, Volume 2, 13th Canadian Edition by Donald E. Kieso t...
Intermediate Accounting, Volume 2, 13th Canadian Edition by Donald E. Kieso t...ssuserf63bd7
 
WSMM Technology February.March Newsletter_vF.pdf
WSMM Technology February.March Newsletter_vF.pdfWSMM Technology February.March Newsletter_vF.pdf
WSMM Technology February.March Newsletter_vF.pdfJamesConcepcion7
 
Pitch Deck Teardown: Xpanceo's $40M Seed deck
Pitch Deck Teardown: Xpanceo's $40M Seed deckPitch Deck Teardown: Xpanceo's $40M Seed deck
Pitch Deck Teardown: Xpanceo's $40M Seed deckHajeJanKamps
 
Planetary and Vedic Yagyas Bring Positive Impacts in Life
Planetary and Vedic Yagyas Bring Positive Impacts in LifePlanetary and Vedic Yagyas Bring Positive Impacts in Life
Planetary and Vedic Yagyas Bring Positive Impacts in LifeBhavana Pujan Kendra
 
5-Step Framework to Convert Any Business into a Wealth Generation Machine.pdf
5-Step Framework to Convert Any Business into a Wealth Generation Machine.pdf5-Step Framework to Convert Any Business into a Wealth Generation Machine.pdf
5-Step Framework to Convert Any Business into a Wealth Generation Machine.pdfSherl Simon
 
Technical Leaders - Working with the Management Team
Technical Leaders - Working with the Management TeamTechnical Leaders - Working with the Management Team
Technical Leaders - Working with the Management TeamArik Fletcher
 
Rakhi sets symbolizing the bond of love.pptx
Rakhi sets symbolizing the bond of love.pptxRakhi sets symbolizing the bond of love.pptx
Rakhi sets symbolizing the bond of love.pptxRakhi Bazaar
 
Excvation Safety for safety officers reference
Excvation Safety for safety officers referenceExcvation Safety for safety officers reference
Excvation Safety for safety officers referencessuser2c065e
 
Paul Turovsky - Real Estate Professional
Paul Turovsky - Real Estate ProfessionalPaul Turovsky - Real Estate Professional
Paul Turovsky - Real Estate ProfessionalPaul Turovsky
 
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...Aggregage
 
WSMM Media and Entertainment Feb_March_Final.pdf
WSMM Media and Entertainment Feb_March_Final.pdfWSMM Media and Entertainment Feb_March_Final.pdf
WSMM Media and Entertainment Feb_March_Final.pdfJamesConcepcion7
 
Simplify Your Funding: Quick and Easy Business Loans
Simplify Your Funding: Quick and Easy Business LoansSimplify Your Funding: Quick and Easy Business Loans
Simplify Your Funding: Quick and Easy Business LoansNugget Global
 
Introducing the Analogic framework for business planning applications
Introducing the Analogic framework for business planning applicationsIntroducing the Analogic framework for business planning applications
Introducing the Analogic framework for business planning applicationsKnowledgeSeed
 
Go for Rakhi Bazaar and Pick the Latest Bhaiya Bhabhi Rakhi.pptx
Go for Rakhi Bazaar and Pick the Latest Bhaiya Bhabhi Rakhi.pptxGo for Rakhi Bazaar and Pick the Latest Bhaiya Bhabhi Rakhi.pptx
Go for Rakhi Bazaar and Pick the Latest Bhaiya Bhabhi Rakhi.pptxRakhi Bazaar
 
Fundamentals Welcome and Inclusive DEIB
Fundamentals Welcome and  Inclusive DEIBFundamentals Welcome and  Inclusive DEIB
Fundamentals Welcome and Inclusive DEIBGregory DeShields
 
Ivey Leveraging Information Technology MBA 2024
Ivey Leveraging Information Technology MBA 2024Ivey Leveraging Information Technology MBA 2024
Ivey Leveraging Information Technology MBA 2024Nihal Nishadul
 
Jewish Resources in the Family Resource Centre
Jewish Resources in the Family Resource CentreJewish Resources in the Family Resource Centre
Jewish Resources in the Family Resource CentreNZSG
 
Entrepreneurial ecosystem- Wider context
Entrepreneurial ecosystem- Wider contextEntrepreneurial ecosystem- Wider context
Entrepreneurial ecosystem- Wider contextP&CO
 

Recently uploaded (20)

Neha Jhalani Hiranandani: A Guide to Her Life and Career
Neha Jhalani Hiranandani: A Guide to Her Life and CareerNeha Jhalani Hiranandani: A Guide to Her Life and Career
Neha Jhalani Hiranandani: A Guide to Her Life and Career
 
Intermediate Accounting, Volume 2, 13th Canadian Edition by Donald E. Kieso t...
Intermediate Accounting, Volume 2, 13th Canadian Edition by Donald E. Kieso t...Intermediate Accounting, Volume 2, 13th Canadian Edition by Donald E. Kieso t...
Intermediate Accounting, Volume 2, 13th Canadian Edition by Donald E. Kieso t...
 
WSMM Technology February.March Newsletter_vF.pdf
WSMM Technology February.March Newsletter_vF.pdfWSMM Technology February.March Newsletter_vF.pdf
WSMM Technology February.March Newsletter_vF.pdf
 
Pitch Deck Teardown: Xpanceo's $40M Seed deck
Pitch Deck Teardown: Xpanceo's $40M Seed deckPitch Deck Teardown: Xpanceo's $40M Seed deck
Pitch Deck Teardown: Xpanceo's $40M Seed deck
 
Planetary and Vedic Yagyas Bring Positive Impacts in Life
Planetary and Vedic Yagyas Bring Positive Impacts in LifePlanetary and Vedic Yagyas Bring Positive Impacts in Life
Planetary and Vedic Yagyas Bring Positive Impacts in Life
 
5-Step Framework to Convert Any Business into a Wealth Generation Machine.pdf
5-Step Framework to Convert Any Business into a Wealth Generation Machine.pdf5-Step Framework to Convert Any Business into a Wealth Generation Machine.pdf
5-Step Framework to Convert Any Business into a Wealth Generation Machine.pdf
 
Technical Leaders - Working with the Management Team
Technical Leaders - Working with the Management TeamTechnical Leaders - Working with the Management Team
Technical Leaders - Working with the Management Team
 
Authentically Social - presented by Corey Perlman
Authentically Social - presented by Corey PerlmanAuthentically Social - presented by Corey Perlman
Authentically Social - presented by Corey Perlman
 
Rakhi sets symbolizing the bond of love.pptx
Rakhi sets symbolizing the bond of love.pptxRakhi sets symbolizing the bond of love.pptx
Rakhi sets symbolizing the bond of love.pptx
 
Excvation Safety for safety officers reference
Excvation Safety for safety officers referenceExcvation Safety for safety officers reference
Excvation Safety for safety officers reference
 
Paul Turovsky - Real Estate Professional
Paul Turovsky - Real Estate ProfessionalPaul Turovsky - Real Estate Professional
Paul Turovsky - Real Estate Professional
 
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
 
WSMM Media and Entertainment Feb_March_Final.pdf
WSMM Media and Entertainment Feb_March_Final.pdfWSMM Media and Entertainment Feb_March_Final.pdf
WSMM Media and Entertainment Feb_March_Final.pdf
 
Simplify Your Funding: Quick and Easy Business Loans
Simplify Your Funding: Quick and Easy Business LoansSimplify Your Funding: Quick and Easy Business Loans
Simplify Your Funding: Quick and Easy Business Loans
 
Introducing the Analogic framework for business planning applications
Introducing the Analogic framework for business planning applicationsIntroducing the Analogic framework for business planning applications
Introducing the Analogic framework for business planning applications
 
Go for Rakhi Bazaar and Pick the Latest Bhaiya Bhabhi Rakhi.pptx
Go for Rakhi Bazaar and Pick the Latest Bhaiya Bhabhi Rakhi.pptxGo for Rakhi Bazaar and Pick the Latest Bhaiya Bhabhi Rakhi.pptx
Go for Rakhi Bazaar and Pick the Latest Bhaiya Bhabhi Rakhi.pptx
 
Fundamentals Welcome and Inclusive DEIB
Fundamentals Welcome and  Inclusive DEIBFundamentals Welcome and  Inclusive DEIB
Fundamentals Welcome and Inclusive DEIB
 
Ivey Leveraging Information Technology MBA 2024
Ivey Leveraging Information Technology MBA 2024Ivey Leveraging Information Technology MBA 2024
Ivey Leveraging Information Technology MBA 2024
 
Jewish Resources in the Family Resource Centre
Jewish Resources in the Family Resource CentreJewish Resources in the Family Resource Centre
Jewish Resources in the Family Resource Centre
 
Entrepreneurial ecosystem- Wider context
Entrepreneurial ecosystem- Wider contextEntrepreneurial ecosystem- Wider context
Entrepreneurial ecosystem- Wider context
 

Ogilvie - Beyond the statistical average

  • 1. http://itconfidence2013.wordpress.com Beyond The Statistical Average 2°°°°International Conference on IT Data collection, Analysis and Benchmarking Tokyo (Japan) - October 22, 2014 John Ogilvie CEO, ISBSGCEO, ISBSG BeyInsert here a pictureB The KISIS Principle “Keeping it Simple is Stupid”
  • 2. 2IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Beyond the Statistical Average Goals of the presentation G1. Understand the characteristics of the data available to establish a productivity baseline G2. Statistical considerations in establishing a productivity baseline G3. Statistical considerations in measuring actual performance against a baseline
  • 3. http://itconfidence2013.wordpress.com Case Study • ABC company is outsourcing their application development and maintenance. • They wish to establish a set of targets for annual improvements in productivity based on what they were achieving internally prior to outsourcing. • The contract with the vendor specified shared risk/reward • bonus/penalty payments for over/under achievement against targets.
  • 4. http://itconfidence2013.wordpress.com Case Study • For each of New Development and Application Enhancement projects ABC required 28 ( 4 technologies X 7 FP size bands) performance baselines in Hours/Function Point • Annual % improvement targets were specified • ABC had data from 128 internal projects. If there were at least 5 in a particular segment, the baseline was set as the average. • Otherwise an industry data average was used.
  • 5. http://itconfidence2013.wordpress.com Case Study • At the end of each quarter, the average actual performance in each segment was calculated and the bonus/penalty rules applied • No minimum number of data points for the calculation was specified • In many cases only 1 or 2 actual projects in each category After 12 months there was considerable conflict between ABC and vendor with ABC threatening legal action and vendor threatening to exit contract . • Both over and under achievements were challenged
  • 6. 6IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com What Questions arise in Case Study What are the characteristics of the data we have? • Shape of distribution • Handling of Outliers Baseline: • How much data is required • Do performance segments make sense Measurement: • How do we determine how productivity has changed • How much measurement data is required
  • 7. http://itconfidence2013.wordpress.com Data Used in This Presentation For the purposes of this presentation, data was extracted from the ISBSG Development and Enhancement Repository. • Data Quality Rating: A or B • Development Type: Enhancement • Count Method: IFPUG • Application Group: Business Application • Development Platform: Mainframe/Midrange/Multi Analyses and tables where produced using Minitab Statistical Software
  • 8. 8IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com ISBSG Relative Sizes Categorises the Functional Size by relative sizes as follows: Relative Size Functional Size 1. XXS Extra-extra-small => 0 and <10 2. XS Extra-small => 10 and <30 3. S Small => 30 and <100 4. M1 Medium1 => 100 and <300 5. M2 Medium2 => 300 and <1000 6. L Large => 1,000 and < 3,000 7. XL Extra-large => 3,000 and < 9,000 8. XXL Extra-extra-large => 9,000 and < 18,000 9. XXXL Extra-extra-extra-large => 18,000
  • 9. 9IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Examine the Data: Descriptive Statistics Relative Total Size Count Mean TrMean StDev Minimum Median Maximum 1. XXS 43 27.01 19.84 45.27 1.40 7.90 236.30 2. XS 157 19.06 14.25 29.83 0.90 10.90 271.60 3. S 424 16.34 12.23 30.27 0.40 10.00 424.90 4. M1 470 13.19 11.47 12.67 0.90 9.60 97.90 5. M2 187 14.52 13.10 12.68 0.80 11.10 80.70 6. L 31 11.69 10.25 11.63 1.00 9.10 42.90 7. XL 4 1.50 * 1.34 0.10 1.30 3.30 8. XXL 2 0.35 * 0.21 0.20 0.35 0.50
  • 10. 10IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Examine the Data 444422220000333366660000333300000000222244440000111188880000111122220000666600000000 PDR (afp) Dotplot of PDR (afp) Each symbol represents up to 17 observations.
  • 11. http://itconfidence2013.wordpress.com Examine the Data 444422220000333366660000333300000000222244440000111188880000111122220000666600000000 1111.... XXXXXXXXSSSS 2222.... XXXXSSSS 3333.... SSSS 4444.... MMMM1111 5555.... MMMM2222 6666.... LLLL 7777.... XXXXLLLL 8888.... XXXXXXXXLLLL PDR (afp) RelativeSize Dotplot of PDR (afp) Each symbol represents up to 16 observations.
  • 12. 12IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Handling Outliers An outlier is an unusually large or small observation. Outliers can have a disproportionate influence on statistical results, such as the mean, which can result in misleading interpretations A variety of techniques can be used • Trim the data by removing the top and bottom 5% - simple to do • Remove data more than 2 standard deviations from the mean ( simple to do but assumes data has normal distribution) • Statistical test that all values in the sample are from the same, normally distributed population. ( Need a tool and assumes data has normal distribution) • Graphically using a Boxplot
  • 14. 14IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Boxplot • “Box” shows values in from Quartile 1(Q1) to Quartile 3(Q3) • Inter Quartile Range (IQR) is from Q1 to Q3. • Value is Q3 – Q1 • Mean and Median are shown • “Whiskers” go to 1.5*IQR above and below the box • An outlier is taken to be any value beyond the Whiskers • Applying this to each of the size groups and removing sizes 7&8 reduced the number of data points by 106 from 1,319 to 1,231
  • 15. 15IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Descriptive Statistics after Outliers Removed Relative Total Size Count Mean TrMean StDev Minimum Median Maximum 1. XXS 38 13.44 11.84 14.92 1.40 6.00 53.80 2. XS 147 13.08 12.31 10.12 0.90 10.30 44.80 3. S 388 10.62 10.13 7.24 0.40 8.60 32.70 4. M1 433 10.20 9.79 6.38 0.90 8.80 30.30 5. M2 178 12.50 11.89 8.78 0.80 10.15 38.90 6. L 28 8.64 8.20 7.04 1.00 6.30 27.70
  • 16. 16IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Data Distribution after Outliers Removed 444488884444000033332222222244441111666688880000 PDR (afp) Dotplot of PDR (afp) Each symbol represents up to 3 observations.
  • 17. 17IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com How much data is required for Baseline & Performance Measurement The need in a baseline is to have sufficient data points (n) such that their average will closely estimate the population average . One approach , based on the Central Limit Theorem in statistical theory indicates: • In general try to have n>30 • If data is highly skewed, ideally more data points • If data is symmetric , less than 30 may suffice 5 data points was insufficient for establishing a baseline in the case study
  • 18. 18IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com How much data required for Baseline & Performance Measurement • In the Case Study, in addition to setting a baseline, ABC wanted to determine if target productivity was being met • Statistically, the 95% Confidence Interval for the true average of our performance is expressed as: “ We are 95% certain that the true mean is contained in the interval: CI=S±1.96σ/√n “ where: S=sample mean, σ=sample standard deviation, n=sample size
  • 19. 19IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Required Sample Sizes For example, if we have 15 data points for size M1 projects, with average S, we can be confident at a level of 95% that the true average of M1 projects would be in the range of S±3 Therefore it is this range, not just the value of S which needs to be considered If the productivity target is in the range then it has been achieved. Standard Deviation Confidence Interval Size M1=6.38 95% 90% ± 1.0 159 112 ± 1.5 72 51 ± 2.0 42 30 ± 2.5 28 20 ± 3.0 20 15 ± 3.5 16 11 ± 4.0 13 9 Sample Size @ Confidence Level
  • 20. 20IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Baseline – Do segments make sense In deciding what segmentation should be used in establishing the baseline and subsequent performance management, the question is whether there is sufficient evidence that performance is different in each segment. Too much segmentation reduces the number of data points in each segment which impacts the Confidence Interval of the measurement, as described earlier
  • 21. 21IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com How to Determine if Segments Differ 6666.... LLLL5555.... MMMM22224444.... MMMM11113333.... SSSS2222.... XXXXSSSS1111.... XXXXXXXXSSSS 22220000....0000 11117777....5555 11115555....0000 11112222....5555 11110000....0000 7777....5555 5555....0000 Relative Size PDR(afp) 5.90633 11.3651 11.2029 13.8004 9.59844 10.8039 9.90016 11.3457 11.426 14.7264 8.54161 18.3479 Interval Plot of PDR (afp) 99995555%%%% CCCCIIII ffffoooorrrr tttthhhheeee MMMMeeeeaaaannnn Individual standard deviations were used to calculate the intervals.
  • 22. 22IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com How to Determine if Segments Differ • The Interval Plot indicates that S & M1 could be combined • The fact that XS and M2 are similar is unexpected • Possibly need to add further attributes to data selection criteria • The CI for XXS and L is too large to be useful due to small numbers of data points ( 38 and 28 respectively)
  • 23. 23IT Confidence 2014 – October 22, 2014 http://itconfidence2014.wordpress.com Recommendations • Beware of basing conclusions on small numbers of data points • Check data for outliers • Try and determine reason for outlier and do not remove if likely to occur in your own data. • Do not segment data unless you are confident there is a real difference between segments. • Your own data is always best. Industry data is a valuable benchmark reference and can provide data until you build up your own repository. Measure what you do and improve