52 - The Impact of Test Ownership and Team Structure on the Reliability and Effectiveness of Quality Test Runs

The Impact of
Test Ownership and Team Structure on
Test Reliability and Effectiveness
Kim Herzig, Nachiappan Nagappan

Improving Development Processes
Quality / Risk
$
Speed
R
Cost
Development Environment
(should be well balanced)
Product / Service
Microsoft aims for shorter release cycles
• Speed up development processes (e.g. code velocity)
• Maintaining / increasing product quality
Empirical data to support & drive decisions
Joint effort by MSR & product teams
• MSR Cambridge: Brendan Murphy, Kim Herzig
• MSR Redmond: Tom Zimmermann, Chris Bird, Nachiappan Nagappan
• TSE Redmond: Jacek Czerwonka, Michaela Greiler
• Windows, Windows Phone, Office, Dynamics product teams
Technology
changes
Legacy
changes
New product
features

A Simple Test Scenario
Development branch
 Each developer runs “his” tests
 Most tests check functional correctness
 Mostly pre-check-in verification
 Failures are isolated: no impact on other engineers

Let’s go big: Complex Test Scenario for Windows
Version Development Control Branch Tree
Branch level 3
Branch level 2
Branch level 1
Branch level 0
branch
Multi-area
branch
Multi-component
branch
main trunk
branch
Quality gate
(component testing)
Quality gate
(component testing)
Quality gate
(component testing)
• Developer have to pass quality gates (system tests)
• Many tests check system constraints (e.g. compatibility or performance)
• Failures not isolated (involve human inspections, causes development freeze for corresponding branch, etc.)

Test Effectiveness & Reliability
Effectiveness
Test suite effectivenesslow
• Number of bugs found and fixes due to test failures (NumBugs)
• We trace test failures to bug report to bug fixes. Using CODEMINE [1].
Reliability
high
High cost,
unknown value
• Bug detection ratio per test suite run (BugsPerExec)
$$$$
High cost,
low value
$$$$
• Eliminates correlation to number of executions.
• However, absolute number for more important as each defect might be cause a disaster.
Test suite Low reliability
cost,
good value
• Number of false test failures (FpPerExec)
$$
Low cost,
low value
$
high low
• Test failures due to any other reason than code defects, e.g. test and infrastructure issues.
Test execution
fails
True positive
failure
False positive
failure
Mapped to
bug report?
yes
Bug report
fixed?
yes
Resolved via
code change?
yes
no
[1] Czerwonka, J., Nagappan, N., Schulte, W., and Murphy, B. CODEMINE: Building a Software Development Data Analytics Platform at Microsoft. Software, IEEE, 30, 4 (2013), 64--71.

Distributed Test Ownership
Test sui te
Test case 1
Test case 2
Test case 3
Test case n
Quality gates combine tests from different sources
• Side effects between test cases
• Limited synchronization over new tests and their interactions.
Tests have impact on other teams
• Engineers might not know what the test is doing
• Low quality tests slow down product development
Test evolve over time
• Is the test still up-to-date? Can we remove it?
• Who wrote the test and is that person still in our organization?
• Who is maintaining the test right now?
Ownership versus Responsibility: Engineers may own the test step but not the quality gate

Mapping Ownership to Organization Tree
Test sui te
Metric name Short description
Number of contributors
NumOwner The number of distinct engineers that own at least one test in the test suite.
NumOwnerLeftOrg
The number of distinct engineers that own at least one test in the test suite and that are no longer in
the organizational structure of Microsoft (left the company).
PCOwnerLeftOg
The relative number of distinct developers that own at least one test case in the test suite but are no
longer part of the organizational structure: 푁푢푚푂푛푤푒푟퐿푒푓푡퐶표푚푝표푛푦 푁푢푚푂푤푛푒푟.
Number of managers
NumL2
The number of level two (L2) managers whose corresponding sub-trees contain all engineers that
own at least one test case of the test suite.
NumL3
The number of level three (L3) managers whose corresponding sub-trees contain all engineers that
NumL4
The number of level four (L4) managers whose corresponding sub-trees contain all engineers that
NumL5
The number of level five (L5) managers whose corresponding sub-trees contain all engineers that
HighestCommonManager
The highest manager level (L3 > L2) whose sub-organization contains all engineers that own at least
one test case of the test suite.
Organizational communication paths
LongestDevDistance
The longest distance between two developers in the organizational sub-tree spanned by test
contributors. The distance between two developers is measured by the length of the shortest path
between the two corresponding organizational sub-tree vertices.
MedianDevDistance
The median of distances (length of shortest paths) between all pairs of developers contributing test
content to a test suite.
MeanDevDistance
The mean of distances (length of shortest paths) between all pairs of developers contributing test
MeanPathLength The mean length of all paths between test owners.
SizeOfLargestClique
Number of vertices of organizational sub-graph such that any two test owners are connected via a
single edge.
Number of aliases
NumAliasGroups
The number of distinct mailing lists associated with test cases of the test suite. Does not include the
number of system aliases (NumSystemAlias).
MedianAliasGroupSize The median number of mailing list members referenced by mailing lists registered as test owners.
NumSystemAlias
The number of distinct aliases that point to a system alias—not to a mail distribution list. These
aliases are real system accounts.

Mapping Ownership to Organization Tree
Metric name Short description
Number of contributors
NumOwner The number of distinct engineers that own at least one test in the test suite.
NumOwnerLeftOrg
The number of distinct engineers that own at least one test in the test suite and that are no longer in
the organizational structure of Microsoft (left the company).
PCOwnerLeftOg
The relative number of distinct developers that own at least one test case in the test suite but are no
longer part of the organizational structure: 푁푢푚푂푛푤푒푟퐿푒푓푡퐶표푚푝표푛푦 푁푢푚푂푤푛푒푟.
Number of managers
NumL2
The number of level two (L2) managers whose corresponding sub-trees contain all engineers that
NumL3
The number of level three (L3) managers whose corresponding sub-trees contain all engineers that
NumL4
The number of level four (L4) managers whose corresponding sub-trees contain all engineers that
NumL5
The number of level five (L5) managers whose corresponding sub-trees contain all engineers that
HighestCommonManager
The highest manager level (L3 > L2) whose sub-organization contains all engineers that own at least
one test case of the test suite.
Organizational communication paths
LongestDevDistance
The longest distance between two developers in the organizational sub-tree spanned by test
contributors. The distance between two developers is measured by the length of the shortest path
between the two corresponding organizational sub-tree vertices.
MedianDevDistance
The median of distances (length of shortest paths) between all pairs of developers contributing test
MeanDevDistance
The mean of distances (length of shortest paths) between all pairs of developers contributing test
MeanPathLength The mean length of all paths between test owners.
SizeOfLargestClique
Number of vertices of organizational sub-graph such that any two test owners are connected via a
single edge.
Number of aliases
NumAliasGroups
The number of distinct mailing lists associated with test cases of the test suite. Does not include the
number of system aliases (NumSystemAlias).
MedianAliasGroupSize The median number of mailing list members referenced by mailing lists registered as test owners.
NumSystemAlias
The number of distinct aliases that point to a system alias—not to a mail distribution list. These
aliases are real system accounts.

Test Ownership
Does organizational structure of test owners impact
the effectiveness or reliability of tests?
Similar: Distributed code ownership impacts code quality!
Results for 8.1

Correlations
Team ownership Scattered ownership
More effective
Less effective
Less reliable

Correlations
Test sui te
?
?
Owners that left the company
Less effective

Correlations
Test sui te
Few group owners
Test sui te
Multiple group owners
More effective Less effective

Classification Models
Classifying quality gates showing above-median effectiveness.
Classifying quality gates showing above-median reliability.
푝푟푒푐푖푠푖표푛 =
푟푒푐푎푙푙 =
100-cross-fold stratified random sampling.
푇푃
푇푃 + 퐹푃
푇푃
푇푃 + 퐹푁
Using multiple machine learners to rule out model over fitting.

Can we predict test effectiveness?
Predicting the effectiveness of test suites with
precision and recall values around 0.7.

Can we predict test reliability?
Organizational structure metrics are
excellent predictors for test suite reliability!

 Organizational metrics on test owners
good predictors for test effectiveness!
Organizational metrics on test owners
excellent predictors for test reliability!

52 - The Impact of Test Ownership and Team Structure on the Reliability and Effectiveness of Quality Test Runs

Recommended

Recommended

More Related Content

What's hot

What's hot (16)

Similar to 52 - The Impact of Test Ownership and Team Structure on the Reliability and Effectiveness of Quality Test Runs

Similar to 52 - The Impact of Test Ownership and Team Structure on the Reliability and Effectiveness of Quality Test Runs (20)

More from ESEM 2014

More from ESEM 2014 (20)

Recently uploaded

Recently uploaded (20)

52 - The Impact of Test Ownership and Team Structure on the Reliability and Effectiveness of Quality Test Runs

Editor's Notes