SBFT Tool Competition 2024 -- Python Test Case Generation Track

•

0 likes•11 views

Nicolas Erni, Al-Ameen, Mohammed, Christian Birchler, Pouria Derakhshanfar, Stephan Lukasczyk, Sebastiano Panichella: SBFT Tool Competition 2024 -- Python Test Case Generation Track 17th International Workshop on Search-Based and Fuzz Testing

Presentations & Public Speaking

Search-Based and Fuzz Testing
Tool Competition 2024
Nicolas Erni
Zurich University of Applied
Science (ZHAW)
Christian Birchler
Zurich University of Applied
Science (ZHAW)
Pouria Derakhshanfar
JetBrains
Stephan Lukasczyk
University of Passau
Mohammed Al-Ameen
Zurich University of Applied
Science (ZHAW)
Software Under test Generated Test Code
Sebastiano Panichella
Zurich University of Applied
Science (ZHAW)
Co-located with the 46th International Conference on Software Engineering (ICSE 2024)

History SBFT Python Tool Competition
Year Venue
Coverage
tool
Mutation Tool #CUTs #Projects
#Participants
(+ baseline)
Round 1 2024 SBST PyTest
MutPy /
Cosmic Ray
35 7 4

SBFT Tool Competition - 2024
Python tool competition: For the
fi
rst time ever, we are extending an invitation to researchers to participate in our
competition using their test generation tool for Python. Tools will be assessed based on a benchmark that evaluates code
coverage and mutation score.
What is New?
Figure 1: Example of test generation for simple Python functions.
New!!!
Software Under test Generated Test Code

Python tool competition Infrastructure
python-tool-competition-2024 Infrastructure
run run run
Klara …. Tooln
CUT
Time budget
generated
tests
generated
tests
generated
tests

Scoring Formula
T = Generated Test
B = Search Budget
C = Class under test
R = independent Run
Covi = statement coverage
Covb = branch coverage
Covm = Strong Mutation
getTime = generation time
covScore(T, B, C, R) = 1 × Covi + 2 × Covb + 4 × Covm
tScore(T, B, C, R) = covScore(T, B, C, R) × min
(
1,
2 × B
genTime)
Score(T, B, C, R) = tScore(T, B, C, R) + penalty(T, B, C, R)
Xavier Devroey, Alessio Gambi, Juan Pablo Galeotti, René Just, Fitsum Meshesha
Kifetew, Annibale Panichella, Sebastiano Panichella: JUGE: An infrastructure for
benchmarking Java unit test generators. Softw. Test. Verification Reliab. 33(3) (2023)

https://github.com/ThunderKey/python-tool-competition-2024
Software Under test Generated Test Code

Benchmark Projects
• Selection criteria
• GitHub repositories
• Open Source
• Simple files
• No system access (OS, process, network, disk)

Benchmark Projects
• Selection criteria
• GitHub repositories
• Open Source
• 3 projects selected
Klara
https://github.com/se2p/pynguin https://github.com/usagitoneko97/klara
Ghostwriter with Hypothesis
https://github.com/HypothesisWorks/hypothesis
Pynguin

Contest Methodology
Search budget
400
seconds
Files under test
35
Repetitions
4 repetitions
Execution environment
Linux VM

The Tools
Competitors
UtBot
Benchmark
Klara
Pynguin
Ghostwriter
V.S.

Results (1)
Average line coverage for each project per tool

Results (2)
Average branch coverage for each project per tool

Results (3)
Average mutation score for each project per tool

Final Ranking
Competitors
UtBot
Benchmark
Klara
Pynguin
Ghostwriter
V.S.
1
2

Lessons Learned
• Identified aspects to improve and bugs that could be fixed in the
infrastructure
• Docker will simplify the evaluation procedure
• More participants to the competition!
• From Academia & Industry

What’s Next?
• Contest Infrastructure
• https://github.com/ThunderKey/python-tool-competition-2024
• Improve usability
• Facilitate setup of an evaluation
• Facilitate evaluation in other contexts
• Update the user documentation
• For the next edition
• More tools
• More CUTs
• Time budgets
• Time penalty

Similar to SBFT Tool Competition 2024 -- Python Test Case Generation Track

Software testing: an introduction - 2017XavierDevroey

Academic Modular SeminarJason Reid

GPCE16 Poster: Automatic Non-functional Testing of Code Generators Families Mohamed BOUSSAA

2010 ICMIT - Software Support for the Fuzzy Front End Stage of the Innovation...HASE – Human Aspects in Software Engineering

CASCON 2023 Most Influential Paper Award TalkNikolaos Tsantalis

PhD public defense: A Measurement Framework for Analyzing Technical Lag in ...Ahmed Zerouali

Enhancing Your Test Automation Scenario Coverage with Selenium - QA or the Hi...Perfecto by Perforce

Primers or Reminders? The Effects of Existing Review Comments on Code ReviewDelft University of Technology

Implementation of GPU-based bioinformatic tools at the ENCODE DCCENCODE-DCC

Automated Developer Testing: Achievements and ChallengesTao Xie

Reproducible, Automated and Portable Computational and Data Science Experimen...Ivo Jimenez

Resume_Yilun Chong_ENYilun Chong

ErikBrayCVErik Bray

Keynote VST2020 (Workshop on Validation, Analysis and Evolution of Software ...University of Antwerp

Bulletproof PowerShellshchegrikovich

Reproducible Science with PythonAndreas Schreiber

Java Unit Testing Tool Competition — Fifth RoundAnnibale Panichella

Collective Mind: a collaborative curation tool for program optimizationGrigori Fursin

ResumeSailesh Sidhwani

Behold the Power of PythonSarah Dutkiewicz

Similar to SBFT Tool Competition 2024 -- Python Test Case Generation Track (20)

Software testing: an introduction - 2017

Academic Modular Seminar

GPCE16 Poster: Automatic Non-functional Testing of Code Generators Families

2010 ICMIT - Software Support for the Fuzzy Front End Stage of the Innovation...

CASCON 2023 Most Influential Paper Award Talk

PhD public defense: A Measurement Framework for Analyzing Technical Lag in ...

Enhancing Your Test Automation Scenario Coverage with Selenium - QA or the Hi...

Primers or Reminders? The Effects of Existing Review Comments on Code Review

Implementation of GPU-based bioinformatic tools at the ENCODE DCC

Automated Developer Testing: Achievements and Challenges

Reproducible, Automated and Portable Computational and Data Science Experimen...

Resume_Yilun Chong_EN

ErikBrayCV

Keynote VST2020 (Workshop on Validation, Analysis and Evolution of Software ...

Bulletproof PowerShell

Reproducible Science with Python

Java Unit Testing Tool Competition — Fifth Round

Collective Mind: a collaborative curation tool for program optimization

Resume

Behold the Power of Python

Recently uploaded

Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...Pooja Nehwal

ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2

Presentation on Engagement in Book Clubssamaasim06

Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝soniya singh

CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdfhenrik385807

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22

Microsoft Copilot AI for Everyone - created by AITatiana Gurgel

BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls

Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4

Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls

George Lever - eCommerce Day Chile 2024eCommerce Institute

SaaStr Workshop Wednesday w: Jason Lemkin, SaaStrsaastr

Mathematics of Finance Presentation.pptxMoumonDas2

Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen

Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...Salam Al-Karadaghi

Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany

Thirunelveli call girls Tamil escorts 7877702510Vipesco

OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...NETWAYS

Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage

Recently uploaded (20)

Navi Mumbai Call Girls Service Pooja 9892124323 Real Russian Girls Looking Mo...

ANCHORING SCRIPT FOR A CULTURAL EVENT.docx

Presentation on Engagement in Book Clubs

Call Girls in Sarojini Nagar Market Delhi 💯 Call Us 🔝8264348440🔝

CTAC 2024 Valencia - Henrik Hanke - Reduce to the max - slideshare.pdf

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx

Microsoft Copilot AI for Everyone - created by AI

BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service

Russian Call Girls in Kolkata Vaishnavi 🤌 8250192130 🚀 Vip Call Girls Kolkata

Night 7k Call Girls Noida Sector 128 Call Me: 8448380779

George Lever - eCommerce Day Chile 2024

SaaStr Workshop Wednesday w: Jason Lemkin, SaaStr

Mathematics of Finance Presentation.pptx

Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...

Exploring protein-protein interactions by Weak Affinity Chromatography (WAC) ...

Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...

Thirunelveli call girls Tamil escorts 7877702510

OSCamp Kubernetes 2024 | A Tester's Guide to CI_CD as an Automated Quality Co...

Introduction to Prompt Engineering (Focusing on ChatGPT)

SBFT Tool Competition 2024 -- Python Test Case Generation Track

1. Search-Based and Fuzz Testing Tool Competition 2024 Nicolas Erni Zurich University of Applied Science (ZHAW) Christian Birchler Zurich University of Applied Science (ZHAW) Pouria Derakhshanfar JetBrains Stephan Lukasczyk University of Passau Mohammed Al-Ameen Zurich University of Applied Science (ZHAW) Software Under test Generated Test Code Sebastiano Panichella Zurich University of Applied Science (ZHAW) Co-located with the 46th International Conference on Software Engineering (ICSE 2024)

2. History SBFT Python Tool Competition Year Venue Coverage tool Mutation Tool #CUTs #Projects #Participants (+ baseline) Round 1 2024 SBST PyTest MutPy / Cosmic Ray 35 7 4

3. SBFT Tool Competition - 2024 Python tool competition: For the fi rst time ever, we are extending an invitation to researchers to participate in our competition using their test generation tool for Python. Tools will be assessed based on a benchmark that evaluates code coverage and mutation score. What is New? Figure 1: Example of test generation for simple Python functions. New!!! Software Under test Generated Test Code

4. Python tool competition Infrastructure python-tool-competition-2024 Infrastructure run run run Klara …. Tooln CUT Time budget generated tests generated tests generated tests

5. Python tool competition Infrastructure python-tool-competition-2024 Infrastructure run run run Klara …. Tooln CUT Time budget Generated tests MutPy / Cosmic Ray Line and Branch coverage metrics Mutation metrics

6. Scoring Formula T = Generated Test B = Search Budget C = Class under test R = independent Run Covi = statement coverage Covb = branch coverage Covm = Strong Mutation getTime = generation time covScore(T, B, C, R) = 1 × Covi + 2 × Covb + 4 × Covm tScore(T, B, C, R) = covScore(T, B, C, R) × min ( 1, 2 × B genTime) Score(T, B, C, R) = tScore(T, B, C, R) + penalty(T, B, C, R) Xavier Devroey, Alessio Gambi, Juan Pablo Galeotti, René Just, Fitsum Meshesha Kifetew, Annibale Panichella, Sebastiano Panichella: JUGE: An infrastructure for benchmarking Java unit test generators. Softw. Test. Verification Reliab. 33(3) (2023)

7. https://github.com/ThunderKey/python-tool-competition-2024 Software Under test Generated Test Code

8. Benchmark Projects • Selection criteria • GitHub repositories • Open Source • Simple files • No system access (OS, process, network, disk)

9. Benchmark Projects • Selection criteria • GitHub repositories • Open Source • 3 projects selected Klara https://github.com/se2p/pynguin https://github.com/usagitoneko97/klara Ghostwriter with Hypothesis https://github.com/HypothesisWorks/hypothesis Pynguin

10. Contest Methodology Search budget 400 seconds Files under test 35 Repetitions 4 repetitions Execution environment Linux VM

11. The Tools Competitors UtBot Benchmark Klara Pynguin Ghostwriter V.S.

12. Results (1) Average line coverage for each project per tool

13. Results (2) Average branch coverage for each project per tool

14. Results (3) Average mutation score for each project per tool

15. Results (4)

16. Results (5)

17. Final Ranking Competitors UtBot Benchmark Klara Pynguin Ghostwriter V.S. 1 2

18. Lessons Learned • Identified aspects to improve and bugs that could be fixed in the infrastructure • Docker will simplify the evaluation procedure • More participants to the competition! • From Academia & Industry

19. What’s Next? • Contest Infrastructure • https://github.com/ThunderKey/python-tool-competition-2024 • Improve usability • Facilitate setup of an evaluation • Facilitate evaluation in other contexts • Update the user documentation • For the next edition • More tools • More CUTs • Time budgets • Time penalty

SBFT Tool Competition 2024 -- Python Test Case Generation Track

Recommended

Recommended

More Related Content

Similar to SBFT Tool Competition 2024 -- Python Test Case Generation Track

Similar to SBFT Tool Competition 2024 -- Python Test Case Generation Track (20)

More from Sebastiano Panichella

More from Sebastiano Panichella (20)

Recently uploaded

Recently uploaded (20)

SBFT Tool Competition 2024 -- Python Test Case Generation Track