SlideShare a Scribd company logo
1 of 41
Download to read offline
Mining Development Repositories
         to Study the Impact of
   Collaboration on Software Systems


                          Nicolas Bettenburg
                                nicbet@cs.queensu.ca
                                      SOFTWARE ANALYSIS
                                       & INTELLIGENCE LAB




Wednesday, 11 April, 12                                     1
Software Development is a Social Activity

              Source Code stands in direct relation to
              organizational structure. [Conway:Datamation:1968]


              Developers spent large part of work day
              communicating with fellow developers. [Begel:ICSE:2010]




Wednesday, 11 April, 12                                                 2
Communication is Critical for Success

                          Communication is the most referenced
                          problem in distributed development.
                                                   [Grinter:GROUP:1999]
                                                   [Bird:ACMComm:2009]




Wednesday, 11 April, 12                                                   3
Research Hypothesis

                 “The collaboration between stakeholders
               impacts the code quality and the development
                    community of a software system.”




Wednesday, 11 April, 12                                       4
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          5
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          6
Available Knowledge in Data



   Version Control Systems          Mailing Lists   Issue Tracking Systems




Wednesday, 11 April, 12                                                      7
Available Knowledge in Data



   Version Control Systems          Mailing Lists   Issue Tracking Systems




                              Communication Data




Wednesday, 11 April, 12                                                      7
Available Knowledge in Data



   Version Control Systems               Mailing Lists                Issue Tracking Systems




                              Communication Data
                                 •   Source Code Comments
                                 •   Change-Log Messages
                                 •   Developer Emails & Discussions
                                 •   Support Dialogues


Wednesday, 11 April, 12                                                                        7
Communication Data Exists
                          Mainly as Unstructured Data

                   In this report, you have defined a parameter named blocksize,
                   which is given a value of "7|D|1|D". In open script of data set,
                   there are below lines code:

                   <script begin>
                   token=Packages.java.util.StringTokenizer(params["blocksize"],"|");
                   vec=new Packages.java.util.Vector();
                   while(token.hasMoreTokens()){
                      vec.addElement(token.nextToken());   Eclipse #150222
                   }
                   params["DateRange"]=java.lang.Integer.parseInt(vec.elementAt(0));
                   </script end>

                   Since the value of params["blocksize"] is "7|D|1|D", vec.elementAt(0)
                   is "7", and then it can not be parsed to int value. In 1.0.1,
                   the value of params["blocksize"] might be 7|D|1|D, so it can be
                   parsed to int value of 7.



                     Extraction and processing of unstructured
                     data is challenging. [MUD:Workshop:2010]
Wednesday, 11 April, 12                                                                    8
Mining Collaboration Data

                                          [Bettenburg:ICPC:2011]

                                             chnical Information in Un                                                                           structured Data
       A Lightw eight Approach to Uncover Te

                                                                                                                             Michel Smidt
                                             ams, Ahmed E. Hassan
                                                                                  Build ID: M20070212-1330
                Nicolas Bettenburg, Bram Ad                                                                   Dept. of Computer Science S)
                                                     gence Lab
                       Software Analysis and Intelli
                                                                                  Steps To Reproduce:
                                                                                                                    Una des a keytyinof Bremen
                                                                                                                                        ng for "M1+S" (ie. Alt+
                                                                                  1. Create a plugin for eclipse that      iversi bindione of the top level
                                                                                                                       inclu

                                 Queen’s University
                                                                                                                                                                                  • Use Spellchecking
                                                                                                                        as mnem  onic
                                                                                                                      as Bremen, for Help > any
                                                                                    where S is any letter that is used
                                                                                                                         the mnemonic Germ &So
                                                                                                                                                     ftware Updates,
                                                                                    menus. Since eclipse uses "S"
                             Kingston, Ontario, Canada                                             Email: michelIDE nformatik.u
                                                                                     "S" is sufficient .                     @i                     ni-bremen.de

                                                                                                                                                                                  • Empirical validation
                                                    cs.queensu.ca
                    Email: {nicbet,bram,ahmed}@
                                                                                  2. Laun  ch the plugin as part of Eclipse                    our example in #1)
                                                                                                               the Help menu (to go along with
                                                                                  3. Press Alt+H to bring down
                                                                                                                 tes" is missing its mnemonic.
                                                                                    BUG: Notice "Software Upda




                                         nication through email, cha
                                                                       t, or
                                                                                   More information:
                                                                                   The code after "if (callback.is
                                                                                   Eclipse's MenuManager.
                                                                                                                   AcceleratorInUse(SWT
                                                                                                            java removes the mnemonic,
                                                                                                                                            .ALT | character))" inside
                                                                                                                                       but it seems like Eclipse
                                                                                                                                           level menumanagers like
                                                                                                                                                                                  • Improved on state of the art
           Abstract—Developer commu
                                                                                                                eratorInUse" only for top
                                                                                   should be checking "isAccel
                                        s mostly of largely uns  tructured
      issue report comments consist
                                                                                         ,Edit,...,Help, etc. :
                                                                      rma-
                                                                                   File
                                     text, mixed with technical info
      data, i.e., natural language                        ons, source code
                                      jargon, abbreviati
                                                                                     /* (non-Javadoc)                               onItem#update(java.l
                                                                                                                                                         ang.String)

      tion such as project-specific
                                                                                                               e.action.IContributi
                                                               cal artifacts           * @see org.eclipse.jfac

       patches, stack traces   and identifiers. These techni                            */
                                         of knowle  dge on the technical                                 tring property) {
       represent a valuable source
                                                                                    public void update(S
                                                         applications from
                                                                                                               = getItems();
                                                                                         tributionItem items[]
                        tem, with a wide range of
                                                                                    ICon
       part of the sys                                                    vo-
                                        s to creating project-specific                                    items.length; i++) {
       establishing traceability link                            en natural
                                                                                    for (int i = 0; i <
                                       e-style delimiters betwe                                      property);
       cabularies. However, the fre
                                                                                    items[i].update(
                                                                      hnical
                                      tent make the mining of tec                   }
       language and technical con                          general-purpose
                                         t step towards a
                                                                                     [...]
        artifacts challenging. As a firs                         information
                                                                                     }

        technique to extractin   g all kinds of technical
                                          present a  lightweight approach            Any status on this bug?
        from unstructured data, we                          guage text. Our
                            cal artifacts and natural lan
                                                                                                                                                   ) [...]
                                                                                                                       for M6 (API) or M7 (non-API               by a prototype
        to untangle techni                                                 are       I'd consider any contributions
                                                                                                                              nical information uncovered
                                         g spell checking tools, which              Figure 1. Examples of tech optionalposed Manager with API (Eclipse Platform
        approach is based on existin                                                                                                 in Menu in this paper.
                                                                                                                                                               and
                                                                     ms and          A 3.5 fix enta be to of the approach pro
                                           available across platfor
                                                                                                                that behaviour
                                                                                    implemwouldtion makeand to have the WorkbenchActionBuilder contributed
         well-understood, fast, readily                                  gh a
                                         of technical artifacts. Throu
                                                                                     off by#208626).in 3.5,
                                                                                            default early                                                     gers turn it on
                                                                                    Bug                                             ions contributed MenuMana
         impartial to different kinds
                                                                                                        and actionSets/editorAct
                                                              our approach
                                                                                      MenuManagers
                                          demonstrate that                                                        in the correct place).
         handcrafted benchmark, we
                                                                                      (if I can find MenuManagers
                                                                    technical
         is able to successfully    uncover a wide range of                                                                  team to make sure we understan
                                                                                                                                                   a
                                                                                                                                                               d what the

                                                                                                                    such, mining unstructured dat
                                                                                         I'd like us to work with the SWT
                                         data.
                                                                                                                                                                       way
                                                                                                                                    sure that we aren't getting in the
          information in unstructured                                                 or project-specific terms. As
                                                                                         correct  platform behavior is, and make
                                                                                                                                           ormation
                                                                                                                                                 onics) seems odd to me, in
                                             ge analysis, unstructured dat
                                                                             a,                                       the exchange of inf
                                                                                                            nt behavior (i.e. turning off mnem
                                                                                      is challenging: it is meant for
                                                                                         of that. The curre
             Keywords-text mining, langua
                                                                                                                               we should fix it properly.
                                                                                                                       automated processing using
                                                                                          general. If we're going to fix this,

          technical information.                                                      between humans, rather than
                                                                                                                       presents an example of tech-
                                                                                      computer machinery. Figure 1
                          I. I NT RO DU CT ION                                                                       found in unstructured data.
                                                                                       nical information commonly
                                      a unique history of design                                                         ering technical information
        Every software system has                                                         Recent approaches for discov
Wednesday, 11 April, 12 changes, as well as development and                                                               e focussed on recognizing                                                            9
           ions, software                                                                 unstructured data [3]–[5] hav
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          10
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          10
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          11
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          11
Quantify Impact on Quality: Idea


             Extracted Communication Data




Wednesday, 11 April, 12                                      12
Quantify Impact on Quality: Idea


             Extracted Communication Data
                                 compute


                             Social Metrics




Wednesday, 11 April, 12                                      12
Quantify Impact on Quality: Idea


             Extracted Communication Data
                                   compute


                             Social Metrics

                             measure relationships


                          Post-Release Defects



Wednesday, 11 April, 12                                      12
Discussion           Social
                   CONTENT           STRUCTURES

                            4 Dimensions
                            of Measures


               Measures of          Communication
               WORKFLOW               DYNAMICS

Wednesday, 11 April, 12                             13
Conceptual Approach


                            Measure         Measure
                            Discussion     Post-Release
                            Metrics           Bugs



                             6 months       6 months
                                                          time


                          LINK USING STATISTICAL MODELS
Wednesday, 11 April, 12                                          14
Findings of our work

               (1) Social metrics explain post-release defects
               as good as code metrics.




Wednesday, 11 April, 12                                          15
Findings of our work

               (1) Social metrics explain post-release defects
               as good as code metrics.

              (2) Combination of social metrics and code
              metrics is cumulative.




Wednesday, 11 April, 12                                          15
Findings of our work

               (1) Social metrics explain post-release defects
               as good as code metrics.

              (2) Combination of social metrics and code
              metrics is cumulative.

              (3) Identify factors that have positive and
              negative relationships with defects.


Wednesday, 11 April, 12                                          15
Findings of our work

               (1) Social metrics explain post-release defects
               as good as code metrics.

              (2) Combination of social metrics and code
              metrics is cumulative.

              (3) Identify factors that have positive and
              negative relationships with defects.

                                                    [ICPC‘2010] (Best Paper)
                                                    [JEMSE?]
Wednesday, 11 April, 12                                                   15
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          16
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          16
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          16
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          17
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          17
Proposed Approach


                          I. Extraction of communication data



                          II. Study impact on software quality



           III. Study impact on development community


Wednesday, 11 April, 12                                          17
Available Knowledge in Data



     Code Review Systems            Mailing Lists    Issue Tracking Systems




                             Data on Management
                             of Code Contributions


Wednesday, 11 April, 12                                                       18
Contribution Management




                                   Patch
                                                                     Project




                                                   Feedback
                                                                    Repository
                  Feedback




                             Submission



                               Review      OK   Verification   OK
                                                                   Integration




Wednesday, 11 April, 12                                                          19
Studying Impact on Community through
                        Contribution Management



   Goal:
   Study how contributors, reviewers, verifiers and the
   software are impacted by communication (anomalies)
   through statistical models.

   Example:
   Reviewers leaving community due to lack of feedback


Wednesday, 11 April, 12                                  20
Available Knowledge in Data



   Version Control Systems          Mailing Lists   Issue Tracking Systems




                              Workflow Information
                                Social Networks


Wednesday, 11 April, 12                                                      21
Evolution of Code-Knowledge
                                  Communities
                                                                        Internet Explorer                                                                                          reed
                                                                                                                                                                                           masayuki




                              cjcypoi02                                                                                                                                                                                                            dietrich
                                          steve.england                                                                                                                               corevette
                                                                                                                                                                                                                                                           steffen.wilberg
                                                                                                                                                                                                  davemgarrett
                               mmortal03                                                                                  timeless                                                                                                                 mano
                                                 fittysix
                                                                                                                                                                           matspal
                                                                                                                                                   longsonr
                                                                                                                                                                                                                 zurtex
                                                                                                                  matti                                                                                                                                                   edilee
                                                                                                                                                                                                                                      mconnor
                                                                                                                                       cwwmozilla                                                                                                    beltzner
                                                                                                                             dveditz
                                       adelfino                                                                                                               zeniko
                                                                                                                                                                                                                          kliu
                                                    alice0775
                                                                                                                                                                                          sziadeh mark.finkle                                                  robert.bugzilla
                                                                                                                                                                                                                                   philringnalda


                                       sgautherie.bz                                                                                       kev
                                                                                                                                                                                                                    faaborg
                                                                                                                   johnath
                                                                                                                                                   martijn.martijn

                                               jmjeffery        jo.hermans          nrthomas gavin.sharp                                                                                                 polidobj

                                                                                                                                             m-wada
                                                                                                                                                                                                                                                   XML Parser
                                                                                                             jbecerra                                                                                      jdarmochwal
                                     john.p.baker           jruderman                                                                                                                           mak77
                                                                             ria.klaassen
                          VYV03354                                                               cbook                                                                           bomfog
                                                                                                                                                                                                                                                                                   dao
                                                                                                                  elmar.ludwig                         sdaugherty
                                                                                                                                                                                                         vseerror
                               nightstalkerz        l10n                 highmind63                                                                                                            twalker
                                                                                                                                                                                                                                                                     mh+mozilla
                                                                                                                                                                                                                                                                                   klaas1988
                                                                                                                 ehsan     stephen.donner
                                                                                            me.at.work
                                                                                                                                                       phiw
                                                                      hskupin
                                                                                                                  ctalbert
                                                                                       tchung                                                              tomer

                                                                                                     marcia                                                              timwi                                                                                    rotis
                                                                                                                                                                                                                                                   uliss

                                                                                                                                       sylvain.pasche
                                                                                                                                                         bugzilla
                                                                                                                             marco.zehe                                                                                                                 cl-bugs-new2



                          JavaScript
                                                                                                                  tonglebeak
                                                                                                     abillings                                                                                                                                                info                             UI
                            Engine
                                                                              deletesoftware                                                   anselm.meyer

                                                                                                                  eddy_nigg
                                                                                                                                                                                                                                                              matt
                                                                                                                                                   RainerStroebel
                                                                samuel.sidler+old                                                       alex
                                                                                                hasham8888

                                                                                                                                                                                                                                             aarobertxtr
                                                                                                                                                                                                                                 manujsabarwal           johnjbarton

                                                                                    myles7897
                                                                                                                                           paulc
                                                                                                                                                                                                                                                    shaver
                                                                                                                                                                                                                                      smichaud


                                                                                                                                 mozilla
                                                                                                                                            zhangchunlin                                                                                                      dtownsend
                                                                                                                                                                                                                                            jdaggett
                                                                                                                                                              kbrosnan

                                                                                                                                                                                                                                                       bzbarsky
                                                                                                                                                    sdwilsh




Wednesday, 11 April, 12                                                                                                                                                                                                                                                                             22
Thesis Progress


          Tools and techniques                    Empirical Validation
       for mining communication repositories    of presented tools and techniques




            Empirical Validation                  Empirical Validation
      of relationship between collaboration    of relationship between collaboration
                and software quality.                 and development teams.




Wednesday, 11 April, 12                                                                23
Thesis Progress


          Tools and techniques                    Empirical Validation
       for mining communication repositories    of presented tools and techniques




            Empirical Validation                  Empirical Validation
      of relationship between collaboration    of relationship between collaboration
                and software quality.                 and development teams.




Wednesday, 11 April, 12                                                                23
Thesis Progress


          Tools and techniques                    Empirical Validation
       for mining communication repositories    of presented tools and techniques




            Empirical Validation                  Empirical Validation
      of relationship between collaboration    of relationship between collaboration
                and software quality.                 and development teams.




Wednesday, 11 April, 12                                                                23
Thesis Progress


          Tools and techniques                    Empirical Validation
       for mining communication repositories    of presented tools and techniques




            Empirical Validation                  Empirical Validation
      of relationship between collaboration    of relationship between collaboration
                and software quality.                 and development teams.




Wednesday, 11 April, 12                                                                23
Thesis Progress


          Tools and techniques                    Empirical Validation
       for mining communication repositories    of presented tools and techniques




            Empirical Validation                  Empirical Validation
      of relationship between collaboration    of relationship between collaboration
                and software quality.                 and development teams.




Wednesday, 11 April, 12                                                                23
Points for Discussion


          • How to do evaluation of code-knowledge
                 communities? (ground truth)?
          • Applicability to industrial settings (almost no
                 communication data records available)?
          • Extend work to defect prediction?
          • Practical implications: management,
                 moderation, staffing, ... ?


Wednesday, 11 April, 12                                       24

More Related Content

Viewers also liked

WCRE2011
WCRE2011WCRE2011
WCRE2011swy351
 
MSR 2009
MSR 2009MSR 2009
MSR 2009swy351
 
ICSME2014
ICSME2014ICSME2014
ICSME2014swy351
 
ICSE2013
ICSE2013ICSE2013
ICSE2013swy351
 
ICPE2015
ICPE2015ICPE2015
ICPE2015swy351
 
Msr2016 tarek
Msr2016 tarek Msr2016 tarek
Msr2016 tarek swy351
 
ICSE2014
ICSE2014ICSE2014
ICSE2014swy351
 
Mining Sociotechnical Information From Software Repositories
Mining Sociotechnical Information From Software RepositoriesMining Sociotechnical Information From Software Repositories
Mining Sociotechnical Information From Software RepositoriesMarco Aurelio Gerosa
 
ASE2010
ASE2010ASE2010
ASE2010swy351
 
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)Margaret-Anne Storey
 
MSR End of Internship Talk
MSR End of Internship TalkMSR End of Internship Talk
MSR End of Internship TalkRay Buse
 
Empirical Software Engineering at Microsoft Research
Empirical Software Engineering at Microsoft ResearchEmpirical Software Engineering at Microsoft Research
Empirical Software Engineering at Microsoft ResearchThomas Zimmermann
 
A Metric for Code Readability
A Metric for Code ReadabilityA Metric for Code Readability
A Metric for Code ReadabilityRay Buse
 
Benevol 2012 Keynote: The Social Software (R)evolution
Benevol 2012 Keynote: The Social Software (R)evolutionBenevol 2012 Keynote: The Social Software (R)evolution
Benevol 2012 Keynote: The Social Software (R)evolutionMargaret-Anne Storey
 
The (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software EngineeringThe (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software EngineeringMargaret-Anne Storey
 
Software Mining and Software Datasets
Software Mining and Software DatasetsSoftware Mining and Software Datasets
Software Mining and Software DatasetsTao Xie
 
FSE 2016 Panel: The State of Software Engineering Research
FSE 2016 Panel: The State of Software Engineering ResearchFSE 2016 Panel: The State of Software Engineering Research
FSE 2016 Panel: The State of Software Engineering ResearchMargaret-Anne Storey
 
Crowdsourcing Documentation in Software Engineering
Crowdsourcing Documentation in Software EngineeringCrowdsourcing Documentation in Software Engineering
Crowdsourcing Documentation in Software EngineeringMargaret-Anne Storey
 
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...Margaret-Anne Storey
 

Viewers also liked (20)

WCRE2011
WCRE2011WCRE2011
WCRE2011
 
MSR 2009
MSR 2009MSR 2009
MSR 2009
 
ICSME2014
ICSME2014ICSME2014
ICSME2014
 
ICSE2013
ICSE2013ICSE2013
ICSE2013
 
ICPE2015
ICPE2015ICPE2015
ICPE2015
 
Msr2016 tarek
Msr2016 tarek Msr2016 tarek
Msr2016 tarek
 
ICSE2014
ICSE2014ICSE2014
ICSE2014
 
Mining Sociotechnical Information From Software Repositories
Mining Sociotechnical Information From Software RepositoriesMining Sociotechnical Information From Software Repositories
Mining Sociotechnical Information From Software Repositories
 
ASE2010
ASE2010ASE2010
ASE2010
 
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
 
MSR End of Internship Talk
MSR End of Internship TalkMSR End of Internship Talk
MSR End of Internship Talk
 
Empirical Software Engineering at Microsoft Research
Empirical Software Engineering at Microsoft ResearchEmpirical Software Engineering at Microsoft Research
Empirical Software Engineering at Microsoft Research
 
A Metric for Code Readability
A Metric for Code ReadabilityA Metric for Code Readability
A Metric for Code Readability
 
Benevol 2012 Keynote: The Social Software (R)evolution
Benevol 2012 Keynote: The Social Software (R)evolutionBenevol 2012 Keynote: The Social Software (R)evolution
Benevol 2012 Keynote: The Social Software (R)evolution
 
The (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software EngineeringThe (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software Engineering
 
Software Mining and Software Datasets
Software Mining and Software DatasetsSoftware Mining and Software Datasets
Software Mining and Software Datasets
 
FSE 2016 Panel: The State of Software Engineering Research
FSE 2016 Panel: The State of Software Engineering ResearchFSE 2016 Panel: The State of Software Engineering Research
FSE 2016 Panel: The State of Software Engineering Research
 
Research industry panel review
Research industry panel reviewResearch industry panel review
Research industry panel review
 
Crowdsourcing Documentation in Software Engineering
Crowdsourcing Documentation in Software EngineeringCrowdsourcing Documentation in Software Engineering
Crowdsourcing Documentation in Software Engineering
 
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
 

Similar to Mining Development Repositories to Study the Impact of Collaboration on Software Systems

Fse2011 bettenburg presentation
Fse2011 bettenburg presentationFse2011 bettenburg presentation
Fse2011 bettenburg presentationSAIL_QU
 
Managing Complexity Across Today’s Application Delivery Chain:Six key indicat...
Managing Complexity Across Today’s Application Delivery Chain:Six key indicat...Managing Complexity Across Today’s Application Delivery Chain:Six key indicat...
Managing Complexity Across Today’s Application Delivery Chain:Six key indicat...Compuware APM
 
FreeNEST concept introduction
FreeNEST concept introductionFreeNEST concept introduction
FreeNEST concept introductionnarsuman
 
Lessons and requirements from a decade of deployed Semantic Web apps
Lessons and requirements from a decade of deployed Semantic Web appsLessons and requirements from a decade of deployed Semantic Web apps
Lessons and requirements from a decade of deployed Semantic Web appsBenjamin Heitmann
 
SeCold - A Linked Data Platform for Mining Software Repositories
SeCold - A Linked Data Platform for  Mining Software RepositoriesSeCold - A Linked Data Platform for  Mining Software Repositories
SeCold - A Linked Data Platform for Mining Software Repositoriesimanmahsa
 
SP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with GephiSP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with GephiJohn Breslin
 
The Essentials Of Project Management
The Essentials Of Project ManagementThe Essentials Of Project Management
The Essentials Of Project ManagementLaura Arrigo
 
Introduction
IntroductionIntroduction
IntroductionKh Ravy
 
511 NY Social Media Presentation
511 NY Social Media Presentation511 NY Social Media Presentation
511 NY Social Media PresentationDeidre
 
IRJET- Windows Log Investigator System for Faster Root Cause Detection of a D...
IRJET- Windows Log Investigator System for Faster Root Cause Detection of a D...IRJET- Windows Log Investigator System for Faster Root Cause Detection of a D...
IRJET- Windows Log Investigator System for Faster Root Cause Detection of a D...IRJET Journal
 
Analysis of IT Monitoring Using Open Source Software Techniques: A Review
Analysis of IT Monitoring Using Open Source Software Techniques: A ReviewAnalysis of IT Monitoring Using Open Source Software Techniques: A Review
Analysis of IT Monitoring Using Open Source Software Techniques: A ReviewIJERD Editor
 
Gephi icwsm-tutorial
Gephi icwsm-tutorialGephi icwsm-tutorial
Gephi icwsm-tutorialcsedays
 
Presentation SIG, Green IT Amsterdam workshop Green Software 12 apr 2011, Gre...
Presentation SIG, Green IT Amsterdam workshop Green Software 12 apr 2011, Gre...Presentation SIG, Green IT Amsterdam workshop Green Software 12 apr 2011, Gre...
Presentation SIG, Green IT Amsterdam workshop Green Software 12 apr 2011, Gre...Jaak Vlasveld
 
Past, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software DataPast, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software DataJeongwhan Choi
 
Software Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesSoftware Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesTao Xie
 
Software Analytics: Towards Software Mining that Matters (2014)
Software Analytics:Towards Software Mining that Matters (2014)Software Analytics:Towards Software Mining that Matters (2014)
Software Analytics: Towards Software Mining that Matters (2014)Tao Xie
 
How temporal network analysis can help us to explore existing interrelationsh...
How temporal network analysis can help us to explore existing interrelationsh...How temporal network analysis can help us to explore existing interrelationsh...
How temporal network analysis can help us to explore existing interrelationsh...Müller-Birn Claudia
 

Similar to Mining Development Repositories to Study the Impact of Collaboration on Software Systems (20)

Fse2011 bettenburg presentation
Fse2011 bettenburg presentationFse2011 bettenburg presentation
Fse2011 bettenburg presentation
 
Managing Complexity Across Today’s Application Delivery Chain:Six key indicat...
Managing Complexity Across Today’s Application Delivery Chain:Six key indicat...Managing Complexity Across Today’s Application Delivery Chain:Six key indicat...
Managing Complexity Across Today’s Application Delivery Chain:Six key indicat...
 
FreeNEST concept introduction
FreeNEST concept introductionFreeNEST concept introduction
FreeNEST concept introduction
 
Data-Intensive Research
Data-Intensive ResearchData-Intensive Research
Data-Intensive Research
 
Lessons and requirements from a decade of deployed Semantic Web apps
Lessons and requirements from a decade of deployed Semantic Web appsLessons and requirements from a decade of deployed Semantic Web apps
Lessons and requirements from a decade of deployed Semantic Web apps
 
SeCold - A Linked Data Platform for Mining Software Repositories
SeCold - A Linked Data Platform for  Mining Software RepositoriesSeCold - A Linked Data Platform for  Mining Software Repositories
SeCold - A Linked Data Platform for Mining Software Repositories
 
SP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with GephiSP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with Gephi
 
2018 learning approach-digitaltrends
2018 learning approach-digitaltrends2018 learning approach-digitaltrends
2018 learning approach-digitaltrends
 
The Essentials Of Project Management
The Essentials Of Project ManagementThe Essentials Of Project Management
The Essentials Of Project Management
 
Introduction
IntroductionIntroduction
Introduction
 
Lopez
LopezLopez
Lopez
 
511 NY Social Media Presentation
511 NY Social Media Presentation511 NY Social Media Presentation
511 NY Social Media Presentation
 
IRJET- Windows Log Investigator System for Faster Root Cause Detection of a D...
IRJET- Windows Log Investigator System for Faster Root Cause Detection of a D...IRJET- Windows Log Investigator System for Faster Root Cause Detection of a D...
IRJET- Windows Log Investigator System for Faster Root Cause Detection of a D...
 
Analysis of IT Monitoring Using Open Source Software Techniques: A Review
Analysis of IT Monitoring Using Open Source Software Techniques: A ReviewAnalysis of IT Monitoring Using Open Source Software Techniques: A Review
Analysis of IT Monitoring Using Open Source Software Techniques: A Review
 
Gephi icwsm-tutorial
Gephi icwsm-tutorialGephi icwsm-tutorial
Gephi icwsm-tutorial
 
Presentation SIG, Green IT Amsterdam workshop Green Software 12 apr 2011, Gre...
Presentation SIG, Green IT Amsterdam workshop Green Software 12 apr 2011, Gre...Presentation SIG, Green IT Amsterdam workshop Green Software 12 apr 2011, Gre...
Presentation SIG, Green IT Amsterdam workshop Green Software 12 apr 2011, Gre...
 
Past, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software DataPast, Present, and Future of Analyzing Software Data
Past, Present, and Future of Analyzing Software Data
 
Software Analytics - Achievements and Challenges
Software Analytics - Achievements and ChallengesSoftware Analytics - Achievements and Challenges
Software Analytics - Achievements and Challenges
 
Software Analytics: Towards Software Mining that Matters (2014)
Software Analytics:Towards Software Mining that Matters (2014)Software Analytics:Towards Software Mining that Matters (2014)
Software Analytics: Towards Software Mining that Matters (2014)
 
How temporal network analysis can help us to explore existing interrelationsh...
How temporal network analysis can help us to explore existing interrelationsh...How temporal network analysis can help us to explore existing interrelationsh...
How temporal network analysis can help us to explore existing interrelationsh...
 

More from Nicolas Bettenburg

10 Year Impact Award Presentation - Duplicate Bug Reports Considered Harmful ...
10 Year Impact Award Presentation - Duplicate Bug Reports Considered Harmful ...10 Year Impact Award Presentation - Duplicate Bug Reports Considered Harmful ...
10 Year Impact Award Presentation - Duplicate Bug Reports Considered Harmful ...Nicolas Bettenburg
 
Ph.D. Dissertation - Studying the Impact of Developer Communication on the Qu...
Ph.D. Dissertation - Studying the Impact of Developer Communication on the Qu...Ph.D. Dissertation - Studying the Impact of Developer Communication on the Qu...
Ph.D. Dissertation - Studying the Impact of Developer Communication on the Qu...Nicolas Bettenburg
 
Think Locally, Act Gobally - Improving Defect and Effort Prediction Models
Think Locally, Act Gobally - Improving Defect and Effort Prediction ModelsThink Locally, Act Gobally - Improving Defect and Effort Prediction Models
Think Locally, Act Gobally - Improving Defect and Effort Prediction ModelsNicolas Bettenburg
 
Using Fuzzy Code Search to Link Code Fragments in Discussions to Source Code
Using Fuzzy Code Search to Link Code Fragments in Discussions to Source CodeUsing Fuzzy Code Search to Link Code Fragments in Discussions to Source Code
Using Fuzzy Code Search to Link Code Fragments in Discussions to Source CodeNicolas Bettenburg
 
A Lightweight Approach to Uncover Technical Information in Unstructured Data
A Lightweight Approach to Uncover Technical Information in Unstructured DataA Lightweight Approach to Uncover Technical Information in Unstructured Data
A Lightweight Approach to Uncover Technical Information in Unstructured DataNicolas Bettenburg
 
Managing Community Contributions: Lessons Learned from a Case Study on Andro...
Managing Community Contributions:  Lessons Learned from a Case Study on Andro...Managing Community Contributions:  Lessons Learned from a Case Study on Andro...
Managing Community Contributions: Lessons Learned from a Case Study on Andro...Nicolas Bettenburg
 
Studying the impact of Social Structures on Software Quality
Studying the impact of Social Structures on Software QualityStudying the impact of Social Structures on Software Quality
Studying the impact of Social Structures on Software QualityNicolas Bettenburg
 
An Empirical Study on Inconsistent Changes to Code Clones at Release Level
An Empirical Study on Inconsistent Changes to Code Clones at Release LevelAn Empirical Study on Inconsistent Changes to Code Clones at Release Level
An Empirical Study on Inconsistent Changes to Code Clones at Release LevelNicolas Bettenburg
 
An Empirical Study on the Risks of Using Off-the-Shelf Techniques for Process...
An Empirical Study on the Risks of Using Off-the-Shelf Techniques for Process...An Empirical Study on the Risks of Using Off-the-Shelf Techniques for Process...
An Empirical Study on the Risks of Using Off-the-Shelf Techniques for Process...Nicolas Bettenburg
 
Finding Paths in Large Spaces - A* and Hierarchical A*
Finding Paths in Large Spaces - A* and Hierarchical A*Finding Paths in Large Spaces - A* and Hierarchical A*
Finding Paths in Large Spaces - A* and Hierarchical A*Nicolas Bettenburg
 
Automatic Identification of Bug Introducing Changes
Automatic Identification of Bug Introducing ChangesAutomatic Identification of Bug Introducing Changes
Automatic Identification of Bug Introducing ChangesNicolas Bettenburg
 
Cloning Considered Harmful Considered Harmful
Cloning Considered Harmful Considered HarmfulCloning Considered Harmful Considered Harmful
Cloning Considered Harmful Considered HarmfulNicolas Bettenburg
 
Predictors of Customer Perceived Quality
Predictors of Customer Perceived QualityPredictors of Customer Perceived Quality
Predictors of Customer Perceived QualityNicolas Bettenburg
 
Extracting Structural Information from Bug Reports.
Extracting Structural Information from Bug Reports.Extracting Structural Information from Bug Reports.
Extracting Structural Information from Bug Reports.Nicolas Bettenburg
 
Computing Accuracy Precision And Recall
Computing Accuracy Precision And RecallComputing Accuracy Precision And Recall
Computing Accuracy Precision And RecallNicolas Bettenburg
 
Duplicate Bug Reports Considered Harmful ... Really?
Duplicate Bug Reports Considered Harmful ... Really?Duplicate Bug Reports Considered Harmful ... Really?
Duplicate Bug Reports Considered Harmful ... Really?Nicolas Bettenburg
 
The Quality of Bug Reports in Eclipse ETX'07
The Quality of Bug Reports in Eclipse ETX'07The Quality of Bug Reports in Eclipse ETX'07
The Quality of Bug Reports in Eclipse ETX'07Nicolas Bettenburg
 

More from Nicolas Bettenburg (20)

10 Year Impact Award Presentation - Duplicate Bug Reports Considered Harmful ...
10 Year Impact Award Presentation - Duplicate Bug Reports Considered Harmful ...10 Year Impact Award Presentation - Duplicate Bug Reports Considered Harmful ...
10 Year Impact Award Presentation - Duplicate Bug Reports Considered Harmful ...
 
Ph.D. Dissertation - Studying the Impact of Developer Communication on the Qu...
Ph.D. Dissertation - Studying the Impact of Developer Communication on the Qu...Ph.D. Dissertation - Studying the Impact of Developer Communication on the Qu...
Ph.D. Dissertation - Studying the Impact of Developer Communication on the Qu...
 
Think Locally, Act Gobally - Improving Defect and Effort Prediction Models
Think Locally, Act Gobally - Improving Defect and Effort Prediction ModelsThink Locally, Act Gobally - Improving Defect and Effort Prediction Models
Think Locally, Act Gobally - Improving Defect and Effort Prediction Models
 
Using Fuzzy Code Search to Link Code Fragments in Discussions to Source Code
Using Fuzzy Code Search to Link Code Fragments in Discussions to Source CodeUsing Fuzzy Code Search to Link Code Fragments in Discussions to Source Code
Using Fuzzy Code Search to Link Code Fragments in Discussions to Source Code
 
A Lightweight Approach to Uncover Technical Information in Unstructured Data
A Lightweight Approach to Uncover Technical Information in Unstructured DataA Lightweight Approach to Uncover Technical Information in Unstructured Data
A Lightweight Approach to Uncover Technical Information in Unstructured Data
 
Managing Community Contributions: Lessons Learned from a Case Study on Andro...
Managing Community Contributions:  Lessons Learned from a Case Study on Andro...Managing Community Contributions:  Lessons Learned from a Case Study on Andro...
Managing Community Contributions: Lessons Learned from a Case Study on Andro...
 
Mud flash
Mud flashMud flash
Mud flash
 
Studying the impact of Social Structures on Software Quality
Studying the impact of Social Structures on Software QualityStudying the impact of Social Structures on Software Quality
Studying the impact of Social Structures on Software Quality
 
An Empirical Study on Inconsistent Changes to Code Clones at Release Level
An Empirical Study on Inconsistent Changes to Code Clones at Release LevelAn Empirical Study on Inconsistent Changes to Code Clones at Release Level
An Empirical Study on Inconsistent Changes to Code Clones at Release Level
 
An Empirical Study on the Risks of Using Off-the-Shelf Techniques for Process...
An Empirical Study on the Risks of Using Off-the-Shelf Techniques for Process...An Empirical Study on the Risks of Using Off-the-Shelf Techniques for Process...
An Empirical Study on the Risks of Using Off-the-Shelf Techniques for Process...
 
Fuzzy Logic in Smart Homes
Fuzzy Logic in Smart HomesFuzzy Logic in Smart Homes
Fuzzy Logic in Smart Homes
 
Finding Paths in Large Spaces - A* and Hierarchical A*
Finding Paths in Large Spaces - A* and Hierarchical A*Finding Paths in Large Spaces - A* and Hierarchical A*
Finding Paths in Large Spaces - A* and Hierarchical A*
 
Automatic Identification of Bug Introducing Changes
Automatic Identification of Bug Introducing ChangesAutomatic Identification of Bug Introducing Changes
Automatic Identification of Bug Introducing Changes
 
Cloning Considered Harmful Considered Harmful
Cloning Considered Harmful Considered HarmfulCloning Considered Harmful Considered Harmful
Cloning Considered Harmful Considered Harmful
 
Approximation Algorithms
Approximation AlgorithmsApproximation Algorithms
Approximation Algorithms
 
Predictors of Customer Perceived Quality
Predictors of Customer Perceived QualityPredictors of Customer Perceived Quality
Predictors of Customer Perceived Quality
 
Extracting Structural Information from Bug Reports.
Extracting Structural Information from Bug Reports.Extracting Structural Information from Bug Reports.
Extracting Structural Information from Bug Reports.
 
Computing Accuracy Precision And Recall
Computing Accuracy Precision And RecallComputing Accuracy Precision And Recall
Computing Accuracy Precision And Recall
 
Duplicate Bug Reports Considered Harmful ... Really?
Duplicate Bug Reports Considered Harmful ... Really?Duplicate Bug Reports Considered Harmful ... Really?
Duplicate Bug Reports Considered Harmful ... Really?
 
The Quality of Bug Reports in Eclipse ETX'07
The Quality of Bug Reports in Eclipse ETX'07The Quality of Bug Reports in Eclipse ETX'07
The Quality of Bug Reports in Eclipse ETX'07
 

Recently uploaded

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 

Recently uploaded (20)

Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 

Mining Development Repositories to Study the Impact of Collaboration on Software Systems

  • 1. Mining Development Repositories to Study the Impact of Collaboration on Software Systems Nicolas Bettenburg nicbet@cs.queensu.ca SOFTWARE ANALYSIS & INTELLIGENCE LAB Wednesday, 11 April, 12 1
  • 2. Software Development is a Social Activity Source Code stands in direct relation to organizational structure. [Conway:Datamation:1968] Developers spent large part of work day communicating with fellow developers. [Begel:ICSE:2010] Wednesday, 11 April, 12 2
  • 3. Communication is Critical for Success Communication is the most referenced problem in distributed development. [Grinter:GROUP:1999] [Bird:ACMComm:2009] Wednesday, 11 April, 12 3
  • 4. Research Hypothesis “The collaboration between stakeholders impacts the code quality and the development community of a software system.” Wednesday, 11 April, 12 4
  • 5. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 5
  • 6. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 6
  • 7. Available Knowledge in Data Version Control Systems Mailing Lists Issue Tracking Systems Wednesday, 11 April, 12 7
  • 8. Available Knowledge in Data Version Control Systems Mailing Lists Issue Tracking Systems Communication Data Wednesday, 11 April, 12 7
  • 9. Available Knowledge in Data Version Control Systems Mailing Lists Issue Tracking Systems Communication Data • Source Code Comments • Change-Log Messages • Developer Emails & Discussions • Support Dialogues Wednesday, 11 April, 12 7
  • 10. Communication Data Exists Mainly as Unstructured Data In this report, you have defined a parameter named blocksize, which is given a value of "7|D|1|D". In open script of data set, there are below lines code: <script begin> token=Packages.java.util.StringTokenizer(params["blocksize"],"|"); vec=new Packages.java.util.Vector(); while(token.hasMoreTokens()){ vec.addElement(token.nextToken()); Eclipse #150222 } params["DateRange"]=java.lang.Integer.parseInt(vec.elementAt(0)); </script end> Since the value of params["blocksize"] is "7|D|1|D", vec.elementAt(0) is "7", and then it can not be parsed to int value. In 1.0.1, the value of params["blocksize"] might be 7|D|1|D, so it can be parsed to int value of 7. Extraction and processing of unstructured data is challenging. [MUD:Workshop:2010] Wednesday, 11 April, 12 8
  • 11. Mining Collaboration Data [Bettenburg:ICPC:2011] chnical Information in Un structured Data A Lightw eight Approach to Uncover Te Michel Smidt ams, Ahmed E. Hassan Build ID: M20070212-1330 Nicolas Bettenburg, Bram Ad Dept. of Computer Science S) gence Lab Software Analysis and Intelli Steps To Reproduce: Una des a keytyinof Bremen ng for "M1+S" (ie. Alt+ 1. Create a plugin for eclipse that iversi bindione of the top level inclu Queen’s University • Use Spellchecking as mnem onic as Bremen, for Help > any where S is any letter that is used the mnemonic Germ &So ftware Updates, menus. Since eclipse uses "S" Kingston, Ontario, Canada Email: michelIDE nformatik.u "S" is sufficient . @i ni-bremen.de • Empirical validation cs.queensu.ca Email: {nicbet,bram,ahmed}@ 2. Laun ch the plugin as part of Eclipse our example in #1) the Help menu (to go along with 3. Press Alt+H to bring down tes" is missing its mnemonic. BUG: Notice "Software Upda nication through email, cha t, or More information: The code after "if (callback.is Eclipse's MenuManager. AcceleratorInUse(SWT java removes the mnemonic, .ALT | character))" inside but it seems like Eclipse level menumanagers like • Improved on state of the art Abstract—Developer commu eratorInUse" only for top should be checking "isAccel s mostly of largely uns tructured issue report comments consist ,Edit,...,Help, etc. : rma- File text, mixed with technical info data, i.e., natural language ons, source code jargon, abbreviati /* (non-Javadoc) onItem#update(java.l ang.String) tion such as project-specific e.action.IContributi cal artifacts * @see org.eclipse.jfac patches, stack traces and identifiers. These techni */ of knowle dge on the technical tring property) { represent a valuable source public void update(S applications from = getItems(); tributionItem items[] tem, with a wide range of ICon part of the sys vo- s to creating project-specific items.length; i++) { establishing traceability link en natural for (int i = 0; i < e-style delimiters betwe property); cabularies. However, the fre items[i].update( hnical tent make the mining of tec } language and technical con general-purpose t step towards a [...] artifacts challenging. As a firs information } technique to extractin g all kinds of technical present a lightweight approach Any status on this bug? from unstructured data, we guage text. Our cal artifacts and natural lan ) [...] for M6 (API) or M7 (non-API by a prototype to untangle techni are I'd consider any contributions nical information uncovered g spell checking tools, which Figure 1. Examples of tech optionalposed Manager with API (Eclipse Platform approach is based on existin in Menu in this paper. and ms and A 3.5 fix enta be to of the approach pro available across platfor that behaviour implemwouldtion makeand to have the WorkbenchActionBuilder contributed well-understood, fast, readily gh a of technical artifacts. Throu off by#208626).in 3.5, default early gers turn it on Bug ions contributed MenuMana impartial to different kinds and actionSets/editorAct our approach MenuManagers demonstrate that in the correct place). handcrafted benchmark, we (if I can find MenuManagers technical is able to successfully uncover a wide range of team to make sure we understan a d what the such, mining unstructured dat I'd like us to work with the SWT data. way sure that we aren't getting in the information in unstructured or project-specific terms. As correct platform behavior is, and make ormation onics) seems odd to me, in ge analysis, unstructured dat a, the exchange of inf nt behavior (i.e. turning off mnem is challenging: it is meant for of that. The curre Keywords-text mining, langua we should fix it properly. automated processing using general. If we're going to fix this, technical information. between humans, rather than presents an example of tech- computer machinery. Figure 1 I. I NT RO DU CT ION found in unstructured data. nical information commonly a unique history of design ering technical information Every software system has Recent approaches for discov Wednesday, 11 April, 12 changes, as well as development and e focussed on recognizing 9 ions, software unstructured data [3]–[5] hav
  • 12. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 10
  • 13. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 10
  • 14. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 11
  • 15. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 11
  • 16. Quantify Impact on Quality: Idea Extracted Communication Data Wednesday, 11 April, 12 12
  • 17. Quantify Impact on Quality: Idea Extracted Communication Data compute Social Metrics Wednesday, 11 April, 12 12
  • 18. Quantify Impact on Quality: Idea Extracted Communication Data compute Social Metrics measure relationships Post-Release Defects Wednesday, 11 April, 12 12
  • 19. Discussion Social CONTENT STRUCTURES 4 Dimensions of Measures Measures of Communication WORKFLOW DYNAMICS Wednesday, 11 April, 12 13
  • 20. Conceptual Approach Measure Measure Discussion Post-Release Metrics Bugs 6 months 6 months time LINK USING STATISTICAL MODELS Wednesday, 11 April, 12 14
  • 21. Findings of our work (1) Social metrics explain post-release defects as good as code metrics. Wednesday, 11 April, 12 15
  • 22. Findings of our work (1) Social metrics explain post-release defects as good as code metrics. (2) Combination of social metrics and code metrics is cumulative. Wednesday, 11 April, 12 15
  • 23. Findings of our work (1) Social metrics explain post-release defects as good as code metrics. (2) Combination of social metrics and code metrics is cumulative. (3) Identify factors that have positive and negative relationships with defects. Wednesday, 11 April, 12 15
  • 24. Findings of our work (1) Social metrics explain post-release defects as good as code metrics. (2) Combination of social metrics and code metrics is cumulative. (3) Identify factors that have positive and negative relationships with defects. [ICPC‘2010] (Best Paper) [JEMSE?] Wednesday, 11 April, 12 15
  • 25. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 16
  • 26. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 16
  • 27. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 16
  • 28. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 17
  • 29. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 17
  • 30. Proposed Approach I. Extraction of communication data II. Study impact on software quality III. Study impact on development community Wednesday, 11 April, 12 17
  • 31. Available Knowledge in Data Code Review Systems Mailing Lists Issue Tracking Systems Data on Management of Code Contributions Wednesday, 11 April, 12 18
  • 32. Contribution Management Patch Project Feedback Repository Feedback Submission Review OK Verification OK Integration Wednesday, 11 April, 12 19
  • 33. Studying Impact on Community through Contribution Management Goal: Study how contributors, reviewers, verifiers and the software are impacted by communication (anomalies) through statistical models. Example: Reviewers leaving community due to lack of feedback Wednesday, 11 April, 12 20
  • 34. Available Knowledge in Data Version Control Systems Mailing Lists Issue Tracking Systems Workflow Information Social Networks Wednesday, 11 April, 12 21
  • 35. Evolution of Code-Knowledge Communities Internet Explorer reed masayuki cjcypoi02 dietrich steve.england corevette steffen.wilberg davemgarrett mmortal03 timeless mano fittysix matspal longsonr zurtex matti edilee mconnor cwwmozilla beltzner dveditz adelfino zeniko kliu alice0775 sziadeh mark.finkle robert.bugzilla philringnalda sgautherie.bz kev faaborg johnath martijn.martijn jmjeffery jo.hermans nrthomas gavin.sharp polidobj m-wada XML Parser jbecerra jdarmochwal john.p.baker jruderman mak77 ria.klaassen VYV03354 cbook bomfog dao elmar.ludwig sdaugherty vseerror nightstalkerz l10n highmind63 twalker mh+mozilla klaas1988 ehsan stephen.donner me.at.work phiw hskupin ctalbert tchung tomer marcia timwi rotis uliss sylvain.pasche bugzilla marco.zehe cl-bugs-new2 JavaScript tonglebeak abillings info UI Engine deletesoftware anselm.meyer eddy_nigg matt RainerStroebel samuel.sidler+old alex hasham8888 aarobertxtr manujsabarwal johnjbarton myles7897 paulc shaver smichaud mozilla zhangchunlin dtownsend jdaggett kbrosnan bzbarsky sdwilsh Wednesday, 11 April, 12 22
  • 36. Thesis Progress Tools and techniques Empirical Validation for mining communication repositories of presented tools and techniques Empirical Validation Empirical Validation of relationship between collaboration of relationship between collaboration and software quality. and development teams. Wednesday, 11 April, 12 23
  • 37. Thesis Progress Tools and techniques Empirical Validation for mining communication repositories of presented tools and techniques Empirical Validation Empirical Validation of relationship between collaboration of relationship between collaboration and software quality. and development teams. Wednesday, 11 April, 12 23
  • 38. Thesis Progress Tools and techniques Empirical Validation for mining communication repositories of presented tools and techniques Empirical Validation Empirical Validation of relationship between collaboration of relationship between collaboration and software quality. and development teams. Wednesday, 11 April, 12 23
  • 39. Thesis Progress Tools and techniques Empirical Validation for mining communication repositories of presented tools and techniques Empirical Validation Empirical Validation of relationship between collaboration of relationship between collaboration and software quality. and development teams. Wednesday, 11 April, 12 23
  • 40. Thesis Progress Tools and techniques Empirical Validation for mining communication repositories of presented tools and techniques Empirical Validation Empirical Validation of relationship between collaboration of relationship between collaboration and software quality. and development teams. Wednesday, 11 April, 12 23
  • 41. Points for Discussion • How to do evaluation of code-knowledge communities? (ground truth)? • Applicability to industrial settings (almost no communication data records available)? • Extend work to defect prediction? • Practical implications: management, moderation, staffing, ... ? Wednesday, 11 April, 12 24