Dawn Foster, Guido Conaldi, Riccardo De Vita
University of Greenwich
Centre for Business Network Analysis
http://www.gre.ac.uk/business/research/centres/cbna/home
Presented at the Third European Conference on Social Networks (EUSN) Mainz, Germany on 27 September 2017
This study investigates collaboration in an open source software community using proximity theory as the theoretical lens with social network analysis and modeling of activities over time to predict collaboration.
Actors in this study are part of the Linux kernel community where they collaborate on one or more sub-projects using mailing lists as the primary method of collaboration. Collaboration occurs in real-time between actors that contribute to multiple sub-projects, work for firms that pay them to contribute to the Linux kernel, and are working virtually from locations across the globe. This complex setting can be better understood by using several dimensions of proximity: organizational, cognitive, institutional, social, and geographical. Collaboration is analysed using data from source code contributions and mailing list participation.
Open source software is developed in the open where anyone can view the source code and anyone with the knowledge to do so can contribute to the project. With no central group responsible for coordination of tasks, collaboration on the development of this software is emergent. Because people from around the world work on these projects together using online tools with publicly accessible interactions between people, it is a relevant setting for using social network analysis to understand and model network relationships.
Understanding Collaboration in Fluid Organizations Using Proximity Theory
1. Understanding Collaboration in Fluid
Organizations, a Proximity Approach
Presented at the Third European Conference on Social Networks (EUSN)
Mainz, Germany on 27 September 2017
Dawn Foster, Guido Conaldi, Riccardo De Vita
University of Greenwich
Centre for Business Network Analysis
http://www.gre.ac.uk/business/research/centres/cbna/home
2. Research Overview
How do participants who are employed by firms
collaborate within a fluid organization?
Proximity theory as a theoretical framework:
• to understand intraorganizational collaboration
• within fluid organizations
• using an open source software project, the
Linux kernel, as the empirical setting.
2
3. Contributions
• Contribute to literature on fluid organizations by determining the
impact of firm affiliation on intraorganizational collaboration
between individuals in fluid organizations.
• Existing studies on open source mostly individual motivations.
• Firms can influence collaboration of employees.
• Demonstrate that proximity theory can be used to better
understand collaboration within fluid organizations.
• Boschma’s (2005) five dimensions should further our understanding.
• Most proximity studies are inter; fluid boundaries blur distinction.
As fluid organizations become more common,
understanding collaboration within them is increasingly important.
3
4. Fluid Organizations
• In fluid organizations, the boundaries and structures allow fluid
movement within the organization as individuals collaborate to
coordinate activities (Ashkenas et al., 2002; Glance & Huberman,
1994).
• Some fluid organizations are based on global virtual work across
many time zones with people from different backgrounds (Nurmi
& Hinds, 2016) and may include individuals from different firms
and different types of institutions (O’Mahony & Bechky, 2008).
• Collaboration, especially within fluid organizations, crosses
dimensions of proximity, including cognitive, organizational,
social, institutional and geographical (Balland, 2012; Boschma,
2005; Cantner & Graf, 2006; Crescenzi, Nathan, & Rodríguez-
Pose, 2016; Knoben & Oerlemans, 2006).
4
5. Proximity Theory
• Social proximity: relations between actors with trust coming from
friendship and experience (Boschma 2005).
• Institutional proximity: whether individuals are in a similar
institutional setting, like corporation, non-profit, university, non-
affiliated, etc. (Balland 2012; Crescenzi et al. 2013).
• Organizational proximity: relationship within and between
organizations (Boschma 2005).
• Cognitive proximity: similarity of frames of reference and knowledge
(Knoben & Oerlemans 2006).
• Geographic Proximity: physical, spatial distance between actors
(Boschma 2005). Online, geographical proximity is often irrelevant, but
some scholars have used time zones (O’Leary & Cummings, 2007).
5
6. Empirical Setting: Open Source
• Open source frequently studied as a fluid organization (e.g. Chen &
O’Mahony, 2009; O'Mahony & Bechky, 2008; Puranam et al., 2014).
• Contributions by individuals, not firms (O’Mahony, 2007), but firms
increasingly have employees contribute as a way to participate
(Jensen & Scacchi, 2007; Roberts et al., 2006).
• Linux Kernel*:
• < 8% of contributions by
unaffiliated software developers
• Neutral project, competing
companies participate
• 22 million lines of code
• 14,000 developers
• 1,300 organisations
6
Linux Kernel
Computer Hardware (CPU, memory, disk)
Linux Operating System (Red Hat, Ubuntu)
Applications (web browser, office)
SystemonlyUserfacing
* Corbet & Kroah-Hartman, 2016
7. Collaboration Network
• Network ties: Mailing Lists – ego replies to
alter
• Collaboration for code review, patch feedback,
bugs & discussions are on mailing lists before
source code is accepted into repository.
7
8. Dataset
USB Mailing List (linux-usb)
• Dates: 2013-10-31 - 2015-10-31
• 60 day moving window
• Messages (Events): 8170 in 3492 threads
• Ties: Ego reply to a message from Alter
• Actors: 892 (Egos: 705, Alters: 712)
8
9. Relational Event Models
• Mailing list data with a time stamps for each message provides
useful data for relational event models when used to explain
likelihood of collaboration between 2 developers given influence
of dimensions of proximity and other effects.
• Predicting events in an ordinal sequence is product of
multinomial likelihoods (Butts, 2008).
• Ordinal model estimated using Multinomial Conditional Logistic
Regression.
• Using clogit in R, which is based on coxph.
• Realized event compared to 10 randomly sampled possible
events (Opsahl & Hogan, 2011).
9
10. Variable Operationalization• Proximity:
• Geographic: time zone similarity
• Organizational: both work for same firm
• Social: number of times dyad participated in same thread
• Cognitive: contribute to same source code subsystems
• Institutional: employed by same type of institution (corporate, non-profit, etc.)
• Dyadic-Level Covariates:
• Alter Maintainer: alter is in a leadership (maintainer) position
• Is Committer: one or both have made code contributions
• Network-Level Covariates:
• Transitive closure: num of x’s ego replied to where x has replied to alter
• Cyclic closure: num of x’s alter replied to where x has replied to ego
• Shared partnership in: same x replies to both ego and alter
• Shared partnership out: ego and alter reply to messages by same x
• Repeated events: number of times ego replied to messages by alter
• Recency effect: 1/n with n as # of people alter emailed before ego
• Participation shift: 1 if last person alter replied to was ego
10
xe a
e a
e a
a
1/3
1/2
1
xa e
xe a
xe a
12. Results Summary
• The results indicate that cognitive, organizational,
and social proximity can be used to predict
collaboration, while institutional and geographic
proximity are not significant.
• Implication: Proximity is relevant and can be used to
explain collaboration in a fluid organization.
• Implication: Firms have at least some influence on
how employees collaborate within fluid organizations.
12
13. Further Research
• Model the decision to send an original
message in addition to the replies (two-
step process).
• Multi-level approach using mailing lists as
levels to simultaneously model multiple
mailing lists as settings for collaboration.
13
14. Thank You and Questions
Authors:
• Dawn M. Foster D.M.Foster@greenwich.ac.uk
@geekygirldawn on Twitter
• Guido Conaldi G.Conaldi@greenwich.ac.uk
• Riccardo De Vita R.DeVita@greenwich.ac.uk
University of Greenwich, Centre for Business Network Analysis
http://www.gre.ac.uk/business/research/centres/cbna
14
15. SourcesAshkenas, R.N. et al., 2002. The Boundaryless Organization: Breaking the Chains of Organizational Structure. 2nd ed. San Francisco: Jossey-
Bass.
Balland, P.A., 2012. Proximity and the Evolution of Collaboration Networks: Evidence from Research and Development Projects within the Global
Navigation Satellite System (GNSS) Industry. Regional Studies, 46(6), pp.741–756.
Boschma, R., 2005. Proximity and Innovation: A Critical Assessment. Regional Studies, 39(1), pp. 61–74.
Butts, C.T., 2008. A relational event framework for social action. Sociological Methodology, 38(1), pp.155-200.
Cantner, U. & Graf, H., 2006. The network of innovators in Jena: An application of social network analysis. Research Policy, 35(4), pp.463–480.
Chen, K.K. & O’Mahony, S., 2009. Differentiating Organizational Boundaries. Research in the Sociology of Organizations, 26, pp. 183–220.
Crescenzi, R., Nathan, M. & Rodríguez-Pose, A., 2016. Do inventors talk to strangers? On proximity and collaborative knowledge creation.
Research Policy, 45(1), pp.177–194.
Corbet, J., Kroah-Hartman, G. & McPherson, A., 2015. Linux Kernel Development: How Fast is it Going, Who is Doing It, What Are They Doing
and Who is Sponsoring the Work, Available at: http://www.linuxfoundation.org/publications/linux-foundation/who-writes-linux-2015.
Dobusch, L. & Schoeneborn, D., 2015. Fluidity, Identity, and Organizationality: The Communicative Constitution of Anonymous. Journal of
Management Studies, 52(8), pp.1005–1035.
Glance, N.S. & Huberman, B.A., 1994. Social dilemmas and fluid organizations, Hillsdale, NJ: Lawrence Erlbaum.
Jensen, C. & Scacchi, W., 2007. Role Migration and Advancement Processes in OSSD Projects: A Comparative Case Study. 29th International
Conference on Software Engineering (ICSE’07), pp. 364–374.
Knoben, J. & Oerlemans, L. a G., 2006. Proximity and inter-organizational collaboration: A literature review. International Journal of Management
Reviews, 8(2), pp.71–89.
March, J.G. & Simon, H.A., 1993. Organizations Second Ed., Malden, MA: Blackwell.
Nurmi, N. & Hinds, P.J., 2016. Job Complexity & Learning Opportunities: A Silver Lining in the Design of Global Virtual Work. Journal of
International Business Studies, 47(6), pp. 1–24.
O'Leary, M.B. and Cummings, J.N., 2007. The spatial, temporal, and configurational characteristics of geographic dispersion in teams. MIS
Quarterly, 31(3), pp. 433-452.
O’Mahony, S., 2007. The governance of open source initiatives: What does it mean to be community managed? Journal of Management and
Governance, 11(2), pp. 139–150.
O’Mahony, S. & Bechky, B.A., 2008. Boundary Organizations: Enabling Collaboration among Unexpected Allies. Administrative Science Quarterly,
53(3), pp. 422–459.
Opsahl, T. and Hogan, B., 2011. Modeling the evolution of continuously-observed networks: Communication in a Facebook-like community. arXiv
preprint arXiv:1010.2141.
Puranam, P., Alexy, O. & Reitzig, M., 2014. What’s “New” About New Forms of Organizing? Academy of Management Review, 39(2), pp. 162–180.
Quintane, E., Pattison, P.E., Robins, G.L. and Mol, J.M., 2013. Short-and long-term stability in organizational networks: Temporal structures of
project teams. Social Networks, 35(4), pp.528-540.
Roberts, J., Hann, I. & Slaughter, S., 2006. Understanding the motivations, participation, and performance of open source software developers: A
longitudinal study of the Apache projects. Management Science, 52(7), pp. 984–999.