Vertical Interaction in Open Software Engineering Communities Patrick Wagstrom Ph.D. Thesis Defense March 9, 2009 Committee: James Herbsleb Kathleen Carley M. Granger Morgan Audris Mockus

2

3

http://www.flickr.com/photos/nixternal/3131672372/

Open Source is BIG Business Year 2008 2008 2007 2007 2006 2003 1999

4

Target MySQL Trolltech Zimbra XenSource JBoss SuSE Cygnus

Buyer Sun Nokia Yahoo! Citrix RedHat Novell RedHat

Amount $1 billion $153 million $350 million $500 million $350 million $210 million $675 million

Open Communities are Bigger

5

From March 2008 Eclipse Executive Director's Report: http://www.eclipse.org/org/foundation/membersminutes/20080317MembersMeeting/DirectorsReport.pdf

Central Players In Open Source Foundations

Commercial Firms

Developers

6

4 Empirical Studies

7



Firms and Foundations



Firms and Firms



Firms and Individuals



Individuals and Individuals

Firms and Foundations:

Guiding an Ecosystem to Promote Value 8

The Problem ●





9

Some research has been done about why individual focused OSS projects utilize foundations Little research has addressed why commercial firms would participate in foundations –

Large monetary cost



Giving up some control



Possibly increased work

What does the foundation do to drive value?

Data ●

Semi-structured interviews with Eclipse Foundation staff and employees of member companies –

10

38 interviews with 40 individuals



Face-to-face meetings at EclipseCon 2007 and 2008



Participation in Eclipse members meetings

Driving Value Creation

11



Non-market player



Introduction of process



Value of the Eclipse brand and marketing



Organizational structure driving value



Platform for innovation

Non-Market Player ●

Eclipse grew out of IBM's old VisualAge ecosystem



Small firms had to worry about being stepped on



Allows innovation without worry about “Gorillas”



12

Opens the door for distribution based business models

Platform for Innovation ●



Foundation actively recruits new members Encourages components to be as modular as possible –





13

Modularity == Independence from other components

Create projects outside of Eclipse and bring inside later Push usage outside traditional realms

Takeaways ●

14

Eclipse Foundation has taken concrete steps to build ecosystem



Governance structure ensures all can provide input



Non-market nature is very beneficial



Services provided for members are worth the cost

Firms and Firms:

Business Collaboration Through Open Source 15

The Problem

16



Much data about how individuals interact in OSS



Little data about how firms collaborate



Is there an overdependence on single firms?



How collaborative are OSS ecosystems?

Data ●

Projects from Eclipse Foundation



Two level project hierarchy



17



Top Level Projects (11)



Sub Projects (89)

Collected data from version control system and IP repository



Ties individuals to code changes and firms



Compared with data from GNOME

How Much Collaboration Really Exists?

18

eclipse.platform tools.cdt

IBM Leaves/QNX Lead

Collaboration in CDT

WindRiver Joins/IBM Lead 19

WindRiver Leads

Who Builds the Platform?

20

Community Network Structure IBM

GNOME May 2005

Eclipse.platform tools.cdt

gtk

Eclipse May 2008

21

Takeaways ●







22

Participation in an OSS ecosystem may require little collaboration with other firms Many key portions of Eclipse are centered on IBM Allows IBM to exert great influence, even though no longer at the center The organic community around GNOME shows much more collaboration

Firms and Individuals: 23

The Impact of Commercial Participation on Volunteer Participation

The Problem ●





24

Commercial firms have different interests than volunteer OSS developers Firms bring many resources to projects that benefit projects What impact do these firms have on volunteer participation?

Data ●

25

Source code version control, bug tracker, and email lists from GNOME project



Individuals are disambiguated and identities linked



Commercial affiliation for developers identified



Face to face interviews with 18 developers

Firm Classifications ●

9 major firms in community



Divided into two categories -





26



Product focused



Community focused

Validated through interviews Developers from community focused firms generally more active within the community

Do commercial developers drive away volunteers? ●

Designed a multilevel model to predict current volunteers based on previous participation

VolDevsi , t =01 VolDevs i , t−12 ComDevs i , t−13 Commitsi , t−1i  i , t

Variable Estimate Std Error P-Value Intercept VolDevs ComDevs Commits

0.5643 0.4562 0.0817 0.0601

0.1397 0.0442 0.0389 0.0242

0.0001 <0.001 0.0360 0.0130

No! They actually have a slight positive impact on the number of volunteers! 27

Do commercial developers drive away volunteers (by firm)? Variable Estimate Std Error P-Value Intercept VolDevs ComDevs(CF) ComDevs(PF) Commits

0.6032 0.4212 0.2050 -0.0433 0.0711

0.1381 0.0443 0.0432 0.0388 0.0234

<0.001 <0.001 <0.001 0.264 0.003

Developers at community focused firms have a significant attractive power while developers at product focused firms have no relation.

28

Takeaways ●



29

Commercial firms do increase volunteer participation in Open Source Community focused firms have a much greater attractive power than product focused firms

Individuals and Individuals: 30

Evolution of the SocioTechnical Congruence Metric

The Problem ●

STC hasn't been replicated in OSS



Difficult to distill to individual level





Typically done at network level



Ratio muddles effects of coordination requirements and actual coordination

Original analysis looked only at short term –

31

Most software projects are long term

Data ●







GNOME project Filtered for projects that had CVS, bug tracker, and mailing list archives Do not have as much developer information as Cataldo et. al. Examine time to resolve bugs –

32

Only include those bugs marked as defects

Individualized STC ∑  C A∧C R  ∑CR

Proportion of coordination requirements that are mirrored in the actual communication network.

[ ][ ][ ]

0 1 1 0 0 0 1 1 1 0 0 1 0 0 1 1 ∧ = 1 0 0 1 1 1 0 1 0 1 1 0  1 1 1 0  CA

33

CR

0 0 1 0

0 0 0 1

1 0 0 1

0 1 1 0

6 =0.6 10 2 =0.5 4

Individualized STC

34

Testing Individualized STC ●

Predict log2 of time to resolve defect



Independent variables –

Number of developers active on defect



Number of people changing defect status



Number of comments made



Individualized STC for developers

Variable Estimate Std Error P-Value Intercept 1.9707 0.0581 NumDevs 0.2846 0.0301 DeltaPeople 0.8074 0.0176 Comments -0.0142 0.0036 UIC -1.2140 0.0770 R^2=0.134, DF=26507, p < 0.0001 35

<0.0001 <0.0001 <0.0001 <0.0001 <0.0001

Disambiguating Results

[ ][ ][ ]

0 1 1 0 0 0 1 1 1 0 0 1 0 0 1 1 ∧ = 1 0 0 1 1 1 0 1 0 1 1 0  1 1 1 0  CA

0 0 1 0

0 0 0 1

1 0 0 1

0 1 1 0

CR

Extra Communication Coordination Requirements Matched Communication

Variable Estimate Std Error P-Value

Intercept 1.4590 0.0568 NumDevs 0.2500 0.0306 DeltaPeople 0.8020 0.0177 Comments -0.0125 0.0036 MatchedComm -0.0524 0.0056 0.0314 0.0032 CoordReq extraComm -0.0119 0.0035 R^2=0.132, DF=26505, p < 0.0001 36

<0.0001 <0.0001 <0.0001 0.0006 <0.0001 <0.0001 0.0006

Takeaways ●





37

Demonstrated a method to individualize STC Should break apart STC metric into it's constituent portions Extra communication, not related to coordination requirements, improves task performance

Conclusions

38

Building OSS Communities ●

Not a matter of just throwing code out there



Designating non-market player for head is helpful



39

Need to find way to drive additional value to members, beyond just software



Enable members to work independently



Watch the centralization of components



Invite firms to participate with volunteers



Encourage discussion in the community

Thank You! This work was supported in part by a National Science Foundation graduate research fellowship, the National Science Foundation (IIS-0414698), the IGERT Training Program in CASOS(NSF,DGE-9972762), the Office of Naval Research under Dynamic Network Analysis program (N00014-02-1-0973), the Air Force Office of Sponsored Research (MURI: Cultural Modeling of the Adversary, 600322), the Army Research Lab (CTA: 20002504), and the Army Research Institute (W91WAW07C0063) for research in the area of dynamic network analysis. Additional support was provided by CASOS - the center for Computational Analysis of Social and Organizational Systems at Carnegie Mellon University. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the the National Science Foundation, the Office of Naval Research, the Air Force Office of Sponsored Research, the Army Research Lab, or the Army Research Institute. And more folks than I can fit on a single slide.

Thanks! 40

Vertical Interaction in Open Software Engineering ... - Patrick Wagstrom's

Mar 9, 2009 - Cygnus. $675 million. MySQL. Trolltech. Nokia. Zimbra. XenSource. Citrix. JBoss. RedHat. SuSE ... Top Level Projects (11). – Sub Projects (89).

3MB Sizes 1 Downloads 127 Views

Recommend Documents

Vertical Interaction in Open Software Engineering ... - Patrick Wagstrom's
Ph.D. Thesis Defense. March 9, 2009 ... Data. ○ Semi-structured interviews with Eclipse Foundation staff and ... Community Network Structure. Eclipse. GNOME.

Vertical Interaction in Open Software Engineering ...
Carnegie Insitute of Technology and School of Computer Science. Carnegie Mellon ... for the degree of Doctor of Philosophy in ... Analysis program (N00014-02-1-0973, the Air Force Office of Sponsored Research (MURI: Cultural Mod- eling of the ..... 4

Vertical Interaction In Open Software Engineering ...
Mar 5, 2008 - Shipped Apache web server. – Utilized Open Source purely as a .... to get access to some info from Eclipse Foundation. – The web accessible ...

requirement engineering process in software engineering pdf ...
requirement engineering process in software engineering pdf. requirement engineering process in software engineering pdf. Open. Extract. Open with. Sign In.

Open Source Software for Routing
ISIS (IPv6) (and ISIS IPv4 is not yet useable). • Multiple branches of Quagga: -. Quagga.net (official “Master” branch), Euro-IX, Quagga-RE and more. 17.

Reenactment in Software Engineering Studies
Developers: • Rarely begin with a good query: hard to choose the right words. • Analyze very briefly list of results before reformulating query. • Even after ...

A Social Network Approach to Free/Open Source ... - Patrick Wagstrom
data from a social networking web site, Advogato.org, .... that most users, 99%, have users have a social network of fewer than 10 alters for the time period.

Open Source & Libre Software in medical practice.pdf
Open Source & Libre Software in medical practice.pdf. Open Source & Libre Software in medical practice.pdf. Open. Extract. Open with. Sign In. Main menu.

Software Engineering - GitHub
Sep 26, 2011 - into an application used by nearly a million people to store over two million code ... “Continuous Integration is a software development practice ...

Software Engineering
directed system for software engineering process improvement. Both products are used ... associated with software process improvement; and Software Shock (Dorset House), a treat- ment that focuses on ..... Security Testing 497. 18.6.3 ..... the Unive

Mining Software Engineering Data
Apr 9, 1993 - To Change. Consult. Guru for. Advice. New Req., Bug Fix. “How does a change in one source code entity propagate to other entities?” No More.

Software Engineering -
individual components? – How is function or data structure detail separated from .... (1) User interface classes define all abstractions that are necessary for Human ... enables data mining or knowledge discovery that can have an impact on the ...

Mobile Software Engineering - cs164
singletons, factories, observers, ... Page 23. unit testing. PHPUnit, Selenium, ... Page 24. UX. Page 25. performance latency, caching, ... Page 26. source control git, subversion. Page 27. IDEs. Xcode, ... Page 28. PHP frameworks. CodeIgniter. Page

Software Engineering
13.4.7. Data Structure 349. 13.4.8. Software Procedure 351. 13.4.9 ...... (e.g., Resisting the Virtual Life, edited by James Brook and Iain Boal and The Future ..... gan Kaufmann, 2000) suggest that the widespread impact of the PC will decline as.