The Open UniversitySkip to content
 

What does it take to develop a million lines of open source code?

Fernandez-Ramil, Juan and Izquierdo-Cortazar, Daniel (2009). What does it take to develop a million lines of open source code? In: 5th International Conference on Open Source Systems, 3-6 June 2009, Skovde, Sweden.

Full text available as:
Full text not publicly available
Due to copyright restrictions, this file is not available for public download
Google Scholar: Look up in Google Scholar

Abstract

This article presents a preliminary and exploratory study of the relationship between size, on the one hand, and effort, duration and team size, on the other, for 11 Free/Libre/Open Source Software (FLOSS) projects with current size ranging between between 0.6 and 5.3 million lines of code (MLOC). Effort was operationalised based on the number of active committers per month. The extracted data did not fit well an early version of the closed-source cost estimation model COCOMO for proprietary software, overall suggesting that, at least to some extent, FLOSS communities are more productive than closed-source teams. This also motivated the need for FLOSS-specific effort models. As a first approximation, we evaluated 16 linear regression models involving different pairs of attributes. One of our experiments was to calculate the net size, that is, to remove any suspiciously large outliers or jumps in the growth trends. The best model we found involved effort against net size, accounting for 79 percent of the variance. This model was based on data excluding a possible outlier (Eclipse), the largest project in our sample. This suggests that different effort models may be needed for certain categories of FLOSS projects. Incidentally, for each of the 11 individual FLOSS projects we were able to model the net size trends with very high accuracy (R^2 ≥ 0.98). Of the 11 projects, 3 have grown superlinearly, 5 linearly and 3 sublinearly, suggesting that in the majority of the cases accumulated complexity is either well controlled or don’t constitute a growth constraining factor.

Item Type: Conference Item
Copyright Holders: 2009 Springer-Verlag
Project Funding Details:
Funded Project NameProject IDFunding Body
Not SetNot SetBelgian FRS-RNRS - postdoctoral scholarship 2.4519.05
Keywords: open source; software economics; metrics; empirical studies; COCOMO model
Academic Unit/Department: Mathematics, Computing and Technology > Computing & Communications
Interdisciplinary Research Centre: Centre for Research in Computing (CRC)
Item ID: 27248
Depositing User: Juan Fernandez-Ramil
Date Deposited: 25 Jan 2011 11:04
Last Modified: 27 Oct 2012 11:44
URI: http://oro.open.ac.uk/id/eprint/27248
Share this page:

Actions (login may be required)

View Item
Report issue / request change

Policies | Disclaimer

© The Open University   + 44 (0)870 333 4340   general-enquiries@open.ac.uk