Comparing the Effectiveness of Using Design and Code Measures in Software Faultiness Estimation

IRIS - Institutional Research Information System
IRIS è il sistema di gestione integrata dei dati della ricerca (persone, progetti, pubblicazioni, attività) adottato dall'Università degli Studi dell’Insubria.

IRInSubria - Institutional Repository Insubria
IRInSubria raccoglie, conserva, documenta e dissemina le informazioni sulla produzione scientifica dell'Università degli Studi dell’Insubria anche ai fini della valutazione della ricerca.

Background. Early identification of software modules that are likely to be faulty helps practitioners take timely actions to improve these modules' quality and reduce development costs in the remainder of the development process. To this end, module faultiness estimation models can be built at any point during development by using measures collected up to that time. Models available in later phases are expected to be more accurate than those available in earlier phases. However, waiting until late in the development process may reduce the impact of the effectiveness and efficacy of any software quality improvement actions and increase their cost.Aims. Our goal is to investigate to what extent using software code measures along with software design measures helps improve the accuracy of module faultiness estimation with respect to using software design measures alone.Method. We built faultiness estimation models-by using Binary Logistic Regression, Naive Bayes, Support Vector Machines, and Decision Trees-for 54 datasets from the PROMISE repository. These datasets contain design and code measures and faultiness data of software modules of real-life projects. We compared the models built by using the code measures and design measures together against the models built by using design measures alone via a few accuracy indicators.Results. The results indicate that the models built by using code measures and design measures together are only slightly more accurate than the models built by using design measures alone.Conclusions. Our analysis shows that measures that can be obtained during design can provide models that are almost as accurate as models that can be achieved in later development phases. This is good news for practitioners, who can start early-hence cheaper and more effective-quality improvement initiatives based on fairly reliable models.

Comparing the Effectiveness of Using Design and Code Measures in Software Faultiness Estimation

Morasca, Sandro;Lavazza, Luigi

2019-01-01

Abstract

Background. Early identification of software modules that are likely to be faulty helps practitioners take timely actions to improve these modules' quality and reduce development costs in the remainder of the development process. To this end, module faultiness estimation models can be built at any point during development by using measures collected up to that time. Models available in later phases are expected to be more accurate than those available in earlier phases. However, waiting until late in the development process may reduce the impact of the effectiveness and efficacy of any software quality improvement actions and increase their cost.Aims. Our goal is to investigate to what extent using software code measures along with software design measures helps improve the accuracy of module faultiness estimation with respect to using software design measures alone.Method. We built faultiness estimation models-by using Binary Logistic Regression, Naive Bayes, Support Vector Machines, and Decision Trees-for 54 datasets from the PROMISE repository. These datasets contain design and code measures and faultiness data of software modules of real-life projects. We compared the models built by using the code measures and design measures together against the models built by using design measures alone via a few accuracy indicators.Results. The results indicate that the models built by using code measures and design measures together are only slightly more accurate than the models built by using design measures alone.Conclusions. Our analysis shows that measures that can be obtained during design can provide models that are almost as accurate as models that can be achieved in later development phases. This is good news for practitioners, who can start early-hence cheaper and more effective-quality improvement initiatives based on fairly reliable models.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2019
			
	Titolo del volume
	
				PROCEEDINGS of EASE 2019 - Evaluation and Assessment in Software Engineering
			
	ISBN
	
				9781450371452
			
	Titolo del congresso
	
				23rd Evaluation and Assessment in Software Engineering Conference, EASE 2019
			
	Luogo del Congresso
	
				IT University Copenhagen, dnk
			
	Data del Congresso
	
				2019
			
	Appare nelle tipologie:
	
				Relazione (in Volume)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11383/2079009

Attenzione

L'Ateneo sottopone a validazione solo i file PDF allegati

Citazioni

ND

3

1

social impact