On Computing the Canonical Features of Software Systems

Software applications typically have many features that vary in their similarity. We define a measurement of similarity between pairs of features based on their underlying implementations and use this measurement to compute a set of canonical features. The canonical features set (CFS) consists of a...

Full description

Saved in:

Bibliographic Details
Published in	2006 13th Working Conference on Reverse Engineering pp. 93 - 102
Main Authors	Kothari, J., Denton, T., Mancoridis, S., Shokoufandeh, A.
Format	Conference Proceeding
Language	English
Published	IEEE 01.10.2006
Subjects	Application software Computer science Costs Investments Reverse engineering Software measurement Software systems Systems engineering and theory
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Software applications typically have many features that vary in their similarity. We define a measurement of similarity between pairs of features based on their underlying implementations and use this measurement to compute a set of canonical features. The canonical features set (CFS) consists of a small number of features that are as dissimilar as possible to each other, yet are most representative of the features that are not in the CFS. The members of the CFS are distinguishing features and understanding their implementation provides the engineer with an overview of the system undergoing scrutiny. The members of the CFS can also be used as cluster centroids to partition the entire set of features. Partitioning the set of features can simplify the understanding of large and complex software systems. Additionally, when a specific feature must undergo maintenance, it is helpful to know which features are most closely related to it. We demonstrate the utility of our method through the analysis of the Jext, Firefox, and Gaim software systems
ISBN:	9780769527192 0769527191
ISSN:	1095-1350 2375-5369
DOI:	10.1109/WCRE.2006.39