Effort Estimation in Agile Software Development Using Autoencoders

Effort estimation is important to correctly plan the use of resources in a software project. When adopting Agile in IT, business value is raised in both performance and quality. A gap in agile effort estimation is the lack of research combining software engineering models and deep learning technique...

Full description

Saved in:
Bibliographic Details
Published in2023 12th International Conference On Software Process Improvement (CIMPS) pp. 1 - 7
Main Authors Sanchez, Eduardo Rodriguez, Santacruz, Eduardo Vazquez, Maceda, Humberto Cervantes
Format Conference Proceeding
LanguageEnglish
Published IEEE 18.10.2023
Subjects
Online AccessGet full text
DOI10.1109/CIMPS61323.2023.10528839

Cover

Loading…
Abstract Effort estimation is important to correctly plan the use of resources in a software project. When adopting Agile in IT, business value is raised in both performance and quality. A gap in agile effort estimation is the lack of research combining software engineering models and deep learning techniques. During the planning phase the team involved makes an approximate estimation of time and cost based on artifacts and requirements obtained from initial interviews with clients and stakeholders. This paper aims to contribute with a hybrid effort estimation model that uses story points which measure the amount of effort needed to accomplish the project, team velocity which measures how many units of effort the team completes in a typical Sprint, and category size labels of effort, time and cost in order to estimate completion time and total cost of a project developed with agile methods like Scrum, The machine learning techniques used to implement the project are neural networks such as autoencoders and different variations of it. The learning capabilities are assessed through 10-Fold cross validation and the estimates are compared with the original dataset and the results obtained from literature. This research uses 21 projects developed by six software houses, a set of 42 noisy data is used for training created using data augmentation technique. Each project has two dependent variables that the algorithm tries to estimate and they are completion time measured in days and total cost valued in Pakistan rupees. The proposed approach compares the use of the original data as input versus the original data with the addition of category size labels. The main idea is that every project has three main features that are scope, time and cost. Since the current work is based on historical data the scope is always fixed and a single project can be estimated according to a hypothetical time or cost which can be small, medium or large.
AbstractList Effort estimation is important to correctly plan the use of resources in a software project. When adopting Agile in IT, business value is raised in both performance and quality. A gap in agile effort estimation is the lack of research combining software engineering models and deep learning techniques. During the planning phase the team involved makes an approximate estimation of time and cost based on artifacts and requirements obtained from initial interviews with clients and stakeholders. This paper aims to contribute with a hybrid effort estimation model that uses story points which measure the amount of effort needed to accomplish the project, team velocity which measures how many units of effort the team completes in a typical Sprint, and category size labels of effort, time and cost in order to estimate completion time and total cost of a project developed with agile methods like Scrum, The machine learning techniques used to implement the project are neural networks such as autoencoders and different variations of it. The learning capabilities are assessed through 10-Fold cross validation and the estimates are compared with the original dataset and the results obtained from literature. This research uses 21 projects developed by six software houses, a set of 42 noisy data is used for training created using data augmentation technique. Each project has two dependent variables that the algorithm tries to estimate and they are completion time measured in days and total cost valued in Pakistan rupees. The proposed approach compares the use of the original data as input versus the original data with the addition of category size labels. The main idea is that every project has three main features that are scope, time and cost. Since the current work is based on historical data the scope is always fixed and a single project can be estimated according to a hypothetical time or cost which can be small, medium or large.
Author Maceda, Humberto Cervantes
Sanchez, Eduardo Rodriguez
Santacruz, Eduardo Vazquez
Author_xml – sequence: 1
  givenname: Eduardo Rodriguez
  surname: Sanchez
  fullname: Sanchez, Eduardo Rodriguez
  email: erodsmx@gmail.com
  organization: Universidad Autónoma Metropolitana,PCyTI,Departamento de Ing. Eléctrica,Ciudad de México
– sequence: 2
  givenname: Eduardo Vazquez
  surname: Santacruz
  fullname: Santacruz, Eduardo Vazquez
  email: evazquez.santacruz@izt.uam.mx
  organization: Universidad Autónoma Metropolitana,PCyTI,Departamento de Ing. Eléctrica,Ciudad de México
– sequence: 3
  givenname: Humberto Cervantes
  surname: Maceda
  fullname: Maceda, Humberto Cervantes
  email: hcm@xanum.uam.mx
  organization: Universidad Autónoma Metropolitana,PCyTI,Departamento de Ing. Eléctrica,Ciudad de México
BookMark eNo1j0FOwzAURI0ECyi9AQtfIMXfjh17GUKglYpAKl1XTvJdWUrtyjEgbk8kYDOjtxm9uSGXIQYkhAJbATBz32xe3nYKBBcrzuYAJrnWwlyQpamMFpIJqaXS1-ShdS6mTNsp-5PNPgbqA62PfkS6iy5_2YT0ET9xjOcThkz3kw9HWn_kiKGPA6bpllw5O064_OsF2T-178262L4-b5p6W3gQZS46UUoENK6UUCk7dM6ZGZzpu95KdGitYrxHBZaLoZIAFrlCwyrt3MCUWJC7312PiIdzmn3T9-H_mvgB0D5KZA
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/CIMPS61323.2023.10528839
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350358568
EndPage 7
ExternalDocumentID 10528839
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i134t-b345e1e9f45176adbff99f4f9cbca5efeaa602ce61a23d7511ae26e9078ffd063
IEDL.DBID RIE
IngestDate Wed May 22 07:08:19 EDT 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i134t-b345e1e9f45176adbff99f4f9cbca5efeaa602ce61a23d7511ae26e9078ffd063
PageCount 7
ParticipantIDs ieee_primary_10528839
PublicationCentury 2000
PublicationDate 2023-Oct.-18
PublicationDateYYYYMMDD 2023-10-18
PublicationDate_xml – month: 10
  year: 2023
  text: 2023-Oct.-18
  day: 18
PublicationDecade 2020
PublicationTitle 2023 12th International Conference On Software Process Improvement (CIMPS)
PublicationTitleAbbrev CIMPS
PublicationYear 2023
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.858174
Snippet Effort estimation is important to correctly plan the use of resources in a software project. When adopting Agile in IT, business value is raised in both...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Agile
Autoencoders
Cost
Costs
Effort
Estimation
Neural Network
Phase measurement
Size measurement
Software algorithms
Time
Time measurement
Training
Title Effort Estimation in Agile Software Development Using Autoencoders
URI https://ieeexplore.ieee.org/document/10528839
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3dS8MwEA9uTz6pOPGbPPja2nw0to9zbExhYzAHexv5uMgQOhktgn-9l3b1CwTf0qTQ5pLLXZLf746QG-0VqMzKSHKeR1IYFRlhTZSoxAvGAadYIApPpmq8kI_LdLkjq9dcGACowWcQh2J9l-82tgpHZajhaUiOm3dIB3duDVmrReck-e3gYTKbo3niIg5JweP29R-JU2q7MTog0_aLDVzkJa5KE9v3X8EY__1Lh6T3RdGjs0_jc0T2oDgm90OPPmhJh6i3DSWRrgvaf0bNp3Ncb9_0Fug3mBCtAQO0X5WbEM8yYJp7ZDEaPg3G0S5JQrRmQpYoWZkCg9zLlN0p7Yz3OT743BqrU_CgtUq4BcU0F-4O_SsNXAHuiTPvHTooJ6RbbAo4RTliixaOQwZeZiANZ9rIEPQsc9JpdUZ6QQCr1yYOxqrt-_kf9RdkP4xDWOlZdkm65baCKzThpbmuh-4D6y-eCw
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA46D3pSceJvc_Da2qZp1hzn2Nh0G4NtsNtImhcZQiejRfCv96Vd_QWCtzSlJH3t6_eSft97hNwpK0AkKfc4Y9LjkRaejlLtBSKwUcgAXzEnFB6NRX_OHxfxYitWL7UwAFCSz8B3zfJfvlmnhdsqQw-PXXFcuUv2EPi5rORaNT8nkPedwWgyRYBike_Kgvv1BT9Kp5TI0Tsk43rMijDy4he59tP3X-kY_z2pI9L8EunRySf8HJMdyE7IQ9diFJrTLnpuJUqkq4y2n9H36RS_uG9qA_QbUYiWlAHaLvK1y2jpWM1NMu91Z52-ty2T4K3CiOdoWx5DCNLyOGwJZbS1Eg-sTHWqYrCglAhYCiJULDItjLAUMAG4Kk6sNRiinJJGts7gDO2IZ1RkGCRgeQJcs1Bp7tKeJYYbJc5J0xlg-VplwljW937xR_8t2e_PRsPlcDB-uiQHzK1VHYWOXZFGvingGgE91zflY_wAZbmhUw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+12th+International+Conference+On+Software+Process+Improvement+%28CIMPS%29&rft.atitle=Effort+Estimation+in+Agile+Software+Development+Using+Autoencoders&rft.au=Sanchez%2C+Eduardo+Rodriguez&rft.au=Santacruz%2C+Eduardo+Vazquez&rft.au=Maceda%2C+Humberto+Cervantes&rft.date=2023-10-18&rft.pub=IEEE&rft.spage=1&rft.epage=7&rft_id=info:doi/10.1109%2FCIMPS61323.2023.10528839&rft.externalDocID=10528839