Automatic Classification of Apps Reviews for Requirement Engineering: Exploring the Customers Need from Healthcare Applications

In one year, more than 6.5 million mobile applications have been listed for download on the application stores. That is, they are used by millions (or billions) of users across the world. Users express their daily experience of applications as reviews on those stores. This experience may include rep...

Full description

Saved in:
Bibliographic Details
Published in2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS) pp. 541 - 548
Main Authors Al Kilani, Nadeem, Tailakh, Rami, Hanani, Abualsoud
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.10.2019
Subjects
Online AccessGet full text
DOI10.1109/SNAMS.2019.8931820

Cover

Abstract In one year, more than 6.5 million mobile applications have been listed for download on the application stores. That is, they are used by millions (or billions) of users across the world. Users express their daily experience of applications as reviews on those stores. This experience may include reporting bugs, demanding new features, posting feedback with regards to performance, reporting security issues, demanding user interface enhancements, and other needs. Interestingly, reviews could contain valuable information for the interest of application vendors and developers. However, the volume of such data is as huge, that is, traditional searching algorithms may not be efficient in extracting such useful information. Machine learning and data mining techniques are one of the popularly used algorithms to efficiently extracting significant information for Software Requirement Engineering; a key phase in the Software Engineering Life Cycle. In this paper, we experience machine learning algorithms and natural language processing techniques to classify a set of reviews about healthcare-domain applications into multiple types of categories such as bug reports, new feature requests, application performance, and user interface. For this purpose, we could extract more than 7500 reviews of ten different health-related mobile applications. More importantly, those reviews were annotated manually by software experts. In our experiments, we use the Weka tool employing different machine learning algorithms. We will also show what algorithms and features will perform better; in terms of accuracy using different evaluation metrics, when classifying reviews about mobile apps into various classes; bugs, new features, sentimental, general bug, usability, security, and performance. Moreover, the conducted experiments show that the overall performance improves when we use the data subset with highly confident labeling; when two experts agree on the same class. For the imbalanced-data problem, this research will show the effect of applying resampling techniques on improving classification accuracy as well.
AbstractList In one year, more than 6.5 million mobile applications have been listed for download on the application stores. That is, they are used by millions (or billions) of users across the world. Users express their daily experience of applications as reviews on those stores. This experience may include reporting bugs, demanding new features, posting feedback with regards to performance, reporting security issues, demanding user interface enhancements, and other needs. Interestingly, reviews could contain valuable information for the interest of application vendors and developers. However, the volume of such data is as huge, that is, traditional searching algorithms may not be efficient in extracting such useful information. Machine learning and data mining techniques are one of the popularly used algorithms to efficiently extracting significant information for Software Requirement Engineering; a key phase in the Software Engineering Life Cycle. In this paper, we experience machine learning algorithms and natural language processing techniques to classify a set of reviews about healthcare-domain applications into multiple types of categories such as bug reports, new feature requests, application performance, and user interface. For this purpose, we could extract more than 7500 reviews of ten different health-related mobile applications. More importantly, those reviews were annotated manually by software experts. In our experiments, we use the Weka tool employing different machine learning algorithms. We will also show what algorithms and features will perform better; in terms of accuracy using different evaluation metrics, when classifying reviews about mobile apps into various classes; bugs, new features, sentimental, general bug, usability, security, and performance. Moreover, the conducted experiments show that the overall performance improves when we use the data subset with highly confident labeling; when two experts agree on the same class. For the imbalanced-data problem, this research will show the effect of applying resampling techniques on improving classification accuracy as well.
Author Tailakh, Rami
Hanani, Abualsoud
Al Kilani, Nadeem
Author_xml – sequence: 1
  givenname: Nadeem
  surname: Al Kilani
  fullname: Al Kilani, Nadeem
  organization: Birzeit University
– sequence: 2
  givenname: Rami
  surname: Tailakh
  fullname: Tailakh, Rami
  organization: University of Bedfordshire
– sequence: 3
  givenname: Abualsoud
  surname: Hanani
  fullname: Hanani, Abualsoud
  organization: Birzeit University
BookMark eNotkMFOwzAQRI0EB1r4Abj4BxJiu6ltblEUKFIpEgWJW2XCurWU2MF2oJz4dVzR085oV280O0Gn1llA6IoUOSGFvFmvqsd1TgsicyEZEbQ4QRPCqSBUzuZv5-i3GqPrVTQtrjsVgtGmTc5Z7DSuhiHgZ_gy8B2wdj7pz9F46MFG3NitsQDe2O0tbvZD5w4Sxx3gegwJCj7gFcAH1t71eAGqi7tWeThgu2NKuEBnWnUBLo9zil7vmpd6kS2f7h_qapkZWrCYlXMtgYmCUi7bUr-3LaNcKym1ACqUKstUDTQDrjRPOzFTPPXnSoh0qzmbout_rgGAzeBNr_zP5vgT9ge7UF6c
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SNAMS.2019.8931820
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 172812946X
9781728129464
EndPage 548
ExternalDocumentID 8931820
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i203t-56f9e3802279c5fbcc327fa99f8e28aa55820ef3e7af7c3284a71097a88bccf73
IEDL.DBID RIE
IngestDate Thu Jun 29 18:38:00 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-56f9e3802279c5fbcc327fa99f8e28aa55820ef3e7af7c3284a71097a88bccf73
PageCount 8
ParticipantIDs ieee_primary_8931820
PublicationCentury 2000
PublicationDate 2019-Oct.
PublicationDateYYYYMMDD 2019-10-01
PublicationDate_xml – month: 10
  year: 2019
  text: 2019-Oct.
PublicationDecade 2010
PublicationTitle 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS)
PublicationTitleAbbrev SNAMS
PublicationYear 2019
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8014929
Snippet In one year, more than 6.5 million mobile applications have been listed for download on the application stores. That is, they are used by millions (or...
SourceID ieee
SourceType Publisher
StartPage 541
SubjectTerms Computer bugs
Data Annotation
Feature extraction
Machine learning algorithms
Medical services
Mobile applications
Requirements Engineering
Security
Software
Supervised Machine Learning
Text Classification
User's Reviews
Title Automatic Classification of Apps Reviews for Requirement Engineering: Exploring the Customers Need from Healthcare Applications
URI https://ieeexplore.ieee.org/document/8931820
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFA7bTp5UNvE37-DRbrNJm8TbGI4hbAg62G2k6QuIsIq2Fy_-676kW53iwVtoQ9O-V3hf8r7vPcauCHVbwgE2svIGI8GliBRiFiVW5KhSjE3mtcOzeTpdiPtlsmyx60YLg4iBfIZ9Pwy5_LywlT8qG1Bs9fXG26xNv1mt1drqYIZ68Dinva8na5H364k_OqaEgDHZZ7PtUjVP5KVflVnffvyqwvjfdzlgvW9pHjw0QeeQtXDdZZ-jqixC7VUITS49_SdYHAoHhDPfoU4BvANBVBp79m84FoSdcoS30PDxgFAhjCvChf5UG-a0GHgdCkwbthiMdjLfPbaY3D2Np9Gms0L0HA95GSWp08hVKB9oE5dZy2PpjNZOYayMSRL6NHQcpXGS7ilhPGdTGqVorpP8iHXWxRqPGehUqxwzY1TYPQlND7DSamHyVNmEn7CuN97qtS6esdrY7fTvy2dszzuwZsuds075VuEFRf0yuwzu_gLrzLJZ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV09T8MwELVKGWAC1CK-8cBI2pLEsc1WVVQB2giJVupWOc5ZQkgNosnCwl_n7LShIAY2K7Hi-G64Z9-7d4RcIerWiAO0p_kNeGHAQ08ApB7TYQYiAl-ltnZ4nETxNHyYsVmDXNe1MADgyGfQsUOXy89yXdqrsi7GVqs3vkW2Me6HrKrWWlfC9GT3OcHTr6Vrof-rqT96priQMdwj4_ViFVPktVMWaUd__NJh_O_f7JP2d3EefarDzgFpwKJFPvtlkTv1VeraXFoCkLM5zQ1FpLmkVRJgSRGk4tjyf93FIN0QJLylNSOPIi6kgxKRob3XpgkuRm0lCo1rvhjtb-S-22Q6vJsMYm_VW8F78XtB4bHISAiEExDUzKRaBz43SkojwBdKMYZbAxMAV4bjOxEqy9rkSgica3hwSJqLfAFHhMpIigxSpYQ7P4USP6C5lqHKIqFZcExa1njzt0o-Y76y28nfjy_JTjwZj-aj--TxlOxaZ1bcuTPSLN5LOEcMUKQXzvVfaky1pg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2019+Sixth+International+Conference+on+Social+Networks+Analysis%2C+Management+and+Security+%28SNAMS%29&rft.atitle=Automatic+Classification+of+Apps+Reviews+for+Requirement+Engineering%3A+Exploring+the+Customers+Need+from+Healthcare+Applications&rft.au=Al+Kilani%2C+Nadeem&rft.au=Tailakh%2C+Rami&rft.au=Hanani%2C+Abualsoud&rft.date=2019-10-01&rft.pub=IEEE&rft.spage=541&rft.epage=548&rft_id=info:doi/10.1109%2FSNAMS.2019.8931820&rft.externalDocID=8931820