Post or Block? Advances in Automatically Filtering Undesired Comments

Currently, a great volume of the available information on several websites comes from the interaction with users, such as social networks, forums and blogs, where readers can post comments and sometimes develop habits of frequenting them. Some blogs specialized in certain subjects, gain the users cr...

Full description

Saved in:

Bibliographic Details
Published in	Journal of intelligent & robotic systems Vol. 80; no. Suppl 1; pp. 245 - 259
Main Authors	Alberto, Túlio C., Lochter, Johannes V., Almeida, Tiago A.
Format	Journal Article
Language	English
Published	Dordrecht Springer Netherlands 01.12.2015 Springer Nature B.V
Subjects	Artificial Intelligence Blocking Blogs Control Economic impact Economics Electrical Engineering Engineering Ensemble learning Filtering Machine learning Mechanical Engineering Mechatronics Messages Readers Robotics Social networks Stacking Support vector machines Texts User experience Websites Undesired messages Natural language processing Supervised learning Classification
Online Access	Get full text
ISSN	0921-0296 1573-0409
DOI	10.1007/s10846-014-0105-y

Cover

Loading…

More Information
Summary:	Currently, a great volume of the available information on several websites comes from the interaction with users, such as social networks, forums and blogs, where readers can post comments and sometimes develop habits of frequenting them. Some blogs specialized in certain subjects, gain the users credibility and become references in the field. Nevertheless, the ease of inserting content through text comments makes room for unwanted messages, which affect the user experience, reduce the quality of the information provided by the websites and indirectly cause personal and economic losses. In this scenario, this paper presents a comprehensive study of established machine learning techniques applied to automatically detect undesired comments posted on blogs. Furthermore, different sets of attributes were evaluated along with text normalization techniques. Experiments carried out with a real and public database indicate that support vector machines, logistic regression and stacking ensemble methods, trained with both attributes extracted from the text messages and posting information, are promising for the task of blocking undesired comments.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	0921-0296 1573-0409
DOI:	10.1007/s10846-014-0105-y