Post or Block? Advances in Automatically Filtering Undesired Comments

Currently, a great volume of the available information on several websites comes from the interaction with users, such as social networks, forums and blogs, where readers can post comments and sometimes develop habits of frequenting them. Some blogs specialized in certain subjects, gain the users cr...

Full description

Saved in:
Bibliographic Details
Published inJournal of intelligent & robotic systems Vol. 80; no. Suppl 1; pp. 245 - 259
Main Authors Alberto, Túlio C., Lochter, Johannes V., Almeida, Tiago A.
Format Journal Article
LanguageEnglish
Published Dordrecht Springer Netherlands 01.12.2015
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN0921-0296
1573-0409
DOI10.1007/s10846-014-0105-y

Cover

Loading…
More Information
Summary:Currently, a great volume of the available information on several websites comes from the interaction with users, such as social networks, forums and blogs, where readers can post comments and sometimes develop habits of frequenting them. Some blogs specialized in certain subjects, gain the users credibility and become references in the field. Nevertheless, the ease of inserting content through text comments makes room for unwanted messages, which affect the user experience, reduce the quality of the information provided by the websites and indirectly cause personal and economic losses. In this scenario, this paper presents a comprehensive study of established machine learning techniques applied to automatically detect undesired comments posted on blogs. Furthermore, different sets of attributes were evaluated along with text normalization techniques. Experiments carried out with a real and public database indicate that support vector machines, logistic regression and stacking ensemble methods, trained with both attributes extracted from the text messages and posting information, are promising for the task of blocking undesired comments.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0921-0296
1573-0409
DOI:10.1007/s10846-014-0105-y