On the Use of Minhash and Locality Sensitive Hashing for Detecting Similar Lyrics

In this paper, we propose a retrieval system based on similarities between songs. We consider the similarity of songs regarding their lyrics, emotions, genres, or a combination of these attributes. To detect similar lyrics, we applied both minhash and locality-sensitive hashing (LSH) methods to a se...

Full description

Saved in:
Bibliographic Details
Published inEngineering letters Vol. 30; no. 1; p. 227
Main Authors Arboleda, Francisco Javier Moreno, Norena, Felipe Cortes, Alvarez, Benjamin Cruz
Format Journal Article
LanguageEnglish
Published Hong Kong International Association of Engineers 24.02.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this paper, we propose a retrieval system based on similarities between songs. We consider the similarity of songs regarding their lyrics, emotions, genres, or a combination of these attributes. To detect similar lyrics, we applied both minhash and locality-sensitive hashing (LSH) methods to a set of songs. We also applied the Watson Tone Analyzer service for detecting emotions. Although experiments with more songs are necessary, our results did not show, e.g., lyrics plagiarism. This finding suggests, at least from a textual point of view, that lyricists are careful on this matter. We also included some artificial similar songs in our set of songs to validate our proposal. Although there were false positives and true negatives, as expected in LSH, this experiment showed the fairness of our proposal.
ISSN:1816-093X
1816-0948