Using general-purpose compression algorithms for music analysis
General-purpose compression algorithms encode files as dictionaries of substrings with the positions of these strings' occurrences. We hypothesized that such algorithms could be used for pattern discovery in music. We compared LZ77, LZ78, Burrows-Wheeler and COSIATEC on classifying folk song me...
Saved in:
Published in | Journal of new music research Vol. 45; no. 1; pp. 1 - 16 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Abingdon
Routledge
02.01.2016
Taylor & Francis Ltd |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | General-purpose compression algorithms encode files as dictionaries of substrings with the positions of these strings' occurrences. We hypothesized that such algorithms could be used for pattern discovery in music. We compared LZ77, LZ78, Burrows-Wheeler and COSIATEC on classifying folk song melodies. A novel method was used, combining multiple viewpoints, the k-nearest-neighbour algorithm and a novel distance metric, corpus compression distance. Using single viewpoints, COSIATEC outperformed the general-purpose compressors, with a classification success rate of 85% on this task. However, by combining 8 of the 10 best-performing viewpoints, including seven that used LZ77, the classification success rate rose to over 94%. In a second experiment, we compared LZ77 with COSIATEC on the task of discovering subject and countersubject entries in fugues by J.S. Bach. When voice information was absent in the input data, COSIATEC outperformed LZ77 with a mean
score of 0.123, compared with 0.053 for LZ77. However, when the music was processed a voice at a time, the
score for LZ77 more than doubled to 0.124. We also discovered a significant correlation between compression factor and
score for all the algorithms, supporting the hypothesis that the best analyses are those represented by the shortest descriptions. |
---|---|
Bibliography: | SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 |
ISSN: | 0929-8215 1744-5027 |
DOI: | 10.1080/09298215.2015.1133656 |