A study of audio mixing methods for piano transcription in violin-piano ensembles
While piano music transcription models have shown high performance for solo piano recordings, their performance degrades when applied to ensemble recordings. This study aims to analyze the impact of different data augmentation methods on piano transcription performance, specifically focusing on mixi...
Saved in:
Main Authors | , , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
23.05.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | While piano music transcription models have shown high performance for solo
piano recordings, their performance degrades when applied to ensemble
recordings. This study aims to analyze the impact of different data
augmentation methods on piano transcription performance, specifically focusing
on mixing techniques applied to violin-piano ensembles. We apply mixing methods
that consider both harmonic and temporal characteristics of the audio. To
create datasets for this study, we generated the PFVN-synth dataset, which
contains 7 hours of violin-piano ensemble audio by rendering MIDI files and
corresponding labels, and also collected unaccompanied violin recordings and
mixed them with the MAESTRO dataset. We evaluated the transcription results on
both synthesized and real audio recordings datasets. |
---|---|
DOI: | 10.48550/arxiv.2305.13758 |