Automatic Music Transcription using Audio-Visual Fusion for Violin Practice in Home Environment

dc.contributor.authorZHANG, Bingjunen_US
dc.contributor.authorWANG, Yeen_US
dc.date.accessioned2009-07-03T09:02:40Zen_US
dc.date.accessioned2017-01-23T07:00:11Z
dc.date.available2009-07-03T09:02:40Zen_US
dc.date.available2017-01-23T07:00:11Z
dc.date.issued2009-07-03T09:02:40Zen_US
dc.description.abstractViolin practice in a home environment, where there is often no teacher available, can benefit from automatic music transcription to provide feedback to the student. This paper describes a high performance violin transcription system with three main contributions. First, as onset detection is an important but challenging task for automatic transcription of pitched non-percussive music, such as from the violin, we propose an effective audio-only onset detection approach based on supervised learning. The proposed approach outperforms the state-of-the-art methods substantially. Second, we introduce the visual modality, i.e., bowing and fingering of the violin playing, to infer onsets, thus enhancing the audio-only onset detection. We devise automatic and real-time video processing algorithms to extract indicative features of onsets from bowing and fingering videos. Third, we evaluate state-of-the-art multimodal fusion techniques to fuse audio and visual modalities and show this improves onset detection and transcription performance significantly. The audio-visual fusion based violin transcription system provides more accurate transcribed results as learning feedback even in acoustically inferior environments. With efficient and fully automatic audio-visual analysis components, the system can be easily deployed in a home environment.en_US
dc.format.extent929564 bytesen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.urihttps://dl.comp.nus.edu.sg/xmlui/handle/1900.100/3056en_US
dc.language.isoenen_US
dc.relation.ispartofseriesTRA7/09en_US
dc.titleAutomatic Music Transcription using Audio-Visual Fusion for Violin Practice in Home Environmenten_US
dc.typeTechnical Reporten_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TRA7-09.pdf
Size:
907.78 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.53 KB
Format:
Plain Text
Description: