M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing CorpusDownload PDF

Published: 17 Sept 2022, Last Modified: 23 May 2023NeurIPS 2022 Datasets and Benchmarks Readers: Everyone
Keywords: singing voice corpus, singing voice synthesis, singing voice conversion, automatic music transcription
Abstract: The lack of publicly available high-quality and accurately labeled datasets has long been a major bottleneck for singing voice synthesis (SVS). To tackle this problem, we present M4Singer, a free-to-use Multi-style, Multi-singer Mandarin singing collection with elaborately annotated Musical scores as well as its benchmarks. Specifically, 1) we construct and release a large high-quality Chinese singing voice corpus, which is recorded by 20 professional singers, covering 700 Chinese pop songs as well as all the four SATB types (i.e., soprano, alto, tenor, and bass); 2) we take extensive efforts to manually compose the musical scores for each recorded song, which are necessary to the study of the prosody modeling for SVS. 3) To facilitate the use and demonstrate the quality of M4Singer, we conduct four different benchmark experiments: score-based SVS, controllable singing voice (CSV), singing voice conversion (SVC) and automatic music transcription (AMT).
Author Statement: Yes
URL: https://m4singer.github.io
Supplementary Material: zip
Dataset Url: https://github.com/M4Singer/M4Singer
License: CC BY-NC-SA 4.0
Contribution Process Agreement: Yes
In Person Attendance: Yes
24 Replies