{
  "metadata": {
    "forum_id": "HJgOl3AqY7",
    "review_id": "SJxnP8_eam",
    "rebuttal_id": "rklb2xgmAX",
    "title": "Modulated Variational Auto-Encoders for Many-to-Many Musical Timbre Transfer",
    "reviewer": "AnonReviewer3",
    "rating": 5,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=HJgOl3AqY7&noteId=rklb2xgmAX",
    "annotator": "anno0"
  },
  "review_sentences": [
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 0,
      "text": "The authors proposed a Modulated Variational auto-Encoders (MoVE) to perform musical timbre transfer.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 1,
      "text": "The authors define timbre transfer as applying parts of the auditory properties of a musical instrument onto another.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 2,
      "text": "It replaces the usual adversarial translation criterion by a Maximum Mean Discrepancy (MMD) objective.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 3,
      "text": "By further conditioning our system on several different instruments, the proposed method can generalize to many-to-many transfer within a single variational architecture able to perform multi-domain transfers.",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 4,
      "text": "Some detailed comments are listed as follow,",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 5,
      "text": "1 The implementation steps of the proposed method (MoVE) are not clear.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_replicability",
      "polarity": "pol_negative"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 6,
      "text": "Some details are missing, which is hardly reproduced by the other researchers.",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_replicability",
      "polarity": "pol_negative"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 7,
      "text": "2 The experimental settings are not reasonable.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 8,
      "text": "The current experimental settings are not matched with the practice environment.",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 9,
      "text": "3 The proposed method can transfer the positive knowledge.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 10,
      "text": "However, some negative knowledge information can be also transferred.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 11,
      "text": "So how to avoid the negative transferring?",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "none"
    },
    {
      "review_id": "SJxnP8_eam",
      "sentence_index": 12,
      "text": "4 For the model, the optimization details or inferring details are missing, which are important for the proposed model.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_replicability",
      "polarity": "none"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 0,
      "text": "Thank you for your review, below we answer the points that were questioned.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 1,
      "text": "* Missing implementation steps and optimization details:",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 2,
      "text": "In addition to implementation details, the appendix has a rather detailed table of the architecture parameters.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          12
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 3,
      "text": "Moreover, we will ultimately release codes on Github.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_by-cr",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          12
        ]
      ],
      "details": {
        "manuscript_change": false
      }
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 4,
      "text": "* Non-matched experiment to practice environment:",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 5,
      "text": "The evaluation of generative models and unsupervised domain translations remains an open question, even less covered in the field of sound.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_mitigate-criticism",
      "alignment": [
        "context_sentences",
        [
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 6,
      "text": "We didn't apply our models yet to datasets previously covered in the related works, such as Nsynth, which is planned and would give some more direct comparisons.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_future",
      "alignment": [
        "context_sentences",
        [
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 7,
      "text": "* How to avoid the negative knowledge transfer:",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 8,
      "text": "As we defined our purpose, the resulting generation is a blending of both domains that renders a target timbre while retaining some of the input features.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 9,
      "text": "It amounts to note class (that is explicitly controlled for the note-conditional model states) together with timbre.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SJxnP8_eam",
      "rebuttal_id": "rklb2xgmAX",
      "sentence_index": 10,
      "text": "We plan on experiments on controlling the amount of timbre transfer in between the input and target domains.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_future",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    }
  ]
}