{
  "metadata": {
    "forum_id": "ByxHJeBYDB",
    "review_id": "rye5LcLAYH",
    "rebuttal_id": "ByghS3dssB",
    "title": "Forecasting Deep Learning Dynamics with Applications to Hyperparameter Tuning",
    "reviewer": "AnonReviewer2",
    "rating": 1,
    "conference": "ICLR2020",
    "permalink": "https://openreview.net/forum?id=ByxHJeBYDB&noteId=ByghS3dssB",
    "annotator": "anno3"
  },
  "review_sentences": [
    {
      "review_id": "rye5LcLAYH",
      "sentence_index": 0,
      "text": "This paper proposed to train a network with training curves and corresponding parameters, and use policy search to find optimal parameter to replace hundreds or thousands of training in real case scenario, and it is clearly much faster using the trained network to infer parameters, instead of tuning the network manually.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rye5LcLAYH",
      "sentence_index": 1,
      "text": "The first point would be: what's the meaning of synthetically generating training curves other than proving that transformer achieves good performance in modeling discrete distribution? Most practical problems would not have the same distribution as the previously gathered public dataset, thus the data is not representative, and synthetic training curves just does not make sence.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rye5LcLAYH",
      "sentence_index": 2,
      "text": "The cited paper 'Learning an adaptive learning rate schedule' does not appear online.",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_edit",
      "aspect": "arg_other",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "rye5LcLAYH",
      "rebuttal_id": "ByghS3dssB",
      "sentence_index": 0,
      "text": "We thank the reviewer for the effort, however we believe there is a mis-understanding.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rye5LcLAYH",
      "rebuttal_id": "ByghS3dssB",
      "sentence_index": 1,
      "text": "As for the synthetic curves experiment, we updated the paper with a justification.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          1
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "rye5LcLAYH",
      "rebuttal_id": "ByghS3dssB",
      "sentence_index": 2,
      "text": "This task, while simple, showcases the ability of Transformer to model a distribution over curves of similar shape to real training curves with varying speeds of convergence.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          1
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rye5LcLAYH",
      "rebuttal_id": "ByghS3dssB",
      "sentence_index": 3,
      "text": "It has been designed so it is easy to quantify the diversity of generated curves and the fit between the distribution generated by the model and the real one.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          1
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rye5LcLAYH",
      "rebuttal_id": "ByghS3dssB",
      "sentence_index": 4,
      "text": "Furthermore, we included two additional tasks, attesting to the ability of Transformer to model a wide range of distributions over training curves.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          1
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "rye5LcLAYH",
      "rebuttal_id": "ByghS3dssB",
      "sentence_index": 5,
      "text": "We also updated the citation of the paper you mentioned with an arxiv URL.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          2
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "rye5LcLAYH",
      "rebuttal_id": "ByghS3dssB",
      "sentence_index": 6,
      "text": "We still believe that while focusing on the synthetic task the reviewer might have missed the main point of the paper, namely that time-series forecasting with Transformer works really well, at least in the context of modeling deep learning dynamics.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rye5LcLAYH",
      "rebuttal_id": "ByghS3dssB",
      "sentence_index": 7,
      "text": "The general problem has been studied in the community for many decades and we believe that we made significant progress, so we kindly encourage the reviewer to reconsider their assessment of our contributions.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    }
  ]
}