{
  "metadata": {
    "forum_id": "rkxQ-nA9FX",
    "review_id": "rylog8i62m",
    "rebuttal_id": "HkgOWxN0T7",
    "title": "Theoretical Analysis of Auto Rate-Tuning by Batch Normalization",
    "reviewer": "AnonReviewer2",
    "rating": 7,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=rkxQ-nA9FX&noteId=HkgOWxN0T7",
    "annotator": "anno9"
  },
  "review_sentences": [
    {
      "review_id": "rylog8i62m",
      "sentence_index": 0,
      "text": "The paper is well written and easy to follow. The topic is apt.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_positive"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 1,
      "text": "I don\u2019t have any comments except the following ones.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 2,
      "text": "Lemma 2.4, Point 1: The proof is confusing.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 3,
      "text": "Consider the one variable vector case.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 4,
      "text": "Assuming that there is only one variable w, then \\nabla L(w) is not perpendicular to w in general.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 5,
      "text": "The Rayleigh quotient example L(w)  = w\u2019*A*w/ (w\u2019*w) for a symmetric matrix A, then \\nabla L(w) = (2/w\u2019*w)(Aw - L(w)*w), which is not perpendicular to w.",
      "suffix": "\n",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 6,
      "text": "Even if we constrain ||w ||_2 = 1",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 7,
      "text": ", then also  \\nabla L(w)  is not perpendicular to w.",
      "suffix": "\n",
      "review_action": "none",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 8,
      "text": "Am I missing something?",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_clarification",
      "aspect": "asp_soundness-correctness",
      "polarity": "none"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 9,
      "text": "What is G_t in Theorem 2.5. It should be defined in the theorem itself.",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_edit",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rylog8i62m",
      "sentence_index": 10,
      "text": "There is another symbol G_g which is a constant.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_quote",
      "aspect": "none",
      "polarity": "none"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 0,
      "text": "Thanks for your positive feedback.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 1,
      "text": "(1).",
      "suffix": "",
      "rebuttal_stance": "other",
      "rebuttal_action": "rebuttal_none",
      "alignment": [
        "context_error",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 2,
      "text": "Lemma 2.4, Point 1: The gradient in your example is indeed perpendicular to w which can be seen as follows.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 3,
      "text": "w\u2019 * \\nabla L(w) = w\u2019 * (2/w\u2019*w)(Aw - L(w)*w) =  (2/w\u2019*w)(w\u2019Aw - L(w)*(w\u2019*w)) =  (2/w\u2019*w)(w\u2019Aw - w\u2019Aw) = 0.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 4,
      "text": "In case of one variable vector, our proof is to take the derivative of c on both sides of F(w) = F(cw), which is the definition of scale-invariance.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 5,
      "text": "Then the left-hand side becomes 0 and the right-hand side becomes w\u2019 * \\nabla F(cw)",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 6,
      "text": "by chain rule.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 7,
      "text": "Taking c = 1, we can conclude",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 8,
      "text": "that w\u2019 * \\nabla F(w)  = 0.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 9,
      "text": "(2)",
      "suffix": "",
      "rebuttal_stance": "other",
      "rebuttal_action": "rebuttal_none",
      "alignment": [
        "context_error",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 10,
      "text": ".",
      "suffix": "",
      "rebuttal_stance": "other",
      "rebuttal_action": "rebuttal_none",
      "alignment": [
        "context_error",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 11,
      "text": "Theorem 2.5: Sorry G_t should be G_t^{(i)}. We will correct this typo in the next revision of this paper.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          9,
          10
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 12,
      "text": "For t = 0, G_t^{(i)} are all initialized to some value.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          9,
          10
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "rylog8i62m",
      "rebuttal_id": "HkgOWxN0T7",
      "sentence_index": 13,
      "text": "The recursion formula for G_t^{(i)} is shown in equation (9).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          9,
          10
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    }
  ]
}