{
  "metadata": {
    "forum_id": "HJfQrs0qt7",
    "review_id": "HJl5RsCF3m",
    "rebuttal_id": "ByesVG2V0Q",
    "title": "Convergence Properties of Deep Neural Networks on Separable Data",
    "reviewer": "AnonReviewer3",
    "rating": 5,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=HJfQrs0qt7&noteId=ByesVG2V0Q",
    "annotator": "anno10"
  },
  "review_sentences": [
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 0,
      "text": "The underlying motivation for the paper is really interesting and cuts straight to the heart of Deep Learning and strives to unravel the key understanding that we are still to a large extent missing.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_motivation-impact",
      "polarity": "pol_positive"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 1,
      "text": "When it comes to clarity and organization I find the paper a bit \"messy\" in that it is a collection of quite a few findings on the very specific topic of binary classification with quite strong assumptions.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 2,
      "text": "Especially given the very specific nature of the topic I miss a strong and clear path through the paper.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 3,
      "text": "Unfortunately the paper leaves me with the distinct feeling that there are still a lot of work needed to be able to tell the story about the problem under study.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "arg_other",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 4,
      "text": "Having said that the paper does contain several individual findings.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_originality",
      "polarity": "pol_positive"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 5,
      "text": "Having said that I find the ideas leading up to what the authors refers to as \"gradient starvation\" to be really interesting and that would be a great clear idea to focus on.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_motivation-impact",
      "polarity": "pol_positive"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 6,
      "text": "A few concrete questions/comments:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 7,
      "text": "Can you explain somewhere exactly what you mean when you say \"learning dynamics of deep learning\"? Given the specific nature of the results presented in the paper it would be nice to be precise also when it comes to the overall topic under study.",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_clarification",
      "aspect": "asp_replicability",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 8,
      "text": "Given the very specific nature of the topic treated in the paper I find the title of the paper largely misleading.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 9,
      "text": "The title claims way more than what is actually delivered in the paper, despite the fact that the authors have put in an \"On\" in the beginning of the title.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 10,
      "text": "In Corollary 3.3. you characterize the convergence speed in a nice way, but I am missing the link to the behaviors observed empirically in e.g. Fig. 2. What am I missing?",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_replicability",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 11,
      "text": "The final sentence in Section 2 is highly speculative and I find this hard to believe without solid backing.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 12,
      "text": "The sentence reads \"... and helps develop intuitions about behaviors observed in more general settings.\" Given the restrictive nature of your set-up I find it very hard to believe that this extends to more general settings.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HJl5RsCF3m",
      "sentence_index": 13,
      "text": "Tiny detail: The axes of several of the plots given in the paper mis the lables which makes it hard to read. Straightforward to fix, but worth mentioning nevertheless.",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_edit",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 0,
      "text": "Thank you for your review.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 1,
      "text": "Below we attempt to answer your concerns.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 2,
      "text": "We also want to point out that we have added some insights/results relaxing one of our main assumptions in Section 3.4 of the latest version of the paper.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_none",
        null
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 3,
      "text": "For more details, please see the comment above entitled: \u201cRelaxing Assumption (H2)\u201d.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_none",
        null
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 4,
      "text": "The path we attempted to draw through the paper aims at the evolution of a nonlinear neural network\u2019s classification performance throughout its training and at the factors that influence it: from the norm of the input to the type of loss used for learning or the frequency of features present in the training data.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 5,
      "text": "Our framework is able to establish properties on the behavior/convergence of certain classifiers during their training on separable data.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 6,
      "text": "Those insights match some observations made by machine learning practitioners, in particular about the sigmoidal shape of learning metrics or the efficiency of the hinge loss on certain tasks.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 7,
      "text": "We have added an explanation of what we mean by \u201clearning dynamics of deep learning\u201d in the last paragraph of the first page.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 8,
      "text": "It usually refers to the evolution of weights and outputs of neural networks throughout training.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 9,
      "text": "For instance, the work by Saxe et al in 2013 is entitled \u201cExact solutions to the nonlinear dynamics of learning in deep linear neural networks\u201d.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 10,
      "text": "We based our title on that paper since it extends some of its results to nonlinear neural networks.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 11,
      "text": "We understand your concern and have made the title more specific.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          8,
          9
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 12,
      "text": "Tentatively, we chose: \u201cConvergence Properties of Deep Neural Networks on Separable Data\u201d.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          8,
          9
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 13,
      "text": "Let us assume for simplicity that in Corollary 3.3, p = 0.5 (ie that the classes are balanced) and that ||x_1|| = 1, ||x_2|| = 0.5.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 14,
      "text": "Then the confidence of the network on those classes corresponds to the red and dashed purple curves of Fig. 2.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 15,
      "text": "Right.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 16,
      "text": "In particular, we see that reaching any level of confidence takes approximately twice as much time on class 2 (red curve) than on class 1 (dashed purple curve).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 17,
      "text": "That is effectively what the corollary is expressing.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 18,
      "text": "We have edited the corresponding sentence to make it less assertive.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          11,
          12
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "HJl5RsCF3m",
      "rebuttal_id": "ByesVG2V0Q",
      "sentence_index": 19,
      "text": "We have added the missing labels in the latest version of the paper. Thank you for pointing it out.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          13
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    }
  ]
}