{
  "metadata": {
    "forum_id": "HJeQbnA5tm",
    "review_id": "BkgYfE5o3Q",
    "rebuttal_id": "rkx7sBROTX",
    "title": "Noisy Information Bottlenecks for Generalization",
    "reviewer": "AnonReviewer2",
    "rating": 3,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=HJeQbnA5tm&noteId=rkx7sBROTX",
    "annotator": "anno10"
  },
  "review_sentences": [
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 0,
      "text": "This paper proposes a justification to one observation on VAE: \"restricting the family of variational approximations can, in fact, have a positive regularizing effect, leading to better generalization\".",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 1,
      "text": "The explanation given in this work is based on Gaussian mean-field approximation.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 2,
      "text": "I had trouble to understand some parts of this paper, since some of the sentences do not make sense to me. For example",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 3,
      "text": "- the sentence under eq. (2)",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 4,
      "text": "- the sentence \"Bacause the identity of the datapoint can never be learned by ...\" What is the identity of a dat point?",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_clarification",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 5,
      "text": "It looks like section 2.1 wants to show the connections between eq. (2) and other popularly used inference methods.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 6,
      "text": "Somehow, those connections are not clear to me.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 7,
      "text": "Besides some issues in the technical details, the major problem of this paper is that it uses the data processing inequality (DPI) in a **wrong** way.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 8,
      "text": "As in (Cover and Thomas, 2012), which is also cited in this paper, DPI is defined on a Markov chain X -> Y -> Z and we have I(X,Y) >= I(X,Z).",
      "suffix": "\n\n",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 9,
      "text": "However, based on the definition of \\theta and \\tilde{\\theta} given in the first sentence of section 2.3, the relation between \\theta, \\tilde{\\theta} and D",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 10,
      "text": "should be: D <- \\theta -> \\tilde{\\theta} (if it is a generative model) or D -> \\theta -> \\tilde{\\theta} (if a discriminative model).",
      "suffix": "",
      "review_action": "none",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BkgYfE5o3Q",
      "sentence_index": 11,
      "text": "Either case, I don't think we can have the inequality in eq. (5).",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 0,
      "text": "Thank you very much for the constructive review.",
      "suffix": "\n\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 1,
      "text": "Summary of our response",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 2,
      "text": "-------------------------------------",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 3,
      "text": "We are certain that the data processing inequality is used correctly.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 4,
      "text": "As you stated, the DPI implies for any Markov chain X -> Y -> Z that I(X,Y) >= I(X,Z).",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 5,
      "text": "Unlike suggested in the review, our model is defined in the form \\theta -> \\tilde{\\theta} -> D, as shown in Figure 1a.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 6,
      "text": "Following your feedback, we updated section 2.1 and 2.3 for more clarity.",
      "suffix": "\n\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_global",
        null
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 7,
      "text": "Detailed response",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 8,
      "text": "-------------------------------------",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 9,
      "text": "We interleave parts of the review with our detailed response for ease of reading.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 10,
      "text": "> [...]",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 11,
      "text": "the major problem of this paper is that it uses the data processing inequality (DPI) in a **wrong** way.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 12,
      "text": "As in (Cover and Thomas, 2012), which is also cited in this paper, DPI is defined on a Markov chain X -> Y -> Z and we have I(X,Y) >= I(X,Z).",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 13,
      "text": "However, based on the definition of \\theta and \\tilde{\\theta} given in the first sentence of section 2.3, the relation between \\theta, \\tilde{\\theta} and D should be: D <- \\theta -> \\tilde{\\theta} (if it is a generative model) or D -> \\theta -> \\tilde{\\theta} (if a discriminative model).",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 14,
      "text": "Response: We are interested in limiting the mutual information I(\\theta, D) between our learned parameters \\theta and the dataset D.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 15,
      "text": "However, this is hard to calculate for typical deep models.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 16,
      "text": "Therefore we introduce a model that forms a Markov chain \\theta -> \\tilde{\\theta} -> D, as shown in Figure 1a.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 17,
      "text": "Hereby, \\tilde{\\theta} is a noisy version of the parameters \\theta.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 18,
      "text": "Crucially, the data D is defined to be dependent only on the noise-corrupted version \\tilde{\\theta}. By choosing a convenient noise process and prior for \\theta we can easily control I(\\tilde{\\theta}, \\theta).",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 19,
      "text": "This gives us an upper bound on the mutual information I(D, \\theta) between data and parameters, according to the DPI.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 20,
      "text": "We updated section 2.3 to reflect this more clearly.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 21,
      "text": "> I had trouble to understand some parts of this paper, since some of the sentences do not make sense to me.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          2
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 22,
      "text": "For example",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          2
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 23,
      "text": "- the sentence under eq. (2)",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          3
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 24,
      "text": "- the sentence \"Because the identity of the datapoint can never be learned by ...\" What is the identity of a data point?",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          4
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 25,
      "text": "It looks like section 2.1 wants to show the connections between eq. (2) and other popularly used inference methods.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          5
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 26,
      "text": "Somehow, those connections are not clear to me.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          6
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 27,
      "text": "Response: The aim of section 2.1 is to motivate limiting mutual information for the purpose of generalization.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 28,
      "text": "We link generalization problems reported in the literature to the introduced information measure.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 29,
      "text": "The information necessary to identify or distinguish between training samples is quantified by the empirical entropy, and we called it the identity of the samples.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BkgYfE5o3Q",
      "rebuttal_id": "rkx7sBROTX",
      "sentence_index": 30,
      "text": "We updated the section to address all of your feedback.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          2,
          3,
          4,
          5,
          6
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    }
  ]
}