{
  "metadata": {
    "forum_id": "Byx93sC9tm",
    "review_id": "BJgzh7Schm",
    "rebuttal_id": "B1xXG3110Q",
    "title": "Deep Ensemble Bayesian Active Learning : Adressing the Mode Collapse issue in Monte Carlo dropout via Ensembles",
    "reviewer": "AnonReviewer2",
    "rating": 4,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=Byx93sC9tm&noteId=B1xXG3110Q",
    "annotator": "anno2"
  },
  "review_sentences": [
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 0,
      "text": "The authors propose to use the combination of model ensemble and MC dropout in Bayesian deep active learning.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 1,
      "text": "They empirically show that there exists the mode collapse problem due to the MC dropout which can be regarded as a variational approximation.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 2,
      "text": "The authors introduce an ensemble of MC-Dropout models with different initialization to remedy this mode collapse problem.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 3,
      "text": "The paper is clearly written and easy to follow.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_positive"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 4,
      "text": "It is interesting to empirically show that the mode collapse problem of MC-Dropout is important in active learning.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_motivation-impact",
      "polarity": "pol_positive"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 5,
      "text": "The major concern I have is that the ensemble of MC-Dropout models is not an approximation of the posterior anymore.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 6,
      "text": "Each MC-Dropout model is an approximation of the posterior, but the ensemble of them may not.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 7,
      "text": "Therefore, it is a little misleading to still call it Bayesian active learning.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 8,
      "text": "Also, the ensemble of MC-Dropout models does not have the theoretic support from the Bayesian perspective.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 9,
      "text": "The motivation for the proposed method is to solve the mode collapse problem of MC-Dropout, but using ensemble loses the Bayesian support benefit of MC-Dropout.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 10,
      "text": "So it seems not a reasonable solution for the mode collapse problem of MC-Dropout.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 11,
      "text": "It is not clear to me why we need to add MC-Dropout to the ensemble.",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_soundness-correctness",
      "polarity": "none"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 12,
      "text": "What is the benefit of DEBAL over an ensemble method if both of them do not have Bayesian theoretic support?",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_soundness-correctness",
      "polarity": "none"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 13,
      "text": "In terms of the empirical results, the better performance of DEBAL compared to a single MC-Dropout model is not supervising as Beluch et al. (2018) already demonstrated that an ensemble is better than a single MC-Dropout.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 14,
      "text": "While the improvement of DEBAL compared to an ensemble is marginal but is reasonable.",
      "suffix": "\n\n",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "BJgzh7Schm",
      "sentence_index": 15,
      "text": "The labels of figures are hard to read.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 0,
      "text": "We thank the reviewer for its valuable and insightful comments.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 1,
      "text": "We are reviewing our work from a theoretical point of view and will update the paper very soon to reflect this.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_global",
        null
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 2,
      "text": "Even though we have not yet proved the above, we have empirically showed that the benefit of DEBAL over plain ensemble methods consists of a better representation of uncertainty, that is paramount in active learning.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8,
          9,
          10,
          11,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 3,
      "text": "By better we mean",
      "suffix": "\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8,
          9,
          10,
          11,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 4,
      "text": "1) more meaningful and closer to what one would expect (Fig 4 & Fig 6 (right))",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8,
          9,
          10,
          11,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 5,
      "text": "2) better calibrated (Fig 6 (left)).",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8,
          9,
          10,
          11,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 6,
      "text": "Our initial aim was not to compare stochastic ensembles with deterministic or single MC-dropout but to correct for the mode collapse issue in estimating posteriors with MC-dropout.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8,
          9,
          10,
          11,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 7,
      "text": "We have empirically shown that adding ensembles to this, greatly improves the MC-dropout technique and outperforms the deterministic ensembles as well.",
      "suffix": "\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8,
          9,
          10,
          11,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 8,
      "text": "We had similar doubts about the benefit of adding MC-Dropout to an ensemble.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8,
          9,
          10,
          11,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 9,
      "text": "Therefore, we contrasted the performance of DEBAL against the plain ensemble method and showed empirically that DEBAL gives rise to better measures of uncertainty.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8,
          9,
          10,
          11,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 10,
      "text": "Finally, as we strive to make our assumptions hold theoretically, we agree that adding theoretical Bayesian support to our method is of great importance if we are to further improve the understanding of Bayesian deep learning.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8,
          9,
          10,
          11,
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 11,
      "text": "For your final point, although Beluch et al. (2018) showed better performance for ensembles, we have shown this in the context of a small dataset problem (i.e. the size of the final dataset acquired during AL is only a small fraction of the entire available unlabelled dataset), which we believe is more relevant to the real world cases if AL is to become a widely used method.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          13,
          14
        ]
      ],
      "details": {}
    },
    {
      "review_id": "BJgzh7Schm",
      "rebuttal_id": "B1xXG3110Q",
      "sentence_index": 12,
      "text": "As for the figures, we are aware of this and will try to make them more clear in a revised version.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_by-cr",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {
        "manuscript_change": true
      }
    }
  ]
}