{
  "metadata": {
    "forum_id": "H1xwNhCcYm",
    "review_id": "r1eWc6qjnX",
    "rebuttal_id": "BygHqa2F0X",
    "title": "Do Deep Generative Models Know What They Don't Know? ",
    "reviewer": "AnonReviewer1",
    "rating": 7,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=H1xwNhCcYm&noteId=BygHqa2F0X",
    "annotator": "anno10"
  },
  "review_sentences": [
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 0,
      "text": "I really enjoyed reading the paper! The exposition is clear with interesting observations, and most importantly, the authors walk the extra mile in doing a theoretical analysis of the observed phenomena.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_originality",
      "polarity": "pol_positive"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 1,
      "text": "Questions for the authors:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 2,
      "text": "1. (Also AREA CHAIR NOTE): Another parallel submission to ICLR titled \u201cGenerative Ensembles for Robust Anomaly Detection\u201d makes similar observations and seemed to suggest that ensembling can help counter the observed CIFAR/SVHN phenomena unlike what we see in Figure 10.",
      "suffix": "",
      "review_action": "arg_social",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 3,
      "text": "Their criteria also accounts for the variance in model log-likelihoods and is hence slightly different.",
      "suffix": "\n",
      "review_action": "arg_social",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 4,
      "text": "2. Even though Figure 2b shows that SVHN test likelihoods are higher than CIFAR test likelihoods, the overlap in the histograms of CIFAR-train and CIFAR-test is much higher than the overlap in CIFAR-train and SVHN-test.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 5,
      "text": "If we define both maximum and minimum thresholds based on the CIFAR-train histogram, it seems like one could detect most SVHN samples just by the virtue that there likelihoods are much higher than even the max threshold determined by the CIFAR-train histogram?",
      "suffix": "\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_experiment",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 6,
      "text": "3. Why does the constant image (all zeros) in Figure 9 (appendix) have such a high likelihood? It\u2019s mean (=0 trivially) is clearly different from the means of the CIFAR-10 images (Figure 6a) so the second order analysis of Section 5 doesn\u2019t seem applicable.",
      "suffix": "\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_replicability",
      "polarity": "pol_negative"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 7,
      "text": "4. How much of this phenomena do you think is characteristic for images specifically? Would be interesting to test anomaly detection using deep generative models trained on modalities other than images.",
      "suffix": "\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_experiment",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 8,
      "text": "5. One of the anonymous comments on OpenReview is very interesting: samples from a CIFAR model look nothing like SVHN.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 9,
      "text": "This seems to call the validity of the anomalous into question. Curious what the authors have to say about this.",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "r1eWc6qjnX",
      "sentence_index": 10,
      "text": "Minor nitpick: There seems to be some space crunching going on via Latex margin and spacing hacks that the authors should ideally avoid :)",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 0,
      "text": "Thanks again, Reviewer #1, for your thoughtful comments.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 1,
      "text": "We respond to your other comments below.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 2,
      "text": "1.  \u201cIt seems like one could detect most SVHN samples just by the virtue that there likelihoods are much higher than even the max threshold determined by the CIFAR-train histogram?\u201d",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          4,
          5
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 3,
      "text": "This is an interesting idea, but we are not sure it is applicable.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-request",
      "alignment": [
        "context_sentences",
        [
          4,
          5
        ]
      ],
      "details": {
        "request_out_of_scope": false
      }
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 4,
      "text": "If one looks closely at Figure 2 (b), there are still blue and black histogram bars (denoting CIFAR-10 train and test instances) covering the entirety of SVHN\u2019s support (red bars).",
      "suffix": "\n\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-request",
      "alignment": [
        "context_sentences",
        [
          4,
          5
        ]
      ],
      "details": {
        "request_out_of_scope": false
      }
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 5,
      "text": "2.  \u201c[The constant input]\u2019s mean (=0 trivially) is clearly different from the means of the CIFAR-10 images (Figure 6a) so the second order analysis of Section 5 doesn\u2019t seem applicable.\u201d",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          6
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 6,
      "text": "See general response #2.",
      "suffix": "\n\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          6
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 7,
      "text": "3.  \u201cHow much of this phenomena do you think is characteristic for images specifically? Would be interesting to test anomaly detection using deep generative models trained on modalities other than images.\u201d",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 8,
      "text": "We have not tested non-image data, since images are the primary focus of work on generative models, but this is an interesting area for future work.",
      "suffix": "\n\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_future",
      "alignment": [
        "context_sentences",
        [
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 9,
      "text": "4.  \u201cSamples from a CIFAR model look nothing like SVHN. This seems to call the validity of the anomalous into question. Curious what the authors have to say about this.\u201d",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          8,
          9
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 10,
      "text": "This is a very good point.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          8,
          9
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 11,
      "text": "See our response to Shengyang Sun\u2019s comment below.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          8,
          9
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 12,
      "text": "We see think this phenomenon has to do with concentration of measure and typical sets, but we do not yet have a rigorous explanation.",
      "suffix": "\n\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          8,
          9
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 13,
      "text": "5.  \u201cThere seems to be some space crunching going on via Latex margin and spacing hacks that the authors should ideally avoid :)\u201d",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "r1eWc6qjnX",
      "rebuttal_id": "BygHqa2F0X",
      "sentence_index": 14,
      "text": "We have fixed the spacing in the latest draft :)",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    }
  ]
}