{
  "metadata": {
    "forum_id": "rket4i0qtX",
    "review_id": "SygP4mws3Q",
    "rebuttal_id": "S1eF1dL3TQ",
    "title": "The meaning of \"most\" for visual question answering models",
    "reviewer": "AnonReviewer2",
    "rating": 7,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=rket4i0qtX&noteId=S1eF1dL3TQ",
    "annotator": "anno13"
  },
  "review_sentences": [
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 0,
      "text": "This paper studies how the FiLM visual question answering (VQA) model answer questions involving the quantifier \u2018most\u2019.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 1,
      "text": "This quantifier is chosen for study because it cannot be expressed in first order logic (i.e., high-order logic is required), and secondly because there are two different algorithmic approaches to answering questions involving \u2018most\u2019 (cardinality-based strategy and pairing-based strategy).",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 2,
      "text": "Experiments are performed by designing abstract visual scenes with controlled numerosity and spatial layouts, and applying methodologies from pyscholinguistics.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 3,
      "text": "The paper concludes that the model learns an approximate number system (ANS), consistent with the cardinality-based strategy, with implications for understanding the conditions under which existing VQA models should perform well or badly (and possibly for improving VQA models).",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 4,
      "text": "Strengths:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 5,
      "text": "- The research question is clear and well-conceived. In general, it seems there are significant opportunities for better collaboration between the experimental psychology and machine learning communities, and this is a good example of the benefits.",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_motivation-impact",
      "polarity": "pol_positive"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 6,
      "text": "- The paper is clear, highly-focused, and well-written.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_positive"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 7,
      "text": "Weaknesses:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 8,
      "text": "- The arguments for why the experimental evidence actually supports the existance of an approximate number system (ANS) could be made more clear.",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_result",
      "aspect": "asp_clarity",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 9,
      "text": "For example, the section on \u201cRatios andWeber fraction\u201d argues that \u201cthese curves align well with the trend predicted by Weber\u2019s law\u201d, but does not explain how the experimental data would present if the alternative hypothesis (pairing-based strategy) was being used.",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_result",
      "aspect": "asp_clarity",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 10,
      "text": "What would the pairing-based strategy look like in Figure 6 right? Are there not significance tests that could be used to more carefully quantify the level of support for the two alternative strategies?",
      "suffix": "\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_clarification",
      "aspect": "asp_substance",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 11,
      "text": "- The experiments seem very similar to Wu et al. 2018, which is considered to be prior work under the ICLR guidelines.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_meaningful-comparison",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 12,
      "text": "While this paper is acknowledged in the related work, it would be helpful to expand further on the relationship between these works, so the originality and contribution of this paper can be better evaluated.",
      "suffix": "\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_meaningful-comparison",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 13,
      "text": "- In some ways it is not that surprising that the CNN more easily learns an approximate number system rather than a pairing-based algorithm, as the later would presumably need to learn a different convolutional filter for every possible spatial arrangement of the pairs (which would be very sample inefficient).",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 14,
      "text": "Therefore, it might be interesting to consider, are there any circumstances under which the CNN would learn a pairing based algorithm?",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_edit",
      "aspect": "asp_substance",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 15,
      "text": "For example, what if the spatial configuration of the pairs was simplified, so they were always side-by-side at a fixed distance? If pairing-based algorithms emerged under simplified scenarios, this might have implications for the design of CNN filters (if we want models that are capable of learning these types of functions).",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_edit",
      "aspect": "asp_substance",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 16,
      "text": "Summary:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "SygP4mws3Q",
      "sentence_index": 17,
      "text": "I regard this as a good paper, with a couple of weakness that could be addressed as indicated.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_positive"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "SygP4mws3Q",
      "rebuttal_id": "S1eF1dL3TQ",
      "sentence_index": 0,
      "text": "Many thanks for the valuable feedback! We uploaded a revised version of the paper, and in the following address the weaknesses you pointed out:",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_global",
        null
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "SygP4mws3Q",
      "rebuttal_id": "S1eF1dL3TQ",
      "sentence_index": 1,
      "text": "- Due to space constraints, we have to refer to Pietroski et al.'s work for more elaborate reasoning regarding the cognitive implications.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SygP4mws3Q",
      "rebuttal_id": "S1eF1dL3TQ",
      "sentence_index": 2,
      "text": "We think their experiments are supposed to give strong indication for the ANS as a likely explanation of human behavior, and thus in our work for the FiLM model, without ultimately ruling out the pairing-based strategy (which is probably impossible via experiments evaluating extrinsic behavior only).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SygP4mws3Q",
      "rebuttal_id": "S1eF1dL3TQ",
      "sentence_index": 3,
      "text": "We are not aware of what the curves for the pairing-based strategy in figure 6 would look like, but there is definitely evidence for Weber's Law in other approximate systems (where pairing-based strategies are no alternative), thus suggesting that similar mechanisms are at work here.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SygP4mws3Q",
      "rebuttal_id": "S1eF1dL3TQ",
      "sentence_index": 4,
      "text": "We added a footnote on this to section 2.4.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "SygP4mws3Q",
      "rebuttal_id": "S1eF1dL3TQ",
      "sentence_index": 5,
      "text": "- We added a sentence to section 5 to clarify the differences (they focus on subitizing while we focus on ANS, and their experiments follow a different methodology with specifically designed data used for training and not just evaluation).",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          8
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "SygP4mws3Q",
      "rebuttal_id": "S1eF1dL3TQ",
      "sentence_index": 6,
      "text": "- We actually consider the pairing-based strategy as more likely to be learned.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SygP4mws3Q",
      "rebuttal_id": "S1eF1dL3TQ",
      "sentence_index": 7,
      "text": "Why? You're right that the convolutions need to learn to handle all possible spatial arrangements, but we think that this is the case for both the pairing- and the cardinality-based strategy, while the latter in addition needs to learn a presumably (our intuition) more complex aggregation mechanism of the locally computed results.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          13,
          14,
          15
        ]
      ],
      "details": {}
    },
    {
      "review_id": "SygP4mws3Q",
      "rebuttal_id": "S1eF1dL3TQ",
      "sentence_index": 8,
      "text": "Anyway, we added a few sentences to the end of section 2.4 discussing our intuition in some more detail to address this point.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_mitigate-criticism",
      "alignment": [
        "context_sentences",
        [
          13,
          14,
          15
        ]
      ],
      "details": {}
    }
  ]
}