{
  "metadata": {
    "forum_id": "S1gd7nCcF7",
    "review_id": "rJl-0ufypm",
    "rebuttal_id": "HkesnzFKCQ",
    "title": "Self-Supervised Generalisation with Meta Auxiliary Learning",
    "reviewer": "AnonReviewer2",
    "rating": 4,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=S1gd7nCcF7&noteId=HkesnzFKCQ",
    "annotator": "anno3"
  },
  "review_sentences": [
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 0,
      "text": "This paper proposes an algorithm for auxiliary learning.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 1,
      "text": "Given a target prediction task to be learned on training data, the auxiliary learning utilizes external training data to improve learning.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 2,
      "text": "The authors focus on a setup where both target and external training data come from the same distribution but differ in class labels, where each class in the target data is a set of finer-grained classes in the auxiliary data.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 3,
      "text": "The authors propose a heuristic for learning from both data sets through minimization of a joint loss function.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 4,
      "text": "The experimental results show that the proposed methods works well on this particular setup on CIFAR data set.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 5,
      "text": "Strengths:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 6,
      "text": "+ a new auxiliary learning algorithm",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_originality",
      "polarity": "pol_positive"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 7,
      "text": "+ positive results on CIFAR data set",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_positive"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 8,
      "text": "Weaknesses:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 9,
      "text": "- novelty is low: the proposed algorithm is a heuristic similar to previously proposed algorithms in the transfer learning and auxiliary learning space",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_originality",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 10,
      "text": "- there is no attempt to provide a theoretical insight into the performance of the algorithm",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 11,
      "text": "- the problem assumptions are too simplistic and unrealistic (feature distributions of target and auxiliary data are identical), so it is questionable if the proposed algorithm has practical importance",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_replicability",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 12,
      "text": "- experiments are performed using a synthetic setup on a single data set, so it remains unclear if the algorithm would be successful in a real life scenario",
      "suffix": "\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_replicability",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rJl-0ufypm",
      "sentence_index": 13,
      "text": "- the paper is poorly written and sentences are generally very hard to parse. For example, section 3.1 is opened by statements such as \"(we use) a multi-task evaluator which trains on the principal and auxiliary tasks, and evaluates the performance of the auxiliary tasks on a meta set\"??",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 0,
      "text": "We thank for the reviewer for their comments on our work, and we share our responses below.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_none",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 1,
      "text": "1. Novelty: To the best of our knowledge, this is the first paper presenting a simple solution to generating useful auxiliary tasks in a self-supervised manner.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          9
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 2,
      "text": "The idea indeed was inspired by other works in auxiliary learning, but only to the extent that we also use auxiliary tasks to improve performance of a principal task.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          9
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 3,
      "text": "The method is not a heuristic; it is theoretically motivated by use of the double gradient, and inspired by the success of this in meta learning (e.g. MAML [1]).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 4,
      "text": "If the reviewer thinks our method is an incremental contribution or similar to previous algorithms, please list the specific references.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          9
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 5,
      "text": "2.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 6,
      "text": "The theoretical insight in this paper comes from the recent advancements in using a double gradient, such as in MAML [1], or understanding what makes a good auxiliary data sampler [2].",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 7,
      "text": "The inner gradient is based on the standard auxiliary learning loss as proposed in other works, whereas the outer gradient uses this inner gradient to actually learn the auxiliary tasks.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 8,
      "text": "The use of an outer gradient for auxiliary learning is our key novelty, and has not been used in any works before.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 9,
      "text": "3. Feature distributions of training and meta-training data (target and auxiliary data in your language) are actually not identical.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 10,
      "text": "The \"learning to generalise\" success from our method is due to closing the *existing* distribution shift in these two datasets. If the distributions are identical, then we wouldn't have any improved generalisation from our method.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 11,
      "text": "4. Both CIFAR10 and CIFAR100 are the subsets from 80 million tiny images dataset [3].",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 12,
      "text": "As described in the website and paper, all images are collected from the internet and partially labelled by humans, and thus indeed present a real-world setup rather than a synthetic setup.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 13,
      "text": "Further, we show that if a harder test set with a more variety exists (CIFAR10.1v6), out method could provide even better generalisation (Figure 4).",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 14,
      "text": "Thus, we hope the reviewer could better explain why you think our algorithm could fail in real-world scenarios.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_followup",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 15,
      "text": "[1] Finn et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks ICML, 2017.",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_other",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 16,
      "text": "[2] Zhang et al. Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary Data, ECCV 2018.",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_other",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rJl-0ufypm",
      "rebuttal_id": "HkesnzFKCQ",
      "sentence_index": 17,
      "text": "[3] http://people.csail.mit.edu/torralba/tinyimages/",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_other",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    }
  ]
}