{
  "metadata": {
    "forum_id": "SyVuRiC5K7",
    "review_id": "S1x4ca-chQ",
    "rebuttal_id": "Hye0RIZ_RX",
    "title": "LEARNING TO PROPAGATE LABELS: TRANSDUCTIVE PROPAGATION NETWORK FOR FEW-SHOT LEARNING",
    "reviewer": "AnonReviewer2",
    "rating": 6,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=SyVuRiC5K7&noteId=Hye0RIZ_RX",
    "annotator": "anno2"
  },
  "review_sentences": [
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 0,
      "text": "Summary",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 1,
      "text": "This paper proposes a meta-learning framework that leverages unlabeled data by learning the graph-based label propogation in an end-to-end manner.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 2,
      "text": "The proposed approaches are evaluated on two few-shot datasets and achieves the state-of-the-art results.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 3,
      "text": "Pros.",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 4,
      "text": "-This paper is well-motivated.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_motivation-impact",
      "polarity": "pol_positive"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 5,
      "text": "Studying label propagation in the meta-learning setting is interesting and novel.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_originality",
      "polarity": "pol_positive"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 6,
      "text": "Intuitively, transductive label propagation should improve supervised learning when the number of labeled instances is low.",
      "suffix": "\n",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 7,
      "text": "-The empirical results show improvement over the baselines, which are expected.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_positive"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 8,
      "text": "Cons.",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 9,
      "text": "-Some technical details  are missing.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 10,
      "text": "In Section 3.2.2, the authors only explain how they learn example-based \\sigma, but details on how to make graph construction end-to-end trainable are missing.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_replicability",
      "polarity": "pol_negative"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 11,
      "text": "Constructing the full weight matrix requires the whole dataset as input and selecting k-nearest neighbor is a non-differentiable operation. Can you give more explanations?",
      "suffix": "\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_soundness-correctness",
      "polarity": "none"
    },
    {
      "review_id": "S1x4ca-chQ",
      "sentence_index": 12,
      "text": "-Does episode training help label propagation? How about the results of label propagation without the episode training?",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_clarity",
      "polarity": "none"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 0,
      "text": "Please refer to our main response in an above comment that addresses the primary and common questions amongst all reviewers.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_none",
        null
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 1,
      "text": "Here we respond to your specific comments.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 2,
      "text": "\"Some technical details are missing.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 3,
      "text": "In Section 3.2.2, the authors only explain how they learn example-based \\sigma, but details on how to make graph construction end-to-end trainable are missing.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 4,
      "text": "Constructing the full weight matrix requires the whole dataset as input and selecting k-nearest neighbor is a non-differentiable operation. Can you give more explanations?\"",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 5,
      "text": ">>> Thanks for pointing out the details.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 6,
      "text": "We want to clarify the few-shot setting.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 7,
      "text": "We follow the widely-used episodic paradigm proposed by Matching Networks [1].",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 8,
      "text": "In each episode (training batch), our algorithm solves a small classification problem which contains N classes each having K support and Q query examples (e.g., N=5, K=1, Q=15, totally 80 examples).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 9,
      "text": "The weight matrix is constructed on the support and query examples in each episode rather than the whole dataset.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 10,
      "text": "This is very fast and efficient.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 11,
      "text": "In deep neural networks, there is a common trick in computing the gradient of operations non-differentiable at some points, but differentiable elsewhere, such as Max-Pooling (top-1) and top-k.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 12,
      "text": "In forward computation pass, the index position of the max (or top-k) values are stored.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 13,
      "text": "While in the back propagation pass, the gradient is computed only with respect to these saved positions.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 14,
      "text": "This trick is implemented in modern deep learning frameworks such as tensorflow and pytorch.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 15,
      "text": "In our paper, we use the tensorflow function tf.nn.top_k() to compute k-nearest neighbor operation.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          9,
          10,
          11
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 16,
      "text": "\"Does episode training help label propagation? How about the results of label propagation without the episode training? \"",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 17,
      "text": ">>> In our paper, the length scale parameter \\sigma is trained in an example-wise and episodic-wise way, as described in section 3.2.2 and Figure 4 of Appendix A. In order to investigate the benefit of episodic training, we combine the heuristic-based label propagation methods [2] with meta-learning to serve as a transductive baseline.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 18,
      "text": "Please refer to Table 1 and Table 2 line \"Label Propagation\".",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 19,
      "text": "It can be seen that TPN outperforms naive label propagation with a large margin, thus verifying the effectiveness of episode training.",
      "suffix": "\n\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 20,
      "text": "[1] Vinyals, Oriol et al. \"Matching networks for one shot learning.\" NIPS. 2016.",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_other",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "S1x4ca-chQ",
      "rebuttal_id": "Hye0RIZ_RX",
      "sentence_index": 21,
      "text": "[2] Zhou, Denny et al. \"Learning with local and global consistency.\" NIPS. 2004.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_other",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    }
  ]
}