{
  "metadata": {
    "forum_id": "Bkeeca4Kvr",
    "review_id": "ryxUOGLHcH",
    "rebuttal_id": "S1x3ALTQjS",
    "title": "FEW-SHOT LEARNING ON GRAPHS VIA SUPER-CLASSES BASED ON GRAPH SPECTRAL MEASURES",
    "reviewer": "AnonReviewer1",
    "rating": 3,
    "conference": "ICLR2020",
    "permalink": "https://openreview.net/forum?id=Bkeeca4Kvr&noteId=S1x3ALTQjS",
    "annotator": "anno2"
  },
  "review_sentences": [
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 0,
      "text": "This paper proposed a few-shot graph classification algorithm based on graph neural networks.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 1,
      "text": "The learning is based on a large set of base class labeled graphs and a small set of novel class labeled graphs.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 2,
      "text": "The goal is to learn a classification algorithm over the novel class based on the sample from the base class and novel class.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 3,
      "text": "The learning process constitutes of the following steps.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 4,
      "text": "First, the base class is classified into K super classes based on the spectral embedding of the graph (onto distributions over the corresponding graph spectrum) and the k-means algorithm with the Wasserstein metric.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 5,
      "text": "Second, for each super class, the classification is done through a feature extractor and a classifier.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 6,
      "text": "In the training of the feature extractor and classifier, the author introduces a super-graph with each node representing a super class.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 7,
      "text": "Finally, in the fine-tuning stage, the feature extractor is fixed, and the classifier is trained based on the novel class.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 8,
      "text": "This work seems to be the first attempt to adopt the few-shot learning in graph classification tasks.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 9,
      "text": "The architecture is novel, and the classification of graph based on spectral embedding together with the Wasserstein metric is novel to me.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_originality",
      "polarity": "pol_positive"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 10,
      "text": "I vote for rejecting this submission for the following concerns.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 11,
      "text": "1. The classification of base class into super classes seems questionable to me.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 12,
      "text": "In the meta-learning language, the author attempts to learn a good representation of graphs based on different graph classification tasks generated by a task distribution.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 13,
      "text": "In terms of graph classification, the task distribution is supported on the joint distributions (G, Y).",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 14,
      "text": "Hence, to characterize different tasks, as far as I am concerned, the classification should take both the graph G and the label Y into consideration, instead of solely the graph.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 15,
      "text": "2. Though seemingly very important to the architecture, the purpose of constructing the super-graph g^{sup} in the training of C^{CAT} seems to be unclear to me.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "ryxUOGLHcH",
      "sentence_index": 16,
      "text": "I would appreciate it if the author could provide more explanation on the introduction of the super-graph in training.",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_edit",
      "aspect": "asp_clarity",
      "polarity": "none"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 0,
      "text": "We thank the reviewer for his comments and observations.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 1,
      "text": "Following are the answers to each question you have raised.",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 2,
      "text": "R1Q1(a): \u201cThe classification of base class into super classes seems questionable to me.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 3,
      "text": "In the meta-learning language, the author attempts to learn a good representation of graphs",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 4,
      "text": "based on different graph classification tasks generated by a task distribution",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 5,
      "text": ".",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 6,
      "text": "In terms of graph classification, the task distribution is supported on the joint distributions (G, Y).\u201d",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 7,
      "text": "R1A1(a): Our model has two major components, $C^{sup}$ (for super-class prediction) and $C^{GAT}$ (for graph label prediction).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 8,
      "text": "During the training phase of our classification, $C^{sup}$, which is a MLP layer, learns the super-class labels of the samples based on GIN\u2019s extracted feature vectors (which represent base class labeled graphs).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 9,
      "text": "While, $C^{GAT}$ takes as input the \u201cgraph of graphs\u201d (supergraph) which models the latent inter-class as well as intra-class information and is constructed in every training batch, along with base-class labels, to learn the associated class distribution.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 10,
      "text": "Then, during the fine-tuning phase on graphs with novel class labels, the feature extractor\u2019s (GIN) parameters are fixed and $C^{sup}$ is used to infer the super-class label of the novel class labeled graphs.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 11,
      "text": "Then, the parameters learned by $C^{GAT}$ get updated and further \u201cfine-tuned\u201d for better performance on the novel samples.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 12,
      "text": "In addition to our brief overview, you could also find a very neatly detailed summarization of our method in reviewer 2\u2019s comments (paragraphs 1-4).",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 13,
      "text": "The meta-learning framework, where batches are sampled as \u201cepisodes\u201d with N-way K-shot setting, does not perform as well in our few-shot setting on graphs for the following reasons:",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 14,
      "text": "1) We have very limited total number of training classes (in order of 10s), when compared to the image domain (order of 100s and 1000s).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 15,
      "text": "This limitation hampers learning across tasks and generalization to new unseen tasks.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 16,
      "text": "2) In each of our batches, we randomly sample a fixed-size of training samples belonging to the set of N labels chosen.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 17,
      "text": "Therefore, when building our supergraph, we end up with k-NN graphs of \u201cvariable size\u201d per super-class, compared to fixed size (K nodes) k-NN graphs that we would have got using episodic learning.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 18,
      "text": "We suspect this further allows our GAT to learn and generalize better to unseen graphs.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 19,
      "text": "Furthermore, in [1], the authors use a similar strategy in their \u201cbaseline++\u201d method and produce good results.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 20,
      "text": "Their findings are also in sync with our empirical finding.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          11,
          12,
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 21,
      "text": "[1] Chen et~al. \u201cA closer look at Few-Shot Classification\u201d, ICLR 2019",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_other",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 22,
      "text": "R1Q1(b): Hence, to characterize different tasks, as far as I am concerned, the classification should take both the graph G and the label Y into consideration, instead of solely the graph.\u201d",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          14
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 23,
      "text": "R1A1(b): In every batch of graphs during both training and fine-tuning phase, each graph is associated with its corresponding graph label.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          14
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 24,
      "text": "In case of training, its a base-class and in the case of fine-tuning its a novel class",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          14
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 25,
      "text": ".",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          14
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 26,
      "text": "In the case of $C^{GAT}$, the graph is accompanied by a regular class label and in case of $C^{sup}$, the graph is accompanied by a superclass label.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          14
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 27,
      "text": "R1Q2: \u201cThough seemingly very important to the architecture, the purpose of constructing the super-graph $g^{sup}$ in the training of $C^{CAT}$ seems to be unclear to me.\u201d",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 28,
      "text": "R1A2: What makes few-shot learning particularly difficult compared to common machine learning settings is the dearth of training examples, which results in a bad empirical risk approximation for the expected risk and therefore gives rise to an empirical risk minimizer that is sub-optimal.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 29,
      "text": "Reducing the required sample complexity can result in a better empirical risk minimizer.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 30,
      "text": "Therefore, given a very large space of hypotheses H, our goal is to further restrict and constrain H using some prior knowledge because a reduced H has reduced sample complexity and thus requires fewer training samples to be trained.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 31,
      "text": "We provide this \u201cprior knowledge\u201d in the form of a \u201cgraph of graphs\u201d, namely our super-graph $g^{sup}$, which captures both the latent inter-class and intra-class relationships between classes.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 32,
      "text": "Observe that in $g^{sup}$, we build a k-NN graph PER super-class, restricting any flow of information between super-classes, thus further restricting H. We force our model to jointly learn both the superclass and graph class labels.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 33,
      "text": "This way similar classes (grouped under a superclass) together contribute to learning a general prior representing the superclasses and each superclass also provides \u201cguidance\u201d to better train with the few samples assigned to that superclass.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 34,
      "text": "The introduction of this prior knowledge in the form of a supergraph in $C^{GAT}$ during training also helps generalize better to the novel samples that are presented to our model in the fine-tuning stage.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {}
    },
    {
      "review_id": "ryxUOGLHcH",
      "rebuttal_id": "S1x3ALTQjS",
      "sentence_index": 35,
      "text": "Additionally, we would also like to draw attention to the supergraph usage summary provided by reviewer 2 in paragraph 3 of their comments.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          15
        ]
      ],
      "details": {}
    }
  ]
}