{
  "metadata": {
    "forum_id": "H1ERcs09KQ",
    "review_id": "HyedILfT2X",
    "rebuttal_id": "H1xXoJM_a7",
    "title": "Hierarchically Clustered Representation Learning",
    "reviewer": "AnonReviewer2",
    "rating": 5,
    "conference": "ICLR2019",
    "permalink": "https://openreview.net/forum?id=H1ERcs09KQ&noteId=H1xXoJM_a7",
    "annotator": "anno10"
  },
  "review_sentences": [
    {
      "review_id": "HyedILfT2X",
      "sentence_index": 0,
      "text": "The paper proposes using the nested CRP as a clustering model rather than a topic model.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HyedILfT2X",
      "sentence_index": 1,
      "text": "The clustering is on the latent vector input into a neural network for generating the observation.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HyedILfT2X",
      "sentence_index": 2,
      "text": "A variational approach is derived.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HyedILfT2X",
      "sentence_index": 3,
      "text": "The proposed model seems like a straightforward extension of the nCRP with a deep model hanging off the end of it.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_originality",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HyedILfT2X",
      "sentence_index": 4,
      "text": "A significant concern/confusion for me is that this doesn't seem to be a mixed membership model, and so I don't know how meaningful it is to generate a level distribution from a Dirichlet and then draw from that mixture one time.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HyedILfT2X",
      "sentence_index": 5,
      "text": "From the generative model it seems every data point has its own Dirichlet vector on levels.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HyedILfT2X",
      "sentence_index": 6,
      "text": "For topic models this makes sense since that vector is then drawn from multiple times (once per word) from a Discrete, so there's a distribution to actually learn.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HyedILfT2X",
      "sentence_index": 7,
      "text": "My understanding is that this isn't being done here.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 0,
      "text": "[Q] The paper proposes using the nested CRP as a clustering model rather than a topic model.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_none",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 1,
      "text": "The clustering is on the latent vector input into a neural network for generating the observation.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          1
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 2,
      "text": "A variational approach is derived.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          2
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 3,
      "text": "The proposed model seems like a straightforward extension of the nCRP with a deep model hanging off the end of it.",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          3
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 4,
      "text": "[A1] Dear Reviewer 2, thank you for the thoughtful review.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_none",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 5,
      "text": "As the reviewer mentioned, we exploited the nested CRP prior to the path selection process.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_none",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 6,
      "text": "For performing a hierarchical density estimation task in embedding space, we additionally designed a hierarchical-versioned Gaussian mixture model prior with the nested CRP prior.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_summary",
      "alignment": [
        "context_none",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 7,
      "text": "[Q] A significant concern/confusion for me is that this doesn't seem to be a mixed membership model, and so I don't know how meaningful it is to generate a level distribution from a Dirichlet and then draw from that mixture one time.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          4
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 8,
      "text": "From the generative model, it seems every data point has its own Dirichlet vector on levels.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          5
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 9,
      "text": "For topic models, this makes sense since that vector is then drawn from multiple times (once per word) from a Discrete, so there's a distribution to actually learn.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          6
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 10,
      "text": "My understanding is that this isn't being done here.",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 11,
      "text": "[A1] Thank you for the very constructive comments.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 12,
      "text": "In fact, we intended to model the level proportion as shown in the third part of our generative process on page 4.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 13,
      "text": "Often, for grouped-data, the level proportion (or topic proportion) is modeled as a group-specific variable.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 14,
      "text": "Under our non-grouped data setting, for example, two following approaches are possible: 1) as the reviewer mentioned, globally define a level proportion once, take multiple level samplings for each data, and 2) as our modeling, locally define the data-specific level proportion, followed by sampling the level (this is actually auxiliary variable for specifying the Gaussian distribution).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 15,
      "text": "The reason we chose the latter approach is for modeling more flexible prior.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 16,
      "text": "The Gaussian mixture distributions exist separately for each level, and we assume the generative process that the mixing coefficient for the level would be different for each data.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 17,
      "text": "Please consider that the data-instance we handled is a high-dimensional data of a document/an image rather than a word/a pixel.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 18,
      "text": "The hierarchically Gaussian mixture distributions are learned for different levels, and here assuming a common level proportion for all data forcefully limits the expressive power of the model.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 19,
      "text": "Also, for preventing the overfitting, we placed the common prior, Dirichlet(\\alpha), on the data-specific level proportion, which can be considered as one of the regularization terms.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 20,
      "text": "[A2] Also, I would like to explain the reviewer\u2019s comment as the formulae.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 21,
      "text": "The prior we suggested is this: \\sum_{\\zeta, l} nCRP(\\zeta_n) * \\eta_{nl} * Normal(-) (\uf0e0 please refer to the Figure 3(a).).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 22,
      "text": "Moreover, the point that the reviewer pointed out is on \\eta_{nl}, i.e., \u2018the reason for designing \\eta as \\eta_{nl}, why \\eta is data-specific variable?",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 23,
      "text": "\u2019",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 24,
      "text": ".",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 25,
      "text": "There are similar works, which previously published [1-3].",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 26,
      "text": "They designed data-specific mixing coefficients of Gaussian mixture models, for improving more flexibility like ours.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 27,
      "text": "[1] Ban, Zhihua, Jianguo Liu, and Li Cao. \"Superpixel Segmentation Using Gaussian Mixture Model.\" IEEE Transactions on Image Processing 27.8 (2018): 4105-4117.",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_other",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 28,
      "text": "[2] Zhang, Hui, et al. \"Automatic Visual Detection System of Railway Surface Defects With Curvature Filter and Improved Gaussian Mixture Model.\" IEEE Transactions on Instrumentation and Measurement 67.7 (2018): 1593-1608.",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_other",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 29,
      "text": "[3] Ji, Zexuan, et al. \"A spatially constrained generative asymmetric Gaussian mixture model for image segmentation.\" Journal of Visual Communication and Image Representation 40 (2016): 611-626.",
      "suffix": "\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_other",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 30,
      "text": "Under the newly proposed Gaussian mixture models from the above papers, the cluster assignment of data is sampled once from the data-specific mixing coefficient, where there is no theoretical problem as a fully Bayesian formalization.",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 31,
      "text": "[A3] We were very impressed with the mathematical detail of the reviewer\u2019s comment and thanked you to the reviewer again. If the reviewer agrees with our argument, we will reflect the argument in our paper.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_sentences",
        [
          4,
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HyedILfT2X",
      "rebuttal_id": "H1xXoJM_a7",
      "sentence_index": 32,
      "text": "Best regards,",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    }
  ]
}