{
  "metadata": {
    "forum_id": "r1e74a4twH",
    "review_id": "HkgXCTz-cB",
    "rebuttal_id": "rJxpWvPUor",
    "title": "CZ-GEM:  A  FRAMEWORK  FOR DISENTANGLED REPRESENTATION LEARNING",
    "reviewer": "AnonReviewer1",
    "rating": 1,
    "conference": "ICLR2020",
    "permalink": "https://openreview.net/forum?id=r1e74a4twH&noteId=rJxpWvPUor",
    "annotator": "anno3"
  },
  "review_sentences": [
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 0,
      "text": "Summary:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 1,
      "text": "The paper proposes the use of a hierarchical model for a generative modeling task.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 2,
      "text": "They propose a framework of introducing an intermediate latent variable to enforce the independence of the control and noise variable.",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 3,
      "text": "The paper report extensive experimental results to validate the proposed hierarchical model.",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 4,
      "text": "The authors also provide the anonymized code to observe the exact implementation in TensorFlow to visualize the latent variable traversals.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 5,
      "text": "Comments:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 6,
      "text": "The paper proposes the use of a hierarchical model for a generative modeling task by introducing an intermediate latent variable to enforce the independence of the control and noise variable.",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 7,
      "text": "The paper report extensive experimental results to validate the proposed hierarchical model.",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 8,
      "text": "This type of framework of crude to fine hierarchical generative model has already been successfully introduced by StackGAN and it's recent variants.",
      "suffix": "\n",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 9,
      "text": "On the unsupervised disentangled feature learning, the framework provides incremental advancement by using beta-VAE in conjunction with GAN to use the best of both the worlds.",
      "suffix": "\n",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 10,
      "text": "Even though the proposed approach is similar to StackGAN, the experiments and the results mentioned in the paper are noteworthy.",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_motivation-impact",
      "polarity": "pol_positive"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 11,
      "text": "Questions to Authors:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 12,
      "text": "There are 2 main claims of novelty made in the paper.",
      "suffix": "\n",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 13,
      "text": "1. Architectural Biases:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 14,
      "text": "How is the approach different in comparison to the StackGAN and it's variable which also use multiple levels of crude to fine image generation?",
      "suffix": "\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_substance",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 15,
      "text": "2. Unsupervised control variable discovery:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 16,
      "text": "This part is just the use of existing disentanglement VAEs to extract the control variables.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 17,
      "text": "So how does the paper try to make contributions to improve the disentangled features with the proposed method?",
      "suffix": "\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_substance",
      "polarity": "none"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 18,
      "text": "Apart from combining these to existing ideas, what can be considered as an added novelty to improve the quality of the disentangled features?",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_clarification",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "HkgXCTz-cB",
      "sentence_index": 19,
      "text": "In summary, I find there is no novelty involved apart from combining the already existing SOTA model in disentangled feature learning (beta-VAE) and image generation (StackGAN).",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_originality",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 0,
      "text": "1. vs StackGAN: Our method introduces a learning method that allows for training generative models with disentangled latents without compromising on the generative quality (unlike all SOTA mutual information (MI) based disentangled representation learning methods such as beta-VAE or info-GAN).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 1,
      "text": "More fundamentally, our method provably avoids issues posed by d-separation that theoretically prohibit disentanglement in the current SOTA methods.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_mitigate-criticism",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 2,
      "text": "This is completely different from the motivation of StackGAN which aims to use iterative refinement (like several other generative models) to learn a generative map from image captions and does not care about disentanglement.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_mitigate-criticism",
      "alignment": [
        "context_sentences",
        [
          8,
          9,
          10
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 3,
      "text": "2. Unsupervised control variable discovery: Beta VAE (or other MI-based methods) disentangle the latents by compromising the generative quality.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          15,
          16,
          17
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 4,
      "text": "The more the model forces disentanglement, by giving more weight to a certain information-theoretic regularizer, the worse the generated images become.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          15,
          16,
          17
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 5,
      "text": "By decoupling the training into two steps, our method allows for far better disentanglement than beta-VAE like methods without compromising the generative quality.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          15,
          16,
          17
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 6,
      "text": "Novelty: Our method aims to solve the fundamental issue of d-separation in disentangled representation learning.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          18,
          19
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 7,
      "text": "It allows for a theoretically consistent way of obtaining factorisation in the posterior without any information-theoretic penalties.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          18,
          19
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 8,
      "text": "It is true, that one can describe the method as a (non-trivial) combination of beta-vae + GAN.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_concede-criticism",
      "alignment": [
        "context_sentences",
        [
          18,
          19
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 9,
      "text": "But this description mischaracterizes the fundamental problem that we have identified and proposed a solution for.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          18,
          19
        ]
      ],
      "details": {}
    },
    {
      "review_id": "HkgXCTz-cB",
      "rebuttal_id": "rJxpWvPUor",
      "sentence_index": 10,
      "text": "(Please refer to comments for Reviewer 2 under \u2018Scientific Contribution\u2019)",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    }
  ]
}