{
  "metadata": {
    "forum_id": "HygYmJBKwH",
    "review_id": "rkeZqrk0tr",
    "rebuttal_id": "SylS_OoXsH",
    "title": "YaoGAN: Learning Worst-case Competitive Algorithms from Self-generated Inputs",
    "reviewer": "AnonReviewer3",
    "rating": 6,
    "conference": "ICLR2020",
    "permalink": "https://openreview.net/forum?id=HygYmJBKwH&noteId=SylS_OoXsH",
    "annotator": "anno2"
  },
  "review_sentences": [
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 0,
      "text": "Update to the Review after the rebuttal from the Authors:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 1,
      "text": "After carefully reviewing the responses by the authors especially on my concerns about the significance of solving an instance of a given problem and the improvement in the exposition of the ideas I would like to amend my earlier decision and recommend to accept.",
      "suffix": "",
      "review_action": "arg_social",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 2,
      "text": "For completeness below is the original review.",
      "suffix": "\n\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_quote",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 3,
      "text": "This paper introduces a framework to learn to generate solutions to online combinatorial optimization problems with worst case guarantees.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 4,
      "text": "The framework as the authors claim eliminates the need for manual hard to solve instance/data creation, which is necessary to teach the model to provide the aforementioned worst case guarantees.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 5,
      "text": "Therefore the main contribution of the paper can be said that this framework shows that it is possible to train a machine learning model, which can learn an algorithm to solve hard online combinatorial optimization problems and this training can be done without knowing much about the actual optimization problem domain.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 6,
      "text": "The only input required is the way to calculate the objective function of the actual problem.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 7,
      "text": "This contribution is demonstrated on two classes of problems: Ski-Rental and Fractional AdWords.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 8,
      "text": "The framework requires two neural networks one for solution generation agent and one for problem instance generation.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 9,
      "text": "These two networks are trained jointly from scratch and the underlying algorithm for the training is provided.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 10,
      "text": "Although a generic framework that learns to solve online combinatorial optimization problems without domain knowledge is by itself a very motivational goal neither the paper successfully demonstrates that the framework the authors propose achieves this goal nor it explains well enough why one would take the machine learning approach to find good algorithms to such problems.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 11,
      "text": "Is it because the ML solution would be faster to compute with big instances? Is it because with the proposed approach one can curate sophisticated heuristic solutions when provable optimality is out of reach?",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 12,
      "text": "This paper should be rejected because proposed method demonstrates that an instance of one class of problems, Fractional Adwords, can be learned to solve without domain expertise, however fails to prove that the approach would be beneficial for any other instances of the same problem.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 13,
      "text": "Although they show that the Ski Rental problem can also be learned to solve though it is trivial and does not even use the framework the authors propose in its full extent, ie. problem instances are not generated by use of a machine learning model, which is one of main claims the authors are making.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 14,
      "text": "Therefore I do not find being able to solve this problem as a supporting evidence for the contributions claimed.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 15,
      "text": "In particular there is not any theoretical not experimental evidence that the approach would scale to any instances where a pure optimization approach would be slow to provide any meaningful solutions.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 16,
      "text": "I find this important because for combinatorial optimization usually scale matters a lot.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 17,
      "text": "While a small instance of a problem can be solve by a general purpose solver quickly a small increase in the problem size can turn out to be intractable.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 18,
      "text": "When proposing a machine learning approach to such problems I would expect the model to scale better than pure optimization approach so that there would be demonstrable benefit.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 19,
      "text": "Although the paper proposes an interesting framework I would argue that it is a \u201cgreen apple\u201d in the sense that authors need to motivate the approach better and expand the contribution beyond solving a particular instance.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_motivation-impact",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 20,
      "text": "Authors acknowledge the fact that their experimental setup is rather limited in Appendix C.1, which I agree with and they also claim that there is a representation for a uniform algorithm for any number of advertisers for the AdWords problem, however they leave this as a future work, which I find unfortunate. I would recommend taking this direction rigorously and expand the contribution, which would prove to be a very sound contribution.",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_experiment",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 21,
      "text": "In order to clarify the exposition the following are some questions:",
      "suffix": "\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_heading",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 22,
      "text": "1. Authors call the approach YaoGAN due to its structural similarity to GANs.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_quote",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 23,
      "text": "I understand the fact that they are training two neural networks in an alternating scheme, which is similar to the GAN training.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 24,
      "text": "How can one evaluate the solutions generated by this framework similar to how GAN generators are evaluated?",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_clarification",
      "aspect": "asp_replicability",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 25,
      "text": "Can one walk the latent distribution of the algorithm agent and draw insights, which might lead into tailoring some algorithms that would be appropriate for some input distribution although in general inferior in terms of worst case guarantees?",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 26,
      "text": "2. The main technical contribution claim needs to be elaborated.",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_experiment",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 27,
      "text": "I understand how the game theoretic framework is established but how does this manifest itself in the algorithm described in Section 3.1 needs more explanation.",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_experiment",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 28,
      "text": "3. Authors claim there are two shortcomings of the previous method proposed in Kong et. al 2018.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_quote",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 29,
      "text": "They need to elaborate how their method overcomes these issues better.",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_experiment",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 30,
      "text": "4. Authors state that fractional relaxation of combinatorial mainly integer optimization problems, which is accurate.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_quote",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 31,
      "text": "Yet their approach is only able to solve the fractional version of the AdWords problem.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 32,
      "text": "In addition I agree with the fact that although continuous relaxations to integer optimization problems might provide insightful directions they usually employed to to prove bounds on the heuristic approaches.",
      "suffix": "",
      "review_action": "arg_fact",
      "fine_review_action": "none",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 33,
      "text": "Yet the authors stop at only solving this version with a machine learning approach, which does not hit the bar for me.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 34,
      "text": "I would have expected the authors to at least elaborate on why the current framework is not suitable for the non-relaxed problem.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 35,
      "text": "What are the shortcomings?",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_experiment",
      "aspect": "asp_substance",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 36,
      "text": "5.In Appendix A authors talk about no-regret dynamics, which are relevant.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_quote",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 37,
      "text": "However, they state they loosely follow this approach. What does that entail? What kind of theoretical guarantees are given up due to not following this, a better exposition on this topic would help to support the claims.",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 38,
      "text": "6. In appendix C.2 authors provide additional plots for the Fractional AdWords problem.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_quote",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 39,
      "text": "However, they retain from providing any intuition about them.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 40,
      "text": "In particular what is the conclusion to be drawn from Figure 5.",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_clarity",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 41,
      "text": "This needs more elaboration. Is this way of training results expected? What is the lesson learned?",
      "suffix": "\n\n",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_clarity",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 42,
      "text": "7.In Figure 8 they provide example data from experience array.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_quote",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 43,
      "text": "What are the significance of these examples? How they help us understand the problem instance generation was actually able to find interesting instances? What kind of dynamics are under covered?",
      "suffix": "",
      "review_action": "arg_request",
      "fine_review_action": "arg-request_explanation",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "rkeZqrk0tr",
      "sentence_index": 44,
      "text": "These are not directly revealed by only looking at the pictures one needs more explanation to support the claims.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 0,
      "text": "Thank you for your review.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 1,
      "text": "Please also see our high-level clarification above which we believe can help in better interpretation of our contribution.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 2,
      "text": "Some specific responses below:",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 3,
      "text": "-- \u201cproposed method demonstrates that an instance of one class of problems, Fractional Adwords, can be learned to solve without domain expertise, however fails to prove that the approach would be beneficial for any other instances of the same problem.\u201d",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 4,
      "text": "Please refer to our overall comments on this question (and also a few more details in reply to Reviewer#1\u2019s similar question).",
      "suffix": "\n\n\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          12
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 5,
      "text": "-- Comment on scale / speed for large instances of combinatorial optimization:",
      "suffix": "\n\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          12,
          13,
          14,
          15,
          16,
          17,
          18,
          19,
          20
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 6,
      "text": "The point of this work is only to see if ML can find optimal algorithms, and not about doing it faster than the known theoretical algorithms.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          12,
          13,
          14,
          15,
          16,
          17,
          18,
          19,
          20
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 7,
      "text": "Note that this is not similar to the case of solving an offline combinatorial problem via integer programming or other solvers, since our problems are online, i.e., the instance is not known beforehand, so there is no comparison to such \u201cgeneral-purpose\u201d solvers.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          12,
          13,
          14,
          15,
          16,
          17,
          18,
          19,
          20
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 8,
      "text": "Thus we don't compare to the running time of offline solvers, but to the worst-case competitive ratio of the optimal online algorithms.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          12,
          13,
          14,
          15,
          16,
          17,
          18,
          19,
          20
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 9,
      "text": "As mentioned in the comment, this approach may eventually lead to finding optimal or near-optimal algorithms for a problem (not an instance of a problem) for which no algorithm is known -- but this is outside the scope of this work future work.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-request",
      "alignment": [
        "context_sentences",
        [
          12,
          13,
          14,
          15,
          16,
          17,
          18,
          19,
          20
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 10,
      "text": "Again drawing the analogy of playing Go, the objective is mostly on training an agent that can make competitive moves rather than very fast moves, and there is no known \u201cgeneral-purpose\u201d strategy to accomplish this.",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          12,
          13,
          14,
          15,
          16,
          17,
          18,
          19,
          20
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 11,
      "text": "*Please also see reply to reviewer #2 on a similar question of evaluating against other methods*",
      "suffix": "\n\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_refute-question",
      "alignment": [
        "context_sentences",
        [
          12,
          13,
          14,
          15,
          16,
          17,
          18,
          19,
          20
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 12,
      "text": "-- \u201cSki Rental problem can also be learned to solve though it is trivial and does not even use the framework the authors propose in its full extent, i.e. problem instances are not generated by use of a machine learning model, which is one of the main claims the authors are making.\u201d",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 13,
      "text": "Please see our high-level clarification on top.",
      "suffix": "\n\n\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          13
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 14,
      "text": "(4) We agree with the reviewer that in many cases there is a gap between solving the discrete problem and the fractional problem.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_concede-criticism",
      "alignment": [
        "context_sentences",
        [
          30,
          31,
          32,
          33,
          34,
          35
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 15,
      "text": "In general it is an established approach to solve the fractional problem and use additional techniques such as rounding to fill the gap.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_concede-criticism",
      "alignment": [
        "context_sentences",
        [
          30,
          31,
          32,
          33,
          34,
          35
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 16,
      "text": "As to AdWords, although the discrete problem naturally corresponds to the real world scenario, we do not consider fractional AdWords below the bar",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          30,
          31,
          32,
          33,
          34,
          35
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 17,
      "text": "compared to discrete AdWords in terms of difficulty",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          30,
          31,
          32,
          33,
          34,
          35
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 18,
      "text": ".",
      "suffix": "",
      "rebuttal_stance": "other",
      "rebuttal_action": "rebuttal_none",
      "alignment": [
        "context_error",
        null
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 19,
      "text": "The optimal CR bound and the adversarial distribution are the same for both cases, and the optimal algorithms basically have the same structure.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          30,
          31,
          32,
          33,
          34,
          35
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 20,
      "text": "One may arguably say that the optimal algorithm for the fractional problem has richer structure as in the fractional problem the action space is much larger as we can fractionally assign each ad to many advertisers.",
      "suffix": "\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          30,
          31,
          32,
          33,
          34,
          35
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 21,
      "text": "As to the shortcomings of our techniques and why we pick the fractional problem, note that the GAN framework needs the computation of the discriminator network (i.e. the algorithm agent in our context) to be differentiable in order to update the generator network (i.e. the adversary in our context) during training.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          30,
          31,
          32,
          33,
          34,
          35
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 22,
      "text": "This poses difficulties if we ask the algorithm agent to make discrete decisions via sampling or rounding since it will not be differentiable.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          30,
          31,
          32,
          33,
          34,
          35
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 23,
      "text": "This doesn\u2019t mean that our high-level framework (i.e. training the algorithm and adversary networks simultaneously) is doomed, since we can use other ML techniques (e.g. reinforcement learning) to implement our framework, but in general sampling and rounding will lead to much more work during training, so we pick the GAN structure in this work.",
      "suffix": "\n\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          30,
          31,
          32,
          33,
          34,
          35
        ]
      ],
      "details": {}
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 24,
      "text": "(5) We know from theory that if the algorithm player runs a no-regret dynamic (e.g. MWU) and the adversary player responds with the worst input for the algorithm in each round, then the algorithm player converges to the optimal algorithm, and the uniform distribution over the adversary player\u2019s responses gives the adversarial distribution.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-request",
      "alignment": [
        "context_sentences",
        [
          36,
          37
        ]
      ],
      "details": {
        "request_out_of_scope": false
      }
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 25,
      "text": "However, we cannot really follow this approach as the space of algorithms is infinite and we cannot run a MWU on this space, and in general it is also hard or impossible to find the absolute worst input in each round.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-request",
      "alignment": [
        "context_sentences",
        [
          36,
          37
        ]
      ],
      "details": {
        "request_out_of_scope": false
      }
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 26,
      "text": "In the practical framework, the algorithm player uses a neural network, and the adversary network tries its best to come up with a bad (but not necessarily worst) input each round.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-request",
      "alignment": [
        "context_sentences",
        [
          36,
          37
        ]
      ],
      "details": {
        "request_out_of_scope": false
      }
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 27,
      "text": "Thus we don\u2019t have all the clean theoretical guarantees anymore, but the intuition should still largely hold (as our empirical result suggests).",
      "suffix": "\n\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-request",
      "alignment": [
        "context_sentences",
        [
          36,
          37
        ]
      ],
      "details": {
        "request_out_of_scope": false
      }
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 28,
      "text": "(6) We updated the appendix to address this. See \u201cTraining convergence\u201d in Appendix D.2",
      "suffix": "\n",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          38,
          39,
          40,
          41
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    },
    {
      "review_id": "rkeZqrk0tr",
      "rebuttal_id": "SylS_OoXsH",
      "sentence_index": 29,
      "text": "(7) We updated the appendix to address this. See \u201cAdversarial distribution\u201d in Appendix D.2",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          42,
          43,
          44
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    }
  ]
}