{
  "metadata": {
    "forum_id": "HygYmJBKwH",
    "review_id": "Skgxo_i6_S",
    "rebuttal_id": "H1xP2ojXjB",
    "title": "YaoGAN: Learning Worst-case Competitive Algorithms from Self-generated Inputs",
    "reviewer": "AnonReviewer1",
    "rating": 3,
    "conference": "ICLR2020",
    "permalink": "https://openreview.net/forum?id=HygYmJBKwH&noteId=H1xP2ojXjB",
    "annotator": "anno0"
  },
  "review_sentences": [
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 0,
      "text": "This papers tackles the following question. Is it possible to learn the \"most\" complex instance of a class of (combinatorial) problem while finding (or recovering) algorithms with strong minimax rate.",
      "suffix": "\n\n",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 1,
      "text": "This is very interesting and clearly a nice line of work (in theory though).",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_motivation-impact",
      "polarity": "pol_positive"
    },
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 2,
      "text": "The techniques used rely on GANs since it can be shown that finding the best (random) algorithm and the worst (deterministic) instance is equivalent to finding the worst random instance against the best deterministic algorithm.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 3,
      "text": "This is actually a direct consequence of any minmax theorem in game theory; the authors decided to credit that result to Yao (I tend to *strongly* disagree with that point as, even if he stated this fact in CS, this result was quite standard several decades before him - anyway.).",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_soundness-correctness",
      "polarity": "pol_negative"
    },
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 4,
      "text": "Then this idea is evaluated in two examples.",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 5,
      "text": "A toy problem (the ski rental) and a more or less concrete ones (adwords pb of Mehta).",
      "suffix": "",
      "review_action": "arg_structuring",
      "fine_review_action": "arg-structuring_summary",
      "aspect": "none",
      "polarity": "none"
    },
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 6,
      "text": "This is the major disappointment in the paper.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 7,
      "text": "The basic idea is very interesting, but I would have expect more interesting use cases as teased by the first sentence of the abstract \"find algorithms with strong worst-case guarantees for online combinatorial optimization problems\".",
      "suffix": "\n\n",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 8,
      "text": "So at the end, I am a bit puzzled. I really like the idea, but I have the feeling that this technique should have been developed for more complicated setting. Or maybe it is actually not working on more difficult combinatorial problem (and this is hidden in the paper).",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    },
    {
      "review_id": "Skgxo_i6_S",
      "sentence_index": 9,
      "text": "I believe that this paper is thus not in its final form and could be largely improved.",
      "suffix": "",
      "review_action": "arg_evaluative",
      "fine_review_action": "none",
      "aspect": "asp_substance",
      "polarity": "pol_negative"
    }
  ],
  "rebuttal_sentences": [
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 0,
      "text": "Thank you for your review.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_social",
      "alignment": [
        "context_global",
        null
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 1,
      "text": "Please also see our high-level clarification above which we believe can help in better interpretation of our contribution.",
      "suffix": "",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 2,
      "text": "Some specific responses below:",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 3,
      "text": "Reviewer #1 is absolutely right -- we don\u2019t know yet how to scale this to more difficult combinatorial problems.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_concede-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 4,
      "text": "But let\u2019s clarify that statement a bit more:",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 5,
      "text": "The ski-rental problem is often the first problem studied when teaching online algorithms, but it is certainly far from a \u201ctoy problem\u201d when we wish to learn an algorithm from scratch.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 6,
      "text": "We apologize if we painted an incorrect picture by calling it a \u201csimple example\u201d and a \u201cstaple introductory problem\u201d.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_concede-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 7,
      "text": "It is easy to describe in that it has a single hidden parameter (the length of the ski season) and a single revealed parameter (the cost of buying).",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 8,
      "text": "It is a staple introductory problem because it is elegant and illuminates the essential difficulties in designing online algorithms: there is a nearly-trivial factor-2 competitive algorithm (rent until you\u2019ve spent $B, then buy, so even if the ski season ends the next day, you\u2019ve not spent more than twice the least possible amount), but the 1-1/e competitive ratio algorithm is quite creative and subtle, and serves as an introduction to the richness of the field of online algorithms.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 9,
      "text": "In fact, the Karlin et al. (1986) paper also introduced the notion of competitive analysis of online algorithms, and  is probably the most-cited paper in this field.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 10,
      "text": "In some sense, this poses us the ideal challenge: can ML approaches discover creative and subtle \u201csolutions\u201d (in our case, an algorithm)?",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 11,
      "text": "On a more technical note, please note that our \u201cmachinery\u201d of solving the two-player game is needed to discover an algorithm for the ski rental problem: if we don\u2019t allow the players to alternate and reach an equilibrium, for any fixed distribution on the ski rental instances (B, K), there is a deterministic algorithm that is optimal (among all online algorithms), and the worst-case performance of that (or any) deterministic algorithm is *provably* limited by a factor of 2 (i.e., there exists some distribution on instances where it will fail badly).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 12,
      "text": "Also refer to our discussion on this in the high-level clarification at the top.",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 13,
      "text": "The AdWords problem considered is actually a difficult combinatorial problem, and is an archetypal online combinatorial optimization problem that captures the class of problems solvable by one of the most powerful techniques in this area -- primal-dual algorithms, which have led to the state-of-the-art approximation algorithms for numerous hard online (and offline) optimization problems.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 14,
      "text": "In particular, it generalizes bipartite matching, historically one of the most significant combinatorial optimization problems (led to the development of the classic Hungarian method, see https://en.wikipedia.org/wiki/Hungarian_algorithm).",
      "suffix": "\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_reject-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 15,
      "text": "We did water down our ambition in a few ways:",
      "suffix": "\n\n",
      "rebuttal_stance": "nonarg",
      "rebuttal_action": "rebuttal_structuring",
      "alignment": [
        "context_in-rebuttal",
        null
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 16,
      "text": "Instead of producing an algorithm that works for inputs of all sizes, we focus on the case of 9x3 (three advertisers, nine slots) -- a fixed finite size!",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 17,
      "text": "This choice was arrived at based on the following criteria: what can we learn in a few hours of computation that\u2019s still *well beyond* what can be achieved through exhaustive search (for an algorithm).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_answer",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 18,
      "text": "Think of our task roughly as learning to play a very hard game on a 9x3 board -- we would, of course, love to learn how to play the same game on arbitrary size boards, but the fact is that the game is mighty hard even at this \u201cboard size\u201d (since in each round, one player plays a 0-1 assignment to each cell in the board, and the other player picks a subset of the columns, in fact a weight vector on the columns).",
      "suffix": "\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_mitigate-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 19,
      "text": "Instead of producing an algorithm that works for the 0-1 version of the problem, we produce an algorithm that works for the fractional version of the problem.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_mitigate-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 20,
      "text": "This is, once again, motivated by making something work with modest amount of computation.",
      "suffix": "",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_mitigate-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 21,
      "text": "Our explorations indicated that producing an algorithm for the 0-1 version needs reinforcement learning, and producing an algorithm that works on all 9x3 instances using this approach would still take several days of computation.",
      "suffix": "\n\n\n",
      "rebuttal_stance": "dispute",
      "rebuttal_action": "rebuttal_mitigate-criticism",
      "alignment": [
        "context_sentences",
        [
          5,
          6,
          7,
          8
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 22,
      "text": "On calling it Yao\u2019s Principle: as Reviewer #1 correctly noted, this is an application of the classic von Neumann minimax principle to the \u201cgame\u201d between an \u201calgorithm player\u201d and an \u201cinput player\u201d.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_concede-criticism",
      "alignment": [
        "context_sentences",
        [
          3
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 23,
      "text": "We call it Yao\u2019s principle primarily in accordance with tradition in theoretical CS (see https://en.wikipedia.org/wiki/Yao%27s_principle and also https://blog.computationalcomplexity.org/2006/10/favorite-theorems-yao-principle.html, where it is noted that \u201cYao observed [the result]\u201d and commentators note that it\u2019s called Yao\u2019s principle because this observation has significant consequences for many central problems in TCS).",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_concede-criticism",
      "alignment": [
        "context_sentences",
        [
          3
        ]
      ],
      "details": {}
    },
    {
      "review_id": "Skgxo_i6_S",
      "rebuttal_id": "H1xP2ojXjB",
      "sentence_index": 24,
      "text": "We are happy to add text to reflect this.",
      "suffix": "",
      "rebuttal_stance": "concur",
      "rebuttal_action": "rebuttal_done",
      "alignment": [
        "context_sentences",
        [
          3
        ]
      ],
      "details": {
        "request_out_of_scope": true
      }
    }
  ]
}