{
  "Selected_candidate": {
    "pr_number": 19964,
    "pr_title": "FIX: add subplot_mosaic axes in the order the user gave them to us",
    "pr_body": "## PR Summary\r\n\r\nThe order the Axes are added to the Figure (and hence the order they are in\r\nthe returned dictionary and fig.axes) is now based on the first time we see the\r\nkey if the layout were recursively unraveled as a c-style array.\r\n\r\nCloses #19736\r\n\r\n## PR Checklist\r\n\r\n<!-- Please mark any checkboxes that do not apply to this PR as [N/A]. -->\r\n\r\n- [x] Has pytest style unit tests (and `pytest` passes).\r\n- [x] Is [Flake 8](https://flake8.pycqa.org/en/latest/) compliant (run `flake8` on changed files to check).\r\n- [x] Conforms to Matplotlib style conventions (install `flake8-docstrings` and run `flake8 --docstring-convention=all`).\r\n- [ ] API changes documented in `doc/api/next_api_changes/` (follow instructions in README.rst there).\r\n\r\nI don't think this needs an API change note as the order was previously unreliable.",
    "issue_id": 19736,
    "issue_title": "subplot_mosaic axes are not added in consistent order",
    "issue_body": "<!--To help us understand and resolve your issue, please fill out the form to the best of your ability.-->\r\n<!--You can feel free to delete the sections that do not apply.-->\r\n\r\n### Bug report\r\n\r\n**Bug summary**\r\n\r\nThe axes of a subplot_mosaic show up in a random order in `fig.axes` (likely due to the use of a `set` for uniquification in `_identify_keys_and_nested`).\r\n\r\n**Code for reproduction**\r\n\r\n<!--A minimum code snippet required to reproduce the bug.\r\nPlease make sure to minimize the number of dependencies required, and provide\r\nany necessary plotted data.\r\nAvoid using threads, as Matplotlib is (explicitly) not thread-safe.-->\r\n\r\n```sh\r\nfor _ in $(seq 10); do python -c 'from pylab import *; fig, axs = subplot_mosaic(\"ab\"); print(fig.axes.index(axs[\"a\"]))'; done\r\n```\r\n\r\n**Actual outcome**\r\n\r\n<!--The output produced by the above code, which may be a screenshot, console output, etc.-->\r\n\r\n```\r\n1\r\n0\r\n1\r\n1\r\n1\r\n0\r\n0\r\n0\r\n0\r\n1\r\n```\r\n\r\n**Expected outcome**\r\n\r\nAxes should be added in a consistent order.  I guess a reasonable one would be as if iterating the spec in C order (dropping duplicates).\r\n\r\nNot release critical (especially as the order was not fixed, so fixing an order is not a backcompat break), but would be nice to get this sorted out before the API moves out of being experimental.\r\n\r\n**Matplotlib version**\r\n<!--Please specify your platform and versions of the relevant libraries you are using:-->\r\n  * Operating system: linux\r\n  * Matplotlib version (`import matplotlib; print(matplotlib.__version__)`): head\r\n  * Matplotlib backend (`print(matplotlib.get_backend())`): any\r\n  * Python version: 39\r\n  * Jupyter version (if applicable): \r\n  * Other libraries: \r\n\r\n<!--Please tell us how you installed matplotlib and python e.g., from source, pip, conda-->\r\n<!--If you installed from conda, please specify which channel you used if not the default-->\r\n\r\n---\r\n\r\nNote: the simple solution of replacing `unique_ids = set()` by `unique_ids = cbook._OrderedSet()` is good enough for the non-nested case, but doesn't handle nested layouts such as `subplot_mosaic([[\"a\", [[\"b1\", \"b2\"], [\"b3\", \"b4\"]]], [\"c\", \"d\"]])` because currently the nested submosaic is always added after all the non-nested axes.",
    "issue_closed_at": "2021-04-16T21:43:45Z",
    "base_commit": "a051169c1b51a36c270b7b696ec86fae8f2f23e8",
    "changes": [
      {
        "file": "lib/matplotlib/figure.py",
        "type": "function",
        "name": "_identify_keys_and_nested",
        "class_name": "FigureBase",
        "code": "def _identify_keys_and_nested(layout):\n            \"\"\"\n            Given a 2D object array, identify unique IDs and nested layouts\n\n            Parameters\n            ----------\n            layout : 2D numpy object array\n\n            Returns\n            -------\n            unique_ids : set\n                The unique non-sub layout entries in this layout\n            nested : dict[tuple[int, int]], 2D object array\n            \"\"\"\n            unique_ids = set()\n            nested = {}\n            for j, row in enumerate(layout):\n                for k, v in enumerate(row):\n                    if v == empty_sentinel:\n                        continue\n                    elif not cbook.is_scalar_or_string(v):\n                        nested[(j, k)] = _make_array(v)\n                    else:\n                        unique_ids.add(v)\n\n            return unique_ids, nested"
      },
      {
        "file": "lib/matplotlib/figure.py",
        "type": "function",
        "name": "_identify_keys_and_nested",
        "class_name": "FigureBase",
        "code": "def _identify_keys_and_nested(layout):\n            \"\"\"\n            Given a 2D object array, identify unique IDs and nested layouts\n\n            Parameters\n            ----------\n            layout : 2D numpy object array\n\n            Returns\n            -------\n            unique_ids : set\n                The unique non-sub layout entries in this layout\n            nested : dict[tuple[int, int]], 2D object array\n            \"\"\"\n            unique_ids = set()\n            nested = {}\n            for j, row in enumerate(layout):\n                for k, v in enumerate(row):\n                    if v == empty_sentinel:\n                        continue\n                    elif not cbook.is_scalar_or_string(v):\n                        nested[(j, k)] = _make_array(v)\n                    else:\n                        unique_ids.add(v)\n\n            return unique_ids, nested"
      },
      {
        "file": "lib/matplotlib/figure.py",
        "type": "function",
        "name": "_do_layout",
        "class_name": "FigureBase",
        "code": "def _do_layout(gs, layout, unique_ids, nested):\n            \"\"\"\n            Recursively do the layout.\n\n            Parameters\n            ----------\n            gs : GridSpec\n            layout : 2D object array\n                The input converted to a 2D numpy array for this level.\n            unique_ids : set\n                The identified scalar labels at this level of nesting.\n            nested : dict[tuple[int, int]], 2D object array\n                The identified nested layouts, if any.\n\n            Returns\n            -------\n            dict[label, Axes]\n                A flat dict of all of the Axes created.\n            \"\"\"\n            rows, cols = layout.shape\n            output = dict()\n\n            # create the Axes at this level of nesting\n            for name in unique_ids:\n                indx = np.argwhere(layout == name)\n                start_row, start_col = np.min(indx, axis=0)\n                end_row, end_col = np.max(indx, axis=0) + 1\n                slc = (slice(start_row, end_row), slice(start_col, end_col))\n\n                if (layout[slc] != name).any():\n                    raise ValueError(\n                        f\"While trying to layout\\n{layout!r}\\n\"\n                        f\"we found that the label {name!r} specifies a \"\n                        \"non-rectangular or non-contiguous area.\")\n\n                ax = self.add_subplot(\n                    gs[slc], **{'label': str(name), **subplot_kw}\n                )\n                output[name] = ax\n\n            # do any sub-layouts\n            for (j, k), nested_layout in nested.items():\n                rows, cols = nested_layout.shape\n                nested_output = _do_layout(\n                    gs[j, k].subgridspec(rows, cols, **gridspec_kw),\n                    nested_layout,\n                    *_identify_keys_and_nested(nested_layout)\n                )\n                overlap = set(output) & set(nested_output)\n                if overlap:\n                    raise ValueError(f\"There are duplicate keys {overlap} \"\n                                     f\"between the outer layout\\n{layout!r}\\n\"\n                                     f\"and the nested layout\\n{nested_layout}\")\n                output.update(nested_output)\n            return output"
      },
      {
        "file": "lib/matplotlib/figure.py",
        "type": "function",
        "name": "_do_layout",
        "class_name": "FigureBase",
        "code": "def _do_layout(gs, layout, unique_ids, nested):\n            \"\"\"\n            Recursively do the layout.\n\n            Parameters\n            ----------\n            gs : GridSpec\n            layout : 2D object array\n                The input converted to a 2D numpy array for this level.\n            unique_ids : set\n                The identified scalar labels at this level of nesting.\n            nested : dict[tuple[int, int]], 2D object array\n                The identified nested layouts, if any.\n\n            Returns\n            -------\n            dict[label, Axes]\n                A flat dict of all of the Axes created.\n            \"\"\"\n            rows, cols = layout.shape\n            output = dict()\n\n            # create the Axes at this level of nesting\n            for name in unique_ids:\n                indx = np.argwhere(layout == name)\n                start_row, start_col = np.min(indx, axis=0)\n                end_row, end_col = np.max(indx, axis=0) + 1\n                slc = (slice(start_row, end_row), slice(start_col, end_col))\n\n                if (layout[slc] != name).any():\n                    raise ValueError(\n                        f\"While trying to layout\\n{layout!r}\\n\"\n                        f\"we found that the label {name!r} specifies a \"\n                        \"non-rectangular or non-contiguous area.\")\n\n                ax = self.add_subplot(\n                    gs[slc], **{'label': str(name), **subplot_kw}\n                )\n                output[name] = ax\n\n            # do any sub-layouts\n            for (j, k), nested_layout in nested.items():\n                rows, cols = nested_layout.shape\n                nested_output = _do_layout(\n                    gs[j, k].subgridspec(rows, cols, **gridspec_kw),\n                    nested_layout,\n                    *_identify_keys_and_nested(nested_layout)\n                )\n                overlap = set(output) & set(nested_output)\n                if overlap:\n                    raise ValueError(f\"There are duplicate keys {overlap} \"\n                                     f\"between the outer layout\\n{layout!r}\\n\"\n                                     f\"and the nested layout\\n{nested_layout}\")\n                output.update(nested_output)\n            return output"
      }
    ]
  },
  "Justification": "Candidate C is the most helpful because it also involves issues related to the figure and axes in Matplotlib, which is directly relevant to the CURRENT bug report regarding cursor data formatting and the interactions with BoundaryNorm. The structural similarities in the handling of subplot axes are key to understanding potential bugs with figure elements. Furthermore, the candidate addresses a fundamental aspect of how axes are managed, which will provide insights into the current issue of cursor data representation during mouse interactions. This alignment makes it a crucial candidate for understanding and potentially rectifying the crash described in the CURRENT report.",
  "instance_id": "matplotlib__matplotlib-22835",
  "repo": "matplotlib/matplotlib",
  "created_at": "2022-04-12T23:13:58Z",
  "problem_statement": "[Bug]: scalar mappable format_cursor_data crashes on BoundarNorm\n### Bug summary\r\n\r\nIn 3.5.0 if you do:\r\n\r\n```python\r\nimport matplotlib.pyplot as plt\r\nimport numpy as np\r\nimport matplotlib as mpl\r\n\r\nfig, ax = plt.subplots()\r\nnorm = mpl.colors.BoundaryNorm(np.linspace(-4, 4, 5), 256)\r\nX = np.random.randn(10, 10)\r\npc = ax.imshow(X, cmap='RdBu_r', norm=norm)\r\n```\r\n\r\nand mouse over the image, it crashes with\r\n\r\n```\r\nFile \"/Users/jklymak/matplotlib/lib/matplotlib/artist.py\", line 1282, in format_cursor_data\r\n    neighbors = self.norm.inverse(\r\n  File \"/Users/jklymak/matplotlib/lib/matplotlib/colors.py\", line 1829, in inverse\r\n    raise ValueError(\"BoundaryNorm is not invertible\")\r\nValueError: BoundaryNorm is not invertible\r\n```\r\n\r\nand interaction stops.  \r\n\r\nNot sure if we should have a special check here, a try-except, or actually just make BoundaryNorm approximately invertible.  \r\n\r\n\r\n### Matplotlib Version\r\n\r\nmain 3.5.0\r\n\r\n\n[Bug]: scalar mappable format_cursor_data crashes on BoundarNorm\n### Bug summary\r\n\r\nIn 3.5.0 if you do:\r\n\r\n```python\r\nimport matplotlib.pyplot as plt\r\nimport numpy as np\r\nimport matplotlib as mpl\r\n\r\nfig, ax = plt.subplots()\r\nnorm = mpl.colors.BoundaryNorm(np.linspace(-4, 4, 5), 256)\r\nX = np.random.randn(10, 10)\r\npc = ax.imshow(X, cmap='RdBu_r', norm=norm)\r\n```\r\n\r\nand mouse over the image, it crashes with\r\n\r\n```\r\nFile \"/Users/jklymak/matplotlib/lib/matplotlib/artist.py\", line 1282, in format_cursor_data\r\n    neighbors = self.norm.inverse(\r\n  File \"/Users/jklymak/matplotlib/lib/matplotlib/colors.py\", line 1829, in inverse\r\n    raise ValueError(\"BoundaryNorm is not invertible\")\r\nValueError: BoundaryNorm is not invertible\r\n```\r\n\r\nand interaction stops.  \r\n\r\nNot sure if we should have a special check here, a try-except, or actually just make BoundaryNorm approximately invertible.  \r\n\r\n\r\n### Matplotlib Version\r\n\r\nmain 3.5.0\r\n\r\n\n",
  "patch": "diff --git a/lib/matplotlib/artist.py b/lib/matplotlib/artist.py\n--- a/lib/matplotlib/artist.py\n+++ b/lib/matplotlib/artist.py\n@@ -12,6 +12,7 @@\n \n import matplotlib as mpl\n from . import _api, cbook\n+from .colors import BoundaryNorm\n from .cm import ScalarMappable\n from .path import Path\n from .transforms import (Bbox, IdentityTransform, Transform, TransformedBbox,\n@@ -1303,10 +1304,20 @@ def format_cursor_data(self, data):\n                 return \"[]\"\n             normed = self.norm(data)\n             if np.isfinite(normed):\n-                # Midpoints of neighboring color intervals.\n-                neighbors = self.norm.inverse(\n-                    (int(self.norm(data) * n) + np.array([0, 1])) / n)\n-                delta = abs(neighbors - data).max()\n+                if isinstance(self.norm, BoundaryNorm):\n+                    # not an invertible normalization mapping\n+                    cur_idx = np.argmin(np.abs(self.norm.boundaries - data))\n+                    neigh_idx = max(0, cur_idx - 1)\n+                    # use max diff to prevent delta == 0\n+                    delta = np.diff(\n+                        self.norm.boundaries[neigh_idx:cur_idx + 2]\n+                    ).max()\n+\n+                else:\n+                    # Midpoints of neighboring color intervals.\n+                    neighbors = self.norm.inverse(\n+                        (int(normed * n) + np.array([0, 1])) / n)\n+                    delta = abs(neighbors - data).max()\n                 g_sig_digits = cbook._g_sig_digits(data, delta)\n             else:\n                 g_sig_digits = 3  # Consistent with default below.\n"
}