{
  "instance_id": "mwaskom__seaborn-2848",
  "repo": "mwaskom/seaborn",
  "created_at": "2022-06-11T18:21:32Z",
  "problem_statement": "pairplot fails with hue_order not containing all hue values in seaborn 0.11.1\nIn seaborn < 0.11, one could plot only a subset of the values in the hue column, by passing a hue_order list containing only the desired values. Points with hue values not in the list were simply not plotted.\n```python\niris = sns.load_dataset(\"iris\")`\n# The hue column contains three different species; here we want to plot two\nsns.pairplot(iris, hue=\"species\", hue_order=[\"setosa\", \"versicolor\"])\n```\n\nThis no longer works in 0.11.1. Passing a hue_order list that does not contain some of the values in the hue column raises a long, ugly error traceback. The first exception arises in seaborn/_core.py:\n```\nTypeError: ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''\n```\nseaborn version: 0.11.1\nmatplotlib version: 3.3.2\nmatplotlib backends: MacOSX, Agg or jupyter notebook inline.",
  "patch": "diff --git a/seaborn/_oldcore.py b/seaborn/_oldcore.py\n--- a/seaborn/_oldcore.py\n+++ b/seaborn/_oldcore.py\n@@ -149,6 +149,13 @@ def _lookup_single(self, key):\n             # Use a value that's in the original data vector\n             value = self.lookup_table[key]\n         except KeyError:\n+\n+            if self.norm is None:\n+                # Currently we only get here in scatterplot with hue_order,\n+                # because scatterplot does not consider hue a grouping variable\n+                # So unused hue levels are in the data, but not the lookup table\n+                return (0, 0, 0, 0)\n+\n             # Use the colormap to interpolate between existing datapoints\n             # (e.g. in the context of making a continuous legend)\n             try:\n",
  "similar_bug_items": [
    {
      "pr_number": 2368,
      "pr_title": "Fix pairgrid off-diagonal plots with non-string column names",
      "pr_body": "Fixes #2307\r\n\r\nWorking reprex from original issue:\r\n\r\n![image](https://user-images.githubusercontent.com/315810/100743239-108c0200-33aa-11eb-8885-9c61d89b3acc.png)\r\n",
      "issue_id": 2307,
      "issue_title": "map_* methods of PairGrid broken in 0.11.0",
      "issue_body": "Hi,\r\n\r\nI discovered that the map_* methods of PairGrid seem to be broken in version 0.11.0 for user defined functions. See reproducible example below with a corrfunc defined to plot the pearson correlation value on the lower plots. The function doesn't seem to get evaluated in version 0.11.0. When I pip install seaborn==0.10.1, I get the desired result. Plots from both cases also attached.\r\n\r\n```import numpy as np\r\nfrom scipy import stats\r\nimport pandas as pd\r\nimport seaborn as sns\r\nimport matplotlib.pyplot as plt\r\nsns.set(style=\"white\")\r\n\r\nmean = np.zeros(3)\r\ncov = np.random.uniform(.2, .4, (3, 3))\r\ncov += cov.T\r\ncov[np.diag_indices(3)] = 1\r\ndata = np.random.multivariate_normal(mean, cov, 100)\r\ndf = pd.DataFrame(data, columns=[\"X\", \"Y\", \"Z\"])\r\n\r\ndef corrfunc(x, y,**kws):\r\n    r, _ = stats.pearsonr(x, y)\r\n    ax = plt.gca()\r\n    ax.annotate(\"r = {:.2f}\".format(r),\r\n                xy=(.1, .9), xycoords=ax.transAxes)\r\n\r\ng = sns.PairGrid(df, palette=[\"red\"])\r\ng.map_upper(plt.scatter, s=10)\r\ng.map_diag(sns.distplot, kde=False)\r\ng.map_lower(sns.kdeplot, cmap=\"Blues_d\")\r\ng.map_lower(corrfunc)\r\nplt.show()\r\n\r\n```\r\n\r\n![seaborn-0-11-0](https://user-images.githubusercontent.com/3239171/94969718-380d3e00-04d1-11eb-821b-9aad80ec696e.png)\r\n\r\n![seaborn-0-10-1](https://user-images.githubusercontent.com/3239171/94969722-3a6f9800-04d1-11eb-8ef1-861d9beb1f26.png)\r\n\r\n\r\n\r\n",
      "issue_closed_at": "2020-12-01T20:48:45Z",
      "base_commit": "2717408b564994002fe08f72ba2dd7e1acf359b6",
      "changes": [
        {
          "file": "seaborn/axisgrid.py",
          "type": "function",
          "name": "__init__",
          "class_name": "JointGrid",
          "code": "def __init__(\n        self, *,\n        x=None, y=None,\n        data=None,\n        height=6, ratio=5, space=.2,\n        dropna=False, xlim=None, ylim=None, size=None, marginal_ticks=False,\n        hue=None, palette=None, hue_order=None, hue_norm=None,\n    ):\n        # Handle deprecations\n        if size is not None:\n            height = size\n            msg = (\"The `size` parameter has been renamed to `height`; \"\n                   \"please update your code.\")\n            warnings.warn(msg, UserWarning)\n\n        # Set up the subplot grid\n        f = plt.figure(figsize=(height, height))\n        gs = plt.GridSpec(ratio + 1, ratio + 1)\n\n        ax_joint = f.add_subplot(gs[1:, :-1])\n        ax_marg_x = f.add_subplot(gs[0, :-1], sharex=ax_joint)\n        ax_marg_y = f.add_subplot(gs[1:, -1], sharey=ax_joint)\n\n        self.fig = f\n        self.ax_joint = ax_joint\n        self.ax_marg_x = ax_marg_x\n        self.ax_marg_y = ax_marg_y\n\n        # Turn off tick visibility for the measure axis on the marginal plots\n        plt.setp(ax_marg_x.get_xticklabels(), visible=False)\n        plt.setp(ax_marg_y.get_yticklabels(), visible=False)\n        plt.setp(ax_marg_x.get_xticklabels(minor=True), visible=False)\n        plt.setp(ax_marg_y.get_yticklabels(minor=True), visible=False)\n\n        # Turn off the ticks on the density axis for the marginal plots\n        if not marginal_ticks:\n            plt.setp(ax_marg_x.yaxis.get_majorticklines(), visible=False)\n            plt.setp(ax_marg_x.yaxis.get_minorticklines(), visible=False)\n            plt.setp(ax_marg_y.xaxis.get_majorticklines(), visible=False)\n            plt.setp(ax_marg_y.xaxis.get_minorticklines(), visible=False)\n            plt.setp(ax_marg_x.get_yticklabels(), visible=False)\n            plt.setp(ax_marg_y.get_xticklabels(), visible=False)\n            plt.setp(ax_marg_x.get_yticklabels(minor=True), visible=False)\n            plt.setp(ax_marg_y.get_xticklabels(minor=True), visible=False)\n            ax_marg_x.yaxis.grid(False)\n            ax_marg_y.xaxis.grid(False)\n\n        # Process the input variables\n        p = VectorPlotter(data=data, variables=dict(x=x, y=y, hue=hue))\n        plot_data = p.plot_data.loc[:, p.plot_data.notna().any()]\n\n        # Possibly drop NA\n        if dropna:\n            plot_data = plot_data.dropna()\n\n        def get_var(var):\n            vector = plot_data.get(var, None)\n            if vector is not None:\n                vector = vector.rename(p.variables.get(var, None))\n            return vector\n\n        self.x = get_var(\"x\")\n        self.y = get_var(\"y\")\n        self.hue = get_var(\"hue\")\n\n        for axis in \"xy\":\n            name = p.variables.get(axis, None)\n            if name is not None:\n                getattr(ax_joint, f\"set_{axis}label\")(name)\n\n        if xlim is not None:\n            ax_joint.set_xlim(xlim)\n        if ylim is not None:\n            ax_joint.set_ylim(ylim)\n\n        # Store the semantic mapping parameters for axes-level functions\n        self._hue_params = dict(palette=palette, hue_order=hue_order, hue_norm=hue_norm)\n\n        # Make the grid look nice\n        utils.despine(f)\n        if not marginal_ticks:\n            utils.despine(ax=ax_marg_x, left=True)\n            utils.despine(ax=ax_marg_y, bottom=True)\n        for axes in [ax_marg_x, ax_marg_y]:\n            for axis in [axes.xaxis, axes.yaxis]:\n                axis.label.set_visible(False)\n        f.tight_layout()\n        f.subplots_adjust(hspace=space, wspace=space)"
        },
        {
          "file": "seaborn/axisgrid.py",
          "type": "function",
          "name": "_map_diag_iter_hue",
          "class_name": "PairGrid",
          "code": "def _map_diag_iter_hue(self, func, **kwargs):\n        \"\"\"Put marginal plot on each diagonal axes, iterating over hue.\"\"\"\n        # Plot on each of the diagonal axes\n        fixed_color = kwargs.pop(\"color\", None)\n\n        for var, ax in zip(self.diag_vars, self.diag_axes):\n            hue_grouped = self.data[var].groupby(self.hue_vals)\n\n            plt.sca(ax)\n\n            for k, label_k in enumerate(self.hue_names):\n\n                # Attempt to get data for this level, allowing for empty\n                try:\n                    data_k = hue_grouped.get_group(label_k)\n                except KeyError:\n                    data_k = pd.Series([], dtype=float)\n\n                if fixed_color is None:\n                    color = self.palette[k]\n                else:\n                    color = fixed_color\n\n                if self._dropna:\n                    data_k = utils.remove_na(data_k)\n\n                if str(func.__module__).startswith(\"seaborn\"):\n                    func(x=data_k, label=label_k, color=color, **kwargs)\n                else:\n                    func(data_k, label=label_k, color=color, **kwargs)\n\n            self._clean_axis(ax)\n\n        self._add_axis_labels()\n\n        return self"
        },
        {
          "file": "seaborn/axisgrid.py",
          "type": "function",
          "name": "_plot_bivariate_iter_hue",
          "class_name": "PairGrid",
          "code": "def _plot_bivariate_iter_hue(self, x_var, y_var, ax, func, **kwargs):\n        \"\"\"Draw a bivariate plot while iterating over hue subsets.\"\"\"\n        plt.sca(ax)\n        if x_var == y_var:\n            axes_vars = [x_var]\n        else:\n            axes_vars = [x_var, y_var]\n\n        hue_grouped = self.data.groupby(self.hue_vals)\n        for k, label_k in enumerate(self.hue_names):\n\n            kws = kwargs.copy()\n\n            # Attempt to get data for this level, allowing for empty\n            try:\n                data_k = hue_grouped.get_group(label_k)\n            except KeyError:\n                data_k = pd.DataFrame(columns=axes_vars,\n                                      dtype=float)\n\n            if self._dropna:\n                data_k = data_k[axes_vars].dropna()\n\n            x = data_k[x_var]\n            y = data_k[y_var]\n\n            for kw, val_list in self.hue_kws.items():\n                kws[kw] = val_list[k]\n            kws.setdefault(\"color\", self.palette[k])\n            if self._hue_var is not None:\n                kws[\"label\"] = label_k\n\n            if str(func.__module__).startswith(\"seaborn\"):\n                func(x=x, y=y, **kws)\n            else:\n                func(x, y, **kws)\n\n        self._update_legend_data(ax)\n        self._clean_axis(ax)"
        }
      ]
    },
    {
      "pr_number": 2504,
      "pr_title": "Fix log scaling in distribution plots",
      "pr_body": "Fixes #2502 \r\n\r\nThis is a huge development footgun; see #2409 for thoughts on how this can be made automatic to reduce the risk of such bugs",
      "issue_id": 2502,
      "issue_title": "displot(kind='ecdf',..., log_scale=True) not working",
      "issue_body": "The following line of code gives an error:\r\n\r\n```\r\nsns.displot(kind='ecdf', data=df, x='col_1', log_scale=True)\r\n\r\nUserWarning: Data has no positive values, and therefore cannot be log-scaled.\r\n```\r\n\r\n\r\nMy data is all positive and kind='hist' or 'kde' works just fine.\r\n\r\n",
      "issue_closed_at": "2021-03-24T21:54:09Z",
      "base_commit": "ba4bd0fa0a90b2bd00cb62c2b4a5e38013a73ac6",
      "changes": [
        {
          "file": "seaborn/distributions.py",
          "type": "line",
          "name": "line 57",
          "code": "    \"\"\",\n    log_scale=\"\"\"\nlog_scale : bool or number, or pair of bools or numbers\n    Set a log scale on the data axis (or axes, with bivariate data) with the\n    given base (default 10), and evaluate the KDE in log space.\n    \"\"\",\n    legend=\"\"\"\nlegend : bool"
        },
        {
          "file": "seaborn/distributions.py",
          "type": "function",
          "name": "plot_univariate_ecdf",
          "class_name": "_DistributionPlotter",
          "code": "def plot_univariate_ecdf(self, estimate_kws, legend, **plot_kws):\n\n        estimator = ECDF(**estimate_kws)\n\n        # Set the draw style to step the right way for the data variable\n        drawstyles = dict(x=\"steps-post\", y=\"steps-pre\")\n        plot_kws[\"drawstyle\"] = drawstyles[self.data_variable]\n\n        # Loop through the subsets, transform and plot the data\n        for sub_vars, sub_data in self.iter_data(\n            \"hue\", reverse=True, from_comp_data=True,\n        ):\n\n            # Compute the ECDF\n            if sub_data.empty:\n                continue\n\n            observations = sub_data[self.data_variable]\n            weights = sub_data.get(\"weights\", None)\n            stat, vals = estimator(observations, weights=weights)\n\n            # Assign attributes based on semantic mapping\n            artist_kws = plot_kws.copy()\n            if \"hue\" in self.variables:\n                artist_kws[\"color\"] = self._hue_map(sub_vars[\"hue\"])\n\n            # Work out the orientation of the plot\n            if self.data_variable == \"x\":\n                plot_args = vals, stat\n                stat_variable = \"y\"\n            else:\n                plot_args = stat, vals\n                stat_variable = \"x\"\n\n            if estimator.stat == \"count\":\n                top_edge = len(observations)\n            else:\n                top_edge = 1\n\n            # Draw the line for this subset\n            ax = self._get_axes(sub_vars)\n            artist, = ax.plot(*plot_args, **artist_kws)\n            sticky_edges = getattr(artist.sticky_edges, stat_variable)\n            sticky_edges[:] = 0, top_edge\n\n        # --- Finalize the plot ----\n        ax = self.ax if self.ax is not None else self.facets.axes.flat[0]\n        stat = estimator.stat.capitalize()\n        default_x = default_y = \"\"\n        if self.data_variable == \"x\":\n            default_y = stat\n        if self.data_variable == \"y\":\n            default_x = stat\n        self._add_axis_labels(ax, default_x, default_y)\n\n        if \"hue\" in self.variables and legend:\n            artist = partial(mpl.lines.Line2D, [], [])\n            alpha = plot_kws.get(\"alpha\", 1)\n            ax_obj = self.ax if self.ax is not None else self.facets\n            self._add_legend(\n                ax_obj, artist, False, False, None, alpha, plot_kws, {},\n            )"
        },
        {
          "file": "seaborn/distributions.py",
          "type": "function",
          "name": "_plot_single_rug",
          "class_name": "_DistributionPlotter",
          "code": "def _plot_single_rug(self, sub_data, var, height, ax, kws):\n        \"\"\"Draw a rugplot along one axis of the plot.\"\"\"\n        vector = sub_data[var]\n        n = len(vector)\n\n        # We'll always add a single collection with varying colors\n        if \"hue\" in self.variables:\n            colors = self._hue_map(sub_data[\"hue\"])\n        else:\n            colors = None\n\n        # Build the array of values for the LineCollection\n        if var == \"x\":\n\n            trans = tx.blended_transform_factory(ax.transData, ax.transAxes)\n            xy_pairs = np.column_stack([\n                np.repeat(vector, 2), np.tile([0, height], n)\n            ])\n\n        if var == \"y\":\n\n            trans = tx.blended_transform_factory(ax.transAxes, ax.transData)\n            xy_pairs = np.column_stack([\n                np.tile([0, height], n), np.repeat(vector, 2)\n            ])\n\n        # Draw the lines on the plot\n        line_segs = xy_pairs.reshape([n, 2, 2])\n        ax.add_collection(LineCollection(\n            line_segs, transform=trans, colors=colors, **kws\n        ))\n\n        ax.autoscale_view(scalex=var == \"x\", scaley=var == \"y\")"
        }
      ]
    },
    {
      "pr_number": 2287,
      "pr_title": "Fix: use linewidth in boxenplot elements",
      "pr_body": "Fixes #2285 . Prior to this change, the `linewidth` argument is only used to frame the legend elements and is not applied to boxes and medians.",
      "issue_id": 2285,
      "issue_title": "boxenplot linewidth bug",
      "issue_body": "I tried changing the linewidth of the boxes' edges with the `linedwidth` parameter, but it doesn't seem to do anything. \r\n\r\nThis is actually visible in the documentation:\r\nhttps://seaborn.pydata.org/generated/seaborn.boxenplot.html#seaborn.boxenplot\r\n\r\nThe following example specifies a linewidth of 2.5:\r\n\r\n```python\r\nax = sns.boxenplot(x=\"day\", y=\"total_bill\", hue=\"time\", data=tips, linewidth=2.5)\r\n```\r\n\r\nwhich produces the following image:\r\n![image](https://user-images.githubusercontent.com/10333715/93465245-cdfa7380-f8ea-11ea-8051-ce591f4ead12.png)\r\n\r\nAs you can see the boxes linewidth is unchanged. Instead, the linewidth argument changes the legend boxes. ",
      "issue_closed_at": "2020-10-26T22:07:11Z",
      "base_commit": "211cabb09bb9228635bdd80f74f7591d36d7b2a7",
      "changes": [
        {
          "file": "seaborn/categorical.py",
          "type": "function",
          "name": "__init__",
          "class_name": "_LVPlotter",
          "code": "def __init__(self, x, y, hue, data, order, hue_order,\n                 orient, color, palette, saturation,\n                 width, dodge, k_depth, linewidth, scale, outlier_prop,\n                 trust_alpha, showfliers=True):\n\n        self.width = width\n        self.dodge = dodge\n        self.saturation = saturation\n\n        k_depth_methods = ['proportion', 'tukey', 'trustworthy', 'full']\n        if not (k_depth in k_depth_methods or isinstance(k_depth, Number)):\n            msg = (f'k_depth must be one of {k_depth_methods} or a number, '\n                   f'but {k_depth} was passed.')\n            raise ValueError(msg)\n        self.k_depth = k_depth\n\n        # TODO seems this member is only used to frame the legend artists\n        if linewidth is None:\n            linewidth = mpl.rcParams[\"lines.linewidth\"]\n        self.linewidth = linewidth\n\n        scales = ['linear', 'exponential', 'area']\n        if scale not in scales:\n            msg = f'scale must be one of {scales}, but {scale} was passed.'\n            raise ValueError(msg)\n        self.scale = scale\n\n        if ((outlier_prop > 1) or (outlier_prop <= 0)):\n            msg = f'outlier_prop {outlier_prop} not in range (0, 1]'\n            raise ValueError(msg)\n        self.outlier_prop = outlier_prop\n\n        if not 0 < trust_alpha < 1:\n            msg = f'trust_alpha {trust_alpha} not in range (0, 1)'\n            raise ValueError(msg)\n        self.trust_alpha = trust_alpha\n\n        self.showfliers = showfliers\n\n        self.establish_variables(x, y, hue, data, orient, order, hue_order)\n        self.establish_colors(color, palette, saturation)"
        },
        {
          "file": "seaborn/categorical.py",
          "type": "function",
          "name": "_lvplot",
          "class_name": "_LVPlotter",
          "code": "def _lvplot(self, box_data, positions,\n                color=[255. / 256., 185. / 256., 0.],\n                widths=1, ax=None, **kws):\n\n        vert = self.orient == \"v\"\n        x = positions[0]\n        box_data = np.asarray(box_data)\n\n        # If we only have one data point, plot a line\n        if len(box_data) == 1:\n            kws.update({'color': self.gray, 'linestyle': '-'})\n            ys = [box_data[0], box_data[0]]\n            xs = [x - widths / 2, x + widths / 2]\n            if vert:\n                xx, yy = xs, ys\n            else:\n                xx, yy = ys, xs\n            ax.plot(xx, yy, **kws)\n        else:\n            # Get the number of data points and calculate \"depth\" of\n            # letter-value plot\n            box_ends, k = self._lv_box_ends(box_data)\n\n            # Anonymous functions for calculating the width and height\n            # of the letter value boxes\n            width = self._width_functions(self.scale)\n\n            # Function to find height of boxes\n            def height(b):\n                return b[1] - b[0]\n\n            # Functions to construct the letter value boxes\n            def vert_perc_box(x, b, i, k, w):\n                rect = Patches.Rectangle((x - widths * w / 2, b[0]),\n                                         widths * w,\n                                         height(b), fill=True)\n                return rect\n\n            def horz_perc_box(x, b, i, k, w):\n                rect = Patches.Rectangle((b[0], x - widths * w / 2),\n                                         height(b), widths * w,\n                                         fill=True)\n                return rect\n\n            # Scale the width of the boxes so the biggest starts at 1\n            w_area = np.array([width(height(b), i, k)\n                               for i, b in enumerate(box_ends)])\n            w_area = w_area / np.max(w_area)\n\n            # Calculate the medians\n            y = np.median(box_data)\n\n            # Calculate the outliers and plot (only if showfliers == True)\n            outliers = []\n            if self.showfliers:\n                outliers = self._lv_outliers(box_data, k)\n            hex_color = mpl.colors.rgb2hex(color)\n\n            if vert:\n                box_func = vert_perc_box\n                xs_median = [x - widths / 2, x + widths / 2]\n                ys_median = [y, y]\n                xs_outliers = np.full(len(outliers), x)\n                ys_outliers = outliers\n\n            else:\n                box_func = horz_perc_box\n                xs_median = [y, y]\n                ys_median = [x - widths / 2, x + widths / 2]\n                xs_outliers = outliers\n                ys_outliers = np.full(len(outliers), x)\n\n            boxes = [box_func(x, b[0], i, k, b[1])\n                     for i, b in enumerate(zip(box_ends, w_area))]\n\n            # Plot the medians\n            ax.plot(xs_median, ys_median, c='.15', alpha=.45,\n                    solid_capstyle=\"butt\", **kws)\n\n            # Plot outliers (if any)\n            if len(outliers) > 0:\n                ax.scatter(xs_outliers, ys_outliers, marker='d',\n                           c=self.gray, **kws)\n\n            # Construct a color map from the input color\n            rgb = [hex_color, (1, 1, 1)]\n            cmap = mpl.colors.LinearSegmentedColormap.from_list('new_map', rgb)\n            # Make sure that the last boxes contain hue and are not pure white\n            rgb = [hex_color, cmap(.85)]\n            cmap = mpl.colors.LinearSegmentedColormap.from_list('new_map', rgb)\n            collection = PatchCollection(boxes, cmap=cmap, edgecolor=self.gray)\n\n            # Set the color gradation, first box will have color=hex_color\n            collection.set_array(np.array(np.linspace(1, 0, len(boxes))))\n\n            # Plot the boxes\n            ax.add_collection(collection)"
        },
        {
          "file": "seaborn/categorical.py",
          "type": "function",
          "name": "horz_perc_box",
          "class_name": "_LVPlotter",
          "code": "def horz_perc_box(x, b, i, k, w):\n                rect = Patches.Rectangle((b[0], x - widths * w / 2),\n                                         height(b), widths * w,\n                                         fill=True)\n                return rect"
        },
        {
          "file": "seaborn/categorical.py",
          "type": "function",
          "name": "horz_perc_box",
          "class_name": "_LVPlotter",
          "code": "def horz_perc_box(x, b, i, k, w):\n                rect = Patches.Rectangle((b[0], x - widths * w / 2),\n                                         height(b), widths * w,\n                                         fill=True)\n                return rect"
        }
      ]
    },
    {
      "pr_number": 2224,
      "pr_title": "Reverse direction of default distributions sequential colormap",
      "pr_body": "I think the contrast properties of a light-to-dark ramp are somewhat less desirable than dark-to-light, but it's a bit more intuitive and, decisively, is more internally consistent with the old defaults. So it wins out.\r\n\r\n![image](https://user-images.githubusercontent.com/315810/91326296-2c23b300-e792-11ea-8e09-3d47cc9d641d.png)\r\n\r\n\r\nFixes #2211 ",
      "issue_id": 2211,
      "issue_title": "Default color gradient directionality in displot",
      "issue_body": "The new `displot` function, when showing 2D histograms or KDE contours has its color gradient going from dark (few observations) to bright (many observations). To me this is confusing since many times the background is bright as well. In the absence of colorbar it is hard to figure out which of the bright squares has lots of observations and which has none. Consider this example (also note that the colorbar is wrong in the range of [0,1) ):\r\n```python\r\nimport seaborn as sns\r\nsns.set(style=\"dark\")\r\npenguins = sns.load_dataset(\"penguins\")\r\nsns.displot(data=penguins, x=\"flipper_length_mm\", y=\"culmen_length_mm\", cbar=True)\r\n```\r\nWhich results with:\r\n![image](https://user-images.githubusercontent.com/13831112/90917473-9d0e4800-e3eb-11ea-9f6a-39a9c32907fe.png)\r\nThe inner brighter squares contain lost of observations, as opposed to the bright bounding squares.\r\n\r\nFor a default I think that since `Axes` are filled by default with a bright color (either mpl default or seaborn-set color), it makes sense to vary colors by default from bright to dark. Also this is consistent with the way `jointplot(kind=\"hex\")` works and the 2D kdeplots from version 0.10. But if current coloring is desired, perhaps make `cbar=True` the default?",
      "issue_closed_at": "2020-08-27T13:32:07Z",
      "base_commit": "f62a4a51f3195be3860d5282e5f3afc4d438d0f3",
      "changes": [
        {
          "file": "examples/layered_bivariate_plot.py",
          "type": "line",
          "name": "line 18",
          "code": "\n# Draw a combo histogram and scatterplot with density contours\nf, ax = plt.subplots(figsize=(6, 6))\nsns.scatterplot(x=x, y=y, s=5, color=\".2\")\nsns.histplot(x=x, y=y, bins=50, pthresh=.1, color=\"seagreen\")\nsns.kdeplot(x=x, y=y, levels=5, color=\"w\", linewidths=1)"
        },
        {
          "file": "seaborn/distributions.py",
          "type": "function",
          "name": "_cmap_from_color",
          "class_name": "_DistributionPlotter",
          "code": "def _cmap_from_color(self, color):\n        \"\"\"Return a sequential colormap given a color seed.\"\"\"\n        # Like so much else here, this is broadly useful, but keeping it\n        # in this class to signify that I haven't thought overly hard about it...\n        r, g, b, _ = to_rgba(color)\n        h, s, _ = husl.rgb_to_husl(r, g, b)\n        xx = np.linspace(-1, 1, 256)\n        ramp = np.zeros((256, 3))\n        ramp[:, 0] = h\n        ramp[:, 1] = s * np.cos(xx)\n        ramp[:, 2] = np.linspace(35, 80, 256)\n        colors = np.clip([husl.husl_to_rgb(*hsl) for hsl in ramp], 0, 1)\n        return mpl.colors.ListedColormap(colors)"
        },
        {
          "file": "seaborn/distributions.py",
          "type": "function",
          "name": "_default_discrete",
          "class_name": "_DistributionPlotter",
          "code": "def _default_discrete(self):\n        \"\"\"Find default values for discrete hist esimation based on variable type.\"\"\"\n        if self.univariate:\n            discrete = self.var_types[self.data_variable] == \"categorical\"\n        else:\n            discrete_x = self.var_types[\"x\"] == \"categorical\"\n            discrete_y = self.var_types[\"y\"] == \"categorical\"\n            discrete = discrete_x, discrete_y\n        return discrete"
        }
      ]
    },
    {
      "pr_number": 2417,
      "pr_title": "Improve NA robustness in VectorPlotter.comp_data",
      "pr_body": "This PR avoids passing `nan` through the matplotlib converters used to obtain a numeric/computable representation of the data (i.e. `VectorPlotter.comp_data`).\r\n\r\nIt also\r\n- codifies that the converted columns in `comp_data` have a float dtype\r\n- converts `inf` to `nan`, in line with what matplotlib does\r\n\r\nFixes #2295 \r\n\r\nAdditionally this will implicitly address #1971 once the regression plots are refactored to use `comp_data` internally. (@mojones, funny that you opened both issues).",
      "issue_id": 2295,
      "issue_title": "histplot with categorical values crashes with missing data, though numerical values work fine",
      "issue_body": "Not sure if this is intended behaviour, but it caught me out due to the difference in handling numerical/categorical data. I note that drawing histograms of categorical data is labelled as experimental, so ignore/close if that explains it.\r\n\r\nWith numerical data `histplot` ignores NaN and plots the other values, this is the behaviour I would expect:\r\n\r\n```\r\nimport numpy as np\r\nimport seaborn as sns\r\n\r\nsns.histplot(\r\n    [1.1, 1.2, 1.3, 1.4, np.nan]\r\n)\r\n```\r\n\r\nbut with categorical data it crashes:\r\n```\r\nimport numpy as np\r\nimport seaborn as sns\r\n\r\nsns.histplot(\r\n    ['foo', 'foo', 'bar', np.nan]\r\n)\r\n\r\n# output\r\n---------------------------------------------------------------------------\r\nTypeError                                 Traceback (most recent call last)\r\n~/.virtualenvs/drawingfromdata/lib/python3.8/site-packages/matplotlib/axis.py in convert_units(self, x)\r\n   1519         try:\r\n-> 1520             ret = self.converter.convert(x, self.units, self)\r\n   1521         except Exception as e:\r\n\r\n~/.virtualenvs/drawingfromdata/lib/python3.8/site-packages/matplotlib/category.py in convert(value, unit, axis)\r\n     60         # force an update so it also does type checking\r\n---> 61         unit.update(values)\r\n     62         return np.vectorize(unit._mapping.__getitem__, otypes=[float])(values)\r\n\r\n~/.virtualenvs/drawingfromdata/lib/python3.8/site-packages/matplotlib/category.py in update(self, data)\r\n    210             # OrderedDict just iterates over unique values in data.\r\n--> 211             cbook._check_isinstance((str, bytes), value=val)\r\n    212             if convertible:\r\n\r\n~/.virtualenvs/drawingfromdata/lib/python3.8/site-packages/matplotlib/cbook/__init__.py in _check_isinstance(_types, **kwargs)\r\n   2234         if not isinstance(v, types):\r\n-> 2235             raise TypeError(\r\n   2236                 \"{!r} must be an instance of {}, not a {}\".format(\r\n\r\nTypeError: 'value' must be an instance of str or bytes, not a float\r\n\r\nThe above exception was the direct cause of the following exception:\r\n\r\nConversionError                           Traceback (most recent call last)\r\n<ipython-input-61-b132ea7dca6c> in <module>\r\n      2 import seaborn as sns\r\n      3 \r\n----> 4 sns.histplot(\r\n      5     ['foo', 'foo', 'bar', np.nan]\r\n      6 )\r\n\r\n~/.virtualenvs/drawingfromdata/lib/python3.8/site-packages/seaborn/distributions.py in histplot(data, x, y, hue, weights, stat, bins, binwidth, binrange, discrete, cumulative, common_bins, common_norm, multiple, element, fill, shrink, kde, kde_kws, line_kws, thresh, pthresh, pmax, cbar, cbar_ax, cbar_kws, palette, hue_order, hue_norm, color, log_scale, legend, ax, **kwargs)\r\n   1420     if p.univariate:\r\n   1421 \r\n-> 1422         p.plot_univariate_histogram(\r\n   1423             multiple=multiple,\r\n   1424             element=element,\r\n\r\n~/.virtualenvs/drawingfromdata/lib/python3.8/site-packages/seaborn/distributions.py in plot_univariate_histogram(self, multiple, element, fill, common_norm, common_bins, shrink, kde, kde_kws, color, legend, line_kws, estimate_kws, **plot_kws)\r\n    421 \r\n    422         # First pass through the data to compute the histograms\r\n--> 423         for sub_vars, sub_data in self.iter_data(\"hue\", from_comp_data=True):\r\n    424 \r\n    425             # Prepare the relevant data\r\n\r\n~/.virtualenvs/drawingfromdata/lib/python3.8/site-packages/seaborn/_core.py in iter_data(self, grouping_vars, reverse, from_comp_data)\r\n    965 \r\n    966         if from_comp_data:\r\n--> 967             data = self.comp_data\r\n    968         else:\r\n    969             data = self.plot_data\r\n\r\n~/.virtualenvs/drawingfromdata/lib/python3.8/site-packages/seaborn/_core.py in comp_data(self)\r\n   1034                 axis = getattr(ax, f\"{var}axis\")\r\n   1035 \r\n-> 1036                 comp_var = axis.convert_units(self.plot_data[var])\r\n   1037                 if axis.get_scale() == \"log\":\r\n   1038                     comp_var = np.log10(comp_var)\r\n\r\n~/.virtualenvs/drawingfromdata/lib/python3.8/site-packages/matplotlib/axis.py in convert_units(self, x)\r\n   1520             ret = self.converter.convert(x, self.units, self)\r\n   1521         except Exception as e:\r\n-> 1522             raise munits.ConversionError('Failed to convert value(s) to axis '\r\n   1523                                          f'units: {x!r}') from e\r\n   1524         return ret\r\n\r\nConversionError: Failed to convert value(s) to axis units: 0    foo\r\n1    foo\r\n2    bar\r\n3    NaN\r\nName: x, dtype: object\r\n```\r\n\r\n",
      "issue_closed_at": "2021-01-05T19:40:57Z",
      "base_commit": "aad96f8d2e36ceceb82a42b69aa3a8f47ef7210d",
      "changes": [
        {
          "file": "seaborn/_core.py",
          "type": "function",
          "name": "comp_data",
          "class_name": "VectorPlotter",
          "code": "def comp_data(self):\n        \"\"\"Dataframe with numeric x and y, after unit conversion and log scaling.\"\"\"\n        if not hasattr(self, \"ax\"):\n            # Probably a good idea, but will need a bunch of tests updated\n            # Most of these tests should just use the external interface\n            # Then this can be re-enabled.\n            # raise AttributeError(\"No Axes attached to plotter\")\n            return self.plot_data\n\n        if not hasattr(self, \"_comp_data\"):\n\n            comp_data = (\n                self.plot_data\n                .copy(deep=False)\n                .drop([\"x\", \"y\"], axis=1, errors=\"ignore\")\n            )\n            for var in \"yx\":\n                if var not in self.variables:\n                    continue\n\n                # Get a corresponding axis object so that we can convert the units\n                # to matplotlib's numeric representation, which we can compute on\n                # This is messy and it would probably be better for VectorPlotter\n                # to manage its own converters (using the matplotlib tools).\n                # XXX Currently does not support unshared categorical axes!\n                # (But see comment in _attach about how those don't exist)\n                if self.ax is None:\n                    ax = self.facets.axes.flat[0]\n                else:\n                    ax = self.ax\n                axis = getattr(ax, f\"{var}axis\")\n\n                comp_var = axis.convert_units(self.plot_data[var])\n                if axis.get_scale() == \"log\":\n                    comp_var = np.log10(comp_var)\n                comp_data.insert(0, var, comp_var)\n\n            self._comp_data = comp_data\n\n        return self._comp_data"
        }
      ]
    }
  ]
}