{
  "original_problem": {
    "instance_id": "scikit-learn__scikit-learn-25500",
    "repo": "scikit-learn/scikit-learn",
    "created_at": "2023-01-27T19:49:28Z",
    "problem_statement": "CalibratedClassifierCV doesn't work with `set_config(transform_output=\"pandas\")`\n### Describe the bug\r\n\r\nCalibratedClassifierCV with isotonic regression doesn't work when we previously set `set_config(transform_output=\"pandas\")`.\r\nThe IsotonicRegression seems to return a dataframe, which is a problem for `_CalibratedClassifier`  in `predict_proba` where it tries to put the dataframe in a numpy array row `proba[:, class_idx] = calibrator.predict(this_pred)`.\r\n\r\n### Steps/Code to Reproduce\r\n\r\n```python\r\nimport numpy as np\r\nfrom sklearn import set_config\r\nfrom sklearn.calibration import CalibratedClassifierCV\r\nfrom sklearn.linear_model import SGDClassifier\r\n\r\nset_config(transform_output=\"pandas\")\r\nmodel = CalibratedClassifierCV(SGDClassifier(), method='isotonic')\r\nmodel.fit(np.arange(90).reshape(30, -1), np.arange(30) % 2)\r\nmodel.predict(np.arange(90).reshape(30, -1))\r\n```\r\n\r\n### Expected Results\r\n\r\nIt should not crash.\r\n\r\n### Actual Results\r\n\r\n```\r\n../core/model_trainer.py:306: in train_model\r\n    cv_predictions = cross_val_predict(pipeline,\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/sklearn/model_selection/_validation.py:968: in cross_val_predict\r\n    predictions = parallel(\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/joblib/parallel.py:1085: in __call__\r\n    if self.dispatch_one_batch(iterator):\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/joblib/parallel.py:901: in dispatch_one_batch\r\n    self._dispatch(tasks)\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/joblib/parallel.py:819: in _dispatch\r\n    job = self._backend.apply_async(batch, callback=cb)\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/joblib/_parallel_backends.py:208: in apply_async\r\n    result = ImmediateResult(func)\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/joblib/_parallel_backends.py:597: in __init__\r\n    self.results = batch()\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/joblib/parallel.py:288: in __call__\r\n    return [func(*args, **kwargs)\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/joblib/parallel.py:288: in <listcomp>\r\n    return [func(*args, **kwargs)\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/sklearn/utils/fixes.py:117: in __call__\r\n    return self.function(*args, **kwargs)\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/sklearn/model_selection/_validation.py:1052: in _fit_and_predict\r\n    predictions = func(X_test)\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/sklearn/pipeline.py:548: in predict_proba\r\n    return self.steps[-1][1].predict_proba(Xt, **predict_proba_params)\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/sklearn/calibration.py:477: in predict_proba\r\n    proba = calibrated_classifier.predict_proba(X)\r\n../../../../.anaconda3/envs/strategy-training/lib/python3.9/site-packages/sklearn/calibration.py:764: in predict_proba\r\n    proba[:, class_idx] = calibrator.predict(this_pred)\r\nE   ValueError: could not broadcast input array from shape (20,1) into shape (20,)\r\n```\r\n\r\n### Versions\r\n\r\n```shell\r\nSystem:\r\n    python: 3.9.15 (main, Nov 24 2022, 14:31:59)  [GCC 11.2.0]\r\nexecutable: /home/philippe/.anaconda3/envs/strategy-training/bin/python\r\n   machine: Linux-5.15.0-57-generic-x86_64-with-glibc2.31\r\n\r\nPython dependencies:\r\n      sklearn: 1.2.0\r\n          pip: 22.2.2\r\n   setuptools: 62.3.2\r\n        numpy: 1.23.5\r\n        scipy: 1.9.3\r\n       Cython: None\r\n       pandas: 1.4.1\r\n   matplotlib: 3.6.3\r\n       joblib: 1.2.0\r\nthreadpoolctl: 3.1.0\r\n\r\nBuilt with OpenMP: True\r\n\r\nthreadpoolctl info:\r\n       user_api: openmp\r\n   internal_api: openmp\r\n         prefix: libgomp\r\n       filepath: /home/philippe/.anaconda3/envs/strategy-training/lib/python3.9/site-packages/scikit_learn.libs/libgomp-a34b3233.so.1.0.0\r\n        version: None\r\n    num_threads: 12\r\n\r\n       user_api: blas\r\n   internal_api: openblas\r\n         prefix: libopenblas\r\n       filepath: /home/philippe/.anaconda3/envs/strategy-training/lib/python3.9/site-packages/numpy.libs/libopenblas64_p-r0-742d56dc.3.20.so\r\n        version: 0.3.20\r\nthreading_layer: pthreads\r\n   architecture: Haswell\r\n    num_threads: 12\r\n\r\n       user_api: blas\r\n   internal_api: openblas\r\n         prefix: libopenblas\r\n       filepath: /home/philippe/.anaconda3/envs/strategy-training/lib/python3.9/site-packages/scipy.libs/libopenblasp-r0-41284840.3.18.so\r\n        version: 0.3.18\r\nthreading_layer: pthreads\r\n   architecture: Haswell\r\n    num_threads: 12\r\n```\r\n\n",
    "patch": "diff --git a/sklearn/isotonic.py b/sklearn/isotonic.py\n--- a/sklearn/isotonic.py\n+++ b/sklearn/isotonic.py\n@@ -360,23 +360,16 @@ def fit(self, X, y, sample_weight=None):\n         self._build_f(X, y)\n         return self\n \n-    def transform(self, T):\n-        \"\"\"Transform new data by linear interpolation.\n-\n-        Parameters\n-        ----------\n-        T : array-like of shape (n_samples,) or (n_samples, 1)\n-            Data to transform.\n+    def _transform(self, T):\n+        \"\"\"`_transform` is called by both `transform` and `predict` methods.\n \n-            .. versionchanged:: 0.24\n-               Also accepts 2d array with 1 feature.\n+        Since `transform` is wrapped to output arrays of specific types (e.g.\n+        NumPy arrays, pandas DataFrame), we cannot make `predict` call `transform`\n+        directly.\n \n-        Returns\n-        -------\n-        y_pred : ndarray of shape (n_samples,)\n-            The transformed data.\n+        The above behaviour could be changed in the future, if we decide to output\n+        other type of arrays when calling `predict`.\n         \"\"\"\n-\n         if hasattr(self, \"X_thresholds_\"):\n             dtype = self.X_thresholds_.dtype\n         else:\n@@ -397,6 +390,24 @@ def transform(self, T):\n \n         return res\n \n+    def transform(self, T):\n+        \"\"\"Transform new data by linear interpolation.\n+\n+        Parameters\n+        ----------\n+        T : array-like of shape (n_samples,) or (n_samples, 1)\n+            Data to transform.\n+\n+            .. versionchanged:: 0.24\n+               Also accepts 2d array with 1 feature.\n+\n+        Returns\n+        -------\n+        y_pred : ndarray of shape (n_samples,)\n+            The transformed data.\n+        \"\"\"\n+        return self._transform(T)\n+\n     def predict(self, T):\n         \"\"\"Predict new data by linear interpolation.\n \n@@ -410,7 +421,7 @@ def predict(self, T):\n         y_pred : ndarray of shape (n_samples,)\n             Transformed data.\n         \"\"\"\n-        return self.transform(T)\n+        return self._transform(T)\n \n     # We implement get_feature_names_out here instead of using\n     # `ClassNamePrefixFeaturesOutMixin`` because `input_features` are ignored.\n"
  },
  "candidates_evaluated": 5,
  "judgment_result": {
    "candidates": [
      {
        "idx": 1,
        "id": "similar_23255",
        "decision": "Not useful",
        "confidence": "Medium",
        "reason": "The issue involves validation data selection, which is unrelated to the data type handling problem in the current issue."
      },
      {
        "idx": 2,
        "id": "similar_6848",
        "decision": "Not useful",
        "confidence": "Medium",
        "reason": "The issue is about an incorrect constant addition in probability computation, not related to data type handling or configuration settings."
      },
      {
        "idx": 3,
        "id": "similar_19637",
        "decision": "Useful",
        "confidence": "High",
        "reason": "Both issues involve `CalibratedClassifierCV` and address integration problems with other components, suggesting similar debugging strategies."
      },
      {
        "idx": 4,
        "id": "similar_19878",
        "decision": "Not useful",
        "confidence": "Medium",
        "reason": "The issue is about method input type support, which is not directly related to the configuration-induced data type handling problem."
      },
      {
        "idx": 5,
        "id": "similar_8933",
        "decision": "Not useful",
        "confidence": "Medium",
        "reason": "The issue involves OOB score calculation affected by label values, unrelated to data type handling or configuration settings."
      }
    ]
  },
  "raw_summaries": [
    {
      "similar_issue": {
        "issue_title": "BUG SGDRegressor only using the first two samples for validation error calculation",
        "issue_body": "### Describe the bug\r\n\r\n`BaseSGD._make_validation_split` returns a 0-1 integer array as validation mask, which is used to calculate the validation score with:\r\n```python\r\n_ValidationScoreCallback(\r\n            self,\r\n            X[validation_mask],\r\n            y[validation_mask],\r\n            sample_weight[validation_mask],\r\n            classes=classes,\r\n```\r\nhere `X[validation_mask]` is just repeating the first two samples as shown in the following code. A quick fix would be convert `validation_mask` to a boolean array.\r\n\r\n### Steps/Code to Reproduce\r\n\r\nI added a print statement in `_ValidationScoreCallback` to print out the shape of validation data\r\n```python\r\nclass _ValidationScoreCallback:\r\n    \"\"\"Callback for early stopping based on validation score\"\"\"\r\n\r\n    def __init__(self, estimator, X_val, y_val, sample_weight_val, classes=None):\r\n        self.estimator = clone(estimator)\r\n        self.estimator.t_ = 1  # to pass check_is_fitted\r\n        if classes is not None:\r\n            self.estimator.classes_ = classes\r\n        self.X_val = X_val\r\n        self.y_val = y_val\r\n        self.sample_weight_val = sample_weight_val\r\n        print(X_val.shape, y_val.shape)\r\n```\r\nand afterwards running the following code\r\n```python\r\nimport numpy as np\r\nfrom sklearn.linear_model import SGDRegressor\r\n\r\nX = np.random.randn(1000, 5)\r\ny = np.random.randn(1000)\r\n\r\nsgd = SGDRegressor(early_stopping=True, max_iter=1, validation_fraction=0.1)\r\nsgd.fit(X, y)\r\n```\r\n\r\n### Expected Results\r\n\r\nThe size of validation data should be `(100,5), (100, )`.\r\n\r\n### Actual Results\r\n\r\nprints out `(1000, 5), (1000,)`\r\n\r\n### Versions\r\n\r\n```shell\r\nSystem:\r\n    python: 3.8.8 (default, Apr 13 2021, 12:59:45)  [Clang 10.0.0 ]\r\nexecutable: /Users/mac/Documents/GitHub/.ENV/bin/python\r\n   machine: macOS-10.16-x86_64-i386-64bit\r\n\r\nPython dependencies:\r\n      sklearn: 1.2.dev0\r\n          pip: 20.2.3\r\n   setuptools: 60.9.3\r\n        numpy: 1.21.2\r\n        scipy: 1.8.0\r\n       Cython: 0.29.28\r\n       pandas: 1.3.3\r\n   matplotlib: 3.5.1\r\n       joblib: 1.1.0\r\nthreadpoolctl: 3.0.0\r\n\r\nBuilt with OpenMP: False\r\n```\r\n",
        "issue_id": 23255,
        "pr_number": 23256,
        "pr_title": "FIX SGDRegreesor and SGDClassifier use correct number of validation data",
        "pr_body": "<!--\r\nThanks for contributing a pull request! Please ensure you have taken a look at\r\nthe contribution guidelines: https://github.com/scikit-learn/scikit-learn/blob/main/CONTRIBUTING.md\r\n-->\r\n\r\n#### Reference Issues/PRs\r\n<!--\r\nExample: Fixes #1234. See also #3456.\r\nPlease use keywords (e.g., Fixes) to create link to the issues or pull requests\r\nyou resolved, so that they will automatically be closed when your pull request\r\nis merged. See https://github.com/blog/1506-closing-issues-via-pull-requests\r\n-->\r\nFixes #23255.\r\n\r\n#### What does this implement/fix? Explain your changes.\r\n- changes the `validation_mask` to be a boolean array for correct indexing.\r\n- fixes a failing test case.\r\n\r\n#### Any other comments?\r\nI haven't added any test cases because I'm not quite sure how to test this behavior without directly using internal api like checking against `_ValidationScoreCallback.X_val.shape`.\r\n\r\n\r\n\r\n<!--\r\nPlease be aware that we are a loose team of volunteers so patience is\r\nnecessary; assistance handling other issues is very welcome. We value\r\nall user contributions, no matter how minor they are. If we are slow to\r\nreview, either the pull request needs some benchmarking, tinkering,\r\nconvincing, etc. or more likely the reviewers are simply busy. In either\r\ncase, we ask for your understanding during the review process.\r\nFor more information, see our FAQ on this topic:\r\nhttp://scikit-learn.org/dev/faq.html#why-is-my-pull-request-not-getting-any-attention.\r\n\r\nThanks for contributing!\r\n-->\r\n",
        "issue_closed_at": "2022-05-02T11:41:03Z",
        "base_commit": "920ab2508fe634ad32df68bb0ebd4f4512fbfb53"
      },
      "summary": "### Summary:\nThis issue pertains to a bug in the `SGDRegressor` model from the `sklearn` library, specifically related to the calculation of validation error during the early stopping process. The problem is rooted in the `_make_validation_split` method of the `BaseSGD` class, which incorrectly generates a validation mask as a 0-1 integer array, resulting in the misuse of validation data. Instead of selecting the appropriate subset for validation, the method mistakenly repeats the first two samples of the training data, leading to inaccurate validation scores.\n\nKey symptoms include the unexpected shapes of validation data, as observed through a debugging step that printed the shape of the validation dataset. The expected validation data size, based on a specified validation fraction, should comprise 10% of the total dataset, but the output indicates that the entire training set is being used instead.\n\nThe affected component is the `SGDRegressor` model in the `sklearn` package, particularly its early stopping functionality, which relies on correct validation data to decide if training should halt early for performance optimization.\n\nThe potential impact of this bug includes incorrect model evaluation during training, which can lead to suboptimal model performance and misleading results. The severity is notable as it affects the reliability of the early stopping mechanism, a critical feature for model efficiency and performance.\n\nRelevant technical details include the need to convert the validation mask into a boolean array for proper indexing of the validation samples. This adjustment ensures accurate validation data selection, aligning with the intended design of the early stopping functionality.",
      "prompt_used": "You are an expert in software issue reasoning analysis.\nGiven the following problem report and its fixed code elements, generate a comprehensive summary based on the entire document. Your goal is to abstract the information in the problem description into a more general description.\n\n## Original Issue Report:\nTitle: BUG SGDRegressor only using the first two samples for validation error calculation\n\nBody:\n### Describe the bug\r\n\r\n`BaseSGD._make_validation_split` returns a 0-1 integer array as validation mask, which is used to calculate the validation score with:\r\n```python\r\n_ValidationScoreCallback(\r\n            self,\r\n            X[validation_mask],\r\n            y[validation_mask],\r\n            sample_weight[validation_mask],\r\n            classes=classes,\r\n```\r\nhere `X[validation_mask]` is just repeating the first two samples as shown in the following code. A quick fix would be convert `validation_mask` to a boolean array.\r\n\r\n### Steps/Code to Reproduce\r\n\r\nI added a print statement in `_ValidationScoreCallback` to print out the shape of validation data\r\n```python\r\nclass _ValidationScoreCallback:\r\n    \"\"\"Callback for early stopping based on validation score\"\"\"\r\n\r\n    def __init__(self, estimator, X_val, y_val, sample_weight_val, classes=None):\r\n        self.estimator = clone(estimator)\r\n        self.estimator.t_ = 1  # to pass check_is_fitted\r\n        if classes is not None:\r\n            self.estimator.classes_ = classes\r\n        self.X_val = X_val\r\n        self.y_val = y_val\r\n        self.sample_weight_val = sample_weight_val\r\n        print(X_val.shape, y_val.shape)\r\n```\r\nand afterwards running the following code\r\n```python\r\nimport numpy as np\r\nfrom sklearn.linear_model import SGDRegressor\r\n\r\nX = np.random.randn(1000, 5)\r\ny = np.random.randn(1000)\r\n\r\nsgd = SGDRegressor(early_stopping=True, max_iter=1, validation_fraction=0.1)\r\nsgd.fit(X, y)\r\n```\r\n\r\n### Expected Results\r\n\r\nThe size of validation data should be `(100,5), (100, )`.\r\n\r\n### Actual Results\r\n\r\nprints out `(1000, 5), (1000,)`\r\n\r\n### Versions\r\n\r\n```shell\r\nSystem:\r\n    python: 3.8.8 (default, Apr 13 2021, 12:59:45)  [Clang 10.0.0 ]\r\nexecutable: /Users/mac/Documents/GitHub/.ENV/bin/python\r\n   machine: macOS-10.16-x86_64-i386-64bit\r\n\r\nPython dependencies:\r\n      sklearn: 1.2.dev0\r\n          pip: 20.2.3\r\n   setuptools: 60.9.3\r\n        numpy: 1.21.2\r\n        scipy: 1.8.0\r\n       Cython: 0.29.28\r\n       pandas: 1.3.3\r\n   matplotlib: 3.5.1\r\n       joblib: 1.1.0\r\nthreadpoolctl: 3.0.0\r\n\r\nBuilt with OpenMP: False\r\n```\r\n\n\n## Code elements fixed by the patch:\n{FIXED_CODE_ELEMENTS}\n\nPlease analyze the above issue report and provide a structured summary that includes:\n1. Problem description in general terms\n2. Key symptoms and behaviors observed\n3. Affected components or systems\n4. Potential impact or severity\n5. Any relevant technical details abstracted for broader understanding\n\nPlease return the summary with “### Summary:\", For example:\n### Summary: This issue is ...\n\nChanges Summary:\nsklearn/linear_model/_stochastic_gradient.py\n  function: BaseSGD._make_validation_split\n  function: BaseSGD._make_validation_split\n"
    },
    {
      "similar_issue": {
        "issue_title": "LinearDiscriminantAnalysis predict probability bug",
        "issue_body": "I am pretty confident there is a bug introduced in commit\n7c1101d7c26ba0b77184cce9c0b9be79adb526de\n\nConcretely, line 518 of the current version \nhttps://github.com/scikit-learn/scikit-learn/blob/master/sklearn/discriminant_analysis.py\nshould be removed as it yields wrong results. \n\nThere is no reason why constant 1 should be added to the computed probability after exponentiation and before inversion. \n\nTo verify this, I have run a one-to-one comparison between the outcome of the method and MATLAB's builtin LDA classifier on the Iris dataset. Only after removal of line 518, results match (up to a tolerance).\n\nIf everyone agrees on that, I am happy to submit a PR.\n",
        "issue_id": 6848,
        "pr_number": 11796,
        "pr_title": "[MRG+2] Fix LDA predict_proba() ",
        "pr_body": "<!--\r\nThanks for contributing a pull request! Please ensure you have taken a look at\r\nthe contribution guidelines: https://github.com/scikit-learn/scikit-learn/blob/master/CONTRIBUTING.md#pull-request-checklist\r\n-->\r\n\r\n#### Reference Issues/PRs\r\nFixes #6848\r\ncloses #11727\r\ncloses #5149\r\n<!--\r\nExample: Fixes #1234. See also #3456.\r\nPlease use keywords (e.g., Fixes) to create link to the issues or pull requests\r\nyou resolved, so that they will automatically be closed when your pull request\r\nis merged. See https://github.com/blog/1506-closing-issues-via-pull-requests\r\n-->\r\n\r\n\r\n#### What does this implement/fix? Explain your changes.\r\nFixes the `predict_proba()` method of LinearDiscriminantAnalysis.\r\nAn `if` statement is used to differentiate between the binary and multi-class case, due to the different output format of the `decision_function` method implemented in the `LinearClassifierMixin` class.\r\n\r\n#### Any other comments?\r\nCopying from #6848:\r\nDo we perhaps want to include additional tests checking the output of predict_proba for LDA and QDA both for the binary and multi-class cases?\r\n\r\n<!--\r\nPlease be aware that we are a loose team of volunteers so patience is\r\nnecessary; assistance handling other issues is very welcome. We value\r\nall user contributions, no matter how minor they are. If we are slow to\r\nreview, either the pull request needs some benchmarking, tinkering,\r\nconvincing, etc. or more likely the reviewers are simply busy. In either\r\ncase, we ask for your understanding during the review process.\r\nFor more information, see our FAQ on this topic:\r\nhttp://scikit-learn.org/dev/faq.html#why-is-my-pull-request-not-getting-any-attention.\r\n\r\nThanks for contributing!\r\n-->\r\n",
        "issue_closed_at": "2019-03-07T16:44:18Z",
        "base_commit": "b73a51bcda362d94d8907915a382a8eb403554c8"
      },
      "summary": "### Summary:\nThis issue pertains to a bug identified in the `LinearDiscriminantAnalysis` (LDA) implementation within the Scikit-learn library, specifically affecting the probability prediction functionality. The problem arises from an unnecessary addition of a constant value during the computation of probabilities, which results in inaccurate outcomes. This discrepancy was verified through empirical comparison with MATLAB's built-in LDA classifier, using the Iris dataset as a benchmark. The issue is localized to a specific line of code that, when removed, aligns the results of Scikit-learn's LDA with expected outcomes. The affected component is the `predict_proba` method within the `sklearn.discriminant_analysis` module. The bug impacts the reliability and accuracy of probability predictions made by the LDA classifier, potentially leading to incorrect data analysis interpretations. The report highlights a need for code correction to ensure proper functionality and to maintain consistency with established statistical methods.",
      "prompt_used": "You are an expert in software issue reasoning analysis.\nGiven the following problem report and its fixed code elements, generate a comprehensive summary based on the entire document. Your goal is to abstract the information in the problem description into a more general description.\n\n## Original Issue Report:\nTitle: LinearDiscriminantAnalysis predict probability bug\n\nBody:\nI am pretty confident there is a bug introduced in commit\n7c1101d7c26ba0b77184cce9c0b9be79adb526de\n\nConcretely, line 518 of the current version \nhttps://github.com/scikit-learn/scikit-learn/blob/master/sklearn/discriminant_analysis.py\nshould be removed as it yields wrong results. \n\nThere is no reason why constant 1 should be added to the computed probability after exponentiation and before inversion. \n\nTo verify this, I have run a one-to-one comparison between the outcome of the method and MATLAB's builtin LDA classifier on the Iris dataset. Only after removal of line 518, results match (up to a tolerance).\n\nIf everyone agrees on that, I am happy to submit a PR.\n\n\n## Code elements fixed by the patch:\n{FIXED_CODE_ELEMENTS}\n\nPlease analyze the above issue report and provide a structured summary that includes:\n1. Problem description in general terms\n2. Key symptoms and behaviors observed\n3. Affected components or systems\n4. Potential impact or severity\n5. Any relevant technical details abstracted for broader understanding\n\nPlease return the summary with “### Summary:\", For example:\n### Summary: This issue is ...\n\nChanges Summary:\nsklearn/discriminant_analysis.py\n  line: line 22\n  function: QuadraticDiscriminantAnalysis.predict_proba\n"
    },
    {
      "similar_issue": {
        "issue_title": "CalibratedClassifier on pipelines",
        "issue_body": "Looks like CalibratedClassifierCV doesn't support pipelines, see https://github.com/scikit-learn/scikit-learn/issues/19625 and https://github.com/scikit-learn/scikit-learn/discussions/19279\r\n\r\nWe should try to delegate the validation to the base estimator (https://github.com/scikit-learn/scikit-learn/discussions/19279#discussioncomment-313812)",
        "issue_id": 19637,
        "pr_number": 19641,
        "pr_title": "Fix Calibrated classifier cv predictions with pipeline",
        "pr_body": "<!--\r\nThanks for contributing a pull request! Please ensure you have taken a look at\r\nthe contribution guidelines: https://github.com/scikit-learn/scikit-learn/blob/main/CONTRIBUTING.md\r\n-->\r\n\r\n#### Reference Issues/PRs\r\n<!--\r\nExample: Fixes #1234. See also #3456.\r\nPlease use keywords (e.g., Fixes) to create link to the issues or pull requests\r\nyou resolved, so that they will automatically be closed when your pull request\r\nis merged. See https://github.com/blog/1506-closing-issues-via-pull-requests\r\n-->\r\nFixes #19637, #8710.\r\n\r\n#### What does this implement/fix? Explain your changes.\r\n\r\nAs suggested in #19641, this PR removed validation from the CalibratedClassifierCV predict_proba function and replaced X.shape[0] with _num_samples(X).\r\n\r\nRegression testing has also been added.\r\n\r\n#### Any other comments?\r\n\r\n\r\n<!--\r\nPlease be aware that we are a loose team of volunteers so patience is\r\nnecessary; assistance handling other issues is very welcome. We value\r\nall user contributions, no matter how minor they are. If we are slow to\r\nreview, either the pull request needs some benchmarking, tinkering,\r\nconvincing, etc. or more likely the reviewers are simply busy. In either\r\ncase, we ask for your understanding during the review process.\r\nFor more information, see our FAQ on this topic:\r\nhttp://scikit-learn.org/dev/faq.html#why-is-my-pull-request-not-getting-any-attention.\r\n\r\nThanks for contributing!\r\n-->\r\n",
        "issue_closed_at": "2021-03-10T10:27:24Z",
        "base_commit": "42e90e9ba28fb37c2c9bd3e8aed1ac2387f1d5d5"
      },
      "summary": "### Summary: This issue pertains to the lack of support for integrating the `CalibratedClassifierCV` class within scikit-learn's pipeline framework. The problem arises when attempting to incorporate `CalibratedClassifierCV` into a pipeline, which is a common practice for streamlining machine learning workflows by chaining multiple processing steps together. The key symptoms include an inability to perform validation steps effectively when the `CalibratedClassifierCV` is part of a pipeline, as detailed in ongoing discussions and issues in the scikit-learn repository.\n\nThe affected component is the `CalibratedClassifierCV` class, which is used to calibrate the predicted probabilities of a classifier by fitting it to a validation set. The severity of the issue lies in its potential to hinder users from effectively calibrating classifiers within pipelines, which could lead to suboptimal model performance and less reliable predictions in applied machine learning tasks.\n\nThe technical details involve the suggestion to delegate validation processes to the base estimator, which is part of the pipeline, to resolve the integration issue. This would enable the `CalibratedClassifierCV` to function seamlessly within a pipeline, ensuring that all components of the machine learning workflow operate cohesively. The code changes in `sklearn/calibration.py` specifically address the implementation of probability prediction methods to support this enhanced functionality.",
      "prompt_used": "You are an expert in software issue reasoning analysis.\nGiven the following problem report and its fixed code elements, generate a comprehensive summary based on the entire document. Your goal is to abstract the information in the problem description into a more general description.\n\n## Original Issue Report:\nTitle: CalibratedClassifier on pipelines\n\nBody:\nLooks like CalibratedClassifierCV doesn't support pipelines, see https://github.com/scikit-learn/scikit-learn/issues/19625 and https://github.com/scikit-learn/scikit-learn/discussions/19279\r\n\r\nWe should try to delegate the validation to the base estimator (https://github.com/scikit-learn/scikit-learn/discussions/19279#discussioncomment-313812)\n\n## Code elements fixed by the patch:\n{FIXED_CODE_ELEMENTS}\n\nPlease analyze the above issue report and provide a structured summary that includes:\n1. Problem description in general terms\n2. Key symptoms and behaviors observed\n3. Affected components or systems\n4. Potential impact or severity\n5. Any relevant technical details abstracted for broader understanding\n\nPlease return the summary with “### Summary:\", For example:\n### Summary: This issue is ...\n\nChanges Summary:\nsklearn/calibration.py\n  line: line 24\n  function: _CalibratedClassifier.predict_proba\n  function: _CalibratedClassifier.predict_proba\n  function: _CalibratedClassifier.predict_proba\n"
    },
    {
      "similar_issue": {
        "issue_title": "OrdinalEncoder accept and failed with sparse matrix in inverse_transform",
        "issue_body": "`OrdinalEncoder` was documented to accept sparse matrix in `inverse_transform`.\r\nA check is internally done to accept sparse matrix. However, the `inverse_transform` will fail with this type of data.\r\nWe should remove the support in the check and make sure that we issue the right error.\r\n",
        "issue_id": 19878,
        "pr_number": 19879,
        "pr_title": "FIX Error for sparse matrix in OrdinalEncoder.inverse_transform",
        "pr_body": "closes #19878\r\n\r\n`OrdinalEncoder.inverse_transform` should not support sparse matrix. It was already failing but with an obscure message.\r\nThis PR adds a non-regression test to check for the error message.",
        "issue_closed_at": "2021-04-13T11:59:30Z",
        "base_commit": "767fd63c9ddddc46e288fdec2cca36a129529a8e"
      },
      "summary": "### Summary:\nThis issue pertains to a discrepancy between the documented capabilities of the `OrdinalEncoder` class's `inverse_transform` method and its actual behavior when handling sparse matrices. The problem arises because, although the method was designed to accept sparse matrices, it fails to process them correctly, leading to execution errors. This indicates a misalignment between the code's functionality and its documentation, specifically regarding input data types for the `inverse_transform` method.\n\nKey symptoms observed include failure of the `inverse_transform` method when a sparse matrix is provided as input, despite internal checks suggesting that such data types are supported. This inconsistency necessitates a change in the method to eliminate the acceptance of sparse matrices and ensure that appropriate error messages are displayed to users when unsupported data types are used.\n\nThe affected component is the `OrdinalEncoder` class within the `sklearn.preprocessing` module, particularly its `inverse_transform` method. The potential impact is significant as it affects users relying on the documented ability to handle sparse matrices, potentially leading to runtime errors and incorrect data processing in machine learning pipelines that utilize this functionality.\n\nThe technical resolution involves modifying the method to reflect its actual capabilities by removing support for sparse matrix input and implementing robust error handling to guide users towards providing compatible data formats. This change improves the reliability of the method and aligns its operation with the documented specifications.",
      "prompt_used": "You are an expert in software issue reasoning analysis.\nGiven the following problem report and its fixed code elements, generate a comprehensive summary based on the entire document. Your goal is to abstract the information in the problem description into a more general description.\n\n## Original Issue Report:\nTitle: OrdinalEncoder accept and failed with sparse matrix in inverse_transform\n\nBody:\n`OrdinalEncoder` was documented to accept sparse matrix in `inverse_transform`.\r\nA check is internally done to accept sparse matrix. However, the `inverse_transform` will fail with this type of data.\r\nWe should remove the support in the check and make sure that we issue the right error.\r\n\n\n## Code elements fixed by the patch:\n{FIXED_CODE_ELEMENTS}\n\nPlease analyze the above issue report and provide a structured summary that includes:\n1. Problem description in general terms\n2. Key symptoms and behaviors observed\n3. Affected components or systems\n4. Potential impact or severity\n5. Any relevant technical details abstracted for broader understanding\n\nPlease return the summary with “### Summary:\", For example:\n### Summary: This issue is ...\n\nChanges Summary:\nsklearn/preprocessing/_encoders.py\n  function: OrdinalEncoder.inverse_transform\n  function: OrdinalEncoder.inverse_transform\n"
    },
    {
      "similar_issue": {
        "issue_title": "BUG: BaggingClassifier.oob_score_ should not change with class label",
        "issue_body": "Let us compute the oob score of a bagged classifier.\r\n\r\n```python\r\nimport numpy as np\r\nimport pandas as pd\r\nfrom sklearn.ensemble import BaggingClassifier\r\nfrom sklearn.neighbors import KNeighborsClassifier\r\n\r\nN = 50\r\nrandState = 5\r\nlabel = 'Label'\r\nfeatures = ['A','B','C']\r\n\r\nlabels = np.random.randint(3, size = N) - 1\r\ndf = pd.DataFrame( labels , index=range(N), columns=[label] )\r\nfor col in features:\r\n    df[col] = df[label] + 0.01 * np.random.rand( N )\r\n\r\nclf = BaggingClassifier(base_estimator = KNeighborsClassifier(), n_estimators = 10, oob_score = True, random_state = randState )\r\nclf.fit(df[features], df[label])\r\nprint clf.oob_score_\r\n```\r\n\r\nHere, clf.oob_score_=0.0.\r\n\r\nNow, you would not expect that the OOB accuracy is a function of the class labels...\r\n\r\n```python\r\ndf.loc[ df[label] == -1 , label ] = 2\r\nclf = BaggingClassifier(base_estimator = KNeighborsClassifier(), n_estimators = 10, oob_score = True, random_state = randState )\r\nclf.fit(df[features], df[label])\r\nprint clf.oob_score_\r\n```\r\n\r\nNow, clf.oob_score_=1.0.\r\n\r\nClearly, OOB score should not be a function of the labels arbitrarily chosen for the classes.\r\n\r\nsklearn.__version__: '0.18.1'\r\nnumpy.__version__: '1.11.3'",
        "issue_id": 8933,
        "pr_number": 8936,
        "pr_title": "[MRG+1] fixed OOB_Score bug for bagging classifiers.",
        "pr_body": "Fixes #8933\r\n\r\n<!--\r\nThanks for contributing a pull request! Please ensure you have taken a look at\r\nthe contribution guidelines: https://github.com/scikit-learn/scikit-learn/blob/master/CONTRIBUTING.md#Contributing-Pull-Requests\r\n-->\r\n#### Reference Issue\r\n<!-- Example: Fixes #1234 -->\r\n\r\n\r\n#### What does this implement/fix? Explain your changes.\r\n\r\n\r\n#### Any other comments?\r\n\r\n\r\n<!--\r\nPlease be aware that we are a loose team of volunteers so patience is\r\nnecessary; assistance handling other issues is very welcome. We value\r\nall user contributions, no matter how minor they are. If we are slow to\r\nreview, either the pull request needs some benchmarking, tinkering,\r\nconvincing, etc. or more likely the reviewers are simply busy. In either\r\ncase, we ask for your understanding during the review process.\r\nFor more information, see our FAQ on this topic:\r\nhttp://scikit-learn.org/dev/faq.html#why-is-my-pull-request-not-getting-any-attention.\r\n\r\nThanks for contributing!\r\n-->\r\n",
        "issue_closed_at": "2017-06-08T09:35:49Z",
        "base_commit": "9131f89e6c165fb27dadd37d3168c1ee5ea84f5a"
      },
      "summary": "### Summary:\nThis issue pertains to an unexpected behavior in the `BaggingClassifier` from the `scikit-learn` library, where the out-of-bag (OOB) score varies depending on the numerical labels assigned to the classes. The problem is observed when using the `BaggingClassifier` in combination with `KNeighborsClassifier` as the base estimator. Specifically, the OOB score, which is expected to objectively represent the model's accuracy irrespective of class label assignments, reflects different values when the numeric representation of class labels is altered, indicating a flaw in how the OOB score is calculated or interpreted.\n\nKey symptoms include the OOB score changing from 0.0 to 1.0 solely based on modifying class label values, which highlights a discrepancy in the model's evaluation that should ideally remain invariant under different label representations. This behavior suggests that the OOB score calculation is incorrectly influenced by the arbitrary numeric values assigned to class labels, rather than the underlying class distribution or model performance.\n\nThe primary component affected is the `BaggingClassifier` within the `scikit-learn` library, specifically in relation to its OOB score functionality. This issue may impact users relying on OOB scores for model evaluation, leading to inaccurate assessments of classifier performance. The severity of this issue could be significant for applications that depend on precise model evaluation metrics, potentially skewing results and decision-making processes.\n\nTechnical details indicate that the patch modifies the implementation in `sklearn/ensemble/bagging.py`, specifically the function `BaggingRegressor._set_oob_score`, to correct the OOB score calculation method, ensuring it remains consistent regardless of label representation. This highlights the necessity of ensuring that metric calculations are invariant to transformations of class labels in machine learning models.",
      "prompt_used": "You are an expert in software issue reasoning analysis.\nGiven the following problem report and its fixed code elements, generate a comprehensive summary based on the entire document. Your goal is to abstract the information in the problem description into a more general description.\n\n## Original Issue Report:\nTitle: BUG: BaggingClassifier.oob_score_ should not change with class label\n\nBody:\nLet us compute the oob score of a bagged classifier.\r\n\r\n```python\r\nimport numpy as np\r\nimport pandas as pd\r\nfrom sklearn.ensemble import BaggingClassifier\r\nfrom sklearn.neighbors import KNeighborsClassifier\r\n\r\nN = 50\r\nrandState = 5\r\nlabel = 'Label'\r\nfeatures = ['A','B','C']\r\n\r\nlabels = np.random.randint(3, size = N) - 1\r\ndf = pd.DataFrame( labels , index=range(N), columns=[label] )\r\nfor col in features:\r\n    df[col] = df[label] + 0.01 * np.random.rand( N )\r\n\r\nclf = BaggingClassifier(base_estimator = KNeighborsClassifier(), n_estimators = 10, oob_score = True, random_state = randState )\r\nclf.fit(df[features], df[label])\r\nprint clf.oob_score_\r\n```\r\n\r\nHere, clf.oob_score_=0.0.\r\n\r\nNow, you would not expect that the OOB accuracy is a function of the class labels...\r\n\r\n```python\r\ndf.loc[ df[label] == -1 , label ] = 2\r\nclf = BaggingClassifier(base_estimator = KNeighborsClassifier(), n_estimators = 10, oob_score = True, random_state = randState )\r\nclf.fit(df[features], df[label])\r\nprint clf.oob_score_\r\n```\r\n\r\nNow, clf.oob_score_=1.0.\r\n\r\nClearly, OOB score should not be a function of the labels arbitrarily chosen for the classes.\r\n\r\nsklearn.__version__: '0.18.1'\r\nnumpy.__version__: '1.11.3'\n\n## Code elements fixed by the patch:\n{FIXED_CODE_ELEMENTS}\n\nPlease analyze the above issue report and provide a structured summary that includes:\n1. Problem description in general terms\n2. Key symptoms and behaviors observed\n3. Affected components or systems\n4. Potential impact or severity\n5. Any relevant technical details abstracted for broader understanding\n\nPlease return the summary with “### Summary:\", For example:\n### Summary: This issue is ...\n\nChanges Summary:\nsklearn/ensemble/bagging.py\n  function: BaggingRegressor._set_oob_score\n"
    }
  ]
}