{
  "Selected_candidate": {
    "pr_number": 1339,
    "pr_title": "Rewrite CaseInsensitiveDict to work correctly/sanely",
    "pr_body": "Fixes #649 and #1329 by making Session.headers a CaseInsensitiveDict,\nand fixing the implementation of CID. Credit for the brilliant idea\nto map `lowercased_key -> (cased_key, mapped_value)` goes to\n@gazpachoking, thanks a bunch.\n\nChanges from original implementation of CaseInsensitiveDict:\n1.  CID is rewritten as a subclass of `collections.MutableMapping`.\n2.  CID remembers the case of the last-set key, but `__setitem__`\n   and `__delitem__` will handle keys without respect to case.\n3.  CID returns the key case as remembered for the `keys`, `items`,\n   and `__iter__` methods.\n4.  Query operations (`__getitem__` and `__contains__`) are done in\n   a case-insensitive manner: `cid['foo']` and `cid['FOO']` will\n   return the same value.\n5.  The constructor as well as `update` and `__eq__` have undefined\n   behavior when given multiple keys that have the same `lower()`.\n6.  The new method `lower_items` is like `iteritems`, but keys are\n   all lowercased.\n7.  CID raises `KeyError` for `__getitem__` as normal dicts do. The\n   old implementation returned `None`\n8.  The `__repr__` now makes it obvious that it's not a normal dict.\n\nSee PR #1333 for the discussions that lead up to this implementation\n",
    "issue_id": 649,
    "issue_title": "CaseInsensitiveDict __setitem__ faulty for differing-case keys",
    "issue_body": "I think I figured out the cause of the issue described here: https://github.com/kennethreitz/requests/pull/59\n\nIt appears that if one isn't careful to use a common case convention when assigning a header after it has been assigned before, the header will fail to work. The request will use the old header value... I think I know why, too. Here's an example:\n\ntest.py\n\n``` python\n\nfrom requests.structures import CaseInsensitiveDict\n\nprint \"Initializing CID = CaseInsensitiveDict()\"\nCID = CaseInsensitiveDict()\n\nprint \"Setting 'spam': 'eggs'\"\nCID['spam'] = 'eggs'\nprint \"Setting 'Spam': 'Eggs'\"\nCID['Spam'] = 'Eggs'\nprint \"Setting 'sPAM': 'eGGS'\"\nCID['sPAM'] = 'eGGS'\n\nprint \"Contents of CID:\", CID\nprint \"CID['spam']: '%s'\" % CID['spam']\nprint \"CID['Spam']: '%s'\" % CID['Spam']\nprint \"CID['sPAM']: '%s'\" % CID['sPAM']\n\nprint \"\\n\\n\\n\"\n\nprint \"Initializing CID = CaseInsensitiveDict()\"\nCID = CaseInsensitiveDict()\n\nprint \"Setting 'sPAM': 'eGGS'\"\nCID['sPAM'] = 'eGGS'\nprint \"Setting 'Spam': 'Eggs'\"\nCID['Spam'] = 'Eggs'\nprint \"Setting 'spam': 'eggs'\"\nCID['spam'] = 'eggs'\n\nprint \"Contents of CID:\", CID\nprint \"CID['spam']: '%s'\" % CID['spam']\nprint \"CID['Spam']: '%s'\" % CID['Spam']\nprint \"CID['sPAM']: '%s'\" % CID['sPAM']\n```\n\nThe output:\n\n```\nInitializing CID = CaseInsensitiveDict()\nSetting 'spam': 'eggs'\nSetting 'Spam': 'Eggs'\nSetting 'sPAM': 'eGGS'\nContents of CID: {'sPAM': 'eGGS', 'Spam': 'Eggs', 'spam': 'eggs'}\nCID['spam']: 'eggs'\nCID['Spam']: 'eggs'\nCID['sPAM']: 'eggs'\n\nInitializing CID = CaseInsensitiveDict()\nSetting 'sPAM': 'eGGS'\nSetting 'Spam': 'Eggs'\nSetting 'spam': 'eggs'\nContents of CID: {'spam': 'eggs', 'Spam': 'Eggs', 'sPAM': 'eGGS'}\nCID['spam']: 'eGGS'\nCID['Spam']: 'eGGS'\nCID['sPAM']: 'eGGS'\n```\n\nNotice how in both passes, only the first assignment actually \"works\". It stores the data for all three assignments, but will only return the value set by the initial key.\n\nI believe the only change necessary would be to the **setitem** method on CaseInsensitiveDict:\n\nBroken:\n\n``` python\n    def __setitem__(self, key, value):\n        dict.__setitem__(self, key, value)\n        self._clear_lower_keys()\n```\n\nFixed:\n\n``` python\n    def __setitem__(self, key, value):\n        dict.__setitem__(self, key.lower(), value)\n        self._clear_lower_keys()\n```\n\nWhen I use the corrected module, the CaseInsensitiveDict behaves as I believe it should:\n\n```\nInitializing CID = CaseInsensitiveDict()\nSetting 'spam': 'eggs'\nSetting 'Spam': 'Eggs'\nSetting 'sPAM': 'eGGS'\nContents of CID: {'spam': 'eGGS'}\nCID['spam']: 'eGGS'\nCID['Spam']: 'eGGS'\nCID['sPAM']: 'eGGS'\n\nInitializing CID = CaseInsensitiveDict()\nSetting 'sPAM': 'eGGS'\nSetting 'Spam': 'Eggs'\nSetting 'spam': 'eggs'\nContents of CID: {'spam': 'eggs'}\nCID['spam']: 'eggs'\nCID['Spam']: 'eggs'\nCID['sPAM']: 'eggs'\n```\n\nNote that the value for that key is the last one assigned, instead of the first.\n\nIf needed, I can put together a pull request for this, but it might take some time since I've never done it before :)\n",
    "issue_closed_at": "2012-07-27T05:54:43Z",
    "base_commit": "ab36f3cc6f91f1e4903a3f4b31501cb7fefe4555",
    "changes": [
      {
        "file": "requests/structures.py",
        "type": "line",
        "name": "line 9",
        "code": "\"\"\"\n\nimport os\nfrom itertools import islice\n\n"
      },
      {
        "file": "requests/structures.py",
        "type": "function",
        "name": "read",
        "class_name": "IteratorProxy",
        "code": "def read(self, n):\n        return \"\".join(islice(self.i, None, n))"
      },
      {
        "file": "requests/utils.py",
        "type": "line",
        "name": "line 23",
        "code": "from .compat import parse_http_list as _parse_list_header\nfrom .compat import quote, urlparse, bytes, str, OrderedDict, urlunparse\nfrom .cookies import RequestsCookieJar, cookiejar_from_dict\n\n_hush_pyflakes = (RequestsCookieJar,)\n"
      },
      {
        "file": "requests/utils.py",
        "type": "function",
        "name": "default_user_agent",
        "class_name": null,
        "code": "def default_user_agent():\n    \"\"\"Return a string representing the default user agent.\"\"\"\n    _implementation = platform.python_implementation()\n\n    if _implementation == 'CPython':\n        _implementation_version = platform.python_version()\n    elif _implementation == 'PyPy':\n        _implementation_version = '%s.%s.%s' % (sys.pypy_version_info.major,\n                                                sys.pypy_version_info.minor,\n                                                sys.pypy_version_info.micro)\n        if sys.pypy_version_info.releaselevel != 'final':\n            _implementation_version = ''.join([_implementation_version, sys.pypy_version_info.releaselevel])\n    elif _implementation == 'Jython':\n        _implementation_version = platform.python_version()  # Complete Guess\n    elif _implementation == 'IronPython':\n        _implementation_version = platform.python_version()  # Complete Guess\n    else:\n        _implementation_version = 'Unknown'\n\n    try:\n        p_system = platform.system()\n        p_release = platform.release()\n    except IOError:\n        p_system = 'Unknown'\n        p_release = 'Unknown'\n\n    return \" \".join(['python-requests/%s' % __version__,\n                     '%s/%s' % (_implementation, _implementation_version),\n                     '%s/%s' % (p_system, p_release)])"
      }
    ]
  },
  "Justification": "Candidate A is the most useful report due to its focus on the CaseInsensitiveDict, which shares similar structural and component characteristics with the hooks implementation in the CURRENT bug report. Both involve parsing input parameters and interacting with data structures, making it relevant for understanding how to modify the method behaviors to correctly handle multiple hooks. While it primarily addresses deviations in key handling, the insights into parameter processing and the potential error introduction through incorrect usage patterns can shed light on fixing the list handling in the hooks argument. Thus, the adjustments and understanding of dictionary behavior from Candidate A can guide improvements in the Request class's hook processing.",
  "instance_id": "psf__requests-863",
  "repo": "psf/requests",
  "created_at": "2012-09-20T15:48:00Z",
  "problem_statement": "Allow lists in the dict values of the hooks argument\nCurrently the Request class has a .register_hook() method but it parses the dictionary it expects from it's hooks argument weirdly: the argument can only specify one hook function per hook.  If you pass in a list of hook functions per hook the code in Request.**init**() will wrap the list in a list which then fails when the hooks are consumed (since a list is not callable).  This is especially annoying since you can not use multiple hooks from a session.  The only way to get multiple hooks now is to create the request object without sending it, then call .register_hook() multiple times and then finally call .send().\n\nThis would all be much easier if Request.**init**() parsed the hooks parameter in a way that it accepts lists as it's values.\n\n",
  "patch": "diff --git a/requests/models.py b/requests/models.py\n--- a/requests/models.py\n+++ b/requests/models.py\n@@ -462,8 +462,10 @@ def path_url(self):\n \n     def register_hook(self, event, hook):\n         \"\"\"Properly register a hook.\"\"\"\n-\n-        self.hooks[event].append(hook)\n+        if isinstance(hook, (list, tuple, set)):\n+            self.hooks[event].extend(hook)\n+        else:\n+            self.hooks[event].append(hook)\n \n     def deregister_hook(self, event, hook):\n         \"\"\"Deregister a previously registered hook.\n"
}