{
  "Selected_candidate": {
    "pr_number": 1361,
    "pr_title": "Always percent-encode location headers.",
    "pr_body": "If we don't, stupid servers that don't do it themselves can hurt us badly.\n\nResolves #1360.\n",
    "issue_id": 1360,
    "issue_title": "Problem with 301-redirect",
    "issue_body": "I have a problem with downloading a file, when the server returns the 301 code:\n\n``` python\nr = requests.get('http://servag.rupsy.ru/track.php?url=2013-04-11/Haatzumaniak_-_Dark_Sadhu_[2013]/03.%20Haatzumaniak_-_777_Dark%20Sadhu%202013.mp3&arj=Haatzumaniak_-_Dark_Sadhu_[2013].ZIP')\nprint(r)\n```\n\nIf you do redirect manually - it works fine:\n\n``` python\nr = requests.get('http://servag.rupsy.ru/track.php?url=2013-04-11/Haatzumaniak_-_Dark_Sadhu_[2013]/03.%20Haatzumaniak_-_777_Dark%20Sadhu%202013.mp3&arj=Haatzumaniak_-_Dark_Sadhu_[2013].ZIP', allow_redirects=False)\nr = requests.get(r.headers['Location'])\nprint(r)\n```\n\nThe problem appears to Python 3.3 and python-requests-1.2.0/python-requests-dev\n\n```\nTraceback (most recent call last):\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\packages\\urllib3\\connectionpool.py\", line 428, in urlopen\n    body=body, headers=headers)\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\packages\\urllib3\\connectionpool.py\", line 290, in _make_request\n    httplib_response = conn.getresponse()\n  File \"c:\\Python33\\lib\\http\\client.py\", line 1143, in getresponse\n    response.begin()\n  File \"c:\\Python33\\lib\\http\\client.py\", line 354, in begin\n    version, status, reason = self._read_status()\n  File \"c:\\Python33\\lib\\http\\client.py\", line 324, in _read_status\n    raise BadStatusLine(line)\nhttp.client.BadStatusLine: ''\n\nDuring handling of the above exception, another exception occurred:\n\nTraceback (most recent call last):\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\adapters.py\", line 283, in send\n    timeout=timeout\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\packages\\urllib3\\connectionpool.py\", line 474, in urlopen\n    raise MaxRetryError(self, url, e)\nrequests.packages.urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='servag.rupsy.ru', port=80): Max retries exceeded with url: /media/2/Haatzumaniak_-_Dark_Sadhu_[2013]/03. Haatzumaniak_-_777_Dark Sadhu 2013.mp3 (Caused by <class 'http.client.BadStatusLine'>: '')\n\nDuring handling of the above exception, another exception occurred:\n\nTraceback (most recent call last):\n  File \"C:/Dropbox/projects/InfoStreams/test.py\", line 25, in <module>\n    allow_redirects=True)\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\api.py\", line 55, in get\n    return request('get', url, **kwargs)\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\api.py\", line 44, in request\n    return session.request(method=method, url=url, **kwargs)\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\sessions.py\", line 345, in request\n    resp = self.send(prep, **send_kwargs)\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\sessions.py\", line 464, in send\n    history = [resp for resp in gen] if allow_redirects else []\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\sessions.py\", line 464, in <listcomp>\n    history = [resp for resp in gen] if allow_redirects else []\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\sessions.py\", line 154, in resolve_redirects\n    allow_redirects=False,\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\sessions.py\", line 448, in send\n    r = adapter.send(request, **kwargs)\n  File \"C:\\Dropbox\\projects\\Helpers\\ph\\.envwin32\\lib\\site-packages\\requests\\adapters.py\", line 318, in send\n    raise ConnectionError(e)\nrequests.exceptions.ConnectionError: HTTPConnectionPool(host='servag.rupsy.ru', port=80): Max retries exceeded with url: /media/2/Haatzumaniak_-_Dark_Sadhu_[2013]/03. Haatzumaniak_-_777_Dark Sadhu 2013.mp3 (Caused by <class 'http.client.BadStatusLine'>: '')\n```\n",
    "issue_closed_at": "2013-05-20T20:04:08Z",
    "base_commit": "eacb91afb1c5dee8a6fcdadc9473463008e791c6",
    "changes": [
      {
        "file": "requests/sessions.py",
        "type": "function",
        "name": "resolve_redirects",
        "class_name": "SessionRedirectMixin",
        "code": "def resolve_redirects(self, resp, req, stream=False, timeout=None,\n                          verify=True, cert=None, proxies=None):\n        \"\"\"Receives a Response. Returns a generator of Responses.\"\"\"\n\n        i = 0\n        prepared_request = PreparedRequest()\n        prepared_request.body = req.body\n        prepared_request.headers = req.headers.copy()\n        prepared_request.hooks = req.hooks\n        prepared_request.method = req.method\n        prepared_request.url = req.url\n\n        # ((resp.status_code is codes.see_other))\n        while (('location' in resp.headers and resp.status_code in REDIRECT_STATI)):\n\n            resp.content  # Consume socket so it can be released\n\n            if i >= self.max_redirects:\n                raise TooManyRedirects('Exceeded %s redirects.' % self.max_redirects)\n\n            # Release the connection back into the pool.\n            resp.close()\n\n            url = resp.headers['location']\n            method = prepared_request.method\n\n            # Handle redirection without scheme (see: RFC 1808 Section 4)\n            if url.startswith('//'):\n                parsed_rurl = urlparse(resp.url)\n                url = '%s:%s' % (parsed_rurl.scheme, url)\n\n            # Facilitate non-RFC2616-compliant 'location' headers\n            # (e.g. '/path/to/resource' instead of 'http://domain.tld/path/to/resource')\n            if not urlparse(url).netloc:\n                # Compliant with RFC3986, we percent encode the url.\n                url = urljoin(resp.url, requote_uri(url))\n\n            prepared_request.url = url\n\n            # http://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html#sec10.3.4\n            if (resp.status_code == codes.see_other and\n                    prepared_request.method != 'HEAD'):\n                method = 'GET'\n\n            # Do what the browsers do, despite standards...\n            if (resp.status_code in (codes.moved, codes.found) and\n                    prepared_request.method not in ('GET', 'HEAD')):\n                method = 'GET'\n\n            prepared_request.method = method\n\n            # https://github.com/kennethreitz/requests/issues/1084\n            if resp.status_code not in (codes.temporary, codes.resume):\n                if 'Content-Length' in prepared_request.headers:\n                    del prepared_request.headers['Content-Length']\n\n                prepared_request.body = None\n\n            headers = prepared_request.headers\n            try:\n                del headers['Cookie']\n            except KeyError:\n                pass\n\n            prepared_request.prepare_cookies(self.cookies)\n\n            resp = self.send(\n                prepared_request,\n                stream=stream,\n                timeout=timeout,\n                verify=verify,\n                cert=cert,\n                proxies=proxies,\n                allow_redirects=False,\n            )\n\n            extract_cookies_to_jar(self.cookies, prepared_request, resp.raw)\n\n            i += 1\n            yield resp"
      }
    ]
  },
  "Justification": "Candidate D discusses problems related to HTTP 301 redirects, which could provide insight into the unexpected behavior of content retrieval in the CURRENT bug report. Both involve issues with handling responses in the requests library, making the patch strategy relevant. Analyzing how Candidate D fixed redirect problems could be beneficial for resolving the confusion around `iter_content` and `text` responses in the CURRENT bug report.",
  "instance_id": "psf__requests-3362",
  "repo": "psf/requests",
  "created_at": "2016-06-24T13:31:31Z",
  "problem_statement": "Uncertain about content/text vs iter_content(decode_unicode=True/False)\nWhen requesting an application/json document, I'm seeing `next(r.iter_content(16*1024, decode_unicode=True))` returning bytes, whereas `r.text` returns unicode. My understanding was that both should return a unicode object. In essence, I thought \"iter_content\" was equivalent to \"iter_text\" when decode_unicode was True. Have I misunderstood something? I can provide an example if needed.\n\nFor reference, I'm using python 3.5.1 and requests 2.10.0.\n\nThanks!\n\n",
  "patch": "diff --git a/requests/utils.py b/requests/utils.py\n--- a/requests/utils.py\n+++ b/requests/utils.py\n@@ -358,13 +358,20 @@ def get_encoding_from_headers(headers):\n \n def stream_decode_response_unicode(iterator, r):\n     \"\"\"Stream decodes a iterator.\"\"\"\n+    encoding = r.encoding\n \n-    if r.encoding is None:\n-        for item in iterator:\n-            yield item\n-        return\n+    if encoding is None:\n+        encoding = r.apparent_encoding\n+\n+    try:\n+        decoder = codecs.getincrementaldecoder(encoding)(errors='replace')\n+    except (LookupError, TypeError):\n+        # A LookupError is raised if the encoding was not found which could\n+        # indicate a misspelling or similar mistake.\n+        #\n+        # A TypeError can be raised if encoding is None\n+        raise UnicodeError(\"Unable to decode contents with encoding %s.\" % encoding)\n \n-    decoder = codecs.getincrementaldecoder(r.encoding)(errors='replace')\n     for chunk in iterator:\n         rv = decoder.decode(chunk)\n         if rv:\n"
}