{
  "instance_id": "astropy__astropy-6938",
  "repo": "astropy/astropy",
  "created_at": "2017-12-07T00:01:14Z",
  "problem_statement": "Possible bug in io.fits related to D exponents\nI came across the following code in ``fitsrec.py``:\r\n\r\n```python\r\n        # Replace exponent separator in floating point numbers\r\n        if 'D' in format:\r\n            output_field.replace(encode_ascii('E'), encode_ascii('D'))\r\n```\r\n\r\nI think this may be incorrect because as far as I can tell ``replace`` is not an in-place operation for ``chararray`` (it returns a copy). Commenting out this code doesn't cause any tests to fail so I think this code isn't being tested anyway.\n",
  "patch": "diff --git a/astropy/io/fits/fitsrec.py b/astropy/io/fits/fitsrec.py\n--- a/astropy/io/fits/fitsrec.py\n+++ b/astropy/io/fits/fitsrec.py\n@@ -1261,7 +1261,7 @@ def _scale_back_ascii(self, col_idx, input_field, output_field):\n \n         # Replace exponent separator in floating point numbers\n         if 'D' in format:\n-            output_field.replace(encode_ascii('E'), encode_ascii('D'))\n+            output_field[:] = output_field.replace(b'E', b'D')\n \n \n def _get_recarray_field(array, key):\n", 
    "Selected_candidate": {
        "pr_number": 5589,
        "pr_title": "Fix up lazy loading issues",
        "pr_body": "Fix or address these issues which resulted from lazy loading. In each case the goal is to maintain the current (1.x) behavior of io.fits.\r\n\r\n- [x] Handling of invalid key in the `in` operator (closes #5583)\r\n- [x] Slices not working (closes #5585)\r\n- [x] Variable name incorrect (closes #5594)\r\n- ~~[ ] Address behavior of `close` #5582~~ Won't change unless someone requests it.\r\n\r\n@eteq -- I believe I've milestoned this correctly, but please modify if not.\r\n\r\nedit: changed text of second todo.",
        "issue_id": 5594,
        "issue_title": "Regression in io.fits due to undefined variable",
        "issue_body": "@mwcraig @embray  - I'm running into the following failure in glue due to the recent changes in io.fits:\r\n\r\n```python\r\n    def _read_next_hdu(self):\r\n        \"\"\"\r\n            Lazily load a single HDU from the fileobj or data string the `HDUList`\r\n            was opened from, unless no further HDUs are found.\r\n    \r\n            Returns True if a new HDU was loaded, or False otherwise.\r\n            \"\"\"\r\n    \r\n        if self._read_all:\r\n            return False\r\n    \r\n        saved_compression_enabled = compressed.COMPRESSION_ENABLED\r\n        fileobj, data, kwargs = self._file, self._data, self._open_kwargs\r\n    \r\n        try:\r\n            self._in_read_next_hdu = True\r\n    \r\n            if ('disable_image_compression' in kwargs and\r\n                kwargs['disable_image_compression']):\r\n                compressed.COMPRESSION_ENABLED = False\r\n    \r\n            # read all HDUs\r\n            try:\r\n                if fileobj is not None:\r\n                    try:\r\n                        # Make sure we're back to the end of the last read\r\n                        # HDU\r\n                        if len(self) > 0:\r\n                            last = self[len(self) - 1]\r\n                            if last._data_offset is not None:\r\n                                offset = last._data_offset + last._data_size\r\n                                fileobj.seek(offset, os.SEEK_SET)\r\n    \r\n                        hdu = _BaseHDU.readfrom(fileobj, **kwargs)\r\n                    except EOFError:\r\n                        self._read_all = True\r\n                        return False\r\n                    except ValueError:\r\n                        # A ValueError can occur when trying to perform I/O\r\n                        # on a closed file\r\n                        if fileobj.closed:\r\n                            self._read_all = True\r\n                            return False\r\n                        else:\r\n                            raise\r\n                    except IOError:\r\n                        if fileobj.writeonly:\r\n                            self._read_all = True\r\n                            return False\r\n                        else:\r\n                            raise\r\n                else:\r\n                    if not data:\r\n                        self._read_all = True\r\n                        return False\r\n                    hdu = _BaseHDU.fromstring(data, **kwargs)\r\n                    self._data = data[hdu._data_offset + hdu._data_size:]\r\n    \r\n                super(HDUList, self).append(hdu)\r\n                if len(self) == 1:\r\n                    # Check for an extension HDU and update the EXTEND\r\n                    # keyword of the primary HDU accordingly\r\n                    self.update_extend()\r\n    \r\n                hdu._new = False\r\n                if 'checksum' in kwargs:\r\n                    hdu._output_checksum = kwargs['checksum']\r\n            # check in the case there is extra space after the last HDU or\r\n            # corrupted HDU\r\n            except (VerifyError, ValueError) as exc:\r\n                warnings.warn(\r\n                    'Error validating header for HDU #{} (note: Astropy '\r\n                    'uses zero-based indexing).\\n{}\\n'\r\n                    'There may be extra bytes after the last HDU or the '\r\n                    'file is corrupted.'.format(\r\n>                       len(hdulist), indent(str(exc))), VerifyWarning)\r\nE                       NameError: global name 'hdulist' is not defined\r\n\r\n../../../miniconda/envs/test/lib/python2.7/site-packages/astropy/io/fits/hdu/hdulist.py:1150: NameError\r\n============================ pytest-warning summary ============================\r\n```",
        "issue_closed_at": "2016-12-13T01:48:53Z",
        "base_commit": "a72fdfb6b07356805182cb3a2fc20487404b073f",
        "changes": [
            {
                "file": "astropy/io/fits/hdu/hdulist.py",
                "type": "function",
                "name": "__getitem__",
                "class_name": "HDUList",
                "code": "def __getitem__(self, key):\n        \"\"\"\n        Get an HDU from the `HDUList`, indexed by number or name.\n        \"\"\"\n\n        if isinstance(key, slice):\n            hdus = super(HDUList, self).__getitem__(key)\n            return HDUList(hdus)\n\n        # Originally this used recursion, but hypothetically an HDU with\n        # a very large number of HDUs could blow the stack, so use a loop\n        # instead\n        return self._try_while_unread_hdus(super(HDUList, self).__getitem__,\n                                           self._positive_index_of(key))"
            },
            {
                "file": "astropy/io/fits/hdu/hdulist.py",
                "type": "function",
                "name": "fromstring",
                "class_name": "HDUList",
                "code": "def fromstring(cls, data, **kwargs):\n        \"\"\"\n        Creates an `HDUList` instance from a string or other in-memory data\n        buffer containing an entire FITS file.  Similar to\n        :meth:`HDUList.fromfile`, but does not accept the mode or memmap\n        arguments, as they are only relevant to reading from a file on disk.\n\n        This is useful for interfacing with other libraries such as CFITSIO,\n        and may also be useful for streaming applications.\n\n        Parameters\n        ----------\n        data : str, buffer, memoryview, etc.\n            A string or other memory buffer containing an entire FITS file.  It\n            should be noted that if that memory is read-only (such as a Python\n            string) the returned :class:`HDUList`'s data portions will also be\n            read-only.\n\n        kwargs : dict\n            Optional keyword arguments.  See\n            :func:`astropy.io.fits.open` for details.\n\n        Returns\n        -------\n        hdul : HDUList\n            An :class:`HDUList` object representing the in-memory FITS file.\n        \"\"\"\n\n        try:\n            # Test that the given object supports the buffer interface by\n            # ensuring an ndarray can be created from it\n            np.ndarray((), dtype='ubyte', buffer=data)\n        except TypeError:\n            raise TypeError(\n                'The provided object %r does not contain an underlying '\n                'memory buffer.  fromstring() requires an object that '\n                'supports the buffer interface such as bytes, str '\n                '(in Python 2.x but not in 3.x), buffer, memoryview, '\n                'ndarray, etc.  This restriction is to ensure that '\n                'efficient access to the array/table data is possible.'\n                % data)\n\n        return cls._readfrom(data=data, **kwargs)"
            },
            {
                "file": "astropy/io/fits/hdu/hdulist.py",
                "type": "function",
                "name": "index_of",
                "class_name": "HDUList",
                "code": "def index_of(self, key):\n        \"\"\"\n        Get the index of an HDU from the `HDUList`.\n\n        Parameters\n        ----------\n        key : int, str or tuple of (string, int)\n           The key identifying the HDU.  If ``key`` is a tuple, it is of the\n           form ``(key, ver)`` where ``ver`` is an ``EXTVER`` value that must\n           match the HDU being searched for.\n\n           If the key is ambiguous (e.g. there are multiple 'SCI' extensions)\n           the first match is returned.  For a more precise match use the\n           ``(name, ver)`` pair.\n\n           If even the ``(name, ver)`` pair is ambiguous (it shouldn't be\n           but it's not impossible) the numeric index must be used to index\n           the duplicate HDU.\n\n        Returns\n        -------\n        index : int\n           The index of the HDU in the `HDUList`.\n        \"\"\"\n\n        if _is_int(key):\n            return key\n        elif isinstance(key, tuple):\n            _key, _ver = key\n        else:\n            _key = key\n            _ver = None\n\n        if not isinstance(_key, string_types):\n            raise TypeError(\n                '%s indices must be integers, extension names as strings, '\n                'or (extname, version) tuples; got %r' %\n                (self.__class__.__name__, _key))\n\n        _key = (_key.strip()).upper()\n\n        found = None\n        for idx, hdu in enumerate(self):\n            name = hdu.name\n            if isinstance(name, string_types):\n                name = name.strip().upper()\n            # 'PRIMARY' should always work as a reference to the first HDU\n            if ((name == _key or (_key == 'PRIMARY' and idx == 0)) and\n                (_ver is None or _ver == hdu.ver)):\n                found = idx\n                break\n\n        if (found is None):\n            raise KeyError('Extension {} not found.'.format(repr(key)))\n        else:\n            return found"
            },
            {
                "file": "astropy/io/fits/hdu/hdulist.py",
                "type": "function",
                "name": "_positive_index_of",
                "class_name": "HDUList",
                "code": "def _positive_index_of(self, key):\n        \"\"\"\n        Same as index_of, but ensures always returning a positive index\n        or zero.\n\n        (Really this should be called non_negative_index_of but it felt\n        too long.)\n\n        This means that if the key is a negative integer, we have to\n        convert it to the corresponding positive index.  This means\n        knowing the length of the HDUList, which in turn means loading\n        all HDUs.  Therefore using negative indices on HDULists is inherently\n        inefficient.\n        \"\"\"\n\n        index = self.index_of(key)\n\n        if index >= 0:\n            return index\n\n        if abs(index) > len(self):\n            raise IndexError(\n                'Extension %s is out of bound or not found.' % index)\n\n        return len(self) + index"
            },
            {
                "file": "astropy/io/fits/hdu/hdulist.py",
                "type": "function",
                "name": "_read_next_hdu",
                "class_name": "HDUList",
                "code": "def _read_next_hdu(self):\n        \"\"\"\n        Lazily load a single HDU from the fileobj or data string the `HDUList`\n        was opened from, unless no further HDUs are found.\n\n        Returns True if a new HDU was loaded, or False otherwise.\n        \"\"\"\n\n        if self._read_all:\n            return False\n\n        saved_compression_enabled = compressed.COMPRESSION_ENABLED\n        fileobj, data, kwargs = self._file, self._data, self._open_kwargs\n\n        try:\n            self._in_read_next_hdu = True\n\n            if ('disable_image_compression' in kwargs and\n                kwargs['disable_image_compression']):\n                compressed.COMPRESSION_ENABLED = False\n\n            # read all HDUs\n            try:\n                if fileobj is not None:\n                    try:\n                        # Make sure we're back to the end of the last read\n                        # HDU\n                        if len(self) > 0:\n                            last = self[len(self) - 1]\n                            if last._data_offset is not None:\n                                offset = last._data_offset + last._data_size\n                                fileobj.seek(offset, os.SEEK_SET)\n\n                        hdu = _BaseHDU.readfrom(fileobj, **kwargs)\n                    except EOFError:\n                        self._read_all = True\n                        return False\n                    except ValueError:\n                        # A ValueError can occur when trying to perform I/O\n                        # on a closed file\n                        if fileobj.closed:\n                            self._read_all = True\n                            return False\n                        else:\n                            raise\n                    except IOError:\n                        if fileobj.writeonly:\n                            self._read_all = True\n                            return False\n                        else:\n                            raise\n                else:\n                    if not data:\n                        self._read_all = True\n                        return False\n                    hdu = _BaseHDU.fromstring(data, **kwargs)\n                    self._data = data[hdu._data_offset + hdu._data_size:]\n\n                super(HDUList, self).append(hdu)\n                if len(self) == 1:\n                    # Check for an extension HDU and update the EXTEND\n                    # keyword of the primary HDU accordingly\n                    self.update_extend()\n\n                hdu._new = False\n                if 'checksum' in kwargs:\n                    hdu._output_checksum = kwargs['checksum']\n            # check in the case there is extra space after the last HDU or\n            # corrupted HDU\n            except (VerifyError, ValueError) as exc:\n                warnings.warn(\n                    'Error validating header for HDU #{} (note: Astropy '\n                    'uses zero-based indexing).\\n{}\\n'\n                    'There may be extra bytes after the last HDU or the '\n                    'file is corrupted.'.format(\n                        len(hdulist), indent(str(exc))), VerifyWarning)\n                del exc\n                self._read_all = True\n                return False\n        finally:\n            compressed.COMPRESSION_ENABLED = saved_compression_enabled\n            self._in_read_next_hdu = False\n\n        return True"
            }
        ]
    },
    "Justification": "Candidate D is selected as the most relevant report because it addresses a regression specifically in the `io.fits` module, which is directly related to the CURRENT bug report. Both the CURRENT bug report and Candidate D deal with reading and handling HDUs in FITS files. Additionally, the error in Candidate D indicates issues with variable management and function behavior in `HDUList`, which could provide insights into the problematic piece of code regarding the manipulation of characters in the `fitsrec.py` file. The structural similarity, along with the direct relevance to the same modules involved in both reports, makes it very likely to assist in solving the CURRENT bug."
}