{
  "Selected_candidate": {
    "pr_number": 2711,
    "pr_title": "FITS: 'BLANK' keyword causes crash when reading data",
    "pr_body": "Hello,\nI already notice since a while an issue while trying to read, e.g., data fits cube (so simply (x,y,z))  see below for the full error which originate from 'hdu/image.py'.\nMy dummy, quickest fix was to add, right before the offending 'if':\n                `if blanks==False: blanks=np.array([False])`\n\nI don't know if it is really appropriate, but it does the trick for me.\n\nregards,\nGilles\n\n```\nIn [120]: array=py.getdata(cube_file)\n---------------------------------------------------------------------------\nAttributeError                            Traceback (most recent call last)\n<ipython-input-120-5d145d1683de> in <module>()\n----> 1 array=py.getdata(cube_file)\n\n/usr/local/lib/python2.7/dist-packages/astropy-0.3.2-py2.7-linux-x86_64.egg/astropy/io/fits/convenience.pyc in getdata(filename, *args, **kwargs)\n    186     hdulist, extidx = _getext(filename, mode, *args, **kwargs)\n    187     hdu = hdulist[extidx]\n--> 188     data = hdu.data\n    189     if data is None and extidx == 0:\n    190         try:\n\n/usr/local/lib/python2.7/dist-packages/astropy-0.3.2-py2.7-linux-x86_64.egg/astropy/utils/misc.pyc in __get__(self, obj, owner)\n    277         key = self._fget.__name__\n    278         if key not in obj.__dict__:\n--> 279             val = self._fget(obj)\n    280             obj.__dict__[key] = val\n    281             return val\n\n/usr/local/lib/python2.7/dist-packages/astropy-0.3.2-py2.7-linux-x86_64.egg/astropy/io/fits/hdu/image.pyc in data(self)\n    213             return\n    214 \n--> 215         data = self._get_scaled_image_data(self._data_offset, self.shape)\n    216         self._update_header_scale_info(data.dtype)\n    217 \n\n/usr/local/lib/python2.7/dist-packages/astropy-0.3.2-py2.7-linux-x86_64.egg/astropy/io/fits/hdu/image.pyc in _get_scaled_image_data(self, offset, shape)\n    582                 # So if the number of blank items is fewer than\n    583                 # len(raw_data.flat) / 8, using np.where will use less memory\n--> 584                 if blanks.sum() < len(blanks) / 8:\n    585                     blanks = np.where(blanks)\n    586 \n\nAttributeError: 'bool' object has no attribute 'sum'\n```\n",
    "issue_id": 2711,
    "issue_title": "FITS: 'BLANK' keyword causes crash when reading data",
    "issue_body": "Hello,\nI already notice since a while an issue while trying to read, e.g., data fits cube (so simply (x,y,z))  see below for the full error which originate from 'hdu/image.py'.\nMy dummy, quickest fix was to add, right before the offending 'if':\n                `if blanks==False: blanks=np.array([False])`\n\nI don't know if it is really appropriate, but it does the trick for me.\n\nregards,\nGilles\n\n```\nIn [120]: array=py.getdata(cube_file)\n---------------------------------------------------------------------------\nAttributeError                            Traceback (most recent call last)\n<ipython-input-120-5d145d1683de> in <module>()\n----> 1 array=py.getdata(cube_file)\n\n/usr/local/lib/python2.7/dist-packages/astropy-0.3.2-py2.7-linux-x86_64.egg/astropy/io/fits/convenience.pyc in getdata(filename, *args, **kwargs)\n    186     hdulist, extidx = _getext(filename, mode, *args, **kwargs)\n    187     hdu = hdulist[extidx]\n--> 188     data = hdu.data\n    189     if data is None and extidx == 0:\n    190         try:\n\n/usr/local/lib/python2.7/dist-packages/astropy-0.3.2-py2.7-linux-x86_64.egg/astropy/utils/misc.pyc in __get__(self, obj, owner)\n    277         key = self._fget.__name__\n    278         if key not in obj.__dict__:\n--> 279             val = self._fget(obj)\n    280             obj.__dict__[key] = val\n    281             return val\n\n/usr/local/lib/python2.7/dist-packages/astropy-0.3.2-py2.7-linux-x86_64.egg/astropy/io/fits/hdu/image.pyc in data(self)\n    213             return\n    214 \n--> 215         data = self._get_scaled_image_data(self._data_offset, self.shape)\n    216         self._update_header_scale_info(data.dtype)\n    217 \n\n/usr/local/lib/python2.7/dist-packages/astropy-0.3.2-py2.7-linux-x86_64.egg/astropy/io/fits/hdu/image.pyc in _get_scaled_image_data(self, offset, shape)\n    582                 # So if the number of blank items is fewer than\n    583                 # len(raw_data.flat) / 8, using np.where will use less memory\n--> 584                 if blanks.sum() < len(blanks) / 8:\n    585                     blanks = np.where(blanks)\n    586 \n\nAttributeError: 'bool' object has no attribute 'sum'\n```\n",
    "issue_closed_at": "2014-09-16T19:18:56Z",
    "base_commit": "0307f793cf700560673ff482d37de447958db437",
    "changes": [
      {
        "file": "astropy/io/fits/hdu/image.py",
        "type": "line",
        "name": "line 1",
        "code": "# Licensed under a 3-clause BSD style license - see PYFITS.rst\n\nimport sys\nimport numpy as np\n\nfrom .base import DELAYED, _ValidHDU, ExtensionHDU\nfrom ..header import Header\nfrom ..util import (_is_pseudo_unsigned, _unsigned_zero, _is_int,\n                    _normalize_slice)\n\nfrom ....extern.six import string_types\nfrom ....extern.six.moves import xrange"
      },
      {
        "file": "astropy/io/fits/hdu/image.py",
        "type": "function",
        "name": "__init__",
        "class_name": "_KeyType",
        "code": "def __init__(self, npts, offset):\n        self.npts = npts\n        self.offset = offset"
      },
      {
        "file": "astropy/io/fits/hdu/image.py",
        "type": "function",
        "name": "_get_scaled_image_data",
        "class_name": "_ImageBaseHDU",
        "code": "def _get_scaled_image_data(self, offset, shape):\n        \"\"\"\n        Internal function for reading image data from a file and apply scale\n        factors to it.  Normally this is used for the entire image, but it\n        supports alternate offset/shape for Section support.\n        \"\"\"\n\n        code = _ImageBaseHDU.NumCode[self._orig_bitpix]\n\n        raw_data = self._get_raw_data(shape, code, offset)\n        raw_data.dtype = raw_data.dtype.newbyteorder('>')\n\n        if (self._orig_bzero == 0 and self._orig_bscale == 1 and\n                self._blank is None):\n            # No further conversion of the data is necessary\n            return raw_data\n\n        data = None\n        if not (self._orig_bzero == 0 and self._orig_bscale == 1):\n            data = self._convert_pseudo_unsigned(raw_data)\n\n        if data is None:\n            # In these cases, we end up with floating-point arrays and have to\n            # apply bscale and bzero. We may have to handle BLANK and convert\n            # to NaN in the resulting floating-point arrays.\n            if self._blank is not None:\n                blanks = raw_data.flat == self._blank\n                # The size of blanks in bytes is the number of elements in\n                # raw_data.flat.  However, if we use np.where instead we will\n                # only use 8 bytes for each index where the condition is true.\n                # So if the number of blank items is fewer than\n                # len(raw_data.flat) / 8, using np.where will use less memory\n                if blanks.sum() < len(blanks) / 8:\n                    blanks = np.where(blanks)\n\n            new_dtype = self._dtype_for_bitpix()\n            if new_dtype is not None:\n                data = np.array(raw_data, dtype=new_dtype)\n            else:  # floating point cases\n                if self._file.memmap:\n                    data = raw_data.copy()\n                # if not memmap, use the space already in memory\n                else:\n                    data = raw_data\n\n            del raw_data\n\n            if self._orig_bscale != 1:\n                np.multiply(data, self._orig_bscale, data)\n            if self._orig_bzero != 0:\n                data += self._orig_bzero\n\n            if self._blank is not None:\n                data.flat[blanks] = np.nan\n\n        return data"
      }
    ]
  },
  "Justification": "Candidate B is the most helpful report because it deals with an error in reading data and provides a straightforward fix that modifies the handling of specific cases in the code, similar to the expectations outlined in the CURRENT bug report. Both reports emphasize issues related to reading files and handling unusual cases that lead to crashes or exceptions—specifically concerning data format expectations. The symptom of crashing is directly related to how the CURRENT bug also results in a ValueError when reading improperly formatted input. The modification approach in Candidate B could provide valuable insight into tackling how the CURRENT bug could improve error handling for the lowercase commands in QDP files, thus enhancing robustness and user experience.",
  "instance_id": "astropy__astropy-14365",
  "repo": "astropy/astropy",
  "created_at": "2023-02-06T19:20:34Z",
  "problem_statement": "ascii.qdp Table format assumes QDP commands are upper case\n### Description\n\nascii.qdp assumes that commands in a QDP file are upper case, for example, for errors they must be \"READ SERR 1 2\" whereas QDP itself is not case sensitive and case use \"read serr 1 2\". \r\n\r\nAs many QDP files are created by hand, the expectation that all commands be all-caps should be removed.\n\n### Expected behavior\n\nThe following qdp file should read into a `Table` with errors, rather than crashing.\r\n```\r\nread serr 1 2 \r\n1 0.5 1 0.5\r\n```\n\n### How to Reproduce\n\nCreate a QDP file:\r\n```\r\n> cat > test.qdp\r\nread serr 1 2 \r\n1 0.5 1 0.5\r\n<EOF>\r\n\r\n > python\r\nPython 3.10.9 (main, Dec  7 2022, 02:03:23) [Clang 13.0.0 (clang-1300.0.29.30)] on darwin\r\nType \"help\", \"copyright\", \"credits\" or \"license\" for more information.\r\n>>> from astropy.table import Table\r\n>>> Table.read('test.qdp',format='ascii.qdp')\r\nWARNING: table_id not specified. Reading the first available table [astropy.io.ascii.qdp]\r\nTraceback (most recent call last):\r\n...\r\n    raise ValueError(f'Unrecognized QDP line: {line}')\r\nValueError: Unrecognized QDP line: read serr 1 2\r\n```\r\n\r\nRunning \"qdp test.qdp\" works just fine.\r\n\n\n### Versions\n\nPython 3.10.9 (main, Dec  7 2022, 02:03:23) [Clang 13.0.0 (clang-1300.0.29.30)]\r\nastropy 5.1\r\nNumpy 1.24.1\r\npyerfa 2.0.0.1\r\nScipy 1.10.0\r\nMatplotlib 3.6.3\r\n\n",
  "patch": "diff --git a/astropy/io/ascii/qdp.py b/astropy/io/ascii/qdp.py\n--- a/astropy/io/ascii/qdp.py\n+++ b/astropy/io/ascii/qdp.py\n@@ -68,7 +68,7 @@ def _line_type(line, delimiter=None):\n     _new_re = rf\"NO({sep}NO)+\"\n     _data_re = rf\"({_decimal_re}|NO|[-+]?nan)({sep}({_decimal_re}|NO|[-+]?nan))*)\"\n     _type_re = rf\"^\\s*((?P<command>{_command_re})|(?P<new>{_new_re})|(?P<data>{_data_re})?\\s*(\\!(?P<comment>.*))?\\s*$\"\n-    _line_type_re = re.compile(_type_re)\n+    _line_type_re = re.compile(_type_re, re.IGNORECASE)\n     line = line.strip()\n     if not line:\n         return \"comment\"\n@@ -306,7 +306,7 @@ def _get_tables_from_qdp_file(qdp_file, input_colnames=None, delimiter=None):\n \n             values = []\n             for v in line.split(delimiter):\n-                if v == \"NO\":\n+                if v.upper() == \"NO\":\n                     values.append(np.ma.masked)\n                 else:\n                     # Understand if number is int or float\n"
}