{
  "id": "astropy__astropy-13033",
  "question": "TimeSeries: misleading exception when required column check fails.\n<!-- This comments are hidden when you submit the issue,\r\nso you do not need to remove them! -->\r\n\r\n<!-- Please be sure to check out our contributing guidelines,\r\nhttps://github.com/astropy/astropy/blob/main/CONTRIBUTING.md .\r\nPlease be sure to check out our code of conduct,\r\nhttps://github.com/astropy/astropy/blob/main/CODE_OF_CONDUCT.md . -->\r\n\r\n<!-- Please have a search on our GitHub repository to see if a similar\r\nissue has already been posted.\r\nIf a similar issue is closed, have a quick look to see if you are satisfied\r\nby the resolution.\r\nIf not please go ahead and open an issue! -->\r\n\r\n<!-- Please check that the development version still produces the same bug.\r\nYou can install development version with\r\npip install git+https://github.com/astropy/astropy\r\ncommand. -->\r\n\r\n### Description\r\n<!-- Provide a general description of the bug. -->\r\n\r\nFor a `TimeSeries` object that has additional required columns (in addition to `time`), when codes mistakenly try to remove a required column, the exception it produces is misleading.\r\n\r\n### Expected behavior\r\n<!-- What did you expect to happen. -->\r\nAn exception that informs the users required columns are missing.\r\n\r\n### Actual behavior\r\nThe actual exception message is confusing:\r\n`ValueError: TimeSeries object is invalid - expected 'time' as the first columns but found 'time'`\r\n\r\n### Steps to Reproduce\r\n<!-- Ideally a code example could be provided so we can run it ourselves. -->\r\n<!-- If you are pasting code, use triple backticks (```) around\r\nyour code snippet. -->\r\n<!-- If necessary, sanitize your screen output to be pasted so you do not\r\nreveal secrets like tokens and passwords. -->\r\n\r\n```python\r\nfrom astropy.time import Time\r\nfrom astropy.timeseries import TimeSeries\r\n\r\ntime=Time(np.arange(100000, 100003), format='jd')\r\nts = TimeSeries(time=time, data = {\"flux\": [99.9, 99.8, 99.7]})\r\nts._required_columns = [\"time\", \"flux\"]                                   \r\nts.remove_column(\"flux\")\r\n\r\n```\r\n\r\n### System Details\r\n<!-- Even if you do not think this is necessary, it is useful information for the maintainers.\r\nPlease run the following snippet and paste the output below:\r\nimport platform; print(platform.platform())\r\nimport sys; print(\"Python\", sys.version)\r\nimport numpy; print(\"Numpy\", numpy.__version__)\r\nimport erfa; print(\"pyerfa\", erfa.__version__)\r\nimport astropy; print(\"astropy\", astropy.__version__)\r\nimport scipy; print(\"Scipy\", scipy.__version__)\r\nimport matplotlib; print(\"Matplotlib\", matplotlib.__version__)\r\n-->\r\n```\r\nWindows-10-10.0.22000-SP0\r\nPython 3.9.10 | packaged by conda-forge | (main, Feb  1 2022, 21:21:54) [MSC v.1929 64 bit (AMD64)]\r\nNumpy 1.22.3\r\npyerfa 2.0.0.1\r\nastropy 5.0.3\r\nScipy 1.8.0\r\nMatplotlib 3.5.1\r\n```\n",
  "answer": "{\"task_id\": \"c140f6e2-7829-4300-bf21-49ff6e462d31\", \"workflow_run_id\": \"d6254f46-3b3f-4773-a42e-b9c64eb28c80\", \"data\": {\"id\": \"d6254f46-3b3f-4773-a42e-b9c64eb28c80\", \"workflow_id\": \"9a0a6efa-4d15-4876-b13f-4fb6a999caa2\", \"status\": \"succeeded\", \"outputs\": {\"final_patch\": [\"--- a/astropy/timeseries/sampled.py\\n+++ b/astropy/timeseries/sampled.py\\n@@ -97,9 +97,11 @@ class TimeSeries(BaseTimeSeries):\\n         Check that the required columns are present and in the right order\\n         \\\"\\\"\\\"\\n         if self._required_columns is not None:\\n-            if not self._required_columns <= set(self.colnames):\\n-                raise ValueError(\\\"TimeSeries object is invalid - expected \\\"\\n-                                 \\\"'time' as the first columns but found '{}'\\\".format(self.colnames[0]))\\n+            missing_columns = [col for col in self._required_columns if col not in self.colnames]\\n+            if missing_columns:\\n+                if len(missing_columns) == 1:\\n+                    raise ValueError(f\\\"TimeSeries object is missing required column: '{missing_columns[0]}'\\\")\\n+                else:\\n+                    raise ValueError(f\\\"TimeSeries object is missing required columns: {missing_columns}\\\")\\n \\n     def __init__(self, data=None, *, time=None, time_start=None, time_delta=None, n_samples=None, **kwargs):\", \"--- a/astropy/timeseries/sampled.py\\n+++ b/astropy/timeseries/sampled.py\\n@@ -97,9 +97,10 @@ class TimeSeries(BaseTimeSeries):\\n         Check that the required columns are present and in the right order\\n         \\\"\\\"\\\"\\n         if self._required_columns is not None:\\n-            if not self._required_columns <= set(self.colnames):\\n-                raise ValueError(\\\"TimeSeries object is invalid - expected \\\"\\n-                                 \\\"'time' as the first columns but found '{}'\\\".format(self.colnames[0]))\\n+            missing_columns = [col for col in self._required_columns if col not in self.colnames]\\n+            if missing_columns:\\n+                missing_str = \\\"', '\\\".join(missing_columns)\\n+                raise ValueError(f\\\"TimeSeries object is missing required column(s): '{missing_str}'\\\")\\n \\n     def __init__(self, data=None, *, time=None, time_start=None, time_delta=None, n_samples=None, **kwargs):\", \"--- a/astropy/timeseries/sampled.py\\n+++ b/astropy/timeseries/sampled.py\\n@@ -97,9 +97,12 @@ class TimeSeries(BaseTimeSeries):\\n         Check that the required columns are present and in the right order\\n         \\\"\\\"\\\"\\n         if self._required_columns is not None:\\n-            if not self._required_columns <= set(self.colnames):\\n-                raise ValueError(\\\"TimeSeries object is invalid - expected \\\"\\n-                                 \\\"'time' as the first columns but found '{}'\\\".format(self.colnames[0]))\\n+            missing_columns = [col for col in self._required_columns if col not in self.colnames]\\n+            if missing_columns:\\n+                missing_str = \\\"', '\\\".join(missing_columns)\\n+                present_str = \\\"', '\\\".join(self.colnames)\\n+                raise ValueError(f\\\"TimeSeries object is missing required column(s): '{missing_str}'. \\\"\\n+                               f\\\"Present columns are: '{present_str}'\\\")\\n \\n     def __init__(self, data=None, *, time=None, time_start=None, time_delta=None, n_samples=None, **kwargs):\", \"--- a/astropy/timeseries/sampled.py\\n+++ b/astropy/timeseries/sampled.py\\n@@ -97,9 +97,11 @@ class TimeSeries(BaseTimeSeries):\\n         Check that the required columns are present and in the right order\\n         \\\"\\\"\\\"\\n         if self._required_columns is not None:\\n-            if not self._required_columns <= set(self.colnames):\\n-                raise ValueError(\\\"TimeSeries object is invalid - expected \\\"\\n-                                 \\\"'time' as the first columns but found '{}'\\\".format(self.colnames[0]))\\n+            missing_columns = set(self._required_columns) - set(self.colnames)\\n+            if missing_columns:\\n+                required_str = \\\"', '\\\".join(sorted(self._required_columns))\\n+                raise ValueError(f\\\"TimeSeries object is invalid - required columns are '{required_str}' \\\"\\n+                               f\\\"but missing: '{\\\"', '\\\".join(sorted(missing_columns))}'\\\")\\n \\n     def __init__(self, data=None, *, time=None, time_start=None, time_delta=None, n_samples=None, **kwargs):\", \"--- a/astropy/timeseries/sampled.py\\n+++ b/astropy/timeseries/sampled.py\\n@@ -97,9 +97,10 @@ class TimeSeries(BaseTimeSeries):\\n         Check that the required columns are present and in the right order\\n         \\\"\\\"\\\"\\n         if self._required_columns is not None:\\n-            if not self._required_columns <= set(self.colnames):\\n-                raise ValueError(\\\"TimeSeries object is invalid - expected \\\"\\n-                                 \\\"'time' as the first columns but found '{}'\\\".format(self.colnames[0]))\\n+            missing_columns = set(self._required_columns) - set(self.colnames)\\n+            if missing_columns:\\n+                raise ValueError(f\\\"TimeSeries object is invalid - missing required column(s): \\\"\\n+                               f\\\"{sorted(missing_columns)}\\\")\\n \\n     def __init__(self, data=None, *, time=None, time_start=None, time_delta=None, n_samples=None, **kwargs):\", \"diff\\n--- a/astropy/timeseries/sampled.py\\n+++ b/astropy/timeseries/sampled.py\\n@@ -97,9 +97,12 @@ class TimeSeries(BaseTimeSeries):\\n         Check that the required columns are present and in the right order\\n         \\\"\\\"\\\"\\n         if self._required_columns is not None:\\n-            if not self._required_columns <= set(self.colnames):\\n-                raise ValueError(\\\"TimeSeries object is invalid - expected \\\"\\n-                                 \\\"'time' as the first columns but found '{}'\\\".format(self.colnames[0]))\\n+            missing_columns = [col for col in self._required_columns if col not in self.colnames]\\n+            if missing_columns:\\n+                missing_str = \\\"', '\\\".join(missing_columns)\\n+                present_str = \\\"', '\\\".join(self.colnames)\\n+                raise ValueError(f\\\"TimeSeries object is missing required column(s): '{missing_str}'. \\\"\\n+                               f\\\"Present columns are: '{present_str}'\\\")\\n \\n     def __init__(self, data=None, *, time=None, time_start=None, time_delta=None, n_samples=None, **kwargs):\\n\", \"diff\\n--- a/astropy/timeseries/sampled.py\\n+++ b/astropy/timeseries/sampled.py\\n@@ -97,9 +97,12 @@ class TimeSeries(BaseTimeSeries):\\n         Check that the required columns are present and in the right order\\n         \\\"\\\"\\\"\\n         if self._required_columns is not None:\\n-            if not self._required_columns <= set(self.colnames):\\n-                raise ValueError(\\\"TimeSeries object is invalid - expected \\\"\\n-                                 \\\"'time' as the first columns but found '{}'\\\".format(self.colnames[0]))\\n+            missing_columns = [col for col in self._required_columns if col not in self.colnames]\\n+            if missing_columns:\\n+                missing_str = \\\"', '\\\".join(missing_columns)\\n+                present_str = \\\"', '\\\".join(self.colnames)\\n+                raise ValueError(f\\\"TimeSeries object is missing required column(s): '{missing_str}'. \\\"\\n+                               f\\\"Present columns are: '{present_str}'\\\")\\n \\n     def __init__(self, data=None, *, time=None, time_start=None, time_delta=None, n_samples=None, **kwargs):\\n\", \"diff\\n--- a/astropy/timeseries/sampled.py\\n+++ b/astropy/timeseries/sampled.py\\n@@ -97,9 +97,12 @@ class TimeSeries(BaseTimeSeries):\\n         Check that the required columns are present and in the right order\\n         \\\"\\\"\\\"\\n         if self._required_columns is not None:\\n-            if not self._required_columns <= set(self.colnames):\\n-                raise ValueError(\\\"TimeSeries object is invalid - expected \\\"\\n-                                 \\\"'time' as the first columns but found '{}'\\\".format(self.colnames[0]))\\n+            missing_columns = [col for col in self._required_columns if col not in self.colnames]\\n+            if missing_columns:\\n+                missing_str = \\\"', '\\\".join(missing_columns)\\n+                present_str = \\\"', '\\\".join(self.colnames)\\n+                raise ValueError(f\\\"TimeSeries object is missing required column(s): '{missing_str}'. \\\"\\n+                               f\\\"Present columns are: '{present_str}'\\\")\\n \\n     def __init__(self, data=None, *, time=None, time_start=None, time_delta=None, n_samples=None, **kwargs):\\n\"]}, \"error\": \"\", \"elapsed_time\": 127.822912, \"total_tokens\": 44521, \"total_steps\": 30, \"created_at\": 1754643930, \"finished_at\": 1754644058}}"
}