{
  "Selected_candidate": {
    "pr_number": 16331,
    "pr_title": "Make var names in generated Cython code unique",
    "pr_body": "Previously, the following call to `unfuncify` would result in non-compiling\r\nCython code due to a variable clash:\r\n\r\n    >>> unfuncify((x, y), x + y, backend='Cython')\r\n\r\nThis was because the resulting Cython code looked something like:\r\n\r\n    def autofunc_c(np.ndarray[np.double_t, ndim=1] _x, np.ndarray[np.double_t, ndim=1] _y):\r\n        cdef int _m = _y.shape[0]\r\n        cdef np.ndarray[np.double_t, ndim=1] _y = np.empty((_m))\r\n\r\nwhere the argument `_y` is declared in the function body even though it is\r\nalready passed as an argument. This commit leverages `doprint` within the C code\r\ngenerator to make sure that generated variable names are unique.\r\n\r\n<!-- Your title above should be a short description of what\r\nwas changed. Do not include the issue number in the title. -->\r\n\r\n#### References to other Issues or PRs\r\n<!-- If this pull request fixes an issue, write \"Fixes #NNNN\" in that exact\r\nformat, e.g. \"Fixes #1234\". See\r\nhttps://github.com/blog/1506-closing-issues-via-pull-requests . Please also\r\nwrite a comment on that issue linking back to this pull request once it is\r\nopen. -->\r\nFixes #15628\r\n\r\n#### Brief description of what is fixed or changed\r\n\r\nAdd a new method to `CythonCodeWrapper` that converts unique dummy variables into distinct strings and pass every expression that emits variable names through this expression. \r\n\r\n#### Other comments\r\n\r\n\r\n#### Release Notes\r\n\r\n<!-- Write the release notes for this release below. See\r\nhttps://github.com/sympy/sympy/wiki/Writing-Release-Notes for more information\r\non how to write release notes. The bot will check your release notes\r\nautomatically to see if they are formatted correctly. -->\r\n\r\n<!-- BEGIN RELEASE NOTES -->\r\n* utilities\r\n  * make variable names in generated cython code unique\r\n<!-- END RELEASE NOTES -->\r\n",
    "issue_id": 15628,
    "issue_title": "Issue ufuncifying a two argument function with the Cython backend",
    "issue_body": "```\r\n>>> ufuncify((x, y), x + y, backend='Cython')\r\nTraceback (most recent call last):\r\n  File \"./sympy/utilities/autowrap.py\", line 168, in _process_files\r\n    retoutput = check_output(command, stderr=STDOUT)\r\n  File \"/Users/aaronmeurer/anaconda3/lib/python3.5/subprocess.py\", line 316, in check_output\r\n    **kwargs).stdout\r\n  File \"/Users/aaronmeurer/anaconda3/lib/python3.5/subprocess.py\", line 398, in run\r\n    output=stdout, stderr=stderr)\r\nsubprocess.CalledProcessError: Command '['/Users/aaronmeurer/anaconda3/bin/python', 'setup.py', 'build_ext', '--inplace']' returned non-zero exit status 1\r\n\r\nDuring handling of the above exception, another exception occurred:\r\n\r\nTraceback (most recent call last):\r\n  File \"<stdin>\", line 1, in <module>\r\n  File \"./sympy/utilities/autowrap.py\", line 1105, in ufuncify\r\n    tempdir, args, flags, verbose, helpers, **kwargs)\r\n  File \"./sympy/utilities/autowrap.py\", line 640, in autowrap\r\n    return code_wrapper.wrap_code(routine, helpers=helps)\r\n  File \"./sympy/utilities/autowrap.py\", line 149, in wrap_code\r\n    self._process_files(routine)\r\n  File \"./sympy/utilities/autowrap.py\", line 172, in _process_files\r\n    \" \".join(command), e.output.decode('utf-8')))\r\nsympy.utilities.autowrap.CodeWrapError: Error while executing command: /Users/aaronmeurer/anaconda3/bin/python setup.py build_ext --inplace. Command output is:\r\n\r\nError compiling Cython file:\r\n------------------------------------------------------------\r\n...\r\n    void autofunc(double *y_4785968, double *x_4785972, double *y_4785973, int m_4785969)\r\n\r\ndef autofunc_c(np.ndarray[np.double_t, ndim=1] _x, np.ndarray[np.double_t, ndim=1] _y):\r\n\r\n    cdef int _m = _y.shape[0]\r\n    cdef np.ndarray[np.double_t, ndim=1] _y = np.empty((_m))\r\n                                        ^\r\n------------------------------------------------------------\r\n\r\nwrapper_module_3.pyx:10:41: '_y' redeclared\r\n\r\nError compiling Cython file:\r\n------------------------------------------------------------\r\n...\r\ncimport numpy as np\r\n\r\ncdef extern from 'wrapped_code_3.h':\r\n    void autofunc(double *y_4785968, double *x_4785972, double *y_4785973, int m_4785969)\r\n\r\ndef autofunc_c(np.ndarray[np.double_t, ndim=1] _x, np.ndarray[np.double_t, ndim=1] _y):\r\n                                                  ^\r\n------------------------------------------------------------\r\n\r\nwrapper_module_3.pyx:7:51: Previous declaration is here\r\nwarning: wrapper_module_3.pyx:10:41: cdef variable '_y' declared after it is used\r\nCompiling wrapper_module_3.pyx because it changed.\r\n[1/1] Cythonizing wrapper_module_3.pyx\r\nTraceback (most recent call last):\r\n  File \"setup.py\", line 19, in <module>\r\n    setup(ext_modules=cythonize(ext_mods, **cy_opts))\r\n  File \"/Users/aaronmeurer/anaconda3/lib/python3.5/site-packages/Cython/Build/Dependencies.py\", line 1026, in cythonize\r\n    cythonize_one(*args)\r\n  File \"/Users/aaronmeurer/anaconda3/lib/python3.5/site-packages/Cython/Build/Dependencies.py\", line 1146, in cythonize_one\r\n    raise CompileError(None, pyx_file)\r\nCython.Compiler.Errors.CompileError: wrapper_module_3.pyx\r\n```\r\n\r\nIt works if the function just has one argument, or if you use a different backend. ",
    "issue_closed_at": "2019-03-20T15:46:53Z",
    "base_commit": "356a73cd676e0c3f1a1c3057a6895db0d82a1be7",
    "changes": [
      {
        "file": "sympy/utilities/autowrap.py",
        "type": "function",
        "name": "dump_pyx",
        "class_name": "CythonCodeWrapper",
        "code": "def dump_pyx(self, routines, f, prefix):\n        \"\"\"Write a Cython file with python wrappers\n\n        This file contains all the definitions of the routines in c code and\n        refers to the header file.\n\n        Arguments\n        ---------\n        routines\n            List of Routine instances\n        f\n            File-like object to write the file to\n        prefix\n            The filename prefix, used to refer to the proper header file.\n            Only the basename of the prefix is used.\n        \"\"\"\n        headers = []\n        functions = []\n        for routine in routines:\n            prototype = self.generator.get_prototype(routine)\n\n            # C Function Header Import\n            headers.append(self.pyx_header.format(header_file=prefix,\n                                                  prototype=prototype))\n\n            # Partition the C function arguments into categories\n            py_rets, py_args, py_loc, py_inf = self._partition_args(routine.arguments)\n\n            # Function prototype\n            name = routine.name\n            arg_string = \", \".join(self._prototype_arg(arg) for arg in py_args)\n\n            # Local Declarations\n            local_decs = []\n            for arg, val in py_inf.items():\n                proto = self._prototype_arg(arg)\n                mat, ind = val\n                local_decs.append(\"    cdef {0} = {1}.shape[{2}]\".format(proto, mat, ind))\n            local_decs.extend([\"    cdef {0}\".format(self._declare_arg(a)) for a in py_loc])\n            declarations = \"\\n\".join(local_decs)\n            if declarations:\n                declarations = declarations + \"\\n\"\n\n            # Function Body\n            args_c = \", \".join([self._call_arg(a) for a in routine.arguments])\n            rets = \", \".join([str(r.name) for r in py_rets])\n            if routine.results:\n                body = '    return %s(%s)' % (routine.name, args_c)\n                if rets:\n                    body = body + ', ' + rets\n            else:\n                body = '    %s(%s)\\n' % (routine.name, args_c)\n                body = body + '    return ' + rets\n\n            functions.append(self.pyx_func.format(name=name, arg_string=arg_string,\n                    declarations=declarations, body=body))\n\n        # Write text to file\n        if self._need_numpy:\n            # Only import numpy if required\n            f.write(self.pyx_imports)\n        f.write('\\n'.join(headers))\n        f.write('\\n'.join(functions))"
      },
      {
        "file": "sympy/utilities/autowrap.py",
        "type": "function",
        "name": "dump_pyx",
        "class_name": "CythonCodeWrapper",
        "code": "def dump_pyx(self, routines, f, prefix):\n        \"\"\"Write a Cython file with python wrappers\n\n        This file contains all the definitions of the routines in c code and\n        refers to the header file.\n\n        Arguments\n        ---------\n        routines\n            List of Routine instances\n        f\n            File-like object to write the file to\n        prefix\n            The filename prefix, used to refer to the proper header file.\n            Only the basename of the prefix is used.\n        \"\"\"\n        headers = []\n        functions = []\n        for routine in routines:\n            prototype = self.generator.get_prototype(routine)\n\n            # C Function Header Import\n            headers.append(self.pyx_header.format(header_file=prefix,\n                                                  prototype=prototype))\n\n            # Partition the C function arguments into categories\n            py_rets, py_args, py_loc, py_inf = self._partition_args(routine.arguments)\n\n            # Function prototype\n            name = routine.name\n            arg_string = \", \".join(self._prototype_arg(arg) for arg in py_args)\n\n            # Local Declarations\n            local_decs = []\n            for arg, val in py_inf.items():\n                proto = self._prototype_arg(arg)\n                mat, ind = val\n                local_decs.append(\"    cdef {0} = {1}.shape[{2}]\".format(proto, mat, ind))\n            local_decs.extend([\"    cdef {0}\".format(self._declare_arg(a)) for a in py_loc])\n            declarations = \"\\n\".join(local_decs)\n            if declarations:\n                declarations = declarations + \"\\n\"\n\n            # Function Body\n            args_c = \", \".join([self._call_arg(a) for a in routine.arguments])\n            rets = \", \".join([str(r.name) for r in py_rets])\n            if routine.results:\n                body = '    return %s(%s)' % (routine.name, args_c)\n                if rets:\n                    body = body + ', ' + rets\n            else:\n                body = '    %s(%s)\\n' % (routine.name, args_c)\n                body = body + '    return ' + rets\n\n            functions.append(self.pyx_func.format(name=name, arg_string=arg_string,\n                    declarations=declarations, body=body))\n\n        # Write text to file\n        if self._need_numpy:\n            # Only import numpy if required\n            f.write(self.pyx_imports)\n        f.write('\\n'.join(headers))\n        f.write('\\n'.join(functions))"
      },
      {
        "file": "sympy/utilities/autowrap.py",
        "type": "function",
        "name": "_prototype_arg",
        "class_name": "CythonCodeWrapper",
        "code": "def _prototype_arg(self, arg):\n        mat_dec = \"np.ndarray[{mtype}, ndim={ndim}] {name}\"\n        np_types = {'double': 'np.double_t',\n                    'int': 'np.int_t'}\n        t = arg.get_datatype('c')\n        if arg.dimensions:\n            self._need_numpy = True\n            ndim = len(arg.dimensions)\n            mtype = np_types[t]\n            return mat_dec.format(mtype=mtype, ndim=ndim, name=arg.name)\n        else:\n            return \"%s %s\" % (t, str(arg.name))"
      },
      {
        "file": "sympy/utilities/autowrap.py",
        "type": "function",
        "name": "_declare_arg",
        "class_name": "CythonCodeWrapper",
        "code": "def _declare_arg(self, arg):\n        proto = self._prototype_arg(arg)\n        if arg.dimensions:\n            shape = '(' + ','.join(str(i[1] + 1) for i in arg.dimensions) + ')'\n            return proto + \" = np.empty({shape})\".format(shape=shape)\n        else:\n            return proto + \" = 0\""
      }
    ]
  },
  "Justification": "Candidate D is the most relevant because it deals directly with the generation of Cython code, which is the same backend that the CURRENT bug report pertains to. Both reports involve issues related to how functions are wrapped and signatures generated in Cython. The structural and module/component similarities are high, as they both originate from the autowrap functionality. Furthermore, the patch for Candidate D addresses variable uniqueness, which could also relate to the signatures generated in the CURRENT bug report, providing insights into potential fixes for the bug regarding incorrect function signatures. Moreover, both bugs deal within the same subsystem of the program concerning code generation and backend compilation, making this candidate particularly effective for debugging the CURRENT issue.",
  "instance_id": "sympy__sympy-16792",
  "repo": "sympy/sympy",
  "created_at": "2019-05-09T03:40:54Z",
  "problem_statement": "autowrap with cython backend fails when array arguments do not appear in wrapped expr\nWhen using the cython backend for autowrap, it appears that the code is not correctly generated when the function in question has array arguments that do not appear in the final expression. A minimal counterexample is:\r\n\r\n```python\r\nfrom sympy.utilities.autowrap import autowrap\r\nfrom sympy import MatrixSymbol\r\nimport numpy as np\r\n\r\nx = MatrixSymbol('x', 2, 1)\r\nexpr = 1.0\r\nf = autowrap(expr, args=(x,), backend='cython')\r\n\r\nf(np.array([[1.0, 2.0]]))\r\n```\r\n\r\nThis should of course return `1.0` but instead fails with:\r\n```python\r\nTypeError: only size-1 arrays can be converted to Python scalars\r\n```\r\n\r\nA little inspection reveals that this is because the corresponding C function is generated with an incorrect signature:\r\n\r\n```C\r\ndouble autofunc(double x) {\r\n\r\n   double autofunc_result;\r\n   autofunc_result = 1.0;\r\n   return autofunc_result;\r\n\r\n}\r\n```\r\n\r\n(`x` should be `double *`, not `double` in this case)\r\n\r\nI've found that this error won't occur so long as `expr` depends at least in part on each argument. For example this slight modification of the above counterexample works perfectly:\r\n\r\n```python\r\nfrom sympy.utilities.autowrap import autowrap\r\nfrom sympy import MatrixSymbol\r\nimport numpy as np\r\n\r\nx = MatrixSymbol('x', 2, 1)\r\n# now output depends on x\r\nexpr = x[0,0]\r\nf = autowrap(expr, args=(x,), backend='cython')\r\n\r\n# returns 1.0 as expected, without failure\r\nf(np.array([[1.0, 2.0]]))\r\n```\r\n\r\nThis may seem like a silly issue (\"why even have `x` as an argument if it doesn't appear in the expression you're trying to evaluate?\"). But of course in interfacing with external libraries (e.g. for numerical integration), one often needs functions to have a pre-defined signature regardless of whether a given argument contributes to the output.\r\n\r\nI think I've identified the problem in `codegen` and will suggest a PR shortly.\n",
  "patch": "diff --git a/sympy/utilities/codegen.py b/sympy/utilities/codegen.py\n--- a/sympy/utilities/codegen.py\n+++ b/sympy/utilities/codegen.py\n@@ -695,6 +695,11 @@ def routine(self, name, expr, argument_sequence=None, global_vars=None):\n         arg_list = []\n \n         # setup input argument list\n+\n+        # helper to get dimensions for data for array-like args\n+        def dimensions(s):\n+            return [(S.Zero, dim - 1) for dim in s.shape]\n+\n         array_symbols = {}\n         for array in expressions.atoms(Indexed) | local_expressions.atoms(Indexed):\n             array_symbols[array.base.label] = array\n@@ -703,11 +708,8 @@ def routine(self, name, expr, argument_sequence=None, global_vars=None):\n \n         for symbol in sorted(symbols, key=str):\n             if symbol in array_symbols:\n-                dims = []\n                 array = array_symbols[symbol]\n-                for dim in array.shape:\n-                    dims.append((S.Zero, dim - 1))\n-                metadata = {'dimensions': dims}\n+                metadata = {'dimensions': dimensions(array)}\n             else:\n                 metadata = {}\n \n@@ -739,7 +741,11 @@ def routine(self, name, expr, argument_sequence=None, global_vars=None):\n                 try:\n                     new_args.append(name_arg_dict[symbol])\n                 except KeyError:\n-                    new_args.append(InputArgument(symbol))\n+                    if isinstance(symbol, (IndexedBase, MatrixSymbol)):\n+                        metadata = {'dimensions': dimensions(symbol)}\n+                    else:\n+                        metadata = {}\n+                    new_args.append(InputArgument(symbol, **metadata))\n             arg_list = new_args\n \n         return Routine(name, arg_list, return_val, local_vars, global_vars)\n"
}