{
  "RepoName": "python-cachier/cachier.git",
  "CommitSHA": "474294fac83b029f13dd61b8d2822923045d4504",
  "Type": "logic error",
  "ErrorMessage": null,
  "Issue": {
    "title": "Clear Cache Function Not Working Properly in Separate Files Mode",
    "description": "The `clear_cache()` function does not appear to properly clear all cache files when caching is configured to use separate files. As a user, when I attempt to clear the cache in this mode, I expect all related cache files to be removed or reset. However, the current implementation of `clear_cache()` does not handle this scenario correctly, resulting in residual cache data. This issue impacts the reliability of cache clearing functionality and could lead to stale or incorrect data being used.",
    "explanation": "### Summary of the Issue\n\nThe issue reported is that the `clear_cache()` function does not work properly when caching is configured to use separate files. When a user tries to clear the cache in this mode, not all related cache files are removed, resulting in residual cache data. This leads to stale or incorrect data being used, impacting the reliability of the cache clearing functionality.\n\n### Details of the Commit\n\nThe commit in question modifies the behavior of the `clear_cache()` method for the pickle-based caching core. The change replaces the placeholder code with a call to an internal method that handles the deletion of all cache files individually when separate file caching is enabled.\n\n### Explanation of the Cause and Solution\n\n#### Cause of the Issue\n\nWhen the caching is configured to use separate files, the `clear_cache()` method was not implemented properly. It effectively did nothing (`pass` statement), which meant that none of the cache files were actually being deleted when this method was called. This resulted in residual cache files being left behind, and subsequent function calls could incorrectly use these stale files, thus leading to data inconsistencies.\n\nThis scenario arises because, in separate files mode, each set of function arguments results in a distinct cache file. Therefore, clearing the cache involves deleting all these individual files, which was not happening due to the unimplemented `clear_cache()` method.\n\n#### Solution Implementation\n\nThe commit addresses this issue by implementing the logic to clear all the separate cache files. Instead of having a `pass` statement, the `clear_cache()` method now calls `_clear_all_cache_files()` (presumably a private method that handles the file deletion process).\n\nHere’s a step-by-step explanation of how the commit resolves the issue:\n\n1. **Identifying Files to Delete**: The solution involves identifying all the separate cache files associated with the function. Each cache file corresponds to a specific set of function arguments.\n\n2. **Deleting Files**: The `_clear_all_cache_files()` method iterates over these files and deletes them, thereby ensuring no residual cache data is left.\n\n3. **Ensuring Robustness**: By replacing the placeholder `pass` statement with a call to a method that performs the necessary deletions, the implementation ensures that all possible cache files are accounted for and properly deleted.\n\n### Solution Explanation\n\nThe solution revolves around correctly implementing the deletion mechanism for caches stored in separate files. By introducing the logic to iterate over and delete each cache file, the `clear_cache()` method now effectively clears all cached data as expected by the users. This ensures that stale or outdated data does not persist, maintaining the integrity and consistency of the function outputs post-cache clearing.\n\nIn summary:\n\n1. **Diagnosis**: The original problem was due to the lack of implementation for clearing multiple cache files.\n2. **Commit Change**: The commit adds a call to a utility method that handles the deletion of these files.\n3. **Effectiveness**: This fix ensures that calling `clear_cache()` will remove all cache files, thus solving the problem of residual stale data.\n\nBy addressing the root cause (unimplemented cache clearing in separate files mode), the commit ensures the correct functionality of the `clear_cache()` method across different caching configurations. This makes the cache clearing reliable and consistent with user expectations."
  },
  "Explain": "### Summary of the Issue\n\nThe issue reported is that the `clear_cache()` function does not work properly when caching is configured to use separate files. When a user tries to clear the cache in this mode, not all related cache files are removed, resulting in residual cache data. This leads to stale or incorrect data being used, impacting the reliability of the cache clearing functionality.\n\n### Details of the Commit\n\nThe commit in question modifies the behavior of the `clear_cache()` method for the pickle-based caching core. The change replaces the placeholder code with a call to an internal method that handles the deletion of all cache files individually when separate file caching is enabled.\n\n### Explanation of the Cause and Solution\n\n#### Cause of the Issue\n\nWhen the caching is configured to use separate files, the `clear_cache()` method was not implemented properly. It effectively did nothing (`pass` statement), which meant that none of the cache files were actually being deleted when this method was called. This resulted in residual cache files being left behind, and subsequent function calls could incorrectly use these stale files, thus leading to data inconsistencies.\n\nThis scenario arises because, in separate files mode, each set of function arguments results in a distinct cache file. Therefore, clearing the cache involves deleting all these individual files, which was not happening due to the unimplemented `clear_cache()` method.\n\n#### Solution Implementation\n\nThe commit addresses this issue by implementing the logic to clear all the separate cache files. Instead of having a `pass` statement, the `clear_cache()` method now calls `_clear_all_cache_files()` (presumably a private method that handles the file deletion process).\n\nHere’s a step-by-step explanation of how the commit resolves the issue:\n\n1. **Identifying Files to Delete**: The solution involves identifying all the separate cache files associated with the function. Each cache file corresponds to a specific set of function arguments.\n\n2. **Deleting Files**: The `_clear_all_cache_files()` method iterates over these files and deletes them, thereby ensuring no residual cache data is left.\n\n3. **Ensuring Robustness**: By replacing the placeholder `pass` statement with a call to a method that performs the necessary deletions, the implementation ensures that all possible cache files are accounted for and properly deleted.\n\n### Solution Explanation\n\nThe solution revolves around correctly implementing the deletion mechanism for caches stored in separate files. By introducing the logic to iterate over and delete each cache file, the `clear_cache()` method now effectively clears all cached data as expected by the users. This ensures that stale or outdated data does not persist, maintaining the integrity and consistency of the function outputs post-cache clearing.\n\nIn summary:\n\n1. **Diagnosis**: The original problem was due to the lack of implementation for clearing multiple cache files.\n2. **Commit Change**: The commit adds a call to a utility method that handles the deletion of these files.\n3. **Effectiveness**: This fix ensures that calling `clear_cache()` will remove all cache files, thus solving the problem of residual stale data.\n\nBy addressing the root cause (unimplemented cache clearing in separate files mode), the commit ensures the correct functionality of the `clear_cache()` method across different caching configurations. This makes the cache clearing reliable and consistent with user expectations.",
  "Time": "2024-06-02",
  "Difficulty": "Simple",
  "OriginCode": [
    {
      "path": "python-cachier_cachier/MANIFEST.in",
      "content": "# Manifest syntax https://docs.python.org/2/distutils/sourcedist.html\ngraft wheelhouse\n\nrecursive-include src *.info\ninclude README.rst\ninclude LICENSE\n\nprune __pycache__\nprune tests\n"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "Cachier\n#######\n\n|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n\nPersistent, stale-free, local and cross-machine caching for Python functions.\n\n.. code-block:: python\n\n  from cachier import cachier\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(days=3))\n  def foo(arg1, arg2):\n    \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\n\n.. role:: python(code)\n  :language: python\n\n.. contents::\n\n.. section-numbering:\n\n\n\nInstallation\n============\n\nInstall ``cachier`` with:\n\n.. code-block:: python\n\n    pip install cachier\n\nFor the latest version supporting Python 2.7 please use:\n\n.. code-block:: python\n\n    pip install 'cachier==1.2.8'\n\nFeatures\n========\n\n* Pure Python.\n* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n* A simple interface.\n* Defining \"shelf life\" for cached values.\n* Local caching using pickle files.\n* Cross-machine caching using MongoDB.\n* Thread-safety.\n\nCachier is **NOT**:\n\n* Meant as a transient cache. Python's @lru_cache is better.\n* Especially fast. It is meant to replace function calls that take more than... a second, say (overhead is around 1 millisecond).\n\nFuture features\n---------------\n\n* S3 core.\n* Multi-core caching.\n* `Cache replacement policies <https://en.wikipedia.org/wiki/Cache_replacement_policies>`_\n\n\nUse\n===\n\nCachier provides a decorator which you can wrap around your functions to give them a persistent cache. The positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). Also, notice that since objects which are instances of user-defined classes are hashable but all compare unequal (their hash value is their id), equal objects across different sessions will not yield identical keys.\n\nSetting up a Cache\n------------------\nYou can add a default, pickle-based, persistent cache to your function - meaning it will last across different Python kernels calling the wrapped function - by decorating it with the ``cachier`` decorator (notice the ``()``!).\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier()\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values!\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nClass and object methods can also be cached. Cachier will automatically ignore the `self` parameter when determining the cache key for an object method. **This means that methods will be cached across all instances of an object, which may not be what you want.**\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  class Foo():\n    @staticmethod\n    @cachier()\n    def good_static_usage(arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method does not depend on object's internal state, so good to cache\n    @cachier()\n    def good_usage_1(self, arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method is calling external service, probably okay to cache\n    @cachier()\n    def good_usage_2(self, arg_1, arg_2):\n      result = self.call_api(arg_1, arg_2)\n      return result\n\n    # Instance method relies on object attribute, NOT good to cache\n    @cachier()\n    def bad_usage(self, arg_1, arg_2):\n      return arg_1 + arg_2 + self.arg_3\n\n\nResetting a Cache\n-----------------\nThe Cachier wrapper adds a ``clear_cache()`` function to each wrapped function. To reset the cache of the wrapped function simply call this method:\n\n.. code-block:: python\n\n  foo.clear_cache()\n\nGeneral Configuration\n----------------------\n\nGlobal Defaults\n~~~~~~~~~~~~~~~\n\nSettings can be globally configured across all Cachier wrappers through the use of the `set_default_params` function. This function takes the same keyword parameters as the ones defined in the decorator, which can be passed all at once or with multiple calls. Parameters given directly to a decorator take precedence over any values set by this function.\n\nThe following parameters will only be applied to decorators defined after `set_default_params` is called:\n\n*  `hash_func`\n*  `backend`\n*  `mongetter`\n*  `cache_dir`\n*  `pickle_reload`\n*  `separate_files`\n\nThese parameters can be changed at any time and they will apply to all decorators:\n\n*  `allow_none`\n*  `caching_enabled`\n*  `stale_after`\n*  `next_time`\n*  `wait_for_calc_timeout`\n\nThe current defaults can be fetched by calling `get_default_params`.\n\nThreads Limit\n~~~~~~~~~~~~~\n\nTo limit the number of threads Cachier is allowed to spawn, set the ``CACHIER_MAX_WORKERS`` with the desired number. The default is 8, so to enable Cachier to spawn even more threads, you'll have to set a higher limit explicitly.\n\n\nGlobal Enable/Disable\n---------------------\n\nCaching can be turned off across all decorators by calling `disable_caching`, and then re-activated by calling `enable_caching`.\n\nThese functions are convenience wrappers around the `caching_enabled` default setting.\n\n\nCache Shelf Life\n----------------\n\nSetting Shelf Life\n~~~~~~~~~~~~~~~~~~\nYou can set any duration as the shelf life of cached return values of a function by providing a corresponding ``timedelta`` object to the ``stale_after`` parameter:\n\n.. code-block:: python\n\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(weeks=2))\n  def bar(arg1, arg2):\n    return {'arg1': arg1, 'arg2': arg2}\n\nNow when a cached value matching the given arguments is found the time of its calculation is checked; if more than ``stale_after`` time has since passed, the function will be run again for the same arguments and the new value will be cached and returned.\n\nThis is useful for lengthy calculations that depend on a dynamic data source.\n\nFuzzy Shelf Life\n~~~~~~~~~~~~~~~~\nSometimes you may want your function to trigger a calculation when it encounters a stale result, but still not wait on it if it's not that critical. In that case, you can set ``next_time`` to ``True`` to have your function trigger a recalculation **in a separate thread**, but return the currently cached stale value:\n\n.. code-block:: python\n\n  @cachier(next_time=True)\n\nFurther function calls made while the calculation is being performed will not trigger redundant calculations.\n\n\nWorking with unhashable arguments\n---------------------------------\n\nAs mentioned above, the positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). To get around this limitation the ``hash_func`` parameter of the ``cachier`` decorator can be provided with a callable that gets the args and kwargs from the decorated function and returns a hash key for them.\n\n.. code-block:: python\n\n  def calculate_hash(args, kwds):\n    key = ...  # compute a hash key here based on arguments\n    return key\n\n  @cachier(hash_func=calculate_hash)\n  def calculate_super_complex_stuff(custom_obj):\n    # amazing code goes here\n\nSee here for an example:\n\n`Question: How to work with unhashable arguments <https://github.com/python-cachier/cachier/issues/91>`_\n\n\nPrecaching values\n---------------------------------\n\nIf you want to load a value into the cache without calling the underlying function, this can be done with the `precache_value` function.\n\n.. code-block:: python\n\n  @cachier()\n  def add(arg1, arg2):\n    return arg1 + arg2\n\n  add.precache_value(2, 2, value_to_cache=5)\n\n  result = add(2, 2)\n  print(result)  # prints 5\n\n\nPer-function call arguments\n---------------------------\n\nCachier also accepts several keyword arguments in the calls of the function it wraps rather than in the decorator call, allowing you to modify its behaviour for a specific function call.\n\nIgnore Cache\n~~~~~~~~~~~~\n\nYou can have ``cachier`` ignore any existing cache for a specific function call by passing ``cachier__skip_cache=True`` to the function call. The cache will neither be checked nor updated with the new return value.\n\n.. code-block:: python\n\n  @cachier()\n  def sum(first_num, second_num):\n    return first_num + second_num\n\n  def main():\n    print(sum(5, 3, cachier__skip_cache=True))\n\nOverwrite Cache\n~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` overwrite an existing cache entry - if one exists - for a specific function call by passing ``cachier__overwrite_cache=True`` to the function call. The cache will not be checked but will be updated with the new return value.\n\nVerbose Cache Call\n~~~~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` print out a detailed explanation of the logic of a specific call by passing ``cachier__verbose=True`` to the function call. This can be useful if you are not sure why a certain function result is, or is not, returned.\n\nCache `None` Values\n~~~~~~~~~~~~~~~~~~~\n\nBy default, ``cachier`` does not cache ``None`` values. You can override this behaviour by passing ``allow_none=True`` to the function call.\n\n\nCachier Cores\n=============\n\nPickle Core\n-----------\n\nThe default core for Cachier is pickle based, meaning each function will store its cache is a separate pickle file in the ``~/.cachier`` directory. Naturally, this kind of cache is both machine-specific and user-specific.\n\nYou can configure ``cachier`` to use another directory by providing the ``cache_dir`` parameter with the path to that directory:\n\n.. code-block:: python\n\n  @cachier(cache_dir='~/.temp/.cache')\n\n\nYou can slightly optimise pickle-based caching if you know your code will only be used in a single thread environment by setting:\n\n.. code-block:: python\n\n  @cachier(pickle_reload=False)\n\nThis will prevent reading the cache file on each cache read, speeding things up a bit, while also nullifying inter-thread functionality (the code is still thread safe, but different threads will have different versions of the cache at times, and will sometime make unnecessary function calls).\n\nSetting the optional argument ``separate_files`` to ``True`` will cause the cache to be stored in several files: A file per argument set, per function. This can help if your per-function cache files become too large.\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier(separate_files=True)\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values, split across several files, per argument set\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nYou can get the fully qualified path to the directory of cache files used by ``cachier`` (``~/.cachier`` by default) by calling the ``cache_dpath()`` function:\n\n.. code-block:: python\n\n  >>> foo.cache_dpath()\n      \"/home/user/.cachier/\"\n\n\nMongoDB Core\n------------\nYou can set a MongoDB-based cache by assigning ``mongetter`` with a callable that returns a ``pymongo.Collection`` object with writing permissions:\n\n.. code-block:: python\n\n    from pymongo import MongoClient\n\n    def my_mongetter():\n        client = MongoClient(get_cachier_db_auth_uri())\n        db_obj = client['cachier_db']\n        if 'someapp_cachier_db' not in db_obj.list_collection_names():\n            db_obj.create_collection('someapp_cachier_db')\n        return db_obj['someapp_cachier_db']\n\n  @cachier(mongetter=my_mongetter)\n\nThis allows you to have a cross-machine, albeit slower, cache. This functionality requires that the installation of the ``pymongo`` python package.\n\nIn certain cases the MongoDB backend might leave a deadlock behind, blocking all subsequent requests from being processed. If you encounter this issue, supply the ``wait_for_calc_timeout`` with a reasonable number of seconds; calls will then wait at most this number of seconds before triggering a recalculation.\n\n.. code-block:: python\n\n  @cachier(mongetter=False, wait_for_calc_timeout=2)\n\n\nMemory Core\n-----------\n\nYou can set an in-memory cache by assigning the ``backend`` parameter with ``'memory'``:\n\n.. code-block:: python\n\n  @cachier(backend='memory')\n\nNote, however, that ``cachier``'s in-memory core is simple, and has no monitoring or cap on cache size, and can thus lead to memory errors on large return values - it is mainly intended to be used with future multi-core functionality. As a rule, Python's built-in ``lru_cache`` is a much better stand-alone solution.\n\n\nContributing\n============\n\nCurrent maintainers are Shay Palachy Affek (`shay.palachy@gmail.com <mailto:shay.palachy@gmail.com>`_, `@shaypal5 <https://github.com/shaypal5>`_) and Judson Neer (`@lordjabez <https://github.com/lordjabez>`_); You are more than welcome to approach them for help. Contributions are very welcomed! :)\n\nInstalling for development\n--------------------------\n\nClone:\n\n.. code-block:: bash\n\n  git clone git@github.com:python-cachier/cachier.git\n\n\nInstall in development mode with test dependencies:\n\n.. code-block:: bash\n\n  cd cachier\n  pip install -e . -r tests/requirements.txt\n\n\nRunning the tests\n-----------------\n\nTo run the tests, call the ``pytest`` command in the repository's root, or:\n\n.. code-block:: bash\n\n  python -m pytest\n\nTo run only MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m mongo\n\nTo run only memory core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m memory\n\nTo run all tests EXCEPT MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not mongo\"\n\n\nTo run all tests EXCEPT memory core AND MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not (mongo or memory)\"\n\n\nRunning MongoDB tests against a live MongoDB instance\n-----------------------------------------------------\n\n**Note to developers:** By default, all MongoDB tests are run against a mocked MongoDB instance, provided by the ``pymongo_inmemory`` package. To run them against a live MongoDB instance, the ``CACHIER_TEST_VS_DOCKERIZED_MONGO`` environment variable is set to ``True`` in the ``test`` environment of this repository (and additional environment variables are populated with the appropriate credentials), used by the GitHub Action running tests on every commit and pull request.\n\nContributors are not expected to run these tests against a live MongoDB instance when developing, as credentials for the testing instance used will NOT be shared, but rather use the testing against the in-memory MongoDB instance as a good proxy.\n\n**HOWEVER, the tests run against a live MongoDB instance when you submit a PR are the determining tests for deciding whether your code functions correctly against MongoDB.**\n\n\nAdding documentation\n--------------------\n\nThis project is documented using the `numpy docstring conventions`_, which were chosen as they are perhaps the most widely-spread conventions that are both supported by common tools such as Sphinx and result in human-readable docstrings (in my personal opinion, of course). When documenting code you add to this project, please follow `these conventions`_.\n\n.. _`numpy docstring conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n.. _`these conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n\nAdditionally, if you update this ``README.rst`` file, use ``python setup.py checkdocs`` to validate it compiles.\n\n\nCredits\n=======\n\nCreated by `Shay Palachy Affek <https://github.com/shaypal5>`_ (shay.palachy@gmail.com), which currently assists in maintenance.\n\nCurrent lead developer/contributor: `Jirka Borovec <https://github.com/Borda>`_ (`@Borda <https://github.com/Borda>`_ on GitHub).\n\nOther major contributors:\n\n* `Jirka Borovec <https://github.com/Borda>`_ - Arg order independence, args-to-kwargs for less unique keys and numerous development and CI contributions.\n\n* `Judson Neer <https://github.com/lordjabez>`_ - Precaching, method caching support and numerous improvements and bugfixes.\n\n* `cthoyt <https://github.com/cthoyt>`_ - Base memory core implementation.\n\n* `amarczew <https://github.com/amarczew>`_ - The ``hash_func`` kwarg.\n\n* `non-senses <https://github.com/non-senses>`_ - The ``wait_for_calc_timeout`` kwarg.\n\n* `Elad Rapaport <https://github.com/erap129>`_ - Multi-file Pickle core, a.k.a ``separate_files`` (released on ``v1.5.3``).\n\n* `John Didion <https://github.com/jdidion>`_ - Support for pickle-based caching for cases where two identically-named methods of different classes are defined in the same module.\n\nNotable bugfixers:\n\n* `MichaelRazum <https://github.com/MichaelRazum>`_.\n\n* `Eric Ma <https://github.com/ericmjl>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n* `Ofir <https://github.com/ofirnk>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n\n\n.. |PyPI-Status| image:: https://img.shields.io/pypi/v/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |PyPI-Versions| image:: https://img.shields.io/pypi/pyversions/cachier.svg\n   :target: https://pypi.python.org/pypi/cachier\n\n.. |Build-Status| image:: https://github.com/python-cachier/cachier/actions/workflows/test.yml/badge.svg\n  :target: https://github.com/python-cachier/cachier/actions/workflows/test.yml\n\n.. |LICENCE| image:: https://img.shields.io/pypi/l/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |Codecov| image:: https://codecov.io/github/python-cachier/cachier/coverage.svg?branch=master\n   :target: https://codecov.io/github/python-cachier/cachier?branch=master\n\n.. |Downloads| image:: https://pepy.tech/badge/cachier\n     :target: https://pepy.tech/project/cachier\n     :alt: PePy stats\n\n.. |Codefactor| image:: https://www.codefactor.io/repository/github/python-cachier/cachier/badge?style=plastic\n     :target: https://www.codefactor.io/repository/github/python-cachier/cachier\n     :alt: Codefactor code quality\n\n.. links:\n.. _pymongo: https://api.mongodb.com/python/current/\n.. _watchdog: https://github.com/gorakhargosh/watchdog\n"
    },
    {
      "path": "python-cachier_cachier/LICENSE",
      "content": "MIT License\n\nCopyright (c) 2016 Shay Palachy\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.\n"
    },
    {
      "path": "python-cachier_cachier/pyproject.toml",
      "content": "[project]\nname = \"cachier\"\ndescription = \"Persistent, stale-free, local and cross-machine caching for Python functions.\"\nreadme = \"README.rst\"\nkeywords = [\n  \"cache\",\n  \"caching\",\n  \"cross-machine\",\n  \"decorator\",\n  \"local\",\n  \"memoization\",\n  \"mongo\",\n  \"persistent\",\n]\nlicense = {file = \"LICENSE\"}\nauthors = [\n   {name = \"Shay Palachy Affek\", email = 'shay.palachy@gmail.com'},\n]\nclassifiers = [\n  'Development Status :: 4 - Beta',\n  'Intended Audience :: Developers',\n  'License :: OSI Approved :: MIT License',\n  'Programming Language :: Python',\n  \"Programming Language :: Python :: 3 :: Only\",\n  \"Programming Language :: Python :: 3.8\",\n  \"Programming Language :: Python :: 3.9\",\n  \"Programming Language :: Python :: 3.10\",\n  \"Programming Language :: Python :: 3.11\",\n  \"Programming Language :: Python :: 3.12\",\n  \"Topic :: Other/Nonlisted Topic\",\n  \"Topic :: Software Development :: Libraries\",\n  \"Topic :: Software Development :: Libraries :: Python Modules\",\n  \"Topic :: Utilities\",\n]\ndynamic = [\n  \"version\",\n]\ndependencies = [\n  \"portalocker>=2.3.2\",\n  \"watchdog>=2.3.1\",\n]\n[project.urls]\nSource = \"https://github.com/python-cachier/cachier\"\n[project.scripts]\ncachier = \"cachier.__naim__:cli\"\n\n[tool.setuptools]\ninclude-package-data = true\n\n[tool.setuptools.dynamic]\nversion = {attr = \"cachier._version.__version__\"}\n\n[tool.setuptools.packages.find]\nwhere = [\"src\"]  # list of folders that contain the packages ([\".\"] by default)\ninclude = [\"cachier*\"]  # package names should match these glob patterns ([\"*\"] by default)\nnamespaces = false  # to disable scanning PEP 420 namespaces (true by default)\n\n[build-system]\nrequires = [\n  \"setuptools\",\n  \"wheel\",\n]\n\n[tool.pytest.ini_options]\ntestpaths = [\"cachier\", \"tests\"]\nnorecursedirs=[\"dist\", \"build\"]\naddopts = [\n    \"--color=yes\",\n    \"--cov=cachier\",\n    \"--cov-report=term\",\n    \"--cov-report=xml:cov.xml\",\n    \"-r a\",\n    \"-v\",\n    \"-s\",\n]\nmarkers = [\n    \"mongo: test the MongoDB core\",\n    \"memory: test the memory core\",\n    \"pickle: test the pickle core\"\n]\n\n[tool.docformatter]\nrecursive = true\n# some docstring start with r\"\"\"\nwrap-summaries = 79\nwrap-descriptions = 79\nblank = true\n\n[tool.ruff]\ntarget-version = \"py38\"\nline-length = 79\n# Enable Pyflakes `E` and `F` codes by default.\nlint.select = [\n    \"E\",\n    \"W\", # see: https://pypi.org/project/pycodestyle\n    \"F\", # see: https://pypi.org/project/pyflakes\n    \"I\", #see: https://pypi.org/project/isort/\n    \"D\", # see: https://pypi.org/project/pydocstyle\n    #\"N\", # see: https://pypi.org/project/pep8-naming\n    \"S\", # see: https://pypi.org/project/flake8-bandit\n    \"SIM\",\n    \"RUF100\" # alternative to yesqa\n]\nlint.extend-select = [\n    \"A\",    # see: https://pypi.org/project/flake8-builtins\n    \"B\",    # see: https://pypi.org/project/flake8-bugbear\n    \"C4\",   # see: https://pypi.org/project/flake8-comprehensions\n]\nlint.ignore = [\n    \"E203\",\n    \"C901\",\n]\n# Exclude a variety of commonly ignored directories.\nexclude = [\n    \".eggs\",\n    \".git\",\n    \".ruff_cache\",\n    \"__pypackages__\",\n    \"_build\",\n    \"build\",\n    \"dist\",\n]\nlint.ignore-init-module-imports = true\nlint.unfixable = [\"F401\"]\n\n[tool.ruff.lint.per-file-ignores]\n\"src/**/__init__.py\" = [\"D104\"]\n\"src/cachier/config.py\" = [\"D100\"]\n\"tests/**\" = [\n  \"S101\", \"S311\", \"S105\", \"S603\",\n  \"D100\", \"D101\", \"D103\", \"D104\", \"D401\"\n]\n\n#[tool.ruff.pydocstyle]\n## Use Google-style docstrings.\n#convention = \"google\"\n\n#[tool.ruff.pycodestyle]\n#ignore-overlong-task-comments = true\n\n[tool.ruff.lint.mccabe]\n# Unlike Flake8, default to a complexity level of 10.\nmax-complexity = 10\n\n[tool.coverage.run]\nbranch = true\ndynamic_context = \"test_function\"\nomit = [\n    \"tests/*\",\n    \"cachier/_version.py\",\n    \"cachier/__init__.py\",\n    \"**/scripts/**\",\n]\n[tool.coverage.report]\nshow_missing = true\n# Regexes for lines to exclude from consideration\nexclude_lines = [\n    \"pragma: no cover\", # Have to re-enable the standard pragma\n    \"raise NotImplementedError\", # Don't complain if tests don't hit defensive assertion code:\n    \"if TYPE_CHECKING:\", # Is only true when running mypy, not tests\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__init__.py",
      "content": "from ._version import *  # noqa: F403\nfrom .config import (\n    disable_caching,\n    enable_caching,\n    get_default_params,\n    set_default_params,\n)\nfrom .core import cachier\n\n__all__ = [\n    \"cachier\",\n    \"set_default_params\",\n    \"get_default_params\",\n    \"enable_caching\",\n    \"disable_caching\",\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/config.py",
      "content": "import datetime\nimport hashlib\nimport os\nimport pickle\nfrom typing import Optional, TypedDict, Union\n\nfrom ._types import Backend, HashFunc, Mongetter\n\n\ndef _default_hash_func(args, kwds):\n    # Sort the kwargs to ensure consistent ordering\n    sorted_kwargs = sorted(kwds.items())\n    # Serialize args and sorted_kwargs using pickle or similar\n    serialized = pickle.dumps((args, sorted_kwargs))\n    # Create a hash of the serialized data\n    return hashlib.sha256(serialized).hexdigest()\n\n\nclass Params(TypedDict):\n    \"\"\"Type definition for cachier parameters.\"\"\"\n\n    caching_enabled: bool\n    hash_func: HashFunc\n    backend: Backend\n    mongetter: Optional[Mongetter]\n    stale_after: datetime.timedelta\n    next_time: bool\n    cache_dir: Union[str, os.PathLike]\n    pickle_reload: bool\n    separate_files: bool\n    wait_for_calc_timeout: int\n    allow_none: bool\n\n\n_default_params: Params = {\n    \"caching_enabled\": True,\n    \"hash_func\": _default_hash_func,\n    \"backend\": \"pickle\",\n    \"mongetter\": None,\n    \"stale_after\": datetime.timedelta.max,\n    \"next_time\": False,\n    \"cache_dir\": \"~/.cachier/\",\n    \"pickle_reload\": True,\n    \"separate_files\": False,\n    \"wait_for_calc_timeout\": 0,\n    \"allow_none\": False,\n}\n\n\ndef _update_with_defaults(\n    param, name: str, func_kwargs: Optional[dict] = None\n):\n    import cachier\n\n    if func_kwargs:\n        kw_name = f\"cachier__{name}\"\n        if kw_name in func_kwargs:\n            return func_kwargs.pop(kw_name)\n    if param is None:\n        return cachier.config._default_params[name]\n    return param\n\n\ndef set_default_params(**params):\n    \"\"\"Configure global parameters applicable to all memoized functions.\n\n    This function takes the same keyword parameters as the ones defined in the\n    decorator, which can be passed all at once or with multiple calls.\n    Parameters given directly to a decorator take precedence over any values\n    set by this function.\n\n    Only 'stale_after', 'next_time', and 'wait_for_calc_timeout' can be changed\n    after the memoization decorator has been applied. Other parameters will\n    only have an effect on decorators applied after this function is run.\n\n    \"\"\"\n    import cachier\n\n    valid_params = (\n        p for p in params.items() if p[0] in cachier.config._default_params\n    )\n    _default_params.update(valid_params)\n\n\ndef get_default_params():\n    \"\"\"Get current set of default parameters.\"\"\"\n    import cachier\n\n    return cachier.config._default_params\n\n\ndef enable_caching():\n    \"\"\"Enable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = True\n\n\ndef disable_caching():\n    \"\"\"Disable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = False\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/_version.py",
      "content": "# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2024, Jirka Borovec <***@gmail.com>\n\nimport os\n\n_PATH_HERE = os.path.dirname(__file__)\n_PATH_VERSION = os.path.join(_PATH_HERE, \"version.info\")\n_RELEASING_PROCESS = os.getenv(\"RELEASING_PROCESS\", \"0\") == \"1\"\n\nwith open(_PATH_VERSION) as fopen:\n    __version__ = fopen.read().strip()\n\n\ndef _get_git_sha() -> str:\n    from subprocess import DEVNULL, check_output\n\n    out = check_output([\"git\", \"rev-parse\", \"--short\", \"HEAD\"], stderr=DEVNULL)  # noqa: S603, S607\n    return out.decode(\"utf-8\").strip()\n\n\nif not _RELEASING_PROCESS:\n    try:\n        sha_short = _get_git_sha()\n        # print(f\"Version enriched with git commit hash: {__version__}.\")\n    except Exception:\n        # print(\"Failed to get the git commit hash,\"\n        #       f\" falling back to base version {__version__}.\")\n        sha_short = \"\"\n    __version__ += f\".dev+{sha_short}\" if sha_short else \".dev\"\n\n\n__all__ = [\"__version__\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/version.info",
      "content": "3.0.0\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/py.typed",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/_types.py",
      "content": "from typing import TYPE_CHECKING, Callable, Literal\n\nif TYPE_CHECKING:\n    import pymongo.collection\n\n\nHashFunc = Callable[..., str]\nMongetter = Callable[[], \"pymongo.collection.Collection\"]\nBackend = Literal[\"pickle\", \"mongo\", \"memory\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__main__.py",
      "content": "\"\"\"A command-line interface for cachier.\"\"\"\n\nimport click\n\nfrom cachier.core import _set_max_workers\n\n\n@click.group()\ndef cli():\n    \"\"\"A command-line interface for cachier.\"\"\"  # noqa: D401\n\n\n@cli.command(\"Limits the number of worker threads used by cachier.\")\n@click.argument(\"max_workers\", nargs=1, type=int)\ndef set_max_workers(max_workers):\n    \"\"\"Limits the number of worker threads used by cachier.\"\"\"\n    _set_max_workers(max_workers)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "\"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport datetime\nimport inspect\nimport os\nimport warnings\nfrom collections import OrderedDict\nfrom concurrent.futures import ThreadPoolExecutor\nfrom functools import wraps\nfrom typing import Optional, Union\nfrom warnings import warn\n\nfrom .config import (\n    Backend,\n    HashFunc,\n    Mongetter,\n    _default_params,\n    _update_with_defaults,\n)\nfrom .cores.base import RecalculationNeeded, _BaseCore\nfrom .cores.memory import _MemoryCore\nfrom .cores.mongo import _MongoCore\nfrom .cores.pickle import _PickleCore\n\nMAX_WORKERS_ENVAR_NAME = \"CACHIER_MAX_WORKERS\"\nDEFAULT_MAX_WORKERS = 8\n\n\ndef _max_workers():\n    return int(os.environ.get(MAX_WORKERS_ENVAR_NAME, DEFAULT_MAX_WORKERS))\n\n\ndef _set_max_workers(max_workers):\n    os.environ[MAX_WORKERS_ENVAR_NAME] = str(max_workers)\n    _get_executor(True)\n\n\ndef _get_executor(reset=False):\n    if reset or not hasattr(_get_executor, \"executor\"):\n        _get_executor.executor = ThreadPoolExecutor(_max_workers())\n    return _get_executor.executor\n\n\ndef _function_thread(core, key, func, args, kwds):\n    try:\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n    except BaseException as exc:\n        print(f\"Function call failed with the following exception:\\n{exc}\")\n\n\ndef _calc_entry(core, key, func, args, kwds):\n    try:\n        core.mark_entry_being_calculated(key)\n        # _get_executor().submit(core.mark_entry_being_calculated, key)\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n        # _get_executor().submit(core.set_entry, key, func_res)\n        return func_res\n    finally:\n        core.mark_entry_not_calculated(key)\n\n\ndef _convert_args_kwargs(\n    func, _is_method: bool, args: tuple, kwds: dict\n) -> dict:\n    \"\"\"Convert mix of positional and keyword arguments to aggregated kwargs.\"\"\"\n    # unwrap if the function is functools.partial\n    if hasattr(func, \"func\"):\n        args = func.args + args\n        kwds.update({k: v for k, v in func.keywords.items() if k not in kwds})\n        func = func.func\n    func_params = list(inspect.signature(func).parameters)\n    args_as_kw = dict(\n        zip(func_params[1:], args[1:])\n        if _is_method\n        else zip(func_params, args)\n    )\n    # init with default values\n    kwargs = {\n        k: v.default\n        for k, v in inspect.signature(func).parameters.items()\n        if v.default is not inspect.Parameter.empty\n    }\n    # merge args expanded as kwargs and the original kwds\n    kwargs.update(dict(**args_as_kw, **kwds))\n    return OrderedDict(sorted(kwargs.items()))\n\n\ndef _pop_kwds_with_deprecation(kwds, name: str, default_value: bool):\n    if name in kwds:\n        warnings.warn(\n            f\"`{name}` is deprecated and will be removed in a future release,\"\n            \" use `cachier__` alternative instead.\",\n            DeprecationWarning,\n            stacklevel=2,\n        )\n    return kwds.pop(name, default_value)\n\n\ndef cachier(\n    hash_func: Optional[HashFunc] = None,\n    hash_params: Optional[HashFunc] = None,\n    backend: Optional[Backend] = None,\n    mongetter: Optional[Mongetter] = None,\n    stale_after: Optional[datetime.timedelta] = None,\n    next_time: Optional[bool] = None,\n    cache_dir: Optional[Union[str, os.PathLike]] = None,\n    pickle_reload: Optional[bool] = None,\n    separate_files: Optional[bool] = None,\n    wait_for_calc_timeout: Optional[int] = None,\n    allow_none: Optional[bool] = None,\n):\n    \"\"\"Wrap as a persistent, stale-free memoization decorator.\n\n    The positional and keyword arguments to the wrapped function must be\n    hashable (i.e. Python's immutable built-in objects, not mutable\n    containers). Also, notice that since objects which are instances of\n    user-defined classes are hashable but all compare unequal (their hash\n    value is their id), equal objects across different sessions will not yield\n    identical keys.\n\n    Arguments:\n    ---------\n    hash_func : callable, optional\n        A callable that gets the args and kwargs from the decorated function\n        and returns a hash key for them. This parameter can be used to enable\n        the use of cachier with functions that get arguments that are not\n        automatically hashable by Python.\n    hash_params : callable, optional\n    backend : str, optional\n        The name of the backend to use. Valid options currently include\n        'pickle', 'mongo' and 'memory'. If not provided, defaults to\n        'pickle' unless the 'mongetter' argument is passed, in which\n        case the mongo backend is automatically selected.\n    mongetter : callable, optional\n        A callable that takes no arguments and returns a pymongo.Collection\n        object with writing permissions. If unset a local pickle cache is used\n        instead.\n    stale_after : datetime.timedelta, optional\n        The time delta after which a cached result is considered stale. Calls\n        made after the result goes stale will trigger a recalculation of the\n        result, but whether a stale or fresh result will be returned is\n        determined by the optional next_time argument.\n    next_time : bool, optional\n        If set to True, a stale result will be returned when finding one, not\n        waiting for the calculation of the fresh result to return. Defaults to\n        False.\n    cache_dir : str, optional\n        A fully qualified path to a file directory to be used for cache files.\n        The running process must have running permissions to this folder. If\n        not provided, a default directory at `~/.cachier/` is used.\n    pickle_reload : bool, optional\n        If set to True, in-memory cache will be reloaded on each cache read,\n        enabling different threads to share cache. Should be set to False for\n        faster reads in single-thread programs. Defaults to True.\n    separate_files: bool, default False, for Pickle cores only\n        Instead of a single cache file per-function, each function's cache is\n        split between several files, one for each argument set. This can help\n        if you per-function cache files become too large.\n    wait_for_calc_timeout: int, optional, for MongoDB only\n        The maximum time to wait for an ongoing calculation. When a\n        process started to calculate the value setting being_calculated to\n        True, any process trying to read the same entry will wait a maximum of\n        seconds specified in this parameter. 0 means wait forever.\n        Once the timeout expires the calculation will be triggered.\n    allow_none: bool, optional\n        Allows storing None values in the cache. If False, functions returning\n        None will not be cached and are recalculated every call.\n\n    \"\"\"\n    # Check for deprecated parameters\n    if hash_params is not None:\n        message = (\n            \"hash_params will be removed in a future release, \"\n            \"please use hash_func instead\"\n        )\n        warn(message, DeprecationWarning, stacklevel=2)\n        hash_func = hash_params\n    # Update parameters with defaults if input is None\n    backend = _update_with_defaults(backend, \"backend\")\n    mongetter = _update_with_defaults(mongetter, \"mongetter\")\n    # Override the backend parameter if a mongetter is provided.\n    if callable(mongetter):\n        backend = \"mongo\"\n    core: _BaseCore\n    if backend == \"pickle\":\n        core = _PickleCore(\n            hash_func=hash_func,\n            pickle_reload=pickle_reload,\n            cache_dir=cache_dir,\n            separate_files=separate_files,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"mongo\":\n        core = _MongoCore(\n            hash_func=hash_func,\n            mongetter=mongetter,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"memory\":\n        core = _MemoryCore(\n            hash_func=hash_func, wait_for_calc_timeout=wait_for_calc_timeout\n        )\n    else:\n        raise ValueError(\"specified an invalid core: %s\" % backend)\n\n    def _cachier_decorator(func):\n        core.set_func(func)\n\n        @wraps(func)\n        def func_wrapper(*args, **kwds):\n            nonlocal allow_none\n            _allow_none = _update_with_defaults(allow_none, \"allow_none\", kwds)\n            # print('Inside general wrapper for {}.'.format(func.__name__))\n            ignore_cache = _pop_kwds_with_deprecation(\n                kwds, \"ignore_cache\", False\n            )\n            overwrite_cache = _pop_kwds_with_deprecation(\n                kwds, \"overwrite_cache\", False\n            )\n            verbose = _pop_kwds_with_deprecation(kwds, \"verbose_cache\", False)\n            ignore_cache = kwds.pop(\"cachier__skip_cache\", ignore_cache)\n            overwrite_cache = kwds.pop(\n                \"cachier__overwrite_cache\", overwrite_cache\n            )\n            verbose = kwds.pop(\"cachier__verbose\", verbose)\n            _stale_after = _update_with_defaults(\n                stale_after, \"stale_after\", kwds\n            )\n            _next_time = _update_with_defaults(next_time, \"next_time\", kwds)\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n\n            _print = lambda x: None  # noqa: E731\n            if verbose:\n                _print = print\n            if ignore_cache or not _default_params[\"caching_enabled\"]:\n                return func(**kwargs)\n            key, entry = core.get_entry((), kwargs)\n            if overwrite_cache:\n                return _calc_entry(core, key, func, args, kwds)\n            if entry is None:\n                _print(\"No entry found. No current calc. Calling like a boss.\")\n                return _calc_entry(core, key, func, args, kwds)\n            _print(\"Entry found.\")\n            if _allow_none or entry.get(\"value\", None) is not None:\n                _print(\"Cached result found.\")\n                now = datetime.datetime.now()\n                if now - entry[\"time\"] <= _stale_after:\n                    _print(\"And it is fresh!\")\n                    return entry[\"value\"]\n                _print(\"But it is stale... :(\")\n                if entry[\"being_calculated\"]:\n                    if _next_time:\n                        _print(\"Returning stale.\")\n                        return entry[\"value\"]  # return stale val\n                    _print(\"Already calc. Waiting on change.\")\n                    try:\n                        return core.wait_on_entry_calc(key)\n                    except RecalculationNeeded:\n                        return _calc_entry(core, key, func, args, kwds)\n                if _next_time:\n                    _print(\"Async calc and return stale\")\n                    try:\n                        core.mark_entry_being_calculated(key)\n                        _get_executor().submit(\n                            _function_thread, core, key, func, args, kwds\n                        )\n                    finally:\n                        core.mark_entry_not_calculated(key)\n                    return entry[\"value\"]\n                _print(\"Calling decorated function and waiting\")\n                return _calc_entry(core, key, func, args, kwds)\n            if entry[\"being_calculated\"]:\n                _print(\"No value but being calculated. Waiting.\")\n                try:\n                    return core.wait_on_entry_calc(key)\n                except RecalculationNeeded:\n                    return _calc_entry(core, key, func, args, kwds)\n            _print(\"No entry found. No current calc. Calling like a boss.\")\n            return _calc_entry(core, key, func, args, kwds)\n\n        def _clear_cache():\n            \"\"\"Clear the cache.\"\"\"\n            core.clear_cache()\n\n        def _clear_being_calculated():\n            \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n            core.clear_being_calculated()\n\n        def _cache_dpath():\n            \"\"\"Return the path to the cache dir, if exists; None if not.\"\"\"\n            return getattr(core, \"cache_dir\", None)\n\n        def _precache_value(*args, value_to_cache, **kwds):  # noqa: D417\n            \"\"\"Add an initial value to the cache.\n\n            Arguments:\n            ---------\n            value_to_cache : any\n                entry to be written into the cache\n\n            \"\"\"\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n            return core.precache_value((), kwargs, value_to_cache)\n\n        func_wrapper.clear_cache = _clear_cache\n        func_wrapper.clear_being_calculated = _clear_being_calculated\n        func_wrapper.cache_dpath = _cache_dpath\n        func_wrapper.precache_value = _precache_value\n        return func_wrapper\n\n    return _cachier_decorator\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/base.py",
      "content": "\"\"\"Defines the interface of a cachier caching core.\"\"\"\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport abc  # for the _BaseCore abstract base class\nimport inspect\nimport threading\nfrom typing import Callable\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n\nclass RecalculationNeeded(Exception):\n    \"\"\"Exception raised when a recalculation is needed.\"\"\"\n\n    pass\n\n\ndef _get_func_str(func: Callable) -> str:\n    return f\".{func.__module__}.{func.__name__}\"\n\n\nclass _BaseCore:\n    __metaclass__ = abc.ABCMeta\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        self.hash_func = _update_with_defaults(hash_func, \"hash_func\")\n        self.wait_for_calc_timeout = wait_for_calc_timeout\n        self.lock = threading.RLock()\n\n    def set_func(self, func):\n        \"\"\"Set the function this core will use.\n\n        This has to be set before any method is called. Also determine if the\n        function is an object method.\n\n        \"\"\"\n        # unwrap if the function is functools.partial\n        if hasattr(func, \"func\"):\n            func = func.func\n        func_params = list(inspect.signature(func).parameters)\n        self.func_is_method = func_params and func_params[0] == \"self\"\n        self.func = func\n\n    def get_key(self, args, kwds):\n        \"\"\"Return a unique key based on the arguments provided.\"\"\"\n        return self.hash_func(args, kwds)\n\n    def get_entry(self, args, kwds):\n        \"\"\"Get entry based on given arguments.\n\n        Return the result mapped to the given arguments in this core's cache,\n        if such a mapping exists.\n\n        \"\"\"\n        key = self.get_key(args, kwds)\n        return self.get_entry_by_key(key)\n\n    def precache_value(self, args, kwds, value_to_cache):\n        \"\"\"Write a precomputed value into the cache.\"\"\"\n        key = self.get_key(args, kwds)\n        self.set_entry(key, value_to_cache)\n        return value_to_cache\n\n    def check_calc_timeout(self, time_spent):\n        \"\"\"Raise an exception if a recalculation is needed.\"\"\"\n        calc_timeout = _update_with_defaults(\n            self.wait_for_calc_timeout, \"wait_for_calc_timeout\"\n        )\n        if calc_timeout > 0 and (time_spent >= calc_timeout):\n            raise RecalculationNeeded()\n\n    @abc.abstractmethod\n    def get_entry_by_key(self, key):\n        \"\"\"Get entry based on given key.\n\n        Return the result mapped to the given key in this core's cache, if such\n        a mapping exists.\n\n        \"\"\"\n\n    @abc.abstractmethod\n    def set_entry(self, key, func_res):\n        \"\"\"Map the given result to the given key in this core's cache.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_being_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_not_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as not being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def wait_on_entry_calc(self, key):\n        \"\"\"Wait on the entry with keys being calculated and returns result.\"\"\"\n\n    @abc.abstractmethod\n    def clear_cache(self):\n        \"\"\"Clear the cache of this core.\"\"\"\n\n    @abc.abstractmethod\n    def clear_being_calculated(self):\n        \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "\"\"\"A pickle-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\nimport os\nimport pickle  # for local caching\nfrom contextlib import suppress\nfrom datetime import datetime\n\nimport portalocker  # to lock on pickle cache IO\nfrom watchdog.events import PatternMatchingEventHandler\nfrom watchdog.observers import Observer\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n# Alternative:  https://github.com/WoLpH/portalocker\nfrom .base import _BaseCore\n\n\nclass _PickleCore(_BaseCore):\n    \"\"\"The pickle core class for cachier.\"\"\"\n\n    class CacheChangeHandler(PatternMatchingEventHandler):\n        \"\"\"Handles cache-file modification events.\"\"\"\n\n        def __init__(self, filename, core, key):\n            PatternMatchingEventHandler.__init__(\n                self,\n                patterns=[\"*\" + filename],\n                ignore_patterns=None,\n                ignore_directories=True,\n                case_sensitive=False,\n            )\n            self.core = core\n            self.key = key\n            self.observer = None\n            self.value = None\n\n        def inject_observer(self, observer):\n            \"\"\"Inject the observer running this handler.\"\"\"\n            self.observer = observer\n\n        def _check_calculation(self):\n            # print('checking calc')\n            entry = self.core.get_entry_by_key(self.key, True)[1]\n            # print(self.key)\n            # print(entry)\n            try:\n                if not entry[\"being_calculated\"]:\n                    # print('stopping observer!')\n                    self.value = entry[\"value\"]\n                    self.observer.stop()\n                # else:\n                # print('NOT stopping observer... :(')\n            except TypeError:\n                self.value = None\n                self.observer.stop()\n\n        def on_created(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()  # pragma: no cover\n\n        def on_modified(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        pickle_reload: bool,\n        cache_dir: str,\n        separate_files: bool,\n        wait_for_calc_timeout: int,\n    ):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = None\n        self.reload = _update_with_defaults(pickle_reload, \"pickle_reload\")\n        self.cache_dir = os.path.expanduser(\n            _update_with_defaults(cache_dir, \"cache_dir\")\n        )\n        self.separate_files = _update_with_defaults(\n            separate_files, \"separate_files\"\n        )\n\n    @property\n    def cache_fname(self) -> str:\n        fname = f\".{self.func.__module__}.{self.func.__qualname__}\"\n        return fname.replace(\"<\", \"_\").replace(\">\", \"_\")\n\n    @property\n    def cache_fpath(self) -> str:\n        os.makedirs(self.cache_dir, exist_ok=True)\n        return os.path.abspath(\n            os.path.join(os.path.realpath(self.cache_dir), self.cache_fname)\n        )\n\n    def _reload_cache(self):\n        with self.lock:\n            try:\n                with portalocker.Lock(\n                    self.cache_fpath, mode=\"rb\"\n                ) as cache_file:\n                    self.cache = pickle.load(cache_file)  # noqa: S301\n            except (FileNotFoundError, EOFError):\n                self.cache = {}\n\n    def _get_cache(self):\n        with self.lock:\n            if not self.cache:\n                self._reload_cache()\n            return self.cache\n\n    def _get_cache_by_key(self, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        fpath += f\"_{hash_str or key}\"\n        try:\n            with portalocker.Lock(fpath, mode=\"rb\") as cache_file:\n                return pickle.load(cache_file)  # noqa: S301\n        except (FileNotFoundError, EOFError):\n            return None\n\n    def _clear_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(f\"{name}_\"):\n                os.remove(os.path.join(path, subpath))\n\n    def _clear_being_calculated_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(name):\n                entry = self._get_cache_by_key(hash_str=subpath.split(\"_\")[-1])\n                if entry is not None:\n                    entry[\"being_calculated\"] = False\n                    self._save_cache(entry, hash_str=subpath.split(\"_\")[-1])\n\n    def _save_cache(self, cache, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        if key is not None:\n            fpath += f\"_{key}\"\n        elif hash_str is not None:\n            fpath += f\"_{hash_str}\"\n        with self.lock:\n            self.cache = cache\n            with portalocker.Lock(fpath, mode=\"wb\") as cache_file:\n                pickle.dump(cache, cache_file, protocol=4)\n            if key is None:\n                self._reload_cache()\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            if self.separate_files:\n                return key, self._get_cache_by_key(key)\n            if self.reload or reload:\n                self._reload_cache()\n            return key, self._get_cache().get(key, None)\n\n    def set_entry(self, key, func_res):\n        key_data = {\n            \"value\": func_res,\n            \"time\": datetime.now(),\n            \"stale\": False,\n            \"being_calculated\": False,\n        }\n        if self.separate_files:\n            self._save_cache(key_data, key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            cache[key] = key_data\n            self._save_cache(cache)\n\n    def mark_entry_being_calculated_separate_files(self, key):\n        self._save_cache(\n            {\n                \"value\": None,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": True,\n            },\n            key=key,\n        )\n\n    def mark_entry_not_calculated_separate_files(self, key):\n        _, entry = self.get_entry_by_key(key)\n        entry[\"being_calculated\"] = False\n        self._save_cache(entry, key=key)\n\n    def mark_entry_being_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_being_calculated_separate_files(key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            try:\n                cache[key][\"being_calculated\"] = True\n            except KeyError:\n                cache[key] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                }\n            self._save_cache(cache)\n\n    def mark_entry_not_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_not_calculated_separate_files(key)\n        with self.lock:\n            cache = self._get_cache()\n            # that's ok, we don't need an entry in that case\n            with suppress(KeyError):\n                cache[key][\"being_calculated\"] = False\n                self._save_cache(cache)\n\n    def wait_on_entry_calc(self, key):\n        if self.separate_files:\n            entry = self._get_cache_by_key(key)\n            filename = f\"{self.cache_fname}_{key}\"\n        else:\n            with self.lock:\n                self._reload_cache()\n                entry = self._get_cache()[key]\n            filename = self.cache_fname\n        if not entry[\"being_calculated\"]:\n            return entry[\"value\"]\n        event_handler = _PickleCore.CacheChangeHandler(\n            filename=filename, core=self, key=key\n        )\n        observer = Observer()\n        event_handler.inject_observer(observer)\n        observer.schedule(event_handler, path=self.cache_dir, recursive=True)\n        observer.start()\n        time_spent = 0\n        while observer.is_alive():\n            observer.join(timeout=1.0)\n            time_spent += 1\n            self.check_calc_timeout(time_spent)\n        return event_handler.value\n\n    def clear_cache(self):\n        if self.separate_files:\n            self._clear_all_cache_files()\n        else:\n            self._save_cache({})\n\n    def clear_being_calculated(self):\n        if self.separate_files:\n            self._clear_being_calculated_all_cache_files()\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            for key in cache:\n                cache[key][\"being_calculated\"] = False\n            self._save_cache(cache)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/memory.py",
      "content": "\"\"\"A memory-based caching core for cachier.\"\"\"\n\nimport threading\nfrom datetime import datetime\n\nfrom .._types import HashFunc\nfrom .base import _BaseCore, _get_func_str\n\n\nclass _MemoryCore(_BaseCore):\n    \"\"\"The memory core class for cachier.\"\"\"\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = {}\n\n    def _hash_func_key(self, key):\n        return f\"{_get_func_str(self.func)}:{key}\"\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            return key, self.cache.get(self._hash_func_key(key), None)\n\n    def set_entry(self, key, func_res):\n        with self.lock:\n            try:\n                # we need to retain the existing condition so that\n                # mark_entry_not_calculated can notify all possibly-waiting\n                # threads about it\n                cond = self.cache[self._hash_func_key(key)][\"condition\"]\n            except KeyError:  # pragma: no cover\n                cond = None\n            self.cache[self._hash_func_key(key)] = {\n                \"value\": func_res,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": False,\n                \"condition\": cond,\n            }\n\n    def mark_entry_being_calculated(self, key):\n        with self.lock:\n            condition = threading.Condition()\n            # condition.acquire()\n            try:\n                self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n                self.cache[self._hash_func_key(key)][\"condition\"] = condition\n            except KeyError:\n                self.cache[self._hash_func_key(key)] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                    \"condition\": condition,\n                }\n\n    def mark_entry_not_calculated(self, key):\n        with self.lock:\n            try:\n                entry = self.cache[self._hash_func_key(key)]\n            except KeyError:  # pragma: no cover\n                return  # that's ok, we don't need an entry in that case\n            entry[\"being_calculated\"] = False\n            cond = entry[\"condition\"]\n            if cond:\n                cond.acquire()\n                cond.notify_all()\n                cond.release()\n                entry[\"condition\"] = None\n\n    def wait_on_entry_calc(self, key):\n        with self.lock:  # pragma: no cover\n            entry = self.cache[self._hash_func_key(key)]\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n        entry[\"condition\"].acquire()\n        entry[\"condition\"].wait()\n        entry[\"condition\"].release()\n        return self.cache[self._hash_func_key(key)][\"value\"]\n\n    def clear_cache(self):\n        with self.lock:\n            self.cache.clear()\n\n    def clear_being_calculated(self):\n        with self.lock:\n            for entry in self.cache.values():\n                entry[\"being_calculated\"] = False\n                entry[\"condition\"] = None\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "content": "\"\"\"A MongoDB-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport pickle  # for serialization of python objects\nimport sys  # to make sure that pymongo was imported\nimport time  # to sleep when waiting on Mongo cache\\\nimport warnings  # to warn if pymongo is missing\nfrom contextlib import suppress\nfrom datetime import datetime\n\nfrom .._types import HashFunc, Mongetter\n\nwith suppress(ImportError):\n    from bson.binary import Binary  # to save binary data to mongodb\n    from pymongo import ASCENDING, IndexModel\n    from pymongo.errors import OperationFailure\n\nfrom .base import RecalculationNeeded, _BaseCore, _get_func_str\n\nMONGO_SLEEP_DURATION_IN_SEC = 1\n\n\nclass MissingMongetter(ValueError):\n    \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n\n\nclass _MongoCore(_BaseCore):\n    _INDEX_NAME = \"func_1_key_1\"\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        mongetter: Mongetter,\n        wait_for_calc_timeout: int,\n    ):\n        if \"pymongo\" not in sys.modules:\n            warnings.warn(\n                \"`pymongo` was not found. MongoDB cores will not function.\",\n                ImportWarning,\n                stacklevel=2,\n            )  # pragma: no cover\n\n        super().__init__(hash_func, wait_for_calc_timeout)\n        if mongetter is None:\n            raise MissingMongetter(\n                \"must specify ``mongetter`` when using the mongo core\"\n            )\n        self.mongetter = mongetter\n        self.mongo_collection = self.mongetter()\n        index_inf = self.mongo_collection.index_information()\n        if _MongoCore._INDEX_NAME not in index_inf:\n            func1key1 = IndexModel(\n                keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n                name=_MongoCore._INDEX_NAME,\n            )\n            self.mongo_collection.create_indexes([func1key1])\n\n    @property\n    def _func_str(self) -> str:\n        return _get_func_str(self.func)\n\n    def get_entry_by_key(self, key):\n        res = self.mongo_collection.find_one(\n            {\"func\": self._func_str, \"key\": key}\n        )\n        if not res:\n            return key, None\n        try:\n            entry = {\n                \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        except KeyError:\n            entry = {\n                \"value\": None,\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        return key, entry\n\n    def set_entry(self, key, func_res):\n        thebytes = pickle.dumps(func_res)\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\n                \"$set\": {\n                    \"func\": self._func_str,\n                    \"key\": key,\n                    \"value\": Binary(thebytes),\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": False,\n                }\n            },\n            upsert=True,\n        )\n\n    def mark_entry_being_calculated(self, key):\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\"$set\": {\"being_calculated\": True}},\n            upsert=True,\n        )\n\n    def mark_entry_not_calculated(self, key):\n        with suppress(OperationFailure):  # don't care in this case\n            self.mongo_collection.update_one(\n                filter={\n                    \"func\": self._func_str,\n                    \"key\": key,\n                },\n                update={\"$set\": {\"being_calculated\": False}},\n                upsert=False,  # should not insert in this case\n            )\n\n    def wait_on_entry_calc(self, key):\n        time_spent = 0\n        while True:\n            time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n            time_spent += MONGO_SLEEP_DURATION_IN_SEC\n            key, entry = self.get_entry_by_key(key)\n            if entry is None:\n                raise RecalculationNeeded()\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n            self.check_calc_timeout(time_spent)\n\n    def clear_cache(self):\n        self.mongo_collection.delete_many(filter={\"func\": self._func_str})\n\n    def clear_being_calculated(self):\n        self.mongo_collection.update_many(\n            filter={\n                \"func\": self._func_str,\n                \"being_calculated\": True,\n            },\n            update={\"$set\": {\"being_calculated\": False}},\n        )\n"
    },
    {
      "path": "python-cachier_cachier/tests/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/tests/standalone_script.py",
      "content": "import time\n\nimport cachier\n\n\n@cachier.cachier()\ndef _takes_3_seconds(label, value):\n    time.sleep(3)\n    return f\"{label} {value}\"\n\n\nprint(_takes_3_seconds(\"two\", 2))\n"
    },
    {
      "path": "python-cachier_cachier/tests/speed_eval.py",
      "content": "from random import random\nfrom time import time\n\nfrom cachier import cachier\n\n\n@cachier(next_time=True)\ndef _test_int_pickling(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef _test_int_pickling_compare(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef test_pickle_speed():\n    \"\"\"Test speeds.\"\"\"\n    print(\"Comparing speeds of decorated vs non-decorated functions...\")\n    num_of_vals = 1000\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling_compare(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Non-decorated average = {sum(times) / num_of_vals}\")\n\n    _test_int_pickling.clear_cache()\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Decorated average = {sum(times) / num_of_vals}\")\n\n\n@cachier()\ndef _test_single_file_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\n@cachier(separate_files=True)\ndef _test_separate_files_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\ndef test_separate_files_vs_single_file():\n    _test_separate_files_speed.clear_cache()\n    _test_single_file_speed.clear_cache()\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_separate_files_speed(j, 2)\n    print(f\"separate files time: {time() - start_time}\")\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_single_file_speed(j, 2)\n    print(f\"single file time: {time() - start_time}\")\n\n\nif __name__ == \"__main__\":\n    test_pickle_speed()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_memory_core.py",
      "content": "\"\"\"Test for the in-memory implementation of the  Cachier python package.\"\"\"\n\nimport hashlib\nimport queue\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pandas as pd\nimport pytest\nfrom cachier import cachier\n\n\n@cachier(backend=\"memory\", next_time=False)\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.memory\ndef test_memory_core():\n    \"\"\"Basic memory core functionality.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\n@pytest.mark.memory\ndef test_memory_core_keywords():\n    \"\"\"Basic memory core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=False)\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after():\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds.clear_cache()\n    val1 = _stale_after_seconds(1, 2)\n    val2 = _stale_after_seconds(1, 2)\n    val3 = _stale_after_seconds(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds(1, 2)\n    assert val4 != val1\n    _stale_after_seconds.clear_cache()\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=True)\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after_next_time():\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _random_num():\n    return random()\n\n\n@cachier(backend=\"memory\")\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.memory\ndef test_overwrite_cache():\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 == int3\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 == int3\n    _random_num_with_arg.clear_cache()\n\n\n@pytest.mark.memory\ndef test_ignore_cache():\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(res_queue):\n    res = _takes_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_memory_being_calculated():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(res_queue):\n    res = _being_calc_next_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_being_calc_next_time():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\")\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@cachier(backend=\"memory\")\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    _takes_time.clear_being_calculated()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated_with_empty_cache():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    _takes_time.clear_being_calculated()\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.memory\ndef test_error_throwing_func():\n    # with\n    res1 = _error_throwing_func(4)\n    sleep(1.5)\n    res2 = _error_throwing_func(4)\n    assert res1 == res2\n\n\n@pytest.mark.memory\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(backend=\"memory\", hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n\n\nif __name__ == \"__main__\":\n    test_memory_being_calculated()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_core_lookup.py",
      "content": "\"\"\"Testing a few basic cachier interfaces.\"\"\"\n\nimport pytest\nfrom cachier import cachier, get_default_params\nfrom cachier.cores.mongo import MissingMongetter\n\n\ndef test_get_default_params():\n    params = get_default_params()\n    assert tuple(sorted(params)) == (\n        \"allow_none\",\n        \"backend\",\n        \"cache_dir\",\n        \"caching_enabled\",\n        \"hash_func\",\n        \"mongetter\",\n        \"next_time\",\n        \"pickle_reload\",\n        \"separate_files\",\n        \"stale_after\",\n        \"wait_for_calc_timeout\",\n    )\n\n\ndef test_bad_name(name=\"nope\"):\n    # Test that the appropriate exception is thrown\n    # when an invalid backend is given.\n    with pytest.raises(ValueError, match=f\"specified an invalid core: {name}\"):\n\n        @cachier(backend=name)\n        def dummy_func():\n            pass\n\n\ndef test_missing_mongetter():\n    # Test that the appropriate exception is thrown\n    # when forgetting to specify the mongetter.\n    with pytest.raises(MissingMongetter):\n\n        @cachier(backend=\"mongo\", mongetter=None)\n        def dummy_func():\n            pass\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_general.py",
      "content": "\"\"\"Non-core-specific tests for cachier.\"\"\"\n\nimport functools\nimport os\nimport queue\nimport subprocess  # nosec: B404\nimport threading\nfrom contextlib import suppress\nfrom random import random\nfrom time import sleep, time\n\nimport cachier\nimport pytest\nfrom cachier.core import (\n    DEFAULT_MAX_WORKERS,\n    MAX_WORKERS_ENVAR_NAME,\n    _get_executor,\n    _max_workers,\n    _set_max_workers,\n)\n\nfrom tests.test_mongo_core import (\n    MONGO_DELTA_LONG,\n    _test_mongetter,\n)\n\n\ndef test_information():\n    print(\"\\ncachier version: \", end=\"\")\n    print(cachier.__version__)\n\n\ndef test_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    with suppress(KeyError):\n        del os.environ[MAX_WORKERS_ENVAR_NAME]\n    assert _max_workers() == DEFAULT_MAX_WORKERS\n\n\ndef test_get_executor():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _get_executor()\n    _get_executor(False)\n    _get_executor(True)\n\n\ndef test_set_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _set_max_workers(9)\n\n\nparametrize_keys = \"mongetter,stale_after,separate_files\"\nparametrize_values = [\n    pytest.param(\n        _test_mongetter, MONGO_DELTA_LONG, False, marks=pytest.mark.mongo\n    ),\n    (None, None, False),\n    (None, None, True),\n]\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_ok(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_fast(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(1)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_fast(res_queue):\n        res = _wait_for_calc_timeout_fast(1, 2)\n        res_queue.put(res)\n\n    \"\"\" Testing calls that avoid timeouts store the values in cache. \"\"\"\n    _wait_for_calc_timeout_fast.clear_cache()\n    val1 = _wait_for_calc_timeout_fast(1, 2)\n    val2 = _wait_for_calc_timeout_fast(1, 2)\n    assert val1 == val2\n\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(2)\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2  # Timeout did not kick in, a single call was done\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_slow(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    sleep(4)\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_precache_value(mongetter, backend):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def dummy_func(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return arg_1 + arg_2\n\n    assert dummy_func.precache_value(2, 2, value_to_cache=5) == 5\n    assert dummy_func(2, 2) == 5\n    dummy_func.clear_cache()\n    assert dummy_func(2, 2) == 4\n    assert dummy_func.precache_value(2, arg_2=2, value_to_cache=5) == 5\n    assert dummy_func(2, arg_2=2) == 5\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_ignore_self_in_methods(mongetter, backend):\n    class DummyClass:\n        @cachier.cachier(backend=backend, mongetter=mongetter)\n        def takes_2_seconds(self, arg_1, arg_2):\n            \"\"\"Some function.\"\"\"\n            sleep(2)\n            return arg_1 + arg_2\n\n    test_object_1 = DummyClass()\n    test_object_2 = DummyClass()\n    test_object_1.takes_2_seconds.clear_cache()\n    test_object_2.takes_2_seconds.clear_cache()\n    assert test_object_1.takes_2_seconds(1, 2) == 3\n    start = time()\n    assert test_object_2.takes_2_seconds(1, 2) == 3\n    end = time()\n    assert end - start < 1\n\n\ndef test_hash_params_deprecation():\n    with pytest.deprecated_call(match=\"hash_params will be removed\"):\n\n        @cachier.cachier(hash_params=lambda a, k: \"key\")\n        def test():\n            return \"value\"\n\n    assert test() == \"value\"\n\n\ndef test_separate_processes():\n    test_args = (\"python\", \"tests/standalone_script.py\")\n    run_params = {\"args\": test_args, \"capture_output\": True, \"text\": True}\n    run_process = functools.partial(subprocess.run, **run_params)\n    result = run_process()\n    assert result.stdout.strip() == \"two 2\"\n    start = time()\n    result = run_process()\n    end = time()\n    assert result.stdout.strip() == \"two 2\"\n    assert end - start < 3\n\n\ndef test_global_disable():\n    @cachier.cachier()\n    def get_random():\n        return random()\n\n    get_random.clear_cache()\n    result_1 = get_random()\n    result_2 = get_random()\n    cachier.disable_caching()\n    result_3 = get_random()\n    cachier.enable_caching()\n    result_4 = get_random()\n    assert result_1 == result_2 == result_4\n    assert result_1 != result_3\n\n\ndef test_none_not_cached_by_default():\n    count = 0\n\n    @cachier.cachier()\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 2\n\n\ndef test_allow_caching_none():\n    count = 0\n\n    @cachier.cachier(allow_none=True)\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 1\n\n\ndef test_identical_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: int, b: int = 2, c: int = 3):\n        nonlocal count\n        count += 1\n        return a + b + c\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1, 2, 3) == 6\n    assert dummy_func(1, 2, c=3) == 6\n    assert dummy_func(1, b=2, c=3) == 6\n    assert dummy_func(a=1, b=2, c=3) == 6\n    assert count == 1\n\n\ndef test_list_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: list, b: list = [2]):  # noqa: B006\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func([1]) == [1, 2]\n    assert dummy_func([1], [2]) == [1, 2]\n    assert dummy_func([1], b=[2]) == [1, 2]\n    assert dummy_func(a=[1], b=[2]) == [1, 2]\n    assert count == 1\n\n\ndef test_order_independent_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(b=2, a=1) == 3\n    assert count == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_diff_functions_same_args(tmpdir, backend: str):\n    count_p = count_m = 0\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert fn(1) == expected\n        assert fn(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_runtime_handling(tmpdir, backend):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir, backend=backend)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n    for fn, expected in [(fn_plus, 5), (fn_minus, 1)]:\n        assert cachier_(fn)(3, 2) == expected\n        assert cachier_(fn)(a=3, b=2) == expected\n    assert count_p == 2\n    assert count_m == 2\n\n\ndef test_partial_handling(tmpdir):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)(2) == expected\n\n        dummy_ = functools.partial(fn, a=1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1) == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1, b=2) == expected\n\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n\n    assert count_p == 1\n    assert count_m == 1\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_defaults.py",
      "content": "import datetime\nimport os\nimport queue\nimport random\nimport threading\nimport time\n\nimport cachier\nimport pytest\n\nfrom tests.test_mongo_core import MONGO_DELTA, _test_mongetter\n\n_default_params = cachier.get_default_params().copy()\n\n\ndef setup_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef teardown_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef test_hash_func_default_param():\n    def slow_hash_func(args, kwds):\n        time.sleep(2)\n        return \"hash\"\n\n    def fast_hash_func(args, kwds):\n        return \"hash\"\n\n    cachier.set_default_params(hash_func=slow_hash_func)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(hash_func=fast_hash_func)\n    def global_test_2():\n        return None\n\n    start = time.time()\n    global_test_1()\n    end = time.time()\n    assert end - start > 1\n    start = time.time()\n    global_test_2()\n    end = time.time()\n    assert end - start < 1\n\n\ndef test_backend_default_param():\n    cachier.set_default_params(backend=\"memory\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(backend=\"pickle\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\n@pytest.mark.mongo\ndef test_mongetter_default_param():\n    cachier.set_default_params(mongetter=_test_mongetter)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(mongetter=False)\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\ndef test_cache_dir_default_param(tmpdir):\n    cachier.set_default_params(cache_dir=tmpdir / \"1\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() == str(tmpdir / \"1\")\n    assert global_test_2.cache_dpath() == str(tmpdir / \"2\")\n\n\ndef test_separate_files_default_param(tmpdir):\n    cachier.set_default_params(separate_files=True)\n\n    @cachier.cachier(cache_dir=tmpdir / \"1\")\n    def global_test_1(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\", separate_files=False)\n    def global_test_2(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    global_test_1(1, 2)\n    global_test_1(3, 4)\n    global_test_2(1, 2)\n    global_test_2(3, 4)\n\n    assert len(os.listdir(global_test_1.cache_dpath())) == 2\n    assert len(os.listdir(global_test_2.cache_dpath())) == 1\n\n\ndef test_allow_none_default_param(tmpdir):\n    cachier.set_default_params(\n        allow_none=True,\n        separate_files=True,\n        verbose_cache=True,\n    )\n    allow_count = disallow_count = 0\n\n    @cachier.cachier(cache_dir=tmpdir)\n    def allow_none():\n        nonlocal allow_count\n        allow_count += 1\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir, allow_none=False)\n    def disallow_none():\n        nonlocal disallow_count\n        disallow_count += 1\n        return None\n\n    assert allow_count == 0\n    allow_none()\n    allow_none()\n    assert allow_count == 1\n\n    assert disallow_count == 0\n    disallow_none()\n    disallow_none()\n    assert disallow_count == 2\n\n    disallow_none(cachier__allow_none=True)\n    disallow_none(cachier__allow_none=True)\n    assert disallow_count == 2\n\n\nPARAMETRIZE_TEST = (\n    \"backend,mongetter\",\n    [\n        pytest.param(\"pickle\", None, marks=pytest.mark.pickle),\n        pytest.param(\"mongo\", _test_mongetter, marks=pytest.mark.mongo),\n    ],\n)\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_stale_after_applies_dynamically(backend, mongetter):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_test(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random() + arg_1 + arg_2\n\n    cachier.set_default_params(stale_after=MONGO_DELTA)\n\n    _stale_after_test.clear_cache()\n    val1 = _stale_after_test(1, 2)\n    val2 = _stale_after_test(1, 2)\n    assert val1 == val2\n    time.sleep(3)\n    val3 = _stale_after_test(1, 2)\n    assert val3 != val1\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_next_time_applies_dynamically(backend, mongetter):\n    NEXT_AFTER_DELTA = datetime.timedelta(seconds=3)\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_next_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random()\n\n    cachier.set_default_params(stale_after=NEXT_AFTER_DELTA, next_time=True)\n\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    time.sleep(NEXT_AFTER_DELTA.seconds + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    time.sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_wait_for_calc_applies_dynamically(backend, mongetter):\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        time.sleep(3)\n        return random.random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    cachier.set_default_params(wait_for_calc_timeout=2)\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    time.sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    time.sleep(5)\n    thread1.join(timeout=5)\n    thread2.join(timeout=5)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\ndef test_default_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1) == 3\n    assert dummy_func(a=1) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert count == 1\n\n\ndef test_deprecated_func_kwargs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    with pytest.deprecated_call(\n        match=\"`verbose_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, verbose_cache=True) == 3\n    assert count == 1\n    with pytest.deprecated_call(\n        match=\"`ignore_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, ignore_cache=True) == 3\n    assert count == 2\n    with pytest.deprecated_call(\n        match=\"`overwrite_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, overwrite_cache=True) == 3\n    assert count == 3\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_pickle_core.py",
      "content": "\"\"\"Test for the Cachier python package.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\n# from os.path import (\n#     realpath,\n#     dirname\n# )\nimport os\nimport pickle\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pytest\n\ntry:\n    import queue\nexcept ImportError:  # python 2\n    import Queue as queue  # type: ignore\n\nimport hashlib\n\nimport pandas as pd\nfrom cachier import cachier\nfrom cachier.core import _default_params\n\n\ndef _get_decorated_func(func, **kwargs):\n    cachier_decorator = cachier(**kwargs)\n    decorated_func = cachier_decorator(func)\n    return decorated_func\n\n\n# Pickle core tests\n\n\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"reload\", [True, False])\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core(reload, separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds,\n        next_time=False,\n        pickle_reload=reload,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_keywords(separate_files):\n    \"\"\"Basic Pickle core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds, next_time=False, separate_files=separate_files\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after(separate_files):\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds_decorated = _get_decorated_func(\n        _stale_after_seconds,\n        stale_after=DELTA,\n        next_time=False,\n        separate_files=separate_files,\n    )\n    _stale_after_seconds_decorated.clear_cache()\n    val1 = _stale_after_seconds_decorated(1, 2)\n    val2 = _stale_after_seconds_decorated(1, 2)\n    val3 = _stale_after_seconds_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds_decorated(1, 2)\n    assert val4 != val1\n    _stale_after_seconds_decorated.clear_cache()\n\n\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after_next_time(separate_files):\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time_decorated = _get_decorated_func(\n        _stale_after_next_time,\n        stale_after=DELTA,\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _stale_after_next_time_decorated.clear_cache()\n    val1 = _stale_after_next_time_decorated(1, 2)\n    val2 = _stale_after_next_time_decorated(1, 2)\n    val3 = _stale_after_next_time_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time_decorated(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time_decorated(1, 2)\n    assert val5 != val1\n    _stale_after_next_time_decorated.clear_cache()\n\n\ndef _random_num():\n    return random()\n\n\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_overwrite_cache(separate_files):\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 == int3\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 == int3\n    _random_num_with_arg_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_ignore_cache(separate_files):\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg_decorated.clear_cache()\n\n\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(takes_time_func, res_queue):\n    res = takes_time_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_being_calculated(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(being_calc_func, res_queue):\n    res = being_calc_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_being_calc_next_time(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _being_calc_next_time_decorated = _get_decorated_func(\n        _being_calc_next_time,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _being_calc_next_time_decorated.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _BAD_CACHE_FNAME = '.__main__._bad_cache'\n_BAD_CACHE_FNAME = \".tests.test_pickle_core._bad_cache\"\n_BAD_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._bad_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\nEXPANDED_CACHIER_DIR = os.path.expanduser(_default_params[\"cache_dir\"])\n_BAD_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME)\n_BAD_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME_SEPARATE_FILES\n)\n_BAD_CACHE_FPATHS = {\n    True: _BAD_CACHE_FPATH_SEPARATE_FILES,\n    False: _BAD_CACHE_FPATH,\n}\n\n\ndef _calls_bad_cache(bad_cache_func, res_queue, trash_cache, separate_files):\n    try:\n        res = bad_cache_func(0.13, 0.02, cachier__verbose=True)\n        if trash_cache:\n            with open(_BAD_CACHE_FPATHS[separate_files], \"w\") as cache_file:\n                cache_file.seek(0)\n                cache_file.truncate()\n        res_queue.put(res)\n    except Exception as exc:\n        res_queue.put(exc)\n\n\ndef _helper_bad_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    _bad_cache_decorated = _get_decorated_func(\n        _bad_cache, separate_files=separate_files\n    )\n    _bad_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if res2 is not None or isinstance(res2, KeyError):\n        return False\n    return True\n\n\n# we want this to succeed at least once\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_bad_cache_file(separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.6, 0.7, 0.8, 1, 1.5, 2]\n    sleeptimes = sleeptimes + sleeptimes\n    for sleeptime in sleeptimes:\n        if _helper_bad_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _DEL_CACHE_FNAME = '.__main__._delete_cache'\n_DEL_CACHE_FNAME = \".tests.test_pickle_core._delete_cache\"\n_DEL_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._delete_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\n_DEL_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME)\n_DEL_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME_SEPARATE_FILES\n)\n_DEL_CACHE_FPATHS = {\n    True: _DEL_CACHE_FPATH_SEPARATE_FILES,\n    False: _DEL_CACHE_FPATH,\n}\n\n\ndef _calls_delete_cache(del_cache_func, res_queue, del_cache, separate_files):\n    try:\n        # print('in')\n        res = del_cache_func(0.13, 0.02)\n        # print('out with {}'.format(res))\n        if del_cache:\n            os.remove(_DEL_CACHE_FPATHS[separate_files])\n            # print(os.path.isfile(_DEL_CACHE_FPATH))\n        res_queue.put(res)\n    except Exception as exc:\n        # print('found')\n        res_queue.put(exc)\n\n\ndef _helper_delete_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    _delete_cache_decorated = _get_decorated_func(\n        _delete_cache, separate_files=separate_files\n    )\n    _delete_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    # print(res1)\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if not ((isinstance(res2, KeyError)) or (res2 is None)):\n        return False\n    return True\n    # print(res2)\n    # print(type(res2))\n\n\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_delete_cache_file(separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.7, 1]\n    sleeptimes = sleeptimes * 4\n    for sleeptime in sleeptimes:\n        if _helper_delete_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_clear_being_calculated(separate_files):\n    \"\"\"Test pickle core clear `being calculated` functionality.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_being_calculated()\n\n\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_error_throwing_func(separate_files):\n    # with\n    _error_throwing_func.count = 0\n    _error_throwing_func_decorated = _get_decorated_func(\n        _error_throwing_func,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _error_throwing_func_decorated.clear_cache()\n    res1 = _error_throwing_func_decorated(4)\n    sleep(1.5)\n    res2 = _error_throwing_func_decorated(4)\n    assert res1 == res2\n\n\n# test custom cache dir for pickle core\n\nCUSTOM_DIR = \"~/.exparrot\"\nEXPANDED_CUSTOM_DIR = os.path.expanduser(CUSTOM_DIR)\n\n\ndef _takes_2_seconds_custom_dir(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_custom_cache_dir(separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_custom_dir_decorated = _get_decorated_func(\n        _takes_2_seconds_custom_dir,\n        next_time=False,\n        cache_dir=CUSTOM_DIR,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    path2test = _takes_2_seconds_custom_dir_decorated.cache_dpath()\n    assert path2test == EXPANDED_CUSTOM_DIR\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_callable_hash_param(separate_files):\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(hash_func=_hash_func, separate_files=separate_files)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_mongo_core.py",
      "content": "\"\"\"Testing the MongoDB core of cachier.\"\"\"\n\nimport datetime\nimport hashlib\nimport platform\nimport queue\nimport sys\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep\nfrom urllib.parse import quote_plus\n\nimport pandas as pd\nimport pymongo\nimport pytest\nfrom birch import Birch  # type: ignore[import-not-found]\nfrom cachier import cachier\nfrom cachier.cores.base import RecalculationNeeded\nfrom cachier.cores.mongo import _MongoCore\nfrom pymongo.errors import OperationFailure\nfrom pymongo.mongo_client import MongoClient\nfrom pymongo_inmemory import MongoClient as InMemoryMongoClient\n\n# === Enables testing vs a real MongoDB instance ===\n\n\nclass CfgKey:\n    HOST = \"TEST_HOST\"\n    PORT = \"TEST_PORT\"\n    # UNAME = \"TEST_USERNAME\"\n    # PWD = \"TEST_PASSWORD\"\n    # DB = \"TEST_DB\"\n    TEST_VS_DOCKERIZED_MONGO = \"TEST_VS_DOCKERIZED_MONGO\"\n\n\nCFG = Birch(\n    namespace=\"cachier\",\n    defaults={CfgKey.TEST_VS_DOCKERIZED_MONGO: False},\n)\n\n\n# URI_TEMPLATE = \"mongodb://myUser:myPassword@localhost:27017/\"\nURI_TEMPLATE = \"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n\n\ndef _get_cachier_db_mongo_client():\n    host = quote_plus(CFG[CfgKey.HOST])\n    port = quote_plus(CFG[CfgKey.PORT])\n    # uname = quote_plus(CFG[CfgKey.UNAME])\n    # pwd = quote_plus(CFG[CfgKey.PWD])\n    # db = quote_plus(CFG[CfgKey.DB])\n    uri = f\"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n    return MongoClient(uri)\n\n\n_COLLECTION_NAME = (\n    f\"cachier_test_{platform.system()}\"\n    f\"_{'.'.join(map(str, sys.version_info[:3]))}\"\n)\n\n\ndef _test_mongetter():\n    if not hasattr(_test_mongetter, \"client\"):\n        if str(CFG.mget(CfgKey.TEST_VS_DOCKERIZED_MONGO)).lower() == \"true\":\n            print(\"Using live MongoDB instance for testing.\")\n            _test_mongetter.client = _get_cachier_db_mongo_client()\n        else:\n            print(\"Using in-memory MongoDB instance for testing.\")\n            _test_mongetter.client = InMemoryMongoClient()\n    db_obj = _test_mongetter.client[\"cachier_test\"]\n    if _COLLECTION_NAME not in db_obj.list_collection_names():\n        db_obj.create_collection(_COLLECTION_NAME)\n    return db_obj[_COLLECTION_NAME]\n\n\n# === Mongo core tests ===\n\n\n@pytest.mark.mongo\ndef test_information():\n    print(\"\\npymongo version: \", end=\"\")\n    print(pymongo.__version__)\n\n\n@pytest.mark.mongo\ndef test_mongo_index_creation():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    collection = _test_mongetter()\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    assert _MongoCore._INDEX_NAME in collection.index_information()\n\n\n@pytest.mark.mongo\ndef test_mongo_core():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, 2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, 2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, 2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, 2)\n    assert val6 == val5\n\n\n@pytest.mark.mongo\ndef test_mongo_core_keywords():\n    \"\"\"Basic Mongo core functionality with keyword arguments.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, arg_2=2)\n    val2 = _test_mongo_caching(1, arg_2=2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, arg_2=2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, arg_2=2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, arg_2=2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, arg_2=2)\n    assert val6 == val5\n\n\nMONGO_DELTA = timedelta(seconds=3)\nMONGO_DELTA_LONG = timedelta(seconds=10)\n\n\n@pytest.mark.mongo\ndef test_mongo_stale_after():\n    \"\"\"Testing MongoDB core stale_after functionality.\"\"\"\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=MONGO_DELTA, next_time=False\n    )\n    def _stale_after_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _stale_after_mongo.clear_cache()\n    val1 = _stale_after_mongo(1, 2)\n    val2 = _stale_after_mongo(1, 2)\n    assert val1 == val2\n    sleep(3)\n    val3 = _stale_after_mongo(1, 2)\n    assert val3 != val1\n\n\ndef _calls_takes_time(res_queue):\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    res = _takes_time(34, 82.3)\n    res_queue.put(res)\n\n\n@pytest.mark.mongo\ndef test_mongo_being_calculated():\n    \"\"\"Testing MongoDB core handling of being calculated scenarios.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(1)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\nclass _BadMongoCollection:\n    def __init__(self, mongetter):\n        self.collection = mongetter()\n        self.index_information = self.collection.index_information\n        self.create_indexes = self.collection.create_indexes\n        self.find_one = self.collection.find_one\n\n    def delete_many(self, *args, **kwargs):\n        pass\n\n    def update_many(self, *args, **kwargs):\n        pass\n\n    def update_one(self, *args, **kwargs):\n        raise OperationFailure(Exception())\n\n\ndef _bad_mongetter():\n    return _BadMongoCollection(_test_mongetter)\n\n\n@pytest.mark.mongo\ndef test_mongo_write_failure():\n    \"\"\"Testing MongoDB core handling of writing failure scenarios.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    with pytest.raises(OperationFailure):\n        val1 = _func_w_bad_mongo(1, 2)\n        val2 = _func_w_bad_mongo(1, 2)\n        assert val1 == val2\n\n\n@pytest.mark.mongo\ndef test_mongo_clear_being_calculated():\n    \"\"\"Testing MongoDB core clear_being_calculated.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _func_w_bad_mongo.clear_being_calculated()\n\n\n@pytest.mark.mongo\ndef test_stalled_mongo_db_cache():\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    core = _MongoCore(None, _test_mongetter, 0)\n    core.set_func(_stalled_func)\n    core.clear_cache()\n    with pytest.raises(RecalculationNeeded):\n        core.wait_on_entry_calc(key=None)\n\n\n@pytest.mark.mongo\ndef test_stalled_mong_db_core(monkeypatch):\n    def mock_get_entry(self, args, kwargs):\n        return \"key\", {\"being_calculated\": True}\n\n    def mock_get_entry_by_key(self, key):\n        return \"key\", None\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry\n    )\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry_by_key\",\n        mock_get_entry_by_key,\n    )\n\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    res = _stalled_func()\n    assert res == 1\n\n    def mock_get_entry_2(self, args, kwargs):\n        entry = {\n            \"being_calculated\": True,\n            \"value\": 1,\n            \"time\": datetime.datetime.now() - datetime.timedelta(seconds=10),\n        }\n        return \"key\", entry\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry_2\n    )\n\n    stale_after = datetime.timedelta(seconds=1)\n\n    @cachier(mongetter=_test_mongetter, stale_after=stale_after)\n    def _stalled_func_2():\n        \"\"\"Testing stalled function.\"\"\"\n        return 2\n\n    res = _stalled_func_2()\n    assert res == 2\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=stale_after, next_time=True\n    )\n    def _stalled_func_3():\n        \"\"\"Testing stalled function.\"\"\"\n        return 3\n\n    res = _stalled_func_3()\n    assert res == 1\n\n\n@pytest.mark.mongo\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(mongetter=_test_mongetter, hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/requirements.txt",
      "content": "# todo: add some version range or pinning latest versions\n# tests and coverages\npytest\ncoverage\npytest-cov\nbirch\n# to connect to the test mongodb server\npymongo\ndnspython\npymongo-inmemory\n# to test pandas dataframe as-param hashing with mongodb core\npandas\n# to be able to run `python setup.py checkdocs`\ncollective.checkdocs\npygments\n"
    }
  ],
  "BuggyCode": [
    {
      "path": "python-cachier_cachier/MANIFEST.in",
      "content": "# Manifest syntax https://docs.python.org/2/distutils/sourcedist.html\ngraft wheelhouse\n\nrecursive-include src *.info\ninclude README.rst\ninclude LICENSE\n\nprune __pycache__\nprune tests\n"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "Cachier\n#######\n\n|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n\nPersistent, stale-free, local and cross-machine caching for Python functions.\n\n.. code-block:: python\n\n  from cachier import cachier\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(days=3))\n  def foo(arg1, arg2):\n    \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\n\n.. role:: python(code)\n  :language: python\n\n.. contents::\n\n.. section-numbering:\n\n\n\nInstallation\n============\n\nInstall ``cachier`` with:\n\n.. code-block:: python\n\n    pip install cachier\n\nFor the latest version supporting Python 2.7 please use:\n\n.. code-block:: python\n\n    pip install 'cachier==1.2.8'\n\nFeatures\n========\n\n* Pure Python.\n* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n* A simple interface.\n* Defining \"shelf life\" for cached values.\n* Local caching using pickle files.\n* Cross-machine caching using MongoDB.\n* Thread-safety.\n\nCachier is **NOT**:\n\n* Meant as a transient cache. Python's @lru_cache is better.\n* Especially fast. It is meant to replace function calls that take more than... a second, say (overhead is around 1 millisecond).\n\nFuture features\n---------------\n\n* S3 core.\n* Multi-core caching.\n* `Cache replacement policies <https://en.wikipedia.org/wiki/Cache_replacement_policies>`_\n\n\nUse\n===\n\nCachier provides a decorator which you can wrap around your functions to give them a persistent cache. The positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). Also, notice that since objects which are instances of user-defined classes are hashable but all compare unequal (their hash value is their id), equal objects across different sessions will not yield identical keys.\n\nSetting up a Cache\n------------------\nYou can add a default, pickle-based, persistent cache to your function - meaning it will last across different Python kernels calling the wrapped function - by decorating it with the ``cachier`` decorator (notice the ``()``!).\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier()\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values!\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nClass and object methods can also be cached. Cachier will automatically ignore the `self` parameter when determining the cache key for an object method. **This means that methods will be cached across all instances of an object, which may not be what you want.**\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  class Foo():\n    @staticmethod\n    @cachier()\n    def good_static_usage(arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method does not depend on object's internal state, so good to cache\n    @cachier()\n    def good_usage_1(self, arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method is calling external service, probably okay to cache\n    @cachier()\n    def good_usage_2(self, arg_1, arg_2):\n      result = self.call_api(arg_1, arg_2)\n      return result\n\n    # Instance method relies on object attribute, NOT good to cache\n    @cachier()\n    def bad_usage(self, arg_1, arg_2):\n      return arg_1 + arg_2 + self.arg_3\n\n\nResetting a Cache\n-----------------\nThe Cachier wrapper adds a ``clear_cache()`` function to each wrapped function. To reset the cache of the wrapped function simply call this method:\n\n.. code-block:: python\n\n  foo.clear_cache()\n\nGeneral Configuration\n----------------------\n\nGlobal Defaults\n~~~~~~~~~~~~~~~\n\nSettings can be globally configured across all Cachier wrappers through the use of the `set_default_params` function. This function takes the same keyword parameters as the ones defined in the decorator, which can be passed all at once or with multiple calls. Parameters given directly to a decorator take precedence over any values set by this function.\n\nThe following parameters will only be applied to decorators defined after `set_default_params` is called:\n\n*  `hash_func`\n*  `backend`\n*  `mongetter`\n*  `cache_dir`\n*  `pickle_reload`\n*  `separate_files`\n\nThese parameters can be changed at any time and they will apply to all decorators:\n\n*  `allow_none`\n*  `caching_enabled`\n*  `stale_after`\n*  `next_time`\n*  `wait_for_calc_timeout`\n\nThe current defaults can be fetched by calling `get_default_params`.\n\nThreads Limit\n~~~~~~~~~~~~~\n\nTo limit the number of threads Cachier is allowed to spawn, set the ``CACHIER_MAX_WORKERS`` with the desired number. The default is 8, so to enable Cachier to spawn even more threads, you'll have to set a higher limit explicitly.\n\n\nGlobal Enable/Disable\n---------------------\n\nCaching can be turned off across all decorators by calling `disable_caching`, and then re-activated by calling `enable_caching`.\n\nThese functions are convenience wrappers around the `caching_enabled` default setting.\n\n\nCache Shelf Life\n----------------\n\nSetting Shelf Life\n~~~~~~~~~~~~~~~~~~\nYou can set any duration as the shelf life of cached return values of a function by providing a corresponding ``timedelta`` object to the ``stale_after`` parameter:\n\n.. code-block:: python\n\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(weeks=2))\n  def bar(arg1, arg2):\n    return {'arg1': arg1, 'arg2': arg2}\n\nNow when a cached value matching the given arguments is found the time of its calculation is checked; if more than ``stale_after`` time has since passed, the function will be run again for the same arguments and the new value will be cached and returned.\n\nThis is useful for lengthy calculations that depend on a dynamic data source.\n\nFuzzy Shelf Life\n~~~~~~~~~~~~~~~~\nSometimes you may want your function to trigger a calculation when it encounters a stale result, but still not wait on it if it's not that critical. In that case, you can set ``next_time`` to ``True`` to have your function trigger a recalculation **in a separate thread**, but return the currently cached stale value:\n\n.. code-block:: python\n\n  @cachier(next_time=True)\n\nFurther function calls made while the calculation is being performed will not trigger redundant calculations.\n\n\nWorking with unhashable arguments\n---------------------------------\n\nAs mentioned above, the positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). To get around this limitation the ``hash_func`` parameter of the ``cachier`` decorator can be provided with a callable that gets the args and kwargs from the decorated function and returns a hash key for them.\n\n.. code-block:: python\n\n  def calculate_hash(args, kwds):\n    key = ...  # compute a hash key here based on arguments\n    return key\n\n  @cachier(hash_func=calculate_hash)\n  def calculate_super_complex_stuff(custom_obj):\n    # amazing code goes here\n\nSee here for an example:\n\n`Question: How to work with unhashable arguments <https://github.com/python-cachier/cachier/issues/91>`_\n\n\nPrecaching values\n---------------------------------\n\nIf you want to load a value into the cache without calling the underlying function, this can be done with the `precache_value` function.\n\n.. code-block:: python\n\n  @cachier()\n  def add(arg1, arg2):\n    return arg1 + arg2\n\n  add.precache_value(2, 2, value_to_cache=5)\n\n  result = add(2, 2)\n  print(result)  # prints 5\n\n\nPer-function call arguments\n---------------------------\n\nCachier also accepts several keyword arguments in the calls of the function it wraps rather than in the decorator call, allowing you to modify its behaviour for a specific function call.\n\nIgnore Cache\n~~~~~~~~~~~~\n\nYou can have ``cachier`` ignore any existing cache for a specific function call by passing ``cachier__skip_cache=True`` to the function call. The cache will neither be checked nor updated with the new return value.\n\n.. code-block:: python\n\n  @cachier()\n  def sum(first_num, second_num):\n    return first_num + second_num\n\n  def main():\n    print(sum(5, 3, cachier__skip_cache=True))\n\nOverwrite Cache\n~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` overwrite an existing cache entry - if one exists - for a specific function call by passing ``cachier__overwrite_cache=True`` to the function call. The cache will not be checked but will be updated with the new return value.\n\nVerbose Cache Call\n~~~~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` print out a detailed explanation of the logic of a specific call by passing ``cachier__verbose=True`` to the function call. This can be useful if you are not sure why a certain function result is, or is not, returned.\n\nCache `None` Values\n~~~~~~~~~~~~~~~~~~~\n\nBy default, ``cachier`` does not cache ``None`` values. You can override this behaviour by passing ``allow_none=True`` to the function call.\n\n\nCachier Cores\n=============\n\nPickle Core\n-----------\n\nThe default core for Cachier is pickle based, meaning each function will store its cache is a separate pickle file in the ``~/.cachier`` directory. Naturally, this kind of cache is both machine-specific and user-specific.\n\nYou can configure ``cachier`` to use another directory by providing the ``cache_dir`` parameter with the path to that directory:\n\n.. code-block:: python\n\n  @cachier(cache_dir='~/.temp/.cache')\n\n\nYou can slightly optimise pickle-based caching if you know your code will only be used in a single thread environment by setting:\n\n.. code-block:: python\n\n  @cachier(pickle_reload=False)\n\nThis will prevent reading the cache file on each cache read, speeding things up a bit, while also nullifying inter-thread functionality (the code is still thread safe, but different threads will have different versions of the cache at times, and will sometime make unnecessary function calls).\n\nSetting the optional argument ``separate_files`` to ``True`` will cause the cache to be stored in several files: A file per argument set, per function. This can help if your per-function cache files become too large.\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier(separate_files=True)\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values, split across several files, per argument set\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nYou can get the fully qualified path to the directory of cache files used by ``cachier`` (``~/.cachier`` by default) by calling the ``cache_dpath()`` function:\n\n.. code-block:: python\n\n  >>> foo.cache_dpath()\n      \"/home/user/.cachier/\"\n\n\nMongoDB Core\n------------\nYou can set a MongoDB-based cache by assigning ``mongetter`` with a callable that returns a ``pymongo.Collection`` object with writing permissions:\n\n.. code-block:: python\n\n    from pymongo import MongoClient\n\n    def my_mongetter():\n        client = MongoClient(get_cachier_db_auth_uri())\n        db_obj = client['cachier_db']\n        if 'someapp_cachier_db' not in db_obj.list_collection_names():\n            db_obj.create_collection('someapp_cachier_db')\n        return db_obj['someapp_cachier_db']\n\n  @cachier(mongetter=my_mongetter)\n\nThis allows you to have a cross-machine, albeit slower, cache. This functionality requires that the installation of the ``pymongo`` python package.\n\nIn certain cases the MongoDB backend might leave a deadlock behind, blocking all subsequent requests from being processed. If you encounter this issue, supply the ``wait_for_calc_timeout`` with a reasonable number of seconds; calls will then wait at most this number of seconds before triggering a recalculation.\n\n.. code-block:: python\n\n  @cachier(mongetter=False, wait_for_calc_timeout=2)\n\n\nMemory Core\n-----------\n\nYou can set an in-memory cache by assigning the ``backend`` parameter with ``'memory'``:\n\n.. code-block:: python\n\n  @cachier(backend='memory')\n\nNote, however, that ``cachier``'s in-memory core is simple, and has no monitoring or cap on cache size, and can thus lead to memory errors on large return values - it is mainly intended to be used with future multi-core functionality. As a rule, Python's built-in ``lru_cache`` is a much better stand-alone solution.\n\n\nContributing\n============\n\nCurrent maintainers are Shay Palachy Affek (`shay.palachy@gmail.com <mailto:shay.palachy@gmail.com>`_, `@shaypal5 <https://github.com/shaypal5>`_) and Judson Neer (`@lordjabez <https://github.com/lordjabez>`_); You are more than welcome to approach them for help. Contributions are very welcomed! :)\n\nInstalling for development\n--------------------------\n\nClone:\n\n.. code-block:: bash\n\n  git clone git@github.com:python-cachier/cachier.git\n\n\nInstall in development mode with test dependencies:\n\n.. code-block:: bash\n\n  cd cachier\n  pip install -e . -r tests/requirements.txt\n\n\nRunning the tests\n-----------------\n\nTo run the tests, call the ``pytest`` command in the repository's root, or:\n\n.. code-block:: bash\n\n  python -m pytest\n\nTo run only MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m mongo\n\nTo run only memory core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m memory\n\nTo run all tests EXCEPT MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not mongo\"\n\n\nTo run all tests EXCEPT memory core AND MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not (mongo or memory)\"\n\n\nRunning MongoDB tests against a live MongoDB instance\n-----------------------------------------------------\n\n**Note to developers:** By default, all MongoDB tests are run against a mocked MongoDB instance, provided by the ``pymongo_inmemory`` package. To run them against a live MongoDB instance, the ``CACHIER_TEST_VS_DOCKERIZED_MONGO`` environment variable is set to ``True`` in the ``test`` environment of this repository (and additional environment variables are populated with the appropriate credentials), used by the GitHub Action running tests on every commit and pull request.\n\nContributors are not expected to run these tests against a live MongoDB instance when developing, as credentials for the testing instance used will NOT be shared, but rather use the testing against the in-memory MongoDB instance as a good proxy.\n\n**HOWEVER, the tests run against a live MongoDB instance when you submit a PR are the determining tests for deciding whether your code functions correctly against MongoDB.**\n\n\nAdding documentation\n--------------------\n\nThis project is documented using the `numpy docstring conventions`_, which were chosen as they are perhaps the most widely-spread conventions that are both supported by common tools such as Sphinx and result in human-readable docstrings (in my personal opinion, of course). When documenting code you add to this project, please follow `these conventions`_.\n\n.. _`numpy docstring conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n.. _`these conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n\nAdditionally, if you update this ``README.rst`` file, use ``python setup.py checkdocs`` to validate it compiles.\n\n\nCredits\n=======\n\nCreated by `Shay Palachy Affek <https://github.com/shaypal5>`_ (shay.palachy@gmail.com), which currently assists in maintenance.\n\nCurrent lead developer/contributor: `Jirka Borovec <https://github.com/Borda>`_ (`@Borda <https://github.com/Borda>`_ on GitHub).\n\nOther major contributors:\n\n* `Jirka Borovec <https://github.com/Borda>`_ - Arg order independence, args-to-kwargs for less unique keys and numerous development and CI contributions.\n\n* `Judson Neer <https://github.com/lordjabez>`_ - Precaching, method caching support and numerous improvements and bugfixes.\n\n* `cthoyt <https://github.com/cthoyt>`_ - Base memory core implementation.\n\n* `amarczew <https://github.com/amarczew>`_ - The ``hash_func`` kwarg.\n\n* `non-senses <https://github.com/non-senses>`_ - The ``wait_for_calc_timeout`` kwarg.\n\n* `Elad Rapaport <https://github.com/erap129>`_ - Multi-file Pickle core, a.k.a ``separate_files`` (released on ``v1.5.3``).\n\n* `John Didion <https://github.com/jdidion>`_ - Support for pickle-based caching for cases where two identically-named methods of different classes are defined in the same module.\n\nNotable bugfixers:\n\n* `MichaelRazum <https://github.com/MichaelRazum>`_.\n\n* `Eric Ma <https://github.com/ericmjl>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n* `Ofir <https://github.com/ofirnk>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n\n\n.. |PyPI-Status| image:: https://img.shields.io/pypi/v/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |PyPI-Versions| image:: https://img.shields.io/pypi/pyversions/cachier.svg\n   :target: https://pypi.python.org/pypi/cachier\n\n.. |Build-Status| image:: https://github.com/python-cachier/cachier/actions/workflows/test.yml/badge.svg\n  :target: https://github.com/python-cachier/cachier/actions/workflows/test.yml\n\n.. |LICENCE| image:: https://img.shields.io/pypi/l/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |Codecov| image:: https://codecov.io/github/python-cachier/cachier/coverage.svg?branch=master\n   :target: https://codecov.io/github/python-cachier/cachier?branch=master\n\n.. |Downloads| image:: https://pepy.tech/badge/cachier\n     :target: https://pepy.tech/project/cachier\n     :alt: PePy stats\n\n.. |Codefactor| image:: https://www.codefactor.io/repository/github/python-cachier/cachier/badge?style=plastic\n     :target: https://www.codefactor.io/repository/github/python-cachier/cachier\n     :alt: Codefactor code quality\n\n.. links:\n.. _pymongo: https://api.mongodb.com/python/current/\n.. _watchdog: https://github.com/gorakhargosh/watchdog\n"
    },
    {
      "path": "python-cachier_cachier/LICENSE",
      "content": "MIT License\n\nCopyright (c) 2016 Shay Palachy\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.\n"
    },
    {
      "path": "python-cachier_cachier/pyproject.toml",
      "content": "[project]\nname = \"cachier\"\ndescription = \"Persistent, stale-free, local and cross-machine caching for Python functions.\"\nreadme = \"README.rst\"\nkeywords = [\n  \"cache\",\n  \"caching\",\n  \"cross-machine\",\n  \"decorator\",\n  \"local\",\n  \"memoization\",\n  \"mongo\",\n  \"persistent\",\n]\nlicense = {file = \"LICENSE\"}\nauthors = [\n   {name = \"Shay Palachy Affek\", email = 'shay.palachy@gmail.com'},\n]\nclassifiers = [\n  'Development Status :: 4 - Beta',\n  'Intended Audience :: Developers',\n  'License :: OSI Approved :: MIT License',\n  'Programming Language :: Python',\n  \"Programming Language :: Python :: 3 :: Only\",\n  \"Programming Language :: Python :: 3.8\",\n  \"Programming Language :: Python :: 3.9\",\n  \"Programming Language :: Python :: 3.10\",\n  \"Programming Language :: Python :: 3.11\",\n  \"Programming Language :: Python :: 3.12\",\n  \"Topic :: Other/Nonlisted Topic\",\n  \"Topic :: Software Development :: Libraries\",\n  \"Topic :: Software Development :: Libraries :: Python Modules\",\n  \"Topic :: Utilities\",\n]\ndynamic = [\n  \"version\",\n]\ndependencies = [\n  \"portalocker>=2.3.2\",\n  \"watchdog>=2.3.1\",\n]\n[project.urls]\nSource = \"https://github.com/python-cachier/cachier\"\n[project.scripts]\ncachier = \"cachier.__naim__:cli\"\n\n[tool.setuptools]\ninclude-package-data = true\n\n[tool.setuptools.dynamic]\nversion = {attr = \"cachier._version.__version__\"}\n\n[tool.setuptools.packages.find]\nwhere = [\"src\"]  # list of folders that contain the packages ([\".\"] by default)\ninclude = [\"cachier*\"]  # package names should match these glob patterns ([\"*\"] by default)\nnamespaces = false  # to disable scanning PEP 420 namespaces (true by default)\n\n[build-system]\nrequires = [\n  \"setuptools\",\n  \"wheel\",\n]\n\n[tool.pytest.ini_options]\ntestpaths = [\"cachier\", \"tests\"]\nnorecursedirs=[\"dist\", \"build\"]\naddopts = [\n    \"--color=yes\",\n    \"--cov=cachier\",\n    \"--cov-report=term\",\n    \"--cov-report=xml:cov.xml\",\n    \"-r a\",\n    \"-v\",\n    \"-s\",\n]\nmarkers = [\n    \"mongo: test the MongoDB core\",\n    \"memory: test the memory core\",\n    \"pickle: test the pickle core\"\n]\n\n[tool.docformatter]\nrecursive = true\n# some docstring start with r\"\"\"\nwrap-summaries = 79\nwrap-descriptions = 79\nblank = true\n\n[tool.ruff]\ntarget-version = \"py38\"\nline-length = 79\n# Enable Pyflakes `E` and `F` codes by default.\nlint.select = [\n    \"E\",\n    \"W\", # see: https://pypi.org/project/pycodestyle\n    \"F\", # see: https://pypi.org/project/pyflakes\n    \"I\", #see: https://pypi.org/project/isort/\n    \"D\", # see: https://pypi.org/project/pydocstyle\n    #\"N\", # see: https://pypi.org/project/pep8-naming\n    \"S\", # see: https://pypi.org/project/flake8-bandit\n    \"SIM\",\n    \"RUF100\" # alternative to yesqa\n]\nlint.extend-select = [\n    \"A\",    # see: https://pypi.org/project/flake8-builtins\n    \"B\",    # see: https://pypi.org/project/flake8-bugbear\n    \"C4\",   # see: https://pypi.org/project/flake8-comprehensions\n]\nlint.ignore = [\n    \"E203\",\n    \"C901\",\n]\n# Exclude a variety of commonly ignored directories.\nexclude = [\n    \".eggs\",\n    \".git\",\n    \".ruff_cache\",\n    \"__pypackages__\",\n    \"_build\",\n    \"build\",\n    \"dist\",\n]\nlint.ignore-init-module-imports = true\nlint.unfixable = [\"F401\"]\n\n[tool.ruff.lint.per-file-ignores]\n\"src/**/__init__.py\" = [\"D104\"]\n\"src/cachier/config.py\" = [\"D100\"]\n\"tests/**\" = [\n  \"S101\", \"S311\", \"S105\", \"S603\",\n  \"D100\", \"D101\", \"D103\", \"D104\", \"D401\"\n]\n\n#[tool.ruff.pydocstyle]\n## Use Google-style docstrings.\n#convention = \"google\"\n\n#[tool.ruff.pycodestyle]\n#ignore-overlong-task-comments = true\n\n[tool.ruff.lint.mccabe]\n# Unlike Flake8, default to a complexity level of 10.\nmax-complexity = 10\n\n[tool.coverage.run]\nbranch = true\ndynamic_context = \"test_function\"\nomit = [\n    \"tests/*\",\n    \"cachier/_version.py\",\n    \"cachier/__init__.py\",\n    \"**/scripts/**\",\n]\n[tool.coverage.report]\nshow_missing = true\n# Regexes for lines to exclude from consideration\nexclude_lines = [\n    \"pragma: no cover\", # Have to re-enable the standard pragma\n    \"raise NotImplementedError\", # Don't complain if tests don't hit defensive assertion code:\n    \"if TYPE_CHECKING:\", # Is only true when running mypy, not tests\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__init__.py",
      "content": "from ._version import *  # noqa: F403\nfrom .config import (\n    disable_caching,\n    enable_caching,\n    get_default_params,\n    set_default_params,\n)\nfrom .core import cachier\n\n__all__ = [\n    \"cachier\",\n    \"set_default_params\",\n    \"get_default_params\",\n    \"enable_caching\",\n    \"disable_caching\",\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/config.py",
      "content": "import datetime\nimport hashlib\nimport os\nimport pickle\nfrom typing import Optional, TypedDict, Union\n\nfrom ._types import Backend, HashFunc, Mongetter\n\n\ndef _default_hash_func(args, kwds):\n    # Sort the kwargs to ensure consistent ordering\n    sorted_kwargs = sorted(kwds.items())\n    # Serialize args and sorted_kwargs using pickle or similar\n    serialized = pickle.dumps((args, sorted_kwargs))\n    # Create a hash of the serialized data\n    return hashlib.sha256(serialized).hexdigest()\n\n\nclass Params(TypedDict):\n    \"\"\"Type definition for cachier parameters.\"\"\"\n\n    caching_enabled: bool\n    hash_func: HashFunc\n    backend: Backend\n    mongetter: Optional[Mongetter]\n    stale_after: datetime.timedelta\n    next_time: bool\n    cache_dir: Union[str, os.PathLike]\n    pickle_reload: bool\n    separate_files: bool\n    wait_for_calc_timeout: int\n    allow_none: bool\n\n\n_default_params: Params = {\n    \"caching_enabled\": True,\n    \"hash_func\": _default_hash_func,\n    \"backend\": \"pickle\",\n    \"mongetter\": None,\n    \"stale_after\": datetime.timedelta.max,\n    \"next_time\": False,\n    \"cache_dir\": \"~/.cachier/\",\n    \"pickle_reload\": True,\n    \"separate_files\": False,\n    \"wait_for_calc_timeout\": 0,\n    \"allow_none\": False,\n}\n\n\ndef _update_with_defaults(\n    param, name: str, func_kwargs: Optional[dict] = None\n):\n    import cachier\n\n    if func_kwargs:\n        kw_name = f\"cachier__{name}\"\n        if kw_name in func_kwargs:\n            return func_kwargs.pop(kw_name)\n    if param is None:\n        return cachier.config._default_params[name]\n    return param\n\n\ndef set_default_params(**params):\n    \"\"\"Configure global parameters applicable to all memoized functions.\n\n    This function takes the same keyword parameters as the ones defined in the\n    decorator, which can be passed all at once or with multiple calls.\n    Parameters given directly to a decorator take precedence over any values\n    set by this function.\n\n    Only 'stale_after', 'next_time', and 'wait_for_calc_timeout' can be changed\n    after the memoization decorator has been applied. Other parameters will\n    only have an effect on decorators applied after this function is run.\n\n    \"\"\"\n    import cachier\n\n    valid_params = (\n        p for p in params.items() if p[0] in cachier.config._default_params\n    )\n    _default_params.update(valid_params)\n\n\ndef get_default_params():\n    \"\"\"Get current set of default parameters.\"\"\"\n    import cachier\n\n    return cachier.config._default_params\n\n\ndef enable_caching():\n    \"\"\"Enable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = True\n\n\ndef disable_caching():\n    \"\"\"Disable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = False\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/_version.py",
      "content": "# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2024, Jirka Borovec <***@gmail.com>\n\nimport os\n\n_PATH_HERE = os.path.dirname(__file__)\n_PATH_VERSION = os.path.join(_PATH_HERE, \"version.info\")\n_RELEASING_PROCESS = os.getenv(\"RELEASING_PROCESS\", \"0\") == \"1\"\n\nwith open(_PATH_VERSION) as fopen:\n    __version__ = fopen.read().strip()\n\n\ndef _get_git_sha() -> str:\n    from subprocess import DEVNULL, check_output\n\n    out = check_output([\"git\", \"rev-parse\", \"--short\", \"HEAD\"], stderr=DEVNULL)  # noqa: S603, S607\n    return out.decode(\"utf-8\").strip()\n\n\nif not _RELEASING_PROCESS:\n    try:\n        sha_short = _get_git_sha()\n        # print(f\"Version enriched with git commit hash: {__version__}.\")\n    except Exception:\n        # print(\"Failed to get the git commit hash,\"\n        #       f\" falling back to base version {__version__}.\")\n        sha_short = \"\"\n    __version__ += f\".dev+{sha_short}\" if sha_short else \".dev\"\n\n\n__all__ = [\"__version__\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/version.info",
      "content": "3.0.0\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/py.typed",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/_types.py",
      "content": "from typing import TYPE_CHECKING, Callable, Literal\n\nif TYPE_CHECKING:\n    import pymongo.collection\n\n\nHashFunc = Callable[..., str]\nMongetter = Callable[[], \"pymongo.collection.Collection\"]\nBackend = Literal[\"pickle\", \"mongo\", \"memory\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__main__.py",
      "content": "\"\"\"A command-line interface for cachier.\"\"\"\n\nimport click\n\nfrom cachier.core import _set_max_workers\n\n\n@click.group()\ndef cli():\n    \"\"\"A command-line interface for cachier.\"\"\"  # noqa: D401\n\n\n@cli.command(\"Limits the number of worker threads used by cachier.\")\n@click.argument(\"max_workers\", nargs=1, type=int)\ndef set_max_workers(max_workers):\n    \"\"\"Limits the number of worker threads used by cachier.\"\"\"\n    _set_max_workers(max_workers)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "\"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport datetime\nimport inspect\nimport os\nimport warnings\nfrom collections import OrderedDict\nfrom concurrent.futures import ThreadPoolExecutor\nfrom functools import wraps\nfrom typing import Optional, Union\nfrom warnings import warn\n\nfrom .config import (\n    Backend,\n    HashFunc,\n    Mongetter,\n    _default_params,\n    _update_with_defaults,\n)\nfrom .cores.base import RecalculationNeeded, _BaseCore\nfrom .cores.memory import _MemoryCore\nfrom .cores.mongo import _MongoCore\nfrom .cores.pickle import _PickleCore\n\nMAX_WORKERS_ENVAR_NAME = \"CACHIER_MAX_WORKERS\"\nDEFAULT_MAX_WORKERS = 8\n\n\ndef _max_workers():\n    return int(os.environ.get(MAX_WORKERS_ENVAR_NAME, DEFAULT_MAX_WORKERS))\n\n\ndef _set_max_workers(max_workers):\n    os.environ[MAX_WORKERS_ENVAR_NAME] = str(max_workers)\n    _get_executor(True)\n\n\ndef _get_executor(reset=False):\n    if reset or not hasattr(_get_executor, \"executor\"):\n        _get_executor.executor = ThreadPoolExecutor(_max_workers())\n    return _get_executor.executor\n\n\ndef _function_thread(core, key, func, args, kwds):\n    try:\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n    except BaseException as exc:\n        print(f\"Function call failed with the following exception:\\n{exc}\")\n\n\ndef _calc_entry(core, key, func, args, kwds):\n    try:\n        core.mark_entry_being_calculated(key)\n        # _get_executor().submit(core.mark_entry_being_calculated, key)\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n        # _get_executor().submit(core.set_entry, key, func_res)\n        return func_res\n    finally:\n        core.mark_entry_not_calculated(key)\n\n\ndef _convert_args_kwargs(\n    func, _is_method: bool, args: tuple, kwds: dict\n) -> dict:\n    \"\"\"Convert mix of positional and keyword arguments to aggregated kwargs.\"\"\"\n    # unwrap if the function is functools.partial\n    if hasattr(func, \"func\"):\n        args = func.args + args\n        kwds.update({k: v for k, v in func.keywords.items() if k not in kwds})\n        func = func.func\n    func_params = list(inspect.signature(func).parameters)\n    args_as_kw = dict(\n        zip(func_params[1:], args[1:])\n        if _is_method\n        else zip(func_params, args)\n    )\n    # init with default values\n    kwargs = {\n        k: v.default\n        for k, v in inspect.signature(func).parameters.items()\n        if v.default is not inspect.Parameter.empty\n    }\n    # merge args expanded as kwargs and the original kwds\n    kwargs.update(dict(**args_as_kw, **kwds))\n    return OrderedDict(sorted(kwargs.items()))\n\n\ndef _pop_kwds_with_deprecation(kwds, name: str, default_value: bool):\n    if name in kwds:\n        warnings.warn(\n            f\"`{name}` is deprecated and will be removed in a future release,\"\n            \" use `cachier__` alternative instead.\",\n            DeprecationWarning,\n            stacklevel=2,\n        )\n    return kwds.pop(name, default_value)\n\n\ndef cachier(\n    hash_func: Optional[HashFunc] = None,\n    hash_params: Optional[HashFunc] = None,\n    backend: Optional[Backend] = None,\n    mongetter: Optional[Mongetter] = None,\n    stale_after: Optional[datetime.timedelta] = None,\n    next_time: Optional[bool] = None,\n    cache_dir: Optional[Union[str, os.PathLike]] = None,\n    pickle_reload: Optional[bool] = None,\n    separate_files: Optional[bool] = None,\n    wait_for_calc_timeout: Optional[int] = None,\n    allow_none: Optional[bool] = None,\n):\n    \"\"\"Wrap as a persistent, stale-free memoization decorator.\n\n    The positional and keyword arguments to the wrapped function must be\n    hashable (i.e. Python's immutable built-in objects, not mutable\n    containers). Also, notice that since objects which are instances of\n    user-defined classes are hashable but all compare unequal (their hash\n    value is their id), equal objects across different sessions will not yield\n    identical keys.\n\n    Arguments:\n    ---------\n    hash_func : callable, optional\n        A callable that gets the args and kwargs from the decorated function\n        and returns a hash key for them. This parameter can be used to enable\n        the use of cachier with functions that get arguments that are not\n        automatically hashable by Python.\n    hash_params : callable, optional\n    backend : str, optional\n        The name of the backend to use. Valid options currently include\n        'pickle', 'mongo' and 'memory'. If not provided, defaults to\n        'pickle' unless the 'mongetter' argument is passed, in which\n        case the mongo backend is automatically selected.\n    mongetter : callable, optional\n        A callable that takes no arguments and returns a pymongo.Collection\n        object with writing permissions. If unset a local pickle cache is used\n        instead.\n    stale_after : datetime.timedelta, optional\n        The time delta after which a cached result is considered stale. Calls\n        made after the result goes stale will trigger a recalculation of the\n        result, but whether a stale or fresh result will be returned is\n        determined by the optional next_time argument.\n    next_time : bool, optional\n        If set to True, a stale result will be returned when finding one, not\n        waiting for the calculation of the fresh result to return. Defaults to\n        False.\n    cache_dir : str, optional\n        A fully qualified path to a file directory to be used for cache files.\n        The running process must have running permissions to this folder. If\n        not provided, a default directory at `~/.cachier/` is used.\n    pickle_reload : bool, optional\n        If set to True, in-memory cache will be reloaded on each cache read,\n        enabling different threads to share cache. Should be set to False for\n        faster reads in single-thread programs. Defaults to True.\n    separate_files: bool, default False, for Pickle cores only\n        Instead of a single cache file per-function, each function's cache is\n        split between several files, one for each argument set. This can help\n        if you per-function cache files become too large.\n    wait_for_calc_timeout: int, optional, for MongoDB only\n        The maximum time to wait for an ongoing calculation. When a\n        process started to calculate the value setting being_calculated to\n        True, any process trying to read the same entry will wait a maximum of\n        seconds specified in this parameter. 0 means wait forever.\n        Once the timeout expires the calculation will be triggered.\n    allow_none: bool, optional\n        Allows storing None values in the cache. If False, functions returning\n        None will not be cached and are recalculated every call.\n\n    \"\"\"\n    # Check for deprecated parameters\n    if hash_params is not None:\n        message = (\n            \"hash_params will be removed in a future release, \"\n            \"please use hash_func instead\"\n        )\n        warn(message, DeprecationWarning, stacklevel=2)\n        hash_func = hash_params\n    # Update parameters with defaults if input is None\n    backend = _update_with_defaults(backend, \"backend\")\n    mongetter = _update_with_defaults(mongetter, \"mongetter\")\n    # Override the backend parameter if a mongetter is provided.\n    if callable(mongetter):\n        backend = \"mongo\"\n    core: _BaseCore\n    if backend == \"pickle\":\n        core = _PickleCore(\n            hash_func=hash_func,\n            pickle_reload=pickle_reload,\n            cache_dir=cache_dir,\n            separate_files=separate_files,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"mongo\":\n        core = _MongoCore(\n            hash_func=hash_func,\n            mongetter=mongetter,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"memory\":\n        core = _MemoryCore(\n            hash_func=hash_func, wait_for_calc_timeout=wait_for_calc_timeout\n        )\n    else:\n        raise ValueError(\"specified an invalid core: %s\" % backend)\n\n    def _cachier_decorator(func):\n        core.set_func(func)\n\n        @wraps(func)\n        def func_wrapper(*args, **kwds):\n            nonlocal allow_none\n            _allow_none = _update_with_defaults(allow_none, \"allow_none\", kwds)\n            # print('Inside general wrapper for {}.'.format(func.__name__))\n            ignore_cache = _pop_kwds_with_deprecation(\n                kwds, \"ignore_cache\", False\n            )\n            overwrite_cache = _pop_kwds_with_deprecation(\n                kwds, \"overwrite_cache\", False\n            )\n            verbose = _pop_kwds_with_deprecation(kwds, \"verbose_cache\", False)\n            ignore_cache = kwds.pop(\"cachier__skip_cache\", ignore_cache)\n            overwrite_cache = kwds.pop(\n                \"cachier__overwrite_cache\", overwrite_cache\n            )\n            verbose = kwds.pop(\"cachier__verbose\", verbose)\n            _stale_after = _update_with_defaults(\n                stale_after, \"stale_after\", kwds\n            )\n            _next_time = _update_with_defaults(next_time, \"next_time\", kwds)\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n\n            _print = lambda x: None  # noqa: E731\n            if verbose:\n                _print = print\n            if ignore_cache or not _default_params[\"caching_enabled\"]:\n                return func(**kwargs)\n            key, entry = core.get_entry((), kwargs)\n            if overwrite_cache:\n                return _calc_entry(core, key, func, args, kwds)\n            if entry is None:\n                _print(\"No entry found. No current calc. Calling like a boss.\")\n                return _calc_entry(core, key, func, args, kwds)\n            _print(\"Entry found.\")\n            if _allow_none or entry.get(\"value\", None) is not None:\n                _print(\"Cached result found.\")\n                now = datetime.datetime.now()\n                if now - entry[\"time\"] <= _stale_after:\n                    _print(\"And it is fresh!\")\n                    return entry[\"value\"]\n                _print(\"But it is stale... :(\")\n                if entry[\"being_calculated\"]:\n                    if _next_time:\n                        _print(\"Returning stale.\")\n                        return entry[\"value\"]  # return stale val\n                    _print(\"Already calc. Waiting on change.\")\n                    try:\n                        return core.wait_on_entry_calc(key)\n                    except RecalculationNeeded:\n                        return _calc_entry(core, key, func, args, kwds)\n                if _next_time:\n                    _print(\"Async calc and return stale\")\n                    try:\n                        core.mark_entry_being_calculated(key)\n                        _get_executor().submit(\n                            _function_thread, core, key, func, args, kwds\n                        )\n                    finally:\n                        core.mark_entry_not_calculated(key)\n                    return entry[\"value\"]\n                _print(\"Calling decorated function and waiting\")\n                return _calc_entry(core, key, func, args, kwds)\n            if entry[\"being_calculated\"]:\n                _print(\"No value but being calculated. Waiting.\")\n                try:\n                    return core.wait_on_entry_calc(key)\n                except RecalculationNeeded:\n                    return _calc_entry(core, key, func, args, kwds)\n            _print(\"No entry found. No current calc. Calling like a boss.\")\n            return _calc_entry(core, key, func, args, kwds)\n\n        def _clear_cache():\n            \"\"\"Clear the cache.\"\"\"\n            core.clear_cache()\n\n        def _clear_being_calculated():\n            \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n            core.clear_being_calculated()\n\n        def _cache_dpath():\n            \"\"\"Return the path to the cache dir, if exists; None if not.\"\"\"\n            return getattr(core, \"cache_dir\", None)\n\n        def _precache_value(*args, value_to_cache, **kwds):  # noqa: D417\n            \"\"\"Add an initial value to the cache.\n\n            Arguments:\n            ---------\n            value_to_cache : any\n                entry to be written into the cache\n\n            \"\"\"\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n            return core.precache_value((), kwargs, value_to_cache)\n\n        func_wrapper.clear_cache = _clear_cache\n        func_wrapper.clear_being_calculated = _clear_being_calculated\n        func_wrapper.cache_dpath = _cache_dpath\n        func_wrapper.precache_value = _precache_value\n        return func_wrapper\n\n    return _cachier_decorator\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/base.py",
      "content": "\"\"\"Defines the interface of a cachier caching core.\"\"\"\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport abc  # for the _BaseCore abstract base class\nimport inspect\nimport threading\nfrom typing import Callable\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n\nclass RecalculationNeeded(Exception):\n    \"\"\"Exception raised when a recalculation is needed.\"\"\"\n\n    pass\n\n\ndef _get_func_str(func: Callable) -> str:\n    return f\".{func.__module__}.{func.__name__}\"\n\n\nclass _BaseCore:\n    __metaclass__ = abc.ABCMeta\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        self.hash_func = _update_with_defaults(hash_func, \"hash_func\")\n        self.wait_for_calc_timeout = wait_for_calc_timeout\n        self.lock = threading.RLock()\n\n    def set_func(self, func):\n        \"\"\"Set the function this core will use.\n\n        This has to be set before any method is called. Also determine if the\n        function is an object method.\n\n        \"\"\"\n        # unwrap if the function is functools.partial\n        if hasattr(func, \"func\"):\n            func = func.func\n        func_params = list(inspect.signature(func).parameters)\n        self.func_is_method = func_params and func_params[0] == \"self\"\n        self.func = func\n\n    def get_key(self, args, kwds):\n        \"\"\"Return a unique key based on the arguments provided.\"\"\"\n        return self.hash_func(args, kwds)\n\n    def get_entry(self, args, kwds):\n        \"\"\"Get entry based on given arguments.\n\n        Return the result mapped to the given arguments in this core's cache,\n        if such a mapping exists.\n\n        \"\"\"\n        key = self.get_key(args, kwds)\n        return self.get_entry_by_key(key)\n\n    def precache_value(self, args, kwds, value_to_cache):\n        \"\"\"Write a precomputed value into the cache.\"\"\"\n        key = self.get_key(args, kwds)\n        self.set_entry(key, value_to_cache)\n        return value_to_cache\n\n    def check_calc_timeout(self, time_spent):\n        \"\"\"Raise an exception if a recalculation is needed.\"\"\"\n        calc_timeout = _update_with_defaults(\n            self.wait_for_calc_timeout, \"wait_for_calc_timeout\"\n        )\n        if calc_timeout > 0 and (time_spent >= calc_timeout):\n            raise RecalculationNeeded()\n\n    @abc.abstractmethod\n    def get_entry_by_key(self, key):\n        \"\"\"Get entry based on given key.\n\n        Return the result mapped to the given key in this core's cache, if such\n        a mapping exists.\n\n        \"\"\"\n\n    @abc.abstractmethod\n    def set_entry(self, key, func_res):\n        \"\"\"Map the given result to the given key in this core's cache.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_being_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_not_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as not being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def wait_on_entry_calc(self, key):\n        \"\"\"Wait on the entry with keys being calculated and returns result.\"\"\"\n\n    @abc.abstractmethod\n    def clear_cache(self):\n        \"\"\"Clear the cache of this core.\"\"\"\n\n    @abc.abstractmethod\n    def clear_being_calculated(self):\n        \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "\"\"\"A pickle-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\nimport os\nimport pickle  # for local caching\nfrom contextlib import suppress\nfrom datetime import datetime\n\nimport portalocker  # to lock on pickle cache IO\nfrom watchdog.events import PatternMatchingEventHandler\nfrom watchdog.observers import Observer\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n# Alternative:  https://github.com/WoLpH/portalocker\nfrom .base import _BaseCore\n\n\nclass _PickleCore(_BaseCore):\n    \"\"\"The pickle core class for cachier.\"\"\"\n\n    class CacheChangeHandler(PatternMatchingEventHandler):\n        \"\"\"Handles cache-file modification events.\"\"\"\n\n        def __init__(self, filename, core, key):\n            PatternMatchingEventHandler.__init__(\n                self,\n                patterns=[\"*\" + filename],\n                ignore_patterns=None,\n                ignore_directories=True,\n                case_sensitive=False,\n            )\n            self.core = core\n            self.key = key\n            self.observer = None\n            self.value = None\n\n        def inject_observer(self, observer):\n            \"\"\"Inject the observer running this handler.\"\"\"\n            self.observer = observer\n\n        def _check_calculation(self):\n            # print('checking calc')\n            entry = self.core.get_entry_by_key(self.key, True)[1]\n            # print(self.key)\n            # print(entry)\n            try:\n                if not entry[\"being_calculated\"]:\n                    # print('stopping observer!')\n                    self.value = entry[\"value\"]\n                    self.observer.stop()\n                # else:\n                # print('NOT stopping observer... :(')\n            except TypeError:\n                self.value = None\n                self.observer.stop()\n\n        def on_created(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()  # pragma: no cover\n\n        def on_modified(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        pickle_reload: bool,\n        cache_dir: str,\n        separate_files: bool,\n        wait_for_calc_timeout: int,\n    ):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = None\n        self.reload = _update_with_defaults(pickle_reload, \"pickle_reload\")\n        self.cache_dir = os.path.expanduser(\n            _update_with_defaults(cache_dir, \"cache_dir\")\n        )\n        self.separate_files = _update_with_defaults(\n            separate_files, \"separate_files\"\n        )\n\n    @property\n    def cache_fname(self) -> str:\n        fname = f\".{self.func.__module__}.{self.func.__qualname__}\"\n        return fname.replace(\"<\", \"_\").replace(\">\", \"_\")\n\n    @property\n    def cache_fpath(self) -> str:\n        os.makedirs(self.cache_dir, exist_ok=True)\n        return os.path.abspath(\n            os.path.join(os.path.realpath(self.cache_dir), self.cache_fname)\n        )\n\n    def _reload_cache(self):\n        with self.lock:\n            try:\n                with portalocker.Lock(\n                    self.cache_fpath, mode=\"rb\"\n                ) as cache_file:\n                    self.cache = pickle.load(cache_file)  # noqa: S301\n            except (FileNotFoundError, EOFError):\n                self.cache = {}\n\n    def _get_cache(self):\n        with self.lock:\n            if not self.cache:\n                self._reload_cache()\n            return self.cache\n\n    def _get_cache_by_key(self, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        fpath += f\"_{hash_str or key}\"\n        try:\n            with portalocker.Lock(fpath, mode=\"rb\") as cache_file:\n                return pickle.load(cache_file)  # noqa: S301\n        except (FileNotFoundError, EOFError):\n            return None\n\n    def _clear_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(f\"{name}_\"):\n                os.remove(os.path.join(path, subpath))\n\n    def _clear_being_calculated_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(name):\n                entry = self._get_cache_by_key(hash_str=subpath.split(\"_\")[-1])\n                if entry is not None:\n                    entry[\"being_calculated\"] = False\n                    self._save_cache(entry, hash_str=subpath.split(\"_\")[-1])\n\n    def _save_cache(self, cache, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        if key is not None:\n            fpath += f\"_{key}\"\n        elif hash_str is not None:\n            fpath += f\"_{hash_str}\"\n        with self.lock:\n            self.cache = cache\n            with portalocker.Lock(fpath, mode=\"wb\") as cache_file:\n                pickle.dump(cache, cache_file, protocol=4)\n            if key is None:\n                self._reload_cache()\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            if self.separate_files:\n                return key, self._get_cache_by_key(key)\n            if self.reload or reload:\n                self._reload_cache()\n            return key, self._get_cache().get(key, None)\n\n    def set_entry(self, key, func_res):\n        key_data = {\n            \"value\": func_res,\n            \"time\": datetime.now(),\n            \"stale\": False,\n            \"being_calculated\": False,\n        }\n        if self.separate_files:\n            self._save_cache(key_data, key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            cache[key] = key_data\n            self._save_cache(cache)\n\n    def mark_entry_being_calculated_separate_files(self, key):\n        self._save_cache(\n            {\n                \"value\": None,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": True,\n            },\n            key=key,\n        )\n\n    def mark_entry_not_calculated_separate_files(self, key):\n        _, entry = self.get_entry_by_key(key)\n        entry[\"being_calculated\"] = False\n        self._save_cache(entry, key=key)\n\n    def mark_entry_being_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_being_calculated_separate_files(key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            try:\n                cache[key][\"being_calculated\"] = True\n            except KeyError:\n                cache[key] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                }\n            self._save_cache(cache)\n\n    def mark_entry_not_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_not_calculated_separate_files(key)\n        with self.lock:\n            cache = self._get_cache()\n            # that's ok, we don't need an entry in that case\n            with suppress(KeyError):\n                cache[key][\"being_calculated\"] = False\n                self._save_cache(cache)\n\n    def wait_on_entry_calc(self, key):\n        if self.separate_files:\n            entry = self._get_cache_by_key(key)\n            filename = f\"{self.cache_fname}_{key}\"\n        else:\n            with self.lock:\n                self._reload_cache()\n                entry = self._get_cache()[key]\n            filename = self.cache_fname\n        if not entry[\"being_calculated\"]:\n            return entry[\"value\"]\n        event_handler = _PickleCore.CacheChangeHandler(\n            filename=filename, core=self, key=key\n        )\n        observer = Observer()\n        event_handler.inject_observer(observer)\n        observer.schedule(event_handler, path=self.cache_dir, recursive=True)\n        observer.start()\n        time_spent = 0\n        while observer.is_alive():\n            observer.join(timeout=1.0)\n            time_spent += 1\n            self.check_calc_timeout(time_spent)\n        return event_handler.value\n\n    def clear_cache(self):\n        if self.separate_files:\n            pass\n        else:\n            self._save_cache({})\n\n    def clear_being_calculated(self):\n        if self.separate_files:\n            self._clear_being_calculated_all_cache_files()\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            for key in cache:\n                cache[key][\"being_calculated\"] = False\n            self._save_cache(cache)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/memory.py",
      "content": "\"\"\"A memory-based caching core for cachier.\"\"\"\n\nimport threading\nfrom datetime import datetime\n\nfrom .._types import HashFunc\nfrom .base import _BaseCore, _get_func_str\n\n\nclass _MemoryCore(_BaseCore):\n    \"\"\"The memory core class for cachier.\"\"\"\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = {}\n\n    def _hash_func_key(self, key):\n        return f\"{_get_func_str(self.func)}:{key}\"\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            return key, self.cache.get(self._hash_func_key(key), None)\n\n    def set_entry(self, key, func_res):\n        with self.lock:\n            try:\n                # we need to retain the existing condition so that\n                # mark_entry_not_calculated can notify all possibly-waiting\n                # threads about it\n                cond = self.cache[self._hash_func_key(key)][\"condition\"]\n            except KeyError:  # pragma: no cover\n                cond = None\n            self.cache[self._hash_func_key(key)] = {\n                \"value\": func_res,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": False,\n                \"condition\": cond,\n            }\n\n    def mark_entry_being_calculated(self, key):\n        with self.lock:\n            condition = threading.Condition()\n            # condition.acquire()\n            try:\n                self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n                self.cache[self._hash_func_key(key)][\"condition\"] = condition\n            except KeyError:\n                self.cache[self._hash_func_key(key)] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                    \"condition\": condition,\n                }\n\n    def mark_entry_not_calculated(self, key):\n        with self.lock:\n            try:\n                entry = self.cache[self._hash_func_key(key)]\n            except KeyError:  # pragma: no cover\n                return  # that's ok, we don't need an entry in that case\n            entry[\"being_calculated\"] = False\n            cond = entry[\"condition\"]\n            if cond:\n                cond.acquire()\n                cond.notify_all()\n                cond.release()\n                entry[\"condition\"] = None\n\n    def wait_on_entry_calc(self, key):\n        with self.lock:  # pragma: no cover\n            entry = self.cache[self._hash_func_key(key)]\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n        entry[\"condition\"].acquire()\n        entry[\"condition\"].wait()\n        entry[\"condition\"].release()\n        return self.cache[self._hash_func_key(key)][\"value\"]\n\n    def clear_cache(self):\n        with self.lock:\n            self.cache.clear()\n\n    def clear_being_calculated(self):\n        with self.lock:\n            for entry in self.cache.values():\n                entry[\"being_calculated\"] = False\n                entry[\"condition\"] = None\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "content": "\"\"\"A MongoDB-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport pickle  # for serialization of python objects\nimport sys  # to make sure that pymongo was imported\nimport time  # to sleep when waiting on Mongo cache\\\nimport warnings  # to warn if pymongo is missing\nfrom contextlib import suppress\nfrom datetime import datetime\n\nfrom .._types import HashFunc, Mongetter\n\nwith suppress(ImportError):\n    from bson.binary import Binary  # to save binary data to mongodb\n    from pymongo import ASCENDING, IndexModel\n    from pymongo.errors import OperationFailure\n\nfrom .base import RecalculationNeeded, _BaseCore, _get_func_str\n\nMONGO_SLEEP_DURATION_IN_SEC = 1\n\n\nclass MissingMongetter(ValueError):\n    \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n\n\nclass _MongoCore(_BaseCore):\n    _INDEX_NAME = \"func_1_key_1\"\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        mongetter: Mongetter,\n        wait_for_calc_timeout: int,\n    ):\n        if \"pymongo\" not in sys.modules:\n            warnings.warn(\n                \"`pymongo` was not found. MongoDB cores will not function.\",\n                ImportWarning,\n                stacklevel=2,\n            )  # pragma: no cover\n\n        super().__init__(hash_func, wait_for_calc_timeout)\n        if mongetter is None:\n            raise MissingMongetter(\n                \"must specify ``mongetter`` when using the mongo core\"\n            )\n        self.mongetter = mongetter\n        self.mongo_collection = self.mongetter()\n        index_inf = self.mongo_collection.index_information()\n        if _MongoCore._INDEX_NAME not in index_inf:\n            func1key1 = IndexModel(\n                keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n                name=_MongoCore._INDEX_NAME,\n            )\n            self.mongo_collection.create_indexes([func1key1])\n\n    @property\n    def _func_str(self) -> str:\n        return _get_func_str(self.func)\n\n    def get_entry_by_key(self, key):\n        res = self.mongo_collection.find_one(\n            {\"func\": self._func_str, \"key\": key}\n        )\n        if not res:\n            return key, None\n        try:\n            entry = {\n                \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        except KeyError:\n            entry = {\n                \"value\": None,\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        return key, entry\n\n    def set_entry(self, key, func_res):\n        thebytes = pickle.dumps(func_res)\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\n                \"$set\": {\n                    \"func\": self._func_str,\n                    \"key\": key,\n                    \"value\": Binary(thebytes),\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": False,\n                }\n            },\n            upsert=True,\n        )\n\n    def mark_entry_being_calculated(self, key):\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\"$set\": {\"being_calculated\": True}},\n            upsert=True,\n        )\n\n    def mark_entry_not_calculated(self, key):\n        with suppress(OperationFailure):  # don't care in this case\n            self.mongo_collection.update_one(\n                filter={\n                    \"func\": self._func_str,\n                    \"key\": key,\n                },\n                update={\"$set\": {\"being_calculated\": False}},\n                upsert=False,  # should not insert in this case\n            )\n\n    def wait_on_entry_calc(self, key):\n        time_spent = 0\n        while True:\n            time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n            time_spent += MONGO_SLEEP_DURATION_IN_SEC\n            key, entry = self.get_entry_by_key(key)\n            if entry is None:\n                raise RecalculationNeeded()\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n            self.check_calc_timeout(time_spent)\n\n    def clear_cache(self):\n        self.mongo_collection.delete_many(filter={\"func\": self._func_str})\n\n    def clear_being_calculated(self):\n        self.mongo_collection.update_many(\n            filter={\n                \"func\": self._func_str,\n                \"being_calculated\": True,\n            },\n            update={\"$set\": {\"being_calculated\": False}},\n        )\n"
    },
    {
      "path": "python-cachier_cachier/tests/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/tests/standalone_script.py",
      "content": "import time\n\nimport cachier\n\n\n@cachier.cachier()\ndef _takes_3_seconds(label, value):\n    time.sleep(3)\n    return f\"{label} {value}\"\n\n\nprint(_takes_3_seconds(\"two\", 2))\n"
    },
    {
      "path": "python-cachier_cachier/tests/speed_eval.py",
      "content": "from random import random\nfrom time import time\n\nfrom cachier import cachier\n\n\n@cachier(next_time=True)\ndef _test_int_pickling(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef _test_int_pickling_compare(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef test_pickle_speed():\n    \"\"\"Test speeds.\"\"\"\n    print(\"Comparing speeds of decorated vs non-decorated functions...\")\n    num_of_vals = 1000\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling_compare(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Non-decorated average = {sum(times) / num_of_vals}\")\n\n    _test_int_pickling.clear_cache()\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Decorated average = {sum(times) / num_of_vals}\")\n\n\n@cachier()\ndef _test_single_file_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\n@cachier(separate_files=True)\ndef _test_separate_files_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\ndef test_separate_files_vs_single_file():\n    _test_separate_files_speed.clear_cache()\n    _test_single_file_speed.clear_cache()\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_separate_files_speed(j, 2)\n    print(f\"separate files time: {time() - start_time}\")\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_single_file_speed(j, 2)\n    print(f\"single file time: {time() - start_time}\")\n\n\nif __name__ == \"__main__\":\n    test_pickle_speed()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_memory_core.py",
      "content": "\"\"\"Test for the in-memory implementation of the  Cachier python package.\"\"\"\n\nimport hashlib\nimport queue\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pandas as pd\nimport pytest\nfrom cachier import cachier\n\n\n@cachier(backend=\"memory\", next_time=False)\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.memory\ndef test_memory_core():\n    \"\"\"Basic memory core functionality.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\n@pytest.mark.memory\ndef test_memory_core_keywords():\n    \"\"\"Basic memory core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=False)\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after():\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds.clear_cache()\n    val1 = _stale_after_seconds(1, 2)\n    val2 = _stale_after_seconds(1, 2)\n    val3 = _stale_after_seconds(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds(1, 2)\n    assert val4 != val1\n    _stale_after_seconds.clear_cache()\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=True)\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after_next_time():\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _random_num():\n    return random()\n\n\n@cachier(backend=\"memory\")\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.memory\ndef test_overwrite_cache():\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 == int3\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 == int3\n    _random_num_with_arg.clear_cache()\n\n\n@pytest.mark.memory\ndef test_ignore_cache():\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(res_queue):\n    res = _takes_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_memory_being_calculated():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(res_queue):\n    res = _being_calc_next_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_being_calc_next_time():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\")\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@cachier(backend=\"memory\")\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    _takes_time.clear_being_calculated()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated_with_empty_cache():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    _takes_time.clear_being_calculated()\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.memory\ndef test_error_throwing_func():\n    # with\n    res1 = _error_throwing_func(4)\n    sleep(1.5)\n    res2 = _error_throwing_func(4)\n    assert res1 == res2\n\n\n@pytest.mark.memory\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(backend=\"memory\", hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n\n\nif __name__ == \"__main__\":\n    test_memory_being_calculated()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_core_lookup.py",
      "content": "\"\"\"Testing a few basic cachier interfaces.\"\"\"\n\nimport pytest\nfrom cachier import cachier, get_default_params\nfrom cachier.cores.mongo import MissingMongetter\n\n\ndef test_get_default_params():\n    params = get_default_params()\n    assert tuple(sorted(params)) == (\n        \"allow_none\",\n        \"backend\",\n        \"cache_dir\",\n        \"caching_enabled\",\n        \"hash_func\",\n        \"mongetter\",\n        \"next_time\",\n        \"pickle_reload\",\n        \"separate_files\",\n        \"stale_after\",\n        \"wait_for_calc_timeout\",\n    )\n\n\ndef test_bad_name(name=\"nope\"):\n    # Test that the appropriate exception is thrown\n    # when an invalid backend is given.\n    with pytest.raises(ValueError, match=f\"specified an invalid core: {name}\"):\n\n        @cachier(backend=name)\n        def dummy_func():\n            pass\n\n\ndef test_missing_mongetter():\n    # Test that the appropriate exception is thrown\n    # when forgetting to specify the mongetter.\n    with pytest.raises(MissingMongetter):\n\n        @cachier(backend=\"mongo\", mongetter=None)\n        def dummy_func():\n            pass\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_general.py",
      "content": "\"\"\"Non-core-specific tests for cachier.\"\"\"\n\nimport functools\nimport os\nimport queue\nimport subprocess  # nosec: B404\nimport threading\nfrom contextlib import suppress\nfrom random import random\nfrom time import sleep, time\n\nimport cachier\nimport pytest\nfrom cachier.core import (\n    DEFAULT_MAX_WORKERS,\n    MAX_WORKERS_ENVAR_NAME,\n    _get_executor,\n    _max_workers,\n    _set_max_workers,\n)\n\nfrom tests.test_mongo_core import (\n    MONGO_DELTA_LONG,\n    _test_mongetter,\n)\n\n\ndef test_information():\n    print(\"\\ncachier version: \", end=\"\")\n    print(cachier.__version__)\n\n\ndef test_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    with suppress(KeyError):\n        del os.environ[MAX_WORKERS_ENVAR_NAME]\n    assert _max_workers() == DEFAULT_MAX_WORKERS\n\n\ndef test_get_executor():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _get_executor()\n    _get_executor(False)\n    _get_executor(True)\n\n\ndef test_set_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _set_max_workers(9)\n\n\nparametrize_keys = \"mongetter,stale_after,separate_files\"\nparametrize_values = [\n    pytest.param(\n        _test_mongetter, MONGO_DELTA_LONG, False, marks=pytest.mark.mongo\n    ),\n    (None, None, False),\n    (None, None, True),\n]\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_ok(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_fast(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(1)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_fast(res_queue):\n        res = _wait_for_calc_timeout_fast(1, 2)\n        res_queue.put(res)\n\n    \"\"\" Testing calls that avoid timeouts store the values in cache. \"\"\"\n    _wait_for_calc_timeout_fast.clear_cache()\n    val1 = _wait_for_calc_timeout_fast(1, 2)\n    val2 = _wait_for_calc_timeout_fast(1, 2)\n    assert val1 == val2\n\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(2)\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2  # Timeout did not kick in, a single call was done\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_slow(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    sleep(4)\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_precache_value(mongetter, backend):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def dummy_func(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return arg_1 + arg_2\n\n    assert dummy_func.precache_value(2, 2, value_to_cache=5) == 5\n    assert dummy_func(2, 2) == 5\n    dummy_func.clear_cache()\n    assert dummy_func(2, 2) == 4\n    assert dummy_func.precache_value(2, arg_2=2, value_to_cache=5) == 5\n    assert dummy_func(2, arg_2=2) == 5\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_ignore_self_in_methods(mongetter, backend):\n    class DummyClass:\n        @cachier.cachier(backend=backend, mongetter=mongetter)\n        def takes_2_seconds(self, arg_1, arg_2):\n            \"\"\"Some function.\"\"\"\n            sleep(2)\n            return arg_1 + arg_2\n\n    test_object_1 = DummyClass()\n    test_object_2 = DummyClass()\n    test_object_1.takes_2_seconds.clear_cache()\n    test_object_2.takes_2_seconds.clear_cache()\n    assert test_object_1.takes_2_seconds(1, 2) == 3\n    start = time()\n    assert test_object_2.takes_2_seconds(1, 2) == 3\n    end = time()\n    assert end - start < 1\n\n\ndef test_hash_params_deprecation():\n    with pytest.deprecated_call(match=\"hash_params will be removed\"):\n\n        @cachier.cachier(hash_params=lambda a, k: \"key\")\n        def test():\n            return \"value\"\n\n    assert test() == \"value\"\n\n\ndef test_separate_processes():\n    test_args = (\"python\", \"tests/standalone_script.py\")\n    run_params = {\"args\": test_args, \"capture_output\": True, \"text\": True}\n    run_process = functools.partial(subprocess.run, **run_params)\n    result = run_process()\n    assert result.stdout.strip() == \"two 2\"\n    start = time()\n    result = run_process()\n    end = time()\n    assert result.stdout.strip() == \"two 2\"\n    assert end - start < 3\n\n\ndef test_global_disable():\n    @cachier.cachier()\n    def get_random():\n        return random()\n\n    get_random.clear_cache()\n    result_1 = get_random()\n    result_2 = get_random()\n    cachier.disable_caching()\n    result_3 = get_random()\n    cachier.enable_caching()\n    result_4 = get_random()\n    assert result_1 == result_2 == result_4\n    assert result_1 != result_3\n\n\ndef test_none_not_cached_by_default():\n    count = 0\n\n    @cachier.cachier()\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 2\n\n\ndef test_allow_caching_none():\n    count = 0\n\n    @cachier.cachier(allow_none=True)\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 1\n\n\ndef test_identical_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: int, b: int = 2, c: int = 3):\n        nonlocal count\n        count += 1\n        return a + b + c\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1, 2, 3) == 6\n    assert dummy_func(1, 2, c=3) == 6\n    assert dummy_func(1, b=2, c=3) == 6\n    assert dummy_func(a=1, b=2, c=3) == 6\n    assert count == 1\n\n\ndef test_list_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: list, b: list = [2]):  # noqa: B006\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func([1]) == [1, 2]\n    assert dummy_func([1], [2]) == [1, 2]\n    assert dummy_func([1], b=[2]) == [1, 2]\n    assert dummy_func(a=[1], b=[2]) == [1, 2]\n    assert count == 1\n\n\ndef test_order_independent_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(b=2, a=1) == 3\n    assert count == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_diff_functions_same_args(tmpdir, backend: str):\n    count_p = count_m = 0\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert fn(1) == expected\n        assert fn(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_runtime_handling(tmpdir, backend):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir, backend=backend)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n    for fn, expected in [(fn_plus, 5), (fn_minus, 1)]:\n        assert cachier_(fn)(3, 2) == expected\n        assert cachier_(fn)(a=3, b=2) == expected\n    assert count_p == 2\n    assert count_m == 2\n\n\ndef test_partial_handling(tmpdir):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)(2) == expected\n\n        dummy_ = functools.partial(fn, a=1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1) == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1, b=2) == expected\n\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n\n    assert count_p == 1\n    assert count_m == 1\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_defaults.py",
      "content": "import datetime\nimport os\nimport queue\nimport random\nimport threading\nimport time\n\nimport cachier\nimport pytest\n\nfrom tests.test_mongo_core import MONGO_DELTA, _test_mongetter\n\n_default_params = cachier.get_default_params().copy()\n\n\ndef setup_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef teardown_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef test_hash_func_default_param():\n    def slow_hash_func(args, kwds):\n        time.sleep(2)\n        return \"hash\"\n\n    def fast_hash_func(args, kwds):\n        return \"hash\"\n\n    cachier.set_default_params(hash_func=slow_hash_func)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(hash_func=fast_hash_func)\n    def global_test_2():\n        return None\n\n    start = time.time()\n    global_test_1()\n    end = time.time()\n    assert end - start > 1\n    start = time.time()\n    global_test_2()\n    end = time.time()\n    assert end - start < 1\n\n\ndef test_backend_default_param():\n    cachier.set_default_params(backend=\"memory\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(backend=\"pickle\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\n@pytest.mark.mongo\ndef test_mongetter_default_param():\n    cachier.set_default_params(mongetter=_test_mongetter)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(mongetter=False)\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\ndef test_cache_dir_default_param(tmpdir):\n    cachier.set_default_params(cache_dir=tmpdir / \"1\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() == str(tmpdir / \"1\")\n    assert global_test_2.cache_dpath() == str(tmpdir / \"2\")\n\n\ndef test_separate_files_default_param(tmpdir):\n    cachier.set_default_params(separate_files=True)\n\n    @cachier.cachier(cache_dir=tmpdir / \"1\")\n    def global_test_1(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\", separate_files=False)\n    def global_test_2(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    global_test_1(1, 2)\n    global_test_1(3, 4)\n    global_test_2(1, 2)\n    global_test_2(3, 4)\n\n    assert len(os.listdir(global_test_1.cache_dpath())) == 2\n    assert len(os.listdir(global_test_2.cache_dpath())) == 1\n\n\ndef test_allow_none_default_param(tmpdir):\n    cachier.set_default_params(\n        allow_none=True,\n        separate_files=True,\n        verbose_cache=True,\n    )\n    allow_count = disallow_count = 0\n\n    @cachier.cachier(cache_dir=tmpdir)\n    def allow_none():\n        nonlocal allow_count\n        allow_count += 1\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir, allow_none=False)\n    def disallow_none():\n        nonlocal disallow_count\n        disallow_count += 1\n        return None\n\n    assert allow_count == 0\n    allow_none()\n    allow_none()\n    assert allow_count == 1\n\n    assert disallow_count == 0\n    disallow_none()\n    disallow_none()\n    assert disallow_count == 2\n\n    disallow_none(cachier__allow_none=True)\n    disallow_none(cachier__allow_none=True)\n    assert disallow_count == 2\n\n\nPARAMETRIZE_TEST = (\n    \"backend,mongetter\",\n    [\n        pytest.param(\"pickle\", None, marks=pytest.mark.pickle),\n        pytest.param(\"mongo\", _test_mongetter, marks=pytest.mark.mongo),\n    ],\n)\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_stale_after_applies_dynamically(backend, mongetter):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_test(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random() + arg_1 + arg_2\n\n    cachier.set_default_params(stale_after=MONGO_DELTA)\n\n    _stale_after_test.clear_cache()\n    val1 = _stale_after_test(1, 2)\n    val2 = _stale_after_test(1, 2)\n    assert val1 == val2\n    time.sleep(3)\n    val3 = _stale_after_test(1, 2)\n    assert val3 != val1\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_next_time_applies_dynamically(backend, mongetter):\n    NEXT_AFTER_DELTA = datetime.timedelta(seconds=3)\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_next_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random()\n\n    cachier.set_default_params(stale_after=NEXT_AFTER_DELTA, next_time=True)\n\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    time.sleep(NEXT_AFTER_DELTA.seconds + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    time.sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_wait_for_calc_applies_dynamically(backend, mongetter):\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        time.sleep(3)\n        return random.random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    cachier.set_default_params(wait_for_calc_timeout=2)\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    time.sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    time.sleep(5)\n    thread1.join(timeout=5)\n    thread2.join(timeout=5)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\ndef test_default_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1) == 3\n    assert dummy_func(a=1) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert count == 1\n\n\ndef test_deprecated_func_kwargs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    with pytest.deprecated_call(\n        match=\"`verbose_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, verbose_cache=True) == 3\n    assert count == 1\n    with pytest.deprecated_call(\n        match=\"`ignore_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, ignore_cache=True) == 3\n    assert count == 2\n    with pytest.deprecated_call(\n        match=\"`overwrite_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, overwrite_cache=True) == 3\n    assert count == 3\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_pickle_core.py",
      "content": "\"\"\"Test for the Cachier python package.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\n# from os.path import (\n#     realpath,\n#     dirname\n# )\nimport os\nimport pickle\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pytest\n\ntry:\n    import queue\nexcept ImportError:  # python 2\n    import Queue as queue  # type: ignore\n\nimport hashlib\n\nimport pandas as pd\nfrom cachier import cachier\nfrom cachier.core import _default_params\n\n\ndef _get_decorated_func(func, **kwargs):\n    cachier_decorator = cachier(**kwargs)\n    decorated_func = cachier_decorator(func)\n    return decorated_func\n\n\n# Pickle core tests\n\n\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"reload\", [True, False])\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core(reload, separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds,\n        next_time=False,\n        pickle_reload=reload,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_keywords(separate_files):\n    \"\"\"Basic Pickle core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds, next_time=False, separate_files=separate_files\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after(separate_files):\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds_decorated = _get_decorated_func(\n        _stale_after_seconds,\n        stale_after=DELTA,\n        next_time=False,\n        separate_files=separate_files,\n    )\n    _stale_after_seconds_decorated.clear_cache()\n    val1 = _stale_after_seconds_decorated(1, 2)\n    val2 = _stale_after_seconds_decorated(1, 2)\n    val3 = _stale_after_seconds_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds_decorated(1, 2)\n    assert val4 != val1\n    _stale_after_seconds_decorated.clear_cache()\n\n\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after_next_time(separate_files):\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time_decorated = _get_decorated_func(\n        _stale_after_next_time,\n        stale_after=DELTA,\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _stale_after_next_time_decorated.clear_cache()\n    val1 = _stale_after_next_time_decorated(1, 2)\n    val2 = _stale_after_next_time_decorated(1, 2)\n    val3 = _stale_after_next_time_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time_decorated(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time_decorated(1, 2)\n    assert val5 != val1\n    _stale_after_next_time_decorated.clear_cache()\n\n\ndef _random_num():\n    return random()\n\n\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_overwrite_cache(separate_files):\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 == int3\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 == int3\n    _random_num_with_arg_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_ignore_cache(separate_files):\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg_decorated.clear_cache()\n\n\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(takes_time_func, res_queue):\n    res = takes_time_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_being_calculated(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(being_calc_func, res_queue):\n    res = being_calc_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_being_calc_next_time(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _being_calc_next_time_decorated = _get_decorated_func(\n        _being_calc_next_time,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _being_calc_next_time_decorated.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _BAD_CACHE_FNAME = '.__main__._bad_cache'\n_BAD_CACHE_FNAME = \".tests.test_pickle_core._bad_cache\"\n_BAD_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._bad_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\nEXPANDED_CACHIER_DIR = os.path.expanduser(_default_params[\"cache_dir\"])\n_BAD_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME)\n_BAD_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME_SEPARATE_FILES\n)\n_BAD_CACHE_FPATHS = {\n    True: _BAD_CACHE_FPATH_SEPARATE_FILES,\n    False: _BAD_CACHE_FPATH,\n}\n\n\ndef _calls_bad_cache(bad_cache_func, res_queue, trash_cache, separate_files):\n    try:\n        res = bad_cache_func(0.13, 0.02, cachier__verbose=True)\n        if trash_cache:\n            with open(_BAD_CACHE_FPATHS[separate_files], \"w\") as cache_file:\n                cache_file.seek(0)\n                cache_file.truncate()\n        res_queue.put(res)\n    except Exception as exc:\n        res_queue.put(exc)\n\n\ndef _helper_bad_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    _bad_cache_decorated = _get_decorated_func(\n        _bad_cache, separate_files=separate_files\n    )\n    _bad_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if res2 is not None or isinstance(res2, KeyError):\n        return False\n    return True\n\n\n# we want this to succeed at least once\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_bad_cache_file(separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.6, 0.7, 0.8, 1, 1.5, 2]\n    sleeptimes = sleeptimes + sleeptimes\n    for sleeptime in sleeptimes:\n        if _helper_bad_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _DEL_CACHE_FNAME = '.__main__._delete_cache'\n_DEL_CACHE_FNAME = \".tests.test_pickle_core._delete_cache\"\n_DEL_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._delete_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\n_DEL_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME)\n_DEL_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME_SEPARATE_FILES\n)\n_DEL_CACHE_FPATHS = {\n    True: _DEL_CACHE_FPATH_SEPARATE_FILES,\n    False: _DEL_CACHE_FPATH,\n}\n\n\ndef _calls_delete_cache(del_cache_func, res_queue, del_cache, separate_files):\n    try:\n        # print('in')\n        res = del_cache_func(0.13, 0.02)\n        # print('out with {}'.format(res))\n        if del_cache:\n            os.remove(_DEL_CACHE_FPATHS[separate_files])\n            # print(os.path.isfile(_DEL_CACHE_FPATH))\n        res_queue.put(res)\n    except Exception as exc:\n        # print('found')\n        res_queue.put(exc)\n\n\ndef _helper_delete_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    _delete_cache_decorated = _get_decorated_func(\n        _delete_cache, separate_files=separate_files\n    )\n    _delete_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    # print(res1)\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if not ((isinstance(res2, KeyError)) or (res2 is None)):\n        return False\n    return True\n    # print(res2)\n    # print(type(res2))\n\n\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_delete_cache_file(separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.7, 1]\n    sleeptimes = sleeptimes * 4\n    for sleeptime in sleeptimes:\n        if _helper_delete_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_clear_being_calculated(separate_files):\n    \"\"\"Test pickle core clear `being calculated` functionality.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_being_calculated()\n\n\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_error_throwing_func(separate_files):\n    # with\n    _error_throwing_func.count = 0\n    _error_throwing_func_decorated = _get_decorated_func(\n        _error_throwing_func,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _error_throwing_func_decorated.clear_cache()\n    res1 = _error_throwing_func_decorated(4)\n    sleep(1.5)\n    res2 = _error_throwing_func_decorated(4)\n    assert res1 == res2\n\n\n# test custom cache dir for pickle core\n\nCUSTOM_DIR = \"~/.exparrot\"\nEXPANDED_CUSTOM_DIR = os.path.expanduser(CUSTOM_DIR)\n\n\ndef _takes_2_seconds_custom_dir(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_custom_cache_dir(separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_custom_dir_decorated = _get_decorated_func(\n        _takes_2_seconds_custom_dir,\n        next_time=False,\n        cache_dir=CUSTOM_DIR,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    path2test = _takes_2_seconds_custom_dir_decorated.cache_dpath()\n    assert path2test == EXPANDED_CUSTOM_DIR\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_callable_hash_param(separate_files):\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(hash_func=_hash_func, separate_files=separate_files)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_mongo_core.py",
      "content": "\"\"\"Testing the MongoDB core of cachier.\"\"\"\n\nimport datetime\nimport hashlib\nimport platform\nimport queue\nimport sys\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep\nfrom urllib.parse import quote_plus\n\nimport pandas as pd\nimport pymongo\nimport pytest\nfrom birch import Birch  # type: ignore[import-not-found]\nfrom cachier import cachier\nfrom cachier.cores.base import RecalculationNeeded\nfrom cachier.cores.mongo import _MongoCore\nfrom pymongo.errors import OperationFailure\nfrom pymongo.mongo_client import MongoClient\nfrom pymongo_inmemory import MongoClient as InMemoryMongoClient\n\n# === Enables testing vs a real MongoDB instance ===\n\n\nclass CfgKey:\n    HOST = \"TEST_HOST\"\n    PORT = \"TEST_PORT\"\n    # UNAME = \"TEST_USERNAME\"\n    # PWD = \"TEST_PASSWORD\"\n    # DB = \"TEST_DB\"\n    TEST_VS_DOCKERIZED_MONGO = \"TEST_VS_DOCKERIZED_MONGO\"\n\n\nCFG = Birch(\n    namespace=\"cachier\",\n    defaults={CfgKey.TEST_VS_DOCKERIZED_MONGO: False},\n)\n\n\n# URI_TEMPLATE = \"mongodb://myUser:myPassword@localhost:27017/\"\nURI_TEMPLATE = \"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n\n\ndef _get_cachier_db_mongo_client():\n    host = quote_plus(CFG[CfgKey.HOST])\n    port = quote_plus(CFG[CfgKey.PORT])\n    # uname = quote_plus(CFG[CfgKey.UNAME])\n    # pwd = quote_plus(CFG[CfgKey.PWD])\n    # db = quote_plus(CFG[CfgKey.DB])\n    uri = f\"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n    return MongoClient(uri)\n\n\n_COLLECTION_NAME = (\n    f\"cachier_test_{platform.system()}\"\n    f\"_{'.'.join(map(str, sys.version_info[:3]))}\"\n)\n\n\ndef _test_mongetter():\n    if not hasattr(_test_mongetter, \"client\"):\n        if str(CFG.mget(CfgKey.TEST_VS_DOCKERIZED_MONGO)).lower() == \"true\":\n            print(\"Using live MongoDB instance for testing.\")\n            _test_mongetter.client = _get_cachier_db_mongo_client()\n        else:\n            print(\"Using in-memory MongoDB instance for testing.\")\n            _test_mongetter.client = InMemoryMongoClient()\n    db_obj = _test_mongetter.client[\"cachier_test\"]\n    if _COLLECTION_NAME not in db_obj.list_collection_names():\n        db_obj.create_collection(_COLLECTION_NAME)\n    return db_obj[_COLLECTION_NAME]\n\n\n# === Mongo core tests ===\n\n\n@pytest.mark.mongo\ndef test_information():\n    print(\"\\npymongo version: \", end=\"\")\n    print(pymongo.__version__)\n\n\n@pytest.mark.mongo\ndef test_mongo_index_creation():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    collection = _test_mongetter()\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    assert _MongoCore._INDEX_NAME in collection.index_information()\n\n\n@pytest.mark.mongo\ndef test_mongo_core():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, 2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, 2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, 2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, 2)\n    assert val6 == val5\n\n\n@pytest.mark.mongo\ndef test_mongo_core_keywords():\n    \"\"\"Basic Mongo core functionality with keyword arguments.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, arg_2=2)\n    val2 = _test_mongo_caching(1, arg_2=2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, arg_2=2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, arg_2=2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, arg_2=2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, arg_2=2)\n    assert val6 == val5\n\n\nMONGO_DELTA = timedelta(seconds=3)\nMONGO_DELTA_LONG = timedelta(seconds=10)\n\n\n@pytest.mark.mongo\ndef test_mongo_stale_after():\n    \"\"\"Testing MongoDB core stale_after functionality.\"\"\"\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=MONGO_DELTA, next_time=False\n    )\n    def _stale_after_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _stale_after_mongo.clear_cache()\n    val1 = _stale_after_mongo(1, 2)\n    val2 = _stale_after_mongo(1, 2)\n    assert val1 == val2\n    sleep(3)\n    val3 = _stale_after_mongo(1, 2)\n    assert val3 != val1\n\n\ndef _calls_takes_time(res_queue):\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    res = _takes_time(34, 82.3)\n    res_queue.put(res)\n\n\n@pytest.mark.mongo\ndef test_mongo_being_calculated():\n    \"\"\"Testing MongoDB core handling of being calculated scenarios.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(1)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\nclass _BadMongoCollection:\n    def __init__(self, mongetter):\n        self.collection = mongetter()\n        self.index_information = self.collection.index_information\n        self.create_indexes = self.collection.create_indexes\n        self.find_one = self.collection.find_one\n\n    def delete_many(self, *args, **kwargs):\n        pass\n\n    def update_many(self, *args, **kwargs):\n        pass\n\n    def update_one(self, *args, **kwargs):\n        raise OperationFailure(Exception())\n\n\ndef _bad_mongetter():\n    return _BadMongoCollection(_test_mongetter)\n\n\n@pytest.mark.mongo\ndef test_mongo_write_failure():\n    \"\"\"Testing MongoDB core handling of writing failure scenarios.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    with pytest.raises(OperationFailure):\n        val1 = _func_w_bad_mongo(1, 2)\n        val2 = _func_w_bad_mongo(1, 2)\n        assert val1 == val2\n\n\n@pytest.mark.mongo\ndef test_mongo_clear_being_calculated():\n    \"\"\"Testing MongoDB core clear_being_calculated.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _func_w_bad_mongo.clear_being_calculated()\n\n\n@pytest.mark.mongo\ndef test_stalled_mongo_db_cache():\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    core = _MongoCore(None, _test_mongetter, 0)\n    core.set_func(_stalled_func)\n    core.clear_cache()\n    with pytest.raises(RecalculationNeeded):\n        core.wait_on_entry_calc(key=None)\n\n\n@pytest.mark.mongo\ndef test_stalled_mong_db_core(monkeypatch):\n    def mock_get_entry(self, args, kwargs):\n        return \"key\", {\"being_calculated\": True}\n\n    def mock_get_entry_by_key(self, key):\n        return \"key\", None\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry\n    )\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry_by_key\",\n        mock_get_entry_by_key,\n    )\n\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    res = _stalled_func()\n    assert res == 1\n\n    def mock_get_entry_2(self, args, kwargs):\n        entry = {\n            \"being_calculated\": True,\n            \"value\": 1,\n            \"time\": datetime.datetime.now() - datetime.timedelta(seconds=10),\n        }\n        return \"key\", entry\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry_2\n    )\n\n    stale_after = datetime.timedelta(seconds=1)\n\n    @cachier(mongetter=_test_mongetter, stale_after=stale_after)\n    def _stalled_func_2():\n        \"\"\"Testing stalled function.\"\"\"\n        return 2\n\n    res = _stalled_func_2()\n    assert res == 2\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=stale_after, next_time=True\n    )\n    def _stalled_func_3():\n        \"\"\"Testing stalled function.\"\"\"\n        return 3\n\n    res = _stalled_func_3()\n    assert res == 1\n\n\n@pytest.mark.mongo\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(mongetter=_test_mongetter, hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/requirements.txt",
      "content": "# todo: add some version range or pinning latest versions\n# tests and coverages\npytest\ncoverage\npytest-cov\nbirch\n# to connect to the test mongodb server\npymongo\ndnspython\npymongo-inmemory\n# to test pandas dataframe as-param hashing with mongodb core\npandas\n# to be able to run `python setup.py checkdocs`\ncollective.checkdocs\npygments\n"
    }
  ],
  "Patch": "--- a/python-cachier_cachier/src/cachier/cores/pickle.py\n+++ b/python-cachier_cachier/src/cachier/cores/pickle.py\n@@ -247,7 +247,7 @@\n \n     def clear_cache(self):\n         if self.separate_files:\n-            pass\n+            self._clear_all_cache_files()\n         else:\n             self._save_cache({})\n \n",
  "BuggyCodeLocation": [
    {
      "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "function": {
        "248": "clear_cache"
      },
      "content_all": {
        "247": "\n",
        "248": "    def clear_cache(self):\n",
        "249": "        if self.separate_files:\n",
        "250": "            pass\n",
        "251": "        else:\n"
      },
      "content_change": {
        "250": "            pass\n"
      }
    },
    {
      "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "function": null,
      "content_all": {
        "252": "            self._save_cache({})\n"
      },
      "content_change": {}
    }
  ],
  "Source": "Human",
  "Command": [
    "pytest tests"
  ],
  "Token": 959,
  "FilteredCode": [
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "1 \"\"\"A pickle-based caching core for cachier.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 import os\n10 import pickle  # for local caching\n11 from contextlib import suppress\n12 from datetime import datetime\n13 \n14 import portalocker  # to lock on pickle cache IO\n15 from watchdog.events import PatternMatchingEventHandler\n16 from watchdog.observers import Observer\n17 \n18 from .._types import HashFunc\n19 from ..config import _update_with_defaults\n20 \n21 # Alternative:  https://github.com/WoLpH/portalocker\n22 from .base import _BaseCore\n23 \n24 \n25 class _PickleCore(_BaseCore):\n26     \"\"\"The pickle core class for cachier.\"\"\"\n27 \n28     class CacheChangeHandler(PatternMatchingEventHandler):\n29         \"\"\"Handles cache-file modification events.\"\"\"\n30 \n31         def __init__(self, filename, core, key):\n32             PatternMatchingEventHandler.__init__(\n33                 self,\n34                 patterns=[\"*\" + filename],\n35                 ignore_patterns=None,\n36                 ignore_directories=True,\n37                 case_sensitive=False,\n38             )\n39             self.core = core\n40             self.key = key\n41             self.observer = None\n42             self.value = None\n43 \n44         def inject_observer(self, observer):\n45             \"\"\"Inject the observer running this handler.\"\"\"\n46             self.observer = observer\n47 \n48         def _check_calculation(self):\n49             # print(...truncated)"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "1 Cachier\n2 #######\n3 \n4 |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n5 \n6 Persistent, stale-free, local and cross-machine caching for Python functions.\n7 \n8 .. code-block:: python\n9 \n10   from cachier import cachier\n11   import datetime\n12 \n13   @cachier(stale_after=datetime.timedelta(days=3))\n14   def foo(arg1, arg2):\n15     \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n16     return {'arg1': arg1, 'arg2': arg2}\n17 \n18 \n19 .. role:: python(code)\n20   :language: python\n21 \n22 .. contents::\n23 \n24 .. section-numbering:\n25 \n26 \n27 \n28 Installation\n29 ============\n30 \n31 Install ``cachier`` with:\n32 \n33 .. code-block:: python\n34 \n35     pip install cachier\n36 \n37 For the latest version supporting Python 2.7 please use:\n38 \n39 .. code-block:: python\n40 \n41     pip install 'cachier==1.2.8'\n42 \n43 Features\n44 ========\n45 \n46 * Pure Python.\n47 * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n48 * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n49 * A simple interface.\n50 * Defining \"shelf life\" for cached values.\n51 * Local caching using pickle fil(...truncated)"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "1 \"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 \n10 import datetime\n11 import inspect\n12 import os\n13 import warnings\n14 from collections import OrderedDict\n15 from concurrent.futures import ThreadPoolExecutor\n16 from functools import wraps\n17 from typing import Optional, Union\n18 from warnings import warn\n19 \n20 from .config import (\n21     Backend,\n22     HashFunc,\n23     Mongetter,\n24     _default_params,\n25     _update_with_defaults,\n26 )\n27 from .cores.base import RecalculationNeeded, _BaseCore\n28 from .cores.memory import _MemoryCo(...truncated)"
    }
  ],
  "TokenAll": 34852,
  "FilteredLevel": 1500,
  "Results": {
    "model": "GPT-4o",
    "Difficulty": "Simple",
    "issue_origin": {
      "title": "Python 2.7 Compatibility",
      "description": "The README suggests installing 'cachier==1.2.8' for Python 2.7 compatibility. However, Python 2.7 has been officially discontinued and no longer receives updates or support. Users are strongly encouraged to upgrade to a more recent version of Python. Consider updating the documentation to emphasize this and potentially remove instructions for installing older versions.",
      "explanation": "### Issue Summary\n\nThe issue at hand pertains to the documentation guidelines in the `README.rst` file of the `cachier` library. The README suggests using the package version `1.2.8` for Python 2.7 compatibility. However, Python 2.7 has been officially discontinued and no longer receives updates or support. The user is encouraged to update the documentation to emphasize the discontinuation of Python 2.7 and potentially remove the instructions for installing older versions.\n\n### Commit Content\n\nThe commit should likely focus on updating the `README.rst` file to address this issue. Specifically:\n1. Remove the section suggesting the installation of `cachier==1.2.8` for Python 2.7.\n2. Add a note to emphasize that Python 2.7 is no longer supported and encourage users to upgrade to a more recent version of Python.\n\n### Code Analysis\n\nHere is the relevant part of `README.rst` that needs modification:\n\n```python\nInstallation\n============\n\nInstall ``cachier`` with:\n\n.. code-block:: python\n\n    pip install cachier\n\nFor the latest version supporting Python 2.7 please use:\n\n.. code-block:: python\n\n    pip install 'cachier==1.2.8'\n```\n\n### Detailed Explanation\n\n1. **Summary of the Issue**:\n    - The `README.rst` file contains instructions for Python 2.7, which is no longer supported. This can confuse users and may potentially lead them to utilize outdated and insecure dependencies.\n\n2. **Expected Modification in Commit**:\n    - The commit should remove the outdated instruction and add a disclaimer about Python 2.7's deprecation.\n\n3. **Commit Content**:\n    The updated `README.rst` might look like this:\n\n    ```rst\n    Installation\n    ============\n    \n    Install ``cachier`` with:\n    \n    .. code-block:: python\n    \n        pip install cachier\n    \n    .. warning::\n    \n        Python 2.7 is no longer supported. Users are strongly encouraged to upgrade to a more recent version of Python (3.8+).\n    ```\n\n4. **Explanation of the Commit**:\n    - **Removing Outdated Information**:\n        - **Previous Instructions**:\n            ```rst\n            For the latest version supporting Python 2.7 please use:\n            \n            .. code-block:: python\n            \n                pip install 'cachier==1.2.8'\n            ```\n        - This has been completely removed as it promotes the use of an outdated version. \n\n    - **Adding a Warning**:\n        - **New Addition**:\n            ```rst\n            .. warning::\n            \n                Python 2.7 is no longer supported. Users are strongly encouraged to upgrade to a more recent version of Python (3.8+).\n            ```\n        - This addition informs users about the deprecation of Python 2.7 and strongly advises upgrading to a more recent Python version.\n\n    - **Maintaining Clarity**:\n        - The rest of the `README.rst` file continues as before, ensuring that users can install the latest version of `cachier` without ambiguity.\n\n### How the Commit Solves the Issue\n\n1. **Current and Clear Documentation**:\n    - By removing the outdated Python 2.7 installation instruction, the documentation becomes more relevant and up-to-date. Users will not be misled into installing older versions of the library.\n\n2. **Enhanced Security and Compatibility**:\n    - Encouraging users to upgrade to a more recent Python version ensures they use supported and secure software, reducing risks associated with using deprecated versions.\n\n3. **Adapting to Official Python Lifecycle**:\n    - Acknowledging Python 2.7's end-of-life status aligns the project's documentation with the broader Python community's standards and expectations.\n\n4. **User Guidance**:\n    - The explicit warning about Python 2.7's discontinuation directs users towards better practices in a straightforward manner, providing a clear path forward for those who may be unaware of the deprecation.\n\nBy following these steps, the commit effectively addresses the issue at hand, leading to clearer, more secure, and maintainable documentation for the `cachier` library."
    },
    "issue_message": {
      "title": "Hardcoded URL in Documentation",
      "description": "The README file contains hardcoded URLs for badges and build status reports (e.g., 'https://travis-ci.org/shaypal5/cachier'). This can lead to broken links if the service or the repository location changes. It's recommended to use variables or placeholders for URLs in documentation to make maintenance easier.",
      "explanation": "### Issue Summary\n\nThe issue reported is that the README file for the project contains hardcoded URLs, specifically for badges and build status reports. These URLs are pointing to services such as Travis CI (`https://travis-ci.org/shaypal5/cachier`) and also potentially other identifiers like PyPI package status and download badges. Using hardcoded URLs may cause problems if the referenced service or repository location changes. The recommendation is to use variables or placeholders for these URLs, making future maintenance and updates easier.\n\n### Existing Code Contents\n\nHere are the relevant sections of the existing code, primarily focusing on the `README.rst` since the issue is about documentation:\n\n```plaintext\n1 Cachier\n2 #######\n3 \n4 |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n5 \n6 Persistent, stale-free, local and cross-machine caching for Python functions.\n7 \n8 .. code-block:: python\n9 \n10   from cachier import cachier\n11   import datetime\n12 \n13   @cachier(stale_after=datetime.timedelta(days=3))\n14   def foo(arg1, arg2):\n15     \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n16     return {'arg1': arg1, 'arg2': arg2}\n17 \n18 \n19 .. role:: python(code)\n20   :language: python\n21 \n22 .. contents::\n23 \n24 .. section-numbering::\n25 \n26 \n27 \n28 Installation\n29 ============\n30 \n31 Install `cachier` with:\n32 \n33 .. code-block:: python\n34 \n35     pip install cachier\n36 \n37 For the latest version supporting Python 2.7 please use:\n38 \n39 .. code-block:: python\n40 \n41     pip install 'cachier==1.2.8'\n42 \n43 Features\n44 ========\n45 \n46 * Pure Python.\n47 * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n48 * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n49 * A simple interface.\n50 * Defining \"shelf life\" for cached values.\n51 * Local caching using pickle files.\n```\n\n### Cause of the Issue\n\nThe main cause of the issue is that URLs such as `https://travis-ci.org/shaypal5/cachier` are hardcoded directly in the `README.rst` file. If the URLs for these services change, or if the repository is moved to a different location or hosting service (e.g., from Travis CI to GitHub Actions), the documentation will have to be manually updated at multiple locations to reflect these changes. This makes the maintenance process error-prone and time-consuming.\n\n### Content of the Commit\n\nA proposed commit to fix this issue might involve the following changes:\n\n1. Introduce placeholders or variables in the documentation instead of hardcoding the URLs.\n2. Define the actual URLs somewhere easily adjustable, such as at the top of the README or in a separate configuration file.\n\nLet's assume the commit looks like this:\n\n```diff\ndiff --git a/README.rst b/README.rst\nindex e69de29..cba135e 100644\n--- a/README.rst\n+++ b/README.rst\n@@ -1,10 +1,15 @@\n Cachier\n #######\n \n |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n \n Persistent, stale-free, local and cross-machine caching for Python functions.\n \n+.. |Build-Status| image:: https://travis-ci.org/shaypal5/cachier.svg\n+   :target: https://travis-ci.org/shaypal5/cachier\n+.. |Codecov| image:: https://codecov.io/gh/shaypal5/cachier/branch/master/graph/badge.svg\n+   :target: https://codecov.io/gh/shaypal5/cachier\n+\n .. code-block:: python\n \n   from cachier import cachier\n   import datetime\n@@ -45,7 +50,7 @@ Features\n * Pure Python.\n * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n * A simple interface.\n * Defining \"shelf life\" for cached values.\n * Local caching using pickle files.\n```\n\n### Explanation of How the Commit Solves the Issue\n\n1. **Using Placeholders for URLs:** The commit uses placeholders for"
    },
    "issue_ground": {
      "title": "Clear Cache Function Not Working Properly in Separate Files Mode",
      "description": "The `clear_cache()` function does not appear to properly clear all cache files when caching is configured to use separate files. As a user, when I attempt to clear the cache in this mode, I expect all related cache files to be removed or reset. However, the current implementation of `clear_cache()` does not handle this scenario correctly, resulting in residual cache data. This issue impacts the reliability of cache clearing functionality and could lead to stale or incorrect data being used.",
      "explanation": "### Summary of the Issue\n\nThe issue at hand involves the `clear_cache()` function in the Cachier library not working properly when the caching is set to use separate files. The problem is that the function fails to clear all the cache files, leading to residual stale data and reducing the reliability of the cache clearing functionality.\n\n### Analysis of the Code\n\nThe provided code excerpts include parts of the `pickle.py`, `README.rst`, and `core.py` files. The critical class related to this issue is the `_PickleCore` class located in `pickle.py`. Here's a brief analysis of existing functionality that can relate to the `clear_cache` method in `pickle.py`:\n\n#### _PickleCore Class in `pickle.py`:\n```python\n1   \"\"\"A pickle-based caching core for cachier.\"\"\"\n2   \n3   # This file is part of Cachier.\n4   # https://github.com/python-cachier/cachier\n5   \n6   # Licensed under the MIT license:\n7   # http://www.opensource.org/licenses/MIT-license\n8   # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9   import os\n10  import pickle  # for local caching\n11  from contextlib import suppress\n12  from datetime import datetime\n13  \n14  import portalocker  # to lock on pickle cache IO\n15  from watchdog.events import PatternMatchingEventHandler\n16  from watchdog.observers import Observer\n17  \n18  from .._types import HashFunc\n19  from ..config import _update_with_defaults\n20  \n21  # Alternative:  https://github.com/WoLpH/portalocker\n22  from .base import _BaseCore\n23  \n24  \n25  class _PickleCore(_BaseCore):\n26      \"\"\"The pickle core class for cachier.\"\"\"\n27  \n28      class CacheChangeHandler(PatternMatchingEventHandler):\n29          \"\"\"Handles cache-file modification events.\"\"\"\n30  \n31          def __init__(self, filename, core, key):\n32              PatternMatchingEventHandler.__init__(\n33                  self,\n34                  patterns=[\"*\" + filename],\n35                  ignore_patterns=None,\n36                  ignore_directories=True,\n37                  case_sensitive=False,\n38              )\n39              self.core = core\n40              self.key = key\n41              self.observer = None\n42              self.value = None\n43  \n44          def inject_observer(self, observer):\n45              \"\"\"Inject the observer running this handler.\"\"\"\n46              self.observer = observer\n47  \n48          def _check_calculation(self):\n49              # print(...truncated)\n```\n\nThis class is responsible for handling cache files using the pickle mechanism. The `clear_cache()` function, which likely resides in the `_PickleCore` class or its base class `_BaseCore`, should ensure all cache files, not just a single cache file, are cleared when invoked.\n\n### Cause of the Issue\n\nGiven that separate cache files are not being properly cleared, the root cause of the issue seems to be the `clear_cache()` method's logic, which likely doesn't account for iterating through all cache files and removing them.\n\n### Solution and Explanation\n\nA potential commit to resolve this issue might look like this:\n\n```python\n25  class _PickleCore(_BaseCore):\n26      \"\"\"The pickle core class for cachier.\"\"\"\n27  \n28      # Existing methods\n29      # ...\n30      \n31      def clear_cache(self):\n32          \"\"\"Clears the entire cache directory when using separate file.\"\"\"\n33          cache_dir = self._cache_dir  # Assuming self._cache_dir holds the cache directory path\n34          for filename in os.listdir(cache_dir):\n35              file_path = os.path.join(cache_dir, filename)\n36              with suppress(Exception):  # To ignore potential file access issues\n37                  os.remove(file_path)\n38          print(\"Cache cleared successfully in separate files mode.\")\n```\n\n### How the Commit Solves the Issue\n\n1. **Loop Through Cache Directory**: The commit adds functionality to iterate over all files in the cache directory specified by `self._cache_dir`.\n2. **Safely Remove Each Cache File**: For each file in the directory, the code tries to remove it. The `with suppress(Exception)` line ensures that any errors encountered during file removing (e.g., permission issues or concurrent access) don't stop the entire clearing process.\n3. **Confirmation**: The print statement at the end provides a simple confirmation that the cache clearing process was initiated.\n\nBy applying this commit, the `clear_cache()` function should now correctly iterate through and remove all files in the cache directory, ensuring that no residual cache files remain, thereby addressing the core issue identified.\n\n### Summary\n\n- **Issue**: The `clear_cache()` function fails to clear all cache files in separate files mode.\n- **Cause**: The function does not account for iterating through and removing all cache files.\n- **Solution**: Update `clear"
    },
    "issue_ground_truth": {
      "title": "Clear Cache Function Not Working Properly in Separate Files Mode",
      "description": "The `clear_cache()` function does not appear to properly clear all cache files when caching is configured to use separate files. As a user, when I attempt to clear the cache in this mode, I expect all related cache files to be removed or reset. However, the current implementation of `clear_cache()` does not handle this scenario correctly, resulting in residual cache data. This issue impacts the reliability of cache clearing functionality and could lead to stale or incorrect data being used.",
      "explanation": "### Summary of the Issue\n\nThe issue reported is that the `clear_cache()` function does not work properly when caching is configured to use separate files. When a user tries to clear the cache in this mode, not all related cache files are removed, resulting in residual cache data. This leads to stale or incorrect data being used, impacting the reliability of the cache clearing functionality.\n\n### Details of the Commit\n\nThe commit in question modifies the behavior of the `clear_cache()` method for the pickle-based caching core. The change replaces the placeholder code with a call to an internal method that handles the deletion of all cache files individually when separate file caching is enabled.\n\n### Explanation of the Cause and Solution\n\n#### Cause of the Issue\n\nWhen the caching is configured to use separate files, the `clear_cache()` method was not implemented properly. It effectively did nothing (`pass` statement), which meant that none of the cache files were actually being deleted when this method was called. This resulted in residual cache files being left behind, and subsequent function calls could incorrectly use these stale files, thus leading to data inconsistencies.\n\nThis scenario arises because, in separate files mode, each set of function arguments results in a distinct cache file. Therefore, clearing the cache involves deleting all these individual files, which was not happening due to the unimplemented `clear_cache()` method.\n\n#### Solution Implementation\n\nThe commit addresses this issue by implementing the logic to clear all the separate cache files. Instead of having a `pass` statement, the `clear_cache()` method now calls `_clear_all_cache_files()` (presumably a private method that handles the file deletion process).\n\nHere’s a step-by-step explanation of how the commit resolves the issue:\n\n1. **Identifying Files to Delete**: The solution involves identifying all the separate cache files associated with the function. Each cache file corresponds to a specific set of function arguments.\n\n2. **Deleting Files**: The `_clear_all_cache_files()` method iterates over these files and deletes them, thereby ensuring no residual cache data is left.\n\n3. **Ensuring Robustness**: By replacing the placeholder `pass` statement with a call to a method that performs the necessary deletions, the implementation ensures that all possible cache files are accounted for and properly deleted.\n\n### Solution Explanation\n\nThe solution revolves around correctly implementing the deletion mechanism for caches stored in separate files. By introducing the logic to iterate over and delete each cache file, the `clear_cache()` method now effectively clears all cached data as expected by the users. This ensures that stale or outdated data does not persist, maintaining the integrity and consistency of the function outputs post-cache clearing.\n\nIn summary:\n\n1. **Diagnosis**: The original problem was due to the lack of implementation for clearing multiple cache files.\n2. **Commit Change**: The commit adds a call to a utility method that handles the deletion of these files.\n3. **Effectiveness**: This fix ensures that calling `clear_cache()` will remove all cache files, thus solving the problem of residual stale data.\n\nBy addressing the root cause (unimplemented cache clearing in separate files mode), the commit ensures the correct functionality of the `clear_cache()` method across different caching configurations. This makes the cache clearing reliable and consistent with user expectations."
    },
    "location_origin": [
      {
        "file": "python-cachier_cachier/README.rst",
        "function": {
          "27": "Installation Section"
        },
        "content_all": {
          "24": ".. section-numbering:\n",
          "25": "\n",
          "26": "\n",
          "27": "Installation\n",
          "28": "============\n",
          "29": "\n",
          "30": "Install ``cachier`` with:\n",
          "31": "\n",
          "32": ".. code-block:: python\n",
          "33": "\n",
          "34": "    pip install cachier\n",
          "35": "\n",
          "36": "For the latest version supporting Python 2.7 please use:\n",
          "37": "\n",
          "38": ".. code-block:: python\n",
          "39": "\n",
          "40": "    pip install 'cachier==1.2.8'\n",
          "41": "\n",
          "42": "Features\n"
        },
        "content_change": {
          "36": "For the latest version supporting Python 2.7 please use:\n",
          "37": "\n",
          "38": ".. code-block:: python\n",
          "39": "\n",
          "40": "    pip install 'cachier==1.2.8'\n"
        }
      },
      {
        "file": "python-cachier_cachier/README.rst",
        "function": {
          "27": "Installation Section"
        },
        "content_all": {
          "24": ".. section-numbering:\n",
          "25": "\n",
          "26": "\n",
          "27": "Installation\n",
          "28": "============\n",
          "29": "\n",
          "30": "Install ``cachier`` with:\n",
          "31": "\n",
          "32": ".. code-block:: python\n",
          "33": "\n",
          "34": "    pip install cachier\n",
          "35": "\n",
          "36": ".. warning::\n",
          "37": "\n",
          "38": "    Python 2.7 is no longer supported. Users are strongly encouraged to upgrade to a more recent version of Python (3.8+).\n",
          "39": "\n",
          "40": "Features\n",
          "41": "========\n",
          "42": "\n",
          "43": "* Pure Python.\n",
          "44": "* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n",
          "45": "* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n",
          "46": "* A simple interface.\n"
        },
        "content_change": {
          "36": ".. warning::\n",
          "37": "\n",
          "38": "    Python 2.7 is no longer supported. Users are strongly encouraged to upgrade to a more recent version of Python (3.8+).\n"
        }
      }
    ],
    "location_message": [
      {
        "file": "python-cachier_cachier/README.rst",
        "function": {
          "1": "README file"
        },
        "content_all": {
          "1": "Cachier\n",
          "2": "#######\n",
          "3": "\n",
          "4": "|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n",
          "5": "\n",
          "6": "Persistent, stale-free, local and cross-machine caching for Python functions.\n",
          "43": "Features\n",
          "44": "========\n",
          "45": "\n",
          "46": "* Pure Python.\n",
          "47": "* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n",
          "48": "* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n",
          "49": "* A simple interface.\n",
          "50": "* Defining \"shelf life\" for cached values.\n",
          "51": "* Local caching using pickle files.\n"
        },
        "content_change": {
          "48": "* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_."
        }
      }
    ],
    "location_ground": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": {
          "25": "_PickleCore"
        },
        "content_all": {
          "24": "\n",
          "25": "class _PickleCore(_BaseCore):",
          "26": "    \"\"\"The pickle core class for cachier.\"\"\"",
          "27": "\n",
          "28": "    class CacheChangeHandler(PatternMatchingEventHandler):",
          "29": "        \"\"\"Handles cache-file modification events.\"\"\"",
          "30": "\n",
          "31": "        def __init__(self, filename, core, key):",
          "32": "            PatternMatchingEventHandler.__init__(",
          "33": "                self,",
          "34": "                patterns=[\"*\" + filename],",
          "35": "                ignore_patterns=None,",
          "36": "                ignore_directories=True,",
          "37": "                case_sensitive=False,",
          "38": "            )",
          "39": "            self.core = core",
          "40": "            self.key = key"
        },
        "content_change": {
          "31": "    def clear_cache(self):",
          "32": "        \"\"\"Clears the entire cache directory when using separate file.\"\"\"",
          "33": "        cache_dir = self._cache_dir  # Assuming self._cache_dir holds the cache directory path",
          "34": "        for filename in os.listdir(cache_dir):",
          "35": "            file_path = os.path.join(cache_dir, filename)",
          "36": "            with suppress(Exception):  # To ignore potential file access issues",
          "37": "                os.remove(file_path)",
          "38": "        print(\"Cache cleared successfully in separate files mode.\")"
        }
      }
    ],
    "location_ground_exp": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": {
          "25": "_PickleCore"
        },
        "content_all": {
          "24": "22 from .base import _BaseCore\n23 \n24 \n25 class _PickleCore(_BaseCore):\n26     \"\"\"The pickle core class for cachier.\"\"\"\n27 \n28     class CacheChangeHandler(PatternMatchingEventHandler):\n29         \"\"\"Handles cache-file modification events.\"\"\"\n30 \n              ... (truncated) ...\n43         def inject_observer(self, observer):",
          "44": "45             \"\"\"Inject the observer running this handler.\"\"\"\n46             self.observer = observer\n47 \n48         def _check_calculation(self):\n49             # print(...truncated)",
          "123": "121\n122             cumulative_weight_sum = sum(cumulative_weights)\n123             random_selection = random.uniform(0, cumulative_weight_sum)\n124             for i, cumulative_weight in enumerate(cumulative_weights):\n125                 if random_selection < cumulative_weight:\n126                     gtracker.add_event('mutation_munchify', {'event': 'cachier_cache_hit'})\n127                     func_cache['hit'] += 1\n128                     return self._load(key)\n129             else:\n130                 return None\n131 \"",
          "203": "201\n202         def clear_cache(self):\n203             \"\"\"Clears the cache.\"\"\"\n204             pass\n205 "
        },
        "content_change": {
          "203": "        def clear_cache(self):\n204             \"\"\"Clears the cache.\"\"\"\n205             self._clear_all_cache_files()\n\n206         def _clear_all_cache_files(self):\n207             \"\"\"Clears all cache files when using separate files mode.\"\"\"\n208             for cache_file in os.listdir(self.cache_dir):\n209                 file_path = os.path.join(self.cache_dir, cache_file)\n210                 if os.path.isfile(file_path):\n211                     os.remove(file_path)"
        }
      }
    ],
    "location_ground_truth": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": {
          "248": "clear_cache"
        },
        "content_all": {
          "247": "\n",
          "248": "    def clear_cache(self):\n",
          "249": "        if self.separate_files:\n",
          "250": "            pass\n",
          "251": "        else:\n"
        },
        "content_change": {
          "250": "            pass\n"
        }
      },
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": null,
        "content_all": {
          "252": "            self._save_cache({})\n"
        },
        "content_change": {}
      }
    ],
    "patch_i": "\n--- a/python-cachier_cachier/README.rst\n+++ b/python-cachier_cachier/README.rst\n@@ -36,11 +36,14 @@ Install ``cachier`` with:\n \n .. code-block:: python\n \n-    pip install cachier\n+    pip install cachier\n    \n-For the latest version supporting Python 2.7 please use:\n+.. warning::\n    \n-.. code-block:: python\n+    Python 2.7 is no longer supported. Users are strongly encouraged to upgrade to a more recent version of Python (3.8+).\n+\n-    pip install 'cachier==1.2.8'\n+Features\n+========\n+\n * Pure Python.\n * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n",
    "patch_im": "\ndiff --git a/README.rst b/README.rst\nindex e69de29..cba135e 100644\n--- a/README.rst\n+++ b/README.rst\n@@ -1,10 +1,14 @@\n Cachier\n #######\n \n-|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n+|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |License|\n \n Persistent, stale-free, local and cross-machine caching for Python functions.\n \n+.. |Build-Status| image:: https://travis-ci.org/shaypal5/cachier.svg\n+   :target: https://travis-ci.org/shaypal5/cachier\n+.. |Codecov| image:: https://codecov.io/gh/shaypal5/cachier/branch/master/graph/badge.svg\n+   :target: https://codecov.io/gh/shaypal5/cachier\n+\n .. code-block:: python\n \n   from cachier import cachier\n@@ -45,7 +49,7 @@ Features\n * Pure Python.\n * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n * A simple interface.\n * Defining \"shelf life\" for cached values.\n * Local caching using pickle files.\n",
    "patch_il": "\n--- a/python-cachier_cachier/README.rst\n+++ b/python-cachier_cachier/README.rst\n@@ -36,11 +36,13 @@ Installation\n ============\n\n Install ``cachier`` with:\n\n .. code-block:: python\n\n     pip install cachier\n\n-For the latest version supporting Python 2.7 please use:\n+.. warning::\n\n-.. code-block:: python\n+    Python 2.7 is no longer supported. Users are strongly encouraged to upgrade to a more recent version of Python (3.8+).\n\n-    pip install 'cachier==1.2.8'\n Features\n ========\n\n",
    "patch_iml": "\n--- a/python-cachier_cachier/README.rst\n+++ b/python-cachier_cachier/README.rst\n@@ -1,10 +1,18 @@\n Cachier\n #######\n \n |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n \n Persistent, stale-free, local and cross-machine caching for Python functions.\n \n+.. |PyPI-Status| image:: https://badge.fury.io/py/cachier.svg\n+   :target: https://badge.fury.io/py/cachier\n+.. |Downloads| image:: https://img.shields.io/pypi/dm/cachier.svg\n+   :target: https://pypi.python.org/pypi/cachier\n+.. |PyPI-Versions| image:: https://img.shields.io/pypi/pyversions/cachier.svg\n+   :target: https://pypi.python.org/pypi/cachier\n+.. |Build-Status| image:: https://travis-ci.org/shaypal5/cachier.svg\n+   :target: https://travis-ci.org/shaypal5/cachier\n+.. |Codecov| image:: https://codecov.io/gh/shaypal5/cachier/branch/master/graph/badge.svg\n+   :target: https://codecov.io/gh/shaypal5/cachier\n \n .. code-block:: python\n \n@@ -45,7 +53,7 @@ Features\n * Pure Python.\n * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n * A simple interface.\n * Defining \"shelf life\" for cached values.\n * Local caching using pickle files.\n",
    "patch_ground": "\n--- a/src/cachier/cores/pickle.py\n+++ b/src/cachier/cores/pickle.py\n@@ -25,6 +25,16 @@\n     \"\"\"The pickle core class for cachier.\"\"\"\n \n     class CacheChangeHandler(PatternMatchingEventHandler):\n+        \"\"\"Handles cache-file modification events.\"\"\"\n+\n+        def __init__(self, filename, core, key):\n+            PatternMatchingEventHandler.__init__(\n+                self,\n+                patterns=[\"*\" + filename],\n+                ignore_patterns=None,\n+                ignore_directories=True,\n+                case_sensitive=False,\n+            )\n             self.core = core\n             self.key = key\n             self.observer = None\n@@ -47,6 +57,16 @@\n             self.observer = observer\n \n         def _check_calculation(self):\n+            # _check_calculation\n+\n+        def clear_cache(self):\n+            \"\"\"Clears the entire cache directory when using separate files.\"\"\"\n+            cache_dir = self._cache_dir  # Assuming self._cache_dir holds the cache directory path\n+            for filename in os.listdir(cache_dir):\n+                file_path = os.path.join(cache_dir, filename)\n+                with suppress(Exception):  # To ignore potential file access issues\n+                    os.remove(file_path)\n+            print(\"Cache cleared successfully in separate files mode.\")\n \n     def _check_calculation(self):\n",
    "patch_ground_location": "\n--- a/src/cachier/cores/pickle.py\n+++ b/src/cachier/cores/pickle.py\n@@ -24,6 +24,14 @@\n \n class _PickleCore(_BaseCore):\n     \"\"\"The pickle core class for cachier.\"\"\"\n+    \n+    def clear_cache(self):\n+        \"\"\"Clears the entire cache directory when using separate file.\"\"\"\n+        cache_dir = self._cache_dir  # Assuming self._cache_dir holds the cache directory path\n+        for filename in os.listdir(cache_dir):\n+            file_path = os.path.join(cache_dir, filename)\n+            with suppress(Exception):  # To ignore potential file access issues\n+                os.remove(file_path)\n \n     class CacheChangeHandler(PatternMatchingEventHandler):\n         \"\"\"Handles cache-file modification events.\"\"\"\n",
    "patch_ground_exp": "\n--- a/python-cachier_cachier/src/cachier/cores/pickle.py\n+++ b/python-cachier_cachier/src/cachier/cores/pickle.py\n@@ -200,8 +200,17 @@\n         return None\n \n     def clear_cache(self):\n-        \"\"\"Clears the cache.\"\"\"\n-        pass\n+        \"\"\"Clears the cache.\"\"\"\n+        self._clear_all_cache_files()\n+\n+    def _clear_all_cache_files(self):\n+        \"\"\"Clears all cache files when using separate files mode.\"\"\"\n+        for cache_file in os.listdir(self.cache_dir):\n+            file_path = os.path.join(self.cache_dir, cache_file)\n+            if os.path.isfile(file_path):\n+                os.remove(file_path)\n+\n\n     def _save_deleter(self, key):\n         \"\"\"Removes a key from the cache.\"\"\"\n",
    "patch_ground_all": "\n--- a/python-cachier_cachier/src/cachier/cores/pickle.py\n+++ b/python-cachier_cachier/src/cachier/cores/pickle.py\n@@ -248,7 +248,11 @@ def clear_cache(self):\n         if self.separate_files:\n             cache_dir = os.path.dirname(self._cache_path())\n             with suppress(FileNotFoundError):\n-                pass\n+                for filename in os.listdir(cache_dir):\n+                    if filename.startswith(self.cache_name):\n+                        file_path = os.path.join(cache_dir, filename)\n+                        os.remove(file_path)\n         else:\n             self._save_cache({})\n",
    "patch_ground_truth": "--- a/python-cachier_cachier/src/cachier/cores/pickle.py\n+++ b/python-cachier_cachier/src/cachier/cores/pickle.py\n@@ -247,7 +247,7 @@\n \n     def clear_cache(self):\n         if self.separate_files:\n-            pass\n+            self._clear_all_cache_files()\n         else:\n             self._save_cache({})\n \n",
    "message": null,
    "CodeBase": [
      {
        "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "content": "1 \"\"\"A pickle-based caching core for cachier.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 import os\n10 import pickle  # for local caching\n11 from contextlib import suppress\n12 from datetime import datetime\n13 \n14 import portalocker  # to lock on pickle cache IO\n15 from watchdog.events import PatternMatchingEventHandler\n16 from watchdog.observers import Observer\n17 \n18 from .._types import HashFunc\n19 from ..config import _update_with_defaults\n20 \n21 # Alternative:  https://github.com/WoLpH/portalocker\n22 from .base import _BaseCore\n23 \n24 \n25 class _PickleCore(_BaseCore):\n26     \"\"\"The pickle core class for cachier.\"\"\"\n27 \n28     class CacheChangeHandler(PatternMatchingEventHandler):\n29         \"\"\"Handles cache-file modification events.\"\"\"\n30 \n31         def __init__(self, filename, core, key):\n32             PatternMatchingEventHandler.__init__(\n33                 self,\n34                 patterns=[\"*\" + filename],\n35                 ignore_patterns=None,\n36                 ignore_directories=True,\n37                 case_sensitive=False,\n38             )\n39             self.core = core\n40             self.key = key\n41             self.observer = None\n42             self.value = None\n43 \n44         def inject_observer(self, observer):\n45             \"\"\"Inject the observer running this handler.\"\"\"\n46             self.observer = observer\n47 \n48         def _check_calculation(self):\n49             # print(...truncated)"
      },
      {
        "path": "python-cachier_cachier/README.rst",
        "content": "1 Cachier\n2 #######\n3 \n4 |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n5 \n6 Persistent, stale-free, local and cross-machine caching for Python functions.\n7 \n8 .. code-block:: python\n9 \n10   from cachier import cachier\n11   import datetime\n12 \n13   @cachier(stale_after=datetime.timedelta(days=3))\n14   def foo(arg1, arg2):\n15     \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n16     return {'arg1': arg1, 'arg2': arg2}\n17 \n18 \n19 .. role:: python(code)\n20   :language: python\n21 \n22 .. contents::\n23 \n24 .. section-numbering:\n25 \n26 \n27 \n28 Installation\n29 ============\n30 \n31 Install ``cachier`` with:\n32 \n33 .. code-block:: python\n34 \n35     pip install cachier\n36 \n37 For the latest version supporting Python 2.7 please use:\n38 \n39 .. code-block:: python\n40 \n41     pip install 'cachier==1.2.8'\n42 \n43 Features\n44 ========\n45 \n46 * Pure Python.\n47 * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n48 * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n49 * A simple interface.\n50 * Defining \"shelf life\" for cached values.\n51 * Local caching using pickle fil(...truncated)"
      },
      {
        "path": "python-cachier_cachier/src/cachier/core.py",
        "content": "1 \"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 \n10 import datetime\n11 import inspect\n12 import os\n13 import warnings\n14 from collections import OrderedDict\n15 from concurrent.futures import ThreadPoolExecutor\n16 from functools import wraps\n17 from typing import Optional, Union\n18 from warnings import warn\n19 \n20 from .config import (\n21     Backend,\n22     HashFunc,\n23     Mongetter,\n24     _default_params,\n25     _update_with_defaults,\n26 )\n27 from .cores.base import RecalculationNeeded, _BaseCore\n28 from .cores.memory import _MemoryCo(...truncated)"
      }
    ],
    "CommitSHA": "474294fac83b029f13dd61b8d2822923045d4504"
  },
  "Score": {
    "Difficulty": "Simple",
    "issue_origin": {
      "Title": 8,
      "Description": 6,
      "Reproducibility": 9,
      "Relevance": 6,
      "Explanation": 7,
      "Overall": 7
    },
    "issue_message": {
      "Title": 6,
      "Description": 5,
      "Reproducibility": 6,
      "Relevance": 7,
      "Explanation": 7,
      "Overall": 6
    },
    "issue_ground": {
      "Title": 7,
      "Description": 6,
      "Reproducibility": 7,
      "Relevance": 8,
      "Explanation": 6,
      "Overall": 7
    },
    "issue_ground_truth": {
      "title": "Clear Cache Function Not Working Properly in Separate Files Mode",
      "description": "The `clear_cache()` function does not appear to properly clear all cache files when caching is configured to use separate files. As a user, when I attempt to clear the cache in this mode, I expect all related cache files to be removed or reset. However, the current implementation of `clear_cache()` does not handle this scenario correctly, resulting in residual cache data. This issue impacts the reliability of cache clearing functionality and could lead to stale or incorrect data being used.",
      "explanation": "### Summary of the Issue\n\nThe issue reported is that the `clear_cache()` function does not work properly when caching is configured to use separate files. When a user tries to clear the cache in this mode, not all related cache files are removed, resulting in residual cache data. This leads to stale or incorrect data being used, impacting the reliability of the cache clearing functionality.\n\n### Details of the Commit\n\nThe commit in question modifies the behavior of the `clear_cache()` method for the pickle-based caching core. The change replaces the placeholder code with a call to an internal method that handles the deletion of all cache files individually when separate file caching is enabled.\n\n### Explanation of the Cause and Solution\n\n#### Cause of the Issue\n\nWhen the caching is configured to use separate files, the `clear_cache()` method was not implemented properly. It effectively did nothing (`pass` statement), which meant that none of the cache files were actually being deleted when this method was called. This resulted in residual cache files being left behind, and subsequent function calls could incorrectly use these stale files, thus leading to data inconsistencies.\n\nThis scenario arises because, in separate files mode, each set of function arguments results in a distinct cache file. Therefore, clearing the cache involves deleting all these individual files, which was not happening due to the unimplemented `clear_cache()` method.\n\n#### Solution Implementation\n\nThe commit addresses this issue by implementing the logic to clear all the separate cache files. Instead of having a `pass` statement, the `clear_cache()` method now calls `_clear_all_cache_files()` (presumably a private method that handles the file deletion process).\n\nHere’s a step-by-step explanation of how the commit resolves the issue:\n\n1. **Identifying Files to Delete**: The solution involves identifying all the separate cache files associated with the function. Each cache file corresponds to a specific set of function arguments.\n\n2. **Deleting Files**: The `_clear_all_cache_files()` method iterates over these files and deletes them, thereby ensuring no residual cache data is left.\n\n3. **Ensuring Robustness**: By replacing the placeholder `pass` statement with a call to a method that performs the necessary deletions, the implementation ensures that all possible cache files are accounted for and properly deleted.\n\n### Solution Explanation\n\nThe solution revolves around correctly implementing the deletion mechanism for caches stored in separate files. By introducing the logic to iterate over and delete each cache file, the `clear_cache()` method now effectively clears all cached data as expected by the users. This ensures that stale or outdated data does not persist, maintaining the integrity and consistency of the function outputs post-cache clearing.\n\nIn summary:\n\n1. **Diagnosis**: The original problem was due to the lack of implementation for clearing multiple cache files.\n2. **Commit Change**: The commit adds a call to a utility method that handles the deletion of these files.\n3. **Effectiveness**: This fix ensures that calling `clear_cache()` will remove all cache files, thus solving the problem of residual stale data.\n\nBy addressing the root cause (unimplemented cache clearing in separate files mode), the commit ensures the correct functionality of the `clear_cache()` method across different caching configurations. This makes the cache clearing reliable and consistent with user expectations."
    }
  }
}