{
  "RepoName": "python-cachier/cachier.git",
  "CommitSHA": "474294fac83b029f13dd61b8d2822923045d4504",
  "Type": "logic error",
  "ErrorMessage": null,
  "Issue": {
    "title": "Cache Not Clearing Properly with Pickle Core When Using Separate Files",
    "description": "It appears that the cache clearing mechanism within the Pickle core is not functioning correctly when the `separate_files` option is enabled. Specifically, invoking `clear_cache()` does not effectively clear the cache as expected. Instead, it seems to leave the task unhandled, resulting in persisted cache data that should otherwise be removed. This inconsistency can lead to issues where outdated or stale data is being returned, even after a clear operation has been requested. This affects the reliability and predictability of the caching mechanism for users who rely on accurate cache state management.",
    "explanation": "### Summary of the Issue\n\nThe issue reported is titled \"Cache Not Clearing Properly with Pickle Core When Using Separate Files.\" It details a problem where invoking the `clear_cache()` method does not effectively clear the cache when the `separate_files` option is enabled. Instead of clearing the cache as expected, the task remains unhandled and the cache data persists. This can lead to outdated or stale data being returned, resulting in unreliable and unpredictable caching behavior for users.\n\n### Analysis of the Problem\n\nFrom the description:\n- The `clear_cache()` function, when used with the `separate_files` option, does not function correctly.\n- The cache data that should be removed remains, leading to the potential return of old or stale data.\n- This affects the reliability and predictability of the caching mechanism.\n\nIn essence, the problem lies in the fact that the `clear_cache()` method has no implementation to handle the scenario where `separate_files` is enabled. Thus, it does nothing when it should clear individual cache files.\n\n### Commit Details and Explanation\n\nThe provided commit patch indicates that the developer has recognized this issue and has addressed it by modifying the `clear_cache` method inside the `pickle.py` file.\n\n#### Content of the Commit \n\nThe commit adds functionality to the `clear_cache` method:\n- **Developer's Perspective on the Cause**:\n  - The original code for the `clear_cache` method had a placeholder (`pass`), resulting in no actual logic being executed to clear the cache when `separate_files` is enabled.\n  - This lack of implementation led to the described issue. The cache data persisted because there was no mechanism to clear the individual cache files created when `separate_files` was set to `True`.\n\n#### Changes in the Commit\n\n- **New Implementation**:\n  - The commit introduces a call to `_clear_all_cache_files()` within the `clear_cache` method if the `separate_files` option is enabled.\n  - This addition ensures that any existing cache files corresponding to the separate files mode are properly handled and deleted.\n\n### Solution to the Issue\n\n1. **Detection of the Problem**:\n   - The method `clear_cache()` was previously not handling the `separate_files` option effectively.\n   - The placeholder `pass` meant that no action was taken to clear the cache when `separate_files` was set to `True`.\n\n2. **Implementing a Solution**:\n   - The developer added the necessary functionality to actually clear the cache files when `separate_files` is specified.\n   - By calling `_clear_all_cache_files()`, which is presumably responsible for handling and deleting all individual cache files associated with the decorated function, the `clear_cache()` method now performs the intended action.\n\n3. **Effect of the Solution**:\n   - The new implementation guarantees that invoking `clear_cache()` with `separate_files` enabled will correctly navigate through and delete all the cache files.\n   - This ensures that no stale or outdated cache data remains, thus maintaining the reliability and predictability of the caching mechanism. \n\nIn summary, the issue was caused by a lack of implementation in the `clear_cache()` function for the `separate_files` option. The commit fixed this by adding the appropriate call to `_clear_all_cache_files()`, ensuring that all related cache files are properly cleared when needed. This solution directly addresses the main problem and restores the correct behavior of the cache clearing functionality."
  },
  "Explain": "### Summary of the Issue\n\nThe issue reported is titled \"Cache Not Clearing Properly with Pickle Core When Using Separate Files.\" It details a problem where invoking the `clear_cache()` method does not effectively clear the cache when the `separate_files` option is enabled. Instead of clearing the cache as expected, the task remains unhandled and the cache data persists. This can lead to outdated or stale data being returned, resulting in unreliable and unpredictable caching behavior for users.\n\n### Analysis of the Problem\n\nFrom the description:\n- The `clear_cache()` function, when used with the `separate_files` option, does not function correctly.\n- The cache data that should be removed remains, leading to the potential return of old or stale data.\n- This affects the reliability and predictability of the caching mechanism.\n\nIn essence, the problem lies in the fact that the `clear_cache()` method has no implementation to handle the scenario where `separate_files` is enabled. Thus, it does nothing when it should clear individual cache files.\n\n### Commit Details and Explanation\n\nThe provided commit patch indicates that the developer has recognized this issue and has addressed it by modifying the `clear_cache` method inside the `pickle.py` file.\n\n#### Content of the Commit \n\nThe commit adds functionality to the `clear_cache` method:\n- **Developer's Perspective on the Cause**:\n  - The original code for the `clear_cache` method had a placeholder (`pass`), resulting in no actual logic being executed to clear the cache when `separate_files` is enabled.\n  - This lack of implementation led to the described issue. The cache data persisted because there was no mechanism to clear the individual cache files created when `separate_files` was set to `True`.\n\n#### Changes in the Commit\n\n- **New Implementation**:\n  - The commit introduces a call to `_clear_all_cache_files()` within the `clear_cache` method if the `separate_files` option is enabled.\n  - This addition ensures that any existing cache files corresponding to the separate files mode are properly handled and deleted.\n\n### Solution to the Issue\n\n1. **Detection of the Problem**:\n   - The method `clear_cache()` was previously not handling the `separate_files` option effectively.\n   - The placeholder `pass` meant that no action was taken to clear the cache when `separate_files` was set to `True`.\n\n2. **Implementing a Solution**:\n   - The developer added the necessary functionality to actually clear the cache files when `separate_files` is specified.\n   - By calling `_clear_all_cache_files()`, which is presumably responsible for handling and deleting all individual cache files associated with the decorated function, the `clear_cache()` method now performs the intended action.\n\n3. **Effect of the Solution**:\n   - The new implementation guarantees that invoking `clear_cache()` with `separate_files` enabled will correctly navigate through and delete all the cache files.\n   - This ensures that no stale or outdated cache data remains, thus maintaining the reliability and predictability of the caching mechanism. \n\nIn summary, the issue was caused by a lack of implementation in the `clear_cache()` function for the `separate_files` option. The commit fixed this by adding the appropriate call to `_clear_all_cache_files()`, ensuring that all related cache files are properly cleared when needed. This solution directly addresses the main problem and restores the correct behavior of the cache clearing functionality.",
  "Time": "2024-06-02",
  "Difficulty": "Simple",
  "OriginCode": [
    {
      "path": "python-cachier_cachier/MANIFEST.in",
      "content": "# Manifest syntax https://docs.python.org/2/distutils/sourcedist.html\ngraft wheelhouse\n\nrecursive-include src *.info\ninclude README.rst\ninclude LICENSE\n\nprune __pycache__\nprune tests\n"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "Cachier\n#######\n\n|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n\nPersistent, stale-free, local and cross-machine caching for Python functions.\n\n.. code-block:: python\n\n  from cachier import cachier\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(days=3))\n  def foo(arg1, arg2):\n    \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\n\n.. role:: python(code)\n  :language: python\n\n.. contents::\n\n.. section-numbering:\n\n\n\nInstallation\n============\n\nInstall ``cachier`` with:\n\n.. code-block:: python\n\n    pip install cachier\n\nFor the latest version supporting Python 2.7 please use:\n\n.. code-block:: python\n\n    pip install 'cachier==1.2.8'\n\nFeatures\n========\n\n* Pure Python.\n* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n* A simple interface.\n* Defining \"shelf life\" for cached values.\n* Local caching using pickle files.\n* Cross-machine caching using MongoDB.\n* Thread-safety.\n\nCachier is **NOT**:\n\n* Meant as a transient cache. Python's @lru_cache is better.\n* Especially fast. It is meant to replace function calls that take more than... a second, say (overhead is around 1 millisecond).\n\nFuture features\n---------------\n\n* S3 core.\n* Multi-core caching.\n* `Cache replacement policies <https://en.wikipedia.org/wiki/Cache_replacement_policies>`_\n\n\nUse\n===\n\nCachier provides a decorator which you can wrap around your functions to give them a persistent cache. The positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). Also, notice that since objects which are instances of user-defined classes are hashable but all compare unequal (their hash value is their id), equal objects across different sessions will not yield identical keys.\n\nSetting up a Cache\n------------------\nYou can add a default, pickle-based, persistent cache to your function - meaning it will last across different Python kernels calling the wrapped function - by decorating it with the ``cachier`` decorator (notice the ``()``!).\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier()\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values!\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nClass and object methods can also be cached. Cachier will automatically ignore the `self` parameter when determining the cache key for an object method. **This means that methods will be cached across all instances of an object, which may not be what you want.**\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  class Foo():\n    @staticmethod\n    @cachier()\n    def good_static_usage(arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method does not depend on object's internal state, so good to cache\n    @cachier()\n    def good_usage_1(self, arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method is calling external service, probably okay to cache\n    @cachier()\n    def good_usage_2(self, arg_1, arg_2):\n      result = self.call_api(arg_1, arg_2)\n      return result\n\n    # Instance method relies on object attribute, NOT good to cache\n    @cachier()\n    def bad_usage(self, arg_1, arg_2):\n      return arg_1 + arg_2 + self.arg_3\n\n\nResetting a Cache\n-----------------\nThe Cachier wrapper adds a ``clear_cache()`` function to each wrapped function. To reset the cache of the wrapped function simply call this method:\n\n.. code-block:: python\n\n  foo.clear_cache()\n\nGeneral Configuration\n----------------------\n\nGlobal Defaults\n~~~~~~~~~~~~~~~\n\nSettings can be globally configured across all Cachier wrappers through the use of the `set_default_params` function. This function takes the same keyword parameters as the ones defined in the decorator, which can be passed all at once or with multiple calls. Parameters given directly to a decorator take precedence over any values set by this function.\n\nThe following parameters will only be applied to decorators defined after `set_default_params` is called:\n\n*  `hash_func`\n*  `backend`\n*  `mongetter`\n*  `cache_dir`\n*  `pickle_reload`\n*  `separate_files`\n\nThese parameters can be changed at any time and they will apply to all decorators:\n\n*  `allow_none`\n*  `caching_enabled`\n*  `stale_after`\n*  `next_time`\n*  `wait_for_calc_timeout`\n\nThe current defaults can be fetched by calling `get_default_params`.\n\nThreads Limit\n~~~~~~~~~~~~~\n\nTo limit the number of threads Cachier is allowed to spawn, set the ``CACHIER_MAX_WORKERS`` with the desired number. The default is 8, so to enable Cachier to spawn even more threads, you'll have to set a higher limit explicitly.\n\n\nGlobal Enable/Disable\n---------------------\n\nCaching can be turned off across all decorators by calling `disable_caching`, and then re-activated by calling `enable_caching`.\n\nThese functions are convenience wrappers around the `caching_enabled` default setting.\n\n\nCache Shelf Life\n----------------\n\nSetting Shelf Life\n~~~~~~~~~~~~~~~~~~\nYou can set any duration as the shelf life of cached return values of a function by providing a corresponding ``timedelta`` object to the ``stale_after`` parameter:\n\n.. code-block:: python\n\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(weeks=2))\n  def bar(arg1, arg2):\n    return {'arg1': arg1, 'arg2': arg2}\n\nNow when a cached value matching the given arguments is found the time of its calculation is checked; if more than ``stale_after`` time has since passed, the function will be run again for the same arguments and the new value will be cached and returned.\n\nThis is useful for lengthy calculations that depend on a dynamic data source.\n\nFuzzy Shelf Life\n~~~~~~~~~~~~~~~~\nSometimes you may want your function to trigger a calculation when it encounters a stale result, but still not wait on it if it's not that critical. In that case, you can set ``next_time`` to ``True`` to have your function trigger a recalculation **in a separate thread**, but return the currently cached stale value:\n\n.. code-block:: python\n\n  @cachier(next_time=True)\n\nFurther function calls made while the calculation is being performed will not trigger redundant calculations.\n\n\nWorking with unhashable arguments\n---------------------------------\n\nAs mentioned above, the positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). To get around this limitation the ``hash_func`` parameter of the ``cachier`` decorator can be provided with a callable that gets the args and kwargs from the decorated function and returns a hash key for them.\n\n.. code-block:: python\n\n  def calculate_hash(args, kwds):\n    key = ...  # compute a hash key here based on arguments\n    return key\n\n  @cachier(hash_func=calculate_hash)\n  def calculate_super_complex_stuff(custom_obj):\n    # amazing code goes here\n\nSee here for an example:\n\n`Question: How to work with unhashable arguments <https://github.com/python-cachier/cachier/issues/91>`_\n\n\nPrecaching values\n---------------------------------\n\nIf you want to load a value into the cache without calling the underlying function, this can be done with the `precache_value` function.\n\n.. code-block:: python\n\n  @cachier()\n  def add(arg1, arg2):\n    return arg1 + arg2\n\n  add.precache_value(2, 2, value_to_cache=5)\n\n  result = add(2, 2)\n  print(result)  # prints 5\n\n\nPer-function call arguments\n---------------------------\n\nCachier also accepts several keyword arguments in the calls of the function it wraps rather than in the decorator call, allowing you to modify its behaviour for a specific function call.\n\nIgnore Cache\n~~~~~~~~~~~~\n\nYou can have ``cachier`` ignore any existing cache for a specific function call by passing ``cachier__skip_cache=True`` to the function call. The cache will neither be checked nor updated with the new return value.\n\n.. code-block:: python\n\n  @cachier()\n  def sum(first_num, second_num):\n    return first_num + second_num\n\n  def main():\n    print(sum(5, 3, cachier__skip_cache=True))\n\nOverwrite Cache\n~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` overwrite an existing cache entry - if one exists - for a specific function call by passing ``cachier__overwrite_cache=True`` to the function call. The cache will not be checked but will be updated with the new return value.\n\nVerbose Cache Call\n~~~~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` print out a detailed explanation of the logic of a specific call by passing ``cachier__verbose=True`` to the function call. This can be useful if you are not sure why a certain function result is, or is not, returned.\n\nCache `None` Values\n~~~~~~~~~~~~~~~~~~~\n\nBy default, ``cachier`` does not cache ``None`` values. You can override this behaviour by passing ``allow_none=True`` to the function call.\n\n\nCachier Cores\n=============\n\nPickle Core\n-----------\n\nThe default core for Cachier is pickle based, meaning each function will store its cache is a separate pickle file in the ``~/.cachier`` directory. Naturally, this kind of cache is both machine-specific and user-specific.\n\nYou can configure ``cachier`` to use another directory by providing the ``cache_dir`` parameter with the path to that directory:\n\n.. code-block:: python\n\n  @cachier(cache_dir='~/.temp/.cache')\n\n\nYou can slightly optimise pickle-based caching if you know your code will only be used in a single thread environment by setting:\n\n.. code-block:: python\n\n  @cachier(pickle_reload=False)\n\nThis will prevent reading the cache file on each cache read, speeding things up a bit, while also nullifying inter-thread functionality (the code is still thread safe, but different threads will have different versions of the cache at times, and will sometime make unnecessary function calls).\n\nSetting the optional argument ``separate_files`` to ``True`` will cause the cache to be stored in several files: A file per argument set, per function. This can help if your per-function cache files become too large.\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier(separate_files=True)\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values, split across several files, per argument set\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nYou can get the fully qualified path to the directory of cache files used by ``cachier`` (``~/.cachier`` by default) by calling the ``cache_dpath()`` function:\n\n.. code-block:: python\n\n  >>> foo.cache_dpath()\n      \"/home/user/.cachier/\"\n\n\nMongoDB Core\n------------\nYou can set a MongoDB-based cache by assigning ``mongetter`` with a callable that returns a ``pymongo.Collection`` object with writing permissions:\n\n.. code-block:: python\n\n    from pymongo import MongoClient\n\n    def my_mongetter():\n        client = MongoClient(get_cachier_db_auth_uri())\n        db_obj = client['cachier_db']\n        if 'someapp_cachier_db' not in db_obj.list_collection_names():\n            db_obj.create_collection('someapp_cachier_db')\n        return db_obj['someapp_cachier_db']\n\n  @cachier(mongetter=my_mongetter)\n\nThis allows you to have a cross-machine, albeit slower, cache. This functionality requires that the installation of the ``pymongo`` python package.\n\nIn certain cases the MongoDB backend might leave a deadlock behind, blocking all subsequent requests from being processed. If you encounter this issue, supply the ``wait_for_calc_timeout`` with a reasonable number of seconds; calls will then wait at most this number of seconds before triggering a recalculation.\n\n.. code-block:: python\n\n  @cachier(mongetter=False, wait_for_calc_timeout=2)\n\n\nMemory Core\n-----------\n\nYou can set an in-memory cache by assigning the ``backend`` parameter with ``'memory'``:\n\n.. code-block:: python\n\n  @cachier(backend='memory')\n\nNote, however, that ``cachier``'s in-memory core is simple, and has no monitoring or cap on cache size, and can thus lead to memory errors on large return values - it is mainly intended to be used with future multi-core functionality. As a rule, Python's built-in ``lru_cache`` is a much better stand-alone solution.\n\n\nContributing\n============\n\nCurrent maintainers are Shay Palachy Affek (`shay.palachy@gmail.com <mailto:shay.palachy@gmail.com>`_, `@shaypal5 <https://github.com/shaypal5>`_) and Judson Neer (`@lordjabez <https://github.com/lordjabez>`_); You are more than welcome to approach them for help. Contributions are very welcomed! :)\n\nInstalling for development\n--------------------------\n\nClone:\n\n.. code-block:: bash\n\n  git clone git@github.com:python-cachier/cachier.git\n\n\nInstall in development mode with test dependencies:\n\n.. code-block:: bash\n\n  cd cachier\n  pip install -e . -r tests/requirements.txt\n\n\nRunning the tests\n-----------------\n\nTo run the tests, call the ``pytest`` command in the repository's root, or:\n\n.. code-block:: bash\n\n  python -m pytest\n\nTo run only MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m mongo\n\nTo run only memory core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m memory\n\nTo run all tests EXCEPT MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not mongo\"\n\n\nTo run all tests EXCEPT memory core AND MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not (mongo or memory)\"\n\n\nRunning MongoDB tests against a live MongoDB instance\n-----------------------------------------------------\n\n**Note to developers:** By default, all MongoDB tests are run against a mocked MongoDB instance, provided by the ``pymongo_inmemory`` package. To run them against a live MongoDB instance, the ``CACHIER_TEST_VS_DOCKERIZED_MONGO`` environment variable is set to ``True`` in the ``test`` environment of this repository (and additional environment variables are populated with the appropriate credentials), used by the GitHub Action running tests on every commit and pull request.\n\nContributors are not expected to run these tests against a live MongoDB instance when developing, as credentials for the testing instance used will NOT be shared, but rather use the testing against the in-memory MongoDB instance as a good proxy.\n\n**HOWEVER, the tests run against a live MongoDB instance when you submit a PR are the determining tests for deciding whether your code functions correctly against MongoDB.**\n\n\nAdding documentation\n--------------------\n\nThis project is documented using the `numpy docstring conventions`_, which were chosen as they are perhaps the most widely-spread conventions that are both supported by common tools such as Sphinx and result in human-readable docstrings (in my personal opinion, of course). When documenting code you add to this project, please follow `these conventions`_.\n\n.. _`numpy docstring conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n.. _`these conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n\nAdditionally, if you update this ``README.rst`` file, use ``python setup.py checkdocs`` to validate it compiles.\n\n\nCredits\n=======\n\nCreated by `Shay Palachy Affek <https://github.com/shaypal5>`_ (shay.palachy@gmail.com), which currently assists in maintenance.\n\nCurrent lead developer/contributor: `Jirka Borovec <https://github.com/Borda>`_ (`@Borda <https://github.com/Borda>`_ on GitHub).\n\nOther major contributors:\n\n* `Jirka Borovec <https://github.com/Borda>`_ - Arg order independence, args-to-kwargs for less unique keys and numerous development and CI contributions.\n\n* `Judson Neer <https://github.com/lordjabez>`_ - Precaching, method caching support and numerous improvements and bugfixes.\n\n* `cthoyt <https://github.com/cthoyt>`_ - Base memory core implementation.\n\n* `amarczew <https://github.com/amarczew>`_ - The ``hash_func`` kwarg.\n\n* `non-senses <https://github.com/non-senses>`_ - The ``wait_for_calc_timeout`` kwarg.\n\n* `Elad Rapaport <https://github.com/erap129>`_ - Multi-file Pickle core, a.k.a ``separate_files`` (released on ``v1.5.3``).\n\n* `John Didion <https://github.com/jdidion>`_ - Support for pickle-based caching for cases where two identically-named methods of different classes are defined in the same module.\n\nNotable bugfixers:\n\n* `MichaelRazum <https://github.com/MichaelRazum>`_.\n\n* `Eric Ma <https://github.com/ericmjl>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n* `Ofir <https://github.com/ofirnk>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n\n\n.. |PyPI-Status| image:: https://img.shields.io/pypi/v/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |PyPI-Versions| image:: https://img.shields.io/pypi/pyversions/cachier.svg\n   :target: https://pypi.python.org/pypi/cachier\n\n.. |Build-Status| image:: https://github.com/python-cachier/cachier/actions/workflows/test.yml/badge.svg\n  :target: https://github.com/python-cachier/cachier/actions/workflows/test.yml\n\n.. |LICENCE| image:: https://img.shields.io/pypi/l/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |Codecov| image:: https://codecov.io/github/python-cachier/cachier/coverage.svg?branch=master\n   :target: https://codecov.io/github/python-cachier/cachier?branch=master\n\n.. |Downloads| image:: https://pepy.tech/badge/cachier\n     :target: https://pepy.tech/project/cachier\n     :alt: PePy stats\n\n.. |Codefactor| image:: https://www.codefactor.io/repository/github/python-cachier/cachier/badge?style=plastic\n     :target: https://www.codefactor.io/repository/github/python-cachier/cachier\n     :alt: Codefactor code quality\n\n.. links:\n.. _pymongo: https://api.mongodb.com/python/current/\n.. _watchdog: https://github.com/gorakhargosh/watchdog\n"
    },
    {
      "path": "python-cachier_cachier/LICENSE",
      "content": "MIT License\n\nCopyright (c) 2016 Shay Palachy\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.\n"
    },
    {
      "path": "python-cachier_cachier/pyproject.toml",
      "content": "[project]\nname = \"cachier\"\ndescription = \"Persistent, stale-free, local and cross-machine caching for Python functions.\"\nreadme = \"README.rst\"\nkeywords = [\n  \"cache\",\n  \"caching\",\n  \"cross-machine\",\n  \"decorator\",\n  \"local\",\n  \"memoization\",\n  \"mongo\",\n  \"persistent\",\n]\nlicense = {file = \"LICENSE\"}\nauthors = [\n   {name = \"Shay Palachy Affek\", email = 'shay.palachy@gmail.com'},\n]\nclassifiers = [\n  'Development Status :: 4 - Beta',\n  'Intended Audience :: Developers',\n  'License :: OSI Approved :: MIT License',\n  'Programming Language :: Python',\n  \"Programming Language :: Python :: 3 :: Only\",\n  \"Programming Language :: Python :: 3.8\",\n  \"Programming Language :: Python :: 3.9\",\n  \"Programming Language :: Python :: 3.10\",\n  \"Programming Language :: Python :: 3.11\",\n  \"Programming Language :: Python :: 3.12\",\n  \"Topic :: Other/Nonlisted Topic\",\n  \"Topic :: Software Development :: Libraries\",\n  \"Topic :: Software Development :: Libraries :: Python Modules\",\n  \"Topic :: Utilities\",\n]\ndynamic = [\n  \"version\",\n]\ndependencies = [\n  \"portalocker>=2.3.2\",\n  \"watchdog>=2.3.1\",\n]\n[project.urls]\nSource = \"https://github.com/python-cachier/cachier\"\n[project.scripts]\ncachier = \"cachier.__naim__:cli\"\n\n[tool.setuptools]\ninclude-package-data = true\n\n[tool.setuptools.dynamic]\nversion = {attr = \"cachier._version.__version__\"}\n\n[tool.setuptools.packages.find]\nwhere = [\"src\"]  # list of folders that contain the packages ([\".\"] by default)\ninclude = [\"cachier*\"]  # package names should match these glob patterns ([\"*\"] by default)\nnamespaces = false  # to disable scanning PEP 420 namespaces (true by default)\n\n[build-system]\nrequires = [\n  \"setuptools\",\n  \"wheel\",\n]\n\n[tool.pytest.ini_options]\ntestpaths = [\"cachier\", \"tests\"]\nnorecursedirs=[\"dist\", \"build\"]\naddopts = [\n    \"--color=yes\",\n    \"--cov=cachier\",\n    \"--cov-report=term\",\n    \"--cov-report=xml:cov.xml\",\n    \"-r a\",\n    \"-v\",\n    \"-s\",\n]\nmarkers = [\n    \"mongo: test the MongoDB core\",\n    \"memory: test the memory core\",\n    \"pickle: test the pickle core\"\n]\n\n[tool.docformatter]\nrecursive = true\n# some docstring start with r\"\"\"\nwrap-summaries = 79\nwrap-descriptions = 79\nblank = true\n\n[tool.ruff]\ntarget-version = \"py38\"\nline-length = 79\n# Enable Pyflakes `E` and `F` codes by default.\nlint.select = [\n    \"E\",\n    \"W\", # see: https://pypi.org/project/pycodestyle\n    \"F\", # see: https://pypi.org/project/pyflakes\n    \"I\", #see: https://pypi.org/project/isort/\n    \"D\", # see: https://pypi.org/project/pydocstyle\n    #\"N\", # see: https://pypi.org/project/pep8-naming\n    \"S\", # see: https://pypi.org/project/flake8-bandit\n    \"SIM\",\n    \"RUF100\" # alternative to yesqa\n]\nlint.extend-select = [\n    \"A\",    # see: https://pypi.org/project/flake8-builtins\n    \"B\",    # see: https://pypi.org/project/flake8-bugbear\n    \"C4\",   # see: https://pypi.org/project/flake8-comprehensions\n]\nlint.ignore = [\n    \"E203\",\n    \"C901\",\n]\n# Exclude a variety of commonly ignored directories.\nexclude = [\n    \".eggs\",\n    \".git\",\n    \".ruff_cache\",\n    \"__pypackages__\",\n    \"_build\",\n    \"build\",\n    \"dist\",\n]\nlint.ignore-init-module-imports = true\nlint.unfixable = [\"F401\"]\n\n[tool.ruff.lint.per-file-ignores]\n\"src/**/__init__.py\" = [\"D104\"]\n\"src/cachier/config.py\" = [\"D100\"]\n\"tests/**\" = [\n  \"S101\", \"S311\", \"S105\", \"S603\",\n  \"D100\", \"D101\", \"D103\", \"D104\", \"D401\"\n]\n\n#[tool.ruff.pydocstyle]\n## Use Google-style docstrings.\n#convention = \"google\"\n\n#[tool.ruff.pycodestyle]\n#ignore-overlong-task-comments = true\n\n[tool.ruff.lint.mccabe]\n# Unlike Flake8, default to a complexity level of 10.\nmax-complexity = 10\n\n[tool.coverage.run]\nbranch = true\ndynamic_context = \"test_function\"\nomit = [\n    \"tests/*\",\n    \"cachier/_version.py\",\n    \"cachier/__init__.py\",\n    \"**/scripts/**\",\n]\n[tool.coverage.report]\nshow_missing = true\n# Regexes for lines to exclude from consideration\nexclude_lines = [\n    \"pragma: no cover\", # Have to re-enable the standard pragma\n    \"raise NotImplementedError\", # Don't complain if tests don't hit defensive assertion code:\n    \"if TYPE_CHECKING:\", # Is only true when running mypy, not tests\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__init__.py",
      "content": "from ._version import *  # noqa: F403\nfrom .config import (\n    disable_caching,\n    enable_caching,\n    get_default_params,\n    set_default_params,\n)\nfrom .core import cachier\n\n__all__ = [\n    \"cachier\",\n    \"set_default_params\",\n    \"get_default_params\",\n    \"enable_caching\",\n    \"disable_caching\",\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/config.py",
      "content": "import datetime\nimport hashlib\nimport os\nimport pickle\nfrom typing import Optional, TypedDict, Union\n\nfrom ._types import Backend, HashFunc, Mongetter\n\n\ndef _default_hash_func(args, kwds):\n    # Sort the kwargs to ensure consistent ordering\n    sorted_kwargs = sorted(kwds.items())\n    # Serialize args and sorted_kwargs using pickle or similar\n    serialized = pickle.dumps((args, sorted_kwargs))\n    # Create a hash of the serialized data\n    return hashlib.sha256(serialized).hexdigest()\n\n\nclass Params(TypedDict):\n    \"\"\"Type definition for cachier parameters.\"\"\"\n\n    caching_enabled: bool\n    hash_func: HashFunc\n    backend: Backend\n    mongetter: Optional[Mongetter]\n    stale_after: datetime.timedelta\n    next_time: bool\n    cache_dir: Union[str, os.PathLike]\n    pickle_reload: bool\n    separate_files: bool\n    wait_for_calc_timeout: int\n    allow_none: bool\n\n\n_default_params: Params = {\n    \"caching_enabled\": True,\n    \"hash_func\": _default_hash_func,\n    \"backend\": \"pickle\",\n    \"mongetter\": None,\n    \"stale_after\": datetime.timedelta.max,\n    \"next_time\": False,\n    \"cache_dir\": \"~/.cachier/\",\n    \"pickle_reload\": True,\n    \"separate_files\": False,\n    \"wait_for_calc_timeout\": 0,\n    \"allow_none\": False,\n}\n\n\ndef _update_with_defaults(\n    param, name: str, func_kwargs: Optional[dict] = None\n):\n    import cachier\n\n    if func_kwargs:\n        kw_name = f\"cachier__{name}\"\n        if kw_name in func_kwargs:\n            return func_kwargs.pop(kw_name)\n    if param is None:\n        return cachier.config._default_params[name]\n    return param\n\n\ndef set_default_params(**params):\n    \"\"\"Configure global parameters applicable to all memoized functions.\n\n    This function takes the same keyword parameters as the ones defined in the\n    decorator, which can be passed all at once or with multiple calls.\n    Parameters given directly to a decorator take precedence over any values\n    set by this function.\n\n    Only 'stale_after', 'next_time', and 'wait_for_calc_timeout' can be changed\n    after the memoization decorator has been applied. Other parameters will\n    only have an effect on decorators applied after this function is run.\n\n    \"\"\"\n    import cachier\n\n    valid_params = (\n        p for p in params.items() if p[0] in cachier.config._default_params\n    )\n    _default_params.update(valid_params)\n\n\ndef get_default_params():\n    \"\"\"Get current set of default parameters.\"\"\"\n    import cachier\n\n    return cachier.config._default_params\n\n\ndef enable_caching():\n    \"\"\"Enable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = True\n\n\ndef disable_caching():\n    \"\"\"Disable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = False\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/_version.py",
      "content": "# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2024, Jirka Borovec <***@gmail.com>\n\nimport os\n\n_PATH_HERE = os.path.dirname(__file__)\n_PATH_VERSION = os.path.join(_PATH_HERE, \"version.info\")\n_RELEASING_PROCESS = os.getenv(\"RELEASING_PROCESS\", \"0\") == \"1\"\n\nwith open(_PATH_VERSION) as fopen:\n    __version__ = fopen.read().strip()\n\n\ndef _get_git_sha() -> str:\n    from subprocess import DEVNULL, check_output\n\n    out = check_output([\"git\", \"rev-parse\", \"--short\", \"HEAD\"], stderr=DEVNULL)  # noqa: S603, S607\n    return out.decode(\"utf-8\").strip()\n\n\nif not _RELEASING_PROCESS:\n    try:\n        sha_short = _get_git_sha()\n        # print(f\"Version enriched with git commit hash: {__version__}.\")\n    except Exception:\n        # print(\"Failed to get the git commit hash,\"\n        #       f\" falling back to base version {__version__}.\")\n        sha_short = \"\"\n    __version__ += f\".dev+{sha_short}\" if sha_short else \".dev\"\n\n\n__all__ = [\"__version__\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/version.info",
      "content": "3.0.0\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/py.typed",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/_types.py",
      "content": "from typing import TYPE_CHECKING, Callable, Literal\n\nif TYPE_CHECKING:\n    import pymongo.collection\n\n\nHashFunc = Callable[..., str]\nMongetter = Callable[[], \"pymongo.collection.Collection\"]\nBackend = Literal[\"pickle\", \"mongo\", \"memory\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__main__.py",
      "content": "\"\"\"A command-line interface for cachier.\"\"\"\n\nimport click\n\nfrom cachier.core import _set_max_workers\n\n\n@click.group()\ndef cli():\n    \"\"\"A command-line interface for cachier.\"\"\"  # noqa: D401\n\n\n@cli.command(\"Limits the number of worker threads used by cachier.\")\n@click.argument(\"max_workers\", nargs=1, type=int)\ndef set_max_workers(max_workers):\n    \"\"\"Limits the number of worker threads used by cachier.\"\"\"\n    _set_max_workers(max_workers)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "\"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport datetime\nimport inspect\nimport os\nimport warnings\nfrom collections import OrderedDict\nfrom concurrent.futures import ThreadPoolExecutor\nfrom functools import wraps\nfrom typing import Optional, Union\nfrom warnings import warn\n\nfrom .config import (\n    Backend,\n    HashFunc,\n    Mongetter,\n    _default_params,\n    _update_with_defaults,\n)\nfrom .cores.base import RecalculationNeeded, _BaseCore\nfrom .cores.memory import _MemoryCore\nfrom .cores.mongo import _MongoCore\nfrom .cores.pickle import _PickleCore\n\nMAX_WORKERS_ENVAR_NAME = \"CACHIER_MAX_WORKERS\"\nDEFAULT_MAX_WORKERS = 8\n\n\ndef _max_workers():\n    return int(os.environ.get(MAX_WORKERS_ENVAR_NAME, DEFAULT_MAX_WORKERS))\n\n\ndef _set_max_workers(max_workers):\n    os.environ[MAX_WORKERS_ENVAR_NAME] = str(max_workers)\n    _get_executor(True)\n\n\ndef _get_executor(reset=False):\n    if reset or not hasattr(_get_executor, \"executor\"):\n        _get_executor.executor = ThreadPoolExecutor(_max_workers())\n    return _get_executor.executor\n\n\ndef _function_thread(core, key, func, args, kwds):\n    try:\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n    except BaseException as exc:\n        print(f\"Function call failed with the following exception:\\n{exc}\")\n\n\ndef _calc_entry(core, key, func, args, kwds):\n    try:\n        core.mark_entry_being_calculated(key)\n        # _get_executor().submit(core.mark_entry_being_calculated, key)\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n        # _get_executor().submit(core.set_entry, key, func_res)\n        return func_res\n    finally:\n        core.mark_entry_not_calculated(key)\n\n\ndef _convert_args_kwargs(\n    func, _is_method: bool, args: tuple, kwds: dict\n) -> dict:\n    \"\"\"Convert mix of positional and keyword arguments to aggregated kwargs.\"\"\"\n    # unwrap if the function is functools.partial\n    if hasattr(func, \"func\"):\n        args = func.args + args\n        kwds.update({k: v for k, v in func.keywords.items() if k not in kwds})\n        func = func.func\n    func_params = list(inspect.signature(func).parameters)\n    args_as_kw = dict(\n        zip(func_params[1:], args[1:])\n        if _is_method\n        else zip(func_params, args)\n    )\n    # init with default values\n    kwargs = {\n        k: v.default\n        for k, v in inspect.signature(func).parameters.items()\n        if v.default is not inspect.Parameter.empty\n    }\n    # merge args expanded as kwargs and the original kwds\n    kwargs.update(dict(**args_as_kw, **kwds))\n    return OrderedDict(sorted(kwargs.items()))\n\n\ndef _pop_kwds_with_deprecation(kwds, name: str, default_value: bool):\n    if name in kwds:\n        warnings.warn(\n            f\"`{name}` is deprecated and will be removed in a future release,\"\n            \" use `cachier__` alternative instead.\",\n            DeprecationWarning,\n            stacklevel=2,\n        )\n    return kwds.pop(name, default_value)\n\n\ndef cachier(\n    hash_func: Optional[HashFunc] = None,\n    hash_params: Optional[HashFunc] = None,\n    backend: Optional[Backend] = None,\n    mongetter: Optional[Mongetter] = None,\n    stale_after: Optional[datetime.timedelta] = None,\n    next_time: Optional[bool] = None,\n    cache_dir: Optional[Union[str, os.PathLike]] = None,\n    pickle_reload: Optional[bool] = None,\n    separate_files: Optional[bool] = None,\n    wait_for_calc_timeout: Optional[int] = None,\n    allow_none: Optional[bool] = None,\n):\n    \"\"\"Wrap as a persistent, stale-free memoization decorator.\n\n    The positional and keyword arguments to the wrapped function must be\n    hashable (i.e. Python's immutable built-in objects, not mutable\n    containers). Also, notice that since objects which are instances of\n    user-defined classes are hashable but all compare unequal (their hash\n    value is their id), equal objects across different sessions will not yield\n    identical keys.\n\n    Arguments:\n    ---------\n    hash_func : callable, optional\n        A callable that gets the args and kwargs from the decorated function\n        and returns a hash key for them. This parameter can be used to enable\n        the use of cachier with functions that get arguments that are not\n        automatically hashable by Python.\n    hash_params : callable, optional\n    backend : str, optional\n        The name of the backend to use. Valid options currently include\n        'pickle', 'mongo' and 'memory'. If not provided, defaults to\n        'pickle' unless the 'mongetter' argument is passed, in which\n        case the mongo backend is automatically selected.\n    mongetter : callable, optional\n        A callable that takes no arguments and returns a pymongo.Collection\n        object with writing permissions. If unset a local pickle cache is used\n        instead.\n    stale_after : datetime.timedelta, optional\n        The time delta after which a cached result is considered stale. Calls\n        made after the result goes stale will trigger a recalculation of the\n        result, but whether a stale or fresh result will be returned is\n        determined by the optional next_time argument.\n    next_time : bool, optional\n        If set to True, a stale result will be returned when finding one, not\n        waiting for the calculation of the fresh result to return. Defaults to\n        False.\n    cache_dir : str, optional\n        A fully qualified path to a file directory to be used for cache files.\n        The running process must have running permissions to this folder. If\n        not provided, a default directory at `~/.cachier/` is used.\n    pickle_reload : bool, optional\n        If set to True, in-memory cache will be reloaded on each cache read,\n        enabling different threads to share cache. Should be set to False for\n        faster reads in single-thread programs. Defaults to True.\n    separate_files: bool, default False, for Pickle cores only\n        Instead of a single cache file per-function, each function's cache is\n        split between several files, one for each argument set. This can help\n        if you per-function cache files become too large.\n    wait_for_calc_timeout: int, optional, for MongoDB only\n        The maximum time to wait for an ongoing calculation. When a\n        process started to calculate the value setting being_calculated to\n        True, any process trying to read the same entry will wait a maximum of\n        seconds specified in this parameter. 0 means wait forever.\n        Once the timeout expires the calculation will be triggered.\n    allow_none: bool, optional\n        Allows storing None values in the cache. If False, functions returning\n        None will not be cached and are recalculated every call.\n\n    \"\"\"\n    # Check for deprecated parameters\n    if hash_params is not None:\n        message = (\n            \"hash_params will be removed in a future release, \"\n            \"please use hash_func instead\"\n        )\n        warn(message, DeprecationWarning, stacklevel=2)\n        hash_func = hash_params\n    # Update parameters with defaults if input is None\n    backend = _update_with_defaults(backend, \"backend\")\n    mongetter = _update_with_defaults(mongetter, \"mongetter\")\n    # Override the backend parameter if a mongetter is provided.\n    if callable(mongetter):\n        backend = \"mongo\"\n    core: _BaseCore\n    if backend == \"pickle\":\n        core = _PickleCore(\n            hash_func=hash_func,\n            pickle_reload=pickle_reload,\n            cache_dir=cache_dir,\n            separate_files=separate_files,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"mongo\":\n        core = _MongoCore(\n            hash_func=hash_func,\n            mongetter=mongetter,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"memory\":\n        core = _MemoryCore(\n            hash_func=hash_func, wait_for_calc_timeout=wait_for_calc_timeout\n        )\n    else:\n        raise ValueError(\"specified an invalid core: %s\" % backend)\n\n    def _cachier_decorator(func):\n        core.set_func(func)\n\n        @wraps(func)\n        def func_wrapper(*args, **kwds):\n            nonlocal allow_none\n            _allow_none = _update_with_defaults(allow_none, \"allow_none\", kwds)\n            # print('Inside general wrapper for {}.'.format(func.__name__))\n            ignore_cache = _pop_kwds_with_deprecation(\n                kwds, \"ignore_cache\", False\n            )\n            overwrite_cache = _pop_kwds_with_deprecation(\n                kwds, \"overwrite_cache\", False\n            )\n            verbose = _pop_kwds_with_deprecation(kwds, \"verbose_cache\", False)\n            ignore_cache = kwds.pop(\"cachier__skip_cache\", ignore_cache)\n            overwrite_cache = kwds.pop(\n                \"cachier__overwrite_cache\", overwrite_cache\n            )\n            verbose = kwds.pop(\"cachier__verbose\", verbose)\n            _stale_after = _update_with_defaults(\n                stale_after, \"stale_after\", kwds\n            )\n            _next_time = _update_with_defaults(next_time, \"next_time\", kwds)\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n\n            _print = lambda x: None  # noqa: E731\n            if verbose:\n                _print = print\n            if ignore_cache or not _default_params[\"caching_enabled\"]:\n                return func(**kwargs)\n            key, entry = core.get_entry((), kwargs)\n            if overwrite_cache:\n                return _calc_entry(core, key, func, args, kwds)\n            if entry is None:\n                _print(\"No entry found. No current calc. Calling like a boss.\")\n                return _calc_entry(core, key, func, args, kwds)\n            _print(\"Entry found.\")\n            if _allow_none or entry.get(\"value\", None) is not None:\n                _print(\"Cached result found.\")\n                now = datetime.datetime.now()\n                if now - entry[\"time\"] <= _stale_after:\n                    _print(\"And it is fresh!\")\n                    return entry[\"value\"]\n                _print(\"But it is stale... :(\")\n                if entry[\"being_calculated\"]:\n                    if _next_time:\n                        _print(\"Returning stale.\")\n                        return entry[\"value\"]  # return stale val\n                    _print(\"Already calc. Waiting on change.\")\n                    try:\n                        return core.wait_on_entry_calc(key)\n                    except RecalculationNeeded:\n                        return _calc_entry(core, key, func, args, kwds)\n                if _next_time:\n                    _print(\"Async calc and return stale\")\n                    try:\n                        core.mark_entry_being_calculated(key)\n                        _get_executor().submit(\n                            _function_thread, core, key, func, args, kwds\n                        )\n                    finally:\n                        core.mark_entry_not_calculated(key)\n                    return entry[\"value\"]\n                _print(\"Calling decorated function and waiting\")\n                return _calc_entry(core, key, func, args, kwds)\n            if entry[\"being_calculated\"]:\n                _print(\"No value but being calculated. Waiting.\")\n                try:\n                    return core.wait_on_entry_calc(key)\n                except RecalculationNeeded:\n                    return _calc_entry(core, key, func, args, kwds)\n            _print(\"No entry found. No current calc. Calling like a boss.\")\n            return _calc_entry(core, key, func, args, kwds)\n\n        def _clear_cache():\n            \"\"\"Clear the cache.\"\"\"\n            core.clear_cache()\n\n        def _clear_being_calculated():\n            \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n            core.clear_being_calculated()\n\n        def _cache_dpath():\n            \"\"\"Return the path to the cache dir, if exists; None if not.\"\"\"\n            return getattr(core, \"cache_dir\", None)\n\n        def _precache_value(*args, value_to_cache, **kwds):  # noqa: D417\n            \"\"\"Add an initial value to the cache.\n\n            Arguments:\n            ---------\n            value_to_cache : any\n                entry to be written into the cache\n\n            \"\"\"\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n            return core.precache_value((), kwargs, value_to_cache)\n\n        func_wrapper.clear_cache = _clear_cache\n        func_wrapper.clear_being_calculated = _clear_being_calculated\n        func_wrapper.cache_dpath = _cache_dpath\n        func_wrapper.precache_value = _precache_value\n        return func_wrapper\n\n    return _cachier_decorator\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/base.py",
      "content": "\"\"\"Defines the interface of a cachier caching core.\"\"\"\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport abc  # for the _BaseCore abstract base class\nimport inspect\nimport threading\nfrom typing import Callable\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n\nclass RecalculationNeeded(Exception):\n    \"\"\"Exception raised when a recalculation is needed.\"\"\"\n\n    pass\n\n\ndef _get_func_str(func: Callable) -> str:\n    return f\".{func.__module__}.{func.__name__}\"\n\n\nclass _BaseCore:\n    __metaclass__ = abc.ABCMeta\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        self.hash_func = _update_with_defaults(hash_func, \"hash_func\")\n        self.wait_for_calc_timeout = wait_for_calc_timeout\n        self.lock = threading.RLock()\n\n    def set_func(self, func):\n        \"\"\"Set the function this core will use.\n\n        This has to be set before any method is called. Also determine if the\n        function is an object method.\n\n        \"\"\"\n        # unwrap if the function is functools.partial\n        if hasattr(func, \"func\"):\n            func = func.func\n        func_params = list(inspect.signature(func).parameters)\n        self.func_is_method = func_params and func_params[0] == \"self\"\n        self.func = func\n\n    def get_key(self, args, kwds):\n        \"\"\"Return a unique key based on the arguments provided.\"\"\"\n        return self.hash_func(args, kwds)\n\n    def get_entry(self, args, kwds):\n        \"\"\"Get entry based on given arguments.\n\n        Return the result mapped to the given arguments in this core's cache,\n        if such a mapping exists.\n\n        \"\"\"\n        key = self.get_key(args, kwds)\n        return self.get_entry_by_key(key)\n\n    def precache_value(self, args, kwds, value_to_cache):\n        \"\"\"Write a precomputed value into the cache.\"\"\"\n        key = self.get_key(args, kwds)\n        self.set_entry(key, value_to_cache)\n        return value_to_cache\n\n    def check_calc_timeout(self, time_spent):\n        \"\"\"Raise an exception if a recalculation is needed.\"\"\"\n        calc_timeout = _update_with_defaults(\n            self.wait_for_calc_timeout, \"wait_for_calc_timeout\"\n        )\n        if calc_timeout > 0 and (time_spent >= calc_timeout):\n            raise RecalculationNeeded()\n\n    @abc.abstractmethod\n    def get_entry_by_key(self, key):\n        \"\"\"Get entry based on given key.\n\n        Return the result mapped to the given key in this core's cache, if such\n        a mapping exists.\n\n        \"\"\"\n\n    @abc.abstractmethod\n    def set_entry(self, key, func_res):\n        \"\"\"Map the given result to the given key in this core's cache.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_being_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_not_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as not being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def wait_on_entry_calc(self, key):\n        \"\"\"Wait on the entry with keys being calculated and returns result.\"\"\"\n\n    @abc.abstractmethod\n    def clear_cache(self):\n        \"\"\"Clear the cache of this core.\"\"\"\n\n    @abc.abstractmethod\n    def clear_being_calculated(self):\n        \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "\"\"\"A pickle-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\nimport os\nimport pickle  # for local caching\nfrom contextlib import suppress\nfrom datetime import datetime\n\nimport portalocker  # to lock on pickle cache IO\nfrom watchdog.events import PatternMatchingEventHandler\nfrom watchdog.observers import Observer\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n# Alternative:  https://github.com/WoLpH/portalocker\nfrom .base import _BaseCore\n\n\nclass _PickleCore(_BaseCore):\n    \"\"\"The pickle core class for cachier.\"\"\"\n\n    class CacheChangeHandler(PatternMatchingEventHandler):\n        \"\"\"Handles cache-file modification events.\"\"\"\n\n        def __init__(self, filename, core, key):\n            PatternMatchingEventHandler.__init__(\n                self,\n                patterns=[\"*\" + filename],\n                ignore_patterns=None,\n                ignore_directories=True,\n                case_sensitive=False,\n            )\n            self.core = core\n            self.key = key\n            self.observer = None\n            self.value = None\n\n        def inject_observer(self, observer):\n            \"\"\"Inject the observer running this handler.\"\"\"\n            self.observer = observer\n\n        def _check_calculation(self):\n            # print('checking calc')\n            entry = self.core.get_entry_by_key(self.key, True)[1]\n            # print(self.key)\n            # print(entry)\n            try:\n                if not entry[\"being_calculated\"]:\n                    # print('stopping observer!')\n                    self.value = entry[\"value\"]\n                    self.observer.stop()\n                # else:\n                # print('NOT stopping observer... :(')\n            except TypeError:\n                self.value = None\n                self.observer.stop()\n\n        def on_created(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()  # pragma: no cover\n\n        def on_modified(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        pickle_reload: bool,\n        cache_dir: str,\n        separate_files: bool,\n        wait_for_calc_timeout: int,\n    ):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = None\n        self.reload = _update_with_defaults(pickle_reload, \"pickle_reload\")\n        self.cache_dir = os.path.expanduser(\n            _update_with_defaults(cache_dir, \"cache_dir\")\n        )\n        self.separate_files = _update_with_defaults(\n            separate_files, \"separate_files\"\n        )\n\n    @property\n    def cache_fname(self) -> str:\n        fname = f\".{self.func.__module__}.{self.func.__qualname__}\"\n        return fname.replace(\"<\", \"_\").replace(\">\", \"_\")\n\n    @property\n    def cache_fpath(self) -> str:\n        os.makedirs(self.cache_dir, exist_ok=True)\n        return os.path.abspath(\n            os.path.join(os.path.realpath(self.cache_dir), self.cache_fname)\n        )\n\n    def _reload_cache(self):\n        with self.lock:\n            try:\n                with portalocker.Lock(\n                    self.cache_fpath, mode=\"rb\"\n                ) as cache_file:\n                    self.cache = pickle.load(cache_file)  # noqa: S301\n            except (FileNotFoundError, EOFError):\n                self.cache = {}\n\n    def _get_cache(self):\n        with self.lock:\n            if not self.cache:\n                self._reload_cache()\n            return self.cache\n\n    def _get_cache_by_key(self, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        fpath += f\"_{hash_str or key}\"\n        try:\n            with portalocker.Lock(fpath, mode=\"rb\") as cache_file:\n                return pickle.load(cache_file)  # noqa: S301\n        except (FileNotFoundError, EOFError):\n            return None\n\n    def _clear_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(f\"{name}_\"):\n                os.remove(os.path.join(path, subpath))\n\n    def _clear_being_calculated_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(name):\n                entry = self._get_cache_by_key(hash_str=subpath.split(\"_\")[-1])\n                if entry is not None:\n                    entry[\"being_calculated\"] = False\n                    self._save_cache(entry, hash_str=subpath.split(\"_\")[-1])\n\n    def _save_cache(self, cache, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        if key is not None:\n            fpath += f\"_{key}\"\n        elif hash_str is not None:\n            fpath += f\"_{hash_str}\"\n        with self.lock:\n            self.cache = cache\n            with portalocker.Lock(fpath, mode=\"wb\") as cache_file:\n                pickle.dump(cache, cache_file, protocol=4)\n            if key is None:\n                self._reload_cache()\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            if self.separate_files:\n                return key, self._get_cache_by_key(key)\n            if self.reload or reload:\n                self._reload_cache()\n            return key, self._get_cache().get(key, None)\n\n    def set_entry(self, key, func_res):\n        key_data = {\n            \"value\": func_res,\n            \"time\": datetime.now(),\n            \"stale\": False,\n            \"being_calculated\": False,\n        }\n        if self.separate_files:\n            self._save_cache(key_data, key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            cache[key] = key_data\n            self._save_cache(cache)\n\n    def mark_entry_being_calculated_separate_files(self, key):\n        self._save_cache(\n            {\n                \"value\": None,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": True,\n            },\n            key=key,\n        )\n\n    def mark_entry_not_calculated_separate_files(self, key):\n        _, entry = self.get_entry_by_key(key)\n        entry[\"being_calculated\"] = False\n        self._save_cache(entry, key=key)\n\n    def mark_entry_being_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_being_calculated_separate_files(key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            try:\n                cache[key][\"being_calculated\"] = True\n            except KeyError:\n                cache[key] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                }\n            self._save_cache(cache)\n\n    def mark_entry_not_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_not_calculated_separate_files(key)\n        with self.lock:\n            cache = self._get_cache()\n            # that's ok, we don't need an entry in that case\n            with suppress(KeyError):\n                cache[key][\"being_calculated\"] = False\n                self._save_cache(cache)\n\n    def wait_on_entry_calc(self, key):\n        if self.separate_files:\n            entry = self._get_cache_by_key(key)\n            filename = f\"{self.cache_fname}_{key}\"\n        else:\n            with self.lock:\n                self._reload_cache()\n                entry = self._get_cache()[key]\n            filename = self.cache_fname\n        if not entry[\"being_calculated\"]:\n            return entry[\"value\"]\n        event_handler = _PickleCore.CacheChangeHandler(\n            filename=filename, core=self, key=key\n        )\n        observer = Observer()\n        event_handler.inject_observer(observer)\n        observer.schedule(event_handler, path=self.cache_dir, recursive=True)\n        observer.start()\n        time_spent = 0\n        while observer.is_alive():\n            observer.join(timeout=1.0)\n            time_spent += 1\n            self.check_calc_timeout(time_spent)\n        return event_handler.value\n\n    def clear_cache(self):\n        if self.separate_files:\n            self._clear_all_cache_files()\n        else:\n            self._save_cache({})\n\n    def clear_being_calculated(self):\n        if self.separate_files:\n            self._clear_being_calculated_all_cache_files()\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            for key in cache:\n                cache[key][\"being_calculated\"] = False\n            self._save_cache(cache)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/memory.py",
      "content": "\"\"\"A memory-based caching core for cachier.\"\"\"\n\nimport threading\nfrom datetime import datetime\n\nfrom .._types import HashFunc\nfrom .base import _BaseCore, _get_func_str\n\n\nclass _MemoryCore(_BaseCore):\n    \"\"\"The memory core class for cachier.\"\"\"\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = {}\n\n    def _hash_func_key(self, key):\n        return f\"{_get_func_str(self.func)}:{key}\"\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            return key, self.cache.get(self._hash_func_key(key), None)\n\n    def set_entry(self, key, func_res):\n        with self.lock:\n            try:\n                # we need to retain the existing condition so that\n                # mark_entry_not_calculated can notify all possibly-waiting\n                # threads about it\n                cond = self.cache[self._hash_func_key(key)][\"condition\"]\n            except KeyError:  # pragma: no cover\n                cond = None\n            self.cache[self._hash_func_key(key)] = {\n                \"value\": func_res,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": False,\n                \"condition\": cond,\n            }\n\n    def mark_entry_being_calculated(self, key):\n        with self.lock:\n            condition = threading.Condition()\n            # condition.acquire()\n            try:\n                self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n                self.cache[self._hash_func_key(key)][\"condition\"] = condition\n            except KeyError:\n                self.cache[self._hash_func_key(key)] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                    \"condition\": condition,\n                }\n\n    def mark_entry_not_calculated(self, key):\n        with self.lock:\n            try:\n                entry = self.cache[self._hash_func_key(key)]\n            except KeyError:  # pragma: no cover\n                return  # that's ok, we don't need an entry in that case\n            entry[\"being_calculated\"] = False\n            cond = entry[\"condition\"]\n            if cond:\n                cond.acquire()\n                cond.notify_all()\n                cond.release()\n                entry[\"condition\"] = None\n\n    def wait_on_entry_calc(self, key):\n        with self.lock:  # pragma: no cover\n            entry = self.cache[self._hash_func_key(key)]\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n        entry[\"condition\"].acquire()\n        entry[\"condition\"].wait()\n        entry[\"condition\"].release()\n        return self.cache[self._hash_func_key(key)][\"value\"]\n\n    def clear_cache(self):\n        with self.lock:\n            self.cache.clear()\n\n    def clear_being_calculated(self):\n        with self.lock:\n            for entry in self.cache.values():\n                entry[\"being_calculated\"] = False\n                entry[\"condition\"] = None\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "content": "\"\"\"A MongoDB-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport pickle  # for serialization of python objects\nimport sys  # to make sure that pymongo was imported\nimport time  # to sleep when waiting on Mongo cache\\\nimport warnings  # to warn if pymongo is missing\nfrom contextlib import suppress\nfrom datetime import datetime\n\nfrom .._types import HashFunc, Mongetter\n\nwith suppress(ImportError):\n    from bson.binary import Binary  # to save binary data to mongodb\n    from pymongo import ASCENDING, IndexModel\n    from pymongo.errors import OperationFailure\n\nfrom .base import RecalculationNeeded, _BaseCore, _get_func_str\n\nMONGO_SLEEP_DURATION_IN_SEC = 1\n\n\nclass MissingMongetter(ValueError):\n    \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n\n\nclass _MongoCore(_BaseCore):\n    _INDEX_NAME = \"func_1_key_1\"\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        mongetter: Mongetter,\n        wait_for_calc_timeout: int,\n    ):\n        if \"pymongo\" not in sys.modules:\n            warnings.warn(\n                \"`pymongo` was not found. MongoDB cores will not function.\",\n                ImportWarning,\n                stacklevel=2,\n            )  # pragma: no cover\n\n        super().__init__(hash_func, wait_for_calc_timeout)\n        if mongetter is None:\n            raise MissingMongetter(\n                \"must specify ``mongetter`` when using the mongo core\"\n            )\n        self.mongetter = mongetter\n        self.mongo_collection = self.mongetter()\n        index_inf = self.mongo_collection.index_information()\n        if _MongoCore._INDEX_NAME not in index_inf:\n            func1key1 = IndexModel(\n                keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n                name=_MongoCore._INDEX_NAME,\n            )\n            self.mongo_collection.create_indexes([func1key1])\n\n    @property\n    def _func_str(self) -> str:\n        return _get_func_str(self.func)\n\n    def get_entry_by_key(self, key):\n        res = self.mongo_collection.find_one(\n            {\"func\": self._func_str, \"key\": key}\n        )\n        if not res:\n            return key, None\n        try:\n            entry = {\n                \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        except KeyError:\n            entry = {\n                \"value\": None,\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        return key, entry\n\n    def set_entry(self, key, func_res):\n        thebytes = pickle.dumps(func_res)\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\n                \"$set\": {\n                    \"func\": self._func_str,\n                    \"key\": key,\n                    \"value\": Binary(thebytes),\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": False,\n                }\n            },\n            upsert=True,\n        )\n\n    def mark_entry_being_calculated(self, key):\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\"$set\": {\"being_calculated\": True}},\n            upsert=True,\n        )\n\n    def mark_entry_not_calculated(self, key):\n        with suppress(OperationFailure):  # don't care in this case\n            self.mongo_collection.update_one(\n                filter={\n                    \"func\": self._func_str,\n                    \"key\": key,\n                },\n                update={\"$set\": {\"being_calculated\": False}},\n                upsert=False,  # should not insert in this case\n            )\n\n    def wait_on_entry_calc(self, key):\n        time_spent = 0\n        while True:\n            time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n            time_spent += MONGO_SLEEP_DURATION_IN_SEC\n            key, entry = self.get_entry_by_key(key)\n            if entry is None:\n                raise RecalculationNeeded()\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n            self.check_calc_timeout(time_spent)\n\n    def clear_cache(self):\n        self.mongo_collection.delete_many(filter={\"func\": self._func_str})\n\n    def clear_being_calculated(self):\n        self.mongo_collection.update_many(\n            filter={\n                \"func\": self._func_str,\n                \"being_calculated\": True,\n            },\n            update={\"$set\": {\"being_calculated\": False}},\n        )\n"
    },
    {
      "path": "python-cachier_cachier/tests/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/tests/standalone_script.py",
      "content": "import time\n\nimport cachier\n\n\n@cachier.cachier()\ndef _takes_3_seconds(label, value):\n    time.sleep(3)\n    return f\"{label} {value}\"\n\n\nprint(_takes_3_seconds(\"two\", 2))\n"
    },
    {
      "path": "python-cachier_cachier/tests/speed_eval.py",
      "content": "from random import random\nfrom time import time\n\nfrom cachier import cachier\n\n\n@cachier(next_time=True)\ndef _test_int_pickling(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef _test_int_pickling_compare(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef test_pickle_speed():\n    \"\"\"Test speeds.\"\"\"\n    print(\"Comparing speeds of decorated vs non-decorated functions...\")\n    num_of_vals = 1000\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling_compare(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Non-decorated average = {sum(times) / num_of_vals}\")\n\n    _test_int_pickling.clear_cache()\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Decorated average = {sum(times) / num_of_vals}\")\n\n\n@cachier()\ndef _test_single_file_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\n@cachier(separate_files=True)\ndef _test_separate_files_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\ndef test_separate_files_vs_single_file():\n    _test_separate_files_speed.clear_cache()\n    _test_single_file_speed.clear_cache()\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_separate_files_speed(j, 2)\n    print(f\"separate files time: {time() - start_time}\")\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_single_file_speed(j, 2)\n    print(f\"single file time: {time() - start_time}\")\n\n\nif __name__ == \"__main__\":\n    test_pickle_speed()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_memory_core.py",
      "content": "\"\"\"Test for the in-memory implementation of the  Cachier python package.\"\"\"\n\nimport hashlib\nimport queue\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pandas as pd\nimport pytest\nfrom cachier import cachier\n\n\n@cachier(backend=\"memory\", next_time=False)\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.memory\ndef test_memory_core():\n    \"\"\"Basic memory core functionality.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\n@pytest.mark.memory\ndef test_memory_core_keywords():\n    \"\"\"Basic memory core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=False)\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after():\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds.clear_cache()\n    val1 = _stale_after_seconds(1, 2)\n    val2 = _stale_after_seconds(1, 2)\n    val3 = _stale_after_seconds(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds(1, 2)\n    assert val4 != val1\n    _stale_after_seconds.clear_cache()\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=True)\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after_next_time():\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _random_num():\n    return random()\n\n\n@cachier(backend=\"memory\")\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.memory\ndef test_overwrite_cache():\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 == int3\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 == int3\n    _random_num_with_arg.clear_cache()\n\n\n@pytest.mark.memory\ndef test_ignore_cache():\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(res_queue):\n    res = _takes_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_memory_being_calculated():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(res_queue):\n    res = _being_calc_next_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_being_calc_next_time():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\")\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@cachier(backend=\"memory\")\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    _takes_time.clear_being_calculated()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated_with_empty_cache():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    _takes_time.clear_being_calculated()\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.memory\ndef test_error_throwing_func():\n    # with\n    res1 = _error_throwing_func(4)\n    sleep(1.5)\n    res2 = _error_throwing_func(4)\n    assert res1 == res2\n\n\n@pytest.mark.memory\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(backend=\"memory\", hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n\n\nif __name__ == \"__main__\":\n    test_memory_being_calculated()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_core_lookup.py",
      "content": "\"\"\"Testing a few basic cachier interfaces.\"\"\"\n\nimport pytest\nfrom cachier import cachier, get_default_params\nfrom cachier.cores.mongo import MissingMongetter\n\n\ndef test_get_default_params():\n    params = get_default_params()\n    assert tuple(sorted(params)) == (\n        \"allow_none\",\n        \"backend\",\n        \"cache_dir\",\n        \"caching_enabled\",\n        \"hash_func\",\n        \"mongetter\",\n        \"next_time\",\n        \"pickle_reload\",\n        \"separate_files\",\n        \"stale_after\",\n        \"wait_for_calc_timeout\",\n    )\n\n\ndef test_bad_name(name=\"nope\"):\n    # Test that the appropriate exception is thrown\n    # when an invalid backend is given.\n    with pytest.raises(ValueError, match=f\"specified an invalid core: {name}\"):\n\n        @cachier(backend=name)\n        def dummy_func():\n            pass\n\n\ndef test_missing_mongetter():\n    # Test that the appropriate exception is thrown\n    # when forgetting to specify the mongetter.\n    with pytest.raises(MissingMongetter):\n\n        @cachier(backend=\"mongo\", mongetter=None)\n        def dummy_func():\n            pass\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_general.py",
      "content": "\"\"\"Non-core-specific tests for cachier.\"\"\"\n\nimport functools\nimport os\nimport queue\nimport subprocess  # nosec: B404\nimport threading\nfrom contextlib import suppress\nfrom random import random\nfrom time import sleep, time\n\nimport cachier\nimport pytest\nfrom cachier.core import (\n    DEFAULT_MAX_WORKERS,\n    MAX_WORKERS_ENVAR_NAME,\n    _get_executor,\n    _max_workers,\n    _set_max_workers,\n)\n\nfrom tests.test_mongo_core import (\n    MONGO_DELTA_LONG,\n    _test_mongetter,\n)\n\n\ndef test_information():\n    print(\"\\ncachier version: \", end=\"\")\n    print(cachier.__version__)\n\n\ndef test_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    with suppress(KeyError):\n        del os.environ[MAX_WORKERS_ENVAR_NAME]\n    assert _max_workers() == DEFAULT_MAX_WORKERS\n\n\ndef test_get_executor():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _get_executor()\n    _get_executor(False)\n    _get_executor(True)\n\n\ndef test_set_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _set_max_workers(9)\n\n\nparametrize_keys = \"mongetter,stale_after,separate_files\"\nparametrize_values = [\n    pytest.param(\n        _test_mongetter, MONGO_DELTA_LONG, False, marks=pytest.mark.mongo\n    ),\n    (None, None, False),\n    (None, None, True),\n]\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_ok(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_fast(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(1)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_fast(res_queue):\n        res = _wait_for_calc_timeout_fast(1, 2)\n        res_queue.put(res)\n\n    \"\"\" Testing calls that avoid timeouts store the values in cache. \"\"\"\n    _wait_for_calc_timeout_fast.clear_cache()\n    val1 = _wait_for_calc_timeout_fast(1, 2)\n    val2 = _wait_for_calc_timeout_fast(1, 2)\n    assert val1 == val2\n\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(2)\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2  # Timeout did not kick in, a single call was done\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_slow(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    sleep(4)\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_precache_value(mongetter, backend):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def dummy_func(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return arg_1 + arg_2\n\n    assert dummy_func.precache_value(2, 2, value_to_cache=5) == 5\n    assert dummy_func(2, 2) == 5\n    dummy_func.clear_cache()\n    assert dummy_func(2, 2) == 4\n    assert dummy_func.precache_value(2, arg_2=2, value_to_cache=5) == 5\n    assert dummy_func(2, arg_2=2) == 5\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_ignore_self_in_methods(mongetter, backend):\n    class DummyClass:\n        @cachier.cachier(backend=backend, mongetter=mongetter)\n        def takes_2_seconds(self, arg_1, arg_2):\n            \"\"\"Some function.\"\"\"\n            sleep(2)\n            return arg_1 + arg_2\n\n    test_object_1 = DummyClass()\n    test_object_2 = DummyClass()\n    test_object_1.takes_2_seconds.clear_cache()\n    test_object_2.takes_2_seconds.clear_cache()\n    assert test_object_1.takes_2_seconds(1, 2) == 3\n    start = time()\n    assert test_object_2.takes_2_seconds(1, 2) == 3\n    end = time()\n    assert end - start < 1\n\n\ndef test_hash_params_deprecation():\n    with pytest.deprecated_call(match=\"hash_params will be removed\"):\n\n        @cachier.cachier(hash_params=lambda a, k: \"key\")\n        def test():\n            return \"value\"\n\n    assert test() == \"value\"\n\n\ndef test_separate_processes():\n    test_args = (\"python\", \"tests/standalone_script.py\")\n    run_params = {\"args\": test_args, \"capture_output\": True, \"text\": True}\n    run_process = functools.partial(subprocess.run, **run_params)\n    result = run_process()\n    assert result.stdout.strip() == \"two 2\"\n    start = time()\n    result = run_process()\n    end = time()\n    assert result.stdout.strip() == \"two 2\"\n    assert end - start < 3\n\n\ndef test_global_disable():\n    @cachier.cachier()\n    def get_random():\n        return random()\n\n    get_random.clear_cache()\n    result_1 = get_random()\n    result_2 = get_random()\n    cachier.disable_caching()\n    result_3 = get_random()\n    cachier.enable_caching()\n    result_4 = get_random()\n    assert result_1 == result_2 == result_4\n    assert result_1 != result_3\n\n\ndef test_none_not_cached_by_default():\n    count = 0\n\n    @cachier.cachier()\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 2\n\n\ndef test_allow_caching_none():\n    count = 0\n\n    @cachier.cachier(allow_none=True)\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 1\n\n\ndef test_identical_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: int, b: int = 2, c: int = 3):\n        nonlocal count\n        count += 1\n        return a + b + c\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1, 2, 3) == 6\n    assert dummy_func(1, 2, c=3) == 6\n    assert dummy_func(1, b=2, c=3) == 6\n    assert dummy_func(a=1, b=2, c=3) == 6\n    assert count == 1\n\n\ndef test_list_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: list, b: list = [2]):  # noqa: B006\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func([1]) == [1, 2]\n    assert dummy_func([1], [2]) == [1, 2]\n    assert dummy_func([1], b=[2]) == [1, 2]\n    assert dummy_func(a=[1], b=[2]) == [1, 2]\n    assert count == 1\n\n\ndef test_order_independent_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(b=2, a=1) == 3\n    assert count == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_diff_functions_same_args(tmpdir, backend: str):\n    count_p = count_m = 0\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert fn(1) == expected\n        assert fn(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_runtime_handling(tmpdir, backend):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir, backend=backend)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n    for fn, expected in [(fn_plus, 5), (fn_minus, 1)]:\n        assert cachier_(fn)(3, 2) == expected\n        assert cachier_(fn)(a=3, b=2) == expected\n    assert count_p == 2\n    assert count_m == 2\n\n\ndef test_partial_handling(tmpdir):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)(2) == expected\n\n        dummy_ = functools.partial(fn, a=1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1) == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1, b=2) == expected\n\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n\n    assert count_p == 1\n    assert count_m == 1\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_defaults.py",
      "content": "import datetime\nimport os\nimport queue\nimport random\nimport threading\nimport time\n\nimport cachier\nimport pytest\n\nfrom tests.test_mongo_core import MONGO_DELTA, _test_mongetter\n\n_default_params = cachier.get_default_params().copy()\n\n\ndef setup_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef teardown_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef test_hash_func_default_param():\n    def slow_hash_func(args, kwds):\n        time.sleep(2)\n        return \"hash\"\n\n    def fast_hash_func(args, kwds):\n        return \"hash\"\n\n    cachier.set_default_params(hash_func=slow_hash_func)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(hash_func=fast_hash_func)\n    def global_test_2():\n        return None\n\n    start = time.time()\n    global_test_1()\n    end = time.time()\n    assert end - start > 1\n    start = time.time()\n    global_test_2()\n    end = time.time()\n    assert end - start < 1\n\n\ndef test_backend_default_param():\n    cachier.set_default_params(backend=\"memory\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(backend=\"pickle\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\n@pytest.mark.mongo\ndef test_mongetter_default_param():\n    cachier.set_default_params(mongetter=_test_mongetter)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(mongetter=False)\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\ndef test_cache_dir_default_param(tmpdir):\n    cachier.set_default_params(cache_dir=tmpdir / \"1\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() == str(tmpdir / \"1\")\n    assert global_test_2.cache_dpath() == str(tmpdir / \"2\")\n\n\ndef test_separate_files_default_param(tmpdir):\n    cachier.set_default_params(separate_files=True)\n\n    @cachier.cachier(cache_dir=tmpdir / \"1\")\n    def global_test_1(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\", separate_files=False)\n    def global_test_2(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    global_test_1(1, 2)\n    global_test_1(3, 4)\n    global_test_2(1, 2)\n    global_test_2(3, 4)\n\n    assert len(os.listdir(global_test_1.cache_dpath())) == 2\n    assert len(os.listdir(global_test_2.cache_dpath())) == 1\n\n\ndef test_allow_none_default_param(tmpdir):\n    cachier.set_default_params(\n        allow_none=True,\n        separate_files=True,\n        verbose_cache=True,\n    )\n    allow_count = disallow_count = 0\n\n    @cachier.cachier(cache_dir=tmpdir)\n    def allow_none():\n        nonlocal allow_count\n        allow_count += 1\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir, allow_none=False)\n    def disallow_none():\n        nonlocal disallow_count\n        disallow_count += 1\n        return None\n\n    assert allow_count == 0\n    allow_none()\n    allow_none()\n    assert allow_count == 1\n\n    assert disallow_count == 0\n    disallow_none()\n    disallow_none()\n    assert disallow_count == 2\n\n    disallow_none(cachier__allow_none=True)\n    disallow_none(cachier__allow_none=True)\n    assert disallow_count == 2\n\n\nPARAMETRIZE_TEST = (\n    \"backend,mongetter\",\n    [\n        pytest.param(\"pickle\", None, marks=pytest.mark.pickle),\n        pytest.param(\"mongo\", _test_mongetter, marks=pytest.mark.mongo),\n    ],\n)\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_stale_after_applies_dynamically(backend, mongetter):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_test(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random() + arg_1 + arg_2\n\n    cachier.set_default_params(stale_after=MONGO_DELTA)\n\n    _stale_after_test.clear_cache()\n    val1 = _stale_after_test(1, 2)\n    val2 = _stale_after_test(1, 2)\n    assert val1 == val2\n    time.sleep(3)\n    val3 = _stale_after_test(1, 2)\n    assert val3 != val1\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_next_time_applies_dynamically(backend, mongetter):\n    NEXT_AFTER_DELTA = datetime.timedelta(seconds=3)\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_next_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random()\n\n    cachier.set_default_params(stale_after=NEXT_AFTER_DELTA, next_time=True)\n\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    time.sleep(NEXT_AFTER_DELTA.seconds + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    time.sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_wait_for_calc_applies_dynamically(backend, mongetter):\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        time.sleep(3)\n        return random.random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    cachier.set_default_params(wait_for_calc_timeout=2)\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    time.sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    time.sleep(5)\n    thread1.join(timeout=5)\n    thread2.join(timeout=5)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\ndef test_default_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1) == 3\n    assert dummy_func(a=1) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert count == 1\n\n\ndef test_deprecated_func_kwargs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    with pytest.deprecated_call(\n        match=\"`verbose_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, verbose_cache=True) == 3\n    assert count == 1\n    with pytest.deprecated_call(\n        match=\"`ignore_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, ignore_cache=True) == 3\n    assert count == 2\n    with pytest.deprecated_call(\n        match=\"`overwrite_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, overwrite_cache=True) == 3\n    assert count == 3\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_pickle_core.py",
      "content": "\"\"\"Test for the Cachier python package.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\n# from os.path import (\n#     realpath,\n#     dirname\n# )\nimport os\nimport pickle\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pytest\n\ntry:\n    import queue\nexcept ImportError:  # python 2\n    import Queue as queue  # type: ignore\n\nimport hashlib\n\nimport pandas as pd\nfrom cachier import cachier\nfrom cachier.core import _default_params\n\n\ndef _get_decorated_func(func, **kwargs):\n    cachier_decorator = cachier(**kwargs)\n    decorated_func = cachier_decorator(func)\n    return decorated_func\n\n\n# Pickle core tests\n\n\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"reload\", [True, False])\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core(reload, separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds,\n        next_time=False,\n        pickle_reload=reload,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_keywords(separate_files):\n    \"\"\"Basic Pickle core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds, next_time=False, separate_files=separate_files\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after(separate_files):\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds_decorated = _get_decorated_func(\n        _stale_after_seconds,\n        stale_after=DELTA,\n        next_time=False,\n        separate_files=separate_files,\n    )\n    _stale_after_seconds_decorated.clear_cache()\n    val1 = _stale_after_seconds_decorated(1, 2)\n    val2 = _stale_after_seconds_decorated(1, 2)\n    val3 = _stale_after_seconds_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds_decorated(1, 2)\n    assert val4 != val1\n    _stale_after_seconds_decorated.clear_cache()\n\n\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after_next_time(separate_files):\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time_decorated = _get_decorated_func(\n        _stale_after_next_time,\n        stale_after=DELTA,\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _stale_after_next_time_decorated.clear_cache()\n    val1 = _stale_after_next_time_decorated(1, 2)\n    val2 = _stale_after_next_time_decorated(1, 2)\n    val3 = _stale_after_next_time_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time_decorated(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time_decorated(1, 2)\n    assert val5 != val1\n    _stale_after_next_time_decorated.clear_cache()\n\n\ndef _random_num():\n    return random()\n\n\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_overwrite_cache(separate_files):\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 == int3\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 == int3\n    _random_num_with_arg_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_ignore_cache(separate_files):\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg_decorated.clear_cache()\n\n\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(takes_time_func, res_queue):\n    res = takes_time_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_being_calculated(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(being_calc_func, res_queue):\n    res = being_calc_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_being_calc_next_time(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _being_calc_next_time_decorated = _get_decorated_func(\n        _being_calc_next_time,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _being_calc_next_time_decorated.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _BAD_CACHE_FNAME = '.__main__._bad_cache'\n_BAD_CACHE_FNAME = \".tests.test_pickle_core._bad_cache\"\n_BAD_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._bad_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\nEXPANDED_CACHIER_DIR = os.path.expanduser(_default_params[\"cache_dir\"])\n_BAD_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME)\n_BAD_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME_SEPARATE_FILES\n)\n_BAD_CACHE_FPATHS = {\n    True: _BAD_CACHE_FPATH_SEPARATE_FILES,\n    False: _BAD_CACHE_FPATH,\n}\n\n\ndef _calls_bad_cache(bad_cache_func, res_queue, trash_cache, separate_files):\n    try:\n        res = bad_cache_func(0.13, 0.02, cachier__verbose=True)\n        if trash_cache:\n            with open(_BAD_CACHE_FPATHS[separate_files], \"w\") as cache_file:\n                cache_file.seek(0)\n                cache_file.truncate()\n        res_queue.put(res)\n    except Exception as exc:\n        res_queue.put(exc)\n\n\ndef _helper_bad_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    _bad_cache_decorated = _get_decorated_func(\n        _bad_cache, separate_files=separate_files\n    )\n    _bad_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if res2 is not None or isinstance(res2, KeyError):\n        return False\n    return True\n\n\n# we want this to succeed at least once\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_bad_cache_file(separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.6, 0.7, 0.8, 1, 1.5, 2]\n    sleeptimes = sleeptimes + sleeptimes\n    for sleeptime in sleeptimes:\n        if _helper_bad_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _DEL_CACHE_FNAME = '.__main__._delete_cache'\n_DEL_CACHE_FNAME = \".tests.test_pickle_core._delete_cache\"\n_DEL_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._delete_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\n_DEL_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME)\n_DEL_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME_SEPARATE_FILES\n)\n_DEL_CACHE_FPATHS = {\n    True: _DEL_CACHE_FPATH_SEPARATE_FILES,\n    False: _DEL_CACHE_FPATH,\n}\n\n\ndef _calls_delete_cache(del_cache_func, res_queue, del_cache, separate_files):\n    try:\n        # print('in')\n        res = del_cache_func(0.13, 0.02)\n        # print('out with {}'.format(res))\n        if del_cache:\n            os.remove(_DEL_CACHE_FPATHS[separate_files])\n            # print(os.path.isfile(_DEL_CACHE_FPATH))\n        res_queue.put(res)\n    except Exception as exc:\n        # print('found')\n        res_queue.put(exc)\n\n\ndef _helper_delete_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    _delete_cache_decorated = _get_decorated_func(\n        _delete_cache, separate_files=separate_files\n    )\n    _delete_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    # print(res1)\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if not ((isinstance(res2, KeyError)) or (res2 is None)):\n        return False\n    return True\n    # print(res2)\n    # print(type(res2))\n\n\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_delete_cache_file(separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.7, 1]\n    sleeptimes = sleeptimes * 4\n    for sleeptime in sleeptimes:\n        if _helper_delete_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_clear_being_calculated(separate_files):\n    \"\"\"Test pickle core clear `being calculated` functionality.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_being_calculated()\n\n\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_error_throwing_func(separate_files):\n    # with\n    _error_throwing_func.count = 0\n    _error_throwing_func_decorated = _get_decorated_func(\n        _error_throwing_func,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _error_throwing_func_decorated.clear_cache()\n    res1 = _error_throwing_func_decorated(4)\n    sleep(1.5)\n    res2 = _error_throwing_func_decorated(4)\n    assert res1 == res2\n\n\n# test custom cache dir for pickle core\n\nCUSTOM_DIR = \"~/.exparrot\"\nEXPANDED_CUSTOM_DIR = os.path.expanduser(CUSTOM_DIR)\n\n\ndef _takes_2_seconds_custom_dir(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_custom_cache_dir(separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_custom_dir_decorated = _get_decorated_func(\n        _takes_2_seconds_custom_dir,\n        next_time=False,\n        cache_dir=CUSTOM_DIR,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    path2test = _takes_2_seconds_custom_dir_decorated.cache_dpath()\n    assert path2test == EXPANDED_CUSTOM_DIR\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_callable_hash_param(separate_files):\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(hash_func=_hash_func, separate_files=separate_files)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_mongo_core.py",
      "content": "\"\"\"Testing the MongoDB core of cachier.\"\"\"\n\nimport datetime\nimport hashlib\nimport platform\nimport queue\nimport sys\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep\nfrom urllib.parse import quote_plus\n\nimport pandas as pd\nimport pymongo\nimport pytest\nfrom birch import Birch  # type: ignore[import-not-found]\nfrom cachier import cachier\nfrom cachier.cores.base import RecalculationNeeded\nfrom cachier.cores.mongo import _MongoCore\nfrom pymongo.errors import OperationFailure\nfrom pymongo.mongo_client import MongoClient\nfrom pymongo_inmemory import MongoClient as InMemoryMongoClient\n\n# === Enables testing vs a real MongoDB instance ===\n\n\nclass CfgKey:\n    HOST = \"TEST_HOST\"\n    PORT = \"TEST_PORT\"\n    # UNAME = \"TEST_USERNAME\"\n    # PWD = \"TEST_PASSWORD\"\n    # DB = \"TEST_DB\"\n    TEST_VS_DOCKERIZED_MONGO = \"TEST_VS_DOCKERIZED_MONGO\"\n\n\nCFG = Birch(\n    namespace=\"cachier\",\n    defaults={CfgKey.TEST_VS_DOCKERIZED_MONGO: False},\n)\n\n\n# URI_TEMPLATE = \"mongodb://myUser:myPassword@localhost:27017/\"\nURI_TEMPLATE = \"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n\n\ndef _get_cachier_db_mongo_client():\n    host = quote_plus(CFG[CfgKey.HOST])\n    port = quote_plus(CFG[CfgKey.PORT])\n    # uname = quote_plus(CFG[CfgKey.UNAME])\n    # pwd = quote_plus(CFG[CfgKey.PWD])\n    # db = quote_plus(CFG[CfgKey.DB])\n    uri = f\"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n    return MongoClient(uri)\n\n\n_COLLECTION_NAME = (\n    f\"cachier_test_{platform.system()}\"\n    f\"_{'.'.join(map(str, sys.version_info[:3]))}\"\n)\n\n\ndef _test_mongetter():\n    if not hasattr(_test_mongetter, \"client\"):\n        if str(CFG.mget(CfgKey.TEST_VS_DOCKERIZED_MONGO)).lower() == \"true\":\n            print(\"Using live MongoDB instance for testing.\")\n            _test_mongetter.client = _get_cachier_db_mongo_client()\n        else:\n            print(\"Using in-memory MongoDB instance for testing.\")\n            _test_mongetter.client = InMemoryMongoClient()\n    db_obj = _test_mongetter.client[\"cachier_test\"]\n    if _COLLECTION_NAME not in db_obj.list_collection_names():\n        db_obj.create_collection(_COLLECTION_NAME)\n    return db_obj[_COLLECTION_NAME]\n\n\n# === Mongo core tests ===\n\n\n@pytest.mark.mongo\ndef test_information():\n    print(\"\\npymongo version: \", end=\"\")\n    print(pymongo.__version__)\n\n\n@pytest.mark.mongo\ndef test_mongo_index_creation():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    collection = _test_mongetter()\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    assert _MongoCore._INDEX_NAME in collection.index_information()\n\n\n@pytest.mark.mongo\ndef test_mongo_core():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, 2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, 2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, 2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, 2)\n    assert val6 == val5\n\n\n@pytest.mark.mongo\ndef test_mongo_core_keywords():\n    \"\"\"Basic Mongo core functionality with keyword arguments.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, arg_2=2)\n    val2 = _test_mongo_caching(1, arg_2=2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, arg_2=2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, arg_2=2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, arg_2=2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, arg_2=2)\n    assert val6 == val5\n\n\nMONGO_DELTA = timedelta(seconds=3)\nMONGO_DELTA_LONG = timedelta(seconds=10)\n\n\n@pytest.mark.mongo\ndef test_mongo_stale_after():\n    \"\"\"Testing MongoDB core stale_after functionality.\"\"\"\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=MONGO_DELTA, next_time=False\n    )\n    def _stale_after_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _stale_after_mongo.clear_cache()\n    val1 = _stale_after_mongo(1, 2)\n    val2 = _stale_after_mongo(1, 2)\n    assert val1 == val2\n    sleep(3)\n    val3 = _stale_after_mongo(1, 2)\n    assert val3 != val1\n\n\ndef _calls_takes_time(res_queue):\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    res = _takes_time(34, 82.3)\n    res_queue.put(res)\n\n\n@pytest.mark.mongo\ndef test_mongo_being_calculated():\n    \"\"\"Testing MongoDB core handling of being calculated scenarios.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(1)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\nclass _BadMongoCollection:\n    def __init__(self, mongetter):\n        self.collection = mongetter()\n        self.index_information = self.collection.index_information\n        self.create_indexes = self.collection.create_indexes\n        self.find_one = self.collection.find_one\n\n    def delete_many(self, *args, **kwargs):\n        pass\n\n    def update_many(self, *args, **kwargs):\n        pass\n\n    def update_one(self, *args, **kwargs):\n        raise OperationFailure(Exception())\n\n\ndef _bad_mongetter():\n    return _BadMongoCollection(_test_mongetter)\n\n\n@pytest.mark.mongo\ndef test_mongo_write_failure():\n    \"\"\"Testing MongoDB core handling of writing failure scenarios.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    with pytest.raises(OperationFailure):\n        val1 = _func_w_bad_mongo(1, 2)\n        val2 = _func_w_bad_mongo(1, 2)\n        assert val1 == val2\n\n\n@pytest.mark.mongo\ndef test_mongo_clear_being_calculated():\n    \"\"\"Testing MongoDB core clear_being_calculated.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _func_w_bad_mongo.clear_being_calculated()\n\n\n@pytest.mark.mongo\ndef test_stalled_mongo_db_cache():\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    core = _MongoCore(None, _test_mongetter, 0)\n    core.set_func(_stalled_func)\n    core.clear_cache()\n    with pytest.raises(RecalculationNeeded):\n        core.wait_on_entry_calc(key=None)\n\n\n@pytest.mark.mongo\ndef test_stalled_mong_db_core(monkeypatch):\n    def mock_get_entry(self, args, kwargs):\n        return \"key\", {\"being_calculated\": True}\n\n    def mock_get_entry_by_key(self, key):\n        return \"key\", None\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry\n    )\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry_by_key\",\n        mock_get_entry_by_key,\n    )\n\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    res = _stalled_func()\n    assert res == 1\n\n    def mock_get_entry_2(self, args, kwargs):\n        entry = {\n            \"being_calculated\": True,\n            \"value\": 1,\n            \"time\": datetime.datetime.now() - datetime.timedelta(seconds=10),\n        }\n        return \"key\", entry\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry_2\n    )\n\n    stale_after = datetime.timedelta(seconds=1)\n\n    @cachier(mongetter=_test_mongetter, stale_after=stale_after)\n    def _stalled_func_2():\n        \"\"\"Testing stalled function.\"\"\"\n        return 2\n\n    res = _stalled_func_2()\n    assert res == 2\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=stale_after, next_time=True\n    )\n    def _stalled_func_3():\n        \"\"\"Testing stalled function.\"\"\"\n        return 3\n\n    res = _stalled_func_3()\n    assert res == 1\n\n\n@pytest.mark.mongo\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(mongetter=_test_mongetter, hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/requirements.txt",
      "content": "# todo: add some version range or pinning latest versions\n# tests and coverages\npytest\ncoverage\npytest-cov\nbirch\n# to connect to the test mongodb server\npymongo\ndnspython\npymongo-inmemory\n# to test pandas dataframe as-param hashing with mongodb core\npandas\n# to be able to run `python setup.py checkdocs`\ncollective.checkdocs\npygments\n"
    }
  ],
  "BuggyCode": [
    {
      "path": "python-cachier_cachier/MANIFEST.in",
      "content": "# Manifest syntax https://docs.python.org/2/distutils/sourcedist.html\ngraft wheelhouse\n\nrecursive-include src *.info\ninclude README.rst\ninclude LICENSE\n\nprune __pycache__\nprune tests\n"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "Cachier\n#######\n\n|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n\nPersistent, stale-free, local and cross-machine caching for Python functions.\n\n.. code-block:: python\n\n  from cachier import cachier\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(days=3))\n  def foo(arg1, arg2):\n    \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\n\n.. role:: python(code)\n  :language: python\n\n.. contents::\n\n.. section-numbering:\n\n\n\nInstallation\n============\n\nInstall ``cachier`` with:\n\n.. code-block:: python\n\n    pip install cachier\n\nFor the latest version supporting Python 2.7 please use:\n\n.. code-block:: python\n\n    pip install 'cachier==1.2.8'\n\nFeatures\n========\n\n* Pure Python.\n* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n* A simple interface.\n* Defining \"shelf life\" for cached values.\n* Local caching using pickle files.\n* Cross-machine caching using MongoDB.\n* Thread-safety.\n\nCachier is **NOT**:\n\n* Meant as a transient cache. Python's @lru_cache is better.\n* Especially fast. It is meant to replace function calls that take more than... a second, say (overhead is around 1 millisecond).\n\nFuture features\n---------------\n\n* S3 core.\n* Multi-core caching.\n* `Cache replacement policies <https://en.wikipedia.org/wiki/Cache_replacement_policies>`_\n\n\nUse\n===\n\nCachier provides a decorator which you can wrap around your functions to give them a persistent cache. The positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). Also, notice that since objects which are instances of user-defined classes are hashable but all compare unequal (their hash value is their id), equal objects across different sessions will not yield identical keys.\n\nSetting up a Cache\n------------------\nYou can add a default, pickle-based, persistent cache to your function - meaning it will last across different Python kernels calling the wrapped function - by decorating it with the ``cachier`` decorator (notice the ``()``!).\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier()\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values!\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nClass and object methods can also be cached. Cachier will automatically ignore the `self` parameter when determining the cache key for an object method. **This means that methods will be cached across all instances of an object, which may not be what you want.**\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  class Foo():\n    @staticmethod\n    @cachier()\n    def good_static_usage(arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method does not depend on object's internal state, so good to cache\n    @cachier()\n    def good_usage_1(self, arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method is calling external service, probably okay to cache\n    @cachier()\n    def good_usage_2(self, arg_1, arg_2):\n      result = self.call_api(arg_1, arg_2)\n      return result\n\n    # Instance method relies on object attribute, NOT good to cache\n    @cachier()\n    def bad_usage(self, arg_1, arg_2):\n      return arg_1 + arg_2 + self.arg_3\n\n\nResetting a Cache\n-----------------\nThe Cachier wrapper adds a ``clear_cache()`` function to each wrapped function. To reset the cache of the wrapped function simply call this method:\n\n.. code-block:: python\n\n  foo.clear_cache()\n\nGeneral Configuration\n----------------------\n\nGlobal Defaults\n~~~~~~~~~~~~~~~\n\nSettings can be globally configured across all Cachier wrappers through the use of the `set_default_params` function. This function takes the same keyword parameters as the ones defined in the decorator, which can be passed all at once or with multiple calls. Parameters given directly to a decorator take precedence over any values set by this function.\n\nThe following parameters will only be applied to decorators defined after `set_default_params` is called:\n\n*  `hash_func`\n*  `backend`\n*  `mongetter`\n*  `cache_dir`\n*  `pickle_reload`\n*  `separate_files`\n\nThese parameters can be changed at any time and they will apply to all decorators:\n\n*  `allow_none`\n*  `caching_enabled`\n*  `stale_after`\n*  `next_time`\n*  `wait_for_calc_timeout`\n\nThe current defaults can be fetched by calling `get_default_params`.\n\nThreads Limit\n~~~~~~~~~~~~~\n\nTo limit the number of threads Cachier is allowed to spawn, set the ``CACHIER_MAX_WORKERS`` with the desired number. The default is 8, so to enable Cachier to spawn even more threads, you'll have to set a higher limit explicitly.\n\n\nGlobal Enable/Disable\n---------------------\n\nCaching can be turned off across all decorators by calling `disable_caching`, and then re-activated by calling `enable_caching`.\n\nThese functions are convenience wrappers around the `caching_enabled` default setting.\n\n\nCache Shelf Life\n----------------\n\nSetting Shelf Life\n~~~~~~~~~~~~~~~~~~\nYou can set any duration as the shelf life of cached return values of a function by providing a corresponding ``timedelta`` object to the ``stale_after`` parameter:\n\n.. code-block:: python\n\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(weeks=2))\n  def bar(arg1, arg2):\n    return {'arg1': arg1, 'arg2': arg2}\n\nNow when a cached value matching the given arguments is found the time of its calculation is checked; if more than ``stale_after`` time has since passed, the function will be run again for the same arguments and the new value will be cached and returned.\n\nThis is useful for lengthy calculations that depend on a dynamic data source.\n\nFuzzy Shelf Life\n~~~~~~~~~~~~~~~~\nSometimes you may want your function to trigger a calculation when it encounters a stale result, but still not wait on it if it's not that critical. In that case, you can set ``next_time`` to ``True`` to have your function trigger a recalculation **in a separate thread**, but return the currently cached stale value:\n\n.. code-block:: python\n\n  @cachier(next_time=True)\n\nFurther function calls made while the calculation is being performed will not trigger redundant calculations.\n\n\nWorking with unhashable arguments\n---------------------------------\n\nAs mentioned above, the positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). To get around this limitation the ``hash_func`` parameter of the ``cachier`` decorator can be provided with a callable that gets the args and kwargs from the decorated function and returns a hash key for them.\n\n.. code-block:: python\n\n  def calculate_hash(args, kwds):\n    key = ...  # compute a hash key here based on arguments\n    return key\n\n  @cachier(hash_func=calculate_hash)\n  def calculate_super_complex_stuff(custom_obj):\n    # amazing code goes here\n\nSee here for an example:\n\n`Question: How to work with unhashable arguments <https://github.com/python-cachier/cachier/issues/91>`_\n\n\nPrecaching values\n---------------------------------\n\nIf you want to load a value into the cache without calling the underlying function, this can be done with the `precache_value` function.\n\n.. code-block:: python\n\n  @cachier()\n  def add(arg1, arg2):\n    return arg1 + arg2\n\n  add.precache_value(2, 2, value_to_cache=5)\n\n  result = add(2, 2)\n  print(result)  # prints 5\n\n\nPer-function call arguments\n---------------------------\n\nCachier also accepts several keyword arguments in the calls of the function it wraps rather than in the decorator call, allowing you to modify its behaviour for a specific function call.\n\nIgnore Cache\n~~~~~~~~~~~~\n\nYou can have ``cachier`` ignore any existing cache for a specific function call by passing ``cachier__skip_cache=True`` to the function call. The cache will neither be checked nor updated with the new return value.\n\n.. code-block:: python\n\n  @cachier()\n  def sum(first_num, second_num):\n    return first_num + second_num\n\n  def main():\n    print(sum(5, 3, cachier__skip_cache=True))\n\nOverwrite Cache\n~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` overwrite an existing cache entry - if one exists - for a specific function call by passing ``cachier__overwrite_cache=True`` to the function call. The cache will not be checked but will be updated with the new return value.\n\nVerbose Cache Call\n~~~~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` print out a detailed explanation of the logic of a specific call by passing ``cachier__verbose=True`` to the function call. This can be useful if you are not sure why a certain function result is, or is not, returned.\n\nCache `None` Values\n~~~~~~~~~~~~~~~~~~~\n\nBy default, ``cachier`` does not cache ``None`` values. You can override this behaviour by passing ``allow_none=True`` to the function call.\n\n\nCachier Cores\n=============\n\nPickle Core\n-----------\n\nThe default core for Cachier is pickle based, meaning each function will store its cache is a separate pickle file in the ``~/.cachier`` directory. Naturally, this kind of cache is both machine-specific and user-specific.\n\nYou can configure ``cachier`` to use another directory by providing the ``cache_dir`` parameter with the path to that directory:\n\n.. code-block:: python\n\n  @cachier(cache_dir='~/.temp/.cache')\n\n\nYou can slightly optimise pickle-based caching if you know your code will only be used in a single thread environment by setting:\n\n.. code-block:: python\n\n  @cachier(pickle_reload=False)\n\nThis will prevent reading the cache file on each cache read, speeding things up a bit, while also nullifying inter-thread functionality (the code is still thread safe, but different threads will have different versions of the cache at times, and will sometime make unnecessary function calls).\n\nSetting the optional argument ``separate_files`` to ``True`` will cause the cache to be stored in several files: A file per argument set, per function. This can help if your per-function cache files become too large.\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier(separate_files=True)\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values, split across several files, per argument set\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nYou can get the fully qualified path to the directory of cache files used by ``cachier`` (``~/.cachier`` by default) by calling the ``cache_dpath()`` function:\n\n.. code-block:: python\n\n  >>> foo.cache_dpath()\n      \"/home/user/.cachier/\"\n\n\nMongoDB Core\n------------\nYou can set a MongoDB-based cache by assigning ``mongetter`` with a callable that returns a ``pymongo.Collection`` object with writing permissions:\n\n.. code-block:: python\n\n    from pymongo import MongoClient\n\n    def my_mongetter():\n        client = MongoClient(get_cachier_db_auth_uri())\n        db_obj = client['cachier_db']\n        if 'someapp_cachier_db' not in db_obj.list_collection_names():\n            db_obj.create_collection('someapp_cachier_db')\n        return db_obj['someapp_cachier_db']\n\n  @cachier(mongetter=my_mongetter)\n\nThis allows you to have a cross-machine, albeit slower, cache. This functionality requires that the installation of the ``pymongo`` python package.\n\nIn certain cases the MongoDB backend might leave a deadlock behind, blocking all subsequent requests from being processed. If you encounter this issue, supply the ``wait_for_calc_timeout`` with a reasonable number of seconds; calls will then wait at most this number of seconds before triggering a recalculation.\n\n.. code-block:: python\n\n  @cachier(mongetter=False, wait_for_calc_timeout=2)\n\n\nMemory Core\n-----------\n\nYou can set an in-memory cache by assigning the ``backend`` parameter with ``'memory'``:\n\n.. code-block:: python\n\n  @cachier(backend='memory')\n\nNote, however, that ``cachier``'s in-memory core is simple, and has no monitoring or cap on cache size, and can thus lead to memory errors on large return values - it is mainly intended to be used with future multi-core functionality. As a rule, Python's built-in ``lru_cache`` is a much better stand-alone solution.\n\n\nContributing\n============\n\nCurrent maintainers are Shay Palachy Affek (`shay.palachy@gmail.com <mailto:shay.palachy@gmail.com>`_, `@shaypal5 <https://github.com/shaypal5>`_) and Judson Neer (`@lordjabez <https://github.com/lordjabez>`_); You are more than welcome to approach them for help. Contributions are very welcomed! :)\n\nInstalling for development\n--------------------------\n\nClone:\n\n.. code-block:: bash\n\n  git clone git@github.com:python-cachier/cachier.git\n\n\nInstall in development mode with test dependencies:\n\n.. code-block:: bash\n\n  cd cachier\n  pip install -e . -r tests/requirements.txt\n\n\nRunning the tests\n-----------------\n\nTo run the tests, call the ``pytest`` command in the repository's root, or:\n\n.. code-block:: bash\n\n  python -m pytest\n\nTo run only MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m mongo\n\nTo run only memory core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m memory\n\nTo run all tests EXCEPT MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not mongo\"\n\n\nTo run all tests EXCEPT memory core AND MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not (mongo or memory)\"\n\n\nRunning MongoDB tests against a live MongoDB instance\n-----------------------------------------------------\n\n**Note to developers:** By default, all MongoDB tests are run against a mocked MongoDB instance, provided by the ``pymongo_inmemory`` package. To run them against a live MongoDB instance, the ``CACHIER_TEST_VS_DOCKERIZED_MONGO`` environment variable is set to ``True`` in the ``test`` environment of this repository (and additional environment variables are populated with the appropriate credentials), used by the GitHub Action running tests on every commit and pull request.\n\nContributors are not expected to run these tests against a live MongoDB instance when developing, as credentials for the testing instance used will NOT be shared, but rather use the testing against the in-memory MongoDB instance as a good proxy.\n\n**HOWEVER, the tests run against a live MongoDB instance when you submit a PR are the determining tests for deciding whether your code functions correctly against MongoDB.**\n\n\nAdding documentation\n--------------------\n\nThis project is documented using the `numpy docstring conventions`_, which were chosen as they are perhaps the most widely-spread conventions that are both supported by common tools such as Sphinx and result in human-readable docstrings (in my personal opinion, of course). When documenting code you add to this project, please follow `these conventions`_.\n\n.. _`numpy docstring conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n.. _`these conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n\nAdditionally, if you update this ``README.rst`` file, use ``python setup.py checkdocs`` to validate it compiles.\n\n\nCredits\n=======\n\nCreated by `Shay Palachy Affek <https://github.com/shaypal5>`_ (shay.palachy@gmail.com), which currently assists in maintenance.\n\nCurrent lead developer/contributor: `Jirka Borovec <https://github.com/Borda>`_ (`@Borda <https://github.com/Borda>`_ on GitHub).\n\nOther major contributors:\n\n* `Jirka Borovec <https://github.com/Borda>`_ - Arg order independence, args-to-kwargs for less unique keys and numerous development and CI contributions.\n\n* `Judson Neer <https://github.com/lordjabez>`_ - Precaching, method caching support and numerous improvements and bugfixes.\n\n* `cthoyt <https://github.com/cthoyt>`_ - Base memory core implementation.\n\n* `amarczew <https://github.com/amarczew>`_ - The ``hash_func`` kwarg.\n\n* `non-senses <https://github.com/non-senses>`_ - The ``wait_for_calc_timeout`` kwarg.\n\n* `Elad Rapaport <https://github.com/erap129>`_ - Multi-file Pickle core, a.k.a ``separate_files`` (released on ``v1.5.3``).\n\n* `John Didion <https://github.com/jdidion>`_ - Support for pickle-based caching for cases where two identically-named methods of different classes are defined in the same module.\n\nNotable bugfixers:\n\n* `MichaelRazum <https://github.com/MichaelRazum>`_.\n\n* `Eric Ma <https://github.com/ericmjl>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n* `Ofir <https://github.com/ofirnk>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n\n\n.. |PyPI-Status| image:: https://img.shields.io/pypi/v/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |PyPI-Versions| image:: https://img.shields.io/pypi/pyversions/cachier.svg\n   :target: https://pypi.python.org/pypi/cachier\n\n.. |Build-Status| image:: https://github.com/python-cachier/cachier/actions/workflows/test.yml/badge.svg\n  :target: https://github.com/python-cachier/cachier/actions/workflows/test.yml\n\n.. |LICENCE| image:: https://img.shields.io/pypi/l/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |Codecov| image:: https://codecov.io/github/python-cachier/cachier/coverage.svg?branch=master\n   :target: https://codecov.io/github/python-cachier/cachier?branch=master\n\n.. |Downloads| image:: https://pepy.tech/badge/cachier\n     :target: https://pepy.tech/project/cachier\n     :alt: PePy stats\n\n.. |Codefactor| image:: https://www.codefactor.io/repository/github/python-cachier/cachier/badge?style=plastic\n     :target: https://www.codefactor.io/repository/github/python-cachier/cachier\n     :alt: Codefactor code quality\n\n.. links:\n.. _pymongo: https://api.mongodb.com/python/current/\n.. _watchdog: https://github.com/gorakhargosh/watchdog\n"
    },
    {
      "path": "python-cachier_cachier/LICENSE",
      "content": "MIT License\n\nCopyright (c) 2016 Shay Palachy\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.\n"
    },
    {
      "path": "python-cachier_cachier/pyproject.toml",
      "content": "[project]\nname = \"cachier\"\ndescription = \"Persistent, stale-free, local and cross-machine caching for Python functions.\"\nreadme = \"README.rst\"\nkeywords = [\n  \"cache\",\n  \"caching\",\n  \"cross-machine\",\n  \"decorator\",\n  \"local\",\n  \"memoization\",\n  \"mongo\",\n  \"persistent\",\n]\nlicense = {file = \"LICENSE\"}\nauthors = [\n   {name = \"Shay Palachy Affek\", email = 'shay.palachy@gmail.com'},\n]\nclassifiers = [\n  'Development Status :: 4 - Beta',\n  'Intended Audience :: Developers',\n  'License :: OSI Approved :: MIT License',\n  'Programming Language :: Python',\n  \"Programming Language :: Python :: 3 :: Only\",\n  \"Programming Language :: Python :: 3.8\",\n  \"Programming Language :: Python :: 3.9\",\n  \"Programming Language :: Python :: 3.10\",\n  \"Programming Language :: Python :: 3.11\",\n  \"Programming Language :: Python :: 3.12\",\n  \"Topic :: Other/Nonlisted Topic\",\n  \"Topic :: Software Development :: Libraries\",\n  \"Topic :: Software Development :: Libraries :: Python Modules\",\n  \"Topic :: Utilities\",\n]\ndynamic = [\n  \"version\",\n]\ndependencies = [\n  \"portalocker>=2.3.2\",\n  \"watchdog>=2.3.1\",\n]\n[project.urls]\nSource = \"https://github.com/python-cachier/cachier\"\n[project.scripts]\ncachier = \"cachier.__naim__:cli\"\n\n[tool.setuptools]\ninclude-package-data = true\n\n[tool.setuptools.dynamic]\nversion = {attr = \"cachier._version.__version__\"}\n\n[tool.setuptools.packages.find]\nwhere = [\"src\"]  # list of folders that contain the packages ([\".\"] by default)\ninclude = [\"cachier*\"]  # package names should match these glob patterns ([\"*\"] by default)\nnamespaces = false  # to disable scanning PEP 420 namespaces (true by default)\n\n[build-system]\nrequires = [\n  \"setuptools\",\n  \"wheel\",\n]\n\n[tool.pytest.ini_options]\ntestpaths = [\"cachier\", \"tests\"]\nnorecursedirs=[\"dist\", \"build\"]\naddopts = [\n    \"--color=yes\",\n    \"--cov=cachier\",\n    \"--cov-report=term\",\n    \"--cov-report=xml:cov.xml\",\n    \"-r a\",\n    \"-v\",\n    \"-s\",\n]\nmarkers = [\n    \"mongo: test the MongoDB core\",\n    \"memory: test the memory core\",\n    \"pickle: test the pickle core\"\n]\n\n[tool.docformatter]\nrecursive = true\n# some docstring start with r\"\"\"\nwrap-summaries = 79\nwrap-descriptions = 79\nblank = true\n\n[tool.ruff]\ntarget-version = \"py38\"\nline-length = 79\n# Enable Pyflakes `E` and `F` codes by default.\nlint.select = [\n    \"E\",\n    \"W\", # see: https://pypi.org/project/pycodestyle\n    \"F\", # see: https://pypi.org/project/pyflakes\n    \"I\", #see: https://pypi.org/project/isort/\n    \"D\", # see: https://pypi.org/project/pydocstyle\n    #\"N\", # see: https://pypi.org/project/pep8-naming\n    \"S\", # see: https://pypi.org/project/flake8-bandit\n    \"SIM\",\n    \"RUF100\" # alternative to yesqa\n]\nlint.extend-select = [\n    \"A\",    # see: https://pypi.org/project/flake8-builtins\n    \"B\",    # see: https://pypi.org/project/flake8-bugbear\n    \"C4\",   # see: https://pypi.org/project/flake8-comprehensions\n]\nlint.ignore = [\n    \"E203\",\n    \"C901\",\n]\n# Exclude a variety of commonly ignored directories.\nexclude = [\n    \".eggs\",\n    \".git\",\n    \".ruff_cache\",\n    \"__pypackages__\",\n    \"_build\",\n    \"build\",\n    \"dist\",\n]\nlint.ignore-init-module-imports = true\nlint.unfixable = [\"F401\"]\n\n[tool.ruff.lint.per-file-ignores]\n\"src/**/__init__.py\" = [\"D104\"]\n\"src/cachier/config.py\" = [\"D100\"]\n\"tests/**\" = [\n  \"S101\", \"S311\", \"S105\", \"S603\",\n  \"D100\", \"D101\", \"D103\", \"D104\", \"D401\"\n]\n\n#[tool.ruff.pydocstyle]\n## Use Google-style docstrings.\n#convention = \"google\"\n\n#[tool.ruff.pycodestyle]\n#ignore-overlong-task-comments = true\n\n[tool.ruff.lint.mccabe]\n# Unlike Flake8, default to a complexity level of 10.\nmax-complexity = 10\n\n[tool.coverage.run]\nbranch = true\ndynamic_context = \"test_function\"\nomit = [\n    \"tests/*\",\n    \"cachier/_version.py\",\n    \"cachier/__init__.py\",\n    \"**/scripts/**\",\n]\n[tool.coverage.report]\nshow_missing = true\n# Regexes for lines to exclude from consideration\nexclude_lines = [\n    \"pragma: no cover\", # Have to re-enable the standard pragma\n    \"raise NotImplementedError\", # Don't complain if tests don't hit defensive assertion code:\n    \"if TYPE_CHECKING:\", # Is only true when running mypy, not tests\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__init__.py",
      "content": "from ._version import *  # noqa: F403\nfrom .config import (\n    disable_caching,\n    enable_caching,\n    get_default_params,\n    set_default_params,\n)\nfrom .core import cachier\n\n__all__ = [\n    \"cachier\",\n    \"set_default_params\",\n    \"get_default_params\",\n    \"enable_caching\",\n    \"disable_caching\",\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/config.py",
      "content": "import datetime\nimport hashlib\nimport os\nimport pickle\nfrom typing import Optional, TypedDict, Union\n\nfrom ._types import Backend, HashFunc, Mongetter\n\n\ndef _default_hash_func(args, kwds):\n    # Sort the kwargs to ensure consistent ordering\n    sorted_kwargs = sorted(kwds.items())\n    # Serialize args and sorted_kwargs using pickle or similar\n    serialized = pickle.dumps((args, sorted_kwargs))\n    # Create a hash of the serialized data\n    return hashlib.sha256(serialized).hexdigest()\n\n\nclass Params(TypedDict):\n    \"\"\"Type definition for cachier parameters.\"\"\"\n\n    caching_enabled: bool\n    hash_func: HashFunc\n    backend: Backend\n    mongetter: Optional[Mongetter]\n    stale_after: datetime.timedelta\n    next_time: bool\n    cache_dir: Union[str, os.PathLike]\n    pickle_reload: bool\n    separate_files: bool\n    wait_for_calc_timeout: int\n    allow_none: bool\n\n\n_default_params: Params = {\n    \"caching_enabled\": True,\n    \"hash_func\": _default_hash_func,\n    \"backend\": \"pickle\",\n    \"mongetter\": None,\n    \"stale_after\": datetime.timedelta.max,\n    \"next_time\": False,\n    \"cache_dir\": \"~/.cachier/\",\n    \"pickle_reload\": True,\n    \"separate_files\": False,\n    \"wait_for_calc_timeout\": 0,\n    \"allow_none\": False,\n}\n\n\ndef _update_with_defaults(\n    param, name: str, func_kwargs: Optional[dict] = None\n):\n    import cachier\n\n    if func_kwargs:\n        kw_name = f\"cachier__{name}\"\n        if kw_name in func_kwargs:\n            return func_kwargs.pop(kw_name)\n    if param is None:\n        return cachier.config._default_params[name]\n    return param\n\n\ndef set_default_params(**params):\n    \"\"\"Configure global parameters applicable to all memoized functions.\n\n    This function takes the same keyword parameters as the ones defined in the\n    decorator, which can be passed all at once or with multiple calls.\n    Parameters given directly to a decorator take precedence over any values\n    set by this function.\n\n    Only 'stale_after', 'next_time', and 'wait_for_calc_timeout' can be changed\n    after the memoization decorator has been applied. Other parameters will\n    only have an effect on decorators applied after this function is run.\n\n    \"\"\"\n    import cachier\n\n    valid_params = (\n        p for p in params.items() if p[0] in cachier.config._default_params\n    )\n    _default_params.update(valid_params)\n\n\ndef get_default_params():\n    \"\"\"Get current set of default parameters.\"\"\"\n    import cachier\n\n    return cachier.config._default_params\n\n\ndef enable_caching():\n    \"\"\"Enable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = True\n\n\ndef disable_caching():\n    \"\"\"Disable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = False\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/_version.py",
      "content": "# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2024, Jirka Borovec <***@gmail.com>\n\nimport os\n\n_PATH_HERE = os.path.dirname(__file__)\n_PATH_VERSION = os.path.join(_PATH_HERE, \"version.info\")\n_RELEASING_PROCESS = os.getenv(\"RELEASING_PROCESS\", \"0\") == \"1\"\n\nwith open(_PATH_VERSION) as fopen:\n    __version__ = fopen.read().strip()\n\n\ndef _get_git_sha() -> str:\n    from subprocess import DEVNULL, check_output\n\n    out = check_output([\"git\", \"rev-parse\", \"--short\", \"HEAD\"], stderr=DEVNULL)  # noqa: S603, S607\n    return out.decode(\"utf-8\").strip()\n\n\nif not _RELEASING_PROCESS:\n    try:\n        sha_short = _get_git_sha()\n        # print(f\"Version enriched with git commit hash: {__version__}.\")\n    except Exception:\n        # print(\"Failed to get the git commit hash,\"\n        #       f\" falling back to base version {__version__}.\")\n        sha_short = \"\"\n    __version__ += f\".dev+{sha_short}\" if sha_short else \".dev\"\n\n\n__all__ = [\"__version__\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/version.info",
      "content": "3.0.0\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/py.typed",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/_types.py",
      "content": "from typing import TYPE_CHECKING, Callable, Literal\n\nif TYPE_CHECKING:\n    import pymongo.collection\n\n\nHashFunc = Callable[..., str]\nMongetter = Callable[[], \"pymongo.collection.Collection\"]\nBackend = Literal[\"pickle\", \"mongo\", \"memory\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__main__.py",
      "content": "\"\"\"A command-line interface for cachier.\"\"\"\n\nimport click\n\nfrom cachier.core import _set_max_workers\n\n\n@click.group()\ndef cli():\n    \"\"\"A command-line interface for cachier.\"\"\"  # noqa: D401\n\n\n@cli.command(\"Limits the number of worker threads used by cachier.\")\n@click.argument(\"max_workers\", nargs=1, type=int)\ndef set_max_workers(max_workers):\n    \"\"\"Limits the number of worker threads used by cachier.\"\"\"\n    _set_max_workers(max_workers)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "\"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport datetime\nimport inspect\nimport os\nimport warnings\nfrom collections import OrderedDict\nfrom concurrent.futures import ThreadPoolExecutor\nfrom functools import wraps\nfrom typing import Optional, Union\nfrom warnings import warn\n\nfrom .config import (\n    Backend,\n    HashFunc,\n    Mongetter,\n    _default_params,\n    _update_with_defaults,\n)\nfrom .cores.base import RecalculationNeeded, _BaseCore\nfrom .cores.memory import _MemoryCore\nfrom .cores.mongo import _MongoCore\nfrom .cores.pickle import _PickleCore\n\nMAX_WORKERS_ENVAR_NAME = \"CACHIER_MAX_WORKERS\"\nDEFAULT_MAX_WORKERS = 8\n\n\ndef _max_workers():\n    return int(os.environ.get(MAX_WORKERS_ENVAR_NAME, DEFAULT_MAX_WORKERS))\n\n\ndef _set_max_workers(max_workers):\n    os.environ[MAX_WORKERS_ENVAR_NAME] = str(max_workers)\n    _get_executor(True)\n\n\ndef _get_executor(reset=False):\n    if reset or not hasattr(_get_executor, \"executor\"):\n        _get_executor.executor = ThreadPoolExecutor(_max_workers())\n    return _get_executor.executor\n\n\ndef _function_thread(core, key, func, args, kwds):\n    try:\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n    except BaseException as exc:\n        print(f\"Function call failed with the following exception:\\n{exc}\")\n\n\ndef _calc_entry(core, key, func, args, kwds):\n    try:\n        core.mark_entry_being_calculated(key)\n        # _get_executor().submit(core.mark_entry_being_calculated, key)\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n        # _get_executor().submit(core.set_entry, key, func_res)\n        return func_res\n    finally:\n        core.mark_entry_not_calculated(key)\n\n\ndef _convert_args_kwargs(\n    func, _is_method: bool, args: tuple, kwds: dict\n) -> dict:\n    \"\"\"Convert mix of positional and keyword arguments to aggregated kwargs.\"\"\"\n    # unwrap if the function is functools.partial\n    if hasattr(func, \"func\"):\n        args = func.args + args\n        kwds.update({k: v for k, v in func.keywords.items() if k not in kwds})\n        func = func.func\n    func_params = list(inspect.signature(func).parameters)\n    args_as_kw = dict(\n        zip(func_params[1:], args[1:])\n        if _is_method\n        else zip(func_params, args)\n    )\n    # init with default values\n    kwargs = {\n        k: v.default\n        for k, v in inspect.signature(func).parameters.items()\n        if v.default is not inspect.Parameter.empty\n    }\n    # merge args expanded as kwargs and the original kwds\n    kwargs.update(dict(**args_as_kw, **kwds))\n    return OrderedDict(sorted(kwargs.items()))\n\n\ndef _pop_kwds_with_deprecation(kwds, name: str, default_value: bool):\n    if name in kwds:\n        warnings.warn(\n            f\"`{name}` is deprecated and will be removed in a future release,\"\n            \" use `cachier__` alternative instead.\",\n            DeprecationWarning,\n            stacklevel=2,\n        )\n    return kwds.pop(name, default_value)\n\n\ndef cachier(\n    hash_func: Optional[HashFunc] = None,\n    hash_params: Optional[HashFunc] = None,\n    backend: Optional[Backend] = None,\n    mongetter: Optional[Mongetter] = None,\n    stale_after: Optional[datetime.timedelta] = None,\n    next_time: Optional[bool] = None,\n    cache_dir: Optional[Union[str, os.PathLike]] = None,\n    pickle_reload: Optional[bool] = None,\n    separate_files: Optional[bool] = None,\n    wait_for_calc_timeout: Optional[int] = None,\n    allow_none: Optional[bool] = None,\n):\n    \"\"\"Wrap as a persistent, stale-free memoization decorator.\n\n    The positional and keyword arguments to the wrapped function must be\n    hashable (i.e. Python's immutable built-in objects, not mutable\n    containers). Also, notice that since objects which are instances of\n    user-defined classes are hashable but all compare unequal (their hash\n    value is their id), equal objects across different sessions will not yield\n    identical keys.\n\n    Arguments:\n    ---------\n    hash_func : callable, optional\n        A callable that gets the args and kwargs from the decorated function\n        and returns a hash key for them. This parameter can be used to enable\n        the use of cachier with functions that get arguments that are not\n        automatically hashable by Python.\n    hash_params : callable, optional\n    backend : str, optional\n        The name of the backend to use. Valid options currently include\n        'pickle', 'mongo' and 'memory'. If not provided, defaults to\n        'pickle' unless the 'mongetter' argument is passed, in which\n        case the mongo backend is automatically selected.\n    mongetter : callable, optional\n        A callable that takes no arguments and returns a pymongo.Collection\n        object with writing permissions. If unset a local pickle cache is used\n        instead.\n    stale_after : datetime.timedelta, optional\n        The time delta after which a cached result is considered stale. Calls\n        made after the result goes stale will trigger a recalculation of the\n        result, but whether a stale or fresh result will be returned is\n        determined by the optional next_time argument.\n    next_time : bool, optional\n        If set to True, a stale result will be returned when finding one, not\n        waiting for the calculation of the fresh result to return. Defaults to\n        False.\n    cache_dir : str, optional\n        A fully qualified path to a file directory to be used for cache files.\n        The running process must have running permissions to this folder. If\n        not provided, a default directory at `~/.cachier/` is used.\n    pickle_reload : bool, optional\n        If set to True, in-memory cache will be reloaded on each cache read,\n        enabling different threads to share cache. Should be set to False for\n        faster reads in single-thread programs. Defaults to True.\n    separate_files: bool, default False, for Pickle cores only\n        Instead of a single cache file per-function, each function's cache is\n        split between several files, one for each argument set. This can help\n        if you per-function cache files become too large.\n    wait_for_calc_timeout: int, optional, for MongoDB only\n        The maximum time to wait for an ongoing calculation. When a\n        process started to calculate the value setting being_calculated to\n        True, any process trying to read the same entry will wait a maximum of\n        seconds specified in this parameter. 0 means wait forever.\n        Once the timeout expires the calculation will be triggered.\n    allow_none: bool, optional\n        Allows storing None values in the cache. If False, functions returning\n        None will not be cached and are recalculated every call.\n\n    \"\"\"\n    # Check for deprecated parameters\n    if hash_params is not None:\n        message = (\n            \"hash_params will be removed in a future release, \"\n            \"please use hash_func instead\"\n        )\n        warn(message, DeprecationWarning, stacklevel=2)\n        hash_func = hash_params\n    # Update parameters with defaults if input is None\n    backend = _update_with_defaults(backend, \"backend\")\n    mongetter = _update_with_defaults(mongetter, \"mongetter\")\n    # Override the backend parameter if a mongetter is provided.\n    if callable(mongetter):\n        backend = \"mongo\"\n    core: _BaseCore\n    if backend == \"pickle\":\n        core = _PickleCore(\n            hash_func=hash_func,\n            pickle_reload=pickle_reload,\n            cache_dir=cache_dir,\n            separate_files=separate_files,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"mongo\":\n        core = _MongoCore(\n            hash_func=hash_func,\n            mongetter=mongetter,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"memory\":\n        core = _MemoryCore(\n            hash_func=hash_func, wait_for_calc_timeout=wait_for_calc_timeout\n        )\n    else:\n        raise ValueError(\"specified an invalid core: %s\" % backend)\n\n    def _cachier_decorator(func):\n        core.set_func(func)\n\n        @wraps(func)\n        def func_wrapper(*args, **kwds):\n            nonlocal allow_none\n            _allow_none = _update_with_defaults(allow_none, \"allow_none\", kwds)\n            # print('Inside general wrapper for {}.'.format(func.__name__))\n            ignore_cache = _pop_kwds_with_deprecation(\n                kwds, \"ignore_cache\", False\n            )\n            overwrite_cache = _pop_kwds_with_deprecation(\n                kwds, \"overwrite_cache\", False\n            )\n            verbose = _pop_kwds_with_deprecation(kwds, \"verbose_cache\", False)\n            ignore_cache = kwds.pop(\"cachier__skip_cache\", ignore_cache)\n            overwrite_cache = kwds.pop(\n                \"cachier__overwrite_cache\", overwrite_cache\n            )\n            verbose = kwds.pop(\"cachier__verbose\", verbose)\n            _stale_after = _update_with_defaults(\n                stale_after, \"stale_after\", kwds\n            )\n            _next_time = _update_with_defaults(next_time, \"next_time\", kwds)\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n\n            _print = lambda x: None  # noqa: E731\n            if verbose:\n                _print = print\n            if ignore_cache or not _default_params[\"caching_enabled\"]:\n                return func(**kwargs)\n            key, entry = core.get_entry((), kwargs)\n            if overwrite_cache:\n                return _calc_entry(core, key, func, args, kwds)\n            if entry is None:\n                _print(\"No entry found. No current calc. Calling like a boss.\")\n                return _calc_entry(core, key, func, args, kwds)\n            _print(\"Entry found.\")\n            if _allow_none or entry.get(\"value\", None) is not None:\n                _print(\"Cached result found.\")\n                now = datetime.datetime.now()\n                if now - entry[\"time\"] <= _stale_after:\n                    _print(\"And it is fresh!\")\n                    return entry[\"value\"]\n                _print(\"But it is stale... :(\")\n                if entry[\"being_calculated\"]:\n                    if _next_time:\n                        _print(\"Returning stale.\")\n                        return entry[\"value\"]  # return stale val\n                    _print(\"Already calc. Waiting on change.\")\n                    try:\n                        return core.wait_on_entry_calc(key)\n                    except RecalculationNeeded:\n                        return _calc_entry(core, key, func, args, kwds)\n                if _next_time:\n                    _print(\"Async calc and return stale\")\n                    try:\n                        core.mark_entry_being_calculated(key)\n                        _get_executor().submit(\n                            _function_thread, core, key, func, args, kwds\n                        )\n                    finally:\n                        core.mark_entry_not_calculated(key)\n                    return entry[\"value\"]\n                _print(\"Calling decorated function and waiting\")\n                return _calc_entry(core, key, func, args, kwds)\n            if entry[\"being_calculated\"]:\n                _print(\"No value but being calculated. Waiting.\")\n                try:\n                    return core.wait_on_entry_calc(key)\n                except RecalculationNeeded:\n                    return _calc_entry(core, key, func, args, kwds)\n            _print(\"No entry found. No current calc. Calling like a boss.\")\n            return _calc_entry(core, key, func, args, kwds)\n\n        def _clear_cache():\n            \"\"\"Clear the cache.\"\"\"\n            core.clear_cache()\n\n        def _clear_being_calculated():\n            \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n            core.clear_being_calculated()\n\n        def _cache_dpath():\n            \"\"\"Return the path to the cache dir, if exists; None if not.\"\"\"\n            return getattr(core, \"cache_dir\", None)\n\n        def _precache_value(*args, value_to_cache, **kwds):  # noqa: D417\n            \"\"\"Add an initial value to the cache.\n\n            Arguments:\n            ---------\n            value_to_cache : any\n                entry to be written into the cache\n\n            \"\"\"\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n            return core.precache_value((), kwargs, value_to_cache)\n\n        func_wrapper.clear_cache = _clear_cache\n        func_wrapper.clear_being_calculated = _clear_being_calculated\n        func_wrapper.cache_dpath = _cache_dpath\n        func_wrapper.precache_value = _precache_value\n        return func_wrapper\n\n    return _cachier_decorator\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/base.py",
      "content": "\"\"\"Defines the interface of a cachier caching core.\"\"\"\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport abc  # for the _BaseCore abstract base class\nimport inspect\nimport threading\nfrom typing import Callable\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n\nclass RecalculationNeeded(Exception):\n    \"\"\"Exception raised when a recalculation is needed.\"\"\"\n\n    pass\n\n\ndef _get_func_str(func: Callable) -> str:\n    return f\".{func.__module__}.{func.__name__}\"\n\n\nclass _BaseCore:\n    __metaclass__ = abc.ABCMeta\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        self.hash_func = _update_with_defaults(hash_func, \"hash_func\")\n        self.wait_for_calc_timeout = wait_for_calc_timeout\n        self.lock = threading.RLock()\n\n    def set_func(self, func):\n        \"\"\"Set the function this core will use.\n\n        This has to be set before any method is called. Also determine if the\n        function is an object method.\n\n        \"\"\"\n        # unwrap if the function is functools.partial\n        if hasattr(func, \"func\"):\n            func = func.func\n        func_params = list(inspect.signature(func).parameters)\n        self.func_is_method = func_params and func_params[0] == \"self\"\n        self.func = func\n\n    def get_key(self, args, kwds):\n        \"\"\"Return a unique key based on the arguments provided.\"\"\"\n        return self.hash_func(args, kwds)\n\n    def get_entry(self, args, kwds):\n        \"\"\"Get entry based on given arguments.\n\n        Return the result mapped to the given arguments in this core's cache,\n        if such a mapping exists.\n\n        \"\"\"\n        key = self.get_key(args, kwds)\n        return self.get_entry_by_key(key)\n\n    def precache_value(self, args, kwds, value_to_cache):\n        \"\"\"Write a precomputed value into the cache.\"\"\"\n        key = self.get_key(args, kwds)\n        self.set_entry(key, value_to_cache)\n        return value_to_cache\n\n    def check_calc_timeout(self, time_spent):\n        \"\"\"Raise an exception if a recalculation is needed.\"\"\"\n        calc_timeout = _update_with_defaults(\n            self.wait_for_calc_timeout, \"wait_for_calc_timeout\"\n        )\n        if calc_timeout > 0 and (time_spent >= calc_timeout):\n            raise RecalculationNeeded()\n\n    @abc.abstractmethod\n    def get_entry_by_key(self, key):\n        \"\"\"Get entry based on given key.\n\n        Return the result mapped to the given key in this core's cache, if such\n        a mapping exists.\n\n        \"\"\"\n\n    @abc.abstractmethod\n    def set_entry(self, key, func_res):\n        \"\"\"Map the given result to the given key in this core's cache.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_being_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_not_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as not being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def wait_on_entry_calc(self, key):\n        \"\"\"Wait on the entry with keys being calculated and returns result.\"\"\"\n\n    @abc.abstractmethod\n    def clear_cache(self):\n        \"\"\"Clear the cache of this core.\"\"\"\n\n    @abc.abstractmethod\n    def clear_being_calculated(self):\n        \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "\"\"\"A pickle-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\nimport os\nimport pickle  # for local caching\nfrom contextlib import suppress\nfrom datetime import datetime\n\nimport portalocker  # to lock on pickle cache IO\nfrom watchdog.events import PatternMatchingEventHandler\nfrom watchdog.observers import Observer\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n# Alternative:  https://github.com/WoLpH/portalocker\nfrom .base import _BaseCore\n\n\nclass _PickleCore(_BaseCore):\n    \"\"\"The pickle core class for cachier.\"\"\"\n\n    class CacheChangeHandler(PatternMatchingEventHandler):\n        \"\"\"Handles cache-file modification events.\"\"\"\n\n        def __init__(self, filename, core, key):\n            PatternMatchingEventHandler.__init__(\n                self,\n                patterns=[\"*\" + filename],\n                ignore_patterns=None,\n                ignore_directories=True,\n                case_sensitive=False,\n            )\n            self.core = core\n            self.key = key\n            self.observer = None\n            self.value = None\n\n        def inject_observer(self, observer):\n            \"\"\"Inject the observer running this handler.\"\"\"\n            self.observer = observer\n\n        def _check_calculation(self):\n            # print('checking calc')\n            entry = self.core.get_entry_by_key(self.key, True)[1]\n            # print(self.key)\n            # print(entry)\n            try:\n                if not entry[\"being_calculated\"]:\n                    # print('stopping observer!')\n                    self.value = entry[\"value\"]\n                    self.observer.stop()\n                # else:\n                # print('NOT stopping observer... :(')\n            except TypeError:\n                self.value = None\n                self.observer.stop()\n\n        def on_created(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()  # pragma: no cover\n\n        def on_modified(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        pickle_reload: bool,\n        cache_dir: str,\n        separate_files: bool,\n        wait_for_calc_timeout: int,\n    ):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = None\n        self.reload = _update_with_defaults(pickle_reload, \"pickle_reload\")\n        self.cache_dir = os.path.expanduser(\n            _update_with_defaults(cache_dir, \"cache_dir\")\n        )\n        self.separate_files = _update_with_defaults(\n            separate_files, \"separate_files\"\n        )\n\n    @property\n    def cache_fname(self) -> str:\n        fname = f\".{self.func.__module__}.{self.func.__qualname__}\"\n        return fname.replace(\"<\", \"_\").replace(\">\", \"_\")\n\n    @property\n    def cache_fpath(self) -> str:\n        os.makedirs(self.cache_dir, exist_ok=True)\n        return os.path.abspath(\n            os.path.join(os.path.realpath(self.cache_dir), self.cache_fname)\n        )\n\n    def _reload_cache(self):\n        with self.lock:\n            try:\n                with portalocker.Lock(\n                    self.cache_fpath, mode=\"rb\"\n                ) as cache_file:\n                    self.cache = pickle.load(cache_file)  # noqa: S301\n            except (FileNotFoundError, EOFError):\n                self.cache = {}\n\n    def _get_cache(self):\n        with self.lock:\n            if not self.cache:\n                self._reload_cache()\n            return self.cache\n\n    def _get_cache_by_key(self, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        fpath += f\"_{hash_str or key}\"\n        try:\n            with portalocker.Lock(fpath, mode=\"rb\") as cache_file:\n                return pickle.load(cache_file)  # noqa: S301\n        except (FileNotFoundError, EOFError):\n            return None\n\n    def _clear_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(f\"{name}_\"):\n                os.remove(os.path.join(path, subpath))\n\n    def _clear_being_calculated_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(name):\n                entry = self._get_cache_by_key(hash_str=subpath.split(\"_\")[-1])\n                if entry is not None:\n                    entry[\"being_calculated\"] = False\n                    self._save_cache(entry, hash_str=subpath.split(\"_\")[-1])\n\n    def _save_cache(self, cache, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        if key is not None:\n            fpath += f\"_{key}\"\n        elif hash_str is not None:\n            fpath += f\"_{hash_str}\"\n        with self.lock:\n            self.cache = cache\n            with portalocker.Lock(fpath, mode=\"wb\") as cache_file:\n                pickle.dump(cache, cache_file, protocol=4)\n            if key is None:\n                self._reload_cache()\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            if self.separate_files:\n                return key, self._get_cache_by_key(key)\n            if self.reload or reload:\n                self._reload_cache()\n            return key, self._get_cache().get(key, None)\n\n    def set_entry(self, key, func_res):\n        key_data = {\n            \"value\": func_res,\n            \"time\": datetime.now(),\n            \"stale\": False,\n            \"being_calculated\": False,\n        }\n        if self.separate_files:\n            self._save_cache(key_data, key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            cache[key] = key_data\n            self._save_cache(cache)\n\n    def mark_entry_being_calculated_separate_files(self, key):\n        self._save_cache(\n            {\n                \"value\": None,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": True,\n            },\n            key=key,\n        )\n\n    def mark_entry_not_calculated_separate_files(self, key):\n        _, entry = self.get_entry_by_key(key)\n        entry[\"being_calculated\"] = False\n        self._save_cache(entry, key=key)\n\n    def mark_entry_being_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_being_calculated_separate_files(key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            try:\n                cache[key][\"being_calculated\"] = True\n            except KeyError:\n                cache[key] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                }\n            self._save_cache(cache)\n\n    def mark_entry_not_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_not_calculated_separate_files(key)\n        with self.lock:\n            cache = self._get_cache()\n            # that's ok, we don't need an entry in that case\n            with suppress(KeyError):\n                cache[key][\"being_calculated\"] = False\n                self._save_cache(cache)\n\n    def wait_on_entry_calc(self, key):\n        if self.separate_files:\n            entry = self._get_cache_by_key(key)\n            filename = f\"{self.cache_fname}_{key}\"\n        else:\n            with self.lock:\n                self._reload_cache()\n                entry = self._get_cache()[key]\n            filename = self.cache_fname\n        if not entry[\"being_calculated\"]:\n            return entry[\"value\"]\n        event_handler = _PickleCore.CacheChangeHandler(\n            filename=filename, core=self, key=key\n        )\n        observer = Observer()\n        event_handler.inject_observer(observer)\n        observer.schedule(event_handler, path=self.cache_dir, recursive=True)\n        observer.start()\n        time_spent = 0\n        while observer.is_alive():\n            observer.join(timeout=1.0)\n            time_spent += 1\n            self.check_calc_timeout(time_spent)\n        return event_handler.value\n\n    def clear_cache(self):\n        if self.separate_files:\n            self._clear_all_cache_files()\n        else:\n            pass\n\n    def clear_being_calculated(self):\n        if self.separate_files:\n            self._clear_being_calculated_all_cache_files()\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            for key in cache:\n                cache[key][\"being_calculated\"] = False\n            self._save_cache(cache)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/memory.py",
      "content": "\"\"\"A memory-based caching core for cachier.\"\"\"\n\nimport threading\nfrom datetime import datetime\n\nfrom .._types import HashFunc\nfrom .base import _BaseCore, _get_func_str\n\n\nclass _MemoryCore(_BaseCore):\n    \"\"\"The memory core class for cachier.\"\"\"\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = {}\n\n    def _hash_func_key(self, key):\n        return f\"{_get_func_str(self.func)}:{key}\"\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            return key, self.cache.get(self._hash_func_key(key), None)\n\n    def set_entry(self, key, func_res):\n        with self.lock:\n            try:\n                # we need to retain the existing condition so that\n                # mark_entry_not_calculated can notify all possibly-waiting\n                # threads about it\n                cond = self.cache[self._hash_func_key(key)][\"condition\"]\n            except KeyError:  # pragma: no cover\n                cond = None\n            self.cache[self._hash_func_key(key)] = {\n                \"value\": func_res,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": False,\n                \"condition\": cond,\n            }\n\n    def mark_entry_being_calculated(self, key):\n        with self.lock:\n            condition = threading.Condition()\n            # condition.acquire()\n            try:\n                self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n                self.cache[self._hash_func_key(key)][\"condition\"] = condition\n            except KeyError:\n                self.cache[self._hash_func_key(key)] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                    \"condition\": condition,\n                }\n\n    def mark_entry_not_calculated(self, key):\n        with self.lock:\n            try:\n                entry = self.cache[self._hash_func_key(key)]\n            except KeyError:  # pragma: no cover\n                return  # that's ok, we don't need an entry in that case\n            entry[\"being_calculated\"] = False\n            cond = entry[\"condition\"]\n            if cond:\n                cond.acquire()\n                cond.notify_all()\n                cond.release()\n                entry[\"condition\"] = None\n\n    def wait_on_entry_calc(self, key):\n        with self.lock:  # pragma: no cover\n            entry = self.cache[self._hash_func_key(key)]\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n        entry[\"condition\"].acquire()\n        entry[\"condition\"].wait()\n        entry[\"condition\"].release()\n        return self.cache[self._hash_func_key(key)][\"value\"]\n\n    def clear_cache(self):\n        with self.lock:\n            self.cache.clear()\n\n    def clear_being_calculated(self):\n        with self.lock:\n            for entry in self.cache.values():\n                entry[\"being_calculated\"] = False\n                entry[\"condition\"] = None\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "content": "\"\"\"A MongoDB-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport pickle  # for serialization of python objects\nimport sys  # to make sure that pymongo was imported\nimport time  # to sleep when waiting on Mongo cache\\\nimport warnings  # to warn if pymongo is missing\nfrom contextlib import suppress\nfrom datetime import datetime\n\nfrom .._types import HashFunc, Mongetter\n\nwith suppress(ImportError):\n    from bson.binary import Binary  # to save binary data to mongodb\n    from pymongo import ASCENDING, IndexModel\n    from pymongo.errors import OperationFailure\n\nfrom .base import RecalculationNeeded, _BaseCore, _get_func_str\n\nMONGO_SLEEP_DURATION_IN_SEC = 1\n\n\nclass MissingMongetter(ValueError):\n    \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n\n\nclass _MongoCore(_BaseCore):\n    _INDEX_NAME = \"func_1_key_1\"\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        mongetter: Mongetter,\n        wait_for_calc_timeout: int,\n    ):\n        if \"pymongo\" not in sys.modules:\n            warnings.warn(\n                \"`pymongo` was not found. MongoDB cores will not function.\",\n                ImportWarning,\n                stacklevel=2,\n            )  # pragma: no cover\n\n        super().__init__(hash_func, wait_for_calc_timeout)\n        if mongetter is None:\n            raise MissingMongetter(\n                \"must specify ``mongetter`` when using the mongo core\"\n            )\n        self.mongetter = mongetter\n        self.mongo_collection = self.mongetter()\n        index_inf = self.mongo_collection.index_information()\n        if _MongoCore._INDEX_NAME not in index_inf:\n            func1key1 = IndexModel(\n                keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n                name=_MongoCore._INDEX_NAME,\n            )\n            self.mongo_collection.create_indexes([func1key1])\n\n    @property\n    def _func_str(self) -> str:\n        return _get_func_str(self.func)\n\n    def get_entry_by_key(self, key):\n        res = self.mongo_collection.find_one(\n            {\"func\": self._func_str, \"key\": key}\n        )\n        if not res:\n            return key, None\n        try:\n            entry = {\n                \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        except KeyError:\n            entry = {\n                \"value\": None,\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        return key, entry\n\n    def set_entry(self, key, func_res):\n        thebytes = pickle.dumps(func_res)\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\n                \"$set\": {\n                    \"func\": self._func_str,\n                    \"key\": key,\n                    \"value\": Binary(thebytes),\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": False,\n                }\n            },\n            upsert=True,\n        )\n\n    def mark_entry_being_calculated(self, key):\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\"$set\": {\"being_calculated\": True}},\n            upsert=True,\n        )\n\n    def mark_entry_not_calculated(self, key):\n        with suppress(OperationFailure):  # don't care in this case\n            self.mongo_collection.update_one(\n                filter={\n                    \"func\": self._func_str,\n                    \"key\": key,\n                },\n                update={\"$set\": {\"being_calculated\": False}},\n                upsert=False,  # should not insert in this case\n            )\n\n    def wait_on_entry_calc(self, key):\n        time_spent = 0\n        while True:\n            time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n            time_spent += MONGO_SLEEP_DURATION_IN_SEC\n            key, entry = self.get_entry_by_key(key)\n            if entry is None:\n                raise RecalculationNeeded()\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n            self.check_calc_timeout(time_spent)\n\n    def clear_cache(self):\n        self.mongo_collection.delete_many(filter={\"func\": self._func_str})\n\n    def clear_being_calculated(self):\n        self.mongo_collection.update_many(\n            filter={\n                \"func\": self._func_str,\n                \"being_calculated\": True,\n            },\n            update={\"$set\": {\"being_calculated\": False}},\n        )\n"
    },
    {
      "path": "python-cachier_cachier/tests/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/tests/standalone_script.py",
      "content": "import time\n\nimport cachier\n\n\n@cachier.cachier()\ndef _takes_3_seconds(label, value):\n    time.sleep(3)\n    return f\"{label} {value}\"\n\n\nprint(_takes_3_seconds(\"two\", 2))\n"
    },
    {
      "path": "python-cachier_cachier/tests/speed_eval.py",
      "content": "from random import random\nfrom time import time\n\nfrom cachier import cachier\n\n\n@cachier(next_time=True)\ndef _test_int_pickling(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef _test_int_pickling_compare(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef test_pickle_speed():\n    \"\"\"Test speeds.\"\"\"\n    print(\"Comparing speeds of decorated vs non-decorated functions...\")\n    num_of_vals = 1000\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling_compare(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Non-decorated average = {sum(times) / num_of_vals}\")\n\n    _test_int_pickling.clear_cache()\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Decorated average = {sum(times) / num_of_vals}\")\n\n\n@cachier()\ndef _test_single_file_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\n@cachier(separate_files=True)\ndef _test_separate_files_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\ndef test_separate_files_vs_single_file():\n    _test_separate_files_speed.clear_cache()\n    _test_single_file_speed.clear_cache()\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_separate_files_speed(j, 2)\n    print(f\"separate files time: {time() - start_time}\")\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_single_file_speed(j, 2)\n    print(f\"single file time: {time() - start_time}\")\n\n\nif __name__ == \"__main__\":\n    test_pickle_speed()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_memory_core.py",
      "content": "\"\"\"Test for the in-memory implementation of the  Cachier python package.\"\"\"\n\nimport hashlib\nimport queue\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pandas as pd\nimport pytest\nfrom cachier import cachier\n\n\n@cachier(backend=\"memory\", next_time=False)\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.memory\ndef test_memory_core():\n    \"\"\"Basic memory core functionality.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\n@pytest.mark.memory\ndef test_memory_core_keywords():\n    \"\"\"Basic memory core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=False)\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after():\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds.clear_cache()\n    val1 = _stale_after_seconds(1, 2)\n    val2 = _stale_after_seconds(1, 2)\n    val3 = _stale_after_seconds(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds(1, 2)\n    assert val4 != val1\n    _stale_after_seconds.clear_cache()\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=True)\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after_next_time():\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _random_num():\n    return random()\n\n\n@cachier(backend=\"memory\")\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.memory\ndef test_overwrite_cache():\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 == int3\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 == int3\n    _random_num_with_arg.clear_cache()\n\n\n@pytest.mark.memory\ndef test_ignore_cache():\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(res_queue):\n    res = _takes_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_memory_being_calculated():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(res_queue):\n    res = _being_calc_next_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_being_calc_next_time():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\")\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@cachier(backend=\"memory\")\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    _takes_time.clear_being_calculated()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated_with_empty_cache():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    _takes_time.clear_being_calculated()\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.memory\ndef test_error_throwing_func():\n    # with\n    res1 = _error_throwing_func(4)\n    sleep(1.5)\n    res2 = _error_throwing_func(4)\n    assert res1 == res2\n\n\n@pytest.mark.memory\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(backend=\"memory\", hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n\n\nif __name__ == \"__main__\":\n    test_memory_being_calculated()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_core_lookup.py",
      "content": "\"\"\"Testing a few basic cachier interfaces.\"\"\"\n\nimport pytest\nfrom cachier import cachier, get_default_params\nfrom cachier.cores.mongo import MissingMongetter\n\n\ndef test_get_default_params():\n    params = get_default_params()\n    assert tuple(sorted(params)) == (\n        \"allow_none\",\n        \"backend\",\n        \"cache_dir\",\n        \"caching_enabled\",\n        \"hash_func\",\n        \"mongetter\",\n        \"next_time\",\n        \"pickle_reload\",\n        \"separate_files\",\n        \"stale_after\",\n        \"wait_for_calc_timeout\",\n    )\n\n\ndef test_bad_name(name=\"nope\"):\n    # Test that the appropriate exception is thrown\n    # when an invalid backend is given.\n    with pytest.raises(ValueError, match=f\"specified an invalid core: {name}\"):\n\n        @cachier(backend=name)\n        def dummy_func():\n            pass\n\n\ndef test_missing_mongetter():\n    # Test that the appropriate exception is thrown\n    # when forgetting to specify the mongetter.\n    with pytest.raises(MissingMongetter):\n\n        @cachier(backend=\"mongo\", mongetter=None)\n        def dummy_func():\n            pass\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_general.py",
      "content": "\"\"\"Non-core-specific tests for cachier.\"\"\"\n\nimport functools\nimport os\nimport queue\nimport subprocess  # nosec: B404\nimport threading\nfrom contextlib import suppress\nfrom random import random\nfrom time import sleep, time\n\nimport cachier\nimport pytest\nfrom cachier.core import (\n    DEFAULT_MAX_WORKERS,\n    MAX_WORKERS_ENVAR_NAME,\n    _get_executor,\n    _max_workers,\n    _set_max_workers,\n)\n\nfrom tests.test_mongo_core import (\n    MONGO_DELTA_LONG,\n    _test_mongetter,\n)\n\n\ndef test_information():\n    print(\"\\ncachier version: \", end=\"\")\n    print(cachier.__version__)\n\n\ndef test_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    with suppress(KeyError):\n        del os.environ[MAX_WORKERS_ENVAR_NAME]\n    assert _max_workers() == DEFAULT_MAX_WORKERS\n\n\ndef test_get_executor():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _get_executor()\n    _get_executor(False)\n    _get_executor(True)\n\n\ndef test_set_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _set_max_workers(9)\n\n\nparametrize_keys = \"mongetter,stale_after,separate_files\"\nparametrize_values = [\n    pytest.param(\n        _test_mongetter, MONGO_DELTA_LONG, False, marks=pytest.mark.mongo\n    ),\n    (None, None, False),\n    (None, None, True),\n]\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_ok(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_fast(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(1)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_fast(res_queue):\n        res = _wait_for_calc_timeout_fast(1, 2)\n        res_queue.put(res)\n\n    \"\"\" Testing calls that avoid timeouts store the values in cache. \"\"\"\n    _wait_for_calc_timeout_fast.clear_cache()\n    val1 = _wait_for_calc_timeout_fast(1, 2)\n    val2 = _wait_for_calc_timeout_fast(1, 2)\n    assert val1 == val2\n\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(2)\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2  # Timeout did not kick in, a single call was done\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_slow(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    sleep(4)\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_precache_value(mongetter, backend):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def dummy_func(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return arg_1 + arg_2\n\n    assert dummy_func.precache_value(2, 2, value_to_cache=5) == 5\n    assert dummy_func(2, 2) == 5\n    dummy_func.clear_cache()\n    assert dummy_func(2, 2) == 4\n    assert dummy_func.precache_value(2, arg_2=2, value_to_cache=5) == 5\n    assert dummy_func(2, arg_2=2) == 5\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_ignore_self_in_methods(mongetter, backend):\n    class DummyClass:\n        @cachier.cachier(backend=backend, mongetter=mongetter)\n        def takes_2_seconds(self, arg_1, arg_2):\n            \"\"\"Some function.\"\"\"\n            sleep(2)\n            return arg_1 + arg_2\n\n    test_object_1 = DummyClass()\n    test_object_2 = DummyClass()\n    test_object_1.takes_2_seconds.clear_cache()\n    test_object_2.takes_2_seconds.clear_cache()\n    assert test_object_1.takes_2_seconds(1, 2) == 3\n    start = time()\n    assert test_object_2.takes_2_seconds(1, 2) == 3\n    end = time()\n    assert end - start < 1\n\n\ndef test_hash_params_deprecation():\n    with pytest.deprecated_call(match=\"hash_params will be removed\"):\n\n        @cachier.cachier(hash_params=lambda a, k: \"key\")\n        def test():\n            return \"value\"\n\n    assert test() == \"value\"\n\n\ndef test_separate_processes():\n    test_args = (\"python\", \"tests/standalone_script.py\")\n    run_params = {\"args\": test_args, \"capture_output\": True, \"text\": True}\n    run_process = functools.partial(subprocess.run, **run_params)\n    result = run_process()\n    assert result.stdout.strip() == \"two 2\"\n    start = time()\n    result = run_process()\n    end = time()\n    assert result.stdout.strip() == \"two 2\"\n    assert end - start < 3\n\n\ndef test_global_disable():\n    @cachier.cachier()\n    def get_random():\n        return random()\n\n    get_random.clear_cache()\n    result_1 = get_random()\n    result_2 = get_random()\n    cachier.disable_caching()\n    result_3 = get_random()\n    cachier.enable_caching()\n    result_4 = get_random()\n    assert result_1 == result_2 == result_4\n    assert result_1 != result_3\n\n\ndef test_none_not_cached_by_default():\n    count = 0\n\n    @cachier.cachier()\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 2\n\n\ndef test_allow_caching_none():\n    count = 0\n\n    @cachier.cachier(allow_none=True)\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 1\n\n\ndef test_identical_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: int, b: int = 2, c: int = 3):\n        nonlocal count\n        count += 1\n        return a + b + c\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1, 2, 3) == 6\n    assert dummy_func(1, 2, c=3) == 6\n    assert dummy_func(1, b=2, c=3) == 6\n    assert dummy_func(a=1, b=2, c=3) == 6\n    assert count == 1\n\n\ndef test_list_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: list, b: list = [2]):  # noqa: B006\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func([1]) == [1, 2]\n    assert dummy_func([1], [2]) == [1, 2]\n    assert dummy_func([1], b=[2]) == [1, 2]\n    assert dummy_func(a=[1], b=[2]) == [1, 2]\n    assert count == 1\n\n\ndef test_order_independent_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(b=2, a=1) == 3\n    assert count == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_diff_functions_same_args(tmpdir, backend: str):\n    count_p = count_m = 0\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert fn(1) == expected\n        assert fn(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_runtime_handling(tmpdir, backend):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir, backend=backend)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n    for fn, expected in [(fn_plus, 5), (fn_minus, 1)]:\n        assert cachier_(fn)(3, 2) == expected\n        assert cachier_(fn)(a=3, b=2) == expected\n    assert count_p == 2\n    assert count_m == 2\n\n\ndef test_partial_handling(tmpdir):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)(2) == expected\n\n        dummy_ = functools.partial(fn, a=1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1) == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1, b=2) == expected\n\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n\n    assert count_p == 1\n    assert count_m == 1\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_defaults.py",
      "content": "import datetime\nimport os\nimport queue\nimport random\nimport threading\nimport time\n\nimport cachier\nimport pytest\n\nfrom tests.test_mongo_core import MONGO_DELTA, _test_mongetter\n\n_default_params = cachier.get_default_params().copy()\n\n\ndef setup_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef teardown_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef test_hash_func_default_param():\n    def slow_hash_func(args, kwds):\n        time.sleep(2)\n        return \"hash\"\n\n    def fast_hash_func(args, kwds):\n        return \"hash\"\n\n    cachier.set_default_params(hash_func=slow_hash_func)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(hash_func=fast_hash_func)\n    def global_test_2():\n        return None\n\n    start = time.time()\n    global_test_1()\n    end = time.time()\n    assert end - start > 1\n    start = time.time()\n    global_test_2()\n    end = time.time()\n    assert end - start < 1\n\n\ndef test_backend_default_param():\n    cachier.set_default_params(backend=\"memory\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(backend=\"pickle\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\n@pytest.mark.mongo\ndef test_mongetter_default_param():\n    cachier.set_default_params(mongetter=_test_mongetter)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(mongetter=False)\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\ndef test_cache_dir_default_param(tmpdir):\n    cachier.set_default_params(cache_dir=tmpdir / \"1\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() == str(tmpdir / \"1\")\n    assert global_test_2.cache_dpath() == str(tmpdir / \"2\")\n\n\ndef test_separate_files_default_param(tmpdir):\n    cachier.set_default_params(separate_files=True)\n\n    @cachier.cachier(cache_dir=tmpdir / \"1\")\n    def global_test_1(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\", separate_files=False)\n    def global_test_2(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    global_test_1(1, 2)\n    global_test_1(3, 4)\n    global_test_2(1, 2)\n    global_test_2(3, 4)\n\n    assert len(os.listdir(global_test_1.cache_dpath())) == 2\n    assert len(os.listdir(global_test_2.cache_dpath())) == 1\n\n\ndef test_allow_none_default_param(tmpdir):\n    cachier.set_default_params(\n        allow_none=True,\n        separate_files=True,\n        verbose_cache=True,\n    )\n    allow_count = disallow_count = 0\n\n    @cachier.cachier(cache_dir=tmpdir)\n    def allow_none():\n        nonlocal allow_count\n        allow_count += 1\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir, allow_none=False)\n    def disallow_none():\n        nonlocal disallow_count\n        disallow_count += 1\n        return None\n\n    assert allow_count == 0\n    allow_none()\n    allow_none()\n    assert allow_count == 1\n\n    assert disallow_count == 0\n    disallow_none()\n    disallow_none()\n    assert disallow_count == 2\n\n    disallow_none(cachier__allow_none=True)\n    disallow_none(cachier__allow_none=True)\n    assert disallow_count == 2\n\n\nPARAMETRIZE_TEST = (\n    \"backend,mongetter\",\n    [\n        pytest.param(\"pickle\", None, marks=pytest.mark.pickle),\n        pytest.param(\"mongo\", _test_mongetter, marks=pytest.mark.mongo),\n    ],\n)\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_stale_after_applies_dynamically(backend, mongetter):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_test(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random() + arg_1 + arg_2\n\n    cachier.set_default_params(stale_after=MONGO_DELTA)\n\n    _stale_after_test.clear_cache()\n    val1 = _stale_after_test(1, 2)\n    val2 = _stale_after_test(1, 2)\n    assert val1 == val2\n    time.sleep(3)\n    val3 = _stale_after_test(1, 2)\n    assert val3 != val1\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_next_time_applies_dynamically(backend, mongetter):\n    NEXT_AFTER_DELTA = datetime.timedelta(seconds=3)\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_next_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random()\n\n    cachier.set_default_params(stale_after=NEXT_AFTER_DELTA, next_time=True)\n\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    time.sleep(NEXT_AFTER_DELTA.seconds + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    time.sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_wait_for_calc_applies_dynamically(backend, mongetter):\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        time.sleep(3)\n        return random.random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    cachier.set_default_params(wait_for_calc_timeout=2)\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    time.sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    time.sleep(5)\n    thread1.join(timeout=5)\n    thread2.join(timeout=5)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\ndef test_default_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1) == 3\n    assert dummy_func(a=1) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert count == 1\n\n\ndef test_deprecated_func_kwargs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    with pytest.deprecated_call(\n        match=\"`verbose_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, verbose_cache=True) == 3\n    assert count == 1\n    with pytest.deprecated_call(\n        match=\"`ignore_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, ignore_cache=True) == 3\n    assert count == 2\n    with pytest.deprecated_call(\n        match=\"`overwrite_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, overwrite_cache=True) == 3\n    assert count == 3\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_pickle_core.py",
      "content": "\"\"\"Test for the Cachier python package.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\n# from os.path import (\n#     realpath,\n#     dirname\n# )\nimport os\nimport pickle\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pytest\n\ntry:\n    import queue\nexcept ImportError:  # python 2\n    import Queue as queue  # type: ignore\n\nimport hashlib\n\nimport pandas as pd\nfrom cachier import cachier\nfrom cachier.core import _default_params\n\n\ndef _get_decorated_func(func, **kwargs):\n    cachier_decorator = cachier(**kwargs)\n    decorated_func = cachier_decorator(func)\n    return decorated_func\n\n\n# Pickle core tests\n\n\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"reload\", [True, False])\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core(reload, separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds,\n        next_time=False,\n        pickle_reload=reload,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_keywords(separate_files):\n    \"\"\"Basic Pickle core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds, next_time=False, separate_files=separate_files\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after(separate_files):\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds_decorated = _get_decorated_func(\n        _stale_after_seconds,\n        stale_after=DELTA,\n        next_time=False,\n        separate_files=separate_files,\n    )\n    _stale_after_seconds_decorated.clear_cache()\n    val1 = _stale_after_seconds_decorated(1, 2)\n    val2 = _stale_after_seconds_decorated(1, 2)\n    val3 = _stale_after_seconds_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds_decorated(1, 2)\n    assert val4 != val1\n    _stale_after_seconds_decorated.clear_cache()\n\n\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after_next_time(separate_files):\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time_decorated = _get_decorated_func(\n        _stale_after_next_time,\n        stale_after=DELTA,\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _stale_after_next_time_decorated.clear_cache()\n    val1 = _stale_after_next_time_decorated(1, 2)\n    val2 = _stale_after_next_time_decorated(1, 2)\n    val3 = _stale_after_next_time_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time_decorated(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time_decorated(1, 2)\n    assert val5 != val1\n    _stale_after_next_time_decorated.clear_cache()\n\n\ndef _random_num():\n    return random()\n\n\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_overwrite_cache(separate_files):\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 == int3\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 == int3\n    _random_num_with_arg_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_ignore_cache(separate_files):\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg_decorated.clear_cache()\n\n\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(takes_time_func, res_queue):\n    res = takes_time_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_being_calculated(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(being_calc_func, res_queue):\n    res = being_calc_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_being_calc_next_time(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _being_calc_next_time_decorated = _get_decorated_func(\n        _being_calc_next_time,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _being_calc_next_time_decorated.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _BAD_CACHE_FNAME = '.__main__._bad_cache'\n_BAD_CACHE_FNAME = \".tests.test_pickle_core._bad_cache\"\n_BAD_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._bad_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\nEXPANDED_CACHIER_DIR = os.path.expanduser(_default_params[\"cache_dir\"])\n_BAD_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME)\n_BAD_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME_SEPARATE_FILES\n)\n_BAD_CACHE_FPATHS = {\n    True: _BAD_CACHE_FPATH_SEPARATE_FILES,\n    False: _BAD_CACHE_FPATH,\n}\n\n\ndef _calls_bad_cache(bad_cache_func, res_queue, trash_cache, separate_files):\n    try:\n        res = bad_cache_func(0.13, 0.02, cachier__verbose=True)\n        if trash_cache:\n            with open(_BAD_CACHE_FPATHS[separate_files], \"w\") as cache_file:\n                cache_file.seek(0)\n                cache_file.truncate()\n        res_queue.put(res)\n    except Exception as exc:\n        res_queue.put(exc)\n\n\ndef _helper_bad_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    _bad_cache_decorated = _get_decorated_func(\n        _bad_cache, separate_files=separate_files\n    )\n    _bad_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if res2 is not None or isinstance(res2, KeyError):\n        return False\n    return True\n\n\n# we want this to succeed at least once\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_bad_cache_file(separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.6, 0.7, 0.8, 1, 1.5, 2]\n    sleeptimes = sleeptimes + sleeptimes\n    for sleeptime in sleeptimes:\n        if _helper_bad_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _DEL_CACHE_FNAME = '.__main__._delete_cache'\n_DEL_CACHE_FNAME = \".tests.test_pickle_core._delete_cache\"\n_DEL_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._delete_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\n_DEL_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME)\n_DEL_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME_SEPARATE_FILES\n)\n_DEL_CACHE_FPATHS = {\n    True: _DEL_CACHE_FPATH_SEPARATE_FILES,\n    False: _DEL_CACHE_FPATH,\n}\n\n\ndef _calls_delete_cache(del_cache_func, res_queue, del_cache, separate_files):\n    try:\n        # print('in')\n        res = del_cache_func(0.13, 0.02)\n        # print('out with {}'.format(res))\n        if del_cache:\n            os.remove(_DEL_CACHE_FPATHS[separate_files])\n            # print(os.path.isfile(_DEL_CACHE_FPATH))\n        res_queue.put(res)\n    except Exception as exc:\n        # print('found')\n        res_queue.put(exc)\n\n\ndef _helper_delete_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    _delete_cache_decorated = _get_decorated_func(\n        _delete_cache, separate_files=separate_files\n    )\n    _delete_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    # print(res1)\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if not ((isinstance(res2, KeyError)) or (res2 is None)):\n        return False\n    return True\n    # print(res2)\n    # print(type(res2))\n\n\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_delete_cache_file(separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.7, 1]\n    sleeptimes = sleeptimes * 4\n    for sleeptime in sleeptimes:\n        if _helper_delete_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_clear_being_calculated(separate_files):\n    \"\"\"Test pickle core clear `being calculated` functionality.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_being_calculated()\n\n\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_error_throwing_func(separate_files):\n    # with\n    _error_throwing_func.count = 0\n    _error_throwing_func_decorated = _get_decorated_func(\n        _error_throwing_func,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _error_throwing_func_decorated.clear_cache()\n    res1 = _error_throwing_func_decorated(4)\n    sleep(1.5)\n    res2 = _error_throwing_func_decorated(4)\n    assert res1 == res2\n\n\n# test custom cache dir for pickle core\n\nCUSTOM_DIR = \"~/.exparrot\"\nEXPANDED_CUSTOM_DIR = os.path.expanduser(CUSTOM_DIR)\n\n\ndef _takes_2_seconds_custom_dir(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_custom_cache_dir(separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_custom_dir_decorated = _get_decorated_func(\n        _takes_2_seconds_custom_dir,\n        next_time=False,\n        cache_dir=CUSTOM_DIR,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    path2test = _takes_2_seconds_custom_dir_decorated.cache_dpath()\n    assert path2test == EXPANDED_CUSTOM_DIR\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_callable_hash_param(separate_files):\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(hash_func=_hash_func, separate_files=separate_files)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_mongo_core.py",
      "content": "\"\"\"Testing the MongoDB core of cachier.\"\"\"\n\nimport datetime\nimport hashlib\nimport platform\nimport queue\nimport sys\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep\nfrom urllib.parse import quote_plus\n\nimport pandas as pd\nimport pymongo\nimport pytest\nfrom birch import Birch  # type: ignore[import-not-found]\nfrom cachier import cachier\nfrom cachier.cores.base import RecalculationNeeded\nfrom cachier.cores.mongo import _MongoCore\nfrom pymongo.errors import OperationFailure\nfrom pymongo.mongo_client import MongoClient\nfrom pymongo_inmemory import MongoClient as InMemoryMongoClient\n\n# === Enables testing vs a real MongoDB instance ===\n\n\nclass CfgKey:\n    HOST = \"TEST_HOST\"\n    PORT = \"TEST_PORT\"\n    # UNAME = \"TEST_USERNAME\"\n    # PWD = \"TEST_PASSWORD\"\n    # DB = \"TEST_DB\"\n    TEST_VS_DOCKERIZED_MONGO = \"TEST_VS_DOCKERIZED_MONGO\"\n\n\nCFG = Birch(\n    namespace=\"cachier\",\n    defaults={CfgKey.TEST_VS_DOCKERIZED_MONGO: False},\n)\n\n\n# URI_TEMPLATE = \"mongodb://myUser:myPassword@localhost:27017/\"\nURI_TEMPLATE = \"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n\n\ndef _get_cachier_db_mongo_client():\n    host = quote_plus(CFG[CfgKey.HOST])\n    port = quote_plus(CFG[CfgKey.PORT])\n    # uname = quote_plus(CFG[CfgKey.UNAME])\n    # pwd = quote_plus(CFG[CfgKey.PWD])\n    # db = quote_plus(CFG[CfgKey.DB])\n    uri = f\"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n    return MongoClient(uri)\n\n\n_COLLECTION_NAME = (\n    f\"cachier_test_{platform.system()}\"\n    f\"_{'.'.join(map(str, sys.version_info[:3]))}\"\n)\n\n\ndef _test_mongetter():\n    if not hasattr(_test_mongetter, \"client\"):\n        if str(CFG.mget(CfgKey.TEST_VS_DOCKERIZED_MONGO)).lower() == \"true\":\n            print(\"Using live MongoDB instance for testing.\")\n            _test_mongetter.client = _get_cachier_db_mongo_client()\n        else:\n            print(\"Using in-memory MongoDB instance for testing.\")\n            _test_mongetter.client = InMemoryMongoClient()\n    db_obj = _test_mongetter.client[\"cachier_test\"]\n    if _COLLECTION_NAME not in db_obj.list_collection_names():\n        db_obj.create_collection(_COLLECTION_NAME)\n    return db_obj[_COLLECTION_NAME]\n\n\n# === Mongo core tests ===\n\n\n@pytest.mark.mongo\ndef test_information():\n    print(\"\\npymongo version: \", end=\"\")\n    print(pymongo.__version__)\n\n\n@pytest.mark.mongo\ndef test_mongo_index_creation():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    collection = _test_mongetter()\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    assert _MongoCore._INDEX_NAME in collection.index_information()\n\n\n@pytest.mark.mongo\ndef test_mongo_core():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, 2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, 2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, 2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, 2)\n    assert val6 == val5\n\n\n@pytest.mark.mongo\ndef test_mongo_core_keywords():\n    \"\"\"Basic Mongo core functionality with keyword arguments.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, arg_2=2)\n    val2 = _test_mongo_caching(1, arg_2=2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, arg_2=2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, arg_2=2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, arg_2=2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, arg_2=2)\n    assert val6 == val5\n\n\nMONGO_DELTA = timedelta(seconds=3)\nMONGO_DELTA_LONG = timedelta(seconds=10)\n\n\n@pytest.mark.mongo\ndef test_mongo_stale_after():\n    \"\"\"Testing MongoDB core stale_after functionality.\"\"\"\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=MONGO_DELTA, next_time=False\n    )\n    def _stale_after_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _stale_after_mongo.clear_cache()\n    val1 = _stale_after_mongo(1, 2)\n    val2 = _stale_after_mongo(1, 2)\n    assert val1 == val2\n    sleep(3)\n    val3 = _stale_after_mongo(1, 2)\n    assert val3 != val1\n\n\ndef _calls_takes_time(res_queue):\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    res = _takes_time(34, 82.3)\n    res_queue.put(res)\n\n\n@pytest.mark.mongo\ndef test_mongo_being_calculated():\n    \"\"\"Testing MongoDB core handling of being calculated scenarios.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(1)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\nclass _BadMongoCollection:\n    def __init__(self, mongetter):\n        self.collection = mongetter()\n        self.index_information = self.collection.index_information\n        self.create_indexes = self.collection.create_indexes\n        self.find_one = self.collection.find_one\n\n    def delete_many(self, *args, **kwargs):\n        pass\n\n    def update_many(self, *args, **kwargs):\n        pass\n\n    def update_one(self, *args, **kwargs):\n        raise OperationFailure(Exception())\n\n\ndef _bad_mongetter():\n    return _BadMongoCollection(_test_mongetter)\n\n\n@pytest.mark.mongo\ndef test_mongo_write_failure():\n    \"\"\"Testing MongoDB core handling of writing failure scenarios.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    with pytest.raises(OperationFailure):\n        val1 = _func_w_bad_mongo(1, 2)\n        val2 = _func_w_bad_mongo(1, 2)\n        assert val1 == val2\n\n\n@pytest.mark.mongo\ndef test_mongo_clear_being_calculated():\n    \"\"\"Testing MongoDB core clear_being_calculated.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _func_w_bad_mongo.clear_being_calculated()\n\n\n@pytest.mark.mongo\ndef test_stalled_mongo_db_cache():\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    core = _MongoCore(None, _test_mongetter, 0)\n    core.set_func(_stalled_func)\n    core.clear_cache()\n    with pytest.raises(RecalculationNeeded):\n        core.wait_on_entry_calc(key=None)\n\n\n@pytest.mark.mongo\ndef test_stalled_mong_db_core(monkeypatch):\n    def mock_get_entry(self, args, kwargs):\n        return \"key\", {\"being_calculated\": True}\n\n    def mock_get_entry_by_key(self, key):\n        return \"key\", None\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry\n    )\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry_by_key\",\n        mock_get_entry_by_key,\n    )\n\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    res = _stalled_func()\n    assert res == 1\n\n    def mock_get_entry_2(self, args, kwargs):\n        entry = {\n            \"being_calculated\": True,\n            \"value\": 1,\n            \"time\": datetime.datetime.now() - datetime.timedelta(seconds=10),\n        }\n        return \"key\", entry\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry_2\n    )\n\n    stale_after = datetime.timedelta(seconds=1)\n\n    @cachier(mongetter=_test_mongetter, stale_after=stale_after)\n    def _stalled_func_2():\n        \"\"\"Testing stalled function.\"\"\"\n        return 2\n\n    res = _stalled_func_2()\n    assert res == 2\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=stale_after, next_time=True\n    )\n    def _stalled_func_3():\n        \"\"\"Testing stalled function.\"\"\"\n        return 3\n\n    res = _stalled_func_3()\n    assert res == 1\n\n\n@pytest.mark.mongo\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(mongetter=_test_mongetter, hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/requirements.txt",
      "content": "# todo: add some version range or pinning latest versions\n# tests and coverages\npytest\ncoverage\npytest-cov\nbirch\n# to connect to the test mongodb server\npymongo\ndnspython\npymongo-inmemory\n# to test pandas dataframe as-param hashing with mongodb core\npandas\n# to be able to run `python setup.py checkdocs`\ncollective.checkdocs\npygments\n"
    }
  ],
  "Patch": "--- a/python-cachier_cachier/src/cachier/cores/pickle.py\n+++ b/python-cachier_cachier/src/cachier/cores/pickle.py\n@@ -249,7 +249,7 @@\n         if self.separate_files:\n             self._clear_all_cache_files()\n         else:\n-            pass\n+            self._save_cache({})\n \n     def clear_being_calculated(self):\n         if self.separate_files:\n",
  "BuggyCodeLocation": [
    {
      "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "function": null,
      "content_all": {
        "252": "            pass\n"
      },
      "content_change": {
        "252": "            pass\n"
      }
    }
  ],
  "Source": "Human",
  "Command": [
    "pytest tests"
  ],
  "Token": 959,
  "FilteredCode": [
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "1 \"\"\"A pickle-based caching core for cachier.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 import os\n10 import pickle  # for local caching\n11 from contextlib import suppress\n12 from datetime import datetime\n13 \n14 import portalocker  # to lock on pickle cache IO\n15 from watchdog.events import PatternMatchingEventHandler\n16 from watchdog.observers import Observer\n17 \n18 from .._types import HashFunc\n19 from ..config import _update_with_defaults\n20 \n21 # Alternative:  https://github.com/WoLpH/portalocker\n22 from .base import _BaseCore\n23 \n24 \n25 class _PickleCore(_BaseCore):\n26     \"\"\"The pickle core class for cachier.\"\"\"\n27 \n28     class CacheChangeHandler(PatternMatchingEventHandler):\n29         \"\"\"Handles cache-file modification events.\"\"\"\n30 \n31         def __init__(self, filename, core, key):\n32             PatternMatchingEventHandler.__init__(\n33                 self,\n34                 patterns=[\"*\" + filename],\n35                 ignore_patterns=None,\n36                 ignore_directories=True,\n37                 case_sensitive=False,\n38             )\n39             self.core = core\n40             self.key = key\n41             self.observer = None\n42             self.value = None\n43 \n44         def inject_observer(self, observer):\n45             \"\"\"Inject the observer running this handler.\"\"\"\n46             self.observer = observer\n47 \n48         def _check_calculation(self):\n49             # print(...truncated)"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "1 Cachier\n2 #######\n3 \n4 |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n5 \n6 Persistent, stale-free, local and cross-machine caching for Python functions.\n7 \n8 .. code-block:: python\n9 \n10   from cachier import cachier\n11   import datetime\n12 \n13   @cachier(stale_after=datetime.timedelta(days=3))\n14   def foo(arg1, arg2):\n15     \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n16     return {'arg1': arg1, 'arg2': arg2}\n17 \n18 \n19 .. role:: python(code)\n20   :language: python\n21 \n22 .. contents::\n23 \n24 .. section-numbering:\n25 \n26 \n27 \n28 Installation\n29 ============\n30 \n31 Install ``cachier`` with:\n32 \n33 .. code-block:: python\n34 \n35     pip install cachier\n36 \n37 For the latest version supporting Python 2.7 please use:\n38 \n39 .. code-block:: python\n40 \n41     pip install 'cachier==1.2.8'\n42 \n43 Features\n44 ========\n45 \n46 * Pure Python.\n47 * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n48 * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n49 * A simple interface.\n50 * Defining \"shelf life\" for cached values.\n51 * Local caching using pickle fil(...truncated)"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "1 \"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 \n10 import datetime\n11 import inspect\n12 import os\n13 import warnings\n14 from collections import OrderedDict\n15 from concurrent.futures import ThreadPoolExecutor\n16 from functools import wraps\n17 from typing import Optional, Union\n18 from warnings import warn\n19 \n20 from .config import (\n21     Backend,\n22     HashFunc,\n23     Mongetter,\n24     _default_params,\n25     _update_with_defaults,\n26 )\n27 from .cores.base import RecalculationNeeded, _BaseCore\n28 from .cores.memory import _MemoryCo(...truncated)"
    }
  ],
  "TokenAll": 34852,
  "FilteredLevel": 1500,
  "Results": {
    "model": "GPT-4o",
    "Difficulty": "Simple",
    "issue_origin": {
      "title": "Insecure pickle usage",
      "description": "The code utilizes the pickle module for caching (line 10 in src/cachier/cores/pickle.py). Pickle can execute arbitrary code if it deserializes untrusted data, which is a security risk. Consider using a safer serialization method, such as JSON, or ensure that the data being deserialized is from a trusted source.",
      "explanation": "### Issue Summary\n\nThe issue is related to the use of the `pickle` module for caching within the `python-cachier` project. Specifically, the use of `pickle` is identified on line 10 of `src/cachier/cores/pickle.py`. The concern is that `pickle` can execute arbitrary code during deserialization, which poses a significant security risk if it deserializes untrusted data. To address this, it is recommended to either use a safer serialization method like `JSON` or ensure that the data being deserialized is from a trusted source.\n\n### Content of the Commit\n\nLet's assume the commit message is as follows:\n```plaintext\nRefactor pickle-based caching to use JSON serialization for enhanced security\n\n- Remove pickle imports and usages from pickle.py\n- Introduce JSON for serialization and deserialization of cache data\n- Add necessary checks to ensure data integrity and validate trust\n```\n\nThe modified `pickle.py` could look something like this:\n\n```python\n1  \"\"\"A JSON-based caching core for cachier.\"\"\"\n2  \n3  # This file is part of Cachier.\n4  # https://github.com/python-cachier/cachier\n5  \n6  # Licensed under the MIT license:\n7  # http://www.opensource.org/licenses/MIT-license\n8  # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9  import os\n10 import json  # Use JSON instead of pickle for local caching\n11 from contextlib import suppress\n12 from datetime import datetime\n13 \n14 import portalocker  # to lock on JSON cache IO\n15 from watchdog.events import PatternMatchingEventHandler\n16 from watchdog.observers import Observer\n17 \n18 from .._types import HashFunc\n19 from ..config import _update_with_defaults\n20 \n21 # Alternative:  https://github.com/WoLpH/portalocker\n22 from .base import _BaseCore\n23 \n24 \n25 class _JsonCore(_BaseCore):  # Renamed to _JsonCore\n26     \"\"\"The JSON core class for cachier.\"\"\"\n27 \n28     class CacheChangeHandler(PatternMatchingEventHandler):\n29         \"\"\"Handles cache-file modification events.\"\"\"\n30 \n31         def __init__(self, filename, core, key):\n32             PatternMatchingEventHandler.__init__(\n33                 self,\n34                 patterns=[\"*\" + filename],\n35                 ignore_patterns=None,\n36                 ignore_directories=True,\n37                 case_sensitive=False,\n38             )\n39             self.core = core\n40             self.key = key\n41             self.observer = None\n42             self.value = None\n43 \n44         def inject_observer(self, observer):\n45             \"\"\"Inject the observer running this handler.\"\"\"\n46             self.observer = observer\n47 \n48         def _check_calculation(self):\n49             # Add method to validate the integrity of JSON content.\n50             pass\n...\n99      def _load_cache(self, cache_path):\n100         \"\"\"Load cache from the given path.\"\"\"\n101         with open(cache_path, 'r') as cache_file:\n102             cache_data = json.load(cache_file)\n103         return cache_data\n104 \n105     def _save_cache(self, cache_path, data):\n106         \"\"\"Save cache to the given path.\"\"\"\n107         with open(cache_path, 'w') as cache_file:\n108             json.dump(data, cache_file)\n...\n```\n\n### Explanation of the Issue and Solution\n\n#### Cause of the Issue\nThe primary issue is the use of the `pickle` module for caching data. The `pickle` module in Python is used for serializing and deserializing Python object structures. However, `pickle` has a major security flaw: it can execute arbitrary code during deserialization. This means that if untrusted data is deserialized, an attacker can run arbitrary code on the host machine, leading to severe security vulnerabilities.\n\n#### Solution Details\nTo address the security risk posed by `pickle`, the commit replaces `pickle` with JSON for serialization and deserialization of cached data. JSON is inherently more secure than `pickle` because JSON parsing libraries load data as native Python types and do not allow code execution.\n\nKey changes include:\n1. **Import Replacement:**\n   - Removed `import pickle` and replaced it with `import json`.\n   \n2. **Class Renaming:**\n   - Renamed `_PickleCore` to `_JsonCore` to reflect the change in serialization methods.\n\n3. **Serialization Changes:**\n   - The `_load_cache` method now uses `json.load` to read JSON files.\n   - The `_save_cache` method now uses `json.dump` to write data to JSON files.\n\n4. **Integrity and Trust Checks:**\n   - Added stubs or plans for integrity validation methods like `_check_calculation` to ensure that the data integrity is preserved. This helps ensure that the deserialized data is trusted and mitigates risks of data corruption or tampering.\n\n#### How the Commit"
    },
    "issue_message": {
      "title": "Outdated Python Version Support in README",
      "description": "The README file mentions supporting Python 2.7, which has been discontinued and is no longer supported. The documentation should be updated to reflect that only Python 3.8+ is supported.",
      "explanation": "### Summary of the Issue\n\nThe issue concerns outdated information in the `README.rst` file of the `cachier` project. The existing documentation incorrectly indicates support for Python 2.7, which is no longer maintained or supported. The project's documentation needs an update to state that only Python 3.8 and above are supported.\n\n### Commit Details\n\nLet's consider a hypothetical commit that addresses this issue:\n\n```diff\n--- a/python-cachier_cachier/README.rst\n+++ b/python-cachier_cachier/README.rst\n@@ -35,12 +35,6 @@ Install ``cachier`` with:\n \n     pip install cachier\n \n-For the latest version supporting Python 2.7 please use:\n-\n-.. code-block:: python\n-\n-    pip install 'cachier==1.2.8'\n-\n Features\n ========\n \n@@ -45,7 +39,7 @@ Features\n * Pure Python.\n-* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n+* Compatible with Python 3.8+.\n * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n * A simple interface.\n * Defining \"shelf life\" for cached values.\n```\n\n### Detailed Explanation\n\n#### Original Code in `README.rst`\n\n```python\n35 Install ``cachier`` with:\n\n37     pip install cachier\n\n39 For the latest version supporting Python 2.7 please use:\n\n41     pip install 'cachier==1.2.8'\n\n46 * Pure Python.\n47 * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n48 * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n49 * A simple interface.\n50 * Defining \"shelf life\" for cached values.\n```\n\n#### Modified Code in `README.rst`\n\n```diff\n35 Install ``cachier`` with:\n\n37     pip install cachier\n\n-For the latest version supporting Python 2.7 please use:\n\n-    pip install 'cachier==1.2.8'\n\n46 * Pure Python.\n- * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n+ * Compatible with Python 3.8+.\n48 * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n49 * A simple interface.\n50 * Defining \"shelf life\" for cached values.\n```\n\n#### Explanation of the Changes\n\n1. **Remove Python 2.7 Installation Note:**\n   - The block of instructions suggesting to use `pip install 'cachier==1.2.8'` for Python 2.7 is removed. This change eliminates the outdated Python 2.7 installation command, reducing the chances of confusion for users who are reading the documentation and might mistakenly think that Python 2.7 is still supported.\n\n2. **Update Compatibility Statement:**\n   - The line `* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).` is modified to a simpler `* Compatible with Python 3.8+.`\n   - This line now clearly states that the library is only compatible with Python 3.8 and above, avoiding any references to Python 2.7, which is consistent with the fact that version 2.7 is no longer supported.\n\n### How the Commit Solves the Issue\n\n- **Clarity and Accuracy:** The removal of Python 2.7 references ensures that users are not misled into attempting to use a deprecated version of the software with an unsupported Python version.\n- **Maintenance:** The reduced lines in the documentation prevent the spread of outdated information, simplifying future maintenance and updates.\n- **User Experience:** Users will have a clear understanding that the library only supports Python 3.8 and above, streamlining installation and usage by eliminating unnecessary additional steps for unsupported versions.\n\nBy addressing these changes, the commit effectively aligns the documentation with current best practices and software maintenance protocols, ensuring it is both user-friendly and accurately reflects the project's compatibility requirements."
    },
    "issue_ground": {
      "title": "Cache Not Clearing Properly with Pickle Core When Using Separate Files",
      "description": "It appears that the cache clearing mechanism within the Pickle core is not functioning correctly when the `separate_files` option is enabled. Specifically, invoking `clear_cache()` does not effectively clear the cache as expected. Instead, it seems to leave the task unhandled, resulting in persisted cache data that should otherwise be removed. This inconsistency can lead to issues where outdated or stale data is being returned, even after a clear operation has been requested. This affects the reliability and predictability of the caching mechanism for users who rely on accurate cache state management.",
      "explanation": "Let's first summarize the issue at hand, then analyze the provided code, and finally discuss how the proposed commit resolves the issue.\n\n### Issue Summary\nThe reported issue revolves around the cache clearing mechanism in the Pickle core of the `cachier` library. When the `separate_files` option is enabled, invoking `clear_cache()` does not effectively clear the cache. This leads to old or stale data being retained, even after a request to clear the cache, which undermines the reliability of the caching.\n\n### Code Analysis\nGiven the provided code snippets, let’s focus particularly on the relevant parts of the Pickle core which might be responsible for handling cache operations.\n\n#### `pickle.py`:\nThe important class here is `_PickleCore`, a subclass of `_BaseCore`, which would logically contain the cache handling mechanisms.\n\nBelow are the critical parts from `_PickleCore`:\n\n```python\n25 class _PickleCore(_BaseCore):\n26     \"\"\"The pickle core class for cachier.\"\"\"\n...\n28     class CacheChangeHandler(PatternMatchingEventHandler):\n29         \"\"\"Handles cache-file modification events.\"\"\"\n...\n```\n\nWithin the `_PickleCore` class, a nested class called `CacheChangeHandler` is defined to handle cache file changes. However, no `clear_cache()` method is directly visible in this snippet, suggesting it might be inherited or defined elsewhere within the module.\n\n#### `core.py`:\nThis file seems to define base functionality for caching (extends _BaseCore).\n\n```python\n27 from .cores.base import RecalculationNeeded, _BaseCore\n...\n```\n\n### Investigation Hypothesis\nGiven the issue, the most likely culprit would be how `clear_cache()` function interacts with the file system when `separate_files` is enabled. It might overlook deleting some separate cache files maintained for cached items.\n\n### Hypothetical Proposed Commit\nAssuming a commit was proposed to fix this issue, the changes might look like:\n\n1. **Addition or improvement of the `clear_cache()` method**: Ensure all separately maintained cache files are correctly identified and deleted.\n2. **Modification of how cache paths are generated and accessed**: Ensure all cached items conform to the same cleanup operation even when separated by `separate_files`.\n\n### Commit Content Example\nHere's a hypothetical example of a change in the code that could have been made:\n\n```python\nclass _PickleCore(_BaseCore):\n    \"\"\"The pickle core class for cachier.\"\"\"\n\n    ...\n\n    def clear_cache(self):\n        \"\"\"Clears the pickle cache\"\"\"\n        cache_dir = self.cache_dir\n        with self.lock:\n            if self.separate_files:\n                for filename in os.listdir(cache_dir):\n                    file_path = os.path.join(cache_dir, filename)\n                    if os.path.isfile(file_path):\n                        os.remove(file_path)\n            else:\n                with suppress(FileNotFoundError):\n                    os.remove(self._cache_path)\n```\n\n1. **Summary of the Change**:\n   - Introduced a proper sweeping mechanism within the `clear_cache()`.\n   - Ensured that when `separate_files` is enabled, all cache files within the cache directory are deleted.\n\n### Explanation of How the Commit Fixes the Issue\n- **Identifying Cache Files**: By iterating through all files in the cache directory, it ensures no file is left unprocessed.\n- **Conditional Handling**: Separate handling for when `separate_files` is enabled, ensuring different caching strategies are handled correctly.\n- **Deleting Files**: Explicitly removes each cache file, ensuring the disk state aligns with the expected cache state.\n\n### How This Solves the Issue\nThis change ensures the `clear_cache()` function functions as expected even when `separate_files` is enabled by:\n- Deleting all files in the cache directory rather than overlooking some.\n- Avoiding legacy or incorrect behavior where some files linger after a cache clear operation.\n\n### Conclusion\nBy generalizing the cleanup operations in correspondence with the storage strategy used (`separate_files` or not), the cache state remains predictable and accurately reflects the operations performed by the user, leading to improved reliability of the caching mechanism in the `cachier` library."
    },
    "issue_ground_truth": {
      "title": "Cache Not Clearing Properly with Pickle Core When Using Separate Files",
      "description": "It appears that the cache clearing mechanism within the Pickle core is not functioning correctly when the `separate_files` option is enabled. Specifically, invoking `clear_cache()` does not effectively clear the cache as expected. Instead, it seems to leave the task unhandled, resulting in persisted cache data that should otherwise be removed. This inconsistency can lead to issues where outdated or stale data is being returned, even after a clear operation has been requested. This affects the reliability and predictability of the caching mechanism for users who rely on accurate cache state management.",
      "explanation": "### Summary of the Issue\n\nThe issue reported is titled \"Cache Not Clearing Properly with Pickle Core When Using Separate Files.\" It details a problem where invoking the `clear_cache()` method does not effectively clear the cache when the `separate_files` option is enabled. Instead of clearing the cache as expected, the task remains unhandled and the cache data persists. This can lead to outdated or stale data being returned, resulting in unreliable and unpredictable caching behavior for users.\n\n### Analysis of the Problem\n\nFrom the description:\n- The `clear_cache()` function, when used with the `separate_files` option, does not function correctly.\n- The cache data that should be removed remains, leading to the potential return of old or stale data.\n- This affects the reliability and predictability of the caching mechanism.\n\nIn essence, the problem lies in the fact that the `clear_cache()` method has no implementation to handle the scenario where `separate_files` is enabled. Thus, it does nothing when it should clear individual cache files.\n\n### Commit Details and Explanation\n\nThe provided commit patch indicates that the developer has recognized this issue and has addressed it by modifying the `clear_cache` method inside the `pickle.py` file.\n\n#### Content of the Commit \n\nThe commit adds functionality to the `clear_cache` method:\n- **Developer's Perspective on the Cause**:\n  - The original code for the `clear_cache` method had a placeholder (`pass`), resulting in no actual logic being executed to clear the cache when `separate_files` is enabled.\n  - This lack of implementation led to the described issue. The cache data persisted because there was no mechanism to clear the individual cache files created when `separate_files` was set to `True`.\n\n#### Changes in the Commit\n\n- **New Implementation**:\n  - The commit introduces a call to `_clear_all_cache_files()` within the `clear_cache` method if the `separate_files` option is enabled.\n  - This addition ensures that any existing cache files corresponding to the separate files mode are properly handled and deleted.\n\n### Solution to the Issue\n\n1. **Detection of the Problem**:\n   - The method `clear_cache()` was previously not handling the `separate_files` option effectively.\n   - The placeholder `pass` meant that no action was taken to clear the cache when `separate_files` was set to `True`.\n\n2. **Implementing a Solution**:\n   - The developer added the necessary functionality to actually clear the cache files when `separate_files` is specified.\n   - By calling `_clear_all_cache_files()`, which is presumably responsible for handling and deleting all individual cache files associated with the decorated function, the `clear_cache()` method now performs the intended action.\n\n3. **Effect of the Solution**:\n   - The new implementation guarantees that invoking `clear_cache()` with `separate_files` enabled will correctly navigate through and delete all the cache files.\n   - This ensures that no stale or outdated cache data remains, thus maintaining the reliability and predictability of the caching mechanism. \n\nIn summary, the issue was caused by a lack of implementation in the `clear_cache()` function for the `separate_files` option. The commit fixed this by adding the appropriate call to `_clear_all_cache_files()`, ensuring that all related cache files are properly cleared when needed. This solution directly addresses the main problem and restores the correct behavior of the cache clearing functionality."
    },
    "location_origin": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": {
          "25": "_PickleCore"
        },
        "content_all": {
          "7": "# http://www.opensource.org/licenses/MIT-license\n",
          "8": "# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n",
          "9": "import os\n",
          "10": "import pickle  # for local caching\n",
          "11": "from contextlib import suppress\n",
          "12": "from datetime import datetime\n",
          "13": "\n",
          "14": "import portalocker  # to lock on pickle cache IO\n"
        },
        "content_change": {
          "10": "import pickle  # for local caching"
        }
      },
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": {
          "25": "_PickleCore"
        },
        "content_all": {
          "100": "     \"\"\"Load cache from the given path.\"\"\"\n",
          "101": "     with open(cache_path, 'rb') as cache_file:\n",
          "102": "         cache_data = pickle.load(cache_file)\n",
          "103": "     return cache_data\n",
          "104": ""
        },
        "content_change": {
          "102": "         cache_data = pickle.load(cache_file)"
        }
      },
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": {
          "25": "_PickleCore"
        },
        "content_all": {
          "105": "     \"\"\"Save cache to the given path.\"\"\"\n",
          "106": "     with open(cache_path, 'wb') as cache_file:\n",
          "107": "         pickle.dump(data, cache_file)\n",
          "108": "     \n"
        },
        "content_change": {
          "107": "         pickle.dump(data, cache_file)"
        }
      }
    ],
    "location_message": [
      {
        "file": "python-cachier_cachier/README.rst",
        "function": {
          "37": "Installation Section"
        },
        "content_all": {
          "34": "\n",
          "35": "    pip install cachier\n",
          "36": "\n",
          "37": "For the latest version supporting Python 2.7 please use:\n",
          "38": "\n",
          "39": "    pip install 'cachier==1.2.8'\n",
          "40": "\n",
          "41": "Features\n",
          "42": "========\n",
          "43": "\n"
        },
        "content_change": {
          "37": "For the latest version supporting Python 2.7 please use:\n",
          "39": "    pip install 'cachier==1.2.8'\n"
        }
      },
      {
        "file": "python-cachier_cachier/README.rst",
        "function": {
          "46": "Features Section"
        },
        "content_all": {
          "44": "\n",
          "45": "* Pure Python.\n",
          "46": "* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n",
          "47": "* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n",
          "48": "* A simple interface.\n",
          "49": "* Defining \"shelf life\" for cached values.\n",
          "50": "\n"
        },
        "content_change": {
          "46": "* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n"
        }
      }
    ],
    "location_ground": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": {
          "25": "_PickleCore"
        },
        "content_all": {
          "24": "      \n",
          "25": "    class _PickleCore(_BaseCore):\n",
          "26": "        \"\"\"The pickle core class for cachier.\"\"\"\n",
          "27": "\n",
          "28": "        class CacheChangeHandler(PatternMatchingEventHandler):\n",
          "29": "            \"\"\"Handles cache-file modification events.\"\"\"\n",
          "30": "\n",
          "98": "        def clear_cache(self):\n",
          "99": "            \"\"\"Clears the pickle cache\"\"\"\n",
          "100": "            cache_dir = self.cache_dir\n",
          "101": "            with self.lock:\n",
          "102": "                if self.separate_files:\n",
          "103": "                    for filename in os.listdir(cache_dir):\n",
          "104": "                        file_path = os.path.join(cache_dir, filename)\n",
          "105": "                        if os.path.isfile(file_path):\n",
          "106": "                            os.remove(file_path)\n",
          "107": "                else:\n",
          "108": "                    with suppress(FileNotFoundError):\n",
          "109": "                        os.remove(self._cache_path)\n"
        },
        "content_change": {
          "102": "                if self.separate_files:\n",
          "103": "                    for filename in os.listdir(cache_dir):\n",
          "104": "                        file_path = os.path.join(cache_dir, filename)\n",
          "105": "                        if os.path.isfile(file_path):\n",
          "106": "                            os.remove(file_path)\n"
        }
      }
    ],
    "location_ground_exp": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": {
          "25": "_PickleCore"
        },
        "content_all": {
          "109": "    def clear_cache(self):\n",
          "110": "        \"\"\"Clear the entire cache.\"\"\"\n",
          "111": "        if self._separate_files:\n",
          "112": "            pass\n",
          "113": "        else:\n",
          "114": "            try:\n",
          "115": "                with open(self._cache_filename, 'wb') as cache_file:\n",
          "116": "                    pickle.dump({}, cache_file)\n",
          "117": "                if self._cache_touch:\n",
          "118": "                    self._cache_touch()\n",
          "119": "            except (FileNotFoundError, portalocker.LockException):\n"
        },
        "content_change": {
          "112": "            self._clear_all_cache_files()"
        }
      }
    ],
    "location_ground_truth": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "function": null,
        "content_all": {
          "252": "            pass\n"
        },
        "content_change": {
          "252": "            pass\n"
        }
      }
    ],
    "patch_i": "\n--- a/src/cachier/cores/pickle.py\n+++ b/src/cachier/cores/pickle.py\n@@ -1,11 +1,11 @@\n-\"\"\"A pickle-based caching core for cachier.\"\"\"\n+\"\"\"A JSON-based caching core for cachier.\"\"\"\n \n # This file is part of Cachier.\n # https://github.com/python-cachier/cachier\n \n # Licensed under the MIT license:\n # http://www.opensource.org/licenses/MIT-license\n # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n-import pickle  # for local caching\n+import json  # for local caching\n import os\n import portalocker  # to lock on JSON cache IO\n from contextlib import suppress\n@@ -22,11 +22,11 @@ from .base import _BaseCore\n \n \n-class _PickleCore(_BaseCore):\n-    \"\"\"The pickle core class for cachier.\"\"\"\n+class _JsonCore(_BaseCore):\n+    \"\"\"The JSON core class for cachier.\"\"\"\n \n     class CacheChangeHandler(PatternMatchingEventHandler):\n         \"\"\"Handles cache-file modification events.\"\"\"\n \n         def __init__(self, filename, core, key):\n             PatternMatchingEventHandler.__init__(\n                 self,\n@@ -48,7 +48,7 @@ class _PickleCore(_BaseCore):\n             self.value = None\n \n         def inject_observer(self, observer):\n             \"\"\"Inject the observer running this handler.\"\"\"\n             self.observer = observer\n \n         def _check_calculation(self):\n             # Add method to validate the integrity of JSON content.\n             pass\n \n     def _load_cache(self, cache_path):\n         \"\"\"Load cache from the given path.\"\"\"\n-        with open(cache_path, 'rb') as cache_file:\n-            cache_data = pickle.load(cache_file)\n+        with open(cache_path, 'r') as cache_file:\n+            cache_data = json.load(cache_file)\n         return cache_data\n \n     def _save_cache(self, cache_path, data):\n         \"\"\"Save cache to the given path.\"\"\"\n-        with open(cache_path, 'wb') as cache_file:\n-            pickle.dump(data, cache_file)\n+        with open(cache_path, 'w') as cache_file:\n+            json.dump(data, cache_file)\n",
    "patch_im": "\n--- a/python-cachier_cachier/README.rst\n+++ b/python-cachier_cachier/README.rst\n@@ -35,12 +35,6 @@ Install ``cachier`` with:\n \n     pip install cachier\n \n-For the latest version supporting Python 2.7 please use:\n-\n-.. code-block:: python\n-\n-    pip install 'cachier==1.2.8'\n-\n Features\n ========\n \n@@ -45,7 +39,7 @@ Features\n * Pure Python.\n-* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n+* Compatible with Python 3.8+.\n * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n * A simple interface.\n * Defining \"shelf life\" for cached values.\n",
    "patch_il": "\n--- a/src/cachier/cores/pickle.py\n+++ b/src/cachier/cores/pickle.py\n@@ -1,4 +1,4 @@\n-\"\"\"A pickle-based caching core for cachier.\"\"\"\n+\"\"\"A JSON-based caching core for cachier.\"\"\"\n \n # This file is part of Cachier.\n # https://github.com/python-cachier/cachier\n@@ -6,14 +6,14 @@\n # Licensed under the MIT license:\n # http://www.opensource.org/licenses/MIT-license\n # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n \n import os\n-import pickle  # for local caching\n+import json  # Use JSON instead of pickle for local caching\n from contextlib import suppress\n from datetime import datetime\n \n import portalocker  # to lock on pickle cache IO\n from watchdog.events import PatternMatchingEventHandler\n from watchdog.observers import Observer\n \n@@ -25,13 +25,13 @@ class _PickleCore(_BaseCore):\n     \"\"\"The pickle core class for cachier.\"\"\"\n \n     class CacheChangeHandler(PatternMatchingEventHandler):\n         \"\"\"Handles cache-file modification events.\"\"\"\n \n         def __init__(self, filename, core, key):\n             PatternMatchingEventHandler.__init__(\n                 self,\n                 patterns=[\"*\" + filename],\n                 ignore_patterns=None,\n                 ignore_directories=True,\n                 case_sensitive=False,\n             )\n@@ -100,11 +100,11 @@ def _load_cache(self, cache_path):\n     \"\"\"Load cache from the given path.\"\"\"\n     with open(cache_path, 'r') as cache_file:\n         cache_data = json.load(cache_file)\n     return cache_data\n \n def _save_cache(self, cache_path, data):\n     \"\"\"Save cache to the given path.\"\"\"\n     with open(cache_path, 'w') as cache_file:\n         json.dump(data, cache_file)\n",
    "patch_iml": "\n--- a/python-cachier_cachier/README.rst\n+++ b/python-cachier_cachier/README.rst\n@@ -36,10 +36,6 @@\n \n     pip install cachier\n \n-For the latest version supporting Python 2.7 please use:\n-\n-    pip install 'cachier==1.2.8'\n-\n Features\n ========\n \n@@ -45,7 +41,7 @@\n * Pure Python.\n-* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n+* Compatible with Python 3.8+.\n * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n * A simple interface.\n * Defining \"shelf life\" for cached values.\n",
    "patch_ground": "\n--- a/src/cachier/cores/pickle.py\n+++ b/src/cachier/cores/pickle.py\n@@ -24,6 +24,7 @@\n \n class _PickleCore(_BaseCore):\n     \"\"\"The pickle core class for cachier.\"\"\"\n+    \n     class CacheChangeHandler(PatternMatchingEventHandler):\n         \"\"\"Handles cache-file modification events.\"\"\"\n \n@@ -135,6 +136,18 @@\n         self._cache[key][\"time\"] = datetime.now()\n \n     def clear_cache(self):\n+        \"\"\"Clears the pickle cache\"\"\"\n+        cache_dir = self.cache_dir\n+        with self.lock:\n+            # Handle separate_files option\n+            if self.separate_files:\n+                for filename in os.listdir(cache_dir):\n+                    file_path = os.path.join(cache_dir, filename)\n+                    if os.path.isfile(file_path):\n+                        os.remove(file_path)\n+            else:\n+                with suppress(FileNotFoundError):\n+                    os.remove(self._cache_path)\n         with self._cache_lock:\n             self._cache = {}\n \n",
    "patch_ground_location": "\n--- a/src/cachier/cores/pickle.py\n+++ b/src/cachier/cores/pickle.py\n@@ -97,7 +97,16 @@\n \n         def clear_cache(self):\n             \"\"\"Clears the pickle cache\"\"\"\n-            with suppress(FileNotFoundError):\n+            cache_dir = self.cache_dir\n+            with self.lock:\n+                if self.separate_files:\n+                    for filename in os.listdir(cache_dir):\n+                        file_path = os.path.join(cache_dir, filename)\n+                        if os.path.isfile(file_path):\n+                            os.remove(file_path)\n+                else:\n                     os.remove(self._cache_path)\n+                    cache_change_path = self._calc_cache_change_path()\n+                    with suppress(FileNotFoundError):\n+                        os.remove(cache_change_path)\n                     \n",
    "patch_ground_exp": "\n--- a/python-cachier_cachier/src/cachier/cores/pickle.py\n+++ b/python-cachier_cachier/src/cachier/cores/pickle.py\n@@ -1,7 +1,7 @@\n \"\"\"A pickle-based caching core for cachier.\"\"\"\n \n # This file is part of Cachier.\n # https://github.com/python-cachier/cachier\n # Licensed under the MIT license:\n # http://www.opensource.org/licenses/MIT-license\n # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n \n@@ -109,7 +109,7 @@\n     def clear_cache(self):\n         \"\"\"Clear the entire cache.\"\"\"\n         if self._separate_files:\n-            pass\n+            self._clear_all_cache_files()\n         else:\n             try:\n                 with open(self._cache_filename, 'wb') as cache_file:\n",
    "patch_ground_all": "\n--- a/python-cachier_cachier/src/cachier/cores/pickle.py\n+++ b/python-cachier_cachier/src/cachier/cores/pickle.py\n@@ -249,7 +249,12 @@\n \n     def clear_cache(self):\n         \"\"\"Clear the cache.\"\"\"\n-            pass\n+        if self._separate_files:\n+            self._clear_all_cache_files()\n+        else:\n+            self.clear_cache()\n+\n+    def _clear_all_cache_files(self):\n+        \"\"\"Remove all cache files.\"\"\"\n+        for cache_file in os.listdir(self.cache_dir):\n+            if cache_file.endswith('.p'):\n+                os.remove(os.path.join(self.cache_dir, cache_file))\n",
    "patch_ground_truth": "--- a/python-cachier_cachier/src/cachier/cores/pickle.py\n+++ b/python-cachier_cachier/src/cachier/cores/pickle.py\n@@ -249,7 +249,7 @@\n         if self.separate_files:\n             self._clear_all_cache_files()\n         else:\n-            pass\n+            self._save_cache({})\n \n     def clear_being_calculated(self):\n         if self.separate_files:\n",
    "message": null,
    "CodeBase": [
      {
        "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "content": "1 \"\"\"A pickle-based caching core for cachier.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 import os\n10 import pickle  # for local caching\n11 from contextlib import suppress\n12 from datetime import datetime\n13 \n14 import portalocker  # to lock on pickle cache IO\n15 from watchdog.events import PatternMatchingEventHandler\n16 from watchdog.observers import Observer\n17 \n18 from .._types import HashFunc\n19 from ..config import _update_with_defaults\n20 \n21 # Alternative:  https://github.com/WoLpH/portalocker\n22 from .base import _BaseCore\n23 \n24 \n25 class _PickleCore(_BaseCore):\n26     \"\"\"The pickle core class for cachier.\"\"\"\n27 \n28     class CacheChangeHandler(PatternMatchingEventHandler):\n29         \"\"\"Handles cache-file modification events.\"\"\"\n30 \n31         def __init__(self, filename, core, key):\n32             PatternMatchingEventHandler.__init__(\n33                 self,\n34                 patterns=[\"*\" + filename],\n35                 ignore_patterns=None,\n36                 ignore_directories=True,\n37                 case_sensitive=False,\n38             )\n39             self.core = core\n40             self.key = key\n41             self.observer = None\n42             self.value = None\n43 \n44         def inject_observer(self, observer):\n45             \"\"\"Inject the observer running this handler.\"\"\"\n46             self.observer = observer\n47 \n48         def _check_calculation(self):\n49             # print(...truncated)"
      },
      {
        "path": "python-cachier_cachier/README.rst",
        "content": "1 Cachier\n2 #######\n3 \n4 |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n5 \n6 Persistent, stale-free, local and cross-machine caching for Python functions.\n7 \n8 .. code-block:: python\n9 \n10   from cachier import cachier\n11   import datetime\n12 \n13   @cachier(stale_after=datetime.timedelta(days=3))\n14   def foo(arg1, arg2):\n15     \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n16     return {'arg1': arg1, 'arg2': arg2}\n17 \n18 \n19 .. role:: python(code)\n20   :language: python\n21 \n22 .. contents::\n23 \n24 .. section-numbering:\n25 \n26 \n27 \n28 Installation\n29 ============\n30 \n31 Install ``cachier`` with:\n32 \n33 .. code-block:: python\n34 \n35     pip install cachier\n36 \n37 For the latest version supporting Python 2.7 please use:\n38 \n39 .. code-block:: python\n40 \n41     pip install 'cachier==1.2.8'\n42 \n43 Features\n44 ========\n45 \n46 * Pure Python.\n47 * Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n48 * Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n49 * A simple interface.\n50 * Defining \"shelf life\" for cached values.\n51 * Local caching using pickle fil(...truncated)"
      },
      {
        "path": "python-cachier_cachier/src/cachier/core.py",
        "content": "1 \"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 \n10 import datetime\n11 import inspect\n12 import os\n13 import warnings\n14 from collections import OrderedDict\n15 from concurrent.futures import ThreadPoolExecutor\n16 from functools import wraps\n17 from typing import Optional, Union\n18 from warnings import warn\n19 \n20 from .config import (\n21     Backend,\n22     HashFunc,\n23     Mongetter,\n24     _default_params,\n25     _update_with_defaults,\n26 )\n27 from .cores.base import RecalculationNeeded, _BaseCore\n28 from .cores.memory import _MemoryCo(...truncated)"
      }
    ],
    "CommitSHA": "474294fac83b029f13dd61b8d2822923045d4504"
  },
  "Score": {
    "Difficulty": "Simple",
    "issue_origin": {
      "Title": 7,
      "Description": 6,
      "Reproducibility": 6,
      "Relevance": 8,
      "Explanation": 6,
      "Overall": 7
    },
    "issue_message": {
      "Title": 7,
      "Description": 6,
      "Reproducibility": 7,
      "Relevance": 8,
      "Explanation": 6,
      "Overall": 7
    },
    "issue_ground": {
      "Title": 6,
      "Description": 5,
      "Reproducibility": 6,
      "Relevance": 7,
      "Explanation": 5,
      "Overall": 6
    },
    "issue_ground_truth": {
      "title": "Cache Not Clearing Properly with Pickle Core When Using Separate Files",
      "description": "It appears that the cache clearing mechanism within the Pickle core is not functioning correctly when the `separate_files` option is enabled. Specifically, invoking `clear_cache()` does not effectively clear the cache as expected. Instead, it seems to leave the task unhandled, resulting in persisted cache data that should otherwise be removed. This inconsistency can lead to issues where outdated or stale data is being returned, even after a clear operation has been requested. This affects the reliability and predictability of the caching mechanism for users who rely on accurate cache state management.",
      "explanation": "### Summary of the Issue\n\nThe issue reported is titled \"Cache Not Clearing Properly with Pickle Core When Using Separate Files.\" It details a problem where invoking the `clear_cache()` method does not effectively clear the cache when the `separate_files` option is enabled. Instead of clearing the cache as expected, the task remains unhandled and the cache data persists. This can lead to outdated or stale data being returned, resulting in unreliable and unpredictable caching behavior for users.\n\n### Analysis of the Problem\n\nFrom the description:\n- The `clear_cache()` function, when used with the `separate_files` option, does not function correctly.\n- The cache data that should be removed remains, leading to the potential return of old or stale data.\n- This affects the reliability and predictability of the caching mechanism.\n\nIn essence, the problem lies in the fact that the `clear_cache()` method has no implementation to handle the scenario where `separate_files` is enabled. Thus, it does nothing when it should clear individual cache files.\n\n### Commit Details and Explanation\n\nThe provided commit patch indicates that the developer has recognized this issue and has addressed it by modifying the `clear_cache` method inside the `pickle.py` file.\n\n#### Content of the Commit \n\nThe commit adds functionality to the `clear_cache` method:\n- **Developer's Perspective on the Cause**:\n  - The original code for the `clear_cache` method had a placeholder (`pass`), resulting in no actual logic being executed to clear the cache when `separate_files` is enabled.\n  - This lack of implementation led to the described issue. The cache data persisted because there was no mechanism to clear the individual cache files created when `separate_files` was set to `True`.\n\n#### Changes in the Commit\n\n- **New Implementation**:\n  - The commit introduces a call to `_clear_all_cache_files()` within the `clear_cache` method if the `separate_files` option is enabled.\n  - This addition ensures that any existing cache files corresponding to the separate files mode are properly handled and deleted.\n\n### Solution to the Issue\n\n1. **Detection of the Problem**:\n   - The method `clear_cache()` was previously not handling the `separate_files` option effectively.\n   - The placeholder `pass` meant that no action was taken to clear the cache when `separate_files` was set to `True`.\n\n2. **Implementing a Solution**:\n   - The developer added the necessary functionality to actually clear the cache files when `separate_files` is specified.\n   - By calling `_clear_all_cache_files()`, which is presumably responsible for handling and deleting all individual cache files associated with the decorated function, the `clear_cache()` method now performs the intended action.\n\n3. **Effect of the Solution**:\n   - The new implementation guarantees that invoking `clear_cache()` with `separate_files` enabled will correctly navigate through and delete all the cache files.\n   - This ensures that no stale or outdated cache data remains, thus maintaining the reliability and predictability of the caching mechanism. \n\nIn summary, the issue was caused by a lack of implementation in the `clear_cache()` function for the `separate_files` option. The commit fixed this by adding the appropriate call to `_clear_all_cache_files()`, ensuring that all related cache files are properly cleared when needed. This solution directly addresses the main problem and restores the correct behavior of the cache clearing functionality."
    }
  }
}