{
  "RepoName": "python-cachier/cachier.git",
  "CommitSHA": "474294fac83b029f13dd61b8d2822923045d4504",
  "Type": "logic error",
  "ErrorMessage": null,
  "Issue": {
    "title": "Cache Clearing Functionality Not Working as Expected",
    "description": "It has been observed that the `clear_cache` function in the `cachier` library does not perform its expected behavior of clearing the in-memory cache. This issue could lead to stale or outdated data being served from the cache, as the clear_cache function currently does nothing (`pass` statement). This can cause significant problems in scenarios where the cache needs to be invalidated and refreshed. Users relying on the clear_cache method to manually invalidate cache entries are currently unable to do so, resulting in potential data inconsistency. The function should be updated to properly clear the cache.",
    "explanation": "### Issue Summary\n\nThe issue at hand pertains to the `clear_cache` function within the `cachier` library. This function is intended to clear in-memory caches to prevent the use of stale or outdated data. However, the current implementation of `clear_cache` is non-functional as it contains merely a `pass` statement, which essentially means no operation is performed. This malfunction can lead to scenarios where users are unable to invalidate and refresh cache entries manually, resulting in potential data inconsistencies.\n\n### Detailed Explanation of the Issue\n\n1. **Cause of the Issue**:\n    - The `clear_cache` function is designed to ensure that the cache can be reset or cleared, allowing for the relevant data to be recalculated and updated. However, in the current implementation, the function does nothing because it contains just a `pass` statement.\n    - This issue becomes critical because if data within the cache becomes outdated or invalid, there's no mechanism to clear the cache, which means that stale data will continue to be served to users and applications relying on this cache.\n\n2. **Effects**:\n    - **Data Consistency Problems**: Users and applications may receive stale data from the cache, leading to inconsistencies and potential errors in data processing or user experience.\n    - **Manual Invalidations**: Users who attempt to clear the cache manually via `clear_cache` will find that their requests to invalidate the cache have no effect, leading to frustration and potential workaround hacks that may not be optimal.\n\n### Detailed Explanation of the Commit\n\n1. **Commit Changes**:\n    - The developer updated the `clear_cache` method within the `cachier/cores/memory.py` file.\n    - Instead of the method containing a `pass` statement, it now includes functionality to actually clear the cache by invoking the `clear` method on the cache dictionary.\n\n2. **How the Commit Solves the Issue**:\n    - **Implementing Cache Clearing**: By replacing the `pass` statement with a call to clear the cache (`self.cache.clear()`), the commit ensures that invoking `clear_cache` will now effectively reset the in-memory cache.\n    - **Enabling Manual Cache Invalidations**: Users can now call `clear_cache` to invalidate the cache entries. This is crucial for scenarios where the cache data needs to be purged to prevent serving outdated information.\n    - **Improving Data Reliability**: With the ability to clear the cache, the application can maintain more reliable and up-to-date data, thereby improving overall data consistency and reliability.\n\n### Solution Explanation\n\nThe solution to the issue was to modify the `clear_cache` function within the memory core of the `cachier` library to perform its intended function, which is to clear the in-memory cache. The developer did this by replacing the no-op `pass` statement with an actual call to clear the cache.\n\n- **Summary of the Change**: The `clear_cache` function now properly clears the in-memory cache.\n- **Effectiveness of the Change**:\n    - This ensures that the cache can be invalidated as needed.\n    - Users and applications relying on this functionality can now ensure they are not using outdated data.\n    - It enhances the flexibility and reliability of cache management within the `cachier` framework.\n\nBy implementing this change, the developer resolved the issue of non-functional cache clearing, allowing for more robust cache management and ensuring that the caching behavior aligns with user expectations and requirements for data consistency."
  },
  "Explain": "### Issue Summary\n\nThe issue at hand pertains to the `clear_cache` function within the `cachier` library. This function is intended to clear in-memory caches to prevent the use of stale or outdated data. However, the current implementation of `clear_cache` is non-functional as it contains merely a `pass` statement, which essentially means no operation is performed. This malfunction can lead to scenarios where users are unable to invalidate and refresh cache entries manually, resulting in potential data inconsistencies.\n\n### Detailed Explanation of the Issue\n\n1. **Cause of the Issue**:\n    - The `clear_cache` function is designed to ensure that the cache can be reset or cleared, allowing for the relevant data to be recalculated and updated. However, in the current implementation, the function does nothing because it contains just a `pass` statement.\n    - This issue becomes critical because if data within the cache becomes outdated or invalid, there's no mechanism to clear the cache, which means that stale data will continue to be served to users and applications relying on this cache.\n\n2. **Effects**:\n    - **Data Consistency Problems**: Users and applications may receive stale data from the cache, leading to inconsistencies and potential errors in data processing or user experience.\n    - **Manual Invalidations**: Users who attempt to clear the cache manually via `clear_cache` will find that their requests to invalidate the cache have no effect, leading to frustration and potential workaround hacks that may not be optimal.\n\n### Detailed Explanation of the Commit\n\n1. **Commit Changes**:\n    - The developer updated the `clear_cache` method within the `cachier/cores/memory.py` file.\n    - Instead of the method containing a `pass` statement, it now includes functionality to actually clear the cache by invoking the `clear` method on the cache dictionary.\n\n2. **How the Commit Solves the Issue**:\n    - **Implementing Cache Clearing**: By replacing the `pass` statement with a call to clear the cache (`self.cache.clear()`), the commit ensures that invoking `clear_cache` will now effectively reset the in-memory cache.\n    - **Enabling Manual Cache Invalidations**: Users can now call `clear_cache` to invalidate the cache entries. This is crucial for scenarios where the cache data needs to be purged to prevent serving outdated information.\n    - **Improving Data Reliability**: With the ability to clear the cache, the application can maintain more reliable and up-to-date data, thereby improving overall data consistency and reliability.\n\n### Solution Explanation\n\nThe solution to the issue was to modify the `clear_cache` function within the memory core of the `cachier` library to perform its intended function, which is to clear the in-memory cache. The developer did this by replacing the no-op `pass` statement with an actual call to clear the cache.\n\n- **Summary of the Change**: The `clear_cache` function now properly clears the in-memory cache.\n- **Effectiveness of the Change**:\n    - This ensures that the cache can be invalidated as needed.\n    - Users and applications relying on this functionality can now ensure they are not using outdated data.\n    - It enhances the flexibility and reliability of cache management within the `cachier` framework.\n\nBy implementing this change, the developer resolved the issue of non-functional cache clearing, allowing for more robust cache management and ensuring that the caching behavior aligns with user expectations and requirements for data consistency.",
  "Time": "2024-06-02",
  "Difficulty": "Simple",
  "OriginCode": [
    {
      "path": "python-cachier_cachier/MANIFEST.in",
      "content": "# Manifest syntax https://docs.python.org/2/distutils/sourcedist.html\ngraft wheelhouse\n\nrecursive-include src *.info\ninclude README.rst\ninclude LICENSE\n\nprune __pycache__\nprune tests\n"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "Cachier\n#######\n\n|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n\nPersistent, stale-free, local and cross-machine caching for Python functions.\n\n.. code-block:: python\n\n  from cachier import cachier\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(days=3))\n  def foo(arg1, arg2):\n    \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\n\n.. role:: python(code)\n  :language: python\n\n.. contents::\n\n.. section-numbering:\n\n\n\nInstallation\n============\n\nInstall ``cachier`` with:\n\n.. code-block:: python\n\n    pip install cachier\n\nFor the latest version supporting Python 2.7 please use:\n\n.. code-block:: python\n\n    pip install 'cachier==1.2.8'\n\nFeatures\n========\n\n* Pure Python.\n* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n* A simple interface.\n* Defining \"shelf life\" for cached values.\n* Local caching using pickle files.\n* Cross-machine caching using MongoDB.\n* Thread-safety.\n\nCachier is **NOT**:\n\n* Meant as a transient cache. Python's @lru_cache is better.\n* Especially fast. It is meant to replace function calls that take more than... a second, say (overhead is around 1 millisecond).\n\nFuture features\n---------------\n\n* S3 core.\n* Multi-core caching.\n* `Cache replacement policies <https://en.wikipedia.org/wiki/Cache_replacement_policies>`_\n\n\nUse\n===\n\nCachier provides a decorator which you can wrap around your functions to give them a persistent cache. The positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). Also, notice that since objects which are instances of user-defined classes are hashable but all compare unequal (their hash value is their id), equal objects across different sessions will not yield identical keys.\n\nSetting up a Cache\n------------------\nYou can add a default, pickle-based, persistent cache to your function - meaning it will last across different Python kernels calling the wrapped function - by decorating it with the ``cachier`` decorator (notice the ``()``!).\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier()\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values!\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nClass and object methods can also be cached. Cachier will automatically ignore the `self` parameter when determining the cache key for an object method. **This means that methods will be cached across all instances of an object, which may not be what you want.**\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  class Foo():\n    @staticmethod\n    @cachier()\n    def good_static_usage(arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method does not depend on object's internal state, so good to cache\n    @cachier()\n    def good_usage_1(self, arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method is calling external service, probably okay to cache\n    @cachier()\n    def good_usage_2(self, arg_1, arg_2):\n      result = self.call_api(arg_1, arg_2)\n      return result\n\n    # Instance method relies on object attribute, NOT good to cache\n    @cachier()\n    def bad_usage(self, arg_1, arg_2):\n      return arg_1 + arg_2 + self.arg_3\n\n\nResetting a Cache\n-----------------\nThe Cachier wrapper adds a ``clear_cache()`` function to each wrapped function. To reset the cache of the wrapped function simply call this method:\n\n.. code-block:: python\n\n  foo.clear_cache()\n\nGeneral Configuration\n----------------------\n\nGlobal Defaults\n~~~~~~~~~~~~~~~\n\nSettings can be globally configured across all Cachier wrappers through the use of the `set_default_params` function. This function takes the same keyword parameters as the ones defined in the decorator, which can be passed all at once or with multiple calls. Parameters given directly to a decorator take precedence over any values set by this function.\n\nThe following parameters will only be applied to decorators defined after `set_default_params` is called:\n\n*  `hash_func`\n*  `backend`\n*  `mongetter`\n*  `cache_dir`\n*  `pickle_reload`\n*  `separate_files`\n\nThese parameters can be changed at any time and they will apply to all decorators:\n\n*  `allow_none`\n*  `caching_enabled`\n*  `stale_after`\n*  `next_time`\n*  `wait_for_calc_timeout`\n\nThe current defaults can be fetched by calling `get_default_params`.\n\nThreads Limit\n~~~~~~~~~~~~~\n\nTo limit the number of threads Cachier is allowed to spawn, set the ``CACHIER_MAX_WORKERS`` with the desired number. The default is 8, so to enable Cachier to spawn even more threads, you'll have to set a higher limit explicitly.\n\n\nGlobal Enable/Disable\n---------------------\n\nCaching can be turned off across all decorators by calling `disable_caching`, and then re-activated by calling `enable_caching`.\n\nThese functions are convenience wrappers around the `caching_enabled` default setting.\n\n\nCache Shelf Life\n----------------\n\nSetting Shelf Life\n~~~~~~~~~~~~~~~~~~\nYou can set any duration as the shelf life of cached return values of a function by providing a corresponding ``timedelta`` object to the ``stale_after`` parameter:\n\n.. code-block:: python\n\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(weeks=2))\n  def bar(arg1, arg2):\n    return {'arg1': arg1, 'arg2': arg2}\n\nNow when a cached value matching the given arguments is found the time of its calculation is checked; if more than ``stale_after`` time has since passed, the function will be run again for the same arguments and the new value will be cached and returned.\n\nThis is useful for lengthy calculations that depend on a dynamic data source.\n\nFuzzy Shelf Life\n~~~~~~~~~~~~~~~~\nSometimes you may want your function to trigger a calculation when it encounters a stale result, but still not wait on it if it's not that critical. In that case, you can set ``next_time`` to ``True`` to have your function trigger a recalculation **in a separate thread**, but return the currently cached stale value:\n\n.. code-block:: python\n\n  @cachier(next_time=True)\n\nFurther function calls made while the calculation is being performed will not trigger redundant calculations.\n\n\nWorking with unhashable arguments\n---------------------------------\n\nAs mentioned above, the positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). To get around this limitation the ``hash_func`` parameter of the ``cachier`` decorator can be provided with a callable that gets the args and kwargs from the decorated function and returns a hash key for them.\n\n.. code-block:: python\n\n  def calculate_hash(args, kwds):\n    key = ...  # compute a hash key here based on arguments\n    return key\n\n  @cachier(hash_func=calculate_hash)\n  def calculate_super_complex_stuff(custom_obj):\n    # amazing code goes here\n\nSee here for an example:\n\n`Question: How to work with unhashable arguments <https://github.com/python-cachier/cachier/issues/91>`_\n\n\nPrecaching values\n---------------------------------\n\nIf you want to load a value into the cache without calling the underlying function, this can be done with the `precache_value` function.\n\n.. code-block:: python\n\n  @cachier()\n  def add(arg1, arg2):\n    return arg1 + arg2\n\n  add.precache_value(2, 2, value_to_cache=5)\n\n  result = add(2, 2)\n  print(result)  # prints 5\n\n\nPer-function call arguments\n---------------------------\n\nCachier also accepts several keyword arguments in the calls of the function it wraps rather than in the decorator call, allowing you to modify its behaviour for a specific function call.\n\nIgnore Cache\n~~~~~~~~~~~~\n\nYou can have ``cachier`` ignore any existing cache for a specific function call by passing ``cachier__skip_cache=True`` to the function call. The cache will neither be checked nor updated with the new return value.\n\n.. code-block:: python\n\n  @cachier()\n  def sum(first_num, second_num):\n    return first_num + second_num\n\n  def main():\n    print(sum(5, 3, cachier__skip_cache=True))\n\nOverwrite Cache\n~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` overwrite an existing cache entry - if one exists - for a specific function call by passing ``cachier__overwrite_cache=True`` to the function call. The cache will not be checked but will be updated with the new return value.\n\nVerbose Cache Call\n~~~~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` print out a detailed explanation of the logic of a specific call by passing ``cachier__verbose=True`` to the function call. This can be useful if you are not sure why a certain function result is, or is not, returned.\n\nCache `None` Values\n~~~~~~~~~~~~~~~~~~~\n\nBy default, ``cachier`` does not cache ``None`` values. You can override this behaviour by passing ``allow_none=True`` to the function call.\n\n\nCachier Cores\n=============\n\nPickle Core\n-----------\n\nThe default core for Cachier is pickle based, meaning each function will store its cache is a separate pickle file in the ``~/.cachier`` directory. Naturally, this kind of cache is both machine-specific and user-specific.\n\nYou can configure ``cachier`` to use another directory by providing the ``cache_dir`` parameter with the path to that directory:\n\n.. code-block:: python\n\n  @cachier(cache_dir='~/.temp/.cache')\n\n\nYou can slightly optimise pickle-based caching if you know your code will only be used in a single thread environment by setting:\n\n.. code-block:: python\n\n  @cachier(pickle_reload=False)\n\nThis will prevent reading the cache file on each cache read, speeding things up a bit, while also nullifying inter-thread functionality (the code is still thread safe, but different threads will have different versions of the cache at times, and will sometime make unnecessary function calls).\n\nSetting the optional argument ``separate_files`` to ``True`` will cause the cache to be stored in several files: A file per argument set, per function. This can help if your per-function cache files become too large.\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier(separate_files=True)\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values, split across several files, per argument set\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nYou can get the fully qualified path to the directory of cache files used by ``cachier`` (``~/.cachier`` by default) by calling the ``cache_dpath()`` function:\n\n.. code-block:: python\n\n  >>> foo.cache_dpath()\n      \"/home/user/.cachier/\"\n\n\nMongoDB Core\n------------\nYou can set a MongoDB-based cache by assigning ``mongetter`` with a callable that returns a ``pymongo.Collection`` object with writing permissions:\n\n.. code-block:: python\n\n    from pymongo import MongoClient\n\n    def my_mongetter():\n        client = MongoClient(get_cachier_db_auth_uri())\n        db_obj = client['cachier_db']\n        if 'someapp_cachier_db' not in db_obj.list_collection_names():\n            db_obj.create_collection('someapp_cachier_db')\n        return db_obj['someapp_cachier_db']\n\n  @cachier(mongetter=my_mongetter)\n\nThis allows you to have a cross-machine, albeit slower, cache. This functionality requires that the installation of the ``pymongo`` python package.\n\nIn certain cases the MongoDB backend might leave a deadlock behind, blocking all subsequent requests from being processed. If you encounter this issue, supply the ``wait_for_calc_timeout`` with a reasonable number of seconds; calls will then wait at most this number of seconds before triggering a recalculation.\n\n.. code-block:: python\n\n  @cachier(mongetter=False, wait_for_calc_timeout=2)\n\n\nMemory Core\n-----------\n\nYou can set an in-memory cache by assigning the ``backend`` parameter with ``'memory'``:\n\n.. code-block:: python\n\n  @cachier(backend='memory')\n\nNote, however, that ``cachier``'s in-memory core is simple, and has no monitoring or cap on cache size, and can thus lead to memory errors on large return values - it is mainly intended to be used with future multi-core functionality. As a rule, Python's built-in ``lru_cache`` is a much better stand-alone solution.\n\n\nContributing\n============\n\nCurrent maintainers are Shay Palachy Affek (`shay.palachy@gmail.com <mailto:shay.palachy@gmail.com>`_, `@shaypal5 <https://github.com/shaypal5>`_) and Judson Neer (`@lordjabez <https://github.com/lordjabez>`_); You are more than welcome to approach them for help. Contributions are very welcomed! :)\n\nInstalling for development\n--------------------------\n\nClone:\n\n.. code-block:: bash\n\n  git clone git@github.com:python-cachier/cachier.git\n\n\nInstall in development mode with test dependencies:\n\n.. code-block:: bash\n\n  cd cachier\n  pip install -e . -r tests/requirements.txt\n\n\nRunning the tests\n-----------------\n\nTo run the tests, call the ``pytest`` command in the repository's root, or:\n\n.. code-block:: bash\n\n  python -m pytest\n\nTo run only MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m mongo\n\nTo run only memory core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m memory\n\nTo run all tests EXCEPT MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not mongo\"\n\n\nTo run all tests EXCEPT memory core AND MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not (mongo or memory)\"\n\n\nRunning MongoDB tests against a live MongoDB instance\n-----------------------------------------------------\n\n**Note to developers:** By default, all MongoDB tests are run against a mocked MongoDB instance, provided by the ``pymongo_inmemory`` package. To run them against a live MongoDB instance, the ``CACHIER_TEST_VS_DOCKERIZED_MONGO`` environment variable is set to ``True`` in the ``test`` environment of this repository (and additional environment variables are populated with the appropriate credentials), used by the GitHub Action running tests on every commit and pull request.\n\nContributors are not expected to run these tests against a live MongoDB instance when developing, as credentials for the testing instance used will NOT be shared, but rather use the testing against the in-memory MongoDB instance as a good proxy.\n\n**HOWEVER, the tests run against a live MongoDB instance when you submit a PR are the determining tests for deciding whether your code functions correctly against MongoDB.**\n\n\nAdding documentation\n--------------------\n\nThis project is documented using the `numpy docstring conventions`_, which were chosen as they are perhaps the most widely-spread conventions that are both supported by common tools such as Sphinx and result in human-readable docstrings (in my personal opinion, of course). When documenting code you add to this project, please follow `these conventions`_.\n\n.. _`numpy docstring conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n.. _`these conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n\nAdditionally, if you update this ``README.rst`` file, use ``python setup.py checkdocs`` to validate it compiles.\n\n\nCredits\n=======\n\nCreated by `Shay Palachy Affek <https://github.com/shaypal5>`_ (shay.palachy@gmail.com), which currently assists in maintenance.\n\nCurrent lead developer/contributor: `Jirka Borovec <https://github.com/Borda>`_ (`@Borda <https://github.com/Borda>`_ on GitHub).\n\nOther major contributors:\n\n* `Jirka Borovec <https://github.com/Borda>`_ - Arg order independence, args-to-kwargs for less unique keys and numerous development and CI contributions.\n\n* `Judson Neer <https://github.com/lordjabez>`_ - Precaching, method caching support and numerous improvements and bugfixes.\n\n* `cthoyt <https://github.com/cthoyt>`_ - Base memory core implementation.\n\n* `amarczew <https://github.com/amarczew>`_ - The ``hash_func`` kwarg.\n\n* `non-senses <https://github.com/non-senses>`_ - The ``wait_for_calc_timeout`` kwarg.\n\n* `Elad Rapaport <https://github.com/erap129>`_ - Multi-file Pickle core, a.k.a ``separate_files`` (released on ``v1.5.3``).\n\n* `John Didion <https://github.com/jdidion>`_ - Support for pickle-based caching for cases where two identically-named methods of different classes are defined in the same module.\n\nNotable bugfixers:\n\n* `MichaelRazum <https://github.com/MichaelRazum>`_.\n\n* `Eric Ma <https://github.com/ericmjl>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n* `Ofir <https://github.com/ofirnk>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n\n\n.. |PyPI-Status| image:: https://img.shields.io/pypi/v/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |PyPI-Versions| image:: https://img.shields.io/pypi/pyversions/cachier.svg\n   :target: https://pypi.python.org/pypi/cachier\n\n.. |Build-Status| image:: https://github.com/python-cachier/cachier/actions/workflows/test.yml/badge.svg\n  :target: https://github.com/python-cachier/cachier/actions/workflows/test.yml\n\n.. |LICENCE| image:: https://img.shields.io/pypi/l/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |Codecov| image:: https://codecov.io/github/python-cachier/cachier/coverage.svg?branch=master\n   :target: https://codecov.io/github/python-cachier/cachier?branch=master\n\n.. |Downloads| image:: https://pepy.tech/badge/cachier\n     :target: https://pepy.tech/project/cachier\n     :alt: PePy stats\n\n.. |Codefactor| image:: https://www.codefactor.io/repository/github/python-cachier/cachier/badge?style=plastic\n     :target: https://www.codefactor.io/repository/github/python-cachier/cachier\n     :alt: Codefactor code quality\n\n.. links:\n.. _pymongo: https://api.mongodb.com/python/current/\n.. _watchdog: https://github.com/gorakhargosh/watchdog\n"
    },
    {
      "path": "python-cachier_cachier/LICENSE",
      "content": "MIT License\n\nCopyright (c) 2016 Shay Palachy\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.\n"
    },
    {
      "path": "python-cachier_cachier/pyproject.toml",
      "content": "[project]\nname = \"cachier\"\ndescription = \"Persistent, stale-free, local and cross-machine caching for Python functions.\"\nreadme = \"README.rst\"\nkeywords = [\n  \"cache\",\n  \"caching\",\n  \"cross-machine\",\n  \"decorator\",\n  \"local\",\n  \"memoization\",\n  \"mongo\",\n  \"persistent\",\n]\nlicense = {file = \"LICENSE\"}\nauthors = [\n   {name = \"Shay Palachy Affek\", email = 'shay.palachy@gmail.com'},\n]\nclassifiers = [\n  'Development Status :: 4 - Beta',\n  'Intended Audience :: Developers',\n  'License :: OSI Approved :: MIT License',\n  'Programming Language :: Python',\n  \"Programming Language :: Python :: 3 :: Only\",\n  \"Programming Language :: Python :: 3.8\",\n  \"Programming Language :: Python :: 3.9\",\n  \"Programming Language :: Python :: 3.10\",\n  \"Programming Language :: Python :: 3.11\",\n  \"Programming Language :: Python :: 3.12\",\n  \"Topic :: Other/Nonlisted Topic\",\n  \"Topic :: Software Development :: Libraries\",\n  \"Topic :: Software Development :: Libraries :: Python Modules\",\n  \"Topic :: Utilities\",\n]\ndynamic = [\n  \"version\",\n]\ndependencies = [\n  \"portalocker>=2.3.2\",\n  \"watchdog>=2.3.1\",\n]\n[project.urls]\nSource = \"https://github.com/python-cachier/cachier\"\n[project.scripts]\ncachier = \"cachier.__naim__:cli\"\n\n[tool.setuptools]\ninclude-package-data = true\n\n[tool.setuptools.dynamic]\nversion = {attr = \"cachier._version.__version__\"}\n\n[tool.setuptools.packages.find]\nwhere = [\"src\"]  # list of folders that contain the packages ([\".\"] by default)\ninclude = [\"cachier*\"]  # package names should match these glob patterns ([\"*\"] by default)\nnamespaces = false  # to disable scanning PEP 420 namespaces (true by default)\n\n[build-system]\nrequires = [\n  \"setuptools\",\n  \"wheel\",\n]\n\n[tool.pytest.ini_options]\ntestpaths = [\"cachier\", \"tests\"]\nnorecursedirs=[\"dist\", \"build\"]\naddopts = [\n    \"--color=yes\",\n    \"--cov=cachier\",\n    \"--cov-report=term\",\n    \"--cov-report=xml:cov.xml\",\n    \"-r a\",\n    \"-v\",\n    \"-s\",\n]\nmarkers = [\n    \"mongo: test the MongoDB core\",\n    \"memory: test the memory core\",\n    \"pickle: test the pickle core\"\n]\n\n[tool.docformatter]\nrecursive = true\n# some docstring start with r\"\"\"\nwrap-summaries = 79\nwrap-descriptions = 79\nblank = true\n\n[tool.ruff]\ntarget-version = \"py38\"\nline-length = 79\n# Enable Pyflakes `E` and `F` codes by default.\nlint.select = [\n    \"E\",\n    \"W\", # see: https://pypi.org/project/pycodestyle\n    \"F\", # see: https://pypi.org/project/pyflakes\n    \"I\", #see: https://pypi.org/project/isort/\n    \"D\", # see: https://pypi.org/project/pydocstyle\n    #\"N\", # see: https://pypi.org/project/pep8-naming\n    \"S\", # see: https://pypi.org/project/flake8-bandit\n    \"SIM\",\n    \"RUF100\" # alternative to yesqa\n]\nlint.extend-select = [\n    \"A\",    # see: https://pypi.org/project/flake8-builtins\n    \"B\",    # see: https://pypi.org/project/flake8-bugbear\n    \"C4\",   # see: https://pypi.org/project/flake8-comprehensions\n]\nlint.ignore = [\n    \"E203\",\n    \"C901\",\n]\n# Exclude a variety of commonly ignored directories.\nexclude = [\n    \".eggs\",\n    \".git\",\n    \".ruff_cache\",\n    \"__pypackages__\",\n    \"_build\",\n    \"build\",\n    \"dist\",\n]\nlint.ignore-init-module-imports = true\nlint.unfixable = [\"F401\"]\n\n[tool.ruff.lint.per-file-ignores]\n\"src/**/__init__.py\" = [\"D104\"]\n\"src/cachier/config.py\" = [\"D100\"]\n\"tests/**\" = [\n  \"S101\", \"S311\", \"S105\", \"S603\",\n  \"D100\", \"D101\", \"D103\", \"D104\", \"D401\"\n]\n\n#[tool.ruff.pydocstyle]\n## Use Google-style docstrings.\n#convention = \"google\"\n\n#[tool.ruff.pycodestyle]\n#ignore-overlong-task-comments = true\n\n[tool.ruff.lint.mccabe]\n# Unlike Flake8, default to a complexity level of 10.\nmax-complexity = 10\n\n[tool.coverage.run]\nbranch = true\ndynamic_context = \"test_function\"\nomit = [\n    \"tests/*\",\n    \"cachier/_version.py\",\n    \"cachier/__init__.py\",\n    \"**/scripts/**\",\n]\n[tool.coverage.report]\nshow_missing = true\n# Regexes for lines to exclude from consideration\nexclude_lines = [\n    \"pragma: no cover\", # Have to re-enable the standard pragma\n    \"raise NotImplementedError\", # Don't complain if tests don't hit defensive assertion code:\n    \"if TYPE_CHECKING:\", # Is only true when running mypy, not tests\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__init__.py",
      "content": "from ._version import *  # noqa: F403\nfrom .config import (\n    disable_caching,\n    enable_caching,\n    get_default_params,\n    set_default_params,\n)\nfrom .core import cachier\n\n__all__ = [\n    \"cachier\",\n    \"set_default_params\",\n    \"get_default_params\",\n    \"enable_caching\",\n    \"disable_caching\",\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/config.py",
      "content": "import datetime\nimport hashlib\nimport os\nimport pickle\nfrom typing import Optional, TypedDict, Union\n\nfrom ._types import Backend, HashFunc, Mongetter\n\n\ndef _default_hash_func(args, kwds):\n    # Sort the kwargs to ensure consistent ordering\n    sorted_kwargs = sorted(kwds.items())\n    # Serialize args and sorted_kwargs using pickle or similar\n    serialized = pickle.dumps((args, sorted_kwargs))\n    # Create a hash of the serialized data\n    return hashlib.sha256(serialized).hexdigest()\n\n\nclass Params(TypedDict):\n    \"\"\"Type definition for cachier parameters.\"\"\"\n\n    caching_enabled: bool\n    hash_func: HashFunc\n    backend: Backend\n    mongetter: Optional[Mongetter]\n    stale_after: datetime.timedelta\n    next_time: bool\n    cache_dir: Union[str, os.PathLike]\n    pickle_reload: bool\n    separate_files: bool\n    wait_for_calc_timeout: int\n    allow_none: bool\n\n\n_default_params: Params = {\n    \"caching_enabled\": True,\n    \"hash_func\": _default_hash_func,\n    \"backend\": \"pickle\",\n    \"mongetter\": None,\n    \"stale_after\": datetime.timedelta.max,\n    \"next_time\": False,\n    \"cache_dir\": \"~/.cachier/\",\n    \"pickle_reload\": True,\n    \"separate_files\": False,\n    \"wait_for_calc_timeout\": 0,\n    \"allow_none\": False,\n}\n\n\ndef _update_with_defaults(\n    param, name: str, func_kwargs: Optional[dict] = None\n):\n    import cachier\n\n    if func_kwargs:\n        kw_name = f\"cachier__{name}\"\n        if kw_name in func_kwargs:\n            return func_kwargs.pop(kw_name)\n    if param is None:\n        return cachier.config._default_params[name]\n    return param\n\n\ndef set_default_params(**params):\n    \"\"\"Configure global parameters applicable to all memoized functions.\n\n    This function takes the same keyword parameters as the ones defined in the\n    decorator, which can be passed all at once or with multiple calls.\n    Parameters given directly to a decorator take precedence over any values\n    set by this function.\n\n    Only 'stale_after', 'next_time', and 'wait_for_calc_timeout' can be changed\n    after the memoization decorator has been applied. Other parameters will\n    only have an effect on decorators applied after this function is run.\n\n    \"\"\"\n    import cachier\n\n    valid_params = (\n        p for p in params.items() if p[0] in cachier.config._default_params\n    )\n    _default_params.update(valid_params)\n\n\ndef get_default_params():\n    \"\"\"Get current set of default parameters.\"\"\"\n    import cachier\n\n    return cachier.config._default_params\n\n\ndef enable_caching():\n    \"\"\"Enable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = True\n\n\ndef disable_caching():\n    \"\"\"Disable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = False\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/_version.py",
      "content": "# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2024, Jirka Borovec <***@gmail.com>\n\nimport os\n\n_PATH_HERE = os.path.dirname(__file__)\n_PATH_VERSION = os.path.join(_PATH_HERE, \"version.info\")\n_RELEASING_PROCESS = os.getenv(\"RELEASING_PROCESS\", \"0\") == \"1\"\n\nwith open(_PATH_VERSION) as fopen:\n    __version__ = fopen.read().strip()\n\n\ndef _get_git_sha() -> str:\n    from subprocess import DEVNULL, check_output\n\n    out = check_output([\"git\", \"rev-parse\", \"--short\", \"HEAD\"], stderr=DEVNULL)  # noqa: S603, S607\n    return out.decode(\"utf-8\").strip()\n\n\nif not _RELEASING_PROCESS:\n    try:\n        sha_short = _get_git_sha()\n        # print(f\"Version enriched with git commit hash: {__version__}.\")\n    except Exception:\n        # print(\"Failed to get the git commit hash,\"\n        #       f\" falling back to base version {__version__}.\")\n        sha_short = \"\"\n    __version__ += f\".dev+{sha_short}\" if sha_short else \".dev\"\n\n\n__all__ = [\"__version__\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/version.info",
      "content": "3.0.0\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/py.typed",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/_types.py",
      "content": "from typing import TYPE_CHECKING, Callable, Literal\n\nif TYPE_CHECKING:\n    import pymongo.collection\n\n\nHashFunc = Callable[..., str]\nMongetter = Callable[[], \"pymongo.collection.Collection\"]\nBackend = Literal[\"pickle\", \"mongo\", \"memory\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__main__.py",
      "content": "\"\"\"A command-line interface for cachier.\"\"\"\n\nimport click\n\nfrom cachier.core import _set_max_workers\n\n\n@click.group()\ndef cli():\n    \"\"\"A command-line interface for cachier.\"\"\"  # noqa: D401\n\n\n@cli.command(\"Limits the number of worker threads used by cachier.\")\n@click.argument(\"max_workers\", nargs=1, type=int)\ndef set_max_workers(max_workers):\n    \"\"\"Limits the number of worker threads used by cachier.\"\"\"\n    _set_max_workers(max_workers)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "\"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport datetime\nimport inspect\nimport os\nimport warnings\nfrom collections import OrderedDict\nfrom concurrent.futures import ThreadPoolExecutor\nfrom functools import wraps\nfrom typing import Optional, Union\nfrom warnings import warn\n\nfrom .config import (\n    Backend,\n    HashFunc,\n    Mongetter,\n    _default_params,\n    _update_with_defaults,\n)\nfrom .cores.base import RecalculationNeeded, _BaseCore\nfrom .cores.memory import _MemoryCore\nfrom .cores.mongo import _MongoCore\nfrom .cores.pickle import _PickleCore\n\nMAX_WORKERS_ENVAR_NAME = \"CACHIER_MAX_WORKERS\"\nDEFAULT_MAX_WORKERS = 8\n\n\ndef _max_workers():\n    return int(os.environ.get(MAX_WORKERS_ENVAR_NAME, DEFAULT_MAX_WORKERS))\n\n\ndef _set_max_workers(max_workers):\n    os.environ[MAX_WORKERS_ENVAR_NAME] = str(max_workers)\n    _get_executor(True)\n\n\ndef _get_executor(reset=False):\n    if reset or not hasattr(_get_executor, \"executor\"):\n        _get_executor.executor = ThreadPoolExecutor(_max_workers())\n    return _get_executor.executor\n\n\ndef _function_thread(core, key, func, args, kwds):\n    try:\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n    except BaseException as exc:\n        print(f\"Function call failed with the following exception:\\n{exc}\")\n\n\ndef _calc_entry(core, key, func, args, kwds):\n    try:\n        core.mark_entry_being_calculated(key)\n        # _get_executor().submit(core.mark_entry_being_calculated, key)\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n        # _get_executor().submit(core.set_entry, key, func_res)\n        return func_res\n    finally:\n        core.mark_entry_not_calculated(key)\n\n\ndef _convert_args_kwargs(\n    func, _is_method: bool, args: tuple, kwds: dict\n) -> dict:\n    \"\"\"Convert mix of positional and keyword arguments to aggregated kwargs.\"\"\"\n    # unwrap if the function is functools.partial\n    if hasattr(func, \"func\"):\n        args = func.args + args\n        kwds.update({k: v for k, v in func.keywords.items() if k not in kwds})\n        func = func.func\n    func_params = list(inspect.signature(func).parameters)\n    args_as_kw = dict(\n        zip(func_params[1:], args[1:])\n        if _is_method\n        else zip(func_params, args)\n    )\n    # init with default values\n    kwargs = {\n        k: v.default\n        for k, v in inspect.signature(func).parameters.items()\n        if v.default is not inspect.Parameter.empty\n    }\n    # merge args expanded as kwargs and the original kwds\n    kwargs.update(dict(**args_as_kw, **kwds))\n    return OrderedDict(sorted(kwargs.items()))\n\n\ndef _pop_kwds_with_deprecation(kwds, name: str, default_value: bool):\n    if name in kwds:\n        warnings.warn(\n            f\"`{name}` is deprecated and will be removed in a future release,\"\n            \" use `cachier__` alternative instead.\",\n            DeprecationWarning,\n            stacklevel=2,\n        )\n    return kwds.pop(name, default_value)\n\n\ndef cachier(\n    hash_func: Optional[HashFunc] = None,\n    hash_params: Optional[HashFunc] = None,\n    backend: Optional[Backend] = None,\n    mongetter: Optional[Mongetter] = None,\n    stale_after: Optional[datetime.timedelta] = None,\n    next_time: Optional[bool] = None,\n    cache_dir: Optional[Union[str, os.PathLike]] = None,\n    pickle_reload: Optional[bool] = None,\n    separate_files: Optional[bool] = None,\n    wait_for_calc_timeout: Optional[int] = None,\n    allow_none: Optional[bool] = None,\n):\n    \"\"\"Wrap as a persistent, stale-free memoization decorator.\n\n    The positional and keyword arguments to the wrapped function must be\n    hashable (i.e. Python's immutable built-in objects, not mutable\n    containers). Also, notice that since objects which are instances of\n    user-defined classes are hashable but all compare unequal (their hash\n    value is their id), equal objects across different sessions will not yield\n    identical keys.\n\n    Arguments:\n    ---------\n    hash_func : callable, optional\n        A callable that gets the args and kwargs from the decorated function\n        and returns a hash key for them. This parameter can be used to enable\n        the use of cachier with functions that get arguments that are not\n        automatically hashable by Python.\n    hash_params : callable, optional\n    backend : str, optional\n        The name of the backend to use. Valid options currently include\n        'pickle', 'mongo' and 'memory'. If not provided, defaults to\n        'pickle' unless the 'mongetter' argument is passed, in which\n        case the mongo backend is automatically selected.\n    mongetter : callable, optional\n        A callable that takes no arguments and returns a pymongo.Collection\n        object with writing permissions. If unset a local pickle cache is used\n        instead.\n    stale_after : datetime.timedelta, optional\n        The time delta after which a cached result is considered stale. Calls\n        made after the result goes stale will trigger a recalculation of the\n        result, but whether a stale or fresh result will be returned is\n        determined by the optional next_time argument.\n    next_time : bool, optional\n        If set to True, a stale result will be returned when finding one, not\n        waiting for the calculation of the fresh result to return. Defaults to\n        False.\n    cache_dir : str, optional\n        A fully qualified path to a file directory to be used for cache files.\n        The running process must have running permissions to this folder. If\n        not provided, a default directory at `~/.cachier/` is used.\n    pickle_reload : bool, optional\n        If set to True, in-memory cache will be reloaded on each cache read,\n        enabling different threads to share cache. Should be set to False for\n        faster reads in single-thread programs. Defaults to True.\n    separate_files: bool, default False, for Pickle cores only\n        Instead of a single cache file per-function, each function's cache is\n        split between several files, one for each argument set. This can help\n        if you per-function cache files become too large.\n    wait_for_calc_timeout: int, optional, for MongoDB only\n        The maximum time to wait for an ongoing calculation. When a\n        process started to calculate the value setting being_calculated to\n        True, any process trying to read the same entry will wait a maximum of\n        seconds specified in this parameter. 0 means wait forever.\n        Once the timeout expires the calculation will be triggered.\n    allow_none: bool, optional\n        Allows storing None values in the cache. If False, functions returning\n        None will not be cached and are recalculated every call.\n\n    \"\"\"\n    # Check for deprecated parameters\n    if hash_params is not None:\n        message = (\n            \"hash_params will be removed in a future release, \"\n            \"please use hash_func instead\"\n        )\n        warn(message, DeprecationWarning, stacklevel=2)\n        hash_func = hash_params\n    # Update parameters with defaults if input is None\n    backend = _update_with_defaults(backend, \"backend\")\n    mongetter = _update_with_defaults(mongetter, \"mongetter\")\n    # Override the backend parameter if a mongetter is provided.\n    if callable(mongetter):\n        backend = \"mongo\"\n    core: _BaseCore\n    if backend == \"pickle\":\n        core = _PickleCore(\n            hash_func=hash_func,\n            pickle_reload=pickle_reload,\n            cache_dir=cache_dir,\n            separate_files=separate_files,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"mongo\":\n        core = _MongoCore(\n            hash_func=hash_func,\n            mongetter=mongetter,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"memory\":\n        core = _MemoryCore(\n            hash_func=hash_func, wait_for_calc_timeout=wait_for_calc_timeout\n        )\n    else:\n        raise ValueError(\"specified an invalid core: %s\" % backend)\n\n    def _cachier_decorator(func):\n        core.set_func(func)\n\n        @wraps(func)\n        def func_wrapper(*args, **kwds):\n            nonlocal allow_none\n            _allow_none = _update_with_defaults(allow_none, \"allow_none\", kwds)\n            # print('Inside general wrapper for {}.'.format(func.__name__))\n            ignore_cache = _pop_kwds_with_deprecation(\n                kwds, \"ignore_cache\", False\n            )\n            overwrite_cache = _pop_kwds_with_deprecation(\n                kwds, \"overwrite_cache\", False\n            )\n            verbose = _pop_kwds_with_deprecation(kwds, \"verbose_cache\", False)\n            ignore_cache = kwds.pop(\"cachier__skip_cache\", ignore_cache)\n            overwrite_cache = kwds.pop(\n                \"cachier__overwrite_cache\", overwrite_cache\n            )\n            verbose = kwds.pop(\"cachier__verbose\", verbose)\n            _stale_after = _update_with_defaults(\n                stale_after, \"stale_after\", kwds\n            )\n            _next_time = _update_with_defaults(next_time, \"next_time\", kwds)\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n\n            _print = lambda x: None  # noqa: E731\n            if verbose:\n                _print = print\n            if ignore_cache or not _default_params[\"caching_enabled\"]:\n                return func(**kwargs)\n            key, entry = core.get_entry((), kwargs)\n            if overwrite_cache:\n                return _calc_entry(core, key, func, args, kwds)\n            if entry is None:\n                _print(\"No entry found. No current calc. Calling like a boss.\")\n                return _calc_entry(core, key, func, args, kwds)\n            _print(\"Entry found.\")\n            if _allow_none or entry.get(\"value\", None) is not None:\n                _print(\"Cached result found.\")\n                now = datetime.datetime.now()\n                if now - entry[\"time\"] <= _stale_after:\n                    _print(\"And it is fresh!\")\n                    return entry[\"value\"]\n                _print(\"But it is stale... :(\")\n                if entry[\"being_calculated\"]:\n                    if _next_time:\n                        _print(\"Returning stale.\")\n                        return entry[\"value\"]  # return stale val\n                    _print(\"Already calc. Waiting on change.\")\n                    try:\n                        return core.wait_on_entry_calc(key)\n                    except RecalculationNeeded:\n                        return _calc_entry(core, key, func, args, kwds)\n                if _next_time:\n                    _print(\"Async calc and return stale\")\n                    try:\n                        core.mark_entry_being_calculated(key)\n                        _get_executor().submit(\n                            _function_thread, core, key, func, args, kwds\n                        )\n                    finally:\n                        core.mark_entry_not_calculated(key)\n                    return entry[\"value\"]\n                _print(\"Calling decorated function and waiting\")\n                return _calc_entry(core, key, func, args, kwds)\n            if entry[\"being_calculated\"]:\n                _print(\"No value but being calculated. Waiting.\")\n                try:\n                    return core.wait_on_entry_calc(key)\n                except RecalculationNeeded:\n                    return _calc_entry(core, key, func, args, kwds)\n            _print(\"No entry found. No current calc. Calling like a boss.\")\n            return _calc_entry(core, key, func, args, kwds)\n\n        def _clear_cache():\n            \"\"\"Clear the cache.\"\"\"\n            core.clear_cache()\n\n        def _clear_being_calculated():\n            \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n            core.clear_being_calculated()\n\n        def _cache_dpath():\n            \"\"\"Return the path to the cache dir, if exists; None if not.\"\"\"\n            return getattr(core, \"cache_dir\", None)\n\n        def _precache_value(*args, value_to_cache, **kwds):  # noqa: D417\n            \"\"\"Add an initial value to the cache.\n\n            Arguments:\n            ---------\n            value_to_cache : any\n                entry to be written into the cache\n\n            \"\"\"\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n            return core.precache_value((), kwargs, value_to_cache)\n\n        func_wrapper.clear_cache = _clear_cache\n        func_wrapper.clear_being_calculated = _clear_being_calculated\n        func_wrapper.cache_dpath = _cache_dpath\n        func_wrapper.precache_value = _precache_value\n        return func_wrapper\n\n    return _cachier_decorator\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/base.py",
      "content": "\"\"\"Defines the interface of a cachier caching core.\"\"\"\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport abc  # for the _BaseCore abstract base class\nimport inspect\nimport threading\nfrom typing import Callable\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n\nclass RecalculationNeeded(Exception):\n    \"\"\"Exception raised when a recalculation is needed.\"\"\"\n\n    pass\n\n\ndef _get_func_str(func: Callable) -> str:\n    return f\".{func.__module__}.{func.__name__}\"\n\n\nclass _BaseCore:\n    __metaclass__ = abc.ABCMeta\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        self.hash_func = _update_with_defaults(hash_func, \"hash_func\")\n        self.wait_for_calc_timeout = wait_for_calc_timeout\n        self.lock = threading.RLock()\n\n    def set_func(self, func):\n        \"\"\"Set the function this core will use.\n\n        This has to be set before any method is called. Also determine if the\n        function is an object method.\n\n        \"\"\"\n        # unwrap if the function is functools.partial\n        if hasattr(func, \"func\"):\n            func = func.func\n        func_params = list(inspect.signature(func).parameters)\n        self.func_is_method = func_params and func_params[0] == \"self\"\n        self.func = func\n\n    def get_key(self, args, kwds):\n        \"\"\"Return a unique key based on the arguments provided.\"\"\"\n        return self.hash_func(args, kwds)\n\n    def get_entry(self, args, kwds):\n        \"\"\"Get entry based on given arguments.\n\n        Return the result mapped to the given arguments in this core's cache,\n        if such a mapping exists.\n\n        \"\"\"\n        key = self.get_key(args, kwds)\n        return self.get_entry_by_key(key)\n\n    def precache_value(self, args, kwds, value_to_cache):\n        \"\"\"Write a precomputed value into the cache.\"\"\"\n        key = self.get_key(args, kwds)\n        self.set_entry(key, value_to_cache)\n        return value_to_cache\n\n    def check_calc_timeout(self, time_spent):\n        \"\"\"Raise an exception if a recalculation is needed.\"\"\"\n        calc_timeout = _update_with_defaults(\n            self.wait_for_calc_timeout, \"wait_for_calc_timeout\"\n        )\n        if calc_timeout > 0 and (time_spent >= calc_timeout):\n            raise RecalculationNeeded()\n\n    @abc.abstractmethod\n    def get_entry_by_key(self, key):\n        \"\"\"Get entry based on given key.\n\n        Return the result mapped to the given key in this core's cache, if such\n        a mapping exists.\n\n        \"\"\"\n\n    @abc.abstractmethod\n    def set_entry(self, key, func_res):\n        \"\"\"Map the given result to the given key in this core's cache.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_being_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_not_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as not being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def wait_on_entry_calc(self, key):\n        \"\"\"Wait on the entry with keys being calculated and returns result.\"\"\"\n\n    @abc.abstractmethod\n    def clear_cache(self):\n        \"\"\"Clear the cache of this core.\"\"\"\n\n    @abc.abstractmethod\n    def clear_being_calculated(self):\n        \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "\"\"\"A pickle-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\nimport os\nimport pickle  # for local caching\nfrom contextlib import suppress\nfrom datetime import datetime\n\nimport portalocker  # to lock on pickle cache IO\nfrom watchdog.events import PatternMatchingEventHandler\nfrom watchdog.observers import Observer\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n# Alternative:  https://github.com/WoLpH/portalocker\nfrom .base import _BaseCore\n\n\nclass _PickleCore(_BaseCore):\n    \"\"\"The pickle core class for cachier.\"\"\"\n\n    class CacheChangeHandler(PatternMatchingEventHandler):\n        \"\"\"Handles cache-file modification events.\"\"\"\n\n        def __init__(self, filename, core, key):\n            PatternMatchingEventHandler.__init__(\n                self,\n                patterns=[\"*\" + filename],\n                ignore_patterns=None,\n                ignore_directories=True,\n                case_sensitive=False,\n            )\n            self.core = core\n            self.key = key\n            self.observer = None\n            self.value = None\n\n        def inject_observer(self, observer):\n            \"\"\"Inject the observer running this handler.\"\"\"\n            self.observer = observer\n\n        def _check_calculation(self):\n            # print('checking calc')\n            entry = self.core.get_entry_by_key(self.key, True)[1]\n            # print(self.key)\n            # print(entry)\n            try:\n                if not entry[\"being_calculated\"]:\n                    # print('stopping observer!')\n                    self.value = entry[\"value\"]\n                    self.observer.stop()\n                # else:\n                # print('NOT stopping observer... :(')\n            except TypeError:\n                self.value = None\n                self.observer.stop()\n\n        def on_created(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()  # pragma: no cover\n\n        def on_modified(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        pickle_reload: bool,\n        cache_dir: str,\n        separate_files: bool,\n        wait_for_calc_timeout: int,\n    ):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = None\n        self.reload = _update_with_defaults(pickle_reload, \"pickle_reload\")\n        self.cache_dir = os.path.expanduser(\n            _update_with_defaults(cache_dir, \"cache_dir\")\n        )\n        self.separate_files = _update_with_defaults(\n            separate_files, \"separate_files\"\n        )\n\n    @property\n    def cache_fname(self) -> str:\n        fname = f\".{self.func.__module__}.{self.func.__qualname__}\"\n        return fname.replace(\"<\", \"_\").replace(\">\", \"_\")\n\n    @property\n    def cache_fpath(self) -> str:\n        os.makedirs(self.cache_dir, exist_ok=True)\n        return os.path.abspath(\n            os.path.join(os.path.realpath(self.cache_dir), self.cache_fname)\n        )\n\n    def _reload_cache(self):\n        with self.lock:\n            try:\n                with portalocker.Lock(\n                    self.cache_fpath, mode=\"rb\"\n                ) as cache_file:\n                    self.cache = pickle.load(cache_file)  # noqa: S301\n            except (FileNotFoundError, EOFError):\n                self.cache = {}\n\n    def _get_cache(self):\n        with self.lock:\n            if not self.cache:\n                self._reload_cache()\n            return self.cache\n\n    def _get_cache_by_key(self, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        fpath += f\"_{hash_str or key}\"\n        try:\n            with portalocker.Lock(fpath, mode=\"rb\") as cache_file:\n                return pickle.load(cache_file)  # noqa: S301\n        except (FileNotFoundError, EOFError):\n            return None\n\n    def _clear_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(f\"{name}_\"):\n                os.remove(os.path.join(path, subpath))\n\n    def _clear_being_calculated_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(name):\n                entry = self._get_cache_by_key(hash_str=subpath.split(\"_\")[-1])\n                if entry is not None:\n                    entry[\"being_calculated\"] = False\n                    self._save_cache(entry, hash_str=subpath.split(\"_\")[-1])\n\n    def _save_cache(self, cache, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        if key is not None:\n            fpath += f\"_{key}\"\n        elif hash_str is not None:\n            fpath += f\"_{hash_str}\"\n        with self.lock:\n            self.cache = cache\n            with portalocker.Lock(fpath, mode=\"wb\") as cache_file:\n                pickle.dump(cache, cache_file, protocol=4)\n            if key is None:\n                self._reload_cache()\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            if self.separate_files:\n                return key, self._get_cache_by_key(key)\n            if self.reload or reload:\n                self._reload_cache()\n            return key, self._get_cache().get(key, None)\n\n    def set_entry(self, key, func_res):\n        key_data = {\n            \"value\": func_res,\n            \"time\": datetime.now(),\n            \"stale\": False,\n            \"being_calculated\": False,\n        }\n        if self.separate_files:\n            self._save_cache(key_data, key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            cache[key] = key_data\n            self._save_cache(cache)\n\n    def mark_entry_being_calculated_separate_files(self, key):\n        self._save_cache(\n            {\n                \"value\": None,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": True,\n            },\n            key=key,\n        )\n\n    def mark_entry_not_calculated_separate_files(self, key):\n        _, entry = self.get_entry_by_key(key)\n        entry[\"being_calculated\"] = False\n        self._save_cache(entry, key=key)\n\n    def mark_entry_being_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_being_calculated_separate_files(key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            try:\n                cache[key][\"being_calculated\"] = True\n            except KeyError:\n                cache[key] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                }\n            self._save_cache(cache)\n\n    def mark_entry_not_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_not_calculated_separate_files(key)\n        with self.lock:\n            cache = self._get_cache()\n            # that's ok, we don't need an entry in that case\n            with suppress(KeyError):\n                cache[key][\"being_calculated\"] = False\n                self._save_cache(cache)\n\n    def wait_on_entry_calc(self, key):\n        if self.separate_files:\n            entry = self._get_cache_by_key(key)\n            filename = f\"{self.cache_fname}_{key}\"\n        else:\n            with self.lock:\n                self._reload_cache()\n                entry = self._get_cache()[key]\n            filename = self.cache_fname\n        if not entry[\"being_calculated\"]:\n            return entry[\"value\"]\n        event_handler = _PickleCore.CacheChangeHandler(\n            filename=filename, core=self, key=key\n        )\n        observer = Observer()\n        event_handler.inject_observer(observer)\n        observer.schedule(event_handler, path=self.cache_dir, recursive=True)\n        observer.start()\n        time_spent = 0\n        while observer.is_alive():\n            observer.join(timeout=1.0)\n            time_spent += 1\n            self.check_calc_timeout(time_spent)\n        return event_handler.value\n\n    def clear_cache(self):\n        if self.separate_files:\n            self._clear_all_cache_files()\n        else:\n            self._save_cache({})\n\n    def clear_being_calculated(self):\n        if self.separate_files:\n            self._clear_being_calculated_all_cache_files()\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            for key in cache:\n                cache[key][\"being_calculated\"] = False\n            self._save_cache(cache)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/memory.py",
      "content": "\"\"\"A memory-based caching core for cachier.\"\"\"\n\nimport threading\nfrom datetime import datetime\n\nfrom .._types import HashFunc\nfrom .base import _BaseCore, _get_func_str\n\n\nclass _MemoryCore(_BaseCore):\n    \"\"\"The memory core class for cachier.\"\"\"\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = {}\n\n    def _hash_func_key(self, key):\n        return f\"{_get_func_str(self.func)}:{key}\"\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            return key, self.cache.get(self._hash_func_key(key), None)\n\n    def set_entry(self, key, func_res):\n        with self.lock:\n            try:\n                # we need to retain the existing condition so that\n                # mark_entry_not_calculated can notify all possibly-waiting\n                # threads about it\n                cond = self.cache[self._hash_func_key(key)][\"condition\"]\n            except KeyError:  # pragma: no cover\n                cond = None\n            self.cache[self._hash_func_key(key)] = {\n                \"value\": func_res,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": False,\n                \"condition\": cond,\n            }\n\n    def mark_entry_being_calculated(self, key):\n        with self.lock:\n            condition = threading.Condition()\n            # condition.acquire()\n            try:\n                self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n                self.cache[self._hash_func_key(key)][\"condition\"] = condition\n            except KeyError:\n                self.cache[self._hash_func_key(key)] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                    \"condition\": condition,\n                }\n\n    def mark_entry_not_calculated(self, key):\n        with self.lock:\n            try:\n                entry = self.cache[self._hash_func_key(key)]\n            except KeyError:  # pragma: no cover\n                return  # that's ok, we don't need an entry in that case\n            entry[\"being_calculated\"] = False\n            cond = entry[\"condition\"]\n            if cond:\n                cond.acquire()\n                cond.notify_all()\n                cond.release()\n                entry[\"condition\"] = None\n\n    def wait_on_entry_calc(self, key):\n        with self.lock:  # pragma: no cover\n            entry = self.cache[self._hash_func_key(key)]\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n        entry[\"condition\"].acquire()\n        entry[\"condition\"].wait()\n        entry[\"condition\"].release()\n        return self.cache[self._hash_func_key(key)][\"value\"]\n\n    def clear_cache(self):\n        with self.lock:\n            self.cache.clear()\n\n    def clear_being_calculated(self):\n        with self.lock:\n            for entry in self.cache.values():\n                entry[\"being_calculated\"] = False\n                entry[\"condition\"] = None\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "content": "\"\"\"A MongoDB-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport pickle  # for serialization of python objects\nimport sys  # to make sure that pymongo was imported\nimport time  # to sleep when waiting on Mongo cache\\\nimport warnings  # to warn if pymongo is missing\nfrom contextlib import suppress\nfrom datetime import datetime\n\nfrom .._types import HashFunc, Mongetter\n\nwith suppress(ImportError):\n    from bson.binary import Binary  # to save binary data to mongodb\n    from pymongo import ASCENDING, IndexModel\n    from pymongo.errors import OperationFailure\n\nfrom .base import RecalculationNeeded, _BaseCore, _get_func_str\n\nMONGO_SLEEP_DURATION_IN_SEC = 1\n\n\nclass MissingMongetter(ValueError):\n    \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n\n\nclass _MongoCore(_BaseCore):\n    _INDEX_NAME = \"func_1_key_1\"\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        mongetter: Mongetter,\n        wait_for_calc_timeout: int,\n    ):\n        if \"pymongo\" not in sys.modules:\n            warnings.warn(\n                \"`pymongo` was not found. MongoDB cores will not function.\",\n                ImportWarning,\n                stacklevel=2,\n            )  # pragma: no cover\n\n        super().__init__(hash_func, wait_for_calc_timeout)\n        if mongetter is None:\n            raise MissingMongetter(\n                \"must specify ``mongetter`` when using the mongo core\"\n            )\n        self.mongetter = mongetter\n        self.mongo_collection = self.mongetter()\n        index_inf = self.mongo_collection.index_information()\n        if _MongoCore._INDEX_NAME not in index_inf:\n            func1key1 = IndexModel(\n                keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n                name=_MongoCore._INDEX_NAME,\n            )\n            self.mongo_collection.create_indexes([func1key1])\n\n    @property\n    def _func_str(self) -> str:\n        return _get_func_str(self.func)\n\n    def get_entry_by_key(self, key):\n        res = self.mongo_collection.find_one(\n            {\"func\": self._func_str, \"key\": key}\n        )\n        if not res:\n            return key, None\n        try:\n            entry = {\n                \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        except KeyError:\n            entry = {\n                \"value\": None,\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        return key, entry\n\n    def set_entry(self, key, func_res):\n        thebytes = pickle.dumps(func_res)\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\n                \"$set\": {\n                    \"func\": self._func_str,\n                    \"key\": key,\n                    \"value\": Binary(thebytes),\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": False,\n                }\n            },\n            upsert=True,\n        )\n\n    def mark_entry_being_calculated(self, key):\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\"$set\": {\"being_calculated\": True}},\n            upsert=True,\n        )\n\n    def mark_entry_not_calculated(self, key):\n        with suppress(OperationFailure):  # don't care in this case\n            self.mongo_collection.update_one(\n                filter={\n                    \"func\": self._func_str,\n                    \"key\": key,\n                },\n                update={\"$set\": {\"being_calculated\": False}},\n                upsert=False,  # should not insert in this case\n            )\n\n    def wait_on_entry_calc(self, key):\n        time_spent = 0\n        while True:\n            time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n            time_spent += MONGO_SLEEP_DURATION_IN_SEC\n            key, entry = self.get_entry_by_key(key)\n            if entry is None:\n                raise RecalculationNeeded()\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n            self.check_calc_timeout(time_spent)\n\n    def clear_cache(self):\n        self.mongo_collection.delete_many(filter={\"func\": self._func_str})\n\n    def clear_being_calculated(self):\n        self.mongo_collection.update_many(\n            filter={\n                \"func\": self._func_str,\n                \"being_calculated\": True,\n            },\n            update={\"$set\": {\"being_calculated\": False}},\n        )\n"
    },
    {
      "path": "python-cachier_cachier/tests/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/tests/standalone_script.py",
      "content": "import time\n\nimport cachier\n\n\n@cachier.cachier()\ndef _takes_3_seconds(label, value):\n    time.sleep(3)\n    return f\"{label} {value}\"\n\n\nprint(_takes_3_seconds(\"two\", 2))\n"
    },
    {
      "path": "python-cachier_cachier/tests/speed_eval.py",
      "content": "from random import random\nfrom time import time\n\nfrom cachier import cachier\n\n\n@cachier(next_time=True)\ndef _test_int_pickling(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef _test_int_pickling_compare(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef test_pickle_speed():\n    \"\"\"Test speeds.\"\"\"\n    print(\"Comparing speeds of decorated vs non-decorated functions...\")\n    num_of_vals = 1000\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling_compare(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Non-decorated average = {sum(times) / num_of_vals}\")\n\n    _test_int_pickling.clear_cache()\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Decorated average = {sum(times) / num_of_vals}\")\n\n\n@cachier()\ndef _test_single_file_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\n@cachier(separate_files=True)\ndef _test_separate_files_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\ndef test_separate_files_vs_single_file():\n    _test_separate_files_speed.clear_cache()\n    _test_single_file_speed.clear_cache()\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_separate_files_speed(j, 2)\n    print(f\"separate files time: {time() - start_time}\")\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_single_file_speed(j, 2)\n    print(f\"single file time: {time() - start_time}\")\n\n\nif __name__ == \"__main__\":\n    test_pickle_speed()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_memory_core.py",
      "content": "\"\"\"Test for the in-memory implementation of the  Cachier python package.\"\"\"\n\nimport hashlib\nimport queue\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pandas as pd\nimport pytest\nfrom cachier import cachier\n\n\n@cachier(backend=\"memory\", next_time=False)\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.memory\ndef test_memory_core():\n    \"\"\"Basic memory core functionality.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\n@pytest.mark.memory\ndef test_memory_core_keywords():\n    \"\"\"Basic memory core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=False)\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after():\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds.clear_cache()\n    val1 = _stale_after_seconds(1, 2)\n    val2 = _stale_after_seconds(1, 2)\n    val3 = _stale_after_seconds(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds(1, 2)\n    assert val4 != val1\n    _stale_after_seconds.clear_cache()\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=True)\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after_next_time():\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _random_num():\n    return random()\n\n\n@cachier(backend=\"memory\")\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.memory\ndef test_overwrite_cache():\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 == int3\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 == int3\n    _random_num_with_arg.clear_cache()\n\n\n@pytest.mark.memory\ndef test_ignore_cache():\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(res_queue):\n    res = _takes_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_memory_being_calculated():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(res_queue):\n    res = _being_calc_next_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_being_calc_next_time():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\")\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@cachier(backend=\"memory\")\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    _takes_time.clear_being_calculated()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated_with_empty_cache():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    _takes_time.clear_being_calculated()\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.memory\ndef test_error_throwing_func():\n    # with\n    res1 = _error_throwing_func(4)\n    sleep(1.5)\n    res2 = _error_throwing_func(4)\n    assert res1 == res2\n\n\n@pytest.mark.memory\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(backend=\"memory\", hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n\n\nif __name__ == \"__main__\":\n    test_memory_being_calculated()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_core_lookup.py",
      "content": "\"\"\"Testing a few basic cachier interfaces.\"\"\"\n\nimport pytest\nfrom cachier import cachier, get_default_params\nfrom cachier.cores.mongo import MissingMongetter\n\n\ndef test_get_default_params():\n    params = get_default_params()\n    assert tuple(sorted(params)) == (\n        \"allow_none\",\n        \"backend\",\n        \"cache_dir\",\n        \"caching_enabled\",\n        \"hash_func\",\n        \"mongetter\",\n        \"next_time\",\n        \"pickle_reload\",\n        \"separate_files\",\n        \"stale_after\",\n        \"wait_for_calc_timeout\",\n    )\n\n\ndef test_bad_name(name=\"nope\"):\n    # Test that the appropriate exception is thrown\n    # when an invalid backend is given.\n    with pytest.raises(ValueError, match=f\"specified an invalid core: {name}\"):\n\n        @cachier(backend=name)\n        def dummy_func():\n            pass\n\n\ndef test_missing_mongetter():\n    # Test that the appropriate exception is thrown\n    # when forgetting to specify the mongetter.\n    with pytest.raises(MissingMongetter):\n\n        @cachier(backend=\"mongo\", mongetter=None)\n        def dummy_func():\n            pass\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_general.py",
      "content": "\"\"\"Non-core-specific tests for cachier.\"\"\"\n\nimport functools\nimport os\nimport queue\nimport subprocess  # nosec: B404\nimport threading\nfrom contextlib import suppress\nfrom random import random\nfrom time import sleep, time\n\nimport cachier\nimport pytest\nfrom cachier.core import (\n    DEFAULT_MAX_WORKERS,\n    MAX_WORKERS_ENVAR_NAME,\n    _get_executor,\n    _max_workers,\n    _set_max_workers,\n)\n\nfrom tests.test_mongo_core import (\n    MONGO_DELTA_LONG,\n    _test_mongetter,\n)\n\n\ndef test_information():\n    print(\"\\ncachier version: \", end=\"\")\n    print(cachier.__version__)\n\n\ndef test_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    with suppress(KeyError):\n        del os.environ[MAX_WORKERS_ENVAR_NAME]\n    assert _max_workers() == DEFAULT_MAX_WORKERS\n\n\ndef test_get_executor():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _get_executor()\n    _get_executor(False)\n    _get_executor(True)\n\n\ndef test_set_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _set_max_workers(9)\n\n\nparametrize_keys = \"mongetter,stale_after,separate_files\"\nparametrize_values = [\n    pytest.param(\n        _test_mongetter, MONGO_DELTA_LONG, False, marks=pytest.mark.mongo\n    ),\n    (None, None, False),\n    (None, None, True),\n]\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_ok(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_fast(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(1)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_fast(res_queue):\n        res = _wait_for_calc_timeout_fast(1, 2)\n        res_queue.put(res)\n\n    \"\"\" Testing calls that avoid timeouts store the values in cache. \"\"\"\n    _wait_for_calc_timeout_fast.clear_cache()\n    val1 = _wait_for_calc_timeout_fast(1, 2)\n    val2 = _wait_for_calc_timeout_fast(1, 2)\n    assert val1 == val2\n\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(2)\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2  # Timeout did not kick in, a single call was done\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_slow(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    sleep(4)\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_precache_value(mongetter, backend):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def dummy_func(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return arg_1 + arg_2\n\n    assert dummy_func.precache_value(2, 2, value_to_cache=5) == 5\n    assert dummy_func(2, 2) == 5\n    dummy_func.clear_cache()\n    assert dummy_func(2, 2) == 4\n    assert dummy_func.precache_value(2, arg_2=2, value_to_cache=5) == 5\n    assert dummy_func(2, arg_2=2) == 5\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_ignore_self_in_methods(mongetter, backend):\n    class DummyClass:\n        @cachier.cachier(backend=backend, mongetter=mongetter)\n        def takes_2_seconds(self, arg_1, arg_2):\n            \"\"\"Some function.\"\"\"\n            sleep(2)\n            return arg_1 + arg_2\n\n    test_object_1 = DummyClass()\n    test_object_2 = DummyClass()\n    test_object_1.takes_2_seconds.clear_cache()\n    test_object_2.takes_2_seconds.clear_cache()\n    assert test_object_1.takes_2_seconds(1, 2) == 3\n    start = time()\n    assert test_object_2.takes_2_seconds(1, 2) == 3\n    end = time()\n    assert end - start < 1\n\n\ndef test_hash_params_deprecation():\n    with pytest.deprecated_call(match=\"hash_params will be removed\"):\n\n        @cachier.cachier(hash_params=lambda a, k: \"key\")\n        def test():\n            return \"value\"\n\n    assert test() == \"value\"\n\n\ndef test_separate_processes():\n    test_args = (\"python\", \"tests/standalone_script.py\")\n    run_params = {\"args\": test_args, \"capture_output\": True, \"text\": True}\n    run_process = functools.partial(subprocess.run, **run_params)\n    result = run_process()\n    assert result.stdout.strip() == \"two 2\"\n    start = time()\n    result = run_process()\n    end = time()\n    assert result.stdout.strip() == \"two 2\"\n    assert end - start < 3\n\n\ndef test_global_disable():\n    @cachier.cachier()\n    def get_random():\n        return random()\n\n    get_random.clear_cache()\n    result_1 = get_random()\n    result_2 = get_random()\n    cachier.disable_caching()\n    result_3 = get_random()\n    cachier.enable_caching()\n    result_4 = get_random()\n    assert result_1 == result_2 == result_4\n    assert result_1 != result_3\n\n\ndef test_none_not_cached_by_default():\n    count = 0\n\n    @cachier.cachier()\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 2\n\n\ndef test_allow_caching_none():\n    count = 0\n\n    @cachier.cachier(allow_none=True)\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 1\n\n\ndef test_identical_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: int, b: int = 2, c: int = 3):\n        nonlocal count\n        count += 1\n        return a + b + c\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1, 2, 3) == 6\n    assert dummy_func(1, 2, c=3) == 6\n    assert dummy_func(1, b=2, c=3) == 6\n    assert dummy_func(a=1, b=2, c=3) == 6\n    assert count == 1\n\n\ndef test_list_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: list, b: list = [2]):  # noqa: B006\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func([1]) == [1, 2]\n    assert dummy_func([1], [2]) == [1, 2]\n    assert dummy_func([1], b=[2]) == [1, 2]\n    assert dummy_func(a=[1], b=[2]) == [1, 2]\n    assert count == 1\n\n\ndef test_order_independent_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(b=2, a=1) == 3\n    assert count == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_diff_functions_same_args(tmpdir, backend: str):\n    count_p = count_m = 0\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert fn(1) == expected\n        assert fn(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_runtime_handling(tmpdir, backend):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir, backend=backend)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n    for fn, expected in [(fn_plus, 5), (fn_minus, 1)]:\n        assert cachier_(fn)(3, 2) == expected\n        assert cachier_(fn)(a=3, b=2) == expected\n    assert count_p == 2\n    assert count_m == 2\n\n\ndef test_partial_handling(tmpdir):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)(2) == expected\n\n        dummy_ = functools.partial(fn, a=1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1) == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1, b=2) == expected\n\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n\n    assert count_p == 1\n    assert count_m == 1\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_defaults.py",
      "content": "import datetime\nimport os\nimport queue\nimport random\nimport threading\nimport time\n\nimport cachier\nimport pytest\n\nfrom tests.test_mongo_core import MONGO_DELTA, _test_mongetter\n\n_default_params = cachier.get_default_params().copy()\n\n\ndef setup_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef teardown_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef test_hash_func_default_param():\n    def slow_hash_func(args, kwds):\n        time.sleep(2)\n        return \"hash\"\n\n    def fast_hash_func(args, kwds):\n        return \"hash\"\n\n    cachier.set_default_params(hash_func=slow_hash_func)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(hash_func=fast_hash_func)\n    def global_test_2():\n        return None\n\n    start = time.time()\n    global_test_1()\n    end = time.time()\n    assert end - start > 1\n    start = time.time()\n    global_test_2()\n    end = time.time()\n    assert end - start < 1\n\n\ndef test_backend_default_param():\n    cachier.set_default_params(backend=\"memory\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(backend=\"pickle\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\n@pytest.mark.mongo\ndef test_mongetter_default_param():\n    cachier.set_default_params(mongetter=_test_mongetter)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(mongetter=False)\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\ndef test_cache_dir_default_param(tmpdir):\n    cachier.set_default_params(cache_dir=tmpdir / \"1\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() == str(tmpdir / \"1\")\n    assert global_test_2.cache_dpath() == str(tmpdir / \"2\")\n\n\ndef test_separate_files_default_param(tmpdir):\n    cachier.set_default_params(separate_files=True)\n\n    @cachier.cachier(cache_dir=tmpdir / \"1\")\n    def global_test_1(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\", separate_files=False)\n    def global_test_2(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    global_test_1(1, 2)\n    global_test_1(3, 4)\n    global_test_2(1, 2)\n    global_test_2(3, 4)\n\n    assert len(os.listdir(global_test_1.cache_dpath())) == 2\n    assert len(os.listdir(global_test_2.cache_dpath())) == 1\n\n\ndef test_allow_none_default_param(tmpdir):\n    cachier.set_default_params(\n        allow_none=True,\n        separate_files=True,\n        verbose_cache=True,\n    )\n    allow_count = disallow_count = 0\n\n    @cachier.cachier(cache_dir=tmpdir)\n    def allow_none():\n        nonlocal allow_count\n        allow_count += 1\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir, allow_none=False)\n    def disallow_none():\n        nonlocal disallow_count\n        disallow_count += 1\n        return None\n\n    assert allow_count == 0\n    allow_none()\n    allow_none()\n    assert allow_count == 1\n\n    assert disallow_count == 0\n    disallow_none()\n    disallow_none()\n    assert disallow_count == 2\n\n    disallow_none(cachier__allow_none=True)\n    disallow_none(cachier__allow_none=True)\n    assert disallow_count == 2\n\n\nPARAMETRIZE_TEST = (\n    \"backend,mongetter\",\n    [\n        pytest.param(\"pickle\", None, marks=pytest.mark.pickle),\n        pytest.param(\"mongo\", _test_mongetter, marks=pytest.mark.mongo),\n    ],\n)\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_stale_after_applies_dynamically(backend, mongetter):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_test(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random() + arg_1 + arg_2\n\n    cachier.set_default_params(stale_after=MONGO_DELTA)\n\n    _stale_after_test.clear_cache()\n    val1 = _stale_after_test(1, 2)\n    val2 = _stale_after_test(1, 2)\n    assert val1 == val2\n    time.sleep(3)\n    val3 = _stale_after_test(1, 2)\n    assert val3 != val1\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_next_time_applies_dynamically(backend, mongetter):\n    NEXT_AFTER_DELTA = datetime.timedelta(seconds=3)\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_next_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random()\n\n    cachier.set_default_params(stale_after=NEXT_AFTER_DELTA, next_time=True)\n\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    time.sleep(NEXT_AFTER_DELTA.seconds + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    time.sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_wait_for_calc_applies_dynamically(backend, mongetter):\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        time.sleep(3)\n        return random.random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    cachier.set_default_params(wait_for_calc_timeout=2)\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    time.sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    time.sleep(5)\n    thread1.join(timeout=5)\n    thread2.join(timeout=5)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\ndef test_default_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1) == 3\n    assert dummy_func(a=1) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert count == 1\n\n\ndef test_deprecated_func_kwargs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    with pytest.deprecated_call(\n        match=\"`verbose_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, verbose_cache=True) == 3\n    assert count == 1\n    with pytest.deprecated_call(\n        match=\"`ignore_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, ignore_cache=True) == 3\n    assert count == 2\n    with pytest.deprecated_call(\n        match=\"`overwrite_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, overwrite_cache=True) == 3\n    assert count == 3\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_pickle_core.py",
      "content": "\"\"\"Test for the Cachier python package.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\n# from os.path import (\n#     realpath,\n#     dirname\n# )\nimport os\nimport pickle\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pytest\n\ntry:\n    import queue\nexcept ImportError:  # python 2\n    import Queue as queue  # type: ignore\n\nimport hashlib\n\nimport pandas as pd\nfrom cachier import cachier\nfrom cachier.core import _default_params\n\n\ndef _get_decorated_func(func, **kwargs):\n    cachier_decorator = cachier(**kwargs)\n    decorated_func = cachier_decorator(func)\n    return decorated_func\n\n\n# Pickle core tests\n\n\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"reload\", [True, False])\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core(reload, separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds,\n        next_time=False,\n        pickle_reload=reload,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_keywords(separate_files):\n    \"\"\"Basic Pickle core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds, next_time=False, separate_files=separate_files\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after(separate_files):\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds_decorated = _get_decorated_func(\n        _stale_after_seconds,\n        stale_after=DELTA,\n        next_time=False,\n        separate_files=separate_files,\n    )\n    _stale_after_seconds_decorated.clear_cache()\n    val1 = _stale_after_seconds_decorated(1, 2)\n    val2 = _stale_after_seconds_decorated(1, 2)\n    val3 = _stale_after_seconds_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds_decorated(1, 2)\n    assert val4 != val1\n    _stale_after_seconds_decorated.clear_cache()\n\n\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after_next_time(separate_files):\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time_decorated = _get_decorated_func(\n        _stale_after_next_time,\n        stale_after=DELTA,\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _stale_after_next_time_decorated.clear_cache()\n    val1 = _stale_after_next_time_decorated(1, 2)\n    val2 = _stale_after_next_time_decorated(1, 2)\n    val3 = _stale_after_next_time_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time_decorated(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time_decorated(1, 2)\n    assert val5 != val1\n    _stale_after_next_time_decorated.clear_cache()\n\n\ndef _random_num():\n    return random()\n\n\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_overwrite_cache(separate_files):\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 == int3\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 == int3\n    _random_num_with_arg_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_ignore_cache(separate_files):\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg_decorated.clear_cache()\n\n\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(takes_time_func, res_queue):\n    res = takes_time_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_being_calculated(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(being_calc_func, res_queue):\n    res = being_calc_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_being_calc_next_time(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _being_calc_next_time_decorated = _get_decorated_func(\n        _being_calc_next_time,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _being_calc_next_time_decorated.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _BAD_CACHE_FNAME = '.__main__._bad_cache'\n_BAD_CACHE_FNAME = \".tests.test_pickle_core._bad_cache\"\n_BAD_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._bad_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\nEXPANDED_CACHIER_DIR = os.path.expanduser(_default_params[\"cache_dir\"])\n_BAD_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME)\n_BAD_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME_SEPARATE_FILES\n)\n_BAD_CACHE_FPATHS = {\n    True: _BAD_CACHE_FPATH_SEPARATE_FILES,\n    False: _BAD_CACHE_FPATH,\n}\n\n\ndef _calls_bad_cache(bad_cache_func, res_queue, trash_cache, separate_files):\n    try:\n        res = bad_cache_func(0.13, 0.02, cachier__verbose=True)\n        if trash_cache:\n            with open(_BAD_CACHE_FPATHS[separate_files], \"w\") as cache_file:\n                cache_file.seek(0)\n                cache_file.truncate()\n        res_queue.put(res)\n    except Exception as exc:\n        res_queue.put(exc)\n\n\ndef _helper_bad_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    _bad_cache_decorated = _get_decorated_func(\n        _bad_cache, separate_files=separate_files\n    )\n    _bad_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if res2 is not None or isinstance(res2, KeyError):\n        return False\n    return True\n\n\n# we want this to succeed at least once\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_bad_cache_file(separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.6, 0.7, 0.8, 1, 1.5, 2]\n    sleeptimes = sleeptimes + sleeptimes\n    for sleeptime in sleeptimes:\n        if _helper_bad_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _DEL_CACHE_FNAME = '.__main__._delete_cache'\n_DEL_CACHE_FNAME = \".tests.test_pickle_core._delete_cache\"\n_DEL_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._delete_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\n_DEL_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME)\n_DEL_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME_SEPARATE_FILES\n)\n_DEL_CACHE_FPATHS = {\n    True: _DEL_CACHE_FPATH_SEPARATE_FILES,\n    False: _DEL_CACHE_FPATH,\n}\n\n\ndef _calls_delete_cache(del_cache_func, res_queue, del_cache, separate_files):\n    try:\n        # print('in')\n        res = del_cache_func(0.13, 0.02)\n        # print('out with {}'.format(res))\n        if del_cache:\n            os.remove(_DEL_CACHE_FPATHS[separate_files])\n            # print(os.path.isfile(_DEL_CACHE_FPATH))\n        res_queue.put(res)\n    except Exception as exc:\n        # print('found')\n        res_queue.put(exc)\n\n\ndef _helper_delete_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    _delete_cache_decorated = _get_decorated_func(\n        _delete_cache, separate_files=separate_files\n    )\n    _delete_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    # print(res1)\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if not ((isinstance(res2, KeyError)) or (res2 is None)):\n        return False\n    return True\n    # print(res2)\n    # print(type(res2))\n\n\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_delete_cache_file(separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.7, 1]\n    sleeptimes = sleeptimes * 4\n    for sleeptime in sleeptimes:\n        if _helper_delete_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_clear_being_calculated(separate_files):\n    \"\"\"Test pickle core clear `being calculated` functionality.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_being_calculated()\n\n\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_error_throwing_func(separate_files):\n    # with\n    _error_throwing_func.count = 0\n    _error_throwing_func_decorated = _get_decorated_func(\n        _error_throwing_func,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _error_throwing_func_decorated.clear_cache()\n    res1 = _error_throwing_func_decorated(4)\n    sleep(1.5)\n    res2 = _error_throwing_func_decorated(4)\n    assert res1 == res2\n\n\n# test custom cache dir for pickle core\n\nCUSTOM_DIR = \"~/.exparrot\"\nEXPANDED_CUSTOM_DIR = os.path.expanduser(CUSTOM_DIR)\n\n\ndef _takes_2_seconds_custom_dir(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_custom_cache_dir(separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_custom_dir_decorated = _get_decorated_func(\n        _takes_2_seconds_custom_dir,\n        next_time=False,\n        cache_dir=CUSTOM_DIR,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    path2test = _takes_2_seconds_custom_dir_decorated.cache_dpath()\n    assert path2test == EXPANDED_CUSTOM_DIR\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_callable_hash_param(separate_files):\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(hash_func=_hash_func, separate_files=separate_files)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_mongo_core.py",
      "content": "\"\"\"Testing the MongoDB core of cachier.\"\"\"\n\nimport datetime\nimport hashlib\nimport platform\nimport queue\nimport sys\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep\nfrom urllib.parse import quote_plus\n\nimport pandas as pd\nimport pymongo\nimport pytest\nfrom birch import Birch  # type: ignore[import-not-found]\nfrom cachier import cachier\nfrom cachier.cores.base import RecalculationNeeded\nfrom cachier.cores.mongo import _MongoCore\nfrom pymongo.errors import OperationFailure\nfrom pymongo.mongo_client import MongoClient\nfrom pymongo_inmemory import MongoClient as InMemoryMongoClient\n\n# === Enables testing vs a real MongoDB instance ===\n\n\nclass CfgKey:\n    HOST = \"TEST_HOST\"\n    PORT = \"TEST_PORT\"\n    # UNAME = \"TEST_USERNAME\"\n    # PWD = \"TEST_PASSWORD\"\n    # DB = \"TEST_DB\"\n    TEST_VS_DOCKERIZED_MONGO = \"TEST_VS_DOCKERIZED_MONGO\"\n\n\nCFG = Birch(\n    namespace=\"cachier\",\n    defaults={CfgKey.TEST_VS_DOCKERIZED_MONGO: False},\n)\n\n\n# URI_TEMPLATE = \"mongodb://myUser:myPassword@localhost:27017/\"\nURI_TEMPLATE = \"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n\n\ndef _get_cachier_db_mongo_client():\n    host = quote_plus(CFG[CfgKey.HOST])\n    port = quote_plus(CFG[CfgKey.PORT])\n    # uname = quote_plus(CFG[CfgKey.UNAME])\n    # pwd = quote_plus(CFG[CfgKey.PWD])\n    # db = quote_plus(CFG[CfgKey.DB])\n    uri = f\"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n    return MongoClient(uri)\n\n\n_COLLECTION_NAME = (\n    f\"cachier_test_{platform.system()}\"\n    f\"_{'.'.join(map(str, sys.version_info[:3]))}\"\n)\n\n\ndef _test_mongetter():\n    if not hasattr(_test_mongetter, \"client\"):\n        if str(CFG.mget(CfgKey.TEST_VS_DOCKERIZED_MONGO)).lower() == \"true\":\n            print(\"Using live MongoDB instance for testing.\")\n            _test_mongetter.client = _get_cachier_db_mongo_client()\n        else:\n            print(\"Using in-memory MongoDB instance for testing.\")\n            _test_mongetter.client = InMemoryMongoClient()\n    db_obj = _test_mongetter.client[\"cachier_test\"]\n    if _COLLECTION_NAME not in db_obj.list_collection_names():\n        db_obj.create_collection(_COLLECTION_NAME)\n    return db_obj[_COLLECTION_NAME]\n\n\n# === Mongo core tests ===\n\n\n@pytest.mark.mongo\ndef test_information():\n    print(\"\\npymongo version: \", end=\"\")\n    print(pymongo.__version__)\n\n\n@pytest.mark.mongo\ndef test_mongo_index_creation():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    collection = _test_mongetter()\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    assert _MongoCore._INDEX_NAME in collection.index_information()\n\n\n@pytest.mark.mongo\ndef test_mongo_core():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, 2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, 2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, 2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, 2)\n    assert val6 == val5\n\n\n@pytest.mark.mongo\ndef test_mongo_core_keywords():\n    \"\"\"Basic Mongo core functionality with keyword arguments.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, arg_2=2)\n    val2 = _test_mongo_caching(1, arg_2=2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, arg_2=2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, arg_2=2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, arg_2=2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, arg_2=2)\n    assert val6 == val5\n\n\nMONGO_DELTA = timedelta(seconds=3)\nMONGO_DELTA_LONG = timedelta(seconds=10)\n\n\n@pytest.mark.mongo\ndef test_mongo_stale_after():\n    \"\"\"Testing MongoDB core stale_after functionality.\"\"\"\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=MONGO_DELTA, next_time=False\n    )\n    def _stale_after_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _stale_after_mongo.clear_cache()\n    val1 = _stale_after_mongo(1, 2)\n    val2 = _stale_after_mongo(1, 2)\n    assert val1 == val2\n    sleep(3)\n    val3 = _stale_after_mongo(1, 2)\n    assert val3 != val1\n\n\ndef _calls_takes_time(res_queue):\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    res = _takes_time(34, 82.3)\n    res_queue.put(res)\n\n\n@pytest.mark.mongo\ndef test_mongo_being_calculated():\n    \"\"\"Testing MongoDB core handling of being calculated scenarios.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(1)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\nclass _BadMongoCollection:\n    def __init__(self, mongetter):\n        self.collection = mongetter()\n        self.index_information = self.collection.index_information\n        self.create_indexes = self.collection.create_indexes\n        self.find_one = self.collection.find_one\n\n    def delete_many(self, *args, **kwargs):\n        pass\n\n    def update_many(self, *args, **kwargs):\n        pass\n\n    def update_one(self, *args, **kwargs):\n        raise OperationFailure(Exception())\n\n\ndef _bad_mongetter():\n    return _BadMongoCollection(_test_mongetter)\n\n\n@pytest.mark.mongo\ndef test_mongo_write_failure():\n    \"\"\"Testing MongoDB core handling of writing failure scenarios.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    with pytest.raises(OperationFailure):\n        val1 = _func_w_bad_mongo(1, 2)\n        val2 = _func_w_bad_mongo(1, 2)\n        assert val1 == val2\n\n\n@pytest.mark.mongo\ndef test_mongo_clear_being_calculated():\n    \"\"\"Testing MongoDB core clear_being_calculated.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _func_w_bad_mongo.clear_being_calculated()\n\n\n@pytest.mark.mongo\ndef test_stalled_mongo_db_cache():\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    core = _MongoCore(None, _test_mongetter, 0)\n    core.set_func(_stalled_func)\n    core.clear_cache()\n    with pytest.raises(RecalculationNeeded):\n        core.wait_on_entry_calc(key=None)\n\n\n@pytest.mark.mongo\ndef test_stalled_mong_db_core(monkeypatch):\n    def mock_get_entry(self, args, kwargs):\n        return \"key\", {\"being_calculated\": True}\n\n    def mock_get_entry_by_key(self, key):\n        return \"key\", None\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry\n    )\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry_by_key\",\n        mock_get_entry_by_key,\n    )\n\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    res = _stalled_func()\n    assert res == 1\n\n    def mock_get_entry_2(self, args, kwargs):\n        entry = {\n            \"being_calculated\": True,\n            \"value\": 1,\n            \"time\": datetime.datetime.now() - datetime.timedelta(seconds=10),\n        }\n        return \"key\", entry\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry_2\n    )\n\n    stale_after = datetime.timedelta(seconds=1)\n\n    @cachier(mongetter=_test_mongetter, stale_after=stale_after)\n    def _stalled_func_2():\n        \"\"\"Testing stalled function.\"\"\"\n        return 2\n\n    res = _stalled_func_2()\n    assert res == 2\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=stale_after, next_time=True\n    )\n    def _stalled_func_3():\n        \"\"\"Testing stalled function.\"\"\"\n        return 3\n\n    res = _stalled_func_3()\n    assert res == 1\n\n\n@pytest.mark.mongo\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(mongetter=_test_mongetter, hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/requirements.txt",
      "content": "# todo: add some version range or pinning latest versions\n# tests and coverages\npytest\ncoverage\npytest-cov\nbirch\n# to connect to the test mongodb server\npymongo\ndnspython\npymongo-inmemory\n# to test pandas dataframe as-param hashing with mongodb core\npandas\n# to be able to run `python setup.py checkdocs`\ncollective.checkdocs\npygments\n"
    }
  ],
  "BuggyCode": [
    {
      "path": "python-cachier_cachier/MANIFEST.in",
      "content": "# Manifest syntax https://docs.python.org/2/distutils/sourcedist.html\ngraft wheelhouse\n\nrecursive-include src *.info\ninclude README.rst\ninclude LICENSE\n\nprune __pycache__\nprune tests\n"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "Cachier\n#######\n\n|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n\nPersistent, stale-free, local and cross-machine caching for Python functions.\n\n.. code-block:: python\n\n  from cachier import cachier\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(days=3))\n  def foo(arg1, arg2):\n    \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\n\n.. role:: python(code)\n  :language: python\n\n.. contents::\n\n.. section-numbering:\n\n\n\nInstallation\n============\n\nInstall ``cachier`` with:\n\n.. code-block:: python\n\n    pip install cachier\n\nFor the latest version supporting Python 2.7 please use:\n\n.. code-block:: python\n\n    pip install 'cachier==1.2.8'\n\nFeatures\n========\n\n* Pure Python.\n* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n* A simple interface.\n* Defining \"shelf life\" for cached values.\n* Local caching using pickle files.\n* Cross-machine caching using MongoDB.\n* Thread-safety.\n\nCachier is **NOT**:\n\n* Meant as a transient cache. Python's @lru_cache is better.\n* Especially fast. It is meant to replace function calls that take more than... a second, say (overhead is around 1 millisecond).\n\nFuture features\n---------------\n\n* S3 core.\n* Multi-core caching.\n* `Cache replacement policies <https://en.wikipedia.org/wiki/Cache_replacement_policies>`_\n\n\nUse\n===\n\nCachier provides a decorator which you can wrap around your functions to give them a persistent cache. The positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). Also, notice that since objects which are instances of user-defined classes are hashable but all compare unequal (their hash value is their id), equal objects across different sessions will not yield identical keys.\n\nSetting up a Cache\n------------------\nYou can add a default, pickle-based, persistent cache to your function - meaning it will last across different Python kernels calling the wrapped function - by decorating it with the ``cachier`` decorator (notice the ``()``!).\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier()\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values!\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nClass and object methods can also be cached. Cachier will automatically ignore the `self` parameter when determining the cache key for an object method. **This means that methods will be cached across all instances of an object, which may not be what you want.**\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  class Foo():\n    @staticmethod\n    @cachier()\n    def good_static_usage(arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method does not depend on object's internal state, so good to cache\n    @cachier()\n    def good_usage_1(self, arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method is calling external service, probably okay to cache\n    @cachier()\n    def good_usage_2(self, arg_1, arg_2):\n      result = self.call_api(arg_1, arg_2)\n      return result\n\n    # Instance method relies on object attribute, NOT good to cache\n    @cachier()\n    def bad_usage(self, arg_1, arg_2):\n      return arg_1 + arg_2 + self.arg_3\n\n\nResetting a Cache\n-----------------\nThe Cachier wrapper adds a ``clear_cache()`` function to each wrapped function. To reset the cache of the wrapped function simply call this method:\n\n.. code-block:: python\n\n  foo.clear_cache()\n\nGeneral Configuration\n----------------------\n\nGlobal Defaults\n~~~~~~~~~~~~~~~\n\nSettings can be globally configured across all Cachier wrappers through the use of the `set_default_params` function. This function takes the same keyword parameters as the ones defined in the decorator, which can be passed all at once or with multiple calls. Parameters given directly to a decorator take precedence over any values set by this function.\n\nThe following parameters will only be applied to decorators defined after `set_default_params` is called:\n\n*  `hash_func`\n*  `backend`\n*  `mongetter`\n*  `cache_dir`\n*  `pickle_reload`\n*  `separate_files`\n\nThese parameters can be changed at any time and they will apply to all decorators:\n\n*  `allow_none`\n*  `caching_enabled`\n*  `stale_after`\n*  `next_time`\n*  `wait_for_calc_timeout`\n\nThe current defaults can be fetched by calling `get_default_params`.\n\nThreads Limit\n~~~~~~~~~~~~~\n\nTo limit the number of threads Cachier is allowed to spawn, set the ``CACHIER_MAX_WORKERS`` with the desired number. The default is 8, so to enable Cachier to spawn even more threads, you'll have to set a higher limit explicitly.\n\n\nGlobal Enable/Disable\n---------------------\n\nCaching can be turned off across all decorators by calling `disable_caching`, and then re-activated by calling `enable_caching`.\n\nThese functions are convenience wrappers around the `caching_enabled` default setting.\n\n\nCache Shelf Life\n----------------\n\nSetting Shelf Life\n~~~~~~~~~~~~~~~~~~\nYou can set any duration as the shelf life of cached return values of a function by providing a corresponding ``timedelta`` object to the ``stale_after`` parameter:\n\n.. code-block:: python\n\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(weeks=2))\n  def bar(arg1, arg2):\n    return {'arg1': arg1, 'arg2': arg2}\n\nNow when a cached value matching the given arguments is found the time of its calculation is checked; if more than ``stale_after`` time has since passed, the function will be run again for the same arguments and the new value will be cached and returned.\n\nThis is useful for lengthy calculations that depend on a dynamic data source.\n\nFuzzy Shelf Life\n~~~~~~~~~~~~~~~~\nSometimes you may want your function to trigger a calculation when it encounters a stale result, but still not wait on it if it's not that critical. In that case, you can set ``next_time`` to ``True`` to have your function trigger a recalculation **in a separate thread**, but return the currently cached stale value:\n\n.. code-block:: python\n\n  @cachier(next_time=True)\n\nFurther function calls made while the calculation is being performed will not trigger redundant calculations.\n\n\nWorking with unhashable arguments\n---------------------------------\n\nAs mentioned above, the positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). To get around this limitation the ``hash_func`` parameter of the ``cachier`` decorator can be provided with a callable that gets the args and kwargs from the decorated function and returns a hash key for them.\n\n.. code-block:: python\n\n  def calculate_hash(args, kwds):\n    key = ...  # compute a hash key here based on arguments\n    return key\n\n  @cachier(hash_func=calculate_hash)\n  def calculate_super_complex_stuff(custom_obj):\n    # amazing code goes here\n\nSee here for an example:\n\n`Question: How to work with unhashable arguments <https://github.com/python-cachier/cachier/issues/91>`_\n\n\nPrecaching values\n---------------------------------\n\nIf you want to load a value into the cache without calling the underlying function, this can be done with the `precache_value` function.\n\n.. code-block:: python\n\n  @cachier()\n  def add(arg1, arg2):\n    return arg1 + arg2\n\n  add.precache_value(2, 2, value_to_cache=5)\n\n  result = add(2, 2)\n  print(result)  # prints 5\n\n\nPer-function call arguments\n---------------------------\n\nCachier also accepts several keyword arguments in the calls of the function it wraps rather than in the decorator call, allowing you to modify its behaviour for a specific function call.\n\nIgnore Cache\n~~~~~~~~~~~~\n\nYou can have ``cachier`` ignore any existing cache for a specific function call by passing ``cachier__skip_cache=True`` to the function call. The cache will neither be checked nor updated with the new return value.\n\n.. code-block:: python\n\n  @cachier()\n  def sum(first_num, second_num):\n    return first_num + second_num\n\n  def main():\n    print(sum(5, 3, cachier__skip_cache=True))\n\nOverwrite Cache\n~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` overwrite an existing cache entry - if one exists - for a specific function call by passing ``cachier__overwrite_cache=True`` to the function call. The cache will not be checked but will be updated with the new return value.\n\nVerbose Cache Call\n~~~~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` print out a detailed explanation of the logic of a specific call by passing ``cachier__verbose=True`` to the function call. This can be useful if you are not sure why a certain function result is, or is not, returned.\n\nCache `None` Values\n~~~~~~~~~~~~~~~~~~~\n\nBy default, ``cachier`` does not cache ``None`` values. You can override this behaviour by passing ``allow_none=True`` to the function call.\n\n\nCachier Cores\n=============\n\nPickle Core\n-----------\n\nThe default core for Cachier is pickle based, meaning each function will store its cache is a separate pickle file in the ``~/.cachier`` directory. Naturally, this kind of cache is both machine-specific and user-specific.\n\nYou can configure ``cachier`` to use another directory by providing the ``cache_dir`` parameter with the path to that directory:\n\n.. code-block:: python\n\n  @cachier(cache_dir='~/.temp/.cache')\n\n\nYou can slightly optimise pickle-based caching if you know your code will only be used in a single thread environment by setting:\n\n.. code-block:: python\n\n  @cachier(pickle_reload=False)\n\nThis will prevent reading the cache file on each cache read, speeding things up a bit, while also nullifying inter-thread functionality (the code is still thread safe, but different threads will have different versions of the cache at times, and will sometime make unnecessary function calls).\n\nSetting the optional argument ``separate_files`` to ``True`` will cause the cache to be stored in several files: A file per argument set, per function. This can help if your per-function cache files become too large.\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier(separate_files=True)\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values, split across several files, per argument set\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nYou can get the fully qualified path to the directory of cache files used by ``cachier`` (``~/.cachier`` by default) by calling the ``cache_dpath()`` function:\n\n.. code-block:: python\n\n  >>> foo.cache_dpath()\n      \"/home/user/.cachier/\"\n\n\nMongoDB Core\n------------\nYou can set a MongoDB-based cache by assigning ``mongetter`` with a callable that returns a ``pymongo.Collection`` object with writing permissions:\n\n.. code-block:: python\n\n    from pymongo import MongoClient\n\n    def my_mongetter():\n        client = MongoClient(get_cachier_db_auth_uri())\n        db_obj = client['cachier_db']\n        if 'someapp_cachier_db' not in db_obj.list_collection_names():\n            db_obj.create_collection('someapp_cachier_db')\n        return db_obj['someapp_cachier_db']\n\n  @cachier(mongetter=my_mongetter)\n\nThis allows you to have a cross-machine, albeit slower, cache. This functionality requires that the installation of the ``pymongo`` python package.\n\nIn certain cases the MongoDB backend might leave a deadlock behind, blocking all subsequent requests from being processed. If you encounter this issue, supply the ``wait_for_calc_timeout`` with a reasonable number of seconds; calls will then wait at most this number of seconds before triggering a recalculation.\n\n.. code-block:: python\n\n  @cachier(mongetter=False, wait_for_calc_timeout=2)\n\n\nMemory Core\n-----------\n\nYou can set an in-memory cache by assigning the ``backend`` parameter with ``'memory'``:\n\n.. code-block:: python\n\n  @cachier(backend='memory')\n\nNote, however, that ``cachier``'s in-memory core is simple, and has no monitoring or cap on cache size, and can thus lead to memory errors on large return values - it is mainly intended to be used with future multi-core functionality. As a rule, Python's built-in ``lru_cache`` is a much better stand-alone solution.\n\n\nContributing\n============\n\nCurrent maintainers are Shay Palachy Affek (`shay.palachy@gmail.com <mailto:shay.palachy@gmail.com>`_, `@shaypal5 <https://github.com/shaypal5>`_) and Judson Neer (`@lordjabez <https://github.com/lordjabez>`_); You are more than welcome to approach them for help. Contributions are very welcomed! :)\n\nInstalling for development\n--------------------------\n\nClone:\n\n.. code-block:: bash\n\n  git clone git@github.com:python-cachier/cachier.git\n\n\nInstall in development mode with test dependencies:\n\n.. code-block:: bash\n\n  cd cachier\n  pip install -e . -r tests/requirements.txt\n\n\nRunning the tests\n-----------------\n\nTo run the tests, call the ``pytest`` command in the repository's root, or:\n\n.. code-block:: bash\n\n  python -m pytest\n\nTo run only MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m mongo\n\nTo run only memory core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m memory\n\nTo run all tests EXCEPT MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not mongo\"\n\n\nTo run all tests EXCEPT memory core AND MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not (mongo or memory)\"\n\n\nRunning MongoDB tests against a live MongoDB instance\n-----------------------------------------------------\n\n**Note to developers:** By default, all MongoDB tests are run against a mocked MongoDB instance, provided by the ``pymongo_inmemory`` package. To run them against a live MongoDB instance, the ``CACHIER_TEST_VS_DOCKERIZED_MONGO`` environment variable is set to ``True`` in the ``test`` environment of this repository (and additional environment variables are populated with the appropriate credentials), used by the GitHub Action running tests on every commit and pull request.\n\nContributors are not expected to run these tests against a live MongoDB instance when developing, as credentials for the testing instance used will NOT be shared, but rather use the testing against the in-memory MongoDB instance as a good proxy.\n\n**HOWEVER, the tests run against a live MongoDB instance when you submit a PR are the determining tests for deciding whether your code functions correctly against MongoDB.**\n\n\nAdding documentation\n--------------------\n\nThis project is documented using the `numpy docstring conventions`_, which were chosen as they are perhaps the most widely-spread conventions that are both supported by common tools such as Sphinx and result in human-readable docstrings (in my personal opinion, of course). When documenting code you add to this project, please follow `these conventions`_.\n\n.. _`numpy docstring conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n.. _`these conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n\nAdditionally, if you update this ``README.rst`` file, use ``python setup.py checkdocs`` to validate it compiles.\n\n\nCredits\n=======\n\nCreated by `Shay Palachy Affek <https://github.com/shaypal5>`_ (shay.palachy@gmail.com), which currently assists in maintenance.\n\nCurrent lead developer/contributor: `Jirka Borovec <https://github.com/Borda>`_ (`@Borda <https://github.com/Borda>`_ on GitHub).\n\nOther major contributors:\n\n* `Jirka Borovec <https://github.com/Borda>`_ - Arg order independence, args-to-kwargs for less unique keys and numerous development and CI contributions.\n\n* `Judson Neer <https://github.com/lordjabez>`_ - Precaching, method caching support and numerous improvements and bugfixes.\n\n* `cthoyt <https://github.com/cthoyt>`_ - Base memory core implementation.\n\n* `amarczew <https://github.com/amarczew>`_ - The ``hash_func`` kwarg.\n\n* `non-senses <https://github.com/non-senses>`_ - The ``wait_for_calc_timeout`` kwarg.\n\n* `Elad Rapaport <https://github.com/erap129>`_ - Multi-file Pickle core, a.k.a ``separate_files`` (released on ``v1.5.3``).\n\n* `John Didion <https://github.com/jdidion>`_ - Support for pickle-based caching for cases where two identically-named methods of different classes are defined in the same module.\n\nNotable bugfixers:\n\n* `MichaelRazum <https://github.com/MichaelRazum>`_.\n\n* `Eric Ma <https://github.com/ericmjl>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n* `Ofir <https://github.com/ofirnk>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n\n\n.. |PyPI-Status| image:: https://img.shields.io/pypi/v/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |PyPI-Versions| image:: https://img.shields.io/pypi/pyversions/cachier.svg\n   :target: https://pypi.python.org/pypi/cachier\n\n.. |Build-Status| image:: https://github.com/python-cachier/cachier/actions/workflows/test.yml/badge.svg\n  :target: https://github.com/python-cachier/cachier/actions/workflows/test.yml\n\n.. |LICENCE| image:: https://img.shields.io/pypi/l/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |Codecov| image:: https://codecov.io/github/python-cachier/cachier/coverage.svg?branch=master\n   :target: https://codecov.io/github/python-cachier/cachier?branch=master\n\n.. |Downloads| image:: https://pepy.tech/badge/cachier\n     :target: https://pepy.tech/project/cachier\n     :alt: PePy stats\n\n.. |Codefactor| image:: https://www.codefactor.io/repository/github/python-cachier/cachier/badge?style=plastic\n     :target: https://www.codefactor.io/repository/github/python-cachier/cachier\n     :alt: Codefactor code quality\n\n.. links:\n.. _pymongo: https://api.mongodb.com/python/current/\n.. _watchdog: https://github.com/gorakhargosh/watchdog\n"
    },
    {
      "path": "python-cachier_cachier/LICENSE",
      "content": "MIT License\n\nCopyright (c) 2016 Shay Palachy\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.\n"
    },
    {
      "path": "python-cachier_cachier/pyproject.toml",
      "content": "[project]\nname = \"cachier\"\ndescription = \"Persistent, stale-free, local and cross-machine caching for Python functions.\"\nreadme = \"README.rst\"\nkeywords = [\n  \"cache\",\n  \"caching\",\n  \"cross-machine\",\n  \"decorator\",\n  \"local\",\n  \"memoization\",\n  \"mongo\",\n  \"persistent\",\n]\nlicense = {file = \"LICENSE\"}\nauthors = [\n   {name = \"Shay Palachy Affek\", email = 'shay.palachy@gmail.com'},\n]\nclassifiers = [\n  'Development Status :: 4 - Beta',\n  'Intended Audience :: Developers',\n  'License :: OSI Approved :: MIT License',\n  'Programming Language :: Python',\n  \"Programming Language :: Python :: 3 :: Only\",\n  \"Programming Language :: Python :: 3.8\",\n  \"Programming Language :: Python :: 3.9\",\n  \"Programming Language :: Python :: 3.10\",\n  \"Programming Language :: Python :: 3.11\",\n  \"Programming Language :: Python :: 3.12\",\n  \"Topic :: Other/Nonlisted Topic\",\n  \"Topic :: Software Development :: Libraries\",\n  \"Topic :: Software Development :: Libraries :: Python Modules\",\n  \"Topic :: Utilities\",\n]\ndynamic = [\n  \"version\",\n]\ndependencies = [\n  \"portalocker>=2.3.2\",\n  \"watchdog>=2.3.1\",\n]\n[project.urls]\nSource = \"https://github.com/python-cachier/cachier\"\n[project.scripts]\ncachier = \"cachier.__naim__:cli\"\n\n[tool.setuptools]\ninclude-package-data = true\n\n[tool.setuptools.dynamic]\nversion = {attr = \"cachier._version.__version__\"}\n\n[tool.setuptools.packages.find]\nwhere = [\"src\"]  # list of folders that contain the packages ([\".\"] by default)\ninclude = [\"cachier*\"]  # package names should match these glob patterns ([\"*\"] by default)\nnamespaces = false  # to disable scanning PEP 420 namespaces (true by default)\n\n[build-system]\nrequires = [\n  \"setuptools\",\n  \"wheel\",\n]\n\n[tool.pytest.ini_options]\ntestpaths = [\"cachier\", \"tests\"]\nnorecursedirs=[\"dist\", \"build\"]\naddopts = [\n    \"--color=yes\",\n    \"--cov=cachier\",\n    \"--cov-report=term\",\n    \"--cov-report=xml:cov.xml\",\n    \"-r a\",\n    \"-v\",\n    \"-s\",\n]\nmarkers = [\n    \"mongo: test the MongoDB core\",\n    \"memory: test the memory core\",\n    \"pickle: test the pickle core\"\n]\n\n[tool.docformatter]\nrecursive = true\n# some docstring start with r\"\"\"\nwrap-summaries = 79\nwrap-descriptions = 79\nblank = true\n\n[tool.ruff]\ntarget-version = \"py38\"\nline-length = 79\n# Enable Pyflakes `E` and `F` codes by default.\nlint.select = [\n    \"E\",\n    \"W\", # see: https://pypi.org/project/pycodestyle\n    \"F\", # see: https://pypi.org/project/pyflakes\n    \"I\", #see: https://pypi.org/project/isort/\n    \"D\", # see: https://pypi.org/project/pydocstyle\n    #\"N\", # see: https://pypi.org/project/pep8-naming\n    \"S\", # see: https://pypi.org/project/flake8-bandit\n    \"SIM\",\n    \"RUF100\" # alternative to yesqa\n]\nlint.extend-select = [\n    \"A\",    # see: https://pypi.org/project/flake8-builtins\n    \"B\",    # see: https://pypi.org/project/flake8-bugbear\n    \"C4\",   # see: https://pypi.org/project/flake8-comprehensions\n]\nlint.ignore = [\n    \"E203\",\n    \"C901\",\n]\n# Exclude a variety of commonly ignored directories.\nexclude = [\n    \".eggs\",\n    \".git\",\n    \".ruff_cache\",\n    \"__pypackages__\",\n    \"_build\",\n    \"build\",\n    \"dist\",\n]\nlint.ignore-init-module-imports = true\nlint.unfixable = [\"F401\"]\n\n[tool.ruff.lint.per-file-ignores]\n\"src/**/__init__.py\" = [\"D104\"]\n\"src/cachier/config.py\" = [\"D100\"]\n\"tests/**\" = [\n  \"S101\", \"S311\", \"S105\", \"S603\",\n  \"D100\", \"D101\", \"D103\", \"D104\", \"D401\"\n]\n\n#[tool.ruff.pydocstyle]\n## Use Google-style docstrings.\n#convention = \"google\"\n\n#[tool.ruff.pycodestyle]\n#ignore-overlong-task-comments = true\n\n[tool.ruff.lint.mccabe]\n# Unlike Flake8, default to a complexity level of 10.\nmax-complexity = 10\n\n[tool.coverage.run]\nbranch = true\ndynamic_context = \"test_function\"\nomit = [\n    \"tests/*\",\n    \"cachier/_version.py\",\n    \"cachier/__init__.py\",\n    \"**/scripts/**\",\n]\n[tool.coverage.report]\nshow_missing = true\n# Regexes for lines to exclude from consideration\nexclude_lines = [\n    \"pragma: no cover\", # Have to re-enable the standard pragma\n    \"raise NotImplementedError\", # Don't complain if tests don't hit defensive assertion code:\n    \"if TYPE_CHECKING:\", # Is only true when running mypy, not tests\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__init__.py",
      "content": "from ._version import *  # noqa: F403\nfrom .config import (\n    disable_caching,\n    enable_caching,\n    get_default_params,\n    set_default_params,\n)\nfrom .core import cachier\n\n__all__ = [\n    \"cachier\",\n    \"set_default_params\",\n    \"get_default_params\",\n    \"enable_caching\",\n    \"disable_caching\",\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/config.py",
      "content": "import datetime\nimport hashlib\nimport os\nimport pickle\nfrom typing import Optional, TypedDict, Union\n\nfrom ._types import Backend, HashFunc, Mongetter\n\n\ndef _default_hash_func(args, kwds):\n    # Sort the kwargs to ensure consistent ordering\n    sorted_kwargs = sorted(kwds.items())\n    # Serialize args and sorted_kwargs using pickle or similar\n    serialized = pickle.dumps((args, sorted_kwargs))\n    # Create a hash of the serialized data\n    return hashlib.sha256(serialized).hexdigest()\n\n\nclass Params(TypedDict):\n    \"\"\"Type definition for cachier parameters.\"\"\"\n\n    caching_enabled: bool\n    hash_func: HashFunc\n    backend: Backend\n    mongetter: Optional[Mongetter]\n    stale_after: datetime.timedelta\n    next_time: bool\n    cache_dir: Union[str, os.PathLike]\n    pickle_reload: bool\n    separate_files: bool\n    wait_for_calc_timeout: int\n    allow_none: bool\n\n\n_default_params: Params = {\n    \"caching_enabled\": True,\n    \"hash_func\": _default_hash_func,\n    \"backend\": \"pickle\",\n    \"mongetter\": None,\n    \"stale_after\": datetime.timedelta.max,\n    \"next_time\": False,\n    \"cache_dir\": \"~/.cachier/\",\n    \"pickle_reload\": True,\n    \"separate_files\": False,\n    \"wait_for_calc_timeout\": 0,\n    \"allow_none\": False,\n}\n\n\ndef _update_with_defaults(\n    param, name: str, func_kwargs: Optional[dict] = None\n):\n    import cachier\n\n    if func_kwargs:\n        kw_name = f\"cachier__{name}\"\n        if kw_name in func_kwargs:\n            return func_kwargs.pop(kw_name)\n    if param is None:\n        return cachier.config._default_params[name]\n    return param\n\n\ndef set_default_params(**params):\n    \"\"\"Configure global parameters applicable to all memoized functions.\n\n    This function takes the same keyword parameters as the ones defined in the\n    decorator, which can be passed all at once or with multiple calls.\n    Parameters given directly to a decorator take precedence over any values\n    set by this function.\n\n    Only 'stale_after', 'next_time', and 'wait_for_calc_timeout' can be changed\n    after the memoization decorator has been applied. Other parameters will\n    only have an effect on decorators applied after this function is run.\n\n    \"\"\"\n    import cachier\n\n    valid_params = (\n        p for p in params.items() if p[0] in cachier.config._default_params\n    )\n    _default_params.update(valid_params)\n\n\ndef get_default_params():\n    \"\"\"Get current set of default parameters.\"\"\"\n    import cachier\n\n    return cachier.config._default_params\n\n\ndef enable_caching():\n    \"\"\"Enable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = True\n\n\ndef disable_caching():\n    \"\"\"Disable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = False\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/_version.py",
      "content": "# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2024, Jirka Borovec <***@gmail.com>\n\nimport os\n\n_PATH_HERE = os.path.dirname(__file__)\n_PATH_VERSION = os.path.join(_PATH_HERE, \"version.info\")\n_RELEASING_PROCESS = os.getenv(\"RELEASING_PROCESS\", \"0\") == \"1\"\n\nwith open(_PATH_VERSION) as fopen:\n    __version__ = fopen.read().strip()\n\n\ndef _get_git_sha() -> str:\n    from subprocess import DEVNULL, check_output\n\n    out = check_output([\"git\", \"rev-parse\", \"--short\", \"HEAD\"], stderr=DEVNULL)  # noqa: S603, S607\n    return out.decode(\"utf-8\").strip()\n\n\nif not _RELEASING_PROCESS:\n    try:\n        sha_short = _get_git_sha()\n        # print(f\"Version enriched with git commit hash: {__version__}.\")\n    except Exception:\n        # print(\"Failed to get the git commit hash,\"\n        #       f\" falling back to base version {__version__}.\")\n        sha_short = \"\"\n    __version__ += f\".dev+{sha_short}\" if sha_short else \".dev\"\n\n\n__all__ = [\"__version__\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/version.info",
      "content": "3.0.0\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/py.typed",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/_types.py",
      "content": "from typing import TYPE_CHECKING, Callable, Literal\n\nif TYPE_CHECKING:\n    import pymongo.collection\n\n\nHashFunc = Callable[..., str]\nMongetter = Callable[[], \"pymongo.collection.Collection\"]\nBackend = Literal[\"pickle\", \"mongo\", \"memory\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__main__.py",
      "content": "\"\"\"A command-line interface for cachier.\"\"\"\n\nimport click\n\nfrom cachier.core import _set_max_workers\n\n\n@click.group()\ndef cli():\n    \"\"\"A command-line interface for cachier.\"\"\"  # noqa: D401\n\n\n@cli.command(\"Limits the number of worker threads used by cachier.\")\n@click.argument(\"max_workers\", nargs=1, type=int)\ndef set_max_workers(max_workers):\n    \"\"\"Limits the number of worker threads used by cachier.\"\"\"\n    _set_max_workers(max_workers)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "\"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport datetime\nimport inspect\nimport os\nimport warnings\nfrom collections import OrderedDict\nfrom concurrent.futures import ThreadPoolExecutor\nfrom functools import wraps\nfrom typing import Optional, Union\nfrom warnings import warn\n\nfrom .config import (\n    Backend,\n    HashFunc,\n    Mongetter,\n    _default_params,\n    _update_with_defaults,\n)\nfrom .cores.base import RecalculationNeeded, _BaseCore\nfrom .cores.memory import _MemoryCore\nfrom .cores.mongo import _MongoCore\nfrom .cores.pickle import _PickleCore\n\nMAX_WORKERS_ENVAR_NAME = \"CACHIER_MAX_WORKERS\"\nDEFAULT_MAX_WORKERS = 8\n\n\ndef _max_workers():\n    return int(os.environ.get(MAX_WORKERS_ENVAR_NAME, DEFAULT_MAX_WORKERS))\n\n\ndef _set_max_workers(max_workers):\n    os.environ[MAX_WORKERS_ENVAR_NAME] = str(max_workers)\n    _get_executor(True)\n\n\ndef _get_executor(reset=False):\n    if reset or not hasattr(_get_executor, \"executor\"):\n        _get_executor.executor = ThreadPoolExecutor(_max_workers())\n    return _get_executor.executor\n\n\ndef _function_thread(core, key, func, args, kwds):\n    try:\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n    except BaseException as exc:\n        print(f\"Function call failed with the following exception:\\n{exc}\")\n\n\ndef _calc_entry(core, key, func, args, kwds):\n    try:\n        core.mark_entry_being_calculated(key)\n        # _get_executor().submit(core.mark_entry_being_calculated, key)\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n        # _get_executor().submit(core.set_entry, key, func_res)\n        return func_res\n    finally:\n        core.mark_entry_not_calculated(key)\n\n\ndef _convert_args_kwargs(\n    func, _is_method: bool, args: tuple, kwds: dict\n) -> dict:\n    \"\"\"Convert mix of positional and keyword arguments to aggregated kwargs.\"\"\"\n    # unwrap if the function is functools.partial\n    if hasattr(func, \"func\"):\n        args = func.args + args\n        kwds.update({k: v for k, v in func.keywords.items() if k not in kwds})\n        func = func.func\n    func_params = list(inspect.signature(func).parameters)\n    args_as_kw = dict(\n        zip(func_params[1:], args[1:])\n        if _is_method\n        else zip(func_params, args)\n    )\n    # init with default values\n    kwargs = {\n        k: v.default\n        for k, v in inspect.signature(func).parameters.items()\n        if v.default is not inspect.Parameter.empty\n    }\n    # merge args expanded as kwargs and the original kwds\n    kwargs.update(dict(**args_as_kw, **kwds))\n    return OrderedDict(sorted(kwargs.items()))\n\n\ndef _pop_kwds_with_deprecation(kwds, name: str, default_value: bool):\n    if name in kwds:\n        warnings.warn(\n            f\"`{name}` is deprecated and will be removed in a future release,\"\n            \" use `cachier__` alternative instead.\",\n            DeprecationWarning,\n            stacklevel=2,\n        )\n    return kwds.pop(name, default_value)\n\n\ndef cachier(\n    hash_func: Optional[HashFunc] = None,\n    hash_params: Optional[HashFunc] = None,\n    backend: Optional[Backend] = None,\n    mongetter: Optional[Mongetter] = None,\n    stale_after: Optional[datetime.timedelta] = None,\n    next_time: Optional[bool] = None,\n    cache_dir: Optional[Union[str, os.PathLike]] = None,\n    pickle_reload: Optional[bool] = None,\n    separate_files: Optional[bool] = None,\n    wait_for_calc_timeout: Optional[int] = None,\n    allow_none: Optional[bool] = None,\n):\n    \"\"\"Wrap as a persistent, stale-free memoization decorator.\n\n    The positional and keyword arguments to the wrapped function must be\n    hashable (i.e. Python's immutable built-in objects, not mutable\n    containers). Also, notice that since objects which are instances of\n    user-defined classes are hashable but all compare unequal (their hash\n    value is their id), equal objects across different sessions will not yield\n    identical keys.\n\n    Arguments:\n    ---------\n    hash_func : callable, optional\n        A callable that gets the args and kwargs from the decorated function\n        and returns a hash key for them. This parameter can be used to enable\n        the use of cachier with functions that get arguments that are not\n        automatically hashable by Python.\n    hash_params : callable, optional\n    backend : str, optional\n        The name of the backend to use. Valid options currently include\n        'pickle', 'mongo' and 'memory'. If not provided, defaults to\n        'pickle' unless the 'mongetter' argument is passed, in which\n        case the mongo backend is automatically selected.\n    mongetter : callable, optional\n        A callable that takes no arguments and returns a pymongo.Collection\n        object with writing permissions. If unset a local pickle cache is used\n        instead.\n    stale_after : datetime.timedelta, optional\n        The time delta after which a cached result is considered stale. Calls\n        made after the result goes stale will trigger a recalculation of the\n        result, but whether a stale or fresh result will be returned is\n        determined by the optional next_time argument.\n    next_time : bool, optional\n        If set to True, a stale result will be returned when finding one, not\n        waiting for the calculation of the fresh result to return. Defaults to\n        False.\n    cache_dir : str, optional\n        A fully qualified path to a file directory to be used for cache files.\n        The running process must have running permissions to this folder. If\n        not provided, a default directory at `~/.cachier/` is used.\n    pickle_reload : bool, optional\n        If set to True, in-memory cache will be reloaded on each cache read,\n        enabling different threads to share cache. Should be set to False for\n        faster reads in single-thread programs. Defaults to True.\n    separate_files: bool, default False, for Pickle cores only\n        Instead of a single cache file per-function, each function's cache is\n        split between several files, one for each argument set. This can help\n        if you per-function cache files become too large.\n    wait_for_calc_timeout: int, optional, for MongoDB only\n        The maximum time to wait for an ongoing calculation. When a\n        process started to calculate the value setting being_calculated to\n        True, any process trying to read the same entry will wait a maximum of\n        seconds specified in this parameter. 0 means wait forever.\n        Once the timeout expires the calculation will be triggered.\n    allow_none: bool, optional\n        Allows storing None values in the cache. If False, functions returning\n        None will not be cached and are recalculated every call.\n\n    \"\"\"\n    # Check for deprecated parameters\n    if hash_params is not None:\n        message = (\n            \"hash_params will be removed in a future release, \"\n            \"please use hash_func instead\"\n        )\n        warn(message, DeprecationWarning, stacklevel=2)\n        hash_func = hash_params\n    # Update parameters with defaults if input is None\n    backend = _update_with_defaults(backend, \"backend\")\n    mongetter = _update_with_defaults(mongetter, \"mongetter\")\n    # Override the backend parameter if a mongetter is provided.\n    if callable(mongetter):\n        backend = \"mongo\"\n    core: _BaseCore\n    if backend == \"pickle\":\n        core = _PickleCore(\n            hash_func=hash_func,\n            pickle_reload=pickle_reload,\n            cache_dir=cache_dir,\n            separate_files=separate_files,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"mongo\":\n        core = _MongoCore(\n            hash_func=hash_func,\n            mongetter=mongetter,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"memory\":\n        core = _MemoryCore(\n            hash_func=hash_func, wait_for_calc_timeout=wait_for_calc_timeout\n        )\n    else:\n        raise ValueError(\"specified an invalid core: %s\" % backend)\n\n    def _cachier_decorator(func):\n        core.set_func(func)\n\n        @wraps(func)\n        def func_wrapper(*args, **kwds):\n            nonlocal allow_none\n            _allow_none = _update_with_defaults(allow_none, \"allow_none\", kwds)\n            # print('Inside general wrapper for {}.'.format(func.__name__))\n            ignore_cache = _pop_kwds_with_deprecation(\n                kwds, \"ignore_cache\", False\n            )\n            overwrite_cache = _pop_kwds_with_deprecation(\n                kwds, \"overwrite_cache\", False\n            )\n            verbose = _pop_kwds_with_deprecation(kwds, \"verbose_cache\", False)\n            ignore_cache = kwds.pop(\"cachier__skip_cache\", ignore_cache)\n            overwrite_cache = kwds.pop(\n                \"cachier__overwrite_cache\", overwrite_cache\n            )\n            verbose = kwds.pop(\"cachier__verbose\", verbose)\n            _stale_after = _update_with_defaults(\n                stale_after, \"stale_after\", kwds\n            )\n            _next_time = _update_with_defaults(next_time, \"next_time\", kwds)\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n\n            _print = lambda x: None  # noqa: E731\n            if verbose:\n                _print = print\n            if ignore_cache or not _default_params[\"caching_enabled\"]:\n                return func(**kwargs)\n            key, entry = core.get_entry((), kwargs)\n            if overwrite_cache:\n                return _calc_entry(core, key, func, args, kwds)\n            if entry is None:\n                _print(\"No entry found. No current calc. Calling like a boss.\")\n                return _calc_entry(core, key, func, args, kwds)\n            _print(\"Entry found.\")\n            if _allow_none or entry.get(\"value\", None) is not None:\n                _print(\"Cached result found.\")\n                now = datetime.datetime.now()\n                if now - entry[\"time\"] <= _stale_after:\n                    _print(\"And it is fresh!\")\n                    return entry[\"value\"]\n                _print(\"But it is stale... :(\")\n                if entry[\"being_calculated\"]:\n                    if _next_time:\n                        _print(\"Returning stale.\")\n                        return entry[\"value\"]  # return stale val\n                    _print(\"Already calc. Waiting on change.\")\n                    try:\n                        return core.wait_on_entry_calc(key)\n                    except RecalculationNeeded:\n                        return _calc_entry(core, key, func, args, kwds)\n                if _next_time:\n                    _print(\"Async calc and return stale\")\n                    try:\n                        core.mark_entry_being_calculated(key)\n                        _get_executor().submit(\n                            _function_thread, core, key, func, args, kwds\n                        )\n                    finally:\n                        core.mark_entry_not_calculated(key)\n                    return entry[\"value\"]\n                _print(\"Calling decorated function and waiting\")\n                return _calc_entry(core, key, func, args, kwds)\n            if entry[\"being_calculated\"]:\n                _print(\"No value but being calculated. Waiting.\")\n                try:\n                    return core.wait_on_entry_calc(key)\n                except RecalculationNeeded:\n                    return _calc_entry(core, key, func, args, kwds)\n            _print(\"No entry found. No current calc. Calling like a boss.\")\n            return _calc_entry(core, key, func, args, kwds)\n\n        def _clear_cache():\n            \"\"\"Clear the cache.\"\"\"\n            core.clear_cache()\n\n        def _clear_being_calculated():\n            \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n            core.clear_being_calculated()\n\n        def _cache_dpath():\n            \"\"\"Return the path to the cache dir, if exists; None if not.\"\"\"\n            return getattr(core, \"cache_dir\", None)\n\n        def _precache_value(*args, value_to_cache, **kwds):  # noqa: D417\n            \"\"\"Add an initial value to the cache.\n\n            Arguments:\n            ---------\n            value_to_cache : any\n                entry to be written into the cache\n\n            \"\"\"\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n            return core.precache_value((), kwargs, value_to_cache)\n\n        func_wrapper.clear_cache = _clear_cache\n        func_wrapper.clear_being_calculated = _clear_being_calculated\n        func_wrapper.cache_dpath = _cache_dpath\n        func_wrapper.precache_value = _precache_value\n        return func_wrapper\n\n    return _cachier_decorator\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/base.py",
      "content": "\"\"\"Defines the interface of a cachier caching core.\"\"\"\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport abc  # for the _BaseCore abstract base class\nimport inspect\nimport threading\nfrom typing import Callable\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n\nclass RecalculationNeeded(Exception):\n    \"\"\"Exception raised when a recalculation is needed.\"\"\"\n\n    pass\n\n\ndef _get_func_str(func: Callable) -> str:\n    return f\".{func.__module__}.{func.__name__}\"\n\n\nclass _BaseCore:\n    __metaclass__ = abc.ABCMeta\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        self.hash_func = _update_with_defaults(hash_func, \"hash_func\")\n        self.wait_for_calc_timeout = wait_for_calc_timeout\n        self.lock = threading.RLock()\n\n    def set_func(self, func):\n        \"\"\"Set the function this core will use.\n\n        This has to be set before any method is called. Also determine if the\n        function is an object method.\n\n        \"\"\"\n        # unwrap if the function is functools.partial\n        if hasattr(func, \"func\"):\n            func = func.func\n        func_params = list(inspect.signature(func).parameters)\n        self.func_is_method = func_params and func_params[0] == \"self\"\n        self.func = func\n\n    def get_key(self, args, kwds):\n        \"\"\"Return a unique key based on the arguments provided.\"\"\"\n        return self.hash_func(args, kwds)\n\n    def get_entry(self, args, kwds):\n        \"\"\"Get entry based on given arguments.\n\n        Return the result mapped to the given arguments in this core's cache,\n        if such a mapping exists.\n\n        \"\"\"\n        key = self.get_key(args, kwds)\n        return self.get_entry_by_key(key)\n\n    def precache_value(self, args, kwds, value_to_cache):\n        \"\"\"Write a precomputed value into the cache.\"\"\"\n        key = self.get_key(args, kwds)\n        self.set_entry(key, value_to_cache)\n        return value_to_cache\n\n    def check_calc_timeout(self, time_spent):\n        \"\"\"Raise an exception if a recalculation is needed.\"\"\"\n        calc_timeout = _update_with_defaults(\n            self.wait_for_calc_timeout, \"wait_for_calc_timeout\"\n        )\n        if calc_timeout > 0 and (time_spent >= calc_timeout):\n            raise RecalculationNeeded()\n\n    @abc.abstractmethod\n    def get_entry_by_key(self, key):\n        \"\"\"Get entry based on given key.\n\n        Return the result mapped to the given key in this core's cache, if such\n        a mapping exists.\n\n        \"\"\"\n\n    @abc.abstractmethod\n    def set_entry(self, key, func_res):\n        \"\"\"Map the given result to the given key in this core's cache.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_being_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_not_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as not being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def wait_on_entry_calc(self, key):\n        \"\"\"Wait on the entry with keys being calculated and returns result.\"\"\"\n\n    @abc.abstractmethod\n    def clear_cache(self):\n        \"\"\"Clear the cache of this core.\"\"\"\n\n    @abc.abstractmethod\n    def clear_being_calculated(self):\n        \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "\"\"\"A pickle-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\nimport os\nimport pickle  # for local caching\nfrom contextlib import suppress\nfrom datetime import datetime\n\nimport portalocker  # to lock on pickle cache IO\nfrom watchdog.events import PatternMatchingEventHandler\nfrom watchdog.observers import Observer\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n# Alternative:  https://github.com/WoLpH/portalocker\nfrom .base import _BaseCore\n\n\nclass _PickleCore(_BaseCore):\n    \"\"\"The pickle core class for cachier.\"\"\"\n\n    class CacheChangeHandler(PatternMatchingEventHandler):\n        \"\"\"Handles cache-file modification events.\"\"\"\n\n        def __init__(self, filename, core, key):\n            PatternMatchingEventHandler.__init__(\n                self,\n                patterns=[\"*\" + filename],\n                ignore_patterns=None,\n                ignore_directories=True,\n                case_sensitive=False,\n            )\n            self.core = core\n            self.key = key\n            self.observer = None\n            self.value = None\n\n        def inject_observer(self, observer):\n            \"\"\"Inject the observer running this handler.\"\"\"\n            self.observer = observer\n\n        def _check_calculation(self):\n            # print('checking calc')\n            entry = self.core.get_entry_by_key(self.key, True)[1]\n            # print(self.key)\n            # print(entry)\n            try:\n                if not entry[\"being_calculated\"]:\n                    # print('stopping observer!')\n                    self.value = entry[\"value\"]\n                    self.observer.stop()\n                # else:\n                # print('NOT stopping observer... :(')\n            except TypeError:\n                self.value = None\n                self.observer.stop()\n\n        def on_created(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()  # pragma: no cover\n\n        def on_modified(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        pickle_reload: bool,\n        cache_dir: str,\n        separate_files: bool,\n        wait_for_calc_timeout: int,\n    ):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = None\n        self.reload = _update_with_defaults(pickle_reload, \"pickle_reload\")\n        self.cache_dir = os.path.expanduser(\n            _update_with_defaults(cache_dir, \"cache_dir\")\n        )\n        self.separate_files = _update_with_defaults(\n            separate_files, \"separate_files\"\n        )\n\n    @property\n    def cache_fname(self) -> str:\n        fname = f\".{self.func.__module__}.{self.func.__qualname__}\"\n        return fname.replace(\"<\", \"_\").replace(\">\", \"_\")\n\n    @property\n    def cache_fpath(self) -> str:\n        os.makedirs(self.cache_dir, exist_ok=True)\n        return os.path.abspath(\n            os.path.join(os.path.realpath(self.cache_dir), self.cache_fname)\n        )\n\n    def _reload_cache(self):\n        with self.lock:\n            try:\n                with portalocker.Lock(\n                    self.cache_fpath, mode=\"rb\"\n                ) as cache_file:\n                    self.cache = pickle.load(cache_file)  # noqa: S301\n            except (FileNotFoundError, EOFError):\n                self.cache = {}\n\n    def _get_cache(self):\n        with self.lock:\n            if not self.cache:\n                self._reload_cache()\n            return self.cache\n\n    def _get_cache_by_key(self, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        fpath += f\"_{hash_str or key}\"\n        try:\n            with portalocker.Lock(fpath, mode=\"rb\") as cache_file:\n                return pickle.load(cache_file)  # noqa: S301\n        except (FileNotFoundError, EOFError):\n            return None\n\n    def _clear_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(f\"{name}_\"):\n                os.remove(os.path.join(path, subpath))\n\n    def _clear_being_calculated_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(name):\n                entry = self._get_cache_by_key(hash_str=subpath.split(\"_\")[-1])\n                if entry is not None:\n                    entry[\"being_calculated\"] = False\n                    self._save_cache(entry, hash_str=subpath.split(\"_\")[-1])\n\n    def _save_cache(self, cache, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        if key is not None:\n            fpath += f\"_{key}\"\n        elif hash_str is not None:\n            fpath += f\"_{hash_str}\"\n        with self.lock:\n            self.cache = cache\n            with portalocker.Lock(fpath, mode=\"wb\") as cache_file:\n                pickle.dump(cache, cache_file, protocol=4)\n            if key is None:\n                self._reload_cache()\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            if self.separate_files:\n                return key, self._get_cache_by_key(key)\n            if self.reload or reload:\n                self._reload_cache()\n            return key, self._get_cache().get(key, None)\n\n    def set_entry(self, key, func_res):\n        key_data = {\n            \"value\": func_res,\n            \"time\": datetime.now(),\n            \"stale\": False,\n            \"being_calculated\": False,\n        }\n        if self.separate_files:\n            self._save_cache(key_data, key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            cache[key] = key_data\n            self._save_cache(cache)\n\n    def mark_entry_being_calculated_separate_files(self, key):\n        self._save_cache(\n            {\n                \"value\": None,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": True,\n            },\n            key=key,\n        )\n\n    def mark_entry_not_calculated_separate_files(self, key):\n        _, entry = self.get_entry_by_key(key)\n        entry[\"being_calculated\"] = False\n        self._save_cache(entry, key=key)\n\n    def mark_entry_being_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_being_calculated_separate_files(key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            try:\n                cache[key][\"being_calculated\"] = True\n            except KeyError:\n                cache[key] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                }\n            self._save_cache(cache)\n\n    def mark_entry_not_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_not_calculated_separate_files(key)\n        with self.lock:\n            cache = self._get_cache()\n            # that's ok, we don't need an entry in that case\n            with suppress(KeyError):\n                cache[key][\"being_calculated\"] = False\n                self._save_cache(cache)\n\n    def wait_on_entry_calc(self, key):\n        if self.separate_files:\n            entry = self._get_cache_by_key(key)\n            filename = f\"{self.cache_fname}_{key}\"\n        else:\n            with self.lock:\n                self._reload_cache()\n                entry = self._get_cache()[key]\n            filename = self.cache_fname\n        if not entry[\"being_calculated\"]:\n            return entry[\"value\"]\n        event_handler = _PickleCore.CacheChangeHandler(\n            filename=filename, core=self, key=key\n        )\n        observer = Observer()\n        event_handler.inject_observer(observer)\n        observer.schedule(event_handler, path=self.cache_dir, recursive=True)\n        observer.start()\n        time_spent = 0\n        while observer.is_alive():\n            observer.join(timeout=1.0)\n            time_spent += 1\n            self.check_calc_timeout(time_spent)\n        return event_handler.value\n\n    def clear_cache(self):\n        if self.separate_files:\n            self._clear_all_cache_files()\n        else:\n            self._save_cache({})\n\n    def clear_being_calculated(self):\n        if self.separate_files:\n            self._clear_being_calculated_all_cache_files()\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            for key in cache:\n                cache[key][\"being_calculated\"] = False\n            self._save_cache(cache)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/memory.py",
      "content": "\"\"\"A memory-based caching core for cachier.\"\"\"\n\nimport threading\nfrom datetime import datetime\n\nfrom .._types import HashFunc\nfrom .base import _BaseCore, _get_func_str\n\n\nclass _MemoryCore(_BaseCore):\n    \"\"\"The memory core class for cachier.\"\"\"\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = {}\n\n    def _hash_func_key(self, key):\n        return f\"{_get_func_str(self.func)}:{key}\"\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            return key, self.cache.get(self._hash_func_key(key), None)\n\n    def set_entry(self, key, func_res):\n        with self.lock:\n            try:\n                # we need to retain the existing condition so that\n                # mark_entry_not_calculated can notify all possibly-waiting\n                # threads about it\n                cond = self.cache[self._hash_func_key(key)][\"condition\"]\n            except KeyError:  # pragma: no cover\n                cond = None\n            self.cache[self._hash_func_key(key)] = {\n                \"value\": func_res,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": False,\n                \"condition\": cond,\n            }\n\n    def mark_entry_being_calculated(self, key):\n        with self.lock:\n            condition = threading.Condition()\n            # condition.acquire()\n            try:\n                self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n                self.cache[self._hash_func_key(key)][\"condition\"] = condition\n            except KeyError:\n                self.cache[self._hash_func_key(key)] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                    \"condition\": condition,\n                }\n\n    def mark_entry_not_calculated(self, key):\n        with self.lock:\n            try:\n                entry = self.cache[self._hash_func_key(key)]\n            except KeyError:  # pragma: no cover\n                return  # that's ok, we don't need an entry in that case\n            entry[\"being_calculated\"] = False\n            cond = entry[\"condition\"]\n            if cond:\n                cond.acquire()\n                cond.notify_all()\n                cond.release()\n                entry[\"condition\"] = None\n\n    def wait_on_entry_calc(self, key):\n        with self.lock:  # pragma: no cover\n            entry = self.cache[self._hash_func_key(key)]\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n        entry[\"condition\"].acquire()\n        entry[\"condition\"].wait()\n        entry[\"condition\"].release()\n        return self.cache[self._hash_func_key(key)][\"value\"]\n\n    def clear_cache(self):\n        with self.lock:\n            pass\n\n    def clear_being_calculated(self):\n        with self.lock:\n            for entry in self.cache.values():\n                entry[\"being_calculated\"] = False\n                entry[\"condition\"] = None\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "content": "\"\"\"A MongoDB-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport pickle  # for serialization of python objects\nimport sys  # to make sure that pymongo was imported\nimport time  # to sleep when waiting on Mongo cache\\\nimport warnings  # to warn if pymongo is missing\nfrom contextlib import suppress\nfrom datetime import datetime\n\nfrom .._types import HashFunc, Mongetter\n\nwith suppress(ImportError):\n    from bson.binary import Binary  # to save binary data to mongodb\n    from pymongo import ASCENDING, IndexModel\n    from pymongo.errors import OperationFailure\n\nfrom .base import RecalculationNeeded, _BaseCore, _get_func_str\n\nMONGO_SLEEP_DURATION_IN_SEC = 1\n\n\nclass MissingMongetter(ValueError):\n    \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n\n\nclass _MongoCore(_BaseCore):\n    _INDEX_NAME = \"func_1_key_1\"\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        mongetter: Mongetter,\n        wait_for_calc_timeout: int,\n    ):\n        if \"pymongo\" not in sys.modules:\n            warnings.warn(\n                \"`pymongo` was not found. MongoDB cores will not function.\",\n                ImportWarning,\n                stacklevel=2,\n            )  # pragma: no cover\n\n        super().__init__(hash_func, wait_for_calc_timeout)\n        if mongetter is None:\n            raise MissingMongetter(\n                \"must specify ``mongetter`` when using the mongo core\"\n            )\n        self.mongetter = mongetter\n        self.mongo_collection = self.mongetter()\n        index_inf = self.mongo_collection.index_information()\n        if _MongoCore._INDEX_NAME not in index_inf:\n            func1key1 = IndexModel(\n                keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n                name=_MongoCore._INDEX_NAME,\n            )\n            self.mongo_collection.create_indexes([func1key1])\n\n    @property\n    def _func_str(self) -> str:\n        return _get_func_str(self.func)\n\n    def get_entry_by_key(self, key):\n        res = self.mongo_collection.find_one(\n            {\"func\": self._func_str, \"key\": key}\n        )\n        if not res:\n            return key, None\n        try:\n            entry = {\n                \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        except KeyError:\n            entry = {\n                \"value\": None,\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        return key, entry\n\n    def set_entry(self, key, func_res):\n        thebytes = pickle.dumps(func_res)\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\n                \"$set\": {\n                    \"func\": self._func_str,\n                    \"key\": key,\n                    \"value\": Binary(thebytes),\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": False,\n                }\n            },\n            upsert=True,\n        )\n\n    def mark_entry_being_calculated(self, key):\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\"$set\": {\"being_calculated\": True}},\n            upsert=True,\n        )\n\n    def mark_entry_not_calculated(self, key):\n        with suppress(OperationFailure):  # don't care in this case\n            self.mongo_collection.update_one(\n                filter={\n                    \"func\": self._func_str,\n                    \"key\": key,\n                },\n                update={\"$set\": {\"being_calculated\": False}},\n                upsert=False,  # should not insert in this case\n            )\n\n    def wait_on_entry_calc(self, key):\n        time_spent = 0\n        while True:\n            time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n            time_spent += MONGO_SLEEP_DURATION_IN_SEC\n            key, entry = self.get_entry_by_key(key)\n            if entry is None:\n                raise RecalculationNeeded()\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n            self.check_calc_timeout(time_spent)\n\n    def clear_cache(self):\n        self.mongo_collection.delete_many(filter={\"func\": self._func_str})\n\n    def clear_being_calculated(self):\n        self.mongo_collection.update_many(\n            filter={\n                \"func\": self._func_str,\n                \"being_calculated\": True,\n            },\n            update={\"$set\": {\"being_calculated\": False}},\n        )\n"
    },
    {
      "path": "python-cachier_cachier/tests/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/tests/standalone_script.py",
      "content": "import time\n\nimport cachier\n\n\n@cachier.cachier()\ndef _takes_3_seconds(label, value):\n    time.sleep(3)\n    return f\"{label} {value}\"\n\n\nprint(_takes_3_seconds(\"two\", 2))\n"
    },
    {
      "path": "python-cachier_cachier/tests/speed_eval.py",
      "content": "from random import random\nfrom time import time\n\nfrom cachier import cachier\n\n\n@cachier(next_time=True)\ndef _test_int_pickling(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef _test_int_pickling_compare(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef test_pickle_speed():\n    \"\"\"Test speeds.\"\"\"\n    print(\"Comparing speeds of decorated vs non-decorated functions...\")\n    num_of_vals = 1000\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling_compare(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Non-decorated average = {sum(times) / num_of_vals}\")\n\n    _test_int_pickling.clear_cache()\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Decorated average = {sum(times) / num_of_vals}\")\n\n\n@cachier()\ndef _test_single_file_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\n@cachier(separate_files=True)\ndef _test_separate_files_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\ndef test_separate_files_vs_single_file():\n    _test_separate_files_speed.clear_cache()\n    _test_single_file_speed.clear_cache()\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_separate_files_speed(j, 2)\n    print(f\"separate files time: {time() - start_time}\")\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_single_file_speed(j, 2)\n    print(f\"single file time: {time() - start_time}\")\n\n\nif __name__ == \"__main__\":\n    test_pickle_speed()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_memory_core.py",
      "content": "\"\"\"Test for the in-memory implementation of the  Cachier python package.\"\"\"\n\nimport hashlib\nimport queue\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pandas as pd\nimport pytest\nfrom cachier import cachier\n\n\n@cachier(backend=\"memory\", next_time=False)\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.memory\ndef test_memory_core():\n    \"\"\"Basic memory core functionality.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\n@pytest.mark.memory\ndef test_memory_core_keywords():\n    \"\"\"Basic memory core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=False)\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after():\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds.clear_cache()\n    val1 = _stale_after_seconds(1, 2)\n    val2 = _stale_after_seconds(1, 2)\n    val3 = _stale_after_seconds(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds(1, 2)\n    assert val4 != val1\n    _stale_after_seconds.clear_cache()\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=True)\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after_next_time():\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _random_num():\n    return random()\n\n\n@cachier(backend=\"memory\")\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.memory\ndef test_overwrite_cache():\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 == int3\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 == int3\n    _random_num_with_arg.clear_cache()\n\n\n@pytest.mark.memory\ndef test_ignore_cache():\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(res_queue):\n    res = _takes_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_memory_being_calculated():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(res_queue):\n    res = _being_calc_next_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_being_calc_next_time():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\")\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@cachier(backend=\"memory\")\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    _takes_time.clear_being_calculated()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated_with_empty_cache():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    _takes_time.clear_being_calculated()\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.memory\ndef test_error_throwing_func():\n    # with\n    res1 = _error_throwing_func(4)\n    sleep(1.5)\n    res2 = _error_throwing_func(4)\n    assert res1 == res2\n\n\n@pytest.mark.memory\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(backend=\"memory\", hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n\n\nif __name__ == \"__main__\":\n    test_memory_being_calculated()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_core_lookup.py",
      "content": "\"\"\"Testing a few basic cachier interfaces.\"\"\"\n\nimport pytest\nfrom cachier import cachier, get_default_params\nfrom cachier.cores.mongo import MissingMongetter\n\n\ndef test_get_default_params():\n    params = get_default_params()\n    assert tuple(sorted(params)) == (\n        \"allow_none\",\n        \"backend\",\n        \"cache_dir\",\n        \"caching_enabled\",\n        \"hash_func\",\n        \"mongetter\",\n        \"next_time\",\n        \"pickle_reload\",\n        \"separate_files\",\n        \"stale_after\",\n        \"wait_for_calc_timeout\",\n    )\n\n\ndef test_bad_name(name=\"nope\"):\n    # Test that the appropriate exception is thrown\n    # when an invalid backend is given.\n    with pytest.raises(ValueError, match=f\"specified an invalid core: {name}\"):\n\n        @cachier(backend=name)\n        def dummy_func():\n            pass\n\n\ndef test_missing_mongetter():\n    # Test that the appropriate exception is thrown\n    # when forgetting to specify the mongetter.\n    with pytest.raises(MissingMongetter):\n\n        @cachier(backend=\"mongo\", mongetter=None)\n        def dummy_func():\n            pass\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_general.py",
      "content": "\"\"\"Non-core-specific tests for cachier.\"\"\"\n\nimport functools\nimport os\nimport queue\nimport subprocess  # nosec: B404\nimport threading\nfrom contextlib import suppress\nfrom random import random\nfrom time import sleep, time\n\nimport cachier\nimport pytest\nfrom cachier.core import (\n    DEFAULT_MAX_WORKERS,\n    MAX_WORKERS_ENVAR_NAME,\n    _get_executor,\n    _max_workers,\n    _set_max_workers,\n)\n\nfrom tests.test_mongo_core import (\n    MONGO_DELTA_LONG,\n    _test_mongetter,\n)\n\n\ndef test_information():\n    print(\"\\ncachier version: \", end=\"\")\n    print(cachier.__version__)\n\n\ndef test_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    with suppress(KeyError):\n        del os.environ[MAX_WORKERS_ENVAR_NAME]\n    assert _max_workers() == DEFAULT_MAX_WORKERS\n\n\ndef test_get_executor():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _get_executor()\n    _get_executor(False)\n    _get_executor(True)\n\n\ndef test_set_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _set_max_workers(9)\n\n\nparametrize_keys = \"mongetter,stale_after,separate_files\"\nparametrize_values = [\n    pytest.param(\n        _test_mongetter, MONGO_DELTA_LONG, False, marks=pytest.mark.mongo\n    ),\n    (None, None, False),\n    (None, None, True),\n]\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_ok(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_fast(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(1)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_fast(res_queue):\n        res = _wait_for_calc_timeout_fast(1, 2)\n        res_queue.put(res)\n\n    \"\"\" Testing calls that avoid timeouts store the values in cache. \"\"\"\n    _wait_for_calc_timeout_fast.clear_cache()\n    val1 = _wait_for_calc_timeout_fast(1, 2)\n    val2 = _wait_for_calc_timeout_fast(1, 2)\n    assert val1 == val2\n\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(2)\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2  # Timeout did not kick in, a single call was done\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_slow(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    sleep(4)\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_precache_value(mongetter, backend):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def dummy_func(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return arg_1 + arg_2\n\n    assert dummy_func.precache_value(2, 2, value_to_cache=5) == 5\n    assert dummy_func(2, 2) == 5\n    dummy_func.clear_cache()\n    assert dummy_func(2, 2) == 4\n    assert dummy_func.precache_value(2, arg_2=2, value_to_cache=5) == 5\n    assert dummy_func(2, arg_2=2) == 5\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_ignore_self_in_methods(mongetter, backend):\n    class DummyClass:\n        @cachier.cachier(backend=backend, mongetter=mongetter)\n        def takes_2_seconds(self, arg_1, arg_2):\n            \"\"\"Some function.\"\"\"\n            sleep(2)\n            return arg_1 + arg_2\n\n    test_object_1 = DummyClass()\n    test_object_2 = DummyClass()\n    test_object_1.takes_2_seconds.clear_cache()\n    test_object_2.takes_2_seconds.clear_cache()\n    assert test_object_1.takes_2_seconds(1, 2) == 3\n    start = time()\n    assert test_object_2.takes_2_seconds(1, 2) == 3\n    end = time()\n    assert end - start < 1\n\n\ndef test_hash_params_deprecation():\n    with pytest.deprecated_call(match=\"hash_params will be removed\"):\n\n        @cachier.cachier(hash_params=lambda a, k: \"key\")\n        def test():\n            return \"value\"\n\n    assert test() == \"value\"\n\n\ndef test_separate_processes():\n    test_args = (\"python\", \"tests/standalone_script.py\")\n    run_params = {\"args\": test_args, \"capture_output\": True, \"text\": True}\n    run_process = functools.partial(subprocess.run, **run_params)\n    result = run_process()\n    assert result.stdout.strip() == \"two 2\"\n    start = time()\n    result = run_process()\n    end = time()\n    assert result.stdout.strip() == \"two 2\"\n    assert end - start < 3\n\n\ndef test_global_disable():\n    @cachier.cachier()\n    def get_random():\n        return random()\n\n    get_random.clear_cache()\n    result_1 = get_random()\n    result_2 = get_random()\n    cachier.disable_caching()\n    result_3 = get_random()\n    cachier.enable_caching()\n    result_4 = get_random()\n    assert result_1 == result_2 == result_4\n    assert result_1 != result_3\n\n\ndef test_none_not_cached_by_default():\n    count = 0\n\n    @cachier.cachier()\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 2\n\n\ndef test_allow_caching_none():\n    count = 0\n\n    @cachier.cachier(allow_none=True)\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 1\n\n\ndef test_identical_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: int, b: int = 2, c: int = 3):\n        nonlocal count\n        count += 1\n        return a + b + c\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1, 2, 3) == 6\n    assert dummy_func(1, 2, c=3) == 6\n    assert dummy_func(1, b=2, c=3) == 6\n    assert dummy_func(a=1, b=2, c=3) == 6\n    assert count == 1\n\n\ndef test_list_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: list, b: list = [2]):  # noqa: B006\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func([1]) == [1, 2]\n    assert dummy_func([1], [2]) == [1, 2]\n    assert dummy_func([1], b=[2]) == [1, 2]\n    assert dummy_func(a=[1], b=[2]) == [1, 2]\n    assert count == 1\n\n\ndef test_order_independent_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(b=2, a=1) == 3\n    assert count == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_diff_functions_same_args(tmpdir, backend: str):\n    count_p = count_m = 0\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert fn(1) == expected\n        assert fn(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_runtime_handling(tmpdir, backend):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir, backend=backend)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n    for fn, expected in [(fn_plus, 5), (fn_minus, 1)]:\n        assert cachier_(fn)(3, 2) == expected\n        assert cachier_(fn)(a=3, b=2) == expected\n    assert count_p == 2\n    assert count_m == 2\n\n\ndef test_partial_handling(tmpdir):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)(2) == expected\n\n        dummy_ = functools.partial(fn, a=1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1) == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1, b=2) == expected\n\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n\n    assert count_p == 1\n    assert count_m == 1\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_defaults.py",
      "content": "import datetime\nimport os\nimport queue\nimport random\nimport threading\nimport time\n\nimport cachier\nimport pytest\n\nfrom tests.test_mongo_core import MONGO_DELTA, _test_mongetter\n\n_default_params = cachier.get_default_params().copy()\n\n\ndef setup_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef teardown_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef test_hash_func_default_param():\n    def slow_hash_func(args, kwds):\n        time.sleep(2)\n        return \"hash\"\n\n    def fast_hash_func(args, kwds):\n        return \"hash\"\n\n    cachier.set_default_params(hash_func=slow_hash_func)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(hash_func=fast_hash_func)\n    def global_test_2():\n        return None\n\n    start = time.time()\n    global_test_1()\n    end = time.time()\n    assert end - start > 1\n    start = time.time()\n    global_test_2()\n    end = time.time()\n    assert end - start < 1\n\n\ndef test_backend_default_param():\n    cachier.set_default_params(backend=\"memory\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(backend=\"pickle\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\n@pytest.mark.mongo\ndef test_mongetter_default_param():\n    cachier.set_default_params(mongetter=_test_mongetter)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(mongetter=False)\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\ndef test_cache_dir_default_param(tmpdir):\n    cachier.set_default_params(cache_dir=tmpdir / \"1\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() == str(tmpdir / \"1\")\n    assert global_test_2.cache_dpath() == str(tmpdir / \"2\")\n\n\ndef test_separate_files_default_param(tmpdir):\n    cachier.set_default_params(separate_files=True)\n\n    @cachier.cachier(cache_dir=tmpdir / \"1\")\n    def global_test_1(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\", separate_files=False)\n    def global_test_2(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    global_test_1(1, 2)\n    global_test_1(3, 4)\n    global_test_2(1, 2)\n    global_test_2(3, 4)\n\n    assert len(os.listdir(global_test_1.cache_dpath())) == 2\n    assert len(os.listdir(global_test_2.cache_dpath())) == 1\n\n\ndef test_allow_none_default_param(tmpdir):\n    cachier.set_default_params(\n        allow_none=True,\n        separate_files=True,\n        verbose_cache=True,\n    )\n    allow_count = disallow_count = 0\n\n    @cachier.cachier(cache_dir=tmpdir)\n    def allow_none():\n        nonlocal allow_count\n        allow_count += 1\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir, allow_none=False)\n    def disallow_none():\n        nonlocal disallow_count\n        disallow_count += 1\n        return None\n\n    assert allow_count == 0\n    allow_none()\n    allow_none()\n    assert allow_count == 1\n\n    assert disallow_count == 0\n    disallow_none()\n    disallow_none()\n    assert disallow_count == 2\n\n    disallow_none(cachier__allow_none=True)\n    disallow_none(cachier__allow_none=True)\n    assert disallow_count == 2\n\n\nPARAMETRIZE_TEST = (\n    \"backend,mongetter\",\n    [\n        pytest.param(\"pickle\", None, marks=pytest.mark.pickle),\n        pytest.param(\"mongo\", _test_mongetter, marks=pytest.mark.mongo),\n    ],\n)\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_stale_after_applies_dynamically(backend, mongetter):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_test(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random() + arg_1 + arg_2\n\n    cachier.set_default_params(stale_after=MONGO_DELTA)\n\n    _stale_after_test.clear_cache()\n    val1 = _stale_after_test(1, 2)\n    val2 = _stale_after_test(1, 2)\n    assert val1 == val2\n    time.sleep(3)\n    val3 = _stale_after_test(1, 2)\n    assert val3 != val1\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_next_time_applies_dynamically(backend, mongetter):\n    NEXT_AFTER_DELTA = datetime.timedelta(seconds=3)\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_next_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random()\n\n    cachier.set_default_params(stale_after=NEXT_AFTER_DELTA, next_time=True)\n\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    time.sleep(NEXT_AFTER_DELTA.seconds + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    time.sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_wait_for_calc_applies_dynamically(backend, mongetter):\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        time.sleep(3)\n        return random.random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    cachier.set_default_params(wait_for_calc_timeout=2)\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    time.sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    time.sleep(5)\n    thread1.join(timeout=5)\n    thread2.join(timeout=5)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\ndef test_default_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1) == 3\n    assert dummy_func(a=1) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert count == 1\n\n\ndef test_deprecated_func_kwargs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    with pytest.deprecated_call(\n        match=\"`verbose_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, verbose_cache=True) == 3\n    assert count == 1\n    with pytest.deprecated_call(\n        match=\"`ignore_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, ignore_cache=True) == 3\n    assert count == 2\n    with pytest.deprecated_call(\n        match=\"`overwrite_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, overwrite_cache=True) == 3\n    assert count == 3\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_pickle_core.py",
      "content": "\"\"\"Test for the Cachier python package.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\n# from os.path import (\n#     realpath,\n#     dirname\n# )\nimport os\nimport pickle\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pytest\n\ntry:\n    import queue\nexcept ImportError:  # python 2\n    import Queue as queue  # type: ignore\n\nimport hashlib\n\nimport pandas as pd\nfrom cachier import cachier\nfrom cachier.core import _default_params\n\n\ndef _get_decorated_func(func, **kwargs):\n    cachier_decorator = cachier(**kwargs)\n    decorated_func = cachier_decorator(func)\n    return decorated_func\n\n\n# Pickle core tests\n\n\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"reload\", [True, False])\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core(reload, separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds,\n        next_time=False,\n        pickle_reload=reload,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_keywords(separate_files):\n    \"\"\"Basic Pickle core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds, next_time=False, separate_files=separate_files\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after(separate_files):\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds_decorated = _get_decorated_func(\n        _stale_after_seconds,\n        stale_after=DELTA,\n        next_time=False,\n        separate_files=separate_files,\n    )\n    _stale_after_seconds_decorated.clear_cache()\n    val1 = _stale_after_seconds_decorated(1, 2)\n    val2 = _stale_after_seconds_decorated(1, 2)\n    val3 = _stale_after_seconds_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds_decorated(1, 2)\n    assert val4 != val1\n    _stale_after_seconds_decorated.clear_cache()\n\n\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after_next_time(separate_files):\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time_decorated = _get_decorated_func(\n        _stale_after_next_time,\n        stale_after=DELTA,\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _stale_after_next_time_decorated.clear_cache()\n    val1 = _stale_after_next_time_decorated(1, 2)\n    val2 = _stale_after_next_time_decorated(1, 2)\n    val3 = _stale_after_next_time_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time_decorated(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time_decorated(1, 2)\n    assert val5 != val1\n    _stale_after_next_time_decorated.clear_cache()\n\n\ndef _random_num():\n    return random()\n\n\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_overwrite_cache(separate_files):\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 == int3\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 == int3\n    _random_num_with_arg_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_ignore_cache(separate_files):\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg_decorated.clear_cache()\n\n\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(takes_time_func, res_queue):\n    res = takes_time_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_being_calculated(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(being_calc_func, res_queue):\n    res = being_calc_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_being_calc_next_time(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _being_calc_next_time_decorated = _get_decorated_func(\n        _being_calc_next_time,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _being_calc_next_time_decorated.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _BAD_CACHE_FNAME = '.__main__._bad_cache'\n_BAD_CACHE_FNAME = \".tests.test_pickle_core._bad_cache\"\n_BAD_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._bad_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\nEXPANDED_CACHIER_DIR = os.path.expanduser(_default_params[\"cache_dir\"])\n_BAD_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME)\n_BAD_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME_SEPARATE_FILES\n)\n_BAD_CACHE_FPATHS = {\n    True: _BAD_CACHE_FPATH_SEPARATE_FILES,\n    False: _BAD_CACHE_FPATH,\n}\n\n\ndef _calls_bad_cache(bad_cache_func, res_queue, trash_cache, separate_files):\n    try:\n        res = bad_cache_func(0.13, 0.02, cachier__verbose=True)\n        if trash_cache:\n            with open(_BAD_CACHE_FPATHS[separate_files], \"w\") as cache_file:\n                cache_file.seek(0)\n                cache_file.truncate()\n        res_queue.put(res)\n    except Exception as exc:\n        res_queue.put(exc)\n\n\ndef _helper_bad_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    _bad_cache_decorated = _get_decorated_func(\n        _bad_cache, separate_files=separate_files\n    )\n    _bad_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if res2 is not None or isinstance(res2, KeyError):\n        return False\n    return True\n\n\n# we want this to succeed at least once\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_bad_cache_file(separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.6, 0.7, 0.8, 1, 1.5, 2]\n    sleeptimes = sleeptimes + sleeptimes\n    for sleeptime in sleeptimes:\n        if _helper_bad_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _DEL_CACHE_FNAME = '.__main__._delete_cache'\n_DEL_CACHE_FNAME = \".tests.test_pickle_core._delete_cache\"\n_DEL_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._delete_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\n_DEL_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME)\n_DEL_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME_SEPARATE_FILES\n)\n_DEL_CACHE_FPATHS = {\n    True: _DEL_CACHE_FPATH_SEPARATE_FILES,\n    False: _DEL_CACHE_FPATH,\n}\n\n\ndef _calls_delete_cache(del_cache_func, res_queue, del_cache, separate_files):\n    try:\n        # print('in')\n        res = del_cache_func(0.13, 0.02)\n        # print('out with {}'.format(res))\n        if del_cache:\n            os.remove(_DEL_CACHE_FPATHS[separate_files])\n            # print(os.path.isfile(_DEL_CACHE_FPATH))\n        res_queue.put(res)\n    except Exception as exc:\n        # print('found')\n        res_queue.put(exc)\n\n\ndef _helper_delete_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    _delete_cache_decorated = _get_decorated_func(\n        _delete_cache, separate_files=separate_files\n    )\n    _delete_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    # print(res1)\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if not ((isinstance(res2, KeyError)) or (res2 is None)):\n        return False\n    return True\n    # print(res2)\n    # print(type(res2))\n\n\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_delete_cache_file(separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.7, 1]\n    sleeptimes = sleeptimes * 4\n    for sleeptime in sleeptimes:\n        if _helper_delete_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_clear_being_calculated(separate_files):\n    \"\"\"Test pickle core clear `being calculated` functionality.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_being_calculated()\n\n\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_error_throwing_func(separate_files):\n    # with\n    _error_throwing_func.count = 0\n    _error_throwing_func_decorated = _get_decorated_func(\n        _error_throwing_func,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _error_throwing_func_decorated.clear_cache()\n    res1 = _error_throwing_func_decorated(4)\n    sleep(1.5)\n    res2 = _error_throwing_func_decorated(4)\n    assert res1 == res2\n\n\n# test custom cache dir for pickle core\n\nCUSTOM_DIR = \"~/.exparrot\"\nEXPANDED_CUSTOM_DIR = os.path.expanduser(CUSTOM_DIR)\n\n\ndef _takes_2_seconds_custom_dir(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_custom_cache_dir(separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_custom_dir_decorated = _get_decorated_func(\n        _takes_2_seconds_custom_dir,\n        next_time=False,\n        cache_dir=CUSTOM_DIR,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    path2test = _takes_2_seconds_custom_dir_decorated.cache_dpath()\n    assert path2test == EXPANDED_CUSTOM_DIR\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_callable_hash_param(separate_files):\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(hash_func=_hash_func, separate_files=separate_files)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_mongo_core.py",
      "content": "\"\"\"Testing the MongoDB core of cachier.\"\"\"\n\nimport datetime\nimport hashlib\nimport platform\nimport queue\nimport sys\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep\nfrom urllib.parse import quote_plus\n\nimport pandas as pd\nimport pymongo\nimport pytest\nfrom birch import Birch  # type: ignore[import-not-found]\nfrom cachier import cachier\nfrom cachier.cores.base import RecalculationNeeded\nfrom cachier.cores.mongo import _MongoCore\nfrom pymongo.errors import OperationFailure\nfrom pymongo.mongo_client import MongoClient\nfrom pymongo_inmemory import MongoClient as InMemoryMongoClient\n\n# === Enables testing vs a real MongoDB instance ===\n\n\nclass CfgKey:\n    HOST = \"TEST_HOST\"\n    PORT = \"TEST_PORT\"\n    # UNAME = \"TEST_USERNAME\"\n    # PWD = \"TEST_PASSWORD\"\n    # DB = \"TEST_DB\"\n    TEST_VS_DOCKERIZED_MONGO = \"TEST_VS_DOCKERIZED_MONGO\"\n\n\nCFG = Birch(\n    namespace=\"cachier\",\n    defaults={CfgKey.TEST_VS_DOCKERIZED_MONGO: False},\n)\n\n\n# URI_TEMPLATE = \"mongodb://myUser:myPassword@localhost:27017/\"\nURI_TEMPLATE = \"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n\n\ndef _get_cachier_db_mongo_client():\n    host = quote_plus(CFG[CfgKey.HOST])\n    port = quote_plus(CFG[CfgKey.PORT])\n    # uname = quote_plus(CFG[CfgKey.UNAME])\n    # pwd = quote_plus(CFG[CfgKey.PWD])\n    # db = quote_plus(CFG[CfgKey.DB])\n    uri = f\"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n    return MongoClient(uri)\n\n\n_COLLECTION_NAME = (\n    f\"cachier_test_{platform.system()}\"\n    f\"_{'.'.join(map(str, sys.version_info[:3]))}\"\n)\n\n\ndef _test_mongetter():\n    if not hasattr(_test_mongetter, \"client\"):\n        if str(CFG.mget(CfgKey.TEST_VS_DOCKERIZED_MONGO)).lower() == \"true\":\n            print(\"Using live MongoDB instance for testing.\")\n            _test_mongetter.client = _get_cachier_db_mongo_client()\n        else:\n            print(\"Using in-memory MongoDB instance for testing.\")\n            _test_mongetter.client = InMemoryMongoClient()\n    db_obj = _test_mongetter.client[\"cachier_test\"]\n    if _COLLECTION_NAME not in db_obj.list_collection_names():\n        db_obj.create_collection(_COLLECTION_NAME)\n    return db_obj[_COLLECTION_NAME]\n\n\n# === Mongo core tests ===\n\n\n@pytest.mark.mongo\ndef test_information():\n    print(\"\\npymongo version: \", end=\"\")\n    print(pymongo.__version__)\n\n\n@pytest.mark.mongo\ndef test_mongo_index_creation():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    collection = _test_mongetter()\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    assert _MongoCore._INDEX_NAME in collection.index_information()\n\n\n@pytest.mark.mongo\ndef test_mongo_core():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, 2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, 2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, 2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, 2)\n    assert val6 == val5\n\n\n@pytest.mark.mongo\ndef test_mongo_core_keywords():\n    \"\"\"Basic Mongo core functionality with keyword arguments.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, arg_2=2)\n    val2 = _test_mongo_caching(1, arg_2=2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, arg_2=2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, arg_2=2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, arg_2=2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, arg_2=2)\n    assert val6 == val5\n\n\nMONGO_DELTA = timedelta(seconds=3)\nMONGO_DELTA_LONG = timedelta(seconds=10)\n\n\n@pytest.mark.mongo\ndef test_mongo_stale_after():\n    \"\"\"Testing MongoDB core stale_after functionality.\"\"\"\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=MONGO_DELTA, next_time=False\n    )\n    def _stale_after_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _stale_after_mongo.clear_cache()\n    val1 = _stale_after_mongo(1, 2)\n    val2 = _stale_after_mongo(1, 2)\n    assert val1 == val2\n    sleep(3)\n    val3 = _stale_after_mongo(1, 2)\n    assert val3 != val1\n\n\ndef _calls_takes_time(res_queue):\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    res = _takes_time(34, 82.3)\n    res_queue.put(res)\n\n\n@pytest.mark.mongo\ndef test_mongo_being_calculated():\n    \"\"\"Testing MongoDB core handling of being calculated scenarios.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(1)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\nclass _BadMongoCollection:\n    def __init__(self, mongetter):\n        self.collection = mongetter()\n        self.index_information = self.collection.index_information\n        self.create_indexes = self.collection.create_indexes\n        self.find_one = self.collection.find_one\n\n    def delete_many(self, *args, **kwargs):\n        pass\n\n    def update_many(self, *args, **kwargs):\n        pass\n\n    def update_one(self, *args, **kwargs):\n        raise OperationFailure(Exception())\n\n\ndef _bad_mongetter():\n    return _BadMongoCollection(_test_mongetter)\n\n\n@pytest.mark.mongo\ndef test_mongo_write_failure():\n    \"\"\"Testing MongoDB core handling of writing failure scenarios.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    with pytest.raises(OperationFailure):\n        val1 = _func_w_bad_mongo(1, 2)\n        val2 = _func_w_bad_mongo(1, 2)\n        assert val1 == val2\n\n\n@pytest.mark.mongo\ndef test_mongo_clear_being_calculated():\n    \"\"\"Testing MongoDB core clear_being_calculated.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _func_w_bad_mongo.clear_being_calculated()\n\n\n@pytest.mark.mongo\ndef test_stalled_mongo_db_cache():\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    core = _MongoCore(None, _test_mongetter, 0)\n    core.set_func(_stalled_func)\n    core.clear_cache()\n    with pytest.raises(RecalculationNeeded):\n        core.wait_on_entry_calc(key=None)\n\n\n@pytest.mark.mongo\ndef test_stalled_mong_db_core(monkeypatch):\n    def mock_get_entry(self, args, kwargs):\n        return \"key\", {\"being_calculated\": True}\n\n    def mock_get_entry_by_key(self, key):\n        return \"key\", None\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry\n    )\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry_by_key\",\n        mock_get_entry_by_key,\n    )\n\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    res = _stalled_func()\n    assert res == 1\n\n    def mock_get_entry_2(self, args, kwargs):\n        entry = {\n            \"being_calculated\": True,\n            \"value\": 1,\n            \"time\": datetime.datetime.now() - datetime.timedelta(seconds=10),\n        }\n        return \"key\", entry\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry_2\n    )\n\n    stale_after = datetime.timedelta(seconds=1)\n\n    @cachier(mongetter=_test_mongetter, stale_after=stale_after)\n    def _stalled_func_2():\n        \"\"\"Testing stalled function.\"\"\"\n        return 2\n\n    res = _stalled_func_2()\n    assert res == 2\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=stale_after, next_time=True\n    )\n    def _stalled_func_3():\n        \"\"\"Testing stalled function.\"\"\"\n        return 3\n\n    res = _stalled_func_3()\n    assert res == 1\n\n\n@pytest.mark.mongo\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(mongetter=_test_mongetter, hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/requirements.txt",
      "content": "# todo: add some version range or pinning latest versions\n# tests and coverages\npytest\ncoverage\npytest-cov\nbirch\n# to connect to the test mongodb server\npymongo\ndnspython\npymongo-inmemory\n# to test pandas dataframe as-param hashing with mongodb core\npandas\n# to be able to run `python setup.py checkdocs`\ncollective.checkdocs\npygments\n"
    }
  ],
  "Patch": "--- a/python-cachier_cachier/src/cachier/cores/memory.py\n+++ b/python-cachier_cachier/src/cachier/cores/memory.py\n@@ -80,7 +80,7 @@\n \n     def clear_cache(self):\n         with self.lock:\n-            pass\n+            self.cache.clear()\n \n     def clear_being_calculated(self):\n         with self.lock:\n",
  "BuggyCodeLocation": [
    {
      "file": "python-cachier_cachier/src/cachier/cores/memory.py",
      "function": null,
      "content_all": {
        "83": "            pass\n"
      },
      "content_change": {
        "83": "            pass\n"
      }
    }
  ],
  "Source": "Human",
  "Command": [
    "pytest tests"
  ],
  "Token": 1269,
  "FilteredCode": [
    {
      "path": "python-cachier_cachier/src/cachier/cores/memory.py",
      "content": "1 \"\"\"A memory-based caching core for cachier.\"\"\"\n2 \n3 import threading\n4 from datetime import datetime\n5 \n6 from .._types import HashFunc\n7 from .base import _BaseCore, _get_func_str\n8 \n9 \n10 class _MemoryCore(_BaseCore):\n11     \"\"\"The memory core class for cachier.\"\"\"\n12 \n13     def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n14         super().__init__(hash_func, wait_for_calc_timeout)\n15         self.cache = {}\n16 \n17     def _hash_func_key(self, key):\n18         return f\"{_get_func_str(self.func)}:{key}\"\n19 \n20     def get_entry_by_key(self, key, reload=False):\n21         with self.lock:\n22             return key, self.cache.get(self._hash_func_key(key), None)\n23 \n24     def set_entry(self, key, func_res):\n25         with self.lock:\n26             try:\n27                 # we need to retain the existing condition so that\n28                 # mark_entry_not_calculated can notify all possibly-waiting\n29                 # threads about it\n30                 cond = self.cache[self._hash_func_key(key)][\"condition\"]\n31             except KeyError:  # pragma: no cover\n32                 cond = None\n33             self.cache[self._hash_func_key(key)] = {\n34                 \"value\": func_res,\n35                 \"time\": datetime.now(),\n36                 \"stale\": False,\n37                 \"being_calculated\": False,\n38                 \"condition\": cond,\n39             }\n40 \n41     def mark_entry_being_calculated(self, key):\n42         with self.lock:\n43             condition = threading.Condition()\n44             # condition.acquire()\n45             try:\n46                 self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n47                 self.cache[self._hash_func_key(key)][\"condition\"] = condition\n48             except KeyError:\n49                 self.cache[self._hash_func_key(key)] = {\n50                     \"value\": None,\n51                     \"time\": datetime.now(),\n52                     \"stale\": False,\n53                     \"being_calculated\": True,\n54                     \"condition\": condition,\n55                 }\n56 \n57     def mark_entry_not_calculated(self, key):\n58         with self.lock:\n59             try:\n60                 entry = self.cache[self._hash_func_key(key)]\n61             except KeyError:  # pragma: no cover\n62                 return  # that's ok, we don't need an entry in that case\n63             entry[\"being_calculated\"] = False\n64             cond = entry[\"condition\"]\n65             if cond:\n66                 cond.acquire()\n67                 cond.notify_all()\n68                 cond.release()\n69                 entry[\"condition\"] = None\n70 \n71     def wait_on_entry_calc(self, key):\n72         with self.lock:  # pragma: no cover\n73             entry = self.cache[self._hash_func_key(key)]\n74             if not entry[\"being_calculated\"]:\n75                 return entry[\"value\"]\n76         entry[\"condition\"].acquire()\n77         entry[\"condition\"].wait()\n78         entry[\"condition\"].release()\n79         return self.cache[self._hash_func_key(key)][\"value\"]\n80 \n81     def clear_cache(self):\n82         with self.lock:\n83             pass\n84 \n85     def clear_being_calculated(self):\n86         with self.lock:\n87             for entry in self.cache.values():\n88                 entry[\"being_calculated\"] = False\n89                 entry[\"condition\"] = None"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "1 \"\"\"A pickle-based caching core for cachier.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 import os\n10 import pickle  # for local caching\n11 from contextlib import suppress\n12 from datetime import datetime\n13 \n14 import portalocker  # to lock on pickle cache IO\n15 from watchdog.events import PatternMatchingEventHandler\n16 from watchdog.observers import Observer\n17 \n18 from .._types import HashFunc\n19 from ..config import _update_with_defaults\n20 \n21 # Alternative:  https://github.com/WoLpH/portalocker\n22 from .base import _BaseCore\n23 \n24 \n25 class _PickleCore(_Base(...truncated)"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "1 Cachier\n2 #######\n3 \n4 |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n5 \n6 Persistent, stale-free, local and cross-machine caching for Python functions.\n7 \n8 .. code-block:: python\n9 \n10   from cachier import cachier\n11   import datetime\n12 \n13   @cachier(stale_after=datetime.timedelta(days=3))\n14   def foo(arg1, arg2):\n15     \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n16     return {'arg1': arg1, 'arg2': arg2}\n17 \n18 \n19 .. role(...truncated)"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "1 \"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 \n10 import datetime\n11 import in(...truncated)"
    }
  ],
  "TokenAll": 34855,
  "FilteredLevel": 1500,
  "Results": {
    "model": "GPT-4o",
    "Difficulty": "Simple",
    "issue_origin": {
      "title": "Potential race condition due to missing lock initialization",
      "description": "The _MemoryCore class uses self.lock in several methods (e.g., get_entry_by_key, set_entry) but the lock is never initialized in the __init__ method. This can lead to potential race conditions. Please initialize self.lock in the __init__ method.",
      "explanation": "### Summary of the Issue\n\nThe issue reported is a potential race condition within a class named `_MemoryCore`, which is part of the memory-based cache implementation in the \"python-cachier\" library. The class makes use of a lock (`self.lock`) in several methods to ensure thread safety, but this lock is never initialized in the `__init__` method. This oversight could lead to race conditions, where multiple threads might access or modify shared resources simultaneously, producing unpredictable results.\n\n### Code Analysis and Issue Location\n\nThe specific portion of the code where the issue occurs is detailed below:\n\n```python\n10 class _MemoryCore(_BaseCore):\n11     \"\"\"The memory core class for cachier.\"\"\"\n12 \n13     def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n14         super().__init__(hash_func, wait_for_calc_timeout)\n15         self.cache = {}\n16 \n17     def _hash_func_key(self, key):\n18         return f\"{_get_func_str(self.func)}:{key}\"\n19 \n20     def get_entry_by_key(self, key, reload=False):\n21         with self.lock:\n22             return key, self.cache.get(self._hash_func_key(key), None)\n23 \n24     def set_entry(self, key, func_res):\n25         with self.lock:\n26             try:\n27                 # retaining the existing condition so that\n28                 # mark_entry_not_calculated can notify all possibly-waiting\n29                 # threads about it\n30                 cond = self.cache[self._hash_func_key(key)][\"condition\"]\n31             except KeyError:\n32                 cond = None\n33             self.cache[self._hash_func_key(key)] = {\n34                 \"value\": func_res,\n35                 \"time\": datetime.now(),\n36                 \"stale\": False,\n37                 \"being_calculated\": False,\n38                 \"condition\": cond,\n39             }\n```\n\nIn the lines above, we observe that `self.lock` is used in the `get_entry_by_key`, `set_entry`, `mark_entry_being_calculated`, `mark_entry_not_calculated`, `wait_on_entry_calc`, `clear_cache`, and `clear_being_calculated` methods. However, no line in the `__init__` method initializes this lock.\n\n### Commit to Solve the Issue\n\nTo resolve the issue, a commit that initializes `self.lock` in the `__init__` method is necessary. The revised code would look like this:\n\n```python\n10 class _MemoryCore(_BaseCore):\n11     \"\"\"The memory core class for cachier.\"\"\"\n12 \n13     def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n14         super().__init__(hash_func, wait_for_calc_timeout)\n15         self.cache = {}\n16         self.lock = threading.Lock()  # Initialization of the lock\n\n# Rest of the class remains the same\n```\n\n### Explanation of the Solution\n\n- **Initialization of `self.lock`:** By adding `self.lock = threading.Lock()` in the `__init__` method, we ensure that the lock is instantiated when an object of `_MemoryCore` class is created. This lock will be used to synchronize access to the `cache` between multiple threads.\n\n- **Ensuring Thread Safety:**\n  - **`with self.lock:`** statements ensure that a thread acquires the lock before entering the critical section (code that interacts with `cache`) and releases it afterwards. This way, only one thread can access the critical section at a time, preventing race conditions.\n  - Every method that operates on shared resources in the `_MemoryCore` class (like `cache`) acquires this lock to guarantee that the resource's state is consistent and not concurrently modified by another thread.\n\n### Summary\n\nThe cause of the issue was the missing initialization of `self.lock` in the `_MemoryCore` class, which led to the possibility of race conditions due to unsynchronized access to shared resources. The commit introduced the initialization of `self.lock` within the `__init__` method, thus ensuring the thread safety of methods that manipulate shared resources by ensuring only one thread can operate on the resources at a given time. This fix is crucial for the reliable functioning of the caching mechanism in multi-threaded environments."
    },
    "issue_message": {
      "title": "Undefined `lock` attribute in _MemoryCore class",
      "description": "In the _MemoryCore class defined in 'python-cachier_cachier/src/cachier/cores/memory.py', the `lock` attribute is used in multiple methods (e.g., get_entry_by_key, set_entry, etc.), but it is never defined or initialized in the class. Please initialize a threading.Lock() object as `self.lock` in the __init__ method.",
      "explanation": "### Issue Summary\nThe issue reported pertains to the `_MemoryCore` class in the `memory.py` file, where the code uses a `lock` attribute for thread synchronization (using the `with self.lock` statement) but the `lock` attribute is never defined or initialized in the class. This leads to an `AttributeError` when any method attempts to use `self.lock`.\n\n### Content of the Commit\nTo fix this issue, we need to:\n\n1. Import the `threading` module (if not already imported).\n2. Initialize a `threading.Lock()` object as `self.lock` within the `__init__` method of the `_MemoryCore` class.\n\n### Solution Implementation\nHere's how the code should be modified:\n\n#### Step 1: Add the Correct Import\nEnsure that the `threading` module is imported (already present in the code).\n\n#### Step 2: Initialize `self.lock` in the `__init__` Method\nAdd the initialization of `self.lock` within the constructor (`__init__` method) of the `_MemoryCore` class.\n\nHere is the modified code showing the necessary changes:\n\n```python\n1  \"\"\"A memory-based caching core for cachier.\"\"\"\n2  \n3  import threading\n4  from datetime import datetime\n5  \n6  from .._types import HashFunc\n7  from .base import _BaseCore, _get_func_str\n8  \n9  \n10 class _MemoryCore(_BaseCore):\n11     \"\"\"The memory core class for cachier.\"\"\"\n12  \n13     def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n14         super().__init__(hash_func, wait_for_calc_timeout)\n15         self.cache = {}\n16         self.lock = threading.Lock()  # Initialize the lock here\n17  \n18     def _hash_func_key(self, key):\n19         return f\"{_get_func_str(self.func)}:{key}\"\n20  \n21     def get_entry_by_key(self, key, reload=False):\n22         with self.lock:\n23             return key, self.cache.get(self._hash_func_key(key), None)\n24  \n25     def set_entry(self, key, func_res):\n26         with self.lock:\n27             try:\n28                 # we need to retain the existing condition so that\n29                 # mark_entry_not_calculated can notify all possibly-waiting\n30                 # threads about it\n31                 cond = self.cache[self._hash_func_key(key)][\"condition\"]\n32             except KeyError:  # pragma: no cover\n33                 cond = None\n34             self.cache[self._hash_func_key(key)] = {\n35                 \"value\": func_res,\n36                 \"time\": datetime.now(),\n37                 \"stale\": False,\n38                 \"being_calculated\": False,\n39                 \"condition\": cond,\n40             }\n41  \n42     def mark_entry_being_calculated(self, key):\n43         with self.lock:\n44             condition = threading.Condition()\n45             # condition.acquire()\n46             try:\n47                 self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n48                 self.cache[self._hash_func_key(key)][\"condition\"] = condition\n49             except KeyError:\n50                 self.cache[self._hash_func_key(key)] = {\n51                     \"value\": None,\n52                     \"time\": datetime.now(),\n53                     \"stale\": False,\n54                     \"being_calculated\": True,\n55                     \"condition\": condition,\n56                 }\n57  \n58     def mark_entry_not_calculated(self, key):\n59         with self.lock:\n60             try:\n61                 entry = self.cache[self._hash_func_key(key)]\n62             except KeyError:  # pragma: no cover\n63                 return  # that's ok, we don't need an entry in that case\n64             entry[\"being_calculated\"] = False\n65             cond = entry[\"condition\"]\n66             if cond:\n67                 cond.acquire()\n68                 cond.notify_all()\n69                 cond.release()\n70                 entry[\"condition\"] = None\n71  \n72     def wait_on_entry_calc(self, key):\n73         with self.lock:  # pragma: no cover\n74             entry = self.cache[self._hash_func_key(key)]\n75             if not entry[\"being_calculated\"]:\n76                 return entry[\"value\"]\n77         entry[\"condition\"].acquire()\n78         entry[\"condition\"].wait()\n79         entry[\"condition\"].release()\n80         return self.cache[self._hash_func_key(key)][\"value\"]\n81  \n82     def clear_cache(self):\n83         with self.lock:\n84             pass\n85  \n86     def clear_being_calculated(self):\n87         with self.lock:\n88             for entry in self.cache.values():\n89                 entry[\"being_calculated\"] = False\n90                 entry[\"condition\"] = None\n```\n\n### How the Commit Solves the Issue\n- **Initialization of `self.lock`**: The"
    },
    "issue_ground": {
      "title": "Cache Clearing Functionality Not Working as Expected",
      "description": "It has been observed that the `clear_cache` function in the `cachier` library does not perform its expected behavior of clearing the in-memory cache. This issue could lead to stale or outdated data being served from the cache, as the clear_cache function currently does nothing (`pass` statement). This can cause significant problems in scenarios where the cache needs to be invalidated and refreshed. Users relying on the clear_cache method to manually invalidate cache entries are currently unable to do so, resulting in potential data inconsistency. The function should be updated to properly clear the cache.",
      "explanation": "### Summary of the Issue\n\nThe issue reported pertains to the `clear_cache` function in the `cachier` library's memory-based caching core (`_MemoryCore`). The function is intended to clear the in-memory cache, but it currently does nothing, as it only contains a `pass` statement. Consequently, this means the function fails to invalidate and refresh the cache, potentially leading to stale or outdated data being served. Users may experience data inconsistency when they rely on this method to manually clear the cache entries.\n\n### Explanation of the Issue\n\nThe `clear_cache` function is defined on line 81 in the `memory.py` file, but instead of performing its intended function, it merely contains a `pass` statement:\n\n```python\n81 def clear_cache(self):\n82     with self.lock:\n83         pass\n```\n\n### Detailed Content of the Commit to Address the Issue\n\nTo fix the issue, the commit should replace the `pass` statement with logic to actually clear the cache. The modified `clear_cache` function might look like this:\n\n```python\n81 def clear_cache(self):\n82     with self.lock:\n83         self.cache.clear()  # Clear the cache dictionary\n```\n\nHere's a breakdown of these changes:\n\n1. **Line 83**: `self.cache.clear()` – This simple command effectively removes all entries from the dictionary that represents the in-memory cache.\n\n### How the Commit Solves the Issue\n\nBy changing the implementation of `clear_cache` to use `self.cache.clear()`, the function now efficiently performs its intended task of clearing the cache:\n\n- **Acquiring the Lock**: The function begins by acquiring a lock (`with self.lock:`) to ensure thread-safety when clearing the cache. This aligns with the usage of locks in other methods like `set_entry`.\n  \n- **Clearing the Cache**: The `self.cache.clear()` command empties the dictionary, effectively removing all stored cache entries.\n\nThis solution ensures users can manually clear out stale or outdated data from the cache, thereby preventing potential data inconsistencies or the serving of outdated information. The fix makes the `clear_cache` method functional and reliable, adhering to its designed purpose of cache invalidation."
    },
    "issue_ground_truth": {
      "title": "Cache Clearing Functionality Not Working as Expected",
      "description": "It has been observed that the `clear_cache` function in the `cachier` library does not perform its expected behavior of clearing the in-memory cache. This issue could lead to stale or outdated data being served from the cache, as the clear_cache function currently does nothing (`pass` statement). This can cause significant problems in scenarios where the cache needs to be invalidated and refreshed. Users relying on the clear_cache method to manually invalidate cache entries are currently unable to do so, resulting in potential data inconsistency. The function should be updated to properly clear the cache.",
      "explanation": "### Issue Summary\n\nThe issue at hand pertains to the `clear_cache` function within the `cachier` library. This function is intended to clear in-memory caches to prevent the use of stale or outdated data. However, the current implementation of `clear_cache` is non-functional as it contains merely a `pass` statement, which essentially means no operation is performed. This malfunction can lead to scenarios where users are unable to invalidate and refresh cache entries manually, resulting in potential data inconsistencies.\n\n### Detailed Explanation of the Issue\n\n1. **Cause of the Issue**:\n    - The `clear_cache` function is designed to ensure that the cache can be reset or cleared, allowing for the relevant data to be recalculated and updated. However, in the current implementation, the function does nothing because it contains just a `pass` statement.\n    - This issue becomes critical because if data within the cache becomes outdated or invalid, there's no mechanism to clear the cache, which means that stale data will continue to be served to users and applications relying on this cache.\n\n2. **Effects**:\n    - **Data Consistency Problems**: Users and applications may receive stale data from the cache, leading to inconsistencies and potential errors in data processing or user experience.\n    - **Manual Invalidations**: Users who attempt to clear the cache manually via `clear_cache` will find that their requests to invalidate the cache have no effect, leading to frustration and potential workaround hacks that may not be optimal.\n\n### Detailed Explanation of the Commit\n\n1. **Commit Changes**:\n    - The developer updated the `clear_cache` method within the `cachier/cores/memory.py` file.\n    - Instead of the method containing a `pass` statement, it now includes functionality to actually clear the cache by invoking the `clear` method on the cache dictionary.\n\n2. **How the Commit Solves the Issue**:\n    - **Implementing Cache Clearing**: By replacing the `pass` statement with a call to clear the cache (`self.cache.clear()`), the commit ensures that invoking `clear_cache` will now effectively reset the in-memory cache.\n    - **Enabling Manual Cache Invalidations**: Users can now call `clear_cache` to invalidate the cache entries. This is crucial for scenarios where the cache data needs to be purged to prevent serving outdated information.\n    - **Improving Data Reliability**: With the ability to clear the cache, the application can maintain more reliable and up-to-date data, thereby improving overall data consistency and reliability.\n\n### Solution Explanation\n\nThe solution to the issue was to modify the `clear_cache` function within the memory core of the `cachier` library to perform its intended function, which is to clear the in-memory cache. The developer did this by replacing the no-op `pass` statement with an actual call to clear the cache.\n\n- **Summary of the Change**: The `clear_cache` function now properly clears the in-memory cache.\n- **Effectiveness of the Change**:\n    - This ensures that the cache can be invalidated as needed.\n    - Users and applications relying on this functionality can now ensure they are not using outdated data.\n    - It enhances the flexibility and reliability of cache management within the `cachier` framework.\n\nBy implementing this change, the developer resolved the issue of non-functional cache clearing, allowing for more robust cache management and ensuring that the caching behavior aligns with user expectations and requirements for data consistency."
    },
    "location_origin": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/memory.py",
        "function": {
          "13": "__init__"
        },
        "content_all": {
          "10": "class _MemoryCore(_BaseCore):",
          "11": "    \"\"\"The memory core class for cachier.\"\"\"",
          "12": "    ",
          "13": "    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):",
          "14": "        super().__init__(hash_func, wait_for_calc_timeout)",
          "15": "        self.cache = {}",
          "16": "        self.lock = threading.Lock()  # Initialization of the lock",
          "17": "    def _hash_func_key(self, key):",
          "18": "        return f\"{_get_func_str(self.func)}:{key}\""
        },
        "content_change": {
          "16": "        self.lock = threading.Lock()  # Initialization of the lock"
        }
      }
    ],
    "location_message": [
      {
        "location": "```json\n[\n    {\n        \"file\": \"python-cachier_cachier/src/cachier/cores/memory.py\",\n        \"function\": {\n            \"13\": \"__init__\"\n        },\n        \"content_all\": {\n            \"10\": \"class _MemoryCore(_BaseCore):\",\n            \"11\": \"    \\\"\\\"\\\"The memory core class for cachier.\\\"\\\"\\\"\",\n            \"12\": \"    \",\n            \"13\": \"    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\",\n            \"14\": \"        super().__init__(hash_func, wait_for_calc_timeout)\",\n            \"15\": \"        self.cache = {}\",\n            \"16\": \"        self.lock = threading.Lock()  # Initialize the lock here\",\n            \"17\": \"    \",\n            \"18\": \"    def _hash_func_key(self, key):\",\n            \"19\": \"        return f\\\"{_get_func_str(self.func)}:{key}\\\"\",\n            \"20\": \"    \"\n        },\n        \"content_change\": {\n            \"16\": \"        self.lock = threading.Lock()  # Initialize the lock here\"\n        }\n    },\n    {\n        \"file\": \"python-cachier_cachier/src/cachier/cores/memory.py\",\n        \"function\": {\n            \"20\": \"get_entry_by_key\"\n        },\n        \"content_all\": {\n            \"17\": \"    def _hash_func_key(self, key):\",\n            \"18\": \"        return f\\\"{_get_func_str(self.func)}:{key}\\\"\",\n            \"19\": \"    \",\n            \"20\": \"    def get_entry_by_key(self, key, reload=False):\",\n            \"21\": \"        with self.lock:\",\n            \"22\": \"            return key, self.cache.get(self._hash_func_key(key), None)\",\n            \"23\": \"    \",\n            \"24\": \"    def set_entry(self, key, func_res):\",\n            \"25\": \"        with self.lock:\",\n            \"26\": \"            try:\",\n            \"27\": \"                # we need to retain the existing condition so that\"\n        },\n        \"content_change\": {\n            \"21\": \"        with self.lock:\"\n        }\n    },\n    {\n        \"file\": \"python-cachier_cachier/src/cachier/cores/memory.py\",\n        \"function\": {\n            \"24\": \"set_entry\"\n        },\n        \"content_all\": {\n            \"21\": \"        with self.lock:\",\n            \"22\": \"            return key, self.cache.get(self._hash_func_key(key), None)\",\n            \"23\": \"    \",\n            \"24\": \"    def set_entry(self, key, func_res):\",\n            \"25\": \"        with self.lock:\",\n            \"26\": \"            try:\",\n            \"27\": \"                # we need to retain the existing condition so that\",\n            \"28\": \"                # mark_entry_not_calculated can notify all possibly-waiting\",\n            \"29\": \"                # threads about it\"\n        },\n        \"content_change\": {\n            \"25\": \"        with self.lock:\"\n        }\n    },\n    {\n        \"file\": \"python-cachier_cachier/src/cachier/cores/memory.py\",\n        \"function\": {\n            \"41\": \"mark_entry_being_calculated\"\n        },\n        \"content_all\": {\n            \"40\": \"            }\",\n            \"41\": \"    def mark_entry_being_calculated(self, key):\",\n            \"42\": \"        with self.lock:\",\n            \"43\": \"            condition = threading.Condition()\",\n            \"44\": \"            # condition.acquire()\",\n            \"45\": \"            try:\",\n            \"46\": \"                self.cache[self._hash_func_key(key)][\\\"being_calculated\\\"] = True\",\n            \"47\": \"                self.cache[self._hash_func_key(key)][\\\"condition\\\"] = condition\",\n            \"48\": \"            except KeyError:\"\n        },\n        \"content_change\": {\n            \"42\": \"        with self.lock:\"\n        }\n    },\n    {\n        \"file\": \"python-cachier_cachier/src/cachier/cores/memory.py\",\n        \"function\": {\n            \"57\": \"mark_entry_not_calculated\"\n        },\n        \"content_all\": {\n            \"56\": \"            }\",\n            \"57\": \"    def mark_entry_not_calculated(self, key):\",\n            \"58\": \"        with self.lock:\",\n            \"59\": \"            try:\",\n            \"60\": \"                entry = self.cache[self._hash_func_key(key)]\",\n            \"61\": \"            except KeyError:  # pragma: no cover\",\n            \"62\": \"                return  # that's ok, we don't need an entry in that case\",\n            \"63\": \"            entry[\\\"being_calculated\\\"] = False\",\n            \"64\": \"            cond = entry[\\\"condition\\\"]"
      }
    ],
    "location_ground": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/memory.py",
        "function": {
          "81": "clear_cache"
        },
        "content_all": {
          "78": "         entry[\"condition\"].release()\n",
          "79": "         return self.cache[self._hash_func_key(key)][\"value\"]\n",
          "80": " \n",
          "81": "     def clear_cache(self):\n",
          "82": "         with self.lock:\n",
          "83": "             pass\n",
          "84": " \n",
          "85": "     def clear_being_calculated(self):\n",
          "86": "         with self.lock:\n",
          "87": "             for entry in self.cache.values():\n",
          "88": "                 entry[\"being_calculated\"] = False\n"
        },
        "content_change": {
          "83": "             self.cache.clear()  # Clear the cache dictionary\n"
        }
      }
    ],
    "location_ground_exp": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/memory.py",
        "function": {
          "81": "clear_cache"
        },
        "content_all": {
          "78": "        entry[\"condition\"].release()\n",
          "79": "        return self.cache[self._hash_func_key(key)][\"value\"]\n",
          "80": "\n",
          "81": "    def clear_cache(self):\n",
          "82": "        with self.lock:\n",
          "83": "            pass\n",
          "84": "\n",
          "85": "    def clear_being_calculated(self):\n",
          "86": "        with self.lock:\n"
        },
        "content_change": {
          "83": "            self.cache.clear()\n"
        }
      }
    ],
    "location_ground_truth": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/memory.py",
        "function": null,
        "content_all": {
          "83": "            pass\n"
        },
        "content_change": {
          "83": "            pass\n"
        }
      }
    ],
    "patch_i": "\n--- a/python-cachier_cachier/src/cachier/cores/memory.py\n+++ b/python-cachier_cachier/src/cachier/cores/memory.py\n@@ -1,6 +1,7 @@\n \"\"\"A memory-based caching core for cachier.\"\"\"\n \n import threading\n+from datetime import datetime\n \n from .._types import HashFunc\n from .base import _BaseCore, _get_func_str\n@@ -15,6 +16,7 @@\n         super().__init__(hash_func, wait_for_calc_timeout)\n         self.cache = {}\n+        self.lock = threading.Lock()  # Initialization of the lock\n \n     def _hash_func_key(self, key):\n         return f\"{_get_func_str(self.func)}:{key}\"\n",
    "patch_im": "\n--- a/python-cachier_cachier/src/cachier/cores/memory.py\n+++ b/python-cachier_cachier/src/cachier/cores/memory.py\n@@ -1,17 +1,18 @@\n \"\"\"A memory-based caching core for cachier.\"\"\"\n \n import threading\n+from datetime import datetime\n \n from .._types import HashFunc\n from .base import _BaseCore, _get_func_str\n \n \n class _MemoryCore(_BaseCore):\n     \"\"\"The memory core class for cachier.\"\"\"\n \n-    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n+   def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n         super().__init__(hash_func, wait_for_calc_timeout)\n         self.cache = {}\n+        self.lock = threading.Lock()\n \n     def _hash_func_key(self, key):\n         return f\"{_get_func_str(self.func)}:{key}\"\n",
    "patch_il": "\n--- a/python-cachier_cachier/src/cachier/cores/memory.py\n+++ b/python-cachier_cachier/src/cachier/cores/memory.py\n@@ -12,6 +12,7 @@ class _MemoryCore(_BaseCore):\n \n     def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n         super().__init__(hash_func, wait_for_calc_timeout)\n         self.cache = {}\n         self.lock = threading.Lock()  # Initialization of the lock\n \n     def _hash_func_key(self, key):\n",
    "patch_iml": "\n--- a/python-cachier_cachier/src/cachier/cores/memory.py\n+++ b/python-cachier_cachier/src/cachier/cores/memory.py\n@@ -1,15 +1,16 @@\n \"\"\"A memory-based caching core for cachier.\"\"\"\n \n+import threading\n from datetime import datetime\n \n from .._types import HashFunc\n from .base import _BaseCore, _get_func_str\n \n \n class _MemoryCore(_BaseCore):\n     \"\"\"The memory core class for cachier.\"\"\"\n \n     def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n         super().__init__(hash_func, wait_for_calc_timeout)\n         self.cache = {}\n+        self.lock = threading.Lock()  # Initialize the lock here\n \n     def _hash_func_key(self, key):\n         return f\"{_get_func_str(self.func)}:{key}\"\n",
    "patch_ground": "\n--- a/python-cachier_cachier/src/cachier/cores/memory.py\n+++ b/python-cachier_cachier/src/cachier/cores/memory.py\n@@ -80,7 +80,7 @@\n         with self.lock:  # pragma: no cover\n             entry = self.cache[self._hash_func_key(key)]\n             if not entry[\"being_calculated\"]:\n                 return entry[\"value\"]\n         entry[\"condition\"].acquire()\n         entry[\"condition\"].wait()\n         entry[\"condition\"].release()\n-        return self.cache[self._hash_func_key(key)][\"value\"]\n+        return self.cache[self._hash_func_key(key)][\"value\"]\n\n     def clear_cache(self):\n         with self.lock:\n-            pass\n+            self.cache.clear()  # Clear the cache dictionary\n\n     def clear_being_calculated(self):\n         with self.lock:\n             for entry in self.cache.values():\n",
    "patch_ground_location": "\n--- a/python-cachier_cachier/src/cachier/cores/memory.py\n+++ b/python-cachier_cachier/src/cachier/cores/memory.py\n@@ -80,7 +80,7 @@\n \n     def clear_cache(self):\n         with self.lock:\n-            pass\n+            self.cache.clear()  # Clear the cache dictionary\n \n     def clear_being_calculated(self):\n         with self.lock:\n",
    "patch_ground_exp": "\n--- python-cachier_cachier/src/cachier/cores/memory.py\n+++ python-cachier_cachier/src/cachier/cores/memory.py\n@@ -81,7 +81,7 @@\n     def clear_cache(self):\n         with self.lock:\n-            pass\n+            self.cache.clear()\n \n     def clear_being_calculated(self):\n         with self.lock:\n",
    "patch_ground_all": "\n--- a/python-cachier_cachier/src/cachier/cores/memory.py\n+++ b/python-cachier_cachier/src/cachier/cores/memory.py\n@@ -81,7 +81,7 @@ def wait_on_entry_calc(self, key):\n \n     def clear_cache(self):\n         with self.lock:\n-            pass\n+            self.cache.clear()\n \n     def clear_being_calculated(self):\n         with self.lock:\n",
    "patch_ground_truth": "--- a/python-cachier_cachier/src/cachier/cores/memory.py\n+++ b/python-cachier_cachier/src/cachier/cores/memory.py\n@@ -80,7 +80,7 @@\n \n     def clear_cache(self):\n         with self.lock:\n-            pass\n+            self.cache.clear()\n \n     def clear_being_calculated(self):\n         with self.lock:\n",
    "message": null,
    "CodeBase": [
      {
        "path": "python-cachier_cachier/src/cachier/cores/memory.py",
        "content": "1 \"\"\"A memory-based caching core for cachier.\"\"\"\n2 \n3 import threading\n4 from datetime import datetime\n5 \n6 from .._types import HashFunc\n7 from .base import _BaseCore, _get_func_str\n8 \n9 \n10 class _MemoryCore(_BaseCore):\n11     \"\"\"The memory core class for cachier.\"\"\"\n12 \n13     def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n14         super().__init__(hash_func, wait_for_calc_timeout)\n15         self.cache = {}\n16 \n17     def _hash_func_key(self, key):\n18         return f\"{_get_func_str(self.func)}:{key}\"\n19 \n20     def get_entry_by_key(self, key, reload=False):\n21         with self.lock:\n22             return key, self.cache.get(self._hash_func_key(key), None)\n23 \n24     def set_entry(self, key, func_res):\n25         with self.lock:\n26             try:\n27                 # we need to retain the existing condition so that\n28                 # mark_entry_not_calculated can notify all possibly-waiting\n29                 # threads about it\n30                 cond = self.cache[self._hash_func_key(key)][\"condition\"]\n31             except KeyError:  # pragma: no cover\n32                 cond = None\n33             self.cache[self._hash_func_key(key)] = {\n34                 \"value\": func_res,\n35                 \"time\": datetime.now(),\n36                 \"stale\": False,\n37                 \"being_calculated\": False,\n38                 \"condition\": cond,\n39             }\n40 \n41     def mark_entry_being_calculated(self, key):\n42         with self.lock:\n43             condition = threading.Condition()\n44             # condition.acquire()\n45             try:\n46                 self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n47                 self.cache[self._hash_func_key(key)][\"condition\"] = condition\n48             except KeyError:\n49                 self.cache[self._hash_func_key(key)] = {\n50                     \"value\": None,\n51                     \"time\": datetime.now(),\n52                     \"stale\": False,\n53                     \"being_calculated\": True,\n54                     \"condition\": condition,\n55                 }\n56 \n57     def mark_entry_not_calculated(self, key):\n58         with self.lock:\n59             try:\n60                 entry = self.cache[self._hash_func_key(key)]\n61             except KeyError:  # pragma: no cover\n62                 return  # that's ok, we don't need an entry in that case\n63             entry[\"being_calculated\"] = False\n64             cond = entry[\"condition\"]\n65             if cond:\n66                 cond.acquire()\n67                 cond.notify_all()\n68                 cond.release()\n69                 entry[\"condition\"] = None\n70 \n71     def wait_on_entry_calc(self, key):\n72         with self.lock:  # pragma: no cover\n73             entry = self.cache[self._hash_func_key(key)]\n74             if not entry[\"being_calculated\"]:\n75                 return entry[\"value\"]\n76         entry[\"condition\"].acquire()\n77         entry[\"condition\"].wait()\n78         entry[\"condition\"].release()\n79         return self.cache[self._hash_func_key(key)][\"value\"]\n80 \n81     def clear_cache(self):\n82         with self.lock:\n83             pass\n84 \n85     def clear_being_calculated(self):\n86         with self.lock:\n87             for entry in self.cache.values():\n88                 entry[\"being_calculated\"] = False\n89                 entry[\"condition\"] = None"
      },
      {
        "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
        "content": "1 \"\"\"A pickle-based caching core for cachier.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 import os\n10 import pickle  # for local caching\n11 from contextlib import suppress\n12 from datetime import datetime\n13 \n14 import portalocker  # to lock on pickle cache IO\n15 from watchdog.events import PatternMatchingEventHandler\n16 from watchdog.observers import Observer\n17 \n18 from .._types import HashFunc\n19 from ..config import _update_with_defaults\n20 \n21 # Alternative:  https://github.com/WoLpH/portalocker\n22 from .base import _BaseCore\n23 \n24 \n25 class _PickleCore(_Base(...truncated)"
      },
      {
        "path": "python-cachier_cachier/README.rst",
        "content": "1 Cachier\n2 #######\n3 \n4 |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n5 \n6 Persistent, stale-free, local and cross-machine caching for Python functions.\n7 \n8 .. code-block:: python\n9 \n10   from cachier import cachier\n11   import datetime\n12 \n13   @cachier(stale_after=datetime.timedelta(days=3))\n14   def foo(arg1, arg2):\n15     \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n16     return {'arg1': arg1, 'arg2': arg2}\n17 \n18 \n19 .. role(...truncated)"
      },
      {
        "path": "python-cachier_cachier/src/cachier/core.py",
        "content": "1 \"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 \n10 import datetime\n11 import in(...truncated)"
      }
    ],
    "CommitSHA": "474294fac83b029f13dd61b8d2822923045d4504"
  },
  "Score": {
    "Difficulty": "Simple",
    "issue_origin": {
      "Title": 6,
      "Description": 7,
      "Reproducibility": 7,
      "Relevance": 7,
      "Explanation": 6,
      "Overall": 7
    },
    "issue_message": {
      "Title": 7,
      "Description": 6,
      "Reproducibility": 6,
      "Relevance": 7,
      "Explanation": 6,
      "Overall": 6
    },
    "issue_ground": {
      "Title": 6,
      "Description": 7,
      "Reproducibility": 6,
      "Relevance": 7,
      "Explanation": 5,
      "Overall": 6
    },
    "issue_ground_truth": {
      "title": "Cache Clearing Functionality Not Working as Expected",
      "description": "It has been observed that the `clear_cache` function in the `cachier` library does not perform its expected behavior of clearing the in-memory cache. This issue could lead to stale or outdated data being served from the cache, as the clear_cache function currently does nothing (`pass` statement). This can cause significant problems in scenarios where the cache needs to be invalidated and refreshed. Users relying on the clear_cache method to manually invalidate cache entries are currently unable to do so, resulting in potential data inconsistency. The function should be updated to properly clear the cache.",
      "explanation": "### Issue Summary\n\nThe issue at hand pertains to the `clear_cache` function within the `cachier` library. This function is intended to clear in-memory caches to prevent the use of stale or outdated data. However, the current implementation of `clear_cache` is non-functional as it contains merely a `pass` statement, which essentially means no operation is performed. This malfunction can lead to scenarios where users are unable to invalidate and refresh cache entries manually, resulting in potential data inconsistencies.\n\n### Detailed Explanation of the Issue\n\n1. **Cause of the Issue**:\n    - The `clear_cache` function is designed to ensure that the cache can be reset or cleared, allowing for the relevant data to be recalculated and updated. However, in the current implementation, the function does nothing because it contains just a `pass` statement.\n    - This issue becomes critical because if data within the cache becomes outdated or invalid, there's no mechanism to clear the cache, which means that stale data will continue to be served to users and applications relying on this cache.\n\n2. **Effects**:\n    - **Data Consistency Problems**: Users and applications may receive stale data from the cache, leading to inconsistencies and potential errors in data processing or user experience.\n    - **Manual Invalidations**: Users who attempt to clear the cache manually via `clear_cache` will find that their requests to invalidate the cache have no effect, leading to frustration and potential workaround hacks that may not be optimal.\n\n### Detailed Explanation of the Commit\n\n1. **Commit Changes**:\n    - The developer updated the `clear_cache` method within the `cachier/cores/memory.py` file.\n    - Instead of the method containing a `pass` statement, it now includes functionality to actually clear the cache by invoking the `clear` method on the cache dictionary.\n\n2. **How the Commit Solves the Issue**:\n    - **Implementing Cache Clearing**: By replacing the `pass` statement with a call to clear the cache (`self.cache.clear()`), the commit ensures that invoking `clear_cache` will now effectively reset the in-memory cache.\n    - **Enabling Manual Cache Invalidations**: Users can now call `clear_cache` to invalidate the cache entries. This is crucial for scenarios where the cache data needs to be purged to prevent serving outdated information.\n    - **Improving Data Reliability**: With the ability to clear the cache, the application can maintain more reliable and up-to-date data, thereby improving overall data consistency and reliability.\n\n### Solution Explanation\n\nThe solution to the issue was to modify the `clear_cache` function within the memory core of the `cachier` library to perform its intended function, which is to clear the in-memory cache. The developer did this by replacing the no-op `pass` statement with an actual call to clear the cache.\n\n- **Summary of the Change**: The `clear_cache` function now properly clears the in-memory cache.\n- **Effectiveness of the Change**:\n    - This ensures that the cache can be invalidated as needed.\n    - Users and applications relying on this functionality can now ensure they are not using outdated data.\n    - It enhances the flexibility and reliability of cache management within the `cachier` framework.\n\nBy implementing this change, the developer resolved the issue of non-functional cache clearing, allowing for more robust cache management and ensuring that the caching behavior aligns with user expectations and requirements for data consistency."
    }
  }
}