{
  "RepoName": "python-cachier/cachier.git",
  "CommitSHA": "474294fac83b029f13dd61b8d2822923045d4504",
  "Type": "logic error",
  "ErrorMessage": null,
  "Issue": {
    "title": "MongoDB Cache Clearing Functionality Not Implemented",
    "description": "The `clear_cache` method in the MongoDB core (`mongo.py`) is not implemented. This method is crucial for maintaining the integrity and manageability of the MongoDB cache by allowing users to clear the cache when necessary. The absence of this functionality means users cannot reset or clear the MongoDB cache, which can lead to stale or outdated data persisting in the cache. This could affect applications relying on fresh data, causing inconsistencies and potential issues in data processing and retrieval. Implementing this method is essential to provide users with a way to manage their MongoDB cache effectively.",
    "explanation": "### Summary of the Issue\nThe issue at hand is with the `cachier` library's `clear_cache` function for the MongoDB backend, located in the `mongo.py` file. This function was not implemented, leaving users without a way to clear the MongoDB cache when necessary. The absence of this function can lead to stale or outdated data persistence in the cache, which is detrimental to applications that rely on real-time or frequently updated data.\n\n### Detailed Content of the Commit\nThe commit implements the `clear_cache` method by adding a functional code snippet to the previously unimplemented method. This method now allows it to delete entries in the MongoDB cache that match a specific function, identified by a string representation `_func_str`.\n\n### Explanation of the Solution and Its Implementation\n1. **Cause of the Issue:**\n   - The `clear_cache` method was defined but not implemented in the `mongo.py` file for the MongoDB backend of the `cachier` library.\n   - This omission meant users could not clear the cache and could inadvertently use outdated or incorrect data, causing inconsistencies in applications requiring fresh data for correct operation.\n\n2. **Content of the Commit:**\n   - The `clear_cache` method was edited to include a function that deletes multiple entries from the MongoDB collection. These entries are identified by a `filter` that matches the function string representation, referred to as `_func_str`.\n   - This mechanism leverages MongoDB’s `delete_many` function to remove all entries tied to the particular function.\n\n3. **How the Commit Solves the Issue:**\n   - **Functionality Implementation:** By adding the implementation to `clear_cache`, the method now actively performs the task of removing cached data from the MongoDB collection, ensuring that users can programmatically clear cache as needed.\n   - **Data Integrity:** Implementing this method addresses the core need for cache management, allowing users to ensure they are not exposed to stale data, thus maintaining the integrity and reliability of the data within their applications.\n   - **User Control:** It provides users with essential control over the caching mechanism, ensuring applications can force cache invalidation and refresh cached data promptly.\n\n### Solution to the Issue\nThe solution involves implementing the `clear_cache` method within the MongoDB core to perform a bulk deletion of cached entries based on a specific function identifier. This ensures that:\n- Users can clear outdated or stale cache entries.\n- Applications that rely on real-time or frequently changing data are not adversely affected by stale data.\n- The caching mechanism remains robust and flexible for different use cases.\n\nIn summary, the commit effectively addresses the absence of the `clear_cache` functionality, ensuring that cache management for MongoDB in the `cachier` library is complete and reliable."
  },
  "Explain": "### Summary of the Issue\nThe issue at hand is with the `cachier` library's `clear_cache` function for the MongoDB backend, located in the `mongo.py` file. This function was not implemented, leaving users without a way to clear the MongoDB cache when necessary. The absence of this function can lead to stale or outdated data persistence in the cache, which is detrimental to applications that rely on real-time or frequently updated data.\n\n### Detailed Content of the Commit\nThe commit implements the `clear_cache` method by adding a functional code snippet to the previously unimplemented method. This method now allows it to delete entries in the MongoDB cache that match a specific function, identified by a string representation `_func_str`.\n\n### Explanation of the Solution and Its Implementation\n1. **Cause of the Issue:**\n   - The `clear_cache` method was defined but not implemented in the `mongo.py` file for the MongoDB backend of the `cachier` library.\n   - This omission meant users could not clear the cache and could inadvertently use outdated or incorrect data, causing inconsistencies in applications requiring fresh data for correct operation.\n\n2. **Content of the Commit:**\n   - The `clear_cache` method was edited to include a function that deletes multiple entries from the MongoDB collection. These entries are identified by a `filter` that matches the function string representation, referred to as `_func_str`.\n   - This mechanism leverages MongoDB’s `delete_many` function to remove all entries tied to the particular function.\n\n3. **How the Commit Solves the Issue:**\n   - **Functionality Implementation:** By adding the implementation to `clear_cache`, the method now actively performs the task of removing cached data from the MongoDB collection, ensuring that users can programmatically clear cache as needed.\n   - **Data Integrity:** Implementing this method addresses the core need for cache management, allowing users to ensure they are not exposed to stale data, thus maintaining the integrity and reliability of the data within their applications.\n   - **User Control:** It provides users with essential control over the caching mechanism, ensuring applications can force cache invalidation and refresh cached data promptly.\n\n### Solution to the Issue\nThe solution involves implementing the `clear_cache` method within the MongoDB core to perform a bulk deletion of cached entries based on a specific function identifier. This ensures that:\n- Users can clear outdated or stale cache entries.\n- Applications that rely on real-time or frequently changing data are not adversely affected by stale data.\n- The caching mechanism remains robust and flexible for different use cases.\n\nIn summary, the commit effectively addresses the absence of the `clear_cache` functionality, ensuring that cache management for MongoDB in the `cachier` library is complete and reliable.",
  "Time": "2024-06-02",
  "Difficulty": "Simple",
  "OriginCode": [
    {
      "path": "python-cachier_cachier/MANIFEST.in",
      "content": "# Manifest syntax https://docs.python.org/2/distutils/sourcedist.html\ngraft wheelhouse\n\nrecursive-include src *.info\ninclude README.rst\ninclude LICENSE\n\nprune __pycache__\nprune tests\n"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "Cachier\n#######\n\n|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n\nPersistent, stale-free, local and cross-machine caching for Python functions.\n\n.. code-block:: python\n\n  from cachier import cachier\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(days=3))\n  def foo(arg1, arg2):\n    \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\n\n.. role:: python(code)\n  :language: python\n\n.. contents::\n\n.. section-numbering:\n\n\n\nInstallation\n============\n\nInstall ``cachier`` with:\n\n.. code-block:: python\n\n    pip install cachier\n\nFor the latest version supporting Python 2.7 please use:\n\n.. code-block:: python\n\n    pip install 'cachier==1.2.8'\n\nFeatures\n========\n\n* Pure Python.\n* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n* A simple interface.\n* Defining \"shelf life\" for cached values.\n* Local caching using pickle files.\n* Cross-machine caching using MongoDB.\n* Thread-safety.\n\nCachier is **NOT**:\n\n* Meant as a transient cache. Python's @lru_cache is better.\n* Especially fast. It is meant to replace function calls that take more than... a second, say (overhead is around 1 millisecond).\n\nFuture features\n---------------\n\n* S3 core.\n* Multi-core caching.\n* `Cache replacement policies <https://en.wikipedia.org/wiki/Cache_replacement_policies>`_\n\n\nUse\n===\n\nCachier provides a decorator which you can wrap around your functions to give them a persistent cache. The positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). Also, notice that since objects which are instances of user-defined classes are hashable but all compare unequal (their hash value is their id), equal objects across different sessions will not yield identical keys.\n\nSetting up a Cache\n------------------\nYou can add a default, pickle-based, persistent cache to your function - meaning it will last across different Python kernels calling the wrapped function - by decorating it with the ``cachier`` decorator (notice the ``()``!).\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier()\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values!\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nClass and object methods can also be cached. Cachier will automatically ignore the `self` parameter when determining the cache key for an object method. **This means that methods will be cached across all instances of an object, which may not be what you want.**\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  class Foo():\n    @staticmethod\n    @cachier()\n    def good_static_usage(arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method does not depend on object's internal state, so good to cache\n    @cachier()\n    def good_usage_1(self, arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method is calling external service, probably okay to cache\n    @cachier()\n    def good_usage_2(self, arg_1, arg_2):\n      result = self.call_api(arg_1, arg_2)\n      return result\n\n    # Instance method relies on object attribute, NOT good to cache\n    @cachier()\n    def bad_usage(self, arg_1, arg_2):\n      return arg_1 + arg_2 + self.arg_3\n\n\nResetting a Cache\n-----------------\nThe Cachier wrapper adds a ``clear_cache()`` function to each wrapped function. To reset the cache of the wrapped function simply call this method:\n\n.. code-block:: python\n\n  foo.clear_cache()\n\nGeneral Configuration\n----------------------\n\nGlobal Defaults\n~~~~~~~~~~~~~~~\n\nSettings can be globally configured across all Cachier wrappers through the use of the `set_default_params` function. This function takes the same keyword parameters as the ones defined in the decorator, which can be passed all at once or with multiple calls. Parameters given directly to a decorator take precedence over any values set by this function.\n\nThe following parameters will only be applied to decorators defined after `set_default_params` is called:\n\n*  `hash_func`\n*  `backend`\n*  `mongetter`\n*  `cache_dir`\n*  `pickle_reload`\n*  `separate_files`\n\nThese parameters can be changed at any time and they will apply to all decorators:\n\n*  `allow_none`\n*  `caching_enabled`\n*  `stale_after`\n*  `next_time`\n*  `wait_for_calc_timeout`\n\nThe current defaults can be fetched by calling `get_default_params`.\n\nThreads Limit\n~~~~~~~~~~~~~\n\nTo limit the number of threads Cachier is allowed to spawn, set the ``CACHIER_MAX_WORKERS`` with the desired number. The default is 8, so to enable Cachier to spawn even more threads, you'll have to set a higher limit explicitly.\n\n\nGlobal Enable/Disable\n---------------------\n\nCaching can be turned off across all decorators by calling `disable_caching`, and then re-activated by calling `enable_caching`.\n\nThese functions are convenience wrappers around the `caching_enabled` default setting.\n\n\nCache Shelf Life\n----------------\n\nSetting Shelf Life\n~~~~~~~~~~~~~~~~~~\nYou can set any duration as the shelf life of cached return values of a function by providing a corresponding ``timedelta`` object to the ``stale_after`` parameter:\n\n.. code-block:: python\n\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(weeks=2))\n  def bar(arg1, arg2):\n    return {'arg1': arg1, 'arg2': arg2}\n\nNow when a cached value matching the given arguments is found the time of its calculation is checked; if more than ``stale_after`` time has since passed, the function will be run again for the same arguments and the new value will be cached and returned.\n\nThis is useful for lengthy calculations that depend on a dynamic data source.\n\nFuzzy Shelf Life\n~~~~~~~~~~~~~~~~\nSometimes you may want your function to trigger a calculation when it encounters a stale result, but still not wait on it if it's not that critical. In that case, you can set ``next_time`` to ``True`` to have your function trigger a recalculation **in a separate thread**, but return the currently cached stale value:\n\n.. code-block:: python\n\n  @cachier(next_time=True)\n\nFurther function calls made while the calculation is being performed will not trigger redundant calculations.\n\n\nWorking with unhashable arguments\n---------------------------------\n\nAs mentioned above, the positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). To get around this limitation the ``hash_func`` parameter of the ``cachier`` decorator can be provided with a callable that gets the args and kwargs from the decorated function and returns a hash key for them.\n\n.. code-block:: python\n\n  def calculate_hash(args, kwds):\n    key = ...  # compute a hash key here based on arguments\n    return key\n\n  @cachier(hash_func=calculate_hash)\n  def calculate_super_complex_stuff(custom_obj):\n    # amazing code goes here\n\nSee here for an example:\n\n`Question: How to work with unhashable arguments <https://github.com/python-cachier/cachier/issues/91>`_\n\n\nPrecaching values\n---------------------------------\n\nIf you want to load a value into the cache without calling the underlying function, this can be done with the `precache_value` function.\n\n.. code-block:: python\n\n  @cachier()\n  def add(arg1, arg2):\n    return arg1 + arg2\n\n  add.precache_value(2, 2, value_to_cache=5)\n\n  result = add(2, 2)\n  print(result)  # prints 5\n\n\nPer-function call arguments\n---------------------------\n\nCachier also accepts several keyword arguments in the calls of the function it wraps rather than in the decorator call, allowing you to modify its behaviour for a specific function call.\n\nIgnore Cache\n~~~~~~~~~~~~\n\nYou can have ``cachier`` ignore any existing cache for a specific function call by passing ``cachier__skip_cache=True`` to the function call. The cache will neither be checked nor updated with the new return value.\n\n.. code-block:: python\n\n  @cachier()\n  def sum(first_num, second_num):\n    return first_num + second_num\n\n  def main():\n    print(sum(5, 3, cachier__skip_cache=True))\n\nOverwrite Cache\n~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` overwrite an existing cache entry - if one exists - for a specific function call by passing ``cachier__overwrite_cache=True`` to the function call. The cache will not be checked but will be updated with the new return value.\n\nVerbose Cache Call\n~~~~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` print out a detailed explanation of the logic of a specific call by passing ``cachier__verbose=True`` to the function call. This can be useful if you are not sure why a certain function result is, or is not, returned.\n\nCache `None` Values\n~~~~~~~~~~~~~~~~~~~\n\nBy default, ``cachier`` does not cache ``None`` values. You can override this behaviour by passing ``allow_none=True`` to the function call.\n\n\nCachier Cores\n=============\n\nPickle Core\n-----------\n\nThe default core for Cachier is pickle based, meaning each function will store its cache is a separate pickle file in the ``~/.cachier`` directory. Naturally, this kind of cache is both machine-specific and user-specific.\n\nYou can configure ``cachier`` to use another directory by providing the ``cache_dir`` parameter with the path to that directory:\n\n.. code-block:: python\n\n  @cachier(cache_dir='~/.temp/.cache')\n\n\nYou can slightly optimise pickle-based caching if you know your code will only be used in a single thread environment by setting:\n\n.. code-block:: python\n\n  @cachier(pickle_reload=False)\n\nThis will prevent reading the cache file on each cache read, speeding things up a bit, while also nullifying inter-thread functionality (the code is still thread safe, but different threads will have different versions of the cache at times, and will sometime make unnecessary function calls).\n\nSetting the optional argument ``separate_files`` to ``True`` will cause the cache to be stored in several files: A file per argument set, per function. This can help if your per-function cache files become too large.\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier(separate_files=True)\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values, split across several files, per argument set\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nYou can get the fully qualified path to the directory of cache files used by ``cachier`` (``~/.cachier`` by default) by calling the ``cache_dpath()`` function:\n\n.. code-block:: python\n\n  >>> foo.cache_dpath()\n      \"/home/user/.cachier/\"\n\n\nMongoDB Core\n------------\nYou can set a MongoDB-based cache by assigning ``mongetter`` with a callable that returns a ``pymongo.Collection`` object with writing permissions:\n\n.. code-block:: python\n\n    from pymongo import MongoClient\n\n    def my_mongetter():\n        client = MongoClient(get_cachier_db_auth_uri())\n        db_obj = client['cachier_db']\n        if 'someapp_cachier_db' not in db_obj.list_collection_names():\n            db_obj.create_collection('someapp_cachier_db')\n        return db_obj['someapp_cachier_db']\n\n  @cachier(mongetter=my_mongetter)\n\nThis allows you to have a cross-machine, albeit slower, cache. This functionality requires that the installation of the ``pymongo`` python package.\n\nIn certain cases the MongoDB backend might leave a deadlock behind, blocking all subsequent requests from being processed. If you encounter this issue, supply the ``wait_for_calc_timeout`` with a reasonable number of seconds; calls will then wait at most this number of seconds before triggering a recalculation.\n\n.. code-block:: python\n\n  @cachier(mongetter=False, wait_for_calc_timeout=2)\n\n\nMemory Core\n-----------\n\nYou can set an in-memory cache by assigning the ``backend`` parameter with ``'memory'``:\n\n.. code-block:: python\n\n  @cachier(backend='memory')\n\nNote, however, that ``cachier``'s in-memory core is simple, and has no monitoring or cap on cache size, and can thus lead to memory errors on large return values - it is mainly intended to be used with future multi-core functionality. As a rule, Python's built-in ``lru_cache`` is a much better stand-alone solution.\n\n\nContributing\n============\n\nCurrent maintainers are Shay Palachy Affek (`shay.palachy@gmail.com <mailto:shay.palachy@gmail.com>`_, `@shaypal5 <https://github.com/shaypal5>`_) and Judson Neer (`@lordjabez <https://github.com/lordjabez>`_); You are more than welcome to approach them for help. Contributions are very welcomed! :)\n\nInstalling for development\n--------------------------\n\nClone:\n\n.. code-block:: bash\n\n  git clone git@github.com:python-cachier/cachier.git\n\n\nInstall in development mode with test dependencies:\n\n.. code-block:: bash\n\n  cd cachier\n  pip install -e . -r tests/requirements.txt\n\n\nRunning the tests\n-----------------\n\nTo run the tests, call the ``pytest`` command in the repository's root, or:\n\n.. code-block:: bash\n\n  python -m pytest\n\nTo run only MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m mongo\n\nTo run only memory core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m memory\n\nTo run all tests EXCEPT MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not mongo\"\n\n\nTo run all tests EXCEPT memory core AND MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not (mongo or memory)\"\n\n\nRunning MongoDB tests against a live MongoDB instance\n-----------------------------------------------------\n\n**Note to developers:** By default, all MongoDB tests are run against a mocked MongoDB instance, provided by the ``pymongo_inmemory`` package. To run them against a live MongoDB instance, the ``CACHIER_TEST_VS_DOCKERIZED_MONGO`` environment variable is set to ``True`` in the ``test`` environment of this repository (and additional environment variables are populated with the appropriate credentials), used by the GitHub Action running tests on every commit and pull request.\n\nContributors are not expected to run these tests against a live MongoDB instance when developing, as credentials for the testing instance used will NOT be shared, but rather use the testing against the in-memory MongoDB instance as a good proxy.\n\n**HOWEVER, the tests run against a live MongoDB instance when you submit a PR are the determining tests for deciding whether your code functions correctly against MongoDB.**\n\n\nAdding documentation\n--------------------\n\nThis project is documented using the `numpy docstring conventions`_, which were chosen as they are perhaps the most widely-spread conventions that are both supported by common tools such as Sphinx and result in human-readable docstrings (in my personal opinion, of course). When documenting code you add to this project, please follow `these conventions`_.\n\n.. _`numpy docstring conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n.. _`these conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n\nAdditionally, if you update this ``README.rst`` file, use ``python setup.py checkdocs`` to validate it compiles.\n\n\nCredits\n=======\n\nCreated by `Shay Palachy Affek <https://github.com/shaypal5>`_ (shay.palachy@gmail.com), which currently assists in maintenance.\n\nCurrent lead developer/contributor: `Jirka Borovec <https://github.com/Borda>`_ (`@Borda <https://github.com/Borda>`_ on GitHub).\n\nOther major contributors:\n\n* `Jirka Borovec <https://github.com/Borda>`_ - Arg order independence, args-to-kwargs for less unique keys and numerous development and CI contributions.\n\n* `Judson Neer <https://github.com/lordjabez>`_ - Precaching, method caching support and numerous improvements and bugfixes.\n\n* `cthoyt <https://github.com/cthoyt>`_ - Base memory core implementation.\n\n* `amarczew <https://github.com/amarczew>`_ - The ``hash_func`` kwarg.\n\n* `non-senses <https://github.com/non-senses>`_ - The ``wait_for_calc_timeout`` kwarg.\n\n* `Elad Rapaport <https://github.com/erap129>`_ - Multi-file Pickle core, a.k.a ``separate_files`` (released on ``v1.5.3``).\n\n* `John Didion <https://github.com/jdidion>`_ - Support for pickle-based caching for cases where two identically-named methods of different classes are defined in the same module.\n\nNotable bugfixers:\n\n* `MichaelRazum <https://github.com/MichaelRazum>`_.\n\n* `Eric Ma <https://github.com/ericmjl>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n* `Ofir <https://github.com/ofirnk>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n\n\n.. |PyPI-Status| image:: https://img.shields.io/pypi/v/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |PyPI-Versions| image:: https://img.shields.io/pypi/pyversions/cachier.svg\n   :target: https://pypi.python.org/pypi/cachier\n\n.. |Build-Status| image:: https://github.com/python-cachier/cachier/actions/workflows/test.yml/badge.svg\n  :target: https://github.com/python-cachier/cachier/actions/workflows/test.yml\n\n.. |LICENCE| image:: https://img.shields.io/pypi/l/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |Codecov| image:: https://codecov.io/github/python-cachier/cachier/coverage.svg?branch=master\n   :target: https://codecov.io/github/python-cachier/cachier?branch=master\n\n.. |Downloads| image:: https://pepy.tech/badge/cachier\n     :target: https://pepy.tech/project/cachier\n     :alt: PePy stats\n\n.. |Codefactor| image:: https://www.codefactor.io/repository/github/python-cachier/cachier/badge?style=plastic\n     :target: https://www.codefactor.io/repository/github/python-cachier/cachier\n     :alt: Codefactor code quality\n\n.. links:\n.. _pymongo: https://api.mongodb.com/python/current/\n.. _watchdog: https://github.com/gorakhargosh/watchdog\n"
    },
    {
      "path": "python-cachier_cachier/LICENSE",
      "content": "MIT License\n\nCopyright (c) 2016 Shay Palachy\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.\n"
    },
    {
      "path": "python-cachier_cachier/pyproject.toml",
      "content": "[project]\nname = \"cachier\"\ndescription = \"Persistent, stale-free, local and cross-machine caching for Python functions.\"\nreadme = \"README.rst\"\nkeywords = [\n  \"cache\",\n  \"caching\",\n  \"cross-machine\",\n  \"decorator\",\n  \"local\",\n  \"memoization\",\n  \"mongo\",\n  \"persistent\",\n]\nlicense = {file = \"LICENSE\"}\nauthors = [\n   {name = \"Shay Palachy Affek\", email = 'shay.palachy@gmail.com'},\n]\nclassifiers = [\n  'Development Status :: 4 - Beta',\n  'Intended Audience :: Developers',\n  'License :: OSI Approved :: MIT License',\n  'Programming Language :: Python',\n  \"Programming Language :: Python :: 3 :: Only\",\n  \"Programming Language :: Python :: 3.8\",\n  \"Programming Language :: Python :: 3.9\",\n  \"Programming Language :: Python :: 3.10\",\n  \"Programming Language :: Python :: 3.11\",\n  \"Programming Language :: Python :: 3.12\",\n  \"Topic :: Other/Nonlisted Topic\",\n  \"Topic :: Software Development :: Libraries\",\n  \"Topic :: Software Development :: Libraries :: Python Modules\",\n  \"Topic :: Utilities\",\n]\ndynamic = [\n  \"version\",\n]\ndependencies = [\n  \"portalocker>=2.3.2\",\n  \"watchdog>=2.3.1\",\n]\n[project.urls]\nSource = \"https://github.com/python-cachier/cachier\"\n[project.scripts]\ncachier = \"cachier.__naim__:cli\"\n\n[tool.setuptools]\ninclude-package-data = true\n\n[tool.setuptools.dynamic]\nversion = {attr = \"cachier._version.__version__\"}\n\n[tool.setuptools.packages.find]\nwhere = [\"src\"]  # list of folders that contain the packages ([\".\"] by default)\ninclude = [\"cachier*\"]  # package names should match these glob patterns ([\"*\"] by default)\nnamespaces = false  # to disable scanning PEP 420 namespaces (true by default)\n\n[build-system]\nrequires = [\n  \"setuptools\",\n  \"wheel\",\n]\n\n[tool.pytest.ini_options]\ntestpaths = [\"cachier\", \"tests\"]\nnorecursedirs=[\"dist\", \"build\"]\naddopts = [\n    \"--color=yes\",\n    \"--cov=cachier\",\n    \"--cov-report=term\",\n    \"--cov-report=xml:cov.xml\",\n    \"-r a\",\n    \"-v\",\n    \"-s\",\n]\nmarkers = [\n    \"mongo: test the MongoDB core\",\n    \"memory: test the memory core\",\n    \"pickle: test the pickle core\"\n]\n\n[tool.docformatter]\nrecursive = true\n# some docstring start with r\"\"\"\nwrap-summaries = 79\nwrap-descriptions = 79\nblank = true\n\n[tool.ruff]\ntarget-version = \"py38\"\nline-length = 79\n# Enable Pyflakes `E` and `F` codes by default.\nlint.select = [\n    \"E\",\n    \"W\", # see: https://pypi.org/project/pycodestyle\n    \"F\", # see: https://pypi.org/project/pyflakes\n    \"I\", #see: https://pypi.org/project/isort/\n    \"D\", # see: https://pypi.org/project/pydocstyle\n    #\"N\", # see: https://pypi.org/project/pep8-naming\n    \"S\", # see: https://pypi.org/project/flake8-bandit\n    \"SIM\",\n    \"RUF100\" # alternative to yesqa\n]\nlint.extend-select = [\n    \"A\",    # see: https://pypi.org/project/flake8-builtins\n    \"B\",    # see: https://pypi.org/project/flake8-bugbear\n    \"C4\",   # see: https://pypi.org/project/flake8-comprehensions\n]\nlint.ignore = [\n    \"E203\",\n    \"C901\",\n]\n# Exclude a variety of commonly ignored directories.\nexclude = [\n    \".eggs\",\n    \".git\",\n    \".ruff_cache\",\n    \"__pypackages__\",\n    \"_build\",\n    \"build\",\n    \"dist\",\n]\nlint.ignore-init-module-imports = true\nlint.unfixable = [\"F401\"]\n\n[tool.ruff.lint.per-file-ignores]\n\"src/**/__init__.py\" = [\"D104\"]\n\"src/cachier/config.py\" = [\"D100\"]\n\"tests/**\" = [\n  \"S101\", \"S311\", \"S105\", \"S603\",\n  \"D100\", \"D101\", \"D103\", \"D104\", \"D401\"\n]\n\n#[tool.ruff.pydocstyle]\n## Use Google-style docstrings.\n#convention = \"google\"\n\n#[tool.ruff.pycodestyle]\n#ignore-overlong-task-comments = true\n\n[tool.ruff.lint.mccabe]\n# Unlike Flake8, default to a complexity level of 10.\nmax-complexity = 10\n\n[tool.coverage.run]\nbranch = true\ndynamic_context = \"test_function\"\nomit = [\n    \"tests/*\",\n    \"cachier/_version.py\",\n    \"cachier/__init__.py\",\n    \"**/scripts/**\",\n]\n[tool.coverage.report]\nshow_missing = true\n# Regexes for lines to exclude from consideration\nexclude_lines = [\n    \"pragma: no cover\", # Have to re-enable the standard pragma\n    \"raise NotImplementedError\", # Don't complain if tests don't hit defensive assertion code:\n    \"if TYPE_CHECKING:\", # Is only true when running mypy, not tests\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__init__.py",
      "content": "from ._version import *  # noqa: F403\nfrom .config import (\n    disable_caching,\n    enable_caching,\n    get_default_params,\n    set_default_params,\n)\nfrom .core import cachier\n\n__all__ = [\n    \"cachier\",\n    \"set_default_params\",\n    \"get_default_params\",\n    \"enable_caching\",\n    \"disable_caching\",\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/config.py",
      "content": "import datetime\nimport hashlib\nimport os\nimport pickle\nfrom typing import Optional, TypedDict, Union\n\nfrom ._types import Backend, HashFunc, Mongetter\n\n\ndef _default_hash_func(args, kwds):\n    # Sort the kwargs to ensure consistent ordering\n    sorted_kwargs = sorted(kwds.items())\n    # Serialize args and sorted_kwargs using pickle or similar\n    serialized = pickle.dumps((args, sorted_kwargs))\n    # Create a hash of the serialized data\n    return hashlib.sha256(serialized).hexdigest()\n\n\nclass Params(TypedDict):\n    \"\"\"Type definition for cachier parameters.\"\"\"\n\n    caching_enabled: bool\n    hash_func: HashFunc\n    backend: Backend\n    mongetter: Optional[Mongetter]\n    stale_after: datetime.timedelta\n    next_time: bool\n    cache_dir: Union[str, os.PathLike]\n    pickle_reload: bool\n    separate_files: bool\n    wait_for_calc_timeout: int\n    allow_none: bool\n\n\n_default_params: Params = {\n    \"caching_enabled\": True,\n    \"hash_func\": _default_hash_func,\n    \"backend\": \"pickle\",\n    \"mongetter\": None,\n    \"stale_after\": datetime.timedelta.max,\n    \"next_time\": False,\n    \"cache_dir\": \"~/.cachier/\",\n    \"pickle_reload\": True,\n    \"separate_files\": False,\n    \"wait_for_calc_timeout\": 0,\n    \"allow_none\": False,\n}\n\n\ndef _update_with_defaults(\n    param, name: str, func_kwargs: Optional[dict] = None\n):\n    import cachier\n\n    if func_kwargs:\n        kw_name = f\"cachier__{name}\"\n        if kw_name in func_kwargs:\n            return func_kwargs.pop(kw_name)\n    if param is None:\n        return cachier.config._default_params[name]\n    return param\n\n\ndef set_default_params(**params):\n    \"\"\"Configure global parameters applicable to all memoized functions.\n\n    This function takes the same keyword parameters as the ones defined in the\n    decorator, which can be passed all at once or with multiple calls.\n    Parameters given directly to a decorator take precedence over any values\n    set by this function.\n\n    Only 'stale_after', 'next_time', and 'wait_for_calc_timeout' can be changed\n    after the memoization decorator has been applied. Other parameters will\n    only have an effect on decorators applied after this function is run.\n\n    \"\"\"\n    import cachier\n\n    valid_params = (\n        p for p in params.items() if p[0] in cachier.config._default_params\n    )\n    _default_params.update(valid_params)\n\n\ndef get_default_params():\n    \"\"\"Get current set of default parameters.\"\"\"\n    import cachier\n\n    return cachier.config._default_params\n\n\ndef enable_caching():\n    \"\"\"Enable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = True\n\n\ndef disable_caching():\n    \"\"\"Disable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = False\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/_version.py",
      "content": "# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2024, Jirka Borovec <***@gmail.com>\n\nimport os\n\n_PATH_HERE = os.path.dirname(__file__)\n_PATH_VERSION = os.path.join(_PATH_HERE, \"version.info\")\n_RELEASING_PROCESS = os.getenv(\"RELEASING_PROCESS\", \"0\") == \"1\"\n\nwith open(_PATH_VERSION) as fopen:\n    __version__ = fopen.read().strip()\n\n\ndef _get_git_sha() -> str:\n    from subprocess import DEVNULL, check_output\n\n    out = check_output([\"git\", \"rev-parse\", \"--short\", \"HEAD\"], stderr=DEVNULL)  # noqa: S603, S607\n    return out.decode(\"utf-8\").strip()\n\n\nif not _RELEASING_PROCESS:\n    try:\n        sha_short = _get_git_sha()\n        # print(f\"Version enriched with git commit hash: {__version__}.\")\n    except Exception:\n        # print(\"Failed to get the git commit hash,\"\n        #       f\" falling back to base version {__version__}.\")\n        sha_short = \"\"\n    __version__ += f\".dev+{sha_short}\" if sha_short else \".dev\"\n\n\n__all__ = [\"__version__\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/version.info",
      "content": "3.0.0\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/py.typed",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/_types.py",
      "content": "from typing import TYPE_CHECKING, Callable, Literal\n\nif TYPE_CHECKING:\n    import pymongo.collection\n\n\nHashFunc = Callable[..., str]\nMongetter = Callable[[], \"pymongo.collection.Collection\"]\nBackend = Literal[\"pickle\", \"mongo\", \"memory\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__main__.py",
      "content": "\"\"\"A command-line interface for cachier.\"\"\"\n\nimport click\n\nfrom cachier.core import _set_max_workers\n\n\n@click.group()\ndef cli():\n    \"\"\"A command-line interface for cachier.\"\"\"  # noqa: D401\n\n\n@cli.command(\"Limits the number of worker threads used by cachier.\")\n@click.argument(\"max_workers\", nargs=1, type=int)\ndef set_max_workers(max_workers):\n    \"\"\"Limits the number of worker threads used by cachier.\"\"\"\n    _set_max_workers(max_workers)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "\"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport datetime\nimport inspect\nimport os\nimport warnings\nfrom collections import OrderedDict\nfrom concurrent.futures import ThreadPoolExecutor\nfrom functools import wraps\nfrom typing import Optional, Union\nfrom warnings import warn\n\nfrom .config import (\n    Backend,\n    HashFunc,\n    Mongetter,\n    _default_params,\n    _update_with_defaults,\n)\nfrom .cores.base import RecalculationNeeded, _BaseCore\nfrom .cores.memory import _MemoryCore\nfrom .cores.mongo import _MongoCore\nfrom .cores.pickle import _PickleCore\n\nMAX_WORKERS_ENVAR_NAME = \"CACHIER_MAX_WORKERS\"\nDEFAULT_MAX_WORKERS = 8\n\n\ndef _max_workers():\n    return int(os.environ.get(MAX_WORKERS_ENVAR_NAME, DEFAULT_MAX_WORKERS))\n\n\ndef _set_max_workers(max_workers):\n    os.environ[MAX_WORKERS_ENVAR_NAME] = str(max_workers)\n    _get_executor(True)\n\n\ndef _get_executor(reset=False):\n    if reset or not hasattr(_get_executor, \"executor\"):\n        _get_executor.executor = ThreadPoolExecutor(_max_workers())\n    return _get_executor.executor\n\n\ndef _function_thread(core, key, func, args, kwds):\n    try:\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n    except BaseException as exc:\n        print(f\"Function call failed with the following exception:\\n{exc}\")\n\n\ndef _calc_entry(core, key, func, args, kwds):\n    try:\n        core.mark_entry_being_calculated(key)\n        # _get_executor().submit(core.mark_entry_being_calculated, key)\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n        # _get_executor().submit(core.set_entry, key, func_res)\n        return func_res\n    finally:\n        core.mark_entry_not_calculated(key)\n\n\ndef _convert_args_kwargs(\n    func, _is_method: bool, args: tuple, kwds: dict\n) -> dict:\n    \"\"\"Convert mix of positional and keyword arguments to aggregated kwargs.\"\"\"\n    # unwrap if the function is functools.partial\n    if hasattr(func, \"func\"):\n        args = func.args + args\n        kwds.update({k: v for k, v in func.keywords.items() if k not in kwds})\n        func = func.func\n    func_params = list(inspect.signature(func).parameters)\n    args_as_kw = dict(\n        zip(func_params[1:], args[1:])\n        if _is_method\n        else zip(func_params, args)\n    )\n    # init with default values\n    kwargs = {\n        k: v.default\n        for k, v in inspect.signature(func).parameters.items()\n        if v.default is not inspect.Parameter.empty\n    }\n    # merge args expanded as kwargs and the original kwds\n    kwargs.update(dict(**args_as_kw, **kwds))\n    return OrderedDict(sorted(kwargs.items()))\n\n\ndef _pop_kwds_with_deprecation(kwds, name: str, default_value: bool):\n    if name in kwds:\n        warnings.warn(\n            f\"`{name}` is deprecated and will be removed in a future release,\"\n            \" use `cachier__` alternative instead.\",\n            DeprecationWarning,\n            stacklevel=2,\n        )\n    return kwds.pop(name, default_value)\n\n\ndef cachier(\n    hash_func: Optional[HashFunc] = None,\n    hash_params: Optional[HashFunc] = None,\n    backend: Optional[Backend] = None,\n    mongetter: Optional[Mongetter] = None,\n    stale_after: Optional[datetime.timedelta] = None,\n    next_time: Optional[bool] = None,\n    cache_dir: Optional[Union[str, os.PathLike]] = None,\n    pickle_reload: Optional[bool] = None,\n    separate_files: Optional[bool] = None,\n    wait_for_calc_timeout: Optional[int] = None,\n    allow_none: Optional[bool] = None,\n):\n    \"\"\"Wrap as a persistent, stale-free memoization decorator.\n\n    The positional and keyword arguments to the wrapped function must be\n    hashable (i.e. Python's immutable built-in objects, not mutable\n    containers). Also, notice that since objects which are instances of\n    user-defined classes are hashable but all compare unequal (their hash\n    value is their id), equal objects across different sessions will not yield\n    identical keys.\n\n    Arguments:\n    ---------\n    hash_func : callable, optional\n        A callable that gets the args and kwargs from the decorated function\n        and returns a hash key for them. This parameter can be used to enable\n        the use of cachier with functions that get arguments that are not\n        automatically hashable by Python.\n    hash_params : callable, optional\n    backend : str, optional\n        The name of the backend to use. Valid options currently include\n        'pickle', 'mongo' and 'memory'. If not provided, defaults to\n        'pickle' unless the 'mongetter' argument is passed, in which\n        case the mongo backend is automatically selected.\n    mongetter : callable, optional\n        A callable that takes no arguments and returns a pymongo.Collection\n        object with writing permissions. If unset a local pickle cache is used\n        instead.\n    stale_after : datetime.timedelta, optional\n        The time delta after which a cached result is considered stale. Calls\n        made after the result goes stale will trigger a recalculation of the\n        result, but whether a stale or fresh result will be returned is\n        determined by the optional next_time argument.\n    next_time : bool, optional\n        If set to True, a stale result will be returned when finding one, not\n        waiting for the calculation of the fresh result to return. Defaults to\n        False.\n    cache_dir : str, optional\n        A fully qualified path to a file directory to be used for cache files.\n        The running process must have running permissions to this folder. If\n        not provided, a default directory at `~/.cachier/` is used.\n    pickle_reload : bool, optional\n        If set to True, in-memory cache will be reloaded on each cache read,\n        enabling different threads to share cache. Should be set to False for\n        faster reads in single-thread programs. Defaults to True.\n    separate_files: bool, default False, for Pickle cores only\n        Instead of a single cache file per-function, each function's cache is\n        split between several files, one for each argument set. This can help\n        if you per-function cache files become too large.\n    wait_for_calc_timeout: int, optional, for MongoDB only\n        The maximum time to wait for an ongoing calculation. When a\n        process started to calculate the value setting being_calculated to\n        True, any process trying to read the same entry will wait a maximum of\n        seconds specified in this parameter. 0 means wait forever.\n        Once the timeout expires the calculation will be triggered.\n    allow_none: bool, optional\n        Allows storing None values in the cache. If False, functions returning\n        None will not be cached and are recalculated every call.\n\n    \"\"\"\n    # Check for deprecated parameters\n    if hash_params is not None:\n        message = (\n            \"hash_params will be removed in a future release, \"\n            \"please use hash_func instead\"\n        )\n        warn(message, DeprecationWarning, stacklevel=2)\n        hash_func = hash_params\n    # Update parameters with defaults if input is None\n    backend = _update_with_defaults(backend, \"backend\")\n    mongetter = _update_with_defaults(mongetter, \"mongetter\")\n    # Override the backend parameter if a mongetter is provided.\n    if callable(mongetter):\n        backend = \"mongo\"\n    core: _BaseCore\n    if backend == \"pickle\":\n        core = _PickleCore(\n            hash_func=hash_func,\n            pickle_reload=pickle_reload,\n            cache_dir=cache_dir,\n            separate_files=separate_files,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"mongo\":\n        core = _MongoCore(\n            hash_func=hash_func,\n            mongetter=mongetter,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"memory\":\n        core = _MemoryCore(\n            hash_func=hash_func, wait_for_calc_timeout=wait_for_calc_timeout\n        )\n    else:\n        raise ValueError(\"specified an invalid core: %s\" % backend)\n\n    def _cachier_decorator(func):\n        core.set_func(func)\n\n        @wraps(func)\n        def func_wrapper(*args, **kwds):\n            nonlocal allow_none\n            _allow_none = _update_with_defaults(allow_none, \"allow_none\", kwds)\n            # print('Inside general wrapper for {}.'.format(func.__name__))\n            ignore_cache = _pop_kwds_with_deprecation(\n                kwds, \"ignore_cache\", False\n            )\n            overwrite_cache = _pop_kwds_with_deprecation(\n                kwds, \"overwrite_cache\", False\n            )\n            verbose = _pop_kwds_with_deprecation(kwds, \"verbose_cache\", False)\n            ignore_cache = kwds.pop(\"cachier__skip_cache\", ignore_cache)\n            overwrite_cache = kwds.pop(\n                \"cachier__overwrite_cache\", overwrite_cache\n            )\n            verbose = kwds.pop(\"cachier__verbose\", verbose)\n            _stale_after = _update_with_defaults(\n                stale_after, \"stale_after\", kwds\n            )\n            _next_time = _update_with_defaults(next_time, \"next_time\", kwds)\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n\n            _print = lambda x: None  # noqa: E731\n            if verbose:\n                _print = print\n            if ignore_cache or not _default_params[\"caching_enabled\"]:\n                return func(**kwargs)\n            key, entry = core.get_entry((), kwargs)\n            if overwrite_cache:\n                return _calc_entry(core, key, func, args, kwds)\n            if entry is None:\n                _print(\"No entry found. No current calc. Calling like a boss.\")\n                return _calc_entry(core, key, func, args, kwds)\n            _print(\"Entry found.\")\n            if _allow_none or entry.get(\"value\", None) is not None:\n                _print(\"Cached result found.\")\n                now = datetime.datetime.now()\n                if now - entry[\"time\"] <= _stale_after:\n                    _print(\"And it is fresh!\")\n                    return entry[\"value\"]\n                _print(\"But it is stale... :(\")\n                if entry[\"being_calculated\"]:\n                    if _next_time:\n                        _print(\"Returning stale.\")\n                        return entry[\"value\"]  # return stale val\n                    _print(\"Already calc. Waiting on change.\")\n                    try:\n                        return core.wait_on_entry_calc(key)\n                    except RecalculationNeeded:\n                        return _calc_entry(core, key, func, args, kwds)\n                if _next_time:\n                    _print(\"Async calc and return stale\")\n                    try:\n                        core.mark_entry_being_calculated(key)\n                        _get_executor().submit(\n                            _function_thread, core, key, func, args, kwds\n                        )\n                    finally:\n                        core.mark_entry_not_calculated(key)\n                    return entry[\"value\"]\n                _print(\"Calling decorated function and waiting\")\n                return _calc_entry(core, key, func, args, kwds)\n            if entry[\"being_calculated\"]:\n                _print(\"No value but being calculated. Waiting.\")\n                try:\n                    return core.wait_on_entry_calc(key)\n                except RecalculationNeeded:\n                    return _calc_entry(core, key, func, args, kwds)\n            _print(\"No entry found. No current calc. Calling like a boss.\")\n            return _calc_entry(core, key, func, args, kwds)\n\n        def _clear_cache():\n            \"\"\"Clear the cache.\"\"\"\n            core.clear_cache()\n\n        def _clear_being_calculated():\n            \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n            core.clear_being_calculated()\n\n        def _cache_dpath():\n            \"\"\"Return the path to the cache dir, if exists; None if not.\"\"\"\n            return getattr(core, \"cache_dir\", None)\n\n        def _precache_value(*args, value_to_cache, **kwds):  # noqa: D417\n            \"\"\"Add an initial value to the cache.\n\n            Arguments:\n            ---------\n            value_to_cache : any\n                entry to be written into the cache\n\n            \"\"\"\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n            return core.precache_value((), kwargs, value_to_cache)\n\n        func_wrapper.clear_cache = _clear_cache\n        func_wrapper.clear_being_calculated = _clear_being_calculated\n        func_wrapper.cache_dpath = _cache_dpath\n        func_wrapper.precache_value = _precache_value\n        return func_wrapper\n\n    return _cachier_decorator\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/base.py",
      "content": "\"\"\"Defines the interface of a cachier caching core.\"\"\"\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport abc  # for the _BaseCore abstract base class\nimport inspect\nimport threading\nfrom typing import Callable\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n\nclass RecalculationNeeded(Exception):\n    \"\"\"Exception raised when a recalculation is needed.\"\"\"\n\n    pass\n\n\ndef _get_func_str(func: Callable) -> str:\n    return f\".{func.__module__}.{func.__name__}\"\n\n\nclass _BaseCore:\n    __metaclass__ = abc.ABCMeta\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        self.hash_func = _update_with_defaults(hash_func, \"hash_func\")\n        self.wait_for_calc_timeout = wait_for_calc_timeout\n        self.lock = threading.RLock()\n\n    def set_func(self, func):\n        \"\"\"Set the function this core will use.\n\n        This has to be set before any method is called. Also determine if the\n        function is an object method.\n\n        \"\"\"\n        # unwrap if the function is functools.partial\n        if hasattr(func, \"func\"):\n            func = func.func\n        func_params = list(inspect.signature(func).parameters)\n        self.func_is_method = func_params and func_params[0] == \"self\"\n        self.func = func\n\n    def get_key(self, args, kwds):\n        \"\"\"Return a unique key based on the arguments provided.\"\"\"\n        return self.hash_func(args, kwds)\n\n    def get_entry(self, args, kwds):\n        \"\"\"Get entry based on given arguments.\n\n        Return the result mapped to the given arguments in this core's cache,\n        if such a mapping exists.\n\n        \"\"\"\n        key = self.get_key(args, kwds)\n        return self.get_entry_by_key(key)\n\n    def precache_value(self, args, kwds, value_to_cache):\n        \"\"\"Write a precomputed value into the cache.\"\"\"\n        key = self.get_key(args, kwds)\n        self.set_entry(key, value_to_cache)\n        return value_to_cache\n\n    def check_calc_timeout(self, time_spent):\n        \"\"\"Raise an exception if a recalculation is needed.\"\"\"\n        calc_timeout = _update_with_defaults(\n            self.wait_for_calc_timeout, \"wait_for_calc_timeout\"\n        )\n        if calc_timeout > 0 and (time_spent >= calc_timeout):\n            raise RecalculationNeeded()\n\n    @abc.abstractmethod\n    def get_entry_by_key(self, key):\n        \"\"\"Get entry based on given key.\n\n        Return the result mapped to the given key in this core's cache, if such\n        a mapping exists.\n\n        \"\"\"\n\n    @abc.abstractmethod\n    def set_entry(self, key, func_res):\n        \"\"\"Map the given result to the given key in this core's cache.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_being_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_not_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as not being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def wait_on_entry_calc(self, key):\n        \"\"\"Wait on the entry with keys being calculated and returns result.\"\"\"\n\n    @abc.abstractmethod\n    def clear_cache(self):\n        \"\"\"Clear the cache of this core.\"\"\"\n\n    @abc.abstractmethod\n    def clear_being_calculated(self):\n        \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "\"\"\"A pickle-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\nimport os\nimport pickle  # for local caching\nfrom contextlib import suppress\nfrom datetime import datetime\n\nimport portalocker  # to lock on pickle cache IO\nfrom watchdog.events import PatternMatchingEventHandler\nfrom watchdog.observers import Observer\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n# Alternative:  https://github.com/WoLpH/portalocker\nfrom .base import _BaseCore\n\n\nclass _PickleCore(_BaseCore):\n    \"\"\"The pickle core class for cachier.\"\"\"\n\n    class CacheChangeHandler(PatternMatchingEventHandler):\n        \"\"\"Handles cache-file modification events.\"\"\"\n\n        def __init__(self, filename, core, key):\n            PatternMatchingEventHandler.__init__(\n                self,\n                patterns=[\"*\" + filename],\n                ignore_patterns=None,\n                ignore_directories=True,\n                case_sensitive=False,\n            )\n            self.core = core\n            self.key = key\n            self.observer = None\n            self.value = None\n\n        def inject_observer(self, observer):\n            \"\"\"Inject the observer running this handler.\"\"\"\n            self.observer = observer\n\n        def _check_calculation(self):\n            # print('checking calc')\n            entry = self.core.get_entry_by_key(self.key, True)[1]\n            # print(self.key)\n            # print(entry)\n            try:\n                if not entry[\"being_calculated\"]:\n                    # print('stopping observer!')\n                    self.value = entry[\"value\"]\n                    self.observer.stop()\n                # else:\n                # print('NOT stopping observer... :(')\n            except TypeError:\n                self.value = None\n                self.observer.stop()\n\n        def on_created(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()  # pragma: no cover\n\n        def on_modified(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        pickle_reload: bool,\n        cache_dir: str,\n        separate_files: bool,\n        wait_for_calc_timeout: int,\n    ):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = None\n        self.reload = _update_with_defaults(pickle_reload, \"pickle_reload\")\n        self.cache_dir = os.path.expanduser(\n            _update_with_defaults(cache_dir, \"cache_dir\")\n        )\n        self.separate_files = _update_with_defaults(\n            separate_files, \"separate_files\"\n        )\n\n    @property\n    def cache_fname(self) -> str:\n        fname = f\".{self.func.__module__}.{self.func.__qualname__}\"\n        return fname.replace(\"<\", \"_\").replace(\">\", \"_\")\n\n    @property\n    def cache_fpath(self) -> str:\n        os.makedirs(self.cache_dir, exist_ok=True)\n        return os.path.abspath(\n            os.path.join(os.path.realpath(self.cache_dir), self.cache_fname)\n        )\n\n    def _reload_cache(self):\n        with self.lock:\n            try:\n                with portalocker.Lock(\n                    self.cache_fpath, mode=\"rb\"\n                ) as cache_file:\n                    self.cache = pickle.load(cache_file)  # noqa: S301\n            except (FileNotFoundError, EOFError):\n                self.cache = {}\n\n    def _get_cache(self):\n        with self.lock:\n            if not self.cache:\n                self._reload_cache()\n            return self.cache\n\n    def _get_cache_by_key(self, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        fpath += f\"_{hash_str or key}\"\n        try:\n            with portalocker.Lock(fpath, mode=\"rb\") as cache_file:\n                return pickle.load(cache_file)  # noqa: S301\n        except (FileNotFoundError, EOFError):\n            return None\n\n    def _clear_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(f\"{name}_\"):\n                os.remove(os.path.join(path, subpath))\n\n    def _clear_being_calculated_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(name):\n                entry = self._get_cache_by_key(hash_str=subpath.split(\"_\")[-1])\n                if entry is not None:\n                    entry[\"being_calculated\"] = False\n                    self._save_cache(entry, hash_str=subpath.split(\"_\")[-1])\n\n    def _save_cache(self, cache, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        if key is not None:\n            fpath += f\"_{key}\"\n        elif hash_str is not None:\n            fpath += f\"_{hash_str}\"\n        with self.lock:\n            self.cache = cache\n            with portalocker.Lock(fpath, mode=\"wb\") as cache_file:\n                pickle.dump(cache, cache_file, protocol=4)\n            if key is None:\n                self._reload_cache()\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            if self.separate_files:\n                return key, self._get_cache_by_key(key)\n            if self.reload or reload:\n                self._reload_cache()\n            return key, self._get_cache().get(key, None)\n\n    def set_entry(self, key, func_res):\n        key_data = {\n            \"value\": func_res,\n            \"time\": datetime.now(),\n            \"stale\": False,\n            \"being_calculated\": False,\n        }\n        if self.separate_files:\n            self._save_cache(key_data, key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            cache[key] = key_data\n            self._save_cache(cache)\n\n    def mark_entry_being_calculated_separate_files(self, key):\n        self._save_cache(\n            {\n                \"value\": None,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": True,\n            },\n            key=key,\n        )\n\n    def mark_entry_not_calculated_separate_files(self, key):\n        _, entry = self.get_entry_by_key(key)\n        entry[\"being_calculated\"] = False\n        self._save_cache(entry, key=key)\n\n    def mark_entry_being_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_being_calculated_separate_files(key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            try:\n                cache[key][\"being_calculated\"] = True\n            except KeyError:\n                cache[key] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                }\n            self._save_cache(cache)\n\n    def mark_entry_not_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_not_calculated_separate_files(key)\n        with self.lock:\n            cache = self._get_cache()\n            # that's ok, we don't need an entry in that case\n            with suppress(KeyError):\n                cache[key][\"being_calculated\"] = False\n                self._save_cache(cache)\n\n    def wait_on_entry_calc(self, key):\n        if self.separate_files:\n            entry = self._get_cache_by_key(key)\n            filename = f\"{self.cache_fname}_{key}\"\n        else:\n            with self.lock:\n                self._reload_cache()\n                entry = self._get_cache()[key]\n            filename = self.cache_fname\n        if not entry[\"being_calculated\"]:\n            return entry[\"value\"]\n        event_handler = _PickleCore.CacheChangeHandler(\n            filename=filename, core=self, key=key\n        )\n        observer = Observer()\n        event_handler.inject_observer(observer)\n        observer.schedule(event_handler, path=self.cache_dir, recursive=True)\n        observer.start()\n        time_spent = 0\n        while observer.is_alive():\n            observer.join(timeout=1.0)\n            time_spent += 1\n            self.check_calc_timeout(time_spent)\n        return event_handler.value\n\n    def clear_cache(self):\n        if self.separate_files:\n            self._clear_all_cache_files()\n        else:\n            self._save_cache({})\n\n    def clear_being_calculated(self):\n        if self.separate_files:\n            self._clear_being_calculated_all_cache_files()\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            for key in cache:\n                cache[key][\"being_calculated\"] = False\n            self._save_cache(cache)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/memory.py",
      "content": "\"\"\"A memory-based caching core for cachier.\"\"\"\n\nimport threading\nfrom datetime import datetime\n\nfrom .._types import HashFunc\nfrom .base import _BaseCore, _get_func_str\n\n\nclass _MemoryCore(_BaseCore):\n    \"\"\"The memory core class for cachier.\"\"\"\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = {}\n\n    def _hash_func_key(self, key):\n        return f\"{_get_func_str(self.func)}:{key}\"\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            return key, self.cache.get(self._hash_func_key(key), None)\n\n    def set_entry(self, key, func_res):\n        with self.lock:\n            try:\n                # we need to retain the existing condition so that\n                # mark_entry_not_calculated can notify all possibly-waiting\n                # threads about it\n                cond = self.cache[self._hash_func_key(key)][\"condition\"]\n            except KeyError:  # pragma: no cover\n                cond = None\n            self.cache[self._hash_func_key(key)] = {\n                \"value\": func_res,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": False,\n                \"condition\": cond,\n            }\n\n    def mark_entry_being_calculated(self, key):\n        with self.lock:\n            condition = threading.Condition()\n            # condition.acquire()\n            try:\n                self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n                self.cache[self._hash_func_key(key)][\"condition\"] = condition\n            except KeyError:\n                self.cache[self._hash_func_key(key)] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                    \"condition\": condition,\n                }\n\n    def mark_entry_not_calculated(self, key):\n        with self.lock:\n            try:\n                entry = self.cache[self._hash_func_key(key)]\n            except KeyError:  # pragma: no cover\n                return  # that's ok, we don't need an entry in that case\n            entry[\"being_calculated\"] = False\n            cond = entry[\"condition\"]\n            if cond:\n                cond.acquire()\n                cond.notify_all()\n                cond.release()\n                entry[\"condition\"] = None\n\n    def wait_on_entry_calc(self, key):\n        with self.lock:  # pragma: no cover\n            entry = self.cache[self._hash_func_key(key)]\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n        entry[\"condition\"].acquire()\n        entry[\"condition\"].wait()\n        entry[\"condition\"].release()\n        return self.cache[self._hash_func_key(key)][\"value\"]\n\n    def clear_cache(self):\n        with self.lock:\n            self.cache.clear()\n\n    def clear_being_calculated(self):\n        with self.lock:\n            for entry in self.cache.values():\n                entry[\"being_calculated\"] = False\n                entry[\"condition\"] = None\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "content": "\"\"\"A MongoDB-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport pickle  # for serialization of python objects\nimport sys  # to make sure that pymongo was imported\nimport time  # to sleep when waiting on Mongo cache\\\nimport warnings  # to warn if pymongo is missing\nfrom contextlib import suppress\nfrom datetime import datetime\n\nfrom .._types import HashFunc, Mongetter\n\nwith suppress(ImportError):\n    from bson.binary import Binary  # to save binary data to mongodb\n    from pymongo import ASCENDING, IndexModel\n    from pymongo.errors import OperationFailure\n\nfrom .base import RecalculationNeeded, _BaseCore, _get_func_str\n\nMONGO_SLEEP_DURATION_IN_SEC = 1\n\n\nclass MissingMongetter(ValueError):\n    \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n\n\nclass _MongoCore(_BaseCore):\n    _INDEX_NAME = \"func_1_key_1\"\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        mongetter: Mongetter,\n        wait_for_calc_timeout: int,\n    ):\n        if \"pymongo\" not in sys.modules:\n            warnings.warn(\n                \"`pymongo` was not found. MongoDB cores will not function.\",\n                ImportWarning,\n                stacklevel=2,\n            )  # pragma: no cover\n\n        super().__init__(hash_func, wait_for_calc_timeout)\n        if mongetter is None:\n            raise MissingMongetter(\n                \"must specify ``mongetter`` when using the mongo core\"\n            )\n        self.mongetter = mongetter\n        self.mongo_collection = self.mongetter()\n        index_inf = self.mongo_collection.index_information()\n        if _MongoCore._INDEX_NAME not in index_inf:\n            func1key1 = IndexModel(\n                keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n                name=_MongoCore._INDEX_NAME,\n            )\n            self.mongo_collection.create_indexes([func1key1])\n\n    @property\n    def _func_str(self) -> str:\n        return _get_func_str(self.func)\n\n    def get_entry_by_key(self, key):\n        res = self.mongo_collection.find_one(\n            {\"func\": self._func_str, \"key\": key}\n        )\n        if not res:\n            return key, None\n        try:\n            entry = {\n                \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        except KeyError:\n            entry = {\n                \"value\": None,\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        return key, entry\n\n    def set_entry(self, key, func_res):\n        thebytes = pickle.dumps(func_res)\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\n                \"$set\": {\n                    \"func\": self._func_str,\n                    \"key\": key,\n                    \"value\": Binary(thebytes),\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": False,\n                }\n            },\n            upsert=True,\n        )\n\n    def mark_entry_being_calculated(self, key):\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\"$set\": {\"being_calculated\": True}},\n            upsert=True,\n        )\n\n    def mark_entry_not_calculated(self, key):\n        with suppress(OperationFailure):  # don't care in this case\n            self.mongo_collection.update_one(\n                filter={\n                    \"func\": self._func_str,\n                    \"key\": key,\n                },\n                update={\"$set\": {\"being_calculated\": False}},\n                upsert=False,  # should not insert in this case\n            )\n\n    def wait_on_entry_calc(self, key):\n        time_spent = 0\n        while True:\n            time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n            time_spent += MONGO_SLEEP_DURATION_IN_SEC\n            key, entry = self.get_entry_by_key(key)\n            if entry is None:\n                raise RecalculationNeeded()\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n            self.check_calc_timeout(time_spent)\n\n    def clear_cache(self):\n        self.mongo_collection.delete_many(filter={\"func\": self._func_str})\n\n    def clear_being_calculated(self):\n        self.mongo_collection.update_many(\n            filter={\n                \"func\": self._func_str,\n                \"being_calculated\": True,\n            },\n            update={\"$set\": {\"being_calculated\": False}},\n        )\n"
    },
    {
      "path": "python-cachier_cachier/tests/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/tests/standalone_script.py",
      "content": "import time\n\nimport cachier\n\n\n@cachier.cachier()\ndef _takes_3_seconds(label, value):\n    time.sleep(3)\n    return f\"{label} {value}\"\n\n\nprint(_takes_3_seconds(\"two\", 2))\n"
    },
    {
      "path": "python-cachier_cachier/tests/speed_eval.py",
      "content": "from random import random\nfrom time import time\n\nfrom cachier import cachier\n\n\n@cachier(next_time=True)\ndef _test_int_pickling(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef _test_int_pickling_compare(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef test_pickle_speed():\n    \"\"\"Test speeds.\"\"\"\n    print(\"Comparing speeds of decorated vs non-decorated functions...\")\n    num_of_vals = 1000\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling_compare(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Non-decorated average = {sum(times) / num_of_vals}\")\n\n    _test_int_pickling.clear_cache()\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Decorated average = {sum(times) / num_of_vals}\")\n\n\n@cachier()\ndef _test_single_file_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\n@cachier(separate_files=True)\ndef _test_separate_files_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\ndef test_separate_files_vs_single_file():\n    _test_separate_files_speed.clear_cache()\n    _test_single_file_speed.clear_cache()\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_separate_files_speed(j, 2)\n    print(f\"separate files time: {time() - start_time}\")\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_single_file_speed(j, 2)\n    print(f\"single file time: {time() - start_time}\")\n\n\nif __name__ == \"__main__\":\n    test_pickle_speed()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_memory_core.py",
      "content": "\"\"\"Test for the in-memory implementation of the  Cachier python package.\"\"\"\n\nimport hashlib\nimport queue\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pandas as pd\nimport pytest\nfrom cachier import cachier\n\n\n@cachier(backend=\"memory\", next_time=False)\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.memory\ndef test_memory_core():\n    \"\"\"Basic memory core functionality.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\n@pytest.mark.memory\ndef test_memory_core_keywords():\n    \"\"\"Basic memory core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=False)\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after():\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds.clear_cache()\n    val1 = _stale_after_seconds(1, 2)\n    val2 = _stale_after_seconds(1, 2)\n    val3 = _stale_after_seconds(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds(1, 2)\n    assert val4 != val1\n    _stale_after_seconds.clear_cache()\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=True)\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after_next_time():\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _random_num():\n    return random()\n\n\n@cachier(backend=\"memory\")\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.memory\ndef test_overwrite_cache():\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 == int3\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 == int3\n    _random_num_with_arg.clear_cache()\n\n\n@pytest.mark.memory\ndef test_ignore_cache():\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(res_queue):\n    res = _takes_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_memory_being_calculated():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(res_queue):\n    res = _being_calc_next_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_being_calc_next_time():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\")\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@cachier(backend=\"memory\")\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    _takes_time.clear_being_calculated()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated_with_empty_cache():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    _takes_time.clear_being_calculated()\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.memory\ndef test_error_throwing_func():\n    # with\n    res1 = _error_throwing_func(4)\n    sleep(1.5)\n    res2 = _error_throwing_func(4)\n    assert res1 == res2\n\n\n@pytest.mark.memory\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(backend=\"memory\", hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n\n\nif __name__ == \"__main__\":\n    test_memory_being_calculated()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_core_lookup.py",
      "content": "\"\"\"Testing a few basic cachier interfaces.\"\"\"\n\nimport pytest\nfrom cachier import cachier, get_default_params\nfrom cachier.cores.mongo import MissingMongetter\n\n\ndef test_get_default_params():\n    params = get_default_params()\n    assert tuple(sorted(params)) == (\n        \"allow_none\",\n        \"backend\",\n        \"cache_dir\",\n        \"caching_enabled\",\n        \"hash_func\",\n        \"mongetter\",\n        \"next_time\",\n        \"pickle_reload\",\n        \"separate_files\",\n        \"stale_after\",\n        \"wait_for_calc_timeout\",\n    )\n\n\ndef test_bad_name(name=\"nope\"):\n    # Test that the appropriate exception is thrown\n    # when an invalid backend is given.\n    with pytest.raises(ValueError, match=f\"specified an invalid core: {name}\"):\n\n        @cachier(backend=name)\n        def dummy_func():\n            pass\n\n\ndef test_missing_mongetter():\n    # Test that the appropriate exception is thrown\n    # when forgetting to specify the mongetter.\n    with pytest.raises(MissingMongetter):\n\n        @cachier(backend=\"mongo\", mongetter=None)\n        def dummy_func():\n            pass\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_general.py",
      "content": "\"\"\"Non-core-specific tests for cachier.\"\"\"\n\nimport functools\nimport os\nimport queue\nimport subprocess  # nosec: B404\nimport threading\nfrom contextlib import suppress\nfrom random import random\nfrom time import sleep, time\n\nimport cachier\nimport pytest\nfrom cachier.core import (\n    DEFAULT_MAX_WORKERS,\n    MAX_WORKERS_ENVAR_NAME,\n    _get_executor,\n    _max_workers,\n    _set_max_workers,\n)\n\nfrom tests.test_mongo_core import (\n    MONGO_DELTA_LONG,\n    _test_mongetter,\n)\n\n\ndef test_information():\n    print(\"\\ncachier version: \", end=\"\")\n    print(cachier.__version__)\n\n\ndef test_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    with suppress(KeyError):\n        del os.environ[MAX_WORKERS_ENVAR_NAME]\n    assert _max_workers() == DEFAULT_MAX_WORKERS\n\n\ndef test_get_executor():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _get_executor()\n    _get_executor(False)\n    _get_executor(True)\n\n\ndef test_set_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _set_max_workers(9)\n\n\nparametrize_keys = \"mongetter,stale_after,separate_files\"\nparametrize_values = [\n    pytest.param(\n        _test_mongetter, MONGO_DELTA_LONG, False, marks=pytest.mark.mongo\n    ),\n    (None, None, False),\n    (None, None, True),\n]\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_ok(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_fast(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(1)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_fast(res_queue):\n        res = _wait_for_calc_timeout_fast(1, 2)\n        res_queue.put(res)\n\n    \"\"\" Testing calls that avoid timeouts store the values in cache. \"\"\"\n    _wait_for_calc_timeout_fast.clear_cache()\n    val1 = _wait_for_calc_timeout_fast(1, 2)\n    val2 = _wait_for_calc_timeout_fast(1, 2)\n    assert val1 == val2\n\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(2)\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2  # Timeout did not kick in, a single call was done\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_slow(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    sleep(4)\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_precache_value(mongetter, backend):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def dummy_func(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return arg_1 + arg_2\n\n    assert dummy_func.precache_value(2, 2, value_to_cache=5) == 5\n    assert dummy_func(2, 2) == 5\n    dummy_func.clear_cache()\n    assert dummy_func(2, 2) == 4\n    assert dummy_func.precache_value(2, arg_2=2, value_to_cache=5) == 5\n    assert dummy_func(2, arg_2=2) == 5\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_ignore_self_in_methods(mongetter, backend):\n    class DummyClass:\n        @cachier.cachier(backend=backend, mongetter=mongetter)\n        def takes_2_seconds(self, arg_1, arg_2):\n            \"\"\"Some function.\"\"\"\n            sleep(2)\n            return arg_1 + arg_2\n\n    test_object_1 = DummyClass()\n    test_object_2 = DummyClass()\n    test_object_1.takes_2_seconds.clear_cache()\n    test_object_2.takes_2_seconds.clear_cache()\n    assert test_object_1.takes_2_seconds(1, 2) == 3\n    start = time()\n    assert test_object_2.takes_2_seconds(1, 2) == 3\n    end = time()\n    assert end - start < 1\n\n\ndef test_hash_params_deprecation():\n    with pytest.deprecated_call(match=\"hash_params will be removed\"):\n\n        @cachier.cachier(hash_params=lambda a, k: \"key\")\n        def test():\n            return \"value\"\n\n    assert test() == \"value\"\n\n\ndef test_separate_processes():\n    test_args = (\"python\", \"tests/standalone_script.py\")\n    run_params = {\"args\": test_args, \"capture_output\": True, \"text\": True}\n    run_process = functools.partial(subprocess.run, **run_params)\n    result = run_process()\n    assert result.stdout.strip() == \"two 2\"\n    start = time()\n    result = run_process()\n    end = time()\n    assert result.stdout.strip() == \"two 2\"\n    assert end - start < 3\n\n\ndef test_global_disable():\n    @cachier.cachier()\n    def get_random():\n        return random()\n\n    get_random.clear_cache()\n    result_1 = get_random()\n    result_2 = get_random()\n    cachier.disable_caching()\n    result_3 = get_random()\n    cachier.enable_caching()\n    result_4 = get_random()\n    assert result_1 == result_2 == result_4\n    assert result_1 != result_3\n\n\ndef test_none_not_cached_by_default():\n    count = 0\n\n    @cachier.cachier()\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 2\n\n\ndef test_allow_caching_none():\n    count = 0\n\n    @cachier.cachier(allow_none=True)\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 1\n\n\ndef test_identical_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: int, b: int = 2, c: int = 3):\n        nonlocal count\n        count += 1\n        return a + b + c\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1, 2, 3) == 6\n    assert dummy_func(1, 2, c=3) == 6\n    assert dummy_func(1, b=2, c=3) == 6\n    assert dummy_func(a=1, b=2, c=3) == 6\n    assert count == 1\n\n\ndef test_list_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: list, b: list = [2]):  # noqa: B006\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func([1]) == [1, 2]\n    assert dummy_func([1], [2]) == [1, 2]\n    assert dummy_func([1], b=[2]) == [1, 2]\n    assert dummy_func(a=[1], b=[2]) == [1, 2]\n    assert count == 1\n\n\ndef test_order_independent_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(b=2, a=1) == 3\n    assert count == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_diff_functions_same_args(tmpdir, backend: str):\n    count_p = count_m = 0\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert fn(1) == expected\n        assert fn(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_runtime_handling(tmpdir, backend):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir, backend=backend)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n    for fn, expected in [(fn_plus, 5), (fn_minus, 1)]:\n        assert cachier_(fn)(3, 2) == expected\n        assert cachier_(fn)(a=3, b=2) == expected\n    assert count_p == 2\n    assert count_m == 2\n\n\ndef test_partial_handling(tmpdir):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)(2) == expected\n\n        dummy_ = functools.partial(fn, a=1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1) == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1, b=2) == expected\n\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n\n    assert count_p == 1\n    assert count_m == 1\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_defaults.py",
      "content": "import datetime\nimport os\nimport queue\nimport random\nimport threading\nimport time\n\nimport cachier\nimport pytest\n\nfrom tests.test_mongo_core import MONGO_DELTA, _test_mongetter\n\n_default_params = cachier.get_default_params().copy()\n\n\ndef setup_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef teardown_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef test_hash_func_default_param():\n    def slow_hash_func(args, kwds):\n        time.sleep(2)\n        return \"hash\"\n\n    def fast_hash_func(args, kwds):\n        return \"hash\"\n\n    cachier.set_default_params(hash_func=slow_hash_func)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(hash_func=fast_hash_func)\n    def global_test_2():\n        return None\n\n    start = time.time()\n    global_test_1()\n    end = time.time()\n    assert end - start > 1\n    start = time.time()\n    global_test_2()\n    end = time.time()\n    assert end - start < 1\n\n\ndef test_backend_default_param():\n    cachier.set_default_params(backend=\"memory\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(backend=\"pickle\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\n@pytest.mark.mongo\ndef test_mongetter_default_param():\n    cachier.set_default_params(mongetter=_test_mongetter)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(mongetter=False)\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\ndef test_cache_dir_default_param(tmpdir):\n    cachier.set_default_params(cache_dir=tmpdir / \"1\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() == str(tmpdir / \"1\")\n    assert global_test_2.cache_dpath() == str(tmpdir / \"2\")\n\n\ndef test_separate_files_default_param(tmpdir):\n    cachier.set_default_params(separate_files=True)\n\n    @cachier.cachier(cache_dir=tmpdir / \"1\")\n    def global_test_1(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\", separate_files=False)\n    def global_test_2(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    global_test_1(1, 2)\n    global_test_1(3, 4)\n    global_test_2(1, 2)\n    global_test_2(3, 4)\n\n    assert len(os.listdir(global_test_1.cache_dpath())) == 2\n    assert len(os.listdir(global_test_2.cache_dpath())) == 1\n\n\ndef test_allow_none_default_param(tmpdir):\n    cachier.set_default_params(\n        allow_none=True,\n        separate_files=True,\n        verbose_cache=True,\n    )\n    allow_count = disallow_count = 0\n\n    @cachier.cachier(cache_dir=tmpdir)\n    def allow_none():\n        nonlocal allow_count\n        allow_count += 1\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir, allow_none=False)\n    def disallow_none():\n        nonlocal disallow_count\n        disallow_count += 1\n        return None\n\n    assert allow_count == 0\n    allow_none()\n    allow_none()\n    assert allow_count == 1\n\n    assert disallow_count == 0\n    disallow_none()\n    disallow_none()\n    assert disallow_count == 2\n\n    disallow_none(cachier__allow_none=True)\n    disallow_none(cachier__allow_none=True)\n    assert disallow_count == 2\n\n\nPARAMETRIZE_TEST = (\n    \"backend,mongetter\",\n    [\n        pytest.param(\"pickle\", None, marks=pytest.mark.pickle),\n        pytest.param(\"mongo\", _test_mongetter, marks=pytest.mark.mongo),\n    ],\n)\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_stale_after_applies_dynamically(backend, mongetter):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_test(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random() + arg_1 + arg_2\n\n    cachier.set_default_params(stale_after=MONGO_DELTA)\n\n    _stale_after_test.clear_cache()\n    val1 = _stale_after_test(1, 2)\n    val2 = _stale_after_test(1, 2)\n    assert val1 == val2\n    time.sleep(3)\n    val3 = _stale_after_test(1, 2)\n    assert val3 != val1\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_next_time_applies_dynamically(backend, mongetter):\n    NEXT_AFTER_DELTA = datetime.timedelta(seconds=3)\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_next_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random()\n\n    cachier.set_default_params(stale_after=NEXT_AFTER_DELTA, next_time=True)\n\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    time.sleep(NEXT_AFTER_DELTA.seconds + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    time.sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_wait_for_calc_applies_dynamically(backend, mongetter):\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        time.sleep(3)\n        return random.random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    cachier.set_default_params(wait_for_calc_timeout=2)\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    time.sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    time.sleep(5)\n    thread1.join(timeout=5)\n    thread2.join(timeout=5)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\ndef test_default_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1) == 3\n    assert dummy_func(a=1) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert count == 1\n\n\ndef test_deprecated_func_kwargs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    with pytest.deprecated_call(\n        match=\"`verbose_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, verbose_cache=True) == 3\n    assert count == 1\n    with pytest.deprecated_call(\n        match=\"`ignore_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, ignore_cache=True) == 3\n    assert count == 2\n    with pytest.deprecated_call(\n        match=\"`overwrite_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, overwrite_cache=True) == 3\n    assert count == 3\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_pickle_core.py",
      "content": "\"\"\"Test for the Cachier python package.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\n# from os.path import (\n#     realpath,\n#     dirname\n# )\nimport os\nimport pickle\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pytest\n\ntry:\n    import queue\nexcept ImportError:  # python 2\n    import Queue as queue  # type: ignore\n\nimport hashlib\n\nimport pandas as pd\nfrom cachier import cachier\nfrom cachier.core import _default_params\n\n\ndef _get_decorated_func(func, **kwargs):\n    cachier_decorator = cachier(**kwargs)\n    decorated_func = cachier_decorator(func)\n    return decorated_func\n\n\n# Pickle core tests\n\n\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"reload\", [True, False])\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core(reload, separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds,\n        next_time=False,\n        pickle_reload=reload,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_keywords(separate_files):\n    \"\"\"Basic Pickle core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds, next_time=False, separate_files=separate_files\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after(separate_files):\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds_decorated = _get_decorated_func(\n        _stale_after_seconds,\n        stale_after=DELTA,\n        next_time=False,\n        separate_files=separate_files,\n    )\n    _stale_after_seconds_decorated.clear_cache()\n    val1 = _stale_after_seconds_decorated(1, 2)\n    val2 = _stale_after_seconds_decorated(1, 2)\n    val3 = _stale_after_seconds_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds_decorated(1, 2)\n    assert val4 != val1\n    _stale_after_seconds_decorated.clear_cache()\n\n\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after_next_time(separate_files):\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time_decorated = _get_decorated_func(\n        _stale_after_next_time,\n        stale_after=DELTA,\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _stale_after_next_time_decorated.clear_cache()\n    val1 = _stale_after_next_time_decorated(1, 2)\n    val2 = _stale_after_next_time_decorated(1, 2)\n    val3 = _stale_after_next_time_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time_decorated(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time_decorated(1, 2)\n    assert val5 != val1\n    _stale_after_next_time_decorated.clear_cache()\n\n\ndef _random_num():\n    return random()\n\n\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_overwrite_cache(separate_files):\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 == int3\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 == int3\n    _random_num_with_arg_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_ignore_cache(separate_files):\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg_decorated.clear_cache()\n\n\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(takes_time_func, res_queue):\n    res = takes_time_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_being_calculated(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(being_calc_func, res_queue):\n    res = being_calc_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_being_calc_next_time(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _being_calc_next_time_decorated = _get_decorated_func(\n        _being_calc_next_time,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _being_calc_next_time_decorated.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _BAD_CACHE_FNAME = '.__main__._bad_cache'\n_BAD_CACHE_FNAME = \".tests.test_pickle_core._bad_cache\"\n_BAD_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._bad_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\nEXPANDED_CACHIER_DIR = os.path.expanduser(_default_params[\"cache_dir\"])\n_BAD_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME)\n_BAD_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME_SEPARATE_FILES\n)\n_BAD_CACHE_FPATHS = {\n    True: _BAD_CACHE_FPATH_SEPARATE_FILES,\n    False: _BAD_CACHE_FPATH,\n}\n\n\ndef _calls_bad_cache(bad_cache_func, res_queue, trash_cache, separate_files):\n    try:\n        res = bad_cache_func(0.13, 0.02, cachier__verbose=True)\n        if trash_cache:\n            with open(_BAD_CACHE_FPATHS[separate_files], \"w\") as cache_file:\n                cache_file.seek(0)\n                cache_file.truncate()\n        res_queue.put(res)\n    except Exception as exc:\n        res_queue.put(exc)\n\n\ndef _helper_bad_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    _bad_cache_decorated = _get_decorated_func(\n        _bad_cache, separate_files=separate_files\n    )\n    _bad_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if res2 is not None or isinstance(res2, KeyError):\n        return False\n    return True\n\n\n# we want this to succeed at least once\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_bad_cache_file(separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.6, 0.7, 0.8, 1, 1.5, 2]\n    sleeptimes = sleeptimes + sleeptimes\n    for sleeptime in sleeptimes:\n        if _helper_bad_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _DEL_CACHE_FNAME = '.__main__._delete_cache'\n_DEL_CACHE_FNAME = \".tests.test_pickle_core._delete_cache\"\n_DEL_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._delete_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\n_DEL_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME)\n_DEL_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME_SEPARATE_FILES\n)\n_DEL_CACHE_FPATHS = {\n    True: _DEL_CACHE_FPATH_SEPARATE_FILES,\n    False: _DEL_CACHE_FPATH,\n}\n\n\ndef _calls_delete_cache(del_cache_func, res_queue, del_cache, separate_files):\n    try:\n        # print('in')\n        res = del_cache_func(0.13, 0.02)\n        # print('out with {}'.format(res))\n        if del_cache:\n            os.remove(_DEL_CACHE_FPATHS[separate_files])\n            # print(os.path.isfile(_DEL_CACHE_FPATH))\n        res_queue.put(res)\n    except Exception as exc:\n        # print('found')\n        res_queue.put(exc)\n\n\ndef _helper_delete_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    _delete_cache_decorated = _get_decorated_func(\n        _delete_cache, separate_files=separate_files\n    )\n    _delete_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    # print(res1)\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if not ((isinstance(res2, KeyError)) or (res2 is None)):\n        return False\n    return True\n    # print(res2)\n    # print(type(res2))\n\n\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_delete_cache_file(separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.7, 1]\n    sleeptimes = sleeptimes * 4\n    for sleeptime in sleeptimes:\n        if _helper_delete_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_clear_being_calculated(separate_files):\n    \"\"\"Test pickle core clear `being calculated` functionality.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_being_calculated()\n\n\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_error_throwing_func(separate_files):\n    # with\n    _error_throwing_func.count = 0\n    _error_throwing_func_decorated = _get_decorated_func(\n        _error_throwing_func,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _error_throwing_func_decorated.clear_cache()\n    res1 = _error_throwing_func_decorated(4)\n    sleep(1.5)\n    res2 = _error_throwing_func_decorated(4)\n    assert res1 == res2\n\n\n# test custom cache dir for pickle core\n\nCUSTOM_DIR = \"~/.exparrot\"\nEXPANDED_CUSTOM_DIR = os.path.expanduser(CUSTOM_DIR)\n\n\ndef _takes_2_seconds_custom_dir(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_custom_cache_dir(separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_custom_dir_decorated = _get_decorated_func(\n        _takes_2_seconds_custom_dir,\n        next_time=False,\n        cache_dir=CUSTOM_DIR,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    path2test = _takes_2_seconds_custom_dir_decorated.cache_dpath()\n    assert path2test == EXPANDED_CUSTOM_DIR\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_callable_hash_param(separate_files):\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(hash_func=_hash_func, separate_files=separate_files)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_mongo_core.py",
      "content": "\"\"\"Testing the MongoDB core of cachier.\"\"\"\n\nimport datetime\nimport hashlib\nimport platform\nimport queue\nimport sys\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep\nfrom urllib.parse import quote_plus\n\nimport pandas as pd\nimport pymongo\nimport pytest\nfrom birch import Birch  # type: ignore[import-not-found]\nfrom cachier import cachier\nfrom cachier.cores.base import RecalculationNeeded\nfrom cachier.cores.mongo import _MongoCore\nfrom pymongo.errors import OperationFailure\nfrom pymongo.mongo_client import MongoClient\nfrom pymongo_inmemory import MongoClient as InMemoryMongoClient\n\n# === Enables testing vs a real MongoDB instance ===\n\n\nclass CfgKey:\n    HOST = \"TEST_HOST\"\n    PORT = \"TEST_PORT\"\n    # UNAME = \"TEST_USERNAME\"\n    # PWD = \"TEST_PASSWORD\"\n    # DB = \"TEST_DB\"\n    TEST_VS_DOCKERIZED_MONGO = \"TEST_VS_DOCKERIZED_MONGO\"\n\n\nCFG = Birch(\n    namespace=\"cachier\",\n    defaults={CfgKey.TEST_VS_DOCKERIZED_MONGO: False},\n)\n\n\n# URI_TEMPLATE = \"mongodb://myUser:myPassword@localhost:27017/\"\nURI_TEMPLATE = \"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n\n\ndef _get_cachier_db_mongo_client():\n    host = quote_plus(CFG[CfgKey.HOST])\n    port = quote_plus(CFG[CfgKey.PORT])\n    # uname = quote_plus(CFG[CfgKey.UNAME])\n    # pwd = quote_plus(CFG[CfgKey.PWD])\n    # db = quote_plus(CFG[CfgKey.DB])\n    uri = f\"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n    return MongoClient(uri)\n\n\n_COLLECTION_NAME = (\n    f\"cachier_test_{platform.system()}\"\n    f\"_{'.'.join(map(str, sys.version_info[:3]))}\"\n)\n\n\ndef _test_mongetter():\n    if not hasattr(_test_mongetter, \"client\"):\n        if str(CFG.mget(CfgKey.TEST_VS_DOCKERIZED_MONGO)).lower() == \"true\":\n            print(\"Using live MongoDB instance for testing.\")\n            _test_mongetter.client = _get_cachier_db_mongo_client()\n        else:\n            print(\"Using in-memory MongoDB instance for testing.\")\n            _test_mongetter.client = InMemoryMongoClient()\n    db_obj = _test_mongetter.client[\"cachier_test\"]\n    if _COLLECTION_NAME not in db_obj.list_collection_names():\n        db_obj.create_collection(_COLLECTION_NAME)\n    return db_obj[_COLLECTION_NAME]\n\n\n# === Mongo core tests ===\n\n\n@pytest.mark.mongo\ndef test_information():\n    print(\"\\npymongo version: \", end=\"\")\n    print(pymongo.__version__)\n\n\n@pytest.mark.mongo\ndef test_mongo_index_creation():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    collection = _test_mongetter()\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    assert _MongoCore._INDEX_NAME in collection.index_information()\n\n\n@pytest.mark.mongo\ndef test_mongo_core():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, 2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, 2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, 2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, 2)\n    assert val6 == val5\n\n\n@pytest.mark.mongo\ndef test_mongo_core_keywords():\n    \"\"\"Basic Mongo core functionality with keyword arguments.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, arg_2=2)\n    val2 = _test_mongo_caching(1, arg_2=2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, arg_2=2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, arg_2=2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, arg_2=2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, arg_2=2)\n    assert val6 == val5\n\n\nMONGO_DELTA = timedelta(seconds=3)\nMONGO_DELTA_LONG = timedelta(seconds=10)\n\n\n@pytest.mark.mongo\ndef test_mongo_stale_after():\n    \"\"\"Testing MongoDB core stale_after functionality.\"\"\"\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=MONGO_DELTA, next_time=False\n    )\n    def _stale_after_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _stale_after_mongo.clear_cache()\n    val1 = _stale_after_mongo(1, 2)\n    val2 = _stale_after_mongo(1, 2)\n    assert val1 == val2\n    sleep(3)\n    val3 = _stale_after_mongo(1, 2)\n    assert val3 != val1\n\n\ndef _calls_takes_time(res_queue):\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    res = _takes_time(34, 82.3)\n    res_queue.put(res)\n\n\n@pytest.mark.mongo\ndef test_mongo_being_calculated():\n    \"\"\"Testing MongoDB core handling of being calculated scenarios.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(1)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\nclass _BadMongoCollection:\n    def __init__(self, mongetter):\n        self.collection = mongetter()\n        self.index_information = self.collection.index_information\n        self.create_indexes = self.collection.create_indexes\n        self.find_one = self.collection.find_one\n\n    def delete_many(self, *args, **kwargs):\n        pass\n\n    def update_many(self, *args, **kwargs):\n        pass\n\n    def update_one(self, *args, **kwargs):\n        raise OperationFailure(Exception())\n\n\ndef _bad_mongetter():\n    return _BadMongoCollection(_test_mongetter)\n\n\n@pytest.mark.mongo\ndef test_mongo_write_failure():\n    \"\"\"Testing MongoDB core handling of writing failure scenarios.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    with pytest.raises(OperationFailure):\n        val1 = _func_w_bad_mongo(1, 2)\n        val2 = _func_w_bad_mongo(1, 2)\n        assert val1 == val2\n\n\n@pytest.mark.mongo\ndef test_mongo_clear_being_calculated():\n    \"\"\"Testing MongoDB core clear_being_calculated.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _func_w_bad_mongo.clear_being_calculated()\n\n\n@pytest.mark.mongo\ndef test_stalled_mongo_db_cache():\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    core = _MongoCore(None, _test_mongetter, 0)\n    core.set_func(_stalled_func)\n    core.clear_cache()\n    with pytest.raises(RecalculationNeeded):\n        core.wait_on_entry_calc(key=None)\n\n\n@pytest.mark.mongo\ndef test_stalled_mong_db_core(monkeypatch):\n    def mock_get_entry(self, args, kwargs):\n        return \"key\", {\"being_calculated\": True}\n\n    def mock_get_entry_by_key(self, key):\n        return \"key\", None\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry\n    )\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry_by_key\",\n        mock_get_entry_by_key,\n    )\n\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    res = _stalled_func()\n    assert res == 1\n\n    def mock_get_entry_2(self, args, kwargs):\n        entry = {\n            \"being_calculated\": True,\n            \"value\": 1,\n            \"time\": datetime.datetime.now() - datetime.timedelta(seconds=10),\n        }\n        return \"key\", entry\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry_2\n    )\n\n    stale_after = datetime.timedelta(seconds=1)\n\n    @cachier(mongetter=_test_mongetter, stale_after=stale_after)\n    def _stalled_func_2():\n        \"\"\"Testing stalled function.\"\"\"\n        return 2\n\n    res = _stalled_func_2()\n    assert res == 2\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=stale_after, next_time=True\n    )\n    def _stalled_func_3():\n        \"\"\"Testing stalled function.\"\"\"\n        return 3\n\n    res = _stalled_func_3()\n    assert res == 1\n\n\n@pytest.mark.mongo\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(mongetter=_test_mongetter, hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/requirements.txt",
      "content": "# todo: add some version range or pinning latest versions\n# tests and coverages\npytest\ncoverage\npytest-cov\nbirch\n# to connect to the test mongodb server\npymongo\ndnspython\npymongo-inmemory\n# to test pandas dataframe as-param hashing with mongodb core\npandas\n# to be able to run `python setup.py checkdocs`\ncollective.checkdocs\npygments\n"
    }
  ],
  "BuggyCode": [
    {
      "path": "python-cachier_cachier/MANIFEST.in",
      "content": "# Manifest syntax https://docs.python.org/2/distutils/sourcedist.html\ngraft wheelhouse\n\nrecursive-include src *.info\ninclude README.rst\ninclude LICENSE\n\nprune __pycache__\nprune tests\n"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "Cachier\n#######\n\n|PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n\nPersistent, stale-free, local and cross-machine caching for Python functions.\n\n.. code-block:: python\n\n  from cachier import cachier\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(days=3))\n  def foo(arg1, arg2):\n    \"\"\"foo now has a persistent cache, triggering recalculation for values stored more than 3 days.\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\n\n.. role:: python(code)\n  :language: python\n\n.. contents::\n\n.. section-numbering:\n\n\n\nInstallation\n============\n\nInstall ``cachier`` with:\n\n.. code-block:: python\n\n    pip install cachier\n\nFor the latest version supporting Python 2.7 please use:\n\n.. code-block:: python\n\n    pip install 'cachier==1.2.8'\n\nFeatures\n========\n\n* Pure Python.\n* Compatible with Python 3.8+ (Python 2.7 was discontinued in version 1.2.8).\n* Supported and `tested on Linux, OS X and Windows <https://travis-ci.org/shaypal5/cachier>`_.\n* A simple interface.\n* Defining \"shelf life\" for cached values.\n* Local caching using pickle files.\n* Cross-machine caching using MongoDB.\n* Thread-safety.\n\nCachier is **NOT**:\n\n* Meant as a transient cache. Python's @lru_cache is better.\n* Especially fast. It is meant to replace function calls that take more than... a second, say (overhead is around 1 millisecond).\n\nFuture features\n---------------\n\n* S3 core.\n* Multi-core caching.\n* `Cache replacement policies <https://en.wikipedia.org/wiki/Cache_replacement_policies>`_\n\n\nUse\n===\n\nCachier provides a decorator which you can wrap around your functions to give them a persistent cache. The positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). Also, notice that since objects which are instances of user-defined classes are hashable but all compare unequal (their hash value is their id), equal objects across different sessions will not yield identical keys.\n\nSetting up a Cache\n------------------\nYou can add a default, pickle-based, persistent cache to your function - meaning it will last across different Python kernels calling the wrapped function - by decorating it with the ``cachier`` decorator (notice the ``()``!).\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier()\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values!\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nClass and object methods can also be cached. Cachier will automatically ignore the `self` parameter when determining the cache key for an object method. **This means that methods will be cached across all instances of an object, which may not be what you want.**\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  class Foo():\n    @staticmethod\n    @cachier()\n    def good_static_usage(arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method does not depend on object's internal state, so good to cache\n    @cachier()\n    def good_usage_1(self, arg_1, arg_2):\n      return arg_1 + arg_2\n\n    # Instance method is calling external service, probably okay to cache\n    @cachier()\n    def good_usage_2(self, arg_1, arg_2):\n      result = self.call_api(arg_1, arg_2)\n      return result\n\n    # Instance method relies on object attribute, NOT good to cache\n    @cachier()\n    def bad_usage(self, arg_1, arg_2):\n      return arg_1 + arg_2 + self.arg_3\n\n\nResetting a Cache\n-----------------\nThe Cachier wrapper adds a ``clear_cache()`` function to each wrapped function. To reset the cache of the wrapped function simply call this method:\n\n.. code-block:: python\n\n  foo.clear_cache()\n\nGeneral Configuration\n----------------------\n\nGlobal Defaults\n~~~~~~~~~~~~~~~\n\nSettings can be globally configured across all Cachier wrappers through the use of the `set_default_params` function. This function takes the same keyword parameters as the ones defined in the decorator, which can be passed all at once or with multiple calls. Parameters given directly to a decorator take precedence over any values set by this function.\n\nThe following parameters will only be applied to decorators defined after `set_default_params` is called:\n\n*  `hash_func`\n*  `backend`\n*  `mongetter`\n*  `cache_dir`\n*  `pickle_reload`\n*  `separate_files`\n\nThese parameters can be changed at any time and they will apply to all decorators:\n\n*  `allow_none`\n*  `caching_enabled`\n*  `stale_after`\n*  `next_time`\n*  `wait_for_calc_timeout`\n\nThe current defaults can be fetched by calling `get_default_params`.\n\nThreads Limit\n~~~~~~~~~~~~~\n\nTo limit the number of threads Cachier is allowed to spawn, set the ``CACHIER_MAX_WORKERS`` with the desired number. The default is 8, so to enable Cachier to spawn even more threads, you'll have to set a higher limit explicitly.\n\n\nGlobal Enable/Disable\n---------------------\n\nCaching can be turned off across all decorators by calling `disable_caching`, and then re-activated by calling `enable_caching`.\n\nThese functions are convenience wrappers around the `caching_enabled` default setting.\n\n\nCache Shelf Life\n----------------\n\nSetting Shelf Life\n~~~~~~~~~~~~~~~~~~\nYou can set any duration as the shelf life of cached return values of a function by providing a corresponding ``timedelta`` object to the ``stale_after`` parameter:\n\n.. code-block:: python\n\n  import datetime\n\n  @cachier(stale_after=datetime.timedelta(weeks=2))\n  def bar(arg1, arg2):\n    return {'arg1': arg1, 'arg2': arg2}\n\nNow when a cached value matching the given arguments is found the time of its calculation is checked; if more than ``stale_after`` time has since passed, the function will be run again for the same arguments and the new value will be cached and returned.\n\nThis is useful for lengthy calculations that depend on a dynamic data source.\n\nFuzzy Shelf Life\n~~~~~~~~~~~~~~~~\nSometimes you may want your function to trigger a calculation when it encounters a stale result, but still not wait on it if it's not that critical. In that case, you can set ``next_time`` to ``True`` to have your function trigger a recalculation **in a separate thread**, but return the currently cached stale value:\n\n.. code-block:: python\n\n  @cachier(next_time=True)\n\nFurther function calls made while the calculation is being performed will not trigger redundant calculations.\n\n\nWorking with unhashable arguments\n---------------------------------\n\nAs mentioned above, the positional and keyword arguments to the wrapped function must be hashable (i.e. Python's immutable built-in objects, not mutable containers). To get around this limitation the ``hash_func`` parameter of the ``cachier`` decorator can be provided with a callable that gets the args and kwargs from the decorated function and returns a hash key for them.\n\n.. code-block:: python\n\n  def calculate_hash(args, kwds):\n    key = ...  # compute a hash key here based on arguments\n    return key\n\n  @cachier(hash_func=calculate_hash)\n  def calculate_super_complex_stuff(custom_obj):\n    # amazing code goes here\n\nSee here for an example:\n\n`Question: How to work with unhashable arguments <https://github.com/python-cachier/cachier/issues/91>`_\n\n\nPrecaching values\n---------------------------------\n\nIf you want to load a value into the cache without calling the underlying function, this can be done with the `precache_value` function.\n\n.. code-block:: python\n\n  @cachier()\n  def add(arg1, arg2):\n    return arg1 + arg2\n\n  add.precache_value(2, 2, value_to_cache=5)\n\n  result = add(2, 2)\n  print(result)  # prints 5\n\n\nPer-function call arguments\n---------------------------\n\nCachier also accepts several keyword arguments in the calls of the function it wraps rather than in the decorator call, allowing you to modify its behaviour for a specific function call.\n\nIgnore Cache\n~~~~~~~~~~~~\n\nYou can have ``cachier`` ignore any existing cache for a specific function call by passing ``cachier__skip_cache=True`` to the function call. The cache will neither be checked nor updated with the new return value.\n\n.. code-block:: python\n\n  @cachier()\n  def sum(first_num, second_num):\n    return first_num + second_num\n\n  def main():\n    print(sum(5, 3, cachier__skip_cache=True))\n\nOverwrite Cache\n~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` overwrite an existing cache entry - if one exists - for a specific function call by passing ``cachier__overwrite_cache=True`` to the function call. The cache will not be checked but will be updated with the new return value.\n\nVerbose Cache Call\n~~~~~~~~~~~~~~~~~~\n\nYou can have ``cachier`` print out a detailed explanation of the logic of a specific call by passing ``cachier__verbose=True`` to the function call. This can be useful if you are not sure why a certain function result is, or is not, returned.\n\nCache `None` Values\n~~~~~~~~~~~~~~~~~~~\n\nBy default, ``cachier`` does not cache ``None`` values. You can override this behaviour by passing ``allow_none=True`` to the function call.\n\n\nCachier Cores\n=============\n\nPickle Core\n-----------\n\nThe default core for Cachier is pickle based, meaning each function will store its cache is a separate pickle file in the ``~/.cachier`` directory. Naturally, this kind of cache is both machine-specific and user-specific.\n\nYou can configure ``cachier`` to use another directory by providing the ``cache_dir`` parameter with the path to that directory:\n\n.. code-block:: python\n\n  @cachier(cache_dir='~/.temp/.cache')\n\n\nYou can slightly optimise pickle-based caching if you know your code will only be used in a single thread environment by setting:\n\n.. code-block:: python\n\n  @cachier(pickle_reload=False)\n\nThis will prevent reading the cache file on each cache read, speeding things up a bit, while also nullifying inter-thread functionality (the code is still thread safe, but different threads will have different versions of the cache at times, and will sometime make unnecessary function calls).\n\nSetting the optional argument ``separate_files`` to ``True`` will cause the cache to be stored in several files: A file per argument set, per function. This can help if your per-function cache files become too large.\n\n.. code-block:: python\n\n  from cachier import cachier\n\n  @cachier(separate_files=True)\n  def foo(arg1, arg2):\n    \"\"\"Your function now has a persistent cache mapped by argument values, split across several files, per argument set\"\"\"\n    return {'arg1': arg1, 'arg2': arg2}\n\nYou can get the fully qualified path to the directory of cache files used by ``cachier`` (``~/.cachier`` by default) by calling the ``cache_dpath()`` function:\n\n.. code-block:: python\n\n  >>> foo.cache_dpath()\n      \"/home/user/.cachier/\"\n\n\nMongoDB Core\n------------\nYou can set a MongoDB-based cache by assigning ``mongetter`` with a callable that returns a ``pymongo.Collection`` object with writing permissions:\n\n.. code-block:: python\n\n    from pymongo import MongoClient\n\n    def my_mongetter():\n        client = MongoClient(get_cachier_db_auth_uri())\n        db_obj = client['cachier_db']\n        if 'someapp_cachier_db' not in db_obj.list_collection_names():\n            db_obj.create_collection('someapp_cachier_db')\n        return db_obj['someapp_cachier_db']\n\n  @cachier(mongetter=my_mongetter)\n\nThis allows you to have a cross-machine, albeit slower, cache. This functionality requires that the installation of the ``pymongo`` python package.\n\nIn certain cases the MongoDB backend might leave a deadlock behind, blocking all subsequent requests from being processed. If you encounter this issue, supply the ``wait_for_calc_timeout`` with a reasonable number of seconds; calls will then wait at most this number of seconds before triggering a recalculation.\n\n.. code-block:: python\n\n  @cachier(mongetter=False, wait_for_calc_timeout=2)\n\n\nMemory Core\n-----------\n\nYou can set an in-memory cache by assigning the ``backend`` parameter with ``'memory'``:\n\n.. code-block:: python\n\n  @cachier(backend='memory')\n\nNote, however, that ``cachier``'s in-memory core is simple, and has no monitoring or cap on cache size, and can thus lead to memory errors on large return values - it is mainly intended to be used with future multi-core functionality. As a rule, Python's built-in ``lru_cache`` is a much better stand-alone solution.\n\n\nContributing\n============\n\nCurrent maintainers are Shay Palachy Affek (`shay.palachy@gmail.com <mailto:shay.palachy@gmail.com>`_, `@shaypal5 <https://github.com/shaypal5>`_) and Judson Neer (`@lordjabez <https://github.com/lordjabez>`_); You are more than welcome to approach them for help. Contributions are very welcomed! :)\n\nInstalling for development\n--------------------------\n\nClone:\n\n.. code-block:: bash\n\n  git clone git@github.com:python-cachier/cachier.git\n\n\nInstall in development mode with test dependencies:\n\n.. code-block:: bash\n\n  cd cachier\n  pip install -e . -r tests/requirements.txt\n\n\nRunning the tests\n-----------------\n\nTo run the tests, call the ``pytest`` command in the repository's root, or:\n\n.. code-block:: bash\n\n  python -m pytest\n\nTo run only MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m mongo\n\nTo run only memory core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m memory\n\nTo run all tests EXCEPT MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not mongo\"\n\n\nTo run all tests EXCEPT memory core AND MongoDB core related tests, use:\n\n.. code-block:: bash\n\n  pytest -m \"not (mongo or memory)\"\n\n\nRunning MongoDB tests against a live MongoDB instance\n-----------------------------------------------------\n\n**Note to developers:** By default, all MongoDB tests are run against a mocked MongoDB instance, provided by the ``pymongo_inmemory`` package. To run them against a live MongoDB instance, the ``CACHIER_TEST_VS_DOCKERIZED_MONGO`` environment variable is set to ``True`` in the ``test`` environment of this repository (and additional environment variables are populated with the appropriate credentials), used by the GitHub Action running tests on every commit and pull request.\n\nContributors are not expected to run these tests against a live MongoDB instance when developing, as credentials for the testing instance used will NOT be shared, but rather use the testing against the in-memory MongoDB instance as a good proxy.\n\n**HOWEVER, the tests run against a live MongoDB instance when you submit a PR are the determining tests for deciding whether your code functions correctly against MongoDB.**\n\n\nAdding documentation\n--------------------\n\nThis project is documented using the `numpy docstring conventions`_, which were chosen as they are perhaps the most widely-spread conventions that are both supported by common tools such as Sphinx and result in human-readable docstrings (in my personal opinion, of course). When documenting code you add to this project, please follow `these conventions`_.\n\n.. _`numpy docstring conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n.. _`these conventions`: https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt\n\nAdditionally, if you update this ``README.rst`` file, use ``python setup.py checkdocs`` to validate it compiles.\n\n\nCredits\n=======\n\nCreated by `Shay Palachy Affek <https://github.com/shaypal5>`_ (shay.palachy@gmail.com), which currently assists in maintenance.\n\nCurrent lead developer/contributor: `Jirka Borovec <https://github.com/Borda>`_ (`@Borda <https://github.com/Borda>`_ on GitHub).\n\nOther major contributors:\n\n* `Jirka Borovec <https://github.com/Borda>`_ - Arg order independence, args-to-kwargs for less unique keys and numerous development and CI contributions.\n\n* `Judson Neer <https://github.com/lordjabez>`_ - Precaching, method caching support and numerous improvements and bugfixes.\n\n* `cthoyt <https://github.com/cthoyt>`_ - Base memory core implementation.\n\n* `amarczew <https://github.com/amarczew>`_ - The ``hash_func`` kwarg.\n\n* `non-senses <https://github.com/non-senses>`_ - The ``wait_for_calc_timeout`` kwarg.\n\n* `Elad Rapaport <https://github.com/erap129>`_ - Multi-file Pickle core, a.k.a ``separate_files`` (released on ``v1.5.3``).\n\n* `John Didion <https://github.com/jdidion>`_ - Support for pickle-based caching for cases where two identically-named methods of different classes are defined in the same module.\n\nNotable bugfixers:\n\n* `MichaelRazum <https://github.com/MichaelRazum>`_.\n\n* `Eric Ma <https://github.com/ericmjl>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n* `Ofir <https://github.com/ofirnk>`_ - The iNotify bugfix (released on ``v1.5.3``).\n\n\n\n.. |PyPI-Status| image:: https://img.shields.io/pypi/v/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |PyPI-Versions| image:: https://img.shields.io/pypi/pyversions/cachier.svg\n   :target: https://pypi.python.org/pypi/cachier\n\n.. |Build-Status| image:: https://github.com/python-cachier/cachier/actions/workflows/test.yml/badge.svg\n  :target: https://github.com/python-cachier/cachier/actions/workflows/test.yml\n\n.. |LICENCE| image:: https://img.shields.io/pypi/l/cachier.svg\n  :target: https://pypi.python.org/pypi/cachier\n\n.. |Codecov| image:: https://codecov.io/github/python-cachier/cachier/coverage.svg?branch=master\n   :target: https://codecov.io/github/python-cachier/cachier?branch=master\n\n.. |Downloads| image:: https://pepy.tech/badge/cachier\n     :target: https://pepy.tech/project/cachier\n     :alt: PePy stats\n\n.. |Codefactor| image:: https://www.codefactor.io/repository/github/python-cachier/cachier/badge?style=plastic\n     :target: https://www.codefactor.io/repository/github/python-cachier/cachier\n     :alt: Codefactor code quality\n\n.. links:\n.. _pymongo: https://api.mongodb.com/python/current/\n.. _watchdog: https://github.com/gorakhargosh/watchdog\n"
    },
    {
      "path": "python-cachier_cachier/LICENSE",
      "content": "MIT License\n\nCopyright (c) 2016 Shay Palachy\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.\n"
    },
    {
      "path": "python-cachier_cachier/pyproject.toml",
      "content": "[project]\nname = \"cachier\"\ndescription = \"Persistent, stale-free, local and cross-machine caching for Python functions.\"\nreadme = \"README.rst\"\nkeywords = [\n  \"cache\",\n  \"caching\",\n  \"cross-machine\",\n  \"decorator\",\n  \"local\",\n  \"memoization\",\n  \"mongo\",\n  \"persistent\",\n]\nlicense = {file = \"LICENSE\"}\nauthors = [\n   {name = \"Shay Palachy Affek\", email = 'shay.palachy@gmail.com'},\n]\nclassifiers = [\n  'Development Status :: 4 - Beta',\n  'Intended Audience :: Developers',\n  'License :: OSI Approved :: MIT License',\n  'Programming Language :: Python',\n  \"Programming Language :: Python :: 3 :: Only\",\n  \"Programming Language :: Python :: 3.8\",\n  \"Programming Language :: Python :: 3.9\",\n  \"Programming Language :: Python :: 3.10\",\n  \"Programming Language :: Python :: 3.11\",\n  \"Programming Language :: Python :: 3.12\",\n  \"Topic :: Other/Nonlisted Topic\",\n  \"Topic :: Software Development :: Libraries\",\n  \"Topic :: Software Development :: Libraries :: Python Modules\",\n  \"Topic :: Utilities\",\n]\ndynamic = [\n  \"version\",\n]\ndependencies = [\n  \"portalocker>=2.3.2\",\n  \"watchdog>=2.3.1\",\n]\n[project.urls]\nSource = \"https://github.com/python-cachier/cachier\"\n[project.scripts]\ncachier = \"cachier.__naim__:cli\"\n\n[tool.setuptools]\ninclude-package-data = true\n\n[tool.setuptools.dynamic]\nversion = {attr = \"cachier._version.__version__\"}\n\n[tool.setuptools.packages.find]\nwhere = [\"src\"]  # list of folders that contain the packages ([\".\"] by default)\ninclude = [\"cachier*\"]  # package names should match these glob patterns ([\"*\"] by default)\nnamespaces = false  # to disable scanning PEP 420 namespaces (true by default)\n\n[build-system]\nrequires = [\n  \"setuptools\",\n  \"wheel\",\n]\n\n[tool.pytest.ini_options]\ntestpaths = [\"cachier\", \"tests\"]\nnorecursedirs=[\"dist\", \"build\"]\naddopts = [\n    \"--color=yes\",\n    \"--cov=cachier\",\n    \"--cov-report=term\",\n    \"--cov-report=xml:cov.xml\",\n    \"-r a\",\n    \"-v\",\n    \"-s\",\n]\nmarkers = [\n    \"mongo: test the MongoDB core\",\n    \"memory: test the memory core\",\n    \"pickle: test the pickle core\"\n]\n\n[tool.docformatter]\nrecursive = true\n# some docstring start with r\"\"\"\nwrap-summaries = 79\nwrap-descriptions = 79\nblank = true\n\n[tool.ruff]\ntarget-version = \"py38\"\nline-length = 79\n# Enable Pyflakes `E` and `F` codes by default.\nlint.select = [\n    \"E\",\n    \"W\", # see: https://pypi.org/project/pycodestyle\n    \"F\", # see: https://pypi.org/project/pyflakes\n    \"I\", #see: https://pypi.org/project/isort/\n    \"D\", # see: https://pypi.org/project/pydocstyle\n    #\"N\", # see: https://pypi.org/project/pep8-naming\n    \"S\", # see: https://pypi.org/project/flake8-bandit\n    \"SIM\",\n    \"RUF100\" # alternative to yesqa\n]\nlint.extend-select = [\n    \"A\",    # see: https://pypi.org/project/flake8-builtins\n    \"B\",    # see: https://pypi.org/project/flake8-bugbear\n    \"C4\",   # see: https://pypi.org/project/flake8-comprehensions\n]\nlint.ignore = [\n    \"E203\",\n    \"C901\",\n]\n# Exclude a variety of commonly ignored directories.\nexclude = [\n    \".eggs\",\n    \".git\",\n    \".ruff_cache\",\n    \"__pypackages__\",\n    \"_build\",\n    \"build\",\n    \"dist\",\n]\nlint.ignore-init-module-imports = true\nlint.unfixable = [\"F401\"]\n\n[tool.ruff.lint.per-file-ignores]\n\"src/**/__init__.py\" = [\"D104\"]\n\"src/cachier/config.py\" = [\"D100\"]\n\"tests/**\" = [\n  \"S101\", \"S311\", \"S105\", \"S603\",\n  \"D100\", \"D101\", \"D103\", \"D104\", \"D401\"\n]\n\n#[tool.ruff.pydocstyle]\n## Use Google-style docstrings.\n#convention = \"google\"\n\n#[tool.ruff.pycodestyle]\n#ignore-overlong-task-comments = true\n\n[tool.ruff.lint.mccabe]\n# Unlike Flake8, default to a complexity level of 10.\nmax-complexity = 10\n\n[tool.coverage.run]\nbranch = true\ndynamic_context = \"test_function\"\nomit = [\n    \"tests/*\",\n    \"cachier/_version.py\",\n    \"cachier/__init__.py\",\n    \"**/scripts/**\",\n]\n[tool.coverage.report]\nshow_missing = true\n# Regexes for lines to exclude from consideration\nexclude_lines = [\n    \"pragma: no cover\", # Have to re-enable the standard pragma\n    \"raise NotImplementedError\", # Don't complain if tests don't hit defensive assertion code:\n    \"if TYPE_CHECKING:\", # Is only true when running mypy, not tests\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__init__.py",
      "content": "from ._version import *  # noqa: F403\nfrom .config import (\n    disable_caching,\n    enable_caching,\n    get_default_params,\n    set_default_params,\n)\nfrom .core import cachier\n\n__all__ = [\n    \"cachier\",\n    \"set_default_params\",\n    \"get_default_params\",\n    \"enable_caching\",\n    \"disable_caching\",\n]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/config.py",
      "content": "import datetime\nimport hashlib\nimport os\nimport pickle\nfrom typing import Optional, TypedDict, Union\n\nfrom ._types import Backend, HashFunc, Mongetter\n\n\ndef _default_hash_func(args, kwds):\n    # Sort the kwargs to ensure consistent ordering\n    sorted_kwargs = sorted(kwds.items())\n    # Serialize args and sorted_kwargs using pickle or similar\n    serialized = pickle.dumps((args, sorted_kwargs))\n    # Create a hash of the serialized data\n    return hashlib.sha256(serialized).hexdigest()\n\n\nclass Params(TypedDict):\n    \"\"\"Type definition for cachier parameters.\"\"\"\n\n    caching_enabled: bool\n    hash_func: HashFunc\n    backend: Backend\n    mongetter: Optional[Mongetter]\n    stale_after: datetime.timedelta\n    next_time: bool\n    cache_dir: Union[str, os.PathLike]\n    pickle_reload: bool\n    separate_files: bool\n    wait_for_calc_timeout: int\n    allow_none: bool\n\n\n_default_params: Params = {\n    \"caching_enabled\": True,\n    \"hash_func\": _default_hash_func,\n    \"backend\": \"pickle\",\n    \"mongetter\": None,\n    \"stale_after\": datetime.timedelta.max,\n    \"next_time\": False,\n    \"cache_dir\": \"~/.cachier/\",\n    \"pickle_reload\": True,\n    \"separate_files\": False,\n    \"wait_for_calc_timeout\": 0,\n    \"allow_none\": False,\n}\n\n\ndef _update_with_defaults(\n    param, name: str, func_kwargs: Optional[dict] = None\n):\n    import cachier\n\n    if func_kwargs:\n        kw_name = f\"cachier__{name}\"\n        if kw_name in func_kwargs:\n            return func_kwargs.pop(kw_name)\n    if param is None:\n        return cachier.config._default_params[name]\n    return param\n\n\ndef set_default_params(**params):\n    \"\"\"Configure global parameters applicable to all memoized functions.\n\n    This function takes the same keyword parameters as the ones defined in the\n    decorator, which can be passed all at once or with multiple calls.\n    Parameters given directly to a decorator take precedence over any values\n    set by this function.\n\n    Only 'stale_after', 'next_time', and 'wait_for_calc_timeout' can be changed\n    after the memoization decorator has been applied. Other parameters will\n    only have an effect on decorators applied after this function is run.\n\n    \"\"\"\n    import cachier\n\n    valid_params = (\n        p for p in params.items() if p[0] in cachier.config._default_params\n    )\n    _default_params.update(valid_params)\n\n\ndef get_default_params():\n    \"\"\"Get current set of default parameters.\"\"\"\n    import cachier\n\n    return cachier.config._default_params\n\n\ndef enable_caching():\n    \"\"\"Enable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = True\n\n\ndef disable_caching():\n    \"\"\"Disable caching globally.\"\"\"\n    import cachier\n\n    cachier.config._default_params[\"caching_enabled\"] = False\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/_version.py",
      "content": "# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2024, Jirka Borovec <***@gmail.com>\n\nimport os\n\n_PATH_HERE = os.path.dirname(__file__)\n_PATH_VERSION = os.path.join(_PATH_HERE, \"version.info\")\n_RELEASING_PROCESS = os.getenv(\"RELEASING_PROCESS\", \"0\") == \"1\"\n\nwith open(_PATH_VERSION) as fopen:\n    __version__ = fopen.read().strip()\n\n\ndef _get_git_sha() -> str:\n    from subprocess import DEVNULL, check_output\n\n    out = check_output([\"git\", \"rev-parse\", \"--short\", \"HEAD\"], stderr=DEVNULL)  # noqa: S603, S607\n    return out.decode(\"utf-8\").strip()\n\n\nif not _RELEASING_PROCESS:\n    try:\n        sha_short = _get_git_sha()\n        # print(f\"Version enriched with git commit hash: {__version__}.\")\n    except Exception:\n        # print(\"Failed to get the git commit hash,\"\n        #       f\" falling back to base version {__version__}.\")\n        sha_short = \"\"\n    __version__ += f\".dev+{sha_short}\" if sha_short else \".dev\"\n\n\n__all__ = [\"__version__\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/version.info",
      "content": "3.0.0\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/py.typed",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/_types.py",
      "content": "from typing import TYPE_CHECKING, Callable, Literal\n\nif TYPE_CHECKING:\n    import pymongo.collection\n\n\nHashFunc = Callable[..., str]\nMongetter = Callable[[], \"pymongo.collection.Collection\"]\nBackend = Literal[\"pickle\", \"mongo\", \"memory\"]\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/__main__.py",
      "content": "\"\"\"A command-line interface for cachier.\"\"\"\n\nimport click\n\nfrom cachier.core import _set_max_workers\n\n\n@click.group()\ndef cli():\n    \"\"\"A command-line interface for cachier.\"\"\"  # noqa: D401\n\n\n@cli.command(\"Limits the number of worker threads used by cachier.\")\n@click.argument(\"max_workers\", nargs=1, type=int)\ndef set_max_workers(max_workers):\n    \"\"\"Limits the number of worker threads used by cachier.\"\"\"\n    _set_max_workers(max_workers)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "\"\"\"Persistent, stale-free memoization decorators for Python.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport datetime\nimport inspect\nimport os\nimport warnings\nfrom collections import OrderedDict\nfrom concurrent.futures import ThreadPoolExecutor\nfrom functools import wraps\nfrom typing import Optional, Union\nfrom warnings import warn\n\nfrom .config import (\n    Backend,\n    HashFunc,\n    Mongetter,\n    _default_params,\n    _update_with_defaults,\n)\nfrom .cores.base import RecalculationNeeded, _BaseCore\nfrom .cores.memory import _MemoryCore\nfrom .cores.mongo import _MongoCore\nfrom .cores.pickle import _PickleCore\n\nMAX_WORKERS_ENVAR_NAME = \"CACHIER_MAX_WORKERS\"\nDEFAULT_MAX_WORKERS = 8\n\n\ndef _max_workers():\n    return int(os.environ.get(MAX_WORKERS_ENVAR_NAME, DEFAULT_MAX_WORKERS))\n\n\ndef _set_max_workers(max_workers):\n    os.environ[MAX_WORKERS_ENVAR_NAME] = str(max_workers)\n    _get_executor(True)\n\n\ndef _get_executor(reset=False):\n    if reset or not hasattr(_get_executor, \"executor\"):\n        _get_executor.executor = ThreadPoolExecutor(_max_workers())\n    return _get_executor.executor\n\n\ndef _function_thread(core, key, func, args, kwds):\n    try:\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n    except BaseException as exc:\n        print(f\"Function call failed with the following exception:\\n{exc}\")\n\n\ndef _calc_entry(core, key, func, args, kwds):\n    try:\n        core.mark_entry_being_calculated(key)\n        # _get_executor().submit(core.mark_entry_being_calculated, key)\n        func_res = func(*args, **kwds)\n        core.set_entry(key, func_res)\n        # _get_executor().submit(core.set_entry, key, func_res)\n        return func_res\n    finally:\n        core.mark_entry_not_calculated(key)\n\n\ndef _convert_args_kwargs(\n    func, _is_method: bool, args: tuple, kwds: dict\n) -> dict:\n    \"\"\"Convert mix of positional and keyword arguments to aggregated kwargs.\"\"\"\n    # unwrap if the function is functools.partial\n    if hasattr(func, \"func\"):\n        args = func.args + args\n        kwds.update({k: v for k, v in func.keywords.items() if k not in kwds})\n        func = func.func\n    func_params = list(inspect.signature(func).parameters)\n    args_as_kw = dict(\n        zip(func_params[1:], args[1:])\n        if _is_method\n        else zip(func_params, args)\n    )\n    # init with default values\n    kwargs = {\n        k: v.default\n        for k, v in inspect.signature(func).parameters.items()\n        if v.default is not inspect.Parameter.empty\n    }\n    # merge args expanded as kwargs and the original kwds\n    kwargs.update(dict(**args_as_kw, **kwds))\n    return OrderedDict(sorted(kwargs.items()))\n\n\ndef _pop_kwds_with_deprecation(kwds, name: str, default_value: bool):\n    if name in kwds:\n        warnings.warn(\n            f\"`{name}` is deprecated and will be removed in a future release,\"\n            \" use `cachier__` alternative instead.\",\n            DeprecationWarning,\n            stacklevel=2,\n        )\n    return kwds.pop(name, default_value)\n\n\ndef cachier(\n    hash_func: Optional[HashFunc] = None,\n    hash_params: Optional[HashFunc] = None,\n    backend: Optional[Backend] = None,\n    mongetter: Optional[Mongetter] = None,\n    stale_after: Optional[datetime.timedelta] = None,\n    next_time: Optional[bool] = None,\n    cache_dir: Optional[Union[str, os.PathLike]] = None,\n    pickle_reload: Optional[bool] = None,\n    separate_files: Optional[bool] = None,\n    wait_for_calc_timeout: Optional[int] = None,\n    allow_none: Optional[bool] = None,\n):\n    \"\"\"Wrap as a persistent, stale-free memoization decorator.\n\n    The positional and keyword arguments to the wrapped function must be\n    hashable (i.e. Python's immutable built-in objects, not mutable\n    containers). Also, notice that since objects which are instances of\n    user-defined classes are hashable but all compare unequal (their hash\n    value is their id), equal objects across different sessions will not yield\n    identical keys.\n\n    Arguments:\n    ---------\n    hash_func : callable, optional\n        A callable that gets the args and kwargs from the decorated function\n        and returns a hash key for them. This parameter can be used to enable\n        the use of cachier with functions that get arguments that are not\n        automatically hashable by Python.\n    hash_params : callable, optional\n    backend : str, optional\n        The name of the backend to use. Valid options currently include\n        'pickle', 'mongo' and 'memory'. If not provided, defaults to\n        'pickle' unless the 'mongetter' argument is passed, in which\n        case the mongo backend is automatically selected.\n    mongetter : callable, optional\n        A callable that takes no arguments and returns a pymongo.Collection\n        object with writing permissions. If unset a local pickle cache is used\n        instead.\n    stale_after : datetime.timedelta, optional\n        The time delta after which a cached result is considered stale. Calls\n        made after the result goes stale will trigger a recalculation of the\n        result, but whether a stale or fresh result will be returned is\n        determined by the optional next_time argument.\n    next_time : bool, optional\n        If set to True, a stale result will be returned when finding one, not\n        waiting for the calculation of the fresh result to return. Defaults to\n        False.\n    cache_dir : str, optional\n        A fully qualified path to a file directory to be used for cache files.\n        The running process must have running permissions to this folder. If\n        not provided, a default directory at `~/.cachier/` is used.\n    pickle_reload : bool, optional\n        If set to True, in-memory cache will be reloaded on each cache read,\n        enabling different threads to share cache. Should be set to False for\n        faster reads in single-thread programs. Defaults to True.\n    separate_files: bool, default False, for Pickle cores only\n        Instead of a single cache file per-function, each function's cache is\n        split between several files, one for each argument set. This can help\n        if you per-function cache files become too large.\n    wait_for_calc_timeout: int, optional, for MongoDB only\n        The maximum time to wait for an ongoing calculation. When a\n        process started to calculate the value setting being_calculated to\n        True, any process trying to read the same entry will wait a maximum of\n        seconds specified in this parameter. 0 means wait forever.\n        Once the timeout expires the calculation will be triggered.\n    allow_none: bool, optional\n        Allows storing None values in the cache. If False, functions returning\n        None will not be cached and are recalculated every call.\n\n    \"\"\"\n    # Check for deprecated parameters\n    if hash_params is not None:\n        message = (\n            \"hash_params will be removed in a future release, \"\n            \"please use hash_func instead\"\n        )\n        warn(message, DeprecationWarning, stacklevel=2)\n        hash_func = hash_params\n    # Update parameters with defaults if input is None\n    backend = _update_with_defaults(backend, \"backend\")\n    mongetter = _update_with_defaults(mongetter, \"mongetter\")\n    # Override the backend parameter if a mongetter is provided.\n    if callable(mongetter):\n        backend = \"mongo\"\n    core: _BaseCore\n    if backend == \"pickle\":\n        core = _PickleCore(\n            hash_func=hash_func,\n            pickle_reload=pickle_reload,\n            cache_dir=cache_dir,\n            separate_files=separate_files,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"mongo\":\n        core = _MongoCore(\n            hash_func=hash_func,\n            mongetter=mongetter,\n            wait_for_calc_timeout=wait_for_calc_timeout,\n        )\n    elif backend == \"memory\":\n        core = _MemoryCore(\n            hash_func=hash_func, wait_for_calc_timeout=wait_for_calc_timeout\n        )\n    else:\n        raise ValueError(\"specified an invalid core: %s\" % backend)\n\n    def _cachier_decorator(func):\n        core.set_func(func)\n\n        @wraps(func)\n        def func_wrapper(*args, **kwds):\n            nonlocal allow_none\n            _allow_none = _update_with_defaults(allow_none, \"allow_none\", kwds)\n            # print('Inside general wrapper for {}.'.format(func.__name__))\n            ignore_cache = _pop_kwds_with_deprecation(\n                kwds, \"ignore_cache\", False\n            )\n            overwrite_cache = _pop_kwds_with_deprecation(\n                kwds, \"overwrite_cache\", False\n            )\n            verbose = _pop_kwds_with_deprecation(kwds, \"verbose_cache\", False)\n            ignore_cache = kwds.pop(\"cachier__skip_cache\", ignore_cache)\n            overwrite_cache = kwds.pop(\n                \"cachier__overwrite_cache\", overwrite_cache\n            )\n            verbose = kwds.pop(\"cachier__verbose\", verbose)\n            _stale_after = _update_with_defaults(\n                stale_after, \"stale_after\", kwds\n            )\n            _next_time = _update_with_defaults(next_time, \"next_time\", kwds)\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n\n            _print = lambda x: None  # noqa: E731\n            if verbose:\n                _print = print\n            if ignore_cache or not _default_params[\"caching_enabled\"]:\n                return func(**kwargs)\n            key, entry = core.get_entry((), kwargs)\n            if overwrite_cache:\n                return _calc_entry(core, key, func, args, kwds)\n            if entry is None:\n                _print(\"No entry found. No current calc. Calling like a boss.\")\n                return _calc_entry(core, key, func, args, kwds)\n            _print(\"Entry found.\")\n            if _allow_none or entry.get(\"value\", None) is not None:\n                _print(\"Cached result found.\")\n                now = datetime.datetime.now()\n                if now - entry[\"time\"] <= _stale_after:\n                    _print(\"And it is fresh!\")\n                    return entry[\"value\"]\n                _print(\"But it is stale... :(\")\n                if entry[\"being_calculated\"]:\n                    if _next_time:\n                        _print(\"Returning stale.\")\n                        return entry[\"value\"]  # return stale val\n                    _print(\"Already calc. Waiting on change.\")\n                    try:\n                        return core.wait_on_entry_calc(key)\n                    except RecalculationNeeded:\n                        return _calc_entry(core, key, func, args, kwds)\n                if _next_time:\n                    _print(\"Async calc and return stale\")\n                    try:\n                        core.mark_entry_being_calculated(key)\n                        _get_executor().submit(\n                            _function_thread, core, key, func, args, kwds\n                        )\n                    finally:\n                        core.mark_entry_not_calculated(key)\n                    return entry[\"value\"]\n                _print(\"Calling decorated function and waiting\")\n                return _calc_entry(core, key, func, args, kwds)\n            if entry[\"being_calculated\"]:\n                _print(\"No value but being calculated. Waiting.\")\n                try:\n                    return core.wait_on_entry_calc(key)\n                except RecalculationNeeded:\n                    return _calc_entry(core, key, func, args, kwds)\n            _print(\"No entry found. No current calc. Calling like a boss.\")\n            return _calc_entry(core, key, func, args, kwds)\n\n        def _clear_cache():\n            \"\"\"Clear the cache.\"\"\"\n            core.clear_cache()\n\n        def _clear_being_calculated():\n            \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n            core.clear_being_calculated()\n\n        def _cache_dpath():\n            \"\"\"Return the path to the cache dir, if exists; None if not.\"\"\"\n            return getattr(core, \"cache_dir\", None)\n\n        def _precache_value(*args, value_to_cache, **kwds):  # noqa: D417\n            \"\"\"Add an initial value to the cache.\n\n            Arguments:\n            ---------\n            value_to_cache : any\n                entry to be written into the cache\n\n            \"\"\"\n            # merge args expanded as kwargs and the original kwds\n            kwargs = _convert_args_kwargs(\n                func, _is_method=core.func_is_method, args=args, kwds=kwds\n            )\n            return core.precache_value((), kwargs, value_to_cache)\n\n        func_wrapper.clear_cache = _clear_cache\n        func_wrapper.clear_being_calculated = _clear_being_calculated\n        func_wrapper.cache_dpath = _cache_dpath\n        func_wrapper.precache_value = _precache_value\n        return func_wrapper\n\n    return _cachier_decorator\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/base.py",
      "content": "\"\"\"Defines the interface of a cachier caching core.\"\"\"\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport abc  # for the _BaseCore abstract base class\nimport inspect\nimport threading\nfrom typing import Callable\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n\nclass RecalculationNeeded(Exception):\n    \"\"\"Exception raised when a recalculation is needed.\"\"\"\n\n    pass\n\n\ndef _get_func_str(func: Callable) -> str:\n    return f\".{func.__module__}.{func.__name__}\"\n\n\nclass _BaseCore:\n    __metaclass__ = abc.ABCMeta\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        self.hash_func = _update_with_defaults(hash_func, \"hash_func\")\n        self.wait_for_calc_timeout = wait_for_calc_timeout\n        self.lock = threading.RLock()\n\n    def set_func(self, func):\n        \"\"\"Set the function this core will use.\n\n        This has to be set before any method is called. Also determine if the\n        function is an object method.\n\n        \"\"\"\n        # unwrap if the function is functools.partial\n        if hasattr(func, \"func\"):\n            func = func.func\n        func_params = list(inspect.signature(func).parameters)\n        self.func_is_method = func_params and func_params[0] == \"self\"\n        self.func = func\n\n    def get_key(self, args, kwds):\n        \"\"\"Return a unique key based on the arguments provided.\"\"\"\n        return self.hash_func(args, kwds)\n\n    def get_entry(self, args, kwds):\n        \"\"\"Get entry based on given arguments.\n\n        Return the result mapped to the given arguments in this core's cache,\n        if such a mapping exists.\n\n        \"\"\"\n        key = self.get_key(args, kwds)\n        return self.get_entry_by_key(key)\n\n    def precache_value(self, args, kwds, value_to_cache):\n        \"\"\"Write a precomputed value into the cache.\"\"\"\n        key = self.get_key(args, kwds)\n        self.set_entry(key, value_to_cache)\n        return value_to_cache\n\n    def check_calc_timeout(self, time_spent):\n        \"\"\"Raise an exception if a recalculation is needed.\"\"\"\n        calc_timeout = _update_with_defaults(\n            self.wait_for_calc_timeout, \"wait_for_calc_timeout\"\n        )\n        if calc_timeout > 0 and (time_spent >= calc_timeout):\n            raise RecalculationNeeded()\n\n    @abc.abstractmethod\n    def get_entry_by_key(self, key):\n        \"\"\"Get entry based on given key.\n\n        Return the result mapped to the given key in this core's cache, if such\n        a mapping exists.\n\n        \"\"\"\n\n    @abc.abstractmethod\n    def set_entry(self, key, func_res):\n        \"\"\"Map the given result to the given key in this core's cache.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_being_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def mark_entry_not_calculated(self, key):\n        \"\"\"Mark the entry mapped by the given key as not being calculated.\"\"\"\n\n    @abc.abstractmethod\n    def wait_on_entry_calc(self, key):\n        \"\"\"Wait on the entry with keys being calculated and returns result.\"\"\"\n\n    @abc.abstractmethod\n    def clear_cache(self):\n        \"\"\"Clear the cache of this core.\"\"\"\n\n    @abc.abstractmethod\n    def clear_being_calculated(self):\n        \"\"\"Mark all entries in this cache as not being calculated.\"\"\"\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/pickle.py",
      "content": "\"\"\"A pickle-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\nimport os\nimport pickle  # for local caching\nfrom contextlib import suppress\nfrom datetime import datetime\n\nimport portalocker  # to lock on pickle cache IO\nfrom watchdog.events import PatternMatchingEventHandler\nfrom watchdog.observers import Observer\n\nfrom .._types import HashFunc\nfrom ..config import _update_with_defaults\n\n# Alternative:  https://github.com/WoLpH/portalocker\nfrom .base import _BaseCore\n\n\nclass _PickleCore(_BaseCore):\n    \"\"\"The pickle core class for cachier.\"\"\"\n\n    class CacheChangeHandler(PatternMatchingEventHandler):\n        \"\"\"Handles cache-file modification events.\"\"\"\n\n        def __init__(self, filename, core, key):\n            PatternMatchingEventHandler.__init__(\n                self,\n                patterns=[\"*\" + filename],\n                ignore_patterns=None,\n                ignore_directories=True,\n                case_sensitive=False,\n            )\n            self.core = core\n            self.key = key\n            self.observer = None\n            self.value = None\n\n        def inject_observer(self, observer):\n            \"\"\"Inject the observer running this handler.\"\"\"\n            self.observer = observer\n\n        def _check_calculation(self):\n            # print('checking calc')\n            entry = self.core.get_entry_by_key(self.key, True)[1]\n            # print(self.key)\n            # print(entry)\n            try:\n                if not entry[\"being_calculated\"]:\n                    # print('stopping observer!')\n                    self.value = entry[\"value\"]\n                    self.observer.stop()\n                # else:\n                # print('NOT stopping observer... :(')\n            except TypeError:\n                self.value = None\n                self.observer.stop()\n\n        def on_created(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()  # pragma: no cover\n\n        def on_modified(self, event):\n            \"\"\"A Watchdog Event Handler method.\"\"\"  # noqa: D401\n            self._check_calculation()\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        pickle_reload: bool,\n        cache_dir: str,\n        separate_files: bool,\n        wait_for_calc_timeout: int,\n    ):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = None\n        self.reload = _update_with_defaults(pickle_reload, \"pickle_reload\")\n        self.cache_dir = os.path.expanduser(\n            _update_with_defaults(cache_dir, \"cache_dir\")\n        )\n        self.separate_files = _update_with_defaults(\n            separate_files, \"separate_files\"\n        )\n\n    @property\n    def cache_fname(self) -> str:\n        fname = f\".{self.func.__module__}.{self.func.__qualname__}\"\n        return fname.replace(\"<\", \"_\").replace(\">\", \"_\")\n\n    @property\n    def cache_fpath(self) -> str:\n        os.makedirs(self.cache_dir, exist_ok=True)\n        return os.path.abspath(\n            os.path.join(os.path.realpath(self.cache_dir), self.cache_fname)\n        )\n\n    def _reload_cache(self):\n        with self.lock:\n            try:\n                with portalocker.Lock(\n                    self.cache_fpath, mode=\"rb\"\n                ) as cache_file:\n                    self.cache = pickle.load(cache_file)  # noqa: S301\n            except (FileNotFoundError, EOFError):\n                self.cache = {}\n\n    def _get_cache(self):\n        with self.lock:\n            if not self.cache:\n                self._reload_cache()\n            return self.cache\n\n    def _get_cache_by_key(self, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        fpath += f\"_{hash_str or key}\"\n        try:\n            with portalocker.Lock(fpath, mode=\"rb\") as cache_file:\n                return pickle.load(cache_file)  # noqa: S301\n        except (FileNotFoundError, EOFError):\n            return None\n\n    def _clear_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(f\"{name}_\"):\n                os.remove(os.path.join(path, subpath))\n\n    def _clear_being_calculated_all_cache_files(self):\n        path, name = os.path.split(self.cache_fpath)\n        for subpath in os.listdir(path):\n            if subpath.startswith(name):\n                entry = self._get_cache_by_key(hash_str=subpath.split(\"_\")[-1])\n                if entry is not None:\n                    entry[\"being_calculated\"] = False\n                    self._save_cache(entry, hash_str=subpath.split(\"_\")[-1])\n\n    def _save_cache(self, cache, key=None, hash_str=None):\n        fpath = self.cache_fpath\n        if key is not None:\n            fpath += f\"_{key}\"\n        elif hash_str is not None:\n            fpath += f\"_{hash_str}\"\n        with self.lock:\n            self.cache = cache\n            with portalocker.Lock(fpath, mode=\"wb\") as cache_file:\n                pickle.dump(cache, cache_file, protocol=4)\n            if key is None:\n                self._reload_cache()\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            if self.separate_files:\n                return key, self._get_cache_by_key(key)\n            if self.reload or reload:\n                self._reload_cache()\n            return key, self._get_cache().get(key, None)\n\n    def set_entry(self, key, func_res):\n        key_data = {\n            \"value\": func_res,\n            \"time\": datetime.now(),\n            \"stale\": False,\n            \"being_calculated\": False,\n        }\n        if self.separate_files:\n            self._save_cache(key_data, key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            cache[key] = key_data\n            self._save_cache(cache)\n\n    def mark_entry_being_calculated_separate_files(self, key):\n        self._save_cache(\n            {\n                \"value\": None,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": True,\n            },\n            key=key,\n        )\n\n    def mark_entry_not_calculated_separate_files(self, key):\n        _, entry = self.get_entry_by_key(key)\n        entry[\"being_calculated\"] = False\n        self._save_cache(entry, key=key)\n\n    def mark_entry_being_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_being_calculated_separate_files(key)\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            try:\n                cache[key][\"being_calculated\"] = True\n            except KeyError:\n                cache[key] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                }\n            self._save_cache(cache)\n\n    def mark_entry_not_calculated(self, key):\n        if self.separate_files:\n            self.mark_entry_not_calculated_separate_files(key)\n        with self.lock:\n            cache = self._get_cache()\n            # that's ok, we don't need an entry in that case\n            with suppress(KeyError):\n                cache[key][\"being_calculated\"] = False\n                self._save_cache(cache)\n\n    def wait_on_entry_calc(self, key):\n        if self.separate_files:\n            entry = self._get_cache_by_key(key)\n            filename = f\"{self.cache_fname}_{key}\"\n        else:\n            with self.lock:\n                self._reload_cache()\n                entry = self._get_cache()[key]\n            filename = self.cache_fname\n        if not entry[\"being_calculated\"]:\n            return entry[\"value\"]\n        event_handler = _PickleCore.CacheChangeHandler(\n            filename=filename, core=self, key=key\n        )\n        observer = Observer()\n        event_handler.inject_observer(observer)\n        observer.schedule(event_handler, path=self.cache_dir, recursive=True)\n        observer.start()\n        time_spent = 0\n        while observer.is_alive():\n            observer.join(timeout=1.0)\n            time_spent += 1\n            self.check_calc_timeout(time_spent)\n        return event_handler.value\n\n    def clear_cache(self):\n        if self.separate_files:\n            self._clear_all_cache_files()\n        else:\n            self._save_cache({})\n\n    def clear_being_calculated(self):\n        if self.separate_files:\n            self._clear_being_calculated_all_cache_files()\n            return  # pragma: no cover\n\n        with self.lock:\n            cache = self._get_cache()\n            for key in cache:\n                cache[key][\"being_calculated\"] = False\n            self._save_cache(cache)\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/memory.py",
      "content": "\"\"\"A memory-based caching core for cachier.\"\"\"\n\nimport threading\nfrom datetime import datetime\n\nfrom .._types import HashFunc\nfrom .base import _BaseCore, _get_func_str\n\n\nclass _MemoryCore(_BaseCore):\n    \"\"\"The memory core class for cachier.\"\"\"\n\n    def __init__(self, hash_func: HashFunc, wait_for_calc_timeout: int):\n        super().__init__(hash_func, wait_for_calc_timeout)\n        self.cache = {}\n\n    def _hash_func_key(self, key):\n        return f\"{_get_func_str(self.func)}:{key}\"\n\n    def get_entry_by_key(self, key, reload=False):\n        with self.lock:\n            return key, self.cache.get(self._hash_func_key(key), None)\n\n    def set_entry(self, key, func_res):\n        with self.lock:\n            try:\n                # we need to retain the existing condition so that\n                # mark_entry_not_calculated can notify all possibly-waiting\n                # threads about it\n                cond = self.cache[self._hash_func_key(key)][\"condition\"]\n            except KeyError:  # pragma: no cover\n                cond = None\n            self.cache[self._hash_func_key(key)] = {\n                \"value\": func_res,\n                \"time\": datetime.now(),\n                \"stale\": False,\n                \"being_calculated\": False,\n                \"condition\": cond,\n            }\n\n    def mark_entry_being_calculated(self, key):\n        with self.lock:\n            condition = threading.Condition()\n            # condition.acquire()\n            try:\n                self.cache[self._hash_func_key(key)][\"being_calculated\"] = True\n                self.cache[self._hash_func_key(key)][\"condition\"] = condition\n            except KeyError:\n                self.cache[self._hash_func_key(key)] = {\n                    \"value\": None,\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": True,\n                    \"condition\": condition,\n                }\n\n    def mark_entry_not_calculated(self, key):\n        with self.lock:\n            try:\n                entry = self.cache[self._hash_func_key(key)]\n            except KeyError:  # pragma: no cover\n                return  # that's ok, we don't need an entry in that case\n            entry[\"being_calculated\"] = False\n            cond = entry[\"condition\"]\n            if cond:\n                cond.acquire()\n                cond.notify_all()\n                cond.release()\n                entry[\"condition\"] = None\n\n    def wait_on_entry_calc(self, key):\n        with self.lock:  # pragma: no cover\n            entry = self.cache[self._hash_func_key(key)]\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n        entry[\"condition\"].acquire()\n        entry[\"condition\"].wait()\n        entry[\"condition\"].release()\n        return self.cache[self._hash_func_key(key)][\"value\"]\n\n    def clear_cache(self):\n        with self.lock:\n            self.cache.clear()\n\n    def clear_being_calculated(self):\n        with self.lock:\n            for entry in self.cache.values():\n                entry[\"being_calculated\"] = False\n                entry[\"condition\"] = None\n"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "content": "\"\"\"A MongoDB-based caching core for cachier.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\nimport pickle  # for serialization of python objects\nimport sys  # to make sure that pymongo was imported\nimport time  # to sleep when waiting on Mongo cache\\\nimport warnings  # to warn if pymongo is missing\nfrom contextlib import suppress\nfrom datetime import datetime\n\nfrom .._types import HashFunc, Mongetter\n\nwith suppress(ImportError):\n    from bson.binary import Binary  # to save binary data to mongodb\n    from pymongo import ASCENDING, IndexModel\n    from pymongo.errors import OperationFailure\n\nfrom .base import RecalculationNeeded, _BaseCore, _get_func_str\n\nMONGO_SLEEP_DURATION_IN_SEC = 1\n\n\nclass MissingMongetter(ValueError):\n    \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n\n\nclass _MongoCore(_BaseCore):\n    _INDEX_NAME = \"func_1_key_1\"\n\n    def __init__(\n        self,\n        hash_func: HashFunc,\n        mongetter: Mongetter,\n        wait_for_calc_timeout: int,\n    ):\n        if \"pymongo\" not in sys.modules:\n            warnings.warn(\n                \"`pymongo` was not found. MongoDB cores will not function.\",\n                ImportWarning,\n                stacklevel=2,\n            )  # pragma: no cover\n\n        super().__init__(hash_func, wait_for_calc_timeout)\n        if mongetter is None:\n            raise MissingMongetter(\n                \"must specify ``mongetter`` when using the mongo core\"\n            )\n        self.mongetter = mongetter\n        self.mongo_collection = self.mongetter()\n        index_inf = self.mongo_collection.index_information()\n        if _MongoCore._INDEX_NAME not in index_inf:\n            func1key1 = IndexModel(\n                keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n                name=_MongoCore._INDEX_NAME,\n            )\n            self.mongo_collection.create_indexes([func1key1])\n\n    @property\n    def _func_str(self) -> str:\n        return _get_func_str(self.func)\n\n    def get_entry_by_key(self, key):\n        res = self.mongo_collection.find_one(\n            {\"func\": self._func_str, \"key\": key}\n        )\n        if not res:\n            return key, None\n        try:\n            entry = {\n                \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        except KeyError:\n            entry = {\n                \"value\": None,\n                \"time\": res.get(\"time\", None),\n                \"stale\": res.get(\"stale\", False),\n                \"being_calculated\": res.get(\"being_calculated\", False),\n            }\n        return key, entry\n\n    def set_entry(self, key, func_res):\n        thebytes = pickle.dumps(func_res)\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\n                \"$set\": {\n                    \"func\": self._func_str,\n                    \"key\": key,\n                    \"value\": Binary(thebytes),\n                    \"time\": datetime.now(),\n                    \"stale\": False,\n                    \"being_calculated\": False,\n                }\n            },\n            upsert=True,\n        )\n\n    def mark_entry_being_calculated(self, key):\n        self.mongo_collection.update_one(\n            filter={\"func\": self._func_str, \"key\": key},\n            update={\"$set\": {\"being_calculated\": True}},\n            upsert=True,\n        )\n\n    def mark_entry_not_calculated(self, key):\n        with suppress(OperationFailure):  # don't care in this case\n            self.mongo_collection.update_one(\n                filter={\n                    \"func\": self._func_str,\n                    \"key\": key,\n                },\n                update={\"$set\": {\"being_calculated\": False}},\n                upsert=False,  # should not insert in this case\n            )\n\n    def wait_on_entry_calc(self, key):\n        time_spent = 0\n        while True:\n            time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n            time_spent += MONGO_SLEEP_DURATION_IN_SEC\n            key, entry = self.get_entry_by_key(key)\n            if entry is None:\n                raise RecalculationNeeded()\n            if not entry[\"being_calculated\"]:\n                return entry[\"value\"]\n            self.check_calc_timeout(time_spent)\n\n    def clear_cache(self):\n        pass\n\n    def clear_being_calculated(self):\n        self.mongo_collection.update_many(\n            filter={\n                \"func\": self._func_str,\n                \"being_calculated\": True,\n            },\n            update={\"$set\": {\"being_calculated\": False}},\n        )\n"
    },
    {
      "path": "python-cachier_cachier/tests/__init__.py",
      "content": ""
    },
    {
      "path": "python-cachier_cachier/tests/standalone_script.py",
      "content": "import time\n\nimport cachier\n\n\n@cachier.cachier()\ndef _takes_3_seconds(label, value):\n    time.sleep(3)\n    return f\"{label} {value}\"\n\n\nprint(_takes_3_seconds(\"two\", 2))\n"
    },
    {
      "path": "python-cachier_cachier/tests/speed_eval.py",
      "content": "from random import random\nfrom time import time\n\nfrom cachier import cachier\n\n\n@cachier(next_time=True)\ndef _test_int_pickling(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef _test_int_pickling_compare(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    return int_1 + int_2\n\n\ndef test_pickle_speed():\n    \"\"\"Test speeds.\"\"\"\n    print(\"Comparing speeds of decorated vs non-decorated functions...\")\n    num_of_vals = 1000\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling_compare(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Non-decorated average = {sum(times) / num_of_vals}\")\n\n    _test_int_pickling.clear_cache()\n    times = []\n    for i in range(1, num_of_vals):\n        tic = time()\n        _test_int_pickling(i, i + 1)\n        toc = time()\n        times.append(toc - tic)\n    print(f\"  - Decorated average = {sum(times) / num_of_vals}\")\n\n\n@cachier()\ndef _test_single_file_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\n@cachier(separate_files=True)\ndef _test_separate_files_speed(int_1, int_2):\n    \"\"\"Add the two given ints.\"\"\"\n    # something that takes some memory\n    return [random() for _ in range(1000000)]\n\n\ndef test_separate_files_vs_single_file():\n    _test_separate_files_speed.clear_cache()\n    _test_single_file_speed.clear_cache()\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_separate_files_speed(j, 2)\n    print(f\"separate files time: {time() - start_time}\")\n    start_time = time()\n    for _ in range(3):\n        for j in range(10):\n            _test_single_file_speed(j, 2)\n    print(f\"single file time: {time() - start_time}\")\n\n\nif __name__ == \"__main__\":\n    test_pickle_speed()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_memory_core.py",
      "content": "\"\"\"Test for the in-memory implementation of the  Cachier python package.\"\"\"\n\nimport hashlib\nimport queue\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pandas as pd\nimport pytest\nfrom cachier import cachier\n\n\n@cachier(backend=\"memory\", next_time=False)\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.memory\ndef test_memory_core():\n    \"\"\"Basic memory core functionality.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\n@pytest.mark.memory\ndef test_memory_core_keywords():\n    \"\"\"Basic memory core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds.clear_cache()\n    _takes_2_seconds(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=False)\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after():\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds.clear_cache()\n    val1 = _stale_after_seconds(1, 2)\n    val2 = _stale_after_seconds(1, 2)\n    val3 = _stale_after_seconds(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds(1, 2)\n    assert val4 != val1\n    _stale_after_seconds.clear_cache()\n\n\n@cachier(backend=\"memory\", stale_after=DELTA, next_time=True)\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.memory\ndef test_stale_after_next_time():\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _random_num():\n    return random()\n\n\n@cachier(backend=\"memory\")\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.memory\ndef test_overwrite_cache():\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 == int3\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 == int3\n    _random_num_with_arg.clear_cache()\n\n\n@pytest.mark.memory\ndef test_ignore_cache():\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num.clear_cache()\n    int1 = _random_num()\n    int2 = _random_num()\n    assert int2 == int1\n    int3 = _random_num(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num.clear_cache()\n\n    _random_num_with_arg.clear_cache()\n    int1 = _random_num_with_arg(\"a\")\n    int2 = _random_num_with_arg(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg.clear_cache()\n\n\n@cachier(backend=\"memory\")\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(res_queue):\n    res = _takes_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_memory_being_calculated():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(res_queue):\n    res = _being_calc_next_time(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.memory\ndef test_being_calc_next_time():\n    \"\"\"Testing memory core handling of being calculated scenarios.\"\"\"\n    _takes_time.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\n@cachier(backend=\"memory\")\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@cachier(backend=\"memory\")\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    _takes_time.clear_being_calculated()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=3)\n    thread2.join(timeout=3)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2\n\n\n@pytest.mark.memory\ndef test_clear_being_calculated_with_empty_cache():\n    \"\"\"Test memory core clear `being calculated` functionality.\"\"\"\n    _takes_time.clear_cache()\n    _takes_time.clear_being_calculated()\n\n\n@cachier(backend=\"memory\", stale_after=timedelta(seconds=1), next_time=True)\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.memory\ndef test_error_throwing_func():\n    # with\n    res1 = _error_throwing_func(4)\n    sleep(1.5)\n    res2 = _error_throwing_func(4)\n    assert res1 == res2\n\n\n@pytest.mark.memory\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(backend=\"memory\", hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n\n\nif __name__ == \"__main__\":\n    test_memory_being_calculated()\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_core_lookup.py",
      "content": "\"\"\"Testing a few basic cachier interfaces.\"\"\"\n\nimport pytest\nfrom cachier import cachier, get_default_params\nfrom cachier.cores.mongo import MissingMongetter\n\n\ndef test_get_default_params():\n    params = get_default_params()\n    assert tuple(sorted(params)) == (\n        \"allow_none\",\n        \"backend\",\n        \"cache_dir\",\n        \"caching_enabled\",\n        \"hash_func\",\n        \"mongetter\",\n        \"next_time\",\n        \"pickle_reload\",\n        \"separate_files\",\n        \"stale_after\",\n        \"wait_for_calc_timeout\",\n    )\n\n\ndef test_bad_name(name=\"nope\"):\n    # Test that the appropriate exception is thrown\n    # when an invalid backend is given.\n    with pytest.raises(ValueError, match=f\"specified an invalid core: {name}\"):\n\n        @cachier(backend=name)\n        def dummy_func():\n            pass\n\n\ndef test_missing_mongetter():\n    # Test that the appropriate exception is thrown\n    # when forgetting to specify the mongetter.\n    with pytest.raises(MissingMongetter):\n\n        @cachier(backend=\"mongo\", mongetter=None)\n        def dummy_func():\n            pass\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_general.py",
      "content": "\"\"\"Non-core-specific tests for cachier.\"\"\"\n\nimport functools\nimport os\nimport queue\nimport subprocess  # nosec: B404\nimport threading\nfrom contextlib import suppress\nfrom random import random\nfrom time import sleep, time\n\nimport cachier\nimport pytest\nfrom cachier.core import (\n    DEFAULT_MAX_WORKERS,\n    MAX_WORKERS_ENVAR_NAME,\n    _get_executor,\n    _max_workers,\n    _set_max_workers,\n)\n\nfrom tests.test_mongo_core import (\n    MONGO_DELTA_LONG,\n    _test_mongetter,\n)\n\n\ndef test_information():\n    print(\"\\ncachier version: \", end=\"\")\n    print(cachier.__version__)\n\n\ndef test_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    with suppress(KeyError):\n        del os.environ[MAX_WORKERS_ENVAR_NAME]\n    assert _max_workers() == DEFAULT_MAX_WORKERS\n\n\ndef test_get_executor():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _get_executor()\n    _get_executor(False)\n    _get_executor(True)\n\n\ndef test_set_max_workers():\n    \"\"\"Just call this function for coverage.\"\"\"\n    _set_max_workers(9)\n\n\nparametrize_keys = \"mongetter,stale_after,separate_files\"\nparametrize_values = [\n    pytest.param(\n        _test_mongetter, MONGO_DELTA_LONG, False, marks=pytest.mark.mongo\n    ),\n    (None, None, False),\n    (None, None, True),\n]\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_ok(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_fast(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(1)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_fast(res_queue):\n        res = _wait_for_calc_timeout_fast(1, 2)\n        res_queue.put(res)\n\n    \"\"\" Testing calls that avoid timeouts store the values in cache. \"\"\"\n    _wait_for_calc_timeout_fast.clear_cache()\n    val1 = _wait_for_calc_timeout_fast(1, 2)\n    val2 = _wait_for_calc_timeout_fast(1, 2)\n    assert val1 == val2\n\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_fast,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(2)\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2  # Timeout did not kick in, a single call was done\n\n\n@pytest.mark.parametrize(parametrize_keys, parametrize_values)\ndef test_wait_for_calc_timeout_slow(mongetter, stale_after, separate_files):\n    @cachier.cachier(\n        mongetter=mongetter,\n        stale_after=stale_after,\n        separate_files=separate_files,\n        next_time=False,\n        wait_for_calc_timeout=2,\n    )\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    sleep(4)\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_precache_value(mongetter, backend):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def dummy_func(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return arg_1 + arg_2\n\n    assert dummy_func.precache_value(2, 2, value_to_cache=5) == 5\n    assert dummy_func(2, 2) == 5\n    dummy_func.clear_cache()\n    assert dummy_func(2, 2) == 4\n    assert dummy_func.precache_value(2, arg_2=2, value_to_cache=5) == 5\n    assert dummy_func(2, arg_2=2) == 5\n\n\n@pytest.mark.parametrize(\n    \"mongetter,backend\",\n    [\n        pytest.param(_test_mongetter, \"mongo\", marks=pytest.mark.mongo),\n        (None, \"memory\"),\n        (None, \"pickle\"),\n    ],\n)\ndef test_ignore_self_in_methods(mongetter, backend):\n    class DummyClass:\n        @cachier.cachier(backend=backend, mongetter=mongetter)\n        def takes_2_seconds(self, arg_1, arg_2):\n            \"\"\"Some function.\"\"\"\n            sleep(2)\n            return arg_1 + arg_2\n\n    test_object_1 = DummyClass()\n    test_object_2 = DummyClass()\n    test_object_1.takes_2_seconds.clear_cache()\n    test_object_2.takes_2_seconds.clear_cache()\n    assert test_object_1.takes_2_seconds(1, 2) == 3\n    start = time()\n    assert test_object_2.takes_2_seconds(1, 2) == 3\n    end = time()\n    assert end - start < 1\n\n\ndef test_hash_params_deprecation():\n    with pytest.deprecated_call(match=\"hash_params will be removed\"):\n\n        @cachier.cachier(hash_params=lambda a, k: \"key\")\n        def test():\n            return \"value\"\n\n    assert test() == \"value\"\n\n\ndef test_separate_processes():\n    test_args = (\"python\", \"tests/standalone_script.py\")\n    run_params = {\"args\": test_args, \"capture_output\": True, \"text\": True}\n    run_process = functools.partial(subprocess.run, **run_params)\n    result = run_process()\n    assert result.stdout.strip() == \"two 2\"\n    start = time()\n    result = run_process()\n    end = time()\n    assert result.stdout.strip() == \"two 2\"\n    assert end - start < 3\n\n\ndef test_global_disable():\n    @cachier.cachier()\n    def get_random():\n        return random()\n\n    get_random.clear_cache()\n    result_1 = get_random()\n    result_2 = get_random()\n    cachier.disable_caching()\n    result_3 = get_random()\n    cachier.enable_caching()\n    result_4 = get_random()\n    assert result_1 == result_2 == result_4\n    assert result_1 != result_3\n\n\ndef test_none_not_cached_by_default():\n    count = 0\n\n    @cachier.cachier()\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 2\n\n\ndef test_allow_caching_none():\n    count = 0\n\n    @cachier.cachier(allow_none=True)\n    def do_operation():\n        nonlocal count\n        count += 1\n        return None\n\n    do_operation.clear_cache()\n    assert count == 0\n    do_operation()\n    do_operation()\n    assert count == 1\n\n\ndef test_identical_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: int, b: int = 2, c: int = 3):\n        nonlocal count\n        count += 1\n        return a + b + c\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1, 2, 3) == 6\n    assert dummy_func(1, 2, c=3) == 6\n    assert dummy_func(1, b=2, c=3) == 6\n    assert dummy_func(a=1, b=2, c=3) == 6\n    assert count == 1\n\n\ndef test_list_inputs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a: list, b: list = [2]):  # noqa: B006\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func([1]) == [1, 2]\n    assert dummy_func([1], [2]) == [1, 2]\n    assert dummy_func([1], b=[2]) == [1, 2]\n    assert dummy_func(a=[1], b=[2]) == [1, 2]\n    assert count == 1\n\n\ndef test_order_independent_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert dummy_func(b=2, a=1) == 3\n    assert count == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_diff_functions_same_args(tmpdir, backend: str):\n    count_p = count_m = 0\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    @cachier.cachier(cache_dir=tmpdir, backend=backend)\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert fn(1) == expected\n        assert fn(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n\n@pytest.mark.parametrize(\"backend\", [\"memory\", \"pickle\"])\ndef test_runtime_handling(tmpdir, backend):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir, backend=backend)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n    assert count_p == 1\n    assert count_m == 1\n\n    for fn, expected in [(fn_plus, 5), (fn_minus, 1)]:\n        assert cachier_(fn)(3, 2) == expected\n        assert cachier_(fn)(a=3, b=2) == expected\n    assert count_p == 2\n    assert count_m == 2\n\n\ndef test_partial_handling(tmpdir):\n    count_p = count_m = 0\n\n    def fn_plus(a, b=2):\n        nonlocal count_p\n        count_p += 1\n        return a + b\n\n    def fn_minus(a, b=2):\n        nonlocal count_m\n        count_m += 1\n        return a - b\n\n    cachier_ = cachier.cachier(cache_dir=tmpdir)\n    assert count_p == count_m == 0\n\n    for fn, expected in [(fn_plus, 3), (fn_minus, -1)]:\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, 1)\n        assert cachier_(dummy_)(2) == expected\n\n        dummy_ = functools.partial(fn, a=1)\n        assert cachier_(dummy_)() == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1) == expected\n\n        dummy_ = functools.partial(fn, b=2)\n        assert cachier_(dummy_)(1, b=2) == expected\n\n        assert cachier_(fn)(1, 2) == expected\n        assert cachier_(fn)(a=1, b=2) == expected\n\n    assert count_p == 1\n    assert count_m == 1\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_defaults.py",
      "content": "import datetime\nimport os\nimport queue\nimport random\nimport threading\nimport time\n\nimport cachier\nimport pytest\n\nfrom tests.test_mongo_core import MONGO_DELTA, _test_mongetter\n\n_default_params = cachier.get_default_params().copy()\n\n\ndef setup_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef teardown_function():\n    cachier.set_default_params(**_default_params)\n\n\ndef test_hash_func_default_param():\n    def slow_hash_func(args, kwds):\n        time.sleep(2)\n        return \"hash\"\n\n    def fast_hash_func(args, kwds):\n        return \"hash\"\n\n    cachier.set_default_params(hash_func=slow_hash_func)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(hash_func=fast_hash_func)\n    def global_test_2():\n        return None\n\n    start = time.time()\n    global_test_1()\n    end = time.time()\n    assert end - start > 1\n    start = time.time()\n    global_test_2()\n    end = time.time()\n    assert end - start < 1\n\n\ndef test_backend_default_param():\n    cachier.set_default_params(backend=\"memory\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(backend=\"pickle\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\n@pytest.mark.mongo\ndef test_mongetter_default_param():\n    cachier.set_default_params(mongetter=_test_mongetter)\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(mongetter=False)\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() is None\n    assert global_test_2.cache_dpath() is not None\n\n\ndef test_cache_dir_default_param(tmpdir):\n    cachier.set_default_params(cache_dir=tmpdir / \"1\")\n\n    @cachier.cachier()\n    def global_test_1():\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\")\n    def global_test_2():\n        return None\n\n    assert global_test_1.cache_dpath() == str(tmpdir / \"1\")\n    assert global_test_2.cache_dpath() == str(tmpdir / \"2\")\n\n\ndef test_separate_files_default_param(tmpdir):\n    cachier.set_default_params(separate_files=True)\n\n    @cachier.cachier(cache_dir=tmpdir / \"1\")\n    def global_test_1(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    @cachier.cachier(cache_dir=tmpdir / \"2\", separate_files=False)\n    def global_test_2(arg_1, arg_2):\n        return arg_1 + arg_2\n\n    global_test_1(1, 2)\n    global_test_1(3, 4)\n    global_test_2(1, 2)\n    global_test_2(3, 4)\n\n    assert len(os.listdir(global_test_1.cache_dpath())) == 2\n    assert len(os.listdir(global_test_2.cache_dpath())) == 1\n\n\ndef test_allow_none_default_param(tmpdir):\n    cachier.set_default_params(\n        allow_none=True,\n        separate_files=True,\n        verbose_cache=True,\n    )\n    allow_count = disallow_count = 0\n\n    @cachier.cachier(cache_dir=tmpdir)\n    def allow_none():\n        nonlocal allow_count\n        allow_count += 1\n        return None\n\n    @cachier.cachier(cache_dir=tmpdir, allow_none=False)\n    def disallow_none():\n        nonlocal disallow_count\n        disallow_count += 1\n        return None\n\n    assert allow_count == 0\n    allow_none()\n    allow_none()\n    assert allow_count == 1\n\n    assert disallow_count == 0\n    disallow_none()\n    disallow_none()\n    assert disallow_count == 2\n\n    disallow_none(cachier__allow_none=True)\n    disallow_none(cachier__allow_none=True)\n    assert disallow_count == 2\n\n\nPARAMETRIZE_TEST = (\n    \"backend,mongetter\",\n    [\n        pytest.param(\"pickle\", None, marks=pytest.mark.pickle),\n        pytest.param(\"mongo\", _test_mongetter, marks=pytest.mark.mongo),\n    ],\n)\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_stale_after_applies_dynamically(backend, mongetter):\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_test(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random() + arg_1 + arg_2\n\n    cachier.set_default_params(stale_after=MONGO_DELTA)\n\n    _stale_after_test.clear_cache()\n    val1 = _stale_after_test(1, 2)\n    val2 = _stale_after_test(1, 2)\n    assert val1 == val2\n    time.sleep(3)\n    val3 = _stale_after_test(1, 2)\n    assert val3 != val1\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_next_time_applies_dynamically(backend, mongetter):\n    NEXT_AFTER_DELTA = datetime.timedelta(seconds=3)\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _stale_after_next_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random.random()\n\n    cachier.set_default_params(stale_after=NEXT_AFTER_DELTA, next_time=True)\n\n    _stale_after_next_time.clear_cache()\n    val1 = _stale_after_next_time(1, 2)\n    val2 = _stale_after_next_time(1, 2)\n    val3 = _stale_after_next_time(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    time.sleep(NEXT_AFTER_DELTA.seconds + 1)\n    val4 = _stale_after_next_time(1, 2)\n    assert val4 == val1\n    time.sleep(0.5)\n    val5 = _stale_after_next_time(1, 2)\n    assert val5 != val1\n    _stale_after_next_time.clear_cache()\n\n\n@pytest.mark.parametrize(*PARAMETRIZE_TEST)\ndef test_wait_for_calc_applies_dynamically(backend, mongetter):\n    \"\"\"Testing for calls timing out to be performed twice when needed.\"\"\"\n\n    @cachier.cachier(backend=backend, mongetter=mongetter)\n    def _wait_for_calc_timeout_slow(arg_1, arg_2):\n        time.sleep(3)\n        return random.random() + arg_1 + arg_2\n\n    def _calls_wait_for_calc_timeout_slow(res_queue):\n        res = _wait_for_calc_timeout_slow(1, 2)\n        res_queue.put(res)\n\n    cachier.set_default_params(wait_for_calc_timeout=2)\n    _wait_for_calc_timeout_slow.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_wait_for_calc_timeout_slow,\n        kwargs={\"res_queue\": res_queue},\n        daemon=True,\n    )\n\n    thread1.start()\n    thread2.start()\n    time.sleep(1)\n    res3 = _wait_for_calc_timeout_slow(1, 2)\n    time.sleep(5)\n    thread1.join(timeout=5)\n    thread2.join(timeout=5)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 != res2  # Timeout kicked in.  Two calls were done\n    res4 = _wait_for_calc_timeout_slow(1, 2)\n    # One of the cached values is returned\n    assert res1 == res4 or res2 == res4 or res3 == res4\n\n\ndef test_default_kwargs_handling():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    assert dummy_func(1) == 3\n    assert dummy_func(a=1) == 3\n    assert dummy_func(a=1, b=2) == 3\n    assert count == 1\n\n\ndef test_deprecated_func_kwargs():\n    count = 0\n\n    @cachier.cachier()\n    def dummy_func(a, b=2):\n        nonlocal count\n        count += 1\n        return a + b\n\n    dummy_func.clear_cache()\n    assert count == 0\n    with pytest.deprecated_call(\n        match=\"`verbose_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, verbose_cache=True) == 3\n    assert count == 1\n    with pytest.deprecated_call(\n        match=\"`ignore_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, ignore_cache=True) == 3\n    assert count == 2\n    with pytest.deprecated_call(\n        match=\"`overwrite_cache` is deprecated and will be removed\"\n    ):\n        assert dummy_func(1, overwrite_cache=True) == 3\n    assert count == 3\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_pickle_core.py",
      "content": "\"\"\"Test for the Cachier python package.\"\"\"\n\n# This file is part of Cachier.\n# https://github.com/python-cachier/cachier\n\n# Licensed under the MIT license:\n# http://www.opensource.org/licenses/MIT-license\n# Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n\n# from os.path import (\n#     realpath,\n#     dirname\n# )\nimport os\nimport pickle\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep, time\n\nimport pytest\n\ntry:\n    import queue\nexcept ImportError:  # python 2\n    import Queue as queue  # type: ignore\n\nimport hashlib\n\nimport pandas as pd\nfrom cachier import cachier\nfrom cachier.core import _default_params\n\n\ndef _get_decorated_func(func, **kwargs):\n    cachier_decorator = cachier(**kwargs)\n    decorated_func = cachier_decorator(func)\n    return decorated_func\n\n\n# Pickle core tests\n\n\ndef _takes_2_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"reload\", [True, False])\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core(reload, separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds,\n        next_time=False,\n        pickle_reload=reload,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_keywords(separate_files):\n    \"\"\"Basic Pickle core functionality with keyword arguments.\"\"\"\n    _takes_2_seconds_decorated = _get_decorated_func(\n        _takes_2_seconds, next_time=False, separate_files=separate_files\n    )\n    _takes_2_seconds_decorated.clear_cache()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\")\n    start = time()\n    _takes_2_seconds_decorated(\"a\", arg_2=\"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_decorated.clear_cache()\n\n\nSECONDS_IN_DELTA = 3\nDELTA = timedelta(seconds=SECONDS_IN_DELTA)\n\n\ndef _stale_after_seconds(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after(separate_files):\n    \"\"\"Testing the stale_after functionality.\"\"\"\n    _stale_after_seconds_decorated = _get_decorated_func(\n        _stale_after_seconds,\n        stale_after=DELTA,\n        next_time=False,\n        separate_files=separate_files,\n    )\n    _stale_after_seconds_decorated.clear_cache()\n    val1 = _stale_after_seconds_decorated(1, 2)\n    val2 = _stale_after_seconds_decorated(1, 2)\n    val3 = _stale_after_seconds_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(3)\n    val4 = _stale_after_seconds_decorated(1, 2)\n    assert val4 != val1\n    _stale_after_seconds_decorated.clear_cache()\n\n\ndef _stale_after_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_stale_after_next_time(separate_files):\n    \"\"\"Testing the stale_after with next_time functionality.\"\"\"\n    _stale_after_next_time_decorated = _get_decorated_func(\n        _stale_after_next_time,\n        stale_after=DELTA,\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _stale_after_next_time_decorated.clear_cache()\n    val1 = _stale_after_next_time_decorated(1, 2)\n    val2 = _stale_after_next_time_decorated(1, 2)\n    val3 = _stale_after_next_time_decorated(1, 3)\n    assert val1 == val2\n    assert val1 != val3\n    sleep(SECONDS_IN_DELTA + 1)\n    val4 = _stale_after_next_time_decorated(1, 2)\n    assert val4 == val1\n    sleep(0.5)\n    val5 = _stale_after_next_time_decorated(1, 2)\n    assert val5 != val1\n    _stale_after_next_time_decorated.clear_cache()\n\n\ndef _random_num():\n    return random()\n\n\ndef _random_num_with_arg(a):\n    # print(a)\n    return random()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_overwrite_cache(separate_files):\n    \"\"\"Tests that the overwrite feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 == int3\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__overwrite_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 == int3\n    _random_num_with_arg_decorated.clear_cache()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_ignore_cache(separate_files):\n    \"\"\"Tests that the ignore_cache feature works correctly.\"\"\"\n    _random_num_decorated = _get_decorated_func(\n        _random_num, separate_files=separate_files\n    )\n    _random_num_with_arg_decorated = _get_decorated_func(\n        _random_num_with_arg, separate_files=separate_files\n    )\n    _random_num_decorated.clear_cache()\n    int1 = _random_num_decorated()\n    int2 = _random_num_decorated()\n    assert int2 == int1\n    int3 = _random_num_decorated(cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_decorated()\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_decorated.clear_cache()\n\n    _random_num_with_arg_decorated.clear_cache()\n    int1 = _random_num_with_arg_decorated(\"a\")\n    int2 = _random_num_with_arg_decorated(\"a\")\n    assert int2 == int1\n    int3 = _random_num_with_arg_decorated(\"a\", cachier__skip_cache=True)\n    assert int3 != int1\n    int4 = _random_num_with_arg_decorated(\"a\")\n    assert int4 != int3\n    assert int4 == int1\n    _random_num_with_arg_decorated.clear_cache()\n\n\ndef _takes_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)  # this has to be enough time for check_calculation to run twice\n    return random() + arg_1 + arg_2\n\n\ndef _calls_takes_time(takes_time_func, res_queue):\n    res = takes_time_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_being_calculated(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time,\n        kwargs={\n            \"takes_time_func\": _takes_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _being_calc_next_time(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\ndef _calls_being_calc_next_time(being_calc_func, res_queue):\n    res = being_calc_func(0.13, 0.02)\n    res_queue.put(res)\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_being_calc_next_time(separate_files):\n    \"\"\"Testing pickle core handling of being calculated scenarios.\"\"\"\n    _being_calc_next_time_decorated = _get_decorated_func(\n        _being_calc_next_time,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _being_calc_next_time_decorated.clear_cache()\n    _being_calc_next_time(0.13, 0.02)\n    sleep(1.1)\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_being_calc_next_time,\n        kwargs={\n            \"being_calc_func\": _being_calc_next_time_decorated,\n            \"res_queue\": res_queue,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(0.5)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\ndef _bad_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _BAD_CACHE_FNAME = '.__main__._bad_cache'\n_BAD_CACHE_FNAME = \".tests.test_pickle_core._bad_cache\"\n_BAD_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._bad_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\nEXPANDED_CACHIER_DIR = os.path.expanduser(_default_params[\"cache_dir\"])\n_BAD_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME)\n_BAD_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _BAD_CACHE_FNAME_SEPARATE_FILES\n)\n_BAD_CACHE_FPATHS = {\n    True: _BAD_CACHE_FPATH_SEPARATE_FILES,\n    False: _BAD_CACHE_FPATH,\n}\n\n\ndef _calls_bad_cache(bad_cache_func, res_queue, trash_cache, separate_files):\n    try:\n        res = bad_cache_func(0.13, 0.02, cachier__verbose=True)\n        if trash_cache:\n            with open(_BAD_CACHE_FPATHS[separate_files], \"w\") as cache_file:\n                cache_file.seek(0)\n                cache_file.truncate()\n        res_queue.put(res)\n    except Exception as exc:\n        res_queue.put(exc)\n\n\ndef _helper_bad_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    _bad_cache_decorated = _get_decorated_func(\n        _bad_cache, separate_files=separate_files\n    )\n    _bad_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_bad_cache,\n        kwargs={\n            \"bad_cache_func\": _bad_cache_decorated,\n            \"res_queue\": res_queue,\n            \"trash_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if res2 is not None or isinstance(res2, KeyError):\n        return False\n    return True\n\n\n# we want this to succeed at least once\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_bad_cache_file(separate_files):\n    \"\"\"Test pickle core handling of bad cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.6, 0.7, 0.8, 1, 1.5, 2]\n    sleeptimes = sleeptimes + sleeptimes\n    for sleeptime in sleeptimes:\n        if _helper_bad_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\ndef _delete_cache(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(1)\n    return random() + arg_1 + arg_2\n\n\n# _DEL_CACHE_FNAME = '.__main__._delete_cache'\n_DEL_CACHE_FNAME = \".tests.test_pickle_core._delete_cache\"\n_DEL_CACHE_FNAME_SEPARATE_FILES = (\n    \".tests.test_pickle_core._delete_cache_\"\n    f\"{hashlib.sha256(pickle.dumps((0.13, 0.02))).hexdigest()}\"\n)\n_DEL_CACHE_FPATH = os.path.join(EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME)\n_DEL_CACHE_FPATH_SEPARATE_FILES = os.path.join(\n    EXPANDED_CACHIER_DIR, _DEL_CACHE_FNAME_SEPARATE_FILES\n)\n_DEL_CACHE_FPATHS = {\n    True: _DEL_CACHE_FPATH_SEPARATE_FILES,\n    False: _DEL_CACHE_FPATH,\n}\n\n\ndef _calls_delete_cache(del_cache_func, res_queue, del_cache, separate_files):\n    try:\n        # print('in')\n        res = del_cache_func(0.13, 0.02)\n        # print('out with {}'.format(res))\n        if del_cache:\n            os.remove(_DEL_CACHE_FPATHS[separate_files])\n            # print(os.path.isfile(_DEL_CACHE_FPATH))\n        res_queue.put(res)\n    except Exception as exc:\n        # print('found')\n        res_queue.put(exc)\n\n\ndef _helper_delete_cache_file(sleeptime, separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    _delete_cache_decorated = _get_decorated_func(\n        _delete_cache, separate_files=separate_files\n    )\n    _delete_cache_decorated.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": True,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread2 = threading.Thread(\n        target=_calls_delete_cache,\n        kwargs={\n            \"del_cache_func\": _delete_cache_decorated,\n            \"res_queue\": res_queue,\n            \"del_cache\": False,\n            \"separate_files\": separate_files,\n        },\n        daemon=True,\n    )\n    thread1.start()\n    sleep(sleeptime)\n    thread2.start()\n    thread1.join(timeout=2)\n    thread2.join(timeout=2)\n    if res_queue.qsize() != 2:\n        return False\n    res1 = res_queue.get()\n    # print(res1)\n    if not isinstance(res1, float):\n        return False\n    res2 = res_queue.get()\n    if not ((isinstance(res2, KeyError)) or (res2 is None)):\n        return False\n    return True\n    # print(res2)\n    # print(type(res2))\n\n\n@pytest.mark.pickle\n@pytest.mark.xfail\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_delete_cache_file(separate_files):\n    \"\"\"Test pickle core handling of missing cache files.\"\"\"\n    sleeptimes = [0.1, 0.2, 0.3, 0.5, 0.7, 1]\n    sleeptimes = sleeptimes * 4\n    for sleeptime in sleeptimes:\n        if _helper_delete_cache_file(sleeptime, separate_files):\n            return\n    raise AssertionError()\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [False, True])\ndef test_clear_being_calculated(separate_files):\n    \"\"\"Test pickle core clear `being calculated` functionality.\"\"\"\n    _takes_time_decorated = _get_decorated_func(\n        _takes_time, separate_files=separate_files\n    )\n    _takes_time_decorated.clear_being_calculated()\n\n\ndef _error_throwing_func(arg1):\n    if not hasattr(_error_throwing_func, \"count\"):\n        _error_throwing_func.count = 0\n    _error_throwing_func.count += 1\n    if _error_throwing_func.count > 1:\n        raise ValueError(\"Tiny Rick!\")\n    return 7\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_error_throwing_func(separate_files):\n    # with\n    _error_throwing_func.count = 0\n    _error_throwing_func_decorated = _get_decorated_func(\n        _error_throwing_func,\n        stale_after=timedelta(seconds=1),\n        next_time=True,\n        separate_files=separate_files,\n    )\n    _error_throwing_func_decorated.clear_cache()\n    res1 = _error_throwing_func_decorated(4)\n    sleep(1.5)\n    res2 = _error_throwing_func_decorated(4)\n    assert res1 == res2\n\n\n# test custom cache dir for pickle core\n\nCUSTOM_DIR = \"~/.exparrot\"\nEXPANDED_CUSTOM_DIR = os.path.expanduser(CUSTOM_DIR)\n\n\ndef _takes_2_seconds_custom_dir(arg_1, arg_2):\n    \"\"\"Some function.\"\"\"\n    sleep(2)\n    return f\"arg_1:{arg_1}, arg_2:{arg_2}\"\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_pickle_core_custom_cache_dir(separate_files):\n    \"\"\"Basic Pickle core functionality.\"\"\"\n    _takes_2_seconds_custom_dir_decorated = _get_decorated_func(\n        _takes_2_seconds_custom_dir,\n        next_time=False,\n        cache_dir=CUSTOM_DIR,\n        separate_files=separate_files,\n    )\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\")\n    start = time()\n    _takes_2_seconds_custom_dir_decorated(\"a\", \"b\", cachier__verbose=True)\n    end = time()\n    assert end - start < 1\n    _takes_2_seconds_custom_dir_decorated.clear_cache()\n    path2test = _takes_2_seconds_custom_dir_decorated.cache_dpath()\n    assert path2test == EXPANDED_CUSTOM_DIR\n\n\n@pytest.mark.pickle\n@pytest.mark.parametrize(\"separate_files\", [True, False])\ndef test_callable_hash_param(separate_files):\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(hash_func=_hash_func, separate_files=separate_files)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/test_mongo_core.py",
      "content": "\"\"\"Testing the MongoDB core of cachier.\"\"\"\n\nimport datetime\nimport hashlib\nimport platform\nimport queue\nimport sys\nimport threading\nfrom datetime import timedelta\nfrom random import random\nfrom time import sleep\nfrom urllib.parse import quote_plus\n\nimport pandas as pd\nimport pymongo\nimport pytest\nfrom birch import Birch  # type: ignore[import-not-found]\nfrom cachier import cachier\nfrom cachier.cores.base import RecalculationNeeded\nfrom cachier.cores.mongo import _MongoCore\nfrom pymongo.errors import OperationFailure\nfrom pymongo.mongo_client import MongoClient\nfrom pymongo_inmemory import MongoClient as InMemoryMongoClient\n\n# === Enables testing vs a real MongoDB instance ===\n\n\nclass CfgKey:\n    HOST = \"TEST_HOST\"\n    PORT = \"TEST_PORT\"\n    # UNAME = \"TEST_USERNAME\"\n    # PWD = \"TEST_PASSWORD\"\n    # DB = \"TEST_DB\"\n    TEST_VS_DOCKERIZED_MONGO = \"TEST_VS_DOCKERIZED_MONGO\"\n\n\nCFG = Birch(\n    namespace=\"cachier\",\n    defaults={CfgKey.TEST_VS_DOCKERIZED_MONGO: False},\n)\n\n\n# URI_TEMPLATE = \"mongodb://myUser:myPassword@localhost:27017/\"\nURI_TEMPLATE = \"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n\n\ndef _get_cachier_db_mongo_client():\n    host = quote_plus(CFG[CfgKey.HOST])\n    port = quote_plus(CFG[CfgKey.PORT])\n    # uname = quote_plus(CFG[CfgKey.UNAME])\n    # pwd = quote_plus(CFG[CfgKey.PWD])\n    # db = quote_plus(CFG[CfgKey.DB])\n    uri = f\"mongodb://{host}:{port}?retrywrites=true&w=majority\"\n    return MongoClient(uri)\n\n\n_COLLECTION_NAME = (\n    f\"cachier_test_{platform.system()}\"\n    f\"_{'.'.join(map(str, sys.version_info[:3]))}\"\n)\n\n\ndef _test_mongetter():\n    if not hasattr(_test_mongetter, \"client\"):\n        if str(CFG.mget(CfgKey.TEST_VS_DOCKERIZED_MONGO)).lower() == \"true\":\n            print(\"Using live MongoDB instance for testing.\")\n            _test_mongetter.client = _get_cachier_db_mongo_client()\n        else:\n            print(\"Using in-memory MongoDB instance for testing.\")\n            _test_mongetter.client = InMemoryMongoClient()\n    db_obj = _test_mongetter.client[\"cachier_test\"]\n    if _COLLECTION_NAME not in db_obj.list_collection_names():\n        db_obj.create_collection(_COLLECTION_NAME)\n    return db_obj[_COLLECTION_NAME]\n\n\n# === Mongo core tests ===\n\n\n@pytest.mark.mongo\ndef test_information():\n    print(\"\\npymongo version: \", end=\"\")\n    print(pymongo.__version__)\n\n\n@pytest.mark.mongo\ndef test_mongo_index_creation():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    collection = _test_mongetter()\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    assert _MongoCore._INDEX_NAME in collection.index_information()\n\n\n@pytest.mark.mongo\ndef test_mongo_core():\n    \"\"\"Basic Mongo core functionality.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, 2)\n    val2 = _test_mongo_caching(1, 2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, 2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, 2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, 2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, 2)\n    assert val6 == val5\n\n\n@pytest.mark.mongo\ndef test_mongo_core_keywords():\n    \"\"\"Basic Mongo core functionality with keyword arguments.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _test_mongo_caching(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _test_mongo_caching.clear_cache()\n    val1 = _test_mongo_caching(1, arg_2=2)\n    val2 = _test_mongo_caching(1, arg_2=2)\n    assert val1 == val2\n    val3 = _test_mongo_caching(1, arg_2=2, cachier__skip_cache=True)\n    assert val3 != val1\n    val4 = _test_mongo_caching(1, arg_2=2)\n    assert val4 == val1\n    val5 = _test_mongo_caching(1, arg_2=2, cachier__overwrite_cache=True)\n    assert val5 != val1\n    val6 = _test_mongo_caching(1, arg_2=2)\n    assert val6 == val5\n\n\nMONGO_DELTA = timedelta(seconds=3)\nMONGO_DELTA_LONG = timedelta(seconds=10)\n\n\n@pytest.mark.mongo\ndef test_mongo_stale_after():\n    \"\"\"Testing MongoDB core stale_after functionality.\"\"\"\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=MONGO_DELTA, next_time=False\n    )\n    def _stale_after_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _stale_after_mongo.clear_cache()\n    val1 = _stale_after_mongo(1, 2)\n    val2 = _stale_after_mongo(1, 2)\n    assert val1 == val2\n    sleep(3)\n    val3 = _stale_after_mongo(1, 2)\n    assert val3 != val1\n\n\ndef _calls_takes_time(res_queue):\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    res = _takes_time(34, 82.3)\n    res_queue.put(res)\n\n\n@pytest.mark.mongo\ndef test_mongo_being_calculated():\n    \"\"\"Testing MongoDB core handling of being calculated scenarios.\"\"\"\n\n    @cachier(mongetter=_test_mongetter)\n    def _takes_time(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        sleep(3)\n        return random() + arg_1 + arg_2\n\n    _takes_time.clear_cache()\n    res_queue = queue.Queue()\n    thread1 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread2 = threading.Thread(\n        target=_calls_takes_time, kwargs={\"res_queue\": res_queue}, daemon=True\n    )\n    thread1.start()\n    sleep(1)\n    thread2.start()\n    thread1.join(timeout=4)\n    thread2.join(timeout=4)\n    assert res_queue.qsize() == 2\n    res1 = res_queue.get()\n    res2 = res_queue.get()\n    assert res1 == res2\n\n\nclass _BadMongoCollection:\n    def __init__(self, mongetter):\n        self.collection = mongetter()\n        self.index_information = self.collection.index_information\n        self.create_indexes = self.collection.create_indexes\n        self.find_one = self.collection.find_one\n\n    def delete_many(self, *args, **kwargs):\n        pass\n\n    def update_many(self, *args, **kwargs):\n        pass\n\n    def update_one(self, *args, **kwargs):\n        raise OperationFailure(Exception())\n\n\ndef _bad_mongetter():\n    return _BadMongoCollection(_test_mongetter)\n\n\n@pytest.mark.mongo\ndef test_mongo_write_failure():\n    \"\"\"Testing MongoDB core handling of writing failure scenarios.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    with pytest.raises(OperationFailure):\n        val1 = _func_w_bad_mongo(1, 2)\n        val2 = _func_w_bad_mongo(1, 2)\n        assert val1 == val2\n\n\n@pytest.mark.mongo\ndef test_mongo_clear_being_calculated():\n    \"\"\"Testing MongoDB core clear_being_calculated.\"\"\"\n\n    @cachier(mongetter=_bad_mongetter)\n    def _func_w_bad_mongo(arg_1, arg_2):\n        \"\"\"Some function.\"\"\"\n        return random() + arg_1 + arg_2\n\n    _func_w_bad_mongo.clear_being_calculated()\n\n\n@pytest.mark.mongo\ndef test_stalled_mongo_db_cache():\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    core = _MongoCore(None, _test_mongetter, 0)\n    core.set_func(_stalled_func)\n    core.clear_cache()\n    with pytest.raises(RecalculationNeeded):\n        core.wait_on_entry_calc(key=None)\n\n\n@pytest.mark.mongo\ndef test_stalled_mong_db_core(monkeypatch):\n    def mock_get_entry(self, args, kwargs):\n        return \"key\", {\"being_calculated\": True}\n\n    def mock_get_entry_by_key(self, key):\n        return \"key\", None\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry\n    )\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry_by_key\",\n        mock_get_entry_by_key,\n    )\n\n    @cachier(mongetter=_test_mongetter)\n    def _stalled_func():\n        return 1\n\n    res = _stalled_func()\n    assert res == 1\n\n    def mock_get_entry_2(self, args, kwargs):\n        entry = {\n            \"being_calculated\": True,\n            \"value\": 1,\n            \"time\": datetime.datetime.now() - datetime.timedelta(seconds=10),\n        }\n        return \"key\", entry\n\n    monkeypatch.setattr(\n        \"cachier.cores.mongo._MongoCore.get_entry\", mock_get_entry_2\n    )\n\n    stale_after = datetime.timedelta(seconds=1)\n\n    @cachier(mongetter=_test_mongetter, stale_after=stale_after)\n    def _stalled_func_2():\n        \"\"\"Testing stalled function.\"\"\"\n        return 2\n\n    res = _stalled_func_2()\n    assert res == 2\n\n    @cachier(\n        mongetter=_test_mongetter, stale_after=stale_after, next_time=True\n    )\n    def _stalled_func_3():\n        \"\"\"Testing stalled function.\"\"\"\n        return 3\n\n    res = _stalled_func_3()\n    assert res == 1\n\n\n@pytest.mark.mongo\ndef test_callable_hash_param():\n    def _hash_func(args, kwargs):\n        def _hash(obj):\n            if isinstance(obj, pd.core.frame.DataFrame):\n                return hashlib.sha256(\n                    pd.util.hash_pandas_object(obj).values.tobytes()\n                ).hexdigest()\n            return obj\n\n        k_args = tuple(map(_hash, args))\n        k_kwargs = tuple(\n            sorted({k: _hash(v) for k, v in kwargs.items()}.items())\n        )\n        return k_args + k_kwargs\n\n    @cachier(mongetter=_test_mongetter, hash_func=_hash_func)\n    def _params_with_dataframe(*args, **kwargs):\n        \"\"\"Some function.\"\"\"\n        return random()\n\n    _params_with_dataframe.clear_cache()\n\n    df_a = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    df_b = pd.DataFrame.from_dict({\"a\": [0], \"b\": [2], \"c\": [3]})\n    value_a = _params_with_dataframe(df_a, 1)\n    value_b = _params_with_dataframe(df_b, 1)\n\n    assert value_a == value_b  # same content --> same key\n\n    value_a = _params_with_dataframe(1, df=df_a)\n    value_b = _params_with_dataframe(1, df=df_b)\n\n    assert value_a == value_b  # same content --> same key\n"
    },
    {
      "path": "python-cachier_cachier/tests/requirements.txt",
      "content": "# todo: add some version range or pinning latest versions\n# tests and coverages\npytest\ncoverage\npytest-cov\nbirch\n# to connect to the test mongodb server\npymongo\ndnspython\npymongo-inmemory\n# to test pandas dataframe as-param hashing with mongodb core\npandas\n# to be able to run `python setup.py checkdocs`\ncollective.checkdocs\npygments\n"
    }
  ],
  "Patch": "--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -135,7 +135,7 @@\n             self.check_calc_timeout(time_spent)\n \n     def clear_cache(self):\n-        pass\n+        self.mongo_collection.delete_many(filter={\"func\": self._func_str})\n \n     def clear_being_calculated(self):\n         self.mongo_collection.update_many(\n",
  "BuggyCodeLocation": [
    {
      "file": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "function": null,
      "content_all": {
        "135": "            self.check_calc_timeout(time_spent)\n",
        "138": "        pass\n"
      },
      "content_change": {
        "138": "        pass\n"
      }
    }
  ],
  "Source": "Human",
  "Command": [
    "pytest tests"
  ],
  "Token": 1446,
  "FilteredCode": [
    {
      "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
      "content": "1 \"\"\"A MongoDB-based caching core for cachier.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 \n10 import pickle  # for serialization of python objects\n11 import sys  # to make sure that pymongo was imported\n12 import time  # to sleep when waiting on Mongo cache\\\n13 import warnings  # to warn if pymongo is missing\n14 from contextlib import suppress\n15 from datetime import datetime\n16 \n17 from .._types import HashFunc, Mongetter\n18 \n19 with suppress(ImportError):\n20     from bson.binary import Binary  # to save binary data to mongodb\n21     from pymongo import ASCENDING, IndexModel\n22     from pymongo.errors import OperationFailure\n23 \n24 from .base import RecalculationNeeded, _BaseCore, _get_func_str\n25 \n26 MONGO_SLEEP_DURATION_IN_SEC = 1\n27 \n28 \n29 class MissingMongetter(ValueError):\n30     \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n31 \n32 \n33 class _MongoCore(_BaseCore):\n34     _INDEX_NAME = \"func_1_key_1\"\n35 \n36     def __init__(\n37         self,\n38         hash_func: HashFunc,\n39         mongetter: Mongetter,\n40         wait_for_calc_timeout: int,\n41     ):\n42         if \"pymongo\" not in sys.modules:\n43             warnings.warn(\n44                 \"`pymongo` was not found. MongoDB cores will not function.\",\n45                 ImportWarning,\n46                 stacklevel=2,\n47             )  # pragma: no cover\n48 \n49         super().__init__(hash_func, wait_for_calc_timeout)\n50         if mongetter is None:\n51             raise MissingMongetter(\n52                 \"must specify ``mongetter`` when using the mongo core\"\n53             )\n54         self.mongetter = mongetter\n55         self.mongo_collection = self.mongetter()\n56         index_inf = self.mongo_collection.index_information()\n57         if _MongoCore._INDEX_NAME not in index_inf:\n58             func1key1 = IndexModel(\n59                 keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n60                 name=_MongoCore._INDEX_NAME,\n61             )\n62             self.mongo_collection.create_indexes([func1key1])\n63 \n64     @property\n65     def _func_str(self) -> str:\n66         return _get_func_str(self.func)\n67 \n68     def get_entry_by_key(self, key):\n69         res = self.mongo_collection.find_one(\n70             {\"func\": self._func_str, \"key\": key}\n71         )\n72         if not res:\n73             return key, None\n74         try:\n75             entry = {\n76                 \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n77                 \"time\": res.get(\"time\", None),\n78                 \"stale\": res.get(\"stale\", False),\n79                 \"being_calculated\": res.get(\"being_calculated\", False),\n80             }\n81         except KeyError:\n82             entry = {\n83                 \"value\": None,\n84                 \"time\": res.get(\"time\", None),\n85                 \"stale\": res.get(\"stale\", False),\n86                 \"being_calculated\": res.get(\"being_calculated\", False),\n87             }\n88         return key, entry\n89 \n90     def set_entry(self, key, func_res):\n91         thebytes = pickle.dumps(func_res)\n92         self.mongo_collection.update_one(\n93             filter={\"func\": self._func_str, \"key\": key},\n94             update={\n95                 \"$set\": {\n96                     \"func\": self._func_str,\n97                     \"key\": key,\n98                     \"value\": Binary(thebytes),\n99                     \"time\": datetime.now(),\n100                     \"stale\": False,\n101                     \"being_calculated\": False,\n102                 }\n103             },\n104             upsert=True,\n105         )\n106 \n107     def mark_entry_being_calculated(self, key):\n108         self.mongo_collection.update_one(\n109             filter={\"func\": self._func_str, \"key\": key},\n110             update={\"$set\": {\"being_calculated\": True}},\n111             upsert=True,\n112         )\n113 \n114     def mark_entry_not_calculated(self, key):\n115         with suppress(OperationFailure):  # don't care in this case\n116             self.mongo_collection.update_one(\n117                 filter={\n118                     \"func\": self._func_str,\n119                     \"key\": key,\n120                 },\n121                 update={\"$set\": {\"being_calculated\": False}},\n122                 upsert=False,  # should not insert in this case\n123             )\n124 \n125     def wait_on_entry_calc(self, key):\n126         time_spent = 0\n127         while True:\n128             time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n129             time_spent += MONGO_SLEEP_DURATION_IN_SEC\n130             key, entry = self.get_entry_by_key(key)\n131             if entry is None:\n132                 raise RecalculationNeeded()\n133             if not entry[\"being_calculated\"]:\n134                 return entry[\"value\"]\n135             self.check_calc_timeout(time_spent)\n136 \n137     def clear_cache(self):\n138         pass\n139 \n140     def clear_being_calculated(self):\n141         self.mongo_collection.update_many(\n142             filter={\n143                 \"func\": self._func_str,\n144                 \"being_calculated\": True,\n145             },\n146             update={\"$set\": {\"being_calculated\": False}},\n147         )"
    },
    {
      "path": "python-cachier_cachier/src/cachier/cores/base.py",
      "content": "1 \"\"\"Defines the interface of a cachier caching core.\"\"\"\n2 # This file is part of Cachier.\n3 # https://github.com/python-cachier/cachier\n4 \n5 # Licensed under the MIT license:\n6 # http://www.opensource.org/licenses/MIT-lic(...truncated)"
    },
    {
      "path": "python-cachier_cachier/README.rst",
      "content": "1 Cachier\n2 #######\n3 \n4 |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n5 \n6 Persistent, sta(...truncated)"
    },
    {
      "path": "python-cachier_cachier/src/cachier/core.py",
      "content": "1 \"\"\"Persistent, stale-free memoization decorators fo(...truncated)"
    }
  ],
  "TokenAll": 34844,
  "FilteredLevel": 1500,
  "Results": {
    "model": "GPT-4o",
    "Difficulty": "Simple",
    "issue_origin": {
      "title": "Unused import statement",
      "description": "The import statement 'from datetime import datetime' on line 15 is not used in the code. Consider removing this unused import to improve code readability and maintainability.",
      "explanation": "### Issue Summary\n\nThe issue at hand is an unused import statement 'from datetime import datetime' on line 15 of the file `python-cachier_cachier/src/cachier/cores/mongo.py`. It has been flagged because it is not utilized anywhere in the codebase, and its presence could be misleading or cluttering.\n\n### Content of the Commit\n\nHere's what the commit might look like to address this issue:\n\n```patch\ndiff --git a/python-cachier_cachier/src/cachier/cores/mongo.py b/python-cachier_cachier/src/cachier/cores/mongo.py\nindex abcdef1..1234567 100644\n--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -12,7 +12,6 @@ import warnings  # to warn if pymongo is missing\n from contextlib import suppress\n from datetime import datetime\n \nimport pickle  # for serialization of python objects\n import sys  # to make sure that pymongo was imported\n import time  # to sleep when waiting on Mongo cache\n import warnings  # to warn if pymongo is missing\n@@ -14,7 +13,6 @@ import warnings  # to warn if pymongo is missing\n from contextlib import suppress\n\n-from datetime import datetime\n\n from .._types import HashFunc, Mongetter\n```\n\n### Explanation of How the Commit Solves the Issue\n\n1. **Identifying the Unused Import**:\n   The problematic import statement is `from datetime import datetime` located on line 15. Despite being imported, `datetime` is not utilized in any part of the `mongo.py` file.\n   \n2. **Purpose of the Commit**:\n   The commit removes the import statement since it is redundant. By doing so, the code becomes cleaner and easier to maintain. Removing unused imports improves readability, helps in maintaining a clean codebase, and may even slightly reduce the memory overhead.\n\n3. **Steps of the Commit**:\n   - The commit retains the comments and the purpose of previous imports while solely removing the line containing `from datetime import datetime`.\n\n   Here’s how the code looks after the removal:\n   ```python\n   14 from contextlib import suppress\n   15 \n   16 import pickle  # for serialization of python objects\n   17 import sys  # to make sure that pymongo was imported\n   18 import time  # to sleep when waiting on Mongo cache\n   19 import warnings  # to warn if pymongo is missing\n   ```\n\n4. **Verifying the Impact**:\n   - The code continues to operate as expected since the functionality has not been altered—only an unused import has been removed.\n   - The readability and maintainability are enhanced. Future developers or maintainers won’t be confused by the presence of an unused import statement.\n\n### Conclusion\n\nBy removing the unused `from datetime import datetime` import statement, the commit addresses the issue of enhancing code cleanliness and maintainability. It ensures that only necessary components are loaded, thereby promoting a more streamlined and efficient codebase. This change refines the module without impacting the existing functionality."
    },
    "issue_message": {
      "title": "Unused Method clear_cache",
      "description": "The method 'clear_cache' in the class '_MongoCore' is defined but not implemented. It's currently a no-op (no operation) function. Consider implementing this method or removing it if it's not needed.",
      "explanation": "### Issue Summary\n\nThe issue at hand involves an unused method named `clear_cache` in the `_MongoCore` class. This method is defined but lacks an implementation—it is a no-op (no operation) function. The suggestion is to either implement this method or remove it if it's not necessary.\n\n### Content of the Commit\n\nThe commit addresses the issue identified in the `_MongoCore` class within the MongoDB caching core for Cachier. The specific change involves replacing the no-op `clear_cache` method with an actual implementation. \n\nHere's the updated portion of the code with the implemented `clear_cache` method:\n\n```python\n137     def clear_cache(self):\n138         self.mongo_collection.delete_many({\"func\": self._func_str})\n```\n\n### Explanation of How the Commit Solves the Issue\n\n1. **Identification of an Empty Method**: Initially, the `clear_cache` method simply had a `pass` statement, effectively making it a placeholder with no logic behind it. This contributes no functionality to the class and could lead to confusion or maintenance issues.\n\n2. **Purpose of `clear_cache` Method**: The intended purpose of `clear_cache` in a caching mechanism is typically to remove existing cached data, thereby freeing up storage and ensuring that future cache queries do not return stale or outdated data.\n\n3. **Implementation Details**: \n   - The updated method, `clear_cache`, uses the MongoDB collection's `delete_many` method.\n   - The method deletes all documents within the collection where the `func` field matches `self._func_str`, which is a string representation of the function being cached.\n   - This ensures that the cache is cleared only for the current function, allowing other cached functions to remain unaffected.\n\n4. **Integration with Existing Code**:\n   - This new implementation aligns well with the existing structure of the `_MongoCore` class, which already utilizes MongoDB operations for cache management (e.g., `get_entry_by_key`, `set_entry`, `mark_entry_being_calculated`).\n   - By using `self._func_str`, the method ensures scoped cache clearing, maintaining consistency with other class methods that use `self._func_str` to identify cache entries.\n\n### Final Thoughts\n\nThe commit effectively resolves the issue by providing a meaningful implementation for the `clear_cache` method. This change contributes positively to the maintainability and functionality of the `_MongoCore` class by enabling developers and users to clear the cache for specific functions, reducing potential issues related to stale cache data. With this implementation, the class now adheres to expected caching behaviors, reinforcing its utility within the Cachier framework."
    },
    "issue_ground": {
      "title": "MongoDB Cache Clearing Functionality Not Implemented",
      "description": "The `clear_cache` method in the MongoDB core (`mongo.py`) is not implemented. This method is crucial for maintaining the integrity and manageability of the MongoDB cache by allowing users to clear the cache when necessary. The absence of this functionality means users cannot reset or clear the MongoDB cache, which can lead to stale or outdated data persisting in the cache. This could affect applications relying on fresh data, causing inconsistencies and potential issues in data processing and retrieval. Implementing this method is essential to provide users with a way to manage their MongoDB cache effectively.",
      "explanation": "## Issue Summary\nThe issue, titled \"MongoDB Cache Clearing Functionality Not Implemented,\" highlights that the `clear_cache` method within the `mongo.py` file is currently not implemented. This method is important for maintaining the cache integrity by allowing users to clear stale or outdated data. Without this feature, users cannot reset the MongoDB cache, which might lead to inconsistencies and outdated data affecting applications that rely on real-time data. The goal is to implement this method to provide users with the capability to manage their MongoDB cache effectively.\n\n## Commit Implementation\nLet's install the hypothetical commit to solve the issue. The commit will define and implement the `clear_cache` method within the `_MongoCore` class. To do this, it will:\n\n1. Define the `clear_cache` method if not already present.\n2. Implement logic to remove entries from the MongoDB cache.\n\nHere's a possible implementation for the commit based on the standard logic to clear a MongoDB collection:\n\n```python\n137    def clear_cache(self):\n138        \"\"\"\n139        Clears the entire cache by removing all entries\n140        from the MongoDB collection associated with this cache.\n141        \"\"\"\n142        self.mongo_collection.delete_many({})\n```\n\n## Explanation of the Changes\n\n- **Line 137-140:** Define the `clear_cache` method. The triple-quoted string is a docstring that provides a brief description of what the method does.\n- **Line 142:** Executes a `delete_many` operation with an empty filter (`{}`), which means it will delete all documents in the `mongo_collection`.\n\n## How the Commit Solves the Issue\n\n1. **Method Definition**: The `clear_cache` method is now defined in the `_MongoCore` class, giving the method a formal presence in the class.\n2. **Method Implementation**: The method is implemented using the `delete_many` operation with an empty filter. This MongoDB command will remove all documents in the collection, effectively clearing the cache.\n3. **Usage**: Users can now call the `clear_cache` method to remove all cached data, ensuring that no stale or outdated information remains, thereby maintaining data integrity and consistency.\n4. **Effectiveness**: By implementing this method, users are provided with an essential tool for cache management. Applications requiring fresh data can now reset the cache as necessary, avoiding inconsistencies in data retrieval and processing.\n\nOverall, implementing the `clear_cache` method resolves the core issue by giving developers the necessary tool to manage their cache effectively in MongoDB, thus avoiding potential problems related to stale data."
    },
    "issue_ground_truth": {
      "title": "MongoDB Cache Clearing Functionality Not Implemented",
      "description": "The `clear_cache` method in the MongoDB core (`mongo.py`) is not implemented. This method is crucial for maintaining the integrity and manageability of the MongoDB cache by allowing users to clear the cache when necessary. The absence of this functionality means users cannot reset or clear the MongoDB cache, which can lead to stale or outdated data persisting in the cache. This could affect applications relying on fresh data, causing inconsistencies and potential issues in data processing and retrieval. Implementing this method is essential to provide users with a way to manage their MongoDB cache effectively.",
      "explanation": "### Summary of the Issue\nThe issue at hand is with the `cachier` library's `clear_cache` function for the MongoDB backend, located in the `mongo.py` file. This function was not implemented, leaving users without a way to clear the MongoDB cache when necessary. The absence of this function can lead to stale or outdated data persistence in the cache, which is detrimental to applications that rely on real-time or frequently updated data.\n\n### Detailed Content of the Commit\nThe commit implements the `clear_cache` method by adding a functional code snippet to the previously unimplemented method. This method now allows it to delete entries in the MongoDB cache that match a specific function, identified by a string representation `_func_str`.\n\n### Explanation of the Solution and Its Implementation\n1. **Cause of the Issue:**\n   - The `clear_cache` method was defined but not implemented in the `mongo.py` file for the MongoDB backend of the `cachier` library.\n   - This omission meant users could not clear the cache and could inadvertently use outdated or incorrect data, causing inconsistencies in applications requiring fresh data for correct operation.\n\n2. **Content of the Commit:**\n   - The `clear_cache` method was edited to include a function that deletes multiple entries from the MongoDB collection. These entries are identified by a `filter` that matches the function string representation, referred to as `_func_str`.\n   - This mechanism leverages MongoDB’s `delete_many` function to remove all entries tied to the particular function.\n\n3. **How the Commit Solves the Issue:**\n   - **Functionality Implementation:** By adding the implementation to `clear_cache`, the method now actively performs the task of removing cached data from the MongoDB collection, ensuring that users can programmatically clear cache as needed.\n   - **Data Integrity:** Implementing this method addresses the core need for cache management, allowing users to ensure they are not exposed to stale data, thus maintaining the integrity and reliability of the data within their applications.\n   - **User Control:** It provides users with essential control over the caching mechanism, ensuring applications can force cache invalidation and refresh cached data promptly.\n\n### Solution to the Issue\nThe solution involves implementing the `clear_cache` method within the MongoDB core to perform a bulk deletion of cached entries based on a specific function identifier. This ensures that:\n- Users can clear outdated or stale cache entries.\n- Applications that rely on real-time or frequently changing data are not adversely affected by stale data.\n- The caching mechanism remains robust and flexible for different use cases.\n\nIn summary, the commit effectively addresses the absence of the `clear_cache` functionality, ensuring that cache management for MongoDB in the `cachier` library is complete and reliable."
    },
    "location_origin": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/mongo.py",
        "function": {
          "0": "<module>"
        },
        "content_all": {
          "12": "import time  # to sleep when waiting on Mongo cache",
          "13": "import warnings  # to warn if pymongo is missing",
          "14": "from contextlib import suppress",
          "15": "from datetime import datetime",
          "16": "",
          "17": "from .._types import HashFunc, Mongetter",
          "18": ""
        },
        "content_change": {
          "15": "from datetime import datetime"
        }
      }
    ],
    "location_message": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/mongo.py",
        "function": {
          "137": "clear_cache"
        },
        "content_all": {
          "134": "                 return entry[\"value\"]\n",
          "135": "             self.check_calc_timeout(time_spent)\n",
          "136": " \n",
          "137": "     def clear_cache(self):\n",
          "138": "         pass\n",
          "139": " \n",
          "140": "     def clear_being_calculated(self):\n"
        },
        "content_change": {
          "138": "         pass\n"
        }
      }
    ],
    "location_ground": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/mongo.py",
        "function": {
          "137": "clear_cache"
        },
        "content_all": {
          "134": "                 return entry[\"value\"]\n",
          "135": "             self.check_calc_timeout(time_spent)\n",
          "136": " \n",
          "137": "     def clear_cache(self):\n",
          "138": "         pass\n",
          "139": " \n",
          "140": "     def clear_being_calculated(self):\n",
          "141": "         self.mongo_collection.update_many(\n",
          "142": "             filter={\n"
        },
        "content_change": {
          "137": "     def clear_cache(self):\n",
          "138": "         self.mongo_collection.delete_many({})\n"
        }
      }
    ],
    "location_ground_exp": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/mongo.py",
        "function": {
          "137": "clear_cache"
        },
        "content_all": {
          "134": "                 return entry[\"value\"]\n",
          "135": "             self.check_calc_timeout(time_spent)\n",
          "136": " \n",
          "137": "     def clear_cache(self):\n",
          "138": "         pass\n",
          "139": " \n",
          "140": "     def clear_being_calculated(self):\n"
        },
        "content_change": {
          "137": "     def clear_cache(self):\n",
          "138": "         self.mongo_collection.delete_many({'func': self._func_str})\n"
        }
      }
    ],
    "location_ground_truth": [
      {
        "file": "python-cachier_cachier/src/cachier/cores/mongo.py",
        "function": null,
        "content_all": {
          "135": "            self.check_calc_timeout(time_spent)\n",
          "138": "        pass\n"
        },
        "content_change": {
          "138": "        pass\n"
        }
      }
    ],
    "patch_i": "\n--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -12,7 +12,6 @@\n import warnings  # to warn if pymongo is missing\n from contextlib import suppress\n \n-from datetime import datetime\n \n import pickle  # for serialization of python objects\n import sys  # to make sure that pymongo was imported\n",
    "patch_im": "\n--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -135,9 +135,9 @@\n             self.check_calc_timeout(time_spent)\n \n     def clear_cache(self):\n-        pass\n+        self.mongo_collection.delete_many({\"func\": self._func_str})\n \n     def clear_being_calculated(self):\n         self.mongo_collection.update_many(\n             filter={\n                 \"func\": self._func_str,\n                 \"being_calculated\": True,\n",
    "patch_il": "\n--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -12,7 +12,6 @@ import warnings  # to warn if pymongo is missing\n from contextlib import suppress\n from datetime import datetime\n \n-import pickle  # for serialization of python objects\n import sys  # to make sure that pymongo was imported\n import time  # to sleep when waiting on Mongo cache\n \n@@ -100,7 +99,6 @@ def set_entry(self, key, func_res):\n                     \"value\": Binary(thebytes),\n                     \"time\": datetime.now(),\n                     \"stale\": False,\n                     \"being_calculated\": False,\n                 }\n             },\n             upsert=True,\n         )\n",
    "patch_iml": "\n--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -135,9 +135,9 @@\n             self.check_calc_timeout(time_spent)\n \n     def clear_cache(self):\n-        pass\n+        self.mongo_collection.delete_many({\"func\": self._func_str})\n \n     def clear_being_calculated(self):\n         self.mongo_collection.update_many(\n             filter={\n                 \"func\": self._func_str,\n",
    "patch_ground": "\n--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -135,9 +135,14 @@\n             self.check_calc_timeout(time_spent)\n \n     def clear_cache(self):\n-        pass\n+        \"\"\"\n+        Clears the entire cache by removing all entries\n+        from the MongoDB collection associated with this cache.\n+        \"\"\"\n+        self.mongo_collection.delete_many({})\n \n     def clear_being_calculated(self):\n         self.mongo_collection.update_many(\n             filter={\n                 \"func\": self._func_str,\n                 \"being_calculated\": True,\n",
    "patch_ground_location": "\n--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -138,7 +138,8 @@\n         pass\n \n+    def clear_cache(self):\n+        \"\"\"\n+        Clears the entire cache by removing all entries\n+        from the MongoDB collection associated with this cache.\n+        \"\"\"\n+        self.mongo_collection.delete_many({})\n+\n     def clear_being_calculated(self):\n         self.mongo_collection.update_many(\n             filter={\n",
    "patch_ground_exp": "\n--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -134,7 +134,8 @@\n             self.check_calc_timeout(time_spent)\n \n     def clear_cache(self):\n-        pass\n+        self.mongo_collection.delete_many({'func': self._func_str})\n\n     def clear_being_calculated(self):\n         self.mongo_collection.update_many(\n",
    "patch_ground_all": "\n--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -135,1 +135,7 @@\n             self.check_calc_timeout(time_spent)\n-        pass\n+        \n+    def clear_cache(self):\n+        self.mongo_collection.delete_many(\n+            filter={\n+                \"func\": self._func_str\n+            }\n+        )\n",
    "patch_ground_truth": "--- a/python-cachier_cachier/src/cachier/cores/mongo.py\n+++ b/python-cachier_cachier/src/cachier/cores/mongo.py\n@@ -135,7 +135,7 @@\n             self.check_calc_timeout(time_spent)\n \n     def clear_cache(self):\n-        pass\n+        self.mongo_collection.delete_many(filter={\"func\": self._func_str})\n \n     def clear_being_calculated(self):\n         self.mongo_collection.update_many(\n",
    "message": null,
    "CodeBase": [
      {
        "path": "python-cachier_cachier/src/cachier/cores/mongo.py",
        "content": "1 \"\"\"A MongoDB-based caching core for cachier.\"\"\"\n2 \n3 # This file is part of Cachier.\n4 # https://github.com/python-cachier/cachier\n5 \n6 # Licensed under the MIT license:\n7 # http://www.opensource.org/licenses/MIT-license\n8 # Copyright (c) 2016, Shay Palachy <shaypal5@gmail.com>\n9 \n10 import pickle  # for serialization of python objects\n11 import sys  # to make sure that pymongo was imported\n12 import time  # to sleep when waiting on Mongo cache\\\n13 import warnings  # to warn if pymongo is missing\n14 from contextlib import suppress\n15 from datetime import datetime\n16 \n17 from .._types import HashFunc, Mongetter\n18 \n19 with suppress(ImportError):\n20     from bson.binary import Binary  # to save binary data to mongodb\n21     from pymongo import ASCENDING, IndexModel\n22     from pymongo.errors import OperationFailure\n23 \n24 from .base import RecalculationNeeded, _BaseCore, _get_func_str\n25 \n26 MONGO_SLEEP_DURATION_IN_SEC = 1\n27 \n28 \n29 class MissingMongetter(ValueError):\n30     \"\"\"Thrown when the mongetter keyword argument is missing.\"\"\"\n31 \n32 \n33 class _MongoCore(_BaseCore):\n34     _INDEX_NAME = \"func_1_key_1\"\n35 \n36     def __init__(\n37         self,\n38         hash_func: HashFunc,\n39         mongetter: Mongetter,\n40         wait_for_calc_timeout: int,\n41     ):\n42         if \"pymongo\" not in sys.modules:\n43             warnings.warn(\n44                 \"`pymongo` was not found. MongoDB cores will not function.\",\n45                 ImportWarning,\n46                 stacklevel=2,\n47             )  # pragma: no cover\n48 \n49         super().__init__(hash_func, wait_for_calc_timeout)\n50         if mongetter is None:\n51             raise MissingMongetter(\n52                 \"must specify ``mongetter`` when using the mongo core\"\n53             )\n54         self.mongetter = mongetter\n55         self.mongo_collection = self.mongetter()\n56         index_inf = self.mongo_collection.index_information()\n57         if _MongoCore._INDEX_NAME not in index_inf:\n58             func1key1 = IndexModel(\n59                 keys=[(\"func\", ASCENDING), (\"key\", ASCENDING)],\n60                 name=_MongoCore._INDEX_NAME,\n61             )\n62             self.mongo_collection.create_indexes([func1key1])\n63 \n64     @property\n65     def _func_str(self) -> str:\n66         return _get_func_str(self.func)\n67 \n68     def get_entry_by_key(self, key):\n69         res = self.mongo_collection.find_one(\n70             {\"func\": self._func_str, \"key\": key}\n71         )\n72         if not res:\n73             return key, None\n74         try:\n75             entry = {\n76                 \"value\": pickle.loads(res[\"value\"]),  # noqa: S301\n77                 \"time\": res.get(\"time\", None),\n78                 \"stale\": res.get(\"stale\", False),\n79                 \"being_calculated\": res.get(\"being_calculated\", False),\n80             }\n81         except KeyError:\n82             entry = {\n83                 \"value\": None,\n84                 \"time\": res.get(\"time\", None),\n85                 \"stale\": res.get(\"stale\", False),\n86                 \"being_calculated\": res.get(\"being_calculated\", False),\n87             }\n88         return key, entry\n89 \n90     def set_entry(self, key, func_res):\n91         thebytes = pickle.dumps(func_res)\n92         self.mongo_collection.update_one(\n93             filter={\"func\": self._func_str, \"key\": key},\n94             update={\n95                 \"$set\": {\n96                     \"func\": self._func_str,\n97                     \"key\": key,\n98                     \"value\": Binary(thebytes),\n99                     \"time\": datetime.now(),\n100                     \"stale\": False,\n101                     \"being_calculated\": False,\n102                 }\n103             },\n104             upsert=True,\n105         )\n106 \n107     def mark_entry_being_calculated(self, key):\n108         self.mongo_collection.update_one(\n109             filter={\"func\": self._func_str, \"key\": key},\n110             update={\"$set\": {\"being_calculated\": True}},\n111             upsert=True,\n112         )\n113 \n114     def mark_entry_not_calculated(self, key):\n115         with suppress(OperationFailure):  # don't care in this case\n116             self.mongo_collection.update_one(\n117                 filter={\n118                     \"func\": self._func_str,\n119                     \"key\": key,\n120                 },\n121                 update={\"$set\": {\"being_calculated\": False}},\n122                 upsert=False,  # should not insert in this case\n123             )\n124 \n125     def wait_on_entry_calc(self, key):\n126         time_spent = 0\n127         while True:\n128             time.sleep(MONGO_SLEEP_DURATION_IN_SEC)\n129             time_spent += MONGO_SLEEP_DURATION_IN_SEC\n130             key, entry = self.get_entry_by_key(key)\n131             if entry is None:\n132                 raise RecalculationNeeded()\n133             if not entry[\"being_calculated\"]:\n134                 return entry[\"value\"]\n135             self.check_calc_timeout(time_spent)\n136 \n137     def clear_cache(self):\n138         pass\n139 \n140     def clear_being_calculated(self):\n141         self.mongo_collection.update_many(\n142             filter={\n143                 \"func\": self._func_str,\n144                 \"being_calculated\": True,\n145             },\n146             update={\"$set\": {\"being_calculated\": False}},\n147         )"
      },
      {
        "path": "python-cachier_cachier/src/cachier/cores/base.py",
        "content": "1 \"\"\"Defines the interface of a cachier caching core.\"\"\"\n2 # This file is part of Cachier.\n3 # https://github.com/python-cachier/cachier\n4 \n5 # Licensed under the MIT license:\n6 # http://www.opensource.org/licenses/MIT-lic(...truncated)"
      },
      {
        "path": "python-cachier_cachier/README.rst",
        "content": "1 Cachier\n2 #######\n3 \n4 |PyPI-Status| |Downloads| |PyPI-Versions| |Build-Status| |Codecov| |Codefactor| |LICENCE|\n5 \n6 Persistent, sta(...truncated)"
      },
      {
        "path": "python-cachier_cachier/src/cachier/core.py",
        "content": "1 \"\"\"Persistent, stale-free memoization decorators fo(...truncated)"
      }
    ],
    "CommitSHA": "474294fac83b029f13dd61b8d2822923045d4504"
  },
  "Score": {
    "Difficulty": "Simple",
    "issue_origin": null,
    "issue_message": {
      "Title": 7,
      "Description": 6,
      "Reproducibility": 6,
      "Relevance": 7,
      "Explanation": 7,
      "Overall": 7
    },
    "issue_ground": {
      "Title": 7,
      "Description": 8,
      "Reproducibility": 9,
      "Relevance": 8,
      "Explanation": 8,
      "Overall": 8
    },
    "issue_ground_truth": {
      "title": "MongoDB Cache Clearing Functionality Not Implemented",
      "description": "The `clear_cache` method in the MongoDB core (`mongo.py`) is not implemented. This method is crucial for maintaining the integrity and manageability of the MongoDB cache by allowing users to clear the cache when necessary. The absence of this functionality means users cannot reset or clear the MongoDB cache, which can lead to stale or outdated data persisting in the cache. This could affect applications relying on fresh data, causing inconsistencies and potential issues in data processing and retrieval. Implementing this method is essential to provide users with a way to manage their MongoDB cache effectively.",
      "explanation": "### Summary of the Issue\nThe issue at hand is with the `cachier` library's `clear_cache` function for the MongoDB backend, located in the `mongo.py` file. This function was not implemented, leaving users without a way to clear the MongoDB cache when necessary. The absence of this function can lead to stale or outdated data persistence in the cache, which is detrimental to applications that rely on real-time or frequently updated data.\n\n### Detailed Content of the Commit\nThe commit implements the `clear_cache` method by adding a functional code snippet to the previously unimplemented method. This method now allows it to delete entries in the MongoDB cache that match a specific function, identified by a string representation `_func_str`.\n\n### Explanation of the Solution and Its Implementation\n1. **Cause of the Issue:**\n   - The `clear_cache` method was defined but not implemented in the `mongo.py` file for the MongoDB backend of the `cachier` library.\n   - This omission meant users could not clear the cache and could inadvertently use outdated or incorrect data, causing inconsistencies in applications requiring fresh data for correct operation.\n\n2. **Content of the Commit:**\n   - The `clear_cache` method was edited to include a function that deletes multiple entries from the MongoDB collection. These entries are identified by a `filter` that matches the function string representation, referred to as `_func_str`.\n   - This mechanism leverages MongoDB’s `delete_many` function to remove all entries tied to the particular function.\n\n3. **How the Commit Solves the Issue:**\n   - **Functionality Implementation:** By adding the implementation to `clear_cache`, the method now actively performs the task of removing cached data from the MongoDB collection, ensuring that users can programmatically clear cache as needed.\n   - **Data Integrity:** Implementing this method addresses the core need for cache management, allowing users to ensure they are not exposed to stale data, thus maintaining the integrity and reliability of the data within their applications.\n   - **User Control:** It provides users with essential control over the caching mechanism, ensuring applications can force cache invalidation and refresh cached data promptly.\n\n### Solution to the Issue\nThe solution involves implementing the `clear_cache` method within the MongoDB core to perform a bulk deletion of cached entries based on a specific function identifier. This ensures that:\n- Users can clear outdated or stale cache entries.\n- Applications that rely on real-time or frequently changing data are not adversely affected by stale data.\n- The caching mechanism remains robust and flexible for different use cases.\n\nIn summary, the commit effectively addresses the absence of the `clear_cache` functionality, ensuring that cache management for MongoDB in the `cachier` library is complete and reliable."
    }
  }
}