{
    "Selected_candidate": {
        "pr_number": 4738,
        "pr_title": "Fixed #24628 -- Fixed applied status for squashed migrations.",
        "pr_body": "",
        "issue_id": 24628,
        "issue_title": "Squash migration is not marked as applied when the migrations it replaces are",
        "issue_body": "(I am attempting a shorter and clearer description of this bug, but leaving the original description intact below - carljm).\nIn Django 1.8, consider an app\nA\nwith migrations\n1\nand\n2\nand a squashed migration\n1_squashed_2\nthat replaces both\n1\nand\n2\n. Consider a deployment of this app which has only\n1\napplied. When it receives the update including\n2\nand\n1_squashed_2\nand is migrated,\n2\nis marked as applied in the database but\n1_squashed_2\nis not.\nThis does not immediately appear to be a problem, because as long as migrations\n1\nand\n2\nexist and\n1_squashed_2\nis marked as replacing them, Django automatically considers\n1_squashed_2\nto be applied (and shows it as such in\nshowmigrations\n). But Django never actually records\n1_squashed_2\nitself as applied in the database.\nAt some point, once all deployments have migrated through\n2\n, the idea is that\n1\nand\n2\ncan be removed, and the\nreplaces\ntag removed from\n1_squashed_2\n. At this point we have a problem, because now Django considers\n1_squashed_2\nto not be applied, and tries to apply it again, causing errors because tables already exist, etc. The only solution is to manually\n--fake\nsquashed migrations on all deployments when their\nreplaces\ntag is removed, which is a pain and eliminates much of the convenience of the migration system;\n--fake\nshould not be necessary in the normal course of things.\nThe solution appears simple: anytime a migration is marked as having been applied, Django should check if it is part of a replaced set, and if that replaced set is now fully applied, the replacing migration should also be marked as applied.\nIt may be that this check should be performed by\nmigrate\neven when it hasn't applied any migrations in the current run, as that would help to correct the corruption already caused by this bug in many existing databases, but not yet noticed because the replaced migrations haven't yet been removed. (This correction should probably be mentioned to the user so it doesn't happen silently.)\nOriginal report follows:\nAfter some time in the chat with MarkusH, we decided that this should be a bug report.\nSome context: I have quite some migrations (33) of which the first already was a squash of several migrations. It did get it's 'replaced' field removed though, as the original migrations it replaced are long gone already. The goal now was to get that long list of migrations down to one again, for peace of mind.\nThis led to the following symptoms: When I created the squashed migration everything seemed fine. ./manage.py migrate did say that nothing was to be done (but then again, all the migrations to be squashed where already applied).\nBut adding another migration after that ended in tears - ./manage migrate didn't want to apply it and told me so in no uncertain terms.\n(pycess)dwt@atlan ~/Code/Projekte/pycess/pycess (git)-[master] % ./manage.py migrate\nTraceback (most recent call last):\n  File \"./manage.py\", line 10, in <module>\n    execute_from_command_line(sys.argv)\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/core/management/__init__.py\", line 330, in execute_from_command_line\n    utility.execute()\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/core/management/__init__.py\", line 322, in execute\n    self.fetch_command(subcommand).run_from_argv(self.argv)\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/core/management/base.py\", line 347, in run_from_argv\n    self.execute(*args, **cmd_options)\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/core/management/base.py\", line 398, in execute\n    output = self.handle(*args, **options)\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/core/management/commands/migrate.py\", line 86, in handle\n    executor = MigrationExecutor(connection, self.migration_progress_callback)\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/db/migrations/executor.py\", line 19, in __init__\n    self.loader = MigrationLoader(self.connection)\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/db/migrations/loader.py\", line 47, in __init__\n    self.build_graph()\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/db/migrations/loader.py\", line 281, in build_graph\n    _reraise_missing_dependency(migration, parent, e)\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/db/migrations/loader.py\", line 264, in _reraise_missing_dependency\n    raise exc\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/db/migrations/loader.py\", line 274, in build_graph\n    self.graph.add_dependency(migration, key, parent)\n  File \"/Users/dwt/Code/Projekte/pycess/django/django/db/migrations/graph.py\", line 124, in add_dependency\n    parent\ndjango.db.migrations.graph.NodeNotFoundError: Migration process.0002_auto_20150411_1005 dependencies reference nonexistent parent node ('process', '0001_squashed_initial_2')\nHere we had quite some discussion in #django-dev with MarkusH, of which the result to me was that a) django doesn't seem to ever record that a squashed migration is applied, at least as long as it is recognizable by django as a squashed migration (i.e. it has a .replaced property). b) there seems to be no way to tell django that the replacing squashed migration really is already applied for this database.\nThe last one turned out to be achievable if you remove the old migrations and the .replaces property from the squashed migration and then apply it with --fake\nMarkusH might want to say more here that he can describe better.\nFor ease of reproduction I'm attaching the project where this occurred for me.",
        "issue_closed_at": "2015-06-02T17:18:16",
        "base_commit": "23048d186ce0041654a9f547fe3e7177efce3076",
        "changes": [
            {
                "file": "django/db/migrations/executor.py",
                "type": "function",
                "name": "migrate",
                "class_name": "MigrationExecutor",
                "code": "def migrate(self, targets, plan=None, fake=False, fake_initial=False):\n        \"\"\"\n        Migrates the database up to the given targets.\n\n        Django first needs to create all project states before a migration is\n        (un)applied and in a second step run all the database operations.\n        \"\"\"\n        if plan is None:\n            plan = self.migration_plan(targets)\n        migrations_to_run = {m[0] for m in plan}\n        # Create the forwards plan Django would follow on an empty database\n        full_plan = self.migration_plan(self.loader.graph.leaf_nodes(), clean_start=True)\n        # Holds all states right before a migration is applied\n        # if the migration is being run.\n        states = {}\n        state = ProjectState(real_apps=list(self.loader.unmigrated_apps))\n        if self.progress_callback:\n            self.progress_callback(\"render_start\")\n        # Phase 1 -- Store all project states of migrations right before they\n        # are applied. The first migration that will be applied in phase 2 will\n        # trigger the rendering of the initial project state. From this time on\n        # models will be recursively reloaded as explained in\n        # `django.db.migrations.state.get_related_models_recursive()`.\n        for migration, _ in full_plan:\n            if not migrations_to_run:\n                # We remove every migration whose state was already computed\n                # from the set below (`migrations_to_run.remove(migration)`).\n                # If no states for migrations must be computed, we can exit\n                # this loop. Migrations that occur after the latest migration\n                # that is about to be applied would only trigger unneeded\n                # mutate_state() calls.\n                break\n            do_run = migration in migrations_to_run\n            if do_run:\n                if 'apps' not in state.__dict__:\n                    state.apps  # Render all real_apps -- performance critical\n                states[migration] = state.clone()\n                migrations_to_run.remove(migration)\n            # Only preserve the state if the migration is being run later\n            state = migration.mutate_state(state, preserve=do_run)\n        if self.progress_callback:\n            self.progress_callback(\"render_success\")\n        # Phase 2 -- Run the migrations\n        for migration, backwards in plan:\n            if not backwards:\n                self.apply_migration(states[migration], migration, fake=fake, fake_initial=fake_initial)\n            else:\n                self.unapply_migration(states[migration], migration, fake=fake)"
            },
            {
                "file": "django/db/migrations/executor.py",
                "type": "function",
                "name": "unapply_migration",
                "class_name": "MigrationExecutor",
                "code": "def unapply_migration(self, state, migration, fake=False):\n        \"\"\"\n        Runs a migration backwards.\n        \"\"\"\n        if self.progress_callback:\n            self.progress_callback(\"unapply_start\", migration, fake)\n        if not fake:\n            with self.connection.schema_editor() as schema_editor:\n                state = migration.unapply(state, schema_editor)\n        # For replacement migrations, record individual statuses\n        if migration.replaces:\n            for app_label, name in migration.replaces:\n                self.recorder.record_unapplied(app_label, name)\n        else:\n            self.recorder.record_unapplied(migration.app_label, migration.name)\n        # Report progress\n        if self.progress_callback:\n            self.progress_callback(\"unapply_success\", migration, fake)\n        return state"
            },
            {
                "file": "django/db/migrations/loader.py",
                "type": "function",
                "name": "build_graph",
                "class_name": "MigrationLoader",
                "code": "def build_graph(self):\n        \"\"\"\n        Builds a migration dependency graph using both the disk and database.\n        You'll need to rebuild the graph if you apply migrations. This isn't\n        usually a problem as generally migration stuff runs in a one-shot process.\n        \"\"\"\n        # Load disk data\n        self.load_disk()\n        # Load database data\n        if self.connection is None:\n            self.applied_migrations = set()\n        else:\n            recorder = MigrationRecorder(self.connection)\n            self.applied_migrations = recorder.applied_migrations()\n        # Do a first pass to separate out replacing and non-replacing migrations\n        normal = {}\n        replacing = {}\n        for key, migration in self.disk_migrations.items():\n            if migration.replaces:\n                replacing[key] = migration\n            else:\n                normal[key] = migration\n        # Calculate reverse dependencies - i.e., for each migration, what depends on it?\n        # This is just for dependency re-pointing when applying replacements,\n        # so we ignore run_before here.\n        reverse_dependencies = {}\n        for key, migration in normal.items():\n            for parent in migration.dependencies:\n                reverse_dependencies.setdefault(parent, set()).add(key)\n        # Remember the possible replacements to generate more meaningful error\n        # messages\n        reverse_replacements = {}\n        for key, migration in replacing.items():\n            for replaced in migration.replaces:\n                reverse_replacements.setdefault(replaced, set()).add(key)\n        # Carry out replacements if we can - that is, if all replaced migrations\n        # are either unapplied or missing.\n        for key, migration in replacing.items():\n            # Ensure this replacement migration is not in applied_migrations\n            self.applied_migrations.discard(key)\n            # Do the check. We can replace if all our replace targets are\n            # applied, or if all of them are unapplied.\n            applied_statuses = [(target in self.applied_migrations) for target in migration.replaces]\n            can_replace = all(applied_statuses) or (not any(applied_statuses))\n            if not can_replace:\n                continue\n            # Alright, time to replace. Step through the replaced migrations\n            # and remove, repointing dependencies if needs be.\n            for replaced in migration.replaces:\n                if replaced in normal:\n                    # We don't care if the replaced migration doesn't exist;\n                    # the usage pattern here is to delete things after a while.\n                    del normal[replaced]\n                for child_key in reverse_dependencies.get(replaced, set()):\n                    if child_key in migration.replaces:\n                        continue\n                    # List of migrations whose dependency on `replaced` needs\n                    # to be updated to a dependency on `key`.\n                    to_update = []\n                    # Child key may itself be replaced, in which case it might\n                    # not be in `normal` anymore (depending on whether we've\n                    # processed its replacement yet). If it's present, we go\n                    # ahead and update it; it may be deleted later on if it is\n                    # replaced, but there's no harm in updating it regardless.\n                    if child_key in normal:\n                        to_update.append(normal[child_key])\n                    # If the child key is replaced, we update its replacement's\n                    # dependencies too, if necessary. (We don't know if this\n                    # replacement will actually take effect or not, but either\n                    # way it's OK to update the replacing migration).\n                    if child_key in reverse_replacements:\n                        for replaces_child_key in reverse_replacements[child_key]:\n                            if replaced in replacing[replaces_child_key].dependencies:\n                                to_update.append(replacing[replaces_child_key])\n                    # Actually perform the dependency update on all migrations\n                    # that require it.\n                    for migration_needing_update in to_update:\n                        migration_needing_update.dependencies.remove(replaced)\n                        migration_needing_update.dependencies.append(key)\n            normal[key] = migration\n            # Mark the replacement as applied if all its replaced ones are\n            if all(applied_statuses):\n                self.applied_migrations.add(key)\n        # Finally, make a graph and load everything into it\n        self.graph = MigrationGraph()\n        for key, migration in normal.items():\n            self.graph.add_node(key, migration)\n\n        def _reraise_missing_dependency(migration, missing, exc):\n            \"\"\"\n            Checks if ``missing`` could have been replaced by any squash\n            migration but wasn't because the the squash migration was partially\n            applied before. In that case raise a more understandable exception.\n\n            #23556\n            \"\"\"\n            if missing in reverse_replacements:\n                candidates = reverse_replacements.get(missing, set())\n                is_replaced = any(candidate in self.graph.nodes for candidate in candidates)\n                if not is_replaced:\n                    tries = ', '.join('%s.%s' % c for c in candidates)\n                    exc_value = NodeNotFoundError(\n                        \"Migration {0} depends on nonexistent node ('{1}', '{2}'). \"\n                        \"Django tried to replace migration {1}.{2} with any of [{3}] \"\n                        \"but wasn't able to because some of the replaced migrations \"\n                        \"are already applied.\".format(\n                            migration, missing[0], missing[1], tries\n                        ),\n                        missing)\n                    exc_value.__cause__ = exc\n                    six.reraise(NodeNotFoundError, exc_value, sys.exc_info()[2])\n            raise exc\n\n        # Add all internal dependencies first to ensure __first__ dependencies\n        # find the correct root node.\n        for key, migration in normal.items():\n            for parent in migration.dependencies:\n                if parent[0] != key[0] or parent[1] == '__first__':\n                    # Ignore __first__ references to the same app (#22325)\n                    continue\n                try:\n                    self.graph.add_dependency(migration, key, parent)\n                except NodeNotFoundError as e:\n                    # Since we added \"key\" to the nodes before this implies\n                    # \"parent\" is not in there. To make the raised exception\n                    # more understandable we check if parent could have been\n                    # replaced but hasn't (eg partially applied squashed\n                    # migration)\n                    _reraise_missing_dependency(migration, parent, e)\n        for key, migration in normal.items():\n            for parent in migration.dependencies:\n                if parent[0] == key[0]:\n                    # Internal dependencies already added.\n                    continue\n                parent = self.check_key(parent, key[0])\n                if parent is not None:\n                    try:\n                        self.graph.add_dependency(migration, key, parent)\n                    except NodeNotFoundError as e:\n                        # Since we added \"key\" to the nodes before this implies\n                        # \"parent\" is not in there.\n                        _reraise_missing_dependency(migration, parent, e)\n            for child in migration.run_before:\n                child = self.check_key(child, key[0])\n                if child is not None:\n                    try:\n                        self.graph.add_dependency(migration, child, key)\n                    except NodeNotFoundError as e:\n                        # Since we added \"key\" to the nodes before this implies\n                        # \"child\" is not in there.\n                        _reraise_missing_dependency(migration, child, e)"
            }
        ]
    },
    "Justification": "Candidate B is the most helpful because it directly addresses issues related to squashed migrations, which is the core of the CURRENT bug report. It highlights a similar problem of migrations not being marked as applied when they should be, underscoring challenges in managing squashed migrations. The candidate's proposed fix involves changes that may provide crucial insights into how to eliminate deprecation warnings when transitioning from index_together to indexes, aligning closely with the deprecation issues mentioned in the CURRENT bug report. Its focus on migration status and the relevance of applying certain migrations collectively make it particularly applicable for debugging the current issue."
}