% \documentclass{uai2025} % for initial submission
\documentclass[accepted]{uai2025} % after acceptance, for a revised version; 
% also before submission to see how the non-anonymous paper would look like 
                        
%% There is a class option to choose the math font
% \documentclass[mathfont=ptmx]{uai2025} % ptmx math instead of Computer
                                         % Modern (has noticeable issues)
% \documentclass[mathfont=newtx]{uai2025} % newtx fonts (improves upon
                                          % ptmx; less tested, no support)
% NOTE: Only keep *one* line above as appropriate, as it will be replaced
%       automatically for papers to be published. Do not make any other
%       change above this note for an accepted version.

%% Choose your variant of English; be consistent
\usepackage[american]{babel}
% \usepackage[british]{babel}

%% Some suggested packages, as needed:
\usepackage{natbib} % has a nice set of citation styles and commands
    \bibliographystyle{plainnat}
    \renewcommand{\bibsection}{\subsubsection*{References}}
\usepackage{mathtools} % amsmath with fixes and additions
% \usepackage{siunitx} % for proper typesetting of numbers and units
\usepackage{booktabs} % commands to create good-looking tables
\usepackage{tikz} % nice language for creating drawings and diagrams
\usepackage{algorithm}
\usepackage{algorithmic}
\usepackage{multirow}
\usepackage{amsmath}
\usepackage{amsthm}
\usepackage{graphicx}
\usepackage{subcaption}
\usepackage{wrapfig}
\usepackage[table]{xcolor}
\usepackage{amssymb}


\newtheorem{theorem}{Theorem}
\newtheorem{assumption}{Assumption}

\definecolor{grey}{HTML}{D9D9D9}

%% Provided macros
% \smaller: Because the class footnote size is essentially LaTeX's \small,
%           redefining \footnotesize, we provide the original \footnotesize
%           using this macro.
%           (Use only sparingly, e.g., in drawings, as it is quite small.)

%% Self-defined macros
\newcommand{\swap}[3][-]{#3#1#2} % just an example

\title{Improving Adversarial Transferability via Decision Boundary Adaptation}

\author[1]{Jiayu Zhang}
\author[2]{Zhiyu Zhu}
\author[2]{Zhibo Jin}
\author[3]{Xinyi Wang}
\author[4]{Huaming Chen\thanks{Corresponding author: huaming.chen@sydney.edu.au. This work was done while the authors were with The University of Sydney.}}
\author[5]{Kim-Kwang Raymond Choo}

% Affiliations
\affil[1]{Suzhou University of Technology}
\affil[2]{University of Technology Sydney}
\affil[3]{Universiti Malaya}
\affil[4]{The University of Sydney}
\affil[5]{The University of Texas at San Antonio}

% The standard author block has changed for UAI 2025 to provide
% more space for long author lists and allow for complex affiliations
%
% All author information is authomatically removed by the class for the
% anonymous submission version of your paper, so you can already add your
% information below.
%
% Add authors
  \begin{document}
\maketitle

\begin{abstract}
Black-box attacks play a pivotal role in adversarial attacks. However, existing approaches often focus predominantly on attacking from a data-centric perspective, neglecting crucial aspects of the models. To address this issue, we propose a novel approach in this paper, coined Decision Boundary Adaptation (DBA). Our approach innovatively adopts a model-centric viewpoint, leveraging operations on the model to attain properties that enhance transferability. We observe that a flatter curvature of the statistical manifold, influenced by both samples and model parameters, leads to stronger transferability of the adversarial attacks. To leverage this, we introduce the concept of local flatness, providing an evaluation method for local flatness property along with a detailed mathematical proof. Additionally, we demonstrate a consistent relationship between local flatness, the model's decision boundary, and the gradient descent process, showing how flatness can be achieved through gradient descent at the model parameter level. Through extensive evaluation using state-of-the-art adversarial attack techniques, our DBA approach significantly enhances the black-box attack capabilities of all the tested adversarial attack methods. The implementation of our method is available at \url{https://github.com/LMBTough/DBA}.
\end{abstract}

\section{Introduction}\label{sec:intro}
Artificial intelligence (AI), especially Deep neural networks (DNNs), has showcased impressive success in computer vision tasks, such as image classification~\citep{He_2016_CVPR, Huang_2017_CVPR, Sandler_2018_CVPR, li2022research, gulzar2023fruit}. However, the vulnerability of DNNs to adversarial attacks poses critical concerns regarding model safety. Even minor, imperceptible perturbations to input data can lead to erroneous predictions~\citep{goodfellow2014explaining}, significantly undermining the model integrity. To enhance the trustworthiness of DNNs, it is important to proactively identify the potential vulnerabilities in the models. Therefore, developing novel adversarial attack methods against DNNs becomes a critical approach to provide holistic and novel insights of the vulnerabilities in the models.

In particular, the growing prevalence of AI-empowered web applications and online services has introduced both new opportunities and challenges for adversarial attacks. Web services, such as cloud-based machine learning platforms and online APIs, often expose machine learning models to external parties, creating potential attack surfaces. Attackers can exploit the web interfaces by launching black-box or query-based attacks, probing for vulnerabilities without direct access to the model. Moreover, with the widespread image recognition and natural language processing techniques on online platforms, the threat of adversarial attacks on the web has become a major security concern~\citep{apruzzese2022spacephish, omara2024ai}.

Adversarial attacks are broadly categorized into white-box and black-box approaches~\citep{papernot2017practical}. Initially, most adversarial attacks are white-box, where attackers have full knowledge of the target model, such as its architecture, parameters, etc~\citep{kurakin2018adversarial,madry2017towards}. However, white-box attacks are often impractical in real-world scenarios. As a result, black-box attacks that do not require any knowledge of the target model have become popular. In black-box attacks, query-based methods simulate the model behaviour by querying the target model for its outputs. While effective, these methods often require frequent and extensive querying to the target model, potentially reducing the stealthiness of the attack. In contrast, transferable attacks do not require any access to the target model, providing better stealth and greater development potential~\citep{dong2018boosting, dong2019evading, lin2019nesterov, zhang2022improving, long2022frequency}. Our work focuses on enhancing transferable black-box attacks.

Figure~\ref{fig:flowchart} shows a simplified model training process with different emphasis of the attack techniques. Recent methods, such as DI-FGSM~\citep{xie2019improving} and MI-FGSM~\citep{dong2018boosting}, enhance the transferability of attacks through data transformations, while others like NAA~\citep{zhang2022improving}, DANAA~\citep{jin2023danaa}, and MIG~\citep{ma2023transferable} improve attack transferability by modifying the loss function. SSA optimizes the Backpropagation steps to enhance the attack transferability~\citep{long2022frequency}. While most approaches optimize based on data or gradients, we note the techniques like LGV~\citep{gubri2022lgv} and DBA, proposed in this work, enhance attack transferability from the model-centric perspective. More discussion is included in Section.~\ref{relatedwork}.

We observe that, the effectiveness of transferable attacks is closely related to gradient information. Smaller gradient values can enhance the attack transferability~\citep{ge2023boosting}, as they indicate a flatter curvature of the statistical manifold under the combined influence of samples and current parameters, contributing to the generalization~\citep{zhao2022penalizing}. However, relying solely on the magnitude of gradients calculated by $L_p$ norms, within a Euclidean space loss function, may not be optimal. In particular, using such information to assess flatness only considers the first-order property of functions. We further investigate the function curvature with second-order information, such as leveraging the Hessian matrix.
%a more accurate observation of function curvature requires a second-order approach (calculating the Hessian matrix).

In this paper, we introduce a more general definition of local flatness in loss function curvature from an adversarial attack perspective, providing an approximate calculation method and a thorough proof. We also investigate how the gradient descent process of model parameters helps find a flatter curvature of the loss function during adversarial attack. Our experiments demonstrate a remarkable consistency between local flatness and decision boundaries. With these findings, we discuss the effectiveness of our method from multiple perspectives: local flatness, decision boundaries, and adversarial defense. We summarise the contributions as follows:
\begin{itemize}
  \item We provide a definition of local flatness during the adversarial attack process, along with a detailed proof, as well as proposing a novel method to achieve flatness.
  \item We discuss and validate the correlations between local flatness, gradient descent, adversarial defense training, and transferability, and propose DBA method to effectively leverage the gradient descent process to enhance the transferability of adversarial attacks.
  \item We conduct extensive experiments to compare DBA method against a broad spectrum of transferable attack methods. Significantly, DBA enhances the transferable capability of all methods in the experiments, demonstrating that DBA is a universal and effective approach for improving the transferability of adversarial attacks.% We open-source our implementation framework for future research.
\end{itemize}

\section{Related Work}\label{relatedwork}
\subsection{White-box Attacks}
White-box attacks are classical methods that grant the adversary access to the target model for the information of structure and parameters. Several methods include Fast Gradient Sign Method (FGSM)~\citep{goodfellow2014explaining}, Basic Iterative Method (BIM)~\citep{kurakin2018adversarial}, PGD~\citep{madry2017towards}, and Carlini\&Wagner method (C\&W)~\citep{carlini2017towards}.

FGSM leverages gradient information of the input data to generate adversarial examples by introducing small perturbations in the direction indicated by the sign function. BIM enhances adversarial attacks by iteratively applying subtle perturbations, thereby increasing the attack effectiveness. In parallel, PGD algorithm extended BIM by incorporating a projection step in each iteration to ensure that adversarial examples remain within a predefined perturbation range, thus enhancing the controllability and efficacy of the attack. Conversely, C\&W employs a distinct strategy by optimizing a tailored objective function to craft adversarial examples. It focuses on minimizing the size of the perturbations required, aiming to produce high-quality adversarial examples that are imperceptible.

\subsection{Black-box Attacks}
Black-box attacks aim to address the limited access to the target model's internals. While there are different approaches, they typically share a common goal of improving the effectiveness and transferability of adversarial examples. 

Traditional method, such as gradient-based attacks, serves as the cornerstone of attack techniques. Several exemplar methods include DI-FGSM~\citep{xie2019improving}, MI-FGSM~\citep{dong2018boosting}, TI-FGSM~\citep{dong2019evading}, and SINI-FGSM~\citep{lin2019nesterov}. DI-FGSM employs random transformations to improving the transferability. MI-FGSM and TI-FGSM add momentum and translation invariance, respectively, to refine attack robustness and success rate. SINI-FGSM further uses Nesterov accelerated gradient with scale-invariance, boosting the robustness and transferability.

\begin{figure}
    \centering
    \includegraphics[width=\linewidth]{images/DBA_flowchart.pdf}
    \caption{Comparison of adversarial attacks. Solid right-pointing arrows represent forward propagation; Dashed reverse arrows indicate backward propagation}
    \label{fig:flowchart}
\end{figure}

In addition to traditional gradient-based methods, Structure Invariant Attack (SIA)~\citep{wang2023structure} and Momentum Integrated Gradients (MIG)~\citep{ma2023transferable} introduce innovative strategies to augment attack transferability. SIA preserves the structural integrity of images through region-specific transformations (i.e., rotation and scaling), while MIG uses integrated gradients and a momentum term to enhance the transferability across models.
%to guide the generation of adversarial perturbations, incorporating a momentum term to ensure the cumulative aggregation of gradients, thus enhancing the transferability and efficacy across different models.
Other methods include Penalizing Gradient Norm (PGN)~\citep{ge2023boosting}, Neural Attribution Attack (NAA)~\citep{zhang2022improving}, and their variants. PGN enhances transferability by exploring the flat regions of a model's loss surface, leveraging flat minima characteristics through repeated gradient ascent on samples, aiming to find a flat direction. For NAA and its advanced form, the Dual Adversarial Neural Attribution Attack~\citep{jin2023danaa}, they target intermediate neurons to create more transferable adversarial examples. These methods assess the significance of neurons, attributing the model's output comprehensively and employing complex formulas to enhance attack efficiency.% through approximation approaches.

Complementary techniques like Gradient Relevance Attack (GRA)~\citep{zhu2023boosting} and frequency domain methods such as Spectrum Simulation Attack (SSA)~\citep{long2022frequency} and Frequency-based Stationary Point Search (FSPS)~\citep{zhu2023improving} represent novel directions. GRA reduces perturbation fluctuations via a gradient relevance framework. SSA manipulates spectral components for better transferability, while FSPS uses frequency information to find effective adversarial directions.
 %addresses the challenge of perturbation fluctuation during adversarial example generation. By introducing gradient relevance frameworks that adaptively correct the update direction based on neighborhood information, GRA mitigates fluctuations and enhances the transferability of adversarial examples. SSA manipulates spectral components to simulate target models, enhancing transferability through Discrete Cosine Transform and spectral saliency maps. FSPS, on the other hand, employs a frequency-centric approach to identify efficacious adversarial directions near stationary points on the loss curve, leveraging frequency information to explore the adversarial attack transferability.

In Figure~\ref{fig:flowchart}, we outline the model training process where each method targets improving attack transferability, primarily through data or gradient-based strategies. LGV~\citep{gubri2022lgv} leverages model parameter to simulate multiple models, akin to DI-FGSM's strategy of utilising similar semantic information. However, LGV requires extensive training data, and the data quality could significantly impact the attack effectiveness. Thus, in this work, we focus on model parameters through the lens of local flatness, which results from the interaction between a single sample and the model. This approach is independent of data quality, providing a more robust framework. Unlike LGV, which does not specifically address decision boundaries or flatness properties, our proposed DBA method directly targets these aspects. By optimizing for local flatness, DBA enhances the transferability of attacks across different methods, making it a universal and effective optimization framework.

Furthermore, unlike PGN~\citep{ge2023boosting}, which explores flat regions of the loss surface by manipulating gradients, our method works by adjusting model parameters to identify attack directions that better promote flatness. In the next section, we provide a more detailed discussion of our DBA method and its advantages in enhancing attack transferability.

\section{Our Proposed Method}

In this section, we first provide definitions for adversarial attacks and transferability in the attacks. We then introduce the local flatness and decision boundary adaptation of models and elucidate their impacts on transferability. We finally discuss the consistency between parameter gradient descent and transferability, decision boundary adaptation, and local flatness from multiple perspectives.

\subsection{Problem Definition}

For adversarial attacks, an adversarial example for a given input sample $x$ and a source model with parameters $\theta_s$ is defined as $x_{adv} = x + \delta$, where $x \in \mathbb{R}^m $, $\theta_s \in \mathbb{R}^n $, and $\delta$ represents a small perturbation crafted to mislead the model. The objective can be mathematically formulated as:
\begin{equation}
\max_{\delta} L(y^t|x+\delta,\theta_s) \quad \text{s.t.} \quad ||\delta||_p \leq \epsilon
\end{equation}
where $L$ is the loss function, $y$ is the true label for $x$, $\epsilon$ is the perturbation constraint ensuring the perturbation is small, and $||\cdot||_p$ denotes the $L_p$ norm, typically $p=\infty$ for image data, imposing a maximum change $\epsilon$ to any element of $x$.

We define the transferability $T$ of an adversarial example $x_{adv}$ from a source model with parameters $\theta_s$ to a well-trained target model with parameters $\theta_t$ as:
\begin{equation}
T(x_{\text{adv}}, \theta_s, \theta_t) = \frac{1}{k} \sum_{i=1}^k \mathbb{I}\left[f_t(x_i^{\text{adv}}) \neq f_t(x_i)\right]
\end{equation}
where $f_t(x)$ represents the classification function of the target model, $k$ denotes the number of adversarial samples, and $\mathbb{I}[\cdot]$ is the indicator function, yielding 1 if the condition holds true otherwise 0. High transferability implies that $T$ must be higher value, signifying that a greater number of adversarial examples generated on the source model effectively manipulate the prediction of target model.

\subsection{Local Flatness}

Smaller gradient values of the loss function indicate a flatter loss function landscape at the gradient computation location~\citep{zhao2022penalizing}. The flat local maxima induced by smaller gradients can enhance the transferability of adversarial attacks~\citep{ge2023boosting}. However, assessing flatness values based solely on gradient magnitudes constitutes an evaluation utilizing only first-order properties in Euclidean space. More generally, we aim to observe the curvature of the statistical manifold, which is influenced by both the current sample and model parameters. This notion, termed \textbf{local flatness}, refers to the flatness of the loss function locally around a single sample $x$ and model parameters $\theta$. Evaluating local flatness typically involves the Fisher information matrix~\citep{lehmann2006theory}. Moreover, starting from the definition of local flatness, we find that transferability arises not only from the gradient of the current sample and its variants but also from the combined effects of the sample and model parameters, an important aspect overlooked by nearly all current transferable adversarial attack methods. 

Inspired by the methods of computing Fisher information in~\citep{martin2020inspecting}, Fisher information inherently serves to assess the amount of information about the sample collected by the model, as discussed in~\citep{ly2017tutorial}. Equation~\ref{eq:fisher} illustrates that as training progresses, the amount of information gathered from the sample decreases, leading to a flatter Fisher information matrix. This flattening effect due to parameter updates (training) is the reason for the increase in flatness. Therefore, we extend this to the sample space to assess its association with local flatness.
\begin{theorem}[Directional Finite Difference Estimation]
\label{theorem1}
Given a function $f$ with independent variable $x$, we can estimate $\Delta x \cdot \frac{\partial f(x)}{\partial x}$ by applying a fixed-direction perturbation $\Delta x$ to $x$, as follows:
\begin{equation}
\resizebox{.7\linewidth}{!}{$
\begin{aligned}
\Delta x \cdot \frac{\partial f(x)}{\partial x} &\approx \frac{f(x + \epsilon \cdot \Delta x) - f(x)}{\epsilon} \\
&\approx \frac{f(x + \epsilon \cdot \Delta x) - f(x-\epsilon \cdot \Delta x)}{2\epsilon}
\end{aligned}
$}
\end{equation}
\end{theorem}
where $\epsilon$ is a scalar. Theorem~\ref{theorem1} will be used later to reduce the complexity of computing Fisher information. The \textit{proof} is provided in the appendix.
\begin{theorem}[Estimation of Curvature on Statistical Manifold]
\label{Theorem2}
Given an adversarial example $x$ and the current model parameters $\theta$, we can employ the Fisher information matrix to estimate the curvature of the statistical manifold.
\begin{equation}
\label{eq:fisher}
\resizebox{.9\linewidth}{!}{$
\begin{aligned}
\mathcal{I}_\theta (x) &= E_{y^c \sim P(y|x,\theta )} \left[ \frac{\partial \log P_{y^c} (x,\theta )^{\top }}{\partial x} \cdot \frac{\partial \log P_{y^c} (x,\theta )}{\partial x} \right] \\
&= - E_{y^c \sim P(y|x,\theta )} \left[ \mathrm{H}^c \right] \quad \text{s.t.} \quad \mathrm{H}_{ij}^c = \frac{\partial^2 \log P(y^c| x, \theta )}{\partial x_i \partial x_j}
\end{aligned}
$}
\end{equation}
\end{theorem}



where $E_{y^c \sim P(y|x,\theta )}[\cdot]$ denotes the expectation over the conditional distribution of class $y^c$ given input $x$ and parameters $\theta$, $\frac{\partial \log P_{y^c} (x,\theta )}{\partial x}$ represents the gradient of the logarithm of the conditional probability of class $y^c$ with respect to input $x$, and $\mathrm{H}^c$ is the Hessian matrix of the logarithm of the conditional probability with respect to $x$, with elements $\mathrm{H}_{ij}^c = \frac{\partial^2 \log P(y^c| x, \theta )}{\partial x_i \partial x_j}$.

The Hessian matrix $\mathrm{H}^c$ represents the curvature of the statistical manifold and serves as an effective tool for assessing the flatness of the manifold. As illustrated in Theorem~\ref{Theorem2}, if we consider the curvature of all neural network output class functions, the Fisher information matrix can be employed. Furthermore, the Fisher information matrix can also be regarded as a crucial tool for evaluating the changes in the KL divergence of the model's output distribution due to variations in $x$, as analyzed in the appendix. It is important to note that the magnitude of gradients considers only the output of a single class, $-log\ P_{y^t}(x,\theta)$, where $t$ represents the target label (the computation method of cross-entropy loss function under hard label conditions)~\citep{ge2023boosting}. However, in reality, for model outputs transformed into a probability distribution, enhancing the output values of other classes can also be an important means to diminish the probability of the current class.

Considering the high computational complexity of $\mathcal{I}_\theta (x) \in \mathbb{R}^{m \times m}$, we can estimate the impact of $\mathcal{I}_\theta (x)$ using the trace of $\mathcal{I}_\theta (x)$, $tr(\mathcal{I}_\theta (x))$. However, using the trace neglects the contribution of many off-diagonal dimensions to curvature. We use the quadratic form of $tr(\mathcal{I}_\theta (x))$ for analysis:
\begin{equation}
tr(\mathcal{I}_\theta (x)) = \sum_{i=1}^{m} e^i \mathcal{I}_\theta (x) e^i
\end{equation}
where $e^i \in \mathbb{R}^m$ and the $i$-th dimension of $e^i$ is 1, with the rest being 0. Changes in the dimension that $e^i$ is 0 of $x$ are not considered in the curvature calculation. More generally, we define:
\begin{equation}
flat (\mathcal{I}_\theta (x)) = \Delta x^\top \mathcal{I}_\theta (x) \Delta x
\end{equation}
as a measure to evaluate the flatness score of $x$'s curvature. With \textbf{Theorem~\ref{theorem1}}, we can approximate and derive \textbf{Theorem~\ref{theorem3}}.

\begin{theorem}[Assessment of Local Flatness]
\label{theorem3}
This allows us to assess local flatness using $flat (\mathcal{I}_\theta (x))$ with the following approximation.
\begin{equation}
\resizebox{.9\linewidth}{!}{$
\begin{aligned}
flat (\mathcal{I}_\theta (x)) &\approx \frac{1}{4 \varepsilon^2} \left \langle \frac{\partial P_y(x+\varepsilon \cdot \Delta x, \theta)}{\partial x} - \frac{\partial P_y(x-\varepsilon \cdot \Delta x, \theta)}{\partial x}, \right. \\
&\quad \left. \frac{\partial \log P_y(x+\varepsilon \cdot \Delta x, \theta)}{\partial x} - \frac{\partial \log P_y(x-\varepsilon \cdot \Delta x, \theta)}{\partial x} \right \rangle \\
&\propto \left \langle \frac{\partial P_y(x+\varepsilon \cdot \Delta x, \theta)}{\partial x} - \frac{\partial P_y(x-\varepsilon \cdot \Delta x, \theta)}{\partial x}, \right. \\
&\quad \left. \frac{\partial \log P_y(x+\varepsilon \cdot \Delta x, \theta)}{\partial x} - \frac{\partial \log P_y(x-\varepsilon \cdot \Delta x, \theta)}{\partial x} \right \rangle
\end{aligned}
$}
\end{equation}
\end{theorem}

where $\langle \cdot, \cdot \rangle$ denotes the dot product, we choose $\Delta x$ as the iterative update vector in adversarial attacks, allowing us to observe the level of flatness during the adversarial attack process. Additionally, we compute the flatness with respect to model parameters $\theta$ as follows:

\begin{equation}
\resizebox{.9\linewidth}{!}{$
\begin{aligned}
flat (\mathcal{I}_x (\theta)) &\propto \left \langle \frac{\partial P_y(x, \theta+\varepsilon \cdot \Delta \theta)}{\partial \theta} - \frac{\partial P_y(x, \theta-\varepsilon \cdot \Delta \theta)}{\partial \theta}, \right. \\
&\quad \left. \frac{\partial \log P_y(x, \theta+\varepsilon \cdot \Delta \theta)}{\partial \theta} - \frac{\partial \log P_y(x, \theta-\varepsilon \cdot \Delta \theta)}{\partial \theta} \right \rangle
\end{aligned}
$}
\end{equation}

The \textit{proof} of \textbf{Theorem~\ref{theorem3}} is provided in the appendix. Fisher information can be employed to assess the compatibility between parameters and samples~\citep{martin2020inspecting}, where a higher $flat (\mathcal{I}_\theta (x))$ indicates that the sample is extracting more information from the parameters $\theta$ (potentially leading to overfitting in attacks). From the perspective of $flat (\mathcal{I}_x (\theta))$, a larger value suggests that the sample is unusual for the parameters $\theta$ and is near the decision boundary, characterizing it as an Out-of-Distribution (OOD) sample. Therefore, it is crucial to minimize $flat (\mathcal{I}_\theta (x))$ and $flat (\mathcal{I}_x (\theta))$ during the attack process, for which gradient descent will be employed. Furthermore, we will discuss why samples close to the decision boundary are non-flat and detrimental to the transferability of adversarial attacks, highlighting a consistency between the two.



\begin{assumption}[Attack Transferability]
\label{assumption1}
Given an input sample $x$, and model parameters $\theta$, a smaller $flat (\mathcal{I}_\theta (x))$ leads to stronger attack transferability.
\end{assumption}


\begin{figure}[t]
    \centering
    \includegraphics[width=\linewidth]{images/combined_fisher_information_train.pdf}
    \caption{(a) and (b) depict the flatness analysis of $flat(\mathcal{I}_\theta (x))$ and $flat(\mathcal{I}_x (\theta))$ during training. The results are obtained using the Inception-v3 model on the ImageNet dataset.}
    \label{fig:train}
\end{figure}
\begin{figure}[t]
    \centering
    \includegraphics[width=\linewidth]{images/combined_fisher_information_comparison_with_labels.pdf}
    \caption{(a) and (b) are the flatness analysis of $flat(\mathcal{I}_\theta (x))$ and $flat(\mathcal{I}_x (\theta))$ during the attack process. The experiments are conducted using Inception-v3 on the ImageNet dataset.}
    \label{fig:ficompres}
\end{figure}



Assumption~\ref{assumption1} constitutes our core assumption, which is an optimized form of the hypothesis regarding the relationship between flatness and transferability proposed in~\citep{ge2023boosting}. As demonstrated in Figure~\ref{fig:train}, our DBA method effectively reduces $flat(\mathcal{I}_\theta (x))$ and $flat(\mathcal{I}_x (\theta))$ during the training process.



In Figure~\ref{fig:ficompres}, taking BIM as an example, when integrated with our DBA method, it consistently sustains lower $flat(\mathcal{I}_\theta (x))$ and $flat(\mathcal{I}_x (\theta))$ throughout the attack process compared to when used without DBA. Lower values of these metrics indicate a flatter curvature of the statistical manifold.


\subsection{Decision Boundary}

Neural networks operate as continuous mappings, where the transition of samples across the decision boundary (resulting in class changes) is a gradual process without abrupt, large-scale shifts in outcomes. This implies that samples near the decision boundary are of lower confidence and more prone to class changes. Training data exhibit high confidence on a well-trained model and are distant from decision boundary. Data close to the training data distribution are considered in-distribution (ID) data, whereas those near the decision boundary are deemed out-of-distribution (OOD) data. 

\begin{figure}
    \centering
    \includegraphics[width=\linewidth]{images/fixed trigger.pdf}
    \caption{Gradient direction information near the decision boundary is divergent and unstable. With DBA, the sample will be converted from Divergent Attack Direction to Correct Attack Direction. (Here we take samples close to the decision boundary as an example, as once a small perturbation is added to these samples, the classification results of the model may change)}
    \label{fig:boundary}
\end{figure}

In Figure~\ref{fig:boundary}, once an adversarial example approaches the vicinity of the decision boundary, gradient direction information becomes more divergent and unstable. To ensure data remains within the ID range, gradient descent on individual samples can be applied to push them from the decision boundary. The distance from the decision boundary can be inferred from the variation in loss function values during the adversarial attack; the greater the distance, the harder it is to increase the loss function value. Figure~\ref{fig:lossvalue} illustrates that the difficulty of attacking increases with gradient descent (Samples Distant from the Decision Boundary). It's important to emphasize that the methodologies for calculating the distance to the decision boundary and local flatness are solely dependent on the current model, the sample, and the task itself, and are not influenced by the method used to compute the gradient of adversarial samples during the attack process. Therefore, the approach remains valid even in scenarios involving complex loss functions.
% \textit{Figure 4: The x-axis represents the attack steps, and the y-axis represents the loss function value (the goal of the adversarial attack is to maximize the loss function value).}
\begin{equation}
\label{eq:gdae}
\min_{\theta } E_{(x, y^t) \sim D} [\max_{\delta } L (\theta ,x + \delta ,y)] 
\end{equation}
(From the perspective of adversarial training)
Furthermore, applying gradient descent to adversarial examples can be seen as simulating the process of adversarial defense training, as shown in Equation~\ref{eq:gdae}, where adversarial defense training of $\theta$ employs gradient descent~\citep{shaham2018understanding}. Utilizing gradients obtained from adversarially trained parameters and adversarial examples can facilitate attack effectiveness on adversarially trained models. We also experiment the transferability tests on such models.

\begin{figure}
    \centering
    \includegraphics[width=.4\linewidth]{images/loss_value.pdf}
    \caption{Relationship between the number of attack steps and the loss function value. We define `Original` as the baseline where no DBA is applied. `Descent' as the rounds of gradient descent. The smaller the value of the loss function, the more challenging it becomes to execute a successful attack, indicating greater distance from the decision boundary. This experiment uses the Inception-v3 model on the ImageNet dataset.}
    \label{fig:lossvalue}
\end{figure}

\subsection{Decision Boundary Adaptation (DBA)} \label{sec:methodDBA}

We have discussed how gradient descent operations can distance samples from the decision boundary, obtaining more general adversarial gradient information, and effectively increase the flatness of the curvature of the statistical manifold. In this context, we simply introduce gradient descent operations during the process of enhancing attack transferability.
\begin{equation}
\theta^t = \theta^{t-1} - \eta \cdot \frac{\partial L(y^t|x_i^{t-1},\theta ^{t-1})}{\partial \theta^{t-1}} 
\end{equation}
where $\theta^0$ represents the model's initial parameters, and $x^{t-1}$ denotes the adversarial sample iterated $t-1$ times. In each iteration of the adversarial sample, $\theta^t$ replaces the original model parameters. Furthermore, as adversarial attack algorithms often iterate over multiple samples simultaneously, the DBA algorithm can perform batch gradient descent operations during parameter updates:
\begin{equation}
\label{eq:batch_size}
\theta^t = \theta^{t-1} - \eta \frac{1}{k} \sum_{i=1}^{k} \frac{\partial L(y^t|x_i^{t-1},\theta ^{t-1})}{\partial \theta^{t-1}} 
\end{equation}
The update to $\theta$ affecting $x_i^{t-1}$ is by $\frac{\partial L(y^t|x_i^{t-1},\theta ^{t-1})}{\partial \theta^{t-1}}$, which also facilitates distancing from the decision boundary. This approach significantly enhances the efficiency of DBA, serving as an approximation of the single-step DBA, which introduces only one additional forward and backward gradient propagation, and it is negligible compared to the numerous propagations in methods like SSA and NAA. Therefore, there is some increase in computational cost but the added computational cost is manageable and acceptable. Detailed experimental results and pseudocode are in the Appendix~\ref{apx:impl}.

% \begin{algorithm}[htbp]

%     \renewcommand{\algorithmicrequire}{\textbf{Input:}} 
%     \renewcommand{\algorithmicensure}{\textbf{Output:}} 
%     \caption{Decision Boundary Adaptation Algorithm}
%     \label{alg:cap}
%     \label{code}
%     % \label{alg1}
%     \begin{algorithmic}[1] 
%         \REQUIRE parameter of the source model $\theta$, input $x$, target $y$, learning rate of model boundaries $\alpha_1$, learning rate of perturbations $\alpha_2$, warm up step $w$

%         \ENSURE $x_I$
%         \STATE $Init$ $ x_0 \leftarrow x$, $\theta_0 \leftarrow \theta$
%     \FOR{$ i = 1 \cdots I $}   
%     \small
%         \STATE $\theta^i = \theta^{i-1} + \left[ \left[ i \geq {w} \right] \right] \cdot \alpha_1 \left( \frac{\partial L\left(y \mid x_{i-1}, \theta^{i-1}\right)}{\partial \theta^{i-1}} \right) $
%         \STATE $\eta_i=\alpha_2 \cdot \operatorname{sign}\left( \frac{\partial L\left(y \mid x_{i-1}, \theta^{i-1}\right)}{\partial\left(x_{i-1}+\eta^{i-1}\right)}\right)$ 
%     \ENDFOR
    
%     \STATE $x_I=x_{i-1} + \eta_i = x_0 + \sum\limits_{j=1}^i \eta_j = x_0 + \eta^i$
%     \end{algorithmic} 
% \end{algorithm}

\section{Experiments}
\subsection{Experimental Setup} \label{sec:setup}
\paragraph{Dataset}To ensure a rigorous experiment design, we use the same dataset in the NAA\cite{zhang2022improving} and SSA~\citep{long2022frequency}, thus ensuring consistency in the experimental conditions across all baseline methods, including our algorithm. Accordingly, we conduct the evaluations using a standard image classification dataset that encompasses 1000 randomly chosen images from the ILSVRC 2012 validation set~\citep{ILSVRC15}.

\paragraph{Models}
% We evaluated the robustness against adversarial attacks across eleven models. Among these, six models were not subjected to any defense training, including traditional CNN architectures such as Inception-v3 (Inc-v3)~\citep{szegedy2016rethinking}, Inception-v4 (Inc-v4)~\citep{szegedy2017inception}, Inception-ResNet-v2 (IncRes-v2)~\citep{szegedy2017inception}, ResNet-50 (Res-50), ResNet-101 (Res-101), and ResNet-152 (Res-152)~\citep{he2016deep}. Additionally, two Vision Transformer (ViT) models, ViT-B/16~\citep{dosovitskiy2020image} and MaxViT-T~\citep{tu2022maxvit}, were also considered. Six of these models served as surrogate models for the attack generation, specifically Inc-v3, Inc-v4, IncRes-v2, Res-152, ViT-B/16, and MaxViT-T.

% Additionally, we include three models with defensive training: Inception-v3 Ensemble 3 (Inc-v3-ens3), Inception-v3 Ensemble 4 (Inc-v3-ens4), and Inception-ResNet-v2 Ensemble (IncRes-v2-ens)~\citep{tramer2017ensemble}. They are trained using ensemble techniques and specific defense mechanisms to improve their robustness against adversarial attacks. Inc-v3-ens3 and Inc-v3-ens4 are ensemble models based on Inception-v3, combining multiple independently trained versions of the same model. IncRes-v2-ens is an ensemble model based on Inception-ResNet-v2, utilizing ensemble techniques to improve its resistance to adversarial attacks.
We evaluated adversarial robustness across eleven models, including six without defense training: Inception-v3 (Inc-v3)~\citep{szegedy2016rethinking}, Inception-v4 (Inc-v4)~\citep{szegedy2017inception}, Inception-ResNet-v2 (IncRes-v2)~\citep{szegedy2017inception}, ResNet-50 (Res-50), ResNet-101 (Res-101), and ResNet-152 (Res-152)~\citep{he2016deep}, along with two Vision Transformer models, ViT-B/16~\citep{dosovitskiy2020image} and MaxViT-T~\citep{tu2022maxvit}. Among them, four models (Inc-v3, Inc-v4, IncRes-v2, Res-152) served as surrogates for attack generation.

We also evaluated three defense-trained models: Inception-v3 Ensemble 3 (Inc-v3-ens3), Inception-v3 Ensemble 4 (Inc-v3-ens4), and Inception-ResNet-v2 Ensemble (IncRes-v2-ens)~\citep{tramer2017ensemble}, which employ ensemble techniques to enhance adversarial robustness. Inc-v3-ens3 and Inc-v3-ens4 aggregate multiple Inception-v3 models, while IncRes-v2-ens extends this approach to Inception-ResNet-v2.


% include DI-FGSM~\citep{xie2019improving}, MI-FGSM~\citep{dong2018boosting}, TI-FGSM~\citep{dong2019evading}, SINI-FGSM~\citep{lin2019nesterov}, NAA~\citep{zhang2022improving}, and SSA~\citep{long2022frequency}, BIM~\citep{kurakin2018adversarial}, PGD~\citep{madry2017towards}



\paragraph{Baseline Methods}
To evaluate the performance of the models under adversarial attacks, we apply ten baseline adversarial attack methods representing a diverse range of common attacks: BIM~\citep{kurakin2018adversarial}, PGD~\citep{madry2017towards}, DI-FGSM~\citep{xie2019improving}, TI-FGSM~\citep{dong2019evading}, MI-FGSM~\citep{dong2018boosting}, SINI-FGSM~\citep{lin2019nesterov}, SSA~\citep{long2022frequency}, FSPS~\citep{zhu2023improving}, SIA~\citep{wang2023structure} and MIG~\citep{ma2023transferable}. %These methods  strategies to assess the robustness of deep learning models.

\begin{table}[t]
\centering
\caption{ASR on Defensive Training Models. Each data point represents Baseline/Baseline+DBA(Gap), \colorbox{green}{Green} indicating improvement by DBA over the original method, and \colorbox{red}{Red} indicates a decrease.}
\label{tab:defense}
\resizebox{\linewidth}{!}{%
\begin{tabular}{@{}c|c|ccc|c@{}}
\toprule
Model                        & Method    & Inc-v3-ens3                             & Inc-v3-ens4                             & IncRes-ens                              & Average                                   \\ \midrule
                             & BIM       & \cellcolor[HTML]{D9EAD3}12.5/12.6(0.1)  & \cellcolor[HTML]{D9EAD3}13.1/14.0(0.9)  & \cellcolor[HTML]{D9EAD3}4.7/6.7(2.0)    & \cellcolor[HTML]{D9EAD3}10.1/11.1(1.0)    \\
                             & DI-FGSM   & \cellcolor[HTML]{D9EAD3}17.3/17.7(0.4)  & \cellcolor[HTML]{D9EAD3}17.4/19.2(1.8)  & \cellcolor[HTML]{D9EAD3}9.0/9.1(0.1)    & \cellcolor[HTML]{D9EAD3}14.57/15.33(0.77) \\
                             & MIG       & \cellcolor[HTML]{D9EAD3}40.3/46.1(5.8)  & \cellcolor[HTML]{D9EAD3}39.6/43.8(4.2)  & \cellcolor[HTML]{D9EAD3}21.4/27.4(6.0)  & \cellcolor[HTML]{D9EAD3}33.77/39.1(5.33)  \\
                             & MI-FGSM   & \cellcolor[HTML]{F4CCCC}22.5/21.4(-1.1) & \cellcolor[HTML]{F4CCCC}22.5/21.5(-1.0) & \cellcolor[HTML]{D9EAD3}10.5/11.0(0.5)  & \cellcolor[HTML]{F4CCCC}18.5/17.97(-0.53) \\
                             & PGD       & \cellcolor[HTML]{D9EAD3}12.1/12.1(0.0)  & \cellcolor[HTML]{D9EAD3}12.6/12.9(0.3)  & \cellcolor[HTML]{D9EAD3}6.5/6.8(0.3)    & \cellcolor[HTML]{D9EAD3}10.4/10.6(0.2)    \\
                             & PGN       & \cellcolor[HTML]{D9EAD3}20.8/23.9(3.1)  & \cellcolor[HTML]{D9EAD3}20.8/24.9(4.1)  & \cellcolor[HTML]{D9EAD3}9.7/12.8(3.1)   & \cellcolor[HTML]{D9EAD3}17.1/20.53(3.43)  \\
                             & SIA       & \cellcolor[HTML]{F4CCCC}63.6/63.5(-0.1) & \cellcolor[HTML]{F4CCCC}61.9/60.8(-1.1) & \cellcolor[HTML]{D9EAD3}36.5/38.6(2.1)  & \cellcolor[HTML]{D9EAD3}54.0/54.3(0.3)    \\
                             & SINI-FGSM & \cellcolor[HTML]{D9EAD3}39.6/47.3(7.7)  & \cellcolor[HTML]{D9EAD3}36.7/45.6(8.9)  & \cellcolor[HTML]{D9EAD3}23.0/26.6(3.6)  & \cellcolor[HTML]{D9EAD3}33.1/39.83(6.73)  \\
                             & SSA       & \cellcolor[HTML]{D9EAD3}74.3/77.2(2.9)  & \cellcolor[HTML]{D9EAD3}75.0/77.1(2.1)  & \cellcolor[HTML]{D9EAD3}60.0/64.1(4.1)  & \cellcolor[HTML]{D9EAD3}69.77/72.8(3.03)  \\
\multirow{-10}{*}{Inc-v3}    & TI-FGSM   & \cellcolor[HTML]{D9EAD3}21.8/24.7(2.9)  & \cellcolor[HTML]{D9EAD3}24.1/27.2(3.1)  & \cellcolor[HTML]{D9EAD3}12.5/15.9(3.4)  & \cellcolor[HTML]{D9EAD3}19.47/22.6(3.13)  \\ \midrule
                             & BIM       & \cellcolor[HTML]{D9EAD3}10.5/12.5(2.0)  & \cellcolor[HTML]{D9EAD3}11.5/12.6(1.1)  & \cellcolor[HTML]{D9EAD3}7.1/7.3(0.2)    & \cellcolor[HTML]{D9EAD3}9.7/10.8(1.1)     \\
                             & DI-FGSM   & \cellcolor[HTML]{D9EAD3}16.0/20.7(4.7)  & \cellcolor[HTML]{D9EAD3}17.3/20.1(2.8)  & \cellcolor[HTML]{D9EAD3}10.8/13.1(2.3)  & \cellcolor[HTML]{D9EAD3}14.7/17.97(3.27)  \\
                             & MIG       & \cellcolor[HTML]{D9EAD3}61.8/65.2(3.4)  & \cellcolor[HTML]{D9EAD3}56.4/58.1(1.7)  & \cellcolor[HTML]{D9EAD3}44.0/47.7(3.7)  & \cellcolor[HTML]{D9EAD3}54.07/57.0(2.93)  \\
                             & MI-FGSM   & \cellcolor[HTML]{D9EAD3}21.2/24.4(3.2)  & \cellcolor[HTML]{D9EAD3}21.9/22.6(0.7)  & \cellcolor[HTML]{D9EAD3}12.8/15.0(2.2)  & \cellcolor[HTML]{D9EAD3}18.63/20.67(2.03) \\
                             & PGD       & \cellcolor[HTML]{D9EAD3}11.9/12.1(0.2)  & \cellcolor[HTML]{D9EAD3}12.4/12.4(0.0)  & \cellcolor[HTML]{D9EAD3}6.7/7.8(1.1)    & \cellcolor[HTML]{D9EAD3}10.33/10.77(0.43) \\
                             & PGN       & \cellcolor[HTML]{D9EAD3}21.9/23.4(1.5)  & \cellcolor[HTML]{D9EAD3}22.1/24.2(2.1)  & \cellcolor[HTML]{D9EAD3}14.7/16.6(1.9)  & \cellcolor[HTML]{D9EAD3}19.57/21.4(1.83)  \\
                             & SIA       & \cellcolor[HTML]{F4CCCC}72.4/72.1(-0.3) & \cellcolor[HTML]{D9EAD3}63.6/65.2(1.6)  & \cellcolor[HTML]{D9EAD3}51.7/51.9(0.2)  & \cellcolor[HTML]{D9EAD3}62.57/63.07(0.5)  \\
                             & SINI-FGSM & \cellcolor[HTML]{D9EAD3}54.7/62.2(7.5)  & \cellcolor[HTML]{D9EAD3}48.9/53.8(4.9)  & \cellcolor[HTML]{D9EAD3}38.7/44.0(5.3)  & \cellcolor[HTML]{D9EAD3}47.43/53.33(5.9)  \\
                             & SSA       & \cellcolor[HTML]{D9EAD3}80.3/82.2(1.9)  & \cellcolor[HTML]{D9EAD3}76.7/77.4(0.7)  & \cellcolor[HTML]{D9EAD3}76.8/77.0(0.2)  & \cellcolor[HTML]{D9EAD3}77.93/78.87(0.93) \\
\multirow{-10}{*}{IncRes-v2} & TI-FGSM   & \cellcolor[HTML]{D9EAD3}27.0/34.8(7.8)  & \cellcolor[HTML]{D9EAD3}28.2/32.9(4.7)  & \cellcolor[HTML]{D9EAD3}24.8/31.6(6.8)  & \cellcolor[HTML]{D9EAD3}26.67/33.1(6.43)  \\ \midrule
                             & BIM       & \cellcolor[HTML]{D9EAD3}11.3/15.6(4.3)  & \cellcolor[HTML]{D9EAD3}11.7/15.2(3.5)  & \cellcolor[HTML]{D9EAD3}4.8/7.7(2.9)    & \cellcolor[HTML]{D9EAD3}9.27/12.83(3.57)  \\
                             & DI-FGSM   & \cellcolor[HTML]{D9EAD3}17.1/21.5(4.4)  & \cellcolor[HTML]{D9EAD3}16.8/22.7(5.9)  & \cellcolor[HTML]{D9EAD3}9.3/14.7(5.4)   & \cellcolor[HTML]{D9EAD3}14.4/19.63(5.23)  \\
                             & MIG       & \cellcolor[HTML]{D9EAD3}28.0/35.3(7.3)  & \cellcolor[HTML]{D9EAD3}26.5/32.2(5.7)  & \cellcolor[HTML]{D9EAD3}15.1/18.8(3.7)  & \cellcolor[HTML]{D9EAD3}23.2/28.77(5.57)  \\
                             & MI-FGSM   & \cellcolor[HTML]{D9EAD3}19.9/23.5(3.6)  & \cellcolor[HTML]{D9EAD3}19.5/24.0(4.5)  & \cellcolor[HTML]{D9EAD3}10.3/13.2(2.9)  & \cellcolor[HTML]{D9EAD3}16.57/20.23(3.67) \\
                             & PGD       & \cellcolor[HTML]{D9EAD3}10.9/13.4(2.5)  & \cellcolor[HTML]{D9EAD3}12.5/13.5(1.0)  & \cellcolor[HTML]{D9EAD3}6.1/7.3(1.2)    & \cellcolor[HTML]{D9EAD3}9.83/11.4(1.57)   \\
                             & PGN       & \cellcolor[HTML]{D9EAD3}18.8/21.9(3.1)  & \cellcolor[HTML]{D9EAD3}21.2/24.5(3.3)  & \cellcolor[HTML]{D9EAD3}12.3/13.3(1.0)  & \cellcolor[HTML]{D9EAD3}17.43/19.9(2.47)  \\
                             & SIA       & \cellcolor[HTML]{D9EAD3}53.4/54.9(1.5)  & \cellcolor[HTML]{D9EAD3}49.0/50.5(1.5)  & \cellcolor[HTML]{D9EAD3}31.9/32.4(0.5)  & \cellcolor[HTML]{D9EAD3}44.77/45.93(1.17) \\
                             & SINI-FGSM & \cellcolor[HTML]{D9EAD3}26.3/36.8(10.5) & \cellcolor[HTML]{D9EAD3}26.0/32.9(6.9)  & \cellcolor[HTML]{D9EAD3}13.6/18.2(4.6)  & \cellcolor[HTML]{D9EAD3}21.97/29.3(7.33)  \\
                             & SSA       & \cellcolor[HTML]{D9EAD3}77.9/82.7(4.8)  & \cellcolor[HTML]{D9EAD3}77.2/80.4(3.2)  & \cellcolor[HTML]{D9EAD3}68.5/73.4(4.9)  & \cellcolor[HTML]{D9EAD3}74.53/78.83(4.3)  \\
\multirow{-10}{*}{Res-152}   & TI-FGSM   & \cellcolor[HTML]{D9EAD3}27.2/35.1(7.9)  & \cellcolor[HTML]{D9EAD3}30.5/34.8(4.3)  & \cellcolor[HTML]{D9EAD3}21.3/28.1(6.8)  & \cellcolor[HTML]{D9EAD3}26.33/32.67(6.33) \\ \midrule
                             & BIM       & \cellcolor[HTML]{D9EAD3}21.2/24.1(2.9)  & \cellcolor[HTML]{D9EAD3}24.4/28.0(3.6)  & \cellcolor[HTML]{D9EAD3}14.8/16.8(2.0)  & \cellcolor[HTML]{D9EAD3}20.13/22.97(2.83) \\
                             & DI-FGSM   & \cellcolor[HTML]{D9EAD3}40.7/43.9(3.2)  & \cellcolor[HTML]{D9EAD3}43.1/45.9(2.8)  & \cellcolor[HTML]{D9EAD3}33.1/35.6(2.5)  & \cellcolor[HTML]{D9EAD3}38.97/41.8(2.83)  \\
                             & MIG       & \cellcolor[HTML]{D9EAD3}56.7/58.5(1.8)  & \cellcolor[HTML]{D9EAD3}56.8/58.7(1.9)  & \cellcolor[HTML]{D9EAD3}50.3/52.2(1.9)  & \cellcolor[HTML]{D9EAD3}54.6/56.47(1.87)  \\
                             & MI-FGSM   & \cellcolor[HTML]{D9EAD3}45.9/46.2(0.3)  & \cellcolor[HTML]{D9EAD3}46.4/47.5(1.1)  & \cellcolor[HTML]{D9EAD3}38.5/40.7(2.2)  & \cellcolor[HTML]{D9EAD3}43.6/44.8(1.2)    \\
                             & PGD       & \cellcolor[HTML]{D9EAD3}18.7/20.2(1.5)  & \cellcolor[HTML]{D9EAD3}20.6/22.0(1.4)  & \cellcolor[HTML]{D9EAD3}12.8/12.8(0.0)  & \cellcolor[HTML]{D9EAD3}17.37/18.33(0.97) \\
                             & PGN       & \cellcolor[HTML]{D9EAD3}29.8/35.3(5.5)  & \cellcolor[HTML]{D9EAD3}32.9/38.0(5.1)  & \cellcolor[HTML]{D9EAD3}21.7/27.5(5.8)  & \cellcolor[HTML]{D9EAD3}28.13/33.6(5.47)  \\
                             & SIA       & \cellcolor[HTML]{D9EAD3}82.7/85.0(2.3)  & \cellcolor[HTML]{D9EAD3}83.2/84.2(1.0)  & \cellcolor[HTML]{D9EAD3}76.8/77.6(0.8)  & \cellcolor[HTML]{D9EAD3}80.9/82.27(1.37)  \\
                             & SINI-FGSM & \cellcolor[HTML]{D9EAD3}56.6/57.5(0.9)  & \cellcolor[HTML]{D9EAD3}56.3/58.5(2.2)  & \cellcolor[HTML]{F4CCCC}51.1/50.2(-0.9) & \cellcolor[HTML]{D9EAD3}54.67/55.4(0.73)  \\
                             & SSA       & \cellcolor[HTML]{D9EAD3}71.0/71.4(0.4)  & \cellcolor[HTML]{D9EAD3}71.8/72.7(0.9)  & \cellcolor[HTML]{D9EAD3}66.3/68.9(2.6)  & \cellcolor[HTML]{D9EAD3}69.7/71.0(1.3)    \\
\multirow{-10}{*}{ViT-B/16}  & TI-FGSM   & \cellcolor[HTML]{D9EAD3}30.3/35.2(4.9)  & \cellcolor[HTML]{D9EAD3}34.6/38.0(3.4)  & \cellcolor[HTML]{D9EAD3}25.4/29.0(3.6)  & \cellcolor[HTML]{D9EAD3}30.1/34.07(3.97)  \\ \bottomrule
\end{tabular}%
}
\end{table}

% \subsubsection{Parameters Setting} \label{apx:setup}
% All our experiments are conducted on one NVIDIA RTX 6000 Ada graphics card. In our experiments across all models, we set the maximum perturbation of all algorithms to $\frac{16}{255}$, with an attack step size of 10. Notably, we set the batch size to 1; the appendix discusses other batch sizes, highlighting the optimal performance enhancement of DBA at a batch size of 1. Additionally, specific parameters were allocated to each baseline attack method to ensure evaluation consistency and fairness. For the DI-FGSM method, we set the decay parameter to 0, the resize rate to 0.9, and the diversity probability to 0.5. Similarly, for the TI-FGSM method, the decay parameter was set to 0, employing a Gaussian kernel with a length of 15 and a standard deviation of 3. The resize rate and diversity probability were set to 0.9 and 0.5, respectively. For the MI-FGSM method, the decay parameter was set to 1. For the SINI-FGSM method, the decay parameter was set to 1, with parameter $m$ set to 5. For the SSA method, we used a kernel length of 7, standard deviation of 3, momentum of 1, $N$ of 20, $\sigma$ of 16, and $\rho$ of 0.5.

\paragraph{Metrics}The main evaluation metric employed to gauge the performance of the models under adversarial attacks is Attack Success Rate (ASR). It quantifies the percentage of adversarial examples that successfully induce misclassifications. A higher attack success rate indicates that the attack method performs better on a specific model. The primary parameter adjusted in our DBA method is the learning rate, with the batch size set to 1 and the training strategy encompassing all steps. Full details on the parameters can be found in the Appendix~\ref{apx:setup} and our code repository.

\subsection{Result}

\begin{table*}[h!]
\centering
\caption{ASR on models without defense training. Each data group follows the pattern: Baseline/Baseline+DBA (Gap), where \colorbox{green}{Green} indicates an improvement by DBA relative to the original method, \colorbox{red}{Red} denotes a decrease, and \colorbox{grey}{Grey} represents white-box attacks, which are not our focus.}
\label{tab:nodefense}
\resizebox{\textwidth}{!}{%
\begin{tabular}{@{}c|c|cccccccc|l@{}}
\toprule
Model                        & Method    & Inc-v3                                  & Inc-v4                                  & IncRes-v2                               & Res-50                                  & Res-101                                 & Res-152                                 & ViT-B/16                               & MaxViT-T                                & Average                                    \\ \midrule
                             & BIM       & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}27.9/36.8(8.9)  & \cellcolor[HTML]{D9EAD3}19.7/29.9(10.2) & \cellcolor[HTML]{D9EAD3}24.1/30.5(6.4)  & \cellcolor[HTML]{D9EAD3}21.9/27.7(5.8)  & \cellcolor[HTML]{D9EAD3}18.7/25.1(6.4)  & \cellcolor[HTML]{D9EAD3}9.2/11(1.8)    & \cellcolor[HTML]{D9EAD3}9.8/11.4(1.6)   & \cellcolor[HTML]{D9EAD3}18.76/24.63(5.87)  \\
                             & DI-FGSM   & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}48.2/52.1(3.9)  & \cellcolor[HTML]{D9EAD3}40.1/41.4(1.3)  & \cellcolor[HTML]{D9EAD3}38.4/42.1(3.7)  & \cellcolor[HTML]{D9EAD3}35.9/39.1(3.2)  & \cellcolor[HTML]{D9EAD3}32.5/36.3(3.8)  & \cellcolor[HTML]{D9EAD3}12.6/13.5(0.9) & \cellcolor[HTML]{D9EAD3}14.8/16.5(1.7)  & \cellcolor[HTML]{D9EAD3}31.79/34.43(2.64)  \\
                             & MIG       & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}70.8/81.8(11)   & \cellcolor[HTML]{D9EAD3}68.5/81.8(13.3) & \cellcolor[HTML]{D9EAD3}69.8/79.8(10.0) & \cellcolor[HTML]{D9EAD3}64.7/77.1(12.4) & \cellcolor[HTML]{D9EAD3}63.9/75.7(11.8) & \cellcolor[HTML]{D9EAD3}32.8/37(4.2)   & \cellcolor[HTML]{D9EAD3}32.4/42.7(10.3) & \cellcolor[HTML]{D9EAD3}57.56/67.99(10.43) \\
                             & MI-FGSM   & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}50.2/56.3(6.1)  & \cellcolor[HTML]{D9EAD3}46.5/54.3(7.8)  & \cellcolor[HTML]{D9EAD3}47.0/51.4(4.4)  & \cellcolor[HTML]{D9EAD3}41.3/46.4(5.1)  & \cellcolor[HTML]{D9EAD3}41.6/45.9(4.3)  & \cellcolor[HTML]{D9EAD3}20.8/21.5(0.7) & \cellcolor[HTML]{D9EAD3}20.9/23(2.1)    & \cellcolor[HTML]{D9EAD3}38.33/42.69(4.36)  \\
                             & PGD       & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}24.4/31.5(7.1)  & \cellcolor[HTML]{D9EAD3}15.9/28(12.1)   & \cellcolor[HTML]{D9EAD3}22.4/28.1(5.7)  & \cellcolor[HTML]{D9EAD3}18.8/25.4(6.6)  & \cellcolor[HTML]{D9EAD3}18.3/21.6(3.3)  & \cellcolor[HTML]{D9EAD3}9.6/10.8(1.2)  & \cellcolor[HTML]{D9EAD3}8.3/10.6(2.3)   & \cellcolor[HTML]{D9EAD3}16.81/22.29(5.47)  \\
                             & PGN       & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}28.9/37.4(8.5)  & \cellcolor[HTML]{D9EAD3}19.4/29.9(10.5) & \cellcolor[HTML]{D9EAD3}27.0/33.2(6.2)  & \cellcolor[HTML]{D9EAD3}22.7/29.3(6.6)  & \cellcolor[HTML]{D9EAD3}20.9/27.4(6.5)  & \cellcolor[HTML]{D9EAD3}13.8/15.8(2)   & \cellcolor[HTML]{D9EAD3}8.8/12.5(3.7)   & \cellcolor[HTML]{D9EAD3}20.21/26.5(6.29)   \\
                             & SIA       & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}96.5/96.9(0.4)  & \cellcolor[HTML]{D9EAD3}95.4/95.6(0.2)  & \cellcolor[HTML]{D9EAD3}92.3/93.9(1.6)  & \cellcolor[HTML]{D9EAD3}91.6/92.1(0.5)  & \cellcolor[HTML]{D9EAD3}90.2/90.9(0.7)  & \cellcolor[HTML]{D9EAD3}48.2/50.6(2.4) & \cellcolor[HTML]{D9EAD3}65/67.3(2.3)    & \cellcolor[HTML]{D9EAD3}82.74/83.9(1.16)   \\
                             & SINI-FGSM & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}76.6/87.9(11.3) & \cellcolor[HTML]{D9EAD3}75.5/86.6(11.1) & \cellcolor[HTML]{D9EAD3}72.9/84.3(11.4) & \cellcolor[HTML]{D9EAD3}69.0/81.9(12.9) & \cellcolor[HTML]{D9EAD3}66.9/79.1(12.2) & \cellcolor[HTML]{D9EAD3}32.1/39.6(7.5) & \cellcolor[HTML]{D9EAD3}35/43.7(8.7)    & \cellcolor[HTML]{D9EAD3}61.14/71.87(10.73) \\
                             & SSA       & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}88.3/90.9(2.6)  & \cellcolor[HTML]{D9EAD3}86.4/88.8(2.4)  & \cellcolor[HTML]{D9EAD3}80.7/84.6(3.9)  & \cellcolor[HTML]{D9EAD3}80.0/83.7(3.7)  & \cellcolor[HTML]{D9EAD3}80.7/83.4(2.7)  & \cellcolor[HTML]{D9EAD3}52.2/56(3.8)   & \cellcolor[HTML]{D9EAD3}49.1/51.5(2.4)  & \cellcolor[HTML]{D9EAD3}73.91/76.99(3.07)  \\
\multirow{-10}{*}{Inc-v3}    & TI-FGSM   & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}32.9/40(7.1)    & \cellcolor[HTML]{D9EAD3}16.5/23.1(6.6)  & \cellcolor[HTML]{D9EAD3}24.2/27.8(3.6)  & \cellcolor[HTML]{D9EAD3}20.8/25.5(4.7)  & \cellcolor[HTML]{D9EAD3}20.7/24.3(3.6)  & \cellcolor[HTML]{D9EAD3}14/15.2(1.2)   & \cellcolor[HTML]{D9EAD3}10.6/12.3(1.7)  & \cellcolor[HTML]{D9EAD3}19.96/24.03(4.07)  \\ \midrule
                             & BIM       & \cellcolor[HTML]{D9EAD3}33.7/41.4(7.7)  & \cellcolor[HTML]{D9EAD3}27.7/34(6.3)    & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}24.3/29.7(5.4)  & \cellcolor[HTML]{D9EAD3}20.8/26.2(5.4)  & \cellcolor[HTML]{D9EAD3}20.4/26(5.6)    & \cellcolor[HTML]{D9EAD3}9.5/10.2(0.7)  & \cellcolor[HTML]{D9EAD3}10/12(2)        & \cellcolor[HTML]{D9EAD3}20.91/25.64(4.73)  \\
                             & DI-FGSM   & \cellcolor[HTML]{D9EAD3}55.8/64.2(8.4)  & \cellcolor[HTML]{D9EAD3}48.8/59.3(10.5) & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}40.0/48.7(8.7)  & \cellcolor[HTML]{D9EAD3}38.2/47.1(8.9)  & \cellcolor[HTML]{D9EAD3}37/45.1(8.1)    & \cellcolor[HTML]{D9EAD3}13.6/14.2(0.6) & \cellcolor[HTML]{D9EAD3}16.9/21.2(4.3)  & \cellcolor[HTML]{D9EAD3}35.76/42.83(7.07)  \\
                             & MIG       & \cellcolor[HTML]{D9EAD3}88.2/89.3(1.1)  & \cellcolor[HTML]{D9EAD3}84.2/85.8(1.6)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}83.0/83.5(0.5)  & \cellcolor[HTML]{D9EAD3}80.3/83.5(3.2)  & \cellcolor[HTML]{D9EAD3}77.5/81.8(4.3)  & \cellcolor[HTML]{D9EAD3}42.9/43.6(0.7) & \cellcolor[HTML]{D9EAD3}43.8/45.8(2)    & \cellcolor[HTML]{D9EAD3}71.41/73.33(1.91)  \\
                             & MI-FGSM   & \cellcolor[HTML]{D9EAD3}60.1/68.2(8.1)  & \cellcolor[HTML]{D9EAD3}54/61(7)        & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}48.4/54.2(5.8)  & \cellcolor[HTML]{D9EAD3}45.1/52.0(6.9)  & \cellcolor[HTML]{D9EAD3}43.5/47.8(4.3)  & \cellcolor[HTML]{D9EAD3}21/21.4(0.4)   & \cellcolor[HTML]{D9EAD3}20.7/25.2(4.5)  & \cellcolor[HTML]{D9EAD3}41.83/47.11(5.29)  \\
                             & PGD       & \cellcolor[HTML]{D9EAD3}32.1/38.5(6.4)  & \cellcolor[HTML]{D9EAD3}23.7/29.5(5.8)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}22.0/26.1(4.1)  & \cellcolor[HTML]{D9EAD3}19.3/21.0(1.7)  & \cellcolor[HTML]{D9EAD3}17.5/20.6(3.1)  & \cellcolor[HTML]{D9EAD3}8.9/10.1(1.2)  & \cellcolor[HTML]{D9EAD3}8.5/10.2(1.7)   & \cellcolor[HTML]{D9EAD3}18.86/22.29(3.43)  \\
                             & PGN       & \cellcolor[HTML]{D9EAD3}39.7/42.2(2.5)  & \cellcolor[HTML]{D9EAD3}30.2/33(2.8)    & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}30.6/31.4(0.8)  & \cellcolor[HTML]{D9EAD3}27.0/28.0(1.0)  & \cellcolor[HTML]{D9EAD3}24.5/26(1.5)    & \cellcolor[HTML]{D9EAD3}13.3/14.9(1.6) & \cellcolor[HTML]{D9EAD3}11.5/12.1(0.6)  & \cellcolor[HTML]{D9EAD3}25.26/26.8(1.54)   \\
                             & SIA       & \cellcolor[HTML]{F4CCCC}96.5/95.7(-0.8) & \cellcolor[HTML]{F4CCCC}95.8/95(-0.8)   & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}92.7/92.8(0.1)  & \cellcolor[HTML]{D9EAD3}91.4/91.6(0.2)  & \cellcolor[HTML]{F4CCCC}91.6/90.9(-0.7) & \cellcolor[HTML]{D9EAD3}50.4/51.4(1)   & \cellcolor[HTML]{D9EAD3}67.2/68.5(1.3)  & \cellcolor[HTML]{D9EAD3}83.66/83.7(0.04)   \\
                             & SINI-FGSM & \cellcolor[HTML]{D9EAD3}87.7/92.5(4.8)  & \cellcolor[HTML]{D9EAD3}84.2/89.7(5.5)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}78.8/86.1(7.3)  & \cellcolor[HTML]{D9EAD3}78.2/83.1(4.9)  & \cellcolor[HTML]{D9EAD3}77.6/82.5(4.9)  & \cellcolor[HTML]{D9EAD3}37.1/40.9(3.8) & \cellcolor[HTML]{D9EAD3}41/45.1(4.1)    & \cellcolor[HTML]{D9EAD3}69.23/74.27(5.04)  \\
                             & SSA       & \cellcolor[HTML]{F4CCCC}90.6/89.8(-0.8) & \cellcolor[HTML]{D9EAD3}89.4/89.5(0.1)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}86.2/86.3(0.1)  & \cellcolor[HTML]{D9EAD3}84.6/84.6(0.0)  & \cellcolor[HTML]{F4CCCC}85.5/85.3(-0.2) & \cellcolor[HTML]{D9EAD3}60/62.4(2.4)   & \cellcolor[HTML]{D9EAD3}57.3/57.4(0.1)  & \cellcolor[HTML]{D9EAD3}79.09/79.33(0.24)  \\
\multirow{-10}{*}{IncRes-v2} & TI-FGSM   & \cellcolor[HTML]{D9EAD3}42.8/48.6(5.8)  & \cellcolor[HTML]{D9EAD3}40.6/47.8(7.2)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}31.5/36.5(5.0)  & \cellcolor[HTML]{D9EAD3}28.1/35.1(7.0)  & \cellcolor[HTML]{D9EAD3}29/34.1(5.1)    & \cellcolor[HTML]{D9EAD3}16.6/20.7(4.1) & \cellcolor[HTML]{D9EAD3}12.9/15.2(2.3)  & \cellcolor[HTML]{D9EAD3}28.79/34.0(5.21)   \\ \midrule
                             & BIM       & \cellcolor[HTML]{D9EAD3}27.4/48.3(20.9) & \cellcolor[HTML]{D9EAD3}21.1/43.2(22.1) & \cellcolor[HTML]{D9EAD3}11.8/29.5(17.7) & \cellcolor[HTML]{D9EAD3}29.6/50.3(20.7) & \cellcolor[HTML]{D9EAD3}26.3/46.8(20.5) & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}9.3/14.2(4.9)  & \cellcolor[HTML]{D9EAD3}10.8/20.8(10)   & \cellcolor[HTML]{D9EAD3}19.47/36.16(16.69) \\
                             & DI-FGSM   & \cellcolor[HTML]{D9EAD3}55.1/58.9(3.8)  & \cellcolor[HTML]{D9EAD3}52.1/55.5(3.4)  & \cellcolor[HTML]{D9EAD3}41.5/45.3(3.8)  & \cellcolor[HTML]{D9EAD3}58.8/65.1(6.3)  & \cellcolor[HTML]{D9EAD3}56.0/63.0(7.0)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}14.4/19.7(5.3) & \cellcolor[HTML]{D9EAD3}24.5/31.6(7.1)  & \cellcolor[HTML]{D9EAD3}43.2/48.44(5.24)   \\
                             & MIG       & \cellcolor[HTML]{D9EAD3}66.9/78(11.1)   & \cellcolor[HTML]{D9EAD3}60.5/74.7(14.2) & \cellcolor[HTML]{D9EAD3}52.5/68.2(15.7) & \cellcolor[HTML]{D9EAD3}71.7/81.7(10.0) & \cellcolor[HTML]{D9EAD3}68.5/81.1(12.6) & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}22.2/29.9(7.7) & \cellcolor[HTML]{D9EAD3}31/36.3(5.3)    & \cellcolor[HTML]{D9EAD3}53.33/64.27(10.94) \\
                             & MI-FGSM   & \cellcolor[HTML]{D9EAD3}54.6/67.6(13)   & \cellcolor[HTML]{D9EAD3}48.3/63(14.7)   & \cellcolor[HTML]{D9EAD3}40/53.3(13.3)   & \cellcolor[HTML]{D9EAD3}57.2/70.9(13.7) & \cellcolor[HTML]{D9EAD3}53.0/68.8(15.8) & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}19.4/24.9(5.5) & \cellcolor[HTML]{D9EAD3}24.9/34.9(10)   & \cellcolor[HTML]{D9EAD3}42.49/54.77(12.29) \\
                             & PGD       & \cellcolor[HTML]{D9EAD3}22.6/42.9(20.3) & \cellcolor[HTML]{D9EAD3}18.8/37.4(18.6) & \cellcolor[HTML]{D9EAD3}10.2/24.3(14.1) & \cellcolor[HTML]{D9EAD3}25.5/44.3(18.8) & \cellcolor[HTML]{D9EAD3}21.7/41.7(20.0) & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}9.2/12.8(3.6)  & \cellcolor[HTML]{D9EAD3}8.4/16.6(8.2)   & \cellcolor[HTML]{D9EAD3}16.63/31.43(14.8)  \\
                             & PGN       & \cellcolor[HTML]{D9EAD3}29.8/34.6(4.8)  & \cellcolor[HTML]{D9EAD3}26.1/28.6(2.5)  & \cellcolor[HTML]{D9EAD3}16.7/20.2(3.5)  & \cellcolor[HTML]{D9EAD3}36.5/39.7(3.2)  & \cellcolor[HTML]{D9EAD3}32.7/35.3(2.6)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}16.6/17.6(1)   & \cellcolor[HTML]{D9EAD3}12.5/14(1.5)    & \cellcolor[HTML]{D9EAD3}24.41/27.14(2.73)  \\
                             & SIA       & \cellcolor[HTML]{D9EAD3}94.7/95.4(0.7)  & \cellcolor[HTML]{D9EAD3}95.7/96.1(0.4)  & \cellcolor[HTML]{D9EAD3}91.3/93(1.7)    & \cellcolor[HTML]{D9EAD3}95.9/96.9(1.0)  & \cellcolor[HTML]{D9EAD3}95.9/96.3(0.4)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}46.3/47(0.7)   & \cellcolor[HTML]{D9EAD3}71.9/75.6(3.7)  & \cellcolor[HTML]{D9EAD3}84.53/85.76(1.23)  \\
                             & SINI-FGSM & \cellcolor[HTML]{D9EAD3}67.8/81.6(13.8) & \cellcolor[HTML]{D9EAD3}62.5/79.9(17.4) & \cellcolor[HTML]{D9EAD3}53.8/73.6(19.8) & \cellcolor[HTML]{D9EAD3}72.4/86.8(14.4) & \cellcolor[HTML]{D9EAD3}68.3/84.2(15.9) & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}22.6/27.9(5.3) & \cellcolor[HTML]{D9EAD3}28.3/38.6(10.3) & \cellcolor[HTML]{D9EAD3}53.67/67.51(13.84) \\
                             & SSA       & \cellcolor[HTML]{D9EAD3}90.3/92.4(2.1)  & \cellcolor[HTML]{D9EAD3}89.2/91.5(2.3)  & \cellcolor[HTML]{D9EAD3}86.4/89(2.6)    & \cellcolor[HTML]{D9EAD3}92.7/93.4(0.7)  & \cellcolor[HTML]{D9EAD3}91.7/92.9(1.2)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}62.9/68.2(5.3) & \cellcolor[HTML]{D9EAD3}59.6/66.1(6.5)  & \cellcolor[HTML]{D9EAD3}81.83/84.79(2.96)  \\
\multirow{-10}{*}{Res-152}   & TI-FGSM   & \cellcolor[HTML]{D9EAD3}37.3/43(5.7)    & \cellcolor[HTML]{D9EAD3}38.2/43.9(5.7)  & \cellcolor[HTML]{D9EAD3}21.7/30(8.3)    & \cellcolor[HTML]{D9EAD3}38.3/44.4(6.1)  & \cellcolor[HTML]{D9EAD3}35.4/39.6(4.2)  & \cellcolor[HTML]{D9D9D9}-               & \cellcolor[HTML]{D9EAD3}24.1/29.9(5.8) & \cellcolor[HTML]{D9EAD3}16.7/21.9(5.2)  & \cellcolor[HTML]{D9EAD3}30.24/36.1(5.86)   \\ \midrule
                             & BIM       & \cellcolor[HTML]{D9EAD3}30.4/33.8(3.4)  & \cellcolor[HTML]{D9EAD3}24.3/26.6(2.3)  & \cellcolor[HTML]{D9EAD3}17.4/20.5(3.1)  & \cellcolor[HTML]{D9EAD3}28.4/29.8(1.4)  & \cellcolor[HTML]{D9EAD3}25.4/29.4(4.0)  & \cellcolor[HTML]{D9EAD3}23.5/26.5(3)    & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}22.8/25(2.2)    & \cellcolor[HTML]{D9EAD3}24.6/27.37(2.77)   \\
                             & DI-FGSM   & \cellcolor[HTML]{D9EAD3}49.9/52.1(2.2)  & \cellcolor[HTML]{D9EAD3}44.2/47.2(3)    & \cellcolor[HTML]{D9EAD3}38.3/40.7(2.4)  & \cellcolor[HTML]{D9EAD3}46.7/49.9(3.2)  & \cellcolor[HTML]{D9EAD3}43.2/47.9(4.7)  & \cellcolor[HTML]{D9EAD3}40.6/44.9(4.3)  & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}42.5/48(5.5)    & \cellcolor[HTML]{D9EAD3}43.63/47.24(3.61)  \\
                             & MIG       & \cellcolor[HTML]{D9EAD3}67.9/70.5(2.6)  & \cellcolor[HTML]{D9EAD3}62.1/63.8(1.7)  & \cellcolor[HTML]{D9EAD3}55.8/57.1(1.3)  & \cellcolor[HTML]{D9EAD3}66.0/67.5(1.5)  & \cellcolor[HTML]{D9EAD3}62.4/63.8(1.4)  & \cellcolor[HTML]{D9EAD3}61.1/61.9(0.8)  & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}54.3/56.9(2.6)  & \cellcolor[HTML]{D9EAD3}61.37/63.07(1.7)   \\
                             & MI-FGSM   & \cellcolor[HTML]{D9EAD3}57.7/59(1.3)    & \cellcolor[HTML]{D9EAD3}53.8/54.1(0.3)  & \cellcolor[HTML]{D9EAD3}45.9/47.7(1.8)  & \cellcolor[HTML]{D9EAD3}56.4/56.4(0.0)  & \cellcolor[HTML]{D9EAD3}54.0/54.1(0.1)  & \cellcolor[HTML]{D9EAD3}50.7/52.2(1.5)  & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}48.6/50.7(2.1)  & \cellcolor[HTML]{D9EAD3}52.44/53.46(1.01)  \\
                             & PGD       & \cellcolor[HTML]{D9EAD3}28/30.9(2.9)    & \cellcolor[HTML]{D9EAD3}21.2/23.5(2.3)  & \cellcolor[HTML]{D9EAD3}15.6/18.3(2.7)  & \cellcolor[HTML]{D9EAD3}27.1/29.7(2.6)  & \cellcolor[HTML]{D9EAD3}23.4/26.6(3.2)  & \cellcolor[HTML]{D9EAD3}22.2/24.9(2.7)  & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}21.1/24.2(3.1)  & \cellcolor[HTML]{D9EAD3}22.66/25.44(2.79)  \\
                             & PGN       & \cellcolor[HTML]{D9EAD3}37.2/43.9(6.7)  & \cellcolor[HTML]{D9EAD3}31.8/35.9(4.1)  & \cellcolor[HTML]{D9EAD3}22.9/29.8(6.9)  & \cellcolor[HTML]{D9EAD3}34.2/41.4(7.2)  & \cellcolor[HTML]{D9EAD3}31.2/37.4(6.2)  & \cellcolor[HTML]{D9EAD3}30.2/36(5.8)    & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}28.1/34.6(6.5)  & \cellcolor[HTML]{D9EAD3}30.8/37.0(6.2)     \\
                             & SIA       & \cellcolor[HTML]{D9EAD3}89.9/90.5(0.6)  & \cellcolor[HTML]{F4CCCC}87.6/87.2(-0.4) & \cellcolor[HTML]{D9EAD3}84.2/86(1.8)    & \cellcolor[HTML]{F4CCCC}88.6/88.5(-0.1) & \cellcolor[HTML]{D9EAD3}87.4/87.6(0.2)  & \cellcolor[HTML]{F4CCCC}85.9/85.8(-0.1) & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}89.8/91.1(1.3)  & \cellcolor[HTML]{D9EAD3}87.63/88.1(0.47)   \\
                             & SINI-FGSM & \cellcolor[HTML]{D9EAD3}68.3/70.7(2.4)  & \cellcolor[HTML]{D9EAD3}64.6/66.5(1.9)  & \cellcolor[HTML]{D9EAD3}59.1/59.7(0.6)  & \cellcolor[HTML]{D9EAD3}68.0/68.0(0.0)  & \cellcolor[HTML]{D9EAD3}64.0/64.5(0.5)  & \cellcolor[HTML]{F4CCCC}60.5/60.4(-0.1) & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}54.8/57.5(2.7)  & \cellcolor[HTML]{D9EAD3}62.76/63.9(1.14)   \\
                             & SSA       & \cellcolor[HTML]{D9EAD3}77.2/77.6(0.4)  & \cellcolor[HTML]{D9EAD3}72.1/74.4(2.3)  & \cellcolor[HTML]{D9EAD3}70.3/70.5(0.2)  & \cellcolor[HTML]{F4CCCC}76.1/75.9(-0.2) & \cellcolor[HTML]{D9EAD3}73.3/74.9(1.6)  & \cellcolor[HTML]{D9EAD3}71.9/74(2.1)    & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}62.8/65.2(2.4)  & \cellcolor[HTML]{D9EAD3}71.96/73.21(1.26)  \\
\multirow{-10}{*}{ViT-B/16}  & TI-FGSM   & \cellcolor[HTML]{D9EAD3}32.4/35.2(2.8)  & \cellcolor[HTML]{D9EAD3}29.6/33.2(3.6)  & \cellcolor[HTML]{D9EAD3}19.3/22.8(3.5)  & \cellcolor[HTML]{D9EAD3}26.7/30.4(3.7)  & \cellcolor[HTML]{D9EAD3}26.1/28.9(2.8)  & \cellcolor[HTML]{D9EAD3}26/29.3(3.3)    & \cellcolor[HTML]{D9D9D9}-              & \cellcolor[HTML]{D9EAD3}24.1/28.3(4.2)  & \cellcolor[HTML]{D9EAD3}26.31/29.73(3.41)  \\ \bottomrule
\end{tabular}%
}
\end{table*}

Experimental results are shown in Tables~\ref{tab:nodefense} and~\ref{tab:defense}. Each table's results are separated by a forward slash, with the left side representing the original attack success rates of baseline methods on various models, and the right side displaying the success rates after applying our DBA algorithm. The values in parentheses indicate the difference in attack success rates between with and without the use of the DBA method. The last column presents the average attack success rates for both the baseline methods and DBA. For additional experimental data, please refer to the appendix.

In Table~\ref{tab:nodefense}, the transferability of nearly all attack methods on models without defense training is significantly enhanced under the black-box setting of DBA, with results up to 97\%. Only in a minority of cases does DBA's performance slightly decline. Specifically, when using Res-152 as the surrogate model, DBA exhibited higher improvements than the other three models, with an average increase of 8.66\%, possibly due to the more general decision boundaries of Res-152. Meanwhile, other three models showed average improvements of 5.41\%, 3.45\%, and 2.44\%, respectively.

For models with defense training, DBA also sustained significant performance enhancements. As depicted in Table~\ref{tab:defense}, DBA continued to be highly effective in these more challenging scenarios, achieving effectiveness in 96\% of cases. With defensively trained models, the average Attack Success Rate (ASR) increased by 2.81\% after applying DBA.

It is important to note that while the improvement in transferability for certain methods like SIA and SSA might not appear significant, the consistency in the enhancements provided by our DBA method across various attack methods is crucial. This consistency is valuable, as it indicates that our work can be effectively integrated with future algorithms and methods. Such a feature of consistency is commonly appreciated in fields like object detection, evident in the updates within the YOLO series. Our approach lays a foundational strategy that can be universally applied, underscoring its potential for broad applicability and adaptability in advancing adversarial attack methodologies.

\subsection{Ablation Experiment}
% In this section, we present the ablation study of DBA. Due to the space limits, the full diagrams of the results are included in the appendix.
\begin{figure}[htpb]
    \centering
    \includegraphics[width=\linewidth]{images/LR_ab.pdf}
    \caption{Performance of different attack methods under the influence of various learning rates using the proposed DBA technique on ResNet-152 as the source model.}
    \label{appfig:LR_ab}
\end{figure}
\begin{figure}[htpb]
    \centering
    \includegraphics[width=.9\linewidth]{images/Strategy_ab.pdf}
    \caption{Comparative summary of attack method performances  using the ResNet-152 source model under four distinct training strategies with the implementation of the proposed DBA technique. The strategies include Early Phase (First 50\%), Late Phase (Last 50\%), Alternate Steps (Every 1-step interval), and Full Training (All steps), each demonstrating the effectiveness of the DBA technique in enhancing attack robustness across different phases of training.}
    \label{appfig:St_ab}
\end{figure}
\subsubsection{Impact of Learning Rates on DBA Performance}
In this section, ResNet-152 is employed as the source model to investigate the impact of different learning rates on the performance of DBA. The learning rates are set to 0.001, 0.0001, and 0.00001, respectively. As shown in Figure~\ref{appfig:LR_ab}, it is observed that for enhancing the BIM attack, the learning rate of 0.001 leads to the best performance, while for the TI-FGSM and SSA methods, a learning rate of 0.0001 yields the optimal results. 

\subsubsection{Impact of Training Strategies on DBA Performance}
In this section, ResNet-152 is employed as the source model to investigate the influence of four distinct training strategies on DBA performance. The strategies consist of training for the first 50\% steps, training for the last 50\% steps, training at every 1-step interval, and training all steps. The unified learning rate is set to 0.0001. As shown in Figure~\ref{appfig:St_ab}, when enhancing the BIM attack, the strategies of training for the first 50\% steps and all steps consistently better. On the other hand, for enhancing the TI-FGSM and SSA methods, the performance of training all steps significantly outperform the other three strategies.

\section{Conclusion}\label{sec:conclusion}
% In this paper, we introduced Decision Boundary Adaptation approach, a straightforward, efficient, and easily implementable solution to enhance the transferability of adversarial attacks against various DNN models. Our method leverages the idea that, the decision boundary of the target model may share some similarities with those of the source model, therefore the manipulation of decision boundary can improve the transferability of adversarial examples. We have evaluated our method on several state-of-the-art adversarial attack methods and demonstrate the effectiveness in improving the black-box attack capability of all adversarial attack methods. Our results indicate that, DBA can significantly enhance the transferability of adversarial examples and achieve state-of-the-art performance on different DNN models. However, we acknowledge that DBA introduces some additional computational overhead. While this overhead is manageable and justified by the performance gain, it may present challenges when deployed on servers with limited computing resources. In conclusion, our proposed DBA method offers valuable insights for future research to further enhance the robustness of DNN models against adversarial attacks, particularly in the context of black-box attacks.
In this paper, we introduced the Decision Boundary Adaptation (DBA) approach, a simple, efficient, and easily implementable method to enhance adversarial attack transferability across DNN models. DBA exploits the similarity between source and target model decision boundaries, improving adversarial example transferability through boundary manipulation. We evaluated DBA on state-of-the-art adversarial attacks, demonstrating its effectiveness in enhancing black-box attack performance. Our results show that DBA significantly improves adversarial transferability, achieving state-of-the-art performance. However, DBA introduces additional computational overhead ($\sim5\%$ on average), which, though manageable, may challenge deployment on resource-limited servers. Overall, DBA provides valuable insights for future research on strengthening DNN robustness against black-box adversarial attacks.

% References
\bibliography{uai2025}

\newpage

\onecolumn
\appendix
\section{\emph{Proof} of Theorem~\ref{theorem1}}

\begin{proof}[Proof of Theorem~\ref{theorem1}]
Consider an infinitesimal $\mathcal{O}$.

Given $f(x + \epsilon  \cdot  \Delta x) = f(x) + \epsilon \cdot \Delta x \cdot \frac{\partial f(x)}{\partial x} + \mathcal{O}$, we can express the incremental change in $f(x)$ due to a small perturbation $\epsilon  \cdot  \Delta x$ as:

\begin{equation}
\frac{f(x + \epsilon  \cdot  \Delta x) - f(x)}{\epsilon} = \Delta x \cdot \frac{\partial f(x)}{\partial x} + \frac{\mathcal{O}}{\epsilon}
\end{equation}

This expression implies that the rate of change of $f(x)$ with respect to $\epsilon$ approximates the directional derivative along $\Delta x$, with $\mathcal{O}/\epsilon$ representing the higher-order terms that become negligible as $\epsilon \rightarrow 0$.

To prove $\frac{f(x + \epsilon \cdot \Delta x) - f(x-\epsilon  \cdot \Delta x )}{2\epsilon }$ follows similarly, we utilize the symmetric difference quotient, which accounts for the function value at $x + \epsilon \cdot \Delta x$ and $x - \epsilon \cdot \Delta x$, providing a more accurate approximation of the derivative:

\begin{equation}
\frac{f(x + \epsilon \cdot \Delta x) - f(x-\epsilon  \cdot \Delta x )}{2\epsilon } = \Delta x \cdot \frac{\partial f(x)}{\partial x} + \frac{\mathcal{O}}{\epsilon}
\end{equation}

This symmetric formulation further reduces the error in the approximation, leading to a more accurate estimation of the directional derivative in the limit as $\epsilon \rightarrow 0$.

\end{proof}

\section{\emph{Proof} of Theorem~\ref{Theorem2}:}
\begin{proof}[Proof of Theorem~\ref{Theorem2}]
The Fisher information matrix $\mathcal{I}_\theta (x)$ can be expressed as:
\begin{equation}
\mathcal{I}_\theta (x) = E_{y^c \sim P(y|x,\theta )} \left [ \frac{\partial \log P_{y^c} (x,\theta )^{\top }}{\partial x} \cdot \frac{\partial \log P_{y^c} (x,\theta )}{\partial x}  \right ]
\end{equation}

This can be expanded as:
\begin{equation}
\mathcal{I}_\theta (x) = \sum_{i=1}^{c} \frac{\partial \log P_{y^c} (x,\theta )^{\top }}{\partial x} \cdot \frac{\partial \log P_{y^c} (x,\theta )}{\partial x} \cdot P_{y^c} (x,\theta )  
\end{equation}

Considering the expectation of the second derivative of $P_{y^c} (x,\theta)$ with respect to $x$ normalized by $P_{y^c} (x,\theta)$:
\begin{equation}
E_{y^c \sim P(y|x,\theta )} \left [  \frac{\partial ^2 P_{y^c} (x,\theta)}{\partial x^2} \cdot \frac{1}{P_{y^c} (x,\theta)}  \right] = \frac{\partial ^2}{\partial  x^2} \sum P_{y^c} (x,\theta) = 0
\end{equation}

The second derivative of $\log P_{y^c} (x,\theta)$ is given by:
\begin{equation}
\frac{\partial ^2}{\partial  x^2} \log P_{y^c} (x,\theta) = \frac{\frac{\partial ^2 P_{y^c} (x, \theta )}{\partial x^2} }{P_{y^c} (x, \theta )} - \frac{\left(\frac{\partial P_{y^c} (x, \theta )}{\partial x}\right)^{\top} \cdot \frac{\partial P_{y^c} (x, \theta )}{\partial x} }{P_{y^c}^2(x,\theta )}
\end{equation}

Therefore, the expectation of the product of the gradients of the log likelihoods is equivalent to the negative expectation of the Hessian matrix of the log likelihood:
\begin{equation}
E_{y^c \sim P(y|x,\theta )} \left [ \frac{\partial \log P_{y^c} (x,\theta )^{\top }}{\partial x} \cdot \frac{\partial \log P_{y^c} (x,\theta )}{\partial x}  \right ] = - E_{y^c \sim P(y|x,\theta )} \left [ \mathrm{H}^c  \right ]
\end{equation}

By integrating over the conditional distribution and approximating for all class labels $c$, we can rewrite the Fisher information matrix as:
\begin{equation}
\begin{aligned}
\mathcal{I}_\theta (x) &= \sum_{c=1}^{C} \frac{\partial \log P_{y^c} (x,\theta )}{\partial x} \cdot P_{y^c} (x,\theta ) \cdot \left(\frac{\partial \log P_{y^c} (x,\theta )}{\partial x}\right)^{\top} \\
\end{aligned}
\end{equation}

\end{proof}

\section{\emph{Proof} of Theorem~\ref{theorem3}:}

\begin{proof}[Proof of Theorem~\ref{theorem3}]

The Fisher information matrix $\mathcal{I}_\theta (x)$ is given by:
\begin{equation}
\begin{aligned}
\mathcal{I}_\theta (x) &= E_{y^c \sim P(y|x,\theta )} \left[ \frac{\partial \log P_{y^c} (x,\theta )^{\top }}{\partial x} \cdot \frac{\partial \log P_{y^c} (x,\theta )}{\partial x} \right] \\
&= \sum_{c=1}^{C} P_{y^c} (x,\theta ) \cdot \frac{\partial \log P_{y^c}(x,\theta )^{\top }}{\partial x} \cdot \frac{\partial \log P_{y^c} (x,\theta )}{\partial x} \\
&= \sum_{c=1}^{C} \frac{\partial P_{y^c}(x,\theta )^{\top }}{\partial x} \cdot \frac{\partial \log P_{y^c} (x,\theta )}{\partial x}
\end{aligned}
\end{equation}

The trace of $\mathcal{I}_\theta (x)$ is:
\begin{equation}
\text{tr}(\mathcal{I}_\theta (x)) = \sum_{i=1}^{m} e^i \mathcal{I}_\theta (x) e^i
\end{equation}

Considering the flatness of $\mathcal{I}_\theta (x)$:
\begin{equation}
\begin{aligned}
flat(\mathcal{I}_\theta (x)) &= \Delta x \mathcal{I}_\theta (x) \Delta x^{\top} \\
&= \sum_{c=1}^{C} \Delta x \cdot \frac{\partial P_{y^c} (x,\theta )^{\top }}{\partial x} \cdot \Delta x \cdot \frac{\partial \log P_{y^c} (x,\theta )}{\partial x} \\
&= \left \langle \Delta x \cdot  \frac{\partial P_y(x,\theta )}{\partial x}, \Delta x \cdot \frac{\partial \log  P_y(x,\theta )}{\partial x}  \right \rangle
\end{aligned}
\end{equation}

Given:
\begin{equation}
\frac{\Delta x \cdot  \frac{\partial P_y(x,\theta )}{\partial x}}{2 \epsilon }  \approx \frac{1}{2 \epsilon } \left( \frac{\partial P_y(x+ \epsilon \cdot \Delta x, \theta )}{\partial x} - \frac{\partial P_y(x - \epsilon \cdot \Delta x, \theta )}{\partial x} \right)
\end{equation}

\begin{equation}
\frac{\Delta x \cdot \frac{\partial \log  P_y(x,\theta )}{\partial x}}{2\epsilon }  \approx \frac{1}{2\epsilon }\left( \frac{\partial \log  P_y(x+ \epsilon \cdot \Delta x, \theta )}{\partial x} - \frac{\partial \log P_y(x - \epsilon \cdot \Delta x, \theta )}{\partial x} \right)
\end{equation}

Hence, \textbf{Theorem~\ref{theorem3}} is proven as:
\begin{equation}
\begin{aligned}
flat (\mathcal{I}_\theta (x)) &\approx \frac{1}{4 \varepsilon^2} \left \langle \frac{\partial P_y(x+\varepsilon \cdot \Delta x, \theta)}{\partial x} - \frac{\partial P_y(x-\varepsilon \cdot \Delta x, \theta)}{\partial x}, \right. \\
&\quad \left. \frac{\partial \log P_y(x+\varepsilon \cdot \Delta x, \theta)}{\partial x} - \frac{\partial \log P_y(x-\varepsilon \cdot \Delta x, \theta)}{\partial x} \right \rangle \\
&\propto \left \langle \frac{\partial P_y(x+\varepsilon \cdot \Delta x, \theta)}{\partial x} - \frac{\partial P_y(x-\varepsilon \cdot \Delta x, \theta)}{\partial x}, \right. \\
&\quad \left. \frac{\partial \log P_y(x+\varepsilon \cdot \Delta x, \theta)}{\partial x} - \frac{\partial \log P_y(x-\varepsilon \cdot \Delta x, \theta)}{\partial x} \right \rangle
\end{aligned}
\end{equation}
The same can be proven for $flat (\mathcal{I}_x (\theta))$.

\end{proof}

\section{\emph{Proof} of the Relationship between KL Divergence and Fisher Information Matrix}
\begin{proof}[Proof of the Relationship between KL Divergence and Fisher Information Matrix]
    

The KL divergence between $P_y(x,\theta)$ and $P_y(x+\Delta x, \theta)$ can be approximated as:
\[
KL(P_y(x,\theta ), P_y(x+\Delta x, \theta )) = \frac{1}{2} \Delta x^\top \cdot  \left ( \frac{\partial^2 KL(P_y(x,\theta ), P_y(x+\Delta x, \theta )) }{\partial (x + \Delta x)_i \partial (x + \Delta x)_j}  \right )_{\Delta x \rightarrow 0} \Delta x + \mathcal{O}
\]

The second derivative of the KL divergence at $\Delta x \rightarrow 0$ is given by:
\[
\left ( \frac{\partial^2 KL(P_y(x,\theta ), P_y(x+\Delta x, \theta )) }{\partial (x + \Delta x)_i \partial (x + \Delta x)_j}  \right )_{\Delta x \rightarrow 0} = -\sum_{c=1}^{C} P_y(x,\theta )\cdot  \left ( \frac{\partial^2 \log P_y (x + \Delta x, \theta )}{\partial (x+\Delta x)_i \partial (x+\Delta x)_j} \right )_{\Delta x \rightarrow 0} = [\mathcal{I}(\theta ) ]_{ij}
\]
\end{proof}
This result demonstrates the equivalence between the second derivative of the KL divergence and the Fisher Information Matrix $\mathcal{I}(\theta)$, thereby establishing the relationship between them.


\section{The Method of Drawing Variation Figure of $flat(\cdot)$}
For the computations depicted in the above figures, we calculated $k$ curves corresponding to the number of samples, normalized all $flat(\cdot)$ values to the 0-1 range using their maximum values, and then plotted the average of these $k$ samples.

\section{Practical Implementation Steps of DBA}

More detailed explanations regarding the practical implementation of DBA. The core implementation of DBA is centered around gradient descent operations to adjust model parameters, optimizing the flatness of decision boundaries. Specifically, the process is as follows:

\begin{itemize}
    \item \textbf{Gradient Information Computation:} Compute gradient information for the sample to assess the properties of the decision boundary in its vicinity.
    \item \textbf{Parameter Update:} Use the computed gradients to update the model parameters via gradient descent, ensuring that samples move away from unstable regions near the decision boundary during the attack process.
    \item \textbf{Iterative Optimization:} Repeat this process iteratively to enhance the stability of attack directions and improve the transferability of adversarial examples.
\end{itemize}


This method allows DBA to maintain the efficiency of existing adversarial attack algorithms while significantly enhancing cross-model attack capabilities through parameter optimization. We validated this implementation extensively in our experiments, demonstrating that DBA consistently and reliably improves the performance of existing methods.

\section{DBA implementation} \label{apx:impl}
\begin{algorithm}[htbp]

    \renewcommand{\algorithmicrequire}{\textbf{Input:}} 
    \renewcommand{\algorithmicensure}{\textbf{Output:}} 
    \caption{Decision Boundary Adaptation Algorithm}
    \label{alg:cap}
    \label{code}
    % \label{alg1}
    \begin{algorithmic}[1] 
        \REQUIRE parameter of the source model $\theta$, input $x$, target $y$, learning rate of model boundaries $\alpha_1$, learning rate of perturbations $\alpha_2$, warm up step $w$

        \ENSURE $x_I$
        \STATE $Init$ $ x_0 \leftarrow x$, $\theta_0 \leftarrow \theta$
    \FOR{$ i = 1 \cdots I $}   
    \small
        \STATE $\theta^i = \theta^{i-1} + \left[ \left[ i \geq {w} \right] \right] \cdot \alpha_1 \left( \frac{\partial L\left(y \mid x_{i-1}, \theta^{i-1}\right)}{\partial \theta^{i-1}} \right) $
        \STATE $\eta_i=\alpha_2 \cdot \operatorname{sign}\left( \frac{\partial L\left(y \mid x_{i-1}, \theta^{i-1}\right)}{\partial\left(x_{i-1}+\eta_{i-1}\right)}\right)$ 
    \ENDFOR
    
    \STATE $x_I=x_{i-1} + \eta_i = x_0 + \sum\limits_{j=1}^i \eta_j = x_0 + \eta^i$
    \end{algorithmic} 
\end{algorithm}


\section{Parameters Setting} \label{apx:setup}
All our experiments are conducted on one NVIDIA RTX 6000 Ada graphics card. In our experiments across all models, we set the maximum perturbation of all algorithms to $\frac{16}{255}$, with an attack step size of 10. Notably, we set the batch size to 1; the appendix discusses other batch sizes, highlighting the optimal performance enhancement of DBA at a batch size of 1. Additionally, specific parameters were allocated to each baseline attack method to ensure evaluation consistency and fairness. For the DI-FGSM method, we set the decay parameter to 0, the resize rate to 0.9, and the diversity probability to 0.5. Similarly, for the TI-FGSM method, the decay parameter was set to 0, employing a Gaussian kernel with a length of 15 and a standard deviation of 3. The resize rate and diversity probability were set to 0.9 and 0.5, respectively. For the MI-FGSM method, the decay parameter was set to 1. For the SINI-FGSM method, the decay parameter was set to 1, with parameter $m$ set to 5. For the SSA method, we used a kernel length of 7, standard deviation of 3, momentum of 1, $N$ of 20, $\sigma$ of 16, and $\rho$ of 0.5.

\section{Result of DBA with expended Attack Method}

We have supplemented our experiments with the RAP~\citep{qin2022boosting} method by evaluating its performance both with and without the integration of DBA in Table~\ref{tab:RAP}. The results are summarized in the table below. Using Inc-v3 as the source model, it is evident that our DBA method consistently enhances the transferability of RAP. In nearly all black-box models, performance improvements are observed, with an average improvement of 0.66\%.
% Please add the following required packages to your document preamble:
% \usepackage{booktabs}
% \usepackage{graphicx}
\begin{table}[h]
\centering
\caption{Result of RAP with and without DBA}
\label{tab:RAP}
\resizebox{.25\linewidth}{!}{%
\begin{tabular}{@{}c|ccc@{}}
\toprule
            & RAP     & \begin{tabular}[c]{@{}c@{}}RAP \\ with DBA\end{tabular} & Gap     \\ \midrule
Inc-v3      & 95.20\% & 95.10\%                                                 & -0.10\% \\
Inc-v4      & 46.00\% & 46.90\%                                                 & 0.90\%  \\
IncRes-v2   & 44.00\% & 45.10\%                                                 & 1.10\%  \\
Res-50      & 45.70\% & 45.70\%                                                 & 0.00\%  \\
Res-101     & 41.40\% & 41.70\%                                                 & 0.30\%  \\
Res-152     & 39.80\% & 40.70\%                                                 & 0.90\%  \\
Inc-v3-ens3 & 22.30\% & 23.80\%                                                 & 1.50\%  \\
Inc-v3-ens4 & 22.10\% & 22.80\%                                                 & 0.70\%  \\
IncRes-ens  & 11.70\% & 12.80\%                                                 & 1.10\%  \\
ViT-B/16    & 20.60\% & 20.60\%                                                 & 0.00\%  \\
MaxViT-T    & 21.60\% & 21.70\%                                                 & 0.10\%  \\ \bottomrule
\end{tabular}%
}
\end{table}


\section{Result of DBA with batch gradient descent}
The data in Table~\ref{apptab:nodefense} and Table~\ref{apptab:defense} represent the results of all experiments conducted with a batch size of 10. In scenarios where gradient descent is applied to each sample individually, the batch size does not affect the attack outcome, as the computations for each sample are independent. Furthermore, regarding the optimization method mentioned in Eq.~\ref{eq:batch_size}, we have conducted experiments detailed in this section, which also demonstrate consistent improvement across various settings.


\begin{table}[t]
\centering
\small
\resizebox{\textwidth}{!}{%
\begin{tabular}{@{}l|l|llllll|l@{}}
\toprule
Model                       & Attack    & Inc-v3                                 & Inc-v4                                 & IncRes-v2                              & Res-50                                 & Res-101                                & Res-152                                & Average                                   \\ \midrule
                            & BIM       & {100/98.2\color[HTML]{FE0000}(-1.8)}  & {27.5/32.6\color[HTML]{009901}(5.1)}  & {20.1/24.2\color[HTML]{009901}(4.1)}  & {24.1/29.4\color[HTML]{009901}(5.3)}  & {22.2/24.6\color[HTML]{009901}(2.4)}  & {18/23.5\color[HTML]{009901}(5.5)}    & {35.32/38.75\color[HTML]{009901}(3.43)}  \\
                            & PGD       & {100/100\color[HTML]{009901}(0)}      & {23.8/27.8\color[HTML]{009901}(4)}    & {16.6/19.1\color[HTML]{009901}(2.5)}  & {22.9/23.9\color[HTML]{009901}(1)}    & {19/20.7\color[HTML]{009901}(1.7)}    & {18.3/19.6\color[HTML]{009901}(1.3)}  & {33.43/35.18\color[HTML]{009901}(1.75)}  \\
                            & DI-FGSM   & {99.8/99.7\color[HTML]{FE0000} (-0.1)} & {46/50.5\color[HTML]{009901} (4.5)}    & {38.8/41.3\color[HTML]{009901} (2.5)}  & {37.8/42.8\color[HTML]{009901}(5)}    & {34.2/37.8\color[HTML]{009901} (3.6)}  & {32.7/36\color[HTML]{009901} (3.3)}    & {48.22/51.35\color[HTML]{009901} (3.13)}  \\
                            & TI-FGSM   & {98.6/98.1\color[HTML]{FE0000} (-0.5)} & {32/40.9\color[HTML]{009901} (8.9)}    & {17.3/24.5\color[HTML]{009901} (7.2)}  & {24.8/29.8\color[HTML]{009901}(5)}    & {21.7/25.8\color[HTML]{009901} (4.1)}  & {20.8/23.9\color[HTML]{009901} (3.1)}  & {35.87/40.5\color[HTML]{009901} (4.63)}   \\
                            & MI-FGSM   & {100/99.8\color[HTML]{FE0000} (-0.2)}  & {50.6/64.2\color[HTML]{009901} (13.6)} & {45.7/59\color[HTML]{009901} (13.3)}   & {47.2/58.9\color[HTML]{009901}(11.7)} & {42.2/55.2\color[HTML]{009901} (13)}   & {40.6/54.3\color[HTML]{009901} (13.7)} & {54.38/65.23\color[HTML]{009901} (10.85)} \\
                            & SINI-FGSM & {100/100\color[HTML]{009901} (0)}      & {77.4/88.1\color[HTML]{009901} (10.7)} & {75.2/87.5\color[HTML]{009901} (12.3)} & {73.3/84.6\color[HTML]{009901}(11.3)} & {69.1/82.4\color[HTML]{009901} (13.3)} & {69.4/80.5\color[HTML]{009901} (11.1)} & {77.4/87.18\color[HTML]{009901} (9.78)}   \\
                            & SSA       & {99.3/99.4\color[HTML]{009901} (0.1)}  & {87.5/90.5\color[HTML]{009901} (3)}    & {86.7/88.7\color[HTML]{009901} (2)}    & {81.9/84.6\color[HTML]{009901}(2.7)}  & {79.3/83.7\color[HTML]{009901} (4.4)}  & {80.4/83.3\color[HTML]{009901} (2.9)}  & {85.85/88.37\color[HTML]{009901} (2.53)}  \\
\multirow{-8}{*}{Inc-v3}    & NAA       & {97.4/97.5\color[HTML]{009901} (0.1)}  & {87.8/87.9\color[HTML]{009901} (0.1)}  & {85.8/86.4\color[HTML]{009901} (0.6)}  & {82.8/83.8\color[HTML]{009901}(1)}    & {82.4/82.8\color[HTML]{009901} (0.4)}  & {81.4/81.9\color[HTML]{009901} (0.5)}  & {86.27/86.72\color[HTML]{009901} (0.45)}  \\ \midrule
                            & BIM       & {36.4/41.5\color[HTML]{009901} (5.1)}  & {99.9/99.6\color[HTML]{FE0000} (-0.3)} & {18.9/26.5\color[HTML]{009901} (7.6)}  & {23.4/28\color[HTML]{009901}(4.6)}    & {20.5/25.9\color[HTML]{009901} (5.4)}  & {19.3/23.7\color[HTML]{009901} (4.4)}  & {36.4/40.87\color[HTML]{009901} (4.47)}   \\
                            & PGD       & {33.4/36.6\color[HTML]{009901} (3.2)}  & {99.9/99.3\color[HTML]{FE0000} (-0.6)} & {14.3/21.4\color[HTML]{009901} (7.1)}  & {21.3/25.8\color[HTML]{009901}(4.5)}  & {17.7/20.2\color[HTML]{009901} (2.5)}  & {17.5/20\color[HTML]{009901} (2.5)}    & {34.02/37.22\color[HTML]{009901} (3.2)}   \\
                            & DI-FGSM   & {54.9/59.3\color[HTML]{009901} (4.4)}  & {99.5/98.6\color[HTML]{FE0000} (-0.9)} & {37.4/43.8\color[HTML]{009901} (6.4)}  & {35.7/43.9\color[HTML]{009901}(8.2)}  & {32.3/38.4\color[HTML]{009901} (6.1)}  & {31/39.2\color[HTML]{009901} (8.2)}    & {48.47/53.87\color[HTML]{009901} (5.4)}   \\
                            & TI-FGSM   & {38.7/45.9\color[HTML]{009901} (7.2)}  & {97.8/97.2\color[HTML]{FE0000} (-0.6)} & {17.6/25.3\color[HTML]{009901} (7.7)}  & {23.9/28.6\color[HTML]{009901}(4.7)}  & {20.5/24.9\color[HTML]{009901} (4.4)}  & {21.2/24.3\color[HTML]{009901} (3.1)}  & {36.62/41.03\color[HTML]{009901} (4.41)}  \\
                            & MI-FGSM   & {60.9/68.1\color[HTML]{009901} (7.2)}  & {99.8/99.8\color[HTML]{009901} (0)}    & {45.9/53.6\color[HTML]{009901} (7.7)}  & {45.6/52.4\color[HTML]{009901}(6.8)}  & {42.2/47.1\color[HTML]{009901} (4.9)}  & {41.2/46.9\color[HTML]{009901} (5.7)}  & {55.93/61.32\color[HTML]{009901} (5.39)}  \\
                            & SINI-FGSM & {86.6/92.6\color[HTML]{009901} (6)}    & {100/100\color[HTML]{009901} (0)}      & {78.7/87.2\color[HTML]{009901} (8.5)}  & {77.6/83.8\color[HTML]{009901}(6.2)}  & {73.8/82.2\color[HTML]{009901} (8.4)}  & {72.8/81\color[HTML]{009901} (8.2)}    & {81.58/87.8\color[HTML]{009901} (6.22)}   \\
                            & SSA       & {91.7/91.2\color[HTML]{FE0000} (-0.5)} & {98.9/98.7\color[HTML]{FE0000} (-0.2)} & {86.6/87.9\color[HTML]{009901} (1.3)}  & {82.8/85.7\color[HTML]{009901}(2.9)}  & {80.6/83.1\color[HTML]{009901} (2.5)}  & {82.7/84.7\color[HTML]{009901} (2)}    & {87.22/88.55\color[HTML]{009901} (1.33)}  \\
\multirow{-8}{*}{Inc-v4}    & NAA       & {88/88.2\color[HTML]{009901} (0.2)}    & {98/98\color[HTML]{009901} (0)}        & {83.7/83.9\color[HTML]{009901} (0.2)}  & {82.6/82.2\color[HTML]{FE0000}(-0.4)} & {82.2/82.2\color[HTML]{009901} (0)}    & {79.9/80\color[HTML]{009901} (0.1)}    & {85.73/86.9\color[HTML]{009901} (1.17)}   \\ \midrule
                            & BIM       & {35.4/43.8\color[HTML]{009901} (8.4)}  & {29.3/33.7\color[HTML]{009901} (4.4)}  & {99.2/99.4\color[HTML]{009901} (0.2)}  & {24.5/30.4\color[HTML]{009901}(5.9)}  & {21.2/26.7\color[HTML]{009901} (5.5)}  & {21.4/25\color[HTML]{009901} (3.6)}    & {38.5/43.17\color[HTML]{009901} (4.67)}   \\
                            & PGD       & {30.8/37.9\color[HTML]{009901} (7.1)}  & {23.7/28.4\color[HTML]{009901} (4.7)}  & {99.4/99.5\color[HTML]{009901} (0.1)}  & {21.8/24.8\color[HTML]{009901}(3)}    & {19.7/22.1\color[HTML]{009901} (2.4)}  & {17.7/19.4\color[HTML]{009901} (1.7)}  & {35.52/38.68\color[HTML]{009901} (3.16)}  \\
                            & DI-FGSM   & {56.7/63.8\color[HTML]{009901} (7.1)}  & {49.4/59.2\color[HTML]{009901} (9.8)}  & {98/98.7\color[HTML]{009901} (0.7)}    & {41.7/48.7\color[HTML]{009901}(7)}    & {38.4/44.1\color[HTML]{009901} (5.7)}  & {38.6/45\color[HTML]{009901} (6.4)}    & {53.8/59.92\color[HTML]{009901} (6.12)}   \\
                            & TI-FGSM   & {42.7/51.9\color[HTML]{009901} (9.2)}  & {41.7/49.2\color[HTML]{009901} (7.5)}  & {94.7/94.9\color[HTML]{009901} (0.2)}  & {30.9/37.7\color[HTML]{009901}(6.8)}  & {29.6/35.4\color[HTML]{009901} (5.8)}  & {28.7/36.3\color[HTML]{009901} (7.6)}  & {44.72/50.9\color[HTML]{009901} (6.18)}   \\
                            & MI-FGSM   & {61.4/68.9\color[HTML]{009901} (7.5)}  & {53/60.8\color[HTML]{009901} (7.8)}    & {99.1/99.5\color[HTML]{009901} (0.4)}  & {48.2/55.3\color[HTML]{009901}(7.1)}  & {46.6/51.5\color[HTML]{009901} (4.9)}  & {44.7/50.1\color[HTML]{009901} (5.4)}  & {58.83/64.35\color[HTML]{009901} (5.52)}  \\
                            & SINI-FGSM & {87.4/92.2\color[HTML]{009901} (4.8)}  & {83.6/88\color[HTML]{009901} (4.4)}    & {99.9/99.9\color[HTML]{009901} (0)}    & {79.2/84.2\color[HTML]{009901}(5)}    & {77.3/84.9\color[HTML]{009901} (7.6)}  & {75/81.9\color[HTML]{009901} (6.9)}    & {83.73/88.52\color[HTML]{009901} (4.79)}  \\
                            & SSA       & {89.4/90.5\color[HTML]{009901} (1.1)}  & {89.4/89.5\color[HTML]{009901} (0.1)}  & {97.7/97.8\color[HTML]{009901} (0.1)}  & {85.9/87.3\color[HTML]{009901}(1.4)}  & {83.9/85.6\color[HTML]{009901} (1.7)}  & {85.7/86.3\color[HTML]{009901} (0.6)}  & {88.67/89.5\color[HTML]{009901} (0.83)}   \\
\multirow{-8}{*}{IncRes-v2} & NAA       & {83.9/84.3\color[HTML]{009901} (0.4)}  & {80.5/81.2\color[HTML]{009901} (0.7)}  & {92.2/92.5\color[HTML]{009901} (0.3)}  & {79.5/79.8\color[HTML]{009901}(0.3)}  & {78.1/78.6\color[HTML]{009901} (0.5)}  & {77.4/77.1\color[HTML]{FE0000} (-0.3)} & {81.93/82.25\color[HTML]{009901} (0.32)}  \\ \midrule
                            & BIM       & {25.6/47.8\color[HTML]{009901} (22.2)} & {21.6/42.2\color[HTML]{009901} (20.6)} & {11.7/31.5\color[HTML]{009901} (19.8)} & {28.6/48.6\color[HTML]{009901}(20)}   & {26.1/45.5\color[HTML]{009901} (19.4)} & {24/44.2\color[HTML]{009901} (20.2)}   & {22.93/43.3\color[HTML]{009901} (20.37)}  \\
                            & PGD       & {22.9/39.4\color[HTML]{009901} (16.5)} & {18.4/32\color[HTML]{009901} (13.6)}   & {10.5/23.1\color[HTML]{009901} (12.6)} & {24.5/43.1\color[HTML]{009901}(18.6)} & {21.7/36.8\color[HTML]{009901} (15.1)} & {20.4/36.1\color[HTML]{009901} (15.7)} & {19.73/35.08\color[HTML]{009901} (15.35)} \\
                            & DI-FGSM   & {53.8/59.6\color[HTML]{009901} (5.8)}  & {51.5/55.2\color[HTML]{009901} (3.7)}  & {40.1/45.7\color[HTML]{009901} (5.6)}  & {60.7/63.6\color[HTML]{009901}(2.9)}  & {57.6/62.1\color[HTML]{009901} (4.5)}  & {54.9/60.8\color[HTML]{009901} (5.9)}  & {53.1/57.83\color[HTML]{009901} (4.73)}   \\
                            & TI-FGSM   & {37.7/45.4\color[HTML]{009901} (7.7)}  & {37.3/44.2\color[HTML]{009901} (6.9)}  & {23.3/31.4\color[HTML]{009901} (8.1)}  & {38.2/44.1\color[HTML]{009901}(5.9)}  & {34.6/42.5\color[HTML]{009901} (7.9)}  & {34.7/43.1\color[HTML]{009901} (8.4)}  & {34.3/41.78\color[HTML]{009901} (7.48)}   \\
                            & MI-FGSM   & {55/72.7\color[HTML]{009901} (17.7)}   & {48.8/69.5\color[HTML]{009901} (20.7)} & {39.7/60\color[HTML]{009901} (20.3)}   & {58.3/74.1\color[HTML]{009901}(15.8)} & {52.7/72.5\color[HTML]{009901} (19.8)} & {52/72\color[HTML]{009901} (20)}       & {51.08/70.13\color[HTML]{009901} (19.05)} \\
                            & SINI-FGSM & {68.1/86.5\color[HTML]{009901} (18.4)} & {63.2/83.8\color[HTML]{009901} (20.6)} & {53.6/79.1\color[HTML]{009901} (25.5)} & {72.6/90.4\color[HTML]{009901}(17.8)} & {68.4/88.8\color[HTML]{009901} (20.4)} & {68.3/87\color[HTML]{009901} (18.7)}   & {65.7/85.93\color[HTML]{009901} (20.23)}  \\
                            & SSA       & {90.4/91.7\color[HTML]{009901} (1.3)}  & {89/90.6\color[HTML]{009901} (1.6)}    & {86.6/88.7\color[HTML]{009901} (2.1)}  & {92.4/92.9\color[HTML]{009901}(0.5)}  & {91.9/93\color[HTML]{009901} (1.1)}    & {92.8/93.4\color[HTML]{009901} (0.6)}  & {90.52/91.72\color[HTML]{009901} (1.2)}   \\
\multirow{-8}{*}{Res-152}   & NAA       & {88.1/89\color[HTML]{009901} (0.9)}    & {86.9/87.5\color[HTML]{009901} (0.6)}  & {81.6/83.1\color[HTML]{009901} (1.5)}  & {89.3/91.1\color[HTML]{009901}(1.8)}  & {90.2/90.9\color[HTML]{009901} (0.7)}  & {88.8/89.8\color[HTML]{009901} (1)}    & {87.48/88.57\color[HTML]{009901} (1.09)}  \\ \bottomrule
\end{tabular}%
}
\caption{Attack Success Rates on Non-defensive Training Models. Each data group follows this pattern: Baseline/Baseline+DBA(Gap), with green indicating enhancement by DBA over the original method, while red indicates a decrease. Importantly, it should be noted that, compared to white-box attacks, more attention should be focused on black-box attacks.}
\label{apptab:nodefense}
\end{table}


\begin{table}[t!]
\centering
\caption{Comparison of attack efficiency in frames per second (FPS) with and without DBA. Although DBA introduces a slight computational overhead, the decrease in FPS remains marginal (mostly within 5\% on average), indicating that DBA maintains high efficiency across various models and attack methods.}

\label{tab:efficiency_dba}
\resizebox{.6\linewidth}{!}{%
\begin{tabular}{@{}l|l|l|l|l@{}}
\toprule
Attack               & Model        & FPS (No DBA) & FPS (With DBA) & Overhead \\ \midrule
\multirow{2}{*}{MIG} & Inception-v3 & 0.4524       & 0.4355         & 3.88\%   \\
                     & MaxViT-T     & 0.1786       & 0.1731         & 3.20\%   \\ \midrule
\multirow{2}{*}{PGN} & Inception-v3 & 0.5919       & 0.5490         & 7.83\%   \\
                     & MaxViT-T     & 0.2307       & 0.2121         & 8.73\%   \\ \midrule
\multirow{2}{*}{SIA} & Inception-v3 & 0.1659       & 0.1602         & 3.62\%   \\
                     & MaxViT-T     & 0.1603       & 0.1576         & 1.74\%   \\ \midrule
\multirow{2}{*}{SSA} & Inception-v3 & 0.6360       & 0.5891         & 7.97\%   \\
                     & MaxViT-T     & 0.3074       & 0.2974         & 3.34\%   \\ \bottomrule
\end{tabular}%
}
\end{table}

\begin{table}[b]
\centering
\resizebox{.8\textwidth}{!}{%
\begin{tabular}{@{}l|l|lll|l@{}}
\toprule
Model                       & Attack    & Inc-v3-ens3                            & Inc-v3-ens4                            & IncRes-v2-ens                          & Average                                   \\ \midrule
                            & BIM       & {12.4/13.4\color[HTML]{009901} (1)}    & {13.5/13.3\color[HTML]{FE0000} (-0.2)} & {4.9/6.2\color[HTML]{009901} (1.3)}    & {10.27/10.97\color[HTML]{009901}(0.7)}   \\
                            & PGD       & {12.6/12.3\color[HTML]{FE0000} (-0.3)} & {12.4/13.4\color[HTML]{009901} (1)}    & {6.5/6.2\color[HTML]{FE0000} (-0.3)}   & {10.5/10.63\color[HTML]{009901}(0.13)}   \\
                            & DI-FGSM   & {17/19.2\color[HTML]{009901} (2.2)}    & {17.5/18.9\color[HTML]{009901} (1.4)}  & {8/9.3\color[HTML]{009901} (1.3)}      & {14.17/15.8\color[HTML]{009901}(1.63)}   \\
                            & TI-FGSM   & {22/27\color[HTML]{009901} (5)}        & {24.4/28.9\color[HTML]{009901} (4.5)}  & {12.9/15.5\color[HTML]{009901} (2.6)}  & {19.77/23.8\color[HTML]{009901}(4.03)}   \\
                            & MI-FGSM   & {21.8/25.3\color[HTML]{009901} (3.5)}  & {22.6/26.3\color[HTML]{009901} (3.7)}  & {10.1/12.9\color[HTML]{009901} (2.8)}  & {18.17/21.5\color[HTML]{009901}(3.33)}   \\
                            & SINI-FGSM & {39.5/47.5\color[HTML]{009901} (8)}    & {38.1/45.6\color[HTML]{009901} (7.5)}  & {22.7/27.7\color[HTML]{009901} (5)}    & {33.43/40.27\color[HTML]{009901}(6.84)}  \\
                            & SSA       & {74.5/79.6\color[HTML]{009901} (5.1)}  & {73.4/77.3\color[HTML]{009901} (3.9)}  & {59.2/64.8\color[HTML]{009901} (5.6)}  & {69.03/73.9\color[HTML]{009901}(4.87)}   \\
\multirow{-8}{*}{Inc-v3}    & NAA       & {57.8/57.1\color[HTML]{FE0000} (-0.7)} & {55.2/56.3\color[HTML]{009901} (1.1)}  & {34.7/34.2\color[HTML]{FE0000} (-0.5)} & {49.23/49.2\color[HTML]{FE0000}(-0.03)}  \\ \midrule
                            & BIM       & {10.2/11.7\color[HTML]{009901} (1.5)}  & {12.4/11.9\color[HTML]{FE0000} (-0.5)} & {5.5/6\color[HTML]{009901} (0.5)}      & {9.37/9.87\color[HTML]{009901}(0.5)}     \\
                            & PGD       & {11.7/12.1\color[HTML]{009901} (0.4)}  & {11.5/11.7\color[HTML]{009901} (0.2)}  & {6/5.5\color[HTML]{FE0000} (-0.5)}     & {9.73/9.77\color[HTML]{009901}(0.04)}    \\
                            & DI-FGSM   & {14/16.7\color[HTML]{009901} (2.7)}    & {15.4/16.2\color[HTML]{009901} (0.8)}  & {7.5/9.3\color[HTML]{009901} (1.8)}    & {12.3/14.07\color[HTML]{009901}(1.77)}   \\
                            & TI-FGSM   & {19.3/23\color[HTML]{009901} (3.7)}    & {20.3/26.1\color[HTML]{009901} (5.8)}  & {13.2/15.8\color[HTML]{009901} (2.6)}  & {17.6/21.63\color[HTML]{009901}(4.03)}   \\
                            & MI-FGSM   & {20.6/22.4\color[HTML]{009901} (1.8)}  & {18.3/19.9\color[HTML]{009901} (1.6)}  & {10.8/12.1\color[HTML]{009901} (1.3)}  & {16.57/18.13\color[HTML]{009901}(1.56)}  \\
                            & SINI-FGSM & {46.9/54.2\color[HTML]{009901} (7.3)}  & {43.4/50.8\color[HTML]{009901} (7.4)}  & {29.4/33.8\color[HTML]{009901} (4.4)}  & {39.9/46.27\color[HTML]{009901}(6.37)}   \\
                            & SSA       & {74.9/77.2\color[HTML]{009901} (2.3)}  & {72.1/77.4\color[HTML]{009901} (5.3)}  & {63.8/67\color[HTML]{009901} (3.2)}    & {70.27/73.87\color[HTML]{009901}(3.6)}   \\
\multirow{-8}{*}{Inc-v4}    & NAA       & {59.7/59.9\color[HTML]{009901} (0.2)}  & {56.7/57.3\color[HTML]{009901} (0.6)}  & {39.8/39.5\color[HTML]{FE0000} (-0.3)} & {52.07/52.23\color[HTML]{009901}(0.16)}  \\ \midrule
                            & BIM       & {9.4/11.7\color[HTML]{009901} (2.3)}   & {9.9/12.4\color[HTML]{009901} (2.5)}   & {6.5/7.1\color[HTML]{009901} (0.6)}    & {8.6/10.4\color[HTML]{009901}(1.8)}      \\
                            & PGD       & {11.8/11.6\color[HTML]{FE0000} (-0.2)} & {10.8/12\color[HTML]{009901} (1.2)}    & {7/7.1\color[HTML]{009901} (0.1)}      & {9.87/10.23\color[HTML]{009901}(0.36)}   \\
                            & DI-FGSM   & {16.3/20.4\color[HTML]{009901} (4.1)}  & {16.8/18.6\color[HTML]{009901} (1.8)}  & {10/12.6\color[HTML]{009901} (2.6)}    & {14.37/17.2\color[HTML]{009901}(2.83)}   \\
                            & TI-FGSM   & {28.2/35.3\color[HTML]{009901} (7.1)}  & {28.2/34.8\color[HTML]{009901} (6.6)}  & {25.3/33.7\color[HTML]{009901} (8.4)}  & {27.23/34.6\color[HTML]{009901}(7.37)}   \\
                            & MI-FGSM   & {21.9/25.5\color[HTML]{009901} (3.6)}  & {22.5/24\color[HTML]{009901} (1.5)}    & {13.4/16.3\color[HTML]{009901} (2.9)}  & {19.27/21.93\color[HTML]{009901}(2.66)}  \\
                            & SINI-FGSM & {55.8/62.5\color[HTML]{009901} (6.7)}  & {49/55.1\color[HTML]{009901} (6.1)}    & {40.9/45.6\color[HTML]{009901} (4.7)}  & {48.57/54.4\color[HTML]{009901}(5.83)}   \\
                            & SSA       & {80.7/81.7\color[HTML]{009901} (1)}    & {77.3/79.4\color[HTML]{009901} (2.1)}  & {76.9/77.4\color[HTML]{009901} (0.5)}  & {78.3/79.5\color[HTML]{009901}(1.2)}     \\
\multirow{-8}{*}{IncRes-v2} & NAA       & {63.8/63.7\color[HTML]{FE0000} (-0.1)} & {56.7/57.2\color[HTML]{009901} (0.5)}  & {50.5/51.2\color[HTML]{009901} (0.7)}  & {57/57.37\color[HTML]{009901}(0.37)}     \\ \midrule
                            & BIM       & {11.8/15.3\color[HTML]{009901} (3.5)}  & {11.7/17.1\color[HTML]{009901} (5.4)}  & {5/8.8\color[HTML]{009901} (3.8)}      & {9.5/13.73\color[HTML]{009901}(4.23)}    \\
                            & PGD       & {11.9/12.3\color[HTML]{009901} (0.4)}  & {11.4/12.8\color[HTML]{009901} (1.4)}  & {5.5/7.2\color[HTML]{009901} (1.7)}    & {9.6/10.77\color[HTML]{009901}(1.17)}    \\
                            & DI-FGSM   & {15.9/23\color[HTML]{009901} (7.1)}    & {16.6/24.5\color[HTML]{009901} (7.9)}  & {8.6/13.6\color[HTML]{009901} (5)}     & {13.7/20.37\color[HTML]{009901}(6.67)}   \\
                            & TI-FGSM   & {28.3/36.8\color[HTML]{009901} (8.5)}  & {31.1/38.5\color[HTML]{009901} (7.4)}  & {21.7/29.5\color[HTML]{009901} (7.8)}  & {27.03/34.93\color[HTML]{009901}(7.9)}   \\
                            & MI-FGSM   & {19/28.5\color[HTML]{009901} (9.5)}    & {19.9/28\color[HTML]{009901} (8.1)}    & {10/15.5\color[HTML]{009901} (5.5)}    & {16.3/24\color[HTML]{009901}(7.7)}       \\
                            & SINI-FGSM & {25.4/41.3\color[HTML]{009901} (15.9)} & {26.5/38.7\color[HTML]{009901} (12.2)} & {13.3/21\color[HTML]{009901} (7.7)}    & {21.73/33.67\color[HTML]{009901}(11.94)} \\
                            & SSA       & {78.1/80.4\color[HTML]{009901} (2.3)}  & {77.3/79.1\color[HTML]{009901} (1.8)}  & {67.3/71.2\color[HTML]{009901} (3.9)}  & {74.23/76.9\color[HTML]{009901}(2.67)}   \\
\multirow{-8}{*}{Res-152}   & NAA       & {41.1/43.5\color[HTML]{009901} (2.4)}  & {40/41.2\color[HTML]{009901} (1.2)}    & {23.2/23.9\color[HTML]{009901} (0.7)}  & {34.77/36.2\color[HTML]{009901}(1.43)}   \\ \bottomrule
\end{tabular}
%
}
\caption{Attack Success Rates on Defensive Training Models. Each data group follows this pattern: Baseline/Baseline+DBA(Gap), with green indicating enhancement by DBA over the original method, while red indicates a decrease.}
\label{apptab:defense}
\end{table}

\section{Computational Efficiency Analysis of DBA}
\label{appendix:efficiency_dba}

% Please add the following required packages to your document preamble:
% \usepackage{booktabs}
% \usepackage{multirow}
% \usepackage{graphicx}


Table~\ref{tab:efficiency_dba} presents the detailed frame-per-second (FPS) measurements for different adversarial attack methods applied to two target models, with and without incorporating DBA. The results show that DBA introduces a modest computational overhead across all configurations. The maximum overhead observed is below 9\%, with an average overhead of approximately 5\%. This confirms that DBA is computationally efficient and can be integrated into existing black-box attack pipelines with minimal runtime impact.


\end{document}
